Patent application title: USE OF A SEED SPECIFIC PROMOTER TO DRIVE ODP1 EXPRESSION IN CRUCIFEROUS OILSEED PLANTS TO INCREASE OIL CONTENT WHILE MAINTAINING NORMAL GERMINATION
Inventors:
Howard Glenn Damude (Hockessin, DE, US)
John D. Everard (Wilmington, DE, US)
Knut Meyer (Wilmington, DE, US)
Knut Meyer (Wilmington, DE, US)
Kevin G. Ripp (Wilmington, DE, US)
Kevin G. Ripp (Wilmington, DE, US)
Kevin L. Stecca (Bear, DE, US)
Kevin L. Stecca (Bear, DE, US)
Assignees:
E.I. DU PONT DE NEMOURS AND COMPANY
IPC8 Class: AC12N1582FI
USPC Class:
800281
Class name: Multicellular living organisms and unmodified parts thereof and related processes method of introducing a polynucleotide molecule into or rearrangement of genetic material within a plant or plant part the polynucleotide alters fat, fatty oil, ester-type wax, or fatty acid production in the plant
Publication date: 2013-12-26
Patent application number: 20130347142
Abstract:
A recombinant DNA construct comprising a polynucleotide encoding an ODP1
polypeptide operably linked to a sucrose synthase 2 promoter where this
construct can be used to increase oil content in the seeds of a
cruciferous oilseed plant while maintaining normal germination is
disclosed. A method for increasing oil content in the seeds of a
cruciferous oilseed plant while maintaining normal germination using this
construct is also disclosed.Claims:
1. A recombinant DNA construct comprising a polynucleotide encoding an
ODP1 polypeptide operably linked to a sucrose synthase 2 promoter (SUS2)
wherein the SUS2 promoter comprises a nucleotide sequence having at least
95% sequence identity to SEQ ID NO:43, wherein said nucleotide sequence
has seed specific promoter activity and wherein the amino acid sequence
of said ODP1 polypeptide has at least 90% sequence identity to SEQ ID
NO:39 and comprises two APETALA2 (AP2) domains and wherein expression of
said ODP1 polypeptide increases oil content in the seeds of a cruciferous
oilseed plant while maintaining normal germination.
2. The recombinant DNA construct of claim 1 wherein the amino acid sequence of said ODP1 polypeptide has at least 95% sequence identity to SEQ ID NO:39.
3. The recombinant DNA construct of claim 1 wherein the amino acid sequence of said ODP1 polypeptide comprises SEQ ID NO:39.
4. The recombinant DNA construct of claim 1 wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43.
5. The recombinant DNA construct of claim 2, wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43.
6. The recombinant DNA construct of claim 3, wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43.
7. The recombinant DNA construct of claim 1 wherein the oilseed plant is canola or Arabidopsis.
8. A transgenic cruciferous oilseed plant comprising in its genome the recombinant DNA construct of claim 1.
9. The transgenic cruciferous oilseed plant of claim 8 wherein the cruciferous oilseed plant is selected from the group consisting of canola and Arabidopsis.
10. A transgenic seed obtained from the plant of claim 8, wherein said seed comprises in its genome said recombinant DNA construct.
11. The transgenic cruciferous oilseed plant of claim 8, wherein the amino acid sequence of said ODP1 polypeptide comprises SEQ ID NO:39.
12. The transgenic cruciferous oilseed plant of claim 8, wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43 and wherein the amino acid sequence of said ODP1 polypeptide has at least 95% sequence identity to SEQ ID NO:39.
13. The transgenic cruciferous oilseed plant of claim 8, wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43 and wherein the amino acid sequence of said ODP1 polypeptide comprises SEQ ID NO:39.
14. A transgenic seed obtained from the plant of claim 8, wherein said seed comprises in its genome said recombinant DNA construct and wherein the amino acid sequence of said ODP1 polypeptide comprises SEQ ID NO:39.
15. A transgenic seed obtained from the plant of claim 8, wherein said seed comprises in its genome said recombinant DNA construct and wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43 and wherein the amino acid sequence of ODP1 polypeptide has at least 95% sequence identity to SEQ ID NO:39.
16. A transgenic seed obtained from the plant of claim 8, wherein said seed comprises in its genome said recombinant DNA construct and wherein the sucrose synthase 2 promoter comprises the nucleotide sequence of SEQ ID NO:43 and wherein the amino acid sequence of said ODP1 polypeptide comprises SEQ ID NO:39.
17. A method for producing a transgenic cruciferous oilseed plant comprising transforming a cruciferous oilseed plant cell with the recombinant DNA construct of claim 1 and regenerating a transgenic cruciferous oilseed plant from the transformed cruciferous oilseed plant cell, wherein the transgenic cruciferous oilseed plant comprises in its genome said recombinant DNA construct.
18. The method of claim 17 wherein the cruciferous oilseed plant is selected from the group consisting of canola and Arabidopsis.
19. A method for increasing oil content in seeds of a transgenic cruciferous oilseed plant while maintaining normal germination, said method comprising: (a) transforming a cruciferous oilseed plant cell with a recombinant DNA construct comprising a polynucleotide encoding an ODP1 polypeptide, wherein the amino acid sequence of said ODP1 polypeptide has at least 90% sequence identity to SEQ ID NO:39 and comprises two APETALA2 (AP2) domains, said polynucleotide being operably linked to a promoter having a nucleotide sequence at least 95% identical to SEQ ID NO: 43, wherein said nucleotide sequence has seed specific promoter activity; (b) regenerating a transgenic cruciferous oilseed plant from the transformed cell of step (a), wherein said plant comprises the recombinant DNA construct; (c) obtaining a transgenic progeny plant derived from the transgenic cruciferous oilseed plant of step (b), wherein the transgenic progeny plant comprises in its genome the recombinant DNA construct; (d) assaying the transgenic progeny plant obtained from step (c) for oil level and germination; and (e) selecting those transgenic progeny plants having seeds comprising said recombinant DNA construct and having an increased level of oil and normal germination when compared to seeds obtained from a control cruciferous oilseed plant, wherein said control plant does not comprise the recombinant DNA construct.
20. The method of claim 19 wherein the amino acid sequence of the ODP1 polypeptide comprises the sequence of SEQ ID NO:39.
21. The method of claim 19 wherein the promoter comprises SEQ ID NO:43.
22. The method of claim 21 wherein the ODP1 polypeptide comprises at least 95% sequence identity to SEQ ID NO: 39.
23. The method of claim 19 wherein the cruciferous oilseed plant is canola or Arabidopsis.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a divisional of U.S. patent application Ser. No. 12/752,175, filed Apr. 1, 2010, which claims the benefit of U.S. Provisional Application No. 61/165,548, filed Apr. 1, 2009, the entire content of which is herein incorporated by reference.
REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICAL
[0002] The official copy of the sequence listing is submitted electronically via EFS-Web as an ASCII formatted sequence listing with a file named 429238seqlist.txt, created on Feb. 12, 2013, and having a size of 604 KB and is filed concurrently with the specification. The sequence listing contained in this ASCII formatted document is part of the specification and is herein incorporated by reference in its entirety
FIELD OF THE INVENTION
[0003] This invention is in the field of biotechnology, in particular, this pertains to increasing oil content while maintaining normal germination in a cruciferous oilseed plant using a seed specific promoter to drive expression of ODP1.
BACKGROUND OF THE INVENTION
[0004] Plant lipids have a variety of industrial and nutritional uses and are central to plant membrane function and climatic adaptation. These lipids represent a vast array of chemical structures, and these structures determine the physiological and industrial properties of the lipid. Many of these structures result either directly or indirectly from metabolic processes that alter the degree of unsaturation of the lipid. Different metabolic regimes in different plants produce these altered lipids, and either domestication of exotic plant species or modification of agronomically adapted species is usually required to produce economically large amounts of the desired lipid.
[0005] There are serious limitations to using mutagenesis to alter fatty acid composition and content. Screens will rarely uncover mutations that a) result in a dominant ("gain-of-function") phenotype, b) are in genes that are essential for plant growth, and c) are in an enzyme that is not rate-limiting and that is encoded by more than one gene. In cases where desired phenotypes are available in mutant corn lines, their introgression into elite lines by traditional breeding techniques is slow and expensive, since the desired oil compositions are likely the result of several recessive genes.
[0006] Recent molecular and cellular biology techniques offer the potential for overcoming some of the limitations of the mutagenesis approach, including the need for extensive breeding. Some of the particularly useful technologies are seed-specific expression of foreign genes in transgenic plants (see Goldberg et al (1989) Cell 56:149-160), and the use of antisense RNA to inhibit plant target genes in a dominant and tissue-specific manner (see van der Krol et al (1988) Gene 72:45-50). Other advances include the transfer of foreign genes into elite commercial varieties of commercial oilcrops, such as soybean (Chee et al (1989) Plant Physiol. 91:1212-1218; Christou et al (1989) Proc. Natl. Acad. Sci. U.S.A. 86:7500-7504; Hinchee et al (1988) Bio/Technology 6:915-922; EPO publication 0 301 749 A2), rapeseed (De Block et al (1989) Plant Physiol. 91:694-701), and sunflower (Everett et al (1987) Bio/Technology 5:1201-1204), and the use of genes as restriction fragment length polymorphism (RFLP) markers in a breeding program, which makes introgression of recessive traits into elite lines rapid and less expensive (Tanksley et al (1989) Bio/Technology 7:257-264). However, application of each of these technologies requires identification and isolation of commercially-important genes.
[0007] Transcription factors generally bind DNA in a sequence-specific manner and either activate or repress transcription initiation. The specific mechanisms of these interactions remain to be fully elucidated. At least three types of separate domains have been identified within transcription factors. One is necessary for sequence-specific DNA recognition, one for the activation/repression of transcriptional initiation, and one for the formation of protein-protein interactions (such as dimerization). Studies indicate that many plant transcription factors can be grouped into distinct classes based on their conserved DNA binding domains (Katagiri F and Chua N H, 1992, Trends Genet. 8:22-27; Menkens A E, Schindler U and Cashmore A R, 1995, Trends in Biochem Sci. 13:506-510; Martin C and Paz-Ares J, 1997, Trends Genet. 13:67-73). Each member of these families interacts and binds with distinct DNA sequence motifs that are often found in multiple gene promoters controlled by different regulatory signals.
[0008] Several transcription factor families have been identified in plants. For example, nucleotide sequences encoding the following transcription factors families have been identified: Alfin-like, AP2 (APETALA2) and EREBPs (ethylene-responsive element binding proteins), ARF, AUX/IAA, bHLH, bZIP, C2C2 (Zn), C2C2 (Co-like), C2C2 (Dof), C2C2 (GATA), C2C2 (YABBY), C2H2 (Zn), C3H-type, CCAAT, CCAAT HAP3, CCAAT HAP5, CPP (Zn), DRAP1, E2F/DP, GARP, GRAS, HMG-BOX, HOMEO BOX, HSF, Jumanji, LFY, LIM, MADS Box, MYB, NAC, NIN-like, Polycomb-like, RAV-like, SBP, TCP, TFIID, Transfactor, Trihelix, TUBBY, and WRKY.
[0009] WO 2005/075655 published on Aug. 18, 2005 describes an AP2 domain transcription factor ODP2 (ovule development protein 2) and methods of U.S. Pat. No. 7,157,621 which issued on Jan. 2, 2007, describes the alteration of oil traits in plants through controlled expression of selected genes in plants.
[0010] The AP2/ERF family of proteins is a plant-specific class of putative transcription factors that have been shown to regulate a wide-variety of developmental processes and are characterized by the presence of an AP2/ERF DNA binding domain. The AP2/ERF proteins have been subdivided into two distinct subfamilies based on whether they contain one (ERF subfamily) or two (AP2 subfamily) DNA binding domains.
[0011] Specifically, AP2 (APETALA2) and EREBPs (ethylene-responsive element binding proteins) are the prototypic members of a family of transcription factors unique to plants, whose distinguishing characteristic is that they contain the so-called AP2 DNA-binding domain. AP2/EREBP genes form a large multigene family, and they play a variety of roles throughout the plant life cycle. AP2/EREBP genes are key regulators of several developmental processes, including floral organ identity determination and leaf epidermal cell identity. In Arabidopsis thaliana, the homeotic gene APETALA2 (AP2) has been shown to control three salient processes during development: (1) the specification of flower organ identity throughout floral organogenesis (Jofuku et al., Plant Cell 6:1211-1225, 1994); (2) establishment of flower meristem identity (Irish and Sussex, Plant Cell 2:8:741-753, 1990); and (3) the temporal and spatial regulation of flower homeotic gene activity (Drews et al., Cell 65:6:991-1002, 1991). DNA sequence analysis suggests that AP2 encodes a theoretical polypeptide of 432 aa, with a distinct 68 aa repeated motif termed the AP2 domain. This domain has been shown to be essential for AP2 functions and contains within the 68 aa, an eighteen amino acid core region that is predicted to form an amphipathic α-helix (Jofuku et al., Plant Cell 6:1211-1225, 1994). AP2-like domain-containing transcription factors have been also been identified in both Arabidopsis thaliana (Okamuro et al., Proc. Natl. Acad. Sci. USA 94:7076-7081, 1997) and in tobacco with the identification of the ethylene responsive element binding proteins (EREBPs) (Ohme-Takagi and Shinshi, Plant Cell 7:2:173-182, 1995). In Arabidopsis, these RAP2 (related to AP2) genes encode two distinct subfamilies of AP2 domain-containing proteins designated AP2-like and EREBP-like (Okamuro et al., Proc. Natl. Acad. Sci. USA 94:7076-7081, 1997). In vitro DNA binding has not been shown to date using the RAP2 proteins. Based upon the presence of two highly conserved motifs YRG and RAYD within the AP2 domain, it has been proposed that binding DNA binding occurs in a manner similar to that of AP2 proteins.
[0012] As was noted above, regulation of transcription of most eukaryotic genes is coordinated through sequence-specific binding of proteins to the promoter region located upstream of the gene. Many of these protein-binding sequences have been conserved during evolution and are found in a wide variety of organisms. One such feature is the "CCAAT" sequence element (Edwards et al, 1998, Plant Physiol. 117:1015-1022). CCAAT boxes are a feature of gene promoters in many eukaryotes including several plant gene promoters.
[0013] HAP proteins constitute a large family of transcription factors first identified in yeast. They combine to from a heteromeric protein complex that activates transcription by binding to CCAAT boxes in eukaryotic promoters. The orthologous HAP proteins display a high degree of evolutionary conservation in their functional domains in all species studied to date (Li et al., 1991, Nucleic Acids Res. 20:1087-1091).
[0014] WO 00/28058 published on May 18, 2000 describes HAP3-type CCAAT-box binding transcriptional activator polynucleotides and polypeptides, especially, the leafy cotyledon 1 transcriptional activator (LEC1) polynucleotides and polypeptides.
[0015] WO 99/67405 describes leafy cotyledon1 genes and their uses.
[0016] The human, murine and plant homologues of CCAAT-binding proteins have been isolated and characterized based on their sequence similarity with their yeast counterparts (Li et al., 1991, Nucleic Acids Res. 20:1087-1091). This high degree of sequence homology translates remarkably into functional interchangeability among orthologue proteins of different species (Sinha et al, 1995, Proc. Natl. Acad. Sci. USA 92:1624-1628). Unlike yeast, multiple forms of each HAP homolog have been identified in plants (Edwards et al, 1998, Plant Physiol. 117:1015-1022).
[0017] Molecular and genetic analysis revealed HAP members to be involved in the control of diverse and critical biological processes ranging from development and cell cycle regulation to metabolic control and homeostasis (Lotan et al, 1998, Cell 93:1195-1205; Lopez et al, 1996, Proc. Natl. Acad. Sci. USA 93:1049-1053). In yeast, HAPs are involved in the transcriptional control of metabolic processes such as the regulation of catabolic derepression of cycl and other genes involved in respiration (Becker et al., 1991, Proc. Natl. Acad. Sci. USA 88:1968-1972).
[0018] In mammalian systems, several reports describe HAPs as direct or indirect regulators of several important genes involved in lipid biosynthesis such as fatty acid synthase (Roder et al, 1997, Gene 184:21-26), farnesyl diphosphate (FPP) synthase (Jackson et al, 1995, J. Biol. Chem. 270:21445-21448; Ericsson et al, 1996, J. Biol. Chem. 217:24359-24364), glycerol-3-phosphate acyltransferase (GPA, Jackson et al, 1997), acetyl-CoA carboxylase (ACC, Lopez et al, 1996, Proc. Natl. Acad. Sci. USA 93:1049-1053) and 3-hydroxy-3-methylglutaryl-coenzyme A (HMG-CoA) synthase (Jackson et al, 1995, J. Biol. Chem. 270:21445-21448), among others.
[0019] In addition, other CCAAT-binding transcription factors have also been reported to be involved in different aspects of the control of lipid biosynthesis and adipocyte growth and differentiation in mammalian systems (see McKnight et al, 1989).
[0020] It appears that the currently available evidence to date points to a family of proteins of the CCAAT-binding transcription factors as important modulators of metabolism and lipid biosynthesis in mammalian systems. Such a determination has not been made for plant systems.
[0021] Other polypeptides that influence ovule and embryo development and stimulate cell growth, such as, Lec1, Kn1, WUSCHEL, Zwille and Aintegumeta (ANT) allow for increased transformation efficiencies when expressed in plants. See, for example, U.S. Application No. 2003/0135889, herein incorporated by reference. In fact, a maize Lec1 homologue of the Arabidopsis embryogenesis controlling gene AtLEC1, has been shown to increase oil content and transformation efficiencies in plants. See, for example, WO 03001902 and U.S. Pat. No. 6,512,165.
[0022] The putative AP2/EREBP transcription factor WRINKLED1 (WRI1) is involved in the regulation of seed storage metabolism in Arabidopsis (Cermac and Benning, 2004, Plant J. 40:575-585). Expression of the WRI1® cDNA under the control of the CaMV 35S promoter led to increased seed oil content. Oil-accumulating seedlings, however, showed aberrant development consistent with a prolonged embryonic state. Nucleic acid molecules encoding WRINKLED1-LIKE polypeptides and methods of use are also described in International Publication No. WO 2006/00732 A2.
[0023] Because transcription factors regulate transcription and orchestrate gene expression in plants and other organisms, control of transcription factor gene expression provides a powerful means for altering plant phenotype. The transformation of plants with transcription factors, however, can result in aberrant development based on the overexpression and/or ectopic expression of the transcription factor. In the current invention, it has been found that use of a seed specific promoter, such as SUS2 from Arabidopsis, can drive expression of an ODP1 gene thereby increasing oil content in the seeds of a cruciferous oilseed plant without negatively affecting germination and seedling establishment.
SUMMARY OF THE INVENTION
[0024] In a first embodiment, the present invention concerns a recombinant DNA construct comprising a polynucleotide encoding an ODP1 polypeptide operably linked to a sucrose synthase 2 promoter wherein said construct increases oil content in the seeds of a cruciferous oilseed plant while maintaining normal germination and further wherein the amino acid sequence of said ODP1 polypeptide has at least 80%, at least 90%, at least 95% or 100% sequence identity to a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:39, and SEQ ID NO:41.
[0025] In another embodiment, the present invention concerns a recombinant construct comprising a sucrose synthase 2 promoter which comprises: (a) the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73, or (b) a nucleotide sequence comprising a functional fragment of the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73.
[0026] In another embodiment, the present invention concerns a transgenic cruciferous oilseed plant comprising in its genome the recombinant DNA construct of the invention. Also included are transgenic seeds obtained from such transgenic cruciferous oilseed plants, wherein the transgenic seed comprises in its genome the recombinant DNA construct of the invention.
[0027] In another embodiment, the present invention concerns a method for producing a transgenic cruciferous oilseed plant comprising transforming a cruciferous oilseed plant cell with the recombinant construct of the invention and regenerating a transgenic plant from the transformed plant cell, wherein the transgenic cruciferous oilseed plant comprises in its genome the recombinant DNA construct of the invention.
[0028] In another embodiment, the present invention concerns a method for increasing oil content in seeds of a transgenic cruciferous oilseed plant while maintaining normal germination, said method comprising:
[0029] (a) transforming a cruciferous oilseed plant cell with a recombinant DNA construct comprising a polynucleotide encoding an ODP1 polypeptide, wherein the amino acid sequence of said ODP1 polypeptide has at least 80%, at least 90% or at least 95% sequence identity with a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:39, and SEQ ID NO:41, said sequence being operably linked to a seed specific promoter;
[0030] (b) regenerating a transgenic cruciferous oilseed plant from the transformed cell of step (a), wherein said plant comprises the recombinant DNA construct;
[0031] (c) obtaining a transgenic progeny plant derived from the transgenic cruciferous oilseed plant of step (b), wherein the transgenic progeny plant comprises in its genome the recombinant DNA construct;
[0032] (d) assaying the transgenic progeny plant obtained from step (c) for oil level and germination; and
[0033] (e) selecting those transgenic progeny plants having seeds with an increased level of oil and normal germination when compared to seeds obtained from a control cruciferous oilseed plant, wherein said control plant does not comprise the recombinant DNA construct.
[0034] In another embodiment, the present invention concerns a method of the invention wherein the ODP1 polypeptide is a maize ODP1 polypeptide and, more specifically, the amino acid sequence of the ODP1 polypeptide comprises the sequence of SEQ ID NO:37. In addition, the seed specific promoter can be a sucrose synthase 2 promoter and, more specifically, the nucleotide sequence of sucrose synthase 2 promoter comprises (a) the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73; or (b) a nucleotide sequence comprising a functional fragment of the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73.
[0035] In another embodiment, the present invention concerns oil or by-products obtained from transgenic seed of the invention.
[0036] In another embodiment, the cruciferous oilseed plant or seed of any of the compositions or methods of the present invention can be canola or Arabidopsis or other plant species including but not limited to the following: Barbarea vulgaris, Brassica campestris, Brassica carinata, Brassica elongate, Brassica fruticulosa, Brassica hirta, Brassica juncea, Brassica napus, Brassica narinosa, Brassica nigra, Brassica oleracea, Brassica perviridis, Brassica rapa, Brassica rupestris, Brassica septiceps, Brassica tournefortii, Brassica verna, Camelina sativa, Crambe abyssinica, Lepidium campestre, Raphanus sativus, Sinapis alba.
BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE LISTING
[0037] The invention can be more fully understood from the following detailed description and the accompanying drawings and Sequence Listing which form a part of this application.
[0038] FIG. 1A-1C show a multiple alignment of the ODP1 polypeptides of Zea mays (SEQ ID NO:37), Glycine max (SEQ ID NO:39), Momordica charantia (SEQ ID NO:41), and the WRINKLED1 gene from Arabidopsis thaliana (SEQ ID NO:42; NCBI GI NO. 32364685). The multiple alignment was assembled using the Clustal V method of alignment with the default parameters. Residues that match SEQ ID NO:37 exactly are enclosed in a box. Above the alignment is shown a consensus sequence. A residue is shown in the consensus sequence when all residues at that position are identical.
[0039] FIG. 2 shows the percent sequence identity and divergence for each pair of polypeptides from the multiple alignment of FIG. 1A-1C.
[0040] SEQ ID NO:1 is the nucleotide sequence of vector pKS121/BS.
[0041] SEQ ID NO:2 is the nucleotide sequence of vector pDsRedxKS121/BS.
[0042] SEQ ID NO:3 is the nucleotide sequence of vector pKS332,
[0043] SEQ ID NO:4 is the nucleotide sequence of PCR primer MWG345.
[0044] SEQ ID NO:5 is the nucleotide sequence of PCR primer MWG346.
[0045] SEQ ID NO:6 is the nucleotide sequence of vector pKS336,
[0046] SEQ ID NO:7 is the nucleotide sequence of the T-DNA of the plant transformation vector pZBL120×KS336.
[0047] SEQ ID NO:8 is the nucleotide sequence of PCR primer MWG339.
[0048] SEQ ID NO:9 is the nucleotide sequence of PCR primer MWG340.
[0049] SEQ ID NO:10 is the nucleotide sequence of vector pKS333.
[0050] SEQ ID NO:11 is the nucleotide sequence of the T-DNA of the plant transformation vector pZBL120×KS333.
[0051] SEQ ID NO:12 is the nucleotide sequence of PCR primer MWG341.
[0052] SEQ ID NO:13 is the nucleotide sequence of PCR primer MWG342.
[0053] SEQ ID NO:14 is the nucleotide sequence of vector pKS334.
[0054] SEQ ID NO:15 is the nucleotide sequence of the T-DNA of the plant transformation vector pZBL120×KS334.
[0055] SEQ ID NO:16 is the nucleotide sequence of vector pKR132.
[0056] SEQ ID NO:17 is the nucleotide sequence of vector pKR627.
[0057] SEQ ID NO:18 is the nucleotide sequence of vector KS294.
[0058] SEQ ID NO:19 is the nucleotide sequence of vector pKR1142.
[0059] SEQ ID NO:20 is the nucleotide sequence of vector pKR1141.
[0060] SEQ ID NO:21 is the nucleotide sequence of PCR primer SuSy-5.
[0061] SEQ ID NO:22 is the nucleotide sequence of PCR primer SuSy-3.
[0062] SEQ ID NO:23 is the nucleotide sequence of vector pLF122.
[0063] SEQ ID NO:24 is the nucleotide sequence of vector pKR1155.
[0064] SEQ ID NO:25 is the nucleotide sequence of vector pKR1158.
[0065] SEQ ID NO:26 is the nucleotide sequence of vector pKR1167.
[0066] SEQ ID NO:27 is the nucleotide sequence of vector pKR92.
[0067] SEQ ID NO:28 is the nucleotide sequence of vector pKR1223.
[0068] SEQ ID NO:29 is the nucleotide sequence of vector pKR268.
[0069] SEQ ID NO:30 is the nucleotide sequence of vector pKR1143.
[0070] SEQ ID NO:31 is the nucleotide sequence of vector pKR1147.
[0071] SEQ ID NO:32 is the nucleotide sequence of vector pKR1220.
[0072] SEQ ID NO:33 is the nucleotide sequence of vector pKR1144.
[0073] SEQ ID NO:34 is the nucleotide sequence of vector pKR1149.
[0074] SEQ ID NO:35 is the nucleotide sequence of vector pKR1221.
[0075] SEQ ID NO:36 is the nucleotide sequence of the maize ODP1 coding region from cDNA clone cde1c.pk003.o22.
[0076] SEQ ID NO:37 is the amino acid sequence of the maize ODP1 encoded by SEQ ID NO:36. SEQ ID NO:37 is identical to SEQ ID NO:320 in U.S. Pat. No. 7,157,621.
[0077] SEQ ID NO:38 is the nucleotide sequence of the soybean ODP1 coding region from cDNA clone se3.pk0003.f5.
[0078] SEQ ID NO:39 is the amino acid sequence of the soybean ODP1 encoded by SEQ ID NO:38. SEQ ID NO:39 is identical to SEQ ID NO:481 in U.S. Pat. No. 7,157,621.
[0079] SEQ ID NO:40 is the nucleotide sequence of the Momordica charantia ODP1 coding region from cDNA clone fds1n.pk015.115.
[0080] SEQ ID NO:41 is the amino acid sequence of the Momordica charantia ODP1 encoded by SEQ ID NO:40. SEQ ID NO:41 is identical to SEQ ID NO:477 in U.S. Pat. No. 7,157,621.
[0081] SEQ ID NO:42 is the amino acid sequence of WRINKLED1 (WRI1) from Arabidopsis thaliana and corresponds to NCBI GI NO. 32364685.
[0082] SEQ ID NO:43 is the nucleotide sequence of the sucrose synthase 2 (SUS2) promoter from Arabidopsis thaliana that is present in vector pKR1223.
[0083] SEQ ID NO:44 is the nucleotide sequence of the canola SUS2 homolog.
[0084] SEQ ID NO:45 is the amino acid sequence of the canola SUS2 homolog encoded by SEQ ID NO:44.
[0085] SEQ ID NO:46 is the nucleotide sequence of primer a.
[0086] SEQ ID NO:47 is the nucleotide sequence of primer b.
[0087] SEQ ID NO:48 is the nucleotide sequence of primer c.
[0088] SEQ ID NO:49 is the nucleotide sequence of primer d.
[0089] SEQ ID NO:50 is the nucleotide sequence of "PvuII rapa cons", a genomic sequence of canola variety NS1822BC that was generated with primers a and b.
[0090] SEQ ID NO:51 is the nucleotide sequence of "1,6 DraI gene cons", a genomic sequence of canola variety NS1822BC that was generated with primers c and d.
[0091] SEQ ID NO:52 is the nucleotide sequence of primer SA188.
[0092] SEQ ID NO:53 is the nucleotide sequence of primer SA189.
[0093] SEQ ID NO:54 is the nucleotide sequence of primer SA190.
[0094] SEQ ID NO:55 is the nucleotide sequence of primer SA191.
[0095] SEQ ID NO:56 is the nucleotide sequence of "BN SUS2 prom1/PCR blunt", which is derived from 1,6 DraI gene cons (SEQ ID NO:51).
[0096] SEQ ID NO:57 is the nucleotide sequence of "BN SUS2 prom2/PCR blunt", which is derived from PvuII rapa cons (SEQ ID NO:50).
[0097] SEQ ID NO:58 is the nucleotide sequence of vector KS427.
[0098] SEQ ID NO:59 is the nucleotide sequence of vector KS 130.
[0099] SEQ ID NO:60 is the nucleotide sequence of vector KS432.
[0100] SEQ ID NO:61 is the nucleotide sequence of vector ARALO80,
[0101] SEQ ID NO:62 is the nucleotide sequence of primer D6 fwd.
[0102] SEQ ID NO:63 is the nucleotide sequence of primer D6 rev,
[0103] SEQ ID NO:64 is the nucleotide sequence of vector KS 119.
[0104] SEQ ID NO:65 is the nucleotide sequence of vector KS430.
[0105] SEQ ID NO:66 is the nucleotide sequence of vector ARALO78.
[0106] SEQ ID NO:67 is the nucleotide sequence of vector KS428.
[0107] SEQ ID NO:68 is the nucleotide sequence of vector KS429.
[0108] SEQ ID NO:69 is the nucleotide sequence of vector ARALO77.
[0109] SEQ ID NO:70 is the nucleotide sequence of vector KS431.
[0110] SEQ ID NO:71 is the nucleotide sequence of vector ARALO79.
[0111] SEQ ID NO:72 is the nucleotide sequence of the sucrose synthase 2-1 (BnSUS2-1) promoter from Brassica napus that is present in BN SUS2 prom1/PCR blunt.
[0112] SEQ ID NO:73 is the nucleotide sequence of the sucrose synthase 2-2 (BnSUS2-2) promoter from Brassica napus that is present in BN SUS2 prom2/PCR blunt.
[0113] The sequence descriptions and Sequence Listing attached hereto comply with the rules governing nucleotide and/or amino acid sequence disclosures in patent applications as set forth in 37 C.F.R. §1.821-1.825.
[0114] The Sequence Listing contains the one letter code for nucleotide sequence characters and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB standards described in Nucleic Acids Res. 13:3021-3030 (1985) and in the Biochemical J. 219 (No. 2):345-373 (1984) which are herein incorporated by reference. The symbols and format used for nucleotide and amino acid sequence data comply with the rules set forth in 37 C.F.R. §1.822.
DETAILED DESCRIPTION OF THE INVENTION
[0115] All patents, patent applications, and publications cited herein are incorporated by reference in their entirety.
[0116] As used herein and in the appended claims, the singular forms "a", "an", and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a plant" includes a plurality of such plants, reference to "a cell" includes one or more cells and equivalents thereof known to those skilled in the art, and so forth.
[0117] Units, prefixes, and symbols may be denoted in their SI accepted form. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxyl orientation, respectively. Numeric ranges recited within the specification are inclusive of the numbers defining the range and include each integer within the defined range. Amino acids may be referred to herein by either commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes. Unless otherwise provided for, software, electrical, and electronics terms as used herein are as defined in The New IEEE Standard Dictionary of Electrical and Electronics Terms (5th edition, 1993). The terms defined below are more fully defined by reference to the specification as a whole.
[0118] In the context of this disclosure, a number of terms and abbreviations are used. The following definitions are provided.
[0119] The term "ODP1" refers to an ovule development protein 1 that is involved with increasing oil content.
[0120] The term "sucrose synthase" (SUS) refers to an enzyme used in carbohydrate metabolism that catalyzes the reversible conversion of sucrose and uridine diphosphate (UDP) to UDP-glucose and fructose in vitro. The terms "Arabidopsis sucrose synthase 2", "AtSuSy" and "AtSUS2") are used interchangeably herein. The Arabidopsis sucrose synthase 2 gene is from genomic locus At5g49190,
[0121] The term "germination" refers to the initial stages in the growth of a seed to form a seedling.
[0122] The term "recombinant" refers to an artificial combination of two otherwise separated segments of sequence, e.g., by chemical synthesis or by the manipulation of isolated segments of nucleic acids by genetic engineering techniques.
[0123] The terms "recombinant construct", "expression construct", "chimeric construct", "construct", and "recombinant DNA construct" are used interchangeably herein. A recombinant construct comprises an artificial combination of nucleic acid fragments, e.g., regulatory and coding sequences that are not found together in nature. For example, a chimeric construct may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. Such a construct may be used by itself or may be used in conjunction with a vector. If a vector is used, then the choice of vector is dependent upon the method that will be used to transform host cells as is well known to those skilled in the art. For example, a plasmid vector can be used. The skilled artisan is well aware of the genetic elements that must be present on the vector in order to successfully transform, select and propagate host cells comprising any of the isolated nucleic acid fragments of the invention. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones et al., EMBO J. 4:2411-2418 (1985); De Almeida et al., Mol. Gen. Genetics 218:78-86 (1989)), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by Southern analysis of DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein expression, or phenotypic analysis, among others.
[0124] This construct may comprise any combination of deoxyribonucleotides, ribonucleotides, and/or modified nucleotides. The construct may be transcribed to form an RNA, wherein the RNA may be capable of forming a double-stranded RNA and/or hairpin structure. This construct may be expressed in the cell, or isolated or synthetically produced. The construct may further comprise a promoter, or other sequences which facilitate manipulation or expression of the construct.
[0125] As used herein, "encodes" or "encoding" refers to a DNA sequence which can be processed to generate an RNA and/or polypeptide.
[0126] As used herein, "expression" or "expressing" refers to production of a functional product, such as, the generation of an RNA transcript from an introduced construct, an endogenous DNA sequence, or a stably incorporated heterologous DNA sequence. The term may also refer to a polypeptide produced from an mRNA generated from any of the above DNA precursors. Thus, expression of a nucleic acid fragment may refer to transcription of the nucleic acid fragment (e.g., transcription resulting in mRNA or other functional RNA) and/or translation of RNA into a precursor or mature protein (polypeptide).
[0127] As used herein, "heterologous" with respect to a sequence means a sequence that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. For example, with respect to a nucleic acid, it can be a nucleic acid that originates from a foreign species, or is synthetically designed, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. A heterologous protein may originate from a foreign species or, if from the same species, is substantially modified from its original form by deliberate human intervention.
[0128] "Plant" includes reference to whole plants, plant organs, plant tissues, seeds and plant cells and progeny of same. Plant cells include, without limitation, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores.
[0129] The term "plant parts" includes differentiated and undifferentiated tissues including, but not limited to the following: roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of cells and culture (e.g., single cells, protoplasts, embryos and callus tissue). The plant tissue may be in plant or in a plant organ, tissue or cell culture.
[0130] The term "plant organ" refers to plant tissue or group of tissues that constitute a morphologically and functionally distinct part of a plant.
[0131] "Progeny" comprises any subsequent generation of a plant. Progeny will inherit, and stably segregate, genes and transgenes from its parent plant(s).
[0132] The term "introduced" means providing a nucleic acid (e.g., expression construct) or protein into a cell. Introduced includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell, and includes reference to the transient provision of a nucleic acid or protein to the cell. Introduced includes reference to stable or transient transformation methods, as well as sexually crossing. Thus, "introduced" in the context of inserting a nucleic acid fragment (e.g., a recombinant DNA construct/expression construct) into ac ell, means "transfection" or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
[0133] The term "genome" as it applies to a plant cells encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components (e.g., mitochondrial, plastid) of the cell.
[0134] The term "isolated" refers to material, such as a nucleic acid or a protein, which is: (1) substantially or essentially free from components which normally accompany or interact with the material as found in its naturally occurring environment or (2) if the material is in its natural environment, the material has been altered by deliberate human intervention to a composition and/or placed at a locus in the cell other than the locus native to the material.
[0135] As used herein, "nucleic acid" means a polynucleotide and includes single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases. Nucleic acids may also include fragments and modified nucleotides. Thus, the terms "polynucleotide", "nucleic acid sequence", "nucleotide sequence" or "nucleic acid fragment" are used interchangeably and is a polymer of RNA or DNA that is single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases. Nucleotides (usually found in their 5'-monophosphate form) are referred to by their single letter designation as follows: "A" for adenylate or deoxyadenylate (for RNA or DNA, respectively), "C" for cytidylate or deosycytidylate, "G" for guanylate or deoxyguanylate, "U" for uridlate, "T" for deosythymidylate, "R" for purines (A or G), "Y" for pyrimidines (C or T), "K" for G or T, "H" for A or C or T, "I" for inosine, and "N" for any nucleotide.
[0136] The terms "subfragment that is functionally equivalent" and "functionally equivalent subfragment" are used interchangeably herein. These terms refer to a portion or subsequence of an isolated nucleic acid fragment in which the ability to alter gene expression or produce a certain phenotype is retained whether or not the fragment or subfragment encodes an active enzyme. For example, the fragment or subfragment can be used in the design of chimeric genes to produce the desired phenotype in a transformed plant. Chimeric genes can be designed for use in suppression by linking a nucleic acid fragment or subfragment thereof, whether or not it encodes an active enzyme, in the sense or antisense orientation relative to a plant promoter sequence.
[0137] The term "conserved domain" or "motif" means a set of amino acids conserved at specific positions along an aligned sequence of evolutionarily related proteins. While amino acids at other positions can vary between homologous proteins, amino acids that are highly conserved at specific positions indicate amino acids that are essential in the structure, the stability, or the activity of a protein. Because they are identified by their high degree of conservation in aligned sequences of a family of protein homologues, they can be used as identifiers, or "signatures", to determine if a protein with a newly determined sequence belongs to a previously identified protein family.
[0138] The terms "homology", "homologous", "substantially similar" and "corresponding substantially" are used interchangeably herein. They refer to nucleic acid fragments wherein changes in one or more nucleotide bases do not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype. These terms also refer to modifications of the nucleic acid fragments of the instant invention such as deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the initial, unmodified fragment. It is therefore understood, as those skilled in the art will appreciate, that the invention encompasses more than the specific exemplary sequences.
[0139] "Sequence identity" or "identity" in the context of nucleic acid or polypeptide sequences refers to the nucleic acid bases or amino acid residues in two sequences that are the same when aligned for maximum correspondence over a specified comparison window.
[0140] Thus, "percentage of sequence identity" refers to the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the results by 100 to yield the percentage of sequence identity. Useful examples of percent sequence identities include, but are not limited to, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, or any integer percentage from 50% to 100%. These identities can be determined using any of the programs described herein.
[0141] Sequence alignments and percent identity or similarity calculations may be determined using a variety of comparison methods designed to detect homologous sequences including, but not limited to, the MegAlign® program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.). Within the context of this application it will be understood that where sequence analysis software is used for analysis, that the results of the analysis will be based on the "default values" of the program referenced, unless otherwise specified. As used herein "default values" will mean any set of values or parameters that originally load with the software when first initialized.
[0142] The "Clustal V method of alignment" corresponds to the alignment method labeled Clustal V (described by Higgins and Sharp, CABIOS. 5:151-153 (1989); Higgins, D. G. et al. (1992) Comput. Appl. Biosci. 8:189-191) and found in the MEGALIGN® program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.). For multiple alignments, the default values correspond to GAP PENALTY=10 and GAP LENGTH PENALTY=10. Default parameters for pairwise alignments and calculation of percent identity of protein sequences using the Clustal method are KTUPLE=1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. For nucleic acids these parameters are KTUPLE=2, GAP PENALTY=5, WINDOW=4 and DIAGONALS SAVED=4. After alignment of the sequences using the Clustal V program, it is possible to obtain a "percent identity" by viewing the "sequence distances" table in the same program.
[0143] "BLASTN method of alignment" is an algorithm provided by the National Center for Biotechnology Information (NCBI) to compare nucleotide sequences using default parameters.
[0144] It is well understood by one skilled in the art that many levels of sequence identity are useful in identifying polypeptides, from other species, wherein such polypeptides have the same or similar function or activity. Useful examples of percent identities include, but are not limited to, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, or any integer percentage from 50% to 100%. Indeed, any integer amino acid identity from 50% to 100% may be useful in describing the present invention, such as 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%. Also, of interest is any full-length or partial complement of this isolated nucleotide fragment.
[0145] "Gene" refers to a nucleic acid fragment that expresses a specific protein, including regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its own regulatory sequences. "Chimeric gene" refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. A "foreign" gene refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer. Foreign genes can comprise native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene that has been introduced into the genome by a transformation procedure.
[0146] The term "genome" as it applies to a plant cells encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components (e.g., mitochondrial, plastid) of the cell.
[0147] "Coding sequence" refers to a DNA sequence that codes for a specific amino acid sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5' non-coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to: promoters, translation leader sequences, introns, polyadenylation recognition sequences, RNA processing sites, effector binding sites and stem-loop structures.
[0148] "Promoter" refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA. The promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a DNA sequence that can stimulate promoter activity, and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue-specificity of a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of some variation may have identical promoter activity. Promoters that cause a gene to be expressed in most cell types at most times are commonly referred to as "constitutive promoters". New promoters of various types useful in plant cells are constantly being discovered; numerous examples may be found in the compilation by Okamuro, J. K., and Goldberg, R. B. Biochemistry of Plants 15:1-82 (1989).
[0149] "Functional variants" of the regulatory sequences (e.g., promoters) are also encompassed by the compositions of the present invention. Functional variants include, for example, the native regulatory sequences of the invention having one or more nucleotide substitutions, deletions or insertions. Functional variants of the invention may be created by site-directed nutagenesis, induced mutation, or may occur as allelic variants (polymorphisms).
[0150] As used herein, a "functional fragment" of a regulatory sequence (e.g. a promoter) is a functional variant formed by one or more deletions from a larger regulatory element. For example, the 5' portion of a sequence with promoter activity may be deleted without abolishing promoter activity, as described by Zhu et al., Plant Cell 7:1681-1689 (1995). Such variants should retain promoter activity, particularly the ability to drive expression in seed or seed tissues. Activity can be measured by Northern blot analysis, reporter activity measurements when using transcriptional fusions, and the like. See, for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.).
[0151] "Translation leader sequence" refers to a polynucleotide sequence located between the promoter sequence of a gene and the coding sequence. The translation leader sequence is present in the fully processed mRNA upstream of the translation start sequence. The translation leader sequence may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency. Examples of translation leader sequences have been described (Turner, R. and Foster, G. D., Mol. Biotechnol. 3:225-236 (1995)).
[0152] "3' non-coding sequences", "transcription terminator" or "termination sequences" refer to DNA sequences located downstream of a coding sequence and include polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exemplified by Ingelbrecht, I. L., et al. Plant Cell 1:671-680 (1989).
[0153] "RNA transcript" refers to the product resulting from RNA polymerase-catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the primary transcript. A RNA transcript is referred to as the mature RNA when it is a RNA sequence derived from post-transcriptional processing of the primary transcript. "Messenger RNA" or "mRNA" refers to the RNA that is without introns and that can be translated into protein by the cell. "cDNA" refers to a DNA that is complementary to, and synthesized from, a mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into double-stranded form using the Klenow fragment of DNA polymerase I. "Sense" RNA refers to RNA transcript that includes the mRNA and can be translated into protein within a cell or in vitro. "Antisense RNA" refers to an RNA transcript that is complementary to all or part of a target primary transcript or mRNA, and that blocks the expression of a target gene (U.S. Pat. No. 5,107,065). The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding sequence, introns, or the coding sequence. "Functional RNA" refers to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect on cellular processes. The terms "complement" and "reverse complement" are used interchangeably herein with respect to mRNA transcripts, and are meant to define the antisense RNA of the message.
[0154] The term "operably linked" refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is regulated by the other. For example, a promoter is operably linked with a coding sequence when it is capable of regulating the expression of that coding sequence (i.e., the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in a sense or antisense orientation. In another example, the complementary RNA regions of the invention can be operably linked, either directly or indirectly, 5' to the target mRNA, or 3' to the target mRNA, or within the target mRNA, or a first complementary region is 5' and its complement is 3' to the target mRNA.
[0155] Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory: Cold Spring Harbor, N.Y. (1989). Transformation methods are well known to those skilled in the art and are described infra.
[0156] "PCR" or "polymerase chain reaction" is a technique for the synthesis of large quantities of specific DNA segments and consists of a series of repetitive cycles (Perkin Elmer Cetus Instruments, Norwalk, Conn.). Typically, the double-stranded DNA is heat denatured, the two primers complementary to the 3' boundaries of the target segment are annealed at low temperature and then extended at an intermediate temperature. One set of these three consecutive steps is referred to as a "cycle".
[0157] The terms "plasmid", "vector" and "cassette" refer to an extra chromosomal element often carrying genes that are not part of the central metabolism of the cell, and usually in the form of circular double-stranded DNA fragments. Such elements may be autonomously replicating sequences, genome integrating sequences, phage or nucleotide sequences, linear or circular, of a single- or double-stranded DNA or RNA, derived from any source, in which a number of nucleotide sequences have been joined or recombined into a unique construction which is capable of introducing a promoter fragment and DNA sequence for a selected gene product along with appropriate 3' untranslated sequence into a cell. "Transformation cassette" refers to a specific vector containing a foreign gene and having elements in addition to the foreign gene that facilitates transformation of a particular host cell. "Expression cassette" refers to a specific vector containing a foreign gene and having elements in addition to the foreign gene that allow for enhanced expression of that gene in a foreign host (i.e., to a discrete nucleic acid fragment into which a nucleic acid sequence or fragment can be moved.)
[0158] The term "expression", as used herein, refers to the production of a functional end-product (e.g., a mRNA or a protein [either precursor or mature]).
[0159] "Stable transformation" refers to the transfer of a nucleic acid fragment into a genome of a host organism, including both nuclear and organellar genomes, resulting in genetically stable inheritance. In contrast, "transient transformation" refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing organelle, of a host organism resulting in gene expression without integration or stable inheritance. Host organisms containing the transformed nucleic acid fragments are referred to as "transgenic" organisms.
[0160] As used herein, "transgenic" refers to a plant or a cell which comprises within its genome a heterologous polynucleotide. Preferably, the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of an expression construct. Transgenic is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic. The term "transgenic" as used herein does not encompass the alteration of the genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition, or spontaneous mutation.
[0161] The present invention concerns a recombinant DNA construct comprising a polynucleotide encoding an ODP1 polypeptide operably linked to a sucrose synthase 2 promoter wherein said construct increases oil content in the seeds of a cruciferous oilseed plant while maintaining normal germination and further wherein the amino acid sequence of said ODP1 polypeptide has at least 80% sequence identity to a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:39, and SEQ ID NO:41.
[0162] In another embodiment, the sequence identity can be at least 90% or 95%.
[0163] In another embodiment the ODP1 polypeptide comprises a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:39, and SEQ ID NO:41.
[0164] In another embodiment, the sucrose synthase 2 promoter comprises: (a) the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73; or (b) a nucleotide sequence comprising a functional fragment of the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73.
[0165] ODP1 is a member of the APETALA2 (AP2) family of proteins that play a role in a variety of biological events including, but not limited to, oil content. The AP2/ERF family of proteins is a plant-specific class of putative transcription factors that have been shown to regulate a wide-variety of developmental processes and are characterized by the presence of an AP2/ERF DNA binding domain. The AP2/ERF proteins have been subdivided into two distinct subfamilies based on whether they contain one (ERF subfamily) or two (AP2 subfamily) DNA binding domains.
[0166] Specifically, AP2 (APETALA2) and EREBPs (ethylene-responsive element binding proteins) are the prototypic members of a family of transcription factors unique to plants, whose distinguishing characteristic is that they contain the so-called AP2 DNA-binding domain. AP2/EREBP genes form a large multigene family, and they play a variety of roles throughout the plant life cycle. AP2/EREBP genes are key regulators of several developmental processes, including floral organ identity determination and leaf epidermal cell identity. In Arabidopsis thaliana, the homeotic gene APETALA2 (AP2) has been shown to control three salient processes during development: (1) the specification of flower organ identity throughout floral organogenesis (Jofuku et al., Plant Cell 6:1211-1225, 1994); (2) establishment of flower meristem identity (Irish and Sussex, Plant Cell 2:8:741-753, 1990); and (3) the temporal and spatial regulation of flower homeotic gene activity (Drews et al., Cell 65:6:991-1002, 1991). DNA sequence analysis suggests that AP2 encodes a theoretical polypeptide of 432 aa, with a distinct 68 aa repeated motif termed the AP2 domain. This domain has been shown to be essential for AP2 functions and contains within the 68 aa, an eighteen amino acid core region that is predicted to form an amphipathic α-helix (Jofuku et al., Plant Cell 6:1211-1225, 1994). Apt-like domain-containing transcription factors have been also been identified in both Arabidopsis thaliana (Okamuro et al., Proc. Natl. Acad. Sci. USA 94:7076-7081, 1997) and in tobacco with the identification of the ethylene responsive element binding proteins (EREBPs) (Ohme-Takagi and Shinshi, Plant Cell 7:2:173-182, 1995). In Arabidopsis, these RAP2 (related to AP2) genes encode two distinct subfamilies of AP2 domain-containing proteins designated AP2-like and EREBP-like (Okamuro et al., Proc. Natl. Acad. Sci. USA 94:7076-7081, 1997). In vitro DNA binding has not been shown to date using the RAP2 proteins. Based upon the presence of two highly conserved motifs YRG and RAYD within the AP2 domain, it has been proposed that binding DNA binding occurs in a manner similar to that of AP2 proteins.
[0167] In another embodiment, the present invention concerns a transgenic cruciferous oilseed plant comprising in its genome the recombinant DNA construct of the invention. Also of interest is a transgenic seed obtained from a transgenic plant as described herein, wherein said seed comprises in its genome a recombinant DNA construct of the invention.
[0168] In still another aspect, the present invention concerns a method for producing a transgenic cruciferous oilseed plant comprising transforming a cruciferous oilseed plant cell with a recombinant construct of the invention and regenerating a transgenic plant from the transformed plant cell.
[0169] This invention concerns a transgenic seed obtained from a transgenic plant made by a method of the invention, wherein said seed comprises in its genome a recombinant DNA construct of the invention.
[0170] In another aspect, the present invention concerns a method for increasing oil content in seeds of a transgenic cruciferous oilseed plant while maintaining normal germination, said method comprising:
[0171] (a) transforming a cruciferous oilseed plant cell with a recombinant DNA construct comprising a polynucleotide encoding an ODP1 polypeptide, wherein the amino acid sequence of said ODP1 polypeptide has at least 80%, at least 90% or at least 95% sequence identity with a sequence selected from the group consisting of SEQ ID NO:37, SEQ ID NO:39, and SEQ ID NO:41, said sequence being operably linked to a seed specific promoter;
[0172] (b) regenerating a transgenic cruciferous oilseed plant from the transformed cell of step (a), wherein said plant comprises the recombinant DNA construct;
[0173] (c) obtaining a transgenic progeny plant derived from the transgenic cruciferous oilseed plant of step (b), wherein the transgenic progeny plant comprises in its genome the recombinant DNA construct;
[0174] (d) assaying the transgenic progeny plant obtained from step (c) for oil level and germination; and
[0175] (e) selecting those transgenic progeny plants having seeds with an increased level of oil and normal germination when compared to seeds obtained from a control cruciferous oilseed plant, wherein said control plant does not comprise the recombinant DNA construct.
[0176] Preferably, the ODP1 polypeptide is a maize ODP1 polypeptide and, more preferably, the amino acid sequence of the ODP1 polypeptide comprises the sequence of SEQ ID NO:37.
[0177] With respect to the seed specific promoter, it can be a sucrose synthase 2 promoter and preferably, the nucleotide sequence of sucrose synthase 2 promoter comprises: (a) the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73; or (b) a nucleotide sequence comprising a functional fragment of the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73.
[0178] The transgenic cruciferous oil seeds described herein of the invention can be processed to yield oil and/or seed by-products.
[0179] In another embodiment, the present invention concerns a recombinant DNA construct comprising a polynucleotide encoding a heterologous polypeptide operably linked to a sucrose synthase 2 promoter, wherein the sucrose synthase 2 promoter comprises: (a) the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73; (b) a nucleotide sequence comprising a functional fragment of the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73; or (c) a nucleotide sequence with at least 80%, at least 90% or at least 95% sequence identity to the nucleotide sequence of SEQ ID NO:43, SEQ ID NO:72 or SEQ ID NO:73; wherein the nucleotide sequence of (a), (b) or (c) has seed-specific promoter activity in a plant. The invention also concerns a transgenic plant, plant cell and seed comprising the recombinant DNA construct. The transgenic plant may be a transgenic cruciferous plant.
[0180] The nucleotide and deduced amino acid sequence of the canola SUS2 homolog transcript model are set forth as SEQ ID NO:44 and SEQ ID NO:45, respectively.
[0181] NCBI GI NO. 150912532 is the nucleotide sequence of the 5'-end of a Brassica oleracea cDNA.
[0182] SEQ ID NO:72 is the nucleotide sequence of the sucrose synthase 2-1 (BnSUS2-1) promoter from Brassica napus that is present in BN SUS2 prom1/PCR blunt. Comparison of SEQ ID NO:72 with SEQ ID NO:44 and NCBI GI NO. 150912532 indicate that nucleotide 427 is at or near the beginning of the 5'-Untranslated region of the canola SUS2 gene. Consequently, a fragment comprising nucleotides 1-426 of SEQ ID NO:72 would be expected to have seed-specific promoter activity in a plant.
[0183] SEQ ID NO:73 is the nucleotide sequence of the sucrose synthase 2-2 (BnSUS2-2) promoter from Brassica napus that is present in BN SUS2 prom2/PCR blunt. Comparison of SEQ ID NO:73 with SEQ ID NO:44 and NCBI GI NO. 150912532 indicate that nucleotide 1766 is at or near the beginning of the 5'-Untranslated region of the canola SUS2 gene. Consequently, a fragment comprising nucleotides 1-1765 of SEQ ID NO:73 would be expected to have seed-specific promoter activity in a plant.
[0184] The cruciferous oilseed plant (or seed) of any of the compositions or methods of the present invention can be canola or Arabidopsis or other plant species including but not limited to the following: Barbarea vulgaris, Brassica campestris, Brassica carinata, Brassica elongate, Brassica fruticulosa, Brassica hirta, Brassica juncea, Brassica napus, Brassica narinosa, Brassica nigra, Brassica oleracea, Brassica perviridis, Brassica rapa, Brassica rupestris, Brassica septiceps, Brassica tournefortii, Brassica verna, Camelina sativa, Crambe abyssinica, Lepidium campestre, Raphanus sativus, Sinapis alba.
[0185] Methods of isolating seed oils are well known in the art: (Young et al., Processing of Fats and Oils, In The Lipid Handbook, Gunstone et al., eds., Chapter 5 pp 253-257; Chapman & Hall: London (1994)). Seed by-products include but are not limited to the following: meal, lecithin, gums, free fatty acids, pigments, soap, stearine, tocopherols, sterols and volatiles.
[0186] The production of edible vegetable oils including canola oil involves two overall processes, mechanical pressing and extraction, and further processing to remove impurities. The techniques used are similar for most vegetable oils produced from the seeds of plants. The crushing and extraction processes utilized by the canola industry today produce very little change to the fatty acid profile of the oil and the nutritional qualities of the meal.
[0187] For example, canola seeds are crushed into two component parts, oil and meal, which are then further manufactured into a wide variety of products.
Further manufacturing, called refining, improves the color, flavor and shelf life of canola oil.
[0188] Canola oil is extracted in several stages. The first stage in processing canola is to roll or flake the seed. This ruptures cells and makes the oil easier to extract. Next the flaked or rolled seeds are cooked and subjected to a mild pressing process which removes some of the oil and compresses the seeds into large chunks called "cake fragments." The cake fragments undergo further processing to remove most of the remaining oil. The oil extracted during each step is combined. The oil is then subjected to processing according to the end product requirements. Different treatments are used to process salad oils, margarines, and shortenings.
[0189] Specifically, canola seed is cleaned by a number of different methods including air aspiration, indent cylinder cleaning, sieve screening, or a combination of these. Cleaning ensures that the seed is free of extraneous plant and other foreign material which is referred to in the industry as "dockage". Seed generally contains less than 2.5% dockage following the cleaning process. Seed that has been cleaned is ready for subsequent crushing into canola oil and meal.
[0190] Seed which will be processed for oil and meal is preconditioned using mild heat treatment, and moisture is then adjusted to improve subsequent oil extraction. Following preconditioning, canola seed is next crushed and flaked and then heated slightly. These processes help to maximize oil recovery. The canola flakes are then "prepressed" in screw presses or expellers to reduce the oil content from about 42% in the seed (on an 8% moisture basis) to between 16-20%. Screw pressing also compresses the flakes into more dense cakes (called "press cake") which facilitates oil extraction.
[0191] Press cake which results from seed processing is next subjected to one of two types of oil extraction to remove much of the remaining oil. Oil may be extracted using either hexane ("solvent") extraction or by "cold-pressing" (also referred to as "expeller pressing"). The end-market into which the oil is sold generally dictates which form of extraction will be used. Hexane is the extraction medium used for the bulk of canola oil which is sold into the commodity grocery chain market as well as to the food industry. Cold-pressed canola oil represents a much smaller volume sold to consumers and is generally marketed in specialty food stores. Both extraction processes result in an oil essentially bland in taste, light yellow in color, and with excellent nutritional and stability properties.
[0192] Hexane extraction reduces the oil content of the press cake to very low levels. Oil recovery from canola seed is approximately 96% when this form of extraction is used. This is accomplished by maximizing contact of the hexane with the press cake through a series of soakings or washings. Residual hexane in the extracted press cake and oil is easily removed by evaporation at low temperature. Solvent residues in oils and meals, when produced in accordance with good manufacturing practice, can be said to be truly insignificant.
[0193] The oil which is produced during the extraction process is referred to as "crude oil" because it contains various compounds which must be removed to ensure a product with good stability and shelf-life. These impurities include phospholipids, mucilaginous gums, free fatty acids, color pigments and fine meal particles. Different methods are used to remove these by-products including water precipitation or organic acids in combination with water. Once removed, these by-products are added to the canola meal fraction in order to increase its feeding value (energy) and make it an even more nutritious product.
[0194] Following water precipitation and/or organic acid processing, the oil will still contain color compounds which, if not removed would make it unattractive to the consumer and also reduce its stability. These compounds are extracted through a process called bleaching. In contrast to what may be implied by the term, bleaching does not involve the use of harsh chemicals. Instead, during the bleaching process, the oil is moved through a natural, diatomaceous clay to remove color compounds and other by-products.
[0195] Deodorization is the final step in the refining of all vegetable oils, including canola. Deodorization involves the use of steam distillation with the objective being the removal of any residual compounds which, if retained, could impart an adverse odor and taste to the oil. The oil produced is referred to as "refined oil".
[0196] In still another embodiment, this invention concerns a transgenic progeny plant obtained from the plant of claim 7 or 12, wherein said transgenic progeny plant comprises the recombinant DNA construct.
[0197] There are a variety of methods for the regeneration of plants from plant tissue. The particular method of regeneration will depend on the starting plant tissue and the particular plant species to be regenerated. The regeneration, development and cultivation of plants from single plant protoplast transformants or from various transformed explants is well known in the art (Weissbach and Weissbach, In: Methods for Plant Molecular Biology, (Eds.), Academic: San Diego, Calif. (1988)). This regeneration and growth process typically includes the steps of selection of transformed cells and culturing those individualized cells through the usual stages of embryonic development through the rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil. Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important lines. Conversely, pollen from plants of these important lines is used to pollinate regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.
[0198] Normal germination of transgenic plant seed is defined as germination frequency that is very similar to the germination frequency of seed of the untransformed variety under produced under identical conditions.
[0199] In addition to the above discussed procedures, practitioners are familiar with the standard resource materials which describe specific conditions and procedures for: the construction, manipulation and isolation of macromolecules (e.g., DNA molecules, plasmids, etc.); the generation of recombinant DNA fragments and recombinant expression constructs; and, the screening and isolating of clones. See, for example: Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor: NY (1989); Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor: NY (1995); Birren et al., Genome Analysis: Detecting Genes, Vol. 1, Cold Spring Harbor: NY (1998); Birren et al., Genome Analysis: Analyzing DNA, Vol. 2, Cold Spring Harbor: NY (1998); Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer: NY (1997).
[0200] Examples of cruciferous oilseed plants that can be used to practice the invention include, but are not limited to, Brassica species, and Arabidopsis thaliana.
[0201] Assays for gene expression based on the transient expression of cloned nucleic acid constructs have been developed by introducing the nucleic acid molecules into plant cells by polyethylene glycol treatment, electroporation, or particle bombardment (Marcotte et al., Nature 335:454-457 (1988); Marcotte et al., Plant Cell 1:523-532 (1989); McCarty et al., Cell 66:895-905 (1991); Hattori et al., Genes Dev. 6:609-618 (1992); Goff et al., EMBO J. 9:2517-2522 (1990)).
[0202] Transient expression systems may be used to functionally dissect gene constructs (see generally, Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Press (1995)). It is understood that any of the nucleic acid molecules of the present invention can be introduced into a plant cell in a permanent or transient manner in combination with other genetic elements such as vectors, promoters, enhancers etc.
[0203] In addition to the above discussed procedures, practitioners are familiar with the standard resource materials which describe specific conditions and procedures for the construction, manipulation and isolation of macromolecules (e.g., DNA molecules, plasmids, etc.), generation of recombinant organisms and the screening and isolating of clones, (see for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (1989); Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Press (1995); Birren et al., Genome Analysis: Detecting Genes, 1, Cold Spring Harbor, N.Y. (1998); Birren et al., Genome Analysis: Analyzing DNA, 2, Cold Spring Harbor, N.Y. (1998); Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer, New York (1997)).
EXAMPLES
[0204] The present invention is further defined in the following Examples, in which parts and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be understood that these Examples, while indicating preferred embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Thus, various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.
[0205] The meaning of abbreviations is as follows: "seq" means second(s), "min" means minute(s), "h" means hour(s), "d" means day(s), "μL" means microliter(s), "mL" means milliliter(s), "L" means liter(s), "μM" means micromolar, "mM" means millimolar, "M" means molar, "mmol" means millimole(s), "μmole" mean micromole(s), "g" means gram(s), "μg" means microgram(s), "ng" means nanogram(s), "U" means unit(s), "bp" means base pair(s) and "kB" means kilobase(s).
Example 1
Construction of Vector pZBL120×KS336 for Expression of a Zea mays ODP1 Under Control of a Beta-Conglycinin Promoter
[0206] Plasmid pKS332 was constructed via a number of different intermediate vectors. The AscI cassette containing Kti3 Promoter::Not/::Kti3 Terminator from pKS121 (PCT Application No. WO 02/00904) was blunt-end cloned into the NotI (filled-in) site on pBLUESCRIPT® II SK+ (Stratagene) to give pKS121/BS (Seq ID NO:1). The NcoI/NotI fragment from expression vector pDsRed-Express (Clontech) was blunt-end cloned into the NotI (filled-in) site of pKS121/BS to give pDsRedxKS121/BS (SEQ ID NO:2). The BamHI cassette containing Kti3 Promoter::DsRed::Kti3 Terminator in pDS-RED×KS121/BS (SEQ ID NO:1) was ligated into the BamHI site of pKS123 (PCT Application No. WO 02/08269) to give pKS332 (SEQ ID NO:3). A DNA fragment encoding the ODP1 polypeptide from maize, Zm-ODP1, described in U.S. Pat. No. 7,157,621, was synthesized by PCR with primers to introduce NotI sites at both ends. Applicants cDNA clone cde1c.pk003.o22 (SEQ ID NO:319 in U.S. Pat. No. 7,157,621) was used as template in a PCR reaction using primers MWG345 (SEQ ID NO:4) and MWG346 (SEQ ID NO:5). The resulting PCR product was digested with NotI restriction enzyme and ligated into the NotI site of pKS332 to give pKS336 (SEQ ID NO:6). Plasmid pKS336 contains the ZM-ODP1 protein-coding region of cDNA clone cde1c.pk003.o22 fused at its 5' terminus with the promoter of the soybean gene for the α'-subunit of β-conglycinin (Beachy et al. (1985) EMBO J. 4:3047-3053) and at its 3' end with the terminator sequence from the phaseolin gene of common bean, Phaseolus vulgaris (Doyle et al. (1986) J. Biol. Chem. 261:9228-9238). The β-conglycinin promoter directs strong seed-specific expression of transgenes in transformed plants.
[0207] A 5.9 kb DNA fragment containing the ZM-ODP1 and DsRed expression cassettes was excised from KS336 using the restriction enzyme AscI and the ends were filled-in with T4 DNA polymerase (Promega, Madison, USA). This fragment was ligated to linearized DNA of the Agrobacterium tumefaciens binary vector pZBL120, which had been linearized with EcoRI and BamHI and the ends filled-in, to give pZBL120×KS336. The T-DNA of the plant transformation vector pZBL120×KS336 is set forth as SEQ ID NO:7.
[0208] It is noted that the binary vector pZBL120 is identical to the pZBL1 binary vector (American Type Culture Collection Accession No. 209128) described in U.S. Pat. No. 5,968,793, except the NOS promoter was replaced with a 963 bp 35S promoter (NCBI Accession No. V00141; also known as NCBI General Indentifier No. 58821) from nucleotide 6494 to 7456 in the NOS Promoter::nptII::OCS Terminator cassette. The new 35S Promoter::nptII::OCS Terminator cassette serves as a kanamycin (Kan) resistance plant selection marker in pZBL120.
Example 2
Generation and Analysis of Oil Content of Transgenic Arabidopsis Lines Containing a Beta-Conglycinin Promoter::ZM-ODP1::Phaseolin Terminator Expression Cassette
[0209] Plasmid DNA of pZBL120×KS336, containing the beta-conglycinin promoter::ZM-ODP1::phaseolin terminator expression cassette, was introduced into Agrobacterium tumefaciens NTL4 (Luo et al, Molecular Plant-Microbe Interactions (2001) 14(1):98-103) by electroporation. Briefly, 1 μg plasmid DNA was mixed with 100 μL of electro-competent cells on ice. The cell suspension was transferred to a 100 μL electroporation cuvette (1 mm gap width) and electroporated using a BIORAD electroporator set to 1 kV, 4000 and 25 μF. Cells were transferred to 1 mL LB medium and incubated for 2 h at 30° C. Cells were plated onto LB medium containing 50 μg/mL kanamycin. Plates were incubated at 30° C. for 60 h. Recombinant Agrobacterium cultures (500 mL LB, 50 μg/mL kanamycin) were inoculated from single colonies of transformed agrobacterium cells and grown at 30° C. for 60 h. Cells were harvested by centrifugation (5000×g, 10 min) and resuspended in 1 L of 5% (W/V) sucrose containing 0.05% (V/V) Silwet. Arabidopsis plants were grown in soil at a density of 30 plants per 100 cm2 pot in METRO-MIX® 360 soil mixture for 4 weeks (22° C., 16 h light/8 h dark, 100 μE m-2s-1). Plants were repeatedly dipped into the Agrobacterium suspension harboring the binary vector pZBL120×KS336 and kept in a dark, high humidity environment for 24 h. Plants were grown for three to four weeks under standard plant growth conditions described above and plant material was harvested and dried for one week at ambient temperatures in paper bags. Seeds were harvested using a 0.425 mm mesh brass sieve.
[0210] Cleaned Arabidopsis seeds (2 grams, corresponding to about 100,000 seeds) were sterilized by washes in 45 mL of 80% ethanol, 0.01% TRITON® X-100, followed by 45 mL of 30% (V/V) household bleach in water, 0.01% TRITON® X-100 and finally by repeated rinsing in sterile water. Aliquots of 20,000 seeds were transferred to square plates (20×20 cm) containing 150 mL of sterile plant growth medium comprised of 0.5×MS salts, 0.53% (W/V) sorbitol, 0.05 MES/KOH (pH 5.8), 200 μg/mL TIMENTIN®, and 50 μg/mL kanamycin solidified with 10 g/L agar. Homogeneous dispersion of the seed on the medium was facilitated by mixing the aqueous seed suspension with an equal volume of melted plant growth medium. Plates were incubated under standard growth conditions for ten days. Kanamycin-resistant seedlings were transferred to plant growth medium without selective agent and grown for one week before transfer to soil. Plants were grown to maturity and T2 seeds were harvested and plated on selective media containing kanamycin. Approximately 100 events were generated in this manner. Wild-type (WT) control plants were grown in the same flat containing pZBL120×KS336 T1 plants. T2 seed were harvested and oil content was measured by NMR as follows.
[0211] NMR Based Analysis of Seed Oil Content:
[0212] Seed oil content was determined using a Maran Ultra NMR analyzer (Resonance Instruments Ltd, Whitney, Oxfordshire, UK). Samples (e.g., batches of Arabidopsis seed ranging in weight between 5 and 200 mg) were placed into pre-weighed 2 mL polypropylene tubes (Corning Inc, Corning N.Y., USA; Part no. 430917) previously labeled with unique bar code identifiers. Samples were then placed into 96 place carriers and processed through the following series of steps by an ADEPT COBRA 600® SCARA robotic system:
[0213] 1. pick up tube (the robotic arm was fitted with a vacuum pickup devise);
[0214] 2. read bar code;
[0215] 3. expose tube to antistatic device (ensured that Arabidopsis seed were not adhering to the tube walls);
[0216] 4. weigh tube (containing the sample), to 0.0001 g precision;
[0217] 5. take NMR reading; measured as the intensity of the proton spin echo 1 msec after a 22.95 MHz signal had been applied to the sample (data was collected for 32 NMR scans per sample);
[0218] 6. return tube to rack; and
[0219] 7. repeat process with next tube. Bar codes, tubes weights and NMR readings were recorded by a computer connected to the system. Sample weight was determined by subtracting the polypropylene tube weight from the weight of the tube containing the sample.
[0220] Seed oil content (on a % seed weight basis) of Arabidopsis seed was calculated as follows:
mg oil=(NMR signal-2.1112)/37.514;
% oil=[(mg oil)/1000]/[g of seed sample weight]×100.
[0221] Prior to establishing this formula, Arabidopsis seed oil was extracted as follows. Approximately 5 g of mature Arabidopsis seed (cv Columbia) were ground to a fine powder using a mortar and pestle. The powder was placed into a 33×94 mm paper thimble (Ahlstrom #7100-3394; Ahlstrom, Mount Holly Springs, Pa., USA) and the oil extracted during approximately 40 extraction cycles with petroleum ether (BP 39.9-51.7° C.) in a Soxhlet apparatus. The extract was allowed to cool and the crude oil was recovered by removing the solvent under vacuum in a rotary evaporator. Calibration parameters were determined by precisely weighing 11 standard samples of partially purified Arabidopsis oil (samples contained 3.6, 6.3, 7.9, 9.6, 12.8, 16.3, 20.3, 28.2, 32.1, 39.9 and 60 mg of partially purified Arabidopsis oil) weighed to a precision of 0.0001 g) into 2 mL polypropylene tubes (Corning Inc, Corning N.Y., USA; Part no. 430917) and subjecting them to NMR analysis. A calibration curve of oil content (% seed weight basis) to NMR value was established.
[0222] Seed oil content of most transgenic lines was increased when compared to oil content of seed collected from wild-type control plants grown in the same flat. The phenotype of two representative transgenic lines, C00536 and C00576, are described below in detail. Kanamycin-resistant T2 seedlings were transferred from selective growth media to soil. For C00536, thirteen T2 plants were grown with four wild-type (WT) control plants. For C00576 ten T2 plants were grown with seven WT control plants. Plants were grown to maturity, T3 seed were harvested from individual plants and subjected to oil quantitation by NMR.
[0223] Data are summarized in Table 1. Presence of the pZBL120×KS336 transgene is associated with an increase in oil content in transgenic T3 seed when compared Arabidopsis plants of identical genetic background that lack the transgene.
TABLE-US-00001 TABLE 1 Oil Content of T3 Seed of pZBL120xKS336 Transgenics Exp Event ID Plant # % Oil 1 C00536 1 45.7 1 C00536 2 45.1 1 C00536 3 45.0 1 C00536 4 44.6 1 C00536 5 44.0 1 C00536 6 43.7 1 C00536 7 43.5 1 C00536 8 42.8 1 C00536 9 42.7 1 C00536 10 42.0 1 C00536 11 42.0 1 C00536 12 41.9 1 C00536 13 39.9 1 C00536 AVG 43.3 1 WT 1 39.5 1 WT 2 37.5 1 WT 3 37.0 1 WT 4 34.7 1 WT AVG 37.2 2 C00576 1 48.0 2 C00576 2 47.9 2 C00576 3 45.9 2 C00576 4 45.3 2 C00576 5 44.5 2 C00576 6 43.7 2 C00576 7 43.6 2 C00576 8 42.1 2 C00576 9 41.9 2 C00576 10 41.0 2 C00576 AVG 44.4 2 WT 1 42.2 2 WT 2 40.9 2 WT 3 40.4 2 WT 4 39.3 2 WT 5 38.7 2 WT 6 38.0 2 WT 7 37.8 2 WT AVG 39.6
[0224] Transgenic T3 seed selections that no longer segregated for the DsRed marker gene were identified by visual inspection using a suitable light source. For both events non-segregating transgenic seed were planted in soil alongside untransformed WT plants.
[0225] T4 seed were harvested from individual T3 plants and WT controls. Oil content was measured by NMR (Table 2). Presence of the pZBL120×KS336 transgene is associated with an increase in oil content in transgenic T4 seed when compared to Arabidopsis plants of identical genetic background that lack the transgene.
TABLE-US-00002 TABLE 2 Oil Content of T4 Seed of pZBL120xKS336 Transgenics Exp Event ID Plant # % Oil 1 C00536 1 46.5 1 C00536 2 46.5 1 C00536 3 46.4 1 C00536 4 46.3 1 C00536 5 46.3 1 C00536 6 46.2 1 C00536 7 46.2 1 C00536 8 46.2 1 C00536 9 46.2 1 C00536 10 46.1 1 C00536 11 46.0 1 C00536 12 45.8 1 C00536 13 45.2 1 C00536 14 45.1 1 C00536 15 45.1 1 C00536 16 44.5 1 C00536 17 43.5 1 C00536 18 43.4 1 C00536 AVG 45.6 1 WT 1 44.8 1 WT 2 44.6 1 WT 3 42.3 1 WT 4 42.1 1 WT 5 42.0 1 WT AVG 43.2 2 C00536 1 45.7 2 C00536 2 45.6 2 C00536 3 45.6 2 C00536 4 45.4 2 C00536 5 45.4 2 C00536 6 45.4 2 C00536 7 45.4 2 C00536 8 45.4 2 C00536 9 45.4 2 C00536 10 45.1 2 C00536 11 45.1 2 C00536 12 45.0 2 C00536 13 44.8 2 C00536 14 44.7 2 C00536 15 44.6 2 C00536 16 44.5 2 C00536 17 43.5 2 C00536 18 43.1 2 C00536 AVG 45.0 2 WT 1 43.8 2 WT 2 43.3 2 WT 3 42.3 2 WT 4 41.8 2 WT 5 41.5 2 WT 6 40.2 2 WT AVG 42.1 3 C00576 1 45.3 3 C00576 2 44.8 3 C00576 3 44.7 3 C00576 4 44.7 3 C00576 5 44.4 3 C00576 6 44.2 3 C00576 7 44.2 3 C00576 8 44.2 3 C00576 9 44.2 3 C00576 10 44.0 3 C00576 11 43.8 3 C00576 12 43.3 3 C00576 13 43.1 3 C00576 14 43.0 3 C00576 15 41.8 3 C00576 16 41.1 3 C00576 AVG 43.8 3 WT 1 43.8 3 WT 2 42.9 3 WT 3 42.4 3 WT 4 41.9 3 WT 5 41.6 3 WT 6 40.3 3 WT 7 37.5 3 WT 8 41.1 3 WT AVG 41.4 4 C00576 1 46.6 4 C00576 2 46.4 4 C00576 3 46.3 4 C00576 4 46.2 4 C00576 5 46.2 4 C00576 6 46.2 4 C00576 7 46.2 4 C00576 8 45.7 4 C00576 9 45.7 4 C00576 10 45.6 4 C00576 11 45.6 4 C00576 12 45.4 4 C00576 13 45.4 4 C00576 14 45.1 4 C00576 15 45.0 4 C00576 16 44.3 4 C00576 17 44.2 4 C00576 AVG 45.7 4 WT 1 44.7 4 WT 2 44.6 4 WT 3 44.4 4 WT 4 43.7 4 WT 5 43.5 4 WT 6 42.2 4 WT AVG 43.9
[0226] A total of five flats were planted using WT seed and homozygous T4 seed of C00536 and C00576. Twenty-four transgenic T4 plants were grown alongside twelve WT plants. Plants were grown to maturity. From each flat WT and transgenic seed were bulk-harvested. Oil content of bulk seed samples was measured by NMR (Table 3). Presence of the pZBL120×KS336 transgene is associated with an increase in oil content in transgenic T5 seed when compared to Arabidopsis plants of identical genetic background that lack the transgene.
[0227] Seed oil content in a given plant is a highly variable trait that responds strongly to plant growth conditions (Li Y, Beisson F, Pollard M, Ohlrogge J (2006) Oil content of Arabidopsis seeds: The influence of seed anatomy, light and plant-to-plant variation, Phytochemistry 67:904-915). It is therefore important that an increase in oil content associated with a particular strategy is observed in multiple environments, over several generations and under conditions that allow for maximal oil accumulation by isogenic control lines. The increase in oil content associated with presence of the pZBL120×KS336 transgene was consistently observed over three generations and in different growth chambers. The average oil increase associated with two different pZBL120×KS336 transgenic events was at least 2% points and as high as 3.6% points (i.e., an oil increase of as high as 8.5% compared to untransformed WT seed). This oil increase was observed under growth conditions in which untransformed Arabidopsis seed produced the expected levels of oil, indicating that oil seed storage lipid accumulation was operating at maximum levels.
TABLE-US-00003 TABLE 3 Oil Content of T5 Seed of pZBL120xKS336 Transgenics Δ Oil (% Flat ID Event ID Oil (%) Points) Δ quadratureOil (%) A C00576 45.1 1.7 3.9 WT 43.5 B C00576 46.4 1.9 4.2 WT 44.5 C C00576 44.8 2.3 5.5 WT 42.5 D C00576 45.5 2.0 4.7 WT 43.4 E C00576 44.6 2.0 4.7 WT 42.6 AVG C00576 2.0 4.6 A C00536 45.9 3.3 7.8 WT 42.6 B C00536 45.8 3.4 8.1 WT 42.4 C C00536 46.7 4.7 11.2 WT 42.0 D C00536 44.7 3.9 9.6 WT 40.8 E C00536 46.2 2.6 6.0 WT 43.5 AVG C00536 3.6 8.5
Example 3
Construction of Vector pZBL120×KS333 for Expression of a Momordica charantia ODP1 Under Control of a Beta-Conglycinin Promoter
[0228] An ODP1 protein-coding region from balsam pear (Momordica charantia) described in detail in U.S. Pat. No. 7,157,621 was synthesized by PCR with primers to introduce NotI sites at both ends of the gene. Applicants cDNA clone fds1n.pk015.115 was used a template in the PCR reaction using primers MWG339 (SEQ ID NO:8) and MWG340 (SEQ ID NO:9). The resulting PCR product was digested with NotI restriction enzyme and ligated into the NotI site of pKS332 to give pKS333 (SEQ ID NO:10).
[0229] A 6.1 kb DNA fragment containing the MC-ODP1 and DsRed expression cassettes was excised from KS333 using the restriction enzyme AscI, the ends were filled-in with T4 DNA polymerase (Promega, Madison, USA) and the fragment was blunt-end ligated to DNA of the Agrobacterium tumefaciens binary vector pZBL120, which had been linearized with EcoRI and BamHI and the ends filled-in. The resulting plant transformation vector was designated pZBL120×KS333, and the T-DNA of this vector is set forth as SEQ Ill NO:11.
Example 4
Construction of Vector pZBL120×KS334 for Expression of a Glycine max ODP1 Under Control of a Beta-Conglycinin Promoter
[0230] An ODP1 protein-coding region from soybean described in detail in U.S. Pat. No. 7,157,621 was synthesized by PCR with primers to introduce NotI sites at both ends of the gene. Applicants cDNA clone se3.pk0003.f5 was used as template in the PCR reaction using primers MWG341 (SEQ ID NO:12) and MWG342 (SEQ ID NO:13). The resulting PCR product was digested with NotI restriction enzyme and ligated into the NotI site of pKS332 to give pKS334 (SEQ ID NO:14).
[0231] A 6.1 kb DNA fragment containing the GM-ODP1 and DsRed expression cassettes was excised from KS334 using the restriction enzyme AscI, the ends were filled-in with T4 DNA polymerase (Promega, Madison, USA) and the fragment was blunt-end ligated to DNA of the Agrobacterium tumefaciens binary vector pZBL120, which had been linearized with EcoRI and BamHI and the ends filled-in. The resulting plant transformation vector was designated pZBL120×KS334, and the T-DNA of this vector is set forth as SEQ ID NO:15.
Example 5
Generation of Arabidospis Lines Transformed with Momordica charantia ODP1 or Glycine max ODP1 and Analysis of Seed Oil Content
[0232] Binary vector constructs pZBL120×KS333 (Momordica charantia ODP1) and pZBL120×KS334 (Glycine max ODP1) were used for Arabidopsis transformation using the floral dip method as described above. Transgenic lines were selected on plant growth media containing kanamycin. 75 and 190 lines were generated with pZBL120×KS333 and pZBL120×KS334, respectively. T1 plants of all lines were grown with 13 untransformed WT plants in the same growth chamber. Plants were grown to maturity. Seed were harvested form individual plants and oil content was measured by NMR (TABLE 4)
TABLE-US-00004 TABLE 4 Oil Content of T2 seed of pZBL120xKS333 and pZBL120xKS334 Transgenics Arabidopsis Line # of Plants % Oil Range Average % Oil pZBL120xKS333 77 25.5-46.6 41.7 pZBL120xKS334 180 16.0-48.1 40.7 WT 13 31.9-43.2 39.1
[0233] T2 seed of two representative transgenic lines, 4445 (pZBL120×KS333) and 4485 (pZBL120×KS334), had an oil content of 45.1% and 45.2% respectively. T2 seed of these two lines were germinated on selective media, seedlings were transferred to soil, T2 plants were grown to maturity and T3 seed were harvested. After one more round of germination on selective media and seed production for each event five flats were planted with 24 kanamycin-resistant 4445 or 4485 seedlings and 12 WT seedlings. Plants were grown to maturity. From each flat WT and transgenic seed were bulk-harvested. Oil content of bulk seed samples was measured by NMR (Table 5). Presence of the pZBL120×KS333 or pZBL120×KS334 transgenes is associated with an increase in oil content in transgenic T5 seed when compared to Arabidopsis plants of identical genetic background that lack the transgene.
TABLE-US-00005 TABLE 5 Oil Content of T5 seed of pZBL120xKS333 and pZBL120xKS334 Transgenics Event Δ quadratureOil Flat ID Construct ID Oil (%) (% Points) Δ Oil (%) A pZBL120xKS333 4445 44.9 0.7 1.5 WT 44.2 B pZBL120xKS333 4445 45.3 1.8 4.0 WT 43.6 C pZBL120xKS333 4445 46.0 2.4 5.4 WT 43.7 D pZBL120xKS333 4445 44.6 1.4 3.2 WT 43.2 E pZBL120xKS333 4445 43.2 -0.6 -1.4 WT 43.8 AVG pZBL120xKS333 1.1 2.5 A pZBL120xKS334 4485 45.4 2.8 6.7 WT 42.5 B pZBL120xKS334 4485 44.4 1.3 3.1 WT 43.1 C pZBL120xKS334 4485 44.5 1.7 4.0 WT 42.8 D pZBL120xKS334 4485 45.1 1.5 3.3 WT 43.7 E pZBL120xKS334 4485 45.4 1.6 3.8 WT 43.8 AVG pZBL120xKS334 1.8 4.2
[0234] The oil increase associated with presence of the Momordica charantia ODP1 transgene (pZBL120×KS333) is 1.1% points (i.e., an oil increase of 2.5% compared to untransformed WT seed).
[0235] The oil increase associated with presence of the Glycine max ODP1 transgene (pZBL120×KS334) is 1.8% points (i.e., an oil increase of 4.2% compared to untransformed WT seed).
Example 6
Compositional Analysis of Arabidopsis Seed Transformed with Zea mays ODP1, Momordica charantia ODP1 or Glycine max ODP1
[0236] T5 seed of Arabidopsis events C00536, 4445 and 4485 carrying pZBL120×KS336 (Zea mays ODP), pZBL120×KS333 (Momordica charantia ODP1) and pZBL120×KS334 (Glycine max ODP1) transgenes, respectively, and WT seed derived from plants grown alongside each of the transgenic events were subjected to compositional analysis as described below. Seed weight was measured by determining the weight of 100 seed. This analysis was performed in triplicate.
[0237] Tissue preparation.
[0238] Arabidopsis seed (approximately 0.5 g in a 1/2×2'' polycarbonate vial) was ground to a homogeneous paste in a GENOGRINDER® (3×30 sec at 1400 strokes per minute, with a 15 sec interval between each round of agitation). After the second round of agitation the vials were removed and the Arabidopsis paste was scraped from the walls with a spatula prior to the last burst of agitation.
[0239] Determination of Protein Content:
[0240] Protein contents were estimated by combustion analysis on a Thermo FINNIGAN® Flash 1112EA combustion analyzer running in the NCS mode (vanadium pentoxide was omitted) according to instructions of the manufacturer. Triplicate samples of the ground pastes, 4-8 mg, weighed to an accuracy of 0.001 mg on a METTLER-TOLEDO® MX5 micro balance, were used for analysis. Protein contents were calculated by multiplying % N, determined by the analyzer, by 6.25. Final protein contents were expressed on a % tissue weight basis.
[0241] Determination of Non-Structural Carbohydrate Content:
[0242] Sub-samples (30-35 mg) of the ground paste were weighed (to an accuracy of 0.1 mg) into 13×100 mm glass tubes; the tubes had TEFLON® lined screw-cap closures. Three replicates were prepared for each sample tested.
[0243] Lipid extraction was performed by adding 2 ml aliquots of heptane to each tube. The tubes were vortex mixed and placed into an ultrasonic bath (VWR Scientific Model 750D) filled with water heated to 60° C. The samples were sonicated at full-power (˜360 W) for 15 min and were then centrifuged (5 min×1700 g). The supernatants were transferred to clean 13×100 mm glass tubes and the pellets were extracted 2 more times with heptane (2 ml, second extraction; 1 ml third extraction) with the supernatants from each extraction being pooled. After lipid extraction 1 ml acetone was added to the pellets and after vortex mixing, to fully disperse the material, they were taken to dryness in a Speedvac.
[0244] Non-Structural Carbohydrate Extraction and Analysis.
[0245] Two ml of 80% ethanol was added to the dried pellets from above. The samples were thoroughly vortex mixed until the plant material was fully dispersed in the solvent prior to sonication at 60° C. for 15 min. After centrifugation, 5 min×1700 g, the supernatants were decanted into clean 13×100 mm glass tubes. Two more extractions with 80% ethanol were performed and the supernatants from each were pooled. The extracted pellets were suspended in acetone and dried (as above). An internal standard quadrature-phenyl glucopyranoside (100 μl of a 0.5000+/-0.0010 g/100 ml stock) was added to each extract prior to drying in a Speedvac. The extracts were maintained in a desiccator until further analysis.
[0246] The acetone dried powders from above were suspended in 0.9 ml MOPS (3-N[Morpholino]propane-sulfonic acid; 50 mM, 5 mM CaCl2, pH 7.0) buffer containing 100 U of heat-stable quadrature-amylase (from Bacillus licheniformis; Sigma A-4551). Samples were placed in a heat block (90° C.) for 75 min and were vortex mixed every 15 min. Samples were then allowed to cool to room temperature and 0.6 ml acetate buffer (285 mM, pH 4.5) containing 5 U amyloglucosidase (Roche 110 202 367 001) was added to each. Samples were incubated for 15-18 h at 55° C. in a water bath fitted with a reciprocating shaker; standards of soluble potato starch (Sigma S-2630) were included to ensure that starch digestion went to completion.
[0247] Post-digestion the released carbohydrates were extracted prior to analysis. Absolute ethanol (6 ml) was added to each tube and after vortex mixing the samples were sonicated for 15 min at 60° C. Samples were centrifuged (5 min×1700 g) and the supernatants were decanted into clean 13×100 mm glass tubes. The pellets were extracted 2 more times with 3 ml of 80% ethanol and the resulting supernatants were pooled. Internal standard (100 ul quadrature-phenyl glucopyranoside, as above) was added to each sample prior to drying in a Speedvac.
[0248] Sample Preparation and Analysis.
[0249] The dried samples from the soluble and starch extractions described above were solubilized in anhydrous pyridine (Sigma-Aldrich P57506) containing 30 mg/ml of hydroxylamine HCl (Sigma-Aldrich 159417). Samples were placed on an orbital shaker (300 rpm) overnight and were then heated for 1 hr (75° C.) with vigorous vortex mixing applied every 15 min. After cooling to room temperature, 1 ml hexamethyldisilazane (Sigma-Aldrich H-4875) and 100 μl trifluoroacetic acid (Sigma-Aldrich T-6508) were added. The samples were vortex mixed and the precipitates were allowed to settle prior to transferring the supernatants to GC sample vials.
[0250] Samples were analyzed on an Agilent 6890 gas chromatograph fitted with a DB-17MS capillary column (15 m×0.32 mm×0.25 um film). Inlet and detector temperatures were both 275° C. After injection (2 μl, 20:1 split) the initial column temperature (150° C.) was increased to 180° C. at a rate of 3° C./min and then at 25° C./min to a final temperature of 320° C. The final temperature was maintained for 10 min. The carrier gas was H2 at a linear velocity of 51 cm/sec. Detection was by flame ionization. Data analysis was performed using Agilent ChemStation software. Each sugar was quantified relative to the internal standard and detector responses were applied for each individual carbohydrate (calculated from standards run with each set of samples). Final carbohydrate concentrations were expressed on a tissue weight basis.
TABLE-US-00006 TABLE 6 Composition Analysis of pZBL120xKS336, pZBL120xKS333 and pZBL120xKS334 Transgenic Seed and WT Control Seed Seed fructose Oil (%, Weight (μg mg-1 Construct Event ID NMR) Protein % (μg) seed) pZBL120xKS336 C00536 46.7 15.7 24 0.6 WT 42 18.1 24 1 Δ quadratureTG/WT 11.2 -13.3 0.0 -40.0 % glucose sucrose raffinose stachyose (μg mg-1 (μg mg-1 (μg mg-1 (μg mg-1 Construct Event ID seed) seed) seed) seed) pZBL120xKS336 C00536 8.5 17.2 0.4 2.1 WT 12.1 29.2 0.8 3.1 ΔquadratureTG/WT -29.8 -41.1 -50.0 -32.3 % Seed fructose Oil (%, Weight (μg mg-1 Construct Event ID NMR) Protein % (μg) seed) pZBL120xKS333 4445 46 15 21.7 1 WT 43.7 14.8 20.7 1.2 Δ quadratureTG/WT 5.3 1.4 4.8 -16.7 % glucose sucrose raffinose stachyose (μg mg-1 (μg mg-1 (μg mg-1 (μg mg-1 Construct Event ID seed) seed) seed) seed) pZBL120xKS333 4445 7.8 14.6 0.5 2 WT 10.3 26.6 0.6 3.6 Δ quadratureTG/WT -24.3 -45.1 -16.7 -44.4 % Seed fructose Oil (%, Protein Weight (μg mg-1 Construct Event ID NMR) % (μg) seed) pZBL120xKS334 4485 45.4 14.8 20.3 0.6 WT 42.5 14.5 20.7 0.9 Δ TG/WT 6.8 2.1 -1.9 -33.3 % glucose sucrose raffinose stachyose (μg mg-1 (μg mg-1 (μg mg-1 (μg mg-1 Construct Event ID seed) seed) seed) seed) pZBL120xKS334 4485 6.3 11.7 0.5 1.6 WT 10.4 30.4 0.7 3.3 Δ quadratureTG/WT -39.4 -61.5 -28.6 -51.5 %
[0251] Table 6 shows that a reduction of soluble carbohydrates is consistently associated with presence of the pZBL120×KS333, 334 and 336 transgenes. There is no consistent change in protein content or seed weight that can be attributed to the pZBL120×KS333, 334 and 336 transgenes.
Example 7
Germination Assays of Arabidopsis Seed Transformed with Zea mays ODP1, Momordica charantia ODP1 or Glycine max ODP1
[0252] T5 seed of Arabidopsis events C00536, 4445 and 4485 carrying pZBL120×KS336 (Zea mays ODP1), pZBL120×KS333 (Momordica charantia ODP1) and pZBL120×KS334 (Glycine max ODP1) transgenes, respectively, were subjected to germination assays on standard Arabidopsis growth media (see above) containing either 10 g L-1 sucrose or equimolar amounts of sorbitol (5.3 g L-1). Seeds were surface-sterilized and homogeneous dispersion of the seed on the medium was facilitated by mixing the aqueous seed suspension with an equal volume of melted plant growth medium containing the either sucrose or sorbitol. Plates were incubated under standard conditions (22° C., 16 h light/8 h dark, 100 μE m-2s-1) and germination rate and seedling phenotype was scored 14 days after plating (Table 7).
TABLE-US-00007 TABLE 7 Germination Assays for pZBL120xKS336, pZBL120xKS333 and pZBL120xKS334 Transgenic Seeds Altered Total Seedling No Healthy Media Seed Morphology Germination Seedlings Line ID Type (#) (#) (#) (#) C00536 sucrose 93 69 2 22 C00536 sucrose 84 50 3 31 C00536 sucrose 90 73 3 14 C00536 sorbitol 95 6 89 0 C00536 sorbitol 112 24 88 0 C00536 sorbitol 100 49 51 0 4445 sucrose 82 24 22 36 4445 sucrose 63 24 7 32 4445 sucrose 94 36 12 46 4445 sorbitol 106 70 36 0 4445 sorbitol 119 77 42 0 4445 sorbitol 106 97 9 0 4485 sucrose 98 50 48 0 4485 sucrose 109 37 70 2 4485 sucrose 129 80 39 10 4485 sorbitol 131 24 107 0 4485 sorbitol 128 25 103 0 4485 sorbitol 127 23 102 2 Altered Seedling No Healthy Media Morphology Germination Seedlings Line ID Type (%) (%) (%) C00536 sucrose 74.2 2.2 23.7 C00536 sucrose 59.5 3.6 36.9 C00536 sucrose 81.1 3.3 15.6 AVG 71.6 3.0 25.4 C00536 sorbitol 6.3 93.7 0.0 C00536 sorbitol 21.4 78.6 0.0 C00536 sorbitol 49.0 51.0 0.0 AVG 25.6 74.4 0.0 4445 sucrose 29.3 26.8 43.9 4445 sucrose 38.1 11.1 50.8 4445 sucrose 38.3 12.8 48.9 AVG 35.2 16.9 47.9 4445 sorbitol 66.0 34.0 0.0 4445 sorbitol 64.7 35.3 0.0 4445 sorbitol 91.5 8.5 0.0 AVG 74.1 25.9 0.0 4485 sucrose 51.0 49.0 0.0 4485 sucrose 33.9 64.2 1.8 4485 sucrose 62.0 30.2 7.8 AVG 49.0 47.8 3.2 4485 sorbitol 18.3 81.7 0.0 4485 sorbitol 19.5 80.5 0.0 4485 sorbitol 18.1 80.3 1.6 AVG 18.7 80.8 0.5
[0253] It is evident that germination and/or seedling development is significantly affected in all events analyzed. Germination is improved in the presence of sucrose; however, in events carrying pZBL120×KS336 and pZBL120×KS334 the seed germinating on sucrose containing media gave rise to seedlings with altered morphology, namely the presence of leaf structures that fail to become green and which resemble non-photosynthetic cotyledon tissue.
[0254] Total fatty acid (FA) composition and content of seedling tissue of C00536, 4485 and WT seedlings were measured 14 days after plating on media containing 10 g L-1 sucrose. Briefly, seedling tissue was frozen on dry ice or by incubation in a -80° C. freezer for two h followed by lyophilization for 48 h.
[0255] Dried seedling tissue was ground to a fine powder using a GENOGRINDER® vial (1/2''×2'' polycarbonate) and a steel ball (SPEX Centriprep (Metuchen, N.J., U.S.A.). Grinding time was 30 sec at 1450 oscillations per min. Ten mg of tissue were weighed into Eppendorf tubes. The tissue was extracted using 100 μL heptane at room temperature under continuous shaking for 2 h. Heptane extracts were cleared by centrifugation and 25 quadratureL of extract was derivatized to fatty acid methyl esters as follows. One mL of a 25% sodium methoxide stock solution was added to 24 mL of HPLC grade methanol. Sodium methoxide was stored under an inert gas.
[0256] Five μL of a 17:0 TAG (Nu-Chek Prep, Elysian, Minn., USA) stock solution (10 mg/mL) was combined with 25 μL of heptane tissue extract in a glass culture tube and 500 μL of 1% sodium methoxide was added. Samples were derivatized in a water bath at 50° C. for 15 min. Samples were allowed to cool to RT and 1 mL of 1M NaCl was added followed by brief mixing. FAMEs were extracted into 1 mL of heptane and 4 μL sample were quantitated by GC analysis (Table 8).
TABLE-US-00008 TABLE 8 Fatty Acid Composition and Total Fatty Acid Content of Seedling Tissue of WT Plants and pZBL120xKS334 and pZBL120xKS336 Transgenic Plants Grown on Sucrose-Containing Media % Total FA Total FA Event ID 16:0 18:0 18:1 18:2 18:3 20:0 20:1 20:2 22:0 (% DW) WT 13.5 10.0 42.3 15.2 15.6 1.0 0.9 0.0 1.5 4.3 4485 11.8 2.7 13.2 26.6 26.7 2.5 13.5 2.2 0.7 18.6 C00536 7.9 2.3 15.0 17.7 32.4 3.2 18.2 2.0 1.2 21.9
[0257] Table 8 demonstrates that seedling tissue of transgenic lines carrying pZBL120×KS334 and pZBL120×KS336 transgenes showed increased fatty acid content when compared to WT seedlings. Moreover, the fatty acid profile of transgenic seedling tissue is similar to that of Arabidopsis WT seed in that it contains significant levels (>15%) of C20 fatty acids.
[0258] In summary, use of a strong heterologous seed storage protein promoter (soybean β-conglycinin promoter) for expression in Arabidopsis of ODP1 genes from a diverse range of plant species belonging to the families of Leguminosae, Cucurbitaceae and Poaceae, resulted in increased seed storage lipid accumulation at the expense of soluble carbohydrates. However, seed germination and seedling establishment was negatively affected in transgenic lines expressing ODP1 genes under control of a strong heterologous seed storage protein promoter.
Example 8
Construction of Arabidopsis Expression Vector pKR1223 for Expression of Zea mays ODP Under Control of the Seed-Specific, Low Strength Arabidopsis Sucrose Synthase Promoter
[0259] The present example describes the synthesis of Arabidopsis expression vector pKR1223 which allows for expression of the Zea mays ODP gene under control of the promoter of an Arabidopsis sucrose synthase gene (At5g49190). Additionally, vector pKR1223 provides seed-specific expression of the DsRed gene in order to visualize positive transformants as well as constitutive expression of the npt gene for selection on kanamycin.
[0260] Plasmid pKR132 (SEQ ID NO:16) which is described in PCT Publication No. WO 2004/071467 (the contents of which are incorporated by reference), was digested with BamHI/SalI and the fragment containing the soy albumin promoter was cloned into the BamHI/XhoI fragment of the pCR-Blunt® cloning vector (Invitrogen Corporation) to produce the starting vector pKR627 (SEQ ID NO:17).
[0261] Plasmid KS294 (SEQ ID NO:18) contains a NotI site flanked by the SCP1 promoter and the phaseolin transcription terminator (SCP1Pro::NotI::PhasTerm). The SCP1 promoter is a synthetic constitutive promoter comprising a portion of the CaMV 35S promoter (Odell et al. (1985) Nature 313:810-812) and the Rsyn7-Syn II Core synthetic consensus promoter (U.S. Pat. Nos. 6,072,050 and 6,555,673, the contents of which are incorporated by reference). See also, for example, US20030226166, Table 13 (the contents of which are incorporated by reference). Downstream of this element is the Tobacco Mosaic Virus (TMV) omega 5'-UTR translational enhancer element (Gallie et al. (1992) Nucleic Acid Research 20:4631-4638), followed by the NotI site and the 3' transcription termination region of the phaseolin gene (Doyle et al., (1986) J. Biol. Chem. 261:9228-9238). The XbaI fragment of KS294 (SEQ ID NO:18), containing the SCP1Pro::NotI::PhasTerm cassette, was cloned into the XbaI site of pKR627 (SEQ ID NO:17) to produce pKR1142 (SEQ ID NO:19).
[0262] The BamHI fragment of KS334 (SEQ ID NO:14; Example 1), containing the Kti3Pro:DsRed:Kti3Term cassette, was cloned into the BamHI site of pKR278 (SEQ ID NO:20), which was previously described in U.S. Patent Publication No. US20080095915 (the contents of which are incorporated by reference), to produce vector pKR1141 (SEQ ID NO:20).
[0263] Genomic DNA was isolated from 3 week-old wild-type Arabidopsis col-0 seedlings using the DNEASY® Plant Mini Kit (Qiagen, Valencia, Calif.) and following the manufacture's protocol. An Arabidopsis Sucrose Synthase ("AtSuSy"; "AtSUS2") promoter derived from gene At5g49190 was PCR-amplified from Arabidopsis genomic DNA using oligonucleotides SuSy-5 (SEQ ID NO:21) and SuSy-3 (SEQ ID NO:22) with the PHUSION® High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland), following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR®-BLUNT® cloning vector using the ZERO BLUNT® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pLF122 (SEQ ID NO:23).
[0264] The BamHI/NotI fragment of pLF122 (SEQ ID NO:23), containing the AtSuSy promoter, was cloned into the BamHI/NotI fragment of pKR1142 (SEQ ID NO:19), containing the phaseolin terminator, to produce pKR1155 (SEQ ID NO:24).
[0265] The Asp718/BsiWI fragment of pKR1155 (SEQ ID NO:24), containing the AtSuSy promoter, was cloned into the BsiWI site of pKR1141 (SEQ ID NO:20), to produce pKR1158 (SEQ ID NO:25).
[0266] The NotI fragment of KS336 (SEQ ID NO:6; Example 1), containing the corn ODP, was cloned into the NotI site of pKR1158 (SEQ ID NO:25), to produce pKR1167 (SEQ ID NO:26).
[0267] The AscI fragment of pKR1167 (SEQ ID NO:26), containing the corn ODP gene, was cloned into the AscI fragment of pKR92 (SEQ ID NO:27) which was previously described in WO2007/061845 (published on May 31, 2007, the contents of which are herein incorporated by reference) to produce pKR1223 (SEQ ID NO:28).
Example 9
Construction of Arabidopsis Expression Vector pKR1220 for Expression of the Corn ODP Under Control of the Seed-Specific, Medium-Strength Soy Annexin Promoter
[0268] The present example describes the synthesis of Arabidopsis expression vector pKR1220 which allows for seed-specific expression of the corn ODP gene under control of the soy annexin promoter. Additionally, vector pKR1220 provides seed-specific expression of the DsRed gene in order to visualize positive transformants and constituitive expression of the npt gene for selection on kanamycin.
[0269] The BsiWI fragment of pKR268 (SEQ ID NO:29; which is described in PCT Publication No. WO 04/071467, the contents of which are herein incorporated by reference), containing the AnnexinPro::NotI::BD30Term cassette, was cloned into the BsiWI site of pKR1141 (SEQ ID NO:20) to give pKR1143 (SEQ ID NO:30).
[0270] The NotI fragment of KS336 (SEQ ID NO:6), containing the corn ODP1 gene, was cloned into the NotI site of pKR1143 (SEQ ID NO:30), to produce pKR1147 (SEQ ID NO:31).
[0271] The AscI fragment of pKR1147 (SEQ ID NO:31), containing the corn ODP1 gene, was cloned into the AscI fragment of pKR92 (SEQ ID NO:27) to produce pKR1220 (SEQ ID NO:32).
Example 10
Construction of Arabidopsis Expression Vector pKR1221 for Expression of the Corn ODP Under Control of the Constitutive, Medium Strength SCP1 Promoter
[0272] The present example describes the synthesis of Arabidopsis expression vector pKR1221 which allows for constituitive expression of the corn ODP1 gene under control of the SCP1 promoter. Additionally, vector pKR1221 provides seed-specific expression of the DSred gene in order to visualize positive transformants and constituitive expression of the npt gene for selection on kanamycin.
[0273] The Asp718/BsiWI fragment of pKR1142 (SEQ ID NO:19), containing the SCP1Pro::NotI::PhasTerm cassette, was cloned into the BsiWI site of pKR1141 (SEQ ID NO:20), to produce pKR1144 (SEQ ID NO:33).
[0274] The NotI fragment of KS336 (SEQ ID NO:6), containing the corn ODP1, was cloned into the NotI site of pKR1144 (SEQ ID NO:33), to produce pKR1149 (SEQ ID NO:34).
[0275] The AscI fragment of pKR1149 (SEQ ID NO:34), containing the corn ODP1 gene, was cloned into the AscI fragment of pKR92 (SEQ ID NO:27) to produce pKR1221 (SEQ ID NO:35).
Example 11
Generation and Analysis of T2 Seed of Arabidopsis Lines Transformed with Corn ODP Under Control of the SCP1, Annexin or Sucrose Synthase Promoters
[0276] Plasmid DNA of pKR1220, pKR1221 and pKR1223 was introduced into Agrobacterium tumefaciens NTL4 (Luo et al, Molecular Plant-Microbe Interactions (2001) 14(1):98-103) by electroporation. Briefly, 1 μg plasmid DNA was mixed with 100 μL of electro-competent cells on ice. The cell suspension was transferred to a 100 μL electroporation cuvette (1 mm gap width) and electroporated using a BIORAD electroporator set to 1 kV, 4000 and 25 μF. Cells were transferred to 1 mL LB medium and incubated for 2 h at 30° C. Cells were plated onto LB medium containing 50 μg/mL kanamycin. Plates were incubated at 30° C. for 60 h. Recombinant Agrobacterium cultures (500 mL LB, 50 μg/mL kanamycin) were inoculated from single colonies of transformed Agrobacterium cells and grown at 30° C. for 60 h. Cells were harvested by centrifugation (5000×g, 10 min) and resuspended in 1 L of 5% (W/V) sucrose containing 0.05% (V/V) Silwet. Arabidopsis plants were grown in soil at a density of 30 plants per 100 cm2 pot in METRO-MIX® 360 soil mixture for 4 weeks (22° C., 16 h light/8 h dark, 100 μE m-2s-1). Plants were repeatedly dipped into the Agrobacterium suspension harboring the relevant binary vector and kept in a dark, high humidity environment for 24 h. Plants were grown for three to four weeks under standard plant growth conditions described above and plant material was harvested and dried for one week at ambient temperatures in paper bags. Seeds were harvested using a 0.425 mm mesh brass sieve.
[0277] Cleaned Arabidopsis seeds (2 grams, corresponding to about 100,000 seeds) were sterilized by washes in 45 mL of 80% ethanol, 0.01% TRITON® X-100, followed by 45 mL of 30% (V/V) household bleach in water, 0.01% TRITON® X-100 and finally by repeated rinsing in sterile water. Aliquots of 20,000 seeds were transferred to square plates (20×20 cm) containing 150 mL of sterile plant growth medium comprised of 0.5×MS salts, 0.53% (W/V) sorbitol, 0.05 MES/KOH (pH 5.8), 200 μg/mL TIMENTIN®, and 50 μg/mL kanamycin solidified with 10 g/L agar. Homogeneous dispersion of the seed on the medium was facilitated by mixing the aqueous seed suspension with an equal volume of melted plant growth medium. Plates were incubated under standard growth conditions for ten days. Kanamycin-resistant seedlings were transferred to plant growth medium without selective agent and grown for one week before transfer to soil. Plants were grown to maturity and T2 seeds were harvested and plated on selective media containing kanamycin. Approximately 100 events were generated in this manner. Wild-type control plants were grown in the same flat containing transgenic T1 plants. T2 seeds were harvested and oil content was measured by NMR (Tables 9 and 10).
TABLE-US-00009 TABLE 9 Data from Germination Assays for T2 Seed of pKR1220, pKR1221 and pKR1223 Transgenics on Selective Medium Containing Kanamycin and Sorbitol Trans- No Healthy Total genic Germi- Seed- Δ Oil Event Seed Seed ASM* KanS nation Lings % ID pKR (#) (#) (#) (#) (#) (#) points 35634 1220 122 110 11 12 31 68 2.6 36062 1220 134 127 25 7 85 17 2.4 35637 1220 147 133 16 14 100 17 2.4 36066 1220 143 123 22 20 59 42 2 35636 1220 116 105 19 11 62 24 1.7 36059 1220 101 85 14 16 52 19 1.6 36104 1221 104 104 6 0 96 2 4.7 36078 1221 83 66 0 17 66 0 3 36087 1221 93 89 0 4 89 0 2 36090 1221 103 103 1 0 98 4 1.9 36101 1221 134 126 0 8 126 0 1.7 36122 1221 108 92 0 16 92 0 1.7 36162 1223 92 83 8 9 20 55 5.3 36210 1223 112 111 2 1 21 88 4.4 36151 1223 144 142 66 2 40 36 3.6 36194 1223 94 91 14 3 11 66 3.4 36157 1223 101 77 14 24 10 53 3.4 36181 1223 160 149 15 11 88 46 3.3 36199 1223 103 95 17 8 12 66 3.2 36208 1223 119 110 22 9 20 68 3.1 36161 1223 134 120 19 14 33 68 3 36200 1223 144 140 0 4 101 39 2.8 36154 1223 110 99 10 11 7 82 2.7 36209 1223 109 106 10 3 31 65 2.6 36179 1223 172 147 10 25 68 69 2.6 36180 1223 162 149 16 13 51 82 2.6 36213 1223 146 127 22 19 57 48 2.4 36206 1223 86 79 17 7 0 62 2.2 *ASM denotes Altered Seedling Morphology
TABLE-US-00010 TABLE 10 Results from Germination Assays for T2 Seed of pKR1220, pKR1221 and pKR1223 Transgenics on Selective Medium Containing Kanamycin and Sorbitol % No % Healthy Δ Oil % Event ID pKR % ASM* Germination Seedlings Points 35634 1220 10.0 28.2 61.8 2.6 36062 1220 19.7 66.9 13.4 2.4 35637 1220 12.0 75.2 12.8 2.4 36066 1220 17.9 48.0 34.1 2.0 35636 1220 18.1 59.0 22.9 1.7 36059 1220 16.5 61.2 22.4 1.6 AVG 15.7 56.4 27.9 2.1 36104 1221 5.8 92.3 1.9 4.7 36078 1221 0.0 100.0 0.0 3.0 36087 1221 0.0 100.0 0.0 2.0 36090 1221 1.0 95.1 3.9 1.9 36101 1221 0.0 100.0 0.0 1.7 36122 1221 0.0 100.0 0.0 1.7 AVG 1.1 97.9 1.0 2.5 36162 1223 9.6 24.1 66.3 5.3 36210 1223 1.8 18.9 79.3 4.4 36151 1223 46.5 28.2 25.4 3.6 36194 1223 15.4 12.1 72.5 3.4 36157 1223 18.2 13.0 68.8 3.4 36181 1223 10.1 59.1 30.9 3.3 36199 1223 17.9 12.6 69.5 3.2 36208 1223 20.0 18.2 61.8 3.1 36161 1223 15.8 27.5 56.7 3.0 36200 1223 0.0 72.1 27.9 2.8 36154 1223 10.1 7.1 82.8 2.7 36209 1223 9.4 29.2 61.3 2.6 36179 1223 6.8 46.3 46.9 2.6 36180 1223 10.7 34.2 55.0 2.6 36213 1223 17.3 44.9 37.8 2.4 36206 1223 21.5 0.0 78.5 2.2 AVG 14.4 28.0 57.6 3.2 *"ASM" denotes Altered Seedling Morphology
Example 12
Analysis of T3 and T4 Seed of Arabidopsis Plants Transformed with Zea mays ODP Under Control of the Arabidopsis Sucrose Synthase Promoter
[0278] T2 seeds of pKR1223 transformation events 36162, 36180 and 36181 were germinated on selective media containing kanamycin. Twenty-four kanamycin-resistant seedlings were planted in soil along side twelve untransformed WT Arabidopsis plants. Plants were grown to maturity and T3 seed samples were harvested from individual T2 plants. A bulk seed sample was generated from all WT plants in a given flat. Oil content was measured by NMR (Table 11).
TABLE-US-00011 TABLE 11 Oil Content of T3 Seed of pKR1223 Transgenics Plant % Event # oil 36162 1 44.6 36162 2 44.5 36162 3 44.4 36162 4 44.3 36162 5 44.3 36162 6 44.2 36162 7 44.2 36162 8 43.9 36162 9 43.8 36162 10 43.7 36162 11 43.7 36162 12 43.7 36162 13 43.7 36162 14 43.7 36162 15 43.6 36162 16 43.5 36162 17 43.5 36162 18 43.5 36162 19 43.4 36162 20 43.0 36162 21 42.8 36162 22 42.2 36162 23 41.8 36162 24 36.4 36162 AVG 43.4 WT in 36162 Exp. AVG 41.8 36180 1 44.5 36180 2 44.3 36180 3 43.8 36180 4 43.8 36180 5 43.7 36180 6 43.6 36180 7 43.6 36180 8 43.6 36180 9 43.5 36180 10 43.4 36180 11 43.3 36180 12 43.3 36180 13 43.3 36180 14 43.3 36180 15 43.2 36180 16 43.2 36180 17 43.1 36180 18 43.1 36180 19 42.9 36180 20 42.9 36180 21 42.8 36180 22 42.8 36180 23 42.7 36180 24 42.6 36180 AVG 43.3 WT in 36180 Exp. AVG 41.9 36181 1 47.2 36181 2 46.3 36181 3 46.2 36181 4 46.1 36181 5 45.9 36181 6 45.7 36181 7 45.4 36181 8 45.0 36181 9 45.0 36181 10 45.0 36181 11 45.0 36181 12 44.9 36181 13 44.9 36181 14 44.8 36181 15 44.7 36181 16 44.6 36181 17 44.5 36181 18 44.4 36181 19 44.4 36181 20 43.8 36181 21 43.8 36181 22 43.6 36181 23 43.3 36181 24 42.6 36181 AVG 44.9 WT in 36181 Exp. AVG 41.9
[0279] Transgenic T3 seed selections of events 36180 and 36162 that no longer segregated for the DsRed marker gene were identified by visual inspection using a suitable light source. These T3 selections that were homozygous for the pKR1223 transgene were subjected to germination assays on plant growth media containing sucrose or sorbitol as described above (Table 12).
TABLE-US-00012 TABLE 12 Germination Assays for T3 Seed of pKR1223 Transgenics Total No Healthy Media Seed ASM * Germination Seedlings Event Type (#) (#) (#) (#) 36180 sucrose 83 0 0 83 36180 sucrose 111 0 0 111 36180 sucrose 110 0 0 110 36180 sorbitol 121 0 0 121 36180 sorbitol 128 0 0 128 36180 sorbitol 118 0 0 118 36162 sucrose 88 0 0 88 36162 sucrose 111 1 1 109 36162 sucrose 90 0 0 90 36162 sorbitol 97 0 0 97 36162 sorbitol 103 0 0 103 36162 sorbitol 107 2 0 105 No Healthy Media ASM * Germination Seedlings Event Type (%) (%) (%) 36180 sucrose 0.0 0.0 100.0 36180 sucrose 0.0 0.0 100.0 36180 sucrose 0.0 0.0 100.0 36180 sucrose AVG 0.0 0.0 100.0 36180 sorbitol 0.0 0.0 100.0 36180 sorbitol 0.0 0.0 100.0 36180 sorbitol 0.0 0.0 100.0 36180 sorbitol AVG 0.0 0.0 100.0 36162 sucrose 0.0 0.0 100.0 36162 sucrose 0.9 0.9 98.2 36162 sucrose 0.0 0.0 100.0 36162 sucrose AVG 0.3 0.3 99.4 36162 sorbitol 0.0 0.0 100.0 36162 sorbitol 0.0 0.0 100.0 36162 sorbitol 1.9 0.0 98.1 36162 sorbitol AVG 0.6 0.0 99.4 * "ASM" denotes Altered Seedling Morphology
[0280] Transgenic T3 seed selections of events 36180 and 36162 that no longer segregated for the DsRed marker gene were identified by visual inspection using a suitable light source. In case of event 36181 no T3 seed selections could be identified that did not segregate for the DS red marker in a total of 24 progeny seed samples derived from 24 kanamycin-resistant T2 plants. Moreover, when T3 seed were plated on selective agarose media, 25% of seed failed to germinate and 25% of the seedlings were sensitive to kanamycin. It is concluded that the transgene insertion in event 36181 can only be maintained in the heterozygous state. The homozygous nature of T3 seed selections of events 36180 and 36162 suggests that the seed phenotype of event 36181 is related to the transgene insertion site and not the transgene itself. It is believed that a gene that is important for development of viable seed was disrupted by the transgene insertion.
[0281] T3 seed selections of events 36180 and 36162 that were homozygous for the transgene insertion and T3 seed selections of event 36181 that were heterozygous for the transgene insertion were germinated on selective media containing kanamycin. Three flats were planted for every transgenic event as follows: 24 seedlings were planted in each flat next to 12 WT seedlings at identical developmental stage. Plants were grown to maturity for approximately eight weeks and seed were harvested in bulk from all transgenic and WT plants in a given flat. Oil content of seed was measured by NMR as described in Example 1. Results are summarized in Table 13. In all three events presence of the pKR1223-derived transgene leads to an increase in oil content that ranges between 0.7 and 2.2% points (1.6-5.4%).
TABLE-US-00013 TABLE 13 Oil Content of T4 Seed of pKR1223 Transgenics Event Δ Oil (% Flat ID ID Oil (%) Points) Δ Oil (%) A 36181 42.8 2.2 5.4 WT 40.6 B 36181 43.5 2.1 5.2 WT 41.4 C 36181 40.8 1.5 4.0 WT 39.2 AVG 2.0 4.9 A 36180 44.5 1.8 4.2 WT 42.7 B 36180 43.6 1.9 4.6 WT 41.7 C 36180 43.2 1.2 2.8 WT 42.0 AVG 1.6 3.9 A 36162 43.3 1.4 3.4 WT 41.9 B 36162 43.6 0.7 1.6 WT 42.9 C 36162 43.8 1.0 2.4 WT 42.7 AVG 1.0 2.5
[0282] T4 seed of events 36162 and 36180 were subjected to compositional analysis as described in Example 6.
TABLE-US-00014 TABLE 14 Composition of pKR1223 Transgenic T4 Seed and WT Control Seed Seed Fructose Oil (%, Protein Weight (μg mg-1 Event NMR) (%) (μg) seed) 36162 43.3 14.94 20.33 2.13 WT 41.9 15.05 19 2.39 Δ quadratureTG/WT 3.3 -0.7 7.0 -10.9 % Glucose Sucrose Raffinose Stachyose (μg mg-1 (μg mg-1 (μg mg-1 (μg mg-1 Event seed) seed) seed) seed) 36162 4.82 11.32 0.56 1.52 WT 5.17 14.28 0.64 1.58 ΔTG/WT -6.8 -20.7 -12.5 -3.8 % Seed Fructose Oil (%, Protein Weight (μg mg-1 Event NMR) (%) (μg) seed) 36180 43.6 15.17 21 2.07 WT 41.7 15.16 21 2.45 Δ TG/WT 4.6 0.1 0.0 -15.5 % Glucose Sucrose Raffinose Stachyose (μg mg-1 (μg mg-1 (μg mg-1 (μg mg-1 Event seed) seed) seed) seed) 36180 4.49 11.14 0.5 1.46 WT 4.97 14.08 0.57 1.45 Δ quadratureTG/WT -9.7 -20.9 -12.3 0.7 %
[0283] A reduction of soluble carbohydrates (mainly sucrose) was consistently associated with the presence of the pKR1223 transgene in events 36162 and 36180. There was no consistent change in protein content or seed weight that can be attributed to the presence of the transgene.
[0284] In summary, use of a promoter of the Arabidopsis sucrose synthase (SUS2) gene (At5g49190) for expression of maize ODP1 resulted in increased seed storage lipid accumulation at the expense of soluble carbohydrates. Seed germination and seedling establishment was not affected.
Example 13
Identification of Seed Specific Promoters to Drive ODP1 Expression in Cruciferous Oilseed Plants
[0285] The sucrose synthase gene family and the role of specific gene family members during seed development, specifically the mobilization of sucrose for seed storage compound biosynthesis, has been described (Ruuska S A, Girke T, Benning C and Ohlrogge J B (2002) Contrapuntal networks of gene expression during Arabidopsis seed filling. Plant Cell 14: 1191-1206; Baud S, Vaultier M-N and Rochat C (2004) Structure and expression profile of the sucrose synthase multigene family in Arabidopsis. J Exp Bot 55: 397-409; and Baud S and Graham I A (2006) A spatiotemporal analysis of enzymatic activities associated with carbon metabolism in wild-type and mutant embryos of Arabidopsis using in situ histochemistry. Plant J 46: 155-169). The current invention describes the unexpected utility of a promoter sequence of a specific gene family member, At5g49190, to direct expression of heterologous ODP1 genes in a manner that allows for increased accumulation of oil during seed development of cruciferous oil seed without affecting germination and seedling establishment of the resulting seed. At5g49190 is expressed during seedling development in synchrony with accumulation of oil and protein (supra). Genes homologous to At5g49190 can be identified in other plant species based on sequence similarity to the At5g49190 gene product and expression pattern of the homolog during seed development. One skilled in the art will recognize that promoter sequences of these genes will have utility for expression of ODP1 genes for increased oil biosynthesis in cruciferous oil seed which is accompanied by unaltered seed germination and seedling establishment.
Example 14
Identification of Canola Promoters to Drive ODP1 Expression in Cruciferous Oilseed Plants
[0286] Public EST and genomic sequence collections of Canola were searched with the deduced amino acid sequence of At5g49190 (AtSUS2). Several ESTs and genomic sequences were identified and assembled into a single contiguous sequence that represents a transcript model of the canola homolog of At5g49190. The nucleotide and deduced amino acid sequence of the canola SUS2 homolog transcript model are set forth as SEQ ID NO:44 and SEQ ID NO:45, respectively.
[0287] Primers a (SEQ ID NO:46) b (SEQ ID NO:47) c (SEQ ID NO:48) and d (SEQ ID NO:49) were used in genome walking experiments according to manufacturer instructions (Clontech, CA, USA). Briefly genomic DNA of Pioneer Hi-Bred International, Inc., spring canola variety NS1822BC was isolated using standard protocols and digested with PvuII or DraI. After adaptor ligation PCR PvuII and DraI-digested genomic DNA was used as template in PCR reactions with Primer a (SEQ ID:46) and Primer c (SEQ ID NO:48), respectively. PCR products generated with primers a (SEQ ID NO: 46) and c (SEQ ID NO:48) were amplified with primers b (SEQ ID NO:47) and d (SEQ ID NO:49), respectively. In both rounds of PCR experiments adaptor specific primers were used with primers a-d. Use of primers a and b generated PCR products of 2.1 kb. Primers c and d generated PCR products of 0.7 kb. These PCR products were cloned using the PCR blunt cloning system (Invitrogen, CA, USA) and sequenced.
[0288] SEQ ID NO:50 (PvuII rapa cons) is genomic sequence of canola variety NS1822BC that was generated with primers a and b. It is comprised of 312 bp of a canola SUS2 homolog and 1924 bp of sequence upstream of the inferred start codon of the SUS2 gene. This 1924 bp sequence (including the 5' untranslated region) is designated the BnSUS2-2 promoter (SEQ ID NO:73).
[0289] SEQ ID NO:51 (1,6 DraI gene cons) is genomic sequence of canola variety NS1822BC that was generated with primers c and d. It is comprised of 37 bp of a canola SUS2 gene and 586 bp of sequence upstream of the inferred start codon of the SUS2 gene. This 586 bp sequence (including the 5' untranslated region) is designated the BnSUS2-1 promoter (SEQ ID NO:72).
[0290] Plasmid DNA of clone #6 containing 1,6 DraI gene cons (SEQ ID NO:51) was used in a PCR reaction with primers SA188 (SEQ ID NO:52) and SA189 (SEQ ID NO:53) using PHUSION® DNA polymerase (New England Biolabs, Inc.). Plasmid DNA of clone #45 containing PvuII rapa cons (SEQ ID NO:50) was used in a PCR reaction with primers SA190 (SEQ ID NO:54) and SA191 (SEQ ID NO:55). PCR products from both reactions were cloned into PCR blunt (Invitrogen, CA, USA) according to manufacturer instructions and sequenced. BN SUS2 prom1/PCR blunt is derived from 1,6 DraI gene cons (SEQ ID NO:51). It's sequence is set forth as SEQ ID NO:56. BN SUS2 prom2/PCR blunt is derived from PvuII rapa cons (SEQ ID NO:50). It's sequence is set forth as SEQ ID NO:57.
[0291] BN SUS2 prom1/PCR blunt (SEQ ID NO:56) was linearized with XbaI and NotI and ligated with a NotI-XbaI fragment from KS332 (SEQ ID NO:3) containing Phas terminator and Kti promoter DS red gene and Kti terminator cassette to give KS427 (SEQ ID NO:58). KS427 (SEQ ID NO:58) was linearized with NotI. A delta-6 desaturase gene of Mortierella alpina was excised from KS130 (SEQ ID NO:59) using NotI and ligated to NotI linearized KS427 (SEQ ID NO:58) to give KS432 (SEQ ID NO:60). Expression cassettes for DSred and delta-6 desaturase genes were excised as a single DNA fragment by digestion with AscI and inserted into AscI linearized pKR92 (SEQ ID NO:27) to give ARALO80 (SEQ ID NO:61). The ARALO80 vector contains the following expression unit: BnSUS2-1 promoter::M. alpina delta-6 desaturase::phaseolin terminator.
[0292] Prior to this KS130 (SEQ ID NO:59) was constructed as follows: Plasmid DNA of CGR-5, which is described in U.S. Pat. No. 5,968,809, was used in a PCR reaction with primers D6 fwd (SEQ ID NO:62) and D6 rev (SEQ ID NO:63). The PCR product was digested with NotI and ligated to NotI-linearized and de-phosphorylated KS119 vector (SEQ ID NO:64) to give KS130 (SEQ ID NO:59). Vector KS119 (SEQ ID NO:64) is described in International Publication No. WO2004071467.
[0293] The maize ODP1 gene was excised from KS336 (SEQ ID NO:6) using NotI and ligated to NotI linearized KS427 (SEQ ID NO:58) to give KS430 (SEQ ID NO:65). Expression cassettes for DSred and maize ODP1 genes were excised as a single fragment by digestion with AscI and inserted into AscI linearized pKR92 (SEQ ID NO:27) to give ARALO78 (SEQ ID NO:66). The ARALO78 vector contains the following expression unit: BnSUS2-1 promoter::ZM-ODP1::phaseolin terminator.
[0294] BN SUS2 pro2/PCR blunt (SEQ ID NO:57) was linearized with XbaI and NotI and ligated with a NotI-XbaI fragment from KS332 (SEQ ID NO:3) containing Phas terminator and Kti promoter DS red gene and Kti terminator cassette to give KS428 (SEQ ID NO:67). KS428 (SEQ ID NO:67) was linearized with NotI. The delta-6 desaturase gene was exised from KS130 (SEQ ID NO:59) using NotI and ligated to NotI-linearized KS428 (SEQ ID NO:67) to give KS429 (SEQ ID NO:68). Expression cassettes for DSred and delta-6 desaturase genes were excised as a single DNA fragment by digestion with AscI and inserted into AscI linearized pKR92 (SEQ ID NO:27) to give ARALO77 (SEQ ID NO:69). The ARALO77 vector contains the following expression unit: BnSUS2-2 promoter::M. alpina delta-6 desaturase::phaseolin terminator.
[0295] The maize ODP1 gene was excised from KS336 (SEQ ID NO:6) using NotI and ligated to NotI-linearized KS428 (SEQ ID NO:67) to give KS431 (SEQ ID NO:70). Expression cassettes for DSred and maize ODP1 genes were excised by digestion with AscI and inserted into AscI linearized pKR92 (SEQ ID NO:27 to give ARALO79 (SEQ ID NO:71). The ARALO79 vector contains the following expression unit: BnSUS2-2 promoter::ZM-ODP1::phaseolin terminator.
[0296] Plasmid DNA of ARALO77, ARALO78, ARALO79 and ARALO80 were used for Agrobacterium-mediated transformation of Arabidopsis plants as described in Example 2.
Example 15
Analysis of Progeny Seed of Arabidopsis Plants Transformed with Zea mays ODP Under Control of Canola Sucrose Synthase Promoters
[0297] Oil content of progeny seed (e.g., T2 seed) of transgenic lines generated with ARALO78 and ARALO79 can be measured by NMR as described in Example 2. Progeny seed (e.g., T2 seed) of transgenic events generated with ARALO78 and ARALO79 are expected to show increased oil content when compared to seed of untransformed control plants grown alongside the transgenic events.
Sequence CWU
1
1
7415280DNAArtificial Sequencevector pKS121/BS 1ggccgccacc gcggtggagc
tccagctttt gttcccttta gtgagggtta attgcgcgct 60tggcgtaatc atggtcatag
ctgtttcctg tgtgaaattg ttatccgctc acaattccac 120acaacatacg agccggaagc
ataaagtgta aagcctgggg tgcctaatga gtgagctaac 180tcacattaat tgcgttgcgc
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc 240tgcattaatg aatcggccaa
cgcgcgggga gaggcggttt gcgtattggg cgctcttccg 300cttcctcgct cactgactcg
ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc 360actcaaaggc ggtaatacgg
ttatccacag aatcagggga taacgcagga aagaacatgt 420gagcaaaagg ccagcaaaag
gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc 480ataggctccg cccccctgac
gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 540acccgacagg actataaaga
taccaggcgt ttccccctgg aagctccctc gtgcgctctc 600ctgttccgac cctgccgctt
accggatacc tgtccgcctt tctcccttcg ggaagcgtgg 660cgctttctca tagctcacgc
tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc 720tgggctgtgt gcacgaaccc
cccgttcagc ccgaccgctg cgccttatcc ggtaactatc 780gtcttgagtc caacccggta
agacacgact tatcgccact ggcagcagcc actggtaaca 840ggattagcag agcgaggtat
gtaggcggtg ctacagagtt cttgaagtgg tggcctaact 900acggctacac tagaaggaca
gtatttggta tctgcgctct gctgaagcca gttaccttcg 960gaaaaagagt tggtagctct
tgatccggca aacaaaccac cgctggtagc ggtggttttt 1020ttgtttgcaa gcagcagatt
acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 1080tttctacggg gtctgacgct
cagtggaacg aaaactcacg ttaagggatt ttggtcatga 1140gattatcaaa aaggatcttc
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa 1200tctaaagtat atatgagtaa
acttggtctg acagttacca atgcttaatc agtgaggcac 1260ctatctcagc gatctgtcta
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga 1320taactacgat acgggagggc
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc 1380cacgctcacc ggctccagat
ttatcagcaa taaaccagcc agccggaagg gccgagcgca 1440gaagtggtcc tgcaacttta
tccgcctcca tccagtctat taattgttgc cgggaagcta 1500gagtaagtag ttcgccagtt
aatagtttgc gcaacgttgt tgccattgct acaggcatcg 1560tggtgtcacg ctcgtcgttt
ggtatggctt cattcagctc cggttcccaa cgatcaaggc 1620gagttacatg atcccccatg
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg 1680ttgtcagaag taagttggcc
gcagtgttat cactcatggt tatggcagca ctgcataatt 1740ctcttactgt catgccatcc
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt 1800cattctgaga atagtgtatg
cggcgaccga gttgctcttg cccggcgtca atacgggata 1860ataccgcgcc acatagcaga
actttaaaag tgctcatcat tggaaaacgt tcttcggggc 1920gaaaactctc aaggatctta
ccgctgttga gatccagttc gatgtaaccc actcgtgcac 1980ccaactgatc ttcagcatct
tttactttca ccagcgtttc tgggtgagca aaaacaggaa 2040ggcaaaatgc cgcaaaaaag
ggaataaggg cgacacggaa atgttgaata ctcatactct 2100tcctttttca atattattga
agcatttatc agggttattg tctcatgagc ggatacatat 2160ttgaatgtat ttagaaaaat
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc 2220cacctgacgc gccctgtagc
ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg 2280tgaccgctac acttgccagc
gccctagcgc ccgctccttt cgctttcttc ccttcctttc 2340tcgccacgtt cgccggcttt
ccccgtcaag ctctaaatcg ggggctccct ttagggttcc 2400gatttagtgc tttacggcac
ctcgacccca aaaaacttga ttagggtgat ggttcacgta 2460gtgggccatc gccctgatag
acggtttttc gccctttgac gttggagtcc acgttcttta 2520atagtggact cttgttccaa
actggaacaa cactcaaccc tatctcggtc tattcttttg 2580atttataagg gattttgccg
atttcggcct attggttaaa aaatgagctg atttaacaaa 2640aatttaacgc gaattttaac
aaaatattaa cgcttacaat ttccattcgc cattcaggct 2700gcgcaactgt tgggaagggc
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa 2760agggggatgt gctgcaaggc
gattaagttg ggtaacgcca gggttttccc agtcacgacg 2820ttgtaaaacg acggccagtg
agcgcgcgta atacgactca ctatagggcg aattgggtac 2880cgggcccccc ctcgaggtcg
acggtatcga taagcttgat atcgaattcc tgcagcccgg 2940gggatccact agttctagag
cggcgcgccg tcgacggata taatgagccg taaacaaaga 3000tgattaagta gtaattaata
cgtactagta aaagtggcaa aagataacga gaaagaacca 3060atttctttgc attcggcctt
agcggaaggc atatataagc tttgattatt ttatttagtg 3120taatgatttc gtacaaccaa
agcatttatt tagtactctc acacttgtgt cgcggccgct 3180tggggggcta tggaagactt
tcttagttag ttgtgtgaat aagcaatgtt gggagaatcg 3240ggactactta taggatagga
ataaaacaga aaagtattaa gtgctaatga aatatttaga 3300ctgataatta aaatcttcac
gtatgtccac ttgatataaa aacgtcagga ataaaggaag 3360tacagtagaa tttaaaggta
ctctttttat atatacccgt gttctctttt tggctagcta 3420gttgcataaa aaataatcta
tatttttatc attattttaa atatcttatg agatggtaaa 3480tatttatcat aatttttttt
actattattt attatttgtg tgtgtaatac atatagaagt 3540taattacaaa ttttatttac
tttttcatta ttttgatatg attcaccatt aatttagtgt 3600tattatttat aatagttcat
tttaatcttt ttgtatatat tatgcgtgca gtactttttt 3660cctacatata actactatta
cattttattt atataatatt tttattaatg aattttcgtg 3720ataatatgta atattgttca
ttattatttc agatttttta aaaatatttg tgttattatt 3780tatgaaatat gtaatttttt
tagtatttga ttttatgatg ataaagtgtt ctaaattcaa 3840aagaaggggg aaagcgtaaa
cattaaaaaa cgtcatcaaa caaaaacaaa atcttgttaa 3900taaagataaa actgtttgtt
ttgatcactg ttatttcgta atataaaaac attatttata 3960tttatattgt tgacaaccaa
atttgcctat caaatctaac caatataatg catgcgtggc 4020aggtaatgta ctaccatgaa
cttaagtcat gacataataa accgtgaatc tgaccaatgc 4080atgtacctan ctaaattgta
tttgtgacac gaagcaaatg attcaattca caatggagat 4140gggaaacaaa taatgaagaa
cccagaacta agaaagcttt tctgaaaaat aaaataaagg 4200caatgtcaaa agtatactgc
atcatcagtc cagaaagcac atgatatttt tttatcagta 4260tcaatgcagc tagttttatt
ttacaatatc gatatagcta gtttaaatat attgcagcta 4320gatttataaa tatttgtgtt
attatttatc atttgtgtaa tcctgttttt agtattttag 4380tttatatatg atgataatgt
attccaaatt taaaagaagg gaaataaatt taaacaagaa 4440aaaaagtcat caaacaaaaa
acaaatgaaa gggtggaaag atgttaccat gtaatgtgaa 4500tgttacagta tttcttttat
tatagagtta acaaattaac taatatgatt ttgttaataa 4560tgataaaata ttttttttat
tattatttca taatataaaa atagtttact taatataaaa 4620aaaattctat cgttcacaac
aaagttggcc acctaattta accatgcatg tacccatgga 4680ccatattagg taaccatcaa
acctgatgaa gagataaaga gatgaagact taagtcataa 4740cacaaaacca taaaaaacaa
aaatacaatc aaccgtcaat ctgaccaatg catgaaaaag 4800ctgcaatagt gagtggcgac
acaaagcaca tgattttctt acaacggaga taaaaccaaa 4860aaaatatttc atgaacaacc
tagaacaaat aaagctttta tataataaat atataaataa 4920ataaaggcta tggaataata
tacttcaata tatttggatt aaataaattg ttggcggggt 4980tgatatattt atacacacct
aaagtcactt caatctcatt ttcacttaac ttttattttt 5040tttttctttt tatttatcat
aaagagaata ttgataatat actttttaac atatttttat 5100gacatttttt attggtgaaa
acttattaaa aatcataaat tttgtaagtt agatttattt 5160aaagagttcc tcttcttatt
ttaaattttt taataaattt ttaaataact aaaatttgtg 5220ttaaaaatgt taaaaaagtg
tgttattaac ccttctcttc gaggatccaa gcttggcgcg 528025968DNAArtificial
Sequencevector pDsRedxKS121/BS;DNA 2catggcctcc tccgaggacg tcatcaagga
gttcatgcgc ttcaaggtgc gcatggaggg 60ctccgtgaac ggccacgagt tcgagatcga
gggcgagggc gagggccgcc cctacgaggg 120cacccagacc gccaagctga aggtgaccaa
gggcggcccc ctgcccttcg cctgggacat 180cctgtccccc cagttccagt acggctccaa
ggtgtacgtg aagcaccccg ccgacatccc 240cgactacaag aagctgtcct tccccgaggg
cttcaagtgg gagcgcgtga tgaacttcga 300ggacggcggc gtggtgaccg tgacccagga
ctcctccctg caggacggct ccttcatcta 360caaggtgaag ttcatcggcg tgaacttccc
ctccgacggc cccgtaatgc agaagaagac 420tatgggctgg gaggcctcca ccgagcgcct
gtacccccgc gacggcgtgc tgaagggcga 480gatccacaag gccctgaagc tgaaggacgg
cggccactac ctggtggagt tcaagtccat 540ctacatggcc aagaagcccg tgcagctgcc
cggctactac tacgtggact ccaagctgga 600catcacctcc cacaacgagg actacaccat
cgtggagcag tacgagcgcg ccgagggccg 660ccaccacctg ttcctgtagc ggccggccgc
gacacaagtg tgagagtact aaataaatgc 720tttggttgta cgaaatcatt acactaaata
aaataatcaa agcttatata tgccttccgc 780taaggccgaa tgcaaagaaa ttggttcttt
ctcgttatct tttgccactt ttactagtac 840gtattaatta ctacttaatc atctttgttt
acggctcatt atatccgtcg acggcgcgcc 900gctctagaac tagtggatcc cccgggctgc
aggaattcga tatcaagctt atcgataccg 960tcgacctcga gggggggccc ggtacccaat
tcgccctata gtgagtcgta ttacgcgcgc 1020tcactggccg tcgttttaca acgtcgtgac
tgggaaaacc ctggcgttac ccaacttaat 1080cgccttgcag cacatccccc tttcgccagc
tggcgtaata gcgaagaggc ccgcaccgat 1140cgcccttccc aacagttgcg cagcctgaat
ggcgaatgga aattgtaagc gttaatattt 1200tgttaaaatt cgcgttaaat ttttgttaaa
tcagctcatt ttttaaccaa taggccgaaa 1260tcggcaaaat cccttataaa tcaaaagaat
agaccgagat agggttgagt gttgttccag 1320tttggaacaa gagtccacta ttaaagaacg
tggactccaa cgtcaaaggg cgaaaaaccg 1380tctatcaggg cgatggccca ctacgtgaac
catcacccta atcaagtttt ttggggtcga 1440ggtgccgtaa agcactaaat cggaacccta
aagggagccc ccgatttaga gcttgacggg 1500gaaagccggc gaacgtggcg agaaaggaag
ggaagaaagc gaaaggagcg ggcgctaggg 1560cgctggcaag tgtagcggtc acgctgcgcg
taaccaccac acccgccgcg cttaatgcgc 1620cgctacaggg cgcgtcaggt ggcacttttc
ggggaaatgt gcgcggaacc cctatttgtt 1680tatttttcta aatacattca aatatgtatc
cgctcatgag acaataaccc tgataaatgc 1740ttcaataata ttgaaaaagg aagagtatga
gtattcaaca tttccgtgtc gcccttattc 1800ccttttttgc ggcattttgc cttcctgttt
ttgctcaccc agaaacgctg gtgaaagtaa 1860aagatgctga agatcagttg ggtgcacgag
tgggttacat cgaactggat ctcaacagcg 1920gtaagatcct tgagagtttt cgccccgaag
aacgttttcc aatgatgagc acttttaaag 1980ttctgctatg tggcgcggta ttatcccgta
ttgacgccgg gcaagagcaa ctcggtcgcc 2040gcatacacta ttctcagaat gacttggttg
agtactcacc agtcacagaa aagcatctta 2100cggatggcat gacagtaaga gaattatgca
gtgctgccat aaccatgagt gataacactg 2160cggccaactt acttctgaca acgatcggag
gaccgaagga gctaaccgct tttttgcaca 2220acatggggga tcatgtaact cgccttgatc
gttgggaacc ggagctgaat gaagccatac 2280caaacgacga gcgtgacacc acgatgcctg
tagcaatggc aacaacgttg cgcaaactat 2340taactggcga actacttact ctagcttccc
ggcaacaatt aatagactgg atggaggcgg 2400ataaagttgc aggaccactt ctgcgctcgg
cccttccggc tggctggttt attgctgata 2460aatctggagc cggtgagcgt gggtctcgcg
gtatcattgc agcactgggg ccagatggta 2520agccctcccg tatcgtagtt atctacacga
cggggagtca ggcaactatg gatgaacgaa 2580atagacagat cgctgagata ggtgcctcac
tgattaagca ttggtaactg tcagaccaag 2640tttactcata tatactttag attgatttaa
aacttcattt ttaatttaaa aggatctagg 2700tgaagatcct ttttgataat ctcatgacca
aaatccctta acgtgagttt tcgttccact 2760gagcgtcaga ccccgtagaa aagatcaaag
gatcttcttg agatcctttt tttctgcgcg 2820taatctgctg cttgcaaaca aaaaaaccac
cgctaccagc ggtggtttgt ttgccggatc 2880aagagctacc aactcttttt ccgaaggtaa
ctggcttcag cagagcgcag ataccaaata 2940ctgtccttct agtgtagccg tagttaggcc
accacttcaa gaactctgta gcaccgccta 3000catacctcgc tctgctaatc ctgttaccag
tggctgctgc cagtggcgat aagtcgtgtc 3060ttaccgggtt ggactcaaga cgatagttac
cggataaggc gcagcggtcg ggctgaacgg 3120ggggttcgtg cacacagccc agcttggagc
gaacgaccta caccgaactg agatacctac 3180agcgtgagct atgagaaagc gccacgcttc
ccgaagggag aaaggcggac aggtatccgg 3240taagcggcag ggtcggaaca ggagagcgca
cgagggagct tccaggggga aacgcctggt 3300atctttatag tcctgtcggg tttcgccacc
tctgacttga gcgtcgattt ttgtgatgct 3360cgtcaggggg gcggagccta tggaaaaacg
ccagcaacgc ggccttttta cggttcctgg 3420ccttttgctg gccttttgct cacatgttct
ttcctgcgtt atcccctgat tctgtggata 3480accgtattac cgcctttgag tgagctgata
ccgctcgccg cagccgaacg accgagcgca 3540gcgagtcagt gagcgaggaa gcggaagagc
gcccaatacg caaaccgcct ctccccgcgc 3600gttggccgat tcattaatgc agctggcacg
acaggtttcc cgactggaaa gcgggcagtg 3660agcgcaacgc aattaatgtg agttagctca
ctcattaggc accccaggct ttacacttta 3720tgcttccggc tcgtatgttg tgtggaattg
tgagcggata acaatttcac acaggaaaca 3780gctatgacca tgattacgcc aagcgcgcaa
ttaaccctca ctaaagggaa caaaagctgg 3840agctccaccg cggtggcggc ccgcgccaag
cttggatcct cgaagagaag ggttaataac 3900acactttttt aacattttta acacaaattt
tagttattta aaaatttatt aaaaaattta 3960aaataagaag aggaactctt taaataaatc
taacttacaa aatttatgat ttttaataag 4020ttttcaccaa taaaaaatgt cataaaaata
tgttaaaaag tatattatca atattctctt 4080tatgataaat aaaaagaaaa aaaaaataaa
agttaagtga aaatgagatt gaagtgactt 4140taggtgtgta taaatatatc aaccccgcca
acaatttatt taatccaaat atattgaagt 4200atattattcc atagccttta tttatttata
tatttattat ataaaagctt tatttgttct 4260aggttgttca tgaaatattt ttttggtttt
atctccgttg taagaaaatc atgtgctttg 4320tgtcgccact cactattgca gctttttcat
gcattggtca gattgacggt tgattgtatt 4380tttgtttttt atggttttgt gttatgactt
aagtcttcat ctctttatct cttcatcagg 4440tttgatggtt acctaatatg gtccatgggt
acatgcatgg ttaaattagg tggccaactt 4500tgttgtgaac gatagaattt tttttatatt
aagtaaacta tttttatatt atgaaataat 4560aataaaaaaa atattttatc attattaaca
aaatcatatt agttaatttg ttaactctat 4620aataaaagaa atactgtaac attcacatta
catggtaaca tctttccacc ctttcatttg 4680ttttttgttt gatgactttt tttcttgttt
aaatttattt cccttctttt aaatttggaa 4740tacattatca tcatatataa actaaaatac
taaaaacagg attacacaaa tgataaataa 4800taacacaaat atttataaat ctagctgcaa
tatatttaaa ctagctatat cgatattgta 4860aaataaaact agctgcattg atactgataa
aaaaatatca tgtgctttct ggactgatga 4920tgcagtatac ttttgacatt gcctttattt
tatttttcag aaaagctttc ttagttctgg 4980gttcttcatt atttgtttcc catctccatt
gtgaattgaa tcatttgctt cgtgtcacaa 5040atacaattta gntaggtaca tgcattggtc
agattcacgg tttattatgt catgacttaa 5100gttcatggta gtacattacc tgccacgcat
gcattatatt ggttagattt gataggcaaa 5160tttggttgtc aacaatataa atataaataa
tgtttttata ttacgaaata acagtgatca 5220aaacaaacag ttttatcttt attaacaaga
ttttgttttt gtttgatgac gttttttaat 5280gtttacgctt tcccccttct tttgaattta
gaacacttta tcatcataaa atcaaatact 5340aaaaaaatta catatttcat aaataataac
acaaatattt ttaaaaaatc tgaaataata 5400atgaacaata ttacatatta tcacgaaaat
tcattaataa aaatattata taaataaaat 5460gtaatagtag ttatatgtag gaaaaaagta
ctgcacgcat aatatataca aaaagattaa 5520aatgaactat tataaataat aacactaaat
taatggtgaa tcatatcaaa ataatgaaaa 5580agtaaataaa atttgtaatt aacttctata
tgtattacac acacaaataa taaataatag 5640taaaaaaaat tatgataaat atttaccatc
tcataagata tttaaaataa tgataaaaat 5700atagattatt ttttatgcaa ctagctagcc
aaaaagagaa cacgggtata tataaaaaga 5760gtacctttaa attctactgt acttccttta
ttcctgacgt ttttatatca agtggacata 5820cgtgaagatt ttaattatca gtctaaatat
ttcattagca cttaatactt ttctgtttta 5880ttcctatcct ataagtagtc ccgattctcc
caacattgct tattcacaca actaactaag 5940aaagtcttcc atagcccccc aagcggcc
5968310058DNAArtificial Sequencevector
pKS332 3gatcctcgaa gagaagggtt aataacacac ttttttaaca tttttaacac aaattttagt
60tatttaaaaa tttattaaaa aatttaaaat aagaagagga actctttaaa taaatctaac
120ttacaaaatt tatgattttt aataagtttt caccaataaa aaatgtcata aaaatatgtt
180aaaaagtata ttatcaatat tctctttatg ataaataaaa agaaaaaaaa aataaaagtt
240aagtgaaaat gagattgaag tgactttagg tgtgtataaa tatatcaacc ccgccaacaa
300tttatttaat ccaaatatat tgaagtatat tattccatag cctttattta tttatatatt
360tattatataa aagctttatt tgttctaggt tgttcatgaa atattttttt ggttttatct
420ccgttgtaag aaaatcatgt gctttgtgtc gccactcact attgcagctt tttcatgcat
480tggtcagatt gacggttgat tgtatttttg ttttttatgg ttttgtgtta tgacttaagt
540cttcatctct ttatctcttc atcaggtttg atggttacct aatatggtcc atgggtacat
600gcatggttaa attaggtggc caactttgtt gtgaacgata gaattttttt tatattaagt
660aaactatttt tatattatga aataataata aaaaaaatat tttatcatta ttaacaaaat
720catattagtt aatttgttaa ctctataata aaagaaatac tgtaacattc acattacatg
780gtaacatctt tccacccttt catttgtttt ttgtttgatg actttttttc ttgtttaaat
840ttatttccct tcttttaaat ttggaataca ttatcatcat atataaacta aaatactaaa
900aacaggatta cacaaatgat aaataataac acaaatattt ataaatctag ctgcaatata
960tttaaactag ctatatcgat attgtaaaat aaaactagct gcattgatac tgataaaaaa
1020atatcatgtg ctttctggac tgatgatgca gtatactttt gacattgcct ttattttatt
1080tttcagaaaa gctttcttag ttctgggttc ttcattattt gtttcccatc tccattgtga
1140attgaatcat ttgcttcgtg tcacaaatac aatttagnta ggtacatgca ttggtcagat
1200tcacggttta ttatgtcatg acttaagttc atggtagtac attacctgcc acgcatgcat
1260tatattggtt agatttgata ggcaaatttg gttgtcaaca atataaatat aaataatgtt
1320tttatattac gaaataacag tgatcaaaac aaacagtttt atctttatta acaagatttt
1380gtttttgttt gatgacgttt tttaatgttt acgctttccc ccttcttttg aatttagaac
1440actttatcat cataaaatca aatactaaaa aaattacata tttcataaat aataacacaa
1500atatttttaa aaaatctgaa ataataatga acaatattac atattatcac gaaaattcat
1560taataaaaat attatataaa taaaatgtaa tagtagttat atgtaggaaa aaagtactgc
1620acgcataata tatacaaaaa gattaaaatg aactattata aataataaca ctaaattaat
1680ggtgaatcat atcaaaataa tgaaaaagta aataaaattt gtaattaact tctatatgta
1740ttacacacac aaataataaa taatagtaaa aaaaattatg ataaatattt accatctcat
1800aagatattta aaataatgat aaaaatatag attatttttt atgcaactag ctagccaaaa
1860agagaacacg ggtatatata aaaagagtac ctttaaattc tactgtactt cctttattcc
1920tgacgttttt atatcaagtg gacatacgtg aagattttaa ttatcagtct aaatatttca
1980ttagcactta atacttttct gttttattcc tatcctataa gtagtcccga ttctcccaac
2040attgcttatt cacacaacta actaagaaag tcttccatag ccccccaagc ggcccatggc
2100ctcctccgag gacgtcatca aggagttcat gcgcttcaag gtgcgcatgg agggctccgt
2160gaacggccac gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca
2220gaccgccaag ctgaaggtga ccaagggcgg ccccctgccc ttcgcctggg acatcctgtc
2280cccccagttc cagtacggct ccaaggtgta cgtgaagcac cccgccgaca tccccgacta
2340caagaagctg tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg
2400cggcgtggtg accgtgaccc aggactcctc cctgcaggac ggctccttca tctacaaggt
2460gaagttcatc ggcgtgaact tcccctccga cggccccgta atgcagaaga agactatggg
2520ctgggaggcc tccaccgagc gcctgtaccc ccgcgacggc gtgctgaagg gcgagatcca
2580caaggccctg aagctgaagg acggcggcca ctacctggtg gagttcaagt ccatctacat
2640ggccaagaag cccgtgcagc tgcccggcta ctactacgtg gactccaagc tggacatcac
2700ctcccacaac gaggactaca ccatcgtgga gcagtacgag cgcgccgagg gccgccacca
2760cctgttcctg tagcggccgg ccgcgacaca agtgtgagag tactaaataa atgctttggt
2820tgtacgaaat cattacacta aataaaataa tcaaagctta tatatgcctt ccgctaaggc
2880cgaatgcaaa gaaattggtt ctttctcgtt atcttttgcc acttttacta gtacgtatta
2940attactactt aatcatcttt gtttacggct cattatatcc gtcgacggcg cgccgctcta
3000gaactagtgg atccgtcgac ggcgcgcccg atcatccgga tatagttcct cctttcagca
3060aaaaacccct caagacccgt ttagaggccc caaggggtta tgctagttat tgctcagcgg
3120tggcagcagc caactcagct tcctttcggg ctttgttagc agccggatcg atccaagctg
3180tacctcacta ttcctttgcc ctcggacgag tgctggggcg tcggtttcca ctatcggcga
3240gtacttctac acagccatcg gtccagacgg ccgcgcttct gcgggcgatt tgtgtacgcc
3300cgacagtccc ggctccggat cggacgattg cgtcgcatcg accctgcgcc caagctgcat
3360catcgaaatt gccgtcaacc aagctctgat agagttggtc aagaccaatg cggagcatat
3420acgcccggag ccgcggcgat cctgcaagct ccggatgcct ccgctcgaag tagcgcgtct
3480gctgctccat acaagccaac cacggcctcc agaagaagat gttggcgacc tcgtattggg
3540aatccccgaa catcgcctcg ctccagtcaa tgaccgctgt tatgcggcca ttgtccgtca
3600ggacattgtt ggagccgaaa tccgcgtgca cgaggtgccg gacttcgggg cagtcctcgg
3660cccaaagcat cagctcatcg agagcctgcg cgacggacgc actgacggtg tcgtccatca
3720cagtttgcca gtgatacaca tggggatcag caatcgcgca tatgaaatca cgccatgtag
3780tgtattgacc gattccttgc ggtccgaatg ggccgaaccc gctcgtctgg ctaagatcgg
3840ccgcagcgat cgcatccata gcctccgcga ccggctgcag aacagcgggc agttcggttt
3900caggcaggtc ttgcaacgtg acaccctgtg cacggcggga gatgcaatag gtcaggctct
3960cgctgaattc cccaatgtca agcacttccg gaatcgggag cgcggccgat gcaaagtgcc
4020gataaacata acgatctttg tagaaaccat cggcgcagct atttacccgc aggacatatc
4080cacgccctcc tacatcgaag ctgaaagcac gagattcttc gccctccgag agctgcatca
4140ggtcggagac gctgtcgaac ttttcgatca gaaacttctc gacagacgtc gcggtgagtt
4200caggcttttc catgggtata tctccttctt aaagttaaac aaaattattt ctagagggaa
4260accgttgtgg tctccctata gtgagtcgta ttaatttcgc gggatcgaga tcgatccaat
4320tccaatccca caaaaatctg agcttaacag cacagttgct cctctcagag cagaatcggg
4380tattcaacac cctcatatca actactacgt tgtgtataac ggtccacatg ccggtatata
4440cgatgactgg ggttgtacaa aggcggcaac aaacggcgtt cccggagttg cacacaagaa
4500atttgccact attacagagg caagagcagc agctgacgcg tacacaacaa gtcagcaaac
4560agacaggttg aacttcatcc ccaaaggaga agctcaactc aagcccaaga gctttgctaa
4620ggccctaaca agcccaccaa agcaaaaagc ccactggctc acgctaggaa ccaaaaggcc
4680cagcagtgat ccagccccaa aagagatctc ctttgccccg gagattacaa tggacgattt
4740cctctatctt tacgatctag gaaggaagtt cgaaggtgaa ggtgacgaca ctatgttcac
4800cactgataat gagaaggtta gcctcttcaa tttcagaaag aatgctgacc cacagatggt
4860tagagaggcc tacgcagcag gtctcatcaa gacgatctac ccgagtaaca atctccagga
4920gatcaaatac cttcccaaga aggttaaaga tgcagtcaaa agattcagga ctaattgcat
4980caagaacaca gagaaagaca tatttctcaa gatcagaagt actattccag tatggacgat
5040tcaaggcttg cttcataaac caaggcaagt aatagagatt ggagtctcta aaaaggtagt
5100tcctactgaa tctaaggcca tgcatggagt ctaagattca aatcgaggat ctaacagaac
5160tcgccgtgaa gactggcgaa cagttcatac agagtctttt acgactcaat gacaagaaga
5220aaatcttcgt caacatggtg gagcacgaca ctctggtcta ctccaaaaat gtcaaagata
5280cagtctcaga agaccaaagg gctattgaga cttttcaaca aaggataatt tcgggaaacc
5340tcctcggatt ccattgccca gctatctgtc acttcatcga aaggacagta gaaaaggaag
5400gtggctccta caaatgccat cattgcgata aaggaaaggc tatcattcaa gatgcctctg
5460ccgacagtgg tcccaaagat ggacccccac ccacgaggag catcgtggaa aaagaagacg
5520ttccaaccac gtcttcaaag caagtggatt gatgtgacat ctccactgac gtaagggatg
5580acgcacaatc ccactatcct tcgcaagacc cttcctctat ataaggaagt tcatttcatt
5640tggagaggac acgctcgagc tcatttctct attacttcag ccataacaaa agaactcttt
5700tctcttctta ttaaaccatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc
5760tgatcgaaaa gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc
5820gtgctttcag cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg
5880atggtttcta caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc
5940cggaagtgct tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg
6000cacagggtgt cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg
6060tcgcggaggc catggatgcg atcgctgcgg ccgatcttag ccagacgagc gggttcggcc
6120cattcggacc gcaaggaatc ggtcaataca ctacatggcg tgatttcata tgcgcgattg
6180ctgatcccca tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg
6240cgcaggctct cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg
6300tgcacgcgga tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca
6360ttgactggag cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct
6420ggaggccgtg gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg
6480agcttgcagg atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct
6540atcagagctt ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg
6600caatcgtccg atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg
6660ccgtctggac cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca
6720ctcgtccgag ggcaaaggaa tagtgaggta cctaaagaag gagtgcgtcg aagcagatcg
6780ttcaaacatt tggcaataaa gtttcttaag attgaatcct gttgccggtc ttgcgatgat
6840tatcatataa tttctgttga attacgttaa gcatgtaata attaacatgt aatgcatgac
6900gttatttatg agatgggttt ttatgattag agtcccgcaa ttatacattt aatacgcgat
6960agaaaacaaa atatagcgcg caaactagga taaattatcg cgcgcggtgt catctatgtt
7020actagatcga tgtcgaatct gatcaacctg cattaatgaa tcggccaacg cgcggggaga
7080ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc
7140gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa
7200tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt
7260aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa
7320aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt
7380ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg
7440tccgcctttc tcccttcggg aagcgtggcg ctttctcaat gctcacgctg taggtatctc
7500agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc
7560gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta
7620tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct
7680acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc
7740tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa
7800caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa
7860aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa
7920aactcacgtt aagggatttt ggtcatgaca ttaacctata aaaataggcg tatcacgagg
7980ccctttcgtc tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg
8040gagacggtca cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg
8100tcagcgggtg ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta
8160ctgagagtgc accatatgga catattgtcg ttagaacgcg gctacaatta atacataacc
8220ttatgtatca tacacatacg atttaggtga cactatagaa cggcgcgcca agcttttgat
8280ccatgccctt catttgccgc ttattaatta atttggtaac agtccgtact aatcagttac
8340ttatccttcc cccatcataa ttaatcttgg tagtctcgaa tgccacaaca ctgactagtc
8400tcttggatca taagaaaaag ccaaggaaca aaagaagaca aaacacaatg agagtatcct
8460ttgcatagca atgtctaagt tcataaaatt caaacaaaaa cgcaatcaca cacagtggac
8520atcacttatc cactagctga tcaggatcgc cgcgtcaaga aaaaaaaact ggaccccaaa
8580agccatgcac aacaacacgt actcacaaag gtgtcaatcg agcagcccaa aacattcacc
8640aactcaaccc atcatgagcc ctcacatttg ttgtttctaa cccaacctca aactcgtatt
8700ctcttccgcc acctcatttt tgtttatttc aacacccgtc aaactgcatg ccaccccgtg
8760gccaaatgtc catgcatgtt aacaagacct atgactataa atagctgcaa tctcggccca
8820ggttttcatc atcaagaacc agttcaatat cctagtacac cgtattaaag aatttaagat
8880atactgcggc cgcaagtatg aactaaaatg catgtaggtg taagagctca tggagagcat
8940ggaatattgt atccgaccat gtaacagtat aataactgag ctccatctca cttcttctat
9000gaataaacaa aggatgttat gatatattaa cactctatct atgcacctta ttgttctatg
9060ataaatttcc tcttattatt ataaatcatc tgaatcgtga cggcttatgg aatgcttcaa
9120atagtacaaa aacaaatgtg tactataaga ctttctaaac aattctaacc ttagcattgt
9180gaacgagaca taagtgttaa gaagacataa caattataat ggaagaagtt tgtctccatt
9240tatatattat atattaccca cttatgtatt atattaggat gttaaggaga cataacaatt
9300ataaagagag aagtttgtat ccatttatat attatatact acccatttat atattatact
9360tatccactta tttaatgtct ttataaggtt tgatccatga tatttctaat attttagttg
9420atatgtatat gaaagggtac tatttgaact ctcttactct gtataaaggt tggatcatcc
9480ttaaagtggg tctatttaat tttattgctt cttacagata aaaaaaaaat tatgagttgg
9540tttgataaaa tattgaagga tttaaaataa taataaataa catataatat atgtatataa
9600atttattata atataacatt tatctataaa aaagtaaata ttgtcataaa tctatacaat
9660cgtttagcct tgctggacga atctcaatta tttaaacgag agtaaacata tttgactttt
9720tggttattta acaaattatt atttaacact atatgaaatt ttttttttta tcagcaaaga
9780ataaaattaa attaagaagg acaatggtgt cccaatcctt atacaaccaa cttccacaag
9840aaagtcaagt cagagacaac aaaaaaacaa gcaaaggaaa ttttttaatt tgagttgtct
9900tgtttgctgc ataatttatg cagtaaaaca ctacacataa cccttttagc agtagagcaa
9960tggttgaccg tgtgcttagc ttcttttatt ttattttttt atcagcaaag aataaataaa
10020ataaaatgag acacttcagg gatgtttcaa caagcttg
10058434DNAArtificial SequencePCR primer MWG345 4gaattcgcgg ccgcatggag
agatctcaac ggca 34534DNAArtificial
SequencePCR primer MWG346 5gaattcgcgg ccgcttagtt gcacacactg atca
34611251DNAArtificial Sequencevector pKS336
6ggccgcatgg agagatctca acggcagtct cctccgccac cgtcgccgtc ctcctcctcg
60tcctccgtct ccgcggacac cgtcctcgtc cctcccggaa agaggcggag ggcggcgacg
120gccaaggccg gcgccgagcc taataagagg atccgcaagg accccgccgc cgccgccgcg
180gggaagagga gctccgtcta caggggagtc accaggcaca ggtggacggg caggttcgag
240gcgcatctct gggacaagca ctgcctcgcc gcgctccaca acaagaagaa aggcaggcaa
300gtctacctgg gggcgtatga cagcgaggag gcagctgctc gtgcctatga cctcgcagct
360ctcaagtact ggggtcctga gactctgctc aacttccctg tggaggatta ctccagcgag
420atgccggaga tggaggccgt gtcccgggag gagtacctgg cctccctccg ccgcaggagc
480agcggcttct ccaggggcgt ctccaagtac agaggcgtcg ccaggcatca ccacaacggg
540aggtgggagg cacggattgg gcgagtcttt gggaacaagt acctctactt gggaacattt
600gacactcaag aagaggcagc caaggcctat gaccttgcgg ccattgaata ccgtggcgtc
660aatgctgtaa ccaacttcga catcagctgc tacctggacc acccgctgtt cctggcacag
720ctccaacagg agccacaggt ggtgccggca ctcaaccaag aacctcaacc tgatcagagc
780gaaaccggaa ctacagagca agagccggag tcaagcgaag ccaagacacc ggatggcagt
840gcagaacccg atgagaacgc ggtgcctgac gacaccgcgg agcccctcac cacagtcgac
900gacagcatcg aagagggctt gtggagccct tgcatggatt acgagctaga caccatgtcg
960agaccaaact ttggcagctc aatcaatctg agcgagtggt tcgctgacgc agacttcgac
1020tgcaacatcg gatgcctgtt cgatgggtgt tctgcggctg acgaaggaag caaggatggt
1080gtaggtctgg cagatttcag tctgtttgag gcaggtgatg tccagctgaa ggatgttctt
1140tcggatatgg aagaggggat acaacctcca gcgatgatca gtgtgtgcaa cgcggccgca
1200agtatgaact aaaatgcatg taggtgtaag agctcatgga gagcatggaa tattgtatcc
1260gaccatgtaa cagtataata actgagctcc atctcacttc ttctatgaat aaacaaagga
1320tgttatgata tattaacact ctatctatgc accttattgt tctatgataa atttcctctt
1380attattataa atcatctgaa tcgtgacggc ttatggaatg cttcaaatag tacaaaaaca
1440aatgtgtact ataagacttt ctaaacaatt ctaaccttag cattgtgaac gagacataag
1500tgttaagaag acataacaat tataatggaa gaagtttgtc tccatttata tattatatat
1560tacccactta tgtattatat taggatgtta aggagacata acaattataa agagagaagt
1620ttgtatccat ttatatatta tatactaccc atttatatat tatacttatc cacttattta
1680atgtctttat aaggtttgat ccatgatatt tctaatattt tagttgatat gtatatgaaa
1740gggtactatt tgaactctct tactctgtat aaaggttgga tcatccttaa agtgggtcta
1800tttaatttta ttgcttctta cagataaaaa aaaaattatg agttggtttg ataaaatatt
1860gaaggattta aaataataat aaataacata taatatatgt atataaattt attataatat
1920aacatttatc tataaaaaag taaatattgt cataaatcta tacaatcgtt tagccttgct
1980ggacgaatct caattattta aacgagagta aacatatttg actttttggt tatttaacaa
2040attattattt aacactatat gaaatttttt tttttatcag caaagaataa aattaaatta
2100agaaggacaa tggtgtccca atccttatac aaccaacttc cacaagaaag tcaagtcaga
2160gacaacaaaa aaacaagcaa aggaaatttt ttaatttgag ttgtcttgtt tgctgcataa
2220tttatgcagt aaaacactac acataaccct tttagcagta gagcaatggt tgaccgtgtg
2280cttagcttct tttattttat ttttttatca gcaaagaata aataaaataa aatgagacac
2340ttcagggatg tttcaacaag cttggatcct cgaagagaag ggttaataac acactttttt
2400aacattttta acacaaattt tagttattta aaaatttatt aaaaaattta aaataagaag
2460aggaactctt taaataaatc taacttacaa aatttatgat ttttaataag ttttcaccaa
2520taaaaaatgt cataaaaata tgttaaaaag tatattatca atattctctt tatgataaat
2580aaaaagaaaa aaaaaataaa agttaagtga aaatgagatt gaagtgactt taggtgtgta
2640taaatatatc aaccccgcca acaatttatt taatccaaat atattgaagt atattattcc
2700atagccttta tttatttata tatttattat ataaaagctt tatttgttct aggttgttca
2760tgaaatattt ttttggtttt atctccgttg taagaaaatc atgtgctttg tgtcgccact
2820cactattgca gctttttcat gcattggtca gattgacggt tgattgtatt tttgtttttt
2880atggttttgt gttatgactt aagtcttcat ctctttatct cttcatcagg tttgatggtt
2940acctaatatg gtccatgggt acatgcatgg ttaaattagg tggccaactt tgttgtgaac
3000gatagaattt tttttatatt aagtaaacta tttttatatt atgaaataat aataaaaaaa
3060atattttatc attattaaca aaatcatatt agttaatttg ttaactctat aataaaagaa
3120atactgtaac attcacatta catggtaaca tctttccacc ctttcatttg ttttttgttt
3180gatgactttt tttcttgttt aaatttattt cccttctttt aaatttggaa tacattatca
3240tcatatataa actaaaatac taaaaacagg attacacaaa tgataaataa taacacaaat
3300atttataaat ctagctgcaa tatatttaaa ctagctatat cgatattgta aaataaaact
3360agctgcattg atactgataa aaaaatatca tgtgctttct ggactgatga tgcagtatac
3420ttttgacatt gcctttattt tatttttcag aaaagctttc ttagttctgg gttcttcatt
3480atttgtttcc catctccatt gtgaattgaa tcatttgctt cgtgtcacaa atacaattta
3540gntaggtaca tgcattggtc agattcacgg tttattatgt catgacttaa gttcatggta
3600gtacattacc tgccacgcat gcattatatt ggttagattt gataggcaaa tttggttgtc
3660aacaatataa atataaataa tgtttttata ttacgaaata acagtgatca aaacaaacag
3720ttttatcttt attaacaaga ttttgttttt gtttgatgac gttttttaat gtttacgctt
3780tcccccttct tttgaattta gaacacttta tcatcataaa atcaaatact aaaaaaatta
3840catatttcat aaataataac acaaatattt ttaaaaaatc tgaaataata atgaacaata
3900ttacatatta tcacgaaaat tcattaataa aaatattata taaataaaat gtaatagtag
3960ttatatgtag gaaaaaagta ctgcacgcat aatatataca aaaagattaa aatgaactat
4020tataaataat aacactaaat taatggtgaa tcatatcaaa ataatgaaaa agtaaataaa
4080atttgtaatt aacttctata tgtattacac acacaaataa taaataatag taaaaaaaat
4140tatgataaat atttaccatc tcataagata tttaaaataa tgataaaaat atagattatt
4200ttttatgcaa ctagctagcc aaaaagagaa cacgggtata tataaaaaga gtacctttaa
4260attctactgt acttccttta ttcctgacgt ttttatatca agtggacata cgtgaagatt
4320ttaattatca gtctaaatat ttcattagca cttaatactt ttctgtttta ttcctatcct
4380ataagtagtc ccgattctcc caacattgct tattcacaca actaactaag aaagtcttcc
4440atagcccccc aagcggccca tggcctcctc cgaggacgtc atcaaggagt tcatgcgctt
4500caaggtgcgc atggagggct ccgtgaacgg ccacgagttc gagatcgagg gcgagggcga
4560gggccgcccc tacgagggca cccagaccgc caagctgaag gtgaccaagg gcggccccct
4620gcccttcgcc tgggacatcc tgtcccccca gttccagtac ggctccaagg tgtacgtgaa
4680gcaccccgcc gacatccccg actacaagaa gctgtccttc cccgagggct tcaagtggga
4740gcgcgtgatg aacttcgagg acggcggcgt ggtgaccgtg acccaggact cctccctgca
4800ggacggctcc ttcatctaca aggtgaagtt catcggcgtg aacttcccct ccgacggccc
4860cgtaatgcag aagaagacta tgggctggga ggcctccacc gagcgcctgt acccccgcga
4920cggcgtgctg aagggcgaga tccacaaggc cctgaagctg aaggacggcg gccactacct
4980ggtggagttc aagtccatct acatggccaa gaagcccgtg cagctgcccg gctactacta
5040cgtggactcc aagctggaca tcacctccca caacgaggac tacaccatcg tggagcagta
5100cgagcgcgcc gagggccgcc accacctgtt cctgtagcgg ccggccgcga cacaagtgtg
5160agagtactaa ataaatgctt tggttgtacg aaatcattac actaaataaa ataatcaaag
5220cttatatatg ccttccgcta aggccgaatg caaagaaatt ggttctttct cgttatcttt
5280tgccactttt actagtacgt attaattact acttaatcat ctttgtttac ggctcattat
5340atccgtcgac ggcgcgccgc tctagaacta gtggatccgt cgacggcgcg cccgatcatc
5400cggatatagt tcctcctttc agcaaaaaac ccctcaagac ccgtttagag gccccaaggg
5460gttatgctag ttattgctca gcggtggcag cagccaactc agcttccttt cgggctttgt
5520tagcagccgg atcgatccaa gctgtacctc actattcctt tgccctcgga cgagtgctgg
5580ggcgtcggtt tccactatcg gcgagtactt ctacacagcc atcggtccag acggccgcgc
5640ttctgcgggc gatttgtgta cgcccgacag tcccggctcc ggatcggacg attgcgtcgc
5700atcgaccctg cgcccaagct gcatcatcga aattgccgtc aaccaagctc tgatagagtt
5760ggtcaagacc aatgcggagc atatacgccc ggagccgcgg cgatcctgca agctccggat
5820gcctccgctc gaagtagcgc gtctgctgct ccatacaagc caaccacggc ctccagaaga
5880agatgttggc gacctcgtat tgggaatccc cgaacatcgc ctcgctccag tcaatgaccg
5940ctgttatgcg gccattgtcc gtcaggacat tgttggagcc gaaatccgcg tgcacgaggt
6000gccggacttc ggggcagtcc tcggcccaaa gcatcagctc atcgagagcc tgcgcgacgg
6060acgcactgac ggtgtcgtcc atcacagttt gccagtgata cacatgggga tcagcaatcg
6120cgcatatgaa atcacgccat gtagtgtatt gaccgattcc ttgcggtccg aatgggccga
6180acccgctcgt ctggctaaga tcggccgcag cgatcgcatc catagcctcc gcgaccggct
6240gcagaacagc gggcagttcg gtttcaggca ggtcttgcaa cgtgacaccc tgtgcacggc
6300gggagatgca ataggtcagg ctctcgctga attccccaat gtcaagcact tccggaatcg
6360ggagcgcggc cgatgcaaag tgccgataaa cataacgatc tttgtagaaa ccatcggcgc
6420agctatttac ccgcaggaca tatccacgcc ctcctacatc gaagctgaaa gcacgagatt
6480cttcgccctc cgagagctgc atcaggtcgg agacgctgtc gaacttttcg atcagaaact
6540tctcgacaga cgtcgcggtg agttcaggct tttccatggg tatatctcct tcttaaagtt
6600aaacaaaatt atttctagag ggaaaccgtt gtggtctccc tatagtgagt cgtattaatt
6660tcgcgggatc gagatcgatc caattccaat cccacaaaaa tctgagctta acagcacagt
6720tgctcctctc agagcagaat cgggtattca acaccctcat atcaactact acgttgtgta
6780taacggtcca catgccggta tatacgatga ctggggttgt acaaaggcgg caacaaacgg
6840cgttcccgga gttgcacaca agaaatttgc cactattaca gaggcaagag cagcagctga
6900cgcgtacaca acaagtcagc aaacagacag gttgaacttc atccccaaag gagaagctca
6960actcaagccc aagagctttg ctaaggccct aacaagccca ccaaagcaaa aagcccactg
7020gctcacgcta ggaaccaaaa ggcccagcag tgatccagcc ccaaaagaga tctcctttgc
7080cccggagatt acaatggacg atttcctcta tctttacgat ctaggaagga agttcgaagg
7140tgaaggtgac gacactatgt tcaccactga taatgagaag gttagcctct tcaatttcag
7200aaagaatgct gacccacaga tggttagaga ggcctacgca gcaggtctca tcaagacgat
7260ctacccgagt aacaatctcc aggagatcaa ataccttccc aagaaggtta aagatgcagt
7320caaaagattc aggactaatt gcatcaagaa cacagagaaa gacatatttc tcaagatcag
7380aagtactatt ccagtatgga cgattcaagg cttgcttcat aaaccaaggc aagtaataga
7440gattggagtc tctaaaaagg tagttcctac tgaatctaag gccatgcatg gagtctaaga
7500ttcaaatcga ggatctaaca gaactcgccg tgaagactgg cgaacagttc atacagagtc
7560ttttacgact caatgacaag aagaaaatct tcgtcaacat ggtggagcac gacactctgg
7620tctactccaa aaatgtcaaa gatacagtct cagaagacca aagggctatt gagacttttc
7680aacaaaggat aatttcggga aacctcctcg gattccattg cccagctatc tgtcacttca
7740tcgaaaggac agtagaaaag gaaggtggct cctacaaatg ccatcattgc gataaaggaa
7800aggctatcat tcaagatgcc tctgccgaca gtggtcccaa agatggaccc ccacccacga
7860ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc aaagcaagtg gattgatgtg
7920acatctccac tgacgtaagg gatgacgcac aatcccacta tccttcgcaa gacccttcct
7980ctatataagg aagttcattt catttggaga ggacacgctc gagctcattt ctctattact
8040tcagccataa caaaagaact cttttctctt cttattaaac catgaaaaag cctgaactca
8100ccgcgacgtc tgtcgagaag tttctgatcg aaaagttcga cagcgtctcc gacctgatgc
8160agctctcgga gggcgaagaa tctcgtgctt tcagcttcga tgtaggaggg cgtggatatg
8220tcctgcgggt aaatagctgc gccgatggtt tctacaaaga tcgttatgtt tatcggcact
8280ttgcatcggc cgcgctcccg attccggaag tgcttgacat tggggaattc agcgagagcc
8340tgacctattg catctcccgc cgtgcacagg gtgtcacgtt gcaagacctg cctgaaaccg
8400aactgcccgc tgttctgcag ccggtcgcgg aggccatgga tgcgatcgct gcggccgatc
8460ttagccagac gagcgggttc ggcccattcg gaccgcaagg aatcggtcaa tacactacat
8520ggcgtgattt catatgcgcg attgctgatc cccatgtgta tcactggcaa actgtgatgg
8580acgacaccgt cagtgcgtcc gtcgcgcagg ctctcgatga gctgatgctt tgggccgagg
8640actgccccga agtccggcac ctcgtgcacg cggatttcgg ctccaacaat gtcctgacgg
8700acaatggccg cataacagcg gtcattgact ggagcgaggc gatgttcggg gattcccaat
8760acgaggtcgc caacatcttc ttctggaggc cgtggttggc ttgtatggag cagcagacgc
8820gctacttcga gcggaggcat ccggagcttg caggatcgcc gcggctccgg gcgtatatgc
8880tccgcattgg tcttgaccaa ctctatcaga gcttggttga cggcaatttc gatgatgcag
8940cttgggcgca gggtcgatgc gacgcaatcg tccgatccgg agccgggact gtcgggcgta
9000cacaaatcgc ccgcagaagc gcggccgtct ggaccgatgg ctgtgtagaa gtactcgccg
9060atagtggaaa ccgacgcccc agcactcgtc cgagggcaaa ggaatagtga ggtacctaaa
9120gaaggagtgc gtcgaagcag atcgttcaaa catttggcaa taaagtttct taagattgaa
9180tcctgttgcc ggtcttgcga tgattatcat ataatttctg ttgaattacg ttaagcatgt
9240aataattaac atgtaatgca tgacgttatt tatgagatgg gtttttatga ttagagtccc
9300gcaattatac atttaatacg cgatagaaaa caaaatatag cgcgcaaact aggataaatt
9360atcgcgcgcg gtgtcatcta tgttactaga tcgatgtcga atctgatcaa cctgcattaa
9420tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg
9480ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag
9540gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa
9600ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc
9660cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca
9720ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg
9780accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct
9840caatgctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt
9900gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag
9960tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc
10020agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac
10080actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga
10140gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc
10200aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg
10260gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gacattaacc
10320tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa
10380aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatgccggg
10440agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac
10500tatgcggcat cagagcagat tgtactgaga gtgcaccata tggacatatt gtcgttagaa
10560cgcggctaca attaatacat aaccttatgt atcatacaca tacgatttag gtgacactat
10620agaacggcgc gccaagcttt tgatccatgc ccttcatttg ccgcttatta attaatttgg
10680taacagtccg tactaatcag ttacttatcc ttcccccatc ataattaatc ttggtagtct
10740cgaatgccac aacactgact agtctcttgg atcataagaa aaagccaagg aacaaaagaa
10800gacaaaacac aatgagagta tcctttgcat agcaatgtct aagttcataa aattcaaaca
10860aaaacgcaat cacacacagt ggacatcact tatccactag ctgatcagga tcgccgcgtc
10920aagaaaaaaa aactggaccc caaaagccat gcacaacaac acgtactcac aaaggtgtca
10980atcgagcagc ccaaaacatt caccaactca acccatcatg agccctcaca tttgttgttt
11040ctaacccaac ctcaaactcg tattctcttc cgccacctca tttttgttta tttcaacacc
11100cgtcaaactg catgccaccc cgtggccaaa tgtccatgca tgttaacaag acctatgact
11160ataaatagct gcaatctcgg cccaggtttt catcatcaag aaccagttca atatcctagt
11220acaccgtatt aaagaattta agatatactg c
1125179060DNAArtificial SequenceT-DNA of vector pZBL120xKS336 7aattacaacg
gtatatatcc tgccgtcgac tctagaggat ccgcgccgtc gacggatata 60atgagccgta
aacaaagatg attaagtagt aattaatacg tactagtaaa agtggcaaaa 120gataacgaga
aagaaccaat ttctttgcat tcggccttag cggaaggcat atataagctt 180tgattatttt
atttagtgta atgatttcgt acaaccaaag catttattta gtactctcac 240acttgtgtcg
cggccggccg ctacaggaac aggtggtggc ggccctcggc gcgctcgtac 300tgctccacga
tggtgtagtc ctcgttgtgg gaggtgatgt ccagcttgga gtccacgtag 360tagtagccgg
gcagctgcac gggcttcttg gccatgtaga tggacttgaa ctccaccagg 420tagtggccgc
cgtccttcag cttcagggcc ttgtggatct cgcccttcag cacgccgtcg 480cgggggtaca
ggcgctcggt ggaggcctcc cagcccatag tcttcttctg cattacgggg 540ccgtcggagg
ggaagttcac gccgatgaac ttcaccttgt agatgaagga gccgtcctgc 600agggaggagt
cctgggtcac ggtcaccacg ccgccgtcct cgaagttcat cacgcgctcc 660cacttgaagc
cctcggggaa ggacagcttc ttgtagtcgg ggatgtcggc ggggtgcttc 720acgtacacct
tggagccgta ctggaactgg ggggacagga tgtcccaggc gaagggcagg 780gggccgccct
tggtcacctt cagcttggcg gtctgggtgc cctcgtaggg gcggccctcg 840ccctcgccct
cgatctcgaa ctcgtggccg ttcacggagc cctccatgcg caccttgaag 900cgcatgaact
ccttgatgac gtcctcggag gaggccatgg gccgcttggg gggctatgga 960agactttctt
agttagttgt gtgaataagc aatgttggga gaatcgggac tacttatagg 1020ataggaataa
aacagaaaag tattaagtgc taatgaaata tttagactga taattaaaat 1080cttcacgtat
gtccacttga tataaaaacg tcaggaataa aggaagtaca gtagaattta 1140aaggtactct
ttttatatat acccgtgttc tctttttggc tagctagttg cataaaaaat 1200aatctatatt
tttatcatta ttttaaatat cttatgagat ggtaaatatt tatcataatt 1260ttttttacta
ttatttatta tttgtgtgtg taatacatat agaagttaat tacaaatttt 1320atttactttt
tcattatttt gatatgattc accattaatt tagtgttatt atttataata 1380gttcatttta
atctttttgt atatattatg cgtgcagtac ttttttccta catataacta 1440ctattacatt
ttatttatat aatattttta ttaatgaatt ttcgtgataa tatgtaatat 1500tgttcattat
tatttcagat tttttaaaaa tatttgtgtt attatttatg aaatatgtaa 1560tttttttagt
atttgatttt atgatgataa agtgttctaa attcaaaaga agggggaaag 1620cgtaaacatt
aaaaaacgtc atcaaacaaa aacaaaatct tgttaataaa gataaaactg 1680tttgttttga
tcactgttat ttcgtaatat aaaaacatta tttatattta tattgttgac 1740aaccaaattt
gcctatcaaa tctaaccaat ataatgcatg cgtggcaggt aatgtactac 1800catgaactta
agtcatgaca taataaaccg tgaatctgac caatgcatgt acctanctaa 1860attgtatttg
tgacacgaag caaatgattc aattcacaat ggagatggga aacaaataat 1920gaagaaccca
gaactaagaa agcttttctg aaaaataaaa taaaggcaat gtcaaaagta 1980tactgcatca
tcagtccaga aagcacatga tattttttta tcagtatcaa tgcagctagt 2040tttattttac
aatatcgata tagctagttt aaatatattg cagctagatt tataaatatt 2100tgtgttatta
tttatcattt gtgtaatcct gtttttagta ttttagttta tatatgatga 2160taatgtattc
caaatttaaa agaagggaaa taaatttaaa caagaaaaaa agtcatcaaa 2220caaaaaacaa
atgaaagggt ggaaagatgt taccatgtaa tgtgaatgtt acagtatttc 2280ttttattata
gagttaacaa attaactaat atgattttgt taataatgat aaaatatttt 2340ttttattatt
atttcataat ataaaaatag tttacttaat ataaaaaaaa ttctatcgtt 2400cacaacaaag
ttggccacct aatttaacca tgcatgtacc catggaccat attaggtaac 2460catcaaacct
gatgaagaga taaagagatg aagacttaag tcataacaca aaaccataaa 2520aaacaaaaat
acaatcaacc gtcaatctga ccaatgcatg aaaaagctgc aatagtgagt 2580ggcgacacaa
agcacatgat tttcttacaa cggagataaa accaaaaaaa tatttcatga 2640acaacctaga
acaaataaag cttttatata ataaatatat aaataaataa aggctatgga 2700ataatatact
tcaatatatt tggattaaat aaattgttgg cggggttgat atatttatac 2760acacctaaag
tcacttcaat ctcattttca cttaactttt attttttttt tctttttatt 2820tatcataaag
agaatattga taatatactt tttaacatat ttttatgaca ttttttattg 2880gtgaaaactt
attaaaaatc ataaattttg taagttagat ttatttaaag agttcctctt 2940cttattttaa
attttttaat aaatttttaa ataactaaaa tttgtgttaa aaatgttaaa 3000aaagtgtgtt
attaaccctt ctcttcgagg atccaagctt gttgaaacat ccctgaagtg 3060tctcatttta
ttttatttat tctttgctga taaaaaaata aaataaaaga agctaagcac 3120acggtcaacc
attgctctac tgctaaaagg gttatgtgta gtgttttact gcataaatta 3180tgcagcaaac
aagacaactc aaattaaaaa atttcctttg cttgtttttt tgttgtctct 3240gacttgactt
tcttgtggaa gttggttgta taaggattgg gacaccattg tccttcttaa 3300tttaatttta
ttctttgctg ataaaaaaaa aaatttcata tagtgttaaa taataatttg 3360ttaaataacc
aaaaagtcaa atatgtttac tctcgtttaa ataattgaga ttcgtccagc 3420aaggctaaac
gattgtatag atttatgaca atatttactt ttttatagat aaatgttata 3480ttataataaa
tttatataca tatattatat gttatttatt attattttaa atccttcaat 3540attttatcaa
accaactcat aatttttttt ttatctgtaa gaagcaataa aattaaatag 3600acccacttta
aggatgatcc aacctttata cagagtaaga gagttcaaat agtacccttt 3660catatacata
tcaactaaaa tattagaaat atcatggatc aaaccttata aagacattaa 3720ataagtggat
aagtataata tataaatggg tagtatataa tatataaatg gatacaaact 3780tctctcttta
taattgttat gtctccttaa catcctaata taatacataa gtgggtaata 3840tataatatat
aaatggagac aaacttcttc cattataatt gttatgtctt cttaacactt 3900atgtctcgtt
cacaatgcta aggttagaat tgtttagaaa gtcttatagt acacatttgt 3960ttttgtacta
tttgaagcat tccataagcc gtcacgattc agatgattta taataataag 4020aggaaattta
tcatagaaca ataaggtgca tagatagagt gttaatatat cataacatcc 4080tttgtttatt
catagaagaa gtgagatgga gctcagttat tatactgtta catggtcgga 4140tacaatattc
catgctctcc atgagctctt acacctacat gcattttagt tcatacttgc 4200ggccgcgttg
cacacactga tcatcgctgg aggttgtatc ccctcttcca tatccgaaag 4260aacatccttc
agctggacat cacctgcctc aaacagactg aaatctgcca gacctacacc 4320atccttgctt
ccttcgtcag ccgcagaaca cccatcgaac aggcatccga tgttgcagtc 4380gaagtctgcg
tcagcgaacc actcgctcag attgattgag ctgccaaagt ttggtctcga 4440catggtgtct
agctcgtaat ccatgcaagg gctccacaag ccctcttcga tgctgtcgtc 4500gactgtggtg
aggggctccg cggtgtcgtc aggcaccgcg ttctcatcgg gttctgcact 4560gccatccggt
gtcttggctt cgcttgactc cggctcttgc tctgtagttc cggtttcgct 4620ctgatcaggt
tgaggttctt ggttgagtgc cggcaccacc tgtggctcct gttggagctg 4680tgccaggaac
agcgggtggt ccaggtagca gctgatgtcg aagttggtta cagcattgac 4740gccacggtat
tcaatggccg caaggtcata ggccttggct gcctcttctt gagtgtcaaa 4800tgttcccaag
tagaggtact tgttcccaaa gactcgccca atccgtgcct cccacctccc 4860gttgtggtga
tgcctggcga cgcctctgta cttggagacg cccctggaga agccgctgct 4920cctgcggcgg
agggaggcca ggtactcctc ccgggacacg gcctccatct ccggcatctc 4980gctggagtaa
tcctccacag ggaagttgag cagagtctca ggaccccagt acttgagagc 5040tgcgaggtca
taggcacgag cagctgcctc ctcgctgtca tacgccccca ggtagacttg 5100cctgcctttc
ttcttgttgt ggagcgcggc gaggcagtgc ttgtcccaga gatgcgcctc 5160gaacctgccc
gtccacctgt gcctggtgac tcccctgtag acggagctcc tcttccccgc 5220ggcggcggcg
gcggggtcct tgcggatcct cttattaggc tcggcgccgg ccttggccgt 5280cgccgccctc
cgcctctttc cgggagggac gaggacggtg tccgcggaga cggaggacga 5340ggaggaggac
ggcgacggtg gcggaggaga ctgccgttga gatctctcca tgcggccgca 5400gtatatctta
aattctttaa tacggtgtac taggatattg aactggttct tgatgatgaa 5460aacctgggcc
gagattgcag ctatttatag tcataggtct tgttaacatg catggacatt 5520tggccacggg
gtggcatgca gtttgacggg tgttgaaata aacaaaaatg aggtggcgga 5580agagaatacg
agtttgaggt tgggttagaa acaacaaatg tgagggctca tgatgggttg 5640agttggtgaa
tgttttgggc tgctcgattg acacctttgt gagtacgtgt tgttgtgcat 5700ggcttttggg
gtccagtttt tttttcttga cgcggcgatc ctgatcagct agtggataag 5760tgatgtccac
tgtgtgtgat tgcgtttttg tttgaatttt atgaacttag acattgctat 5820gcaaaggata
ctctcattgt gttttgtctt cttttgttcc ttggcttttt cttatgatcc 5880aagagactag
tcagtgttgt ggcattcgag actaccaaga ttaattatga tgggggaagg 5940ataagtaact
gattagtacg gactgttacc aaattaatta ataagcggca aatgaagggc 6000atggatcaaa
agcttggcgc gaattcactg gccgtcgttt tacaacgtcg tgactgggaa 6060aaccctggcg
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 6120aatagcgaag
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 6180tggatcgatc
cgtcgatcga ccaaagcggc catcgtgcct ccccactcct gcagttcggg 6240ggcatggatg
cgcggatagc cgctgctggt ttcctggatg ccgacggatt tgcactgccg 6300gtagaactcc
gcgaggtcgt ccagcctcag gcagcagctg aaccaactcg cgaggggatc 6360gagcccctgc
tgagcctcga catgttgtcg caaaattcgc cctggacccg cccaacgatt 6420tgtcgtcact
gtcaaggttt gacctgcact tcatttgggg cccacataca ccaaaaaaat 6480gctgcataat
tctcggggca gcaagtcggt tacccggccg ccgtgctgga ccgggttgaa 6540tggtgcccgt
aactttcggt agagcggacg gccaatactc aacttcaagg aatctcaccc 6600atgcgcgccg
gcggggaacc ggagttccct tcagtgaacg ttattagttc gccgctcggt 6660gtgtcgtaga
tactagcccc tggggccttt tgaaatttga ataagattta tgtaatcagt 6720cttttaggtt
tgaccggttc tgccgctttt tttaaaattg gatttgtaat aataaaacgc 6780aattgtttgt
tattgtggcg ctctatcata gatgtcgcta taaacctatt cagcacaata 6840tattgttttc
attttaatat tgtacatata agtagtaggg tacaatcagt aaattgaacg 6900gagaatatta
ttcataaaaa tacgatagta acgggtgata tattcattag aatgaaccga 6960aaccggcggt
aaggatctga gctacacatg ctcaggtttt ttacaacgtg cacaacagaa 7020ttgaaagcaa
atatcatgcg atcataggcg tctcgcatat ctcattaaag cagggggtgg 7080gcgaagaact
ccagcatgag atccccgcgc tggaggatca tccagccggc gtcccggaaa 7140acgattccga
agcccaacct ttcatagaag gcggcggtgg aatcgaaatc tcgtgatggc 7200aggttgggcg
tcgcttggtc ggtcatttcg aaccccagag tcccgctcag aagaactcgt 7260caagaaggcg
atagaaggcg atgcgctgcg aatcgggagc ggcgataccg taaagcacga 7320ggaagcggtc
agcccattcg ccgccaagct cttcagcaat atcacgggta gccaacgcta 7380tgtcctgata
gcggtccgcc acacccagcc ggccacagtc gatgaatcca gaaaagcggc 7440cattttccac
catgatattc ggcaagcagg catcgccatg ggtcacgacg agatcctcgc 7500cgtcgggcat
gcgcgccttg agcctggcga acagttcggc tggcgcgagc ccctgatgct 7560cttcgtccag
atcatcctga tcgacaagac cggcttccat ccgagtacgt gctcgctcga 7620tgcgatgttt
cgcttggtgg tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc 7680gcattgcatc
agccatgatg gatactttct cggcaggagc aaggtgagat gacaggagat 7740cctgccccgg
cacttcgccc aatagcagcc agtcccttcc cgcttcagtg acaacgtcga 7800gcacagctgc
gcaaggaacg cccgtcgtgg ccagccacga tagccgcgct gcctcgtcct 7860gcagttcatt
cagggcaccg gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg 7920ctgacagccg
gaacacggcg gcatcagagc agccgattgt ctgttgtgcc cagtcatagc 7980cgaatagcct
ctccacccaa gcggccggag aacctgcgtg caatccatct tgttcaatca 8040tgcgaaacga
tccccgcaag cttggagact ggtgatttca gcgtgtcctc tccaaatgaa 8100atgaacttcc
ttatatagag gaagggtctt gcgaaggata gtgggattgt gcgtcatccc 8160ttacgtcagt
ggagatatca catcaatcca cttgctttga agacgtggtt ggaacgtctt 8220ctttttccac
gatgctcctc gtgggtgggg gtccatcttt gggaccactg tcggcagagg 8280catcttcaac
gatggccttt cctttatcgc aatgatggca tttgtaggag ccaccttcct 8340tttccactat
cttcacaata aagtgacaga tagctgggca atggaatccg aggaggtttc 8400cggatattac
cctttgttga aaagtctcaa ttgccctttg gtcttctgag actgtatctt 8460tgatattttt
ggagtagaca agcgtgtcgt gctccaccat gttgacgaag attttcttct 8520tgtcattgag
tcgtaagaga ctctgtatga actgttcgcc agtctttacg gcgagttctg 8580ttaggtcctc
tatttgaatc tttgactcca tggcctttga ttcagtggga actacctttt 8640tagagactcc
aatctctatt acttgccttg gtttgtgaag caagccttga atcgtccata 8700ctggaatagt
acttctgatc ttgagaaata tatctttctc tgtgttcttg atgcagttag 8760tcctgaatct
tttgactgca tctttaacct tcttgggaag gtatttgatc tcctggagat 8820tattgctcgg
gtagatcgtc ttgatgagac ctgctgcgta agcctctcta accatctgtg 8880ggttagcatt
ctttctgaaa ttgaaaaggc taatcttctc attatcagtg gtgaacatgg 8940tatcgtcacc
ttctccgtcg aacttcctga ctagatcgta gagatagagg aagtcgtcca 9000ttgtgatctc
tggggcaaag gagatctgaa ttatcattta caattgaata tatcctgcca
9060834DNAArtificial SequencePCR primer MWG339 8gaattcgcgg ccgccatgag
aaggtctccc tctg 34934DNAArtificial
SequencePCR primer MWG340 9gaattcgcgg ccgctcaaac cctaaattca cacg
341011299DNAArtificial Sequencevector pKS333
10ggccgccatg agaaggtctc cctctgtttc tacttcctcc tcctcctcct cctcctgcgt
60cggcggcggc ggcttcgaca gcaataatct caatctcgcc gcccctccgc gccggccgca
120atcggagaag accggagcga aacgccggaa gcggaatcag gacgacgcca aatgcgagat
180tgagaatcgt aacggtaata acaacaacag cagcaacaac aatgcctctt ccggccgccg
240gagctccatt tacagaggag tcactaggca ccgatggacc ggccggttcg aagcgcatct
300ctgggacaag agttcgtgga atagcattca gaacaaaaaa ggaaggcaag tttatttggg
360agcatacgat aacgaggaag ctgccgcccg aacttatgac ctcgctgccc tcaagtactg
420gggtcccgga accaccctca atttcccggt agagtcgtac aggaatgaaa tagaagaaat
480gcggaaagtt acgaaggagg agtatttggc gtcgttacgg cggcggagca gcggattttc
540gagaggcgta tcgaagtacc gcggcgtggc ccgccaccac cacaacggcc ggtgggaggc
600gcggatcggc cgtgttttcg gaagcaaata tctttacctg ggaacttaca acacacaaga
660ggaagcagca gcagcatatg acatggctgc aattgagtac agaggggtca atgcagtgac
720caatttcgac atcagcaatt acattgggcg gctggagaat aaatcatcag tttttccagc
780agcagagcag cccctacagc ccaactgctc ccctgcttcc tcttctgagg aaggcgaagt
840agtacagcag caacagcaac agacgacgat ggcgttctca ggctcgcccc tccagttccc
900gtcgatggag aacagcccga cgacaatgga ggaggatcat gatctgcatt ggtcattcct
960agacacgggg ttcgtgcagg tccccgacct ccccctcgag aagtctggcg aattgcctga
1020cctgttcttt gatgagatcg ggttcgagga cgacatcggg ttgatattcg aggcgagctt
1080ggaagacgag aggtgcgggg aggggggtga gaagttagaa gatgtgggga aaatggagat
1140gatgaagagt gatcatgagg agagggggtt gttctcgact acttcgccat cttcgtcgtc
1200gataaccacc tcggtttcgt gtgaatttag ggtttgagcg gccgcaagta tgaactaaaa
1260tgcatgtagg tgtaagagct catggagagc atggaatatt gtatccgacc atgtaacagt
1320ataataactg agctccatct cacttcttct atgaataaac aaaggatgtt atgatatatt
1380aacactctat ctatgcacct tattgttcta tgataaattt cctcttatta ttataaatca
1440tctgaatcgt gacggcttat ggaatgcttc aaatagtaca aaaacaaatg tgtactataa
1500gactttctaa acaattctaa ccttagcatt gtgaacgaga cataagtgtt aagaagacat
1560aacaattata atggaagaag tttgtctcca tttatatatt atatattacc cacttatgta
1620ttatattagg atgttaagga gacataacaa ttataaagag agaagtttgt atccatttat
1680atattatata ctacccattt atatattata cttatccact tatttaatgt ctttataagg
1740tttgatccat gatatttcta atattttagt tgatatgtat atgaaagggt actatttgaa
1800ctctcttact ctgtataaag gttggatcat ccttaaagtg ggtctattta attttattgc
1860ttcttacaga taaaaaaaaa attatgagtt ggtttgataa aatattgaag gatttaaaat
1920aataataaat aacatataat atatgtatat aaatttatta taatataaca tttatctata
1980aaaaagtaaa tattgtcata aatctataca atcgtttagc cttgctggac gaatctcaat
2040tatttaaacg agagtaaaca tatttgactt tttggttatt taacaaatta ttatttaaca
2100ctatatgaaa tttttttttt tatcagcaaa gaataaaatt aaattaagaa ggacaatggt
2160gtcccaatcc ttatacaacc aacttccaca agaaagtcaa gtcagagaca acaaaaaaac
2220aagcaaagga aattttttaa tttgagttgt cttgtttgct gcataattta tgcagtaaaa
2280cactacacat aaccctttta gcagtagagc aatggttgac cgtgtgctta gcttctttta
2340ttttattttt ttatcagcaa agaataaata aaataaaatg agacacttca gggatgtttc
2400aacaagcttg gatcctcgaa gagaagggtt aataacacac ttttttaaca tttttaacac
2460aaattttagt tatttaaaaa tttattaaaa aatttaaaat aagaagagga actctttaaa
2520taaatctaac ttacaaaatt tatgattttt aataagtttt caccaataaa aaatgtcata
2580aaaatatgtt aaaaagtata ttatcaatat tctctttatg ataaataaaa agaaaaaaaa
2640aataaaagtt aagtgaaaat gagattgaag tgactttagg tgtgtataaa tatatcaacc
2700ccgccaacaa tttatttaat ccaaatatat tgaagtatat tattccatag cctttattta
2760tttatatatt tattatataa aagctttatt tgttctaggt tgttcatgaa atattttttt
2820ggttttatct ccgttgtaag aaaatcatgt gctttgtgtc gccactcact attgcagctt
2880tttcatgcat tggtcagatt gacggttgat tgtatttttg ttttttatgg ttttgtgtta
2940tgacttaagt cttcatctct ttatctcttc atcaggtttg atggttacct aatatggtcc
3000atgggtacat gcatggttaa attaggtggc caactttgtt gtgaacgata gaattttttt
3060tatattaagt aaactatttt tatattatga aataataata aaaaaaatat tttatcatta
3120ttaacaaaat catattagtt aatttgttaa ctctataata aaagaaatac tgtaacattc
3180acattacatg gtaacatctt tccacccttt catttgtttt ttgtttgatg actttttttc
3240ttgtttaaat ttatttccct tcttttaaat ttggaataca ttatcatcat atataaacta
3300aaatactaaa aacaggatta cacaaatgat aaataataac acaaatattt ataaatctag
3360ctgcaatata tttaaactag ctatatcgat attgtaaaat aaaactagct gcattgatac
3420tgataaaaaa atatcatgtg ctttctggac tgatgatgca gtatactttt gacattgcct
3480ttattttatt tttcagaaaa gctttcttag ttctgggttc ttcattattt gtttcccatc
3540tccattgtga attgaatcat ttgcttcgtg tcacaaatac aatttagnta ggtacatgca
3600ttggtcagat tcacggttta ttatgtcatg acttaagttc atggtagtac attacctgcc
3660acgcatgcat tatattggtt agatttgata ggcaaatttg gttgtcaaca atataaatat
3720aaataatgtt tttatattac gaaataacag tgatcaaaac aaacagtttt atctttatta
3780acaagatttt gtttttgttt gatgacgttt tttaatgttt acgctttccc ccttcttttg
3840aatttagaac actttatcat cataaaatca aatactaaaa aaattacata tttcataaat
3900aataacacaa atatttttaa aaaatctgaa ataataatga acaatattac atattatcac
3960gaaaattcat taataaaaat attatataaa taaaatgtaa tagtagttat atgtaggaaa
4020aaagtactgc acgcataata tatacaaaaa gattaaaatg aactattata aataataaca
4080ctaaattaat ggtgaatcat atcaaaataa tgaaaaagta aataaaattt gtaattaact
4140tctatatgta ttacacacac aaataataaa taatagtaaa aaaaattatg ataaatattt
4200accatctcat aagatattta aaataatgat aaaaatatag attatttttt atgcaactag
4260ctagccaaaa agagaacacg ggtatatata aaaagagtac ctttaaattc tactgtactt
4320cctttattcc tgacgttttt atatcaagtg gacatacgtg aagattttaa ttatcagtct
4380aaatatttca ttagcactta atacttttct gttttattcc tatcctataa gtagtcccga
4440ttctcccaac attgcttatt cacacaacta actaagaaag tcttccatag ccccccaagc
4500ggcccatggc ctcctccgag gacgtcatca aggagttcat gcgcttcaag gtgcgcatgg
4560agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc cgcccctacg
4620agggcaccca gaccgccaag ctgaaggtga ccaagggcgg ccccctgccc ttcgcctggg
4680acatcctgtc cccccagttc cagtacggct ccaaggtgta cgtgaagcac cccgccgaca
4740tccccgacta caagaagctg tccttccccg agggcttcaa gtgggagcgc gtgatgaact
4800tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac ggctccttca
4860tctacaaggt gaagttcatc ggcgtgaact tcccctccga cggccccgta atgcagaaga
4920agactatggg ctgggaggcc tccaccgagc gcctgtaccc ccgcgacggc gtgctgaagg
4980gcgagatcca caaggccctg aagctgaagg acggcggcca ctacctggtg gagttcaagt
5040ccatctacat ggccaagaag cccgtgcagc tgcccggcta ctactacgtg gactccaagc
5100tggacatcac ctcccacaac gaggactaca ccatcgtgga gcagtacgag cgcgccgagg
5160gccgccacca cctgttcctg tagcggccgg ccgcgacaca agtgtgagag tactaaataa
5220atgctttggt tgtacgaaat cattacacta aataaaataa tcaaagctta tatatgcctt
5280ccgctaaggc cgaatgcaaa gaaattggtt ctttctcgtt atcttttgcc acttttacta
5340gtacgtatta attactactt aatcatcttt gtttacggct cattatatcc gtcgacggcg
5400cgggccgctc tagaactagt ggatccgtcg acggcgcgcc cgatcatccg gatatagttc
5460ctcctttcag caaaaaaccc ctcaagaccc gtttagaggc cccaaggggt tatgctagtt
5520attgctcagc ggtggcagca gccaactcag cttcctttcg ggctttgtta gcagccggat
5580cgatccaagc tgtacctcac tattcctttg ccctcggacg agtgctgggg cgtcggtttc
5640cactatcggc gagtacttct acacagccat cggtccagac ggccgcgctt ctgcgggcga
5700tttgtgtacg cccgacagtc ccggctccgg atcggacgat tgcgtcgcat cgaccctgcg
5760cccaagctgc atcatcgaaa ttgccgtcaa ccaagctctg atagagttgg tcaagaccaa
5820tgcggagcat atacgcccgg agccgcggcg atcctgcaag ctccggatgc ctccgctcga
5880agtagcgcgt ctgctgctcc atacaagcca accacggcct ccagaagaag atgttggcga
5940cctcgtattg ggaatccccg aacatcgcct cgctccagtc aatgaccgct gttatgcggc
6000cattgtccgt caggacattg ttggagccga aatccgcgtg cacgaggtgc cggacttcgg
6060ggcagtcctc ggcccaaagc atcagctcat cgagagcctg cgcgacggac gcactgacgg
6120tgtcgtccat cacagtttgc cagtgataca catggggatc agcaatcgcg catatgaaat
6180cacgccatgt agtgtattga ccgattcctt gcggtccgaa tgggccgaac ccgctcgtct
6240ggctaagatc ggccgcagcg atcgcatcca tagcctccgc gaccggctgc agaacagcgg
6300gcagttcggt ttcaggcagg tcttgcaacg tgacaccctg tgcacggcgg gagatgcaat
6360aggtcaggct ctcgctgaat tccccaatgt caagcacttc cggaatcggg agcgcggccg
6420atgcaaagtg ccgataaaca taacgatctt tgtagaaacc atcggcgcag ctatttaccc
6480gcaggacata tccacgccct cctacatcga agctgaaagc acgagattct tcgccctccg
6540agagctgcat caggtcggag acgctgtcga acttttcgat cagaaacttc tcgacagacg
6600tcgcggtgag ttcaggcttt tccatgggta tatctccttc ttaaagttaa acaaaattat
6660ttctagaggg aaaccgttgt ggtctcccta tagtgagtcg tattaatttc gcgggatcga
6720gatcgatcca attccaatcc cacaaaaatc tgagcttaac agcacagttg ctcctctcag
6780agcagaatcg ggtattcaac accctcatat caactactac gttgtgtata acggtccaca
6840tgccggtata tacgatgact ggggttgtac aaaggcggca acaaacggcg ttcccggagt
6900tgcacacaag aaatttgcca ctattacaga ggcaagagca gcagctgacg cgtacacaac
6960aagtcagcaa acagacaggt tgaacttcat ccccaaagga gaagctcaac tcaagcccaa
7020gagctttgct aaggccctaa caagcccacc aaagcaaaaa gcccactggc tcacgctagg
7080aaccaaaagg cccagcagtg atccagcccc aaaagagatc tcctttgccc cggagattac
7140aatggacgat ttcctctatc tttacgatct aggaaggaag ttcgaaggtg aaggtgacga
7200cactatgttc accactgata atgagaaggt tagcctcttc aatttcagaa agaatgctga
7260cccacagatg gttagagagg cctacgcagc aggtctcatc aagacgatct acccgagtaa
7320caatctccag gagatcaaat accttcccaa gaaggttaaa gatgcagtca aaagattcag
7380gactaattgc atcaagaaca cagagaaaga catatttctc aagatcagaa gtactattcc
7440agtatggacg attcaaggct tgcttcataa accaaggcaa gtaatagaga ttggagtctc
7500taaaaaggta gttcctactg aatctaaggc catgcatgga gtctaagatt caaatcgagg
7560atctaacaga actcgccgtg aagactggcg aacagttcat acagagtctt ttacgactca
7620atgacaagaa gaaaatcttc gtcaacatgg tggagcacga cactctggtc tactccaaaa
7680atgtcaaaga tacagtctca gaagaccaaa gggctattga gacttttcaa caaaggataa
7740tttcgggaaa cctcctcgga ttccattgcc cagctatctg tcacttcatc gaaaggacag
7800tagaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag gctatcattc
7860aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg agcatcgtgg
7920aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgac atctccactg
7980acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct atataaggaa
8040gttcatttca tttggagagg acacgctcga gctcatttct ctattacttc agccataaca
8100aaagaactct tttctcttct tattaaacca tgaaaaagcc tgaactcacc gcgacgtctg
8160tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag ctctcggagg
8220gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc ctgcgggtaa
8280atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt gcatcggccg
8340cgctcccgat tccggaagtg cttgacattg gggaattcag cgagagcctg acctattgca
8400tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa ctgcccgctg
8460ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt agccagacga
8520gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg cgtgatttca
8580tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac gacaccgtca
8640gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac tgccccgaag
8700tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac aatggccgca
8760taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac gaggtcgcca
8820acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc tacttcgagc
8880ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc cgcattggtc
8940ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct tgggcgcagg
9000gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca caaatcgccc
9060gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat agtggaaacc
9120gacgccccag cactcgtccg agggcaaagg aatagtgagg tacctaaaga aggagtgcgt
9180cgaagcagat cgttcaaaca tttggcaata aagtttctta agattgaatc ctgttgccgg
9240tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa taattaacat
9300gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc aattatacat
9360ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat cgcgcgcggt
9420gtcatctatg ttactagatc gatgtcgaat ctgatcaacc tgcattaatg aatcggccaa
9480cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct cactgactcg
9540ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg
9600ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag
9660gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac
9720gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga
9780taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt
9840accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca atgctcacgc
9900tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc
9960cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta
10020agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat
10080gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca
10140gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct
10200tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt
10260acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct
10320cagtggaacg aaaactcacg ttaagggatt ttggtcatga cattaaccta taaaaatagg
10380cgtatcacga ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac
10440atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc
10500cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta tgcggcatca
10560gagcagattg tactgagagt gcaccatatg gacatattgt cgttagaacg cggctacaat
10620taatacataa ccttatgtat catacacata cgatttaggt gacactatag aacggcgcgc
10680caagcttttg atccatgccc ttcatttgcc gcttattaat taatttggta acagtccgta
10740ctaatcagtt acttatcctt cccccatcat aattaatctt ggtagtctcg aatgccacaa
10800cactgactag tctcttggat cataagaaaa agccaaggaa caaaagaaga caaaacacaa
10860tgagagtatc ctttgcatag caatgtctaa gttcataaaa ttcaaacaaa aacgcaatca
10920cacacagtgg acatcactta tccactagct gatcaggatc gccgcgtcaa gaaaaaaaaa
10980ctggacccca aaagccatgc acaacaacac gtactcacaa aggtgtcaat cgagcagccc
11040aaaacattca ccaactcaac ccatcatgag ccctcacatt tgttgtttct aacccaacct
11100caaactcgta ttctcttccg ccacctcatt tttgtttatt tcaacacccg tcaaactgca
11160tgccaccccg tggccaaatg tccatgcatg ttaacaagac ctatgactat aaatagctgc
11220aatctcggcc caggttttca tcatcaagaa ccagttcaat atcctagtac accgtattaa
11280agaatttaag atatactgc
11299119142DNAArtificial SequenceT-DNA of vector pZBL120xKS333
11aattacaacg gtatatatcc tgccgtcgac tctagaggat ccgcgccgtc gacggatcca
60ctagttctag agcggcccgc gccgtcgacg gatataatga gccgtaaaca aagatgatta
120agtagtaatt aatacgtact agtaaaagtg gcaaaagata acgagaaaga accaatttct
180ttgcattcgg ccttagcgga aggcatatat aagctttgat tattttattt agtgtaatga
240tttcgtacaa ccaaagcatt tatttagtac tctcacactt gtgtcgcggc cggccgctac
300aggaacaggt ggtggcggcc ctcggcgcgc tcgtactgct ccacgatggt gtagtcctcg
360ttgtgggagg tgatgtccag cttggagtcc acgtagtagt agccgggcag ctgcacgggc
420ttcttggcca tgtagatgga cttgaactcc accaggtagt ggccgccgtc cttcagcttc
480agggccttgt ggatctcgcc cttcagcacg ccgtcgcggg ggtacaggcg ctcggtggag
540gcctcccagc ccatagtctt cttctgcatt acggggccgt cggaggggaa gttcacgccg
600atgaacttca ccttgtagat gaaggagccg tcctgcaggg aggagtcctg ggtcacggtc
660accacgccgc cgtcctcgaa gttcatcacg cgctcccact tgaagccctc ggggaaggac
720agcttcttgt agtcggggat gtcggcgggg tgcttcacgt acaccttgga gccgtactgg
780aactgggggg acaggatgtc ccaggcgaag ggcagggggc cgcccttggt caccttcagc
840ttggcggtct gggtgccctc gtaggggcgg ccctcgccct cgccctcgat ctcgaactcg
900tggccgttca cggagccctc catgcgcacc ttgaagcgca tgaactcctt gatgacgtcc
960tcggaggagg ccatgggccg cttggggggc tatggaagac tttcttagtt agttgtgtga
1020ataagcaatg ttgggagaat cgggactact tataggatag gaataaaaca gaaaagtatt
1080aagtgctaat gaaatattta gactgataat taaaatcttc acgtatgtcc acttgatata
1140aaaacgtcag gaataaagga agtacagtag aatttaaagg tactcttttt atatataccc
1200gtgttctctt tttggctagc tagttgcata aaaaataatc tatattttta tcattatttt
1260aaatatctta tgagatggta aatatttatc ataatttttt ttactattat ttattatttg
1320tgtgtgtaat acatatagaa gttaattaca aattttattt actttttcat tattttgata
1380tgattcacca ttaatttagt gttattattt ataatagttc attttaatct ttttgtatat
1440attatgcgtg cagtactttt ttcctacata taactactat tacattttat ttatataata
1500tttttattaa tgaattttcg tgataatatg taatattgtt cattattatt tcagattttt
1560taaaaatatt tgtgttatta tttatgaaat atgtaatttt tttagtattt gattttatga
1620tgataaagtg ttctaaattc aaaagaaggg ggaaagcgta aacattaaaa aacgtcatca
1680aacaaaaaca aaatcttgtt aataaagata aaactgtttg ttttgatcac tgttatttcg
1740taatataaaa acattattta tatttatatt gttgacaacc aaatttgcct atcaaatcta
1800accaatataa tgcatgcgtg gcaggtaatg tactaccatg aacttaagtc atgacataat
1860aaaccgtgaa tctgaccaat gcatgtacct anctaaattg tatttgtgac acgaagcaaa
1920tgattcaatt cacaatggag atgggaaaca aataatgaag aacccagaac taagaaagct
1980tttctgaaaa ataaaataaa ggcaatgtca aaagtatact gcatcatcag tccagaaagc
2040acatgatatt tttttatcag tatcaatgca gctagtttta ttttacaata tcgatatagc
2100tagtttaaat atattgcagc tagatttata aatatttgtg ttattattta tcatttgtgt
2160aatcctgttt ttagtatttt agtttatata tgatgataat gtattccaaa tttaaaagaa
2220gggaaataaa tttaaacaag aaaaaaagtc atcaaacaaa aaacaaatga aagggtggaa
2280agatgttacc atgtaatgtg aatgttacag tatttctttt attatagagt taacaaatta
2340actaatatga ttttgttaat aatgataaaa tatttttttt attattattt cataatataa
2400aaatagttta cttaatataa aaaaaattct atcgttcaca acaaagttgg ccacctaatt
2460taaccatgca tgtacccatg gaccatatta ggtaaccatc aaacctgatg aagagataaa
2520gagatgaaga cttaagtcat aacacaaaac cataaaaaac aaaaatacaa tcaaccgtca
2580atctgaccaa tgcatgaaaa agctgcaata gtgagtggcg acacaaagca catgattttc
2640ttacaacgga gataaaacca aaaaaatatt tcatgaacaa cctagaacaa ataaagcttt
2700tatataataa atatataaat aaataaaggc tatggaataa tatacttcaa tatatttgga
2760ttaaataaat tgttggcggg gttgatatat ttatacacac ctaaagtcac ttcaatctca
2820ttttcactta acttttattt tttttttctt tttatttatc ataaagagaa tattgataat
2880atacttttta acatattttt atgacatttt ttattggtga aaacttatta aaaatcataa
2940attttgtaag ttagatttat ttaaagagtt cctcttctta ttttaaattt tttaataaat
3000ttttaaataa ctaaaatttg tgttaaaaat gttaaaaaag tgtgttatta acccttctct
3060tcgaggatcc aagcttgttg aaacatccct gaagtgtctc attttatttt atttattctt
3120tgctgataaa aaaataaaat aaaagaagct aagcacacgg tcaaccattg ctctactgct
3180aaaagggtta tgtgtagtgt tttactgcat aaattatgca gcaaacaaga caactcaaat
3240taaaaaattt cctttgcttg tttttttgtt gtctctgact tgactttctt gtggaagttg
3300gttgtataag gattgggaca ccattgtcct tcttaattta attttattct ttgctgataa
3360aaaaaaaaat ttcatatagt gttaaataat aatttgttaa ataaccaaaa agtcaaatat
3420gtttactctc gtttaaataa ttgagattcg tccagcaagg ctaaacgatt gtatagattt
3480atgacaatat ttactttttt atagataaat gttatattat aataaattta tatacatata
3540ttatatgtta tttattatta ttttaaatcc ttcaatattt tatcaaacca actcataatt
3600ttttttttat ctgtaagaag caataaaatt aaatagaccc actttaagga tgatccaacc
3660tttatacaga gtaagagagt tcaaatagta ccctttcata tacatatcaa ctaaaatatt
3720agaaatatca tggatcaaac cttataaaga cattaaataa gtggataagt ataatatata
3780aatgggtagt atataatata taaatggata caaacttctc tctttataat tgttatgtct
3840ccttaacatc ctaatataat acataagtgg gtaatatata atatataaat ggagacaaac
3900ttcttccatt ataattgtta tgtcttctta acacttatgt ctcgttcaca atgctaaggt
3960tagaattgtt tagaaagtct tatagtacac atttgttttt gtactatttg aagcattcca
4020taagccgtca cgattcagat gatttataat aataagagga aatttatcat agaacaataa
4080ggtgcataga tagagtgtta atatatcata acatcctttg tttattcata gaagaagtga
4140gatggagctc agttattata ctgttacatg gtcggataca atattccatg ctctccatga
4200gctcttacac ctacatgcat tttagttcat acttgcggcc gctcaaaccc taaattcaca
4260cgaaaccgag gtggttatcg acgacgaaga tggcgaagta gtcgagaaca accccctctc
4320ctcatgatca ctcttcatca tctccatttt ccccacatct tctaacttct cacccccctc
4380cccgcacctc tcgtcttcca agctcgcctc gaatatcaac ccgatgtcgt cctcgaaccc
4440gatctcatca aagaacaggt caggcaattc gccagacttc tcgaggggga ggtcggggac
4500ctgcacgaac cccgtgtcta ggaatgacca atgcagatca tgatcctcct ccattgtcgt
4560cgggctgttc tccatcgacg ggaactggag gggcgagcct gagaacgcca tcgtcgtctg
4620ttgctgttgc tgctgtacta cttcgccttc ctcagaagag gaagcagggg agcagttggg
4680ctgtaggggc tgctctgctg ctggaaaaac tgatgattta ttctccagcc gcccaatgta
4740attgctgatg tcgaaattgg tcactgcatt gacccctctg tactcaattg cagccatgtc
4800atatgctgct gctgcttcct cttgtgtgtt gtaagttccc aggtaaagat atttgcttcc
4860gaaaacacgg ccgatccgcg cctcccaccg gccgttgtgg tggtggcggg ccacgccgcg
4920gtacttcgat acgcctctcg aaaatccgct gctccgccgc cgtaacgacg ccaaatactc
4980ctccttcgta actttccgca tttcttctat ttcattcctg tacgactcta ccgggaaatt
5040gagggtggtt ccgggacccc agtacttgag ggcagcgagg tcataagttc gggcggcagc
5100ttcctcgtta tcgtatgctc ccaaataaac ttgccttcct tttttgttct gaatgctatt
5160ccacgaactc ttgtcccaga gatgcgcttc gaaccggccg gtccatcggt gcctagtgac
5220tcctctgtaa atggagctcc ggcggccgga agaggcattg ttgttgctgc tgttgttgtt
5280attaccgtta cgattctcaa tctcgcattt ggcgtcgtcc tgattccgct tccggcgttt
5340cgctccggtc ttctccgatt gcggccggcg cggaggggcg gcgagattga gattattgct
5400gtcgaagccg ccgccgccga cgcaggagga ggaggaggag gaggaagtag aaacagaggg
5460agaccttctc atggcggccg cagtatatct taaattcttt aatacggtgt actaggatat
5520tgaactggtt cttgatgatg aaaacctggg ccgagattgc agctatttat agtcataggt
5580cttgttaaca tgcatggaca tttggccacg gggtggcatg cagtttgacg ggtgttgaaa
5640taaacaaaaa tgaggtggcg gaagagaata cgagtttgag gttgggttag aaacaacaaa
5700tgtgagggct catgatgggt tgagttggtg aatgttttgg gctgctcgat tgacaccttt
5760gtgagtacgt gttgttgtgc atggcttttg gggtccagtt tttttttctt gacgcggcga
5820tcctgatcag ctagtggata agtgatgtcc actgtgtgtg attgcgtttt tgtttgaatt
5880ttatgaactt agacattgct atgcaaagga tactctcatt gtgttttgtc ttcttttgtt
5940ccttggcttt ttcttatgat ccaagagact agtcagtgtt gtggcattcg agactaccaa
6000gattaattat gatgggggaa ggataagtaa ctgattagta cggactgtta ccaaattaat
6060taataagcgg caaatgaagg gcatggatca aaagcttggc gcgaattcac tggccgtcgt
6120tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca
6180tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca
6240gttgcgcagc ctgaatggcg aatggatcga tccgtcgatc gaccaaagcg gccatcgtgc
6300ctccccactc ctgcagttcg ggggcatgga tgcgcggata gccgctgctg gtttcctgga
6360tgccgacgga tttgcactgc cggtagaact ccgcgaggtc gtccagcctc aggcagcagc
6420tgaaccaact cgcgagggga tcgagcccct gctgagcctc gacatgttgt cgcaaaattc
6480gccctggacc cgcccaacga tttgtcgtca ctgtcaaggt ttgacctgca cttcatttgg
6540ggcccacata caccaaaaaa atgctgcata attctcgggg cagcaagtcg gttacccggc
6600cgccgtgctg gaccgggttg aatggtgccc gtaactttcg gtagagcgga cggccaatac
6660tcaacttcaa ggaatctcac ccatgcgcgc cggcggggaa ccggagttcc cttcagtgaa
6720cgttattagt tcgccgctcg gtgtgtcgta gatactagcc cctggggcct tttgaaattt
6780gaataagatt tatgtaatca gtcttttagg tttgaccggt tctgccgctt tttttaaaat
6840tggatttgta ataataaaac gcaattgttt gttattgtgg cgctctatca tagatgtcgc
6900tataaaccta ttcagcacaa tatattgttt tcattttaat attgtacata taagtagtag
6960ggtacaatca gtaaattgaa cggagaatat tattcataaa aatacgatag taacgggtga
7020tatattcatt agaatgaacc gaaaccggcg gtaaggatct gagctacaca tgctcaggtt
7080ttttacaacg tgcacaacag aattgaaagc aaatatcatg cgatcatagg cgtctcgcat
7140atctcattaa agcagggggt gggcgaagaa ctccagcatg agatccccgc gctggaggat
7200catccagccg gcgtcccgga aaacgattcc gaagcccaac ctttcataga aggcggcggt
7260ggaatcgaaa tctcgtgatg gcaggttggg cgtcgcttgg tcggtcattt cgaaccccag
7320agtcccgctc agaagaactc gtcaagaagg cgatagaagg cgatgcgctg cgaatcggga
7380gcggcgatac cgtaaagcac gaggaagcgg tcagcccatt cgccgccaag ctcttcagca
7440atatcacggg tagccaacgc tatgtcctga tagcggtccg ccacacccag ccggccacag
7500tcgatgaatc cagaaaagcg gccattttcc accatgatat tcggcaagca ggcatcgcca
7560tgggtcacga cgagatcctc gccgtcgggc atgcgcgcct tgagcctggc gaacagttcg
7620gctggcgcga gcccctgatg ctcttcgtcc agatcatcct gatcgacaag accggcttcc
7680atccgagtac gtgctcgctc gatgcgatgt ttcgcttggt ggtcgaatgg gcaggtagcc
7740ggatcaagcg tatgcagccg ccgcattgca tcagccatga tggatacttt ctcggcagga
7800gcaaggtgag atgacaggag atcctgcccc ggcacttcgc ccaatagcag ccagtccctt
7860cccgcttcag tgacaacgtc gagcacagct gcgcaaggaa cgcccgtcgt ggccagccac
7920gatagccgcg ctgcctcgtc ctgcagttca ttcagggcac cggacaggtc ggtcttgaca
7980aaaagaaccg ggcgcccctg cgctgacagc cggaacacgg cggcatcaga gcagccgatt
8040gtctgttgtg cccagtcata gccgaatagc ctctccaccc aagcggccgg agaacctgcg
8100tgcaatccat cttgttcaat catgcgaaac gatccccgca agcttggaga ctggtgattt
8160cagcgtgtcc tctccaaatg aaatgaactt ccttatatag aggaagggtc ttgcgaagga
8220tagtgggatt gtgcgtcatc ccttacgtca gtggagatat cacatcaatc cacttgcttt
8280gaagacgtgg ttggaacgtc ttctttttcc acgatgctcc tcgtgggtgg gggtccatct
8340ttgggaccac tgtcggcaga ggcatcttca acgatggcct ttcctttatc gcaatgatgg
8400catttgtagg agccaccttc cttttccact atcttcacaa taaagtgaca gatagctggg
8460caatggaatc cgaggaggtt tccggatatt accctttgtt gaaaagtctc aattgccctt
8520tggtcttctg agactgtatc tttgatattt ttggagtaga caagcgtgtc gtgctccacc
8580atgttgacga agattttctt cttgtcattg agtcgtaaga gactctgtat gaactgttcg
8640ccagtcttta cggcgagttc tgttaggtcc tctatttgaa tctttgactc catggccttt
8700gattcagtgg gaactacctt tttagagact ccaatctcta ttacttgcct tggtttgtga
8760agcaagcctt gaatcgtcca tactggaata gtacttctga tcttgagaaa tatatctttc
8820tctgtgttct tgatgcagtt agtcctgaat cttttgactg catctttaac cttcttggga
8880aggtatttga tctcctggag attattgctc gggtagatcg tcttgatgag acctgctgcg
8940taagcctctc taaccatctg tgggttagca ttctttctga aattgaaaag gctaatcttc
9000tcattatcag tggtgaacat ggtatcgtca ccttctccgt cgaacttcct gactagatcg
9060tagagataga ggaagtcgtc cattgtgatc tctggggcaa aggagatctg aattatcatt
9120tacaattgaa tatatcctgc ca
91421234DNAArtificial SequencePCR primer MWG341 12gaattcgcgg ccgcatgaag
aggtctccag catc 341334DNAArtificial
SequencePCR primer MWG342 13gaattcgcgg ccgctcatag atctagagca tagt
341411304DNAArtificial Sequencevector pKS334
14ggccgcatga agaggtctcc agcatcttct tgttcatcat ctacttcctc tgttgggttt
60gaagctccca ttgaaaaaag aaggcctaag catccaagga ggaataattt gaagtcacaa
120aaatgcaagc agaaccaaac caccactggt ggcagaagaa gctctatcta tagaggagtt
180acaaggcata ggtggacagg gaggtttgaa gctcacctat gggataagag ctcttggaac
240aacattcaga gcaagaaggg tcgacaagtt tatttggggg catatgatac tgaagaatct
300gcagcccgta cctatgacct tgcagccctt aaatactggg gaaaagatgc aaccctgaat
360ttcccgatag aaacttatac caaggagctc gaggaaatgg acaaggtttc aagagaagaa
420tatttggctt ctttgcggcg ccaaagcagt ggcttttcta gaggcctgtc taagtaccgt
480ggggttgcta ggcatcatca taatggtcgc tgggaagcac gaattggaag agtatgcgga
540aacaagtacc tctacttggg gacatataaa actcaagagg aggcagcagt ggcatatgac
600atggcagcaa tagagtaccg tggagtcaat gcagtgacca attttgacat aagcaactac
660atggacaaaa taaagaagaa aaatgaccaa acccaacaac aacaaacaga agcacaaacg
720gaaacagttc ctaactcctc tgactctgaa gaagtagaag tagaacaaca gacaacaaca
780ataaccacac cacccccatc tgaaaatctg cacatgccac cacagcagca ccaagttcaa
840tacacccccc atgtctctcc aagggaagaa gaatcatcat cactgatcac aattatggac
900catgtgcttg agcaggatct gccatggagc ttcatgtaca ctggcttgtc tcagtttcaa
960gatccaaact tggctttctg caaaggtgat gatgacttgg tgggcatgtt tgatagtgca
1020gggtttgagg aagacattga ttttctgttc agcactcaac ctggtgatga gactgagagt
1080gatgtcaaca atatgagcgc agttttggat agtgttgagt gtggagacac aaatggggct
1140ggtggaagca tgatgcatgt ggataacaag cagaagatag tatcatttgc ttcttcacca
1200tcatctacaa ctacagtttc ttgtgactat gctctagatc tagcggccgc aagtatgaac
1260taaaatgcat gtaggtgtaa gagctcatgg agagcatgga atattgtatc cgaccatgta
1320acagtataat aactgagctc catctcactt cttctatgaa taaacaaagg atgttatgat
1380atattaacac tctatctatg caccttattg ttctatgata aatttcctct tattattata
1440aatcatctga atcgtgacgg cttatggaat gcttcaaata gtacaaaaac aaatgtgtac
1500tataagactt tctaaacaat tctaacctta gcattgtgaa cgagacataa gtgttaagaa
1560gacataacaa ttataatgga agaagtttgt ctccatttat atattatata ttacccactt
1620atgtattata ttaggatgtt aaggagacat aacaattata aagagagaag tttgtatcca
1680tttatatatt atatactacc catttatata ttatacttat ccacttattt aatgtcttta
1740taaggtttga tccatgatat ttctaatatt ttagttgata tgtatatgaa agggtactat
1800ttgaactctc ttactctgta taaaggttgg atcatcctta aagtgggtct atttaatttt
1860attgcttctt acagataaaa aaaaaattat gagttggttt gataaaatat tgaaggattt
1920aaaataataa taaataacat ataatatatg tatataaatt tattataata taacatttat
1980ctataaaaaa gtaaatattg tcataaatct atacaatcgt ttagccttgc tggacgaatc
2040tcaattattt aaacgagagt aaacatattt gactttttgg ttatttaaca aattattatt
2100taacactata tgaaattttt ttttttatca gcaaagaata aaattaaatt aagaaggaca
2160atggtgtccc aatccttata caaccaactt ccacaagaaa gtcaagtcag agacaacaaa
2220aaaacaagca aaggaaattt tttaatttga gttgtcttgt ttgctgcata atttatgcag
2280taaaacacta cacataaccc ttttagcagt agagcaatgg ttgaccgtgt gcttagcttc
2340ttttatttta tttttttatc agcaaagaat aaataaaata aaatgagaca cttcagggat
2400gtttcaacaa gcttggatcc tcgaagagaa gggttaataa cacacttttt taacattttt
2460aacacaaatt ttagttattt aaaaatttat taaaaaattt aaaataagaa gaggaactct
2520ttaaataaat ctaacttaca aaatttatga tttttaataa gttttcacca ataaaaaatg
2580tcataaaaat atgttaaaaa gtatattatc aatattctct ttatgataaa taaaaagaaa
2640aaaaaaataa aagttaagtg aaaatgagat tgaagtgact ttaggtgtgt ataaatatat
2700caaccccgcc aacaatttat ttaatccaaa tatattgaag tatattattc catagccttt
2760atttatttat atatttatta tataaaagct ttatttgttc taggttgttc atgaaatatt
2820tttttggttt tatctccgtt gtaagaaaat catgtgcttt gtgtcgccac tcactattgc
2880agctttttca tgcattggtc agattgacgg ttgattgtat ttttgttttt tatggttttg
2940tgttatgact taagtcttca tctctttatc tcttcatcag gtttgatggt tacctaatat
3000ggtccatggg tacatgcatg gttaaattag gtggccaact ttgttgtgaa cgatagaatt
3060ttttttatat taagtaaact atttttatat tatgaaataa taataaaaaa aatattttat
3120cattattaac aaaatcatat tagttaattt gttaactcta taataaaaga aatactgtaa
3180cattcacatt acatggtaac atctttccac cctttcattt gttttttgtt tgatgacttt
3240ttttcttgtt taaatttatt tcccttcttt taaatttgga atacattatc atcatatata
3300aactaaaata ctaaaaacag gattacacaa atgataaata ataacacaaa tatttataaa
3360tctagctgca atatatttaa actagctata tcgatattgt aaaataaaac tagctgcatt
3420gatactgata aaaaaatatc atgtgctttc tggactgatg atgcagtata cttttgacat
3480tgcctttatt ttatttttca gaaaagcttt cttagttctg ggttcttcat tatttgtttc
3540ccatctccat tgtgaattga atcatttgct tcgtgtcaca aatacaattt agntaggtac
3600atgcattggt cagattcacg gtttattatg tcatgactta agttcatggt agtacattac
3660ctgccacgca tgcattatat tggttagatt tgataggcaa atttggttgt caacaatata
3720aatataaata atgtttttat attacgaaat aacagtgatc aaaacaaaca gttttatctt
3780tattaacaag attttgtttt tgtttgatga cgttttttaa tgtttacgct ttcccccttc
3840ttttgaattt agaacacttt atcatcataa aatcaaatac taaaaaaatt acatatttca
3900taaataataa cacaaatatt tttaaaaaat ctgaaataat aatgaacaat attacatatt
3960atcacgaaaa ttcattaata aaaatattat ataaataaaa tgtaatagta gttatatgta
4020ggaaaaaagt actgcacgca taatatatac aaaaagatta aaatgaacta ttataaataa
4080taacactaaa ttaatggtga atcatatcaa aataatgaaa aagtaaataa aatttgtaat
4140taacttctat atgtattaca cacacaaata ataaataata gtaaaaaaaa ttatgataaa
4200tatttaccat ctcataagat atttaaaata atgataaaaa tatagattat tttttatgca
4260actagctagc caaaaagaga acacgggtat atataaaaag agtaccttta aattctactg
4320tacttccttt attcctgacg tttttatatc aagtggacat acgtgaagat tttaattatc
4380agtctaaata tttcattagc acttaatact tttctgtttt attcctatcc tataagtagt
4440cccgattctc ccaacattgc ttattcacac aactaactaa gaaagtcttc catagccccc
4500caagcggccc atggcctcct ccgaggacgt catcaaggag ttcatgcgct tcaaggtgcg
4560catggagggc tccgtgaacg gccacgagtt cgagatcgag ggcgagggcg agggccgccc
4620ctacgagggc acccagaccg ccaagctgaa ggtgaccaag ggcggccccc tgcccttcgc
4680ctgggacatc ctgtcccccc agttccagta cggctccaag gtgtacgtga agcaccccgc
4740cgacatcccc gactacaaga agctgtcctt ccccgagggc ttcaagtggg agcgcgtgat
4800gaacttcgag gacggcggcg tggtgaccgt gacccaggac tcctccctgc aggacggctc
4860cttcatctac aaggtgaagt tcatcggcgt gaacttcccc tccgacggcc ccgtaatgca
4920gaagaagact atgggctggg aggcctccac cgagcgcctg tacccccgcg acggcgtgct
4980gaagggcgag atccacaagg ccctgaagct gaaggacggc ggccactacc tggtggagtt
5040caagtccatc tacatggcca agaagcccgt gcagctgccc ggctactact acgtggactc
5100caagctggac atcacctccc acaacgagga ctacaccatc gtggagcagt acgagcgcgc
5160cgagggccgc caccacctgt tcctgtagcg gccggccgcg acacaagtgt gagagtacta
5220aataaatgct ttggttgtac gaaatcatta cactaaataa aataatcaaa gcttatatat
5280gccttccgct aaggccgaat gcaaagaaat tggttctttc tcgttatctt ttgccacttt
5340tactagtacg tattaattac tacttaatca tctttgttta cggctcatta tatccgtcga
5400cggcgcgggc cgctctagaa ctagtggatc cgtcgacggc gcgcccgatc atccggatat
5460agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa ggggttatgc
5520tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt tgttagcagc
5580cggatcgatc caagctgtac ctcactattc ctttgccctc ggacgagtgc tggggcgtcg
5640gtttccacta tcggcgagta cttctacaca gccatcggtc cagacggccg cgcttctgcg
5700ggcgatttgt gtacgcccga cagtcccggc tccggatcgg acgattgcgt cgcatcgacc
5760ctgcgcccaa gctgcatcat cgaaattgcc gtcaaccaag ctctgataga gttggtcaag
5820accaatgcgg agcatatacg cccggagccg cggcgatcct gcaagctccg gatgcctccg
5880ctcgaagtag cgcgtctgct gctccataca agccaaccac ggcctccaga agaagatgtt
5940ggcgacctcg tattgggaat ccccgaacat cgcctcgctc cagtcaatga ccgctgttat
6000gcggccattg tccgtcagga cattgttgga gccgaaatcc gcgtgcacga ggtgccggac
6060ttcggggcag tcctcggccc aaagcatcag ctcatcgaga gcctgcgcga cggacgcact
6120gacggtgtcg tccatcacag tttgccagtg atacacatgg ggatcagcaa tcgcgcatat
6180gaaatcacgc catgtagtgt attgaccgat tccttgcggt ccgaatgggc cgaacccgct
6240cgtctggcta agatcggccg cagcgatcgc atccatagcc tccgcgaccg gctgcagaac
6300agcgggcagt tcggtttcag gcaggtcttg caacgtgaca ccctgtgcac ggcgggagat
6360gcaataggtc aggctctcgc tgaattcccc aatgtcaagc acttccggaa tcgggagcgc
6420ggccgatgca aagtgccgat aaacataacg atctttgtag aaaccatcgg cgcagctatt
6480tacccgcagg acatatccac gccctcctac atcgaagctg aaagcacgag attcttcgcc
6540ctccgagagc tgcatcaggt cggagacgct gtcgaacttt tcgatcagaa acttctcgac
6600agacgtcgcg gtgagttcag gcttttccat gggtatatct ccttcttaaa gttaaacaaa
6660attatttcta gagggaaacc gttgtggtct ccctatagtg agtcgtatta atttcgcggg
6720atcgagatcg atccaattcc aatcccacaa aaatctgagc ttaacagcac agttgctcct
6780ctcagagcag aatcgggtat tcaacaccct catatcaact actacgttgt gtataacggt
6840ccacatgccg gtatatacga tgactggggt tgtacaaagg cggcaacaaa cggcgttccc
6900ggagttgcac acaagaaatt tgccactatt acagaggcaa gagcagcagc tgacgcgtac
6960acaacaagtc agcaaacaga caggttgaac ttcatcccca aaggagaagc tcaactcaag
7020cccaagagct ttgctaaggc cctaacaagc ccaccaaagc aaaaagccca ctggctcacg
7080ctaggaacca aaaggcccag cagtgatcca gccccaaaag agatctcctt tgccccggag
7140attacaatgg acgatttcct ctatctttac gatctaggaa ggaagttcga aggtgaaggt
7200gacgacacta tgttcaccac tgataatgag aaggttagcc tcttcaattt cagaaagaat
7260gctgacccac agatggttag agaggcctac gcagcaggtc tcatcaagac gatctacccg
7320agtaacaatc tccaggagat caaatacctt cccaagaagg ttaaagatgc agtcaaaaga
7380ttcaggacta attgcatcaa gaacacagag aaagacatat ttctcaagat cagaagtact
7440attccagtat ggacgattca aggcttgctt cataaaccaa ggcaagtaat agagattgga
7500gtctctaaaa aggtagttcc tactgaatct aaggccatgc atggagtcta agattcaaat
7560cgaggatcta acagaactcg ccgtgaagac tggcgaacag ttcatacaga gtcttttacg
7620actcaatgac aagaagaaaa tcttcgtcaa catggtggag cacgacactc tggtctactc
7680caaaaatgtc aaagatacag tctcagaaga ccaaagggct attgagactt ttcaacaaag
7740gataatttcg ggaaacctcc tcggattcca ttgcccagct atctgtcact tcatcgaaag
7800gacagtagaa aaggaaggtg gctcctacaa atgccatcat tgcgataaag gaaaggctat
7860cattcaagat gcctctgccg acagtggtcc caaagatgga cccccaccca cgaggagcat
7920cgtggaaaaa gaagacgttc caaccacgtc ttcaaagcaa gtggattgat gtgacatctc
7980cactgacgta agggatgacg cacaatccca ctatccttcg caagaccctt cctctatata
8040aggaagttca tttcatttgg agaggacacg ctcgagctca tttctctatt acttcagcca
8100taacaaaaga actcttttct cttcttatta aaccatgaaa aagcctgaac tcaccgcgac
8160gtctgtcgag aagtttctga tcgaaaagtt cgacagcgtc tccgacctga tgcagctctc
8220ggagggcgaa gaatctcgtg ctttcagctt cgatgtagga gggcgtggat atgtcctgcg
8280ggtaaatagc tgcgccgatg gtttctacaa agatcgttat gtttatcggc actttgcatc
8340ggccgcgctc ccgattccgg aagtgcttga cattggggaa ttcagcgaga gcctgaccta
8400ttgcatctcc cgccgtgcac agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc
8460cgctgttctg cagccggtcg cggaggccat ggatgcgatc gctgcggccg atcttagcca
8520gacgagcggg ttcggcccat tcggaccgca aggaatcggt caatacacta catggcgtga
8580tttcatatgc gcgattgctg atccccatgt gtatcactgg caaactgtga tggacgacac
8640cgtcagtgcg tccgtcgcgc aggctctcga tgagctgatg ctttgggccg aggactgccc
8700cgaagtccgg cacctcgtgc acgcggattt cggctccaac aatgtcctga cggacaatgg
8760ccgcataaca gcggtcattg actggagcga ggcgatgttc ggggattccc aatacgaggt
8820cgccaacatc ttcttctgga ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt
8880cgagcggagg catccggagc ttgcaggatc gccgcggctc cgggcgtata tgctccgcat
8940tggtcttgac caactctatc agagcttggt tgacggcaat ttcgatgatg cagcttgggc
9000gcagggtcga tgcgacgcaa tcgtccgatc cggagccggg actgtcgggc gtacacaaat
9060cgcccgcaga agcgcggccg tctggaccga tggctgtgta gaagtactcg ccgatagtgg
9120aaaccgacgc cccagcactc gtccgagggc aaaggaatag tgaggtacct aaagaaggag
9180tgcgtcgaag cagatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt
9240gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt
9300aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta
9360tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa attatcgcgc
9420gcggtgtcat ctatgttact agatcgatgt cgaatctgat caacctgcat taatgaatcg
9480gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg
9540actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa
9600tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc
9660aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc
9720ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat
9780aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc
9840cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcaatgct
9900cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg
9960aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc
10020cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga
10080ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa
10140ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta
10200gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc
10260agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg
10320acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgacatta acctataaaa
10380ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct
10440gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac
10500aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg
10560catcagagca gattgtactg agagtgcacc atatggacat attgtcgtta gaacgcggct
10620acaattaata cataacctta tgtatcatac acatacgatt taggtgacac tatagaacgg
10680cgcgccaagc ttttgatcca tgcccttcat ttgccgctta ttaattaatt tggtaacagt
10740ccgtactaat cagttactta tccttccccc atcataatta atcttggtag tctcgaatgc
10800cacaacactg actagtctct tggatcataa gaaaaagcca aggaacaaaa gaagacaaaa
10860cacaatgaga gtatcctttg catagcaatg tctaagttca taaaattcaa acaaaaacgc
10920aatcacacac agtggacatc acttatccac tagctgatca ggatcgccgc gtcaagaaaa
10980aaaaactgga ccccaaaagc catgcacaac aacacgtact cacaaaggtg tcaatcgagc
11040agcccaaaac attcaccaac tcaacccatc atgagccctc acatttgttg tttctaaccc
11100aacctcaaac tcgtattctc ttccgccacc tcatttttgt ttatttcaac acccgtcaaa
11160ctgcatgcca ccccgtggcc aaatgtccat gcatgttaac aagacctatg actataaata
11220gctgcaatct cggcccaggt tttcatcatc aagaaccagt tcaatatcct agtacaccgt
11280attaaagaat ttaagatata ctgc
11304159147DNAArtificial SequenceT-DNA of vector pZBL120xKS334
15aattacaacg gtatatatcc tgccgtcgac tctagaggat ccgcgccgtc gacggatcca
60ctagttctag agcggcccgc gccgtcgacg gatataatga gccgtaaaca aagatgatta
120agtagtaatt aatacgtact agtaaaagtg gcaaaagata acgagaaaga accaatttct
180ttgcattcgg ccttagcgga aggcatatat aagctttgat tattttattt agtgtaatga
240tttcgtacaa ccaaagcatt tatttagtac tctcacactt gtgtcgcggc cggccgctac
300aggaacaggt ggtggcggcc ctcggcgcgc tcgtactgct ccacgatggt gtagtcctcg
360ttgtgggagg tgatgtccag cttggagtcc acgtagtagt agccgggcag ctgcacgggc
420ttcttggcca tgtagatgga cttgaactcc accaggtagt ggccgccgtc cttcagcttc
480agggccttgt ggatctcgcc cttcagcacg ccgtcgcggg ggtacaggcg ctcggtggag
540gcctcccagc ccatagtctt cttctgcatt acggggccgt cggaggggaa gttcacgccg
600atgaacttca ccttgtagat gaaggagccg tcctgcaggg aggagtcctg ggtcacggtc
660accacgccgc cgtcctcgaa gttcatcacg cgctcccact tgaagccctc ggggaaggac
720agcttcttgt agtcggggat gtcggcgggg tgcttcacgt acaccttgga gccgtactgg
780aactgggggg acaggatgtc ccaggcgaag ggcagggggc cgcccttggt caccttcagc
840ttggcggtct gggtgccctc gtaggggcgg ccctcgccct cgccctcgat ctcgaactcg
900tggccgttca cggagccctc catgcgcacc ttgaagcgca tgaactcctt gatgacgtcc
960tcggaggagg ccatgggccg cttggggggc tatggaagac tttcttagtt agttgtgtga
1020ataagcaatg ttgggagaat cgggactact tataggatag gaataaaaca gaaaagtatt
1080aagtgctaat gaaatattta gactgataat taaaatcttc acgtatgtcc acttgatata
1140aaaacgtcag gaataaagga agtacagtag aatttaaagg tactcttttt atatataccc
1200gtgttctctt tttggctagc tagttgcata aaaaataatc tatattttta tcattatttt
1260aaatatctta tgagatggta aatatttatc ataatttttt ttactattat ttattatttg
1320tgtgtgtaat acatatagaa gttaattaca aattttattt actttttcat tattttgata
1380tgattcacca ttaatttagt gttattattt ataatagttc attttaatct ttttgtatat
1440attatgcgtg cagtactttt ttcctacata taactactat tacattttat ttatataata
1500tttttattaa tgaattttcg tgataatatg taatattgtt cattattatt tcagattttt
1560taaaaatatt tgtgttatta tttatgaaat atgtaatttt tttagtattt gattttatga
1620tgataaagtg ttctaaattc aaaagaaggg ggaaagcgta aacattaaaa aacgtcatca
1680aacaaaaaca aaatcttgtt aataaagata aaactgtttg ttttgatcac tgttatttcg
1740taatataaaa acattattta tatttatatt gttgacaacc aaatttgcct atcaaatcta
1800accaatataa tgcatgcgtg gcaggtaatg tactaccatg aacttaagtc atgacataat
1860aaaccgtgaa tctgaccaat gcatgtacct anctaaattg tatttgtgac acgaagcaaa
1920tgattcaatt cacaatggag atgggaaaca aataatgaag aacccagaac taagaaagct
1980tttctgaaaa ataaaataaa ggcaatgtca aaagtatact gcatcatcag tccagaaagc
2040acatgatatt tttttatcag tatcaatgca gctagtttta ttttacaata tcgatatagc
2100tagtttaaat atattgcagc tagatttata aatatttgtg ttattattta tcatttgtgt
2160aatcctgttt ttagtatttt agtttatata tgatgataat gtattccaaa tttaaaagaa
2220gggaaataaa tttaaacaag aaaaaaagtc atcaaacaaa aaacaaatga aagggtggaa
2280agatgttacc atgtaatgtg aatgttacag tatttctttt attatagagt taacaaatta
2340actaatatga ttttgttaat aatgataaaa tatttttttt attattattt cataatataa
2400aaatagttta cttaatataa aaaaaattct atcgttcaca acaaagttgg ccacctaatt
2460taaccatgca tgtacccatg gaccatatta ggtaaccatc aaacctgatg aagagataaa
2520gagatgaaga cttaagtcat aacacaaaac cataaaaaac aaaaatacaa tcaaccgtca
2580atctgaccaa tgcatgaaaa agctgcaata gtgagtggcg acacaaagca catgattttc
2640ttacaacgga gataaaacca aaaaaatatt tcatgaacaa cctagaacaa ataaagcttt
2700tatataataa atatataaat aaataaaggc tatggaataa tatacttcaa tatatttgga
2760ttaaataaat tgttggcggg gttgatatat ttatacacac ctaaagtcac ttcaatctca
2820ttttcactta acttttattt tttttttctt tttatttatc ataaagagaa tattgataat
2880atacttttta acatattttt atgacatttt ttattggtga aaacttatta aaaatcataa
2940attttgtaag ttagatttat ttaaagagtt cctcttctta ttttaaattt tttaataaat
3000ttttaaataa ctaaaatttg tgttaaaaat gttaaaaaag tgtgttatta acccttctct
3060tcgaggatcc aagcttgttg aaacatccct gaagtgtctc attttatttt atttattctt
3120tgctgataaa aaaataaaat aaaagaagct aagcacacgg tcaaccattg ctctactgct
3180aaaagggtta tgtgtagtgt tttactgcat aaattatgca gcaaacaaga caactcaaat
3240taaaaaattt cctttgcttg tttttttgtt gtctctgact tgactttctt gtggaagttg
3300gttgtataag gattgggaca ccattgtcct tcttaattta attttattct ttgctgataa
3360aaaaaaaaat ttcatatagt gttaaataat aatttgttaa ataaccaaaa agtcaaatat
3420gtttactctc gtttaaataa ttgagattcg tccagcaagg ctaaacgatt gtatagattt
3480atgacaatat ttactttttt atagataaat gttatattat aataaattta tatacatata
3540ttatatgtta tttattatta ttttaaatcc ttcaatattt tatcaaacca actcataatt
3600ttttttttat ctgtaagaag caataaaatt aaatagaccc actttaagga tgatccaacc
3660tttatacaga gtaagagagt tcaaatagta ccctttcata tacatatcaa ctaaaatatt
3720agaaatatca tggatcaaac cttataaaga cattaaataa gtggataagt ataatatata
3780aatgggtagt atataatata taaatggata caaacttctc tctttataat tgttatgtct
3840ccttaacatc ctaatataat acataagtgg gtaatatata atatataaat ggagacaaac
3900ttcttccatt ataattgtta tgtcttctta acacttatgt ctcgttcaca atgctaaggt
3960tagaattgtt tagaaagtct tatagtacac atttgttttt gtactatttg aagcattcca
4020taagccgtca cgattcagat gatttataat aataagagga aatttatcat agaacaataa
4080ggtgcataga tagagtgtta atatatcata acatcctttg tttattcata gaagaagtga
4140gatggagctc agttattata ctgttacatg gtcggataca atattccatg ctctccatga
4200gctcttacac ctacatgcat tttagttcat acttgcggcc gctagatcta gagcatagtc
4260acaagaaact gtagttgtag atgatggtga agaagcaaat gatactatct tctgcttgtt
4320atccacatgc atcatgcttc caccagcccc atttgtgtct ccacactcaa cactatccaa
4380aactgcgctc atattgttga catcactctc agtctcatca ccaggttgag tgctgaacag
4440aaaatcaatg tcttcctcaa accctgcact atcaaacatg cccaccaagt catcatcacc
4500tttgcagaaa gccaagtttg gatcttgaaa ctgagacaag ccagtgtaca tgaagctcca
4560tggcagatcc tgctcaagca catggtccat aattgtgatc agtgatgatg attcttcttc
4620ccttggagag acatgggggg tgtattgaac ttggtgctgc tgtggtggca tgtgcagatt
4680ttcagatggg ggtggtgtgg ttattgttgt tgtctgttgt tctacttcta cttcttcaga
4740gtcagaggag ttaggaactg tttccgtttg tgcttctgtt tgttgttgtt gggtttggtc
4800atttttcttc tttattttgt ccatgtagtt gcttatgtca aaattggtca ctgcattgac
4860tccacggtac tctattgctg ccatgtcata tgccactgct gcctcctctt gagttttata
4920tgtccccaag tagaggtact tgtttccgca tactcttcca attcgtgctt cccagcgacc
4980attatgatga tgcctagcaa ccccacggta cttagacagg cctctagaaa agccactgct
5040ttggcgccgc aaagaagcca aatattcttc tcttgaaacc ttgtccattt cctcgagctc
5100cttggtataa gtttctatcg ggaaattcag ggttgcatct tttccccagt atttaagggc
5160tgcaaggtca taggtacggg ctgcagattc ttcagtatca tatgccccca aataaacttg
5220tcgacccttc ttgctctgaa tgttgttcca agagctctta tcccataggt gagcttcaaa
5280cctccctgtc cacctatgcc ttgtaactcc tctatagata gagcttcttc tgccaccagt
5340ggtggtttgg ttctgcttgc atttttgtga cttcaaatta ttcctccttg gatgcttagg
5400ccttcttttt tcaatgggag cttcaaaccc aacagaggaa gtagatgatg aacaagaaga
5460tgctggagac ctcttcatgc ggccgcagta tatcttaaat tctttaatac ggtgtactag
5520gatattgaac tggttcttga tgatgaaaac ctgggccgag attgcagcta tttatagtca
5580taggtcttgt taacatgcat ggacatttgg ccacggggtg gcatgcagtt tgacgggtgt
5640tgaaataaac aaaaatgagg tggcggaaga gaatacgagt ttgaggttgg gttagaaaca
5700acaaatgtga gggctcatga tgggttgagt tggtgaatgt tttgggctgc tcgattgaca
5760cctttgtgag tacgtgttgt tgtgcatggc ttttggggtc cagttttttt ttcttgacgc
5820ggcgatcctg atcagctagt ggataagtga tgtccactgt gtgtgattgc gtttttgttt
5880gaattttatg aacttagaca ttgctatgca aaggatactc tcattgtgtt ttgtcttctt
5940ttgttccttg gctttttctt atgatccaag agactagtca gtgttgtggc attcgagact
6000accaagatta attatgatgg gggaaggata agtaactgat tagtacggac tgttaccaaa
6060ttaattaata agcggcaaat gaagggcatg gatcaaaagc ttggcgcgaa ttcactggcc
6120gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca
6180gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc
6240caacagttgc gcagcctgaa tggcgaatgg atcgatccgt cgatcgacca aagcggccat
6300cgtgcctccc cactcctgca gttcgggggc atggatgcgc ggatagccgc tgctggtttc
6360ctggatgccg acggatttgc actgccggta gaactccgcg aggtcgtcca gcctcaggca
6420gcagctgaac caactcgcga ggggatcgag cccctgctga gcctcgacat gttgtcgcaa
6480aattcgccct ggacccgccc aacgatttgt cgtcactgtc aaggtttgac ctgcacttca
6540tttggggccc acatacacca aaaaaatgct gcataattct cggggcagca agtcggttac
6600ccggccgccg tgctggaccg ggttgaatgg tgcccgtaac tttcggtaga gcggacggcc
6660aatactcaac ttcaaggaat ctcacccatg cgcgccggcg gggaaccgga gttcccttca
6720gtgaacgtta ttagttcgcc gctcggtgtg tcgtagatac tagcccctgg ggccttttga
6780aatttgaata agatttatgt aatcagtctt ttaggtttga ccggttctgc cgcttttttt
6840aaaattggat ttgtaataat aaaacgcaat tgtttgttat tgtggcgctc tatcatagat
6900gtcgctataa acctattcag cacaatatat tgttttcatt ttaatattgt acatataagt
6960agtagggtac aatcagtaaa ttgaacggag aatattattc ataaaaatac gatagtaacg
7020ggtgatatat tcattagaat gaaccgaaac cggcggtaag gatctgagct acacatgctc
7080aggtttttta caacgtgcac aacagaattg aaagcaaata tcatgcgatc ataggcgtct
7140cgcatatctc attaaagcag ggggtgggcg aagaactcca gcatgagatc cccgcgctgg
7200aggatcatcc agccggcgtc ccggaaaacg attccgaagc ccaacctttc atagaaggcg
7260gcggtggaat cgaaatctcg tgatggcagg ttgggcgtcg cttggtcggt catttcgaac
7320cccagagtcc cgctcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat
7380cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt
7440cagcaatatc acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccggc
7500cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat
7560cgccatgggt cacgacgaga tcctcgccgt cgggcatgcg cgccttgagc ctggcgaaca
7620gttcggctgg cgcgagcccc tgatgctctt cgtccagatc atcctgatcg acaagaccgg
7680cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg aatgggcagg
7740tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat actttctcgg
7800caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat agcagccagt
7860cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc gtcgtggcca
7920gccacgatag ccgcgctgcc tcgtcctgca gttcattcag ggcaccggac aggtcggtct
7980tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca tcagagcagc
8040cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg gccggagaac
8100ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc ccgcaagctt ggagactggt
8160gatttcagcg tgtcctctcc aaatgaaatg aacttcctta tatagaggaa gggtcttgcg
8220aaggatagtg ggattgtgcg tcatccctta cgtcagtgga gatatcacat caatccactt
8280gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg ggtgggggtc
8340catctttggg accactgtcg gcagaggcat cttcaacgat ggcctttcct ttatcgcaat
8400gatggcattt gtaggagcca ccttcctttt ccactatctt cacaataaag tgacagatag
8460ctgggcaatg gaatccgagg aggtttccgg atattaccct ttgttgaaaa gtctcaattg
8520ccctttggtc ttctgagact gtatctttga tatttttgga gtagacaagc gtgtcgtgct
8580ccaccatgtt gacgaagatt ttcttcttgt cattgagtcg taagagactc tgtatgaact
8640gttcgccagt ctttacggcg agttctgtta ggtcctctat ttgaatcttt gactccatgg
8700cctttgattc agtgggaact acctttttag agactccaat ctctattact tgccttggtt
8760tgtgaagcaa gccttgaatc gtccatactg gaatagtact tctgatcttg agaaatatat
8820ctttctctgt gttcttgatg cagttagtcc tgaatctttt gactgcatct ttaaccttct
8880tgggaaggta tttgatctcc tggagattat tgctcgggta gatcgtcttg atgagacctg
8940ctgcgtaagc ctctctaacc atctgtgggt tagcattctt tctgaaattg aaaaggctaa
9000tcttctcatt atcagtggtg aacatggtat cgtcaccttc tccgtcgaac ttcctgacta
9060gatcgtagag atagaggaag tcgtccattg tgatctctgg ggcaaaggag atctgaatta
9120tcatttacaa ttgaatatat cctgcca
9147163983DNAArtificial Sequencevector pKR132 16ctagagtcga cctgcaggca
tgcaagcttg gcgtaatcat ggtcatagct gtttcctgtg 60tgaaattgtt atccgctcac
aattccacac aacatacgag ccggaagcat aaagtgtaaa 120gcctggggtg cctaatgagt
gagctaactc acattaattg cgttgcgctc actgcccgct 180ttccagtcgg gaaacctgtc
gtgccagctg cattaatgaa tcggccaacg cgcggggaga 240ggcggtttgc gtattgggcg
ctcttccgct tcctcgctca ctgactcgct gcgctcggtc 300gttcggctgc ggcgagcggt
atcagctcac tcaaaggcgg taatacggtt atccacagaa 360tcaggggata acgcaggaaa
gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt 420aaaaaggccg cgttgctggc
gtttttccat aggctccgcc cccctgacga gcatcacaaa 480aatcgacgct caagtcagag
gtggcgaaac ccgacaggac tataaagata ccaggcgttt 540ccccctggaa gctccctcgt
gcgctctcct gttccgaccc tgccgcttac cggatacctg 600tccgcctttc tcccttcggg
aagcgtggcg ctttctcata gctcacgctg taggtatctc 660agttcggtgt aggtcgttcg
ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc 720gaccgctgcg ccttatccgg
taactatcgt cttgagtcca acccggtaag acacgactta 780tcgccactgg cagcagccac
tggtaacagg attagcagag cgaggtatgt aggcggtgct 840acagagttct tgaagtggtg
gcctaactac ggctacacta gaaggacagt atttggtatc 900tgcgctctgc tgaagccagt
taccttcgga aaaagagttg gtagctcttg atccggcaaa 960caaaccaccg ctggtagcgg
tggttttttt gtttgcaagc agcagattac gcgcagaaaa 1020aaaggatctc aagaagatcc
tttgatcttt tctacggggt ctgacgctca gtggaacgaa 1080aactcacgtt aagggatttt
ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 1140ttaaattaaa aatgaagttt
taaatcaatc taaagtatat atgagtaaac ttggtctgac 1200agttaccaat gcttaatcag
tgaggcacct atctcagcga tctgtctatt tcgttcatcc 1260atagttgcct gactccccgt
cgtgtagata actacgatac gggagggctt accatctggc 1320cccagtgctg caatgatacc
gcgagaccca cgctcaccgg ctccagattt atcagcaata 1380aaccagccag ccggaagggc
cgagcgcaga agtggtcctg caactttatc cgcctccatc 1440cagtctatta attgttgccg
ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 1500aacgttgttg ccattgctac
aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca 1560ttcagctccg gttcccaacg
atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa 1620gcggttagct ccttcggtcc
tccgatcgtt gtcagaagta agttggccgc agtgttatca 1680ctcatggtta tggcagcact
gcataattct cttactgtca tgccatccgt aagatgcttt 1740tctgtgactg gtgagtactc
aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt 1800tgctcttgcc cggcgtcaat
acgggataat accgcgccac atagcagaac tttaaaagtg 1860ctcatcattg gaaaacgttc
ttcggggcga aaactctcaa ggatcttacc gctgttgaga 1920tccagttcga tgtaacccac
tcgtgcaccc aactgatctt cagcatcttt tactttcacc 1980agcgtttctg ggtgagcaaa
aacaggaagg caaaatgccg caaaaaaggg aataagggcg 2040acacggaaat gttgaatact
catactcttc ctttttcaat attattgaag catttatcag 2100ggttattgtc tcatgagcgg
atacatattt gaatgtattt agaaaaataa acaaataggg 2160gttccgcgca catttccccg
aaaagtgcca cctgacgtct aagaaaccat tattatcatg 2220acattaacct ataaaaatag
gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat 2280gacggtgaaa acctctgaca
catgcagctc ccggagacgg tcacagcttg tctgtaagcg 2340gatgccggga gcagacaagc
ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc 2400tggcttaact atgcggcatc
agagcagatt gtactgagag tgcaccatat gcggtgtgaa 2460ataccgcaca gatgcgtaag
gagaaaatac cgcatcaggc gccattcgcc attcaggctg 2520cgcaactgtt gggaagggcg
atcggtgcgg gcctcttcgc tattacgcca gctggcgaaa 2580gggggatgtg ctgcaaggcg
attaagttgg gtaacgccag ggttttccca gtcacgacgt 2640tgtaaaacga cggccagtga
attcgagctc ggtacccggg gatcctctag acctgcaggc 2700caactgcgtt tggggctcca
gattaaacga cgccgtttcg ttcctttcgc ttcacggctt 2760aacgatgtcg tttctgtctg
tgcccaaaaa ataaaggcat ttgttatttg caccagatat 2820ttactaagtg caccctagtt
tgacaagtag gcgataatta caaatagatg cggtgcaaat 2880aataaatttt gaaggaaata
attacaaaag aacagaactt atatttactt tattttaaaa 2940aactaaaatg aaagaacaaa
aaaagtaaaa aatacaaaaa atgtgcttta accactttca 3000ttatttgtta cagaaagtat
gattctactc aaattgatct gttgtatctg gtgctgcctt 3060gtcacactgg cgatttcaat
cccctaaaga tatggtgcaa actgcgaagt gatcaatatc 3120tgctcggtta atttagatta
attaataata ttcaacgtga tgtaccaaaa aaagacaatt 3180ttttgctcca ttgacaaatt
aaacctcatc aaggtaattt ccaaacctat aagcaaaaaa 3240atttcacatt aattggcccg
caatcctatt agtcttatta tactagagta ggaaaaaaaa 3300caattacaca acttgtctta
ttattctcta tgctaatgaa tatttttccc ttttgttaga 3360aatcagtgtt tcctaattta
ttgagtatta attccactca ccgcatatat ttaccgttga 3420ataagaaaat tttacacata
attcttttta agataaataa tttttttata ctagatctta 3480tatgattacg tgaagccaag
tgggttatac taatgatata taatgtttga tagtaatcag 3540tttataaacc aaatgcatgg
aaatgttacg tggaagcacg taaattaaca agcattgaag 3600caaatgcagc caccgcacca
aaaccacccc acttcacttc cacgtaccat attccatgca 3660actacaacac cctaaaactt
caataaatgc ccccaccttc acttcacttc acccatcaat 3720agcaagcggc cgcgaagtta
aaagcaatgt tgtcacttgt cgtactaaca catgatgtga 3780tagtttatgc tagctagcta
taacataagc tgtctctgag tgtgttgtat attaataaag 3840atcatcactg gtgaatggtg
atcgtgtacg taccctactt agtaggcaat ggaagcactt 3900agagtgtgct ttgtgcatgg
ccttgcctct gttttgagac ttttgtaatg ttttcgagtt 3960taaatctttg cctttgcgta
cgt 3983174746DNAArtificial
Sequencevector pKR627 17gatcctctag acctgcaggc caactgcgtt tggggctcca
gattaaacga cgccgtttcg 60ttcctttcgc ttcacggctt aacgatgtcg tttctgtctg
tgcccaaaaa ataaaggcat 120ttgttatttg caccagatat ttactaagtg caccctagtt
tgacaagtag gcgataatta 180caaatagatg cggtgcaaat aataaatttt gaaggaaata
attacaaaag aacagaactt 240atatttactt tattttaaaa aactaaaatg aaagaacaaa
aaaagtaaaa aatacaaaaa 300atgtgcttta accactttca ttatttgtta cagaaagtat
gattctactc aaattgatct 360gttgtatctg gtgctgcctt gtcacactgg cgatttcaat
cccctaaaga tatggtgcaa 420actgcgaagt gatcaatatc tgctcggtta atttagatta
attaataata ttcaacgtga 480tgtaccaaaa aaagacaatt ttttgctcca ttgacaaatt
aaacctcatc aaggtaattt 540ccaaacctat aagcaaaaaa atttcacatt aattggcccg
caatcctatt agtcttatta 600tactagagta ggaaaaaaaa caattacaca acttgtctta
ttattctcta tgctaatgaa 660tatttttccc ttttgttaga aatcagtgtt tcctaattta
ttgagtatta attccactca 720ccgcatatat ttaccgttga ataagaaaat tttacacata
attcttttta agataaataa 780tttttttata ctagatctta tatgattacg tgaagccaag
tgggttatac taatgatata 840taatgtttga tagtaatcag tttataaacc aaatgcatgg
aaatgttacg tggaagcacg 900taaattaaca agcattgaag caaatgcagc caccgcacca
aaaccacccc acttcacttc 960cacgtaccat attccatgca actacaacac cctaaaactt
caataaatgc ccccaccttc 1020acttcacttc acccatcaat agcaagcggc cgcgaagtta
aaagcaatgt tgtcacttgt 1080cgtactaaca catgatgtga tagtttatgc tagctagcta
taacataagc tgtctctgag 1140tgtgttgtat attaataaag atcatcactg gtgaatggtg
atcgtgtacg taccctactt 1200agtaggcaat ggaagcactt agagtgtgct ttgtgcatgg
ccttgcctct gttttgagac 1260ttttgtaatg ttttcgagtt taaatctttg cctttgcgta
cgtctagagt cgagcatgca 1320tctagagggc ccaattcgcc ctatagtgag tcgtattaca
attcactggc cgtcgtttta 1380caacgtcgtg actgggaaaa ccctggcgtt acccaactta
atcgccttgc agcacatccc 1440cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg
atcgcccttc ccaacagttg 1500cgcagcctat acgtacggca gtttaaggtt tacacctata
aaagagagag ccgttatcgt 1560ctgtttgtgg atgtacagag tgatattatt gacacgccgg
ggcgacggat ggtgatcccc 1620ctggccagtg cacgtctgct gtcagataaa gtctcccgtg
aactttaccc ggtggtgcat 1680atcggggatg aaagctggcg catgatgacc accgatatgg
ccagtgtgcc ggtctccgtt 1740atcggggaag aagtggctga tctcagccac cgcgaaaatg
acatcaaaaa cgccattaac 1800ctgatgttct ggggaatata aatgtcaggc atgagattat
caaaaaggat cttcacctag 1860atccttttca cgtagaaagc cagtccgcag aaacggtgct
gaccccggat gaatgtcagc 1920tactgggcta tctggacaag ggaaaacgca agcgcaaaga
gaaagcaggt agcttgcagt 1980gggcttacat ggcgatagct agactgggcg gttttatgga
cagcaagcga accggaattg 2040ccagctgggg cgccctctgg taaggttggg aagccctgca
aagtaaactg gatggctttc 2100tcgccgccaa ggatctgatg gcgcagggga tcaagctctg
atcaagagac aggatgagga 2160tcgtttcgca tgattgaaca agatggattg cacgcaggtt
ctccggccgc ttgggtggag 2220aggctattcg gctatgactg ggcacaacag acaatcggct
gctctgatgc cgccgtgttc 2280cggctgtcag cgcaggggcg cccggttctt tttgtcaaga
ccgacctgtc cggtgccctg 2340aatgaactgc aagacgaggc agcgcggcta tcgtggctgg
ccacgacggg cgttccttgc 2400gcagctgtgc tcgacgttgt cactgaagcg ggaagggact
ggctgctatt gggcgaagtg 2460ccggggcagg atctcctgtc atctcacctt gctcctgccg
agaaagtatc catcatggct 2520gatgcaatgc ggcggctgca tacgcttgat ccggctacct
gcccattcga ccaccaagcg 2580aaacatcgca tcgagcgagc acgtactcgg atggaagccg
gtcttgtcga tcaggatgat 2640ctggacgaag agcatcaggg gctcgcgcca gccgaactgt
tcgccaggct caaggcgagc 2700atgcccgacg gcgaggatct cgtcgtgacc catggcgatg
cctgcttgcc gaatatcatg 2760gtggaaaatg gccgcttttc tggattcatc gactgtggcc
ggctgggtgt ggcggaccgc 2820tatcaggaca tagcgttggc tacccgtgat attgctgaag
agcttggcgg cgaatgggct 2880gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt
cgcagcgcat cgccttctat 2940cgccttcttg acgagttctt ctgaattatt aacgcttaca
atttcctgat gcggtatttt 3000ctccttacgc atctgtgcgg tatttcacac cgcatacagg
tggcactttt cggggaaatg 3060tgcgcggaac ccctatttgt ttatttttct aaatacattc
aaatatgtat ccgctcatga 3120gacaataacc ctgataaatg cttcaataat agcacgtgag
gagggccacc atggccaagt 3180tgaccagtgc cgttccggtg ctcaccgcgc gcgacgtcgc
cggagcggtc gagttctgga 3240ccgaccggct cgggttctcc cgggacttcg tggaggacga
cttcgccggt gtggtccggg 3300acgacgtgac cctgttcatc agcgcggtcc aggaccaggt
ggtgccggac aacaccctgg 3360cctgggtgtg ggtgcgcggc ctggacgagc tgtacgccga
gtggtcggag gtcgtgtcca 3420cgaacttccg ggacgcctcc gggccggcca tgaccgagat
cggcgagcag ccgtgggggc 3480gggagttcgc cctgcgcgac ccggccggca actgcgtgca
cttcgtggcc gaggagcagg 3540actgacacgt gctaaaactt catttttaat ttaaaaggat
ctaggtgaag atcctttttg 3600ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
ccactgagcg tcagaccccg 3660tagaaaagat caaaggatct tcttgagatc ctttttttct
gcgcgtaatc tgctgcttgc 3720aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
ggatcaagag ctaccaactc 3780tttttccgaa ggtaactggc ttcagcagag cgcagatacc
aaatactgtc cttctagtgt 3840agccgtagtt aggccaccac ttcaagaact ctgtagcacc
gcctacatac ctcgctctgc 3900taatcctgtt accagtggct gctgccagtg gcgataagtc
gtgtcttacc gggttggact 3960caagacgata gttaccggat aaggcgcagc ggtcgggctg
aacggggggt tcgtgcacac 4020agcccagctt ggagcgaacg acctacaccg aactgagata
cctacagcgt gagctatgag 4080aaagcgccac gcttcccgaa gggagaaagg cggacaggta
tccggtaagc ggcagggtcg 4140gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
ctggtatctt tatagtcctg 4200tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
atgctcgtca ggggggcgga 4260gcctatggaa aaacgccagc aacgcggcct ttttacggtt
cctgggcttt tgctggcctt 4320ttgctcacat gttctttcct gcgttatccc ctgattctgt
ggataaccgt attaccgcct 4380ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
gcgcagcgag tcagtgagcg 4440aggaagcgga agagcgccca atacgcaaac cgcctctccc
cgcgcgttgg ccgattcatt 4500aatgcagctg gcacgacagg tttcccgact ggaaagcggg
cagtgagcgc aacgcaatta 4560atgtgagtta gctcactcat taggcacccc aggctttaca
ctttatgctt ccggctcgta 4620tgttgtgtgg aattgtgagc ggataacaat ttcacacagg
aaacagctat gaccatgatt 4680acgccaagct atttaggtga cgcgttagaa tactcaagct
atgcatcaag cttggtaccg 4740agctcg
4746184330DNAArtificial Sequencevector KS294
18agcttggaat tcgggatctg agtctagaaa tccgtcaaca tggtggagca cgacactctc
60gtctactcca agaatatcaa agatacagtc tcagaagacc aaagggctat tgagactttt
120caacaaaggg taatatcggg aaacctcctc ggattccatt gcccagctat ctgtcacttc
180atcaaaagga cagtagaaaa ggaaggtggc acctacaaat gccatcattg cgataaagga
240aaggctatcg ttcaagatgc ctctgccgac agtggtccca aagatggacc cccacccacg
300aggagcatcg tggaaaaaga agacgttcca accacgtctt caaagcaagt ggattgatgt
360gatgatccta tgcgtatggt atgacgtgtg ttcaagatga tgacttcaaa cctacctatg
420acgtatggta tgaacgtgtg tcgactgatg acttagatcc actcgagcgg ctataaatac
480gtacctacgc accctgcgct accatcccta gagctgcagc ttatttttac aacaattacc
540aacaacaaca aacaacaaac aacattacaa ttactattta caattacagt cgacccggga
600tcgtacctct agggtggcgg ccgcaagtat gaactaaaat gcatgtaggt gtaagagctc
660atggagagca tggaatattg tatccgacca tgtaacagta taataactga gctccatctc
720acttcttcta tgaataaaca aaggatgtta tgatatatta acactctatc tatgcacctt
780attgttctat gataaatttc ctcttattat tataaatcat ctgaatcgtg acggcttatg
840gaatgcttca aatagtacaa aaacaaatgt gtactataag actttctaaa caattctaac
900cttagcattg tgaacgagac ataagtgtta agaagacata acaattataa tggaagaagt
960ttgtctccat ttatatatta tatattaccc acttatgtat tatattagga tgttaaggag
1020acataacaat tataaagaga gaagtttgta tccatttata tattatatac tacccattta
1080tatattatac ttatccactt atttaatgtc tttataaggt ttgatccatg atatttctaa
1140tattttagtt gatatgtata tgaaagggta ctatttgaac tctcttactc tgtataaagg
1200ttggatcatc cttaaagtgg gtctatttaa ttttattgct tcttacagat aaaaaaaaaa
1260ttatgagttg gtttgataaa atattgaagg atttaaaata ataataaata acatataata
1320tatgtatata aatttattat aatataacat ttatctataa aaaagtaaat attgtcataa
1380atctatacaa tcgtttagcc ttgctggacg aatctcaatt atttaaacga gagtaaacat
1440atttgacttt ttggttattt aacaaattat tatttaacac tatatgaaat tttttttttt
1500atcagcaaag aataaaatta aattaagaag gacaatggtg tcccaatcct tatacaacca
1560acttccacaa gaaagtcaag tcagagacaa caaaaaaaca agcaaaggaa attttttaat
1620ttgagttgtc ttgtttgctg cataatttat gcagtaaaac actacacata acccttttag
1680cagtagagca atggttgacc gtgtgcttag cttcttttat tttatttttt tatcagcaaa
1740gaataaataa aataaaatga gacacttcag ggatgtttca acaagctcta gactggaatt
1800cgtcgacggc gcgcccgatc atccggatat agttcctcct ttcagcaaaa aacccctcaa
1860gacccgttta gaggccccaa ggggttatgc tagttattgc tcagcggtgg cagcagccaa
1920ctcagcttcc tttcgggctt tgttagcagc cggatcgatc caagctgtac ctcactattc
1980ctttgccctc ggacgagtgc tggggcgtcg gtttccacta tcggcgagta cttctacaca
2040gccatcggtc cagacggccg cgcttctgcg ggcgatttgt gtacgcccga cagtcccggc
2100tccggatcgg acgattgcgt cgcatcgacc ctgcgcccaa gctgcatcat cgaaattgcc
2160gtcaaccaag ctctgataga gttggtcaag accaatgcgg agcatatacg cccggagccg
2220cggcgatcct gcaagctccg gatgcctccg ctcgaagtag cgcgtctgct gctccataca
2280agccaaccac ggcctccaga agaagatgtt ggcgacctcg tattgggaat ccccgaacat
2340cgcctcgctc cagtcaatga ccgctgttat gcggccattg tccgtcagga cattgttgga
2400gccgaaatcc gcgtgcacga ggtgccggac ttcggggcag tcctcggccc aaagcatcag
2460ctcatcgaga gcctgcgcga cggacgcact gacggtgtcg tccatcacag tttgccagtg
2520atacacatgg ggatcagcaa tcgcgcatat gaaatcacgc catgtagtgt attgaccgat
2580tccttgcggt ccgaatgggc cgaacccgct cgtctggcta agatcggccg cagcgatcgc
2640atccatagcc tccgcgaccg gctgcagaac agcgggcagt tcggtttcag gcaggtcttg
2700caacgtgaca ccctgtgcac ggcgggagat gcaataggtc aggctctcgc tgaattcccc
2760aatgtcaagc acttccggaa tcgggagcgc ggccgatgca aagtgccgat aaacataacg
2820atctttgtag aaaccatcgg cgcagctatt tacccgcagg acatatccac gccctcctac
2880atcgaagctg aaagcacgag attcttcgcc ctccgagagc tgcatcaggt cggagacgct
2940gtcgaacttt tcgatcagaa acttctcgac agacgtcgcg gtgagttcag gcttttccat
3000gggtatatct ccttcttaaa gttaaacaaa attatttcta gagggaaacc gttgtggtct
3060ccctatagtg agtcgtatta atttcgcggg atcgagatct gatcaacctg cattaatgaa
3120tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca
3180ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg
3240taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc
3300agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc
3360cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac
3420tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc
3480tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcaat
3540gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc
3600acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca
3660acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag
3720cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta
3780gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg
3840gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc
3900agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt
3960ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaca ttaacctata
4020aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac ggtgaaaacc
4080tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat gccgggagca
4140gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg cttaactatg
4200cggcatcaga gcagattgta ctgagagtgc accatatgga catattgtcg ttagaacgcg
4260gctacaatta atacataacc ttatgtatca tacacatacg atttaggtga cactatagaa
4320cggcgcgcca
4330195195DNAArtificial Sequencevector pKR1142 19ctagagggcc caattcgccc
tatagtgagt cgtattacaa ttcactggcc gtcgttttac 60aacgtcgtga ctgggaaaac
cctggcgtta cccaacttaa tcgccttgca gcacatcccc 120ctttcgccag ctggcgtaat
agcgaagagg cccgcaccga tcgcccttcc caacagttgc 180gcagcctata cgtacggcag
tttaaggttt acacctataa aagagagagc cgttatcgtc 240tgtttgtgga tgtacagagt
gatattattg acacgccggg gcgacggatg gtgatccccc 300tggccagtgc acgtctgctg
tcagataaag tctcccgtga actttacccg gtggtgcata 360tcggggatga aagctggcgc
atgatgacca ccgatatggc cagtgtgccg gtctccgtta 420tcggggaaga agtggctgat
ctcagccacc gcgaaaatga catcaaaaac gccattaacc 480tgatgttctg gggaatataa
atgtcaggca tgagattatc aaaaaggatc ttcacctaga 540tccttttcac gtagaaagcc
agtccgcaga aacggtgctg accccggatg aatgtcagct 600actgggctat ctggacaagg
gaaaacgcaa gcgcaaagag aaagcaggta gcttgcagtg 660ggcttacatg gcgatagcta
gactgggcgg ttttatggac agcaagcgaa ccggaattgc 720cagctggggc gccctctggt
aaggttggga agccctgcaa agtaaactgg atggctttct 780cgccgccaag gatctgatgg
cgcaggggat caagctctga tcaagagaca ggatgaggat 840cgtttcgcat gattgaacaa
gatggattgc acgcaggttc tccggccgct tgggtggaga 900ggctattcgg ctatgactgg
gcacaacaga caatcggctg ctctgatgcc gccgtgttcc 960ggctgtcagc gcaggggcgc
ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga 1020atgaactgca agacgaggca
gcgcggctat cgtggctggc cacgacgggc gttccttgcg 1080cagctgtgct cgacgttgtc
actgaagcgg gaagggactg gctgctattg ggcgaagtgc 1140cggggcagga tctcctgtca
tctcaccttg ctcctgccga gaaagtatcc atcatggctg 1200atgcaatgcg gcggctgcat
acgcttgatc cggctacctg cccattcgac caccaagcga 1260aacatcgcat cgagcgagca
cgtactcgga tggaagccgg tcttgtcgat caggatgatc 1320tggacgaaga gcatcagggg
ctcgcgccag ccgaactgtt cgccaggctc aaggcgagca 1380tgcccgacgg cgaggatctc
gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg 1440tggaaaatgg ccgcttttct
ggattcatcg actgtggccg gctgggtgtg gcggaccgct 1500atcaggacat agcgttggct
acccgtgata ttgctgaaga gcttggcggc gaatgggctg 1560accgcttcct cgtgctttac
ggtatcgccg ctcccgattc gcagcgcatc gccttctatc 1620gccttcttga cgagttcttc
tgaattatta acgcttacaa tttcctgatg cggtattttc 1680tccttacgca tctgtgcggt
atttcacacc gcatacaggt ggcacttttc ggggaaatgt 1740gcgcggaacc cctatttgtt
tatttttcta aatacattca aatatgtatc cgctcatgag 1800acaataaccc tgataaatgc
ttcaataata gcacgtgagg agggccacca tggccaagtt 1860gaccagtgcc gttccggtgc
tcaccgcgcg cgacgtcgcc ggagcggtcg agttctggac 1920cgaccggctc gggttctccc
gggacttcgt ggaggacgac ttcgccggtg tggtccggga 1980cgacgtgacc ctgttcatca
gcgcggtcca ggaccaggtg gtgccggaca acaccctggc 2040ctgggtgtgg gtgcgcggcc
tggacgagct gtacgccgag tggtcggagg tcgtgtccac 2100gaacttccgg gacgcctccg
ggccggccat gaccgagatc ggcgagcagc cgtgggggcg 2160ggagttcgcc ctgcgcgacc
cggccggcaa ctgcgtgcac ttcgtggccg aggagcagga 2220ctgacacgtg ctaaaacttc
atttttaatt taaaaggatc taggtgaaga tcctttttga 2280taatctcatg accaaaatcc
cttaacgtga gttttcgttc cactgagcgt cagaccccgt 2340agaaaagatc aaaggatctt
cttgagatcc tttttttctg cgcgtaatct gctgcttgca 2400aacaaaaaaa ccaccgctac
cagcggtggt ttgtttgccg gatcaagagc taccaactct 2460ttttccgaag gtaactggct
tcagcagagc gcagatacca aatactgtcc ttctagtgta 2520gccgtagtta ggccaccact
tcaagaactc tgtagcaccg cctacatacc tcgctctgct 2580aatcctgtta ccagtggctg
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 2640aagacgatag ttaccggata
aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 2700gcccagcttg gagcgaacga
cctacaccga actgagatac ctacagcgtg agctatgaga 2760aagcgccacg cttcccgaag
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 2820aacaggagag cgcacgaggg
agcttccagg gggaaacgcc tggtatcttt atagtcctgt 2880cgggtttcgc cacctctgac
ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 2940cctatggaaa aacgccagca
acgcggcctt tttacggttc ctgggctttt gctggccttt 3000tgctcacatg ttctttcctg
cgttatcccc tgattctgtg gataaccgta ttaccgcctt 3060tgagtgagct gataccgctc
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 3120ggaagcggaa gagcgcccaa
tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 3180atgcagctgg cacgacaggt
ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 3240tgtgagttag ctcactcatt
aggcacccca ggctttacac tttatgcttc cggctcgtat 3300gttgtgtgga attgtgagcg
gataacaatt tcacacagga aacagctatg accatgatta 3360cgccaagcta tttaggtgac
gcgttagaat actcaagcta tgcatcaagc ttggtaccga 3420gctcggatcc tctagaaatc
cgtcaacatg gtggagcacg acactctcgt ctactccaag 3480aatatcaaag atacagtctc
agaagaccaa agggctattg agacttttca acaaagggta 3540atatcgggaa acctcctcgg
attccattgc ccagctatct gtcacttcat caaaaggaca 3600gtagaaaagg aaggtggcac
ctacaaatgc catcattgcg ataaaggaaa ggctatcgtt 3660caagatgcct ctgccgacag
tggtcccaaa gatggacccc cacccacgag gagcatcgtg 3720gaaaaagaag acgttccaac
cacgtcttca aagcaagtgg attgatgtga tgatcctatg 3780cgtatggtat gacgtgtgtt
caagatgatg acttcaaacc tacctatgac gtatggtatg 3840aacgtgtgtc gactgatgac
ttagatccac tcgagcggct ataaatacgt acctacgcac 3900cctgcgctac catccctaga
gctgcagctt atttttacaa caattaccaa caacaacaaa 3960caacaaacaa cattacaatt
actatttaca attacagtcg acccgggatc gtacctctag 4020ggtggcggcc gcaagtatga
actaaaatgc atgtaggtgt aagagctcat ggagagcatg 4080gaatattgta tccgaccatg
taacagtata ataactgagc tccatctcac ttcttctatg 4140aataaacaaa ggatgttatg
atatattaac actctatcta tgcaccttat tgttctatga 4200taaatttcct cttattatta
taaatcatct gaatcgtgac ggcttatgga atgcttcaaa 4260tagtacaaaa acaaatgtgt
actataagac tttctaaaca attctaacct tagcattgtg 4320aacgagacat aagtgttaag
aagacataac aattataatg gaagaagttt gtctccattt 4380atatattata tattacccac
ttatgtatta tattaggatg ttaaggagac ataacaatta 4440taaagagaga agtttgtatc
catttatata ttatatacta cccatttata tattatactt 4500atccacttat ttaatgtctt
tataaggttt gatccatgat atttctaata ttttagttga 4560tatgtatatg aaagggtact
atttgaactc tcttactctg tataaaggtt ggatcatcct 4620taaagtgggt ctatttaatt
ttattgcttc ttacagataa aaaaaaaatt atgagttggt 4680ttgataaaat attgaaggat
ttaaaataat aataaataac atataatata tgtatataaa 4740tttattataa tataacattt
atctataaaa aagtaaatat tgtcataaat ctatacaatc 4800gtttagcctt gctggacgaa
tctcaattat ttaaacgaga gtaaacatat ttgacttttt 4860ggttatttaa caaattatta
tttaacacta tatgaaattt ttttttttat cagcaaagaa 4920taaaattaaa ttaagaagga
caatggtgtc ccaatcctta tacaaccaac ttccacaaga 4980aagtcaagtc agagacaaca
aaaaaacaag caaaggaaat tttttaattt gagttgtctt 5040gtttgctgca taatttatgc
agtaaaacac tacacataac ccttttagca gtagagcaat 5100ggttgaccgt gtgcttagct
tcttttattt tattttttta tcagcaaaga ataaataaaa 5160taaaatgaga cacttcaggg
atgtttcaac aagct 5195208314DNAArtificial
Sequencevector pKR1141 20gatccgtcga cggcgcgccc gatcatccgg atatagttcc
tcctttcagc aaaaaacccc 60tcaagacccg tttagaggcc ccaaggggtt atgctagtta
ttgctcagcg gtggcagcag 120ccaactcagc ttcctttcgg gctttgttag cagccggatc
gatccaagct gtacctcact 180attcctttgc cctcggacga gtgctggggc gtcggtttcc
actatcggcg agtacttcta 240cacagccatc ggtccagacg gccgcgcttc tgcgggcgat
ttgtgtacgc ccgacagtcc 300cggctccgga tcggacgatt gcgtcgcatc gaccctgcgc
ccaagctgca tcatcgaaat 360tgccgtcaac caagctctga tagagttggt caagaccaat
gcggagcata tacgcccgga 420gccgcggcga tcctgcaagc tccggatgcc tccgctcgaa
gtagcgcgtc tgctgctcca 480tacaagccaa ccacggcctc cagaagaaga tgttggcgac
ctcgtattgg gaatccccga 540acatcgcctc gctccagtca atgaccgctg ttatgcggcc
attgtccgtc aggacattgt 600tggagccgaa atccgcgtgc acgaggtgcc ggacttcggg
gcagtcctcg gcccaaagca 660tcagctcatc gagagcctgc gcgacggacg cactgacggt
gtcgtccatc acagtttgcc 720agtgatacac atggggatca gcaatcgcgc atatgaaatc
acgccatgta gtgtattgac 780cgattccttg cggtccgaat gggccgaacc cgctcgtctg
gctaagatcg gccgcagcga 840tcgcatccat agcctccgcg accggctgca gaacagcggg
cagttcggtt tcaggcaggt 900cttgcaacgt gacaccctgt gcacggcggg agatgcaata
ggtcaggctc tcgctgaatt 960ccccaatgtc aagcacttcc ggaatcggga gcgcggccga
tgcaaagtgc cgataaacat 1020aacgatcttt gtagaaacca tcggcgcagc tatttacccg
caggacatat ccacgccctc 1080ctacatcgaa gctgaaagca cgagattctt cgccctccga
gagctgcatc aggtcggaga 1140cgctgtcgaa cttttcgatc agaaacttct cgacagacgt
cgcggtgagt tcaggctttt 1200ccatgggtat atctccttct taaagttaaa caaaattatt
tctagaggga aaccgttgtg 1260gtctccctat agtgagtcgt attaatttcg cgggatcgag
atcgatccaa ttccaatccc 1320acaaaaatct gagcttaaca gcacagttgc tcctctcaga
gcagaatcgg gtattcaaca 1380ccctcatatc aactactacg ttgtgtataa cggtccacat
gccggtatat acgatgactg 1440gggttgtaca aaggcggcaa caaacggcgt tcccggagtt
gcacacaaga aatttgccac 1500tattacagag gcaagagcag cagctgacgc gtacacaaca
agtcagcaaa cagacaggtt 1560gaacttcatc cccaaaggag aagctcaact caagcccaag
agctttgcta aggccctaac 1620aagcccacca aagcaaaaag cccactggct cacgctagga
accaaaaggc ccagcagtga 1680tccagcccca aaagagatct cctttgcccc ggagattaca
atggacgatt tcctctatct 1740ttacgatcta ggaaggaagt tcgaaggtga aggtgacgac
actatgttca ccactgataa 1800tgagaaggtt agcctcttca atttcagaaa gaatgctgac
ccacagatgg ttagagaggc 1860ctacgcagca ggtctcatca agacgatcta cccgagtaac
aatctccagg agatcaaata 1920ccttcccaag aaggttaaag atgcagtcaa aagattcagg
actaattgca tcaagaacac 1980agagaaagac atatttctca agatcagaag tactattcca
gtatggacga ttcaaggctt 2040gcttcataaa ccaaggcaag taatagagat tggagtctct
aaaaaggtag ttcctactga 2100atctaaggcc atgcatggag tctaagattc aaatcgagga
tctaacagaa ctcgccgtga 2160agactggcga acagttcata cagagtcttt tacgactcaa
tgacaagaag aaaatcttcg 2220tcaacatggt ggagcacgac actctggtct actccaaaaa
tgtcaaagat acagtctcag 2280aagaccaaag ggctattgag acttttcaac aaaggataat
ttcgggaaac ctcctcggat 2340tccattgccc agctatctgt cacttcatcg aaaggacagt
agaaaaggaa ggtggctcct 2400acaaatgcca tcattgcgat aaaggaaagg ctatcattca
agatgcctct gccgacagtg 2460gtcccaaaga tggaccccca cccacgagga gcatcgtgga
aaaagaagac gttccaacca 2520cgtcttcaaa gcaagtggat tgatgtgaca tctccactga
cgtaagggat gacgcacaat 2580cccactatcc ttcgcaagac ccttcctcta tataaggaag
ttcatttcat ttggagagga 2640cacgctcgag ctcatttctc tattacttca gccataacaa
aagaactctt ttctcttctt 2700attaaaccat gaaaaagcct gaactcaccg cgacgtctgt
cgagaagttt ctgatcgaaa 2760agttcgacag cgtctccgac ctgatgcagc tctcggaggg
cgaagaatct cgtgctttca 2820gcttcgatgt aggagggcgt ggatatgtcc tgcgggtaaa
tagctgcgcc gatggtttct 2880acaaagatcg ttatgtttat cggcactttg catcggccgc
gctcccgatt ccggaagtgc 2940ttgacattgg ggaattcagc gagagcctga cctattgcat
ctcccgccgt gcacagggtg 3000tcacgttgca agacctgcct gaaaccgaac tgcccgctgt
tctgcagccg gtcgcggagg 3060ccatggatgc gatcgctgcg gccgatctta gccagacgag
cgggttcggc ccattcggac 3120cgcaaggaat cggtcaatac actacatggc gtgatttcat
atgcgcgatt gctgatcccc 3180atgtgtatca ctggcaaact gtgatggacg acaccgtcag
tgcgtccgtc gcgcaggctc 3240tcgatgagct gatgctttgg gccgaggact gccccgaagt
ccggcacctc gtgcacgcgg 3300atttcggctc caacaatgtc ctgacggaca atggccgcat
aacagcggtc attgactgga 3360gcgaggcgat gttcggggat tcccaatacg aggtcgccaa
catcttcttc tggaggccgt 3420ggttggcttg tatggagcag cagacgcgct acttcgagcg
gaggcatccg gagcttgcag 3480gatcgccgcg gctccgggcg tatatgctcc gcattggtct
tgaccaactc tatcagagct 3540tggttgacgg caatttcgat gatgcagctt gggcgcaggg
tcgatgcgac gcaatcgtcc 3600gatccggagc cgggactgtc gggcgtacac aaatcgcccg
cagaagcgcg gccgtctgga 3660ccgatggctg tgtagaagta ctcgccgata gtggaaaccg
acgccccagc actcgtccga 3720gggcaaagga atagtgaggt acctaaagaa ggagtgcgtc
gaagcagatc gttcaaacat 3780ttggcaataa agtttcttaa gattgaatcc tgttgccggt
cttgcgatga ttatcatata 3840atttctgttg aattacgtta agcatgtaat aattaacatg
taatgcatga cgttatttat 3900gagatgggtt tttatgatta gagtcccgca attatacatt
taatacgcga tagaaaacaa 3960aatatagcgc gcaaactagg ataaattatc gcgcgcggtg
tcatctatgt tactagatcg 4020atgtcgaatc gatcaacctg cattaatgaa tcggccaacg
cgcggggaga ggcggtttgc 4080gtattgggcg ctcttccgct tcctcgctca ctgactcgct
gcgctcggtc gttcggctgc 4140ggcgagcggt atcagctcac tcaaaggcgg taatacggtt
atccacagaa tcaggggata 4200acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc
caggaaccgt aaaaaggccg 4260cgttgctggc gtttttccat aggctccgcc cccctgacga
gcatcacaaa aatcgacgct 4320caagtcagag gtggcgaaac ccgacaggac tataaagata
ccaggcgttt ccccctggaa 4380gctccctcgt gcgctctcct gttccgaccc tgccgcttac
cggatacctg tccgcctttc 4440tcccttcggg aagcgtggcg ctttctcaat gctcacgctg
taggtatctc agttcggtgt 4500aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc
cgttcagccc gaccgctgcg 4560ccttatccgg taactatcgt cttgagtcca acccggtaag
acacgactta tcgccactgg 4620cagcagccac tggtaacagg attagcagag cgaggtatgt
aggcggtgct acagagttct 4680tgaagtggtg gcctaactac ggctacacta gaaggacagt
atttggtatc tgcgctctgc 4740tgaagccagt taccttcgga aaaagagttg gtagctcttg
atccggcaaa caaaccaccg 4800ctggtagcgg tggttttttt gtttgcaagc agcagattac
gcgcagaaaa aaaggatctc 4860aagaagatcc tttgatcttt tctacggggt ctgacgctca
gtggaacgaa aactcacgtt 4920aagggatttt ggtcatgaca ttaacctata aaaataggcg
tatcacgagg ccctttcgtc 4980tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat
gcagctcccg gagacggtca 5040cagcttgtct gtaagcggat gccgggagca gacaagcccg
tcagggcgcg tcagcgggtg 5100ttggcgggtg tcggggctgg cttaactatg cggcatcaga
gcagattgta ctgagagtgc 5160accatatgga catattgtcg ttagaacgcg gctacaatta
atacataacc ttatgtatca 5220tacacatacg atttaggtga cactatagaa cggcgcgcca
agcttggatc tcctgcagga 5280tctggccggc cggatctcgt acggatcctc gaagagaagg
gttaataaca cactttttta 5340acatttttaa cacaaatttt agttatttaa aaatttatta
aaaaatttaa aataagaaga 5400ggaactcttt aaataaatct aacttacaaa atttatgatt
tttaataagt tttcaccaat 5460aaaaaatgtc ataaaaatat gttaaaaagt atattatcaa
tattctcttt atgataaata 5520aaaagaaaaa aaaaataaaa gttaagtgaa aatgagattg
aagtgacttt aggtgtgtat 5580aaatatatca accccgccaa caatttattt aatccaaata
tattgaagta tattattcca 5640tagcctttat ttatttatat atttattata taaaagcttt
atttgttcta ggttgttcat 5700gaaatatttt tttggtttta tctccgttgt aagaaaatca
tgtgctttgt gtcgccactc 5760actattgcag ctttttcatg cattggtcag attgacggtt
gattgtattt ttgtttttta 5820tggttttgtg ttatgactta agtcttcatc tctttatctc
ttcatcaggt ttgatggtta 5880cctaatatgg tccatgggta catgcatggt taaattaggt
ggccaacttt gttgtgaacg 5940atagaatttt ttttatatta agtaaactat ttttatatta
tgaaataata ataaaaaaaa 6000tattttatca ttattaacaa aatcatatta gttaatttgt
taactctata ataaaagaaa 6060tactgtaaca ttcacattac atggtaacat ctttccaccc
tttcatttgt tttttgtttg 6120atgacttttt ttcttgttta aatttatttc ccttctttta
aatttggaat acattatcat 6180catatataaa ctaaaatact aaaaacagga ttacacaaat
gataaataat aacacaaata 6240tttataaatc tagctgcaat atatttaaac tagctatatc
gatattgtaa aataaaacta 6300gctgcattga tactgataaa aaaatatcat gtgctttctg
gactgatgat gcagtatact 6360tttgacattg cctttatttt atttttcaga aaagctttct
tagttctggg ttcttcatta 6420tttgtttccc atctccattg tgaattgaat catttgcttc
gtgtcacaaa tacaatttag 6480ntaggtacat gcattggtca gattcacggt ttattatgtc
atgacttaag ttcatggtag 6540tacattacct gccacgcatg cattatattg gttagatttg
ataggcaaat ttggttgtca 6600acaatataaa tataaataat gtttttatat tacgaaataa
cagtgatcaa aacaaacagt 6660tttatcttta ttaacaagat tttgtttttg tttgatgacg
ttttttaatg tttacgcttt 6720cccccttctt ttgaatttag aacactttat catcataaaa
tcaaatacta aaaaaattac 6780atatttcata aataataaca caaatatttt taaaaaatct
gaaataataa tgaacaatat 6840tacatattat cacgaaaatt cattaataaa aatattatat
aaataaaatg taatagtagt 6900tatatgtagg aaaaaagtac tgcacgcata atatatacaa
aaagattaaa atgaactatt 6960ataaataata acactaaatt aatggtgaat catatcaaaa
taatgaaaaa gtaaataaaa 7020tttgtaatta acttctatat gtattacaca cacaaataat
aaataatagt aaaaaaaatt 7080atgataaata tttaccatct cataagatat ttaaaataat
gataaaaata tagattattt 7140tttatgcaac tagctagcca aaaagagaac acgggtatat
ataaaaagag tacctttaaa 7200ttctactgta cttcctttat tcctgacgtt tttatatcaa
gtggacatac gtgaagattt 7260taattatcag tctaaatatt tcattagcac ttaatacttt
tctgttttat tcctatccta 7320taagtagtcc cgattctccc aacattgctt attcacacaa
ctaactaaga aagtcttcca 7380tagcccccca agcggcccat ggcctcctcc gaggacgtca
tcaaggagtt catgcgcttc 7440aaggtgcgca tggagggctc cgtgaacggc cacgagttcg
agatcgaggg cgagggcgag 7500ggccgcccct acgagggcac ccagaccgcc aagctgaagg
tgaccaaggg cggccccctg 7560cccttcgcct gggacatcct gtccccccag ttccagtacg
gctccaaggt gtacgtgaag 7620caccccgccg acatccccga ctacaagaag ctgtccttcc
ccgagggctt caagtgggag 7680cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga
cccaggactc ctccctgcag 7740gacggctcct tcatctacaa ggtgaagttc atcggcgtga
acttcccctc cgacggcccc 7800gtaatgcaga agaagactat gggctgggag gcctccaccg
agcgcctgta cccccgcgac 7860ggcgtgctga agggcgagat ccacaaggcc ctgaagctga
aggacggcgg ccactacctg 7920gtggagttca agtccatcta catggccaag aagcccgtgc
agctgcccgg ctactactac 7980gtggactcca agctggacat cacctcccac aacgaggact
acaccatcgt ggagcagtac 8040gagcgcgccg agggccgcca ccacctgttc ctgtagcggc
cggccgcgac acaagtgtga 8100gagtactaaa taaatgcttt ggttgtacga aatcattaca
ctaaataaaa taatcaaagc 8160ttatatatgc cttccgctaa ggccgaatgc aaagaaattg
gttctttctc gttatctttt 8220gccactttta ctagtacgta ttaattacta cttaatcatc
tttgtttacg gctcattata 8280tccgtcgacg gcgcgggccg ctctagaact agtg
83142135DNAArtificial SequencePCR primer SuSy-5
21tcctgcaggt ctactcttta catgttcttt actcc
352234DNAArtificial SequencePCR primer SuSy-3 22agcggccgcg attttttctc
agaggcaaaa acac 34235531DNAArtificial
Sequencevector pLF122 23cctgaattct gcagatatcc atcacactgg cggccgctcg
agcatgcatc tagagggccc 60aattcgccct atagtgagtc gtattacaat tcactggccg
tcgttttaca acgtcgtgac 120tgggaaaacc ctggcgttac ccaacttaat cgccttgcag
cacatccccc tttcgccagc 180tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc
aacagttgcg cagcctatac 240gtacggcagt ttaaggttta cacctataaa agagagagcc
gttatcgtct gtttgtggat 300gtacagagtg atattattga cacgccgggg cgacggatgg
tgatccccct ggccagtgca 360cgtctgctgt cagataaagt ctcccgtgaa ctttacccgg
tggtgcatat cggggatgaa 420agctggcgca tgatgaccac cgatatggcc agtgtgccgg
tctccgttat cggggaagaa 480gtggctgatc tcagccaccg cgaaaatgac atcaaaaacg
ccattaacct gatgttctgg 540ggaatataaa tgtcaggcat gagattatca aaaaggatct
tcacctagat ccttttcacg 600tagaaagcca gtccgcagaa acggtgctga ccccggatga
atgtcagcta ctgggctatc 660tggacaaggg aaaacgcaag cgcaaagaga aagcaggtag
cttgcagtgg gcttacatgg 720cgatagctag actgggcggt tttatggaca gcaagcgaac
cggaattgcc agctggggcg 780ccctctggta aggttgggaa gccctgcaaa gtaaactgga
tggctttctc gccgccaagg 840atctgatggc gcaggggatc aagctctgat caagagacag
gatgaggatc gtttcgcatg 900attgaacaag atggattgca cgcaggttct ccggccgctt
gggtggagag gctattcggc 960tatgactggg cacaacagac aatcggctgc tctgatgccg
ccgtgttccg gctgtcagcg 1020caggggcgcc cggttctttt tgtcaagacc gacctgtccg
gtgccctgaa tgaactgcaa 1080gacgaggcag cgcggctatc gtggctggcc acgacgggcg
ttccttgcgc agctgtgctc 1140gacgttgtca ctgaagcggg aagggactgg ctgctattgg
gcgaagtgcc ggggcaggat 1200ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca
tcatggctga tgcaatgcgg 1260cggctgcata cgcttgatcc ggctacctgc ccattcgacc
accaagcgaa acatcgcatc 1320gagcgagcac gtactcggat ggaagccggt cttgtcgatc
aggatgatct ggacgaagag 1380catcaggggc tcgcgccagc cgaactgttc gccaggctca
aggcgagcat gcccgacggc 1440gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga
atatcatggt ggaaaatggc 1500cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg
cggaccgcta tcaggacata 1560gcgttggcta cccgtgatat tgctgaagag cttggcggcg
aatgggctga ccgcttcctc 1620gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg
ccttctatcg ccttcttgac 1680gagttcttct gaattattaa cgcttacaat ttcctgatgc
ggtattttct ccttacgcat 1740ctgtgcggta tttcacaccg catacaggtg gcacttttcg
gggaaatgtg cgcggaaccc 1800ctatttgttt atttttctaa atacattcaa atatgtatcc
gctcatgaga caataaccct 1860gataaatgct tcaataatag cacgtgagga gggccaccat
ggccaagttg accagtgccg 1920ttccggtgct caccgcgcgc gacgtcgccg gagcggtcga
gttctggacc gaccggctcg 1980ggttctcccg ggacttcgtg gaggacgact tcgccggtgt
ggtccgggac gacgtgaccc 2040tgttcatcag cgcggtccag gaccaggtgg tgccggacaa
caccctggcc tgggtgtggg 2100tgcgcggcct ggacgagctg tacgccgagt ggtcggaggt
cgtgtccacg aacttccggg 2160acgcctccgg gccggccatg accgagatcg gcgagcagcc
gtgggggcgg gagttcgccc 2220tgcgcgaccc ggccggcaac tgcgtgcact tcgtggccga
ggagcaggac tgacacgtgc 2280taaaacttca tttttaattt aaaaggatct aggtgaagat
cctttttgat aatctcatga 2340ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc
agaccccgta gaaaagatca 2400aaggatcttc ttgagatcct ttttttctgc gcgtaatctg
ctgcttgcaa acaaaaaaac 2460caccgctacc agcggtggtt tgtttgccgg atcaagagct
accaactctt tttccgaagg 2520taactggctt cagcagagcg cagataccaa atactgtcct
tctagtgtag ccgtagttag 2580gccaccactt caagaactct gtagcaccgc ctacatacct
cgctctgcta atcctgttac 2640cagtggctgc tgccagtggc gataagtcgt gtcttaccgg
gttggactca agacgatagt 2700taccggataa ggcgcagcgg tcgggctgaa cggggggttc
gtgcacacag cccagcttgg 2760agcgaacgac ctacaccgaa ctgagatacc tacagcgtga
gctatgagaa agcgccacgc 2820ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg
cagggtcgga acaggagagc 2880gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
tagtcctgtc gggtttcgcc 2940acctctgact tgagcgtcga tttttgtgat gctcgtcagg
ggggcggagc ctatggaaaa 3000acgccagcaa cgcggccttt ttacggttcc tgggcttttg
ctggcctttt gctcacatgt 3060tctttcctgc gttatcccct gattctgtgg ataaccgtat
taccgccttt gagtgagctg 3120ataccgctcg ccgcagccga acgaccgagc gcagcgagtc
agtgagcgag gaagcggaag 3180agcgcccaat acgcaaaccg cctctccccg cgcgttggcc
gattcattaa tgcagctggc 3240acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa
cgcaattaat gtgagttagc 3300tcactcatta ggcaccccag gctttacact ttatgcttcc
ggctcgtatg ttgtgtggaa 3360ttgtgagcgg ataacaattt cacacaggaa acagctatga
ccatgattac gccaagctat 3420ttaggtgacg cgttagaata ctcaagctat gcatcaagct
tggtaccgag ctcggatcca 3480ctagtaacgg ccgccagtgt gctggaattc aggtcctgca
ggtctactct ttacatgttc 3540tttactccgt ctcaaaattt cctttttttg ttggctctct
ccgaacgagt tggagaaatc 3600gttaacccta atcgaagatc tagattcctc tacatacgtt
tgatctctct ctcagtatgg 3660attacaaagc gccaaggaga tactactcac acggagttgt
tgcgagacag caagatttcg 3720caacagatat agttacgaga agaagacctt atgtccctta
cgaccgtcca aataagtttt 3780caaggagtct ggtttggacg tcaaaagagt acaaatcacc
cgagggcaat aatatgccaa 3840ggaccaatga tgtgtcaccg aaaccaccag ttttaggttt
ggcgaggaag aatgctgctt 3900gtgggccaat gagatcttct agtctcagaa aatgggtatg
taagtattgg aaagatggaa 3960agtgcaagag gggtgagcag tgccagttct tacactcttg
gtcttgtttc cctggattgg 4020ccatggtagc ttctcttgaa gggcacaata aggaactaaa
ggggatcgct ctccctgagg 4080gttcagataa actcttttca gtcagtattg atggtacatt
gcgagtttgg gactgcaatt 4140ctggtcagtg tgtacattcc atcaaccttg acgcagaagc
agggtctcta atcagtgaag 4200gcccttgggt tttccttggc ttgccaaacg ctataaaggc
ttttaacgtt caaaccagtc 4260aagatttgca tcttcaagca gcaggggtgg ttggtcaggt
gaatgcaatg actattgcaa 4320acggaatgct ttttgctgga acaagttctg gtagtatctt
agtctggaaa gctactacag 4380actctgagtc tgatccattc aaatacttga catctcttga
gggacatagt ggtgaagtca 4440cttgttttgc tgttggaggt caaatgctat actctggttc
tgtcgataaa acaatcaaga 4500tgtgggatct caacaccctg caatgtataa tgaccctgaa
gcaacatacc ggcactgtca 4560cttcactctt atgttgggat aaatgtttga tatcgtcttc
cttggatggg accataaaag 4620tttgggctta ttctgaaaac ggaatcttga aagttgttca
aactcgcaga caagaacaga 4680gtagtgttca tgctctttct ggtatgcatg atgcagaagc
caaaccgata atattctgct 4740cttaccaaaa cggaaccgtt ggcattttcg acctaccatc
ttttcaagaa agaggaagga 4800tgttctctac gcacacgatc gccacactca caattggtcc
tcaaggattg ttattcagtg 4860gagacgagag tggtaacttg cgtgtatgga ccttagctgc
tggcaacaaa gtttagtctt 4920ttcgactaaa gaattctgat ttaattttgt ggtttatatg
ttgagttaac tgttaagaga 4980gttttatttt gtaataggtg tatcagtcaa taaacaatct
ttgtatcaac caaatgtaat 5040ttttctcgtt aattcgattt cagagttttt actttaagat
aaacaaactc tttcacacat 5100catttaatga aagtggagaa gcttaaaaaa caaacaaaga
aactgatcca tttttggcgg 5160gtcttcttct actcttattc atatgtgtta acgaactata
gcgtaaaatt cagagcaagc 5220gatctccgat ttgaacgtgg ctatcaccgg aggcccacca
ctacgggcga tacgctctaa 5280gtgaggatta aagtgctctg gtggtgacgt tgaagaaact
cgcccatggt ttttgttatc 5340tctgcagcca agtgtcgttc tttcttcgcc acttctcatc
aagctacagt gaatttaaaa 5400atggcgtctt tctttgatct cgtatacata agctggattg
gtttcttaaa caaattcctc 5460tccttttggg tcttctgggt ttgccttgta agtgtttgtg
tttttgcctc tgagaaaaaa 5520tcgcggccgc t
5531246644DNAArtificial Sequencevector pKR1155
24ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc tcatggagag catggaatat
60tgtatccgac catgtaacag tataataact gagctccatc tcacttcttc tatgaataaa
120caaaggatgt tatgatatat taacactcta tctatgcacc ttattgttct atgataaatt
180tcctcttatt attataaatc atctgaatcg tgacggctta tggaatgctt caaatagtac
240aaaaacaaat gtgtactata agactttcta aacaattcta accttagcat tgtgaacgag
300acataagtgt taagaagaca taacaattat aatggaagaa gtttgtctcc atttatatat
360tatatattac ccacttatgt attatattag gatgttaagg agacataaca attataaaga
420gagaagtttg tatccattta tatattatat actacccatt tatatattat acttatccac
480ttatttaatg tctttataag gtttgatcca tgatatttct aatattttag ttgatatgta
540tatgaaaggg tactatttga actctcttac tctgtataaa ggttggatca tccttaaagt
600gggtctattt aattttattg cttcttacag ataaaaaaaa aattatgagt tggtttgata
660aaatattgaa ggatttaaaa taataataaa taacatataa tatatgtata taaatttatt
720ataatataac atttatctat aaaaaagtaa atattgtcat aaatctatac aatcgtttag
780ccttgctgga cgaatctcaa ttatttaaac gagagtaaac atatttgact ttttggttat
840ttaacaaatt attatttaac actatatgaa attttttttt ttatcagcaa agaataaaat
900taaattaaga aggacaatgg tgtcccaatc cttatacaac caacttccac aagaaagtca
960agtcagagac aacaaaaaaa caagcaaagg aaatttttta atttgagttg tcttgtttgc
1020tgcataattt atgcagtaaa acactacaca taaccctttt agcagtagag caatggttga
1080ccgtgtgctt agcttctttt attttatttt tttatcagca aagaataaat aaaataaaat
1140gagacacttc agggatgttt caacaagctc tagagggccc aattcgccct atagtgagtc
1200gtattacaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac
1260ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc
1320ccgcaccgat cgcccttccc aacagttgcg cagcctatac gtacggcagt ttaaggttta
1380cacctataaa agagagagcc gttatcgtct gtttgtggat gtacagagtg atattattga
1440cacgccgggg cgacggatgg tgatccccct ggccagtgca cgtctgctgt cagataaagt
1500ctcccgtgaa ctttacccgg tggtgcatat cggggatgaa agctggcgca tgatgaccac
1560cgatatggcc agtgtgccgg tctccgttat cggggaagaa gtggctgatc tcagccaccg
1620cgaaaatgac atcaaaaacg ccattaacct gatgttctgg ggaatataaa tgtcaggcat
1680gagattatca aaaaggatct tcacctagat ccttttcacg tagaaagcca gtccgcagaa
1740acggtgctga ccccggatga atgtcagcta ctgggctatc tggacaaggg aaaacgcaag
1800cgcaaagaga aagcaggtag cttgcagtgg gcttacatgg cgatagctag actgggcggt
1860tttatggaca gcaagcgaac cggaattgcc agctggggcg ccctctggta aggttgggaa
1920gccctgcaaa gtaaactgga tggctttctc gccgccaagg atctgatggc gcaggggatc
1980aagctctgat caagagacag gatgaggatc gtttcgcatg attgaacaag atggattgca
2040cgcaggttct ccggccgctt gggtggagag gctattcggc tatgactggg cacaacagac
2100aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc cggttctttt
2160tgtcaagacc gacctgtccg gtgccctgaa tgaactgcaa gacgaggcag cgcggctatc
2220gtggctggcc acgacgggcg ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg
2280aagggactgg ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc
2340tcctgccgag aaagtatcca tcatggctga tgcaatgcgg cggctgcata cgcttgatcc
2400ggctacctgc ccattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat
2460ggaagccggt cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc
2520cgaactgttc gccaggctca aggcgagcat gcccgacggc gaggatctcg tcgtgaccca
2580tggcgatgcc tgcttgccga atatcatggt ggaaaatggc cgcttttctg gattcatcga
2640ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat
2700tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcgccgc
2760tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct gaattattaa
2820cgcttacaat ttcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg
2880catacaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa
2940atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatag
3000cacgtgagga gggccaccat ggccaagttg accagtgccg ttccggtgct caccgcgcgc
3060gacgtcgccg gagcggtcga gttctggacc gaccggctcg ggttctcccg ggacttcgtg
3120gaggacgact tcgccggtgt ggtccgggac gacgtgaccc tgttcatcag cgcggtccag
3180gaccaggtgg tgccggacaa caccctggcc tgggtgtggg tgcgcggcct ggacgagctg
3240tacgccgagt ggtcggaggt cgtgtccacg aacttccggg acgcctccgg gccggccatg
3300accgagatcg gcgagcagcc gtgggggcgg gagttcgccc tgcgcgaccc ggccggcaac
3360tgcgtgcact tcgtggccga ggagcaggac tgacacgtgc taaaacttca tttttaattt
3420aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag
3480ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct
3540ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt
3600tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg
3660cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct
3720gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc
3780gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg
3840tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa
3900ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg
3960gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg
4020ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga
4080tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt
4140ttacggttcc tgggcttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct
4200gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga
4260acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcccaat acgcaaaccg
4320cctctccccg cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg
4380aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag
4440gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt
4500cacacaggaa acagctatga ccatgattac gccaagctat ttaggtgacg cgttagaata
4560ctcaagctat gcatcaagct tggtaccgag ctcggatcca ctagtaacgg ccgccagtgt
4620gctggaattc aggtcctgca ggtctactct ttacatgttc tttactccgt ctcaaaattt
4680cctttttttg ttggctctct ccgaacgagt tggagaaatc gttaacccta atcgaagatc
4740tagattcctc tacatacgtt tgatctctct ctcagtatgg attacaaagc gccaaggaga
4800tactactcac acggagttgt tgcgagacag caagatttcg caacagatat agttacgaga
4860agaagacctt atgtccctta cgaccgtcca aataagtttt caaggagtct ggtttggacg
4920tcaaaagagt acaaatcacc cgagggcaat aatatgccaa ggaccaatga tgtgtcaccg
4980aaaccaccag ttttaggttt ggcgaggaag aatgctgctt gtgggccaat gagatcttct
5040agtctcagaa aatgggtatg taagtattgg aaagatggaa agtgcaagag gggtgagcag
5100tgccagttct tacactcttg gtcttgtttc cctggattgg ccatggtagc ttctcttgaa
5160gggcacaata aggaactaaa ggggatcgct ctccctgagg gttcagataa actcttttca
5220gtcagtattg atggtacatt gcgagtttgg gactgcaatt ctggtcagtg tgtacattcc
5280atcaaccttg acgcagaagc agggtctcta atcagtgaag gcccttgggt tttccttggc
5340ttgccaaacg ctataaaggc ttttaacgtt caaaccagtc aagatttgca tcttcaagca
5400gcaggggtgg ttggtcaggt gaatgcaatg actattgcaa acggaatgct ttttgctgga
5460acaagttctg gtagtatctt agtctggaaa gctactacag actctgagtc tgatccattc
5520aaatacttga catctcttga gggacatagt ggtgaagtca cttgttttgc tgttggaggt
5580caaatgctat actctggttc tgtcgataaa acaatcaaga tgtgggatct caacaccctg
5640caatgtataa tgaccctgaa gcaacatacc ggcactgtca cttcactctt atgttgggat
5700aaatgtttga tatcgtcttc cttggatggg accataaaag tttgggctta ttctgaaaac
5760ggaatcttga aagttgttca aactcgcaga caagaacaga gtagtgttca tgctctttct
5820ggtatgcatg atgcagaagc caaaccgata atattctgct cttaccaaaa cggaaccgtt
5880ggcattttcg acctaccatc ttttcaagaa agaggaagga tgttctctac gcacacgatc
5940gccacactca caattggtcc tcaaggattg ttattcagtg gagacgagag tggtaacttg
6000cgtgtatgga ccttagctgc tggcaacaaa gtttagtctt ttcgactaaa gaattctgat
6060ttaattttgt ggtttatatg ttgagttaac tgttaagaga gttttatttt gtaataggtg
6120tatcagtcaa taaacaatct ttgtatcaac caaatgtaat ttttctcgtt aattcgattt
6180cagagttttt actttaagat aaacaaactc tttcacacat catttaatga aagtggagaa
6240gcttaaaaaa caaacaaaga aactgatcca tttttggcgg gtcttcttct actcttattc
6300atatgtgtta acgaactata gcgtaaaatt cagagcaagc gatctccgat ttgaacgtgg
6360ctatcaccgg aggcccacca ctacgggcga tacgctctaa gtgaggatta aagtgctctg
6420gtggtgacgt tgaagaaact cgcccatggt ttttgttatc tctgcagcca agtgtcgttc
6480tttcttcgcc acttctcatc aagctacagt gaatttaaaa atggcgtctt tctttgatct
6540cgtatacata agctggattg gtttcttaaa caaattcctc tccttttggg tcttctgggt
6600ttgccttgta agtgtttgtg tttttgcctc tgagaaaaaa tcgc
66442511736DNAArtificial Sequencevector pKR1158 25gtacgagatc cggccggcca
gatcctgcag gagatccaag cttggcgcgc cgttctatag 60tgtcacctaa atcgtatgtg
tatgatacat aaggttatgt attaattgta gccgcgttct 120aacgacaata tgtccatatg
gtgcactctc agtacaatct gctctgatgc cgcatagtta 180agccagcccc gacacccgcc
aacacccgct gacgcgccct gacgggcttg tctgctcccg 240gcatccgctt acagacaagc
tgtgaccgtc tccgggagct gcatgtgtca gaggttttca 300ccgtcatcac cgaaacgcgc
gagacgaaag ggcctcgtga tacgcctatt tttataggtt 360aatgtcatga ccaaaatccc
ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 420gaaaagatca aaggatcttc
ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 480acaaaaaaac caccgctacc
agcggtggtt tgtttgccgg atcaagagct accaactctt 540tttccgaagg taactggctt
cagcagagcg cagataccaa atactgtcct tctagtgtag 600ccgtagttag gccaccactt
caagaactct gtagcaccgc ctacatacct cgctctgcta 660atcctgttac cagtggctgc
tgccagtggc gataagtcgt gtcttaccgg gttggactca 720agacgatagt taccggataa
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 780cccagcttgg agcgaacgac
ctacaccgaa ctgagatacc tacagcgtga gcattgagaa 840agcgccacgc ttcccgaagg
gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 900acaggagagc gcacgaggga
gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 960gggtttcgcc acctctgact
tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 1020ctatggaaaa acgccagcaa
cgcggccttt ttacggttcc tggccttttg ctggcctttt 1080gctcacatgt tctttcctgc
gttatcccct gattctgtgg ataaccgtat taccgccttt 1140gagtgagctg ataccgctcg
ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 1200gaagcggaag agcgcccaat
acgcaaaccg cctctccccg cgcgttggcc gattcattaa 1260tgcaggttga tcgattcgac
atcgatctag taacatagat gacaccgcgc gcgataattt 1320atcctagttt gcgcgctata
ttttgttttc tatcgcgtat taaatgtata attgcgggac 1380tctaatcata aaaacccatc
tcataaataa cgtcatgcat tacatgttaa ttattacatg 1440cttaacgtaa ttcaacagaa
attatatgat aatcatcgca agaccggcaa caggattcaa 1500tcttaagaaa ctttattgcc
aaatgtttga acgatctgct tcgacgcact ccttctttag 1560gtacctcact attcctttgc
cctcggacga gtgctggggc gtcggtttcc actatcggcg 1620agtacttcta cacagccatc
ggtccagacg gccgcgcttc tgcgggcgat ttgtgtacgc 1680ccgacagtcc cggctccgga
tcggacgatt gcgtcgcatc gaccctgcgc ccaagctgca 1740tcatcgaaat tgccgtcaac
caagctctga tagagttggt caagaccaat gcggagcata 1800tacgcccgga gccgcggcga
tcctgcaagc tccggatgcc tccgctcgaa gtagcgcgtc 1860tgctgctcca tacaagccaa
ccacggcctc cagaagaaga tgttggcgac ctcgtattgg 1920gaatccccga acatcgcctc
gctccagtca atgaccgctg ttatgcggcc attgtccgtc 1980aggacattgt tggagccgaa
atccgcgtgc acgaggtgcc ggacttcggg gcagtcctcg 2040gcccaaagca tcagctcatc
gagagcctgc gcgacggacg cactgacggt gtcgtccatc 2100acagtttgcc agtgatacac
atggggatca gcaatcgcgc atatgaaatc acgccatgta 2160gtgtattgac cgattccttg
cggtccgaat gggccgaacc cgctcgtctg gctaagatcg 2220gccgcagcga tcgcatccat
ggcctccgcg accggctgca gaacagcggg cagttcggtt 2280tcaggcaggt cttgcaacgt
gacaccctgt gcacggcggg agatgcaata ggtcaggctc 2340tcgctgaatt ccccaatgtc
aagcacttcc ggaatcggga gcgcggccga tgcaaagtgc 2400cgataaacat aacgatcttt
gtagaaacca tcggcgcagc tatttacccg caggacatat 2460ccacgccctc ctacatcgaa
gctgaaagca cgagattctt cgccctccga gagctgcatc 2520aggtcggaga cgctgtcgaa
cttttcgatc agaaacttct cgacagacgt cgcggtgagt 2580tcaggctttt tcatggttta
ataagaagag aaaagagttc ttttgttatg gctgaagtaa 2640tagagaaatg agctcgagcg
tgtcctctcc aaatgaaatg aacttcctta tatagaggaa 2700gggtcttgcg aaggatagtg
ggattgtgcg tcatccctta cgtcagtgga gatgtcacat 2760caatccactt gctttgaaga
cgtggttgga acgtcttctt tttccacgat gctcctcgtg 2820ggtgggggtc catctttggg
accactgtcg gcagaggcat cttgaatgat agcctttcct 2880ttatcgcaat gatggcattt
gtaggagcca ccttcctttt ctactgtcct ttcgatgaag 2940tgacagatag ctgggcaatg
gaatccgagg aggtttcccg aaattatcct ttgttgaaaa 3000gtctcaatag ccctttggtc
ttctgagact gtatctttga catttttgga gtagaccaga 3060gtgtcgtgct ccaccatgtt
gacgaagatt ttcttcttgt cattgagtcg taaaagactc 3120tgtatgaact gttcgccagt
cttcacggcg agttctgtta gatcctcgat ttgaatctta 3180gactccatgc atggccttag
attcagtagg aactaccttt ttagagactc caatctctat 3240tacttgcctt ggtttatgaa
gcaagccttg aatcgtccat actggaatag tacttctgat 3300cttgagaaat atgtctttct
ctgtgttctt gatgcaatta gtcctgaatc ttttgactgc 3360atctttaacc ttcttgggaa
ggtatttgat ctcctggaga ttgttactcg ggtagatcgt 3420cttgatgaga cctgctgcgt
aggcctctct aaccatctgt gggtcagcat tctttctgaa 3480attgaagagg ctaaccttct
cattatcagt ggtgaacata gtgtcgtcac cttcaccttc 3540gaacttcctt cctagatcgt
aaagatagag gaaatcgtcc attgtaatct ccggggcaaa 3600ggagatctct tttggggctg
gatcactgct gggccttttg gttcctagcg tgagccagtg 3660ggctttttgc tttggtgggc
ttgttagggc cttagcaaag ctcttgggct tgagttgagc 3720ttctcctttg gggatgaagt
tcaacctgtc tgtttgctga cttgttgtgt acgcgtcagc 3780tgctgctctt gcctctgtaa
tagtggcaaa tttcttgtgt gcaactccgg gaacgccgtt 3840tgttgccgcc tttgtacaac
cccagtcatc gtatataccg gcatgtggac cgttatacac 3900aacgtagtag ttgatatgag
ggtgttgaat acccgattct gctctgagag gagcaactgt 3960gctgttaagc tcagattttt
gtgggattgg aattggatcg atctcgatcc cgcgaaatta 4020atacgactca ctatagggag
accacaacgg tttccctcta gaaataattt tgtttaactt 4080taagaaggag atatacccat
ggaaaagcct gaactcaccg cgacgtctgt cgagaagttt 4140ctgatcgaaa agttcgacag
cgtctccgac ctgatgcagc tctcggaggg cgaagaatct 4200cgtgctttca gcttcgatgt
aggagggcgt ggatatgtcc tgcgggtaaa tagctgcgcc 4260gatggtttct acaaagatcg
ttatgtttat cggcactttg catcggccgc gctcccgatt 4320ccggaagtgc ttgacattgg
ggaattcagc gagagcctga cctattgcat ctcccgccgt 4380gcacagggtg tcacgttgca
agacctgcct gaaaccgaac tgcccgctgt tctgcagccg 4440gtcgcggagg ctatggatgc
gatcgctgcg gccgatctta gccagacgag cgggttcggc 4500ccattcggac cgcaaggaat
cggtcaatac actacatggc gtgatttcat atgcgcgatt 4560gctgatcccc atgtgtatca
ctggcaaact gtgatggacg acaccgtcag tgcgtccgtc 4620gcgcaggctc tcgatgagct
gatgctttgg gccgaggact gccccgaagt ccggcacctc 4680gtgcacgcgg atttcggctc
caacaatgtc ctgacggaca atggccgcat aacagcggtc 4740attgactgga gcgaggcgat
gttcggggat tcccaatacg aggtcgccaa catcttcttc 4800tggaggccgt ggttggcttg
tatggagcag cagacgcgct acttcgagcg gaggcatccg 4860gagcttgcag gatcgccgcg
gctccgggcg tatatgctcc gcattggtct tgaccaactc 4920tatcagagct tggttgacgg
caatttcgat gatgcagctt gggcgcaggg tcgatgcgac 4980gcaatcgtcc gatccggagc
cgggactgtc gggcgtacac aaatcgcccg cagaagcgcg 5040gccgtctgga ccgatggctg
tgtagaagta ctcgccgata gtggaaaccg acgccccagc 5100actcgtccga gggcaaagga
atagtgaggt acagcttgga tcgatccggc tgctaacaaa 5160gcccgaaagg aagctgagtt
ggctgctgcc accgctgagc aataactagc ataacccctt 5220ggggcctcta aacgggtctt
gaggggtttt ttgctgaaag gaggaactat atccggatga 5280tcgggcgcgc cgtcgacgga
tccactagtt ctagagcggc ccgcgccgtc gacggatata 5340atgagccgta aacaaagatg
attaagtagt aattaatacg tactagtaaa agtggcaaaa 5400gataacgaga aagaaccaat
ttctttgcat tcggccttag cggaaggcat atataagctt 5460tgattatttt atttagtgta
atgatttcgt acaaccaaag catttattta gtactctcac 5520acttgtgtcg cggccggccg
ctacaggaac aggtggtggc ggccctcggc gcgctcgtac 5580tgctccacga tggtgtagtc
ctcgttgtgg gaggtgatgt ccagcttgga gtccacgtag 5640tagtagccgg gcagctgcac
gggcttcttg gccatgtaga tggacttgaa ctccaccagg 5700tagtggccgc cgtccttcag
cttcagggcc ttgtggatct cgcccttcag cacgccgtcg 5760cgggggtaca ggcgctcggt
ggaggcctcc cagcccatag tcttcttctg cattacgggg 5820ccgtcggagg ggaagttcac
gccgatgaac ttcaccttgt agatgaagga gccgtcctgc 5880agggaggagt cctgggtcac
ggtcaccacg ccgccgtcct cgaagttcat cacgcgctcc 5940cacttgaagc cctcggggaa
ggacagcttc ttgtagtcgg ggatgtcggc ggggtgcttc 6000acgtacacct tggagccgta
ctggaactgg ggggacagga tgtcccaggc gaagggcagg 6060gggccgccct tggtcacctt
cagcttggcg gtctgggtgc cctcgtaggg gcggccctcg 6120ccctcgccct cgatctcgaa
ctcgtggccg ttcacggagc cctccatgcg caccttgaag 6180cgcatgaact ccttgatgac
gtcctcggag gaggccatgg gccgcttggg gggctatgga 6240agactttctt agttagttgt
gtgaataagc aatgttggga gaatcgggac tacttatagg 6300ataggaataa aacagaaaag
tattaagtgc taatgaaata tttagactga taattaaaat 6360cttcacgtat gtccacttga
tataaaaacg tcaggaataa aggaagtaca gtagaattta 6420aaggtactct ttttatatat
acccgtgttc tctttttggc tagctagttg cataaaaaat 6480aatctatatt tttatcatta
ttttaaatat cttatgagat ggtaaatatt tatcataatt 6540ttttttacta ttatttatta
tttgtgtgtg taatacatat agaagttaat tacaaatttt 6600atttactttt tcattatttt
gatatgattc accattaatt tagtgttatt atttataata 6660gttcatttta atctttttgt
atatattatg cgtgcagtac ttttttccta catataacta 6720ctattacatt ttatttatat
aatattttta ttaatgaatt ttcgtgataa tatgtaatat 6780tgttcattat tatttcagat
tttttaaaaa tatttgtgtt attatttatg aaatatgtaa 6840tttttttagt atttgatttt
atgatgataa agtgttctaa attcaaaaga agggggaaag 6900cgtaaacatt aaaaaacgtc
atcaaacaaa aacaaaatct tgttaataaa gataaaactg 6960tttgttttga tcactgttat
ttcgtaatat aaaaacatta tttatattta tattgttgac 7020aaccaaattt gcctatcaaa
tctaaccaat ataatgcatg cgtggcaggt aatgtactac 7080catgaactta agtcatgaca
taataaaccg tgaatctgac caatgcatgt acctanctaa 7140attgtatttg tgacacgaag
caaatgattc aattcacaat ggagatggga aacaaataat 7200gaagaaccca gaactaagaa
agcttttctg aaaaataaaa taaaggcaat gtcaaaagta 7260tactgcatca tcagtccaga
aagcacatga tattttttta tcagtatcaa tgcagctagt 7320tttattttac aatatcgata
tagctagttt aaatatattg cagctagatt tataaatatt 7380tgtgttatta tttatcattt
gtgtaatcct gtttttagta ttttagttta tatatgatga 7440taatgtattc caaatttaaa
agaagggaaa taaatttaaa caagaaaaaa agtcatcaaa 7500caaaaaacaa atgaaagggt
ggaaagatgt taccatgtaa tgtgaatgtt acagtatttc 7560ttttattata gagttaacaa
attaactaat atgattttgt taataatgat aaaatatttt 7620ttttattatt atttcataat
ataaaaatag tttacttaat ataaaaaaaa ttctatcgtt 7680cacaacaaag ttggccacct
aatttaacca tgcatgtacc catggaccat attaggtaac 7740catcaaacct gatgaagaga
taaagagatg aagacttaag tcataacaca aaaccataaa 7800aaacaaaaat acaatcaacc
gtcaatctga ccaatgcatg aaaaagctgc aatagtgagt 7860ggcgacacaa agcacatgat
tttcttacaa cggagataaa accaaaaaaa tatttcatga 7920acaacctaga acaaataaag
cttttatata ataaatatat aaataaataa aggctatgga 7980ataatatact tcaatatatt
tggattaaat aaattgttgg cggggttgat atatttatac 8040acacctaaag tcacttcaat
ctcattttca cttaactttt attttttttt tctttttatt 8100tatcataaag agaatattga
taatatactt tttaacatat ttttatgaca ttttttattg 8160gtgaaaactt attaaaaatc
ataaattttg taagttagat ttatttaaag agttcctctt 8220cttattttaa attttttaat
aaatttttaa ataactaaaa tttgtgttaa aaatgttaaa 8280aaagtgtgtt attaaccctt
ctcttcgagg atccgtaccg agctcggatc cactagtaac 8340ggccgccagt gtgctggaat
tcaggtcctg caggtctact ctttacatgt tctttactcc 8400gtctcaaaat ttcctttttt
tgttggctct ctccgaacga gttggagaaa tcgttaaccc 8460taatcgaaga tctagattcc
tctacatacg tttgatctct ctctcagtat ggattacaaa 8520gcgccaagga gatactactc
acacggagtt gttgcgagac agcaagattt cgcaacagat 8580atagttacga gaagaagacc
ttatgtccct tacgaccgtc caaataagtt ttcaaggagt 8640ctggtttgga cgtcaaaaga
gtacaaatca cccgagggca ataatatgcc aaggaccaat 8700gatgtgtcac cgaaaccacc
agttttaggt ttggcgagga agaatgctgc ttgtgggcca 8760atgagatctt ctagtctcag
aaaatgggta tgtaagtatt ggaaagatgg aaagtgcaag 8820aggggtgagc agtgccagtt
cttacactct tggtcttgtt tccctggatt ggccatggta 8880gcttctcttg aagggcacaa
taaggaacta aaggggatcg ctctccctga gggttcagat 8940aaactctttt cagtcagtat
tgatggtaca ttgcgagttt gggactgcaa ttctggtcag 9000tgtgtacatt ccatcaacct
tgacgcagaa gcagggtctc taatcagtga aggcccttgg 9060gttttccttg gcttgccaaa
cgctataaag gcttttaacg ttcaaaccag tcaagatttg 9120catcttcaag cagcaggggt
ggttggtcag gtgaatgcaa tgactattgc aaacggaatg 9180ctttttgctg gaacaagttc
tggtagtatc ttagtctgga aagctactac agactctgag 9240tctgatccat tcaaatactt
gacatctctt gagggacata gtggtgaagt cacttgtttt 9300gctgttggag gtcaaatgct
atactctggt tctgtcgata aaacaatcaa gatgtgggat 9360ctcaacaccc tgcaatgtat
aatgaccctg aagcaacata ccggcactgt cacttcactc 9420ttatgttggg ataaatgttt
gatatcgtct tccttggatg ggaccataaa agtttgggct 9480tattctgaaa acggaatctt
gaaagttgtt caaactcgca gacaagaaca gagtagtgtt 9540catgctcttt ctggtatgca
tgatgcagaa gccaaaccga taatattctg ctcttaccaa 9600aacggaaccg ttggcatttt
cgacctacca tcttttcaag aaagaggaag gatgttctct 9660acgcacacga tcgccacact
cacaattggt cctcaaggat tgttattcag tggagacgag 9720agtggtaact tgcgtgtatg
gaccttagct gctggcaaca aagtttagtc ttttcgacta 9780aagaattctg atttaatttt
gtggtttata tgttgagtta actgttaaga gagttttatt 9840ttgtaatagg tgtatcagtc
aataaacaat ctttgtatca accaaatgta atttttctcg 9900ttaattcgat ttcagagttt
ttactttaag ataaacaaac tctttcacac atcatttaat 9960gaaagtggag aagcttaaaa
aacaaacaaa gaaactgatc catttttggc gggtcttctt 10020ctactcttat tcatatgtgt
taacgaacta tagcgtaaaa ttcagagcaa gcgatctccg 10080atttgaacgt ggctatcacc
ggaggcccac cactacgggc gatacgctct aagtgaggat 10140taaagtgctc tggtggtgac
gttgaagaaa ctcgcccatg gtttttgtta tctctgcagc 10200caagtgtcgt tctttcttcg
ccacttctca tcaagctaca gtgaatttaa aaatggcgtc 10260tttctttgat ctcgtataca
taagctggat tggtttctta aacaaattcc tctccttttg 10320ggtcttctgg gtttgccttg
taagtgtttg tgtttttgcc tctgagaaaa aatcgcggcc 10380gcaagtatga actaaaatgc
atgtaggtgt aagagctcat ggagagcatg gaatattgta 10440tccgaccatg taacagtata
ataactgagc tccatctcac ttcttctatg aataaacaaa 10500ggatgttatg atatattaac
actctatcta tgcaccttat tgttctatga taaatttcct 10560cttattatta taaatcatct
gaatcgtgac ggcttatgga atgcttcaaa tagtacaaaa 10620acaaatgtgt actataagac
tttctaaaca attctaacct tagcattgtg aacgagacat 10680aagtgttaag aagacataac
aattataatg gaagaagttt gtctccattt atatattata 10740tattacccac ttatgtatta
tattaggatg ttaaggagac ataacaatta taaagagaga 10800agtttgtatc catttatata
ttatatacta cccatttata tattatactt atccacttat 10860ttaatgtctt tataaggttt
gatccatgat atttctaata ttttagttga tatgtatatg 10920aaagggtact atttgaactc
tcttactctg tataaaggtt ggatcatcct taaagtgggt 10980ctatttaatt ttattgcttc
ttacagataa aaaaaaaatt atgagttggt ttgataaaat 11040attgaaggat ttaaaataat
aataaataac atataatata tgtatataaa tttattataa 11100tataacattt atctataaaa
aagtaaatat tgtcataaat ctatacaatc gtttagcctt 11160gctggacgaa tctcaattat
ttaaacgaga gtaaacatat ttgacttttt ggttatttaa 11220caaattatta tttaacacta
tatgaaattt ttttttttat cagcaaagaa taaaattaaa 11280ttaagaagga caatggtgtc
ccaatcctta tacaaccaac ttccacaaga aagtcaagtc 11340agagacaaca aaaaaacaag
caaaggaaat tttttaattt gagttgtctt gtttgctgca 11400taatttatgc agtaaaacac
tacacataac ccttttagca gtagagcaat ggttgaccgt 11460gtgcttagct tcttttattt
tattttttta tcagcaaaga ataaataaaa taaaatgaga 11520cacttcaggg atgtttcaac
aagctctaga gggcccaatt cgccctatag tgagtcgtat 11580tacaattcac tggccgtcgt
tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 11640cttaatcgcc ttgcagcaca
tccccctttc gccagctggc gtaatagcga agaggcccgc 11700accgatcgcc cttcccaaca
gttgcgcagc ctatac 117362612929DNAArtificial
Sequencevector pKR1167 26ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc
tcatggagag catggaatat 60tgtatccgac catgtaacag tataataact gagctccatc
tcacttcttc tatgaataaa 120caaaggatgt tatgatatat taacactcta tctatgcacc
ttattgttct atgataaatt 180tcctcttatt attataaatc atctgaatcg tgacggctta
tggaatgctt caaatagtac 240aaaaacaaat gtgtactata agactttcta aacaattcta
accttagcat tgtgaacgag 300acataagtgt taagaagaca taacaattat aatggaagaa
gtttgtctcc atttatatat 360tatatattac ccacttatgt attatattag gatgttaagg
agacataaca attataaaga 420gagaagtttg tatccattta tatattatat actacccatt
tatatattat acttatccac 480ttatttaatg tctttataag gtttgatcca tgatatttct
aatattttag ttgatatgta 540tatgaaaggg tactatttga actctcttac tctgtataaa
ggttggatca tccttaaagt 600gggtctattt aattttattg cttcttacag ataaaaaaaa
aattatgagt tggtttgata 660aaatattgaa ggatttaaaa taataataaa taacatataa
tatatgtata taaatttatt 720ataatataac atttatctat aaaaaagtaa atattgtcat
aaatctatac aatcgtttag 780ccttgctgga cgaatctcaa ttatttaaac gagagtaaac
atatttgact ttttggttat 840ttaacaaatt attatttaac actatatgaa attttttttt
ttatcagcaa agaataaaat 900taaattaaga aggacaatgg tgtcccaatc cttatacaac
caacttccac aagaaagtca 960agtcagagac aacaaaaaaa caagcaaagg aaatttttta
atttgagttg tcttgtttgc 1020tgcataattt atgcagtaaa acactacaca taaccctttt
agcagtagag caatggttga 1080ccgtgtgctt agcttctttt attttatttt tttatcagca
aagaataaat aaaataaaat 1140gagacacttc agggatgttt caacaagctc tagagggccc
aattcgccct atagtgagtc 1200gtattacaat tcactggccg tcgttttaca acgtcgtgac
tgggaaaacc ctggcgttac 1260ccaacttaat cgccttgcag cacatccccc tttcgccagc
tggcgtaata gcgaagaggc 1320ccgcaccgat cgcccttccc aacagttgcg cagcctatac
gtacgagatc cggccggcca 1380gatcctgcag gagatccaag cttggcgcgc cgttctatag
tgtcacctaa atcgtatgtg 1440tatgatacat aaggttatgt attaattgta gccgcgttct
aacgacaata tgtccatatg 1500gtgcactctc agtacaatct gctctgatgc cgcatagtta
agccagcccc gacacccgcc 1560aacacccgct gacgcgccct gacgggcttg tctgctcccg
gcatccgctt acagacaagc 1620tgtgaccgtc tccgggagct gcatgtgtca gaggttttca
ccgtcatcac cgaaacgcgc 1680gagacgaaag ggcctcgtga tacgcctatt tttataggtt
aatgtcatga ccaaaatccc 1740ttaacgtgag ttttcgttcc actgagcgtc agaccccgta
gaaaagatca aaggatcttc 1800ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa
acaaaaaaac caccgctacc 1860agcggtggtt tgtttgccgg atcaagagct accaactctt
tttccgaagg taactggctt 1920cagcagagcg cagataccaa atactgtcct tctagtgtag
ccgtagttag gccaccactt 1980caagaactct gtagcaccgc ctacatacct cgctctgcta
atcctgttac cagtggctgc 2040tgccagtggc gataagtcgt gtcttaccgg gttggactca
agacgatagt taccggataa 2100ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag
cccagcttgg agcgaacgac 2160ctacaccgaa ctgagatacc tacagcgtga gcattgagaa
agcgccacgc ttcccgaagg 2220gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga
acaggagagc gcacgaggga 2280gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc
gggtttcgcc acctctgact 2340tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc
ctatggaaaa acgccagcaa 2400cgcggccttt ttacggttcc tggccttttg ctggcctttt
gctcacatgt tctttcctgc 2460gttatcccct gattctgtgg ataaccgtat taccgccttt
gagtgagctg ataccgctcg 2520ccgcagccga acgaccgagc gcagcgagtc agtgagcgag
gaagcggaag agcgcccaat 2580acgcaaaccg cctctccccg cgcgttggcc gattcattaa
tgcaggttga tcgattcgac 2640atcgatctag taacatagat gacaccgcgc gcgataattt
atcctagttt gcgcgctata 2700ttttgttttc tatcgcgtat taaatgtata attgcgggac
tctaatcata aaaacccatc 2760tcataaataa cgtcatgcat tacatgttaa ttattacatg
cttaacgtaa ttcaacagaa 2820attatatgat aatcatcgca agaccggcaa caggattcaa
tcttaagaaa ctttattgcc 2880aaatgtttga acgatctgct tcgacgcact ccttctttag
gtacctcact attcctttgc 2940cctcggacga gtgctggggc gtcggtttcc actatcggcg
agtacttcta cacagccatc 3000ggtccagacg gccgcgcttc tgcgggcgat ttgtgtacgc
ccgacagtcc cggctccgga 3060tcggacgatt gcgtcgcatc gaccctgcgc ccaagctgca
tcatcgaaat tgccgtcaac 3120caagctctga tagagttggt caagaccaat gcggagcata
tacgcccgga gccgcggcga 3180tcctgcaagc tccggatgcc tccgctcgaa gtagcgcgtc
tgctgctcca tacaagccaa 3240ccacggcctc cagaagaaga tgttggcgac ctcgtattgg
gaatccccga acatcgcctc 3300gctccagtca atgaccgctg ttatgcggcc attgtccgtc
aggacattgt tggagccgaa 3360atccgcgtgc acgaggtgcc ggacttcggg gcagtcctcg
gcccaaagca tcagctcatc 3420gagagcctgc gcgacggacg cactgacggt gtcgtccatc
acagtttgcc agtgatacac 3480atggggatca gcaatcgcgc atatgaaatc acgccatgta
gtgtattgac cgattccttg 3540cggtccgaat gggccgaacc cgctcgtctg gctaagatcg
gccgcagcga tcgcatccat 3600ggcctccgcg accggctgca gaacagcggg cagttcggtt
tcaggcaggt cttgcaacgt 3660gacaccctgt gcacggcggg agatgcaata ggtcaggctc
tcgctgaatt ccccaatgtc 3720aagcacttcc ggaatcggga gcgcggccga tgcaaagtgc
cgataaacat aacgatcttt 3780gtagaaacca tcggcgcagc tatttacccg caggacatat
ccacgccctc ctacatcgaa 3840gctgaaagca cgagattctt cgccctccga gagctgcatc
aggtcggaga cgctgtcgaa 3900cttttcgatc agaaacttct cgacagacgt cgcggtgagt
tcaggctttt tcatggttta 3960ataagaagag aaaagagttc ttttgttatg gctgaagtaa
tagagaaatg agctcgagcg 4020tgtcctctcc aaatgaaatg aacttcctta tatagaggaa
gggtcttgcg aaggatagtg 4080ggattgtgcg tcatccctta cgtcagtgga gatgtcacat
caatccactt gctttgaaga 4140cgtggttgga acgtcttctt tttccacgat gctcctcgtg
ggtgggggtc catctttggg 4200accactgtcg gcagaggcat cttgaatgat agcctttcct
ttatcgcaat gatggcattt 4260gtaggagcca ccttcctttt ctactgtcct ttcgatgaag
tgacagatag ctgggcaatg 4320gaatccgagg aggtttcccg aaattatcct ttgttgaaaa
gtctcaatag ccctttggtc 4380ttctgagact gtatctttga catttttgga gtagaccaga
gtgtcgtgct ccaccatgtt 4440gacgaagatt ttcttcttgt cattgagtcg taaaagactc
tgtatgaact gttcgccagt 4500cttcacggcg agttctgtta gatcctcgat ttgaatctta
gactccatgc atggccttag 4560attcagtagg aactaccttt ttagagactc caatctctat
tacttgcctt ggtttatgaa 4620gcaagccttg aatcgtccat actggaatag tacttctgat
cttgagaaat atgtctttct 4680ctgtgttctt gatgcaatta gtcctgaatc ttttgactgc
atctttaacc ttcttgggaa 4740ggtatttgat ctcctggaga ttgttactcg ggtagatcgt
cttgatgaga cctgctgcgt 4800aggcctctct aaccatctgt gggtcagcat tctttctgaa
attgaagagg ctaaccttct 4860cattatcagt ggtgaacata gtgtcgtcac cttcaccttc
gaacttcctt cctagatcgt 4920aaagatagag gaaatcgtcc attgtaatct ccggggcaaa
ggagatctct tttggggctg 4980gatcactgct gggccttttg gttcctagcg tgagccagtg
ggctttttgc tttggtgggc 5040ttgttagggc cttagcaaag ctcttgggct tgagttgagc
ttctcctttg gggatgaagt 5100tcaacctgtc tgtttgctga cttgttgtgt acgcgtcagc
tgctgctctt gcctctgtaa 5160tagtggcaaa tttcttgtgt gcaactccgg gaacgccgtt
tgttgccgcc tttgtacaac 5220cccagtcatc gtatataccg gcatgtggac cgttatacac
aacgtagtag ttgatatgag 5280ggtgttgaat acccgattct gctctgagag gagcaactgt
gctgttaagc tcagattttt 5340gtgggattgg aattggatcg atctcgatcc cgcgaaatta
atacgactca ctatagggag 5400accacaacgg tttccctcta gaaataattt tgtttaactt
taagaaggag atatacccat 5460ggaaaagcct gaactcaccg cgacgtctgt cgagaagttt
ctgatcgaaa agttcgacag 5520cgtctccgac ctgatgcagc tctcggaggg cgaagaatct
cgtgctttca gcttcgatgt 5580aggagggcgt ggatatgtcc tgcgggtaaa tagctgcgcc
gatggtttct acaaagatcg 5640ttatgtttat cggcactttg catcggccgc gctcccgatt
ccggaagtgc ttgacattgg 5700ggaattcagc gagagcctga cctattgcat ctcccgccgt
gcacagggtg tcacgttgca 5760agacctgcct gaaaccgaac tgcccgctgt tctgcagccg
gtcgcggagg ctatggatgc 5820gatcgctgcg gccgatctta gccagacgag cgggttcggc
ccattcggac cgcaaggaat 5880cggtcaatac actacatggc gtgatttcat atgcgcgatt
gctgatcccc atgtgtatca 5940ctggcaaact gtgatggacg acaccgtcag tgcgtccgtc
gcgcaggctc tcgatgagct 6000gatgctttgg gccgaggact gccccgaagt ccggcacctc
gtgcacgcgg atttcggctc 6060caacaatgtc ctgacggaca atggccgcat aacagcggtc
attgactgga gcgaggcgat 6120gttcggggat tcccaatacg aggtcgccaa catcttcttc
tggaggccgt ggttggcttg 6180tatggagcag cagacgcgct acttcgagcg gaggcatccg
gagcttgcag gatcgccgcg 6240gctccgggcg tatatgctcc gcattggtct tgaccaactc
tatcagagct tggttgacgg 6300caatttcgat gatgcagctt gggcgcaggg tcgatgcgac
gcaatcgtcc gatccggagc 6360cgggactgtc gggcgtacac aaatcgcccg cagaagcgcg
gccgtctgga ccgatggctg 6420tgtagaagta ctcgccgata gtggaaaccg acgccccagc
actcgtccga gggcaaagga 6480atagtgaggt acagcttgga tcgatccggc tgctaacaaa
gcccgaaagg aagctgagtt 6540ggctgctgcc accgctgagc aataactagc ataacccctt
ggggcctcta aacgggtctt 6600gaggggtttt ttgctgaaag gaggaactat atccggatga
tcgggcgcgc cgtcgacgga 6660tccactagtt ctagagcggc ccgcgccgtc gacggatata
atgagccgta aacaaagatg 6720attaagtagt aattaatacg tactagtaaa agtggcaaaa
gataacgaga aagaaccaat 6780ttctttgcat tcggccttag cggaaggcat atataagctt
tgattatttt atttagtgta 6840atgatttcgt acaaccaaag catttattta gtactctcac
acttgtgtcg cggccggccg 6900ctacaggaac aggtggtggc ggccctcggc gcgctcgtac
tgctccacga tggtgtagtc 6960ctcgttgtgg gaggtgatgt ccagcttgga gtccacgtag
tagtagccgg gcagctgcac 7020gggcttcttg gccatgtaga tggacttgaa ctccaccagg
tagtggccgc cgtccttcag 7080cttcagggcc ttgtggatct cgcccttcag cacgccgtcg
cgggggtaca ggcgctcggt 7140ggaggcctcc cagcccatag tcttcttctg cattacgggg
ccgtcggagg ggaagttcac 7200gccgatgaac ttcaccttgt agatgaagga gccgtcctgc
agggaggagt cctgggtcac 7260ggtcaccacg ccgccgtcct cgaagttcat cacgcgctcc
cacttgaagc cctcggggaa 7320ggacagcttc ttgtagtcgg ggatgtcggc ggggtgcttc
acgtacacct tggagccgta 7380ctggaactgg ggggacagga tgtcccaggc gaagggcagg
gggccgccct tggtcacctt 7440cagcttggcg gtctgggtgc cctcgtaggg gcggccctcg
ccctcgccct cgatctcgaa 7500ctcgtggccg ttcacggagc cctccatgcg caccttgaag
cgcatgaact ccttgatgac 7560gtcctcggag gaggccatgg gccgcttggg gggctatgga
agactttctt agttagttgt 7620gtgaataagc aatgttggga gaatcgggac tacttatagg
ataggaataa aacagaaaag 7680tattaagtgc taatgaaata tttagactga taattaaaat
cttcacgtat gtccacttga 7740tataaaaacg tcaggaataa aggaagtaca gtagaattta
aaggtactct ttttatatat 7800acccgtgttc tctttttggc tagctagttg cataaaaaat
aatctatatt tttatcatta 7860ttttaaatat cttatgagat ggtaaatatt tatcataatt
ttttttacta ttatttatta 7920tttgtgtgtg taatacatat agaagttaat tacaaatttt
atttactttt tcattatttt 7980gatatgattc accattaatt tagtgttatt atttataata
gttcatttta atctttttgt 8040atatattatg cgtgcagtac ttttttccta catataacta
ctattacatt ttatttatat 8100aatattttta ttaatgaatt ttcgtgataa tatgtaatat
tgttcattat tatttcagat 8160tttttaaaaa tatttgtgtt attatttatg aaatatgtaa
tttttttagt atttgatttt 8220atgatgataa agtgttctaa attcaaaaga agggggaaag
cgtaaacatt aaaaaacgtc 8280atcaaacaaa aacaaaatct tgttaataaa gataaaactg
tttgttttga tcactgttat 8340ttcgtaatat aaaaacatta tttatattta tattgttgac
aaccaaattt gcctatcaaa 8400tctaaccaat ataatgcatg cgtggcaggt aatgtactac
catgaactta agtcatgaca 8460taataaaccg tgaatctgac caatgcatgt acctanctaa
attgtatttg tgacacgaag 8520caaatgattc aattcacaat ggagatggga aacaaataat
gaagaaccca gaactaagaa 8580agcttttctg aaaaataaaa taaaggcaat gtcaaaagta
tactgcatca tcagtccaga 8640aagcacatga tattttttta tcagtatcaa tgcagctagt
tttattttac aatatcgata 8700tagctagttt aaatatattg cagctagatt tataaatatt
tgtgttatta tttatcattt 8760gtgtaatcct gtttttagta ttttagttta tatatgatga
taatgtattc caaatttaaa 8820agaagggaaa taaatttaaa caagaaaaaa agtcatcaaa
caaaaaacaa atgaaagggt 8880ggaaagatgt taccatgtaa tgtgaatgtt acagtatttc
ttttattata gagttaacaa 8940attaactaat atgattttgt taataatgat aaaatatttt
ttttattatt atttcataat 9000ataaaaatag tttacttaat ataaaaaaaa ttctatcgtt
cacaacaaag ttggccacct 9060aatttaacca tgcatgtacc catggaccat attaggtaac
catcaaacct gatgaagaga 9120taaagagatg aagacttaag tcataacaca aaaccataaa
aaacaaaaat acaatcaacc 9180gtcaatctga ccaatgcatg aaaaagctgc aatagtgagt
ggcgacacaa agcacatgat 9240tttcttacaa cggagataaa accaaaaaaa tatttcatga
acaacctaga acaaataaag 9300cttttatata ataaatatat aaataaataa aggctatgga
ataatatact tcaatatatt 9360tggattaaat aaattgttgg cggggttgat atatttatac
acacctaaag tcacttcaat 9420ctcattttca cttaactttt attttttttt tctttttatt
tatcataaag agaatattga 9480taatatactt tttaacatat ttttatgaca ttttttattg
gtgaaaactt attaaaaatc 9540ataaattttg taagttagat ttatttaaag agttcctctt
cttattttaa attttttaat 9600aaatttttaa ataactaaaa tttgtgttaa aaatgttaaa
aaagtgtgtt attaaccctt 9660ctcttcgagg atccgtaccg agctcggatc cactagtaac
ggccgccagt gtgctggaat 9720tcaggtcctg caggtctact ctttacatgt tctttactcc
gtctcaaaat ttcctttttt 9780tgttggctct ctccgaacga gttggagaaa tcgttaaccc
taatcgaaga tctagattcc 9840tctacatacg tttgatctct ctctcagtat ggattacaaa
gcgccaagga gatactactc 9900acacggagtt gttgcgagac agcaagattt cgcaacagat
atagttacga gaagaagacc 9960ttatgtccct tacgaccgtc caaataagtt ttcaaggagt
ctggtttgga cgtcaaaaga 10020gtacaaatca cccgagggca ataatatgcc aaggaccaat
gatgtgtcac cgaaaccacc 10080agttttaggt ttggcgagga agaatgctgc ttgtgggcca
atgagatctt ctagtctcag 10140aaaatgggta tgtaagtatt ggaaagatgg aaagtgcaag
aggggtgagc agtgccagtt 10200cttacactct tggtcttgtt tccctggatt ggccatggta
gcttctcttg aagggcacaa 10260taaggaacta aaggggatcg ctctccctga gggttcagat
aaactctttt cagtcagtat 10320tgatggtaca ttgcgagttt gggactgcaa ttctggtcag
tgtgtacatt ccatcaacct 10380tgacgcagaa gcagggtctc taatcagtga aggcccttgg
gttttccttg gcttgccaaa 10440cgctataaag gcttttaacg ttcaaaccag tcaagatttg
catcttcaag cagcaggggt 10500ggttggtcag gtgaatgcaa tgactattgc aaacggaatg
ctttttgctg gaacaagttc 10560tggtagtatc ttagtctgga aagctactac agactctgag
tctgatccat tcaaatactt 10620gacatctctt gagggacata gtggtgaagt cacttgtttt
gctgttggag gtcaaatgct 10680atactctggt tctgtcgata aaacaatcaa gatgtgggat
ctcaacaccc tgcaatgtat 10740aatgaccctg aagcaacata ccggcactgt cacttcactc
ttatgttggg ataaatgttt 10800gatatcgtct tccttggatg ggaccataaa agtttgggct
tattctgaaa acggaatctt 10860gaaagttgtt caaactcgca gacaagaaca gagtagtgtt
catgctcttt ctggtatgca 10920tgatgcagaa gccaaaccga taatattctg ctcttaccaa
aacggaaccg ttggcatttt 10980cgacctacca tcttttcaag aaagaggaag gatgttctct
acgcacacga tcgccacact 11040cacaattggt cctcaaggat tgttattcag tggagacgag
agtggtaact tgcgtgtatg 11100gaccttagct gctggcaaca aagtttagtc ttttcgacta
aagaattctg atttaatttt 11160gtggtttata tgttgagtta actgttaaga gagttttatt
ttgtaatagg tgtatcagtc 11220aataaacaat ctttgtatca accaaatgta atttttctcg
ttaattcgat ttcagagttt 11280ttactttaag ataaacaaac tctttcacac atcatttaat
gaaagtggag aagcttaaaa 11340aacaaacaaa gaaactgatc catttttggc gggtcttctt
ctactcttat tcatatgtgt 11400taacgaacta tagcgtaaaa ttcagagcaa gcgatctccg
atttgaacgt ggctatcacc 11460ggaggcccac cactacgggc gatacgctct aagtgaggat
taaagtgctc tggtggtgac 11520gttgaagaaa ctcgcccatg gtttttgtta tctctgcagc
caagtgtcgt tctttcttcg 11580ccacttctca tcaagctaca gtgaatttaa aaatggcgtc
tttctttgat ctcgtataca 11640taagctggat tggtttctta aacaaattcc tctccttttg
ggtcttctgg gtttgccttg 11700taagtgtttg tgtttttgcc tctgagaaaa aatcgcggcc
gcatggagag atctcaacgg 11760cagtctcctc cgccaccgtc gccgtcctcc tcctcgtcct
ccgtctccgc ggacaccgtc 11820ctcgtccctc ccggaaagag gcggagggcg gcgacggcca
aggccggcgc cgagcctaat 11880aagaggatcc gcaaggaccc cgccgccgcc gccgcgggga
agaggagctc cgtctacagg 11940ggagtcacca ggcacaggtg gacgggcagg ttcgaggcgc
atctctggga caagcactgc 12000ctcgccgcgc tccacaacaa gaagaaaggc aggcaagtct
acctgggggc gtatgacagc 12060gaggaggcag ctgctcgtgc ctatgacctc gcagctctca
agtactgggg tcctgagact 12120ctgctcaact tccctgtgga ggattactcc agcgagatgc
cggagatgga ggccgtgtcc 12180cgggaggagt acctggcctc cctccgccgc aggagcagcg
gcttctccag gggcgtctcc 12240aagtacagag gcgtcgccag gcatcaccac aacgggaggt
gggaggcacg gattgggcga 12300gtctttggga acaagtacct ctacttggga acatttgaca
ctcaagaaga ggcagccaag 12360gcctatgacc ttgcggccat tgaataccgt ggcgtcaatg
ctgtaaccaa cttcgacatc 12420agctgctacc tggaccaccc gctgttcctg gcacagctcc
aacaggagcc acaggtggtg 12480ccggcactca accaagaacc tcaacctgat cagagcgaaa
ccggaactac agagcaagag 12540ccggagtcaa gcgaagccaa gacaccggat ggcagtgcag
aacccgatga gaacgcggtg 12600cctgacgaca ccgcggagcc cctcaccaca gtcgacgaca
gcatcgaaga gggcttgtgg 12660agcccttgca tggattacga gctagacacc atgtcgagac
caaactttgg cagctcaatc 12720aatctgagcg agtggttcgc tgacgcagac ttcgactgca
acatcggatg cctgttcgat 12780gggtgttctg cggctgacga aggaagcaag gatggtgtag
gtctggcaga tttcagtctg 12840tttgaggcag gtgatgtcca gctgaaggat gttctttcgg
atatggaaga ggggatacaa 12900cctccagcga tgatcagtgt gtgcaacgc
129292713268DNAArtificial Sequencevector pKR92
27cgcgcctcga gtgggcggat cccccgggct gcaggaattc actggccgtc gttttacaac
60gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt
120tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca
180gcctgaatgg cgaatggatc gatccatcgc gatgtacctt ttgttagtca gcctctcgat
240tgctcatcgt cattacacag taccgaagtt tgatcgatct agtaacatag atgacaccgc
300gcgcgataat ttatcctagt ttgcgcgcta tattttgttt tctatcgcgt attaaatgta
360taattgcggg actctaatca taaaaaccca tctcataaat aacgtcatgc attacatgtt
420aattattaca tgcttaacgt aattcaacag aaattatatg ataatcatcg caagaccggc
480aacaggattc aatcttaaga aactttattg ccaaatgttt gaacgatctg cttcgacgca
540ctccttcttt actccaccat ctcgtcctta ttgaaaacgt gggtagcacc aaaacgaatc
600aagtcgctgg aactgaagtt accaatcacg ctggatgatt tgccagttgg attaatcttg
660cctttccccg catgaataat attgatgaat gcatgcgtga ggggtagttc gatgttggca
720atagctgcaa ttgccgcgac atcctccaac gagcataatt cttcagaaaa atagcgatgt
780tccatgttgt cagggcatgc atgatgcacg ttatgaggtg acggtgctag gcagtattcc
840ctcaaagttt catagtcagt atcatattca tcattgcatt cctgcaagag agaattgaga
900cgcaatccac acgctgcggc aaccttccgg cgttcgtggt ctatttgctc ttggacgttg
960caaacgtaag tgttggatcg atccggggtg ggcgaagaac tccagcatga gatccccgcg
1020ctggaggatc atccagccgg cgtcccggaa aacgattccg aagcccaacc tttcatagaa
1080ggcggcggtg gaatcgaaat ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc
1140gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc
1200gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gccgccaagc
1260tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc cacacccagc
1320cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cggcaagcag
1380gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg
1440aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg atcgacaaga
1500ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg
1560caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc
1620tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc
1680cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg
1740gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg
1800gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag
1860cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga
1920gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atccccgcaa gcttggagac
1980tggtgatttc agcgtgtcct ctccaaatga aatgaacttc cttatataga ggaagggtct
2040tgcgaaggat agtgggattg tgcgtcatcc cttacgtcag tggagatatc acatcaatcc
2100acttgctttg aagacgtggt tggaacgtct tctttttcca cgatgctcct cgtgggtggg
2160ggtccatctt tgggaccact gtcggcagag gcatcttcaa cgatggcctt tcctttatcg
2220caatgatggc atttgtagga gccaccttcc ttttccacta tcttcacaat aaagtgacag
2280atagctgggc aatggaatcc gaggaggttt ccggatatta ccctttgttg aaaagtctca
2340attgcccttt ggtcttctga gactgtatct ttgatatttt tggagtagac aagcgtgtcg
2400tgctccacca tgttgacgaa gattttcttc ttgtcattga gtcgtaagag actctgtatg
2460aactgttcgc cagtctttac ggcgagttct gttaggtcct ctatttgaat ctttgactcc
2520atggcctttg attcagtggg aactaccttt ttagagactc caatctctat tacttgcctt
2580ggtttgtgaa gcaagccttg aatcgtccat actggaatag tacttctgat cttgagaaat
2640atatctttct ctgtgttctt gatgcagtta gtcctgaatc ttttgactgc atctttaacc
2700ttcttgggaa ggtatttgat ctcctggaga ttattgctcg ggtagatcgt cttgatgaga
2760cctgctgcgt aagcctctct aaccatctgt gggttagcat tctttctgaa attgaaaagg
2820ctaatcttct cattatcagt ggtgaacatg gtatcgtcac cttctccgtc gaacttcctg
2880actagatcgt agagatagag gaagtcgtcc attgtgatct ctggggcaaa ggagtctgaa
2940ttaattcgat atggtggatt tatcacaaat gggacccgcc gccgacagag gtgtgatgtt
3000aggccaggac tttgaaaatt tgcgcaacta tcgtatagtg gccgacaaat tgacgccgag
3060ttgacagact gcctagcatt tgagtgaatt atgtgaggta atgggctaca ctgaattggt
3120agctcaaact gtcagtattt atgtatatga gtgtatattt tcgcataatc tcagaccaat
3180ctgaagatga aatgggtatc tgggaatggc gaaatcaagg catcgatcgt gaagtttctc
3240atctaagccc ccatttggac gtgaatgtag acacgtcgaa ataaagattt ccgaattaga
3300ataatttgtt tattgctttc gcctataaat acgacggatc gtaatttgtc gttttatcaa
3360aatgtacttt cattttataa taacgctgcg gacatctaca tttttgaatt gaaaaaaaat
3420tggtaattac tctttctttt tctccatatt gaccatcata ctcattgctg atccatgtag
3480atttcccgga catgaagcca tttacaattg aatatatcct gccgccgctg ccgctttgca
3540cccggtggag cttgcatgtt ggtttctacg cagaactgag ccggttaggc agataatttc
3600cattgagaac tgagccatgt gcaccttccc cccaacacgg tgagcgacgg ggcaacggag
3660tgatccacat gggactttta aacatcatcc gtcggatggc gttgcgagag aagcagtcga
3720tccgtgagat cagccgacgc accgggcagg cgcgcaacac gatcgcaaag tatttgaacg
3780caggtacaat cgagccgacg ttcacgcgga acgaccaagc aagctagctt taatgcggta
3840gtttatcaca gttaaattgc taacgcagtc aggcaccgtg tatgaaatct aacaatgcgc
3900tcatcgtcat cctcggcacc gtcaccctgg atgctgtagg cataggcttg gttatgccgg
3960tactgccggg cctcttgcgg gatatcgtcc attccgacag catcgccagt cactatggcg
4020tgctgctagc gctatatgcg ttgatgcaat ttctatgcgc acccgttctc ggagcactgt
4080ccgaccgctt tggccgccgc ccagtcctgc tcgcttcgct acttggagcc actatcgact
4140acgcgatcat ggcgaccaca cccgtcctgt ggtccaaccc ctccgctgct atagtgcagt
4200cggcttctga cgttcagtgc agccgtcttc tgaaaacgac atgtcgcaca agtcctaagt
4260tacgcgacag gctgccgccc tgcccttttc ctggcgtttt cttgtcgcgt gttttagtcg
4320cataaagtag aatacttgcg actagaaccg gagacattac gccatgaaca agagcgccgc
4380cgctggcctg ctgggctatg cccgcgtcag caccgacgac caggacttga ccaaccaacg
4440ggccgaactg cacgcggccg gctgcaccaa gctgttttcc gagaagatca ccggcaccag
4500gcgcgaccgc ccggagctgg ccaggatgct tgaccaccta cgccctggcg acgttgtgac
4560agtgaccagg ctagaccgcc tggcccgcag cacccgcgac ctactggaca ttgccgagcg
4620catccaggag gccggcgcgg gcctgcgtag cctggcagag ccgtgggccg acaccaccac
4680gccggccggc cgcatggtgt tgaccgtgtt cgccggcatt gccgagttcg agcgttccct
4740aatcatcgac cgcacccgga gcgggcgcga ggccgccaag gcccgaggcg tgaagtttgg
4800cccccgccct accctcaccc cggcacagat cgcgcacgcc cgcgagctga tcgaccagga
4860aggccgcacc gtgaaagagg cggctgcact gcttggcgtg catcgctcga ccctgtaccg
4920cgcacttgag cgcagcgagg aagtgacgcc caccgaggcc aggcggcgcg gtgccttccg
4980tgaggacgca ttgaccgagg ccgacgccct ggcggccgcc gagaatgaac gccaagagga
5040acaagcatga aaccgcacca ggacggccag gacgaaccgt ttttcattac cgaagagatc
5100gaggcggaga tgatcgcggc cgggtacgtg ttcgagccgc ccgcgcacgt ctcaaccgtg
5160cggctgcatg aaatcctggc cggtttgtct gatgccaagc tggcggcctg gccggccagc
5220ttggccgctg aagaaaccga gcgccgccgt ctaaaaaggt gatgtgtatt tgagtaaaac
5280agcttgcgtc atgcggtcgc tgcgtatatg atgcgatgag taaataaaca aatacgcaag
5340ggaacgcatg aagttatcgc tgtacttaac cagaaaggcg ggtcaggcaa gacgaccatc
5400gcaacccatc tagcccgcgc cctgcaactc gccggggccg atgttctgtt agtcgattcc
5460gatccccagg gcagtgcccg cgattgggcg gccgtgcggg aagatcaacc gctaaccgtt
5520gtcggcatcg accgcccgac gattgaccgc gacgtgaagg ccatcggccg gcgcgacttc
5580gtagtgatcg acggagcgcc ccaggcggcg gacttggctg tgtccgcgat caaggcagcc
5640gacttcgtgc tgattccggt gcagccaagc ccttacgaca tatgggccac cgccgacctg
5700gtggagctgg ttaagcagcg cattgaggtc acggatggaa ggctacaagc ggcctttgtc
5760gtgtcgcggg cgatcaaagg cacgcgcatc ggcggtgagg ttgccgaggc gctggccggg
5820tacgagctgc ccattcttga gtcccgtatc acgcagcgcg tgagctaccc aggcactgcc
5880gccgccggca caaccgttct tgaatcagaa cccgagggcg acgctgcccg cgaggtccag
5940gcgctggccg ctgaaattaa atcaaaactc atttgagtta atgaggtaaa gagaaaatga
6000gcaaaagcac aaacacgcta agtgccggcc gtccgagcgc acgcagcagc aaggctgcaa
6060cgttggccag cctggcagac acgccagcca tgaagcgggt caactttcag ttgccggcgg
6120aggatcacac caagctgaag atgtacgcgg tacgccaagg caagaccatt accgagctgc
6180tatctgaata catcgcgcag ctaccagagt aaatgagcaa atgaataaat gagtagatga
6240attttagcgg ctaaaggagg cggcatggaa aatcaagaac aaccaggcac cgacgccgtg
6300gaatgcccca tgtgtggagg aacgggcggt tggccaggcg taagcggctg ggttgtctgc
6360cggccctgca atggcactgg aacccccaag cccgaggaat cggcgtgagc ggtcgcaaac
6420catccggccc ggtacaaatc ggcgcggcgc tgggtgatga cctggtggag aagttgaagg
6480ccgcgcaggc cgcccagcgg caacgcatcg aggcagaagc acgccccggt gaatcgtggc
6540aagcggccgc tgatcgaatc cgcaaagaat cccggcaacc gccggcagcc ggtgcgccgt
6600cgattaggaa gccgcccaag ggcgacgagc aaccagattt tttcgttccg atgctctatg
6660acgtgggcac ccgcgatagt cgcagcatca tggacgtggc cgttttccgt ctgtcgaagc
6720gtgaccgacg agctggcgag gtgatccgct acgagcttcc agacgggcac gtagaggttt
6780ccgcagggcc ggccggcatg gccagtgtgt gggattacga cctggtactg atggcggttt
6840cccatctaac cgaatccatg aaccgatacc gggaagggaa gggagacaag cccggccgcg
6900tgttccgtcc acacgttgcg gacgtactca agttctgccg gcgagccgat ggcggaaagc
6960agaaagacga cctggtagaa acctgcattc ggttaaacac cacgcacgtt gccatgcagc
7020gtacgaagaa ggccaagaac ggccgcctgg tgacggtatc cgagggtgaa gccttgatta
7080gccgctacaa gatcgtaaag agcgaaaccg ggcggccgga gtacatcgag atcgagctag
7140ctgattggat gtaccgcgag atcacagaag gcaagaaccc ggacgtgctg acggttcacc
7200ccgattactt tttgatcgat cccggcatcg gccgttttct ctaccgcctg gcacgccgcg
7260ccgcaggcaa ggcagaagcc agatggttgt tcaagacgat ctacgaacgc agtggcagcg
7320ccggagagtt caagaagttc tgtttcaccg tgcgcaagct gatcgggtca aatgacctgc
7380cggagtacga tttgaaggag gaggcggggc aggctggccc gatcctagtc atgcgctacc
7440gcaacctgat cgagggcgaa gcatccgccg gttcctaatg tacggagcag atgctagggc
7500aaattgccct agcaggggaa aaaggtcgaa aaggtctctt tcctgtggat agcacgtaca
7560ttgggaaccc aaagccgtac attgggaacc ggaacccgta cattgggaac ccaaagccgt
7620acattgggaa ccggtcacac atgtaagtga ctgatataaa agagaaaaaa ggcgattttt
7680ccgcctaaaa ctctttaaaa cttattaaaa ctcttaaaac ccgcctggcc tgtgcataac
7740tgtctggcca gcgcacagcc gaagagctgc aaaaagcgcc tacccttcgg tcgctgcgct
7800ccctacgccc cgccgcttcg cgtcggccta tcgcggccgc tggccgctca aaaatggctg
7860gcctacggcc aggcaatcta ccagggcgcg gacaagccgc gccgtcgcca ctcgaccgcc
7920ggcgcccaca tcaaggcacc ctgcctcgcg cgtttcggtg atgacggtga aaacctctga
7980cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa
8040gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat gacccagtca
8100cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag attgtactga
8160gagtgcacca tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca
8220ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag
8280cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag
8340gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc
8400tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc
8460agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc
8520tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt
8580cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg
8640ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat
8700ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag
8760ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt
8820ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc
8880cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta
8940gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag
9000atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga
9060ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa
9120gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa
9180tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc
9240ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga
9300taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa
9360gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt
9420gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg
9480ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc
9540aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg
9600gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag
9660cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt
9720actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt
9780caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaag
9840acctgcaggg gggggggggc gctgaggtct gcctcgtgaa gaaggtgttg ctgactcata
9900ccaggcctga atcgccccat catccagcca gaaagtgagg gagccacggt tgatgagagc
9960tttgttgtag gtggaccagt tggtgatttt gaacttttgc tttgccacgg aacggtctgc
10020gttgtcggga agatgcgtga tctgatcctt caactcagca aaagttcgat ttattcaaca
10080aagccgccgt cccgtcaagt cagcgtaatg ctctgccagt gttacaacca attaaccaat
10140tctgattaga aaaactcatc gagcatcaaa tgaaactgca atttattcat atcaggatta
10200tcaataccat atttttgaaa aagccgtttc tgtaatgaag gagaaaactc accgaggcag
10260ttccatagga tggcaagatc ctggtatcgg tctgcgattc cgactcgtcc aacatcaata
10320caacctatta atttcccctc gtcaaaaata aggttatcaa gtgagaaatc accatgagtg
10380acgactgaat ccggtgagaa tggcaaaagc ttatgcattt ctttccagac ttgttcaaca
10440ggccagccat tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt
10500gattgcgcct gagcgagacg aaatacgcga tcgctgttaa aaggacaatt acaaacagga
10560atcgaatgca accggcgcag gaacactgcc agcgcatcaa caatattttc acctgaatca
10620ggatattctt ctaatacctg gaatgctgtt ttcccgggga tcgcagtggt gagtaaccat
10680gcatcatcag gagtacggat aaaatgcttg atggtcggaa gaggcataaa ttccgtcagc
10740cagtttagtc tgaccatctc atctgtaaca tcattggcaa cgctaccttt gccatgtttc
10800agaaacaact ctggcgcatc gggcttccca tacaatcgat agattgtcgc acctgattgc
10860ccgacattat cgcgagccca tttataccca tataaatcag catccatgtt ggaatttaat
10920cgcggcctcg agcaagacgt ttcccgttga atatggctca taacacccct tgtattactg
10980tttatgtaag cagacagttt tattgttcat gatgatatat ttttatcttg tgcaatgtaa
11040catcagagat tttgagacac aacgtggctt tccccccccc ccctgcaggt caattcggtc
11100gatatggcta ttacgaagaa ggctcgtgcg cggagtcccg tgaactttcc cacgcaacaa
11160gtgaaccgca ccgggtttgc cggaggccat ttcgttaaaa tgcgcagcca tggctgcttc
11220gtccagcatg gcgtaatact gatcctcgtc ttcggctggc ggtatattgc cgatgggctt
11280caaaagccgc cgtggttgaa ccagtctatc cattccaagg tagcgaactc gaccgcttcg
11340aagctcctcc atggtccacg ccgatgaatg acctcggcct tgtaaagacc gttgatcgct
11400tctgcgaggg cgttgtcgtg ctgtcgccga cgcttccgat agatggctcg atacctgctt
11460ctgccaaccg ctcggaatag cgaaaggaca cgtattgaac accgcgatcc gagtgatgca
11520ctaggccgcc atgagcggga cgccgatcat gatgagcctc ctcgagggca tcgaggacaa
11580agcctgcatg tgctgtccgg ctcgcccgcc atccgacaat gcgacgggcg aagacgtcga
11640tcacgaaggc cacgtagacg aagccctccc aagtggcgac ataagtacgg acatgcgcaa
11700aggctttccc ggtttgtcgc tgatggtgca agagacgctg aagcgcgatc cgatgcgcag
11760gcatctgttc gtcttccgcg gtcgtggcgg tggcctgatc aaggtcactc gccgaagagc
11820tgcatgattg gctcgaaacc gagcggggga aattgtcgcg cagttctccc gtcgccgagg
11880cgataaatta catgctcaag cgatgggatg gcattacgtc attcctcgat gacggcccga
11940tttgcctgac gaacaatgct gccgaacgaa cgctcagagg ctatgtactc ggcaggaagt
12000catggctgtt tgccggatcg gatcgttgtg ctgaacgtgc ggcgttcatg gcgacactga
12060tcatgagcgc caagctcaat aacatcgatc cgcaggcctg gcttgccgac gtccgcgccg
12120accttgcgga cgctccgatc agcaggcttg agcaacagct gccgtggaac tggacatcca
12180agacactgag tgctcaggcg gcctgacctg cggccttcac cggatactta ccccattatc
12240gcagattgcg atgaagcatc agcgtcattc agcaatcttg ccaaagtatg caggctcgcg
12300agaatcgacg tgcgaaaccg gctggttgcg ccaaagatcc gcttgcggag cggtcgaaca
12360ttcatgctgg gacttcaaga ggtcgagtag aggaagaacc ggaaaggttg caccggaaaa
12420tatgcgttcc tttggagagc gcctcatgga cgtgaacaaa tcgcccggac caaggatgcc
12480acggatacaa aagctcgcga agctcggtcc cgtgggtgtt ctgtcgtctc gttgtacaac
12540gaaatccatt cccattccgc gctcaagatg gcttcccctc ggcagttcat cagggctaaa
12600tcaatctagc cgacttgtcc ggtgaaatgg gctgcactcc aacagaaaca atcaaacaaa
12660catacacagc gacttattca cacgagctca aattacaacg gtatatatcc tgccagtcag
12720catcatcaca ccaaaagtta ggcccgaata gtttgaaatt agaaagctcg caattgaggt
12780ctacaggcca aattcgctct tagccgtaca atattactca ccggtgcgat gccccccatc
12840gtaggtgaag gtggaaatta atgatccatc ttgagaccac aggcccacaa cagctaccag
12900tttcctcaag ggtccaccaa aaacgtaagc gcttacgtac atggtcgata agaaaaggca
12960atttgtagat gttaacatcc aacgtcgctt tcagggatcg atccaatacg caaaccgcct
13020ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc cgactggaaa
13080gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc accccaggct
13140ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata acaatttcac
13200acaggaaaca gctatgacca tgattacgcc aagcttgcat gcctgcaggt cgactctaga
13260ggatctgg
132682820921DNAArtificial Sequencevector pKR1223 28cgcgccagat cctctagagt
cgacctgcag gcatgcaagc ttggcgtaat catggtcata 60gctgtttcct gtgtgaaatt
gttatccgct cacaattcca cacaacatac gagccggaag 120cataaagtgt aaagcctggg
gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg 180ctcactgccc gctttccagt
cgggaaacct gtcgtgccag ctgcattaat gaatcggcca 240acgcgcgggg agaggcggtt
tgcgtattgg atcgatccct gaaagcgacg ttggatgtta 300acatctacaa attgcctttt
cttatcgacc atgtacgtaa gcgcttacgt ttttggtgga 360cccttgagga aactggtagc
tgttgtgggc ctgtggtctc aagatggatc attaatttcc 420accttcacct acgatggggg
gcatcgcacc ggtgagtaat attgtacggc taagagcgaa 480tttggcctgt agacctcaat
tgcgagcttt ctaatttcaa actattcggg cctaactttt 540ggtgtgatga tgctgactgg
caggatatat accgttgtaa tttgagctcg tgtgaataag 600tcgctgtgta tgtttgtttg
attgtttctg ttggagtgca gcccatttca ccggacaagt 660cggctagatt gatttagccc
tgatgaactg ccgaggggaa gccatcttga gcgcggaatg 720ggaatggatt tcgttgtaca
acgagacgac agaacaccca cgggaccgag cttcgcgagc 780ttttgtatcc gtggcatcct
tggtccgggc gatttgttca cgtccatgag gcgctctcca 840aaggaacgca tattttccgg
tgcaaccttt ccggttcttc ctctactcga cctcttgaag 900tcccagcatg aatgttcgac
cgctccgcaa gcggatcttt ggcgcaacca gccggtttcg 960cacgtcgatt ctcgcgagcc
tgcatacttt ggcaagattg ctgaatgacg ctgatgcttc 1020atcgcaatct gcgataatgg
ggtaagtatc cggtgaaggc cgcaggtcag gccgcctgag 1080cactcagtgt cttggatgtc
cagttccacg gcagctgttg ctcaagcctg ctgatcggag 1140cgtccgcaag gtcggcgcgg
acgtcggcaa gccaggcctg cggatcgatg ttattgagct 1200tggcgctcat gatcagtgtc
gccatgaacg ccgcacgttc agcacaacga tccgatccgg 1260caaacagcca tgacttcctg
ccgagtacat agcctctgag cgttcgttcg gcagcattgt 1320tcgtcaggca aatcgggccg
tcatcgagga atgacgtaat gccatcccat cgcttgagca 1380tgtaatttat cgcctcggcg
acgggagaac tgcgcgacaa tttcccccgc tcggtttcga 1440gccaatcatg cagctcttcg
gcgagtgacc ttgatcaggc caccgccacg accgcggaag 1500acgaacagat gcctgcgcat
cggatcgcgc ttcagcgtct cttgcaccat cagcgacaaa 1560ccgggaaagc ctttgcgcat
gtccgtactt atgtcgccac ttgggagggc ttcgtctacg 1620tggccttcgt gatcgacgtc
ttcgcccgtc gcattgtcgg atggcgggcg agccggacag 1680cacatgcagg ctttgtcctc
gatgccctcg aggaggctca tcatgatcgg cgtcccgctc 1740atggcggcct agtgcatcac
tcggatcgcg gtgttcaata cgtgtccttt cgctattccg 1800agcggttggc agaagcaggt
atcgagccat ctatcggaag cgtcggcgac agcacgacaa 1860cgccctcgca gaagcgatca
acggtcttta caaggccgag gtcattcatc ggcgtggacc 1920atggaggagc ttcgaagcgg
tcgagttcgc taccttggaa tggatagact ggttcaacca 1980cggcggcttt tgaagcccat
cggcaatata ccgccagccg aagacgagga tcagtattac 2040gccatgctgg acgaagcagc
catggctgcg cattttaacg aaatggcctc cggcaaaccc 2100ggtgcggttc acttgttgcg
tgggaaagtt cacgggactc cgcgcacgag ccttcttcgt 2160aatagccata tcgaccgaat
tgacctgcag gggggggggg gaaagccacg ttgtgtctca 2220aaatctctga tgttacattg
cacaagataa aaatatatca tcatgaacaa taaaactgtc 2280tgcttacata aacagtaata
caaggggtgt tatgagccat attcaacggg aaacgtcttg 2340ctcgaggccg cgattaaatt
ccaacatgga tgctgattta tatgggtata aatgggctcg 2400cgataatgtc gggcaatcag
gtgcgacaat ctatcgattg tatgggaagc ccgatgcgcc 2460agagttgttt ctgaaacatg
gcaaaggtag cgttgccaat gatgttacag atgagatggt 2520cagactaaac tggctgacgg
aatttatgcc tcttccgacc atcaagcatt ttatccgtac 2580tcctgatgat gcatggttac
tcaccactgc gatccccggg aaaacagcat tccaggtatt 2640agaagaatat cctgattcag
gtgaaaatat tgttgatgcg ctggcagtgt tcctgcgccg 2700gttgcattcg attcctgttt
gtaattgtcc ttttaacagc gatcgcgtat ttcgtctcgc 2760tcaggcgcaa tcacgaatga
ataacggttt ggttgatgcg agtgattttg atgacgagcg 2820taatggctgg cctgttgaac
aagtctggaa agaaatgcat aagcttttgc cattctcacc 2880ggattcagtc gtcactcatg
gtgatttctc acttgataac cttatttttg acgaggggaa 2940attaataggt tgtattgatg
ttggacgagt cggaatcgca gaccgatacc aggatcttgc 3000catcctatgg aactgcctcg
gtgagttttc tccttcatta cagaaacggc tttttcaaaa 3060atatggtatt gataatcctg
atatgaataa attgcagttt catttgatgc tcgatgagtt 3120tttctaatca gaattggtta
attggttgta acactggcag agcattacgc tgacttgacg 3180ggacggcggc tttgttgaat
aaatcgaact tttgctgagt tgaaggatca gatcacgcat 3240cttcccgaca acgcagaccg
ttccgtggca aagcaaaagt tcaaaatcac caactggtcc 3300acctacaaca aagctctcat
caaccgtggc tccctcactt tctggctgga tgatggggcg 3360attcaggcct ggtatgagtc
agcaacacct tcttcacgag gcagacctca gcgccccccc 3420ccccctgcag gtcttttcca
atgatgagca cttttaaagt tctgctatgt ggcgcggtat 3480tatcccgtgt tgacgccggg
caagagcaac tcggtcgccg catacactat tctcagaatg 3540acttggttga gtactcacca
gtcacagaaa agcatcttac ggatggcatg acagtaagag 3600aattatgcag tgctgccata
accatgagtg ataacactgc ggccaactta cttctgacaa 3660cgatcggagg accgaaggag
ctaaccgctt ttttgcacaa catgggggat catgtaactc 3720gccttgatcg ttgggaaccg
gagctgaatg aagccatacc aaacgacgag cgtgacacca 3780cgatgcctgt agcaatggca
acaacgttgc gcaaactatt aactggcgaa ctacttactc 3840tagcttcccg gcaacaatta
atagactgga tggaggcgga taaagttgca ggaccacttc 3900tgcgctcggc ccttccggct
ggctggttta ttgctgataa atctggagcc ggtgagcgtg 3960ggtctcgcgg tatcattgca
gcactggggc cagatggtaa gccctcccgt atcgtagtta 4020tctacacgac ggggagtcag
gcaactatgg atgaacgaaa tagacagatc gctgagatag 4080gtgcctcact gattaagcat
tggtaactgt cagaccaagt ttactcatat atactttaga 4140ttgatttaaa acttcatttt
taatttaaaa ggatctaggt gaagatcctt tttgataatc 4200tcatgaccaa aatcccttaa
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 4260agatcaaagg atcttcttga
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 4320aaaaaccacc gctaccagcg
gtggtttgtt tgccggatca agagctacca actctttttc 4380cgaaggtaac tggcttcagc
agagcgcaga taccaaatac tgtccttcta gtgtagccgt 4440agttaggcca ccacttcaag
aactctgtag caccgcctac atacctcgct ctgctaatcc 4500tgttaccagt ggctgctgcc
agtggcgata agtcgtgtct taccgggttg gactcaagac 4560gatagttacc ggataaggcg
cagcggtcgg gctgaacggg gggttcgtgc acacagccca 4620gcttggagcg aacgacctac
accgaactga gatacctaca gcgtgagcta tgagaaagcg 4680ccacgcttcc cgaagggaga
aaggcggaca ggtatccggt aagcggcagg gtcggaacag 4740gagagcgcac gagggagctt
ccagggggaa acgcctggta tctttatagt cctgtcgggt 4800ttcgccacct ctgacttgag
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 4860ggaaaaacgc cagcaacgcg
gcctttttac ggttcctggc cttttgctgg ccttttgctc 4920acatgttctt tcctgcgtta
tcccctgatt ctgtggataa ccgtattacc gcctttgagt 4980gagctgatac cgctcgccgc
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 5040cggaagagcg cctgatgcgg
tattttctcc ttacgcatct gtgcggtatt tcacaccgca 5100tatggtgcac tctcagtaca
atctgctctg atgccgcata gttaagccag tatacactcc 5160gctatcgcta cgtgactggg
tcatggctgc gccccgacac ccgccaacac ccgctgacgc 5220gccctgacgg gcttgtctgc
tcccggcatc cgcttacaga caagctgtga ccgtctccgg 5280gagctgcatg tgtcagaggt
tttcaccgtc atcaccgaaa cgcgcgaggc agggtgcctt 5340gatgtgggcg ccggcggtcg
agtggcgacg gcgcggcttg tccgcgccct ggtagattgc 5400ctggccgtag gccagccatt
tttgagcggc cagcggccgc gataggccga cgcgaagcgg 5460cggggcgtag ggagcgcagc
gaccgaaggg taggcgcttt ttgcagctct tcggctgtgc 5520gctggccaga cagttatgca
caggccaggc gggttttaag agttttaata agttttaaag 5580agttttaggc ggaaaaatcg
ccttttttct cttttatatc agtcacttac atgtgtgacc 5640ggttcccaat gtacggcttt
gggttcccaa tgtacgggtt ccggttccca atgtacggct 5700ttgggttccc aatgtacgtg
ctatccacag gaaagagacc ttttcgacct ttttcccctg 5760ctagggcaat ttgccctagc
atctgctccg tacattagga accggcggat gcttcgccct 5820cgatcaggtt gcggtagcgc
atgactagga tcgggccagc ctgccccgcc tcctccttca 5880aatcgtactc cggcaggtca
tttgacccga tcagcttgcg cacggtgaaa cagaacttct 5940tgaactctcc ggcgctgcca
ctgcgttcgt agatcgtctt gaacaaccat ctggcttctg 6000ccttgcctgc ggcgcggcgt
gccaggcggt agagaaaacg gccgatgccg ggatcgatca 6060aaaagtaatc ggggtgaacc
gtcagcacgt ccgggttctt gccttctgtg atctcgcggt 6120acatccaatc agctagctcg
atctcgatgt actccggccg cccggtttcg ctctttacga 6180tcttgtagcg gctaatcaag
gcttcaccct cggataccgt caccaggcgg ccgttcttgg 6240ccttcttcgt acgctgcatg
gcaacgtgcg tggtgtttaa ccgaatgcag gtttctacca 6300ggtcgtcttt ctgctttccg
ccatcggctc gccggcagaa cttgagtacg tccgcaacgt 6360gtggacggaa cacgcggccg
ggcttgtctc ccttcccttc ccggtatcgg ttcatggatt 6420cggttagatg ggaaaccgcc
atcagtacca ggtcgtaatc ccacacactg gccatgccgg 6480ccggccctgc ggaaacctct
acgtgcccgt ctggaagctc gtagcggatc acctcgccag 6540ctcgtcggtc acgcttcgac
agacggaaaa cggccacgtc catgatgctg cgactatcgc 6600gggtgcccac gtcatagagc
atcggaacga aaaaatctgg ttgctcgtcg cccttgggcg 6660gcttcctaat cgacggcgca
ccggctgccg gcggttgccg ggattctttg cggattcgat 6720cagcggccgc ttgccacgat
tcaccggggc gtgcttctgc ctcgatgcgt tgccgctggg 6780cggcctgcgc ggccttcaac
ttctccacca ggtcatcacc cagcgccgcg ccgatttgta 6840ccgggccgga tggtttgcga
ccgctcacgc cgattcctcg ggcttggggg ttccagtgcc 6900attgcagggc cggcagacaa
cccagccgct tacgcctggc caaccgcccg ttcctccaca 6960catggggcat tccacggcgt
cggtgcctgg ttgttcttga ttttccatgc cgcctccttt 7020agccgctaaa attcatctac
tcatttattc atttgctcat ttactctggt agctgcgcga 7080tgtattcaga tagcagctcg
gtaatggtct tgccttggcg taccgcgtac atcttcagct 7140tggtgtgatc ctccgccggc
aactgaaagt tgacccgctt catggctggc gtgtctgcca 7200ggctggccaa cgttgcagcc
ttgctgctgc gtgcgctcgg acggccggca cttagcgtgt 7260ttgtgctttt gctcattttc
tctttacctc attaactcaa atgagttttg atttaatttc 7320agcggccagc gcctggacct
cgcgggcagc gtcgccctcg ggttctgatt caagaacggt 7380tgtgccggcg gcggcagtgc
ctgggtagct cacgcgctgc gtgatacggg actcaagaat 7440gggcagctcg tacccggcca
gcgcctcggc aacctcaccg ccgatgcgcg tgcctttgat 7500cgcccgcgac acgacaaagg
ccgcttgtag ccttccatcc gtgacctcaa tgcgctgctt 7560aaccagctcc accaggtcgg
cggtggccca tatgtcgtaa gggcttggct gcaccggaat 7620cagcacgaag tcggctgcct
tgatcgcgga cacagccaag tccgccgcct ggggcgctcc 7680gtcgatcact acgaagtcgc
gccggccgat ggccttcacg tcgcggtcaa tcgtcgggcg 7740gtcgatgccg acaacggtta
gcggttgatc ttcccgcacg gccgcccaat cgcgggcact 7800gccctgggga tcggaatcga
ctaacagaac atcggccccg gcgagttgca gggcgcgggc 7860tagatgggtt gcgatggtcg
tcttgcctga cccgcctttc tggttaagta cagcgataac 7920ttcatgcgtt cccttgcgta
tttgtttatt tactcatcgc atcatatacg cagcgaccgc 7980atgacgcaag ctgttttact
caaatacaca tcaccttttt agacggcggc gctcggtttc 8040ttcagcggcc aagctggccg
gccaggccgc cagcttggca tcagacaaac cggccaggat 8100ttcatgcagc cgcacggttg
agacgtgcgc gggcggctcg aacacgtacc cggccgcgat 8160catctccgcc tcgatctctt
cggtaatgaa aaacggttcg tcctggccgt cctggtgcgg 8220tttcatgctt gttcctcttg
gcgttcattc tcggcggccg ccagggcgtc ggcctcggtc 8280aatgcgtcct cacggaaggc
accgcgccgc ctggcctcgg tgggcgtcac ttcctcgctg 8340cgctcaagtg cgcggtacag
ggtcgagcga tgcacgccaa gcagtgcagc cgcctctttc 8400acggtgcggc cttcctggtc
gatcagctcg cgggcgtgcg cgatctgtgc cggggtgagg 8460gtagggcggg ggccaaactt
cacgcctcgg gccttggcgg cctcgcgccc gctccgggtg 8520cggtcgatga ttagggaacg
ctcgaactcg gcaatgccgg cgaacacggt caacaccatg 8580cggccggccg gcgtggtggt
gtcggcccac ggctctgcca ggctacgcag gcccgcgccg 8640gcctcctgga tgcgctcggc
aatgtccagt aggtcgcggg tgctgcgggc caggcggtct 8700agcctggtca ctgtcacaac
gtcgccaggg cgtaggtggt caagcatcct ggccagctcc 8760gggcggtcgc gcctggtgcc
ggtgatcttc tcggaaaaca gcttggtgca gccggccgcg 8820tgcagttcgg cccgttggtt
ggtcaagtcc tggtcgtcgg tgctgacgcg ggcatagccc 8880agcaggccag cggcggcgct
cttgttcatg gcgtaatgtc tccggttcta gtcgcaagta 8940ttctacttta tgcgactaaa
acacgcgaca agaaaacgcc aggaaaaggg cagggcggca 9000gcctgtcgcg taacttagga
cttgtgcgac atgtcgtttt cagaagacgg ctgcactgaa 9060cgtcagaagc cgactgcact
atagcagcgg aggggttgga ccacaggacg ggtgtggtcg 9120ccatgatcgc gtagtcgata
gtggctccaa gtagcgaagc gagcaggact gggcggcggc 9180caaagcggtc ggacagtgct
ccgagaacgg gtgcgcatag aaattgcatc aacgcatata 9240gcgctagcag cacgccatag
tgactggcga tgctgtcgga atggacgata tcccgcaaga 9300ggcccggcag taccggcata
accaagccta tgcctacagc atccagggtg acggtgccga 9360ggatgacgat gagcgcattg
ttagatttca tacacggtgc ctgactgcgt tagcaattta 9420actgtgataa actaccgcat
taaagctagc ttgcttggtc gttccgcgtg aacgtcggct 9480cgattgtacc tgcgttcaaa
tactttgcga tcgtgttgcg cgcctgcccg gtgcgtcggc 9540tgatctcacg gatcgactgc
ttctctcgca acgccatccg acggatgatg tttaaaagtc 9600ccatgtggat cactccgttg
ccccgtcgct caccgtgttg gggggaaggt gcacatggct 9660cagttctcaa tggaaattat
ctgcctaacc ggctcagttc tgcgtagaaa ccaacatgca 9720agctccaccg ggtgcaaagc
ggcagcggcg gcaggatata ttcaattgta aatggcttca 9780tgtccgggaa atctacatgg
atcagcaatg agtatgatgg tcaatatgga gaaaaagaaa 9840gagtaattac caattttttt
tcaattcaaa aatgtagatg tccgcagcgt tattataaaa 9900tgaaagtaca ttttgataaa
acgacaaatt acgatccgtc gtatttatag gcgaaagcaa 9960taaacaaatt attctaattc
ggaaatcttt atttcgacgt gtctacattc acgtccaaat 10020gggggcttag atgagaaact
tcacgatcga tgccttgatt tcgccattcc cagataccca 10080tttcatcttc agattggtct
gagattatgc gaaaatatac actcatatac ataaatactg 10140acagtttgag ctaccaattc
agtgtagccc attacctcac ataattcact caaatgctag 10200gcagtctgtc aactcggcgt
caatttgtcg gccactatac gatagttgcg caaattttca 10260aagtcctggc ctaacatcac
acctctgtcg gcggcgggtc ccatttgtga taaatccacc 10320atatcgaatt aattcagact
cctttgcccc agagatcaca atggacgact tcctctatct 10380ctacgatcta gtcaggaagt
tcgacggaga aggtgacgat accatgttca ccactgataa 10440tgagaagatt agccttttca
atttcagaaa gaatgctaac ccacagatgg ttagagaggc 10500ttacgcagca ggtctcatca
agacgatcta cccgagcaat aatctccagg agatcaaata 10560ccttcccaag aaggttaaag
atgcagtcaa aagattcagg actaactgca tcaagaacac 10620agagaaagat atatttctca
agatcagaag tactattcca gtatggacga ttcaaggctt 10680gcttcacaaa ccaaggcaag
taatagagat tggagtctct aaaaaggtag ttcccactga 10740atcaaaggcc atggagtcaa
agattcaaat agaggaccta acagaactcg ccgtaaagac 10800tggcgaacag ttcatacaga
gtctcttacg actcaatgac aagaagaaaa tcttcgtcaa 10860catggtggag cacgacacgc
ttgtctactc caaaaatatc aaagatacag tctcagaaga 10920ccaaagggca attgagactt
ttcaacaaag ggtaatatcc ggaaacctcc tcggattcca 10980ttgcccagct atctgtcact
ttattgtgaa gatagtggaa aaggaaggtg gctcctacaa 11040atgccatcat tgcgataaag
gaaaggccat cgttgaagat gcctctgccg acagtggtcc 11100caaagatgga cccccaccca
cgaggagcat cgtggaaaaa gaagacgttc caaccacgtc 11160ttcaaagcaa gtggattgat
gtgatatctc cactgacgta agggatgacg cacaatccca 11220ctatccttcg caagaccctt
cctctatata aggaagttca tttcatttgg agaggacacg 11280ctgaaatcac cagtctccaa
gcttgcgggg atcgtttcgc atgattgaac aagatggatt 11340gcacgcaggt tctccggccg
cttgggtgga gaggctattc ggctatgact gggcacaaca 11400gacaatcggc tgctctgatg
ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct 11460ttttgtcaag accgacctgt
ccggtgccct gaatgaactg caggacgagg cagcgcggct 11520atcgtggctg gccacgacgg
gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc 11580gggaagggac tggctgctat
tgggcgaagt gccggggcag gatctcctgt catctcacct 11640tgctcctgcc gagaaagtat
ccatcatggc tgatgcaatg cggcggctgc atacgcttga 11700tccggctacc tgcccattcg
accaccaagc gaaacatcgc atcgagcgag cacgtactcg 11760gatggaagcc ggtcttgtcg
atcaggatga tctggacgaa gagcatcagg ggctcgcgcc 11820agccgaactg ttcgccaggc
tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac 11880ccatggcgat gcctgcttgc
cgaatatcat ggtggaaaat ggccgctttt ctggattcat 11940cgactgtggc cggctgggtg
tggcggaccg ctatcaggac atagcgttgg ctacccgtga 12000tattgctgaa gagcttggcg
gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc 12060cgctcccgat tcgcagcgca
tcgccttcta tcgccttctt gacgagttct tctgagcggg 12120actctggggt tcgaaatgac
cgaccaagcg acgcccaacc tgccatcacg agatttcgat 12180tccaccgccg ccttctatga
aaggttgggc ttcggaatcg ttttccggga cgccggctgg 12240atgatcctcc agcgcgggga
tctcatgctg gagttcttcg cccaccccgg atcgatccaa 12300cacttacgtt tgcaacgtcc
aagagcaaat agaccacgaa cgccggaagg ttgccgcagc 12360gtgtggattg cgtctcaatt
ctctcttgca ggaatgcaat gatgaatatg atactgacta 12420tgaaactttg agggaatact
gcctagcacc gtcacctcat aacgtgcatc atgcatgccc 12480tgacaacatg gaacatcgct
atttttctga agaattatgc tcgttggagg atgtcgcggc 12540aattgcagct attgccaaca
tcgaactacc cctcacgcat gcattcatca atattattca 12600tgcggggaaa ggcaagatta
atccaactgg caaatcatcc agcgtgattg gtaacttcag 12660ttccagcgac ttgattcgtt
ttggtgctac ccacgttttc aataaggacg agatggtgga 12720gtaaagaagg agtgcgtcga
agcagatcgt tcaaacattt ggcaataaag tttcttaaga 12780ttgaatcctg ttgccggtct
tgcgatgatt atcatataat ttctgttgaa ttacgttaag 12840catgtaataa ttaacatgta
atgcatgacg ttatttatga gatgggtttt tatgattaga 12900gtcccgcaat tatacattta
atacgcgata gaaaacaaaa tatagcgcgc aaactaggat 12960aaattatcgc gcgcggtgtc
atctatgtta ctagatcgat caaacttcgg tactgtgtaa 13020tgacgatgag caatcgagag
gctgactaac aaaaggtaca tcgcgatgga tcgatccatt 13080cgccattcag gctgcgcaac
tgttgggaag ggcgatcggt gcgggcctct tcgctattac 13140gccagctggc gaaaggggga
tgtgctgcaa ggcgattaag ttgggtaacg ccagggtttt 13200cccagtcacg acgttgtaaa
acgacggcca gtgaattcct gcagcccggg ggatccgccc 13260actcgaggcg cgccgtcgac
ggatataatg agccgtaaac aaagatgatt aagtagtaat 13320taatacgtac tagtaaaagt
ggcaaaagat aacgagaaag aaccaatttc tttgcattcg 13380gccttagcgg aaggcatata
taagctttga ttattttatt tagtgtaatg atttcgtaca 13440accaaagcat ttatttagta
ctctcacact tgtgtcgcgg ccggccgcta caggaacagg 13500tggtggcggc cctcggcgcg
ctcgtactgc tccacgatgg tgtagtcctc gttgtgggag 13560gtgatgtcca gcttggagtc
cacgtagtag tagccgggca gctgcacggg cttcttggcc 13620atgtagatgg acttgaactc
caccaggtag tggccgccgt ccttcagctt cagggccttg 13680tggatctcgc ccttcagcac
gccgtcgcgg gggtacaggc gctcggtgga ggcctcccag 13740cccatagtct tcttctgcat
tacggggccg tcggagggga agttcacgcc gatgaacttc 13800accttgtaga tgaaggagcc
gtcctgcagg gaggagtcct gggtcacggt caccacgccg 13860ccgtcctcga agttcatcac
gcgctcccac ttgaagccct cggggaagga cagcttcttg 13920tagtcgggga tgtcggcggg
gtgcttcacg tacaccttgg agccgtactg gaactggggg 13980gacaggatgt cccaggcgaa
gggcaggggg ccgcccttgg tcaccttcag cttggcggtc 14040tgggtgccct cgtaggggcg
gccctcgccc tcgccctcga tctcgaactc gtggccgttc 14100acggagccct ccatgcgcac
cttgaagcgc atgaactcct tgatgacgtc ctcggaggag 14160gccatgggcc gcttgggggg
ctatggaaga ctttcttagt tagttgtgtg aataagcaat 14220gttgggagaa tcgggactac
ttataggata ggaataaaac agaaaagtat taagtgctaa 14280tgaaatattt agactgataa
ttaaaatctt cacgtatgtc cacttgatat aaaaacgtca 14340ggaataaagg aagtacagta
gaatttaaag gtactctttt tatatatacc cgtgttctct 14400ttttggctag ctagttgcat
aaaaaataat ctatattttt atcattattt taaatatctt 14460atgagatggt aaatatttat
cataattttt tttactatta tttattattt gtgtgtgtaa 14520tacatataga agttaattac
aaattttatt tactttttca ttattttgat atgattcacc 14580attaatttag tgttattatt
tataatagtt cattttaatc tttttgtata tattatgcgt 14640gcagtacttt tttcctacat
ataactacta ttacatttta tttatataat atttttatta 14700atgaattttc gtgataatat
gtaatattgt tcattattat ttcagatttt ttaaaaatat 14760ttgtgttatt atttatgaaa
tatgtaattt ttttagtatt tgattttatg atgataaagt 14820gttctaaatt caaaagaagg
gggaaagcgt aaacattaaa aaacgtcatc aaacaaaaac 14880aaaatcttgt taataaagat
aaaactgttt gttttgatca ctgttatttc gtaatataaa 14940aacattattt atatttatat
tgttgacaac caaatttgcc tatcaaatct aaccaatata 15000atgcatgcgt ggcaggtaat
gtactaccat gaacttaagt catgacataa taaaccgtga 15060atctgaccaa tgcatgtacc
tanctaaatt gtatttgtga cacgaagcaa atgattcaat 15120tcacaatgga gatgggaaac
aaataatgaa gaacccagaa ctaagaaagc ttttctgaaa 15180aataaaataa aggcaatgtc
aaaagtatac tgcatcatca gtccagaaag cacatgatat 15240ttttttatca gtatcaatgc
agctagtttt attttacaat atcgatatag ctagtttaaa 15300tatattgcag ctagatttat
aaatatttgt gttattattt atcatttgtg taatcctgtt 15360tttagtattt tagtttatat
atgatgataa tgtattccaa atttaaaaga agggaaataa 15420atttaaacaa gaaaaaaagt
catcaaacaa aaaacaaatg aaagggtgga aagatgttac 15480catgtaatgt gaatgttaca
gtatttcttt tattatagag ttaacaaatt aactaatatg 15540attttgttaa taatgataaa
atattttttt tattattatt tcataatata aaaatagttt 15600acttaatata aaaaaaattc
tatcgttcac aacaaagttg gccacctaat ttaaccatgc 15660atgtacccat ggaccatatt
aggtaaccat caaacctgat gaagagataa agagatgaag 15720acttaagtca taacacaaaa
ccataaaaaa caaaaataca atcaaccgtc aatctgacca 15780atgcatgaaa aagctgcaat
agtgagtggc gacacaaagc acatgatttt cttacaacgg 15840agataaaacc aaaaaaatat
ttcatgaaca acctagaaca aataaagctt ttatataata 15900aatatataaa taaataaagg
ctatggaata atatacttca atatatttgg attaaataaa 15960ttgttggcgg ggttgatata
tttatacaca cctaaagtca cttcaatctc attttcactt 16020aacttttatt ttttttttct
ttttatttat cataaagaga atattgataa tatacttttt 16080aacatatttt tatgacattt
tttattggtg aaaacttatt aaaaatcata aattttgtaa 16140gttagattta tttaaagagt
tcctcttctt attttaaatt ttttaataaa tttttaaata 16200actaaaattt gtgttaaaaa
tgttaaaaaa gtgtgttatt aacccttctc ttcgaggatc 16260cgtaccgagc tcggatccac
tagtaacggc cgccagtgtg ctggaattca ggtcctgcag 16320gtctactctt tacatgttct
ttactccgtc tcaaaatttc ctttttttgt tggctctctc 16380cgaacgagtt ggagaaatcg
ttaaccctaa tcgaagatct agattcctct acatacgttt 16440gatctctctc tcagtatgga
ttacaaagcg ccaaggagat actactcaca cggagttgtt 16500gcgagacagc aagatttcgc
aacagatata gttacgagaa gaagacctta tgtcccttac 16560gaccgtccaa ataagttttc
aaggagtctg gtttggacgt caaaagagta caaatcaccc 16620gagggcaata atatgccaag
gaccaatgat gtgtcaccga aaccaccagt tttaggtttg 16680gcgaggaaga atgctgcttg
tgggccaatg agatcttcta gtctcagaaa atgggtatgt 16740aagtattgga aagatggaaa
gtgcaagagg ggtgagcagt gccagttctt acactcttgg 16800tcttgtttcc ctggattggc
catggtagct tctcttgaag ggcacaataa ggaactaaag 16860gggatcgctc tccctgaggg
ttcagataaa ctcttttcag tcagtattga tggtacattg 16920cgagtttggg actgcaattc
tggtcagtgt gtacattcca tcaaccttga cgcagaagca 16980gggtctctaa tcagtgaagg
cccttgggtt ttccttggct tgccaaacgc tataaaggct 17040tttaacgttc aaaccagtca
agatttgcat cttcaagcag caggggtggt tggtcaggtg 17100aatgcaatga ctattgcaaa
cggaatgctt tttgctggaa caagttctgg tagtatctta 17160gtctggaaag ctactacaga
ctctgagtct gatccattca aatacttgac atctcttgag 17220ggacatagtg gtgaagtcac
ttgttttgct gttggaggtc aaatgctata ctctggttct 17280gtcgataaaa caatcaagat
gtgggatctc aacaccctgc aatgtataat gaccctgaag 17340caacataccg gcactgtcac
ttcactctta tgttgggata aatgtttgat atcgtcttcc 17400ttggatggga ccataaaagt
ttgggcttat tctgaaaacg gaatcttgaa agttgttcaa 17460actcgcagac aagaacagag
tagtgttcat gctctttctg gtatgcatga tgcagaagcc 17520aaaccgataa tattctgctc
ttaccaaaac ggaaccgttg gcattttcga cctaccatct 17580tttcaagaaa gaggaaggat
gttctctacg cacacgatcg ccacactcac aattggtcct 17640caaggattgt tattcagtgg
agacgagagt ggtaacttgc gtgtatggac cttagctgct 17700ggcaacaaag tttagtcttt
tcgactaaag aattctgatt taattttgtg gtttatatgt 17760tgagttaact gttaagagag
ttttattttg taataggtgt atcagtcaat aaacaatctt 17820tgtatcaacc aaatgtaatt
tttctcgtta attcgatttc agagttttta ctttaagata 17880aacaaactct ttcacacatc
atttaatgaa agtggagaag cttaaaaaac aaacaaagaa 17940actgatccat ttttggcggg
tcttcttcta ctcttattca tatgtgttaa cgaactatag 18000cgtaaaattc agagcaagcg
atctccgatt tgaacgtggc tatcaccgga ggcccaccac 18060tacgggcgat acgctctaag
tgaggattaa agtgctctgg tggtgacgtt gaagaaactc 18120gcccatggtt tttgttatct
ctgcagccaa gtgtcgttct ttcttcgcca cttctcatca 18180agctacagtg aatttaaaaa
tggcgtcttt ctttgatctc gtatacataa gctggattgg 18240tttcttaaac aaattcctct
ccttttgggt cttctgggtt tgccttgtaa gtgtttgtgt 18300ttttgcctct gagaaaaaat
cgcggccgca tggagagatc tcaacggcag tctcctccgc 18360caccgtcgcc gtcctcctcc
tcgtcctccg tctccgcgga caccgtcctc gtccctcccg 18420gaaagaggcg gagggcggcg
acggccaagg ccggcgccga gcctaataag aggatccgca 18480aggaccccgc cgccgccgcc
gcggggaaga ggagctccgt ctacagggga gtcaccaggc 18540acaggtggac gggcaggttc
gaggcgcatc tctgggacaa gcactgcctc gccgcgctcc 18600acaacaagaa gaaaggcagg
caagtctacc tgggggcgta tgacagcgag gaggcagctg 18660ctcgtgccta tgacctcgca
gctctcaagt actggggtcc tgagactctg ctcaacttcc 18720ctgtggagga ttactccagc
gagatgccgg agatggaggc cgtgtcccgg gaggagtacc 18780tggcctccct ccgccgcagg
agcagcggct tctccagggg cgtctccaag tacagaggcg 18840tcgccaggca tcaccacaac
gggaggtggg aggcacggat tgggcgagtc tttgggaaca 18900agtacctcta cttgggaaca
tttgacactc aagaagaggc agccaaggcc tatgaccttg 18960cggccattga ataccgtggc
gtcaatgctg taaccaactt cgacatcagc tgctacctgg 19020accacccgct gttcctggca
cagctccaac aggagccaca ggtggtgccg gcactcaacc 19080aagaacctca acctgatcag
agcgaaaccg gaactacaga gcaagagccg gagtcaagcg 19140aagccaagac accggatggc
agtgcagaac ccgatgagaa cgcggtgcct gacgacaccg 19200cggagcccct caccacagtc
gacgacagca tcgaagaggg cttgtggagc ccttgcatgg 19260attacgagct agacaccatg
tcgagaccaa actttggcag ctcaatcaat ctgagcgagt 19320ggttcgctga cgcagacttc
gactgcaaca tcggatgcct gttcgatggg tgttctgcgg 19380ctgacgaagg aagcaaggat
ggtgtaggtc tggcagattt cagtctgttt gaggcaggtg 19440atgtccagct gaaggatgtt
ctttcggata tggaagaggg gatacaacct ccagcgatga 19500tcagtgtgtg caacgcggcc
gcaagtatga actaaaatgc atgtaggtgt aagagctcat 19560ggagagcatg gaatattgta
tccgaccatg taacagtata ataactgagc tccatctcac 19620ttcttctatg aataaacaaa
ggatgttatg atatattaac actctatcta tgcaccttat 19680tgttctatga taaatttcct
cttattatta taaatcatct gaatcgtgac ggcttatgga 19740atgcttcaaa tagtacaaaa
acaaatgtgt actataagac tttctaaaca attctaacct 19800tagcattgtg aacgagacat
aagtgttaag aagacataac aattataatg gaagaagttt 19860gtctccattt atatattata
tattacccac ttatgtatta tattaggatg ttaaggagac 19920ataacaatta taaagagaga
agtttgtatc catttatata ttatatacta cccatttata 19980tattatactt atccacttat
ttaatgtctt tataaggttt gatccatgat atttctaata 20040ttttagttga tatgtatatg
aaagggtact atttgaactc tcttactctg tataaaggtt 20100ggatcatcct taaagtgggt
ctatttaatt ttattgcttc ttacagataa aaaaaaaatt 20160atgagttggt ttgataaaat
attgaaggat ttaaaataat aataaataac atataatata 20220tgtatataaa tttattataa
tataacattt atctataaaa aagtaaatat tgtcataaat 20280ctatacaatc gtttagcctt
gctggacgaa tctcaattat ttaaacgaga gtaaacatat 20340ttgacttttt ggttatttaa
caaattatta tttaacacta tatgaaattt ttttttttat 20400cagcaaagaa taaaattaaa
ttaagaagga caatggtgtc ccaatcctta tacaaccaac 20460ttccacaaga aagtcaagtc
agagacaaca aaaaaacaag caaaggaaat tttttaattt 20520gagttgtctt gtttgctgca
taatttatgc agtaaaacac tacacataac ccttttagca 20580gtagagcaat ggttgaccgt
gtgcttagct tcttttattt tattttttta tcagcaaaga 20640ataaataaaa taaaatgaga
cacttcaggg atgtttcaac aagctctaga gggcccaatt 20700cgccctatag tgagtcgtat
tacaattcac tggccgtcgt tttacaacgt cgtgactggg 20760aaaaccctgg cgttacccaa
cttaatcgcc ttgcagcaca tccccctttc gccagctggc 20820gtaatagcga agaggcccgc
accgatcgcc cttcccaaca gttgcgcagc ctatacgtac 20880gagatccggc cggccagatc
ctgcaggaga tccaagcttg g 20921294906DNAArtificial
Sequencevector pKR268 29ggccgcatga gccgtaaagg ttcaatacaa cgagtgcttg
ttttcttagg gacaagcatt 60gtacttatgt atgattctgt gtaaccatga gtcttccacg
ttgtactaat gtgaagggca 120aaaataaaac acagaacaag ttcgtttttc tcaaataatg
tgaaggtaga aaatggaacc 180atgcctcctc tcttgcatgt gatttaaaat attagcagat
ggtaccgtac gtgggcggat 240cccccgggct gcaggaattc actggccgtc gttttacaac
gtcgtgactg ggaaaaccct 300ggcgttaccc aacttaatcg ccttgcagca catccccctt
tcgccagctg gcgtaatagc 360gaagaggccc gcaccgatcg cccttcccaa cagttgcgca
gcctgaatgg cgaatggcgc 420ctgatgcggt attttctcct tacgcatctg tgcggtattt
cacaccgcat atggtgcact 480ctcagtacaa tctgctctga tgccgcatag ttaagccagc
cccgacaccc gccaacaccc 540gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg
cttacagaca agctgtgacc 600gtctccggga gctgcatgtg tcagaggttt tcaccgtcat
caccgaaacg cgcgagacga 660aagggcctcg tgatacgcct atttttatag gttaatgtca
tgataataat ggtttcttag 720acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc
ctatttgttt atttttctaa 780atacattcaa atatgtatcc gctcatgaga caataaccct
gataaatgct tcaataatat 840tgaaaaagga agagtatgag tattcaacat ttccgtgtcg
cccttattcc cttttttgcg 900gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg
tgaaagtaaa agatgctgaa 960gatcagttgg gtgcacgagt gggttacatc gaactggatc
tcaacagcgg taagatcctt 1020gagagttttc gccccgaaga acgttttcca atgatgagca
cttttaaagt tctgctatgt 1080ggcgcggtat tatcccgtat tgacgccggg caagagcaac
tcggtcgccg catacactat 1140tctcagaatg acttggttga gtactcacca gtcacagaaa
agcatcttac ggatggcatg 1200acagtaagag aattatgcag tgctgccata accatgagtg
ataacactgc ggccaactta 1260cttctgacaa cgatcggagg accgaaggag ctaaccgctt
ttttgcacaa catgggggat 1320catgtaactc gccttgatcg ttgggaaccg gagctgaatg
aagccatacc aaacgacgag 1380cgtgacacca cgatgcctgt agcaatggca acaacgttgc
gcaaactatt aactggcgaa 1440ctacttactc tagcttcccg gcaacaatta atagactgga
tggaggcgga taaagttgca 1500ggaccacttc tgcgctcggc ccttccggct ggctggttta
ttgctgataa atctggagcc 1560ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc
cagatggtaa gccctcccgt 1620atcgtagtta tctacacgac ggggagtcag gcaactatgg
atgaacgaaa tagacagatc 1680gctgagatag gtgcctcact gattaagcat tggtaactgt
cagaccaagt ttactcatat 1740atactttaga ttgatttaaa acttcatttt taatttaaaa
ggatctaggt gaagatcctt 1800tttgataatc tcatgaccaa aatcccttaa cgtgagtttt
cgttccactg agcgtcagac 1860cccgtagaaa agatcaaagg atcttcttga gatccttttt
ttctgcgcgt aatctgctgc 1920ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt
tgccggatca agagctacca 1980actctttttc cgaaggtaac tggcttcagc agagcgcaga
taccaaatac tgtccttcta 2040gtgtagccgt agttaggcca ccacttcaag aactctgtag
caccgcctac atacctcgct 2100ctgctaatcc tgttaccagt ggctgctgcc agtggcgata
agtcgtgtct taccgggttg 2160gactcaagac gatagttacc ggataaggcg cagcggtcgg
gctgaacggg gggttcgtgc 2220acacagccca gcttggagcg aacgacctac accgaactga
gatacctaca gcgtgagcta 2280tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca
ggtatccggt aagcggcagg 2340gtcggaacag gagagcgcac gagggagctt ccagggggaa
acgcctggta tctttatagt 2400cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt
tgtgatgctc gtcagggggg 2460cggagcctat ggaaaaacgc cagcaacgcg gcctttttac
ggttcctggc cttttgctgg 2520ccttttgctc acatgttctt tcctgcgtta tcccctgatt
ctgtggataa ccgtattacc 2580gcctttgagt gagctgatac cgctcgccgc agccgaacga
ccgagcgcag cgagtcagtg 2640agcgaggaag cggaagagcg cccaatacgc aaaccgcctc
tccccgcgcg ttggccgatt 2700cattaatgca gctggcacga caggtttccc gactggaaag
cgggcagtga gcgcaacgca 2760attaatgtga gttagctcac tcattaggca ccccaggctt
tacactttat gcttccggct 2820cgtatgttgt gtggaattgt gagcggataa caatttcaca
caggaaacag ctatgaccat 2880gattacgcca agcttgcatg cctgcaggtc gactcgacgt
acgatcccac atgcaagttt 2940ttatttcaat cccttttcct ttgaataact gaccaagaac
aacaagaaaa aaaaaaaaaa 3000agaaaaggat cattttgaaa ggatattttt cgctcctatt
caaatactgt atttttacca 3060aaaaaactgt atttttccta cactctcaag ctttgttttt
cgcttcgact ctcatgattt 3120ccttcatatg ccaatcactc tatttataaa tggcataagg
tagtgtgaac aattgcaaag 3180cttgtcatca aaagcttgca atgtacaaat taatgttttt
catgcctttc aaaattatct 3240gcacccccta gctattaatc taacatctaa gtaaggctag
tgaatttttt cgaatagtca 3300tgcagtgcat taatttcccc gtgactattt tggctttgac
tccaacactg gccccgtaca 3360tccgtccctc attacatgaa aagaaatatt gtttatattc
ttaattaaaa atattgtccc 3420ttctaaattt tcatatagtt aattattata ttactttttt
ctctattcta ttagttctat 3480tttcaaatta ttatttatgc atatgtaaag tacattatat
ttttgctata tacttaaata 3540tttctaaatt attaaaaaaa gactgatatg aaaaatttat
tctttttaaa gctatatcat 3600tttatatata ctttttcttt tcttttcttt cattttctat
tcaatttaat aagaaataaa 3660ttttgtaaat ttttatttat caatttataa aaatatttta
ctttatatgt tttttcacat 3720ttttgttaaa caaatcatat cattatgatt gaaagagagg
aaattgacag tgagtaataa 3780gtgatgagaa aaaaatgtgt tatttcctaa aaaaaaccta
aacaaacatg tatctactct 3840ctatttcatc tatctctcat ttcatttttc tctttatctc
tttctttatt tttttatcat 3900atcatttcac attaattatt tttactctct ttattttttc
tctctatccc tctcttattt 3960ccactcatat atacactcca aaattggggc atgcctttat
cactactcta tctcctccac 4020taaatcattt aaatgaaact gaaaagcatt ggcaagtctc
ctcccctcct caagtgattt 4080ccaactcagc attggcatct aattgattca gtatatctat
tgcatgtgta aaagtctttc 4140cacaatacat aactattaat taatcttaaa taaataaagg
ataaaatatt tttttttctt 4200cataaaatta aaatatgtta ttttttgttt agatgtatat
tcgaataaat ctaaatatat 4260gataatgatt ttttatattg attaaacata taatcaatat
taaatatgat atttttttat 4320ataggttgta cacataattt tataaggata aaaaatatga
taaaaataaa ttttaaatat 4380ttttatattt acgagaaaaa aaaatatttt agccataaat
aaatgaccag catattttac 4440aaccttagta attcataaat tcctatatgt atatttgaaa
ttaaaaacag ataatcgtta 4500agggaaggaa tcctacgtca tctcttgcca tttgtttttc
atgcaaacag aaagggacga 4560aaaaccacct caccatgaat cactcttcac accattttta
ctagcaaaca agtctcaaca 4620actgaagcca gctctctttc cgtttctttt tacaacactt
tctttgaaat agtagtattt 4680ttttttcaca tgatttatta acgtgccaaa agatgcttat
tgaatagagt gcacatttgt 4740aatgtactac taattagaac atgaaaaagc attgttctaa
cacgataatc ctgtgaaggc 4800gttaactcca aagatccaat ttcactatat aaattgtgac
gaaagcaaaa tgaattcaca 4860tagctgagag agaaaggaaa ggttaactaa gaagcaatac
ttcagc 49063010528DNAArtificial Sequencevector pKR1143
30gtacgagatc cggccggcca gatcctgcag gagatccaag cttggcgcgc cgttctatag
60tgtcacctaa atcgtatgtg tatgatacat aaggttatgt attaattgta gccgcgttct
120aacgacaata tgtccatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta
180agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg
240gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca
300ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt
360aatgtcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta
420gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa
480acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt
540tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag
600ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta
660atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca
720agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag
780cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa
840agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga
900acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc
960gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc
1020ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt
1080gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt
1140gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag
1200gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa
1260tgcaggttga tcgattcgac atcgatctag taacatagat gacaccgcgc gcgataattt
1320atcctagttt gcgcgctata ttttgttttc tatcgcgtat taaatgtata attgcgggac
1380tctaatcata aaaacccatc tcataaataa cgtcatgcat tacatgttaa ttattacatg
1440cttaacgtaa ttcaacagaa attatatgat aatcatcgca agaccggcaa caggattcaa
1500tcttaagaaa ctttattgcc aaatgtttga acgatctgct tcgacgcact ccttctttag
1560gtacctcact attcctttgc cctcggacga gtgctggggc gtcggtttcc actatcggcg
1620agtacttcta cacagccatc ggtccagacg gccgcgcttc tgcgggcgat ttgtgtacgc
1680ccgacagtcc cggctccgga tcggacgatt gcgtcgcatc gaccctgcgc ccaagctgca
1740tcatcgaaat tgccgtcaac caagctctga tagagttggt caagaccaat gcggagcata
1800tacgcccgga gccgcggcga tcctgcaagc tccggatgcc tccgctcgaa gtagcgcgtc
1860tgctgctcca tacaagccaa ccacggcctc cagaagaaga tgttggcgac ctcgtattgg
1920gaatccccga acatcgcctc gctccagtca atgaccgctg ttatgcggcc attgtccgtc
1980aggacattgt tggagccgaa atccgcgtgc acgaggtgcc ggacttcggg gcagtcctcg
2040gcccaaagca tcagctcatc gagagcctgc gcgacggacg cactgacggt gtcgtccatc
2100acagtttgcc agtgatacac atggggatca gcaatcgcgc atatgaaatc acgccatgta
2160gtgtattgac cgattccttg cggtccgaat gggccgaacc cgctcgtctg gctaagatcg
2220gccgcagcga tcgcatccat ggcctccgcg accggctgca gaacagcggg cagttcggtt
2280tcaggcaggt cttgcaacgt gacaccctgt gcacggcggg agatgcaata ggtcaggctc
2340tcgctgaatt ccccaatgtc aagcacttcc ggaatcggga gcgcggccga tgcaaagtgc
2400cgataaacat aacgatcttt gtagaaacca tcggcgcagc tatttacccg caggacatat
2460ccacgccctc ctacatcgaa gctgaaagca cgagattctt cgccctccga gagctgcatc
2520aggtcggaga cgctgtcgaa cttttcgatc agaaacttct cgacagacgt cgcggtgagt
2580tcaggctttt tcatggttta ataagaagag aaaagagttc ttttgttatg gctgaagtaa
2640tagagaaatg agctcgagcg tgtcctctcc aaatgaaatg aacttcctta tatagaggaa
2700gggtcttgcg aaggatagtg ggattgtgcg tcatccctta cgtcagtgga gatgtcacat
2760caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg
2820ggtgggggtc catctttggg accactgtcg gcagaggcat cttgaatgat agcctttcct
2880ttatcgcaat gatggcattt gtaggagcca ccttcctttt ctactgtcct ttcgatgaag
2940tgacagatag ctgggcaatg gaatccgagg aggtttcccg aaattatcct ttgttgaaaa
3000gtctcaatag ccctttggtc ttctgagact gtatctttga catttttgga gtagaccaga
3060gtgtcgtgct ccaccatgtt gacgaagatt ttcttcttgt cattgagtcg taaaagactc
3120tgtatgaact gttcgccagt cttcacggcg agttctgtta gatcctcgat ttgaatctta
3180gactccatgc atggccttag attcagtagg aactaccttt ttagagactc caatctctat
3240tacttgcctt ggtttatgaa gcaagccttg aatcgtccat actggaatag tacttctgat
3300cttgagaaat atgtctttct ctgtgttctt gatgcaatta gtcctgaatc ttttgactgc
3360atctttaacc ttcttgggaa ggtatttgat ctcctggaga ttgttactcg ggtagatcgt
3420cttgatgaga cctgctgcgt aggcctctct aaccatctgt gggtcagcat tctttctgaa
3480attgaagagg ctaaccttct cattatcagt ggtgaacata gtgtcgtcac cttcaccttc
3540gaacttcctt cctagatcgt aaagatagag gaaatcgtcc attgtaatct ccggggcaaa
3600ggagatctct tttggggctg gatcactgct gggccttttg gttcctagcg tgagccagtg
3660ggctttttgc tttggtgggc ttgttagggc cttagcaaag ctcttgggct tgagttgagc
3720ttctcctttg gggatgaagt tcaacctgtc tgtttgctga cttgttgtgt acgcgtcagc
3780tgctgctctt gcctctgtaa tagtggcaaa tttcttgtgt gcaactccgg gaacgccgtt
3840tgttgccgcc tttgtacaac cccagtcatc gtatataccg gcatgtggac cgttatacac
3900aacgtagtag ttgatatgag ggtgttgaat acccgattct gctctgagag gagcaactgt
3960gctgttaagc tcagattttt gtgggattgg aattggatcg atctcgatcc cgcgaaatta
4020atacgactca ctatagggag accacaacgg tttccctcta gaaataattt tgtttaactt
4080taagaaggag atatacccat ggaaaagcct gaactcaccg cgacgtctgt cgagaagttt
4140ctgatcgaaa agttcgacag cgtctccgac ctgatgcagc tctcggaggg cgaagaatct
4200cgtgctttca gcttcgatgt aggagggcgt ggatatgtcc tgcgggtaaa tagctgcgcc
4260gatggtttct acaaagatcg ttatgtttat cggcactttg catcggccgc gctcccgatt
4320ccggaagtgc ttgacattgg ggaattcagc gagagcctga cctattgcat ctcccgccgt
4380gcacagggtg tcacgttgca agacctgcct gaaaccgaac tgcccgctgt tctgcagccg
4440gtcgcggagg ctatggatgc gatcgctgcg gccgatctta gccagacgag cgggttcggc
4500ccattcggac cgcaaggaat cggtcaatac actacatggc gtgatttcat atgcgcgatt
4560gctgatcccc atgtgtatca ctggcaaact gtgatggacg acaccgtcag tgcgtccgtc
4620gcgcaggctc tcgatgagct gatgctttgg gccgaggact gccccgaagt ccggcacctc
4680gtgcacgcgg atttcggctc caacaatgtc ctgacggaca atggccgcat aacagcggtc
4740attgactgga gcgaggcgat gttcggggat tcccaatacg aggtcgccaa catcttcttc
4800tggaggccgt ggttggcttg tatggagcag cagacgcgct acttcgagcg gaggcatccg
4860gagcttgcag gatcgccgcg gctccgggcg tatatgctcc gcattggtct tgaccaactc
4920tatcagagct tggttgacgg caatttcgat gatgcagctt gggcgcaggg tcgatgcgac
4980gcaatcgtcc gatccggagc cgggactgtc gggcgtacac aaatcgcccg cagaagcgcg
5040gccgtctgga ccgatggctg tgtagaagta ctcgccgata gtggaaaccg acgccccagc
5100actcgtccga gggcaaagga atagtgaggt acagcttgga tcgatccggc tgctaacaaa
5160gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc ataacccctt
5220ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat atccggatga
5280tcgggcgcgc cgtcgacgga tccactagtt ctagagcggc ccgcgccgtc gacggatata
5340atgagccgta aacaaagatg attaagtagt aattaatacg tactagtaaa agtggcaaaa
5400gataacgaga aagaaccaat ttctttgcat tcggccttag cggaaggcat atataagctt
5460tgattatttt atttagtgta atgatttcgt acaaccaaag catttattta gtactctcac
5520acttgtgtcg cggccggccg ctacaggaac aggtggtggc ggccctcggc gcgctcgtac
5580tgctccacga tggtgtagtc ctcgttgtgg gaggtgatgt ccagcttgga gtccacgtag
5640tagtagccgg gcagctgcac gggcttcttg gccatgtaga tggacttgaa ctccaccagg
5700tagtggccgc cgtccttcag cttcagggcc ttgtggatct cgcccttcag cacgccgtcg
5760cgggggtaca ggcgctcggt ggaggcctcc cagcccatag tcttcttctg cattacgggg
5820ccgtcggagg ggaagttcac gccgatgaac ttcaccttgt agatgaagga gccgtcctgc
5880agggaggagt cctgggtcac ggtcaccacg ccgccgtcct cgaagttcat cacgcgctcc
5940cacttgaagc cctcggggaa ggacagcttc ttgtagtcgg ggatgtcggc ggggtgcttc
6000acgtacacct tggagccgta ctggaactgg ggggacagga tgtcccaggc gaagggcagg
6060gggccgccct tggtcacctt cagcttggcg gtctgggtgc cctcgtaggg gcggccctcg
6120ccctcgccct cgatctcgaa ctcgtggccg ttcacggagc cctccatgcg caccttgaag
6180cgcatgaact ccttgatgac gtcctcggag gaggccatgg gccgcttggg gggctatgga
6240agactttctt agttagttgt gtgaataagc aatgttggga gaatcgggac tacttatagg
6300ataggaataa aacagaaaag tattaagtgc taatgaaata tttagactga taattaaaat
6360cttcacgtat gtccacttga tataaaaacg tcaggaataa aggaagtaca gtagaattta
6420aaggtactct ttttatatat acccgtgttc tctttttggc tagctagttg cataaaaaat
6480aatctatatt tttatcatta ttttaaatat cttatgagat ggtaaatatt tatcataatt
6540ttttttacta ttatttatta tttgtgtgtg taatacatat agaagttaat tacaaatttt
6600atttactttt tcattatttt gatatgattc accattaatt tagtgttatt atttataata
6660gttcatttta atctttttgt atatattatg cgtgcagtac ttttttccta catataacta
6720ctattacatt ttatttatat aatattttta ttaatgaatt ttcgtgataa tatgtaatat
6780tgttcattat tatttcagat tttttaaaaa tatttgtgtt attatttatg aaatatgtaa
6840tttttttagt atttgatttt atgatgataa agtgttctaa attcaaaaga agggggaaag
6900cgtaaacatt aaaaaacgtc atcaaacaaa aacaaaatct tgttaataaa gataaaactg
6960tttgttttga tcactgttat ttcgtaatat aaaaacatta tttatattta tattgttgac
7020aaccaaattt gcctatcaaa tctaaccaat ataatgcatg cgtggcaggt aatgtactac
7080catgaactta agtcatgaca taataaaccg tgaatctgac caatgcatgt acctanctaa
7140attgtatttg tgacacgaag caaatgattc aattcacaat ggagatggga aacaaataat
7200gaagaaccca gaactaagaa agcttttctg aaaaataaaa taaaggcaat gtcaaaagta
7260tactgcatca tcagtccaga aagcacatga tattttttta tcagtatcaa tgcagctagt
7320tttattttac aatatcgata tagctagttt aaatatattg cagctagatt tataaatatt
7380tgtgttatta tttatcattt gtgtaatcct gtttttagta ttttagttta tatatgatga
7440taatgtattc caaatttaaa agaagggaaa taaatttaaa caagaaaaaa agtcatcaaa
7500caaaaaacaa atgaaagggt ggaaagatgt taccatgtaa tgtgaatgtt acagtatttc
7560ttttattata gagttaacaa attaactaat atgattttgt taataatgat aaaatatttt
7620ttttattatt atttcataat ataaaaatag tttacttaat ataaaaaaaa ttctatcgtt
7680cacaacaaag ttggccacct aatttaacca tgcatgtacc catggaccat attaggtaac
7740catcaaacct gatgaagaga taaagagatg aagacttaag tcataacaca aaaccataaa
7800aaacaaaaat acaatcaacc gtcaatctga ccaatgcatg aaaaagctgc aatagtgagt
7860ggcgacacaa agcacatgat tttcttacaa cggagataaa accaaaaaaa tatttcatga
7920acaacctaga acaaataaag cttttatata ataaatatat aaataaataa aggctatgga
7980ataatatact tcaatatatt tggattaaat aaattgttgg cggggttgat atatttatac
8040acacctaaag tcacttcaat ctcattttca cttaactttt attttttttt tctttttatt
8100tatcataaag agaatattga taatatactt tttaacatat ttttatgaca ttttttattg
8160gtgaaaactt attaaaaatc ataaattttg taagttagat ttatttaaag agttcctctt
8220cttattttaa attttttaat aaatttttaa ataactaaaa tttgtgttaa aaatgttaaa
8280aaagtgtgtt attaaccctt ctcttcgagg atccgtacga tcccacatgc aagtttttat
8340ttcaatccct tttcctttga ataactgacc aagaacaaca agaaaaaaaa aaaaaaagaa
8400aaggatcatt ttgaaaggat atttttcgct cctattcaaa tactgtattt ttaccaaaaa
8460aactgtattt ttcctacact ctcaagcttt gtttttcgct tcgactctca tgatttcctt
8520catatgccaa tcactctatt tataaatggc ataaggtagt gtgaacaatt gcaaagcttg
8580tcatcaaaag cttgcaatgt acaaattaat gtttttcatg cctttcaaaa ttatctgcac
8640cccctagcta ttaatctaac atctaagtaa ggctagtgaa ttttttcgaa tagtcatgca
8700gtgcattaat ttccccgtga ctattttggc tttgactcca acactggccc cgtacatccg
8760tccctcatta catgaaaaga aatattgttt atattcttaa ttaaaaatat tgtcccttct
8820aaattttcat atagttaatt attatattac ttttttctct attctattag ttctattttc
8880aaattattat ttatgcatat gtaaagtaca ttatattttt gctatatact taaatatttc
8940taaattatta aaaaaagact gatatgaaaa atttattctt tttaaagcta tatcatttta
9000tatatacttt ttcttttctt ttctttcatt ttctattcaa tttaataaga aataaatttt
9060gtaaattttt atttatcaat ttataaaaat attttacttt atatgttttt tcacattttt
9120gttaaacaaa tcatatcatt atgattgaaa gagaggaaat tgacagtgag taataagtga
9180tgagaaaaaa atgtgttatt tcctaaaaaa aacctaaaca aacatgtatc tactctctat
9240ttcatctatc tctcatttca tttttctctt tatctctttc tttatttttt tatcatatca
9300tttcacatta attattttta ctctctttat tttttctctc tatccctctc ttatttccac
9360tcatatatac actccaaaat tggggcatgc ctttatcact actctatctc ctccactaaa
9420tcatttaaat gaaactgaaa agcattggca agtctcctcc cctcctcaag tgatttccaa
9480ctcagcattg gcatctaatt gattcagtat atctattgca tgtgtaaaag tctttccaca
9540atacataact attaattaat cttaaataaa taaaggataa aatatttttt tttcttcata
9600aaattaaaat atgttatttt ttgtttagat gtatattcga ataaatctaa atatatgata
9660atgatttttt atattgatta aacatataat caatattaaa tatgatattt ttttatatag
9720gttgtacaca taattttata aggataaaaa atatgataaa aataaatttt aaatattttt
9780atatttacga gaaaaaaaaa tattttagcc ataaataaat gaccagcata ttttacaacc
9840ttagtaattc ataaattcct atatgtatat ttgaaattaa aaacagataa tcgttaaggg
9900aaggaatcct acgtcatctc ttgccatttg tttttcatgc aaacagaaag ggacgaaaaa
9960ccacctcacc atgaatcact cttcacacca tttttactag caaacaagtc tcaacaactg
10020aagccagctc tctttccgtt tctttttaca acactttctt tgaaatagta gtattttttt
10080ttcacatgat ttattaacgt gccaaaagat gcttattgaa tagagtgcac atttgtaatg
10140tactactaat tagaacatga aaaagcattg ttctaacacg ataatcctgt gaaggcgtta
10200actccaaaga tccaatttca ctatataaat tgtgacgaaa gcaaaatgaa ttcacatagc
10260tgagagagaa aggaaaggtt aactaagaag caatacttca gcggccgcat gagccgtaaa
10320ggttcaatac aacgagtgct tgttttctta gggacaagca ttgtacttat gtatgattct
10380gtgtaaccat gagtcttcca cgttgtacta atgtgaaggg caaaaataaa acacagaaca
10440agttcgtttt tctcaaataa tgtgaaggta gaaaatggaa ccatgcctcc tctcttgcat
10500gtgatttaaa atattagcag atggtacc
105283111721DNAArtificial Sequencevector pKR1147 31ggccgcatga gccgtaaagg
ttcaatacaa cgagtgcttg ttttcttagg gacaagcatt 60gtacttatgt atgattctgt
gtaaccatga gtcttccacg ttgtactaat gtgaagggca 120aaaataaaac acagaacaag
ttcgtttttc tcaaataatg tgaaggtaga aaatggaacc 180atgcctcctc tcttgcatgt
gatttaaaat attagcagat ggtaccgtac gagatccggc 240cggccagatc ctgcaggaga
tccaagcttg gcgcgccgtt ctatagtgtc acctaaatcg 300tatgtgtatg atacataagg
ttatgtatta attgtagccg cgttctaacg acaatatgtc 360catatggtgc actctcagta
caatctgctc tgatgccgca tagttaagcc agccccgaca 420cccgccaaca cccgctgacg
cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 480acaagctgtg accgtctccg
ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 540acgcgcgaga cgaaagggcc
tcgtgatacg cctattttta taggttaatg tcatgaccaa 600aatcccttaa cgtgagtttt
cgttccactg agcgtcagac cccgtagaaa agatcaaagg 660atcttcttga gatccttttt
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 720gctaccagcg gtggtttgtt
tgccggatca agagctacca actctttttc cgaaggtaac 780tggcttcagc agagcgcaga
taccaaatac tgtccttcta gtgtagccgt agttaggcca 840ccacttcaag aactctgtag
caccgcctac atacctcgct ctgctaatcc tgttaccagt 900ggctgctgcc agtggcgata
agtcgtgtct taccgggttg gactcaagac gatagttacc 960ggataaggcg cagcggtcgg
gctgaacggg gggttcgtgc acacagccca gcttggagcg 1020aacgacctac accgaactga
gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc 1080cgaagggaga aaggcggaca
ggtatccggt aagcggcagg gtcggaacag gagagcgcac 1140gagggagctt ccagggggaa
acgcctggta tctttatagt cctgtcgggt ttcgccacct 1200ctgacttgag cgtcgatttt
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 1260cagcaacgcg gcctttttac
ggttcctggc cttttgctgg ccttttgctc acatgttctt 1320tcctgcgtta tcccctgatt
ctgtggataa ccgtattacc gcctttgagt gagctgatac 1380cgctcgccgc agccgaacga
ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 1440cccaatacgc aaaccgcctc
tccccgcgcg ttggccgatt cattaatgca ggttgatcga 1500ttcgacatcg atctagtaac
atagatgaca ccgcgcgcga taatttatcc tagtttgcgc 1560gctatatttt gttttctatc
gcgtattaaa tgtataattg cgggactcta atcataaaaa 1620cccatctcat aaataacgtc
atgcattaca tgttaattat tacatgctta acgtaattca 1680acagaaatta tatgataatc
atcgcaagac cggcaacagg attcaatctt aagaaacttt 1740attgccaaat gtttgaacga
tctgcttcga cgcactcctt ctttaggtac ctcactattc 1800ctttgccctc ggacgagtgc
tggggcgtcg gtttccacta tcggcgagta cttctacaca 1860gccatcggtc cagacggccg
cgcttctgcg ggcgatttgt gtacgcccga cagtcccggc 1920tccggatcgg acgattgcgt
cgcatcgacc ctgcgcccaa gctgcatcat cgaaattgcc 1980gtcaaccaag ctctgataga
gttggtcaag accaatgcgg agcatatacg cccggagccg 2040cggcgatcct gcaagctccg
gatgcctccg ctcgaagtag cgcgtctgct gctccataca 2100agccaaccac ggcctccaga
agaagatgtt ggcgacctcg tattgggaat ccccgaacat 2160cgcctcgctc cagtcaatga
ccgctgttat gcggccattg tccgtcagga cattgttgga 2220gccgaaatcc gcgtgcacga
ggtgccggac ttcggggcag tcctcggccc aaagcatcag 2280ctcatcgaga gcctgcgcga
cggacgcact gacggtgtcg tccatcacag tttgccagtg 2340atacacatgg ggatcagcaa
tcgcgcatat gaaatcacgc catgtagtgt attgaccgat 2400tccttgcggt ccgaatgggc
cgaacccgct cgtctggcta agatcggccg cagcgatcgc 2460atccatggcc tccgcgaccg
gctgcagaac agcgggcagt tcggtttcag gcaggtcttg 2520caacgtgaca ccctgtgcac
ggcgggagat gcaataggtc aggctctcgc tgaattcccc 2580aatgtcaagc acttccggaa
tcgggagcgc ggccgatgca aagtgccgat aaacataacg 2640atctttgtag aaaccatcgg
cgcagctatt tacccgcagg acatatccac gccctcctac 2700atcgaagctg aaagcacgag
attcttcgcc ctccgagagc tgcatcaggt cggagacgct 2760gtcgaacttt tcgatcagaa
acttctcgac agacgtcgcg gtgagttcag gctttttcat 2820ggtttaataa gaagagaaaa
gagttctttt gttatggctg aagtaataga gaaatgagct 2880cgagcgtgtc ctctccaaat
gaaatgaact tccttatata gaggaagggt cttgcgaagg 2940atagtgggat tgtgcgtcat
cccttacgtc agtggagatg tcacatcaat ccacttgctt 3000tgaagacgtg gttggaacgt
cttctttttc cacgatgctc ctcgtgggtg ggggtccatc 3060tttgggacca ctgtcggcag
aggcatcttg aatgatagcc tttcctttat cgcaatgatg 3120gcatttgtag gagccacctt
ccttttctac tgtcctttcg atgaagtgac agatagctgg 3180gcaatggaat ccgaggaggt
ttcccgaaat tatcctttgt tgaaaagtct caatagccct 3240ttggtcttct gagactgtat
ctttgacatt tttggagtag accagagtgt cgtgctccac 3300catgttgacg aagattttct
tcttgtcatt gagtcgtaaa agactctgta tgaactgttc 3360gccagtcttc acggcgagtt
ctgttagatc ctcgatttga atcttagact ccatgcatgg 3420ccttagattc agtaggaact
acctttttag agactccaat ctctattact tgccttggtt 3480tatgaagcaa gccttgaatc
gtccatactg gaatagtact tctgatcttg agaaatatgt 3540ctttctctgt gttcttgatg
caattagtcc tgaatctttt gactgcatct ttaaccttct 3600tgggaaggta tttgatctcc
tggagattgt tactcgggta gatcgtcttg atgagacctg 3660ctgcgtaggc ctctctaacc
atctgtgggt cagcattctt tctgaaattg aagaggctaa 3720ccttctcatt atcagtggtg
aacatagtgt cgtcaccttc accttcgaac ttccttccta 3780gatcgtaaag atagaggaaa
tcgtccattg taatctccgg ggcaaaggag atctcttttg 3840gggctggatc actgctgggc
cttttggttc ctagcgtgag ccagtgggct ttttgctttg 3900gtgggcttgt tagggcctta
gcaaagctct tgggcttgag ttgagcttct cctttgggga 3960tgaagttcaa cctgtctgtt
tgctgacttg ttgtgtacgc gtcagctgct gctcttgcct 4020ctgtaatagt ggcaaatttc
ttgtgtgcaa ctccgggaac gccgtttgtt gccgcctttg 4080tacaacccca gtcatcgtat
ataccggcat gtggaccgtt atacacaacg tagtagttga 4140tatgagggtg ttgaataccc
gattctgctc tgagaggagc aactgtgctg ttaagctcag 4200atttttgtgg gattggaatt
ggatcgatct cgatcccgcg aaattaatac gactcactat 4260agggagacca caacggtttc
cctctagaaa taattttgtt taactttaag aaggagatat 4320acccatggaa aagcctgaac
tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt 4380cgacagcgtc tccgacctga
tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt 4440cgatgtagga gggcgtggat
atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa 4500agatcgttat gtttatcggc
actttgcatc ggccgcgctc ccgattccgg aagtgcttga 4560cattggggaa ttcagcgaga
gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac 4620gttgcaagac ctgcctgaaa
ccgaactgcc cgctgttctg cagccggtcg cggaggctat 4680ggatgcgatc gctgcggccg
atcttagcca gacgagcggg ttcggcccat tcggaccgca 4740aggaatcggt caatacacta
catggcgtga tttcatatgc gcgattgctg atccccatgt 4800gtatcactgg caaactgtga
tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga 4860tgagctgatg ctttgggccg
aggactgccc cgaagtccgg cacctcgtgc acgcggattt 4920cggctccaac aatgtcctga
cggacaatgg ccgcataaca gcggtcattg actggagcga 4980ggcgatgttc ggggattccc
aatacgaggt cgccaacatc ttcttctgga ggccgtggtt 5040ggcttgtatg gagcagcaga
cgcgctactt cgagcggagg catccggagc ttgcaggatc 5100gccgcggctc cgggcgtata
tgctccgcat tggtcttgac caactctatc agagcttggt 5160tgacggcaat ttcgatgatg
cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc 5220cggagccggg actgtcgggc
gtacacaaat cgcccgcaga agcgcggccg tctggaccga 5280tggctgtgta gaagtactcg
ccgatagtgg aaaccgacgc cccagcactc gtccgagggc 5340aaaggaatag tgaggtacag
cttggatcga tccggctgct aacaaagccc gaaaggaagc 5400tgagttggct gctgccaccg
ctgagcaata actagcataa ccccttgggg cctctaaacg 5460ggtcttgagg ggttttttgc
tgaaaggagg aactatatcc ggatgatcgg gcgcgccgtc 5520gacggatcca ctagttctag
agcggcccgc gccgtcgacg gatataatga gccgtaaaca 5580aagatgatta agtagtaatt
aatacgtact agtaaaagtg gcaaaagata acgagaaaga 5640accaatttct ttgcattcgg
ccttagcgga aggcatatat aagctttgat tattttattt 5700agtgtaatga tttcgtacaa
ccaaagcatt tatttagtac tctcacactt gtgtcgcggc 5760cggccgctac aggaacaggt
ggtggcggcc ctcggcgcgc tcgtactgct ccacgatggt 5820gtagtcctcg ttgtgggagg
tgatgtccag cttggagtcc acgtagtagt agccgggcag 5880ctgcacgggc ttcttggcca
tgtagatgga cttgaactcc accaggtagt ggccgccgtc 5940cttcagcttc agggccttgt
ggatctcgcc cttcagcacg ccgtcgcggg ggtacaggcg 6000ctcggtggag gcctcccagc
ccatagtctt cttctgcatt acggggccgt cggaggggaa 6060gttcacgccg atgaacttca
ccttgtagat gaaggagccg tcctgcaggg aggagtcctg 6120ggtcacggtc accacgccgc
cgtcctcgaa gttcatcacg cgctcccact tgaagccctc 6180ggggaaggac agcttcttgt
agtcggggat gtcggcgggg tgcttcacgt acaccttgga 6240gccgtactgg aactgggggg
acaggatgtc ccaggcgaag ggcagggggc cgcccttggt 6300caccttcagc ttggcggtct
gggtgccctc gtaggggcgg ccctcgccct cgccctcgat 6360ctcgaactcg tggccgttca
cggagccctc catgcgcacc ttgaagcgca tgaactcctt 6420gatgacgtcc tcggaggagg
ccatgggccg cttggggggc tatggaagac tttcttagtt 6480agttgtgtga ataagcaatg
ttgggagaat cgggactact tataggatag gaataaaaca 6540gaaaagtatt aagtgctaat
gaaatattta gactgataat taaaatcttc acgtatgtcc 6600acttgatata aaaacgtcag
gaataaagga agtacagtag aatttaaagg tactcttttt 6660atatataccc gtgttctctt
tttggctagc tagttgcata aaaaataatc tatattttta 6720tcattatttt aaatatctta
tgagatggta aatatttatc ataatttttt ttactattat 6780ttattatttg tgtgtgtaat
acatatagaa gttaattaca aattttattt actttttcat 6840tattttgata tgattcacca
ttaatttagt gttattattt ataatagttc attttaatct 6900ttttgtatat attatgcgtg
cagtactttt ttcctacata taactactat tacattttat 6960ttatataata tttttattaa
tgaattttcg tgataatatg taatattgtt cattattatt 7020tcagattttt taaaaatatt
tgtgttatta tttatgaaat atgtaatttt tttagtattt 7080gattttatga tgataaagtg
ttctaaattc aaaagaaggg ggaaagcgta aacattaaaa 7140aacgtcatca aacaaaaaca
aaatcttgtt aataaagata aaactgtttg ttttgatcac 7200tgttatttcg taatataaaa
acattattta tatttatatt gttgacaacc aaatttgcct 7260atcaaatcta accaatataa
tgcatgcgtg gcaggtaatg tactaccatg aacttaagtc 7320atgacataat aaaccgtgaa
tctgaccaat gcatgtacct anctaaattg tatttgtgac 7380acgaagcaaa tgattcaatt
cacaatggag atgggaaaca aataatgaag aacccagaac 7440taagaaagct tttctgaaaa
ataaaataaa ggcaatgtca aaagtatact gcatcatcag 7500tccagaaagc acatgatatt
tttttatcag tatcaatgca gctagtttta ttttacaata 7560tcgatatagc tagtttaaat
atattgcagc tagatttata aatatttgtg ttattattta 7620tcatttgtgt aatcctgttt
ttagtatttt agtttatata tgatgataat gtattccaaa 7680tttaaaagaa gggaaataaa
tttaaacaag aaaaaaagtc atcaaacaaa aaacaaatga 7740aagggtggaa agatgttacc
atgtaatgtg aatgttacag tatttctttt attatagagt 7800taacaaatta actaatatga
ttttgttaat aatgataaaa tatttttttt attattattt 7860cataatataa aaatagttta
cttaatataa aaaaaattct atcgttcaca acaaagttgg 7920ccacctaatt taaccatgca
tgtacccatg gaccatatta ggtaaccatc aaacctgatg 7980aagagataaa gagatgaaga
cttaagtcat aacacaaaac cataaaaaac aaaaatacaa 8040tcaaccgtca atctgaccaa
tgcatgaaaa agctgcaata gtgagtggcg acacaaagca 8100catgattttc ttacaacgga
gataaaacca aaaaaatatt tcatgaacaa cctagaacaa 8160ataaagcttt tatataataa
atatataaat aaataaaggc tatggaataa tatacttcaa 8220tatatttgga ttaaataaat
tgttggcggg gttgatatat ttatacacac ctaaagtcac 8280ttcaatctca ttttcactta
acttttattt tttttttctt tttatttatc ataaagagaa 8340tattgataat atacttttta
acatattttt atgacatttt ttattggtga aaacttatta 8400aaaatcataa attttgtaag
ttagatttat ttaaagagtt cctcttctta ttttaaattt 8460tttaataaat ttttaaataa
ctaaaatttg tgttaaaaat gttaaaaaag tgtgttatta 8520acccttctct tcgaggatcc
gtacgatccc acatgcaagt ttttatttca atcccttttc 8580ctttgaataa ctgaccaaga
acaacaagaa aaaaaaaaaa aaagaaaagg atcattttga 8640aaggatattt ttcgctccta
ttcaaatact gtatttttac caaaaaaact gtatttttcc 8700tacactctca agctttgttt
ttcgcttcga ctctcatgat ttccttcata tgccaatcac 8760tctatttata aatggcataa
ggtagtgtga acaattgcaa agcttgtcat caaaagcttg 8820caatgtacaa attaatgttt
ttcatgcctt tcaaaattat ctgcaccccc tagctattaa 8880tctaacatct aagtaaggct
agtgaatttt ttcgaatagt catgcagtgc attaatttcc 8940ccgtgactat tttggctttg
actccaacac tggccccgta catccgtccc tcattacatg 9000aaaagaaata ttgtttatat
tcttaattaa aaatattgtc ccttctaaat tttcatatag 9060ttaattatta tattactttt
ttctctattc tattagttct attttcaaat tattatttat 9120gcatatgtaa agtacattat
atttttgcta tatacttaaa tatttctaaa ttattaaaaa 9180aagactgata tgaaaaattt
attcttttta aagctatatc attttatata tactttttct 9240tttcttttct ttcattttct
attcaattta ataagaaata aattttgtaa atttttattt 9300atcaatttat aaaaatattt
tactttatat gttttttcac atttttgtta aacaaatcat 9360atcattatga ttgaaagaga
ggaaattgac agtgagtaat aagtgatgag aaaaaaatgt 9420gttatttcct aaaaaaaacc
taaacaaaca tgtatctact ctctatttca tctatctctc 9480atttcatttt tctctttatc
tctttcttta tttttttatc atatcatttc acattaatta 9540tttttactct ctttattttt
tctctctatc cctctcttat ttccactcat atatacactc 9600caaaattggg gcatgccttt
atcactactc tatctcctcc actaaatcat ttaaatgaaa 9660ctgaaaagca ttggcaagtc
tcctcccctc ctcaagtgat ttccaactca gcattggcat 9720ctaattgatt cagtatatct
attgcatgtg taaaagtctt tccacaatac ataactatta 9780attaatctta aataaataaa
ggataaaata tttttttttc ttcataaaat taaaatatgt 9840tattttttgt ttagatgtat
attcgaataa atctaaatat atgataatga ttttttatat 9900tgattaaaca tataatcaat
attaaatatg atattttttt atataggttg tacacataat 9960tttataagga taaaaaatat
gataaaaata aattttaaat atttttatat ttacgagaaa 10020aaaaaatatt ttagccataa
ataaatgacc agcatatttt acaaccttag taattcataa 10080attcctatat gtatatttga
aattaaaaac agataatcgt taagggaagg aatcctacgt 10140catctcttgc catttgtttt
tcatgcaaac agaaagggac gaaaaaccac ctcaccatga 10200atcactcttc acaccatttt
tactagcaaa caagtctcaa caactgaagc cagctctctt 10260tccgtttctt tttacaacac
tttctttgaa atagtagtat ttttttttca catgatttat 10320taacgtgcca aaagatgctt
attgaataga gtgcacattt gtaatgtact actaattaga 10380acatgaaaaa gcattgttct
aacacgataa tcctgtgaag gcgttaactc caaagatcca 10440atttcactat ataaattgtg
acgaaagcaa aatgaattca catagctgag agagaaagga 10500aaggttaact aagaagcaat
acttcagcgg ccgcatggag agatctcaac ggcagtctcc 10560tccgccaccg tcgccgtcct
cctcctcgtc ctccgtctcc gcggacaccg tcctcgtccc 10620tcccggaaag aggcggaggg
cggcgacggc caaggccggc gccgagccta ataagaggat 10680ccgcaaggac cccgccgccg
ccgccgcggg gaagaggagc tccgtctaca ggggagtcac 10740caggcacagg tggacgggca
ggttcgaggc gcatctctgg gacaagcact gcctcgccgc 10800gctccacaac aagaagaaag
gcaggcaagt ctacctgggg gcgtatgaca gcgaggaggc 10860agctgctcgt gcctatgacc
tcgcagctct caagtactgg ggtcctgaga ctctgctcaa 10920cttccctgtg gaggattact
ccagcgagat gccggagatg gaggccgtgt cccgggagga 10980gtacctggcc tccctccgcc
gcaggagcag cggcttctcc aggggcgtct ccaagtacag 11040aggcgtcgcc aggcatcacc
acaacgggag gtgggaggca cggattgggc gagtctttgg 11100gaacaagtac ctctacttgg
gaacatttga cactcaagaa gaggcagcca aggcctatga 11160ccttgcggcc attgaatacc
gtggcgtcaa tgctgtaacc aacttcgaca tcagctgcta 11220cctggaccac ccgctgttcc
tggcacagct ccaacaggag ccacaggtgg tgccggcact 11280caaccaagaa cctcaacctg
atcagagcga aaccggaact acagagcaag agccggagtc 11340aagcgaagcc aagacaccgg
atggcagtgc agaacccgat gagaacgcgg tgcctgacga 11400caccgcggag cccctcacca
cagtcgacga cagcatcgaa gagggcttgt ggagcccttg 11460catggattac gagctagaca
ccatgtcgag accaaacttt ggcagctcaa tcaatctgag 11520cgagtggttc gctgacgcag
acttcgactg caacatcgga tgcctgttcg atgggtgttc 11580tgcggctgac gaaggaagca
aggatggtgt aggtctggca gatttcagtc tgtttgaggc 11640aggtgatgtc cagctgaagg
atgttctttc ggatatggaa gaggggatac aacctccagc 11700gatgatcagt gtgtgcaacg c
117213219713DNAArtificial
Sequencevector pKR1220 32cgcgccagat cctctagagt cgacctgcag gcatgcaagc
ttggcgtaat catggtcata 60gctgtttcct gtgtgaaatt gttatccgct cacaattcca
cacaacatac gagccggaag 120cataaagtgt aaagcctggg gtgcctaatg agtgagctaa
ctcacattaa ttgcgttgcg 180ctcactgccc gctttccagt cgggaaacct gtcgtgccag
ctgcattaat gaatcggcca 240acgcgcgggg agaggcggtt tgcgtattgg atcgatccct
gaaagcgacg ttggatgtta 300acatctacaa attgcctttt cttatcgacc atgtacgtaa
gcgcttacgt ttttggtgga 360cccttgagga aactggtagc tgttgtgggc ctgtggtctc
aagatggatc attaatttcc 420accttcacct acgatggggg gcatcgcacc ggtgagtaat
attgtacggc taagagcgaa 480tttggcctgt agacctcaat tgcgagcttt ctaatttcaa
actattcggg cctaactttt 540ggtgtgatga tgctgactgg caggatatat accgttgtaa
tttgagctcg tgtgaataag 600tcgctgtgta tgtttgtttg attgtttctg ttggagtgca
gcccatttca ccggacaagt 660cggctagatt gatttagccc tgatgaactg ccgaggggaa
gccatcttga gcgcggaatg 720ggaatggatt tcgttgtaca acgagacgac agaacaccca
cgggaccgag cttcgcgagc 780ttttgtatcc gtggcatcct tggtccgggc gatttgttca
cgtccatgag gcgctctcca 840aaggaacgca tattttccgg tgcaaccttt ccggttcttc
ctctactcga cctcttgaag 900tcccagcatg aatgttcgac cgctccgcaa gcggatcttt
ggcgcaacca gccggtttcg 960cacgtcgatt ctcgcgagcc tgcatacttt ggcaagattg
ctgaatgacg ctgatgcttc 1020atcgcaatct gcgataatgg ggtaagtatc cggtgaaggc
cgcaggtcag gccgcctgag 1080cactcagtgt cttggatgtc cagttccacg gcagctgttg
ctcaagcctg ctgatcggag 1140cgtccgcaag gtcggcgcgg acgtcggcaa gccaggcctg
cggatcgatg ttattgagct 1200tggcgctcat gatcagtgtc gccatgaacg ccgcacgttc
agcacaacga tccgatccgg 1260caaacagcca tgacttcctg ccgagtacat agcctctgag
cgttcgttcg gcagcattgt 1320tcgtcaggca aatcgggccg tcatcgagga atgacgtaat
gccatcccat cgcttgagca 1380tgtaatttat cgcctcggcg acgggagaac tgcgcgacaa
tttcccccgc tcggtttcga 1440gccaatcatg cagctcttcg gcgagtgacc ttgatcaggc
caccgccacg accgcggaag 1500acgaacagat gcctgcgcat cggatcgcgc ttcagcgtct
cttgcaccat cagcgacaaa 1560ccgggaaagc ctttgcgcat gtccgtactt atgtcgccac
ttgggagggc ttcgtctacg 1620tggccttcgt gatcgacgtc ttcgcccgtc gcattgtcgg
atggcgggcg agccggacag 1680cacatgcagg ctttgtcctc gatgccctcg aggaggctca
tcatgatcgg cgtcccgctc 1740atggcggcct agtgcatcac tcggatcgcg gtgttcaata
cgtgtccttt cgctattccg 1800agcggttggc agaagcaggt atcgagccat ctatcggaag
cgtcggcgac agcacgacaa 1860cgccctcgca gaagcgatca acggtcttta caaggccgag
gtcattcatc ggcgtggacc 1920atggaggagc ttcgaagcgg tcgagttcgc taccttggaa
tggatagact ggttcaacca 1980cggcggcttt tgaagcccat cggcaatata ccgccagccg
aagacgagga tcagtattac 2040gccatgctgg acgaagcagc catggctgcg cattttaacg
aaatggcctc cggcaaaccc 2100ggtgcggttc acttgttgcg tgggaaagtt cacgggactc
cgcgcacgag ccttcttcgt 2160aatagccata tcgaccgaat tgacctgcag gggggggggg
gaaagccacg ttgtgtctca 2220aaatctctga tgttacattg cacaagataa aaatatatca
tcatgaacaa taaaactgtc 2280tgcttacata aacagtaata caaggggtgt tatgagccat
attcaacggg aaacgtcttg 2340ctcgaggccg cgattaaatt ccaacatgga tgctgattta
tatgggtata aatgggctcg 2400cgataatgtc gggcaatcag gtgcgacaat ctatcgattg
tatgggaagc ccgatgcgcc 2460agagttgttt ctgaaacatg gcaaaggtag cgttgccaat
gatgttacag atgagatggt 2520cagactaaac tggctgacgg aatttatgcc tcttccgacc
atcaagcatt ttatccgtac 2580tcctgatgat gcatggttac tcaccactgc gatccccggg
aaaacagcat tccaggtatt 2640agaagaatat cctgattcag gtgaaaatat tgttgatgcg
ctggcagtgt tcctgcgccg 2700gttgcattcg attcctgttt gtaattgtcc ttttaacagc
gatcgcgtat ttcgtctcgc 2760tcaggcgcaa tcacgaatga ataacggttt ggttgatgcg
agtgattttg atgacgagcg 2820taatggctgg cctgttgaac aagtctggaa agaaatgcat
aagcttttgc cattctcacc 2880ggattcagtc gtcactcatg gtgatttctc acttgataac
cttatttttg acgaggggaa 2940attaataggt tgtattgatg ttggacgagt cggaatcgca
gaccgatacc aggatcttgc 3000catcctatgg aactgcctcg gtgagttttc tccttcatta
cagaaacggc tttttcaaaa 3060atatggtatt gataatcctg atatgaataa attgcagttt
catttgatgc tcgatgagtt 3120tttctaatca gaattggtta attggttgta acactggcag
agcattacgc tgacttgacg 3180ggacggcggc tttgttgaat aaatcgaact tttgctgagt
tgaaggatca gatcacgcat 3240cttcccgaca acgcagaccg ttccgtggca aagcaaaagt
tcaaaatcac caactggtcc 3300acctacaaca aagctctcat caaccgtggc tccctcactt
tctggctgga tgatggggcg 3360attcaggcct ggtatgagtc agcaacacct tcttcacgag
gcagacctca gcgccccccc 3420ccccctgcag gtcttttcca atgatgagca cttttaaagt
tctgctatgt ggcgcggtat 3480tatcccgtgt tgacgccggg caagagcaac tcggtcgccg
catacactat tctcagaatg 3540acttggttga gtactcacca gtcacagaaa agcatcttac
ggatggcatg acagtaagag 3600aattatgcag tgctgccata accatgagtg ataacactgc
ggccaactta cttctgacaa 3660cgatcggagg accgaaggag ctaaccgctt ttttgcacaa
catgggggat catgtaactc 3720gccttgatcg ttgggaaccg gagctgaatg aagccatacc
aaacgacgag cgtgacacca 3780cgatgcctgt agcaatggca acaacgttgc gcaaactatt
aactggcgaa ctacttactc 3840tagcttcccg gcaacaatta atagactgga tggaggcgga
taaagttgca ggaccacttc 3900tgcgctcggc ccttccggct ggctggttta ttgctgataa
atctggagcc ggtgagcgtg 3960ggtctcgcgg tatcattgca gcactggggc cagatggtaa
gccctcccgt atcgtagtta 4020tctacacgac ggggagtcag gcaactatgg atgaacgaaa
tagacagatc gctgagatag 4080gtgcctcact gattaagcat tggtaactgt cagaccaagt
ttactcatat atactttaga 4140ttgatttaaa acttcatttt taatttaaaa ggatctaggt
gaagatcctt tttgataatc 4200tcatgaccaa aatcccttaa cgtgagtttt cgttccactg
agcgtcagac cccgtagaaa 4260agatcaaagg atcttcttga gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa 4320aaaaaccacc gctaccagcg gtggtttgtt tgccggatca
agagctacca actctttttc 4380cgaaggtaac tggcttcagc agagcgcaga taccaaatac
tgtccttcta gtgtagccgt 4440agttaggcca ccacttcaag aactctgtag caccgcctac
atacctcgct ctgctaatcc 4500tgttaccagt ggctgctgcc agtggcgata agtcgtgtct
taccgggttg gactcaagac 4560gatagttacc ggataaggcg cagcggtcgg gctgaacggg
gggttcgtgc acacagccca 4620gcttggagcg aacgacctac accgaactga gatacctaca
gcgtgagcta tgagaaagcg 4680ccacgcttcc cgaagggaga aaggcggaca ggtatccggt
aagcggcagg gtcggaacag 4740gagagcgcac gagggagctt ccagggggaa acgcctggta
tctttatagt cctgtcgggt 4800ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc
gtcagggggg cggagcctat 4860ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc
cttttgctgg ccttttgctc 4920acatgttctt tcctgcgtta tcccctgatt ctgtggataa
ccgtattacc gcctttgagt 4980gagctgatac cgctcgccgc agccgaacga ccgagcgcag
cgagtcagtg agcgaggaag 5040cggaagagcg cctgatgcgg tattttctcc ttacgcatct
gtgcggtatt tcacaccgca 5100tatggtgcac tctcagtaca atctgctctg atgccgcata
gttaagccag tatacactcc 5160gctatcgcta cgtgactggg tcatggctgc gccccgacac
ccgccaacac ccgctgacgc 5220gccctgacgg gcttgtctgc tcccggcatc cgcttacaga
caagctgtga ccgtctccgg 5280gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa
cgcgcgaggc agggtgcctt 5340gatgtgggcg ccggcggtcg agtggcgacg gcgcggcttg
tccgcgccct ggtagattgc 5400ctggccgtag gccagccatt tttgagcggc cagcggccgc
gataggccga cgcgaagcgg 5460cggggcgtag ggagcgcagc gaccgaaggg taggcgcttt
ttgcagctct tcggctgtgc 5520gctggccaga cagttatgca caggccaggc gggttttaag
agttttaata agttttaaag 5580agttttaggc ggaaaaatcg ccttttttct cttttatatc
agtcacttac atgtgtgacc 5640ggttcccaat gtacggcttt gggttcccaa tgtacgggtt
ccggttccca atgtacggct 5700ttgggttccc aatgtacgtg ctatccacag gaaagagacc
ttttcgacct ttttcccctg 5760ctagggcaat ttgccctagc atctgctccg tacattagga
accggcggat gcttcgccct 5820cgatcaggtt gcggtagcgc atgactagga tcgggccagc
ctgccccgcc tcctccttca 5880aatcgtactc cggcaggtca tttgacccga tcagcttgcg
cacggtgaaa cagaacttct 5940tgaactctcc ggcgctgcca ctgcgttcgt agatcgtctt
gaacaaccat ctggcttctg 6000ccttgcctgc ggcgcggcgt gccaggcggt agagaaaacg
gccgatgccg ggatcgatca 6060aaaagtaatc ggggtgaacc gtcagcacgt ccgggttctt
gccttctgtg atctcgcggt 6120acatccaatc agctagctcg atctcgatgt actccggccg
cccggtttcg ctctttacga 6180tcttgtagcg gctaatcaag gcttcaccct cggataccgt
caccaggcgg ccgttcttgg 6240ccttcttcgt acgctgcatg gcaacgtgcg tggtgtttaa
ccgaatgcag gtttctacca 6300ggtcgtcttt ctgctttccg ccatcggctc gccggcagaa
cttgagtacg tccgcaacgt 6360gtggacggaa cacgcggccg ggcttgtctc ccttcccttc
ccggtatcgg ttcatggatt 6420cggttagatg ggaaaccgcc atcagtacca ggtcgtaatc
ccacacactg gccatgccgg 6480ccggccctgc ggaaacctct acgtgcccgt ctggaagctc
gtagcggatc acctcgccag 6540ctcgtcggtc acgcttcgac agacggaaaa cggccacgtc
catgatgctg cgactatcgc 6600gggtgcccac gtcatagagc atcggaacga aaaaatctgg
ttgctcgtcg cccttgggcg 6660gcttcctaat cgacggcgca ccggctgccg gcggttgccg
ggattctttg cggattcgat 6720cagcggccgc ttgccacgat tcaccggggc gtgcttctgc
ctcgatgcgt tgccgctggg 6780cggcctgcgc ggccttcaac ttctccacca ggtcatcacc
cagcgccgcg ccgatttgta 6840ccgggccgga tggtttgcga ccgctcacgc cgattcctcg
ggcttggggg ttccagtgcc 6900attgcagggc cggcagacaa cccagccgct tacgcctggc
caaccgcccg ttcctccaca 6960catggggcat tccacggcgt cggtgcctgg ttgttcttga
ttttccatgc cgcctccttt 7020agccgctaaa attcatctac tcatttattc atttgctcat
ttactctggt agctgcgcga 7080tgtattcaga tagcagctcg gtaatggtct tgccttggcg
taccgcgtac atcttcagct 7140tggtgtgatc ctccgccggc aactgaaagt tgacccgctt
catggctggc gtgtctgcca 7200ggctggccaa cgttgcagcc ttgctgctgc gtgcgctcgg
acggccggca cttagcgtgt 7260ttgtgctttt gctcattttc tctttacctc attaactcaa
atgagttttg atttaatttc 7320agcggccagc gcctggacct cgcgggcagc gtcgccctcg
ggttctgatt caagaacggt 7380tgtgccggcg gcggcagtgc ctgggtagct cacgcgctgc
gtgatacggg actcaagaat 7440gggcagctcg tacccggcca gcgcctcggc aacctcaccg
ccgatgcgcg tgcctttgat 7500cgcccgcgac acgacaaagg ccgcttgtag ccttccatcc
gtgacctcaa tgcgctgctt 7560aaccagctcc accaggtcgg cggtggccca tatgtcgtaa
gggcttggct gcaccggaat 7620cagcacgaag tcggctgcct tgatcgcgga cacagccaag
tccgccgcct ggggcgctcc 7680gtcgatcact acgaagtcgc gccggccgat ggccttcacg
tcgcggtcaa tcgtcgggcg 7740gtcgatgccg acaacggtta gcggttgatc ttcccgcacg
gccgcccaat cgcgggcact 7800gccctgggga tcggaatcga ctaacagaac atcggccccg
gcgagttgca gggcgcgggc 7860tagatgggtt gcgatggtcg tcttgcctga cccgcctttc
tggttaagta cagcgataac 7920ttcatgcgtt cccttgcgta tttgtttatt tactcatcgc
atcatatacg cagcgaccgc 7980atgacgcaag ctgttttact caaatacaca tcaccttttt
agacggcggc gctcggtttc 8040ttcagcggcc aagctggccg gccaggccgc cagcttggca
tcagacaaac cggccaggat 8100ttcatgcagc cgcacggttg agacgtgcgc gggcggctcg
aacacgtacc cggccgcgat 8160catctccgcc tcgatctctt cggtaatgaa aaacggttcg
tcctggccgt cctggtgcgg 8220tttcatgctt gttcctcttg gcgttcattc tcggcggccg
ccagggcgtc ggcctcggtc 8280aatgcgtcct cacggaaggc accgcgccgc ctggcctcgg
tgggcgtcac ttcctcgctg 8340cgctcaagtg cgcggtacag ggtcgagcga tgcacgccaa
gcagtgcagc cgcctctttc 8400acggtgcggc cttcctggtc gatcagctcg cgggcgtgcg
cgatctgtgc cggggtgagg 8460gtagggcggg ggccaaactt cacgcctcgg gccttggcgg
cctcgcgccc gctccgggtg 8520cggtcgatga ttagggaacg ctcgaactcg gcaatgccgg
cgaacacggt caacaccatg 8580cggccggccg gcgtggtggt gtcggcccac ggctctgcca
ggctacgcag gcccgcgccg 8640gcctcctgga tgcgctcggc aatgtccagt aggtcgcggg
tgctgcgggc caggcggtct 8700agcctggtca ctgtcacaac gtcgccaggg cgtaggtggt
caagcatcct ggccagctcc 8760gggcggtcgc gcctggtgcc ggtgatcttc tcggaaaaca
gcttggtgca gccggccgcg 8820tgcagttcgg cccgttggtt ggtcaagtcc tggtcgtcgg
tgctgacgcg ggcatagccc 8880agcaggccag cggcggcgct cttgttcatg gcgtaatgtc
tccggttcta gtcgcaagta 8940ttctacttta tgcgactaaa acacgcgaca agaaaacgcc
aggaaaaggg cagggcggca 9000gcctgtcgcg taacttagga cttgtgcgac atgtcgtttt
cagaagacgg ctgcactgaa 9060cgtcagaagc cgactgcact atagcagcgg aggggttgga
ccacaggacg ggtgtggtcg 9120ccatgatcgc gtagtcgata gtggctccaa gtagcgaagc
gagcaggact gggcggcggc 9180caaagcggtc ggacagtgct ccgagaacgg gtgcgcatag
aaattgcatc aacgcatata 9240gcgctagcag cacgccatag tgactggcga tgctgtcgga
atggacgata tcccgcaaga 9300ggcccggcag taccggcata accaagccta tgcctacagc
atccagggtg acggtgccga 9360ggatgacgat gagcgcattg ttagatttca tacacggtgc
ctgactgcgt tagcaattta 9420actgtgataa actaccgcat taaagctagc ttgcttggtc
gttccgcgtg aacgtcggct 9480cgattgtacc tgcgttcaaa tactttgcga tcgtgttgcg
cgcctgcccg gtgcgtcggc 9540tgatctcacg gatcgactgc ttctctcgca acgccatccg
acggatgatg tttaaaagtc 9600ccatgtggat cactccgttg ccccgtcgct caccgtgttg
gggggaaggt gcacatggct 9660cagttctcaa tggaaattat ctgcctaacc ggctcagttc
tgcgtagaaa ccaacatgca 9720agctccaccg ggtgcaaagc ggcagcggcg gcaggatata
ttcaattgta aatggcttca 9780tgtccgggaa atctacatgg atcagcaatg agtatgatgg
tcaatatgga gaaaaagaaa 9840gagtaattac caattttttt tcaattcaaa aatgtagatg
tccgcagcgt tattataaaa 9900tgaaagtaca ttttgataaa acgacaaatt acgatccgtc
gtatttatag gcgaaagcaa 9960taaacaaatt attctaattc ggaaatcttt atttcgacgt
gtctacattc acgtccaaat 10020gggggcttag atgagaaact tcacgatcga tgccttgatt
tcgccattcc cagataccca 10080tttcatcttc agattggtct gagattatgc gaaaatatac
actcatatac ataaatactg 10140acagtttgag ctaccaattc agtgtagccc attacctcac
ataattcact caaatgctag 10200gcagtctgtc aactcggcgt caatttgtcg gccactatac
gatagttgcg caaattttca 10260aagtcctggc ctaacatcac acctctgtcg gcggcgggtc
ccatttgtga taaatccacc 10320atatcgaatt aattcagact cctttgcccc agagatcaca
atggacgact tcctctatct 10380ctacgatcta gtcaggaagt tcgacggaga aggtgacgat
accatgttca ccactgataa 10440tgagaagatt agccttttca atttcagaaa gaatgctaac
ccacagatgg ttagagaggc 10500ttacgcagca ggtctcatca agacgatcta cccgagcaat
aatctccagg agatcaaata 10560ccttcccaag aaggttaaag atgcagtcaa aagattcagg
actaactgca tcaagaacac 10620agagaaagat atatttctca agatcagaag tactattcca
gtatggacga ttcaaggctt 10680gcttcacaaa ccaaggcaag taatagagat tggagtctct
aaaaaggtag ttcccactga 10740atcaaaggcc atggagtcaa agattcaaat agaggaccta
acagaactcg ccgtaaagac 10800tggcgaacag ttcatacaga gtctcttacg actcaatgac
aagaagaaaa tcttcgtcaa 10860catggtggag cacgacacgc ttgtctactc caaaaatatc
aaagatacag tctcagaaga 10920ccaaagggca attgagactt ttcaacaaag ggtaatatcc
ggaaacctcc tcggattcca 10980ttgcccagct atctgtcact ttattgtgaa gatagtggaa
aaggaaggtg gctcctacaa 11040atgccatcat tgcgataaag gaaaggccat cgttgaagat
gcctctgccg acagtggtcc 11100caaagatgga cccccaccca cgaggagcat cgtggaaaaa
gaagacgttc caaccacgtc 11160ttcaaagcaa gtggattgat gtgatatctc cactgacgta
agggatgacg cacaatccca 11220ctatccttcg caagaccctt cctctatata aggaagttca
tttcatttgg agaggacacg 11280ctgaaatcac cagtctccaa gcttgcgggg atcgtttcgc
atgattgaac aagatggatt 11340gcacgcaggt tctccggccg cttgggtgga gaggctattc
ggctatgact gggcacaaca 11400gacaatcggc tgctctgatg ccgccgtgtt ccggctgtca
gcgcaggggc gcccggttct 11460ttttgtcaag accgacctgt ccggtgccct gaatgaactg
caggacgagg cagcgcggct 11520atcgtggctg gccacgacgg gcgttccttg cgcagctgtg
ctcgacgttg tcactgaagc 11580gggaagggac tggctgctat tgggcgaagt gccggggcag
gatctcctgt catctcacct 11640tgctcctgcc gagaaagtat ccatcatggc tgatgcaatg
cggcggctgc atacgcttga 11700tccggctacc tgcccattcg accaccaagc gaaacatcgc
atcgagcgag cacgtactcg 11760gatggaagcc ggtcttgtcg atcaggatga tctggacgaa
gagcatcagg ggctcgcgcc 11820agccgaactg ttcgccaggc tcaaggcgcg catgcccgac
ggcgaggatc tcgtcgtgac 11880ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat
ggccgctttt ctggattcat 11940cgactgtggc cggctgggtg tggcggaccg ctatcaggac
atagcgttgg ctacccgtga 12000tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc
ctcgtgcttt acggtatcgc 12060cgctcccgat tcgcagcgca tcgccttcta tcgccttctt
gacgagttct tctgagcggg 12120actctggggt tcgaaatgac cgaccaagcg acgcccaacc
tgccatcacg agatttcgat 12180tccaccgccg ccttctatga aaggttgggc ttcggaatcg
ttttccggga cgccggctgg 12240atgatcctcc agcgcgggga tctcatgctg gagttcttcg
cccaccccgg atcgatccaa 12300cacttacgtt tgcaacgtcc aagagcaaat agaccacgaa
cgccggaagg ttgccgcagc 12360gtgtggattg cgtctcaatt ctctcttgca ggaatgcaat
gatgaatatg atactgacta 12420tgaaactttg agggaatact gcctagcacc gtcacctcat
aacgtgcatc atgcatgccc 12480tgacaacatg gaacatcgct atttttctga agaattatgc
tcgttggagg atgtcgcggc 12540aattgcagct attgccaaca tcgaactacc cctcacgcat
gcattcatca atattattca 12600tgcggggaaa ggcaagatta atccaactgg caaatcatcc
agcgtgattg gtaacttcag 12660ttccagcgac ttgattcgtt ttggtgctac ccacgttttc
aataaggacg agatggtgga 12720gtaaagaagg agtgcgtcga agcagatcgt tcaaacattt
ggcaataaag tttcttaaga 12780ttgaatcctg ttgccggtct tgcgatgatt atcatataat
ttctgttgaa ttacgttaag 12840catgtaataa ttaacatgta atgcatgacg ttatttatga
gatgggtttt tatgattaga 12900gtcccgcaat tatacattta atacgcgata gaaaacaaaa
tatagcgcgc aaactaggat 12960aaattatcgc gcgcggtgtc atctatgtta ctagatcgat
caaacttcgg tactgtgtaa 13020tgacgatgag caatcgagag gctgactaac aaaaggtaca
tcgcgatgga tcgatccatt 13080cgccattcag gctgcgcaac tgttgggaag ggcgatcggt
gcgggcctct tcgctattac 13140gccagctggc gaaaggggga tgtgctgcaa ggcgattaag
ttgggtaacg ccagggtttt 13200cccagtcacg acgttgtaaa acgacggcca gtgaattcct
gcagcccggg ggatccgccc 13260actcgaggcg cgccgtcgac ggatataatg agccgtaaac
aaagatgatt aagtagtaat 13320taatacgtac tagtaaaagt ggcaaaagat aacgagaaag
aaccaatttc tttgcattcg 13380gccttagcgg aaggcatata taagctttga ttattttatt
tagtgtaatg atttcgtaca 13440accaaagcat ttatttagta ctctcacact tgtgtcgcgg
ccggccgcta caggaacagg 13500tggtggcggc cctcggcgcg ctcgtactgc tccacgatgg
tgtagtcctc gttgtgggag 13560gtgatgtcca gcttggagtc cacgtagtag tagccgggca
gctgcacggg cttcttggcc 13620atgtagatgg acttgaactc caccaggtag tggccgccgt
ccttcagctt cagggccttg 13680tggatctcgc ccttcagcac gccgtcgcgg gggtacaggc
gctcggtgga ggcctcccag 13740cccatagtct tcttctgcat tacggggccg tcggagggga
agttcacgcc gatgaacttc 13800accttgtaga tgaaggagcc gtcctgcagg gaggagtcct
gggtcacggt caccacgccg 13860ccgtcctcga agttcatcac gcgctcccac ttgaagccct
cggggaagga cagcttcttg 13920tagtcgggga tgtcggcggg gtgcttcacg tacaccttgg
agccgtactg gaactggggg 13980gacaggatgt cccaggcgaa gggcaggggg ccgcccttgg
tcaccttcag cttggcggtc 14040tgggtgccct cgtaggggcg gccctcgccc tcgccctcga
tctcgaactc gtggccgttc 14100acggagccct ccatgcgcac cttgaagcgc atgaactcct
tgatgacgtc ctcggaggag 14160gccatgggcc gcttgggggg ctatggaaga ctttcttagt
tagttgtgtg aataagcaat 14220gttgggagaa tcgggactac ttataggata ggaataaaac
agaaaagtat taagtgctaa 14280tgaaatattt agactgataa ttaaaatctt cacgtatgtc
cacttgatat aaaaacgtca 14340ggaataaagg aagtacagta gaatttaaag gtactctttt
tatatatacc cgtgttctct 14400ttttggctag ctagttgcat aaaaaataat ctatattttt
atcattattt taaatatctt 14460atgagatggt aaatatttat cataattttt tttactatta
tttattattt gtgtgtgtaa 14520tacatataga agttaattac aaattttatt tactttttca
ttattttgat atgattcacc 14580attaatttag tgttattatt tataatagtt cattttaatc
tttttgtata tattatgcgt 14640gcagtacttt tttcctacat ataactacta ttacatttta
tttatataat atttttatta 14700atgaattttc gtgataatat gtaatattgt tcattattat
ttcagatttt ttaaaaatat 14760ttgtgttatt atttatgaaa tatgtaattt ttttagtatt
tgattttatg atgataaagt 14820gttctaaatt caaaagaagg gggaaagcgt aaacattaaa
aaacgtcatc aaacaaaaac 14880aaaatcttgt taataaagat aaaactgttt gttttgatca
ctgttatttc gtaatataaa 14940aacattattt atatttatat tgttgacaac caaatttgcc
tatcaaatct aaccaatata 15000atgcatgcgt ggcaggtaat gtactaccat gaacttaagt
catgacataa taaaccgtga 15060atctgaccaa tgcatgtacc tanctaaatt gtatttgtga
cacgaagcaa atgattcaat 15120tcacaatgga gatgggaaac aaataatgaa gaacccagaa
ctaagaaagc ttttctgaaa 15180aataaaataa aggcaatgtc aaaagtatac tgcatcatca
gtccagaaag cacatgatat 15240ttttttatca gtatcaatgc agctagtttt attttacaat
atcgatatag ctagtttaaa 15300tatattgcag ctagatttat aaatatttgt gttattattt
atcatttgtg taatcctgtt 15360tttagtattt tagtttatat atgatgataa tgtattccaa
atttaaaaga agggaaataa 15420atttaaacaa gaaaaaaagt catcaaacaa aaaacaaatg
aaagggtgga aagatgttac 15480catgtaatgt gaatgttaca gtatttcttt tattatagag
ttaacaaatt aactaatatg 15540attttgttaa taatgataaa atattttttt tattattatt
tcataatata aaaatagttt 15600acttaatata aaaaaaattc tatcgttcac aacaaagttg
gccacctaat ttaaccatgc 15660atgtacccat ggaccatatt aggtaaccat caaacctgat
gaagagataa agagatgaag 15720acttaagtca taacacaaaa ccataaaaaa caaaaataca
atcaaccgtc aatctgacca 15780atgcatgaaa aagctgcaat agtgagtggc gacacaaagc
acatgatttt cttacaacgg 15840agataaaacc aaaaaaatat ttcatgaaca acctagaaca
aataaagctt ttatataata 15900aatatataaa taaataaagg ctatggaata atatacttca
atatatttgg attaaataaa 15960ttgttggcgg ggttgatata tttatacaca cctaaagtca
cttcaatctc attttcactt 16020aacttttatt ttttttttct ttttatttat cataaagaga
atattgataa tatacttttt 16080aacatatttt tatgacattt tttattggtg aaaacttatt
aaaaatcata aattttgtaa 16140gttagattta tttaaagagt tcctcttctt attttaaatt
ttttaataaa tttttaaata 16200actaaaattt gtgttaaaaa tgttaaaaaa gtgtgttatt
aacccttctc ttcgaggatc 16260cgtacgatcc cacatgcaag tttttatttc aatccctttt
cctttgaata actgaccaag 16320aacaacaaga aaaaaaaaaa aaaagaaaag gatcattttg
aaaggatatt tttcgctcct 16380attcaaatac tgtattttta ccaaaaaaac tgtatttttc
ctacactctc aagctttgtt 16440tttcgcttcg actctcatga tttccttcat atgccaatca
ctctatttat aaatggcata 16500aggtagtgtg aacaattgca aagcttgtca tcaaaagctt
gcaatgtaca aattaatgtt 16560tttcatgcct ttcaaaatta tctgcacccc ctagctatta
atctaacatc taagtaaggc 16620tagtgaattt tttcgaatag tcatgcagtg cattaatttc
cccgtgacta ttttggcttt 16680gactccaaca ctggccccgt acatccgtcc ctcattacat
gaaaagaaat attgtttata 16740ttcttaatta aaaatattgt cccttctaaa ttttcatata
gttaattatt atattacttt 16800tttctctatt ctattagttc tattttcaaa ttattattta
tgcatatgta aagtacatta 16860tatttttgct atatacttaa atatttctaa attattaaaa
aaagactgat atgaaaaatt 16920tattcttttt aaagctatat cattttatat atactttttc
ttttcttttc tttcattttc 16980tattcaattt aataagaaat aaattttgta aatttttatt
tatcaattta taaaaatatt 17040ttactttata tgttttttca catttttgtt aaacaaatca
tatcattatg attgaaagag 17100aggaaattga cagtgagtaa taagtgatga gaaaaaaatg
tgttatttcc taaaaaaaac 17160ctaaacaaac atgtatctac tctctatttc atctatctct
catttcattt ttctctttat 17220ctctttcttt atttttttat catatcattt cacattaatt
atttttactc tctttatttt 17280ttctctctat ccctctctta tttccactca tatatacact
ccaaaattgg ggcatgcctt 17340tatcactact ctatctcctc cactaaatca tttaaatgaa
actgaaaagc attggcaagt 17400ctcctcccct cctcaagtga tttccaactc agcattggca
tctaattgat tcagtatatc 17460tattgcatgt gtaaaagtct ttccacaata cataactatt
aattaatctt aaataaataa 17520aggataaaat attttttttt cttcataaaa ttaaaatatg
ttattttttg tttagatgta 17580tattcgaata aatctaaata tatgataatg attttttata
ttgattaaac atataatcaa 17640tattaaatat gatatttttt tatataggtt gtacacataa
ttttataagg ataaaaaata 17700tgataaaaat aaattttaaa tatttttata tttacgagaa
aaaaaaatat tttagccata 17760aataaatgac cagcatattt tacaacctta gtaattcata
aattcctata tgtatatttg 17820aaattaaaaa cagataatcg ttaagggaag gaatcctacg
tcatctcttg ccatttgttt 17880ttcatgcaaa cagaaaggga cgaaaaacca cctcaccatg
aatcactctt cacaccattt 17940ttactagcaa acaagtctca acaactgaag ccagctctct
ttccgtttct ttttacaaca 18000ctttctttga aatagtagta tttttttttc acatgattta
ttaacgtgcc aaaagatgct 18060tattgaatag agtgcacatt tgtaatgtac tactaattag
aacatgaaaa agcattgttc 18120taacacgata atcctgtgaa ggcgttaact ccaaagatcc
aatttcacta tataaattgt 18180gacgaaagca aaatgaattc acatagctga gagagaaagg
aaaggttaac taagaagcaa 18240tacttcagcg gccgcatgga gagatctcaa cggcagtctc
ctccgccacc gtcgccgtcc 18300tcctcctcgt cctccgtctc cgcggacacc gtcctcgtcc
ctcccggaaa gaggcggagg 18360gcggcgacgg ccaaggccgg cgccgagcct aataagagga
tccgcaagga ccccgccgcc 18420gccgccgcgg ggaagaggag ctccgtctac aggggagtca
ccaggcacag gtggacgggc 18480aggttcgagg cgcatctctg ggacaagcac tgcctcgccg
cgctccacaa caagaagaaa 18540ggcaggcaag tctacctggg ggcgtatgac agcgaggagg
cagctgctcg tgcctatgac 18600ctcgcagctc tcaagtactg gggtcctgag actctgctca
acttccctgt ggaggattac 18660tccagcgaga tgccggagat ggaggccgtg tcccgggagg
agtacctggc ctccctccgc 18720cgcaggagca gcggcttctc caggggcgtc tccaagtaca
gaggcgtcgc caggcatcac 18780cacaacggga ggtgggaggc acggattggg cgagtctttg
ggaacaagta cctctacttg 18840ggaacatttg acactcaaga agaggcagcc aaggcctatg
accttgcggc cattgaatac 18900cgtggcgtca atgctgtaac caacttcgac atcagctgct
acctggacca cccgctgttc 18960ctggcacagc tccaacagga gccacaggtg gtgccggcac
tcaaccaaga acctcaacct 19020gatcagagcg aaaccggaac tacagagcaa gagccggagt
caagcgaagc caagacaccg 19080gatggcagtg cagaacccga tgagaacgcg gtgcctgacg
acaccgcgga gcccctcacc 19140acagtcgacg acagcatcga agagggcttg tggagccctt
gcatggatta cgagctagac 19200accatgtcga gaccaaactt tggcagctca atcaatctga
gcgagtggtt cgctgacgca 19260gacttcgact gcaacatcgg atgcctgttc gatgggtgtt
ctgcggctga cgaaggaagc 19320aaggatggtg taggtctggc agatttcagt ctgtttgagg
caggtgatgt ccagctgaag 19380gatgttcttt cggatatgga agaggggata caacctccag
cgatgatcag tgtgtgcaac 19440gcggccgcat gagccgtaaa ggttcaatac aacgagtgct
tgttttctta gggacaagca 19500ttgtacttat gtatgattct gtgtaaccat gagtcttcca
cgttgtacta atgtgaaggg 19560caaaaataaa acacagaaca agttcgtttt tctcaaataa
tgtgaaggta gaaaatggaa 19620ccatgcctcc tctcttgcat gtgatttaaa atattagcag
atggtaccgt acgagatccg 19680gccggccaga tcctgcagga gatccaagct tgg
197133310287DNAArtificial Sequencevector pKR1144
33gtacgagatc cggccggcca gatcctgcag gagatccaag cttggcgcgc cgttctatag
60tgtcacctaa atcgtatgtg tatgatacat aaggttatgt attaattgta gccgcgttct
120aacgacaata tgtccatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta
180agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg
240gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca
300ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt
360aatgtcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta
420gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa
480acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt
540tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag
600ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta
660atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca
720agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag
780cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa
840agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga
900acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc
960gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc
1020ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt
1080gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt
1140gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag
1200gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa
1260tgcaggttga tcgattcgac atcgatctag taacatagat gacaccgcgc gcgataattt
1320atcctagttt gcgcgctata ttttgttttc tatcgcgtat taaatgtata attgcgggac
1380tctaatcata aaaacccatc tcataaataa cgtcatgcat tacatgttaa ttattacatg
1440cttaacgtaa ttcaacagaa attatatgat aatcatcgca agaccggcaa caggattcaa
1500tcttaagaaa ctttattgcc aaatgtttga acgatctgct tcgacgcact ccttctttag
1560gtacctcact attcctttgc cctcggacga gtgctggggc gtcggtttcc actatcggcg
1620agtacttcta cacagccatc ggtccagacg gccgcgcttc tgcgggcgat ttgtgtacgc
1680ccgacagtcc cggctccgga tcggacgatt gcgtcgcatc gaccctgcgc ccaagctgca
1740tcatcgaaat tgccgtcaac caagctctga tagagttggt caagaccaat gcggagcata
1800tacgcccgga gccgcggcga tcctgcaagc tccggatgcc tccgctcgaa gtagcgcgtc
1860tgctgctcca tacaagccaa ccacggcctc cagaagaaga tgttggcgac ctcgtattgg
1920gaatccccga acatcgcctc gctccagtca atgaccgctg ttatgcggcc attgtccgtc
1980aggacattgt tggagccgaa atccgcgtgc acgaggtgcc ggacttcggg gcagtcctcg
2040gcccaaagca tcagctcatc gagagcctgc gcgacggacg cactgacggt gtcgtccatc
2100acagtttgcc agtgatacac atggggatca gcaatcgcgc atatgaaatc acgccatgta
2160gtgtattgac cgattccttg cggtccgaat gggccgaacc cgctcgtctg gctaagatcg
2220gccgcagcga tcgcatccat ggcctccgcg accggctgca gaacagcggg cagttcggtt
2280tcaggcaggt cttgcaacgt gacaccctgt gcacggcggg agatgcaata ggtcaggctc
2340tcgctgaatt ccccaatgtc aagcacttcc ggaatcggga gcgcggccga tgcaaagtgc
2400cgataaacat aacgatcttt gtagaaacca tcggcgcagc tatttacccg caggacatat
2460ccacgccctc ctacatcgaa gctgaaagca cgagattctt cgccctccga gagctgcatc
2520aggtcggaga cgctgtcgaa cttttcgatc agaaacttct cgacagacgt cgcggtgagt
2580tcaggctttt tcatggttta ataagaagag aaaagagttc ttttgttatg gctgaagtaa
2640tagagaaatg agctcgagcg tgtcctctcc aaatgaaatg aacttcctta tatagaggaa
2700gggtcttgcg aaggatagtg ggattgtgcg tcatccctta cgtcagtgga gatgtcacat
2760caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gctcctcgtg
2820ggtgggggtc catctttggg accactgtcg gcagaggcat cttgaatgat agcctttcct
2880ttatcgcaat gatggcattt gtaggagcca ccttcctttt ctactgtcct ttcgatgaag
2940tgacagatag ctgggcaatg gaatccgagg aggtttcccg aaattatcct ttgttgaaaa
3000gtctcaatag ccctttggtc ttctgagact gtatctttga catttttgga gtagaccaga
3060gtgtcgtgct ccaccatgtt gacgaagatt ttcttcttgt cattgagtcg taaaagactc
3120tgtatgaact gttcgccagt cttcacggcg agttctgtta gatcctcgat ttgaatctta
3180gactccatgc atggccttag attcagtagg aactaccttt ttagagactc caatctctat
3240tacttgcctt ggtttatgaa gcaagccttg aatcgtccat actggaatag tacttctgat
3300cttgagaaat atgtctttct ctgtgttctt gatgcaatta gtcctgaatc ttttgactgc
3360atctttaacc ttcttgggaa ggtatttgat ctcctggaga ttgttactcg ggtagatcgt
3420cttgatgaga cctgctgcgt aggcctctct aaccatctgt gggtcagcat tctttctgaa
3480attgaagagg ctaaccttct cattatcagt ggtgaacata gtgtcgtcac cttcaccttc
3540gaacttcctt cctagatcgt aaagatagag gaaatcgtcc attgtaatct ccggggcaaa
3600ggagatctct tttggggctg gatcactgct gggccttttg gttcctagcg tgagccagtg
3660ggctttttgc tttggtgggc ttgttagggc cttagcaaag ctcttgggct tgagttgagc
3720ttctcctttg gggatgaagt tcaacctgtc tgtttgctga cttgttgtgt acgcgtcagc
3780tgctgctctt gcctctgtaa tagtggcaaa tttcttgtgt gcaactccgg gaacgccgtt
3840tgttgccgcc tttgtacaac cccagtcatc gtatataccg gcatgtggac cgttatacac
3900aacgtagtag ttgatatgag ggtgttgaat acccgattct gctctgagag gagcaactgt
3960gctgttaagc tcagattttt gtgggattgg aattggatcg atctcgatcc cgcgaaatta
4020atacgactca ctatagggag accacaacgg tttccctcta gaaataattt tgtttaactt
4080taagaaggag atatacccat ggaaaagcct gaactcaccg cgacgtctgt cgagaagttt
4140ctgatcgaaa agttcgacag cgtctccgac ctgatgcagc tctcggaggg cgaagaatct
4200cgtgctttca gcttcgatgt aggagggcgt ggatatgtcc tgcgggtaaa tagctgcgcc
4260gatggtttct acaaagatcg ttatgtttat cggcactttg catcggccgc gctcccgatt
4320ccggaagtgc ttgacattgg ggaattcagc gagagcctga cctattgcat ctcccgccgt
4380gcacagggtg tcacgttgca agacctgcct gaaaccgaac tgcccgctgt tctgcagccg
4440gtcgcggagg ctatggatgc gatcgctgcg gccgatctta gccagacgag cgggttcggc
4500ccattcggac cgcaaggaat cggtcaatac actacatggc gtgatttcat atgcgcgatt
4560gctgatcccc atgtgtatca ctggcaaact gtgatggacg acaccgtcag tgcgtccgtc
4620gcgcaggctc tcgatgagct gatgctttgg gccgaggact gccccgaagt ccggcacctc
4680gtgcacgcgg atttcggctc caacaatgtc ctgacggaca atggccgcat aacagcggtc
4740attgactgga gcgaggcgat gttcggggat tcccaatacg aggtcgccaa catcttcttc
4800tggaggccgt ggttggcttg tatggagcag cagacgcgct acttcgagcg gaggcatccg
4860gagcttgcag gatcgccgcg gctccgggcg tatatgctcc gcattggtct tgaccaactc
4920tatcagagct tggttgacgg caatttcgat gatgcagctt gggcgcaggg tcgatgcgac
4980gcaatcgtcc gatccggagc cgggactgtc gggcgtacac aaatcgcccg cagaagcgcg
5040gccgtctgga ccgatggctg tgtagaagta ctcgccgata gtggaaaccg acgccccagc
5100actcgtccga gggcaaagga atagtgaggt acagcttgga tcgatccggc tgctaacaaa
5160gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc ataacccctt
5220ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat atccggatga
5280tcgggcgcgc cgtcgacgga tccactagtt ctagagcggc ccgcgccgtc gacggatata
5340atgagccgta aacaaagatg attaagtagt aattaatacg tactagtaaa agtggcaaaa
5400gataacgaga aagaaccaat ttctttgcat tcggccttag cggaaggcat atataagctt
5460tgattatttt atttagtgta atgatttcgt acaaccaaag catttattta gtactctcac
5520acttgtgtcg cggccggccg ctacaggaac aggtggtggc ggccctcggc gcgctcgtac
5580tgctccacga tggtgtagtc ctcgttgtgg gaggtgatgt ccagcttgga gtccacgtag
5640tagtagccgg gcagctgcac gggcttcttg gccatgtaga tggacttgaa ctccaccagg
5700tagtggccgc cgtccttcag cttcagggcc ttgtggatct cgcccttcag cacgccgtcg
5760cgggggtaca ggcgctcggt ggaggcctcc cagcccatag tcttcttctg cattacgggg
5820ccgtcggagg ggaagttcac gccgatgaac ttcaccttgt agatgaagga gccgtcctgc
5880agggaggagt cctgggtcac ggtcaccacg ccgccgtcct cgaagttcat cacgcgctcc
5940cacttgaagc cctcggggaa ggacagcttc ttgtagtcgg ggatgtcggc ggggtgcttc
6000acgtacacct tggagccgta ctggaactgg ggggacagga tgtcccaggc gaagggcagg
6060gggccgccct tggtcacctt cagcttggcg gtctgggtgc cctcgtaggg gcggccctcg
6120ccctcgccct cgatctcgaa ctcgtggccg ttcacggagc cctccatgcg caccttgaag
6180cgcatgaact ccttgatgac gtcctcggag gaggccatgg gccgcttggg gggctatgga
6240agactttctt agttagttgt gtgaataagc aatgttggga gaatcgggac tacttatagg
6300ataggaataa aacagaaaag tattaagtgc taatgaaata tttagactga taattaaaat
6360cttcacgtat gtccacttga tataaaaacg tcaggaataa aggaagtaca gtagaattta
6420aaggtactct ttttatatat acccgtgttc tctttttggc tagctagttg cataaaaaat
6480aatctatatt tttatcatta ttttaaatat cttatgagat ggtaaatatt tatcataatt
6540ttttttacta ttatttatta tttgtgtgtg taatacatat agaagttaat tacaaatttt
6600atttactttt tcattatttt gatatgattc accattaatt tagtgttatt atttataata
6660gttcatttta atctttttgt atatattatg cgtgcagtac ttttttccta catataacta
6720ctattacatt ttatttatat aatattttta ttaatgaatt ttcgtgataa tatgtaatat
6780tgttcattat tatttcagat tttttaaaaa tatttgtgtt attatttatg aaatatgtaa
6840tttttttagt atttgatttt atgatgataa agtgttctaa attcaaaaga agggggaaag
6900cgtaaacatt aaaaaacgtc atcaaacaaa aacaaaatct tgttaataaa gataaaactg
6960tttgttttga tcactgttat ttcgtaatat aaaaacatta tttatattta tattgttgac
7020aaccaaattt gcctatcaaa tctaaccaat ataatgcatg cgtggcaggt aatgtactac
7080catgaactta agtcatgaca taataaaccg tgaatctgac caatgcatgt acctanctaa
7140attgtatttg tgacacgaag caaatgattc aattcacaat ggagatggga aacaaataat
7200gaagaaccca gaactaagaa agcttttctg aaaaataaaa taaaggcaat gtcaaaagta
7260tactgcatca tcagtccaga aagcacatga tattttttta tcagtatcaa tgcagctagt
7320tttattttac aatatcgata tagctagttt aaatatattg cagctagatt tataaatatt
7380tgtgttatta tttatcattt gtgtaatcct gtttttagta ttttagttta tatatgatga
7440taatgtattc caaatttaaa agaagggaaa taaatttaaa caagaaaaaa agtcatcaaa
7500caaaaaacaa atgaaagggt ggaaagatgt taccatgtaa tgtgaatgtt acagtatttc
7560ttttattata gagttaacaa attaactaat atgattttgt taataatgat aaaatatttt
7620ttttattatt atttcataat ataaaaatag tttacttaat ataaaaaaaa ttctatcgtt
7680cacaacaaag ttggccacct aatttaacca tgcatgtacc catggaccat attaggtaac
7740catcaaacct gatgaagaga taaagagatg aagacttaag tcataacaca aaaccataaa
7800aaacaaaaat acaatcaacc gtcaatctga ccaatgcatg aaaaagctgc aatagtgagt
7860ggcgacacaa agcacatgat tttcttacaa cggagataaa accaaaaaaa tatttcatga
7920acaacctaga acaaataaag cttttatata ataaatatat aaataaataa aggctatgga
7980ataatatact tcaatatatt tggattaaat aaattgttgg cggggttgat atatttatac
8040acacctaaag tcacttcaat ctcattttca cttaactttt attttttttt tctttttatt
8100tatcataaag agaatattga taatatactt tttaacatat ttttatgaca ttttttattg
8160gtgaaaactt attaaaaatc ataaattttg taagttagat ttatttaaag agttcctctt
8220cttattttaa attttttaat aaatttttaa ataactaaaa tttgtgttaa aaatgttaaa
8280aaagtgtgtt attaaccctt ctcttcgagg atccgtaccg agctcggatc ctctagaaat
8340ccgtcaacat ggtggagcac gacactctcg tctactccaa gaatatcaaa gatacagtct
8400cagaagacca aagggctatt gagacttttc aacaaagggt aatatcggga aacctcctcg
8460gattccattg cccagctatc tgtcacttca tcaaaaggac agtagaaaag gaaggtggca
8520cctacaaatg ccatcattgc gataaaggaa aggctatcgt tcaagatgcc tctgccgaca
8580gtggtcccaa agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa
8640ccacgtcttc aaagcaagtg gattgatgtg atgatcctat gcgtatggta tgacgtgtgt
8700tcaagatgat gacttcaaac ctacctatga cgtatggtat gaacgtgtgt cgactgatga
8760cttagatcca ctcgagcggc tataaatacg tacctacgca ccctgcgcta ccatccctag
8820agctgcagct tatttttaca acaattacca acaacaacaa acaacaaaca acattacaat
8880tactatttac aattacagtc gacccgggat cgtacctcta gggtggcggc cgcaagtatg
8940aactaaaatg catgtaggtg taagagctca tggagagcat ggaatattgt atccgaccat
9000gtaacagtat aataactgag ctccatctca cttcttctat gaataaacaa aggatgttat
9060gatatattaa cactctatct atgcacctta ttgttctatg ataaatttcc tcttattatt
9120ataaatcatc tgaatcgtga cggcttatgg aatgcttcaa atagtacaaa aacaaatgtg
9180tactataaga ctttctaaac aattctaacc ttagcattgt gaacgagaca taagtgttaa
9240gaagacataa caattataat ggaagaagtt tgtctccatt tatatattat atattaccca
9300cttatgtatt atattaggat gttaaggaga cataacaatt ataaagagag aagtttgtat
9360ccatttatat attatatact acccatttat atattatact tatccactta tttaatgtct
9420ttataaggtt tgatccatga tatttctaat attttagttg atatgtatat gaaagggtac
9480tatttgaact ctcttactct gtataaaggt tggatcatcc ttaaagtggg tctatttaat
9540tttattgctt cttacagata aaaaaaaaat tatgagttgg tttgataaaa tattgaagga
9600tttaaaataa taataaataa catataatat atgtatataa atttattata atataacatt
9660tatctataaa aaagtaaata ttgtcataaa tctatacaat cgtttagcct tgctggacga
9720atctcaatta tttaaacgag agtaaacata tttgactttt tggttattta acaaattatt
9780atttaacact atatgaaatt ttttttttta tcagcaaaga ataaaattaa attaagaagg
9840acaatggtgt cccaatcctt atacaaccaa cttccacaag aaagtcaagt cagagacaac
9900aaaaaaacaa gcaaaggaaa ttttttaatt tgagttgtct tgtttgctgc ataatttatg
9960cagtaaaaca ctacacataa cccttttagc agtagagcaa tggttgaccg tgtgcttagc
10020ttcttttatt ttattttttt atcagcaaag aataaataaa ataaaatgag acacttcagg
10080gatgtttcaa caagctctag agggcccaat tcgccctata gtgagtcgta ttacaattca
10140ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc
10200cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc
10260ccttcccaac agttgcgcag cctatac
102873411480DNAArtificial Sequencevector pKR1149 34ggccgcaagt atgaactaaa
atgcatgtag gtgtaagagc tcatggagag catggaatat 60tgtatccgac catgtaacag
tataataact gagctccatc tcacttcttc tatgaataaa 120caaaggatgt tatgatatat
taacactcta tctatgcacc ttattgttct atgataaatt 180tcctcttatt attataaatc
atctgaatcg tgacggctta tggaatgctt caaatagtac 240aaaaacaaat gtgtactata
agactttcta aacaattcta accttagcat tgtgaacgag 300acataagtgt taagaagaca
taacaattat aatggaagaa gtttgtctcc atttatatat 360tatatattac ccacttatgt
attatattag gatgttaagg agacataaca attataaaga 420gagaagtttg tatccattta
tatattatat actacccatt tatatattat acttatccac 480ttatttaatg tctttataag
gtttgatcca tgatatttct aatattttag ttgatatgta 540tatgaaaggg tactatttga
actctcttac tctgtataaa ggttggatca tccttaaagt 600gggtctattt aattttattg
cttcttacag ataaaaaaaa aattatgagt tggtttgata 660aaatattgaa ggatttaaaa
taataataaa taacatataa tatatgtata taaatttatt 720ataatataac atttatctat
aaaaaagtaa atattgtcat aaatctatac aatcgtttag 780ccttgctgga cgaatctcaa
ttatttaaac gagagtaaac atatttgact ttttggttat 840ttaacaaatt attatttaac
actatatgaa attttttttt ttatcagcaa agaataaaat 900taaattaaga aggacaatgg
tgtcccaatc cttatacaac caacttccac aagaaagtca 960agtcagagac aacaaaaaaa
caagcaaagg aaatttttta atttgagttg tcttgtttgc 1020tgcataattt atgcagtaaa
acactacaca taaccctttt agcagtagag caatggttga 1080ccgtgtgctt agcttctttt
attttatttt tttatcagca aagaataaat aaaataaaat 1140gagacacttc agggatgttt
caacaagctc tagagggccc aattcgccct atagtgagtc 1200gtattacaat tcactggccg
tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 1260ccaacttaat cgccttgcag
cacatccccc tttcgccagc tggcgtaata gcgaagaggc 1320ccgcaccgat cgcccttccc
aacagttgcg cagcctatac gtacgagatc cggccggcca 1380gatcctgcag gagatccaag
cttggcgcgc cgttctatag tgtcacctaa atcgtatgtg 1440tatgatacat aaggttatgt
attaattgta gccgcgttct aacgacaata tgtccatatg 1500gtgcactctc agtacaatct
gctctgatgc cgcatagtta agccagcccc gacacccgcc 1560aacacccgct gacgcgccct
gacgggcttg tctgctcccg gcatccgctt acagacaagc 1620tgtgaccgtc tccgggagct
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc 1680gagacgaaag ggcctcgtga
tacgcctatt tttataggtt aatgtcatga ccaaaatccc 1740ttaacgtgag ttttcgttcc
actgagcgtc agaccccgta gaaaagatca aaggatcttc 1800ttgagatcct ttttttctgc
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc 1860agcggtggtt tgtttgccgg
atcaagagct accaactctt tttccgaagg taactggctt 1920cagcagagcg cagataccaa
atactgtcct tctagtgtag ccgtagttag gccaccactt 1980caagaactct gtagcaccgc
ctacatacct cgctctgcta atcctgttac cagtggctgc 2040tgccagtggc gataagtcgt
gtcttaccgg gttggactca agacgatagt taccggataa 2100ggcgcagcgg tcgggctgaa
cggggggttc gtgcacacag cccagcttgg agcgaacgac 2160ctacaccgaa ctgagatacc
tacagcgtga gcattgagaa agcgccacgc ttcccgaagg 2220gagaaaggcg gacaggtatc
cggtaagcgg cagggtcgga acaggagagc gcacgaggga 2280gcttccaggg ggaaacgcct
ggtatcttta tagtcctgtc gggtttcgcc acctctgact 2340tgagcgtcga tttttgtgat
gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa 2400cgcggccttt ttacggttcc
tggccttttg ctggcctttt gctcacatgt tctttcctgc 2460gttatcccct gattctgtgg
ataaccgtat taccgccttt gagtgagctg ataccgctcg 2520ccgcagccga acgaccgagc
gcagcgagtc agtgagcgag gaagcggaag agcgcccaat 2580acgcaaaccg cctctccccg
cgcgttggcc gattcattaa tgcaggttga tcgattcgac 2640atcgatctag taacatagat
gacaccgcgc gcgataattt atcctagttt gcgcgctata 2700ttttgttttc tatcgcgtat
taaatgtata attgcgggac tctaatcata aaaacccatc 2760tcataaataa cgtcatgcat
tacatgttaa ttattacatg cttaacgtaa ttcaacagaa 2820attatatgat aatcatcgca
agaccggcaa caggattcaa tcttaagaaa ctttattgcc 2880aaatgtttga acgatctgct
tcgacgcact ccttctttag gtacctcact attcctttgc 2940cctcggacga gtgctggggc
gtcggtttcc actatcggcg agtacttcta cacagccatc 3000ggtccagacg gccgcgcttc
tgcgggcgat ttgtgtacgc ccgacagtcc cggctccgga 3060tcggacgatt gcgtcgcatc
gaccctgcgc ccaagctgca tcatcgaaat tgccgtcaac 3120caagctctga tagagttggt
caagaccaat gcggagcata tacgcccgga gccgcggcga 3180tcctgcaagc tccggatgcc
tccgctcgaa gtagcgcgtc tgctgctcca tacaagccaa 3240ccacggcctc cagaagaaga
tgttggcgac ctcgtattgg gaatccccga acatcgcctc 3300gctccagtca atgaccgctg
ttatgcggcc attgtccgtc aggacattgt tggagccgaa 3360atccgcgtgc acgaggtgcc
ggacttcggg gcagtcctcg gcccaaagca tcagctcatc 3420gagagcctgc gcgacggacg
cactgacggt gtcgtccatc acagtttgcc agtgatacac 3480atggggatca gcaatcgcgc
atatgaaatc acgccatgta gtgtattgac cgattccttg 3540cggtccgaat gggccgaacc
cgctcgtctg gctaagatcg gccgcagcga tcgcatccat 3600ggcctccgcg accggctgca
gaacagcggg cagttcggtt tcaggcaggt cttgcaacgt 3660gacaccctgt gcacggcggg
agatgcaata ggtcaggctc tcgctgaatt ccccaatgtc 3720aagcacttcc ggaatcggga
gcgcggccga tgcaaagtgc cgataaacat aacgatcttt 3780gtagaaacca tcggcgcagc
tatttacccg caggacatat ccacgccctc ctacatcgaa 3840gctgaaagca cgagattctt
cgccctccga gagctgcatc aggtcggaga cgctgtcgaa 3900cttttcgatc agaaacttct
cgacagacgt cgcggtgagt tcaggctttt tcatggttta 3960ataagaagag aaaagagttc
ttttgttatg gctgaagtaa tagagaaatg agctcgagcg 4020tgtcctctcc aaatgaaatg
aacttcctta tatagaggaa gggtcttgcg aaggatagtg 4080ggattgtgcg tcatccctta
cgtcagtgga gatgtcacat caatccactt gctttgaaga 4140cgtggttgga acgtcttctt
tttccacgat gctcctcgtg ggtgggggtc catctttggg 4200accactgtcg gcagaggcat
cttgaatgat agcctttcct ttatcgcaat gatggcattt 4260gtaggagcca ccttcctttt
ctactgtcct ttcgatgaag tgacagatag ctgggcaatg 4320gaatccgagg aggtttcccg
aaattatcct ttgttgaaaa gtctcaatag ccctttggtc 4380ttctgagact gtatctttga
catttttgga gtagaccaga gtgtcgtgct ccaccatgtt 4440gacgaagatt ttcttcttgt
cattgagtcg taaaagactc tgtatgaact gttcgccagt 4500cttcacggcg agttctgtta
gatcctcgat ttgaatctta gactccatgc atggccttag 4560attcagtagg aactaccttt
ttagagactc caatctctat tacttgcctt ggtttatgaa 4620gcaagccttg aatcgtccat
actggaatag tacttctgat cttgagaaat atgtctttct 4680ctgtgttctt gatgcaatta
gtcctgaatc ttttgactgc atctttaacc ttcttgggaa 4740ggtatttgat ctcctggaga
ttgttactcg ggtagatcgt cttgatgaga cctgctgcgt 4800aggcctctct aaccatctgt
gggtcagcat tctttctgaa attgaagagg ctaaccttct 4860cattatcagt ggtgaacata
gtgtcgtcac cttcaccttc gaacttcctt cctagatcgt 4920aaagatagag gaaatcgtcc
attgtaatct ccggggcaaa ggagatctct tttggggctg 4980gatcactgct gggccttttg
gttcctagcg tgagccagtg ggctttttgc tttggtgggc 5040ttgttagggc cttagcaaag
ctcttgggct tgagttgagc ttctcctttg gggatgaagt 5100tcaacctgtc tgtttgctga
cttgttgtgt acgcgtcagc tgctgctctt gcctctgtaa 5160tagtggcaaa tttcttgtgt
gcaactccgg gaacgccgtt tgttgccgcc tttgtacaac 5220cccagtcatc gtatataccg
gcatgtggac cgttatacac aacgtagtag ttgatatgag 5280ggtgttgaat acccgattct
gctctgagag gagcaactgt gctgttaagc tcagattttt 5340gtgggattgg aattggatcg
atctcgatcc cgcgaaatta atacgactca ctatagggag 5400accacaacgg tttccctcta
gaaataattt tgtttaactt taagaaggag atatacccat 5460ggaaaagcct gaactcaccg
cgacgtctgt cgagaagttt ctgatcgaaa agttcgacag 5520cgtctccgac ctgatgcagc
tctcggaggg cgaagaatct cgtgctttca gcttcgatgt 5580aggagggcgt ggatatgtcc
tgcgggtaaa tagctgcgcc gatggtttct acaaagatcg 5640ttatgtttat cggcactttg
catcggccgc gctcccgatt ccggaagtgc ttgacattgg 5700ggaattcagc gagagcctga
cctattgcat ctcccgccgt gcacagggtg tcacgttgca 5760agacctgcct gaaaccgaac
tgcccgctgt tctgcagccg gtcgcggagg ctatggatgc 5820gatcgctgcg gccgatctta
gccagacgag cgggttcggc ccattcggac cgcaaggaat 5880cggtcaatac actacatggc
gtgatttcat atgcgcgatt gctgatcccc atgtgtatca 5940ctggcaaact gtgatggacg
acaccgtcag tgcgtccgtc gcgcaggctc tcgatgagct 6000gatgctttgg gccgaggact
gccccgaagt ccggcacctc gtgcacgcgg atttcggctc 6060caacaatgtc ctgacggaca
atggccgcat aacagcggtc attgactgga gcgaggcgat 6120gttcggggat tcccaatacg
aggtcgccaa catcttcttc tggaggccgt ggttggcttg 6180tatggagcag cagacgcgct
acttcgagcg gaggcatccg gagcttgcag gatcgccgcg 6240gctccgggcg tatatgctcc
gcattggtct tgaccaactc tatcagagct tggttgacgg 6300caatttcgat gatgcagctt
gggcgcaggg tcgatgcgac gcaatcgtcc gatccggagc 6360cgggactgtc gggcgtacac
aaatcgcccg cagaagcgcg gccgtctgga ccgatggctg 6420tgtagaagta ctcgccgata
gtggaaaccg acgccccagc actcgtccga gggcaaagga 6480atagtgaggt acagcttgga
tcgatccggc tgctaacaaa gcccgaaagg aagctgagtt 6540ggctgctgcc accgctgagc
aataactagc ataacccctt ggggcctcta aacgggtctt 6600gaggggtttt ttgctgaaag
gaggaactat atccggatga tcgggcgcgc cgtcgacgga 6660tccactagtt ctagagcggc
ccgcgccgtc gacggatata atgagccgta aacaaagatg 6720attaagtagt aattaatacg
tactagtaaa agtggcaaaa gataacgaga aagaaccaat 6780ttctttgcat tcggccttag
cggaaggcat atataagctt tgattatttt atttagtgta 6840atgatttcgt acaaccaaag
catttattta gtactctcac acttgtgtcg cggccggccg 6900ctacaggaac aggtggtggc
ggccctcggc gcgctcgtac tgctccacga tggtgtagtc 6960ctcgttgtgg gaggtgatgt
ccagcttgga gtccacgtag tagtagccgg gcagctgcac 7020gggcttcttg gccatgtaga
tggacttgaa ctccaccagg tagtggccgc cgtccttcag 7080cttcagggcc ttgtggatct
cgcccttcag cacgccgtcg cgggggtaca ggcgctcggt 7140ggaggcctcc cagcccatag
tcttcttctg cattacgggg ccgtcggagg ggaagttcac 7200gccgatgaac ttcaccttgt
agatgaagga gccgtcctgc agggaggagt cctgggtcac 7260ggtcaccacg ccgccgtcct
cgaagttcat cacgcgctcc cacttgaagc cctcggggaa 7320ggacagcttc ttgtagtcgg
ggatgtcggc ggggtgcttc acgtacacct tggagccgta 7380ctggaactgg ggggacagga
tgtcccaggc gaagggcagg gggccgccct tggtcacctt 7440cagcttggcg gtctgggtgc
cctcgtaggg gcggccctcg ccctcgccct cgatctcgaa 7500ctcgtggccg ttcacggagc
cctccatgcg caccttgaag cgcatgaact ccttgatgac 7560gtcctcggag gaggccatgg
gccgcttggg gggctatgga agactttctt agttagttgt 7620gtgaataagc aatgttggga
gaatcgggac tacttatagg ataggaataa aacagaaaag 7680tattaagtgc taatgaaata
tttagactga taattaaaat cttcacgtat gtccacttga 7740tataaaaacg tcaggaataa
aggaagtaca gtagaattta aaggtactct ttttatatat 7800acccgtgttc tctttttggc
tagctagttg cataaaaaat aatctatatt tttatcatta 7860ttttaaatat cttatgagat
ggtaaatatt tatcataatt ttttttacta ttatttatta 7920tttgtgtgtg taatacatat
agaagttaat tacaaatttt atttactttt tcattatttt 7980gatatgattc accattaatt
tagtgttatt atttataata gttcatttta atctttttgt 8040atatattatg cgtgcagtac
ttttttccta catataacta ctattacatt ttatttatat 8100aatattttta ttaatgaatt
ttcgtgataa tatgtaatat tgttcattat tatttcagat 8160tttttaaaaa tatttgtgtt
attatttatg aaatatgtaa tttttttagt atttgatttt 8220atgatgataa agtgttctaa
attcaaaaga agggggaaag cgtaaacatt aaaaaacgtc 8280atcaaacaaa aacaaaatct
tgttaataaa gataaaactg tttgttttga tcactgttat 8340ttcgtaatat aaaaacatta
tttatattta tattgttgac aaccaaattt gcctatcaaa 8400tctaaccaat ataatgcatg
cgtggcaggt aatgtactac catgaactta agtcatgaca 8460taataaaccg tgaatctgac
caatgcatgt acctanctaa attgtatttg tgacacgaag 8520caaatgattc aattcacaat
ggagatggga aacaaataat gaagaaccca gaactaagaa 8580agcttttctg aaaaataaaa
taaaggcaat gtcaaaagta tactgcatca tcagtccaga 8640aagcacatga tattttttta
tcagtatcaa tgcagctagt tttattttac aatatcgata 8700tagctagttt aaatatattg
cagctagatt tataaatatt tgtgttatta tttatcattt 8760gtgtaatcct gtttttagta
ttttagttta tatatgatga taatgtattc caaatttaaa 8820agaagggaaa taaatttaaa
caagaaaaaa agtcatcaaa caaaaaacaa atgaaagggt 8880ggaaagatgt taccatgtaa
tgtgaatgtt acagtatttc ttttattata gagttaacaa 8940attaactaat atgattttgt
taataatgat aaaatatttt ttttattatt atttcataat 9000ataaaaatag tttacttaat
ataaaaaaaa ttctatcgtt cacaacaaag ttggccacct 9060aatttaacca tgcatgtacc
catggaccat attaggtaac catcaaacct gatgaagaga 9120taaagagatg aagacttaag
tcataacaca aaaccataaa aaacaaaaat acaatcaacc 9180gtcaatctga ccaatgcatg
aaaaagctgc aatagtgagt ggcgacacaa agcacatgat 9240tttcttacaa cggagataaa
accaaaaaaa tatttcatga acaacctaga acaaataaag 9300cttttatata ataaatatat
aaataaataa aggctatgga ataatatact tcaatatatt 9360tggattaaat aaattgttgg
cggggttgat atatttatac acacctaaag tcacttcaat 9420ctcattttca cttaactttt
attttttttt tctttttatt tatcataaag agaatattga 9480taatatactt tttaacatat
ttttatgaca ttttttattg gtgaaaactt attaaaaatc 9540ataaattttg taagttagat
ttatttaaag agttcctctt cttattttaa attttttaat 9600aaatttttaa ataactaaaa
tttgtgttaa aaatgttaaa aaagtgtgtt attaaccctt 9660ctcttcgagg atccgtaccg
agctcggatc ctctagaaat ccgtcaacat ggtggagcac 9720gacactctcg tctactccaa
gaatatcaaa gatacagtct cagaagacca aagggctatt 9780gagacttttc aacaaagggt
aatatcggga aacctcctcg gattccattg cccagctatc 9840tgtcacttca tcaaaaggac
agtagaaaag gaaggtggca cctacaaatg ccatcattgc 9900gataaaggaa aggctatcgt
tcaagatgcc tctgccgaca gtggtcccaa agatggaccc 9960ccacccacga ggagcatcgt
ggaaaaagaa gacgttccaa ccacgtcttc aaagcaagtg 10020gattgatgtg atgatcctat
gcgtatggta tgacgtgtgt tcaagatgat gacttcaaac 10080ctacctatga cgtatggtat
gaacgtgtgt cgactgatga cttagatcca ctcgagcggc 10140tataaatacg tacctacgca
ccctgcgcta ccatccctag agctgcagct tatttttaca 10200acaattacca acaacaacaa
acaacaaaca acattacaat tactatttac aattacagtc 10260gacccgggat cgtacctcta
gggtggcggc cgcatggaga gatctcaacg gcagtctcct 10320ccgccaccgt cgccgtcctc
ctcctcgtcc tccgtctccg cggacaccgt cctcgtccct 10380cccggaaaga ggcggagggc
ggcgacggcc aaggccggcg ccgagcctaa taagaggatc 10440cgcaaggacc ccgccgccgc
cgccgcgggg aagaggagct ccgtctacag gggagtcacc 10500aggcacaggt ggacgggcag
gttcgaggcg catctctggg acaagcactg cctcgccgcg 10560ctccacaaca agaagaaagg
caggcaagtc tacctggggg cgtatgacag cgaggaggca 10620gctgctcgtg cctatgacct
cgcagctctc aagtactggg gtcctgagac tctgctcaac 10680ttccctgtgg aggattactc
cagcgagatg ccggagatgg aggccgtgtc ccgggaggag 10740tacctggcct ccctccgccg
caggagcagc ggcttctcca ggggcgtctc caagtacaga 10800ggcgtcgcca ggcatcacca
caacgggagg tgggaggcac ggattgggcg agtctttggg 10860aacaagtacc tctacttggg
aacatttgac actcaagaag aggcagccaa ggcctatgac 10920cttgcggcca ttgaataccg
tggcgtcaat gctgtaacca acttcgacat cagctgctac 10980ctggaccacc cgctgttcct
ggcacagctc caacaggagc cacaggtggt gccggcactc 11040aaccaagaac ctcaacctga
tcagagcgaa accggaacta cagagcaaga gccggagtca 11100agcgaagcca agacaccgga
tggcagtgca gaacccgatg agaacgcggt gcctgacgac 11160accgcggagc ccctcaccac
agtcgacgac agcatcgaag agggcttgtg gagcccttgc 11220atggattacg agctagacac
catgtcgaga ccaaactttg gcagctcaat caatctgagc 11280gagtggttcg ctgacgcaga
cttcgactgc aacatcggat gcctgttcga tgggtgttct 11340gcggctgacg aaggaagcaa
ggatggtgta ggtctggcag atttcagtct gtttgaggca 11400ggtgatgtcc agctgaagga
tgttctttcg gatatggaag aggggataca acctccagcg 11460atgatcagtg tgtgcaacgc
114803519472DNAArtificial
Sequencevector pKR1221 35cgcgccagat cctctagagt cgacctgcag gcatgcaagc
ttggcgtaat catggtcata 60gctgtttcct gtgtgaaatt gttatccgct cacaattcca
cacaacatac gagccggaag 120cataaagtgt aaagcctggg gtgcctaatg agtgagctaa
ctcacattaa ttgcgttgcg 180ctcactgccc gctttccagt cgggaaacct gtcgtgccag
ctgcattaat gaatcggcca 240acgcgcgggg agaggcggtt tgcgtattgg atcgatccct
gaaagcgacg ttggatgtta 300acatctacaa attgcctttt cttatcgacc atgtacgtaa
gcgcttacgt ttttggtgga 360cccttgagga aactggtagc tgttgtgggc ctgtggtctc
aagatggatc attaatttcc 420accttcacct acgatggggg gcatcgcacc ggtgagtaat
attgtacggc taagagcgaa 480tttggcctgt agacctcaat tgcgagcttt ctaatttcaa
actattcggg cctaactttt 540ggtgtgatga tgctgactgg caggatatat accgttgtaa
tttgagctcg tgtgaataag 600tcgctgtgta tgtttgtttg attgtttctg ttggagtgca
gcccatttca ccggacaagt 660cggctagatt gatttagccc tgatgaactg ccgaggggaa
gccatcttga gcgcggaatg 720ggaatggatt tcgttgtaca acgagacgac agaacaccca
cgggaccgag cttcgcgagc 780ttttgtatcc gtggcatcct tggtccgggc gatttgttca
cgtccatgag gcgctctcca 840aaggaacgca tattttccgg tgcaaccttt ccggttcttc
ctctactcga cctcttgaag 900tcccagcatg aatgttcgac cgctccgcaa gcggatcttt
ggcgcaacca gccggtttcg 960cacgtcgatt ctcgcgagcc tgcatacttt ggcaagattg
ctgaatgacg ctgatgcttc 1020atcgcaatct gcgataatgg ggtaagtatc cggtgaaggc
cgcaggtcag gccgcctgag 1080cactcagtgt cttggatgtc cagttccacg gcagctgttg
ctcaagcctg ctgatcggag 1140cgtccgcaag gtcggcgcgg acgtcggcaa gccaggcctg
cggatcgatg ttattgagct 1200tggcgctcat gatcagtgtc gccatgaacg ccgcacgttc
agcacaacga tccgatccgg 1260caaacagcca tgacttcctg ccgagtacat agcctctgag
cgttcgttcg gcagcattgt 1320tcgtcaggca aatcgggccg tcatcgagga atgacgtaat
gccatcccat cgcttgagca 1380tgtaatttat cgcctcggcg acgggagaac tgcgcgacaa
tttcccccgc tcggtttcga 1440gccaatcatg cagctcttcg gcgagtgacc ttgatcaggc
caccgccacg accgcggaag 1500acgaacagat gcctgcgcat cggatcgcgc ttcagcgtct
cttgcaccat cagcgacaaa 1560ccgggaaagc ctttgcgcat gtccgtactt atgtcgccac
ttgggagggc ttcgtctacg 1620tggccttcgt gatcgacgtc ttcgcccgtc gcattgtcgg
atggcgggcg agccggacag 1680cacatgcagg ctttgtcctc gatgccctcg aggaggctca
tcatgatcgg cgtcccgctc 1740atggcggcct agtgcatcac tcggatcgcg gtgttcaata
cgtgtccttt cgctattccg 1800agcggttggc agaagcaggt atcgagccat ctatcggaag
cgtcggcgac agcacgacaa 1860cgccctcgca gaagcgatca acggtcttta caaggccgag
gtcattcatc ggcgtggacc 1920atggaggagc ttcgaagcgg tcgagttcgc taccttggaa
tggatagact ggttcaacca 1980cggcggcttt tgaagcccat cggcaatata ccgccagccg
aagacgagga tcagtattac 2040gccatgctgg acgaagcagc catggctgcg cattttaacg
aaatggcctc cggcaaaccc 2100ggtgcggttc acttgttgcg tgggaaagtt cacgggactc
cgcgcacgag ccttcttcgt 2160aatagccata tcgaccgaat tgacctgcag gggggggggg
gaaagccacg ttgtgtctca 2220aaatctctga tgttacattg cacaagataa aaatatatca
tcatgaacaa taaaactgtc 2280tgcttacata aacagtaata caaggggtgt tatgagccat
attcaacggg aaacgtcttg 2340ctcgaggccg cgattaaatt ccaacatgga tgctgattta
tatgggtata aatgggctcg 2400cgataatgtc gggcaatcag gtgcgacaat ctatcgattg
tatgggaagc ccgatgcgcc 2460agagttgttt ctgaaacatg gcaaaggtag cgttgccaat
gatgttacag atgagatggt 2520cagactaaac tggctgacgg aatttatgcc tcttccgacc
atcaagcatt ttatccgtac 2580tcctgatgat gcatggttac tcaccactgc gatccccggg
aaaacagcat tccaggtatt 2640agaagaatat cctgattcag gtgaaaatat tgttgatgcg
ctggcagtgt tcctgcgccg 2700gttgcattcg attcctgttt gtaattgtcc ttttaacagc
gatcgcgtat ttcgtctcgc 2760tcaggcgcaa tcacgaatga ataacggttt ggttgatgcg
agtgattttg atgacgagcg 2820taatggctgg cctgttgaac aagtctggaa agaaatgcat
aagcttttgc cattctcacc 2880ggattcagtc gtcactcatg gtgatttctc acttgataac
cttatttttg acgaggggaa 2940attaataggt tgtattgatg ttggacgagt cggaatcgca
gaccgatacc aggatcttgc 3000catcctatgg aactgcctcg gtgagttttc tccttcatta
cagaaacggc tttttcaaaa 3060atatggtatt gataatcctg atatgaataa attgcagttt
catttgatgc tcgatgagtt 3120tttctaatca gaattggtta attggttgta acactggcag
agcattacgc tgacttgacg 3180ggacggcggc tttgttgaat aaatcgaact tttgctgagt
tgaaggatca gatcacgcat 3240cttcccgaca acgcagaccg ttccgtggca aagcaaaagt
tcaaaatcac caactggtcc 3300acctacaaca aagctctcat caaccgtggc tccctcactt
tctggctgga tgatggggcg 3360attcaggcct ggtatgagtc agcaacacct tcttcacgag
gcagacctca gcgccccccc 3420ccccctgcag gtcttttcca atgatgagca cttttaaagt
tctgctatgt ggcgcggtat 3480tatcccgtgt tgacgccggg caagagcaac tcggtcgccg
catacactat tctcagaatg 3540acttggttga gtactcacca gtcacagaaa agcatcttac
ggatggcatg acagtaagag 3600aattatgcag tgctgccata accatgagtg ataacactgc
ggccaactta cttctgacaa 3660cgatcggagg accgaaggag ctaaccgctt ttttgcacaa
catgggggat catgtaactc 3720gccttgatcg ttgggaaccg gagctgaatg aagccatacc
aaacgacgag cgtgacacca 3780cgatgcctgt agcaatggca acaacgttgc gcaaactatt
aactggcgaa ctacttactc 3840tagcttcccg gcaacaatta atagactgga tggaggcgga
taaagttgca ggaccacttc 3900tgcgctcggc ccttccggct ggctggttta ttgctgataa
atctggagcc ggtgagcgtg 3960ggtctcgcgg tatcattgca gcactggggc cagatggtaa
gccctcccgt atcgtagtta 4020tctacacgac ggggagtcag gcaactatgg atgaacgaaa
tagacagatc gctgagatag 4080gtgcctcact gattaagcat tggtaactgt cagaccaagt
ttactcatat atactttaga 4140ttgatttaaa acttcatttt taatttaaaa ggatctaggt
gaagatcctt tttgataatc 4200tcatgaccaa aatcccttaa cgtgagtttt cgttccactg
agcgtcagac cccgtagaaa 4260agatcaaagg atcttcttga gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa 4320aaaaaccacc gctaccagcg gtggtttgtt tgccggatca
agagctacca actctttttc 4380cgaaggtaac tggcttcagc agagcgcaga taccaaatac
tgtccttcta gtgtagccgt 4440agttaggcca ccacttcaag aactctgtag caccgcctac
atacctcgct ctgctaatcc 4500tgttaccagt ggctgctgcc agtggcgata agtcgtgtct
taccgggttg gactcaagac 4560gatagttacc ggataaggcg cagcggtcgg gctgaacggg
gggttcgtgc acacagccca 4620gcttggagcg aacgacctac accgaactga gatacctaca
gcgtgagcta tgagaaagcg 4680ccacgcttcc cgaagggaga aaggcggaca ggtatccggt
aagcggcagg gtcggaacag 4740gagagcgcac gagggagctt ccagggggaa acgcctggta
tctttatagt cctgtcgggt 4800ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc
gtcagggggg cggagcctat 4860ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc
cttttgctgg ccttttgctc 4920acatgttctt tcctgcgtta tcccctgatt ctgtggataa
ccgtattacc gcctttgagt 4980gagctgatac cgctcgccgc agccgaacga ccgagcgcag
cgagtcagtg agcgaggaag 5040cggaagagcg cctgatgcgg tattttctcc ttacgcatct
gtgcggtatt tcacaccgca 5100tatggtgcac tctcagtaca atctgctctg atgccgcata
gttaagccag tatacactcc 5160gctatcgcta cgtgactggg tcatggctgc gccccgacac
ccgccaacac ccgctgacgc 5220gccctgacgg gcttgtctgc tcccggcatc cgcttacaga
caagctgtga ccgtctccgg 5280gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa
cgcgcgaggc agggtgcctt 5340gatgtgggcg ccggcggtcg agtggcgacg gcgcggcttg
tccgcgccct ggtagattgc 5400ctggccgtag gccagccatt tttgagcggc cagcggccgc
gataggccga cgcgaagcgg 5460cggggcgtag ggagcgcagc gaccgaaggg taggcgcttt
ttgcagctct tcggctgtgc 5520gctggccaga cagttatgca caggccaggc gggttttaag
agttttaata agttttaaag 5580agttttaggc ggaaaaatcg ccttttttct cttttatatc
agtcacttac atgtgtgacc 5640ggttcccaat gtacggcttt gggttcccaa tgtacgggtt
ccggttccca atgtacggct 5700ttgggttccc aatgtacgtg ctatccacag gaaagagacc
ttttcgacct ttttcccctg 5760ctagggcaat ttgccctagc atctgctccg tacattagga
accggcggat gcttcgccct 5820cgatcaggtt gcggtagcgc atgactagga tcgggccagc
ctgccccgcc tcctccttca 5880aatcgtactc cggcaggtca tttgacccga tcagcttgcg
cacggtgaaa cagaacttct 5940tgaactctcc ggcgctgcca ctgcgttcgt agatcgtctt
gaacaaccat ctggcttctg 6000ccttgcctgc ggcgcggcgt gccaggcggt agagaaaacg
gccgatgccg ggatcgatca 6060aaaagtaatc ggggtgaacc gtcagcacgt ccgggttctt
gccttctgtg atctcgcggt 6120acatccaatc agctagctcg atctcgatgt actccggccg
cccggtttcg ctctttacga 6180tcttgtagcg gctaatcaag gcttcaccct cggataccgt
caccaggcgg ccgttcttgg 6240ccttcttcgt acgctgcatg gcaacgtgcg tggtgtttaa
ccgaatgcag gtttctacca 6300ggtcgtcttt ctgctttccg ccatcggctc gccggcagaa
cttgagtacg tccgcaacgt 6360gtggacggaa cacgcggccg ggcttgtctc ccttcccttc
ccggtatcgg ttcatggatt 6420cggttagatg ggaaaccgcc atcagtacca ggtcgtaatc
ccacacactg gccatgccgg 6480ccggccctgc ggaaacctct acgtgcccgt ctggaagctc
gtagcggatc acctcgccag 6540ctcgtcggtc acgcttcgac agacggaaaa cggccacgtc
catgatgctg cgactatcgc 6600gggtgcccac gtcatagagc atcggaacga aaaaatctgg
ttgctcgtcg cccttgggcg 6660gcttcctaat cgacggcgca ccggctgccg gcggttgccg
ggattctttg cggattcgat 6720cagcggccgc ttgccacgat tcaccggggc gtgcttctgc
ctcgatgcgt tgccgctggg 6780cggcctgcgc ggccttcaac ttctccacca ggtcatcacc
cagcgccgcg ccgatttgta 6840ccgggccgga tggtttgcga ccgctcacgc cgattcctcg
ggcttggggg ttccagtgcc 6900attgcagggc cggcagacaa cccagccgct tacgcctggc
caaccgcccg ttcctccaca 6960catggggcat tccacggcgt cggtgcctgg ttgttcttga
ttttccatgc cgcctccttt 7020agccgctaaa attcatctac tcatttattc atttgctcat
ttactctggt agctgcgcga 7080tgtattcaga tagcagctcg gtaatggtct tgccttggcg
taccgcgtac atcttcagct 7140tggtgtgatc ctccgccggc aactgaaagt tgacccgctt
catggctggc gtgtctgcca 7200ggctggccaa cgttgcagcc ttgctgctgc gtgcgctcgg
acggccggca cttagcgtgt 7260ttgtgctttt gctcattttc tctttacctc attaactcaa
atgagttttg atttaatttc 7320agcggccagc gcctggacct cgcgggcagc gtcgccctcg
ggttctgatt caagaacggt 7380tgtgccggcg gcggcagtgc ctgggtagct cacgcgctgc
gtgatacggg actcaagaat 7440gggcagctcg tacccggcca gcgcctcggc aacctcaccg
ccgatgcgcg tgcctttgat 7500cgcccgcgac acgacaaagg ccgcttgtag ccttccatcc
gtgacctcaa tgcgctgctt 7560aaccagctcc accaggtcgg cggtggccca tatgtcgtaa
gggcttggct gcaccggaat 7620cagcacgaag tcggctgcct tgatcgcgga cacagccaag
tccgccgcct ggggcgctcc 7680gtcgatcact acgaagtcgc gccggccgat ggccttcacg
tcgcggtcaa tcgtcgggcg 7740gtcgatgccg acaacggtta gcggttgatc ttcccgcacg
gccgcccaat cgcgggcact 7800gccctgggga tcggaatcga ctaacagaac atcggccccg
gcgagttgca gggcgcgggc 7860tagatgggtt gcgatggtcg tcttgcctga cccgcctttc
tggttaagta cagcgataac 7920ttcatgcgtt cccttgcgta tttgtttatt tactcatcgc
atcatatacg cagcgaccgc 7980atgacgcaag ctgttttact caaatacaca tcaccttttt
agacggcggc gctcggtttc 8040ttcagcggcc aagctggccg gccaggccgc cagcttggca
tcagacaaac cggccaggat 8100ttcatgcagc cgcacggttg agacgtgcgc gggcggctcg
aacacgtacc cggccgcgat 8160catctccgcc tcgatctctt cggtaatgaa aaacggttcg
tcctggccgt cctggtgcgg 8220tttcatgctt gttcctcttg gcgttcattc tcggcggccg
ccagggcgtc ggcctcggtc 8280aatgcgtcct cacggaaggc accgcgccgc ctggcctcgg
tgggcgtcac ttcctcgctg 8340cgctcaagtg cgcggtacag ggtcgagcga tgcacgccaa
gcagtgcagc cgcctctttc 8400acggtgcggc cttcctggtc gatcagctcg cgggcgtgcg
cgatctgtgc cggggtgagg 8460gtagggcggg ggccaaactt cacgcctcgg gccttggcgg
cctcgcgccc gctccgggtg 8520cggtcgatga ttagggaacg ctcgaactcg gcaatgccgg
cgaacacggt caacaccatg 8580cggccggccg gcgtggtggt gtcggcccac ggctctgcca
ggctacgcag gcccgcgccg 8640gcctcctgga tgcgctcggc aatgtccagt aggtcgcggg
tgctgcgggc caggcggtct 8700agcctggtca ctgtcacaac gtcgccaggg cgtaggtggt
caagcatcct ggccagctcc 8760gggcggtcgc gcctggtgcc ggtgatcttc tcggaaaaca
gcttggtgca gccggccgcg 8820tgcagttcgg cccgttggtt ggtcaagtcc tggtcgtcgg
tgctgacgcg ggcatagccc 8880agcaggccag cggcggcgct cttgttcatg gcgtaatgtc
tccggttcta gtcgcaagta 8940ttctacttta tgcgactaaa acacgcgaca agaaaacgcc
aggaaaaggg cagggcggca 9000gcctgtcgcg taacttagga cttgtgcgac atgtcgtttt
cagaagacgg ctgcactgaa 9060cgtcagaagc cgactgcact atagcagcgg aggggttgga
ccacaggacg ggtgtggtcg 9120ccatgatcgc gtagtcgata gtggctccaa gtagcgaagc
gagcaggact gggcggcggc 9180caaagcggtc ggacagtgct ccgagaacgg gtgcgcatag
aaattgcatc aacgcatata 9240gcgctagcag cacgccatag tgactggcga tgctgtcgga
atggacgata tcccgcaaga 9300ggcccggcag taccggcata accaagccta tgcctacagc
atccagggtg acggtgccga 9360ggatgacgat gagcgcattg ttagatttca tacacggtgc
ctgactgcgt tagcaattta 9420actgtgataa actaccgcat taaagctagc ttgcttggtc
gttccgcgtg aacgtcggct 9480cgattgtacc tgcgttcaaa tactttgcga tcgtgttgcg
cgcctgcccg gtgcgtcggc 9540tgatctcacg gatcgactgc ttctctcgca acgccatccg
acggatgatg tttaaaagtc 9600ccatgtggat cactccgttg ccccgtcgct caccgtgttg
gggggaaggt gcacatggct 9660cagttctcaa tggaaattat ctgcctaacc ggctcagttc
tgcgtagaaa ccaacatgca 9720agctccaccg ggtgcaaagc ggcagcggcg gcaggatata
ttcaattgta aatggcttca 9780tgtccgggaa atctacatgg atcagcaatg agtatgatgg
tcaatatgga gaaaaagaaa 9840gagtaattac caattttttt tcaattcaaa aatgtagatg
tccgcagcgt tattataaaa 9900tgaaagtaca ttttgataaa acgacaaatt acgatccgtc
gtatttatag gcgaaagcaa 9960taaacaaatt attctaattc ggaaatcttt atttcgacgt
gtctacattc acgtccaaat 10020gggggcttag atgagaaact tcacgatcga tgccttgatt
tcgccattcc cagataccca 10080tttcatcttc agattggtct gagattatgc gaaaatatac
actcatatac ataaatactg 10140acagtttgag ctaccaattc agtgtagccc attacctcac
ataattcact caaatgctag 10200gcagtctgtc aactcggcgt caatttgtcg gccactatac
gatagttgcg caaattttca 10260aagtcctggc ctaacatcac acctctgtcg gcggcgggtc
ccatttgtga taaatccacc 10320atatcgaatt aattcagact cctttgcccc agagatcaca
atggacgact tcctctatct 10380ctacgatcta gtcaggaagt tcgacggaga aggtgacgat
accatgttca ccactgataa 10440tgagaagatt agccttttca atttcagaaa gaatgctaac
ccacagatgg ttagagaggc 10500ttacgcagca ggtctcatca agacgatcta cccgagcaat
aatctccagg agatcaaata 10560ccttcccaag aaggttaaag atgcagtcaa aagattcagg
actaactgca tcaagaacac 10620agagaaagat atatttctca agatcagaag tactattcca
gtatggacga ttcaaggctt 10680gcttcacaaa ccaaggcaag taatagagat tggagtctct
aaaaaggtag ttcccactga 10740atcaaaggcc atggagtcaa agattcaaat agaggaccta
acagaactcg ccgtaaagac 10800tggcgaacag ttcatacaga gtctcttacg actcaatgac
aagaagaaaa tcttcgtcaa 10860catggtggag cacgacacgc ttgtctactc caaaaatatc
aaagatacag tctcagaaga 10920ccaaagggca attgagactt ttcaacaaag ggtaatatcc
ggaaacctcc tcggattcca 10980ttgcccagct atctgtcact ttattgtgaa gatagtggaa
aaggaaggtg gctcctacaa 11040atgccatcat tgcgataaag gaaaggccat cgttgaagat
gcctctgccg acagtggtcc 11100caaagatgga cccccaccca cgaggagcat cgtggaaaaa
gaagacgttc caaccacgtc 11160ttcaaagcaa gtggattgat gtgatatctc cactgacgta
agggatgacg cacaatccca 11220ctatccttcg caagaccctt cctctatata aggaagttca
tttcatttgg agaggacacg 11280ctgaaatcac cagtctccaa gcttgcgggg atcgtttcgc
atgattgaac aagatggatt 11340gcacgcaggt tctccggccg cttgggtgga gaggctattc
ggctatgact gggcacaaca 11400gacaatcggc tgctctgatg ccgccgtgtt ccggctgtca
gcgcaggggc gcccggttct 11460ttttgtcaag accgacctgt ccggtgccct gaatgaactg
caggacgagg cagcgcggct 11520atcgtggctg gccacgacgg gcgttccttg cgcagctgtg
ctcgacgttg tcactgaagc 11580gggaagggac tggctgctat tgggcgaagt gccggggcag
gatctcctgt catctcacct 11640tgctcctgcc gagaaagtat ccatcatggc tgatgcaatg
cggcggctgc atacgcttga 11700tccggctacc tgcccattcg accaccaagc gaaacatcgc
atcgagcgag cacgtactcg 11760gatggaagcc ggtcttgtcg atcaggatga tctggacgaa
gagcatcagg ggctcgcgcc 11820agccgaactg ttcgccaggc tcaaggcgcg catgcccgac
ggcgaggatc tcgtcgtgac 11880ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat
ggccgctttt ctggattcat 11940cgactgtggc cggctgggtg tggcggaccg ctatcaggac
atagcgttgg ctacccgtga 12000tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc
ctcgtgcttt acggtatcgc 12060cgctcccgat tcgcagcgca tcgccttcta tcgccttctt
gacgagttct tctgagcggg 12120actctggggt tcgaaatgac cgaccaagcg acgcccaacc
tgccatcacg agatttcgat 12180tccaccgccg ccttctatga aaggttgggc ttcggaatcg
ttttccggga cgccggctgg 12240atgatcctcc agcgcgggga tctcatgctg gagttcttcg
cccaccccgg atcgatccaa 12300cacttacgtt tgcaacgtcc aagagcaaat agaccacgaa
cgccggaagg ttgccgcagc 12360gtgtggattg cgtctcaatt ctctcttgca ggaatgcaat
gatgaatatg atactgacta 12420tgaaactttg agggaatact gcctagcacc gtcacctcat
aacgtgcatc atgcatgccc 12480tgacaacatg gaacatcgct atttttctga agaattatgc
tcgttggagg atgtcgcggc 12540aattgcagct attgccaaca tcgaactacc cctcacgcat
gcattcatca atattattca 12600tgcggggaaa ggcaagatta atccaactgg caaatcatcc
agcgtgattg gtaacttcag 12660ttccagcgac ttgattcgtt ttggtgctac ccacgttttc
aataaggacg agatggtgga 12720gtaaagaagg agtgcgtcga agcagatcgt tcaaacattt
ggcaataaag tttcttaaga 12780ttgaatcctg ttgccggtct tgcgatgatt atcatataat
ttctgttgaa ttacgttaag 12840catgtaataa ttaacatgta atgcatgacg ttatttatga
gatgggtttt tatgattaga 12900gtcccgcaat tatacattta atacgcgata gaaaacaaaa
tatagcgcgc aaactaggat 12960aaattatcgc gcgcggtgtc atctatgtta ctagatcgat
caaacttcgg tactgtgtaa 13020tgacgatgag caatcgagag gctgactaac aaaaggtaca
tcgcgatgga tcgatccatt 13080cgccattcag gctgcgcaac tgttgggaag ggcgatcggt
gcgggcctct tcgctattac 13140gccagctggc gaaaggggga tgtgctgcaa ggcgattaag
ttgggtaacg ccagggtttt 13200cccagtcacg acgttgtaaa acgacggcca gtgaattcct
gcagcccggg ggatccgccc 13260actcgaggcg cgccgtcgac ggatataatg agccgtaaac
aaagatgatt aagtagtaat 13320taatacgtac tagtaaaagt ggcaaaagat aacgagaaag
aaccaatttc tttgcattcg 13380gccttagcgg aaggcatata taagctttga ttattttatt
tagtgtaatg atttcgtaca 13440accaaagcat ttatttagta ctctcacact tgtgtcgcgg
ccggccgcta caggaacagg 13500tggtggcggc cctcggcgcg ctcgtactgc tccacgatgg
tgtagtcctc gttgtgggag 13560gtgatgtcca gcttggagtc cacgtagtag tagccgggca
gctgcacggg cttcttggcc 13620atgtagatgg acttgaactc caccaggtag tggccgccgt
ccttcagctt cagggccttg 13680tggatctcgc ccttcagcac gccgtcgcgg gggtacaggc
gctcggtgga ggcctcccag 13740cccatagtct tcttctgcat tacggggccg tcggagggga
agttcacgcc gatgaacttc 13800accttgtaga tgaaggagcc gtcctgcagg gaggagtcct
gggtcacggt caccacgccg 13860ccgtcctcga agttcatcac gcgctcccac ttgaagccct
cggggaagga cagcttcttg 13920tagtcgggga tgtcggcggg gtgcttcacg tacaccttgg
agccgtactg gaactggggg 13980gacaggatgt cccaggcgaa gggcaggggg ccgcccttgg
tcaccttcag cttggcggtc 14040tgggtgccct cgtaggggcg gccctcgccc tcgccctcga
tctcgaactc gtggccgttc 14100acggagccct ccatgcgcac cttgaagcgc atgaactcct
tgatgacgtc ctcggaggag 14160gccatgggcc gcttgggggg ctatggaaga ctttcttagt
tagttgtgtg aataagcaat 14220gttgggagaa tcgggactac ttataggata ggaataaaac
agaaaagtat taagtgctaa 14280tgaaatattt agactgataa ttaaaatctt cacgtatgtc
cacttgatat aaaaacgtca 14340ggaataaagg aagtacagta gaatttaaag gtactctttt
tatatatacc cgtgttctct 14400ttttggctag ctagttgcat aaaaaataat ctatattttt
atcattattt taaatatctt 14460atgagatggt aaatatttat cataattttt tttactatta
tttattattt gtgtgtgtaa 14520tacatataga agttaattac aaattttatt tactttttca
ttattttgat atgattcacc 14580attaatttag tgttattatt tataatagtt cattttaatc
tttttgtata tattatgcgt 14640gcagtacttt tttcctacat ataactacta ttacatttta
tttatataat atttttatta 14700atgaattttc gtgataatat gtaatattgt tcattattat
ttcagatttt ttaaaaatat 14760ttgtgttatt atttatgaaa tatgtaattt ttttagtatt
tgattttatg atgataaagt 14820gttctaaatt caaaagaagg gggaaagcgt aaacattaaa
aaacgtcatc aaacaaaaac 14880aaaatcttgt taataaagat aaaactgttt gttttgatca
ctgttatttc gtaatataaa 14940aacattattt atatttatat tgttgacaac caaatttgcc
tatcaaatct aaccaatata 15000atgcatgcgt ggcaggtaat gtactaccat gaacttaagt
catgacataa taaaccgtga 15060atctgaccaa tgcatgtacc tanctaaatt gtatttgtga
cacgaagcaa atgattcaat 15120tcacaatgga gatgggaaac aaataatgaa gaacccagaa
ctaagaaagc ttttctgaaa 15180aataaaataa aggcaatgtc aaaagtatac tgcatcatca
gtccagaaag cacatgatat 15240ttttttatca gtatcaatgc agctagtttt attttacaat
atcgatatag ctagtttaaa 15300tatattgcag ctagatttat aaatatttgt gttattattt
atcatttgtg taatcctgtt 15360tttagtattt tagtttatat atgatgataa tgtattccaa
atttaaaaga agggaaataa 15420atttaaacaa gaaaaaaagt catcaaacaa aaaacaaatg
aaagggtgga aagatgttac 15480catgtaatgt gaatgttaca gtatttcttt tattatagag
ttaacaaatt aactaatatg 15540attttgttaa taatgataaa atattttttt tattattatt
tcataatata aaaatagttt 15600acttaatata aaaaaaattc tatcgttcac aacaaagttg
gccacctaat ttaaccatgc 15660atgtacccat ggaccatatt aggtaaccat caaacctgat
gaagagataa agagatgaag 15720acttaagtca taacacaaaa ccataaaaaa caaaaataca
atcaaccgtc aatctgacca 15780atgcatgaaa aagctgcaat agtgagtggc gacacaaagc
acatgatttt cttacaacgg 15840agataaaacc aaaaaaatat ttcatgaaca acctagaaca
aataaagctt ttatataata 15900aatatataaa taaataaagg ctatggaata atatacttca
atatatttgg attaaataaa 15960ttgttggcgg ggttgatata tttatacaca cctaaagtca
cttcaatctc attttcactt 16020aacttttatt ttttttttct ttttatttat cataaagaga
atattgataa tatacttttt 16080aacatatttt tatgacattt tttattggtg aaaacttatt
aaaaatcata aattttgtaa 16140gttagattta tttaaagagt tcctcttctt attttaaatt
ttttaataaa tttttaaata 16200actaaaattt gtgttaaaaa tgttaaaaaa gtgtgttatt
aacccttctc ttcgaggatc 16260cgtaccgagc tcggatcctc tagaaatccg tcaacatggt
ggagcacgac actctcgtct 16320actccaagaa tatcaaagat acagtctcag aagaccaaag
ggctattgag acttttcaac 16380aaagggtaat atcgggaaac ctcctcggat tccattgccc
agctatctgt cacttcatca 16440aaaggacagt agaaaaggaa ggtggcacct acaaatgcca
tcattgcgat aaaggaaagg 16500ctatcgttca agatgcctct gccgacagtg gtcccaaaga
tggaccccca cccacgagga 16560gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa
gcaagtggat tgatgtgatg 16620atcctatgcg tatggtatga cgtgtgttca agatgatgac
ttcaaaccta cctatgacgt 16680atggtatgaa cgtgtgtcga ctgatgactt agatccactc
gagcggctat aaatacgtac 16740ctacgcaccc tgcgctacca tccctagagc tgcagcttat
ttttacaaca attaccaaca 16800acaacaaaca acaaacaaca ttacaattac tatttacaat
tacagtcgac ccgggatcgt 16860acctctaggg tggcggccgc atggagagat ctcaacggca
gtctcctccg ccaccgtcgc 16920cgtcctcctc ctcgtcctcc gtctccgcgg acaccgtcct
cgtccctccc ggaaagaggc 16980ggagggcggc gacggccaag gccggcgccg agcctaataa
gaggatccgc aaggaccccg 17040ccgccgccgc cgcggggaag aggagctccg tctacagggg
agtcaccagg cacaggtgga 17100cgggcaggtt cgaggcgcat ctctgggaca agcactgcct
cgccgcgctc cacaacaaga 17160agaaaggcag gcaagtctac ctgggggcgt atgacagcga
ggaggcagct gctcgtgcct 17220atgacctcgc agctctcaag tactggggtc ctgagactct
gctcaacttc cctgtggagg 17280attactccag cgagatgccg gagatggagg ccgtgtcccg
ggaggagtac ctggcctccc 17340tccgccgcag gagcagcggc ttctccaggg gcgtctccaa
gtacagaggc gtcgccaggc 17400atcaccacaa cgggaggtgg gaggcacgga ttgggcgagt
ctttgggaac aagtacctct 17460acttgggaac atttgacact caagaagagg cagccaaggc
ctatgacctt gcggccattg 17520aataccgtgg cgtcaatgct gtaaccaact tcgacatcag
ctgctacctg gaccacccgc 17580tgttcctggc acagctccaa caggagccac aggtggtgcc
ggcactcaac caagaacctc 17640aacctgatca gagcgaaacc ggaactacag agcaagagcc
ggagtcaagc gaagccaaga 17700caccggatgg cagtgcagaa cccgatgaga acgcggtgcc
tgacgacacc gcggagcccc 17760tcaccacagt cgacgacagc atcgaagagg gcttgtggag
cccttgcatg gattacgagc 17820tagacaccat gtcgagacca aactttggca gctcaatcaa
tctgagcgag tggttcgctg 17880acgcagactt cgactgcaac atcggatgcc tgttcgatgg
gtgttctgcg gctgacgaag 17940gaagcaagga tggtgtaggt ctggcagatt tcagtctgtt
tgaggcaggt gatgtccagc 18000tgaaggatgt tctttcggat atggaagagg ggatacaacc
tccagcgatg atcagtgtgt 18060gcaacgcggc cgcaagtatg aactaaaatg catgtaggtg
taagagctca tggagagcat 18120ggaatattgt atccgaccat gtaacagtat aataactgag
ctccatctca cttcttctat 18180gaataaacaa aggatgttat gatatattaa cactctatct
atgcacctta ttgttctatg 18240ataaatttcc tcttattatt ataaatcatc tgaatcgtga
cggcttatgg aatgcttcaa 18300atagtacaaa aacaaatgtg tactataaga ctttctaaac
aattctaacc ttagcattgt 18360gaacgagaca taagtgttaa gaagacataa caattataat
ggaagaagtt tgtctccatt 18420tatatattat atattaccca cttatgtatt atattaggat
gttaaggaga cataacaatt 18480ataaagagag aagtttgtat ccatttatat attatatact
acccatttat atattatact 18540tatccactta tttaatgtct ttataaggtt tgatccatga
tatttctaat attttagttg 18600atatgtatat gaaagggtac tatttgaact ctcttactct
gtataaaggt tggatcatcc 18660ttaaagtggg tctatttaat tttattgctt cttacagata
aaaaaaaaat tatgagttgg 18720tttgataaaa tattgaagga tttaaaataa taataaataa
catataatat atgtatataa 18780atttattata atataacatt tatctataaa aaagtaaata
ttgtcataaa tctatacaat 18840cgtttagcct tgctggacga atctcaatta tttaaacgag
agtaaacata tttgactttt 18900tggttattta acaaattatt atttaacact atatgaaatt
ttttttttta tcagcaaaga 18960ataaaattaa attaagaagg acaatggtgt cccaatcctt
atacaaccaa cttccacaag 19020aaagtcaagt cagagacaac aaaaaaacaa gcaaaggaaa
ttttttaatt tgagttgtct 19080tgtttgctgc ataatttatg cagtaaaaca ctacacataa
cccttttagc agtagagcaa 19140tggttgaccg tgtgcttagc ttcttttatt ttattttttt
atcagcaaag aataaataaa 19200ataaaatgag acacttcagg gatgtttcaa caagctctag
agggcccaat tcgccctata 19260gtgagtcgta ttacaattca ctggccgtcg ttttacaacg
tcgtgactgg gaaaaccctg 19320gcgttaccca acttaatcgc cttgcagcac atcccccttt
cgccagctgg cgtaatagcg 19380aagaggcccg caccgatcgc ccttcccaac agttgcgcag
cctatacgta cgagatccgg 19440ccggccagat cctgcaggag atccaagctt gg
19472361188DNAZea mays 36atggagagat ctcaacggca
gtctcctccg ccaccgtcgc cgtcctcctc ctcgtcctcc 60gtctccgcgg acaccgtcct
cgtccctccc ggaaagaggc ggagggcggc gacggccaag 120gccggcgccg agcctaataa
gaggatccgc aaggaccccg ccgccgccgc cgcggggaag 180aggagctccg tctacagggg
agtcaccagg cacaggtgga cgggcaggtt cgaggcgcat 240ctctgggaca agcactgcct
cgccgcgctc cacaacaaga agaaaggcag gcaagtctac 300ctgggggcgt atgacagcga
ggaggcagct gctcgtgcct atgacctcgc agctctcaag 360tactggggtc ctgagactct
gctcaacttc cctgtggagg attactccag cgagatgccg 420gagatggagg ccgtgtcccg
ggaggagtac ctggcctccc tccgccgcag gagcagcggc 480ttctccaggg gcgtctccaa
gtacagaggc gtcgccaggc atcaccacaa cgggaggtgg 540gaggcacgga ttgggcgagt
ctttgggaac aagtacctct acttgggaac atttgacact 600caagaagagg cagccaaggc
ctatgacctt gcggccattg aataccgtgg cgtcaatgct 660gtaaccaact tcgacatcag
ctgctacctg gaccacccgc tgttcctggc acagctccaa 720caggagccac aggtggtgcc
ggcactcaac caagaacctc aacctgatca gagcgaaacc 780ggaactacag agcaagagcc
ggagtcaagc gaagccaaga caccggatgg cagtgcagaa 840cccgatgaga acgcggtgcc
tgacgacacc gcggagcccc tcaccacagt cgacgacagc 900atcgaagagg gcttgtggag
cccttgcatg gattacgagc tagacaccat gtcgagacca 960aactttggca gctcaatcaa
tctgagcgag tggttcgctg acgcagactt cgactgcaac 1020atcggatgcc tgttcgatgg
gtgttctgcg gctgacgaag gaagcaagga tggtgtaggt 1080ctggcagatt tcagtctgtt
tgaggcaggt gatgtccagc tgaaggatgt tctttcggat 1140atggaagagg ggatacaacc
tccagcgatg atcagtgtgt gcaactaa 118837395PRTZea mays 37Met
Glu Arg Ser Gln Arg Gln Ser Pro Pro Pro Pro Ser Pro Ser Ser 1
5 10 15 Ser Ser Ser Ser Val Ser
Ala Asp Thr Val Leu Val Pro Pro Gly Lys 20
25 30 Arg Arg Arg Ala Ala Thr Ala Lys Ala Gly
Ala Glu Pro Asn Lys Arg 35 40
45 Ile Arg Lys Asp Pro Ala Ala Ala Ala Ala Gly Lys Arg Ser
Ser Val 50 55 60
Tyr Arg Gly Val Thr Arg His Arg Trp Thr Gly Arg Phe Glu Ala His 65
70 75 80 Leu Trp Asp Lys His
Cys Leu Ala Ala Leu His Asn Lys Lys Lys Gly 85
90 95 Arg Gln Val Tyr Leu Gly Ala Tyr Asp Ser
Glu Glu Ala Ala Ala Arg 100 105
110 Ala Tyr Asp Leu Ala Ala Leu Lys Tyr Trp Gly Pro Glu Thr Leu
Leu 115 120 125 Asn
Phe Pro Val Glu Asp Tyr Ser Ser Glu Met Pro Glu Met Glu Ala 130
135 140 Val Ser Arg Glu Glu Tyr
Leu Ala Ser Leu Arg Arg Arg Ser Ser Gly 145 150
155 160 Phe Ser Arg Gly Val Ser Lys Tyr Arg Gly Val
Ala Arg His His His 165 170
175 Asn Gly Arg Trp Glu Ala Arg Ile Gly Arg Val Phe Gly Asn Lys Tyr
180 185 190 Leu Tyr
Leu Gly Thr Phe Asp Thr Gln Glu Glu Ala Ala Lys Ala Tyr 195
200 205 Asp Leu Ala Ala Ile Glu Tyr
Arg Gly Val Asn Ala Val Thr Asn Phe 210 215
220 Asp Ile Ser Cys Tyr Leu Asp His Pro Leu Phe Leu
Ala Gln Leu Gln 225 230 235
240 Gln Glu Pro Gln Val Val Pro Ala Leu Asn Gln Glu Pro Gln Pro Asp
245 250 255 Gln Ser Glu
Thr Gly Thr Thr Glu Gln Glu Pro Glu Ser Ser Glu Ala 260
265 270 Lys Thr Pro Asp Gly Ser Ala Glu
Pro Asp Glu Asn Ala Val Pro Asp 275 280
285 Asp Thr Ala Glu Pro Leu Thr Thr Val Asp Asp Ser Ile
Glu Glu Gly 290 295 300
Leu Trp Ser Pro Cys Met Asp Tyr Glu Leu Asp Thr Met Ser Arg Pro 305
310 315 320 Asn Phe Gly Ser
Ser Ile Asn Leu Ser Glu Trp Phe Ala Asp Ala Asp 325
330 335 Phe Asp Cys Asn Ile Gly Cys Leu Phe
Asp Gly Cys Ser Ala Ala Asp 340 345
350 Glu Gly Ser Lys Asp Gly Val Gly Leu Ala Asp Phe Ser Leu
Phe Glu 355 360 365
Ala Gly Asp Val Gln Leu Lys Asp Val Leu Ser Asp Met Glu Glu Gly 370
375 380 Ile Gln Pro Pro Ala
Met Ile Ser Val Cys Asn 385 390 395
381239DNAGlycine max 38atgaagaggt ctccagcatc ttcttgttca tcatctactt
cctctgttgg gtttgaagct 60cccattgaaa aaagaaggcc taagcatcca aggaggaata
atttgaagtc acaaaaatgc 120aagcagaacc aaaccaccac tggtggcaga agaagctcta
tctatagagg agttacaagg 180cataggtgga cagggaggtt tgaagctcac ctatgggata
agagctcttg gaacaacatt 240cagagcaaga agggtcgaca agtttatttg ggggcatatg
atactgaaga atctgcagcc 300cgtacctatg accttgcagc ccttaaatac tggggaaaag
atgcaaccct gaatttcccg 360atagaaactt ataccaagga gctcgaggaa atggacaagg
tttcaagaga agaatatttg 420gcttctttgc ggcgccaaag cagtggcttt tctagaggcc
tgtctaagta ccgtggggtt 480gctaggcatc atcataatgg tcgctgggaa gcacgaattg
gaagagtatg cggaaacaag 540tacctctact tggggacata taaaactcaa gaggaggcag
cagtggcata tgacatggca 600gcaatagagt accgtggagt caatgcagtg accaattttg
acataagcaa ctacatggac 660aaaataaaga agaaaaatga ccaaacccaa caacaacaaa
cagaagcaca aacggaaaca 720gttcctaact cctctgactc tgaagaagta gaagtagaac
aacagacaac aacaataacc 780acaccacccc catctgaaaa tctgcacatg ccaccacagc
agcaccaagt tcaatacacc 840ccccatgtct ctccaaggga agaagaatca tcatcactga
tcacaattat ggaccatgtg 900cttgagcagg atctgccatg gagcttcatg tacactggct
tgtctcagtt tcaagatcca 960aacttggctt tctgcaaagg tgatgatgac ttggtgggca
tgtttgatag tgcagggttt 1020gaggaagaca ttgattttct gttcagcact caacctggtg
atgagactga gagtgatgtc 1080aacaatatga gcgcagtttt ggatagtgtt gagtgtggag
acacaaatgg ggctggtgga 1140agcatgatgc atgtggataa caagcagaag atagtatcat
ttgcttcttc accatcatct 1200acaactacag tttcttgtga ctatgctcta gatctatga
123939412PRTGlycine max 39Met Lys Arg Ser Pro Ala
Ser Ser Cys Ser Ser Ser Thr Ser Ser Val 1 5
10 15 Gly Phe Glu Ala Pro Ile Glu Lys Arg Arg Pro
Lys His Pro Arg Arg 20 25
30 Asn Asn Leu Lys Ser Gln Lys Cys Lys Gln Asn Gln Thr Thr Thr
Gly 35 40 45 Gly
Arg Arg Ser Ser Ile Tyr Arg Gly Val Thr Arg His Arg Trp Thr 50
55 60 Gly Arg Phe Glu Ala His
Leu Trp Asp Lys Ser Ser Trp Asn Asn Ile 65 70
75 80 Gln Ser Lys Lys Gly Arg Gln Val Tyr Leu Gly
Ala Tyr Asp Thr Glu 85 90
95 Glu Ser Ala Ala Arg Thr Tyr Asp Leu Ala Ala Leu Lys Tyr Trp Gly
100 105 110 Lys Asp
Ala Thr Leu Asn Phe Pro Ile Glu Thr Tyr Thr Lys Glu Leu 115
120 125 Glu Glu Met Asp Lys Val Ser
Arg Glu Glu Tyr Leu Ala Ser Leu Arg 130 135
140 Arg Gln Ser Ser Gly Phe Ser Arg Gly Leu Ser Lys
Tyr Arg Gly Val 145 150 155
160 Ala Arg His His His Asn Gly Arg Trp Glu Ala Arg Ile Gly Arg Val
165 170 175 Cys Gly Asn
Lys Tyr Leu Tyr Leu Gly Thr Tyr Lys Thr Gln Glu Glu 180
185 190 Ala Ala Val Ala Tyr Asp Met Ala
Ala Ile Glu Tyr Arg Gly Val Asn 195 200
205 Ala Val Thr Asn Phe Asp Ile Ser Asn Tyr Met Asp Lys
Ile Lys Lys 210 215 220
Lys Asn Asp Gln Thr Gln Gln Gln Gln Thr Glu Ala Gln Thr Glu Thr 225
230 235 240 Val Pro Asn Ser
Ser Asp Ser Glu Glu Val Glu Val Glu Gln Gln Thr 245
250 255 Thr Thr Ile Thr Thr Pro Pro Pro Ser
Glu Asn Leu His Met Pro Pro 260 265
270 Gln Gln His Gln Val Gln Tyr Thr Pro His Val Ser Pro Arg
Glu Glu 275 280 285
Glu Ser Ser Ser Leu Ile Thr Ile Met Asp His Val Leu Glu Gln Asp 290
295 300 Leu Pro Trp Ser Phe
Met Tyr Thr Gly Leu Ser Gln Phe Gln Asp Pro 305 310
315 320 Asn Leu Ala Phe Cys Lys Gly Asp Asp Asp
Leu Val Gly Met Phe Asp 325 330
335 Ser Ala Gly Phe Glu Glu Asp Ile Asp Phe Leu Phe Ser Thr Gln
Pro 340 345 350 Gly
Asp Glu Thr Glu Ser Asp Val Asn Asn Met Ser Ala Val Leu Asp 355
360 365 Ser Val Glu Cys Gly Asp
Thr Asn Gly Ala Gly Gly Ser Met Met His 370 375
380 Val Asp Asn Lys Gln Lys Ile Val Ser Phe Ala
Ser Ser Pro Ser Ser 385 390 395
400 Thr Thr Thr Val Ser Cys Asp Tyr Ala Leu Asp Leu
405 410 401230DNAMomordica charantia 40atgagaaggt
ctccctctgt ttctacttcc tcctcctcct cctcctcctg cgtcggcggc 60ggcggcttcg
acagcaataa tctcaatctc gccgcccctc cgcgccggcc gcaatcggag 120aagaccggag
cgaaacgccg gaagcggaat caggacgacg ccaaatgcga gattgagaat 180cgtaacggta
ataacaacaa cagcagcaac aacaatgcct cttccggccg ccggagctcc 240atttacagag
gagtcactag gcaccgatgg accggccggt tcgaagcgca tctctgggac 300aagagttcgt
ggaatagcat tcagaacaaa aaaggaaggc aagtttattt gggagcatac 360gataacgagg
aagctgccgc ccgaacttat gacctcgctg ccctcaagta ctggggtccc 420ggaaccaccc
tcaatttccc ggtagagtcg tacaggaatg aaatagaaga aatgcggaaa 480gttacgaagg
aggagtattt ggcgtcgtta cggcggcgga gcagcggatt ttcgagaggc 540gtatcgaagt
accgcggcgt ggcccgccac caccacaacg gccggtggga ggcgcggatc 600ggccgtgttt
tcggaagcaa atatctttac ctgggaactt acaacacaca agaggaagca 660gcagcagcat
atgacatggc tgcaattgag tacagagggg tcaatgcagt gaccaatttc 720gacatcagca
attacattgg gcggctggag aataaatcat cagtttttcc agcagcagag 780cagcccctac
agcccaactg ctcccctgct tcctcttctg aggaaggcga agtagtacag 840cagcaacagc
aacagacgac gatggcgttc tcaggctcgc ccctccagtt cccgtcgatg 900gagaacagcc
cgacgacaat ggaggaggat catgatctgc attggtcatt cctagacacg 960gggttcgtgc
aggtccccga cctccccctc gagaagtctg gcgaattgcc tgacctgttc 1020tttgatgaga
tcgggttcga ggacgacatc gggttgatat tcgaggcgag cttggaagac 1080gagaggtgcg
gggagggggg tgagaagtta gaagatgtgg ggaaaatgga gatgatgaag 1140agtgatcatg
aggagagggg gttgttctcg actacttcgc catcttcgtc gtcgataacc 1200acctcggttt
cgtgtgaatt tagggtttga
123041409PRTMomordica charantia 41Met Arg Arg Ser Pro Ser Val Ser Thr Ser
Ser Ser Ser Ser Ser Ser 1 5 10
15 Cys Val Gly Gly Gly Gly Phe Asp Ser Asn Asn Leu Asn Leu Ala
Ala 20 25 30 Pro
Pro Arg Arg Pro Gln Ser Glu Lys Thr Gly Ala Lys Arg Arg Lys 35
40 45 Arg Asn Gln Asp Asp Ala
Lys Cys Glu Ile Glu Asn Arg Asn Gly Asn 50 55
60 Asn Asn Asn Ser Ser Asn Asn Asn Ala Ser Ser
Gly Arg Arg Ser Ser 65 70 75
80 Ile Tyr Arg Gly Val Thr Arg His Arg Trp Thr Gly Arg Phe Glu Ala
85 90 95 His Leu
Trp Asp Lys Ser Ser Trp Asn Ser Ile Gln Asn Lys Lys Gly 100
105 110 Arg Gln Val Tyr Leu Gly Ala
Tyr Asp Asn Glu Glu Ala Ala Ala Arg 115 120
125 Thr Tyr Asp Leu Ala Ala Leu Lys Tyr Trp Gly Pro
Gly Thr Thr Leu 130 135 140
Asn Phe Pro Val Glu Ser Tyr Arg Asn Glu Ile Glu Glu Met Arg Lys 145
150 155 160 Val Thr Lys
Glu Glu Tyr Leu Ala Ser Leu Arg Arg Arg Ser Ser Gly 165
170 175 Phe Ser Arg Gly Val Ser Lys Tyr
Arg Gly Val Ala Arg His His His 180 185
190 Asn Gly Arg Trp Glu Ala Arg Ile Gly Arg Val Phe Gly
Ser Lys Tyr 195 200 205
Leu Tyr Leu Gly Thr Tyr Asn Thr Gln Glu Glu Ala Ala Ala Ala Tyr 210
215 220 Asp Met Ala Ala
Ile Glu Tyr Arg Gly Val Asn Ala Val Thr Asn Phe 225 230
235 240 Asp Ile Ser Asn Tyr Ile Gly Arg Leu
Glu Asn Lys Ser Ser Val Phe 245 250
255 Pro Ala Ala Glu Gln Pro Leu Gln Pro Asn Cys Ser Pro Ala
Ser Ser 260 265 270
Ser Glu Glu Gly Glu Val Val Gln Gln Gln Gln Gln Gln Thr Thr Met
275 280 285 Ala Phe Ser Gly
Ser Pro Leu Gln Phe Pro Ser Met Glu Asn Ser Pro 290
295 300 Thr Thr Met Glu Glu Asp His Asp
Leu His Trp Ser Phe Leu Asp Thr 305 310
315 320 Gly Phe Val Gln Val Pro Asp Leu Pro Leu Glu Lys
Ser Gly Glu Leu 325 330
335 Pro Asp Leu Phe Phe Asp Glu Ile Gly Phe Glu Asp Asp Ile Gly Leu
340 345 350 Ile Phe Glu
Ala Ser Leu Glu Asp Glu Arg Cys Gly Glu Gly Gly Glu 355
360 365 Lys Leu Glu Asp Val Gly Lys Met
Glu Met Met Lys Ser Asp His Glu 370 375
380 Glu Arg Gly Leu Phe Ser Thr Thr Ser Pro Ser Ser Ser
Ser Ile Thr 385 390 395
400 Thr Ser Val Ser Cys Glu Phe Arg Val 405
42430PRTArabidopsis thaliana 42Met Lys Lys Arg Leu Thr Thr Ser Thr Cys
Ser Ser Ser Pro Ser Ser 1 5 10
15 Ser Val Ser Ser Ser Thr Thr Thr Ser Ser Pro Ile Gln Ser Glu
Ala 20 25 30 Pro
Arg Pro Lys Arg Ala Lys Arg Ala Lys Lys Ser Ser Pro Ser Gly 35
40 45 Asp Lys Ser His Asn Pro
Thr Ser Pro Ala Ser Thr Arg Arg Ser Ser 50 55
60 Ile Tyr Arg Gly Val Thr Arg His Arg Trp Thr
Gly Arg Phe Glu Ala 65 70 75
80 His Leu Trp Asp Lys Ser Ser Trp Asn Ser Ile Gln Asn Lys Lys Gly
85 90 95 Lys Gln
Val Tyr Leu Gly Ala Tyr Asp Ser Glu Glu Ala Ala Ala His 100
105 110 Thr Tyr Asp Leu Ala Ala Leu
Lys Tyr Trp Gly Pro Asp Thr Ile Leu 115 120
125 Asn Phe Pro Ala Glu Thr Tyr Thr Lys Glu Leu Glu
Glu Met Gln Arg 130 135 140
Val Thr Lys Glu Glu Tyr Leu Ala Ser Leu Arg Arg Gln Ser Ser Gly 145
150 155 160 Phe Ser Arg
Gly Val Ser Lys Tyr Arg Gly Val Ala Arg His His His 165
170 175 Asn Gly Arg Trp Glu Ala Arg Ile
Gly Arg Val Phe Gly Asn Lys Tyr 180 185
190 Leu Tyr Leu Gly Thr Tyr Asn Thr Gln Glu Glu Ala Ala
Ala Ala Tyr 195 200 205
Asp Met Ala Ala Ile Glu Tyr Arg Gly Ala Asn Ala Val Thr Asn Phe 210
215 220 Asp Ile Ser Asn
Tyr Ile Asp Arg Leu Lys Lys Lys Gly Val Phe Pro 225 230
235 240 Phe Pro Val Asn Gln Ala Asn His Gln
Glu Gly Ile Leu Val Glu Ala 245 250
255 Lys Gln Glu Val Glu Thr Arg Glu Ala Lys Glu Glu Pro Arg
Glu Glu 260 265 270
Val Lys Gln Gln Tyr Val Glu Glu Pro Pro Gln Glu Glu Glu Glu Lys
275 280 285 Glu Glu Glu Lys
Ala Glu Gln Gln Glu Ala Glu Ile Val Gly Tyr Ser 290
295 300 Glu Glu Ala Ala Val Val Asn Cys
Cys Ile Asp Ser Ser Thr Ile Met 305 310
315 320 Glu Met Asp Arg Cys Gly Asp Asn Asn Glu Leu Ala
Trp Asn Phe Cys 325 330
335 Met Met Asp Thr Gly Phe Ser Pro Phe Leu Thr Asp Gln Asn Leu Ala
340 345 350 Asn Glu Asn
Pro Ile Glu Tyr Pro Glu Leu Phe Asn Glu Leu Ala Phe 355
360 365 Glu Asp Asn Ile Asp Phe Met Phe
Asp Asp Gly Lys His Glu Cys Leu 370 375
380 Asn Leu Glu Asn Leu Asp Cys Cys Val Val Gly Arg Glu
Ser Pro Pro 385 390 395
400 Ser Ser Ser Ser Pro Leu Ser Cys Leu Ser Thr Asp Ser Ala Ser Ser
405 410 415 Thr Thr Thr Thr
Thr Thr Ser Val Ser Cys Asn Tyr Leu Val 420
425 430 432004DNAArabidopsis thaliana 43ggtctactct
ttacatgttc tttactccgt ctcaaaattt cctttttttg ttggctctct 60ccgaacgagt
tggagaaatc gttaacccta atcgaagatc tagattcctc tacatacgtt 120tgatctctct
ctcagtatgg attacaaagc gccaaggaga tactactcac acggagttgt 180tgcgagacag
caagatttcg caacagatat agttacgaga agaagacctt atgtccctta 240cgaccgtcca
aataagtttt caaggagtct ggtttggacg tcaaaagagt acaaatcacc 300cgagggcaat
aatatgccaa ggaccaatga tgtgtcaccg aaaccaccag ttttaggttt 360ggcgaggaag
aatgctgctt gtgggccaat gagatcttct agtctcagaa aatgggtatg 420taagtattgg
aaagatggaa agtgcaagag gggtgagcag tgccagttct tacactcttg 480gtcttgtttc
cctggattgg ccatggtagc ttctcttgaa gggcacaata aggaactaaa 540ggggatcgct
ctccctgagg gttcagataa actcttttca gtcagtattg atggtacatt 600gcgagtttgg
gactgcaatt ctggtcagtg tgtacattcc atcaaccttg acgcagaagc 660agggtctcta
atcagtgaag gcccttgggt tttccttggc ttgccaaacg ctataaaggc 720ttttaacgtt
caaaccagtc aagatttgca tcttcaagca gcaggggtgg ttggtcaggt 780gaatgcaatg
actattgcaa acggaatgct ttttgctgga acaagttctg gtagtatctt 840agtctggaaa
gctactacag actctgagtc tgatccattc aaatacttga catctcttga 900gggacatagt
ggtgaagtca cttgttttgc tgttggaggt caaatgctat actctggttc 960tgtcgataaa
acaatcaaga tgtgggatct caacaccctg caatgtataa tgaccctgaa 1020gcaacatacc
ggcactgtca cttcactctt atgttgggat aaatgtttga tatcgtcttc 1080cttggatggg
accataaaag tttgggctta ttctgaaaac ggaatcttga aagttgttca 1140aactcgcaga
caagaacaga gtagtgttca tgctctttct ggtatgcatg atgcagaagc 1200caaaccgata
atattctgct cttaccaaaa cggaaccgtt ggcattttcg acctaccatc 1260ttttcaagaa
agaggaagga tgttctctac gcacacgatc gccacactca caattggtcc 1320tcaaggattg
ttattcagtg gagacgagag tggtaacttg cgtgtatgga ccttagctgc 1380tggcaacaaa
gtttagtctt ttcgactaaa gaattctgat ttaattttgt ggtttatatg 1440ttgagttaac
tgttaagaga gttttatttt gtaataggtg tatcagtcaa taaacaatct 1500ttgtatcaac
caaatgtaat ttttctcgtt aattcgattt cagagttttt actttaagat 1560aaacaaactc
tttcacacat catttaatga aagtggagaa gcttaaaaaa caaacaaaga 1620aactgatcca
tttttggcgg gtcttcttct actcttattc atatgtgtta acgaactata 1680gcgtaaaatt
cagagcaagc gatctccgat ttgaacgtgg ctatcaccgg aggcccacca 1740ctacgggcga
tacgctctaa gtgaggatta aagtgctctg gtggtgacgt tgaagaaact 1800cgcccatggt
ttttgttatc tctgcagcca agtgtcgttc tttcttcgcc acttctcatc 1860aagctacagt
gaatttaaaa atggcgtctt tctttgatct cgtatacata agctggattg 1920gtttcttaaa
caaattcctc tccttttggg tcttctgggt ttgccttgta agtgtttgtg 1980tttttgcctc
tgagaaaaaa tcgc
2004442790DNABrassica napusmisc_feature(2776)..(2776)n is a, c, g, or t
44ggccacttct catcatgtta cagggaccat aaaaatggcg tatttcttca gccccgggta
60taaatacaca catgatcctg tggtggttts ttccacaagt tacatctcct tctggttttt
120gtattgcaag tgtttgtgtt ttttgcctcc gagagaaaat catgccgacc ggtaggttcg
180agacgatgcg tgaatgggtc cacgacgcca tctctgctca acgcaatgag ctcctctctc
240ttttttccag atacgtagct caggggaaag ggatactgya gtcccaccag ctgattgacg
300agttcctcaa gactgtgaaa gtggatggaa ctacagaaga tcttaagaat cgtcccttca
360tgaaagttct gcagtctgca gaggaagcca tagttttgcc tccctttgtt gcsctggcga
420ttcgtcccag acctggtgtt agagaatatg tccgtgtgaa tgtctacgag ctgagcgtag
480accatttaac tgtttctgag tatcttcggt tcaaggaaga gctcgttaat ggccatgcca
540atgggaatta tcttctcgag cttgattttg aaccgttcaa cgcaacgttt cctcgtccaa
600ctcggtcatc atctattggg aatggggttc agttcctcaa ccgtcacctc tcgtcaatca
660tgttccgtaa caaagacagc ttggagcctt tgcttgagtt tctccgcact cacaaacatg
720acggccgtgc catgatgctg aatgatcgaa tacagaacat ccgcacactt caggaagctt
780tggcgagggc agaggagttc ctctctaaac ttcctttggc tacaccatac tctgaattcg
840aatttgract acaagggatg ggatttgaga ggggatgggg tgacacgkca cagaaggttt
900cagaaatggt gcatctmctt ctggacatac tccaggcacc tgatccttct gtcttggaga
960cgtttcttgg aaggattcct atggtgttca atgtygtkat tttgtctccg catggctact
1020ttggccaagc caatgtcttg ggtcttcctg atactggtgg acaggttgtc tacattcttg
1080atcaagtacg tgctttggaa agcgagatgc tcctyaggat acagaagcaa ggactggatg
1140ttactccaaa gattctcatt gtaacaaggt tgataccaga agcagaagga acaacatgca
1200accagaggtt agaaaargtw agcggtacag aacacrcaca tattctrcga ataccrtttm
1260ggactgaaaa gggcattctt cgcaagtgga tctcgaggtt tgatgtctgg ccatacctgg
1320agactttcgc agaggatgca tcaaatgaaa ttgctgcgga gttgcaaggt gtgccaaatc
1380tcatcattgg caactacagt gatgggaatc tcgtggcttc tttgttagct tgtaagctag
1440gcgtgataca gtgcaatatt gctcatgctt tggagaaaac caagtatcca gagtctgaca
1500tttactggag aaaccatgaa gataagtatc attttgcaag tcagttcact gcggacctaa
1560ttgccatgaa taatgctgat ttcatcatca ccagcacata ccaagagatc gctggaagca
1620aaaacaaagt tgggcaatac garagccaca cagctttcac ccttcctggt ctttacagag
1680ttgtkcatgg aatcaatgtc tttgatccca agtttaatat agtctctcca ggagctgata
1740tgaccatata cttyccwtat tctgacaagg aaagaagact aactgccctt catgagtcwa
1800ttgaagaact yctgtttagc agygaacaga atgttgagca tgttggtttt ctkagcgacc
1860agwygaagcc aatcattttc tccatggcca gacttgacag agtgaaaaac ttgactgggc
1920tagttgagtg ctatgccaas aacrgcaasc tgagagaggt tgcgaaccty sttgtastwg
1980gtggctacgt ggacgtgaat cagtccaggg acagagagga aatggctgag atacaaaaga
2040tgcacagcct ratcaagcag tatggtttac acggtgagtt caggtggata gctgctcaaa
2100tgaaccgtgc tmggaacggt gagctttacc gttatatcgc agacacwaaa ggtgtttttg
2160ttcagcctgc tttctatgaa gcktttgggc tcacagttgt ggaatcaatg acttgtgggc
2220tcccaacgtt tgctacatgt catggtggac ctgcggagat catcgagaat ggagtttctg
2280gcttccacat cgacccwtat catccagaac agsttgcaac tactttggtc agcttcttyg
2340agacctgcaa cgctgatcca agtcactggg agaaaatctc tgatggaggg cttaagcgaa
2400tctatgaaag gtacacatgg aagaagtact cagagaggct gcttacgctg gctggtgtct
2460attcattctg gaaacatgtg tctaagcttg aaaggagaga aacacgacgt tacctagaga
2520tgttttactc tctcaagtat cgtgatctgg ccaattcaat cccactggca actgatgagc
2580attgagcaag ctatggttgg attctaatac ttgctgcact ccctgttgtg tgtttctgtt
2640atctttgaat aaataagcta ttgtcggctt ttgtttccat gactagtttg gttttcagac
2700ttttcctgtt gttttcttga tatgaataac aagtatcgtt gagttctaag ctcggcatta
2760aataacttgt cgtgtnggaa agcttactga
279045807PRTBrassica napusmisc_feature(40)..(40)Xaa can be any naturally
occurring amino acid 45Met Pro Thr Gly Arg Phe Glu Thr Met Arg Glu Trp
Val His Asp Ala 1 5 10
15 Ile Ser Ala Gln Arg Asn Glu Leu Leu Ser Leu Phe Ser Arg Tyr Val
20 25 30 Ala Gln Gly
Lys Gly Ile Leu Xaa Ser His Gln Leu Ile Asp Glu Phe 35
40 45 Leu Lys Thr Val Lys Val Asp Gly
Thr Thr Glu Asp Leu Lys Asn Arg 50 55
60 Pro Phe Met Lys Val Leu Gln Ser Ala Glu Glu Ala Ile
Val Leu Pro 65 70 75
80 Pro Phe Val Ala Leu Ala Ile Arg Pro Arg Pro Gly Val Arg Glu Tyr
85 90 95 Val Arg Val Asn
Val Tyr Glu Leu Ser Val Asp His Leu Thr Val Ser 100
105 110 Glu Tyr Leu Arg Phe Lys Glu Glu Leu
Val Asn Gly His Ala Asn Gly 115 120
125 Asn Tyr Leu Leu Glu Leu Asp Phe Glu Pro Phe Asn Ala Thr
Phe Pro 130 135 140
Arg Pro Thr Arg Ser Ser Ser Ile Gly Asn Gly Val Gln Phe Leu Asn 145
150 155 160 Arg His Leu Ser Ser
Ile Met Phe Arg Asn Lys Asp Ser Leu Glu Pro 165
170 175 Leu Leu Glu Phe Leu Arg Thr His Lys His
Asp Gly Arg Ala Met Met 180 185
190 Leu Asn Asp Arg Ile Gln Asn Ile Arg Thr Leu Gln Glu Ala Leu
Ala 195 200 205 Arg
Ala Glu Glu Phe Leu Ser Lys Leu Pro Leu Ala Thr Pro Tyr Ser 210
215 220 Glu Phe Glu Phe Xaa Leu
Gln Gly Met Gly Phe Glu Arg Gly Trp Gly 225 230
235 240 Asp Thr Xaa Gln Lys Val Ser Glu Met Val His
Leu Leu Leu Asp Ile 245 250
255 Leu Gln Ala Pro Asp Pro Ser Val Leu Glu Thr Phe Leu Gly Arg Ile
260 265 270 Pro Met
Val Phe Asn Val Val Ile Leu Ser Pro His Gly Tyr Phe Gly 275
280 285 Gln Ala Asn Val Leu Gly Leu
Pro Asp Thr Gly Gly Gln Val Val Tyr 290 295
300 Ile Leu Asp Gln Val Arg Ala Leu Glu Ser Glu Met
Leu Leu Arg Ile 305 310 315
320 Gln Lys Gln Gly Leu Asp Val Thr Pro Lys Ile Leu Ile Val Thr Arg
325 330 335 Leu Ile Pro
Glu Ala Glu Gly Thr Thr Cys Asn Gln Arg Leu Glu Lys 340
345 350 Val Ser Gly Thr Glu His Xaa His
Ile Leu Arg Ile Pro Phe Arg Thr 355 360
365 Glu Lys Gly Ile Leu Arg Lys Trp Ile Ser Arg Phe Asp
Val Trp Pro 370 375 380
Tyr Leu Glu Thr Phe Ala Glu Asp Ala Ser Asn Glu Ile Ala Ala Glu 385
390 395 400 Leu Gln Gly Val
Pro Asn Leu Ile Ile Gly Asn Tyr Ser Asp Gly Asn 405
410 415 Leu Val Ala Ser Leu Leu Ala Cys Lys
Leu Gly Val Ile Gln Cys Asn 420 425
430 Ile Ala His Ala Leu Glu Lys Thr Lys Tyr Pro Glu Ser Asp
Ile Tyr 435 440 445
Trp Arg Asn His Glu Asp Lys Tyr His Phe Ala Ser Gln Phe Thr Ala 450
455 460 Asp Leu Ile Ala Met
Asn Asn Ala Asp Phe Ile Ile Thr Ser Thr Tyr 465 470
475 480 Gln Glu Ile Ala Gly Ser Lys Asn Lys Val
Gly Gln Tyr Glu Ser His 485 490
495 Thr Ala Phe Thr Leu Pro Gly Leu Tyr Arg Val Val His Gly Ile
Asn 500 505 510 Val
Phe Asp Pro Lys Phe Asn Ile Val Ser Pro Gly Ala Asp Met Thr 515
520 525 Ile Tyr Phe Pro Tyr Ser
Asp Lys Glu Arg Arg Leu Thr Ala Leu His 530 535
540 Glu Ser Ile Glu Glu Leu Leu Phe Ser Ser Glu
Gln Asn Val Glu His 545 550 555
560 Val Gly Phe Leu Ser Asp Gln Xaa Lys Pro Ile Ile Phe Ser Met Ala
565 570 575 Arg Leu
Asp Arg Val Lys Asn Leu Thr Gly Leu Val Glu Cys Tyr Ala 580
585 590 Xaa Asn Xaa Xaa Leu Arg Glu
Val Ala Asn Leu Xaa Val Xaa Gly Gly 595 600
605 Tyr Val Asp Val Asn Gln Ser Arg Asp Arg Glu Glu
Met Ala Glu Ile 610 615 620
Gln Lys Met His Ser Leu Ile Lys Gln Tyr Gly Leu His Gly Glu Phe 625
630 635 640 Arg Trp Ile
Ala Ala Gln Met Asn Arg Ala Arg Asn Gly Glu Leu Tyr 645
650 655 Arg Tyr Ile Ala Asp Thr Lys Gly
Val Phe Val Gln Pro Ala Phe Tyr 660 665
670 Glu Ala Phe Gly Leu Thr Val Val Glu Ser Met Thr Cys
Gly Leu Pro 675 680 685
Thr Phe Ala Thr Cys His Gly Gly Pro Ala Glu Ile Ile Glu Asn Gly 690
695 700 Val Ser Gly Phe
His Ile Asp Pro Tyr His Pro Glu Gln Xaa Ala Thr 705 710
715 720 Thr Leu Val Ser Phe Phe Glu Thr Cys
Asn Ala Asp Pro Ser His Trp 725 730
735 Glu Lys Ile Ser Asp Gly Gly Leu Lys Arg Ile Tyr Glu Arg
Tyr Thr 740 745 750
Trp Lys Lys Tyr Ser Glu Arg Leu Leu Thr Leu Ala Gly Val Tyr Ser
755 760 765 Phe Trp Lys His
Val Ser Lys Leu Glu Arg Arg Glu Thr Arg Arg Tyr 770
775 780 Leu Glu Met Phe Tyr Ser Leu Lys
Tyr Arg Asp Leu Ala Asn Ser Ile 785 790
795 800 Pro Leu Ala Thr Asp Glu His 805
4626DNAArtificial sequencePrimer a 46ccttgcaaaa cttaagatca aaagtc
264726DNAArtificial sequencePrimer
b 47ctatagatgg gatgaagctg ctctcg
264824DNAArtificial sequencePrimer c 48agagaggagc tcattgcgtt gagc
244924DNAArtificial sequencePrimer d
49cccattcacg catcgtctcg aacc
24502234DNABrassica napusmisc_feature(859)..(859)n is a, c, g, or t
50ctatagatgg gatgaagctg ctctcgacaa atctgataaa actaaagaag gttagtaatc
60aatttttaca aaatcataga ttattttttt cattgaatta tttttatgct ataccaagaa
120ttgtatttta gtatttgttt taactacata taatagaatt aactacatat aaattaacta
180aacttaaaat aaaaatagat ttgtttcctg aaattatttt aagaatatat atgtatatat
240ctaaaatctt agacttagat agatttttct atctatctat tttggttact taaaataaat
300aaatttgtat aaataattgt atagttatca aaaattaaaa ctaatttttt taaagttgtt
360gatatataaa atactaaaga tttaacgatt aagtatttat ttaagtatag aattttgttt
420tttttttaag tttagttatg aagttgttaa ttatattaaa acaaaacaat atttcgaaat
480tttattatca tattcgaata tatttttttt agtgatgatg tatgaattat tatcataatt
540tgaaagttta ctaaaaaata tatcaacatg aattgtaata tatgagttat taccttaacc
600aaaattataa attaacatta aatataatta tatatgtcat atttagccat acaatgtgtc
660atcaatatta atagtcatgt caatattaca taatgccaat attatgctac ttaaacccca
720aatcccctaa ctcccgttaa gtagccaaat tcataaatat acttattcga caaaataaaa
780aactttaaaa tatttactaa tccgaccatg cacaagcatc cattccctat tccattgcca
840cgggataaca atgcaaccna ctcctcaaaa aaagaaaaat tcaagctctt ttgcaaaaaa
900aaataaaata attttaacac ctaaaatttt ttgtttccaa acttctacag ggaacacaca
960taaaagaaaa agaggacgtc cactcggatc acgcaacaaa ccaaaaggtg tgtcatgact
1020cctaagatat aatatttcct tattcaaaat cataccattt taaattatga atgtatttcg
1080tagtccacca gatatgtaat ccaccagcgt tcaaaccaaa gttttatgat tgtaagttta
1140agtgaattat aataatatat tcttcacggt atcttttcat aactaattga gttatcaaac
1200ttgatcgcac atgtggcttt gataggtgtg acttttatgg tatacaattc tttcaaccta
1260aaaacattat tgttcctcaa tatcttacat tatgcttgac tgcaacaaaa tattttctca
1320tctgttttct tcctttaaac caatttatta tcatctattt cctgacattt taatccatcc
1380acctatgtca aaaacttata gaaaatgtca acttccaaac aaaacataat tgaacttcgc
1440aaataaattc ttaataatat taaaaaatgt tacttaatta tttcttcaac cccattttcc
1500gcgcgtagcg cggacaaaga ctctagttaa atatagaagt ttccgattct catcgtataa
1560aacggtgact ttggcgggct ttcatgtgta acaaattggt ttaacaaacc actgcctagt
1620cgtttagtgt agaatcagcg catggaactc cgattggagc gtgactttca cgtgccggag
1680gcccaccacc acagcgggcg ttacgctcta agaatctcgc ccacggtttt cttcatctgc
1740cccccgccaa gtgtcttcct cgttcgccac ttctcaccaa gttacaggaa ccctaaaaat
1800ggcctttctt cagccccggc tataatacac acatgatcct atagtgggtt cttccacaag
1860ttacatctcc ttctggattg tacatttcaa gtgtttgtgt tttttctgcc tctgagagaa
1920aatcatgccg acgggtaggt tcgagacgat gcgtgaatgg gttcacgacg ccatctctgc
1980tcaacgcaat gagctcctct ctctcttttc caggtatctc tctctctctt actgaatatg
2040cgttacatat ataagttcag tacatgcatt gtcactttgt caactttcaa cagttgagag
2100tagagcatgt taaaaaaaaa agttagttcg ttttacttgc atgtgtgttg tggttagtct
2160caggaggagt aatgctttgg tttgctatgt ttagatacgt agctcagggg aaagggatac
2220tgcagtccca ccag
223451632DNABrassica napus 51ccaatttatt atcatctatt tcctgacatt ttaatccatc
cacctatgtc aaaaacttat 60agaaaatgtc aacttccaaa caaaacataa ttgaacttcg
caaataaatt cttaataata 120ttaaaaaatg ttacttaatt atttcttcaa ccccattttc
cgcgcgtagc gcggacaaag 180actctagtta aatatagaag tttccgattc tcatcgtata
aaacggtgac tttggcgggc 240tttcatgtgt aacaaattgg tttaacaaac cactgcctag
tcgtttagtg tagaatcagc 300gcatggaact ccgattggag cgtgactttc acgtrccgga
ggcccaccac cwcagcgggc 360gttacgctct aagaatctcg cccacggttt tcttcatctc
ccccccgcca agtgtctccc 420tcgttcgcca cttctcatca tgttacaggg accataaaaa
tggcgtattt cttcagcccc 480gggtataaat acacacatga tcctgtggtg ggttcttcca
caagttacat ctccttctgg 540tttttgtatt gcaagtgttt gtattttttg cctccgagag
aaaatcatgc cgaccggtag 600gttcgagacg atgcgtgaat gggcctgaat tc
6325235DNAArtificial sequencePrimer SA188
52ggcgcgcccc aatttattat catctatttc ctgac
355334DNAArtificial sequencePrimer SA189 53gcggccgcga ttttctctcg
gaggcaaaaa atac 345436DNAArtificial
sequencePrimer SA190 54ggcgcgccct atagatggga tgaagctgct ctcgac
365536DNAArtificial sequencePrimer SA191 55gcggccgcga
ttttctctca gaggcagaaa aaacac
36564114DNAArtificial sequencePlasmid BN SUS2 prom1/PCR blunt
56cctgaattct gcagatatcc atcacactgg cggccgctcg agcatgcatc tagagggccc
60aattcgccct atagtgagtc gtattacaat tcactggccg tcgttttaca acgtcgtgac
120tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc
180tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctatac
240gtacggcagt ttaaggttta cacctataaa agagagagcc gttatcgtct gtttgtggat
300gtacagagtg atattattga cacgccgggg cgacggatgg tgatccccct ggccagtgca
360cgtctgctgt cagataaagt ctcccgtgaa ctttacccgg tggtgcatat cggggatgaa
420agctggcgca tgatgaccac cgatatggcc agtgtgccgg tctccgttat cggggaagaa
480gtggctgatc tcagccaccg cgaaaatgac atcaaaaacg ccattaacct gatgttctgg
540ggaatataaa tgtcaggcat gagattatca aaaaggatct tcacctagat ccttttcacg
600tagaaagcca gtccgcagaa acggtgctga ccccggatga atgtcagcta ctgggctatc
660tggacaaggg aaaacgcaag cgcaaagaga aagcaggtag cttgcagtgg gcttacatgg
720cgatagctag actgggcggt tttatggaca gcaagcgaac cggaattgcc agctggggcg
780ccctctggta aggttgggaa gccctgcaaa gtaaactgga tggctttctt gccgccaagg
840atctgatggc gcaggggatc aagctctgat caagagacag gatgaggatc gtttcgcatg
900attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc
960tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg
1020caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcaa
1080gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc
1140gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat
1200ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg
1260cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc
1320gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag
1380catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgagcat gcccgacggc
1440gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc
1500cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata
1560gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc
1620gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac
1680gagttcttct gaattattaa cgcttacaat ttcctgatgc ggtattttct ccttacgcat
1740ctgtgcggta tttcacaccg catcaggtgg cacttttcgg ggaaatgtgc gcggaacccc
1800tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg
1860ataaatgctt caataatagc acgtgaggag ggccaccatg gccaagttga ccagtgccgt
1920tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag ttctggaccg accggctcgg
1980gttctcccgg gacttcgtgg aggacgactt cgccggtgtg gtccgggacg acgtgaccct
2040gttcatcagc gcggtccagg accaggtggt gccggacaac accctggcct gggtgtgggt
2100gcgcggcctg gacgagctgt acgccgagtg gtcggaggtc gtgtccacga acttccggga
2160cgcctccggg ccggccatga ccgagatcgg cgagcagccg tgggggcggg agttcgccct
2220gcgcgacccg gccggcaact gcgtgcactt cgtggccgag gagcaggact gacacgtgct
2280aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac
2340caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa
2400aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc
2460accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt
2520aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg
2580ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc
2640agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt
2700accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga
2760gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct
2820tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg
2880cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca
2940cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa
3000cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt
3060ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga
3120taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga
3180gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca
3240cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct
3300cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat
3360tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaagctatt
3420taggtgacgc gttagaatac tcaagctatg catcaagctt ggtaccgagc tcggatccac
3480tagtaacggc cgccagtgtg ctggaattca ggggcgcgcc ccaatttatt atcatctatt
3540tcctgacatt ttaatccatc cacctatgtc aaaaacttat agaaaatgtc aacttccaaa
3600caaaacataa ttgaacttcg caaataaatt cttaataata ttaaaaaatg ttacttaatt
3660atttcttcaa ccccattttc cgcgcgtagc gcggacaaag actctagtta aatatagaag
3720tttccgattc tcatcgtata aaacggtgac tttggcgggc tttcatgtgt aacaaattgg
3780tttaacaaac cactgcctag tcgtttagtg tagaatcagc gcatggaact ccgattggag
3840cgtgactttc acgtgccgga ggcccaccac cacagcgggc gttacgctct aagaatctcg
3900cccacggttt tcttcatctc ccccccgcca agtgtctccc tcgttcgcca cttctcatca
3960tgttacaggg accataaaaa tggcgtattt cttcagcccc gggtataaat acacacatga
4020tcctgtggtg ggttcttcca caagttacat ctccttctgg tttttgtatt gcaagtgttt
4080gtattttttg cctccgagag aaaatcgcgg ccgc
4114575452DNAArtificial sequencePlasmid BN SUS2 prom2/PCR blunt
57cctgaattct gcagatatcc atcacactgg cggccgctcg agcatgcatc tagagggccc
60aattcgccct atagtgagtc gtattacaat tcactggccg tcgttttaca acgtcgtgac
120tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc
180tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctatac
240gtacggcagt ttaaggttta cacctataaa agagagagcc gttatcgtct gtttgtggat
300gtacagagtg atattattga cacgccgggg cgacggatgg tgatccccct ggccagtgca
360cgtctgctgt cagataaagt ctcccgtgaa ctttacccgg tggtgcatat cggggatgaa
420agctggcgca tgatgaccac cgatatggcc agtgtgccgg tctccgttat cggggaagaa
480gtggctgatc tcagccaccg cgaaaatgac atcaaaaacg ccattaacct gatgttctgg
540ggaatataaa tgtcaggcat gagattatca aaaaggatct tcacctagat ccttttcacg
600tagaaagcca gtccgcagaa acggtgctga ccccggatga atgtcagcta ctgggctatc
660tggacaaggg aaaacgcaag cgcaaagaga aagcaggtag cttgcagtgg gcttacatgg
720cgatagctag actgggcggt tttatggaca gcaagcgaac cggaattgcc agctggggcg
780ccctctggta aggttgggaa gccctgcaaa gtaaactgga tggctttctt gccgccaagg
840atctgatggc gcaggggatc aagctctgat caagagacag gatgaggatc gtttcgcatg
900attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc
960tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg
1020caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcaa
1080gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc
1140gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat
1200ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg
1260cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc
1320gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag
1380catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgagcat gcccgacggc
1440gaggatctcg tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc
1500cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata
1560gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc
1620gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac
1680gagttcttct gaattattaa cgcttacaat ttcctgatgc ggtattttct ccttacgcat
1740ctgtgcggta tttcacaccg catcaggtgg cacttttcgg ggaaatgtgc gcggaacccc
1800tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg
1860ataaatgctt caataatagc acgtgaggag ggccaccatg gccaagttga ccagtgccgt
1920tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag ttctggaccg accggctcgg
1980gttctcccgg gacttcgtgg aggacgactt cgccggtgtg gtccgggacg acgtgaccct
2040gttcatcagc gcggtccagg accaggtggt gccggacaac accctggcct gggtgtgggt
2100gcgcggcctg gacgagctgt acgccgagtg gtcggaggtc gtgtccacga acttccggga
2160cgcctccggg ccggccatga ccgagatcgg cgagcagccg tgggggcggg agttcgccct
2220gcgcgacccg gccggcaact gcgtgcactt cgtggccgag gagcaggact gacacgtgct
2280aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac
2340caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa
2400aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc
2460accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt
2520aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg
2580ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc
2640agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt
2700accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga
2760gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct
2820tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg
2880cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca
2940cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa
3000cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt
3060ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga
3120taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga
3180gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca
3240cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct
3300cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat
3360tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaagctatt
3420taggtgacgc gttagaatac tcaagctatg catcaagctt ggtaccgagc tcggatccac
3480tagtaacggc cgccagtgtg ctggaattca ggggcgcgcc ctatagatgg gatgaagctg
3540ctctcgacaa atctgataaa actaaagaag gttagtaatc aatttttaca aaatcataga
3600ttattttttt cattgaatta tttttatgct ataccaagaa ttgtatttta gtatttgttt
3660taactacata taatagaatt aactacatat aaattaacta aacttaaaat aaaaatagat
3720ttgtttcctg aaattatttt aagaatatat atgtatatat ctaaaatctt agacttagat
3780agatttttct atctatctat tttggttact taaaataaat aaatttgtat aaataattgt
3840atagttatca aaaattaaaa ctaatttttt taaagttgtt gatatataaa atactaaaga
3900tttaacgatt aagtatttat ttaagtatag aattttgttt tttttttaag tttagttatg
3960aagttgttaa ttatattaaa acaaaacaat atttcgaaat tttattatca tattcgaata
4020tatttttttt agtgatgatg tatgaattat tatcataatt tgaaagttta ctaaaaaata
4080tatcaacatg aattgtaata tatgagttat taccttaacc aaaattataa attaacatta
4140aatataatta tatatgtcat atttagccat acaatgtgtc atcaatatta atagtcatgt
4200caatattaca taatgccaat attatgctac ttaaacccca aatcccctaa ctcccgttaa
4260gtagccaaat tcataaatat acttattcga caaaataaaa aactttaaaa tatttactaa
4320tccgaccatg cacaagcatc cattccctat tccattgcca cgggataaca atgcaaccna
4380ctcctcaaaa aaagaaaaat tcaagctctt ttgcaaaaaa aaataaaata attttaacac
4440ctaaaatttt ttgtttccaa acttctacag ggaacacaca taaaagaaaa agaggacgtc
4500cactcggatc acgcaacaaa ccaaaaggtg tgtcatgact cctaagatat aatatttcct
4560tattcaaaat cataccattt taaattatga atgtatttcg tagtccacca gatatgtaat
4620ccaccagcgt tcaaaccaaa gttttatgat tgtaagttta agtgaattat aataatatat
4680tcttcacggt atcttttcat aactaattga gttatcaaac ttgatcgcac atgtggcttt
4740gataggtgtg acttttatgg tatacaattc tttcaaccta aaaacattat tgttcctcaa
4800tatcttacat tatgcttgac tgcaacaaaa tattttctca tctgttttct tcctttaaac
4860caatttatta tcatctattt cctgacattt taatccatcc acctatgtca aaaacttata
4920gaaaatgtca acttccaaac aaaacataat tgaacttcgc aaataaattc ttaataatat
4980taaaaaatgt tacttaatta tttcttcaac cccattttcc gcgcgtagcg cggacaaaga
5040ctctagttaa atatagaagt ttccgattct catcgtataa aacggtgact ttggcgggct
5100ttcatgtgta acaaattggt ttaacaaacc actgcctagt cgtttagtgt agaatcagcg
5160catggaactc cgattggagc gtgactttca cgtgccggag gcccaccacc acagcgggcg
5220ttacgctcta agaatctcgc ccacggtttt cttcatctgc cccccgccaa gtgtcttcct
5280cgttcgccac ttctcaccaa gttacaggaa ccctaaaaat ggcctttctt cagccccggc
5340tataatacac acatgatcct atagtgggtt cttccacaag ttacatctcc ttctggattg
5400tacatttcaa gtgtttgtgt tttttctgcc tctgagagaa aatcgcggcc gc
5452588227DNAArtificial sequenceVector KS427 58ctagagggcc caattcgccc
tatagtgagt cgtattacaa ttcactggcc gtcgttttac 60aacgtcgtga ctgggaaaac
cctggcgtta cccaacttaa tcgccttgca gcacatcccc 120ctttcgccag ctggcgtaat
agcgaagagg cccgcaccga tcgcccttcc caacagttgc 180gcagcctata cgtacggcag
tttaaggttt acacctataa aagagagagc cgttatcgtc 240tgtttgtgga tgtacagagt
gatattattg acacgccggg gcgacggatg gtgatccccc 300tggccagtgc acgtctgctg
tcagataaag tctcccgtga actttacccg gtggtgcata 360tcggggatga aagctggcgc
atgatgacca ccgatatggc cagtgtgccg gtctccgtta 420tcggggaaga agtggctgat
ctcagccacc gcgaaaatga catcaaaaac gccattaacc 480tgatgttctg gggaatataa
atgtcaggca tgagattatc aaaaaggatc ttcacctaga 540tccttttcac gtagaaagcc
agtccgcaga aacggtgctg accccggatg aatgtcagct 600actgggctat ctggacaagg
gaaaacgcaa gcgcaaagag aaagcaggta gcttgcagtg 660ggcttacatg gcgatagcta
gactgggcgg ttttatggac agcaagcgaa ccggaattgc 720cagctggggc gccctctggt
aaggttggga agccctgcaa agtaaactgg atggctttct 780tgccgccaag gatctgatgg
cgcaggggat caagctctga tcaagagaca ggatgaggat 840cgtttcgcat gattgaacaa
gatggattgc acgcaggttc tccggccgct tgggtggaga 900ggctattcgg ctatgactgg
gcacaacaga caatcggctg ctctgatgcc gccgtgttcc 960ggctgtcagc gcaggggcgc
ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga 1020atgaactgca agacgaggca
gcgcggctat cgtggctggc cacgacgggc gttccttgcg 1080cagctgtgct cgacgttgtc
actgaagcgg gaagggactg gctgctattg ggcgaagtgc 1140cggggcagga tctcctgtca
tctcaccttg ctcctgccga gaaagtatcc atcatggctg 1200atgcaatgcg gcggctgcat
acgcttgatc cggctacctg cccattcgac caccaagcga 1260aacatcgcat cgagcgagca
cgtactcgga tggaagccgg tcttgtcgat caggatgatc 1320tggacgaaga gcatcagggg
ctcgcgccag ccgaactgtt cgccaggctc aaggcgagca 1380tgcccgacgg cgaggatctc
gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg 1440tggaaaatgg ccgcttttct
ggattcatcg actgtggccg gctgggtgtg gcggaccgct 1500atcaggacat agcgttggct
acccgtgata ttgctgaaga gcttggcggc gaatgggctg 1560accgcttcct cgtgctttac
ggtatcgccg ctcccgattc gcagcgcatc gccttctatc 1620gccttcttga cgagttcttc
tgaattatta acgcttacaa tttcctgatg cggtattttc 1680tccttacgca tctgtgcggt
atttcacacc gcatcaggtg gcacttttcg gggaaatgtg 1740cgcggaaccc ctatttgttt
atttttctaa atacattcaa atatgtatcc gctcatgaga 1800caataaccct gataaatgct
tcaataatag cacgtgagga gggccaccat ggccaagttg 1860accagtgccg ttccggtgct
caccgcgcgc gacgtcgccg gagcggtcga gttctggacc 1920gaccggctcg ggttctcccg
ggacttcgtg gaggacgact tcgccggtgt ggtccgggac 1980gacgtgaccc tgttcatcag
cgcggtccag gaccaggtgg tgccggacaa caccctggcc 2040tgggtgtggg tgcgcggcct
ggacgagctg tacgccgagt ggtcggaggt cgtgtccacg 2100aacttccggg acgcctccgg
gccggccatg accgagatcg gcgagcagcc gtgggggcgg 2160gagttcgccc tgcgcgaccc
ggccggcaac tgcgtgcact tcgtggccga ggagcaggac 2220tgacacgtgc taaaacttca
tttttaattt aaaaggatct aggtgaagat cctttttgat 2280aatctcatga ccaaaatccc
ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 2340gaaaagatca aaggatcttc
ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 2400acaaaaaaac caccgctacc
agcggtggtt tgtttgccgg atcaagagct accaactctt 2460tttccgaagg taactggctt
cagcagagcg cagataccaa atactgttct tctagtgtag 2520ccgtagttag gccaccactt
caagaactct gtagcaccgc ctacatacct cgctctgcta 2580atcctgttac cagtggctgc
tgccagtggc gataagtcgt gtcttaccgg gttggactca 2640agacgatagt taccggataa
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 2700cccagcttgg agcgaacgac
ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 2760agcgccacgc ttcccgaagg
gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 2820acaggagagc gcacgaggga
gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 2880gggtttcgcc acctctgact
tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 2940ctatggaaaa acgccagcaa
cgcggccttt ttacggttcc tggccttttg ctggcctttt 3000gctcacatgt tctttcctgc
gttatcccct gattctgtgg ataaccgtat taccgccttt 3060gagtgagctg ataccgctcg
ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 3120gaagcggaag agcgcccaat
acgcaaaccg cctctccccg cgcgttggcc gattcattaa 3180tgcagctggc acgacaggtt
tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 3240gtgagttagc tcactcatta
ggcaccccag gctttacact ttatgcttcc ggctcgtatg 3300ttgtgtggaa ttgtgagcgg
ataacaattt cacacaggaa acagctatga ccatgattac 3360gccaagctat ttaggtgacg
cgttagaata ctcaagctat gcatcaagct tggtaccgag 3420ctcggatcca ctagtaacgg
ccgccagtgt gctggaattc aggggcgcgc cccaatttat 3480tatcatctat ttcctgacat
tttaatccat ccacctatgt caaaaactta tagaaaatgt 3540caacttccaa acaaaacata
attgaacttc gcaaataaat tcttaataat attaaaaaat 3600gttacttaat tatttcttca
accccatttt ccgcgcgtag cgcggacaaa gactctagtt 3660aaatatagaa gtttccgatt
ctcatcgtat aaaacggtga ctttggcggg ctttcatgtg 3720taacaaattg gtttaacaaa
ccactgccta gtcgtttagt gtagaatcag cgcatggaac 3780tccgattgga gcgtgacttt
cacgtgccgg aggcccacca ccacagcggg cgttacgctc 3840taagaatctc gcccacggtt
ttcttcatct cccccccgcc aagtgtctcc ctcgttcgcc 3900acttctcatc atgttacagg
gaccataaaa atggcgtatt tcttcagccc cgggtataaa 3960tacacacatg atcctgtggt
gggttcttcc acaagttaca tctccttctg gtttttgtat 4020tgcaagtgtt tgtatttttt
gcctccgaga gaaaatcgcg gccgcaagta tgaactaaaa 4080tgcatgtagg tgtaagagct
catggagagc atggaatatt gtatccgacc atgtaacagt 4140ataataactg agctccatct
cacttcttct atgaataaac aaaggatgtt atgatatatt 4200aacactctat ctatgcacct
tattgttcta tgataaattt cctcttatta ttataaatca 4260tctgaatcgt gacggcttat
ggaatgcttc aaatagtaca aaaacaaatg tgtactataa 4320gactttctaa acaattctaa
ccttagcatt gtgaacgaga cataagtgtt aagaagacat 4380aacaattata atggaagaag
tttgtctcca tttatatatt atatattacc cacttatgta 4440ttatattagg atgttaagga
gacataacaa ttataaagag agaagtttgt atccatttat 4500atattatata ctacccattt
atatattata cttatccact tatttaatgt ctttataagg 4560tttgatccat gatatttcta
atattttagt tgatatgtat atgaaagggt actatttgaa 4620ctctcttact ctgtataaag
gttggatcat ccttaaagtg ggtctattta attttattgc 4680ttcttacaga taaaaaaaaa
attatgagtt ggtttgataa aatattgaag gatttaaaat 4740aataataaat aacatataat
atatgtatat aaatttatta taatataaca tttatctata 4800aaaaagtaaa tattgtcata
aatctataca atcgtttagc cttgctggac gaatctcaat 4860tatttaaacg agagtaaaca
tatttgactt tttggttatt taacaaatta ttatttaaca 4920ctatatgaaa tttttttttt
tatcagcaaa gaataaaatt aaattaagaa ggacaatggt 4980gtcccaatcc ttatacaacc
aacttccaca agaaagtcaa gtcagagaca acaaaaaaac 5040aagcaaagga aattttttaa
tttgagttgt cttgtttgct gcataattta tgcagtaaaa 5100cactacacat aaccctttta
gcagtagagc aatggttgac cgtgtgctta gcttctttta 5160ttttattttt ttatcagcaa
agaataaata aaataaaatg agacacttca gggatgtttc 5220aacaagcttg gatcctcgaa
gagaagggtt aataacacac ttttttaaca tttttaacac 5280aaattttagt tatttaaaaa
tttattaaaa aatttaaaat aagaagagga actctttaaa 5340taaatctaac ttacaaaatt
tatgattttt aataagtttt caccaataaa aaatgtcata 5400aaaatatgtt aaaaagtata
ttatcaatat tctctttatg ataaataaaa agaaaaaaaa 5460aataaaagtt aagtgaaaat
gagattgaag tgactttagg tgtgtataaa tatatcaacc 5520ccgccaacaa tttatttaat
ccaaatatat tgaagtatat tattccatag cctttattta 5580tttatatatt tattatataa
aagctttatt tgttctaggt tgttcatgaa atattttttt 5640ggttttatct ccgttgtaag
aaaatcatgt gctttgtgtc gccactcact attgcagctt 5700tttcatgcat tggtcagatt
gacggttgat tgtatttttg ttttttatgg ttttgtgtta 5760tgacttaagt cttcatctct
ttatctcttc atcaggtttg atggttacct aatatggtcc 5820atgggtacat gcatggttaa
attaggtggc caactttgtt gtgaacgata gaattttttt 5880tatattaagt aaactatttt
tatattatga aataataata aaaaaaatat tttatcatta 5940ttaacaaaat catattagtt
aatttgttaa ctctataata aaagaaatac tgtaacattc 6000acattacatg gtaacatctt
tccacccttt catttgtttt ttgtttgatg actttttttc 6060ttgtttaaat ttatttccct
tcttttaaat ttggaataca ttatcatcat atataaacta 6120aaatactaaa aacaggatta
cacaaatgat aaataataac acaaatattt ataaatctag 6180ctgcaatata tttaaactag
ctatatcgat attgtaaaat aaaactagct gcattgatac 6240tgataaaaaa atatcatgtg
ctttctggac tgatgatgca gtatactttt gacattgcct 6300ttattttatt tttcagaaaa
gctttcttag ttctgggttc ttcattattt gtttcccatc 6360tccattgtga attgaatcat
ttgcttcgtg tcacaaatac aatttagnta ggtacatgca 6420ttggtcagat tcacggttta
ttatgtcatg acttaagttc atggtagtac attacctgcc 6480acgcatgcat tatattggtt
agatttgata ggcaaatttg gttgtcaaca atataaatat 6540aaataatgtt tttatattac
gaaataacag tgatcaaaac aaacagtttt atctttatta 6600acaagatttt gtttttgttt
gatgacgttt tttaatgttt acgctttccc ccttcttttg 6660aatttagaac actttatcat
cataaaatca aatactaaaa aaattacata tttcataaat 6720aataacacaa atatttttaa
aaaatctgaa ataataatga acaatattac atattatcac 6780gaaaattcat taataaaaat
attatataaa taaaatgtaa tagtagttat atgtaggaaa 6840aaagtactgc acgcataata
tatacaaaaa gattaaaatg aactattata aataataaca 6900ctaaattaat ggtgaatcat
atcaaaataa tgaaaaagta aataaaattt gtaattaact 6960tctatatgta ttacacacac
aaataataaa taatagtaaa aaaaattatg ataaatattt 7020accatctcat aagatattta
aaataatgat aaaaatatag attatttttt atgcaactag 7080ctagccaaaa agagaacacg
ggtatatata aaaagagtac ctttaaattc tactgtactt 7140cctttattcc tgacgttttt
atatcaagtg gacatacgtg aagattttaa ttatcagtct 7200aaatatttca ttagcactta
atacttttct gttttattcc tatcctataa gtagtcccga 7260ttctcccaac attgcttatt
cacacaacta actaagaaag tcttccatag ccccccaagc 7320ggcccatggc ctcctccgag
gacgtcatca aggagttcat gcgcttcaag gtgcgcatgg 7380agggctccgt gaacggccac
gagttcgaga tcgagggcga gggcgagggc cgcccctacg 7440agggcaccca gaccgccaag
ctgaaggtga ccaagggcgg ccccctgccc ttcgcctggg 7500acatcctgtc cccccagttc
cagtacggct ccaaggtgta cgtgaagcac cccgccgaca 7560tccccgacta caagaagctg
tccttccccg agggcttcaa gtgggagcgc gtgatgaact 7620tcgaggacgg cggcgtggtg
accgtgaccc aggactcctc cctgcaggac ggctccttca 7680tctacaaggt gaagttcatc
ggcgtgaact tcccctccga cggccccgta atgcagaaga 7740agactatggg ctgggaggcc
tccaccgagc gcctgtaccc ccgcgacggc gtgctgaagg 7800gcgagatcca caaggccctg
aagctgaagg acggcggcca ctacctggtg gagttcaagt 7860ccatctacat ggccaagaag
cccgtgcagc tgcccggcta ctactacgtg gactccaagc 7920tggacatcac ctcccacaac
gaggactaca ccatcgtgga gcagtacgag cgcgccgagg 7980gccgccacca cctgttcctg
tagcggccgg ccgcgacaca agtgtgagag tactaaataa 8040atgctttggt tgtacgaaat
cattacacta aataaaataa tcaaagctta tatatgcctt 8100ccgctaaggc cgaatgcaaa
gaaattggtt ctttctcgtt atcttttgcc acttttacta 8160gtacgtatta attactactt
aatcatcttt gtttacggct cattatatcc gtcgacggcg 8220cgccgct
8227595704DNAArtificial
sequenceVector KS130 59ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc
tcatggagag catggaatat 60tgtatccgac catgtaacag tataataact gagctccatc
tcacttcttc tatgaataaa 120caaaggatgt tatgatatat taacactcta tctatgcacc
ttattgttct atgataaatt 180tcctcttatt attataaatc atctgaatcg tgacggctta
tggaatgctt caaatagtac 240aaaaacaaat gtgtactata agactttcta aacaattcta
accttagcat tgtgaacgag 300acataagtgt taagaagaca taacaattat aatggaagaa
gtttgtctcc atttatatat 360tatatattac ccacttatgt attatattag gatgttaagg
agacataaca attataaaga 420gagaagtttg tatccattta tatattatat actacccatt
tatatattat acttatccac 480ttatttaatg tctttataag gtttgatcca tgatatttct
aatattttag ttgatatgta 540tatgaaaggg tactatttga actctcttac tctgtataaa
ggttggatca tccttaaagt 600gggtctattt aattttattg cttcttacag ataaaaaaaa
aattatgagt tggtttgata 660aaatattgaa ggatttaaaa taataataaa taacatataa
tatatgtata taaatttatt 720ataatataac atttatctat aaaaaagtaa atattgtcat
aaatctatac aatcgtttag 780ccttgctgga cgaatctcaa ttatttaaac gagagtaaac
atatttgact ttttggttat 840ttaacaaatt attatttaac actatatgaa attttttttt
ttatcagcaa agaataaaat 900taaattaaga aggacaatgg tgtcccaatc cttatacaac
caacttccac aagaaagtca 960agtcagagac aacaaaaaaa caagcaaagg aaatttttta
atttgagttg tcttgtttgc 1020tgcataattt atgcagtaaa acactacaca taaccctttt
agcagtagag caatggttga 1080ccgtgtgctt agcttctttt attttatttt tttatcagca
aagaataaat aaaataaaat 1140gagacacttc agggatgttt caacaagctt ggatccgtcg
acggcgcgcc cgatcatccg 1200gatatagttc ctcctttcag caaaaaaccc ctcaagaccc
gtttagaggc cccaaggggt 1260tatgctagtt attgctcagc ggtggcagca gccaactcag
cttcctttcg ggctttgtta 1320gcagccggat cgatccaagc tgtacctcac tattcctttg
ccctcggacg agtgctgggg 1380cgtcggtttc cactatcggc gagtacttct acacagccat
cggtccagac ggccgcgctt 1440ctgcgggcga tttgtgtacg cccgacagtc ccggctccgg
atcggacgat tgcgtcgcat 1500cgaccctgcg cccaagctgc atcatcgaaa ttgccgtcaa
ccaagctctg atagagttgg 1560tcaagaccaa tgcggagcat atacgcccgg agccgcggcg
atcctgcaag ctccggatgc 1620ctccgctcga agtagcgcgt ctgctgctcc atacaagcca
accacggcct ccagaagaag 1680atgttggcga cctcgtattg ggaatccccg aacatcgcct
cgctccagtc aatgaccgct 1740gttatgcggc cattgtccgt caggacattg ttggagccga
aatccgcgtg cacgaggtgc 1800cggacttcgg ggcagtcctc ggcccaaagc atcagctcat
cgagagcctg cgcgacggac 1860gcactgacgg tgtcgtccat cacagtttgc cagtgataca
catggggatc agcaatcgcg 1920catatgaaat cacgccatgt agtgtattga ccgattcctt
gcggtccgaa tgggccgaac 1980ccgctcgtct ggctaagatc ggccgcagcg atcgcatcca
tagcctccgc gaccggctgc 2040agaacagcgg gcagttcggt ttcaggcagg tcttgcaacg
tgacaccctg tgcacggcgg 2100gagatgcaat aggtcaggct ctcgctgaat tccccaatgt
caagcacttc cggaatcggg 2160agcgcggccg atgcaaagtg ccgataaaca taacgatctt
tgtagaaacc atcggcgcag 2220ctatttaccc gcaggacata tccacgccct cctacatcga
agctgaaagc acgagattct 2280tcgccctccg agagctgcat caggtcggag acgctgtcga
acttttcgat cagaaacttc 2340tcgacagacg tcgcggtgag ttcaggcttt tccatgggta
tatctccttc ttaaagttaa 2400acaaaattat ttctagaggg aaaccgttgt ggtctcccta
tagtgagtcg tattaatttc 2460gcgggatcga gatctgatca acctgcatta atgaatcggc
caacgcgcgg ggagaggcgg 2520tttgcgtatt gggcgctctt ccgcttcctc gctcactgac
tcgctgcgct cggtcgttcg 2580gctgcggcga gcggtatcag ctcactcaaa ggcggtaata
cggttatcca cagaatcagg 2640ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa
aaggccagga accgtaaaaa 2700ggccgcgttg ctggcgtttt tccataggct ccgcccccct
gacgagcatc acaaaaatcg 2760acgctcaagt cagaggtggc gaaacccgac aggactataa
agataccagg cgtttccccc 2820tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg
cttaccggat acctgtccgc 2880ctttctccct tcgggaagcg tggcgctttc tcaatgctca
cgctgtaggt atctcagttc 2940ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa
ccccccgttc agcccgaccg 3000ctgcgcctta tccggtaact atcgtcttga gtccaacccg
gtaagacacg acttatcgcc 3060actggcagca gccactggta acaggattag cagagcgagg
tatgtaggcg gtgctacaga 3120gttcttgaag tggtggccta actacggcta cactagaagg
acagtatttg gtatctgcgc 3180tctgctgaag ccagttacct tcggaaaaag agttggtagc
tcttgatccg gcaaacaaac 3240caccgctggt agcggtggtt tttttgtttg caagcagcag
attacgcgca gaaaaaaagg 3300atctcaagaa gatcctttga tcttttctac ggggtctgac
gctcagtgga acgaaaactc 3360acgttaaggg attttggtca tgacattaac ctataaaaat
aggcgtatca cgaggccctt 3420tcgtctcgcg cgtttcggtg atgacggtga aaacctctga
cacatgcagc tcccggagac 3480ggtcacagct tgtctgtaag cggatgccgg gagcagacaa
gcccgtcagg gcgcgtcagc 3540gggtgttggc gggtgtcggg gctggcttaa ctatgcggca
tcagagcaga ttgtactgag 3600agtgcaccat atggacatat tgtcgttaga acgcggctac
aattaataca taaccttatg 3660tatcatacac atacgattta ggtgacacta tagaacggcg
cgccaagctt ttgatccatg 3720cccttcattt gccgcttatt aattaatttg gtaacagtcc
gtactaatca gttacttatc 3780cttcccccat cataattaat cttggtagtc tcgaatgcca
caacactgac tagtctcttg 3840gatcataaga aaaagccaag gaacaaaaga agacaaaaca
caatgagagt atcctttgca 3900tagcaatgtc taagttcata aaattcaaac aaaaacgcaa
tcacacacag tggacatcac 3960ttatccacta gctgatcagg atcgccgcgt caagaaaaaa
aaactggacc ccaaaagcca 4020tgcacaacaa cacgtactca caaaggtgtc aatcgagcag
cccaaaacat tcaccaactc 4080aacccatcat gagccctcac atttgttgtt tctaacccaa
cctcaaactc gtattctctt 4140ccgccacctc atttttgttt atttcaacac ccgtcaaact
gcatgccacc ccgtggccaa 4200atgtccatgc atgttaacaa gacctatgac tataaatagc
tgcaatctcg gcccaggttt 4260tcatcatcaa gaaccagttc aatatcctag tacaccgtat
taaagaattt aagatatact 4320gcggccgcat ggctgctgct cccagtgtga ggacgtttac
tcgggccgag gttttgaatg 4380ccgaggctct gaatgagggc aagaaggatg ccgaggcacc
cttcttgatg atcatcgaca 4440acaaggtgta cgatgtccgc gagttcgtcc ctgatcatcc
cggtggaagt gtgattctca 4500cgcacgttgg caaggacggc actgacgtct ttgacacttt
tcaccccgag gctgcttggg 4560agactcttgc caacttttac gttggtgata ttgacgagag
cgaccgcgat atcaagaatg 4620atgactttgc ggccgaggtc cgcaagctgc gtaccttgtt
ccagtctctt ggttactacg 4680attcttccaa ggcatactac gccttcaagg tctcgttcaa
cctctgcatc tggggtttgt 4740cgacggtcat tgtggccaag tggggccaga cctcgaccct
cgccaacgtg ctctcggctg 4800cgcttttggg tctgttctgg cagcagtgcg gatggttggc
tcacgacttt ttgcatcacc 4860aggtcttcca ggaccgtttc tggggtgatc ttttcggcgc
cttcttggga ggtgtctgcc 4920agggcttctc gtcctcgtgg tggaaggaca agcacaacac
tcaccacgcc gcccccaacg 4980tccacggcga ggatcccgac attgacaccc accctctgtt
gacctggagt gagcatgcgt 5040tggagatgtt ctcggatgtc ccagatgagg agctgacccg
catgtggtcg cgtttcatgg 5100tcctgaacca gacctggttt tacttcccca ttctctcgtt
tgcccgtctc tcctggtgcc 5160tccagtccat tctctttgtg ctgcctaacg gtcaggccca
caagccctcg ggcgcgcgtg 5220tgcccatctc gttggtcgag cagctgtcgc ttgcgatgca
ctggacctgg tacctcgcca 5280ccatgttcct gttcatcaag gatcccgtca acatgctggt
gtactttttg gtgtcgcagg 5340cggtgtgcgg aaacttgttg gcgatcgtgt tctcgctcaa
ccacaacggt atgcctgtga 5400tctcgaagga ggaggcggtc gatatggatt tcttcacgaa
gcagatcatc acgggtcgtg 5460atgtccaccc gggtctattt gccaactggt tcacgggtgg
attgaactat cagatcgagc 5520accacttgtt cccttcgatg cctcgccaca acttttcaaa
gatccagcct gctgtcgaga 5580ccctgtgcaa aaagtacaat gtccgatacc acaccaccgg
tatgatcgag ggaactgcag 5640aggtctttag ccgtctgaac gaggtctcca aggctgcctc
caagatgggt aaggcgcagt 5700aagc
5704609609DNAArtificial sequenceVector KS432
60ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc tcatggagag catggaatat
60tgtatccgac catgtaacag tataataact gagctccatc tcacttcttc tatgaataaa
120caaaggatgt tatgatatat taacactcta tctatgcacc ttattgttct atgataaatt
180tcctcttatt attataaatc atctgaatcg tgacggctta tggaatgctt caaatagtac
240aaaaacaaat gtgtactata agactttcta aacaattcta accttagcat tgtgaacgag
300acataagtgt taagaagaca taacaattat aatggaagaa gtttgtctcc atttatatat
360tatatattac ccacttatgt attatattag gatgttaagg agacataaca attataaaga
420gagaagtttg tatccattta tatattatat actacccatt tatatattat acttatccac
480ttatttaatg tctttataag gtttgatcca tgatatttct aatattttag ttgatatgta
540tatgaaaggg tactatttga actctcttac tctgtataaa ggttggatca tccttaaagt
600gggtctattt aattttattg cttcttacag ataaaaaaaa aattatgagt tggtttgata
660aaatattgaa ggatttaaaa taataataaa taacatataa tatatgtata taaatttatt
720ataatataac atttatctat aaaaaagtaa atattgtcat aaatctatac aatcgtttag
780ccttgctgga cgaatctcaa ttatttaaac gagagtaaac atatttgact ttttggttat
840ttaacaaatt attatttaac actatatgaa attttttttt ttatcagcaa agaataaaat
900taaattaaga aggacaatgg tgtcccaatc cttatacaac caacttccac aagaaagtca
960agtcagagac aacaaaaaaa caagcaaagg aaatttttta atttgagttg tcttgtttgc
1020tgcataattt atgcagtaaa acactacaca taaccctttt agcagtagag caatggttga
1080ccgtgtgctt agcttctttt attttatttt tttatcagca aagaataaat aaaataaaat
1140gagacacttc agggatgttt caacaagctt ggatcctcga agagaagggt taataacaca
1200cttttttaac atttttaaca caaattttag ttatttaaaa atttattaaa aaatttaaaa
1260taagaagagg aactctttaa ataaatctaa cttacaaaat ttatgatttt taataagttt
1320tcaccaataa aaaatgtcat aaaaatatgt taaaaagtat attatcaata ttctctttat
1380gataaataaa aagaaaaaaa aaataaaagt taagtgaaaa tgagattgaa gtgactttag
1440gtgtgtataa atatatcaac cccgccaaca atttatttaa tccaaatata ttgaagtata
1500ttattccata gcctttattt atttatatat ttattatata aaagctttat ttgttctagg
1560ttgttcatga aatatttttt tggttttatc tccgttgtaa gaaaatcatg tgctttgtgt
1620cgccactcac tattgcagct ttttcatgca ttggtcagat tgacggttga ttgtattttt
1680gttttttatg gttttgtgtt atgacttaag tcttcatctc tttatctctt catcaggttt
1740gatggttacc taatatggtc catgggtaca tgcatggtta aattaggtgg ccaactttgt
1800tgtgaacgat agaatttttt ttatattaag taaactattt ttatattatg aaataataat
1860aaaaaaaata ttttatcatt attaacaaaa tcatattagt taatttgtta actctataat
1920aaaagaaata ctgtaacatt cacattacat ggtaacatct ttccaccctt tcatttgttt
1980tttgtttgat gacttttttt cttgtttaaa tttatttccc ttcttttaaa tttggaatac
2040attatcatca tatataaact aaaatactaa aaacaggatt acacaaatga taaataataa
2100cacaaatatt tataaatcta gctgcaatat atttaaacta gctatatcga tattgtaaaa
2160taaaactagc tgcattgata ctgataaaaa aatatcatgt gctttctgga ctgatgatgc
2220agtatacttt tgacattgcc tttattttat ttttcagaaa agctttctta gttctgggtt
2280cttcattatt tgtttcccat ctccattgtg aattgaatca tttgcttcgt gtcacaaata
2340caatttagnt aggtacatgc attggtcaga ttcacggttt attatgtcat gacttaagtt
2400catggtagta cattacctgc cacgcatgca ttatattggt tagatttgat aggcaaattt
2460ggttgtcaac aatataaata taaataatgt ttttatatta cgaaataaca gtgatcaaaa
2520caaacagttt tatctttatt aacaagattt tgtttttgtt tgatgacgtt ttttaatgtt
2580tacgctttcc cccttctttt gaatttagaa cactttatca tcataaaatc aaatactaaa
2640aaaattacat atttcataaa taataacaca aatattttta aaaaatctga aataataatg
2700aacaatatta catattatca cgaaaattca ttaataaaaa tattatataa ataaaatgta
2760atagtagtta tatgtaggaa aaaagtactg cacgcataat atatacaaaa agattaaaat
2820gaactattat aaataataac actaaattaa tggtgaatca tatcaaaata atgaaaaagt
2880aaataaaatt tgtaattaac ttctatatgt attacacaca caaataataa ataatagtaa
2940aaaaaattat gataaatatt taccatctca taagatattt aaaataatga taaaaatata
3000gattattttt tatgcaacta gctagccaaa aagagaacac gggtatatat aaaaagagta
3060cctttaaatt ctactgtact tcctttattc ctgacgtttt tatatcaagt ggacatacgt
3120gaagatttta attatcagtc taaatatttc attagcactt aatacttttc tgttttattc
3180ctatcctata agtagtcccg attctcccaa cattgcttat tcacacaact aactaagaaa
3240gtcttccata gccccccaag cggcccatgg cctcctccga ggacgtcatc aaggagttca
3300tgcgcttcaa ggtgcgcatg gagggctccg tgaacggcca cgagttcgag atcgagggcg
3360agggcgaggg ccgcccctac gagggcaccc agaccgccaa gctgaaggtg accaagggcg
3420gccccctgcc cttcgcctgg gacatcctgt ccccccagtt ccagtacggc tccaaggtgt
3480acgtgaagca ccccgccgac atccccgact acaagaagct gtccttcccc gagggcttca
3540agtgggagcg cgtgatgaac ttcgaggacg gcggcgtggt gaccgtgacc caggactcct
3600ccctgcagga cggctccttc atctacaagg tgaagttcat cggcgtgaac ttcccctccg
3660acggccccgt aatgcagaag aagactatgg gctgggaggc ctccaccgag cgcctgtacc
3720cccgcgacgg cgtgctgaag ggcgagatcc acaaggccct gaagctgaag gacggcggcc
3780actacctggt ggagttcaag tccatctaca tggccaagaa gcccgtgcag ctgcccggct
3840actactacgt ggactccaag ctggacatca cctcccacaa cgaggactac accatcgtgg
3900agcagtacga gcgcgccgag ggccgccacc acctgttcct gtagcggccg gccgcgacac
3960aagtgtgaga gtactaaata aatgctttgg ttgtacgaaa tcattacact aaataaaata
4020atcaaagctt atatatgcct tccgctaagg ccgaatgcaa agaaattggt tctttctcgt
4080tatcttttgc cacttttact agtacgtatt aattactact taatcatctt tgtttacggc
4140tcattatatc cgtcgacggc gcgccgctct agagggccca attcgcccta tagtgagtcg
4200tattacaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc
4260caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc
4320cgcaccgatc gcccttccca acagttgcgc agcctatacg tacggcagtt taaggtttac
4380acctataaaa gagagagccg ttatcgtctg tttgtggatg tacagagtga tattattgac
4440acgccggggc gacggatggt gatccccctg gccagtgcac gtctgctgtc agataaagtc
4500tcccgtgaac tttacccggt ggtgcatatc ggggatgaaa gctggcgcat gatgaccacc
4560gatatggcca gtgtgccggt ctccgttatc ggggaagaag tggctgatct cagccaccgc
4620gaaaatgaca tcaaaaacgc cattaacctg atgttctggg gaatataaat gtcaggcatg
4680agattatcaa aaaggatctt cacctagatc cttttcacgt agaaagccag tccgcagaaa
4740cggtgctgac cccggatgaa tgtcagctac tgggctatct ggacaaggga aaacgcaagc
4800gcaaagagaa agcaggtagc ttgcagtggg cttacatggc gatagctaga ctgggcggtt
4860ttatggacag caagcgaacc ggaattgcca gctggggcgc cctctggtaa ggttgggaag
4920ccctgcaaag taaactggat ggctttcttg ccgccaagga tctgatggcg caggggatca
4980agctctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga tggattgcac
5040gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca
5100atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt
5160gtcaagaccg acctgtccgg tgccctgaat gaactgcaag acgaggcagc gcggctatcg
5220tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga
5280agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct
5340cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg
5400gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg
5460gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc
5520gaactgttcg ccaggctcaa ggcgagcatg cccgacggcg aggatctcgt cgtgacccat
5580ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac
5640tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt
5700gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct
5760cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg aattattaac
5820gcttacaatt tcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc
5880atcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat
5940acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatagca
6000cgtgaggagg gccaccatgg ccaagttgac cagtgccgtt ccggtgctca ccgcgcgcga
6060cgtcgccgga gcggtcgagt tctggaccga ccggctcggg ttctcccggg acttcgtgga
6120ggacgacttc gccggtgtgg tccgggacga cgtgaccctg ttcatcagcg cggtccagga
6180ccaggtggtg ccggacaaca ccctggcctg ggtgtgggtg cgcggcctgg acgagctgta
6240cgccgagtgg tcggaggtcg tgtccacgaa cttccgggac gcctccgggc cggccatgac
6300cgagatcggc gagcagccgt gggggcggga gttcgccctg cgcgacccgg ccggcaactg
6360cgtgcacttc gtggccgagg agcaggactg acacgtgcta aaacttcatt tttaatttaa
6420aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt
6480ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt
6540ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg
6600tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca
6660gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt
6720agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga
6780taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc
6840gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact
6900gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga
6960caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg
7020aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt
7080tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt
7140acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga
7200ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac
7260gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc
7320tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa
7380agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc
7440tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca
7500cacaggaaac agctatgacc atgattacgc caagctattt aggtgacgcg ttagaatact
7560caagctatgc atcaagcttg gtaccgagct cggatccact agtaacggcc gccagtgtgc
7620tggaattcag gggcgcgccc caatttatta tcatctattt cctgacattt taatccatcc
7680acctatgtca aaaacttata gaaaatgtca acttccaaac aaaacataat tgaacttcgc
7740aaataaattc ttaataatat taaaaaatgt tacttaatta tttcttcaac cccattttcc
7800gcgcgtagcg cggacaaaga ctctagttaa atatagaagt ttccgattct catcgtataa
7860aacggtgact ttggcgggct ttcatgtgta acaaattggt ttaacaaacc actgcctagt
7920cgtttagtgt agaatcagcg catggaactc cgattggagc gtgactttca cgtgccggag
7980gcccaccacc acagcgggcg ttacgctcta agaatctcgc ccacggtttt cttcatctcc
8040cccccgccaa gtgtctccct cgttcgccac ttctcatcat gttacaggga ccataaaaat
8100ggcgtatttc ttcagccccg ggtataaata cacacatgat cctgtggtgg gttcttccac
8160aagttacatc tccttctggt ttttgtattg caagtgtttg tattttttgc ctccgagaga
8220aaatcgcggc cgcatggctg ctgctcccag tgtgaggacg tttactcggg ccgaggtttt
8280gaatgccgag gctctgaatg agggcaagaa ggatgccgag gcacccttct tgatgatcat
8340cgacaacaag gtgtacgatg tccgcgagtt cgtccctgat catcccggtg gaagtgtgat
8400tctcacgcac gttggcaagg acggcactga cgtctttgac acttttcacc ccgaggctgc
8460ttgggagact cttgccaact tttacgttgg tgatattgac gagagcgacc gcgatatcaa
8520gaatgatgac tttgcggccg aggtccgcaa gctgcgtacc ttgttccagt ctcttggtta
8580ctacgattct tccaaggcat actacgcctt caaggtctcg ttcaacctct gcatctgggg
8640tttgtcgacg gtcattgtgg ccaagtgggg ccagacctcg accctcgcca acgtgctctc
8700ggctgcgctt ttgggtctgt tctggcagca gtgcggatgg ttggctcacg actttttgca
8760tcaccaggtc ttccaggacc gtttctgggg tgatcttttc ggcgccttct tgggaggtgt
8820ctgccagggc ttctcgtcct cgtggtggaa ggacaagcac aacactcacc acgccgcccc
8880caacgtccac ggcgaggatc ccgacattga cacccaccct ctgttgacct ggagtgagca
8940tgcgttggag atgttctcgg atgtcccaga tgaggagctg acccgcatgt ggtcgcgttt
9000catggtcctg aaccagacct ggttttactt ccccattctc tcgtttgccc gtctctcctg
9060gtgcctccag tccattctct ttgtgctgcc taacggtcag gcccacaagc cctcgggcgc
9120gcgtgtgccc atctcgttgg tcgagcagct gtcgcttgcg atgcactgga cctggtacct
9180cgccaccatg ttcctgttca tcaaggatcc cgtcaacatg ctggtgtact ttttggtgtc
9240gcaggcggtg tgcggaaact tgttggcgat cgtgttctcg ctcaaccaca acggtatgcc
9300tgtgatctcg aaggaggagg cggtcgatat ggatttcttc acgaagcaga tcatcacggg
9360tcgtgatgtc cacccgggtc tatttgccaa ctggttcacg ggtggattga actatcagat
9420cgagcaccac ttgttccctt cgatgcctcg ccacaacttt tcaaagatcc agcctgctgt
9480cgagaccctg tgcaaaaagt acaatgtccg ataccacacc accggtatga tcgagggaac
9540tgcagaggtc tttagccgtc tgaacgaggt ctccaaggct gcctccaaga tgggtaaggc
9600gcagtaagc
96096119404DNAArtificial sequenceVector ARALO80 61cgcgcctcga gtgggcggat
cccccgggct gcaggaattc actggccgtc gttttacaac 60gtcgtgactg ggaaaaccct
ggcgttaccc aacttaatcg ccttgcagca catccccctt 120tcgccagctg gcgtaatagc
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 180gcctgaatgg cgaatggatc
gatccatcgc gatgtacctt ttgttagtca gcctctcgat 240tgctcatcgt cattacacag
taccgaagtt tgatcgatct agtaacatag atgacaccgc 300gcgcgataat ttatcctagt
ttgcgcgcta tattttgttt tctatcgcgt attaaatgta 360taattgcggg actctaatca
taaaaaccca tctcataaat aacgtcatgc attacatgtt 420aattattaca tgcttaacgt
aattcaacag aaattatatg ataatcatcg caagaccggc 480aacaggattc aatcttaaga
aactttattg ccaaatgttt gaacgatctg cttcgacgca 540ctccttcttt actccaccat
ctcgtcctta ttgaaaacgt gggtagcacc aaaacgaatc 600aagtcgctgg aactgaagtt
accaatcacg ctggatgatt tgccagttgg attaatcttg 660cctttccccg catgaataat
attgatgaat gcatgcgtga ggggtagttc gatgttggca 720atagctgcaa ttgccgcgac
atcctccaac gagcataatt cttcagaaaa atagcgatgt 780tccatgttgt cagggcatgc
atgatgcacg ttatgaggtg acggtgctag gcagtattcc 840ctcaaagttt catagtcagt
atcatattca tcattgcatt cctgcaagag agaattgaga 900cgcaatccac acgctgcggc
aaccttccgg cgttcgtggt ctatttgctc ttggacgttg 960caaacgtaag tgttggatcg
atccggggtg ggcgaagaac tccagcatga gatccccgcg 1020ctggaggatc atccagccgg
cgtcccggaa aacgattccg aagcccaacc tttcatagaa 1080ggcggcggtg gaatcgaaat
ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc 1140gaaccccaga gtcccgctca
gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc 1200gaatcgggag cggcgatacc
gtaaagcacg aggaagcggt cagcccattc gccgccaagc 1260tcttcagcaa tatcacgggt
agccaacgct atgtcctgat agcggtccgc cacacccagc 1320cggccacagt cgatgaatcc
agaaaagcgg ccattttcca ccatgatatt cggcaagcag 1380gcatcgccat gggtcacgac
gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg 1440aacagttcgg ctggcgcgag
cccctgatgc tcttcgtcca gatcatcctg atcgacaaga 1500ccggcttcca tccgagtacg
tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg 1560caggtagccg gatcaagcgt
atgcagccgc cgcattgcat cagccatgat ggatactttc 1620tcggcaggag caaggtgaga
tgacaggaga tcctgccccg gcacttcgcc caatagcagc 1680cagtcccttc ccgcttcagt
gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg 1740gccagccacg atagccgcgc
tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg 1800gtcttgacaa aaagaaccgg
gcgcccctgc gctgacagcc ggaacacggc ggcatcagag 1860cagccgattg tctgttgtgc
ccagtcatag ccgaatagcc tctccaccca agcggccgga 1920gaacctgcgt gcaatccatc
ttgttcaatc atgcgaaacg atccccgcaa gcttggagac 1980tggtgatttc agcgtgtcct
ctccaaatga aatgaacttc cttatataga ggaagggtct 2040tgcgaaggat agtgggattg
tgcgtcatcc cttacgtcag tggagatatc acatcaatcc 2100acttgctttg aagacgtggt
tggaacgtct tctttttcca cgatgctcct cgtgggtggg 2160ggtccatctt tgggaccact
gtcggcagag gcatcttcaa cgatggcctt tcctttatcg 2220caatgatggc atttgtagga
gccaccttcc ttttccacta tcttcacaat aaagtgacag 2280atagctgggc aatggaatcc
gaggaggttt ccggatatta ccctttgttg aaaagtctca 2340attgcccttt ggtcttctga
gactgtatct ttgatatttt tggagtagac aagcgtgtcg 2400tgctccacca tgttgacgaa
gattttcttc ttgtcattga gtcgtaagag actctgtatg 2460aactgttcgc cagtctttac
ggcgagttct gttaggtcct ctatttgaat ctttgactcc 2520atggcctttg attcagtggg
aactaccttt ttagagactc caatctctat tacttgcctt 2580ggtttgtgaa gcaagccttg
aatcgtccat actggaatag tacttctgat cttgagaaat 2640atatctttct ctgtgttctt
gatgcagtta gtcctgaatc ttttgactgc atctttaacc 2700ttcttgggaa ggtatttgat
ctcctggaga ttattgctcg ggtagatcgt cttgatgaga 2760cctgctgcgt aagcctctct
aaccatctgt gggttagcat tctttctgaa attgaaaagg 2820ctaatcttct cattatcagt
ggtgaacatg gtatcgtcac cttctccgtc gaacttcctg 2880actagatcgt agagatagag
gaagtcgtcc attgtgatct ctggggcaaa ggagatctga 2940attaattcga tatggtggat
ttatcacaaa tgggacccgc cgccgacaga ggtgtgatgt 3000taggccagga ctttgaaaat
ttgcgcaact atcgtatagt ggccgacaaa ttgacgccga 3060gttgacagac tgcctagcat
ttgagtgaat tatgtgaggt aatgggctac actgaattgg 3120tagctcaaac tgtcagtatt
tatgtatatg agtgtatatt ttcgcataat ctcagaccaa 3180tctgaagatg aaatgggtat
ctgggaatgg cgaaatcaag gcatcgatcg tgaagtttct 3240catctaagcc cccatttgga
cgtgaatgta gacacgtcga aataaagatt tccgaattag 3300aataatttgt ttattgcttt
cgcctataaa tacgacggat cgtaatttgt cgttttatca 3360aaatgtactt tcattttata
ataacgctgc ggacatctac atttttgaat tgaaaaaaaa 3420ttggtaatta ctctttcttt
ttctccatat tgaccatcat actcattgct gatccatgta 3480gatttcccgg acatgaagcc
atttacaatt gaatatatcc tgccgccgct gccgctttgc 3540acccggtgga gcttgcatgt
tggtttctac gcagaactga gccggttagg cagataattt 3600ccattgagaa ctgagccatg
tgcaccttcc ccccaacacg gtgagcgacg gggcaacgga 3660gtgatccaca tgggactttt
aaacatcatc cgtcggatgg cgttgcgaga gaagcagtcg 3720atccgtgaga tcagccgacg
caccgggcag gcgcgcaaca cgatcgcaaa gtatttgaac 3780gcaggtacaa tcgagccgac
gttcacgcgg aacgaccaag caagctagct ttaatgcggt 3840agtttatcac agttaaattg
ctaacgcagt caggcaccgt gtatgaaatc taacaatgcg 3900ctcatcgtca tcctcggcac
cgtcaccctg gatgctgtag gcataggctt ggttatgccg 3960gtactgccgg gcctcttgcg
ggatatcgtc cattccgaca gcatcgccag tcactatggc 4020gtgctgctag cgctatatgc
gttgatgcaa tttctatgcg cacccgttct cggagcactg 4080tccgaccgct ttggccgccg
cccagtcctg ctcgcttcgc tacttggagc cactatcgac 4140tacgcgatca tggcgaccac
acccgtcctg tggtccaacc cctccgctgc tatagtgcag 4200tcggcttctg acgttcagtg
cagccgtctt ctgaaaacga catgtcgcac aagtcctaag 4260ttacgcgaca ggctgccgcc
ctgccctttt cctggcgttt tcttgtcgcg tgttttagtc 4320gcataaagta gaatacttgc
gactagaacc ggagacatta cgccatgaac aagagcgccg 4380ccgctggcct gctgggctat
gcccgcgtca gcaccgacga ccaggacttg accaaccaac 4440gggccgaact gcacgcggcc
ggctgcacca agctgttttc cgagaagatc accggcacca 4500ggcgcgaccg cccggagctg
gccaggatgc ttgaccacct acgccctggc gacgttgtga 4560cagtgaccag gctagaccgc
ctggcccgca gcacccgcga cctactggac attgccgagc 4620gcatccagga ggccggcgcg
ggcctgcgta gcctggcaga gccgtgggcc gacaccacca 4680cgccggccgg ccgcatggtg
ttgaccgtgt tcgccggcat tgccgagttc gagcgttccc 4740taatcatcga ccgcacccgg
agcgggcgcg aggccgccaa ggcccgaggc gtgaagtttg 4800gcccccgccc taccctcacc
ccggcacaga tcgcgcacgc ccgcgagctg atcgaccagg 4860aaggccgcac cgtgaaagag
gcggctgcac tgcttggcgt gcatcgctcg accctgtacc 4920gcgcacttga gcgcagcgag
gaagtgacgc ccaccgaggc caggcggcgc ggtgccttcc 4980gtgaggacgc attgaccgag
gccgacgccc tggcggccgc cgagaatgaa cgccaagagg 5040aacaagcatg aaaccgcacc
aggacggcca ggacgaaccg tttttcatta ccgaagagat 5100cgaggcggag atgatcgcgg
ccgggtacgt gttcgagccg cccgcgcacg tctcaaccgt 5160gcggctgcat gaaatcctgg
ccggtttgtc tgatgccaag ctggcggcct ggccggccag 5220cttggccgct gaagaaaccg
agcgccgccg tctaaaaagg tgatgtgtat ttgagtaaaa 5280cagcttgcgt catgcggtcg
ctgcgtatat gatgcgatga gtaaataaac aaatacgcaa 5340gggaacgcat gaagttatcg
ctgtacttaa ccagaaaggc gggtcaggca agacgaccat 5400cgcaacccat ctagcccgcg
ccctgcaact cgccggggcc gatgttctgt tagtcgattc 5460cgatccccag ggcagtgccc
gcgattgggc ggccgtgcgg gaagatcaac cgctaaccgt 5520tgtcggcatc gaccgcccga
cgattgaccg cgacgtgaag gccatcggcc ggcgcgactt 5580cgtagtgatc gacggagcgc
cccaggcggc ggacttggct gtgtccgcga tcaaggcagc 5640cgacttcgtg ctgattccgg
tgcagccaag cccttacgac atatgggcca ccgccgacct 5700ggtggagctg gttaagcagc
gcattgaggt cacggatgga aggctacaag cggcctttgt 5760cgtgtcgcgg gcgatcaaag
gcacgcgcat cggcggtgag gttgccgagg cgctggccgg 5820gtacgagctg cccattcttg
agtcccgtat cacgcagcgc gtgagctacc caggcactgc 5880cgccgccggc acaaccgttc
ttgaatcaga acccgagggc gacgctgccc gcgaggtcca 5940ggcgctggcc gctgaaatta
aatcaaaact catttgagtt aatgaggtaa agagaaaatg 6000agcaaaagca caaacacgct
aagtgccggc cgtccgagcg cacgcagcag caaggctgca 6060acgttggcca gcctggcaga
cacgccagcc atgaagcggg tcaactttca gttgccggcg 6120gaggatcaca ccaagctgaa
gatgtacgcg gtacgccaag gcaagaccat taccgagctg 6180ctatctgaat acatcgcgca
gctaccagag taaatgagca aatgaataaa tgagtagatg 6240aattttagcg gctaaaggag
gcggcatgga aaatcaagaa caaccaggca ccgacgccgt 6300ggaatgcccc atgtgtggag
gaacgggcgg ttggccaggc gtaagcggct gggttgtctg 6360ccggccctgc aatggcactg
gaacccccaa gcccgaggaa tcggcgtgag cggtcgcaaa 6420ccatccggcc cggtacaaat
cggcgcggcg ctgggtgatg acctggtgga gaagttgaag 6480gccgcgcagg ccgcccagcg
gcaacgcatc gaggcagaag cacgccccgg tgaatcgtgg 6540caagcggccg ctgatcgaat
ccgcaaagaa tcccggcaac cgccggcagc cggtgcgccg 6600tcgattagga agccgcccaa
gggcgacgag caaccagatt ttttcgttcc gatgctctat 6660gacgtgggca cccgcgatag
tcgcagcatc atggacgtgg ccgttttccg tctgtcgaag 6720cgtgaccgac gagctggcga
ggtgatccgc tacgagcttc cagacgggca cgtagaggtt 6780tccgcagggc cggccggcat
ggccagtgtg tgggattacg acctggtact gatggcggtt 6840tcccatctaa ccgaatccat
gaaccgatac cgggaaggga agggagacaa gcccggccgc 6900gtgttccgtc cacacgttgc
ggacgtactc aagttctgcc ggcgagccga tggcggaaag 6960cagaaagacg acctggtaga
aacctgcatt cggttaaaca ccacgcacgt tgccatgcag 7020cgtacgaaga aggccaagaa
cggccgcctg gtgacggtat ccgagggtga agccttgatt 7080agccgctaca agatcgtaaa
gagcgaaacc gggcggccgg agtacatcga gatcgagcta 7140gctgattgga tgtaccgcga
gatcacagaa ggcaagaacc cggacgtgct gacggttcac 7200cccgattact ttttgatcga
tcccggcatc ggccgttttc tctaccgcct ggcacgccgc 7260gccgcaggca aggcagaagc
cagatggttg ttcaagacga tctacgaacg cagtggcagc 7320gccggagagt tcaagaagtt
ctgtttcacc gtgcgcaagc tgatcgggtc aaatgacctg 7380ccggagtacg atttgaagga
ggaggcgggg caggctggcc cgatcctagt catgcgctac 7440cgcaacctga tcgagggcga
agcatccgcc ggttcctaat gtacggagca gatgctaggg 7500caaattgccc tagcagggga
aaaaggtcga aaaggtctct ttcctgtgga tagcacgtac 7560attgggaacc caaagccgta
cattgggaac cggaacccgt acattgggaa cccaaagccg 7620tacattggga accggtcaca
catgtaagtg actgatataa aagagaaaaa aggcgatttt 7680tccgcctaaa actctttaaa
acttattaaa actcttaaaa cccgcctggc ctgtgcataa 7740ctgtctggcc agcgcacagc
cgaagagctg caaaaagcgc ctacccttcg gtcgctgcgc 7800tccctacgcc ccgccgcttc
gcgtcggcct atcgcggccg ctggccgctc aaaaatggct 7860ggcctacggc caggcaatct
accagggcgc ggacaagccg cgccgtcgcc actcgaccgc 7920cggcgcccac atcaaggcac
cctgcctcgc gcgtttcggt gatgacggtg aaaacctctg 7980acacatgcag ctcccggaga
cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca 8040agcccgtcag ggcgcgtcag
cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc 8100acgtagcgat agcggagtgt
atactggctt aactatgcgg catcagagca gattgtactg 8160agagtgcacc atatgcggtg
tgaaataccg cacagatgcg taaggagaaa ataccgcatc 8220aggcgctctt ccgcttcctc
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga 8280gcggtatcag ctcactcaaa
ggcggtaata cggttatcca cagaatcagg ggataacgca 8340ggaaagaaca tgtgagcaaa
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 8400ctggcgtttt tccataggct
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 8460cagaggtggc gaaacccgac
aggactataa agataccagg cgtttccccc tggaagctcc 8520ctcgtgcgct ctcctgttcc
gaccctgccg cttaccggat acctgtccgc ctttctccct 8580tcgggaagcg tggcgctttc
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc 8640gttcgctcca agctgggctg
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 8700tccggtaact atcgtcttga
gtccaacccg gtaagacacg acttatcgcc actggcagca 8760gccactggta acaggattag
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 8820tggtggccta actacggcta
cactagaagg acagtatttg gtatctgcgc tctgctgaag 8880ccagttacct tcggaaaaag
agttggtagc tcttgatccg gcaaacaaac caccgctggt 8940agcggtggtt tttttgtttg
caagcagcag attacgcgca gaaaaaaagg atctcaagaa 9000gatcctttga tcttttctac
ggggtctgac gctcagtgga acgaaaactc acgttaaggg 9060attttggtca tgagattatc
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga 9120agttttaaat caatctaaag
tatatatgag taaacttggt ctgacagtta ccaatgctta 9180atcagtgagg cacctatctc
agcgatctgt ctatttcgtt catccatagt tgcctgactc 9240cccgtcgtgt agataactac
gatacgggag ggcttaccat ctggccccag tgctgcaatg 9300ataccgcgag acccacgctc
accggctcca gatttatcag caataaacca gccagccgga 9360agggccgagc gcagaagtgg
tcctgcaact ttatccgcct ccatccagtc tattaattgt 9420tgccgggaag ctagagtaag
tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt 9480gctacaggca tcgtggtgtc
acgctcgtcg tttggtatgg cttcattcag ctccggttcc 9540caacgatcaa ggcgagttac
atgatccccc atgttgtgca aaaaagcggt tagctccttc 9600ggtcctccga tcgttgtcag
aagtaagttg gccgcagtgt tatcactcat ggttatggca 9660gcactgcata attctcttac
tgtcatgcca tccgtaagat gcttttctgt gactggtgag 9720tactcaacca agtcattctg
agaatagtgt atgcggcgac cgagttgctc ttgcccggcg 9780tcaacacggg ataataccgc
gccacatagc agaactttaa aagtgctcat cattggaaaa 9840gacctgcagg gggggggggg
cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat 9900accaggcctg aatcgcccca
tcatccagcc agaaagtgag ggagccacgg ttgatgagag 9960ctttgttgta ggtggaccag
ttggtgattt tgaacttttg ctttgccacg gaacggtctg 10020cgttgtcggg aagatgcgtg
atctgatcct tcaactcagc aaaagttcga tttattcaac 10080aaagccgccg tcccgtcaag
tcagcgtaat gctctgccag tgttacaacc aattaaccaa 10140ttctgattag aaaaactcat
cgagcatcaa atgaaactgc aatttattca tatcaggatt 10200atcaatacca tatttttgaa
aaagccgttt ctgtaatgaa ggagaaaact caccgaggca 10260gttccatagg atggcaagat
cctggtatcg gtctgcgatt ccgactcgtc caacatcaat 10320acaacctatt aatttcccct
cgtcaaaaat aaggttatca agtgagaaat caccatgagt 10380gacgactgaa tccggtgaga
atggcaaaag cttatgcatt tctttccaga cttgttcaac 10440aggccagcca ttacgctcgt
catcaaaatc actcgcatca accaaaccgt tattcattcg 10500tgattgcgcc tgagcgagac
gaaatacgcg atcgctgtta aaaggacaat tacaaacagg 10560aatcgaatgc aaccggcgca
ggaacactgc cagcgcatca acaatatttt cacctgaatc 10620aggatattct tctaatacct
ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca 10680tgcatcatca ggagtacgga
taaaatgctt gatggtcgga agaggcataa attccgtcag 10740ccagtttagt ctgaccatct
catctgtaac atcattggca acgctacctt tgccatgttt 10800cagaaacaac tctggcgcat
cgggcttccc atacaatcga tagattgtcg cacctgattg 10860cccgacatta tcgcgagccc
atttataccc atataaatca gcatccatgt tggaatttaa 10920tcgcggcctc gagcaagacg
tttcccgttg aatatggctc ataacacccc ttgtattact 10980gtttatgtaa gcagacagtt
ttattgttca tgatgatata tttttatctt gtgcaatgta 11040acatcagaga ttttgagaca
caacgtggct ttcccccccc cccctgcagg tcaattcggt 11100cgatatggct attacgaaga
aggctcgtgc gcggagtccc gtgaactttc ccacgcaaca 11160agtgaaccgc accgggtttg
ccggaggcca tttcgttaaa atgcgcagcc atggctgctt 11220cgtccagcat ggcgtaatac
tgatcctcgt cttcggctgg cggtatattg ccgatgggct 11280tcaaaagccg ccgtggttga
accagtctat ccattccaag gtagcgaact cgaccgcttc 11340gaagctcctc catggtccac
gccgatgaat gacctcggcc ttgtaaagac cgttgatcgc 11400ttctgcgagg gcgttgtcgt
gctgtcgccg acgcttccga tagatggctc gatacctgct 11460tctgccaacc gctcggaata
gcgaaaggac acgtattgaa caccgcgatc cgagtgatgc 11520actaggccgc catgagcggg
acgccgatca tgatgagcct cctcgagggc atcgaggaca 11580aagcctgcat gtgctgtccg
gctcgcccgc catccgacaa tgcgacgggc gaagacgtcg 11640atcacgaagg ccacgtagac
gaagccctcc caagtggcga cataagtacg gacatgcgca 11700aaggctttcc cggtttgtcg
ctgatggtgc aagagacgct gaagcgcgat ccgatgcgca 11760ggcatctgtt cgtcttccgc
ggtcgtggcg gtggcctgat caaggtcact cgccgaagag 11820ctgcatgatt ggctcgaaac
cgagcggggg aaattgtcgc gcagttctcc cgtcgccgag 11880gcgataaatt acatgctcaa
gcgatgggat ggcattacgt cattcctcga tgacggcccg 11940atttgcctga cgaacaatgc
tgccgaacga acgctcagag gctatgtact cggcaggaag 12000tcatggctgt ttgccggatc
ggatcgttgt gctgaacgtg cggcgttcat ggcgacactg 12060atcatgagcg ccaagctcaa
taacatcgat ccgcaggcct ggcttgccga cgtccgcgcc 12120gaccttgcgg acgctccgat
cagcaggctt gagcaacagc tgccgtggaa ctggacatcc 12180aagacactga gtgctcaggc
ggcctgacct gcggccttca ccggatactt accccattat 12240cgcagattgc gatgaagcat
cagcgtcatt cagcaatctt gccaaagtat gcaggctcgc 12300gagaatcgac gtgcgaaacc
ggctggttgc gccaaagatc cgcttgcgga gcggtcgaac 12360attcatgctg ggacttcaag
aggtcgagta gaggaagaac cggaaaggtt gcaccggaaa 12420atatgcgttc ctttggagag
cgcctcatgg acgtgaacaa atcgcccgga ccaaggatgc 12480cacggataca aaagctcgcg
aagctcggtc ccgtgggtgt tctgtcgtct cgttgtacaa 12540cgaaatccat tcccattccg
cgctcaagat ggcttcccct cggcagttca tcagggctaa 12600atcaatctag ccgacttgtc
cggtgaaatg ggctgcactc caacagaaac aatcaaacaa 12660acatacacag cgacttattc
acacgagctc aaattacaac ggtatatatc ctgccagtca 12720gcatcatcac accaaaagtt
aggcccgaat agtttgaaat tagaaagctc gcaattgagg 12780tctacaggcc aaattcgctc
ttagccgtac aatattactc accggtgcga tgccccccat 12840cgtaggtgaa ggtggaaatt
aatgatccat cttgagacca caggcccaca acagctacca 12900gtttcctcaa gggtccacca
aaaacgtaag cgcttacgta catggtcgat aagaaaaggc 12960aatttgtaga tgttaacatc
caacgtcgct ttcagggatc gatccaatac gcaaaccgcc 13020tctccccgcg cgttggccga
ttcattaatg cagctggcac gacaggtttc ccgactggaa 13080agcgggcagt gagcgcaacg
caattaatgt gagttagctc actcattagg caccccaggc 13140tttacacttt atgcttccgg
ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 13200cacaggaaac agctatgacc
atgattacgc caagcttgca tgcctgcagg tcgactctag 13260aggatctggc gcgccccaat
ttattatcat ctatttcctg acattttaat ccatccacct 13320atgtcaaaaa cttatagaaa
atgtcaactt ccaaacaaaa cataattgaa cttcgcaaat 13380aaattcttaa taatattaaa
aaatgttact taattatttc ttcaacccca ttttccgcgc 13440gtagcgcgga caaagactct
agttaaatat agaagtttcc gattctcatc gtataaaacg 13500gtgactttgg cgggctttca
tgtgtaacaa attggtttaa caaaccactg cctagtcgtt 13560tagtgtagaa tcagcgcatg
gaactccgat tggagcgtga ctttcacgtg ccggaggccc 13620accaccacag cgggcgttac
gctctaagaa tctcgcccac ggttttcttc atctcccccc 13680cgccaagtgt ctccctcgtt
cgccacttct catcatgtta cagggaccat aaaaatggcg 13740tatttcttca gccccgggta
taaatacaca catgatcctg tggtgggttc ttccacaagt 13800tacatctcct tctggttttt
gtattgcaag tgtttgtatt ttttgcctcc gagagaaaat 13860cgcggccgca tggctgctgc
tcccagtgtg aggacgttta ctcgggccga ggttttgaat 13920gccgaggctc tgaatgaggg
caagaaggat gccgaggcac ccttcttgat gatcatcgac 13980aacaaggtgt acgatgtccg
cgagttcgtc cctgatcatc ccggtggaag tgtgattctc 14040acgcacgttg gcaaggacgg
cactgacgtc tttgacactt ttcaccccga ggctgcttgg 14100gagactcttg ccaactttta
cgttggtgat attgacgaga gcgaccgcga tatcaagaat 14160gatgactttg cggccgaggt
ccgcaagctg cgtaccttgt tccagtctct tggttactac 14220gattcttcca aggcatacta
cgccttcaag gtctcgttca acctctgcat ctggggtttg 14280tcgacggtca ttgtggccaa
gtggggccag acctcgaccc tcgccaacgt gctctcggct 14340gcgcttttgg gtctgttctg
gcagcagtgc ggatggttgg ctcacgactt tttgcatcac 14400caggtcttcc aggaccgttt
ctggggtgat cttttcggcg ccttcttggg aggtgtctgc 14460cagggcttct cgtcctcgtg
gtggaaggac aagcacaaca ctcaccacgc cgcccccaac 14520gtccacggcg aggatcccga
cattgacacc caccctctgt tgacctggag tgagcatgcg 14580ttggagatgt tctcggatgt
cccagatgag gagctgaccc gcatgtggtc gcgtttcatg 14640gtcctgaacc agacctggtt
ttacttcccc attctctcgt ttgcccgtct ctcctggtgc 14700ctccagtcca ttctctttgt
gctgcctaac ggtcaggccc acaagccctc gggcgcgcgt 14760gtgcccatct cgttggtcga
gcagctgtcg cttgcgatgc actggacctg gtacctcgcc 14820accatgttcc tgttcatcaa
ggatcccgtc aacatgctgg tgtacttttt ggtgtcgcag 14880gcggtgtgcg gaaacttgtt
ggcgatcgtg ttctcgctca accacaacgg tatgcctgtg 14940atctcgaagg aggaggcggt
cgatatggat ttcttcacga agcagatcat cacgggtcgt 15000gatgtccacc cgggtctatt
tgccaactgg ttcacgggtg gattgaacta tcagatcgag 15060caccacttgt tcccttcgat
gcctcgccac aacttttcaa agatccagcc tgctgtcgag 15120accctgtgca aaaagtacaa
tgtccgatac cacaccaccg gtatgatcga gggaactgca 15180gaggtcttta gccgtctgaa
cgaggtctcc aaggctgcct ccaagatggg taaggcgcag 15240taagcggccg caagtatgaa
ctaaaatgca tgtaggtgta agagctcatg gagagcatgg 15300aatattgtat ccgaccatgt
aacagtataa taactgagct ccatctcact tcttctatga 15360ataaacaaag gatgttatga
tatattaaca ctctatctat gcaccttatt gttctatgat 15420aaatttcctc ttattattat
aaatcatctg aatcgtgacg gcttatggaa tgcttcaaat 15480agtacaaaaa caaatgtgta
ctataagact ttctaaacaa ttctaacctt agcattgtga 15540acgagacata agtgttaaga
agacataaca attataatgg aagaagtttg tctccattta 15600tatattatat attacccact
tatgtattat attaggatgt taaggagaca taacaattat 15660aaagagagaa gtttgtatcc
atttatatat tatatactac ccatttatat attatactta 15720tccacttatt taatgtcttt
ataaggtttg atccatgata tttctaatat tttagttgat 15780atgtatatga aagggtacta
tttgaactct cttactctgt ataaaggttg gatcatcctt 15840aaagtgggtc tatttaattt
tattgcttct tacagataaa aaaaaaatta tgagttggtt 15900tgataaaata ttgaaggatt
taaaataata ataaataaca tataatatat gtatataaat 15960ttattataat ataacattta
tctataaaaa agtaaatatt gtcataaatc tatacaatcg 16020tttagccttg ctggacgaat
ctcaattatt taaacgagag taaacatatt tgactttttg 16080gttatttaac aaattattat
ttaacactat atgaaatttt tttttttatc agcaaagaat 16140aaaattaaat taagaaggac
aatggtgtcc caatccttat acaaccaact tccacaagaa 16200agtcaagtca gagacaacaa
aaaaacaagc aaaggaaatt ttttaatttg agttgtcttg 16260tttgctgcat aatttatgca
gtaaaacact acacataacc cttttagcag tagagcaatg 16320gttgaccgtg tgcttagctt
cttttatttt atttttttat cagcaaagaa taaataaaat 16380aaaatgagac acttcaggga
tgtttcaaca agcttggatc ctcgaagaga agggttaata 16440acacactttt ttaacatttt
taacacaaat tttagttatt taaaaattta ttaaaaaatt 16500taaaataaga agaggaactc
tttaaataaa tctaacttac aaaatttatg atttttaata 16560agttttcacc aataaaaaat
gtcataaaaa tatgttaaaa agtatattat caatattctc 16620tttatgataa ataaaaagaa
aaaaaaaata aaagttaagt gaaaatgaga ttgaagtgac 16680tttaggtgtg tataaatata
tcaaccccgc caacaattta tttaatccaa atatattgaa 16740gtatattatt ccatagcctt
tatttattta tatatttatt atataaaagc tttatttgtt 16800ctaggttgtt catgaaatat
ttttttggtt ttatctccgt tgtaagaaaa tcatgtgctt 16860tgtgtcgcca ctcactattg
cagctttttc atgcattggt cagattgacg gttgattgta 16920tttttgtttt ttatggtttt
gtgttatgac ttaagtcttc atctctttat ctcttcatca 16980ggtttgatgg ttacctaata
tggtccatgg gtacatgcat ggttaaatta ggtggccaac 17040tttgttgtga acgatagaat
tttttttata ttaagtaaac tatttttata ttatgaaata 17100ataataaaaa aaatatttta
tcattattaa caaaatcata ttagttaatt tgttaactct 17160ataataaaag aaatactgta
acattcacat tacatggtaa catctttcca ccctttcatt 17220tgttttttgt ttgatgactt
tttttcttgt ttaaatttat ttcccttctt ttaaatttgg 17280aatacattat catcatatat
aaactaaaat actaaaaaca ggattacaca aatgataaat 17340aataacacaa atatttataa
atctagctgc aatatattta aactagctat atcgatattg 17400taaaataaaa ctagctgcat
tgatactgat aaaaaaatat catgtgcttt ctggactgat 17460gatgcagtat acttttgaca
ttgcctttat tttatttttc agaaaagctt tcttagttct 17520gggttcttca ttatttgttt
cccatctcca ttgtgaattg aatcatttgc ttcgtgtcac 17580aaatacaatt tagntaggta
catgcattgg tcagattcac ggtttattat gtcatgactt 17640aagttcatgg tagtacatta
cctgccacgc atgcattata ttggttagat ttgataggca 17700aatttggttg tcaacaatat
aaatataaat aatgttttta tattacgaaa taacagtgat 17760caaaacaaac agttttatct
ttattaacaa gattttgttt ttgtttgatg acgtttttta 17820atgtttacgc tttccccctt
cttttgaatt tagaacactt tatcatcata aaatcaaata 17880ctaaaaaaat tacatatttc
ataaataata acacaaatat ttttaaaaaa tctgaaataa 17940taatgaacaa tattacatat
tatcacgaaa attcattaat aaaaatatta tataaataaa 18000atgtaatagt agttatatgt
aggaaaaaag tactgcacgc ataatatata caaaaagatt 18060aaaatgaact attataaata
ataacactaa attaatggtg aatcatatca aaataatgaa 18120aaagtaaata aaatttgtaa
ttaacttcta tatgtattac acacacaaat aataaataat 18180agtaaaaaaa attatgataa
atatttacca tctcataaga tatttaaaat aatgataaaa 18240atatagatta ttttttatgc
aactagctag ccaaaaagag aacacgggta tatataaaaa 18300gagtaccttt aaattctact
gtacttcctt tattcctgac gtttttatat caagtggaca 18360tacgtgaaga ttttaattat
cagtctaaat atttcattag cacttaatac ttttctgttt 18420tattcctatc ctataagtag
tcccgattct cccaacattg cttattcaca caactaacta 18480agaaagtctt ccatagcccc
ccaagcggcc catggcctcc tccgaggacg tcatcaagga 18540gttcatgcgc ttcaaggtgc
gcatggaggg ctccgtgaac ggccacgagt tcgagatcga 18600gggcgagggc gagggccgcc
cctacgaggg cacccagacc gccaagctga aggtgaccaa 18660gggcggcccc ctgcccttcg
cctgggacat cctgtccccc cagttccagt acggctccaa 18720ggtgtacgtg aagcaccccg
ccgacatccc cgactacaag aagctgtcct tccccgaggg 18780cttcaagtgg gagcgcgtga
tgaacttcga ggacggcggc gtggtgaccg tgacccagga 18840ctcctccctg caggacggct
ccttcatcta caaggtgaag ttcatcggcg tgaacttccc 18900ctccgacggc cccgtaatgc
agaagaagac tatgggctgg gaggcctcca ccgagcgcct 18960gtacccccgc gacggcgtgc
tgaagggcga gatccacaag gccctgaagc tgaaggacgg 19020cggccactac ctggtggagt
tcaagtccat ctacatggcc aagaagcccg tgcagctgcc 19080cggctactac tacgtggact
ccaagctgga catcacctcc cacaacgagg actacaccat 19140cgtggagcag tacgagcgcg
ccgagggccg ccaccacctg ttcctgtagc ggccggccgc 19200gacacaagtg tgagagtact
aaataaatgc tttggttgta cgaaatcatt acactaaata 19260aaataatcaa agcttatata
tgccttccgc taaggccgaa tgcaaagaaa ttggttcttt 19320ctcgttatct tttgccactt
ttactagtac gtattaatta ctacttaatc atctttgttt 19380acggctcatt atatccgtcg
acgg 194046230DNAArtificial
sequencePrimer D6 fwd 62gaattcgcgg ccgcatggct gctgctccca
306330DNAArtificial sequencePrimer D6 rev
63gaattcgcgg ccgcttactg cgccttaccc
30644322DNAArtificial sequenceVector KS119 64agcttttgat ccatgccctt
catttgccgc ttattaatta atttggtaac agtccgtact 60aatcagttac ttatccttcc
cccatcataa ttaatcttgg tagtctcgaa tgccacaaca 120ctgactagtc tcttggatca
taagaaaaag ccaaggaaca aaagaagaca aaacacaatg 180agagtatcct ttgcatagca
atgtctaagt tcataaaatt caaacaaaaa cgcaatcaca 240cacagtggac atcacttatc
cactagctga tcaggatcgc cgcgtcaaga aaaaaaaact 300ggaccccaaa agccatgcac
aacaacacgt actcacaaag gtgtcaatcg agcagcccaa 360aacattcacc aactcaaccc
atcatgagcc ctcacatttg ttgtttctaa cccaacctca 420aactcgtatt ctcttccgcc
acctcatttt tgtttatttc aacacccgtc aaactgcatg 480ccaccccgtg gccaaatgtc
catgcatgtt aacaagacct atgactataa atagctgcaa 540tctcggccca ggttttcatc
atcaagaacc agttcaatat cctagtacac cgtattaaag 600aatttaagat atactgcggc
cgcaagtatg aactaaaatg catgtaggtg taagagctca 660tggagagcat ggaatattgt
atccgaccat gtaacagtat aataactgag ctccatctca 720cttcttctat gaataaacaa
aggatgttat gatatattaa cactctatct atgcacctta 780ttgttctatg ataaatttcc
tcttattatt ataaatcatc tgaatcgtga cggcttatgg 840aatgcttcaa atagtacaaa
aacaaatgtg tactataaga ctttctaaac aattctaacc 900ttagcattgt gaacgagaca
taagtgttaa gaagacataa caattataat ggaagaagtt 960tgtctccatt tatatattat
atattaccca cttatgtatt atattaggat gttaaggaga 1020cataacaatt ataaagagag
aagtttgtat ccatttatat attatatact acccatttat 1080atattatact tatccactta
tttaatgtct ttataaggtt tgatccatga tatttctaat 1140attttagttg atatgtatat
gaaagggtac tatttgaact ctcttactct gtataaaggt 1200tggatcatcc ttaaagtggg
tctatttaat tttattgctt cttacagata aaaaaaaaat 1260tatgagttgg tttgataaaa
tattgaagga tttaaaataa taataaataa catataatat 1320atgtatataa atttattata
atataacatt tatctataaa aaagtaaata ttgtcataaa 1380tctatacaat cgtttagcct
tgctggacga atctcaatta tttaaacgag agtaaacata 1440tttgactttt tggttattta
acaaattatt atttaacact atatgaaatt ttttttttta 1500tcagcaaaga ataaaattaa
attaagaagg acaatggtgt cccaatcctt atacaaccaa 1560cttccacaag aaagtcaagt
cagagacaac aaaaaaacaa gcaaaggaaa ttttttaatt 1620tgagttgtct tgtttgctgc
ataatttatg cagtaaaaca ctacacataa cccttttagc 1680agtagagcaa tggttgaccg
tgtgcttagc ttcttttatt ttattttttt atcagcaaag 1740aataaataaa ataaaatgag
acacttcagg gatgtttcaa caagcttgga tccgtcgacg 1800gcgcgcccga tcatccggat
atagttcctc ctttcagcaa aaaacccctc aagacccgtt 1860tagaggcccc aaggggttat
gctagttatt gctcagcggt ggcagcagcc aactcagctt 1920cctttcgggc tttgttagca
gccggatcga tccaagctgt acctcactat tcctttgccc 1980tcggacgagt gctggggcgt
cggtttccac tatcggcgag tacttctaca cagccatcgg 2040tccagacggc cgcgcttctg
cgggcgattt gtgtacgccc gacagtcccg gctccggatc 2100ggacgattgc gtcgcatcga
ccctgcgccc aagctgcatc atcgaaattg ccgtcaacca 2160agctctgata gagttggtca
agaccaatgc ggagcatata cgcccggagc cgcggcgatc 2220ctgcaagctc cggatgcctc
cgctcgaagt agcgcgtctg ctgctccata caagccaacc 2280acggcctcca gaagaagatg
ttggcgacct cgtattggga atccccgaac atcgcctcgc 2340tccagtcaat gaccgctgtt
atgcggccat tgtccgtcag gacattgttg gagccgaaat 2400ccgcgtgcac gaggtgccgg
acttcggggc agtcctcggc ccaaagcatc agctcatcga 2460gagcctgcgc gacggacgca
ctgacggtgt cgtccatcac agtttgccag tgatacacat 2520ggggatcagc aatcgcgcat
atgaaatcac gccatgtagt gtattgaccg attccttgcg 2580gtccgaatgg gccgaacccg
ctcgtctggc taagatcggc cgcagcgatc gcatccatag 2640cctccgcgac cggctgcaga
acagcgggca gttcggtttc aggcaggtct tgcaacgtga 2700caccctgtgc acggcgggag
atgcaatagg tcaggctctc gctgaattcc ccaatgtcaa 2760gcacttccgg aatcgggagc
gcggccgatg caaagtgccg ataaacataa cgatctttgt 2820agaaaccatc ggcgcagcta
tttacccgca ggacatatcc acgccctcct acatcgaagc 2880tgaaagcacg agattcttcg
ccctccgaga gctgcatcag gtcggagacg ctgtcgaact 2940tttcgatcag aaacttctcg
acagacgtcg cggtgagttc aggcttttcc atgggtatat 3000ctccttctta aagttaaaca
aaattatttc tagagggaaa ccgttgtggt ctccctatag 3060tgagtcgtat taatttcgcg
ggatcgagat ctgatcaacc tgcattaatg aatcggccaa 3120cgcgcgggga gaggcggttt
gcgtattggg cgctcttccg cttcctcgct cactgactcg 3180ctgcgctcgg tcgttcggct
gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg 3240ttatccacag aatcagggga
taacgcagga aagaacatgt gagcaaaagg ccagcaaaag 3300gccaggaacc gtaaaaaggc
cgcgttgctg gcgtttttcc ataggctccg cccccctgac 3360gagcatcaca aaaatcgacg
ctcaagtcag aggtggcgaa acccgacagg actataaaga 3420taccaggcgt ttccccctgg
aagctccctc gtgcgctctc ctgttccgac cctgccgctt 3480accggatacc tgtccgcctt
tctcccttcg ggaagcgtgg cgctttctca atgctcacgc 3540tgtaggtatc tcagttcggt
gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc 3600cccgttcagc ccgaccgctg
cgccttatcc ggtaactatc gtcttgagtc caacccggta 3660agacacgact tatcgccact
ggcagcagcc actggtaaca ggattagcag agcgaggtat 3720gtaggcggtg ctacagagtt
cttgaagtgg tggcctaact acggctacac tagaaggaca 3780gtatttggta tctgcgctct
gctgaagcca gttaccttcg gaaaaagagt tggtagctct 3840tgatccggca aacaaaccac
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt 3900acgcgcagaa aaaaaggatc
tcaagaagat cctttgatct tttctacggg gtctgacgct 3960cagtggaacg aaaactcacg
ttaagggatt ttggtcatga cattaaccta taaaaatagg 4020cgtatcacga ggccctttcg
tctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac 4080atgcagctcc cggagacggt
cacagcttgt ctgtaagcgg atgccgggag cagacaagcc 4140cgtcagggcg cgtcagcggg
tgttggcggg tgtcggggct ggcttaacta tgcggcatca 4200gagcagattg tactgagagt
gcaccatatg gacatattgt cgttagaacg cggctacaat 4260taatacataa ccttatgtat
catacacata cgatttaggt gacactatag aacggcgcgc 4320ca
4322659420DNAArtificial
sequenceVector KS430 65ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc
tcatggagag catggaatat 60tgtatccgac catgtaacag tataataact gagctccatc
tcacttcttc tatgaataaa 120caaaggatgt tatgatatat taacactcta tctatgcacc
ttattgttct atgataaatt 180tcctcttatt attataaatc atctgaatcg tgacggctta
tggaatgctt caaatagtac 240aaaaacaaat gtgtactata agactttcta aacaattcta
accttagcat tgtgaacgag 300acataagtgt taagaagaca taacaattat aatggaagaa
gtttgtctcc atttatatat 360tatatattac ccacttatgt attatattag gatgttaagg
agacataaca attataaaga 420gagaagtttg tatccattta tatattatat actacccatt
tatatattat acttatccac 480ttatttaatg tctttataag gtttgatcca tgatatttct
aatattttag ttgatatgta 540tatgaaaggg tactatttga actctcttac tctgtataaa
ggttggatca tccttaaagt 600gggtctattt aattttattg cttcttacag ataaaaaaaa
aattatgagt tggtttgata 660aaatattgaa ggatttaaaa taataataaa taacatataa
tatatgtata taaatttatt 720ataatataac atttatctat aaaaaagtaa atattgtcat
aaatctatac aatcgtttag 780ccttgctgga cgaatctcaa ttatttaaac gagagtaaac
atatttgact ttttggttat 840ttaacaaatt attatttaac actatatgaa attttttttt
ttatcagcaa agaataaaat 900taaattaaga aggacaatgg tgtcccaatc cttatacaac
caacttccac aagaaagtca 960agtcagagac aacaaaaaaa caagcaaagg aaatttttta
atttgagttg tcttgtttgc 1020tgcataattt atgcagtaaa acactacaca taaccctttt
agcagtagag caatggttga 1080ccgtgtgctt agcttctttt attttatttt tttatcagca
aagaataaat aaaataaaat 1140gagacacttc agggatgttt caacaagctt ggatcctcga
agagaagggt taataacaca 1200cttttttaac atttttaaca caaattttag ttatttaaaa
atttattaaa aaatttaaaa 1260taagaagagg aactctttaa ataaatctaa cttacaaaat
ttatgatttt taataagttt 1320tcaccaataa aaaatgtcat aaaaatatgt taaaaagtat
attatcaata ttctctttat 1380gataaataaa aagaaaaaaa aaataaaagt taagtgaaaa
tgagattgaa gtgactttag 1440gtgtgtataa atatatcaac cccgccaaca atttatttaa
tccaaatata ttgaagtata 1500ttattccata gcctttattt atttatatat ttattatata
aaagctttat ttgttctagg 1560ttgttcatga aatatttttt tggttttatc tccgttgtaa
gaaaatcatg tgctttgtgt 1620cgccactcac tattgcagct ttttcatgca ttggtcagat
tgacggttga ttgtattttt 1680gttttttatg gttttgtgtt atgacttaag tcttcatctc
tttatctctt catcaggttt 1740gatggttacc taatatggtc catgggtaca tgcatggtta
aattaggtgg ccaactttgt 1800tgtgaacgat agaatttttt ttatattaag taaactattt
ttatattatg aaataataat 1860aaaaaaaata ttttatcatt attaacaaaa tcatattagt
taatttgtta actctataat 1920aaaagaaata ctgtaacatt cacattacat ggtaacatct
ttccaccctt tcatttgttt 1980tttgtttgat gacttttttt cttgtttaaa tttatttccc
ttcttttaaa tttggaatac 2040attatcatca tatataaact aaaatactaa aaacaggatt
acacaaatga taaataataa 2100cacaaatatt tataaatcta gctgcaatat atttaaacta
gctatatcga tattgtaaaa 2160taaaactagc tgcattgata ctgataaaaa aatatcatgt
gctttctgga ctgatgatgc 2220agtatacttt tgacattgcc tttattttat ttttcagaaa
agctttctta gttctgggtt 2280cttcattatt tgtttcccat ctccattgtg aattgaatca
tttgcttcgt gtcacaaata 2340caatttagnt aggtacatgc attggtcaga ttcacggttt
attatgtcat gacttaagtt 2400catggtagta cattacctgc cacgcatgca ttatattggt
tagatttgat aggcaaattt 2460ggttgtcaac aatataaata taaataatgt ttttatatta
cgaaataaca gtgatcaaaa 2520caaacagttt tatctttatt aacaagattt tgtttttgtt
tgatgacgtt ttttaatgtt 2580tacgctttcc cccttctttt gaatttagaa cactttatca
tcataaaatc aaatactaaa 2640aaaattacat atttcataaa taataacaca aatattttta
aaaaatctga aataataatg 2700aacaatatta catattatca cgaaaattca ttaataaaaa
tattatataa ataaaatgta 2760atagtagtta tatgtaggaa aaaagtactg cacgcataat
atatacaaaa agattaaaat 2820gaactattat aaataataac actaaattaa tggtgaatca
tatcaaaata atgaaaaagt 2880aaataaaatt tgtaattaac ttctatatgt attacacaca
caaataataa ataatagtaa 2940aaaaaattat gataaatatt taccatctca taagatattt
aaaataatga taaaaatata 3000gattattttt tatgcaacta gctagccaaa aagagaacac
gggtatatat aaaaagagta 3060cctttaaatt ctactgtact tcctttattc ctgacgtttt
tatatcaagt ggacatacgt 3120gaagatttta attatcagtc taaatatttc attagcactt
aatacttttc tgttttattc 3180ctatcctata agtagtcccg attctcccaa cattgcttat
tcacacaact aactaagaaa 3240gtcttccata gccccccaag cggcccatgg cctcctccga
ggacgtcatc aaggagttca 3300tgcgcttcaa ggtgcgcatg gagggctccg tgaacggcca
cgagttcgag atcgagggcg 3360agggcgaggg ccgcccctac gagggcaccc agaccgccaa
gctgaaggtg accaagggcg 3420gccccctgcc cttcgcctgg gacatcctgt ccccccagtt
ccagtacggc tccaaggtgt 3480acgtgaagca ccccgccgac atccccgact acaagaagct
gtccttcccc gagggcttca 3540agtgggagcg cgtgatgaac ttcgaggacg gcggcgtggt
gaccgtgacc caggactcct 3600ccctgcagga cggctccttc atctacaagg tgaagttcat
cggcgtgaac ttcccctccg 3660acggccccgt aatgcagaag aagactatgg gctgggaggc
ctccaccgag cgcctgtacc 3720cccgcgacgg cgtgctgaag ggcgagatcc acaaggccct
gaagctgaag gacggcggcc 3780actacctggt ggagttcaag tccatctaca tggccaagaa
gcccgtgcag ctgcccggct 3840actactacgt ggactccaag ctggacatca cctcccacaa
cgaggactac accatcgtgg 3900agcagtacga gcgcgccgag ggccgccacc acctgttcct
gtagcggccg gccgcgacac 3960aagtgtgaga gtactaaata aatgctttgg ttgtacgaaa
tcattacact aaataaaata 4020atcaaagctt atatatgcct tccgctaagg ccgaatgcaa
agaaattggt tctttctcgt 4080tatcttttgc cacttttact agtacgtatt aattactact
taatcatctt tgtttacggc 4140tcattatatc cgtcgacggc gcgccgctct agagggccca
attcgcccta tagtgagtcg 4200tattacaatt cactggccgt cgttttacaa cgtcgtgact
gggaaaaccc tggcgttacc 4260caacttaatc gccttgcagc acatccccct ttcgccagct
ggcgtaatag cgaagaggcc 4320cgcaccgatc gcccttccca acagttgcgc agcctatacg
tacggcagtt taaggtttac 4380acctataaaa gagagagccg ttatcgtctg tttgtggatg
tacagagtga tattattgac 4440acgccggggc gacggatggt gatccccctg gccagtgcac
gtctgctgtc agataaagtc 4500tcccgtgaac tttacccggt ggtgcatatc ggggatgaaa
gctggcgcat gatgaccacc 4560gatatggcca gtgtgccggt ctccgttatc ggggaagaag
tggctgatct cagccaccgc 4620gaaaatgaca tcaaaaacgc cattaacctg atgttctggg
gaatataaat gtcaggcatg 4680agattatcaa aaaggatctt cacctagatc cttttcacgt
agaaagccag tccgcagaaa 4740cggtgctgac cccggatgaa tgtcagctac tgggctatct
ggacaaggga aaacgcaagc 4800gcaaagagaa agcaggtagc ttgcagtggg cttacatggc
gatagctaga ctgggcggtt 4860ttatggacag caagcgaacc ggaattgcca gctggggcgc
cctctggtaa ggttgggaag 4920ccctgcaaag taaactggat ggctttcttg ccgccaagga
tctgatggcg caggggatca 4980agctctgatc aagagacagg atgaggatcg tttcgcatga
ttgaacaaga tggattgcac 5040gcaggttctc cggccgcttg ggtggagagg ctattcggct
atgactgggc acaacagaca 5100atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc
aggggcgccc ggttcttttt 5160gtcaagaccg acctgtccgg tgccctgaat gaactgcaag
acgaggcagc gcggctatcg 5220tggctggcca cgacgggcgt tccttgcgca gctgtgctcg
acgttgtcac tgaagcggga 5280agggactggc tgctattggg cgaagtgccg gggcaggatc
tcctgtcatc tcaccttgct 5340cctgccgaga aagtatccat catggctgat gcaatgcggc
ggctgcatac gcttgatccg 5400gctacctgcc cattcgacca ccaagcgaaa catcgcatcg
agcgagcacg tactcggatg 5460gaagccggtc ttgtcgatca ggatgatctg gacgaagagc
atcaggggct cgcgccagcc 5520gaactgttcg ccaggctcaa ggcgagcatg cccgacggcg
aggatctcgt cgtgacccat 5580ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc
gcttttctgg attcatcgac 5640tgtggccggc tgggtgtggc ggaccgctat caggacatag
cgttggctac ccgtgatatt 5700gctgaagagc ttggcggcga atgggctgac cgcttcctcg
tgctttacgg tatcgccgct 5760cccgattcgc agcgcatcgc cttctatcgc cttcttgacg
agttcttctg aattattaac 5820gcttacaatt tcctgatgcg gtattttctc cttacgcatc
tgtgcggtat ttcacaccgc 5880atcaggtggc acttttcggg gaaatgtgcg cggaacccct
atttgtttat ttttctaaat 5940acattcaaat atgtatccgc tcatgagaca ataaccctga
taaatgcttc aataatagca 6000cgtgaggagg gccaccatgg ccaagttgac cagtgccgtt
ccggtgctca ccgcgcgcga 6060cgtcgccgga gcggtcgagt tctggaccga ccggctcggg
ttctcccggg acttcgtgga 6120ggacgacttc gccggtgtgg tccgggacga cgtgaccctg
ttcatcagcg cggtccagga 6180ccaggtggtg ccggacaaca ccctggcctg ggtgtgggtg
cgcggcctgg acgagctgta 6240cgccgagtgg tcggaggtcg tgtccacgaa cttccgggac
gcctccgggc cggccatgac 6300cgagatcggc gagcagccgt gggggcggga gttcgccctg
cgcgacccgg ccggcaactg 6360cgtgcacttc gtggccgagg agcaggactg acacgtgcta
aaacttcatt tttaatttaa 6420aaggatctag gtgaagatcc tttttgataa tctcatgacc
aaaatccctt aacgtgagtt 6480ttcgttccac tgagcgtcag accccgtaga aaagatcaaa
ggatcttctt gagatccttt 6540ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca
ccgctaccag cggtggtttg 6600tttgccggat caagagctac caactctttt tccgaaggta
actggcttca gcagagcgca 6660gataccaaat actgttcttc tagtgtagcc gtagttaggc
caccacttca agaactctgt 6720agcaccgcct acatacctcg ctctgctaat cctgttacca
gtggctgctg ccagtggcga 6780taagtcgtgt cttaccgggt tggactcaag acgatagtta
ccggataagg cgcagcggtc 6840gggctgaacg gggggttcgt gcacacagcc cagcttggag
cgaacgacct acaccgaact 6900gagataccta cagcgtgagc tatgagaaag cgccacgctt
cccgaaggga gaaaggcgga 6960caggtatccg gtaagcggca gggtcggaac aggagagcgc
acgagggagc ttccaggggg 7020aaacgcctgg tatctttata gtcctgtcgg gtttcgccac
ctctgacttg agcgtcgatt 7080tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac
gccagcaacg cggccttttt 7140acggttcctg gccttttgct ggccttttgc tcacatgttc
tttcctgcgt tatcccctga 7200ttctgtggat aaccgtatta ccgcctttga gtgagctgat
accgctcgcc gcagccgaac 7260gaccgagcgc agcgagtcag tgagcgagga agcggaagag
cgcccaatac gcaaaccgcc 7320tctccccgcg cgttggccga ttcattaatg cagctggcac
gacaggtttc ccgactggaa 7380agcgggcagt gagcgcaacg caattaatgt gagttagctc
actcattagg caccccaggc 7440tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt
gtgagcggat aacaatttca 7500cacaggaaac agctatgacc atgattacgc caagctattt
aggtgacgcg ttagaatact 7560caagctatgc atcaagcttg gtaccgagct cggatccact
agtaacggcc gccagtgtgc 7620tggaattcag gggcgcgccc caatttatta tcatctattt
cctgacattt taatccatcc 7680acctatgtca aaaacttata gaaaatgtca acttccaaac
aaaacataat tgaacttcgc 7740aaataaattc ttaataatat taaaaaatgt tacttaatta
tttcttcaac cccattttcc 7800gcgcgtagcg cggacaaaga ctctagttaa atatagaagt
ttccgattct catcgtataa 7860aacggtgact ttggcgggct ttcatgtgta acaaattggt
ttaacaaacc actgcctagt 7920cgtttagtgt agaatcagcg catggaactc cgattggagc
gtgactttca cgtgccggag 7980gcccaccacc acagcgggcg ttacgctcta agaatctcgc
ccacggtttt cttcatctcc 8040cccccgccaa gtgtctccct cgttcgccac ttctcatcat
gttacaggga ccataaaaat 8100ggcgtatttc ttcagccccg ggtataaata cacacatgat
cctgtggtgg gttcttccac 8160aagttacatc tccttctggt ttttgtattg caagtgtttg
tattttttgc ctccgagaga 8220aaatcgcggc cgcatggaga gatctcaacg gcagtctcct
ccgccaccgt cgccgtcctc 8280ctcctcgtcc tccgtctccg cggacaccgt cctcgtccct
cccggaaaga ggcggagggc 8340ggcgacggcc aaggccggcg ccgagcctaa taagaggatc
cgcaaggacc ccgccgccgc 8400cgccgcgggg aagaggagct ccgtctacag gggagtcacc
aggcacaggt ggacgggcag 8460gttcgaggcg catctctggg acaagcactg cctcgccgcg
ctccacaaca agaagaaagg 8520caggcaagtc tacctggggg cgtatgacag cgaggaggca
gctgctcgtg cctatgacct 8580cgcagctctc aagtactggg gtcctgagac tctgctcaac
ttccctgtgg aggattactc 8640cagcgagatg ccggagatgg aggccgtgtc ccgggaggag
tacctggcct ccctccgccg 8700caggagcagc ggcttctcca ggggcgtctc caagtacaga
ggcgtcgcca ggcatcacca 8760caacgggagg tgggaggcac ggattgggcg agtctttggg
aacaagtacc tctacttggg 8820aacatttgac actcaagaag aggcagccaa ggcctatgac
cttgcggcca ttgaataccg 8880tggcgtcaat gctgtaacca acttcgacat cagctgctac
ctggaccacc cgctgttcct 8940ggcacagctc caacaggagc cacaggtggt gccggcactc
aaccaagaac ctcaacctga 9000tcagagcgaa accggaacta cagagcaaga gccggagtca
agcgaagcca agacaccgga 9060tggcagtgca gaacccgatg agaacgcggt gcctgacgac
accgcggagc ccctcaccac 9120agtcgacgac agcatcgaag agggcttgtg gagcccttgc
atggattacg agctagacac 9180catgtcgaga ccaaactttg gcagctcaat caatctgagc
gagtggttcg ctgacgcaga 9240cttcgactgc aacatcggat gcctgttcga tgggtgttct
gcggctgacg aaggaagcaa 9300ggatggtgta ggtctggcag atttcagtct gtttgaggca
ggtgatgtcc agctgaagga 9360tgttctttcg gatatggaag aggggataca acctccagcg
atgatcagtg tgtgcaacgc 94206619215DNAArtificial sequenceVector ARALO78
66cgcgcctcga gtgggcggat cccccgggct gcaggaattc actggccgtc gttttacaac
60gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt
120tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca
180gcctgaatgg cgaatggatc gatccatcgc gatgtacctt ttgttagtca gcctctcgat
240tgctcatcgt cattacacag taccgaagtt tgatcgatct agtaacatag atgacaccgc
300gcgcgataat ttatcctagt ttgcgcgcta tattttgttt tctatcgcgt attaaatgta
360taattgcggg actctaatca taaaaaccca tctcataaat aacgtcatgc attacatgtt
420aattattaca tgcttaacgt aattcaacag aaattatatg ataatcatcg caagaccggc
480aacaggattc aatcttaaga aactttattg ccaaatgttt gaacgatctg cttcgacgca
540ctccttcttt actccaccat ctcgtcctta ttgaaaacgt gggtagcacc aaaacgaatc
600aagtcgctgg aactgaagtt accaatcacg ctggatgatt tgccagttgg attaatcttg
660cctttccccg catgaataat attgatgaat gcatgcgtga ggggtagttc gatgttggca
720atagctgcaa ttgccgcgac atcctccaac gagcataatt cttcagaaaa atagcgatgt
780tccatgttgt cagggcatgc atgatgcacg ttatgaggtg acggtgctag gcagtattcc
840ctcaaagttt catagtcagt atcatattca tcattgcatt cctgcaagag agaattgaga
900cgcaatccac acgctgcggc aaccttccgg cgttcgtggt ctatttgctc ttggacgttg
960caaacgtaag tgttggatcg atccggggtg ggcgaagaac tccagcatga gatccccgcg
1020ctggaggatc atccagccgg cgtcccggaa aacgattccg aagcccaacc tttcatagaa
1080ggcggcggtg gaatcgaaat ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc
1140gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc
1200gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gccgccaagc
1260tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc cacacccagc
1320cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cggcaagcag
1380gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg
1440aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg atcgacaaga
1500ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg
1560caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc
1620tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc
1680cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg
1740gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg
1800gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag
1860cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga
1920gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atccccgcaa gcttggagac
1980tggtgatttc agcgtgtcct ctccaaatga aatgaacttc cttatataga ggaagggtct
2040tgcgaaggat agtgggattg tgcgtcatcc cttacgtcag tggagatatc acatcaatcc
2100acttgctttg aagacgtggt tggaacgtct tctttttcca cgatgctcct cgtgggtggg
2160ggtccatctt tgggaccact gtcggcagag gcatcttcaa cgatggcctt tcctttatcg
2220caatgatggc atttgtagga gccaccttcc ttttccacta tcttcacaat aaagtgacag
2280atagctgggc aatggaatcc gaggaggttt ccggatatta ccctttgttg aaaagtctca
2340attgcccttt ggtcttctga gactgtatct ttgatatttt tggagtagac aagcgtgtcg
2400tgctccacca tgttgacgaa gattttcttc ttgtcattga gtcgtaagag actctgtatg
2460aactgttcgc cagtctttac ggcgagttct gttaggtcct ctatttgaat ctttgactcc
2520atggcctttg attcagtggg aactaccttt ttagagactc caatctctat tacttgcctt
2580ggtttgtgaa gcaagccttg aatcgtccat actggaatag tacttctgat cttgagaaat
2640atatctttct ctgtgttctt gatgcagtta gtcctgaatc ttttgactgc atctttaacc
2700ttcttgggaa ggtatttgat ctcctggaga ttattgctcg ggtagatcgt cttgatgaga
2760cctgctgcgt aagcctctct aaccatctgt gggttagcat tctttctgaa attgaaaagg
2820ctaatcttct cattatcagt ggtgaacatg gtatcgtcac cttctccgtc gaacttcctg
2880actagatcgt agagatagag gaagtcgtcc attgtgatct ctggggcaaa ggagatctga
2940attaattcga tatggtggat ttatcacaaa tgggacccgc cgccgacaga ggtgtgatgt
3000taggccagga ctttgaaaat ttgcgcaact atcgtatagt ggccgacaaa ttgacgccga
3060gttgacagac tgcctagcat ttgagtgaat tatgtgaggt aatgggctac actgaattgg
3120tagctcaaac tgtcagtatt tatgtatatg agtgtatatt ttcgcataat ctcagaccaa
3180tctgaagatg aaatgggtat ctgggaatgg cgaaatcaag gcatcgatcg tgaagtttct
3240catctaagcc cccatttgga cgtgaatgta gacacgtcga aataaagatt tccgaattag
3300aataatttgt ttattgcttt cgcctataaa tacgacggat cgtaatttgt cgttttatca
3360aaatgtactt tcattttata ataacgctgc ggacatctac atttttgaat tgaaaaaaaa
3420ttggtaatta ctctttcttt ttctccatat tgaccatcat actcattgct gatccatgta
3480gatttcccgg acatgaagcc atttacaatt gaatatatcc tgccgccgct gccgctttgc
3540acccggtgga gcttgcatgt tggtttctac gcagaactga gccggttagg cagataattt
3600ccattgagaa ctgagccatg tgcaccttcc ccccaacacg gtgagcgacg gggcaacgga
3660gtgatccaca tgggactttt aaacatcatc cgtcggatgg cgttgcgaga gaagcagtcg
3720atccgtgaga tcagccgacg caccgggcag gcgcgcaaca cgatcgcaaa gtatttgaac
3780gcaggtacaa tcgagccgac gttcacgcgg aacgaccaag caagctagct ttaatgcggt
3840agtttatcac agttaaattg ctaacgcagt caggcaccgt gtatgaaatc taacaatgcg
3900ctcatcgtca tcctcggcac cgtcaccctg gatgctgtag gcataggctt ggttatgccg
3960gtactgccgg gcctcttgcg ggatatcgtc cattccgaca gcatcgccag tcactatggc
4020gtgctgctag cgctatatgc gttgatgcaa tttctatgcg cacccgttct cggagcactg
4080tccgaccgct ttggccgccg cccagtcctg ctcgcttcgc tacttggagc cactatcgac
4140tacgcgatca tggcgaccac acccgtcctg tggtccaacc cctccgctgc tatagtgcag
4200tcggcttctg acgttcagtg cagccgtctt ctgaaaacga catgtcgcac aagtcctaag
4260ttacgcgaca ggctgccgcc ctgccctttt cctggcgttt tcttgtcgcg tgttttagtc
4320gcataaagta gaatacttgc gactagaacc ggagacatta cgccatgaac aagagcgccg
4380ccgctggcct gctgggctat gcccgcgtca gcaccgacga ccaggacttg accaaccaac
4440gggccgaact gcacgcggcc ggctgcacca agctgttttc cgagaagatc accggcacca
4500ggcgcgaccg cccggagctg gccaggatgc ttgaccacct acgccctggc gacgttgtga
4560cagtgaccag gctagaccgc ctggcccgca gcacccgcga cctactggac attgccgagc
4620gcatccagga ggccggcgcg ggcctgcgta gcctggcaga gccgtgggcc gacaccacca
4680cgccggccgg ccgcatggtg ttgaccgtgt tcgccggcat tgccgagttc gagcgttccc
4740taatcatcga ccgcacccgg agcgggcgcg aggccgccaa ggcccgaggc gtgaagtttg
4800gcccccgccc taccctcacc ccggcacaga tcgcgcacgc ccgcgagctg atcgaccagg
4860aaggccgcac cgtgaaagag gcggctgcac tgcttggcgt gcatcgctcg accctgtacc
4920gcgcacttga gcgcagcgag gaagtgacgc ccaccgaggc caggcggcgc ggtgccttcc
4980gtgaggacgc attgaccgag gccgacgccc tggcggccgc cgagaatgaa cgccaagagg
5040aacaagcatg aaaccgcacc aggacggcca ggacgaaccg tttttcatta ccgaagagat
5100cgaggcggag atgatcgcgg ccgggtacgt gttcgagccg cccgcgcacg tctcaaccgt
5160gcggctgcat gaaatcctgg ccggtttgtc tgatgccaag ctggcggcct ggccggccag
5220cttggccgct gaagaaaccg agcgccgccg tctaaaaagg tgatgtgtat ttgagtaaaa
5280cagcttgcgt catgcggtcg ctgcgtatat gatgcgatga gtaaataaac aaatacgcaa
5340gggaacgcat gaagttatcg ctgtacttaa ccagaaaggc gggtcaggca agacgaccat
5400cgcaacccat ctagcccgcg ccctgcaact cgccggggcc gatgttctgt tagtcgattc
5460cgatccccag ggcagtgccc gcgattgggc ggccgtgcgg gaagatcaac cgctaaccgt
5520tgtcggcatc gaccgcccga cgattgaccg cgacgtgaag gccatcggcc ggcgcgactt
5580cgtagtgatc gacggagcgc cccaggcggc ggacttggct gtgtccgcga tcaaggcagc
5640cgacttcgtg ctgattccgg tgcagccaag cccttacgac atatgggcca ccgccgacct
5700ggtggagctg gttaagcagc gcattgaggt cacggatgga aggctacaag cggcctttgt
5760cgtgtcgcgg gcgatcaaag gcacgcgcat cggcggtgag gttgccgagg cgctggccgg
5820gtacgagctg cccattcttg agtcccgtat cacgcagcgc gtgagctacc caggcactgc
5880cgccgccggc acaaccgttc ttgaatcaga acccgagggc gacgctgccc gcgaggtcca
5940ggcgctggcc gctgaaatta aatcaaaact catttgagtt aatgaggtaa agagaaaatg
6000agcaaaagca caaacacgct aagtgccggc cgtccgagcg cacgcagcag caaggctgca
6060acgttggcca gcctggcaga cacgccagcc atgaagcggg tcaactttca gttgccggcg
6120gaggatcaca ccaagctgaa gatgtacgcg gtacgccaag gcaagaccat taccgagctg
6180ctatctgaat acatcgcgca gctaccagag taaatgagca aatgaataaa tgagtagatg
6240aattttagcg gctaaaggag gcggcatgga aaatcaagaa caaccaggca ccgacgccgt
6300ggaatgcccc atgtgtggag gaacgggcgg ttggccaggc gtaagcggct gggttgtctg
6360ccggccctgc aatggcactg gaacccccaa gcccgaggaa tcggcgtgag cggtcgcaaa
6420ccatccggcc cggtacaaat cggcgcggcg ctgggtgatg acctggtgga gaagttgaag
6480gccgcgcagg ccgcccagcg gcaacgcatc gaggcagaag cacgccccgg tgaatcgtgg
6540caagcggccg ctgatcgaat ccgcaaagaa tcccggcaac cgccggcagc cggtgcgccg
6600tcgattagga agccgcccaa gggcgacgag caaccagatt ttttcgttcc gatgctctat
6660gacgtgggca cccgcgatag tcgcagcatc atggacgtgg ccgttttccg tctgtcgaag
6720cgtgaccgac gagctggcga ggtgatccgc tacgagcttc cagacgggca cgtagaggtt
6780tccgcagggc cggccggcat ggccagtgtg tgggattacg acctggtact gatggcggtt
6840tcccatctaa ccgaatccat gaaccgatac cgggaaggga agggagacaa gcccggccgc
6900gtgttccgtc cacacgttgc ggacgtactc aagttctgcc ggcgagccga tggcggaaag
6960cagaaagacg acctggtaga aacctgcatt cggttaaaca ccacgcacgt tgccatgcag
7020cgtacgaaga aggccaagaa cggccgcctg gtgacggtat ccgagggtga agccttgatt
7080agccgctaca agatcgtaaa gagcgaaacc gggcggccgg agtacatcga gatcgagcta
7140gctgattgga tgtaccgcga gatcacagaa ggcaagaacc cggacgtgct gacggttcac
7200cccgattact ttttgatcga tcccggcatc ggccgttttc tctaccgcct ggcacgccgc
7260gccgcaggca aggcagaagc cagatggttg ttcaagacga tctacgaacg cagtggcagc
7320gccggagagt tcaagaagtt ctgtttcacc gtgcgcaagc tgatcgggtc aaatgacctg
7380ccggagtacg atttgaagga ggaggcgggg caggctggcc cgatcctagt catgcgctac
7440cgcaacctga tcgagggcga agcatccgcc ggttcctaat gtacggagca gatgctaggg
7500caaattgccc tagcagggga aaaaggtcga aaaggtctct ttcctgtgga tagcacgtac
7560attgggaacc caaagccgta cattgggaac cggaacccgt acattgggaa cccaaagccg
7620tacattggga accggtcaca catgtaagtg actgatataa aagagaaaaa aggcgatttt
7680tccgcctaaa actctttaaa acttattaaa actcttaaaa cccgcctggc ctgtgcataa
7740ctgtctggcc agcgcacagc cgaagagctg caaaaagcgc ctacccttcg gtcgctgcgc
7800tccctacgcc ccgccgcttc gcgtcggcct atcgcggccg ctggccgctc aaaaatggct
7860ggcctacggc caggcaatct accagggcgc ggacaagccg cgccgtcgcc actcgaccgc
7920cggcgcccac atcaaggcac cctgcctcgc gcgtttcggt gatgacggtg aaaacctctg
7980acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca
8040agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc
8100acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg
8160agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc
8220aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga
8280gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca
8340ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg
8400ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt
8460cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc
8520ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct
8580tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc
8640gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta
8700tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca
8760gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag
8820tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag
8880ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt
8940agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa
9000gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg
9060attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga
9120agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta
9180atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc
9240cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg
9300ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga
9360agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt
9420tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt
9480gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc
9540caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc
9600ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca
9660gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag
9720tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg
9780tcaacacggg ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa
9840gacctgcagg gggggggggg cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat
9900accaggcctg aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag
9960ctttgttgta ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtctg
10020cgttgtcggg aagatgcgtg atctgatcct tcaactcagc aaaagttcga tttattcaac
10080aaagccgccg tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa
10140ttctgattag aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt
10200atcaatacca tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca
10260gttccatagg atggcaagat cctggtatcg gtctgcgatt ccgactcgtc caacatcaat
10320acaacctatt aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt
10380gacgactgaa tccggtgaga atggcaaaag cttatgcatt tctttccaga cttgttcaac
10440aggccagcca ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg
10500tgattgcgcc tgagcgagac gaaatacgcg atcgctgtta aaaggacaat tacaaacagg
10560aatcgaatgc aaccggcgca ggaacactgc cagcgcatca acaatatttt cacctgaatc
10620aggatattct tctaatacct ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca
10680tgcatcatca ggagtacgga taaaatgctt gatggtcgga agaggcataa attccgtcag
10740ccagtttagt ctgaccatct catctgtaac atcattggca acgctacctt tgccatgttt
10800cagaaacaac tctggcgcat cgggcttccc atacaatcga tagattgtcg cacctgattg
10860cccgacatta tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa
10920tcgcggcctc gagcaagacg tttcccgttg aatatggctc ataacacccc ttgtattact
10980gtttatgtaa gcagacagtt ttattgttca tgatgatata tttttatctt gtgcaatgta
11040acatcagaga ttttgagaca caacgtggct ttcccccccc cccctgcagg tcaattcggt
11100cgatatggct attacgaaga aggctcgtgc gcggagtccc gtgaactttc ccacgcaaca
11160agtgaaccgc accgggtttg ccggaggcca tttcgttaaa atgcgcagcc atggctgctt
11220cgtccagcat ggcgtaatac tgatcctcgt cttcggctgg cggtatattg ccgatgggct
11280tcaaaagccg ccgtggttga accagtctat ccattccaag gtagcgaact cgaccgcttc
11340gaagctcctc catggtccac gccgatgaat gacctcggcc ttgtaaagac cgttgatcgc
11400ttctgcgagg gcgttgtcgt gctgtcgccg acgcttccga tagatggctc gatacctgct
11460tctgccaacc gctcggaata gcgaaaggac acgtattgaa caccgcgatc cgagtgatgc
11520actaggccgc catgagcggg acgccgatca tgatgagcct cctcgagggc atcgaggaca
11580aagcctgcat gtgctgtccg gctcgcccgc catccgacaa tgcgacgggc gaagacgtcg
11640atcacgaagg ccacgtagac gaagccctcc caagtggcga cataagtacg gacatgcgca
11700aaggctttcc cggtttgtcg ctgatggtgc aagagacgct gaagcgcgat ccgatgcgca
11760ggcatctgtt cgtcttccgc ggtcgtggcg gtggcctgat caaggtcact cgccgaagag
11820ctgcatgatt ggctcgaaac cgagcggggg aaattgtcgc gcagttctcc cgtcgccgag
11880gcgataaatt acatgctcaa gcgatgggat ggcattacgt cattcctcga tgacggcccg
11940atttgcctga cgaacaatgc tgccgaacga acgctcagag gctatgtact cggcaggaag
12000tcatggctgt ttgccggatc ggatcgttgt gctgaacgtg cggcgttcat ggcgacactg
12060atcatgagcg ccaagctcaa taacatcgat ccgcaggcct ggcttgccga cgtccgcgcc
12120gaccttgcgg acgctccgat cagcaggctt gagcaacagc tgccgtggaa ctggacatcc
12180aagacactga gtgctcaggc ggcctgacct gcggccttca ccggatactt accccattat
12240cgcagattgc gatgaagcat cagcgtcatt cagcaatctt gccaaagtat gcaggctcgc
12300gagaatcgac gtgcgaaacc ggctggttgc gccaaagatc cgcttgcgga gcggtcgaac
12360attcatgctg ggacttcaag aggtcgagta gaggaagaac cggaaaggtt gcaccggaaa
12420atatgcgttc ctttggagag cgcctcatgg acgtgaacaa atcgcccgga ccaaggatgc
12480cacggataca aaagctcgcg aagctcggtc ccgtgggtgt tctgtcgtct cgttgtacaa
12540cgaaatccat tcccattccg cgctcaagat ggcttcccct cggcagttca tcagggctaa
12600atcaatctag ccgacttgtc cggtgaaatg ggctgcactc caacagaaac aatcaaacaa
12660acatacacag cgacttattc acacgagctc aaattacaac ggtatatatc ctgccagtca
12720gcatcatcac accaaaagtt aggcccgaat agtttgaaat tagaaagctc gcaattgagg
12780tctacaggcc aaattcgctc ttagccgtac aatattactc accggtgcga tgccccccat
12840cgtaggtgaa ggtggaaatt aatgatccat cttgagacca caggcccaca acagctacca
12900gtttcctcaa gggtccacca aaaacgtaag cgcttacgta catggtcgat aagaaaaggc
12960aatttgtaga tgttaacatc caacgtcgct ttcagggatc gatccaatac gcaaaccgcc
13020tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa
13080agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc
13140tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca
13200cacaggaaac agctatgacc atgattacgc caagcttgca tgcctgcagg tcgactctag
13260aggatctggc gcgccccaat ttattatcat ctatttcctg acattttaat ccatccacct
13320atgtcaaaaa cttatagaaa atgtcaactt ccaaacaaaa cataattgaa cttcgcaaat
13380aaattcttaa taatattaaa aaatgttact taattatttc ttcaacccca ttttccgcgc
13440gtagcgcgga caaagactct agttaaatat agaagtttcc gattctcatc gtataaaacg
13500gtgactttgg cgggctttca tgtgtaacaa attggtttaa caaaccactg cctagtcgtt
13560tagtgtagaa tcagcgcatg gaactccgat tggagcgtga ctttcacgtg ccggaggccc
13620accaccacag cgggcgttac gctctaagaa tctcgcccac ggttttcttc atctcccccc
13680cgccaagtgt ctccctcgtt cgccacttct catcatgtta cagggaccat aaaaatggcg
13740tatttcttca gccccgggta taaatacaca catgatcctg tggtgggttc ttccacaagt
13800tacatctcct tctggttttt gtattgcaag tgtttgtatt ttttgcctcc gagagaaaat
13860cgcggccgca tggagagatc tcaacggcag tctcctccgc caccgtcgcc gtcctcctcc
13920tcgtcctccg tctccgcgga caccgtcctc gtccctcccg gaaagaggcg gagggcggcg
13980acggccaagg ccggcgccga gcctaataag aggatccgca aggaccccgc cgccgccgcc
14040gcggggaaga ggagctccgt ctacagggga gtcaccaggc acaggtggac gggcaggttc
14100gaggcgcatc tctgggacaa gcactgcctc gccgcgctcc acaacaagaa gaaaggcagg
14160caagtctacc tgggggcgta tgacagcgag gaggcagctg ctcgtgccta tgacctcgca
14220gctctcaagt actggggtcc tgagactctg ctcaacttcc ctgtggagga ttactccagc
14280gagatgccgg agatggaggc cgtgtcccgg gaggagtacc tggcctccct ccgccgcagg
14340agcagcggct tctccagggg cgtctccaag tacagaggcg tcgccaggca tcaccacaac
14400gggaggtggg aggcacggat tgggcgagtc tttgggaaca agtacctcta cttgggaaca
14460tttgacactc aagaagaggc agccaaggcc tatgaccttg cggccattga ataccgtggc
14520gtcaatgctg taaccaactt cgacatcagc tgctacctgg accacccgct gttcctggca
14580cagctccaac aggagccaca ggtggtgccg gcactcaacc aagaacctca acctgatcag
14640agcgaaaccg gaactacaga gcaagagccg gagtcaagcg aagccaagac accggatggc
14700agtgcagaac ccgatgagaa cgcggtgcct gacgacaccg cggagcccct caccacagtc
14760gacgacagca tcgaagaggg cttgtggagc ccttgcatgg attacgagct agacaccatg
14820tcgagaccaa actttggcag ctcaatcaat ctgagcgagt ggttcgctga cgcagacttc
14880gactgcaaca tcggatgcct gttcgatggg tgttctgcgg ctgacgaagg aagcaaggat
14940ggtgtaggtc tggcagattt cagtctgttt gaggcaggtg atgtccagct gaaggatgtt
15000ctttcggata tggaagaggg gatacaacct ccagcgatga tcagtgtgtg caacgcggcc
15060gcaagtatga actaaaatgc atgtaggtgt aagagctcat ggagagcatg gaatattgta
15120tccgaccatg taacagtata ataactgagc tccatctcac ttcttctatg aataaacaaa
15180ggatgttatg atatattaac actctatcta tgcaccttat tgttctatga taaatttcct
15240cttattatta taaatcatct gaatcgtgac ggcttatgga atgcttcaaa tagtacaaaa
15300acaaatgtgt actataagac tttctaaaca attctaacct tagcattgtg aacgagacat
15360aagtgttaag aagacataac aattataatg gaagaagttt gtctccattt atatattata
15420tattacccac ttatgtatta tattaggatg ttaaggagac ataacaatta taaagagaga
15480agtttgtatc catttatata ttatatacta cccatttata tattatactt atccacttat
15540ttaatgtctt tataaggttt gatccatgat atttctaata ttttagttga tatgtatatg
15600aaagggtact atttgaactc tcttactctg tataaaggtt ggatcatcct taaagtgggt
15660ctatttaatt ttattgcttc ttacagataa aaaaaaaatt atgagttggt ttgataaaat
15720attgaaggat ttaaaataat aataaataac atataatata tgtatataaa tttattataa
15780tataacattt atctataaaa aagtaaatat tgtcataaat ctatacaatc gtttagcctt
15840gctggacgaa tctcaattat ttaaacgaga gtaaacatat ttgacttttt ggttatttaa
15900caaattatta tttaacacta tatgaaattt ttttttttat cagcaaagaa taaaattaaa
15960ttaagaagga caatggtgtc ccaatcctta tacaaccaac ttccacaaga aagtcaagtc
16020agagacaaca aaaaaacaag caaaggaaat tttttaattt gagttgtctt gtttgctgca
16080taatttatgc agtaaaacac tacacataac ccttttagca gtagagcaat ggttgaccgt
16140gtgcttagct tcttttattt tattttttta tcagcaaaga ataaataaaa taaaatgaga
16200cacttcaggg atgtttcaac aagcttggat cctcgaagag aagggttaat aacacacttt
16260tttaacattt ttaacacaaa ttttagttat ttaaaaattt attaaaaaat ttaaaataag
16320aagaggaact ctttaaataa atctaactta caaaatttat gatttttaat aagttttcac
16380caataaaaaa tgtcataaaa atatgttaaa aagtatatta tcaatattct ctttatgata
16440aataaaaaga aaaaaaaaat aaaagttaag tgaaaatgag attgaagtga ctttaggtgt
16500gtataaatat atcaaccccg ccaacaattt atttaatcca aatatattga agtatattat
16560tccatagcct ttatttattt atatatttat tatataaaag ctttatttgt tctaggttgt
16620tcatgaaata tttttttggt tttatctccg ttgtaagaaa atcatgtgct ttgtgtcgcc
16680actcactatt gcagcttttt catgcattgg tcagattgac ggttgattgt atttttgttt
16740tttatggttt tgtgttatga cttaagtctt catctcttta tctcttcatc aggtttgatg
16800gttacctaat atggtccatg ggtacatgca tggttaaatt aggtggccaa ctttgttgtg
16860aacgatagaa ttttttttat attaagtaaa ctatttttat attatgaaat aataataaaa
16920aaaatatttt atcattatta acaaaatcat attagttaat ttgttaactc tataataaaa
16980gaaatactgt aacattcaca ttacatggta acatctttcc accctttcat ttgttttttg
17040tttgatgact ttttttcttg tttaaattta tttcccttct tttaaatttg gaatacatta
17100tcatcatata taaactaaaa tactaaaaac aggattacac aaatgataaa taataacaca
17160aatatttata aatctagctg caatatattt aaactagcta tatcgatatt gtaaaataaa
17220actagctgca ttgatactga taaaaaaata tcatgtgctt tctggactga tgatgcagta
17280tacttttgac attgccttta ttttattttt cagaaaagct ttcttagttc tgggttcttc
17340attatttgtt tcccatctcc attgtgaatt gaatcatttg cttcgtgtca caaatacaat
17400ttagntaggt acatgcattg gtcagattca cggtttatta tgtcatgact taagttcatg
17460gtagtacatt acctgccacg catgcattat attggttaga tttgataggc aaatttggtt
17520gtcaacaata taaatataaa taatgttttt atattacgaa ataacagtga tcaaaacaaa
17580cagttttatc tttattaaca agattttgtt tttgtttgat gacgtttttt aatgtttacg
17640ctttccccct tcttttgaat ttagaacact ttatcatcat aaaatcaaat actaaaaaaa
17700ttacatattt cataaataat aacacaaata tttttaaaaa atctgaaata ataatgaaca
17760atattacata ttatcacgaa aattcattaa taaaaatatt atataaataa aatgtaatag
17820tagttatatg taggaaaaaa gtactgcacg cataatatat acaaaaagat taaaatgaac
17880tattataaat aataacacta aattaatggt gaatcatatc aaaataatga aaaagtaaat
17940aaaatttgta attaacttct atatgtatta cacacacaaa taataaataa tagtaaaaaa
18000aattatgata aatatttacc atctcataag atatttaaaa taatgataaa aatatagatt
18060attttttatg caactagcta gccaaaaaga gaacacgggt atatataaaa agagtacctt
18120taaattctac tgtacttcct ttattcctga cgtttttata tcaagtggac atacgtgaag
18180attttaatta tcagtctaaa tatttcatta gcacttaata cttttctgtt ttattcctat
18240cctataagta gtcccgattc tcccaacatt gcttattcac acaactaact aagaaagtct
18300tccatagccc cccaagcggc ccatggcctc ctccgaggac gtcatcaagg agttcatgcg
18360cttcaaggtg cgcatggagg gctccgtgaa cggccacgag ttcgagatcg agggcgaggg
18420cgagggccgc ccctacgagg gcacccagac cgccaagctg aaggtgacca agggcggccc
18480cctgcccttc gcctgggaca tcctgtcccc ccagttccag tacggctcca aggtgtacgt
18540gaagcacccc gccgacatcc ccgactacaa gaagctgtcc ttccccgagg gcttcaagtg
18600ggagcgcgtg atgaacttcg aggacggcgg cgtggtgacc gtgacccagg actcctccct
18660gcaggacggc tccttcatct acaaggtgaa gttcatcggc gtgaacttcc cctccgacgg
18720ccccgtaatg cagaagaaga ctatgggctg ggaggcctcc accgagcgcc tgtacccccg
18780cgacggcgtg ctgaagggcg agatccacaa ggccctgaag ctgaaggacg gcggccacta
18840cctggtggag ttcaagtcca tctacatggc caagaagccc gtgcagctgc ccggctacta
18900ctacgtggac tccaagctgg acatcacctc ccacaacgag gactacacca tcgtggagca
18960gtacgagcgc gccgagggcc gccaccacct gttcctgtag cggccggccg cgacacaagt
19020gtgagagtac taaataaatg ctttggttgt acgaaatcat tacactaaat aaaataatca
19080aagcttatat atgccttccg ctaaggccga atgcaaagaa attggttctt tctcgttatc
19140ttttgccact tttactagta cgtattaatt actacttaat catctttgtt tacggctcat
19200tatatccgtc gacgg
19215679565DNAArtificial sequenceVector KS428 67ctagagggcc caattcgccc
tatagtgagt cgtattacaa ttcactggcc gtcgttttac 60aacgtcgtga ctgggaaaac
cctggcgtta cccaacttaa tcgccttgca gcacatcccc 120ctttcgccag ctggcgtaat
agcgaagagg cccgcaccga tcgcccttcc caacagttgc 180gcagcctata cgtacggcag
tttaaggttt acacctataa aagagagagc cgttatcgtc 240tgtttgtgga tgtacagagt
gatattattg acacgccggg gcgacggatg gtgatccccc 300tggccagtgc acgtctgctg
tcagataaag tctcccgtga actttacccg gtggtgcata 360tcggggatga aagctggcgc
atgatgacca ccgatatggc cagtgtgccg gtctccgtta 420tcggggaaga agtggctgat
ctcagccacc gcgaaaatga catcaaaaac gccattaacc 480tgatgttctg gggaatataa
atgtcaggca tgagattatc aaaaaggatc ttcacctaga 540tccttttcac gtagaaagcc
agtccgcaga aacggtgctg accccggatg aatgtcagct 600actgggctat ctggacaagg
gaaaacgcaa gcgcaaagag aaagcaggta gcttgcagtg 660ggcttacatg gcgatagcta
gactgggcgg ttttatggac agcaagcgaa ccggaattgc 720cagctggggc gccctctggt
aaggttggga agccctgcaa agtaaactgg atggctttct 780tgccgccaag gatctgatgg
cgcaggggat caagctctga tcaagagaca ggatgaggat 840cgtttcgcat gattgaacaa
gatggattgc acgcaggttc tccggccgct tgggtggaga 900ggctattcgg ctatgactgg
gcacaacaga caatcggctg ctctgatgcc gccgtgttcc 960ggctgtcagc gcaggggcgc
ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga 1020atgaactgca agacgaggca
gcgcggctat cgtggctggc cacgacgggc gttccttgcg 1080cagctgtgct cgacgttgtc
actgaagcgg gaagggactg gctgctattg ggcgaagtgc 1140cggggcagga tctcctgtca
tctcaccttg ctcctgccga gaaagtatcc atcatggctg 1200atgcaatgcg gcggctgcat
acgcttgatc cggctacctg cccattcgac caccaagcga 1260aacatcgcat cgagcgagca
cgtactcgga tggaagccgg tcttgtcgat caggatgatc 1320tggacgaaga gcatcagggg
ctcgcgccag ccgaactgtt cgccaggctc aaggcgagca 1380tgcccgacgg cgaggatctc
gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg 1440tggaaaatgg ccgcttttct
ggattcatcg actgtggccg gctgggtgtg gcggaccgct 1500atcaggacat agcgttggct
acccgtgata ttgctgaaga gcttggcggc gaatgggctg 1560accgcttcct cgtgctttac
ggtatcgccg ctcccgattc gcagcgcatc gccttctatc 1620gccttcttga cgagttcttc
tgaattatta acgcttacaa tttcctgatg cggtattttc 1680tccttacgca tctgtgcggt
atttcacacc gcatcaggtg gcacttttcg gggaaatgtg 1740cgcggaaccc ctatttgttt
atttttctaa atacattcaa atatgtatcc gctcatgaga 1800caataaccct gataaatgct
tcaataatag cacgtgagga gggccaccat ggccaagttg 1860accagtgccg ttccggtgct
caccgcgcgc gacgtcgccg gagcggtcga gttctggacc 1920gaccggctcg ggttctcccg
ggacttcgtg gaggacgact tcgccggtgt ggtccgggac 1980gacgtgaccc tgttcatcag
cgcggtccag gaccaggtgg tgccggacaa caccctggcc 2040tgggtgtggg tgcgcggcct
ggacgagctg tacgccgagt ggtcggaggt cgtgtccacg 2100aacttccggg acgcctccgg
gccggccatg accgagatcg gcgagcagcc gtgggggcgg 2160gagttcgccc tgcgcgaccc
ggccggcaac tgcgtgcact tcgtggccga ggagcaggac 2220tgacacgtgc taaaacttca
tttttaattt aaaaggatct aggtgaagat cctttttgat 2280aatctcatga ccaaaatccc
ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 2340gaaaagatca aaggatcttc
ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 2400acaaaaaaac caccgctacc
agcggtggtt tgtttgccgg atcaagagct accaactctt 2460tttccgaagg taactggctt
cagcagagcg cagataccaa atactgttct tctagtgtag 2520ccgtagttag gccaccactt
caagaactct gtagcaccgc ctacatacct cgctctgcta 2580atcctgttac cagtggctgc
tgccagtggc gataagtcgt gtcttaccgg gttggactca 2640agacgatagt taccggataa
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 2700cccagcttgg agcgaacgac
ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 2760agcgccacgc ttcccgaagg
gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 2820acaggagagc gcacgaggga
gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 2880gggtttcgcc acctctgact
tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 2940ctatggaaaa acgccagcaa
cgcggccttt ttacggttcc tggccttttg ctggcctttt 3000gctcacatgt tctttcctgc
gttatcccct gattctgtgg ataaccgtat taccgccttt 3060gagtgagctg ataccgctcg
ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 3120gaagcggaag agcgcccaat
acgcaaaccg cctctccccg cgcgttggcc gattcattaa 3180tgcagctggc acgacaggtt
tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat 3240gtgagttagc tcactcatta
ggcaccccag gctttacact ttatgcttcc ggctcgtatg 3300ttgtgtggaa ttgtgagcgg
ataacaattt cacacaggaa acagctatga ccatgattac 3360gccaagctat ttaggtgacg
cgttagaata ctcaagctat gcatcaagct tggtaccgag 3420ctcggatcca ctagtaacgg
ccgccagtgt gctggaattc aggggcgcgc cctatagatg 3480ggatgaagct gctctcgaca
aatctgataa aactaaagaa ggttagtaat caatttttac 3540aaaatcatag attatttttt
tcattgaatt atttttatgc tataccaaga attgtatttt 3600agtatttgtt ttaactacat
ataatagaat taactacata taaattaact aaacttaaaa 3660taaaaataga tttgtttcct
gaaattattt taagaatata tatgtatata tctaaaatct 3720tagacttaga tagatttttc
tatctatcta ttttggttac ttaaaataaa taaatttgta 3780taaataattg tatagttatc
aaaaattaaa actaattttt ttaaagttgt tgatatataa 3840aatactaaag atttaacgat
taagtattta tttaagtata gaattttgtt ttttttttaa 3900gtttagttat gaagttgtta
attatattaa aacaaaacaa tatttcgaaa ttttattatc 3960atattcgaat atattttttt
tagtgatgat gtatgaatta ttatcataat ttgaaagttt 4020actaaaaaat atatcaacat
gaattgtaat atatgagtta ttaccttaac caaaattata 4080aattaacatt aaatataatt
atatatgtca tatttagcca tacaatgtgt catcaatatt 4140aatagtcatg tcaatattac
ataatgccaa tattatgcta cttaaacccc aaatccccta 4200actcccgtta agtagccaaa
ttcataaata tacttattcg acaaaataaa aaactttaaa 4260atatttacta atccgaccat
gcacaagcat ccattcccta ttccattgcc acgggataac 4320aatgcaaccn actcctcaaa
aaaagaaaaa ttcaagctct tttgcaaaaa aaaataaaat 4380aattttaaca cctaaaattt
tttgtttcca aacttctaca gggaacacac ataaaagaaa 4440aagaggacgt ccactcggat
cacgcaacaa accaaaaggt gtgtcatgac tcctaagata 4500taatatttcc ttattcaaaa
tcataccatt ttaaattatg aatgtatttc gtagtccacc 4560agatatgtaa tccaccagcg
ttcaaaccaa agttttatga ttgtaagttt aagtgaatta 4620taataatata ttcttcacgg
tatcttttca taactaattg agttatcaaa cttgatcgca 4680catgtggctt tgataggtgt
gacttttatg gtatacaatt ctttcaacct aaaaacatta 4740ttgttcctca atatcttaca
ttatgcttga ctgcaacaaa atattttctc atctgttttc 4800ttcctttaaa ccaatttatt
atcatctatt tcctgacatt ttaatccatc cacctatgtc 4860aaaaacttat agaaaatgtc
aacttccaaa caaaacataa ttgaacttcg caaataaatt 4920cttaataata ttaaaaaatg
ttacttaatt atttcttcaa ccccattttc cgcgcgtagc 4980gcggacaaag actctagtta
aatatagaag tttccgattc tcatcgtata aaacggtgac 5040tttggcgggc tttcatgtgt
aacaaattgg tttaacaaac cactgcctag tcgtttagtg 5100tagaatcagc gcatggaact
ccgattggag cgtgactttc acgtgccgga ggcccaccac 5160cacagcgggc gttacgctct
aagaatctcg cccacggttt tcttcatctg ccccccgcca 5220agtgtcttcc tcgttcgcca
cttctcacca agttacagga accctaaaaa tggcctttct 5280tcagccccgg ctataataca
cacatgatcc tatagtgggt tcttccacaa gttacatctc 5340cttctggatt gtacatttca
agtgtttgtg ttttttctgc ctctgagaga aaatcgcggc 5400cgcaagtatg aactaaaatg
catgtaggtg taagagctca tggagagcat ggaatattgt 5460atccgaccat gtaacagtat
aataactgag ctccatctca cttcttctat gaataaacaa 5520aggatgttat gatatattaa
cactctatct atgcacctta ttgttctatg ataaatttcc 5580tcttattatt ataaatcatc
tgaatcgtga cggcttatgg aatgcttcaa atagtacaaa 5640aacaaatgtg tactataaga
ctttctaaac aattctaacc ttagcattgt gaacgagaca 5700taagtgttaa gaagacataa
caattataat ggaagaagtt tgtctccatt tatatattat 5760atattaccca cttatgtatt
atattaggat gttaaggaga cataacaatt ataaagagag 5820aagtttgtat ccatttatat
attatatact acccatttat atattatact tatccactta 5880tttaatgtct ttataaggtt
tgatccatga tatttctaat attttagttg atatgtatat 5940gaaagggtac tatttgaact
ctcttactct gtataaaggt tggatcatcc ttaaagtggg 6000tctatttaat tttattgctt
cttacagata aaaaaaaaat tatgagttgg tttgataaaa 6060tattgaagga tttaaaataa
taataaataa catataatat atgtatataa atttattata 6120atataacatt tatctataaa
aaagtaaata ttgtcataaa tctatacaat cgtttagcct 6180tgctggacga atctcaatta
tttaaacgag agtaaacata tttgactttt tggttattta 6240acaaattatt atttaacact
atatgaaatt ttttttttta tcagcaaaga ataaaattaa 6300attaagaagg acaatggtgt
cccaatcctt atacaaccaa cttccacaag aaagtcaagt 6360cagagacaac aaaaaaacaa
gcaaaggaaa ttttttaatt tgagttgtct tgtttgctgc 6420ataatttatg cagtaaaaca
ctacacataa cccttttagc agtagagcaa tggttgaccg 6480tgtgcttagc ttcttttatt
ttattttttt atcagcaaag aataaataaa ataaaatgag 6540acacttcagg gatgtttcaa
caagcttgga tcctcgaaga gaagggttaa taacacactt 6600ttttaacatt tttaacacaa
attttagtta tttaaaaatt tattaaaaaa tttaaaataa 6660gaagaggaac tctttaaata
aatctaactt acaaaattta tgatttttaa taagttttca 6720ccaataaaaa atgtcataaa
aatatgttaa aaagtatatt atcaatattc tctttatgat 6780aaataaaaag aaaaaaaaaa
taaaagttaa gtgaaaatga gattgaagtg actttaggtg 6840tgtataaata tatcaacccc
gccaacaatt tatttaatcc aaatatattg aagtatatta 6900ttccatagcc tttatttatt
tatatattta ttatataaaa gctttatttg ttctaggttg 6960ttcatgaaat atttttttgg
ttttatctcc gttgtaagaa aatcatgtgc tttgtgtcgc 7020cactcactat tgcagctttt
tcatgcattg gtcagattga cggttgattg tatttttgtt 7080ttttatggtt ttgtgttatg
acttaagtct tcatctcttt atctcttcat caggtttgat 7140ggttacctaa tatggtccat
gggtacatgc atggttaaat taggtggcca actttgttgt 7200gaacgataga atttttttta
tattaagtaa actattttta tattatgaaa taataataaa 7260aaaaatattt tatcattatt
aacaaaatca tattagttaa tttgttaact ctataataaa 7320agaaatactg taacattcac
attacatggt aacatctttc caccctttca tttgtttttt 7380gtttgatgac tttttttctt
gtttaaattt atttcccttc ttttaaattt ggaatacatt 7440atcatcatat ataaactaaa
atactaaaaa caggattaca caaatgataa ataataacac 7500aaatatttat aaatctagct
gcaatatatt taaactagct atatcgatat tgtaaaataa 7560aactagctgc attgatactg
ataaaaaaat atcatgtgct ttctggactg atgatgcagt 7620atacttttga cattgccttt
attttatttt tcagaaaagc tttcttagtt ctgggttctt 7680cattatttgt ttcccatctc
cattgtgaat tgaatcattt gcttcgtgtc acaaatacaa 7740tttagntagg tacatgcatt
ggtcagattc acggtttatt atgtcatgac ttaagttcat 7800ggtagtacat tacctgccac
gcatgcatta tattggttag atttgatagg caaatttggt 7860tgtcaacaat ataaatataa
ataatgtttt tatattacga aataacagtg atcaaaacaa 7920acagttttat ctttattaac
aagattttgt ttttgtttga tgacgttttt taatgtttac 7980gctttccccc ttcttttgaa
tttagaacac tttatcatca taaaatcaaa tactaaaaaa 8040attacatatt tcataaataa
taacacaaat atttttaaaa aatctgaaat aataatgaac 8100aatattacat attatcacga
aaattcatta ataaaaatat tatataaata aaatgtaata 8160gtagttatat gtaggaaaaa
agtactgcac gcataatata tacaaaaaga ttaaaatgaa 8220ctattataaa taataacact
aaattaatgg tgaatcatat caaaataatg aaaaagtaaa 8280taaaatttgt aattaacttc
tatatgtatt acacacacaa ataataaata atagtaaaaa 8340aaattatgat aaatatttac
catctcataa gatatttaaa ataatgataa aaatatagat 8400tattttttat gcaactagct
agccaaaaag agaacacggg tatatataaa aagagtacct 8460ttaaattcta ctgtacttcc
tttattcctg acgtttttat atcaagtgga catacgtgaa 8520gattttaatt atcagtctaa
atatttcatt agcacttaat acttttctgt tttattccta 8580tcctataagt agtcccgatt
ctcccaacat tgcttattca cacaactaac taagaaagtc 8640ttccatagcc ccccaagcgg
cccatggcct cctccgagga cgtcatcaag gagttcatgc 8700gcttcaaggt gcgcatggag
ggctccgtga acggccacga gttcgagatc gagggcgagg 8760gcgagggccg cccctacgag
ggcacccaga ccgccaagct gaaggtgacc aagggcggcc 8820ccctgccctt cgcctgggac
atcctgtccc cccagttcca gtacggctcc aaggtgtacg 8880tgaagcaccc cgccgacatc
cccgactaca agaagctgtc cttccccgag ggcttcaagt 8940gggagcgcgt gatgaacttc
gaggacggcg gcgtggtgac cgtgacccag gactcctccc 9000tgcaggacgg ctccttcatc
tacaaggtga agttcatcgg cgtgaacttc ccctccgacg 9060gccccgtaat gcagaagaag
actatgggct gggaggcctc caccgagcgc ctgtaccccc 9120gcgacggcgt gctgaagggc
gagatccaca aggccctgaa gctgaaggac ggcggccact 9180acctggtgga gttcaagtcc
atctacatgg ccaagaagcc cgtgcagctg cccggctact 9240actacgtgga ctccaagctg
gacatcacct cccacaacga ggactacacc atcgtggagc 9300agtacgagcg cgccgagggc
cgccaccacc tgttcctgta gcggccggcc gcgacacaag 9360tgtgagagta ctaaataaat
gctttggttg tacgaaatca ttacactaaa taaaataatc 9420aaagcttata tatgccttcc
gctaaggccg aatgcaaaga aattggttct ttctcgttat 9480cttttgccac ttttactagt
acgtattaat tactacttaa tcatctttgt ttacggctca 9540ttatatccgt cgacggcgcg
ccgct 95656810947DNAArtificial
sequenceVector KS429 68ggccgcaagt atgaactaaa atgcatgtag gtgtaagagc
tcatggagag catggaatat 60tgtatccgac catgtaacag tataataact gagctccatc
tcacttcttc tatgaataaa 120caaaggatgt tatgatatat taacactcta tctatgcacc
ttattgttct atgataaatt 180tcctcttatt attataaatc atctgaatcg tgacggctta
tggaatgctt caaatagtac 240aaaaacaaat gtgtactata agactttcta aacaattcta
accttagcat tgtgaacgag 300acataagtgt taagaagaca taacaattat aatggaagaa
gtttgtctcc atttatatat 360tatatattac ccacttatgt attatattag gatgttaagg
agacataaca attataaaga 420gagaagtttg tatccattta tatattatat actacccatt
tatatattat acttatccac 480ttatttaatg tctttataag gtttgatcca tgatatttct
aatattttag ttgatatgta 540tatgaaaggg tactatttga actctcttac tctgtataaa
ggttggatca tccttaaagt 600gggtctattt aattttattg cttcttacag ataaaaaaaa
aattatgagt tggtttgata 660aaatattgaa ggatttaaaa taataataaa taacatataa
tatatgtata taaatttatt 720ataatataac atttatctat aaaaaagtaa atattgtcat
aaatctatac aatcgtttag 780ccttgctgga cgaatctcaa ttatttaaac gagagtaaac
atatttgact ttttggttat 840ttaacaaatt attatttaac actatatgaa attttttttt
ttatcagcaa agaataaaat 900taaattaaga aggacaatgg tgtcccaatc cttatacaac
caacttccac aagaaagtca 960agtcagagac aacaaaaaaa caagcaaagg aaatttttta
atttgagttg tcttgtttgc 1020tgcataattt atgcagtaaa acactacaca taaccctttt
agcagtagag caatggttga 1080ccgtgtgctt agcttctttt attttatttt tttatcagca
aagaataaat aaaataaaat 1140gagacacttc agggatgttt caacaagctt ggatcctcga
agagaagggt taataacaca 1200cttttttaac atttttaaca caaattttag ttatttaaaa
atttattaaa aaatttaaaa 1260taagaagagg aactctttaa ataaatctaa cttacaaaat
ttatgatttt taataagttt 1320tcaccaataa aaaatgtcat aaaaatatgt taaaaagtat
attatcaata ttctctttat 1380gataaataaa aagaaaaaaa aaataaaagt taagtgaaaa
tgagattgaa gtgactttag 1440gtgtgtataa atatatcaac cccgccaaca atttatttaa
tccaaatata ttgaagtata 1500ttattccata gcctttattt atttatatat ttattatata
aaagctttat ttgttctagg 1560ttgttcatga aatatttttt tggttttatc tccgttgtaa
gaaaatcatg tgctttgtgt 1620cgccactcac tattgcagct ttttcatgca ttggtcagat
tgacggttga ttgtattttt 1680gttttttatg gttttgtgtt atgacttaag tcttcatctc
tttatctctt catcaggttt 1740gatggttacc taatatggtc catgggtaca tgcatggtta
aattaggtgg ccaactttgt 1800tgtgaacgat agaatttttt ttatattaag taaactattt
ttatattatg aaataataat 1860aaaaaaaata ttttatcatt attaacaaaa tcatattagt
taatttgtta actctataat 1920aaaagaaata ctgtaacatt cacattacat ggtaacatct
ttccaccctt tcatttgttt 1980tttgtttgat gacttttttt cttgtttaaa tttatttccc
ttcttttaaa tttggaatac 2040attatcatca tatataaact aaaatactaa aaacaggatt
acacaaatga taaataataa 2100cacaaatatt tataaatcta gctgcaatat atttaaacta
gctatatcga tattgtaaaa 2160taaaactagc tgcattgata ctgataaaaa aatatcatgt
gctttctgga ctgatgatgc 2220agtatacttt tgacattgcc tttattttat ttttcagaaa
agctttctta gttctgggtt 2280cttcattatt tgtttcccat ctccattgtg aattgaatca
tttgcttcgt gtcacaaata 2340caatttagnt aggtacatgc attggtcaga ttcacggttt
attatgtcat gacttaagtt 2400catggtagta cattacctgc cacgcatgca ttatattggt
tagatttgat aggcaaattt 2460ggttgtcaac aatataaata taaataatgt ttttatatta
cgaaataaca gtgatcaaaa 2520caaacagttt tatctttatt aacaagattt tgtttttgtt
tgatgacgtt ttttaatgtt 2580tacgctttcc cccttctttt gaatttagaa cactttatca
tcataaaatc aaatactaaa 2640aaaattacat atttcataaa taataacaca aatattttta
aaaaatctga aataataatg 2700aacaatatta catattatca cgaaaattca ttaataaaaa
tattatataa ataaaatgta 2760atagtagtta tatgtaggaa aaaagtactg cacgcataat
atatacaaaa agattaaaat 2820gaactattat aaataataac actaaattaa tggtgaatca
tatcaaaata atgaaaaagt 2880aaataaaatt tgtaattaac ttctatatgt attacacaca
caaataataa ataatagtaa 2940aaaaaattat gataaatatt taccatctca taagatattt
aaaataatga taaaaatata 3000gattattttt tatgcaacta gctagccaaa aagagaacac
gggtatatat aaaaagagta 3060cctttaaatt ctactgtact tcctttattc ctgacgtttt
tatatcaagt ggacatacgt 3120gaagatttta attatcagtc taaatatttc attagcactt
aatacttttc tgttttattc 3180ctatcctata agtagtcccg attctcccaa cattgcttat
tcacacaact aactaagaaa 3240gtcttccata gccccccaag cggcccatgg cctcctccga
ggacgtcatc aaggagttca 3300tgcgcttcaa ggtgcgcatg gagggctccg tgaacggcca
cgagttcgag atcgagggcg 3360agggcgaggg ccgcccctac gagggcaccc agaccgccaa
gctgaaggtg accaagggcg 3420gccccctgcc cttcgcctgg gacatcctgt ccccccagtt
ccagtacggc tccaaggtgt 3480acgtgaagca ccccgccgac atccccgact acaagaagct
gtccttcccc gagggcttca 3540agtgggagcg cgtgatgaac ttcgaggacg gcggcgtggt
gaccgtgacc caggactcct 3600ccctgcagga cggctccttc atctacaagg tgaagttcat
cggcgtgaac ttcccctccg 3660acggccccgt aatgcagaag aagactatgg gctgggaggc
ctccaccgag cgcctgtacc 3720cccgcgacgg cgtgctgaag ggcgagatcc acaaggccct
gaagctgaag gacggcggcc 3780actacctggt ggagttcaag tccatctaca tggccaagaa
gcccgtgcag ctgcccggct 3840actactacgt ggactccaag ctggacatca cctcccacaa
cgaggactac accatcgtgg 3900agcagtacga gcgcgccgag ggccgccacc acctgttcct
gtagcggccg gccgcgacac 3960aagtgtgaga gtactaaata aatgctttgg ttgtacgaaa
tcattacact aaataaaata 4020atcaaagctt atatatgcct tccgctaagg ccgaatgcaa
agaaattggt tctttctcgt 4080tatcttttgc cacttttact agtacgtatt aattactact
taatcatctt tgtttacggc 4140tcattatatc cgtcgacggc gcgccgctct agagggccca
attcgcccta tagtgagtcg 4200tattacaatt cactggccgt cgttttacaa cgtcgtgact
gggaaaaccc tggcgttacc 4260caacttaatc gccttgcagc acatccccct ttcgccagct
ggcgtaatag cgaagaggcc 4320cgcaccgatc gcccttccca acagttgcgc agcctatacg
tacggcagtt taaggtttac 4380acctataaaa gagagagccg ttatcgtctg tttgtggatg
tacagagtga tattattgac 4440acgccggggc gacggatggt gatccccctg gccagtgcac
gtctgctgtc agataaagtc 4500tcccgtgaac tttacccggt ggtgcatatc ggggatgaaa
gctggcgcat gatgaccacc 4560gatatggcca gtgtgccggt ctccgttatc ggggaagaag
tggctgatct cagccaccgc 4620gaaaatgaca tcaaaaacgc cattaacctg atgttctggg
gaatataaat gtcaggcatg 4680agattatcaa aaaggatctt cacctagatc cttttcacgt
agaaagccag tccgcagaaa 4740cggtgctgac cccggatgaa tgtcagctac tgggctatct
ggacaaggga aaacgcaagc 4800gcaaagagaa agcaggtagc ttgcagtggg cttacatggc
gatagctaga ctgggcggtt 4860ttatggacag caagcgaacc ggaattgcca gctggggcgc
cctctggtaa ggttgggaag 4920ccctgcaaag taaactggat ggctttcttg ccgccaagga
tctgatggcg caggggatca 4980agctctgatc aagagacagg atgaggatcg tttcgcatga
ttgaacaaga tggattgcac 5040gcaggttctc cggccgcttg ggtggagagg ctattcggct
atgactgggc acaacagaca 5100atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc
aggggcgccc ggttcttttt 5160gtcaagaccg acctgtccgg tgccctgaat gaactgcaag
acgaggcagc gcggctatcg 5220tggctggcca cgacgggcgt tccttgcgca gctgtgctcg
acgttgtcac tgaagcggga 5280agggactggc tgctattggg cgaagtgccg gggcaggatc
tcctgtcatc tcaccttgct 5340cctgccgaga aagtatccat catggctgat gcaatgcggc
ggctgcatac gcttgatccg 5400gctacctgcc cattcgacca ccaagcgaaa catcgcatcg
agcgagcacg tactcggatg 5460gaagccggtc ttgtcgatca ggatgatctg gacgaagagc
atcaggggct cgcgccagcc 5520gaactgttcg ccaggctcaa ggcgagcatg cccgacggcg
aggatctcgt cgtgacccat 5580ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc
gcttttctgg attcatcgac 5640tgtggccggc tgggtgtggc ggaccgctat caggacatag
cgttggctac ccgtgatatt 5700gctgaagagc ttggcggcga atgggctgac cgcttcctcg
tgctttacgg tatcgccgct 5760cccgattcgc agcgcatcgc cttctatcgc cttcttgacg
agttcttctg aattattaac 5820gcttacaatt tcctgatgcg gtattttctc cttacgcatc
tgtgcggtat ttcacaccgc 5880atcaggtggc acttttcggg gaaatgtgcg cggaacccct
atttgtttat ttttctaaat 5940acattcaaat atgtatccgc tcatgagaca ataaccctga
taaatgcttc aataatagca 6000cgtgaggagg gccaccatgg ccaagttgac cagtgccgtt
ccggtgctca ccgcgcgcga 6060cgtcgccgga gcggtcgagt tctggaccga ccggctcggg
ttctcccggg acttcgtgga 6120ggacgacttc gccggtgtgg tccgggacga cgtgaccctg
ttcatcagcg cggtccagga 6180ccaggtggtg ccggacaaca ccctggcctg ggtgtgggtg
cgcggcctgg acgagctgta 6240cgccgagtgg tcggaggtcg tgtccacgaa cttccgggac
gcctccgggc cggccatgac 6300cgagatcggc gagcagccgt gggggcggga gttcgccctg
cgcgacccgg ccggcaactg 6360cgtgcacttc gtggccgagg agcaggactg acacgtgcta
aaacttcatt tttaatttaa 6420aaggatctag gtgaagatcc tttttgataa tctcatgacc
aaaatccctt aacgtgagtt 6480ttcgttccac tgagcgtcag accccgtaga aaagatcaaa
ggatcttctt gagatccttt 6540ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca
ccgctaccag cggtggtttg 6600tttgccggat caagagctac caactctttt tccgaaggta
actggcttca gcagagcgca 6660gataccaaat actgttcttc tagtgtagcc gtagttaggc
caccacttca agaactctgt 6720agcaccgcct acatacctcg ctctgctaat cctgttacca
gtggctgctg ccagtggcga 6780taagtcgtgt cttaccgggt tggactcaag acgatagtta
ccggataagg cgcagcggtc 6840gggctgaacg gggggttcgt gcacacagcc cagcttggag
cgaacgacct acaccgaact 6900gagataccta cagcgtgagc tatgagaaag cgccacgctt
cccgaaggga gaaaggcgga 6960caggtatccg gtaagcggca gggtcggaac aggagagcgc
acgagggagc ttccaggggg 7020aaacgcctgg tatctttata gtcctgtcgg gtttcgccac
ctctgacttg agcgtcgatt 7080tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac
gccagcaacg cggccttttt 7140acggttcctg gccttttgct ggccttttgc tcacatgttc
tttcctgcgt tatcccctga 7200ttctgtggat aaccgtatta ccgcctttga gtgagctgat
accgctcgcc gcagccgaac 7260gaccgagcgc agcgagtcag tgagcgagga agcggaagag
cgcccaatac gcaaaccgcc 7320tctccccgcg cgttggccga ttcattaatg cagctggcac
gacaggtttc ccgactggaa 7380agcgggcagt gagcgcaacg caattaatgt gagttagctc
actcattagg caccccaggc 7440tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt
gtgagcggat aacaatttca 7500cacaggaaac agctatgacc atgattacgc caagctattt
aggtgacgcg ttagaatact 7560caagctatgc atcaagcttg gtaccgagct cggatccact
agtaacggcc gccagtgtgc 7620tggaattcag gggcgcgccc tatagatggg atgaagctgc
tctcgacaaa tctgataaaa 7680ctaaagaagg ttagtaatca atttttacaa aatcatagat
tatttttttc attgaattat 7740ttttatgcta taccaagaat tgtattttag tatttgtttt
aactacatat aatagaatta 7800actacatata aattaactaa acttaaaata aaaatagatt
tgtttcctga aattatttta 7860agaatatata tgtatatatc taaaatctta gacttagata
gatttttcta tctatctatt 7920ttggttactt aaaataaata aatttgtata aataattgta
tagttatcaa aaattaaaac 7980taattttttt aaagttgttg atatataaaa tactaaagat
ttaacgatta agtatttatt 8040taagtataga attttgtttt ttttttaagt ttagttatga
agttgttaat tatattaaaa 8100caaaacaata tttcgaaatt ttattatcat attcgaatat
atttttttta gtgatgatgt 8160atgaattatt atcataattt gaaagtttac taaaaaatat
atcaacatga attgtaatat 8220atgagttatt accttaacca aaattataaa ttaacattaa
atataattat atatgtcata 8280tttagccata caatgtgtca tcaatattaa tagtcatgtc
aatattacat aatgccaata 8340ttatgctact taaaccccaa atcccctaac tcccgttaag
tagccaaatt cataaatata 8400cttattcgac aaaataaaaa actttaaaat atttactaat
ccgaccatgc acaagcatcc 8460attccctatt ccattgccac gggataacaa tgcaaccnac
tcctcaaaaa aagaaaaatt 8520caagctcttt tgcaaaaaaa aataaaataa ttttaacacc
taaaattttt tgtttccaaa 8580cttctacagg gaacacacat aaaagaaaaa gaggacgtcc
actcggatca cgcaacaaac 8640caaaaggtgt gtcatgactc ctaagatata atatttcctt
attcaaaatc ataccatttt 8700aaattatgaa tgtatttcgt agtccaccag atatgtaatc
caccagcgtt caaaccaaag 8760ttttatgatt gtaagtttaa gtgaattata ataatatatt
cttcacggta tcttttcata 8820actaattgag ttatcaaact tgatcgcaca tgtggctttg
ataggtgtga cttttatggt 8880atacaattct ttcaacctaa aaacattatt gttcctcaat
atcttacatt atgcttgact 8940gcaacaaaat attttctcat ctgttttctt cctttaaacc
aatttattat catctatttc 9000ctgacatttt aatccatcca cctatgtcaa aaacttatag
aaaatgtcaa cttccaaaca 9060aaacataatt gaacttcgca aataaattct taataatatt
aaaaaatgtt acttaattat 9120ttcttcaacc ccattttccg cgcgtagcgc ggacaaagac
tctagttaaa tatagaagtt 9180tccgattctc atcgtataaa acggtgactt tggcgggctt
tcatgtgtaa caaattggtt 9240taacaaacca ctgcctagtc gtttagtgta gaatcagcgc
atggaactcc gattggagcg 9300tgactttcac gtgccggagg cccaccacca cagcgggcgt
tacgctctaa gaatctcgcc 9360cacggttttc ttcatctgcc ccccgccaag tgtcttcctc
gttcgccact tctcaccaag 9420ttacaggaac cctaaaaatg gcctttcttc agccccggct
ataatacaca catgatccta 9480tagtgggttc ttccacaagt tacatctcct tctggattgt
acatttcaag tgtttgtgtt 9540ttttctgcct ctgagagaaa atcgcggccg catggctgct
gctcccagtg tgaggacgtt 9600tactcgggcc gaggttttga atgccgaggc tctgaatgag
ggcaagaagg atgccgaggc 9660acccttcttg atgatcatcg acaacaaggt gtacgatgtc
cgcgagttcg tccctgatca 9720tcccggtgga agtgtgattc tcacgcacgt tggcaaggac
ggcactgacg tctttgacac 9780ttttcacccc gaggctgctt gggagactct tgccaacttt
tacgttggtg atattgacga 9840gagcgaccgc gatatcaaga atgatgactt tgcggccgag
gtccgcaagc tgcgtacctt 9900gttccagtct cttggttact acgattcttc caaggcatac
tacgccttca aggtctcgtt 9960caacctctgc atctggggtt tgtcgacggt cattgtggcc
aagtggggcc agacctcgac 10020cctcgccaac gtgctctcgg ctgcgctttt gggtctgttc
tggcagcagt gcggatggtt 10080ggctcacgac tttttgcatc accaggtctt ccaggaccgt
ttctggggtg atcttttcgg 10140cgccttcttg ggaggtgtct gccagggctt ctcgtcctcg
tggtggaagg acaagcacaa 10200cactcaccac gccgccccca acgtccacgg cgaggatccc
gacattgaca cccaccctct 10260gttgacctgg agtgagcatg cgttggagat gttctcggat
gtcccagatg aggagctgac 10320ccgcatgtgg tcgcgtttca tggtcctgaa ccagacctgg
ttttacttcc ccattctctc 10380gtttgcccgt ctctcctggt gcctccagtc cattctcttt
gtgctgccta acggtcaggc 10440ccacaagccc tcgggcgcgc gtgtgcccat ctcgttggtc
gagcagctgt cgcttgcgat 10500gcactggacc tggtacctcg ccaccatgtt cctgttcatc
aaggatcccg tcaacatgct 10560ggtgtacttt ttggtgtcgc aggcggtgtg cggaaacttg
ttggcgatcg tgttctcgct 10620caaccacaac ggtatgcctg tgatctcgaa ggaggaggcg
gtcgatatgg atttcttcac 10680gaagcagatc atcacgggtc gtgatgtcca cccgggtcta
tttgccaact ggttcacggg 10740tggattgaac tatcagatcg agcaccactt gttcccttcg
atgcctcgcc acaacttttc 10800aaagatccag cctgctgtcg agaccctgtg caaaaagtac
aatgtccgat accacaccac 10860cggtatgatc gagggaactg cagaggtctt tagccgtctg
aacgaggtct ccaaggctgc 10920ctccaagatg ggtaaggcgc agtaagc
109476920742DNAArtificial sequenceVector ARALO77
69cgcgcctcga gtgggcggat cccccgggct gcaggaattc actggccgtc gttttacaac
60gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt
120tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca
180gcctgaatgg cgaatggatc gatccatcgc gatgtacctt ttgttagtca gcctctcgat
240tgctcatcgt cattacacag taccgaagtt tgatcgatct agtaacatag atgacaccgc
300gcgcgataat ttatcctagt ttgcgcgcta tattttgttt tctatcgcgt attaaatgta
360taattgcggg actctaatca taaaaaccca tctcataaat aacgtcatgc attacatgtt
420aattattaca tgcttaacgt aattcaacag aaattatatg ataatcatcg caagaccggc
480aacaggattc aatcttaaga aactttattg ccaaatgttt gaacgatctg cttcgacgca
540ctccttcttt actccaccat ctcgtcctta ttgaaaacgt gggtagcacc aaaacgaatc
600aagtcgctgg aactgaagtt accaatcacg ctggatgatt tgccagttgg attaatcttg
660cctttccccg catgaataat attgatgaat gcatgcgtga ggggtagttc gatgttggca
720atagctgcaa ttgccgcgac atcctccaac gagcataatt cttcagaaaa atagcgatgt
780tccatgttgt cagggcatgc atgatgcacg ttatgaggtg acggtgctag gcagtattcc
840ctcaaagttt catagtcagt atcatattca tcattgcatt cctgcaagag agaattgaga
900cgcaatccac acgctgcggc aaccttccgg cgttcgtggt ctatttgctc ttggacgttg
960caaacgtaag tgttggatcg atccggggtg ggcgaagaac tccagcatga gatccccgcg
1020ctggaggatc atccagccgg cgtcccggaa aacgattccg aagcccaacc tttcatagaa
1080ggcggcggtg gaatcgaaat ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc
1140gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc
1200gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gccgccaagc
1260tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc cacacccagc
1320cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cggcaagcag
1380gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg
1440aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg atcgacaaga
1500ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg
1560caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc
1620tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc
1680cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg
1740gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg
1800gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag
1860cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga
1920gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atccccgcaa gcttggagac
1980tggtgatttc agcgtgtcct ctccaaatga aatgaacttc cttatataga ggaagggtct
2040tgcgaaggat agtgggattg tgcgtcatcc cttacgtcag tggagatatc acatcaatcc
2100acttgctttg aagacgtggt tggaacgtct tctttttcca cgatgctcct cgtgggtggg
2160ggtccatctt tgggaccact gtcggcagag gcatcttcaa cgatggcctt tcctttatcg
2220caatgatggc atttgtagga gccaccttcc ttttccacta tcttcacaat aaagtgacag
2280atagctgggc aatggaatcc gaggaggttt ccggatatta ccctttgttg aaaagtctca
2340attgcccttt ggtcttctga gactgtatct ttgatatttt tggagtagac aagcgtgtcg
2400tgctccacca tgttgacgaa gattttcttc ttgtcattga gtcgtaagag actctgtatg
2460aactgttcgc cagtctttac ggcgagttct gttaggtcct ctatttgaat ctttgactcc
2520atggcctttg attcagtggg aactaccttt ttagagactc caatctctat tacttgcctt
2580ggtttgtgaa gcaagccttg aatcgtccat actggaatag tacttctgat cttgagaaat
2640atatctttct ctgtgttctt gatgcagtta gtcctgaatc ttttgactgc atctttaacc
2700ttcttgggaa ggtatttgat ctcctggaga ttattgctcg ggtagatcgt cttgatgaga
2760cctgctgcgt aagcctctct aaccatctgt gggttagcat tctttctgaa attgaaaagg
2820ctaatcttct cattatcagt ggtgaacatg gtatcgtcac cttctccgtc gaacttcctg
2880actagatcgt agagatagag gaagtcgtcc attgtgatct ctggggcaaa ggagatctga
2940attaattcga tatggtggat ttatcacaaa tgggacccgc cgccgacaga ggtgtgatgt
3000taggccagga ctttgaaaat ttgcgcaact atcgtatagt ggccgacaaa ttgacgccga
3060gttgacagac tgcctagcat ttgagtgaat tatgtgaggt aatgggctac actgaattgg
3120tagctcaaac tgtcagtatt tatgtatatg agtgtatatt ttcgcataat ctcagaccaa
3180tctgaagatg aaatgggtat ctgggaatgg cgaaatcaag gcatcgatcg tgaagtttct
3240catctaagcc cccatttgga cgtgaatgta gacacgtcga aataaagatt tccgaattag
3300aataatttgt ttattgcttt cgcctataaa tacgacggat cgtaatttgt cgttttatca
3360aaatgtactt tcattttata ataacgctgc ggacatctac atttttgaat tgaaaaaaaa
3420ttggtaatta ctctttcttt ttctccatat tgaccatcat actcattgct gatccatgta
3480gatttcccgg acatgaagcc atttacaatt gaatatatcc tgccgccgct gccgctttgc
3540acccggtgga gcttgcatgt tggtttctac gcagaactga gccggttagg cagataattt
3600ccattgagaa ctgagccatg tgcaccttcc ccccaacacg gtgagcgacg gggcaacgga
3660gtgatccaca tgggactttt aaacatcatc cgtcggatgg cgttgcgaga gaagcagtcg
3720atccgtgaga tcagccgacg caccgggcag gcgcgcaaca cgatcgcaaa gtatttgaac
3780gcaggtacaa tcgagccgac gttcacgcgg aacgaccaag caagctagct ttaatgcggt
3840agtttatcac agttaaattg ctaacgcagt caggcaccgt gtatgaaatc taacaatgcg
3900ctcatcgtca tcctcggcac cgtcaccctg gatgctgtag gcataggctt ggttatgccg
3960gtactgccgg gcctcttgcg ggatatcgtc cattccgaca gcatcgccag tcactatggc
4020gtgctgctag cgctatatgc gttgatgcaa tttctatgcg cacccgttct cggagcactg
4080tccgaccgct ttggccgccg cccagtcctg ctcgcttcgc tacttggagc cactatcgac
4140tacgcgatca tggcgaccac acccgtcctg tggtccaacc cctccgctgc tatagtgcag
4200tcggcttctg acgttcagtg cagccgtctt ctgaaaacga catgtcgcac aagtcctaag
4260ttacgcgaca ggctgccgcc ctgccctttt cctggcgttt tcttgtcgcg tgttttagtc
4320gcataaagta gaatacttgc gactagaacc ggagacatta cgccatgaac aagagcgccg
4380ccgctggcct gctgggctat gcccgcgtca gcaccgacga ccaggacttg accaaccaac
4440gggccgaact gcacgcggcc ggctgcacca agctgttttc cgagaagatc accggcacca
4500ggcgcgaccg cccggagctg gccaggatgc ttgaccacct acgccctggc gacgttgtga
4560cagtgaccag gctagaccgc ctggcccgca gcacccgcga cctactggac attgccgagc
4620gcatccagga ggccggcgcg ggcctgcgta gcctggcaga gccgtgggcc gacaccacca
4680cgccggccgg ccgcatggtg ttgaccgtgt tcgccggcat tgccgagttc gagcgttccc
4740taatcatcga ccgcacccgg agcgggcgcg aggccgccaa ggcccgaggc gtgaagtttg
4800gcccccgccc taccctcacc ccggcacaga tcgcgcacgc ccgcgagctg atcgaccagg
4860aaggccgcac cgtgaaagag gcggctgcac tgcttggcgt gcatcgctcg accctgtacc
4920gcgcacttga gcgcagcgag gaagtgacgc ccaccgaggc caggcggcgc ggtgccttcc
4980gtgaggacgc attgaccgag gccgacgccc tggcggccgc cgagaatgaa cgccaagagg
5040aacaagcatg aaaccgcacc aggacggcca ggacgaaccg tttttcatta ccgaagagat
5100cgaggcggag atgatcgcgg ccgggtacgt gttcgagccg cccgcgcacg tctcaaccgt
5160gcggctgcat gaaatcctgg ccggtttgtc tgatgccaag ctggcggcct ggccggccag
5220cttggccgct gaagaaaccg agcgccgccg tctaaaaagg tgatgtgtat ttgagtaaaa
5280cagcttgcgt catgcggtcg ctgcgtatat gatgcgatga gtaaataaac aaatacgcaa
5340gggaacgcat gaagttatcg ctgtacttaa ccagaaaggc gggtcaggca agacgaccat
5400cgcaacccat ctagcccgcg ccctgcaact cgccggggcc gatgttctgt tagtcgattc
5460cgatccccag ggcagtgccc gcgattgggc ggccgtgcgg gaagatcaac cgctaaccgt
5520tgtcggcatc gaccgcccga cgattgaccg cgacgtgaag gccatcggcc ggcgcgactt
5580cgtagtgatc gacggagcgc cccaggcggc ggacttggct gtgtccgcga tcaaggcagc
5640cgacttcgtg ctgattccgg tgcagccaag cccttacgac atatgggcca ccgccgacct
5700ggtggagctg gttaagcagc gcattgaggt cacggatgga aggctacaag cggcctttgt
5760cgtgtcgcgg gcgatcaaag gcacgcgcat cggcggtgag gttgccgagg cgctggccgg
5820gtacgagctg cccattcttg agtcccgtat cacgcagcgc gtgagctacc caggcactgc
5880cgccgccggc acaaccgttc ttgaatcaga acccgagggc gacgctgccc gcgaggtcca
5940ggcgctggcc gctgaaatta aatcaaaact catttgagtt aatgaggtaa agagaaaatg
6000agcaaaagca caaacacgct aagtgccggc cgtccgagcg cacgcagcag caaggctgca
6060acgttggcca gcctggcaga cacgccagcc atgaagcggg tcaactttca gttgccggcg
6120gaggatcaca ccaagctgaa gatgtacgcg gtacgccaag gcaagaccat taccgagctg
6180ctatctgaat acatcgcgca gctaccagag taaatgagca aatgaataaa tgagtagatg
6240aattttagcg gctaaaggag gcggcatgga aaatcaagaa caaccaggca ccgacgccgt
6300ggaatgcccc atgtgtggag gaacgggcgg ttggccaggc gtaagcggct gggttgtctg
6360ccggccctgc aatggcactg gaacccccaa gcccgaggaa tcggcgtgag cggtcgcaaa
6420ccatccggcc cggtacaaat cggcgcggcg ctgggtgatg acctggtgga gaagttgaag
6480gccgcgcagg ccgcccagcg gcaacgcatc gaggcagaag cacgccccgg tgaatcgtgg
6540caagcggccg ctgatcgaat ccgcaaagaa tcccggcaac cgccggcagc cggtgcgccg
6600tcgattagga agccgcccaa gggcgacgag caaccagatt ttttcgttcc gatgctctat
6660gacgtgggca cccgcgatag tcgcagcatc atggacgtgg ccgttttccg tctgtcgaag
6720cgtgaccgac gagctggcga ggtgatccgc tacgagcttc cagacgggca cgtagaggtt
6780tccgcagggc cggccggcat ggccagtgtg tgggattacg acctggtact gatggcggtt
6840tcccatctaa ccgaatccat gaaccgatac cgggaaggga agggagacaa gcccggccgc
6900gtgttccgtc cacacgttgc ggacgtactc aagttctgcc ggcgagccga tggcggaaag
6960cagaaagacg acctggtaga aacctgcatt cggttaaaca ccacgcacgt tgccatgcag
7020cgtacgaaga aggccaagaa cggccgcctg gtgacggtat ccgagggtga agccttgatt
7080agccgctaca agatcgtaaa gagcgaaacc gggcggccgg agtacatcga gatcgagcta
7140gctgattgga tgtaccgcga gatcacagaa ggcaagaacc cggacgtgct gacggttcac
7200cccgattact ttttgatcga tcccggcatc ggccgttttc tctaccgcct ggcacgccgc
7260gccgcaggca aggcagaagc cagatggttg ttcaagacga tctacgaacg cagtggcagc
7320gccggagagt tcaagaagtt ctgtttcacc gtgcgcaagc tgatcgggtc aaatgacctg
7380ccggagtacg atttgaagga ggaggcgggg caggctggcc cgatcctagt catgcgctac
7440cgcaacctga tcgagggcga agcatccgcc ggttcctaat gtacggagca gatgctaggg
7500caaattgccc tagcagggga aaaaggtcga aaaggtctct ttcctgtgga tagcacgtac
7560attgggaacc caaagccgta cattgggaac cggaacccgt acattgggaa cccaaagccg
7620tacattggga accggtcaca catgtaagtg actgatataa aagagaaaaa aggcgatttt
7680tccgcctaaa actctttaaa acttattaaa actcttaaaa cccgcctggc ctgtgcataa
7740ctgtctggcc agcgcacagc cgaagagctg caaaaagcgc ctacccttcg gtcgctgcgc
7800tccctacgcc ccgccgcttc gcgtcggcct atcgcggccg ctggccgctc aaaaatggct
7860ggcctacggc caggcaatct accagggcgc ggacaagccg cgccgtcgcc actcgaccgc
7920cggcgcccac atcaaggcac cctgcctcgc gcgtttcggt gatgacggtg aaaacctctg
7980acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca
8040agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc
8100acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg
8160agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc
8220aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga
8280gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca
8340ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg
8400ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt
8460cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc
8520ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct
8580tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc
8640gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta
8700tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca
8760gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag
8820tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag
8880ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt
8940agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa
9000gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg
9060attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga
9120agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta
9180atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc
9240cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg
9300ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga
9360agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt
9420tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt
9480gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc
9540caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc
9600ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca
9660gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag
9720tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg
9780tcaacacggg ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa
9840gacctgcagg gggggggggg cgctgaggtc tgcctcgtga agaaggtgtt gctgactcat
9900accaggcctg aatcgcccca tcatccagcc agaaagtgag ggagccacgg ttgatgagag
9960ctttgttgta ggtggaccag ttggtgattt tgaacttttg ctttgccacg gaacggtctg
10020cgttgtcggg aagatgcgtg atctgatcct tcaactcagc aaaagttcga tttattcaac
10080aaagccgccg tcccgtcaag tcagcgtaat gctctgccag tgttacaacc aattaaccaa
10140ttctgattag aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt
10200atcaatacca tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca
10260gttccatagg atggcaagat cctggtatcg gtctgcgatt ccgactcgtc caacatcaat
10320acaacctatt aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt
10380gacgactgaa tccggtgaga atggcaaaag cttatgcatt tctttccaga cttgttcaac
10440aggccagcca ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg
10500tgattgcgcc tgagcgagac gaaatacgcg atcgctgtta aaaggacaat tacaaacagg
10560aatcgaatgc aaccggcgca ggaacactgc cagcgcatca acaatatttt cacctgaatc
10620aggatattct tctaatacct ggaatgctgt tttcccgggg atcgcagtgg tgagtaacca
10680tgcatcatca ggagtacgga taaaatgctt gatggtcgga agaggcataa attccgtcag
10740ccagtttagt ctgaccatct catctgtaac atcattggca acgctacctt tgccatgttt
10800cagaaacaac tctggcgcat cgggcttccc atacaatcga tagattgtcg cacctgattg
10860cccgacatta tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa
10920tcgcggcctc gagcaagacg tttcccgttg aatatggctc ataacacccc ttgtattact
10980gtttatgtaa gcagacagtt ttattgttca tgatgatata tttttatctt gtgcaatgta
11040acatcagaga ttttgagaca caacgtggct ttcccccccc cccctgcagg tcaattcggt
11100cgatatggct attacgaaga aggctcgtgc gcggagtccc gtgaactttc ccacgcaaca
11160agtgaaccgc accgggtttg ccggaggcca tttcgttaaa atgcgcagcc atggctgctt
11220cgtccagcat ggcgtaatac tgatcctcgt cttcggctgg cggtatattg ccgatgggct
11280tcaaaagccg ccgtggttga accagtctat ccattccaag gtagcgaact cgaccgcttc
11340gaagctcctc catggtccac gccgatgaat gacctcggcc ttgtaaagac cgttgatcgc
11400ttctgcgagg gcgttgtcgt gctgtcgccg acgcttccga tagatggctc gatacctgct
11460tctgccaacc gctcggaata gcgaaaggac acgtattgaa caccgcgatc cgagtgatgc
11520actaggccgc catgagcggg acgccgatca tgatgagcct cctcgagggc atcgaggaca
11580aagcctgcat gtgctgtccg gctcgcccgc catccgacaa tgcgacgggc gaagacgtcg
11640atcacgaagg ccacgtagac gaagccctcc caagtggcga cataagtacg gacatgcgca
11700aaggctttcc cggtttgtcg ctgatggtgc aagagacgct gaagcgcgat ccgatgcgca
11760ggcatctgtt cgtcttccgc ggtcgtggcg gtggcctgat caaggtcact cgccgaagag
11820ctgcatgatt ggctcgaaac cgagcggggg aaattgtcgc gcagttctcc cgtcgccgag
11880gcgataaatt acatgctcaa gcgatgggat ggcattacgt cattcctcga tgacggcccg
11940atttgcctga cgaacaatgc tgccgaacga acgctcagag gctatgtact cggcaggaag
12000tcatggctgt ttgccggatc ggatcgttgt gctgaacgtg cggcgttcat ggcgacactg
12060atcatgagcg ccaagctcaa taacatcgat ccgcaggcct ggcttgccga cgtccgcgcc
12120gaccttgcgg acgctccgat cagcaggctt gagcaacagc tgccgtggaa ctggacatcc
12180aagacactga gtgctcaggc ggcctgacct gcggccttca ccggatactt accccattat
12240cgcagattgc gatgaagcat cagcgtcatt cagcaatctt gccaaagtat gcaggctcgc
12300gagaatcgac gtgcgaaacc ggctggttgc gccaaagatc cgcttgcgga gcggtcgaac
12360attcatgctg ggacttcaag aggtcgagta gaggaagaac cggaaaggtt gcaccggaaa
12420atatgcgttc ctttggagag cgcctcatgg acgtgaacaa atcgcccgga ccaaggatgc
12480cacggataca aaagctcgcg aagctcggtc ccgtgggtgt tctgtcgtct cgttgtacaa
12540cgaaatccat tcccattccg cgctcaagat ggcttcccct cggcagttca tcagggctaa
12600atcaatctag ccgacttgtc cggtgaaatg ggctgcactc caacagaaac aatcaaacaa
12660acatacacag cgacttattc acacgagctc aaattacaac ggtatatatc ctgccagtca
12720gcatcatcac accaaaagtt aggcccgaat agtttgaaat tagaaagctc gcaattgagg
12780tctacaggcc aaattcgctc ttagccgtac aatattactc accggtgcga tgccccccat
12840cgtaggtgaa ggtggaaatt aatgatccat cttgagacca caggcccaca acagctacca
12900gtttcctcaa gggtccacca aaaacgtaag cgcttacgta catggtcgat aagaaaaggc
12960aatttgtaga tgttaacatc caacgtcgct ttcagggatc gatccaatac gcaaaccgcc
13020tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa
13080agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc
13140tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca
13200cacaggaaac agctatgacc atgattacgc caagcttgca tgcctgcagg tcgactctag
13260aggatctggc gcgccctata gatgggatga agctgctctc gacaaatctg ataaaactaa
13320agaaggttag taatcaattt ttacaaaatc atagattatt tttttcattg aattattttt
13380atgctatacc aagaattgta ttttagtatt tgttttaact acatataata gaattaacta
13440catataaatt aactaaactt aaaataaaaa tagatttgtt tcctgaaatt attttaagaa
13500tatatatgta tatatctaaa atcttagact tagatagatt tttctatcta tctattttgg
13560ttacttaaaa taaataaatt tgtataaata attgtatagt tatcaaaaat taaaactaat
13620ttttttaaag ttgttgatat ataaaatact aaagatttaa cgattaagta tttatttaag
13680tatagaattt tgtttttttt ttaagtttag ttatgaagtt gttaattata ttaaaacaaa
13740acaatatttc gaaattttat tatcatattc gaatatattt tttttagtga tgatgtatga
13800attattatca taatttgaaa gtttactaaa aaatatatca acatgaattg taatatatga
13860gttattacct taaccaaaat tataaattaa cattaaatat aattatatat gtcatattta
13920gccatacaat gtgtcatcaa tattaatagt catgtcaata ttacataatg ccaatattat
13980gctacttaaa ccccaaatcc cctaactccc gttaagtagc caaattcata aatatactta
14040ttcgacaaaa taaaaaactt taaaatattt actaatccga ccatgcacaa gcatccattc
14100cctattccat tgccacggga taacaatgca accnactcct caaaaaaaga aaaattcaag
14160ctcttttgca aaaaaaaata aaataatttt aacacctaaa attttttgtt tccaaacttc
14220tacagggaac acacataaaa gaaaaagagg acgtccactc ggatcacgca acaaaccaaa
14280aggtgtgtca tgactcctaa gatataatat ttccttattc aaaatcatac cattttaaat
14340tatgaatgta tttcgtagtc caccagatat gtaatccacc agcgttcaaa ccaaagtttt
14400atgattgtaa gtttaagtga attataataa tatattcttc acggtatctt ttcataacta
14460attgagttat caaacttgat cgcacatgtg gctttgatag gtgtgacttt tatggtatac
14520aattctttca acctaaaaac attattgttc ctcaatatct tacattatgc ttgactgcaa
14580caaaatattt tctcatctgt tttcttcctt taaaccaatt tattatcatc tatttcctga
14640cattttaatc catccaccta tgtcaaaaac ttatagaaaa tgtcaacttc caaacaaaac
14700ataattgaac ttcgcaaata aattcttaat aatattaaaa aatgttactt aattatttct
14760tcaaccccat tttccgcgcg tagcgcggac aaagactcta gttaaatata gaagtttccg
14820attctcatcg tataaaacgg tgactttggc gggctttcat gtgtaacaaa ttggtttaac
14880aaaccactgc ctagtcgttt agtgtagaat cagcgcatgg aactccgatt ggagcgtgac
14940tttcacgtgc cggaggccca ccaccacagc gggcgttacg ctctaagaat ctcgcccacg
15000gttttcttca tctgcccccc gccaagtgtc ttcctcgttc gccacttctc accaagttac
15060aggaacccta aaaatggcct ttcttcagcc ccggctataa tacacacatg atcctatagt
15120gggttcttcc acaagttaca tctccttctg gattgtacat ttcaagtgtt tgtgtttttt
15180ctgcctctga gagaaaatcg cggccgcatg gctgctgctc ccagtgtgag gacgtttact
15240cgggccgagg ttttgaatgc cgaggctctg aatgagggca agaaggatgc cgaggcaccc
15300ttcttgatga tcatcgacaa caaggtgtac gatgtccgcg agttcgtccc tgatcatccc
15360ggtggaagtg tgattctcac gcacgttggc aaggacggca ctgacgtctt tgacactttt
15420caccccgagg ctgcttggga gactcttgcc aacttttacg ttggtgatat tgacgagagc
15480gaccgcgata tcaagaatga tgactttgcg gccgaggtcc gcaagctgcg taccttgttc
15540cagtctcttg gttactacga ttcttccaag gcatactacg ccttcaaggt ctcgttcaac
15600ctctgcatct ggggtttgtc gacggtcatt gtggccaagt ggggccagac ctcgaccctc
15660gccaacgtgc tctcggctgc gcttttgggt ctgttctggc agcagtgcgg atggttggct
15720cacgactttt tgcatcacca ggtcttccag gaccgtttct ggggtgatct tttcggcgcc
15780ttcttgggag gtgtctgcca gggcttctcg tcctcgtggt ggaaggacaa gcacaacact
15840caccacgccg cccccaacgt ccacggcgag gatcccgaca ttgacaccca ccctctgttg
15900acctggagtg agcatgcgtt ggagatgttc tcggatgtcc cagatgagga gctgacccgc
15960atgtggtcgc gtttcatggt cctgaaccag acctggtttt acttccccat tctctcgttt
16020gcccgtctct cctggtgcct ccagtccatt ctctttgtgc tgcctaacgg tcaggcccac
16080aagccctcgg gcgcgcgtgt gcccatctcg ttggtcgagc agctgtcgct tgcgatgcac
16140tggacctggt acctcgccac catgttcctg ttcatcaagg atcccgtcaa catgctggtg
16200tactttttgg tgtcgcaggc ggtgtgcgga aacttgttgg cgatcgtgtt ctcgctcaac
16260cacaacggta tgcctgtgat ctcgaaggag gaggcggtcg atatggattt cttcacgaag
16320cagatcatca cgggtcgtga tgtccacccg ggtctatttg ccaactggtt cacgggtgga
16380ttgaactatc agatcgagca ccacttgttc ccttcgatgc ctcgccacaa cttttcaaag
16440atccagcctg ctgtcgagac cctgtgcaaa aagtacaatg tccgatacca caccaccggt
16500atgatcgagg gaactgcaga ggtctttagc cgtctgaacg aggtctccaa ggctgcctcc
16560aagatgggta aggcgcagta agcggccgca agtatgaact aaaatgcatg taggtgtaag
16620agctcatgga gagcatggaa tattgtatcc gaccatgtaa cagtataata actgagctcc
16680atctcacttc ttctatgaat aaacaaagga tgttatgata tattaacact ctatctatgc
16740accttattgt tctatgataa atttcctctt attattataa atcatctgaa tcgtgacggc
16800ttatggaatg cttcaaatag tacaaaaaca aatgtgtact ataagacttt ctaaacaatt
16860ctaaccttag cattgtgaac gagacataag tgttaagaag acataacaat tataatggaa
16920gaagtttgtc tccatttata tattatatat tacccactta tgtattatat taggatgtta
16980aggagacata acaattataa agagagaagt ttgtatccat ttatatatta tatactaccc
17040atttatatat tatacttatc cacttattta atgtctttat aaggtttgat ccatgatatt
17100tctaatattt tagttgatat gtatatgaaa gggtactatt tgaactctct tactctgtat
17160aaaggttgga tcatccttaa agtgggtcta tttaatttta ttgcttctta cagataaaaa
17220aaaaattatg agttggtttg ataaaatatt gaaggattta aaataataat aaataacata
17280taatatatgt atataaattt attataatat aacatttatc tataaaaaag taaatattgt
17340cataaatcta tacaatcgtt tagccttgct ggacgaatct caattattta aacgagagta
17400aacatatttg actttttggt tatttaacaa attattattt aacactatat gaaatttttt
17460tttttatcag caaagaataa aattaaatta agaaggacaa tggtgtccca atccttatac
17520aaccaacttc cacaagaaag tcaagtcaga gacaacaaaa aaacaagcaa aggaaatttt
17580ttaatttgag ttgtcttgtt tgctgcataa tttatgcagt aaaacactac acataaccct
17640tttagcagta gagcaatggt tgaccgtgtg cttagcttct tttattttat ttttttatca
17700gcaaagaata aataaaataa aatgagacac ttcagggatg tttcaacaag cttggatcct
17760cgaagagaag ggttaataac acactttttt aacattttta acacaaattt tagttattta
17820aaaatttatt aaaaaattta aaataagaag aggaactctt taaataaatc taacttacaa
17880aatttatgat ttttaataag ttttcaccaa taaaaaatgt cataaaaata tgttaaaaag
17940tatattatca atattctctt tatgataaat aaaaagaaaa aaaaaataaa agttaagtga
18000aaatgagatt gaagtgactt taggtgtgta taaatatatc aaccccgcca acaatttatt
18060taatccaaat atattgaagt atattattcc atagccttta tttatttata tatttattat
18120ataaaagctt tatttgttct aggttgttca tgaaatattt ttttggtttt atctccgttg
18180taagaaaatc atgtgctttg tgtcgccact cactattgca gctttttcat gcattggtca
18240gattgacggt tgattgtatt tttgtttttt atggttttgt gttatgactt aagtcttcat
18300ctctttatct cttcatcagg tttgatggtt acctaatatg gtccatgggt acatgcatgg
18360ttaaattagg tggccaactt tgttgtgaac gatagaattt tttttatatt aagtaaacta
18420tttttatatt atgaaataat aataaaaaaa atattttatc attattaaca aaatcatatt
18480agttaatttg ttaactctat aataaaagaa atactgtaac attcacatta catggtaaca
18540tctttccacc ctttcatttg ttttttgttt gatgactttt tttcttgttt aaatttattt
18600cccttctttt aaatttggaa tacattatca tcatatataa actaaaatac taaaaacagg
18660attacacaaa tgataaataa taacacaaat atttataaat ctagctgcaa tatatttaaa
18720ctagctatat cgatattgta aaataaaact agctgcattg atactgataa aaaaatatca
18780tgtgctttct ggactgatga tgcagtatac ttttgacatt gcctttattt tatttttcag
18840aaaagctttc ttagttctgg gttcttcatt atttgtttcc catctccatt gtgaattgaa
18900tcatttgctt cgtgtcacaa atacaattta gntaggtaca tgcattggtc agattcacgg
18960tttattatgt catgacttaa gttcatggta gtacattacc tgccacgcat gcattatatt
19020ggttagattt gataggcaaa tttggttgtc aacaatataa atataaataa tgtttttata
19080ttacgaaata acagtgatca aaacaaacag ttttatcttt attaacaaga ttttgttttt
19140gtttgatgac gttttttaat gtttacgctt tcccccttct tttgaattta gaacacttta
19200tcatcataaa atcaaatact aaaaaaatta catatttcat aaataataac acaaatattt
19260ttaaaaaatc tgaaataata atgaacaata ttacatatta tcacgaaaat tcattaataa
19320aaatattata taaataaaat gtaatagtag ttatatgtag gaaaaaagta ctgcacgcat
19380aatatataca aaaagattaa aatgaactat tataaataat aacactaaat taatggtgaa
19440tcatatcaaa ataatgaaaa agtaaataaa atttgtaatt aacttctata tgtattacac
19500acacaaataa taaataatag taaaaaaaat tatgataaat atttaccatc tcataagata
19560tttaaaataa tgataaaaat atagattatt ttttatgcaa ctagctagcc aaaaagagaa
19620cacgggtata tataaaaaga gtacctttaa attctactgt acttccttta ttcctgacgt
19680ttttatatca agtggacata cgtgaagatt ttaattatca gtctaaatat ttcattagca
19740cttaatactt ttctgtttta ttcctatcct ataagtagtc ccgattctcc caacattgct
19800tattcacaca actaactaag aaagtcttcc atagcccccc aagcggccca tggcctcctc
19860cgaggacgtc atcaaggagt tcatgcgctt caaggtgcgc atggagggct ccgtgaacgg
19920ccacgagttc gagatcgagg gcgagggcga gggccgcccc tacgagggca cccagaccgc
19980caagctgaag gtgaccaagg gcggccccct gcccttcgcc tgggacatcc tgtcccccca
20040gttccagtac ggctccaagg tgtacgtgaa gcaccccgcc gacatccccg actacaagaa
20100gctgtccttc cccgagggct tcaagtggga gcgcgtgatg aacttcgagg acggcggcgt
20160ggtgaccgtg acccaggact cctccctgca ggacggctcc ttcatctaca aggtgaagtt
20220catcggcgtg aacttcccct ccgacggccc cgtaatgcag aagaagacta tgggctggga
20280ggcctccacc gagcgcctgt acccccgcga cggcgtgctg aagggcgaga tccacaaggc
20340cctgaagctg aaggacggcg gccactacct ggtggagttc aagtccatct acatggccaa
20400gaagcccgtg cagctgcccg gctactacta cgtggactcc aagctggaca tcacctccca
20460caacgaggac tacaccatcg tggagcagta cgagcgcgcc gagggccgcc accacctgtt
20520cctgtagcgg ccggccgcga cacaagtgtg agagtactaa ataaatgctt tggttgtacg
20580aaatcattac actaaataaa ataatcaaag cttatatatg ccttccgcta aggccgaatg
20640caaagaaatt ggttctttct cgttatcttt tgccactttt actagtacgt attaattact
20700acttaatcat ctttgtttac ggctcattat atccgtcgac gg
207427010758DNAArtificial sequenceVector KS431 70ggccgcaagt atgaactaaa
atgcatgtag gtgtaagagc tcatggagag catggaatat 60tgtatccgac catgtaacag
tataataact gagctccatc tcacttcttc tatgaataaa 120caaaggatgt tatgatatat
taacactcta tctatgcacc ttattgttct atgataaatt 180tcctcttatt attataaatc
atctgaatcg tgacggctta tggaatgctt caaatagtac 240aaaaacaaat gtgtactata
agactttcta aacaattcta accttagcat tgtgaacgag 300acataagtgt taagaagaca
taacaattat aatggaagaa gtttgtctcc atttatatat 360tatatattac ccacttatgt
attatattag gatgttaagg agacataaca attataaaga 420gagaagtttg tatccattta
tatattatat actacccatt tatatattat acttatccac 480ttatttaatg tctttataag
gtttgatcca tgatatttct aatattttag ttgatatgta 540tatgaaaggg tactatttga
actctcttac tctgtataaa ggttggatca tccttaaagt 600gggtctattt aattttattg
cttcttacag ataaaaaaaa aattatgagt tggtttgata 660aaatattgaa ggatttaaaa
taataataaa taacatataa tatatgtata taaatttatt 720ataatataac atttatctat
aaaaaagtaa atattgtcat aaatctatac aatcgtttag 780ccttgctgga cgaatctcaa
ttatttaaac gagagtaaac atatttgact ttttggttat 840ttaacaaatt attatttaac
actatatgaa attttttttt ttatcagcaa agaataaaat 900taaattaaga aggacaatgg
tgtcccaatc cttatacaac caacttccac aagaaagtca 960agtcagagac aacaaaaaaa
caagcaaagg aaatttttta atttgagttg tcttgtttgc 1020tgcataattt atgcagtaaa
acactacaca taaccctttt agcagtagag caatggttga 1080ccgtgtgctt agcttctttt
attttatttt tttatcagca aagaataaat aaaataaaat 1140gagacacttc agggatgttt
caacaagctt ggatcctcga agagaagggt taataacaca 1200cttttttaac atttttaaca
caaattttag ttatttaaaa atttattaaa aaatttaaaa 1260taagaagagg aactctttaa
ataaatctaa cttacaaaat ttatgatttt taataagttt 1320tcaccaataa aaaatgtcat
aaaaatatgt taaaaagtat attatcaata ttctctttat 1380gataaataaa aagaaaaaaa
aaataaaagt taagtgaaaa tgagattgaa gtgactttag 1440gtgtgtataa atatatcaac
cccgccaaca atttatttaa tccaaatata ttgaagtata 1500ttattccata gcctttattt
atttatatat ttattatata aaagctttat ttgttctagg 1560ttgttcatga aatatttttt
tggttttatc tccgttgtaa gaaaatcatg tgctttgtgt 1620cgccactcac tattgcagct
ttttcatgca ttggtcagat tgacggttga ttgtattttt 1680gttttttatg gttttgtgtt
atgacttaag tcttcatctc tttatctctt catcaggttt 1740gatggttacc taatatggtc
catgggtaca tgcatggtta aattaggtgg ccaactttgt 1800tgtgaacgat agaatttttt
ttatattaag taaactattt ttatattatg aaataataat 1860aaaaaaaata ttttatcatt
attaacaaaa tcatattagt taatttgtta actctataat 1920aaaagaaata ctgtaacatt
cacattacat ggtaacatct ttccaccctt tcatttgttt 1980tttgtttgat gacttttttt
cttgtttaaa tttatttccc ttcttttaaa tttggaatac 2040attatcatca tatataaact
aaaatactaa aaacaggatt acacaaatga taaataataa 2100cacaaatatt tataaatcta
gctgcaatat atttaaacta gctatatcga tattgtaaaa 2160taaaactagc tgcattgata
ctgataaaaa aatatcatgt gctttctgga ctgatgatgc 2220agtatacttt tgacattgcc
tttattttat ttttcagaaa agctttctta gttctgggtt 2280cttcattatt tgtttcccat
ctccattgtg aattgaatca tttgcttcgt gtcacaaata 2340caatttagnt aggtacatgc
attggtcaga ttcacggttt attatgtcat gacttaagtt 2400catggtagta cattacctgc
cacgcatgca ttatattggt tagatttgat aggcaaattt 2460ggttgtcaac aatataaata
taaataatgt ttttatatta cgaaataaca gtgatcaaaa 2520caaacagttt tatctttatt
aacaagattt tgtttttgtt tgatgacgtt ttttaatgtt 2580tacgctttcc cccttctttt
gaatttagaa cactttatca tcataaaatc aaatactaaa 2640aaaattacat atttcataaa
taataacaca aatattttta aaaaatctga aataataatg 2700aacaatatta catattatca
cgaaaattca ttaataaaaa tattatataa ataaaatgta 2760atagtagtta tatgtaggaa
aaaagtactg cacgcataat atatacaaaa agattaaaat 2820gaactattat aaataataac
actaaattaa tggtgaatca tatcaaaata atgaaaaagt 2880aaataaaatt tgtaattaac
ttctatatgt attacacaca caaataataa ataatagtaa 2940aaaaaattat gataaatatt
taccatctca taagatattt aaaataatga taaaaatata 3000gattattttt tatgcaacta
gctagccaaa aagagaacac gggtatatat aaaaagagta 3060cctttaaatt ctactgtact
tcctttattc ctgacgtttt tatatcaagt ggacatacgt 3120gaagatttta attatcagtc
taaatatttc attagcactt aatacttttc tgttttattc 3180ctatcctata agtagtcccg
attctcccaa cattgcttat tcacacaact aactaagaaa 3240gtcttccata gccccccaag
cggcccatgg cctcctccga ggacgtcatc aaggagttca 3300tgcgcttcaa ggtgcgcatg
gagggctccg tgaacggcca cgagttcgag atcgagggcg 3360agggcgaggg ccgcccctac
gagggcaccc agaccgccaa gctgaaggtg accaagggcg 3420gccccctgcc cttcgcctgg
gacatcctgt ccccccagtt ccagtacggc tccaaggtgt 3480acgtgaagca ccccgccgac
atccccgact acaagaagct gtccttcccc gagggcttca 3540agtgggagcg cgtgatgaac
ttcgaggacg gcggcgtggt gaccgtgacc caggactcct 3600ccctgcagga cggctccttc
atctacaagg tgaagttcat cggcgtgaac ttcccctccg 3660acggccccgt aatgcagaag
aagactatgg gctgggaggc ctccaccgag cgcctgtacc 3720cccgcgacgg cgtgctgaag
ggcgagatcc acaaggccct gaagctgaag gacggcggcc 3780actacctggt ggagttcaag
tccatctaca tggccaagaa gcccgtgcag ctgcccggct 3840actactacgt ggactccaag
ctggacatca cctcccacaa cgaggactac accatcgtgg 3900agcagtacga gcgcgccgag
ggccgccacc acctgttcct gtagcggccg gccgcgacac 3960aagtgtgaga gtactaaata
aatgctttgg ttgtacgaaa tcattacact aaataaaata 4020atcaaagctt atatatgcct
tccgctaagg ccgaatgcaa agaaattggt tctttctcgt 4080tatcttttgc cacttttact
agtacgtatt aattactact taatcatctt tgtttacggc 4140tcattatatc cgtcgacggc
gcgccgctct agagggccca attcgcccta tagtgagtcg 4200tattacaatt cactggccgt
cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 4260caacttaatc gccttgcagc
acatccccct ttcgccagct ggcgtaatag cgaagaggcc 4320cgcaccgatc gcccttccca
acagttgcgc agcctatacg tacggcagtt taaggtttac 4380acctataaaa gagagagccg
ttatcgtctg tttgtggatg tacagagtga tattattgac 4440acgccggggc gacggatggt
gatccccctg gccagtgcac gtctgctgtc agataaagtc 4500tcccgtgaac tttacccggt
ggtgcatatc ggggatgaaa gctggcgcat gatgaccacc 4560gatatggcca gtgtgccggt
ctccgttatc ggggaagaag tggctgatct cagccaccgc 4620gaaaatgaca tcaaaaacgc
cattaacctg atgttctggg gaatataaat gtcaggcatg 4680agattatcaa aaaggatctt
cacctagatc cttttcacgt agaaagccag tccgcagaaa 4740cggtgctgac cccggatgaa
tgtcagctac tgggctatct ggacaaggga aaacgcaagc 4800gcaaagagaa agcaggtagc
ttgcagtggg cttacatggc gatagctaga ctgggcggtt 4860ttatggacag caagcgaacc
ggaattgcca gctggggcgc cctctggtaa ggttgggaag 4920ccctgcaaag taaactggat
ggctttcttg ccgccaagga tctgatggcg caggggatca 4980agctctgatc aagagacagg
atgaggatcg tttcgcatga ttgaacaaga tggattgcac 5040gcaggttctc cggccgcttg
ggtggagagg ctattcggct atgactgggc acaacagaca 5100atcggctgct ctgatgccgc
cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt 5160gtcaagaccg acctgtccgg
tgccctgaat gaactgcaag acgaggcagc gcggctatcg 5220tggctggcca cgacgggcgt
tccttgcgca gctgtgctcg acgttgtcac tgaagcggga 5280agggactggc tgctattggg
cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct 5340cctgccgaga aagtatccat
catggctgat gcaatgcggc ggctgcatac gcttgatccg 5400gctacctgcc cattcgacca
ccaagcgaaa catcgcatcg agcgagcacg tactcggatg 5460gaagccggtc ttgtcgatca
ggatgatctg gacgaagagc atcaggggct cgcgccagcc 5520gaactgttcg ccaggctcaa
ggcgagcatg cccgacggcg aggatctcgt cgtgacccat 5580ggcgatgcct gcttgccgaa
tatcatggtg gaaaatggcc gcttttctgg attcatcgac 5640tgtggccggc tgggtgtggc
ggaccgctat caggacatag cgttggctac ccgtgatatt 5700gctgaagagc ttggcggcga
atgggctgac cgcttcctcg tgctttacgg tatcgccgct 5760cccgattcgc agcgcatcgc
cttctatcgc cttcttgacg agttcttctg aattattaac 5820gcttacaatt tcctgatgcg
gtattttctc cttacgcatc tgtgcggtat ttcacaccgc 5880atcaggtggc acttttcggg
gaaatgtgcg cggaacccct atttgtttat ttttctaaat 5940acattcaaat atgtatccgc
tcatgagaca ataaccctga taaatgcttc aataatagca 6000cgtgaggagg gccaccatgg
ccaagttgac cagtgccgtt ccggtgctca ccgcgcgcga 6060cgtcgccgga gcggtcgagt
tctggaccga ccggctcggg ttctcccggg acttcgtgga 6120ggacgacttc gccggtgtgg
tccgggacga cgtgaccctg ttcatcagcg cggtccagga 6180ccaggtggtg ccggacaaca
ccctggcctg ggtgtgggtg cgcggcctgg acgagctgta 6240cgccgagtgg tcggaggtcg
tgtccacgaa cttccgggac gcctccgggc cggccatgac 6300cgagatcggc gagcagccgt
gggggcggga gttcgccctg cgcgacccgg ccggcaactg 6360cgtgcacttc gtggccgagg
agcaggactg acacgtgcta aaacttcatt tttaatttaa 6420aaggatctag gtgaagatcc
tttttgataa tctcatgacc aaaatccctt aacgtgagtt 6480ttcgttccac tgagcgtcag
accccgtaga aaagatcaaa ggatcttctt gagatccttt 6540ttttctgcgc gtaatctgct
gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 6600tttgccggat caagagctac
caactctttt tccgaaggta actggcttca gcagagcgca 6660gataccaaat actgttcttc
tagtgtagcc gtagttaggc caccacttca agaactctgt 6720agcaccgcct acatacctcg
ctctgctaat cctgttacca gtggctgctg ccagtggcga 6780taagtcgtgt cttaccgggt
tggactcaag acgatagtta ccggataagg cgcagcggtc 6840gggctgaacg gggggttcgt
gcacacagcc cagcttggag cgaacgacct acaccgaact 6900gagataccta cagcgtgagc
tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 6960caggtatccg gtaagcggca
gggtcggaac aggagagcgc acgagggagc ttccaggggg 7020aaacgcctgg tatctttata
gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 7080tttgtgatgc tcgtcagggg
ggcggagcct atggaaaaac gccagcaacg cggccttttt 7140acggttcctg gccttttgct
ggccttttgc tcacatgttc tttcctgcgt tatcccctga 7200ttctgtggat aaccgtatta
ccgcctttga gtgagctgat accgctcgcc gcagccgaac 7260gaccgagcgc agcgagtcag
tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 7320tctccccgcg cgttggccga
ttcattaatg cagctggcac gacaggtttc ccgactggaa 7380agcgggcagt gagcgcaacg
caattaatgt gagttagctc actcattagg caccccaggc 7440tttacacttt atgcttccgg
ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 7500cacaggaaac agctatgacc
atgattacgc caagctattt aggtgacgcg ttagaatact 7560caagctatgc atcaagcttg
gtaccgagct cggatccact agtaacggcc gccagtgtgc 7620tggaattcag gggcgcgccc
tatagatggg atgaagctgc tctcgacaaa tctgataaaa 7680ctaaagaagg ttagtaatca
atttttacaa aatcatagat tatttttttc attgaattat 7740ttttatgcta taccaagaat
tgtattttag tatttgtttt aactacatat aatagaatta 7800actacatata aattaactaa
acttaaaata aaaatagatt tgtttcctga aattatttta 7860agaatatata tgtatatatc
taaaatctta gacttagata gatttttcta tctatctatt 7920ttggttactt aaaataaata
aatttgtata aataattgta tagttatcaa aaattaaaac 7980taattttttt aaagttgttg
atatataaaa tactaaagat ttaacgatta agtatttatt 8040taagtataga attttgtttt
ttttttaagt ttagttatga agttgttaat tatattaaaa 8100caaaacaata tttcgaaatt
ttattatcat attcgaatat atttttttta gtgatgatgt 8160atgaattatt atcataattt
gaaagtttac taaaaaatat atcaacatga attgtaatat 8220atgagttatt accttaacca
aaattataaa ttaacattaa atataattat atatgtcata 8280tttagccata caatgtgtca
tcaatattaa tagtcatgtc aatattacat aatgccaata 8340ttatgctact taaaccccaa
atcccctaac tcccgttaag tagccaaatt cataaatata 8400cttattcgac aaaataaaaa
actttaaaat atttactaat ccgaccatgc acaagcatcc 8460attccctatt ccattgccac
gggataacaa tgcaaccnac tcctcaaaaa aagaaaaatt 8520caagctcttt tgcaaaaaaa
aataaaataa ttttaacacc taaaattttt tgtttccaaa 8580cttctacagg gaacacacat
aaaagaaaaa gaggacgtcc actcggatca cgcaacaaac 8640caaaaggtgt gtcatgactc
ctaagatata atatttcctt attcaaaatc ataccatttt 8700aaattatgaa tgtatttcgt
agtccaccag atatgtaatc caccagcgtt caaaccaaag 8760ttttatgatt gtaagtttaa
gtgaattata ataatatatt cttcacggta tcttttcata 8820actaattgag ttatcaaact
tgatcgcaca tgtggctttg ataggtgtga cttttatggt 8880atacaattct ttcaacctaa
aaacattatt gttcctcaat atcttacatt atgcttgact 8940gcaacaaaat attttctcat
ctgttttctt cctttaaacc aatttattat catctatttc 9000ctgacatttt aatccatcca
cctatgtcaa aaacttatag aaaatgtcaa cttccaaaca 9060aaacataatt gaacttcgca
aataaattct taataatatt aaaaaatgtt acttaattat 9120ttcttcaacc ccattttccg
cgcgtagcgc ggacaaagac tctagttaaa tatagaagtt 9180tccgattctc atcgtataaa
acggtgactt tggcgggctt tcatgtgtaa caaattggtt 9240taacaaacca ctgcctagtc
gtttagtgta gaatcagcgc atggaactcc gattggagcg 9300tgactttcac gtgccggagg
cccaccacca cagcgggcgt tacgctctaa gaatctcgcc 9360cacggttttc ttcatctgcc
ccccgccaag tgtcttcctc gttcgccact tctcaccaag 9420ttacaggaac cctaaaaatg
gcctttcttc agccccggct ataatacaca catgatccta 9480tagtgggttc ttccacaagt
tacatctcct tctggattgt acatttcaag tgtttgtgtt 9540ttttctgcct ctgagagaaa
atcgcggccg catggagaga tctcaacggc agtctcctcc 9600gccaccgtcg ccgtcctcct
cctcgtcctc cgtctccgcg gacaccgtcc tcgtccctcc 9660cggaaagagg cggagggcgg
cgacggccaa ggccggcgcc gagcctaata agaggatccg 9720caaggacccc gccgccgccg
ccgcggggaa gaggagctcc gtctacaggg gagtcaccag 9780gcacaggtgg acgggcaggt
tcgaggcgca tctctgggac aagcactgcc tcgccgcgct 9840ccacaacaag aagaaaggca
ggcaagtcta cctgggggcg tatgacagcg aggaggcagc 9900tgctcgtgcc tatgacctcg
cagctctcaa gtactggggt cctgagactc tgctcaactt 9960ccctgtggag gattactcca
gcgagatgcc ggagatggag gccgtgtccc gggaggagta 10020cctggcctcc ctccgccgca
ggagcagcgg cttctccagg ggcgtctcca agtacagagg 10080cgtcgccagg catcaccaca
acgggaggtg ggaggcacgg attgggcgag tctttgggaa 10140caagtacctc tacttgggaa
catttgacac tcaagaagag gcagccaagg cctatgacct 10200tgcggccatt gaataccgtg
gcgtcaatgc tgtaaccaac ttcgacatca gctgctacct 10260ggaccacccg ctgttcctgg
cacagctcca acaggagcca caggtggtgc cggcactcaa 10320ccaagaacct caacctgatc
agagcgaaac cggaactaca gagcaagagc cggagtcaag 10380cgaagccaag acaccggatg
gcagtgcaga acccgatgag aacgcggtgc ctgacgacac 10440cgcggagccc ctcaccacag
tcgacgacag catcgaagag ggcttgtgga gcccttgcat 10500ggattacgag ctagacacca
tgtcgagacc aaactttggc agctcaatca atctgagcga 10560gtggttcgct gacgcagact
tcgactgcaa catcggatgc ctgttcgatg ggtgttctgc 10620ggctgacgaa ggaagcaagg
atggtgtagg tctggcagat ttcagtctgt ttgaggcagg 10680tgatgtccag ctgaaggatg
ttctttcgga tatggaagag gggatacaac ctccagcgat 10740gatcagtgtg tgcaacgc
107587120553DNAArtificial
sequenceVector ARALO79 71cgcgcctcga gtgggcggat cccccgggct gcaggaattc
actggccgtc gttttacaac 60gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg
ccttgcagca catccccctt 120tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg
cccttcccaa cagttgcgca 180gcctgaatgg cgaatggatc gatccatcgc gatgtacctt
ttgttagtca gcctctcgat 240tgctcatcgt cattacacag taccgaagtt tgatcgatct
agtaacatag atgacaccgc 300gcgcgataat ttatcctagt ttgcgcgcta tattttgttt
tctatcgcgt attaaatgta 360taattgcggg actctaatca taaaaaccca tctcataaat
aacgtcatgc attacatgtt 420aattattaca tgcttaacgt aattcaacag aaattatatg
ataatcatcg caagaccggc 480aacaggattc aatcttaaga aactttattg ccaaatgttt
gaacgatctg cttcgacgca 540ctccttcttt actccaccat ctcgtcctta ttgaaaacgt
gggtagcacc aaaacgaatc 600aagtcgctgg aactgaagtt accaatcacg ctggatgatt
tgccagttgg attaatcttg 660cctttccccg catgaataat attgatgaat gcatgcgtga
ggggtagttc gatgttggca 720atagctgcaa ttgccgcgac atcctccaac gagcataatt
cttcagaaaa atagcgatgt 780tccatgttgt cagggcatgc atgatgcacg ttatgaggtg
acggtgctag gcagtattcc 840ctcaaagttt catagtcagt atcatattca tcattgcatt
cctgcaagag agaattgaga 900cgcaatccac acgctgcggc aaccttccgg cgttcgtggt
ctatttgctc ttggacgttg 960caaacgtaag tgttggatcg atccggggtg ggcgaagaac
tccagcatga gatccccgcg 1020ctggaggatc atccagccgg cgtcccggaa aacgattccg
aagcccaacc tttcatagaa 1080ggcggcggtg gaatcgaaat ctcgtgatgg caggttgggc
gtcgcttggt cggtcatttc 1140gaaccccaga gtcccgctca gaagaactcg tcaagaaggc
gatagaaggc gatgcgctgc 1200gaatcgggag cggcgatacc gtaaagcacg aggaagcggt
cagcccattc gccgccaagc 1260tcttcagcaa tatcacgggt agccaacgct atgtcctgat
agcggtccgc cacacccagc 1320cggccacagt cgatgaatcc agaaaagcgg ccattttcca
ccatgatatt cggcaagcag 1380gcatcgccat gggtcacgac gagatcctcg ccgtcgggca
tgcgcgcctt gagcctggcg 1440aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca
gatcatcctg atcgacaaga 1500ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt
tcgcttggtg gtcgaatggg 1560caggtagccg gatcaagcgt atgcagccgc cgcattgcat
cagccatgat ggatactttc 1620tcggcaggag caaggtgaga tgacaggaga tcctgccccg
gcacttcgcc caatagcagc 1680cagtcccttc ccgcttcagt gacaacgtcg agcacagctg
cgcaaggaac gcccgtcgtg 1740gccagccacg atagccgcgc tgcctcgtcc tgcagttcat
tcagggcacc ggacaggtcg 1800gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc
ggaacacggc ggcatcagag 1860cagccgattg tctgttgtgc ccagtcatag ccgaatagcc
tctccaccca agcggccgga 1920gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg
atccccgcaa gcttggagac 1980tggtgatttc agcgtgtcct ctccaaatga aatgaacttc
cttatataga ggaagggtct 2040tgcgaaggat agtgggattg tgcgtcatcc cttacgtcag
tggagatatc acatcaatcc 2100acttgctttg aagacgtggt tggaacgtct tctttttcca
cgatgctcct cgtgggtggg 2160ggtccatctt tgggaccact gtcggcagag gcatcttcaa
cgatggcctt tcctttatcg 2220caatgatggc atttgtagga gccaccttcc ttttccacta
tcttcacaat aaagtgacag 2280atagctgggc aatggaatcc gaggaggttt ccggatatta
ccctttgttg aaaagtctca 2340attgcccttt ggtcttctga gactgtatct ttgatatttt
tggagtagac aagcgtgtcg 2400tgctccacca tgttgacgaa gattttcttc ttgtcattga
gtcgtaagag actctgtatg 2460aactgttcgc cagtctttac ggcgagttct gttaggtcct
ctatttgaat ctttgactcc 2520atggcctttg attcagtggg aactaccttt ttagagactc
caatctctat tacttgcctt 2580ggtttgtgaa gcaagccttg aatcgtccat actggaatag
tacttctgat cttgagaaat 2640atatctttct ctgtgttctt gatgcagtta gtcctgaatc
ttttgactgc atctttaacc 2700ttcttgggaa ggtatttgat ctcctggaga ttattgctcg
ggtagatcgt cttgatgaga 2760cctgctgcgt aagcctctct aaccatctgt gggttagcat
tctttctgaa attgaaaagg 2820ctaatcttct cattatcagt ggtgaacatg gtatcgtcac
cttctccgtc gaacttcctg 2880actagatcgt agagatagag gaagtcgtcc attgtgatct
ctggggcaaa ggagatctga 2940attaattcga tatggtggat ttatcacaaa tgggacccgc
cgccgacaga ggtgtgatgt 3000taggccagga ctttgaaaat ttgcgcaact atcgtatagt
ggccgacaaa ttgacgccga 3060gttgacagac tgcctagcat ttgagtgaat tatgtgaggt
aatgggctac actgaattgg 3120tagctcaaac tgtcagtatt tatgtatatg agtgtatatt
ttcgcataat ctcagaccaa 3180tctgaagatg aaatgggtat ctgggaatgg cgaaatcaag
gcatcgatcg tgaagtttct 3240catctaagcc cccatttgga cgtgaatgta gacacgtcga
aataaagatt tccgaattag 3300aataatttgt ttattgcttt cgcctataaa tacgacggat
cgtaatttgt cgttttatca 3360aaatgtactt tcattttata ataacgctgc ggacatctac
atttttgaat tgaaaaaaaa 3420ttggtaatta ctctttcttt ttctccatat tgaccatcat
actcattgct gatccatgta 3480gatttcccgg acatgaagcc atttacaatt gaatatatcc
tgccgccgct gccgctttgc 3540acccggtgga gcttgcatgt tggtttctac gcagaactga
gccggttagg cagataattt 3600ccattgagaa ctgagccatg tgcaccttcc ccccaacacg
gtgagcgacg gggcaacgga 3660gtgatccaca tgggactttt aaacatcatc cgtcggatgg
cgttgcgaga gaagcagtcg 3720atccgtgaga tcagccgacg caccgggcag gcgcgcaaca
cgatcgcaaa gtatttgaac 3780gcaggtacaa tcgagccgac gttcacgcgg aacgaccaag
caagctagct ttaatgcggt 3840agtttatcac agttaaattg ctaacgcagt caggcaccgt
gtatgaaatc taacaatgcg 3900ctcatcgtca tcctcggcac cgtcaccctg gatgctgtag
gcataggctt ggttatgccg 3960gtactgccgg gcctcttgcg ggatatcgtc cattccgaca
gcatcgccag tcactatggc 4020gtgctgctag cgctatatgc gttgatgcaa tttctatgcg
cacccgttct cggagcactg 4080tccgaccgct ttggccgccg cccagtcctg ctcgcttcgc
tacttggagc cactatcgac 4140tacgcgatca tggcgaccac acccgtcctg tggtccaacc
cctccgctgc tatagtgcag 4200tcggcttctg acgttcagtg cagccgtctt ctgaaaacga
catgtcgcac aagtcctaag 4260ttacgcgaca ggctgccgcc ctgccctttt cctggcgttt
tcttgtcgcg tgttttagtc 4320gcataaagta gaatacttgc gactagaacc ggagacatta
cgccatgaac aagagcgccg 4380ccgctggcct gctgggctat gcccgcgtca gcaccgacga
ccaggacttg accaaccaac 4440gggccgaact gcacgcggcc ggctgcacca agctgttttc
cgagaagatc accggcacca 4500ggcgcgaccg cccggagctg gccaggatgc ttgaccacct
acgccctggc gacgttgtga 4560cagtgaccag gctagaccgc ctggcccgca gcacccgcga
cctactggac attgccgagc 4620gcatccagga ggccggcgcg ggcctgcgta gcctggcaga
gccgtgggcc gacaccacca 4680cgccggccgg ccgcatggtg ttgaccgtgt tcgccggcat
tgccgagttc gagcgttccc 4740taatcatcga ccgcacccgg agcgggcgcg aggccgccaa
ggcccgaggc gtgaagtttg 4800gcccccgccc taccctcacc ccggcacaga tcgcgcacgc
ccgcgagctg atcgaccagg 4860aaggccgcac cgtgaaagag gcggctgcac tgcttggcgt
gcatcgctcg accctgtacc 4920gcgcacttga gcgcagcgag gaagtgacgc ccaccgaggc
caggcggcgc ggtgccttcc 4980gtgaggacgc attgaccgag gccgacgccc tggcggccgc
cgagaatgaa cgccaagagg 5040aacaagcatg aaaccgcacc aggacggcca ggacgaaccg
tttttcatta ccgaagagat 5100cgaggcggag atgatcgcgg ccgggtacgt gttcgagccg
cccgcgcacg tctcaaccgt 5160gcggctgcat gaaatcctgg ccggtttgtc tgatgccaag
ctggcggcct ggccggccag 5220cttggccgct gaagaaaccg agcgccgccg tctaaaaagg
tgatgtgtat ttgagtaaaa 5280cagcttgcgt catgcggtcg ctgcgtatat gatgcgatga
gtaaataaac aaatacgcaa 5340gggaacgcat gaagttatcg ctgtacttaa ccagaaaggc
gggtcaggca agacgaccat 5400cgcaacccat ctagcccgcg ccctgcaact cgccggggcc
gatgttctgt tagtcgattc 5460cgatccccag ggcagtgccc gcgattgggc ggccgtgcgg
gaagatcaac cgctaaccgt 5520tgtcggcatc gaccgcccga cgattgaccg cgacgtgaag
gccatcggcc ggcgcgactt 5580cgtagtgatc gacggagcgc cccaggcggc ggacttggct
gtgtccgcga tcaaggcagc 5640cgacttcgtg ctgattccgg tgcagccaag cccttacgac
atatgggcca ccgccgacct 5700ggtggagctg gttaagcagc gcattgaggt cacggatgga
aggctacaag cggcctttgt 5760cgtgtcgcgg gcgatcaaag gcacgcgcat cggcggtgag
gttgccgagg cgctggccgg 5820gtacgagctg cccattcttg agtcccgtat cacgcagcgc
gtgagctacc caggcactgc 5880cgccgccggc acaaccgttc ttgaatcaga acccgagggc
gacgctgccc gcgaggtcca 5940ggcgctggcc gctgaaatta aatcaaaact catttgagtt
aatgaggtaa agagaaaatg 6000agcaaaagca caaacacgct aagtgccggc cgtccgagcg
cacgcagcag caaggctgca 6060acgttggcca gcctggcaga cacgccagcc atgaagcggg
tcaactttca gttgccggcg 6120gaggatcaca ccaagctgaa gatgtacgcg gtacgccaag
gcaagaccat taccgagctg 6180ctatctgaat acatcgcgca gctaccagag taaatgagca
aatgaataaa tgagtagatg 6240aattttagcg gctaaaggag gcggcatgga aaatcaagaa
caaccaggca ccgacgccgt 6300ggaatgcccc atgtgtggag gaacgggcgg ttggccaggc
gtaagcggct gggttgtctg 6360ccggccctgc aatggcactg gaacccccaa gcccgaggaa
tcggcgtgag cggtcgcaaa 6420ccatccggcc cggtacaaat cggcgcggcg ctgggtgatg
acctggtgga gaagttgaag 6480gccgcgcagg ccgcccagcg gcaacgcatc gaggcagaag
cacgccccgg tgaatcgtgg 6540caagcggccg ctgatcgaat ccgcaaagaa tcccggcaac
cgccggcagc cggtgcgccg 6600tcgattagga agccgcccaa gggcgacgag caaccagatt
ttttcgttcc gatgctctat 6660gacgtgggca cccgcgatag tcgcagcatc atggacgtgg
ccgttttccg tctgtcgaag 6720cgtgaccgac gagctggcga ggtgatccgc tacgagcttc
cagacgggca cgtagaggtt 6780tccgcagggc cggccggcat ggccagtgtg tgggattacg
acctggtact gatggcggtt 6840tcccatctaa ccgaatccat gaaccgatac cgggaaggga
agggagacaa gcccggccgc 6900gtgttccgtc cacacgttgc ggacgtactc aagttctgcc
ggcgagccga tggcggaaag 6960cagaaagacg acctggtaga aacctgcatt cggttaaaca
ccacgcacgt tgccatgcag 7020cgtacgaaga aggccaagaa cggccgcctg gtgacggtat
ccgagggtga agccttgatt 7080agccgctaca agatcgtaaa gagcgaaacc gggcggccgg
agtacatcga gatcgagcta 7140gctgattgga tgtaccgcga gatcacagaa ggcaagaacc
cggacgtgct gacggttcac 7200cccgattact ttttgatcga tcccggcatc ggccgttttc
tctaccgcct ggcacgccgc 7260gccgcaggca aggcagaagc cagatggttg ttcaagacga
tctacgaacg cagtggcagc 7320gccggagagt tcaagaagtt ctgtttcacc gtgcgcaagc
tgatcgggtc aaatgacctg 7380ccggagtacg atttgaagga ggaggcgggg caggctggcc
cgatcctagt catgcgctac 7440cgcaacctga tcgagggcga agcatccgcc ggttcctaat
gtacggagca gatgctaggg 7500caaattgccc tagcagggga aaaaggtcga aaaggtctct
ttcctgtgga tagcacgtac 7560attgggaacc caaagccgta cattgggaac cggaacccgt
acattgggaa cccaaagccg 7620tacattggga accggtcaca catgtaagtg actgatataa
aagagaaaaa aggcgatttt 7680tccgcctaaa actctttaaa acttattaaa actcttaaaa
cccgcctggc ctgtgcataa 7740ctgtctggcc agcgcacagc cgaagagctg caaaaagcgc
ctacccttcg gtcgctgcgc 7800tccctacgcc ccgccgcttc gcgtcggcct atcgcggccg
ctggccgctc aaaaatggct 7860ggcctacggc caggcaatct accagggcgc ggacaagccg
cgccgtcgcc actcgaccgc 7920cggcgcccac atcaaggcac cctgcctcgc gcgtttcggt
gatgacggtg aaaacctctg 7980acacatgcag ctcccggaga cggtcacagc ttgtctgtaa
gcggatgccg ggagcagaca 8040agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg
ggcgcagcca tgacccagtc 8100acgtagcgat agcggagtgt atactggctt aactatgcgg
catcagagca gattgtactg 8160agagtgcacc atatgcggtg tgaaataccg cacagatgcg
taaggagaaa ataccgcatc 8220aggcgctctt ccgcttcctc gctcactgac tcgctgcgct
cggtcgttcg gctgcggcga 8280gcggtatcag ctcactcaaa ggcggtaata cggttatcca
cagaatcagg ggataacgca 8340ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga
accgtaaaaa ggccgcgttg 8400ctggcgtttt tccataggct ccgcccccct gacgagcatc
acaaaaatcg acgctcaagt 8460cagaggtggc gaaacccgac aggactataa agataccagg
cgtttccccc tggaagctcc 8520ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat
acctgtccgc ctttctccct 8580tcgggaagcg tggcgctttc tcatagctca cgctgtaggt
atctcagttc ggtgtaggtc 8640gttcgctcca agctgggctg tgtgcacgaa ccccccgttc
agcccgaccg ctgcgcctta 8700tccggtaact atcgtcttga gtccaacccg gtaagacacg
acttatcgcc actggcagca 8760gccactggta acaggattag cagagcgagg tatgtaggcg
gtgctacaga gttcttgaag 8820tggtggccta actacggcta cactagaagg acagtatttg
gtatctgcgc tctgctgaag 8880ccagttacct tcggaaaaag agttggtagc tcttgatccg
gcaaacaaac caccgctggt 8940agcggtggtt tttttgtttg caagcagcag attacgcgca
gaaaaaaagg atctcaagaa 9000gatcctttga tcttttctac ggggtctgac gctcagtgga
acgaaaactc acgttaaggg 9060attttggtca tgagattatc aaaaaggatc ttcacctaga
tccttttaaa ttaaaaatga 9120agttttaaat caatctaaag tatatatgag taaacttggt
ctgacagtta ccaatgctta 9180atcagtgagg cacctatctc agcgatctgt ctatttcgtt
catccatagt tgcctgactc 9240cccgtcgtgt agataactac gatacgggag ggcttaccat
ctggccccag tgctgcaatg 9300ataccgcgag acccacgctc accggctcca gatttatcag
caataaacca gccagccgga 9360agggccgagc gcagaagtgg tcctgcaact ttatccgcct
ccatccagtc tattaattgt 9420tgccgggaag ctagagtaag tagttcgcca gttaatagtt
tgcgcaacgt tgttgccatt 9480gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg
cttcattcag ctccggttcc 9540caacgatcaa ggcgagttac atgatccccc atgttgtgca
aaaaagcggt tagctccttc 9600ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt
tatcactcat ggttatggca 9660gcactgcata attctcttac tgtcatgcca tccgtaagat
gcttttctgt gactggtgag 9720tactcaacca agtcattctg agaatagtgt atgcggcgac
cgagttgctc ttgcccggcg 9780tcaacacggg ataataccgc gccacatagc agaactttaa
aagtgctcat cattggaaaa 9840gacctgcagg gggggggggg cgctgaggtc tgcctcgtga
agaaggtgtt gctgactcat 9900accaggcctg aatcgcccca tcatccagcc agaaagtgag
ggagccacgg ttgatgagag 9960ctttgttgta ggtggaccag ttggtgattt tgaacttttg
ctttgccacg gaacggtctg 10020cgttgtcggg aagatgcgtg atctgatcct tcaactcagc
aaaagttcga tttattcaac 10080aaagccgccg tcccgtcaag tcagcgtaat gctctgccag
tgttacaacc aattaaccaa 10140ttctgattag aaaaactcat cgagcatcaa atgaaactgc
aatttattca tatcaggatt 10200atcaatacca tatttttgaa aaagccgttt ctgtaatgaa
ggagaaaact caccgaggca 10260gttccatagg atggcaagat cctggtatcg gtctgcgatt
ccgactcgtc caacatcaat 10320acaacctatt aatttcccct cgtcaaaaat aaggttatca
agtgagaaat caccatgagt 10380gacgactgaa tccggtgaga atggcaaaag cttatgcatt
tctttccaga cttgttcaac 10440aggccagcca ttacgctcgt catcaaaatc actcgcatca
accaaaccgt tattcattcg 10500tgattgcgcc tgagcgagac gaaatacgcg atcgctgtta
aaaggacaat tacaaacagg 10560aatcgaatgc aaccggcgca ggaacactgc cagcgcatca
acaatatttt cacctgaatc 10620aggatattct tctaatacct ggaatgctgt tttcccgggg
atcgcagtgg tgagtaacca 10680tgcatcatca ggagtacgga taaaatgctt gatggtcgga
agaggcataa attccgtcag 10740ccagtttagt ctgaccatct catctgtaac atcattggca
acgctacctt tgccatgttt 10800cagaaacaac tctggcgcat cgggcttccc atacaatcga
tagattgtcg cacctgattg 10860cccgacatta tcgcgagccc atttataccc atataaatca
gcatccatgt tggaatttaa 10920tcgcggcctc gagcaagacg tttcccgttg aatatggctc
ataacacccc ttgtattact 10980gtttatgtaa gcagacagtt ttattgttca tgatgatata
tttttatctt gtgcaatgta 11040acatcagaga ttttgagaca caacgtggct ttcccccccc
cccctgcagg tcaattcggt 11100cgatatggct attacgaaga aggctcgtgc gcggagtccc
gtgaactttc ccacgcaaca 11160agtgaaccgc accgggtttg ccggaggcca tttcgttaaa
atgcgcagcc atggctgctt 11220cgtccagcat ggcgtaatac tgatcctcgt cttcggctgg
cggtatattg ccgatgggct 11280tcaaaagccg ccgtggttga accagtctat ccattccaag
gtagcgaact cgaccgcttc 11340gaagctcctc catggtccac gccgatgaat gacctcggcc
ttgtaaagac cgttgatcgc 11400ttctgcgagg gcgttgtcgt gctgtcgccg acgcttccga
tagatggctc gatacctgct 11460tctgccaacc gctcggaata gcgaaaggac acgtattgaa
caccgcgatc cgagtgatgc 11520actaggccgc catgagcggg acgccgatca tgatgagcct
cctcgagggc atcgaggaca 11580aagcctgcat gtgctgtccg gctcgcccgc catccgacaa
tgcgacgggc gaagacgtcg 11640atcacgaagg ccacgtagac gaagccctcc caagtggcga
cataagtacg gacatgcgca 11700aaggctttcc cggtttgtcg ctgatggtgc aagagacgct
gaagcgcgat ccgatgcgca 11760ggcatctgtt cgtcttccgc ggtcgtggcg gtggcctgat
caaggtcact cgccgaagag 11820ctgcatgatt ggctcgaaac cgagcggggg aaattgtcgc
gcagttctcc cgtcgccgag 11880gcgataaatt acatgctcaa gcgatgggat ggcattacgt
cattcctcga tgacggcccg 11940atttgcctga cgaacaatgc tgccgaacga acgctcagag
gctatgtact cggcaggaag 12000tcatggctgt ttgccggatc ggatcgttgt gctgaacgtg
cggcgttcat ggcgacactg 12060atcatgagcg ccaagctcaa taacatcgat ccgcaggcct
ggcttgccga cgtccgcgcc 12120gaccttgcgg acgctccgat cagcaggctt gagcaacagc
tgccgtggaa ctggacatcc 12180aagacactga gtgctcaggc ggcctgacct gcggccttca
ccggatactt accccattat 12240cgcagattgc gatgaagcat cagcgtcatt cagcaatctt
gccaaagtat gcaggctcgc 12300gagaatcgac gtgcgaaacc ggctggttgc gccaaagatc
cgcttgcgga gcggtcgaac 12360attcatgctg ggacttcaag aggtcgagta gaggaagaac
cggaaaggtt gcaccggaaa 12420atatgcgttc ctttggagag cgcctcatgg acgtgaacaa
atcgcccgga ccaaggatgc 12480cacggataca aaagctcgcg aagctcggtc ccgtgggtgt
tctgtcgtct cgttgtacaa 12540cgaaatccat tcccattccg cgctcaagat ggcttcccct
cggcagttca tcagggctaa 12600atcaatctag ccgacttgtc cggtgaaatg ggctgcactc
caacagaaac aatcaaacaa 12660acatacacag cgacttattc acacgagctc aaattacaac
ggtatatatc ctgccagtca 12720gcatcatcac accaaaagtt aggcccgaat agtttgaaat
tagaaagctc gcaattgagg 12780tctacaggcc aaattcgctc ttagccgtac aatattactc
accggtgcga tgccccccat 12840cgtaggtgaa ggtggaaatt aatgatccat cttgagacca
caggcccaca acagctacca 12900gtttcctcaa gggtccacca aaaacgtaag cgcttacgta
catggtcgat aagaaaaggc 12960aatttgtaga tgttaacatc caacgtcgct ttcagggatc
gatccaatac gcaaaccgcc 13020tctccccgcg cgttggccga ttcattaatg cagctggcac
gacaggtttc ccgactggaa 13080agcgggcagt gagcgcaacg caattaatgt gagttagctc
actcattagg caccccaggc 13140tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt
gtgagcggat aacaatttca 13200cacaggaaac agctatgacc atgattacgc caagcttgca
tgcctgcagg tcgactctag 13260aggatctggc gcgccctata gatgggatga agctgctctc
gacaaatctg ataaaactaa 13320agaaggttag taatcaattt ttacaaaatc atagattatt
tttttcattg aattattttt 13380atgctatacc aagaattgta ttttagtatt tgttttaact
acatataata gaattaacta 13440catataaatt aactaaactt aaaataaaaa tagatttgtt
tcctgaaatt attttaagaa 13500tatatatgta tatatctaaa atcttagact tagatagatt
tttctatcta tctattttgg 13560ttacttaaaa taaataaatt tgtataaata attgtatagt
tatcaaaaat taaaactaat 13620ttttttaaag ttgttgatat ataaaatact aaagatttaa
cgattaagta tttatttaag 13680tatagaattt tgtttttttt ttaagtttag ttatgaagtt
gttaattata ttaaaacaaa 13740acaatatttc gaaattttat tatcatattc gaatatattt
tttttagtga tgatgtatga 13800attattatca taatttgaaa gtttactaaa aaatatatca
acatgaattg taatatatga 13860gttattacct taaccaaaat tataaattaa cattaaatat
aattatatat gtcatattta 13920gccatacaat gtgtcatcaa tattaatagt catgtcaata
ttacataatg ccaatattat 13980gctacttaaa ccccaaatcc cctaactccc gttaagtagc
caaattcata aatatactta 14040ttcgacaaaa taaaaaactt taaaatattt actaatccga
ccatgcacaa gcatccattc 14100cctattccat tgccacggga taacaatgca accnactcct
caaaaaaaga aaaattcaag 14160ctcttttgca aaaaaaaata aaataatttt aacacctaaa
attttttgtt tccaaacttc 14220tacagggaac acacataaaa gaaaaagagg acgtccactc
ggatcacgca acaaaccaaa 14280aggtgtgtca tgactcctaa gatataatat ttccttattc
aaaatcatac cattttaaat 14340tatgaatgta tttcgtagtc caccagatat gtaatccacc
agcgttcaaa ccaaagtttt 14400atgattgtaa gtttaagtga attataataa tatattcttc
acggtatctt ttcataacta 14460attgagttat caaacttgat cgcacatgtg gctttgatag
gtgtgacttt tatggtatac 14520aattctttca acctaaaaac attattgttc ctcaatatct
tacattatgc ttgactgcaa 14580caaaatattt tctcatctgt tttcttcctt taaaccaatt
tattatcatc tatttcctga 14640cattttaatc catccaccta tgtcaaaaac ttatagaaaa
tgtcaacttc caaacaaaac 14700ataattgaac ttcgcaaata aattcttaat aatattaaaa
aatgttactt aattatttct 14760tcaaccccat tttccgcgcg tagcgcggac aaagactcta
gttaaatata gaagtttccg 14820attctcatcg tataaaacgg tgactttggc gggctttcat
gtgtaacaaa ttggtttaac 14880aaaccactgc ctagtcgttt agtgtagaat cagcgcatgg
aactccgatt ggagcgtgac 14940tttcacgtgc cggaggccca ccaccacagc gggcgttacg
ctctaagaat ctcgcccacg 15000gttttcttca tctgcccccc gccaagtgtc ttcctcgttc
gccacttctc accaagttac 15060aggaacccta aaaatggcct ttcttcagcc ccggctataa
tacacacatg atcctatagt 15120gggttcttcc acaagttaca tctccttctg gattgtacat
ttcaagtgtt tgtgtttttt 15180ctgcctctga gagaaaatcg cggccgcatg gagagatctc
aacggcagtc tcctccgcca 15240ccgtcgccgt cctcctcctc gtcctccgtc tccgcggaca
ccgtcctcgt ccctcccgga 15300aagaggcgga gggcggcgac ggccaaggcc ggcgccgagc
ctaataagag gatccgcaag 15360gaccccgccg ccgccgccgc ggggaagagg agctccgtct
acaggggagt caccaggcac 15420aggtggacgg gcaggttcga ggcgcatctc tgggacaagc
actgcctcgc cgcgctccac 15480aacaagaaga aaggcaggca agtctacctg ggggcgtatg
acagcgagga ggcagctgct 15540cgtgcctatg acctcgcagc tctcaagtac tggggtcctg
agactctgct caacttccct 15600gtggaggatt actccagcga gatgccggag atggaggccg
tgtcccggga ggagtacctg 15660gcctccctcc gccgcaggag cagcggcttc tccaggggcg
tctccaagta cagaggcgtc 15720gccaggcatc accacaacgg gaggtgggag gcacggattg
ggcgagtctt tgggaacaag 15780tacctctact tgggaacatt tgacactcaa gaagaggcag
ccaaggccta tgaccttgcg 15840gccattgaat accgtggcgt caatgctgta accaacttcg
acatcagctg ctacctggac 15900cacccgctgt tcctggcaca gctccaacag gagccacagg
tggtgccggc actcaaccaa 15960gaacctcaac ctgatcagag cgaaaccgga actacagagc
aagagccgga gtcaagcgaa 16020gccaagacac cggatggcag tgcagaaccc gatgagaacg
cggtgcctga cgacaccgcg 16080gagcccctca ccacagtcga cgacagcatc gaagagggct
tgtggagccc ttgcatggat 16140tacgagctag acaccatgtc gagaccaaac tttggcagct
caatcaatct gagcgagtgg 16200ttcgctgacg cagacttcga ctgcaacatc ggatgcctgt
tcgatgggtg ttctgcggct 16260gacgaaggaa gcaaggatgg tgtaggtctg gcagatttca
gtctgtttga ggcaggtgat 16320gtccagctga aggatgttct ttcggatatg gaagagggga
tacaacctcc agcgatgatc 16380agtgtgtgca acgcggccgc aagtatgaac taaaatgcat
gtaggtgtaa gagctcatgg 16440agagcatgga atattgtatc cgaccatgta acagtataat
aactgagctc catctcactt 16500cttctatgaa taaacaaagg atgttatgat atattaacac
tctatctatg caccttattg 16560ttctatgata aatttcctct tattattata aatcatctga
atcgtgacgg cttatggaat 16620gcttcaaata gtacaaaaac aaatgtgtac tataagactt
tctaaacaat tctaacctta 16680gcattgtgaa cgagacataa gtgttaagaa gacataacaa
ttataatgga agaagtttgt 16740ctccatttat atattatata ttacccactt atgtattata
ttaggatgtt aaggagacat 16800aacaattata aagagagaag tttgtatcca tttatatatt
atatactacc catttatata 16860ttatacttat ccacttattt aatgtcttta taaggtttga
tccatgatat ttctaatatt 16920ttagttgata tgtatatgaa agggtactat ttgaactctc
ttactctgta taaaggttgg 16980atcatcctta aagtgggtct atttaatttt attgcttctt
acagataaaa aaaaaattat 17040gagttggttt gataaaatat tgaaggattt aaaataataa
taaataacat ataatatatg 17100tatataaatt tattataata taacatttat ctataaaaaa
gtaaatattg tcataaatct 17160atacaatcgt ttagccttgc tggacgaatc tcaattattt
aaacgagagt aaacatattt 17220gactttttgg ttatttaaca aattattatt taacactata
tgaaattttt ttttttatca 17280gcaaagaata aaattaaatt aagaaggaca atggtgtccc
aatccttata caaccaactt 17340ccacaagaaa gtcaagtcag agacaacaaa aaaacaagca
aaggaaattt tttaatttga 17400gttgtcttgt ttgctgcata atttatgcag taaaacacta
cacataaccc ttttagcagt 17460agagcaatgg ttgaccgtgt gcttagcttc ttttatttta
tttttttatc agcaaagaat 17520aaataaaata aaatgagaca cttcagggat gtttcaacaa
gcttggatcc tcgaagagaa 17580gggttaataa cacacttttt taacattttt aacacaaatt
ttagttattt aaaaatttat 17640taaaaaattt aaaataagaa gaggaactct ttaaataaat
ctaacttaca aaatttatga 17700tttttaataa gttttcacca ataaaaaatg tcataaaaat
atgttaaaaa gtatattatc 17760aatattctct ttatgataaa taaaaagaaa aaaaaaataa
aagttaagtg aaaatgagat 17820tgaagtgact ttaggtgtgt ataaatatat caaccccgcc
aacaatttat ttaatccaaa 17880tatattgaag tatattattc catagccttt atttatttat
atatttatta tataaaagct 17940ttatttgttc taggttgttc atgaaatatt tttttggttt
tatctccgtt gtaagaaaat 18000catgtgcttt gtgtcgccac tcactattgc agctttttca
tgcattggtc agattgacgg 18060ttgattgtat ttttgttttt tatggttttg tgttatgact
taagtcttca tctctttatc 18120tcttcatcag gtttgatggt tacctaatat ggtccatggg
tacatgcatg gttaaattag 18180gtggccaact ttgttgtgaa cgatagaatt ttttttatat
taagtaaact atttttatat 18240tatgaaataa taataaaaaa aatattttat cattattaac
aaaatcatat tagttaattt 18300gttaactcta taataaaaga aatactgtaa cattcacatt
acatggtaac atctttccac 18360cctttcattt gttttttgtt tgatgacttt ttttcttgtt
taaatttatt tcccttcttt 18420taaatttgga atacattatc atcatatata aactaaaata
ctaaaaacag gattacacaa 18480atgataaata ataacacaaa tatttataaa tctagctgca
atatatttaa actagctata 18540tcgatattgt aaaataaaac tagctgcatt gatactgata
aaaaaatatc atgtgctttc 18600tggactgatg atgcagtata cttttgacat tgcctttatt
ttatttttca gaaaagcttt 18660cttagttctg ggttcttcat tatttgtttc ccatctccat
tgtgaattga atcatttgct 18720tcgtgtcaca aatacaattt agntaggtac atgcattggt
cagattcacg gtttattatg 18780tcatgactta agttcatggt agtacattac ctgccacgca
tgcattatat tggttagatt 18840tgataggcaa atttggttgt caacaatata aatataaata
atgtttttat attacgaaat 18900aacagtgatc aaaacaaaca gttttatctt tattaacaag
attttgtttt tgtttgatga 18960cgttttttaa tgtttacgct ttcccccttc ttttgaattt
agaacacttt atcatcataa 19020aatcaaatac taaaaaaatt acatatttca taaataataa
cacaaatatt tttaaaaaat 19080ctgaaataat aatgaacaat attacatatt atcacgaaaa
ttcattaata aaaatattat 19140ataaataaaa tgtaatagta gttatatgta ggaaaaaagt
actgcacgca taatatatac 19200aaaaagatta aaatgaacta ttataaataa taacactaaa
ttaatggtga atcatatcaa 19260aataatgaaa aagtaaataa aatttgtaat taacttctat
atgtattaca cacacaaata 19320ataaataata gtaaaaaaaa ttatgataaa tatttaccat
ctcataagat atttaaaata 19380atgataaaaa tatagattat tttttatgca actagctagc
caaaaagaga acacgggtat 19440atataaaaag agtaccttta aattctactg tacttccttt
attcctgacg tttttatatc 19500aagtggacat acgtgaagat tttaattatc agtctaaata
tttcattagc acttaatact 19560tttctgtttt attcctatcc tataagtagt cccgattctc
ccaacattgc ttattcacac 19620aactaactaa gaaagtcttc catagccccc caagcggccc
atggcctcct ccgaggacgt 19680catcaaggag ttcatgcgct tcaaggtgcg catggagggc
tccgtgaacg gccacgagtt 19740cgagatcgag ggcgagggcg agggccgccc ctacgagggc
acccagaccg ccaagctgaa 19800ggtgaccaag ggcggccccc tgcccttcgc ctgggacatc
ctgtcccccc agttccagta 19860cggctccaag gtgtacgtga agcaccccgc cgacatcccc
gactacaaga agctgtcctt 19920ccccgagggc ttcaagtggg agcgcgtgat gaacttcgag
gacggcggcg tggtgaccgt 19980gacccaggac tcctccctgc aggacggctc cttcatctac
aaggtgaagt tcatcggcgt 20040gaacttcccc tccgacggcc ccgtaatgca gaagaagact
atgggctggg aggcctccac 20100cgagcgcctg tacccccgcg acggcgtgct gaagggcgag
atccacaagg ccctgaagct 20160gaaggacggc ggccactacc tggtggagtt caagtccatc
tacatggcca agaagcccgt 20220gcagctgccc ggctactact acgtggactc caagctggac
atcacctccc acaacgagga 20280ctacaccatc gtggagcagt acgagcgcgc cgagggccgc
caccacctgt tcctgtagcg 20340gccggccgcg acacaagtgt gagagtacta aataaatgct
ttggttgtac gaaatcatta 20400cactaaataa aataatcaaa gcttatatat gccttccgct
aaggccgaat gcaaagaaat 20460tggttctttc tcgttatctt ttgccacttt tactagtacg
tattaattac tacttaatca 20520tctttgttta cggctcatta tatccgtcga cgg
2055372586DNABrassica napus 72ccaatttatt atcatctatt
tcctgacatt ttaatccatc cacctatgtc aaaaacttat 60agaaaatgtc aacttccaaa
caaaacataa ttgaacttcg caaataaatt cttaataata 120ttaaaaaatg ttacttaatt
atttcttcaa ccccattttc cgcgcgtagc gcggacaaag 180actctagtta aatatagaag
tttccgattc tcatcgtata aaacggtgac tttggcgggc 240tttcatgtgt aacaaattgg
tttaacaaac cactgcctag tcgtttagtg tagaatcagc 300gcatggaact ccgattggag
cgtgactttc acgtrccgga ggcccaccac cwcagcgggc 360gttacgctct aagaatctcg
cccacggttt tcttcatctc ccccccgcca agtgtctccc 420tcgttcgcca cttctcatca
tgttacaggg accataaaaa tggcgtattt cttcagcccc 480gggtataaat acacacatga
tcctgtggtg ggttcttcca caagttacat ctccttctgg 540tttttgtatt gcaagtgttt
gtattttttg cctccgagag aaaatc 586731924DNABrassica
napusmisc_feature(859)..(859)n is a, c, g, or t 73ctatagatgg gatgaagctg
ctctcgacaa atctgataaa actaaagaag gttagtaatc 60aatttttaca aaatcataga
ttattttttt cattgaatta tttttatgct ataccaagaa 120ttgtatttta gtatttgttt
taactacata taatagaatt aactacatat aaattaacta 180aacttaaaat aaaaatagat
ttgtttcctg aaattatttt aagaatatat atgtatatat 240ctaaaatctt agacttagat
agatttttct atctatctat tttggttact taaaataaat 300aaatttgtat aaataattgt
atagttatca aaaattaaaa ctaatttttt taaagttgtt 360gatatataaa atactaaaga
tttaacgatt aagtatttat ttaagtatag aattttgttt 420tttttttaag tttagttatg
aagttgttaa ttatattaaa acaaaacaat atttcgaaat 480tttattatca tattcgaata
tatttttttt agtgatgatg tatgaattat tatcataatt 540tgaaagttta ctaaaaaata
tatcaacatg aattgtaata tatgagttat taccttaacc 600aaaattataa attaacatta
aatataatta tatatgtcat atttagccat acaatgtgtc 660atcaatatta atagtcatgt
caatattaca taatgccaat attatgctac ttaaacccca 720aatcccctaa ctcccgttaa
gtagccaaat tcataaatat acttattcga caaaataaaa 780aactttaaaa tatttactaa
tccgaccatg cacaagcatc cattccctat tccattgcca 840cgggataaca atgcaaccna
ctcctcaaaa aaagaaaaat tcaagctctt ttgcaaaaaa 900aaataaaata attttaacac
ctaaaatttt ttgtttccaa acttctacag ggaacacaca 960taaaagaaaa agaggacgtc
cactcggatc acgcaacaaa ccaaaaggtg tgtcatgact 1020cctaagatat aatatttcct
tattcaaaat cataccattt taaattatga atgtatttcg 1080tagtccacca gatatgtaat
ccaccagcgt tcaaaccaaa gttttatgat tgtaagttta 1140agtgaattat aataatatat
tcttcacggt atcttttcat aactaattga gttatcaaac 1200ttgatcgcac atgtggcttt
gataggtgtg acttttatgg tatacaattc tttcaaccta 1260aaaacattat tgttcctcaa
tatcttacat tatgcttgac tgcaacaaaa tattttctca 1320tctgttttct tcctttaaac
caatttatta tcatctattt cctgacattt taatccatcc 1380acctatgtca aaaacttata
gaaaatgtca acttccaaac aaaacataat tgaacttcgc 1440aaataaattc ttaataatat
taaaaaatgt tacttaatta tttcttcaac cccattttcc 1500gcgcgtagcg cggacaaaga
ctctagttaa atatagaagt ttccgattct catcgtataa 1560aacggtgact ttggcgggct
ttcatgtgta acaaattggt ttaacaaacc actgcctagt 1620cgtttagtgt agaatcagcg
catggaactc cgattggagc gtgactttca cgtgccggag 1680gcccaccacc acagcgggcg
ttacgctcta agaatctcgc ccacggtttt cttcatctgc 1740cccccgccaa gtgtcttcct
cgttcgccac ttctcaccaa gttacaggaa ccctaaaaat 1800ggcctttctt cagccccggc
tataatacac acatgatcct atagtgggtt cttccacaag 1860ttacatctcc ttctggattg
tacatttcaa gtgtttgtgt tttttctgcc tctgagagaa 1920aatc
192474444PRTArtificial
SequenceODP1 consensus sequence 74Met Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa
Xaa Xaa Xaa Xaa Ser Xaa1 5 10
15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa 20 25 30 Xaa
Xaa Xaa Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 35
40 45 Arg Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 50 55
60 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Arg Ser65 70 75
80 Ser Xaa Tyr Arg Gly Val Thr Arg His Arg Trp Thr Gly Arg Phe Glu
85 90 95 Ala His Leu
Trp Asp Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Lys 100
105 110 Xaa Gly Xaa Gln Val Tyr Leu Gly
Ala Tyr Asp Xaa Glu Glu Xaa Ala 115 120
125 Ala Xaa Xaa Tyr Asp Leu Ala Ala Leu Lys Tyr Trp Gly
Xaa Xaa Xaa 130 135 140
Xaa Leu Asn Phe Pro Xaa Glu Xaa Tyr Xaa Xaa Glu Xaa Xaa Glu Met145
150 155 160 Xaa Xaa Val Xaa Xaa
Glu Glu Tyr Leu Ala Ser Leu Arg Arg Xaa Ser 165
170 175 Ser Gly Phe Ser Arg Gly Xaa Ser Lys Tyr
Arg Gly Val Ala Arg His 180 185
190 His His Asn Gly Arg Trp Glu Ala Arg Ile Gly Arg Val Xaa Gly
Xaa 195 200 205 Lys
Tyr Leu Tyr Leu Gly Thr Xaa Xaa Thr Gln Glu Glu Ala Ala Xaa 210
215 220 Ala Tyr Asp Xaa Ala Ala
Ile Glu Tyr Arg Gly Xaa Asn Ala Val Thr225 230
235 240 Asn Phe Asp Ile Ser Xaa Tyr Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa 245 250
255 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
260 265 270 Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Xaa Xaa 275
280 285 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa 290 295
300 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa305 310 315
320 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Trp Xaa Xaa Xaa Xaa
325 330 335 Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 340
345 350 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa 355 360
365 Phe Xaa Xaa Xaa Ile Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa
Xaa Xaa 370 375 380
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa385
390 395 400 Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 405
410 415 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa 420 425
430 Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa 435
440
User Contributions:
Comment about this patent or add new information about this topic: