Patent application title: ANTIGEN DELIVERY PLATFORMS
Inventors:
Michael Franti (Redding, CT, US)
Anders Lilja (Somerville, MA, US)
Rebecca Loomis (Philadelphia, PA, US)
Peter W. Mason (Somerville, MA, US)
Assignees:
GlaxoSmithKline Biologicals, s.a.
IPC8 Class: AC07K14005FI
USPC Class:
Class name:
Publication date: 2022-07-07
Patent application number: 20220213149
Abstract:
This disclosure provides platforms for delivery of herpes virus proteins
to cells, particularly proteins that form complexes in vivo. In some
embodiments these proteins and the complexes they form elicit potent
neutralizing antibodies. Thus, presentation of herpes virus proteins
using the disclosed platforms permits the generation of broad and potent
immune responses useful for vaccine development.Claims:
1-46. (canceled)
47. A composition comprising an RNA delivery system and RNA molecules comprising a sequence that encodes a Varicella zoster virus (VZV) glycoprotein gE immunogen.
48. The composition of claim 47, wherein the RNA delivery system is lipid nanoparticles.
49. The composition of claim 47, wherein the RNA delivery system is a cationic nanoemulsion.
50. The composition of claim 48, the lipid nanoparticles comprising a PEGylated lipid, cholesterol, 1, 2-Diastearoyl-sn-glycero-3-phosphocholine (DSPC), and a cationic lipid.
51. The composition of claim 47, the RNA molecules comprising at least one modified nucleotide.
52. The composition of claim 51, the at least one modified nucleotide selected from the group comprising pseudouridine, N6-methyladenosine, 5-methylcytidine, and 5-methyluridine.
53. The composition of claim 47, the RNA molecules further comprising a 3' poly A tail.
54. The composition of claim 47, the RNA molecules further comprising a 5' Cap.
55. The composition of claim 54, the 5' Cap having a Cap 0 structure or a Cap 1 structure.
56. The composition of claim 47, the RNA molecules being self-replicating RNA.
57. The composition of claim 50, the cationic lipid comprising a tertiary amine.
58. The composition of claim 48, the lipid nanoparticles encapsulating at least half of the RNA molecules.
59. The composition of claim 48, the RNA molecules further comprising a 5' Cap 1 structure and a 3' poly A tail; and the lipid nanoparticles comprising a PEGylated lipid, cholesterol, DSPC, and a cationic lipid comprising a tertiary amine.
60. A method of eliciting an immune response in an individual, the method comprising administering to the individual an effective amount of the composition of claim 47.
61. The method of claim 60 comprising administering two or more doses of the composition.
62. A method of eliciting an immune response in an individual, the method comprising administering to the individual an effective amount of the composition of claim 50.
63. The method of claim 62 comprising administering two or more doses of the composition.
64. A method of eliciting an immune response in an individual, the method comprising administering to the individual an effective amount of the composition of claim 59.
65. The method of claim 64 comprising administering two or more doses of the composition.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This is a Divisional of U.S. patent application Ser. No. 16/114,621, filed Aug. 28, 2018, which is a Continuation of U.S. patent application Ser. No. 13/878,835, filed Oct. 10, 2013 (now abandoned), which is the U.S. National Stage application submitted under 35 U.S.C. .sctn. 371 for International Application No. PCT/US2011/55834, filed Oct. 11, 2011, which claims priority to U.S. Provisional Application No. 61/391,960, filed Oct. 11, 2010, all of which are incorporated herein by reference in their entireties.
SUBMISSION OF SEQUENCE LISTING ON ASCII TEXT FILE
[0002] The content of the following submission on ASCII text file is incorporated herein by reference in its entirety: a computer readable form (CRF) of the Sequence Listing (file name: VN54076D1_US_Seq_Listing.txt; created Jun. 28, 2021, size: 614,614 bytes).
BACKGROUND
[0003] Herpes viruses are widespread and cause a wide range of diseases in humans that in the worst cases can lead to substantial morbidity and mortality, primarily in immunocompromised individuals (e.g., transplant recipients and HIV-infected individuals). Humans are susceptible to infection by at least eight herpes viruses. Herpes simplex virus-1 (HSV-1, HHV-1), Herpes simplex virus-2 (HSV-2, HHV-2) and Varicella zoster virus (VZV, HHV-3) are alpha-subfamily viruses, cytomegalovirus (CMV, HHV-5) and Roseoloviruses (HHV-6 and HHV-7) are beta-subfamily viruses, Epstein-Barr virus (EBV, HHV-4) and Kaposi's sarcoma-associated herpesvirus (KSHV, HHV-8) are gamma-subfamily viruses that infect humans.
[0004] CMV infection leads to substantial morbidity and mortality in immunocompromised individuals (e.g., transplant recipients and HIV-infected individuals) and congenital infection can result in devastating defects in neurological development in neonates. CMV envelope glycoproteins gB, gH, gL, gM and gN represent attractive vaccine candidates as they are expressed on the viral surface and can elicit protective virus-neutralizing humoral immune responses. Some CMV vaccine strategies have targeted the major surface glycoprotein B (gB), which can induce a dominant antibody response. (Go and Pollard, JID 197:1631-1633 (2008)). CMV glycoprotein gB can induce a neutralizing antibody response, and a large fraction of the antibodies that neutralize infection of fibroblasts in sera from CMV-positive patients is directed against gB (Britt 1990). Similarly, it has been reported that gH and gM/gN are targets of the immune response to natural infection (Urban et al (1996) J. Gen. Virol. 77(Pt. 7):1537-47; Mach et al (2000) J. Virol. 74(24):11881-92).
[0005] Complexes of CMV proteins are also attractive vaccine candidates because they appear to be involved in important processes in the viral life cycle. For example, the gH/gL/gO complex seems to have important roles in both fibroblast and epithelial/endothelial cell entry. The prevailing model suggests that the gH/gL/gO complex mediates infection of fibroblasts. hCMV gO-null mutants produce small plaques on fibroblasts and very low titer virus indicating a role in entry (Dunn (2003), Proc. Natl. Acad. Sci. USA 100:14223-28; Hobom (2000) J. Virol. 74:7720-29). Recent studies suggest that gO is not incorporated into virions with gH/gL, but may act as a molecular chaperone, increasing gH/gL export from the ER to the Golgi apparatus and incorporation into virions (Ryckman (2009) J. Virol 82:60-70). Through pulse-chase experiments, it was shown that small amounts of gO remain bound to gH/gL for long periods of time but most gO dissociates and or is degraded from the gH/gL/gO complex, as it is not found in extracellular virions or secreted from cells. When gO was deleted from a clinical strain of CMV (TR) those viral particles had significantly reduced amounts of gH/gL incorporated into the virion. Additionally, gO deleted from TR virus also inhibited entry into epithelial and endothelial cells, suggesting that gH/gL is also required for epithelial/endothelial cell entry (Wille (2010) J. Virol. 84(5):2585-96).
[0006] CMV gH/gL can also associate with UL128, UL130, and UL131A (referred to here as UL131) and form a pentameric complex that is required for entry into several cell types, including epithelial cells, endothelial cells, and dendritic cells (Hahn et al (2004) J. Virol. 78(18):10023-33; Wang and Shenk (2005) Proc. Natl. Acad. Sci USA 102(50):18153-8; Gerna et al (2005). J. Gen. Virol. 84(Pt 6):1431-6; Ryckman et al (2008) J. Virol. 82:60-70). In contrast, this complex is not required for infection of fibroblasts. Laboratory hCMV isolates carry mutations in the UL128-UL131 locus, and mutations arise in clinical isolates after only a few passages in cultured fibroblasts (Akter et al (2003) J. Gen. Virol. 84(Pt 5):1117-22). During natural infection, the pentameric complex elicits antibodies that neutralize infection of epithelial cells, endothelial cells (and likely any other cell type where the pentameric complex mediates viral entry) with very high potency (Macagno et al (2010) J. Virol. 84(2):1005-13). It also appears that antibodies to this complex contribute significantly to the ability of human sera to neutralize infection of epithelial cells (Genini et al (2011) J. Clin. Virol. 52(2):113-8).
[0007] U.S. Pat. No. 5,767,250 discloses methods for making certain CMV protein complexes that contain gH and gL. The complexes are produced by introducing a DNA construct that encodes gH and a DNA construct that encodes gL into a cell so that the gH and gL are co-expressed.
[0008] WO 2004/076645 describes recombinant DNA molecules that encode CMV proteins. According to this document, combinations of distinct DNA molecules that encode different CMV proteins, can be introduced into cells to cause co-expression of the encoded CMV proteins. When gM and gN were co-expressed in this way, they formed a disulfide-linked complex. Rabbits immunized with DNA constructs that produced the gM/gN complex or with a DNA construct encoding gB produced equivalent neutralizing antibody responses.
[0009] A need exists for nucleic acids that encode two or more herpes virus proteins, for methods of expressing two or more herpes virus proteins in the same cell, and for immunization methods that produce better immune responses.
SUMMARY OF THE INVENTION
[0010] The invention relates to platforms for co-delivery of two or more herpesvirus proteins, such as cytomegalovirus (CMV) proteins, to cells, particularly proteins that form complexes in vivo. In one aspect, the invention is a recombinant polycistronic nucleic acid molecules that contain a first sequence encoding a first herpesvirus (e.g., CMV) protein or fragment thereof, and a second sequence encoding a second herpesvirus (e.g., CMV) protein or fragment thereof.
[0011] For example, the invention provides a self-replicating RNA molecule comprising a polynucleotide which comprises a) a first nucleotide sequence encoding a first protein or fragment thereof from a herpes virus; and b) a second nucleotide sequence encoding a second protein or fragment thereof from the herpes virus. The first nucleotide sequence and second nucleotide sequence are operably linked to one or more control elements so that when the self-replicating RNA molecule is introduced into a suitable cell, the first and second herpes virus proteins or fragments thereof are produced in an amount sufficient for the formation of a complex in the cell that contains the first and second proteins or fragments. Preferably, the first protein and the second protein are not the same protein or fragments of the same protein, the first protein is not a fragment of the second protein, and the second protein is not a fragment of the first protein. The first nucleotide sequence can be operably linked to a first control element and the second nucleotide sequence can be operably linked to a second control element.
[0012] The self-replicating RNA molecule can further comprise a third nucleotide sequence encoding a third protein or fragment thereof from said herpes virus, optionally a fourth nucleotide sequence encoding a fourth protein or fragment thereof from said herpes virus; and optionally a fifth nucleotide sequence encoding a fifth protein or fragment thereof from said herpes virus. When sequences encoding additional proteins or fragments from a herpes virus are present (i.e., the third, fourth and fifth nucleotide sequences) they are operably linked to one or more control elements. In one example of a pentacistronic construct, the first nucleotide sequence is operably linked to a first control element, the second nucleotide sequence is operably linked to a second control element, the third nucleotide sequence is operably linked to a third control element, the fourth nucleotide sequence is operably linked to a fourth control element, and the fifth nucleotide sequence is operably linked to a fifth control element. The control elements present in the construct (e.g., first, second, third, fourth and fifth control elements) can be independently selected from the group consisting of a subgenomic promoter, an IRES, and a viral (e.g., FMDV) 2A site.
[0013] The herpes virus can be HSV-1, 1, HSV-2, VZV, EBV type 1, EBV type 2, CMV, HHV-6 type A, HHV-6 type B, HHV-7 and HHV-8. In some embodiments, the recombinant polycistronic nucleic acid molecule (e.g., self replicating RNA) encodes gH or a fragment thereof and gL or a fragment thereof of any one of these herpes viruses. In more particular embodiments, the herpes virus is CMV or VZV.
[0014] When the recombinant polycistronic nucleic acid molecule (e.g., self replicating RNA) encodes two or more VZV proteins, the proteins can be selected from the group consisting of gB, gE, gH, gI, gL and a fragment (e.g., of at least 10 amino acids) thereof. In some embodiments, the recombinant polycistronic nucleic acid molecule (e.g., self replicating RNA) encodes VZV gH or a fragment thereof and VZV gL or a fragment thereof.
[0015] In a particular example, the invention provides a self-replicating RNA molecule comprising a polynucleotide which comprises a) a first sequence encoding a first cytomegalovirus (CMV) protein or fragment thereof; and b) a second sequence encoding a second CMV protein or fragment thereof. The first sequence and second sequence are operably linked to one or more control elements so that when the self-replicating RNA molecule is introduced into a suitable cell, the first and second CMV proteins are produced in an amount sufficient for the formation of a complex in the cell that contains the first and second CMV proteins or fragments.
[0016] The first CMV protein and the second CMV protein are independently selected from the group consisting of gB, gH, gL; gO; gM, gN; UL128, UL130, UL131, and a fragment of any one of the foregoing. Preferably, the first CMV protein and the second CMV protein are not the same protein or fragments of the same protein, the first CMV protein is not a fragment of the second CMV protein, and the second CMV protein is not a fragment of the first CMV protein. If desired, the self-replicating RNA molecule can further comprise a third sequence encoding a third CMV protein, wherein the third sequences is operably linked to a control element. Similarly, additional sequences encoding additional CMV proteins (e.g., a fourth sequence encoding a fourth CMV protein, a fifth sequence encoding a fifth CMV protein) can be included. The control elements can be independently selected from the group consisting of a subgenomic promoter, and IRES, and a viral 2A site.
[0017] In some embodiments, the self replicating nucleic acid molecule encodes the CMV proteins gH and gL. In other embodiments, the self-replicating RNA molecule encodes the CMV proteins gH, gL, and gO. In other embodiments, the self-replicating RNA molecule encodes the CMV proteins gH, gL, UL128, UL130 and UL131.
[0018] The self replicating RNA molecules can be an alphavirus replicon. In such instances, the alphavirus replicon can be delivered in the form of an alphavirus replicon particle (VRP). The self replicating RNA molecule can also be in the form of a "naked" RNA molecule.
[0019] The invention also relates to a recombinant DNA molecule that encodes a self replicating RNA molecule as described herein. In some embodiments, the recombinant DNA molecule is a plasmid. In some embodiments, the recombinant DNA molecule includes a mammalian promoter that drive transcription of the encoded self replicating RNA molecule.
[0020] The invention also relates to compositions that comprise a self-replicating RNA molecule as described herein and a pharmaceutically acceptable vehicle. The self-replicating RNA molecule can be "naked." In some embodiments, the composition comprises a self-replicating RNA molecule that encodes the CMV proteins gH and gL. In other embodiments, the composition further comprises a self-replicating RNA molecule that encodes the CMV protein gB. The composition can also contain an RNA delivery system such as a liposome, a polymeric nanoparticle, an oil-in-water cationic nanoemulsion or combinations thereof. For example, the self-replicating RNA molecule can be encapsulated in a liposome.
[0021] In certain embodiments, the composition comprises a VRP that contains a alphavirus replicon that encodes two or more CMV proteins. In some embodiments, the VRP comprises a replicon that encodes CMV gH and gL. If desired, the composition can further comprising a second VRP containing a replicon that encodes CMV gB. The composition can also comprise an adjuvant.
[0022] The invention also relates to methods of forming a CMV protein complex. In some embodiments a self-replicating RNA encoding two or more CMV proteins is delivered to a cell, the cell is maintained under conditions suitable for expression of the CMV proteins, wherein a CMV protein complex is formed. In other embodiments, a VRP that contains a self-replicating RNA encoding two or more CMV proteins is delivered to a cell, the cell is maintained under conditions suitable for expression of the CMV proteins, wherein a CMV protein complex is formed. The method can be used to form a CMV protein complex in a cell in vivo.
[0023] The invention also relates to a method for inducing an immune response in an individual. In some embodiments, a self-replicating RNA encoding two or more CMV proteins is administered to the individual. The self-replicating RNA molecule can be administered as a composition that contains an RNA delivery system, such as a liposome. In other embodiments, a VRP that contains a self-replicating RNA encoding two or more CMV proteins is administered to the individual. In preferred embodiments, the self-replicating RNA molecule encodes CMV proteins gH and gL. Preferably, the induced immune response comprises the production of neutralizing anti-CMV antibodies. More preferably, the neutralizing antibodies are complement-independent.
[0024] The invention also relates to a method of inhibiting CMV entry into a cell comprising contacting the cell with a self-replicating RNA molecule that encodes two or more CMV proteins, such as gH and gL. The cell can be selected from the group consisting of an epithelial cell, an endothelial cell, a fibroblast and combinations thereof. In some embodiments, the cell is contacted with a VRP that contains a self-replicating RNA encoding two or more CMV proteins.
[0025] The invention also relates to the use of a self-replicating RNA molecule that encodes two or more CMV proteins (e.g., a VRP, a composition comprising the self-replicating RNA molecule and a liposome) form a CMV protein complex in a cell, to induce an immune response or to inhibit CMV entry into a cell.
BRIEF DESCRIPTION OF THE DRAWINGS
[0026] FIG. 1 is a schematic of CMV identifying known glycoprotein complexes involved in CMV entry into target cells. Envelope glycoproteins represent attractive vaccine candidates as they are expressed on the viral surface and can elicit protective and long lasting virus-neutralizing humoral immune responses. The structural glycoproteins mediating these processes can be divided into two classes; those that are conserved throughout the herpes virus family and those that are not. Among those that are conserved are gB, gH, gL, gM and gN. Many of these glycoproteins form complexes with one another (gH/gL/.+-.gO; gH/gL/UL128/UL130/UL131; gM/gN) to facilitate localization to the viral surface and to carry out their functions in viral attachment, entry and cell fusion.
[0027] FIGS. 2A-2F are schematics of CMV constructs. FIG. 2A, Schematic of the gB constructs ("gB FL", full-length gB; soluble gBs "gB sol 750" and "gB sol 692") described in Example 1. Two different soluble versions of gB were constructed; gB sol 750 lacks the transmembrane spanning domain and cytoplasmic domain, gB sol 692 also lacks a hydrophobic region and is similar to the gB sol described in Reap et al. (2007) Clin. Vacc. Immunol. 14:748-55. FIG. 2B, Schematic of the gB replicon vectors used to produce viral replicaton particles (VRPs). FIG. 2C, Schematic of the gH constructs ("gH FL", full-length gH; soluble gH "gH sol") described in Example 1. A single soluble version of gH was constructed which lacked the transmembrane spanning domain FIG. 2D, Schematic of the gH replicon vectors used to produce VRPs. FIG. 2E, Schematic of gL construct described in Example 1. FIG. 2F, Schematic of the gL replicon vector used to produce VRPs. In FIGS. 2B, 2D and 2F, "NSP1," "NSP2," "NSP3," and "NSP4," are alphavirus nonstructural proteins 1-4, respectively, required for replication of the virus.
[0028] FIGS. 3A and 3B show that mice immunized with gB (FL, sol 750, sol 692) or gH (FL, sol) VRPs induced antibody responses that were neutralizing in the presence of guinea pig complement. The neutralization assay was done by pre-incubating the CMV virus strain TB40UL32E-GFP (which encodes the enhanced green fluorescent protein-GFP, Sampaio et al (2005) J. Virol. 79(5):2754-67), with mouse sera and guinea pig complement before infection of ARPE-19 epithelial cells. Five days post-infection, the number of GFP positive cells was determined. FIG. 3A, Serum dilution curves for all sera analyzed in ARPE-19 cells in the presence of complement. FIG. 3B, 50% neutralization titers for the sera samples. Virus incubated with pre-immune sera yielded low neutralization at low dilutions (1:40-1:80). gB (FL, sol 750, sol 692) sera had very strong neutralizing activity with 50% neutralization titers between 1:1800-1:2100. All gB immunized mice yielded a similar neutralization profile. gH (FL, sol) sera had neutralizing activity with 50% neutralization titers around 1:160. See Example 1.
[0029] FIG. 4A is a schematic illustration of monocistronic replicons encoding green fluorescent protein (GFP) or red fluorescent protein (mCherry) and a bicistronic replicon encoding GFP and mCherry. "NSP1," "NSP2," "NSP3," and "NSP4," are alphavirus nonstructural proteins 1-4, respectively. The polycistronic alphavirus replicon system was designed by making modifications to the existing alphavirus replicon system to accommodate multiple subgenomic promoters driving genes of interest.
[0030] FIG. 4B are fluorescence plots showing FACS analysis of BHKV cells infected with VRPs containing mono- and bicistronic RNAs. Polycistronic alphavirus VRPs yield more cells expressing both genes of interest at approximately equal amounts (GFP and mCherry; 72.48%) than co-infection of GFP VRP+mCherry VRP (26.30%). See Example 2.
[0031] FIG. 5A is a schematic illustration of construction of polycistronic alphavirus replicon constructs encoding gH/gL and gH/gL/gO.
[0032] FIG. 5B show that gH/gL form a complex in vitro. VRPs containing replicons encoding gH, gL, gO, gH/gL or gH/gL/gO were produced in BHKV cells. The resulting VRPs were used to infect ARPE-19 cells to demonstrate complex formation in vitro. The alphavirus infected ARPE-19 cells were harvested and analyzed for the presence of gH and gL. ARPE-19 cells infected with VRPs encoding gH/gL produced disulfide linked complexes of gH/gL (see in the absence of DTT, heat). gO did not detectably alter the gH/gL association. The left hand blot shows expression of gH protein. The right hand blot shows expression of gL protein. Molecular weight markers are indicated between the blots. .cndot.=monomeric gH, .cndot..cndot.=monomeric gL, <=herodimer (gH+gL), *=dimer of heterodimers.
[0033] FIG. 5C shows immunoprecipitation of gH and gH/gL complexes from BHKV cells infected with VRPs Immunoprecipitation was performed using mouse IgG antibodies as a control (Lanes 2, 4, 7, and 10) or mouse anti-gH antibodies (Genway) to immunoprecipitate gH (Lanes 3, 5, 8, and 11). Western blots were performed using pooled rabbit anti-gL antibody and rabbit anti-gH antibody. Lanes 1, 6, and 9 show gH protein (upper band .about.75 kDa) and gL protein (lower band .about.30 kDa) for reference. Lanes 2 and 3 are lysates infected with gH-VRP. Lane 2 shows that the control antibody did not immunoprecipitate gH. Lane 3 shows the anti-gH antibody immunoprecipitated gH. Lanes 4 and 5 are from lysates infected with gL-VRP only. No gH protein was immunoprecipitated. Lanes 7 and 8 are from lysates infected with bicistronic gH/gL-VRP. Lane 8 shows that gL was immunoprecipitated using the gH antibody. (See asterisk). Lanes 10 and 11 are from lysates infected with tricistronic gH/gL/gO-VRP. Lane 11 shows that gL was immunoprecipitated using the gH antibody. (See asterisk). Molecular Weight markers are also shown (MW). See Example 3.
[0034] FIGS. 6A-6C shows that VRPs that affect gH/gL complex formation in vitro induce potent immune response to CMV which is qualitatively and quantitatively superior to the response to gB VRPs. FIG. 6A and FIG. 6B show serum dilution curves for gH, gL, gO, gH+gL, gH+gL+gO, gH/gL and gH/gL/gO VRP-immunized mice in neutralization of TB40-UL32-EGFP infection of ARPE-19 cells in the presence (FIG. 6A) or absence (FIG. 6B) of complement. Various dilutions of sera were pre-incubated with TB40UL32E-GFP in the presence or absence of guinea pig complement and then added to ARPE-19 epithelial cells. After 5 day infection with the virus, GFP-positive cells were counted. FIG. 6C is a graph showing 50% neutralization titers obtained in the presence and absence of complement. "3wp3," three weeks post-third immunization. VRPs expressing single CMV proteins (gH, gL, gO VRPs or co-administered gH, gL and gO VRPs) did not enhance neutralizing activity beyond that of gH alone. In contrast, sera from mice immunized with bicistronic gH/gL or tricistronic gH/gL/gO VRPs demonstrated potent neutralizing responses. Moreover, the potent neutralizing responses were similar in the presence and absence of guinea pig complement, showing that polycistronic VRPs successfully induced a complement-independent immune response. See Example 4.
[0035] FIGS. 7A and 7B shows that VRPs that affect gH/gL complex formation in vitro induced antibodies that potently neutralized infection of MRC-5 fibroblast cells. FIG. 7A shows serum dilution curves for gH, gL, gO, gH+gL, gH+gL+gO, gH/gL and gH/gL/gO VRP-immunized mice in MRC-5 cells in the absence of complement. Various dilutions of sera were pre-incubated with TB40GFP in the presence or absence of guinea pig complement and then added to MRC-5 fibroblast cells. After 5 day infection with the virus, GFP-positive cells were counted. FIG. 7B is a graph showing 50% neutralization titers obtained in a MRC-5 fibroblast cell model in the absence of complement. "3wp3," three weeks post-third immunization. VRPs expressing single CMV proteins (gH, gL, gO VRPs or co-administered gH, gL and gO VRPs) did not enhance neutralizing activity beyond that of gH alone. In contrast, sera from mice immunized with bicistronic gH/gL or tricistronic gH/gL/gO VRPs demonstrated extremely potent neutralizing responses. See Example 4.
[0036] FIGS. 8A and 8B are graphs showing that the neutralizing antibodies induced by delivery of the polycistronic VRPs were cross-neutralizing antibodies. The sera from mice immunized with gH/gL and gH/gL/gO VRPs were able to neutralize TB40UL32E-GFP and VR1814 clinical strains of CMV in both ARPE-19 epithelial cells (FIG. 8A) and MRC-5 fibroblast cells (FIG. 8B) in the absence of guinea pig complement in an IE-1 neutralization assay.
[0037] FIG. 9 is a graph showing that the neutralizing antibodies elicited against gH FL/gL are complement-independent and similar to natural immunity in titer. Mice were immunized with gB FL or gH FL/gL VRPs at 1.times.10.sup.6 IU, 3 times, 3 weeks apart before the terminal bleed. Sera was analyzed for ability to neutralize TB40UL32E-EGFP CMV infection of ARPE-19 cells in the presence and absence of guinea pig complement in a neutralization assay. Unlike antibodies elicited by gB, antibodies elicited by gH FL/gL are complement-independent. Furthermore, gH FL/gL antibodies in these vaccinated mice were similar in titer to those found in naturally infected human subjects.
[0038] FIG. 10 shows a plasmid map for pVCR modified gH-SGPgL-SGPgO.
[0039] FIG. 11 show a plasmid map for pVCR modified gH-SGPgL.
[0040] FIG. 12 show a plasmid map for pVCR modified gH sol-SGPgL.
[0041] FIG. 13 show a plasmid map for pVCR modified gH sol-SGPgL-SGPgO.
[0042] FIG. 14A-14H show the nucleotide sequence (SEQ ID NO: 83) of the plasmid encoding the A160 self-replicating RNA molecule which encodes CMV surface glycoprotein H (gH) and CMV surface glycoprotein L (gL). The nucleotide sequences encoding gH and gL are underlined.
[0043] FIG. 15A-15H show the nucleotide sequence (SEQ ID NO: 84) of the plasmid encoding the A322 self-replicating RNA molecule which encodes the soluble form of CMV surface glycoprotein H (gHsol) and CMV surface glycoprotein L (gL). The nucleotide sequences encoding gHsol and gL are underlined.
[0044] FIG. 16A-16H show the nucleotide sequence (SEQ ID NO: 85) of the plasmid encoding the A323 self-replicating RNA molecule which encodes CMV surface glycoprotein B (gB). The nucleotide sequence encoding gB is underlined.
[0045] FIGS. 17A and 17B are histograms showing 50% neutralizing titers of sera from mice that were immunized with VRP or self-replicating RNA. FIG. 17A shows 50% neutralizing titers against human CMV strain TB40UL32E-EGFP ("TB40) on ARPE-19 cells, and FIG. 17B shows 50% neutralizing titers against human CMV strain 8819 on ARPE-19 cells
[0046] FIG. 18 is a schematic of petacistronic RNA replicons, A526 (SEQ ID NO:56), A527 (SEQ ID NO:57), A554 (SEQ ID NO:65), A555 (SEQ ID NO:66) and A556 (SEQ ID NO:67), that encode five CMV proteins. Subgenomic promoters are shown by arrows, other control elements are labeled.
[0047] FIG. 19 is a fluorescence histogram showing that BHKV cells transfected with the A527 RNA replicon express the gH/gL/UL128/UL130/UL131 pentameric complex. Cell stain was performed using antibodies that bind a conformational epitope present on the pentameric complex (Macagno (2010) J. Virol. 84(2):1005-13).
[0048] FIG. 20 is a schematic and graph. The schematic shows bicistronic RNA replicons, A160 and A531-A537, that encode CMV gH and gL. The graph shows neutralizing activity of immune sera from mice immunized with VRPs that contained the replicons.
[0049] FIG. 21 is a graph showing anti-VZV protein antibody response in immune sera from mice immunized with monocistronic RNA replicons that encoded VZV proteins or bicistronic RNA replicons that encoded VZV gE and gI, or gH and gL. The mice were immunized with 7 .mu.g RNA formulated with a CNE (see, Example 7).
[0050] FIG. 22 is a graph showing anti-VZV protein antibody response in immune sera from mice immunized with monocistronic RNA replicons that encoded VZV proteins or bicistronic RNA replicons that encoded VZV gE and gI, or gH and gL. The mice were immunized with 1 .mu.g RNA formulated with a CNE (see, Example 7).
DETAILED DESCRIPTION
[0051] The invention provides platforms for co-delivery of herpesvirus proteins, such as cytomegalovirus (CMV) proteins, to cells, particularly proteins that form complexes in vivo. In some embodiments, these proteins and the complexes they form elicit potent neutralizing antibodies. The immune response produced by co-delivery of herpesvirus (e.g., CMV) proteins, particularly those that form complexes in vivo (e.g., gH/gL), can be superior to the immune response produced using other approaches. For example, an RNA molecule (e.g., a replicon) that encodes both gH and gL of CMV can induce better neutralizing titers and/or protective immunity in comparison to an RNA molecule that encodes gB, an RNA molecule that encodes gH, an RNA molecule that encodes gL, or even a mixture of RNA molecules that individually encode gH or gL. Further, a replicon encoding gH/gL/UL128/UL130/UL131 can provide responses superior to those encoding only gH/gL.
[0052] In a general aspect, the invention relates to platforms for delivery of two or more herpesvirus (e.g., CMV) proteins to cells. The platforms comprise recombinant polycistronic nucleic acid molecules that contain a first sequence encoding a first herpesvirus (e.g., CMV) protein or fragment thereof, and a second sequence encoding a second herpesvirus (e.g., CMV) protein or fragment thereof. If desired, one or more additional sequences encoding additional proteins, for example, a third herpesvirus (e.g., CMV) protein or fragment thereof, a fourth herpesvirus (e.g., CMV) protein or fragment thereof, a fifth herpesvirus (e.g., CMV) protein or fragment thereof etc. can be present in the recombinant polycistronic nucleic acid molecule. The sequences encoding herpesvirus (e.g., CMV) proteins or fragments thereof are operably linked to one or more suitable control elements so that the herpesvirus (e.g., CMV) proteins or fragments are produced by a cell that contains the recombinant polycistronic nucleic acid.
[0053] In the polycistronic nucleic acids described herein, the encoded first and second herpesvirus proteins or fragments, and the encoded third, forth and fifth herpes virus proteins or fragments, if present, generally and preferably are from the same herpes virus. In certain examples, all herpes virus proteins or fragments encoded by a polycistronic vector are CMV proteins or VZV proteins.
[0054] The recombinant polycistronic nucleic acid molecules described herein provide the advantage of delivering sequences that encode two or more herpesvirus (e.g., CMV) proteins to a cell, and driving the expression of the herpesvirus (e.g., CMV) proteins at sufficient levels to result in the formation of a protein complex containing the two or more herpesvirus (e.g., CMV) proteins in vivo. Using this approach, the two or more encoded herpesvirus (e.g., CMV) proteins can be expressed at sufficient intracellular levels for the formation of herpesvirus (e.g., CMV) protein complexes (e.g., gH/gL). For example, the encoded herpesvirus (e.g., CMV) proteins or fragments thereof can be expressed at substantially the same level, or if desired, at different levels by selecting appropriate expression control sequences (e.g., promoters, IRES, 2A site etc.). This is significantly more efficient way to produce protein complexes in vivo than by co-delivering two or more individual DNA molecules that encode different herpesvirus (e.g., CMV) to the same cell, which can be inefficient and highly variable. See, e.g., WO 2004/076645.
[0055] The recombinant polycistronic nucleic acid molecule can be based on any desired nucleic acid such as DNA (e.g., plasmid or viral DNA) or RNA. Any suitable DNA or RNA can be used as the nucleic acid vector that carries the open reading frames that encode herpesvirus (e.g., CMV) proteins or fragments thereof. Suitable nucleic acid vectors have the capacity to carry and drive expression of more than one protein gene. Such nucleic acid vectors are known in the art and include, for example, plasmids, DNA obtained from DNA viruses such as vaccinia virus vectors (e.g., NYVAC, see U.S. Pat. No. 5,494,807), and poxvirus vectors (e.g., ALVAC canarypox vector, Sanofi Pasteur), and RNA obtained from suitable RNA viruses such as an alphavirus. If desired, the recombinant polycistronic nucleic acid molecule can be modified, e.g., contain modified nucleobases and or linkages as described further herein. Preferably, the polycistronic nucleic acid molecule is an RNA molecule.
[0056] In some aspects, the recombinant polycistronic nucleic acid molecule is a DNA molecule such as plasmid DNA. Such DNA molecules can, for example, encode a polycistronic replicon and contain a mammalian promoter that drives transcription of the replicon. Recombinant polycistronic nucleic acid molecules or this type can be administered to a mammal and then be transcribed in situ to produce a polycistronic replicon that expresses herpesvirus proteins.
[0057] In some aspects, the invention is a polycistronic nucleic acid molecule that contains a sequence encoding a herpesvirus gH or fragment thereof, and a herpesvirus gL or a fragment thereof. The gH and gL proteins, or fragments thereof, can be from any desired herpes virus such as HSV-1, HSV-2, VZV, EBV type 1, EBV type 2, CMV, HHV-6 type A, HHV-6 type B, HHV-7, KSHV, and the like. Preferably, the herpesvirus is VZV, HSV-2, HSV-1, EBV (type 1 or type 2) or CMV. More preferably, the herpesvirus is VZV, HSV-2 or CMV. Even more preferably, the herpesvirus is CMV. The sequences of gH and gL proteins and of nucleic acids that encode the proteins from these viruses are well known in the art. Exemplary sequences are identified in Table 1. The polycistronic nucleic acid molecule can contain a first sequence encoding a gH protein disclosed in Table 1, or a fragment thereof, or a sequence that is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical thereto. The polycistronic nucleic acid molecule can also contain a second sequence encoding a gL protein disclosed in Table 1, or a fragment thereof, or a sequence that is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical thereto.
TABLE-US-00001 TABLE 1 Virus gH accession number gL accession number HSV-1 (HHV-1) NP_044623.1 NP_044602.1 HSV-2 (HHV-2) NP_044491.1 NP_044470.1 VZV (HHV-3) NP_040160.1 NP_040182.1 EBV type 1 (HHV-4) YP_401700.1 YP_401678.1 EBV type 2 (HHV-4) YP_001129496.1 YP_001129472.1 CMV (HHV-5) YP_081523.1 YP_081555.1 HHV-6 type A NP_042941.1 NP_042975.1 HHV-6 type B NP_050229.1 NP_050261.1 HHV-7 YP_073788.1 YP_073820.1 KSHV (HHV-8) YP_001129375.1 YP_001129399.1
[0058] In this description of the invention, to facilitate a clear description of the nucleic acids, particular sequence components are referred to as a "first sequence," a "second sequence," etc. It is to be understood that the first and second sequences can appear in any desired order or orientation, and that no particular order or orientation is intended by the words "first", "second" etc. Similarly, protein complexes are referred to by listing the proteins that are present in the complex, e.g., gH/gL. This is intended to describe the complex by the proteins that are present in the complex and does not indicate relative amounts of the proteins or the order or orientation of sequences that encode the proteins on a recombinant nucleic acid.
[0059] Certain preferred embodiments, such as alphavirus VRP and self-replicating RNA that contain sequences encoding CMV proteins, are further described herein. It is intended that the sequences encoding CMV proteins in such preferred embodiments, can be replaced with sequences encoding proteins, such as gH and gL from other herpesviruses.
Alphavirus VRP Platforms
[0060] In some embodiments, CMV proteins are delivered to a cell using alphavirus replicon particles (VRP) which employ polycistronic replicons (or vectors) as described below. As used herein, "polycistronic" includes bicistronic vectors as well as vectors comprising three or more cistrons. Cistrons in a polycistronic vector can encode CMV proteins from the same CMV strains or from different CMV strains. The cistrons can be oriented in any 5'-3' order. Any nucleotide sequence encoding a CMV protein can be used to produce the protein. Exemplary sequences useful for preparing the polycistronic nucleic acids that encode two or more CMV proteins or fragments thereof are described herein.
[0061] As used herein, the term "alphavirus" has its conventional meaning in the art and includes various species such as Venezuelan equine encephalitis virus (VEE; e.g., Trinidad donkey, TC83CR, etc.), Semliki Forest virus (SFV), Sindbis virus, Ross River virus, Western equine encephalitis virus, Eastern equine encephalitis virus, Chikungunya virus, S.A. AR86 virus, Everglades virus, Mucambo virus, Barmah Forest virus, Middelburg virus, Pixuna virus, O'nyong-nyong virus, Getah virus, Sagiyama virus, Bebaru virus, Mayaro virus, Una virus, Aura virus, Whataroa virus, Banbanki virus, Kyzylagach virus, Highlands J virus, Fort Morgan virus, Ndumu virus, and Buggy Creek virus. The term alphavirus may also include chimeric alphaviruses (e.g., as described by Perri et al., (2003) J. Virol. 77(19):10394-403) that contain genome sequences from more than one alphavirus.
[0062] An "alphavirus replicon particle" (VRP) or "replicon particle" is an alphavirus replicon packaged with alphavirus structural proteins.
[0063] An "alphavirus replicon" (or "replicon") is an RNA molecule which can direct its own amplification in vivo in a target cell. The replicon encodes the polymerase(s) which catalyze RNA amplification (nsP1, nsP2, nsP3, nsP4) and contains cis RNA sequences required for replication which are recognized and utilized by the encoded polymerase(s). An alphavirus replicon typically contains the following ordered elements: 5' viral sequences required in cis for replication, sequences which encode biologically active alphavirus nonstructural proteins (nsP1, nsP2, nsP3, nsP4), 3' viral sequences required in cis for replication, and a polyadenylate tract. An alphavirus replicon also may contain one or more viral subgenomic "junction region" promoters directing the expression of heterologous nucleotide sequences, which may, in certain embodiments, be modified in order to increase or reduce viral transcription of the subgenomic fragment and heterologous sequence(s) to be expressed. Other control elements can be used, as described below.
[0064] Alphavirus replicons encoding CMV proteins are used to produce VRPs. Such alphavirus replicons comprise sequences encoding at least two CMV proteins or fragments thereof. These sequences are operably linked to one or more suitable control elements, such as a subgenomic promoter, an IRES (e.g., EMCV, EV71), and a viral 2A site, which can be the same or different. Delivery of components of these complexes using the polycistronic vectors disclosed herein is an efficient way of providing nucleic acid sequences that encode two or more CMV proteins in desired relative amounts; whereas if multiple alphavirus constructs were used to deliver individual CMV proteins for complex formation, efficient co-delivery of VRPs would be required.
[0065] Any combination of suitable control elements can be used in any order. In one example, a single subgenomic promoter is operable linked to two sequences encoding two different CMV proteins, and an IRES is positioned between the two coding sequences. In another example, two sequences that encode two different CMV proteins are operably linked to separate promoters. In still another example, the two sequences that encode two different CMV proteins are operably linked to a single promoter. The two sequences that encode two different CMV proteins are linked to each other through a nucleotide sequence encoding a viral 2A site, and thus encode a single amino acid chain that contain the amino acid sequences of both CMV proteins. The viral 2A site in this context is used to generate two CMV proteins from encoded polyprotein.
[0066] Subgenomic Promoters
[0067] Subgenomic promoters, also known as junction region promoters can be used to regulate protein expression. Alphaviral subgenomic promoters regulate expression of alphaviral structural proteins. See Strauss and Strauss, "The alphaviruses: gene expression, replication, and evolution," Microbiol Rev. 1994 September; 58(3):491-562. A polycistronic polynucleotide can comprise a subgenomic promoter from any alphavirus. When two or more subgenomic promoters are present in a polycistronic polynucleotide, the promoters can be the same or different. For example, the subgenomic promoter can have the sequence CTCTCTACGGCTAACCTGAATGGA (SEQ ID NO: 1). In certain embodiments, subgenomic promoters can be modified in order to increase or reduce viral transcription of the proteins. See U.S. Pat. No. 6,592,874.
[0068] Internal Ribosomal Entry Site (IRES)
[0069] In some embodiments, one or more control elements is an internal ribosomal entry site (IRES). An IRES allows multiple proteins to be made from a single mRNA transcript as ribosomes bind to each IRES and initiate translation in the absence of a 5'-cap, which is normally required to initiate translation of protein in eukaryotic cells. For example, the IRES can be EV71 (SEQ ID NO:50) or EMCV (SEQ ID NO:49).
[0070] Viral 2A Site
[0071] The FMDV 2A protein is a short peptide that serves to separate the structural proteins of FMDV from a nonstructural protein (FMDV 2B). Early work on this peptide suggested that it acts as an autocatalytic protease, but other work (e.g., Donnelly et al., (2001), J. Gen. Virol. 82, 1013-1025) suggest that this short sequence and the following single amino acid of FMDV 2B (Gly) acts as a translational stop-start. Regardless of the precise mode of action, the sequence can be inserted between two polypeptides, and effect the production of multiple individual polypeptides from a single open reading frame. In the context of this invention, FMDV 2A sequences can be inserted between the sequences encoding at least two CMV proteins, allowing for their synthesis as part of a single open reading frame. For example, the open reading frame may encode a gH protein and a gL protein separated by a sequence encoding a viral 2A site. A single mRNA is transcribed then, during the translation step, the gH and gL peptides are produced separately due to the activity of the viral 2A site. Any suitable viral 2A sequence may be used. Often, a viral 2A site comprises the consensus sequence Asp-Val/Ile-Glu-X-Asn-Pro-Gly-Pro, where X is any amino acid (SEQ ID NO: 2). For example, the Foot and Mouth Disease Virus 2A peptide sequence is DVESNPGP (SEQ ID NO: 3). See Trichas et al., "Use of the viral 2A peptide for bicistronic expression in transgenic mice," BMC Biol. 2008 Sep. 15; 6:40, and Halpin et al., "Self-processing 2A-polyproteins--a system for co-ordinate expression of multiple proteins in transgenic plants," Plant J. 1999 February; 17(4):453-9.
[0072] In some embodiments an alphavirus replicon is a chimeric replicon, such as a VEE-Sindbis chimeric replicon (VCR) or a VEE strain TC83 replicon (TC83R) or a TC83-Sindbis chimeric replicon (TC83CR). In some embodiments a VCR contains the packaging signal and 3' UTR from a Sindbis replicon in place of sequences in nsP3 and at the 3' end of the VEE replicon; see Perri et al., J. Virol. 77, 10394-403, 2003. In some embodiments, a TC83CR contains the packaging signal and 3' UTR from a Sindbis replicon in place of sequences in nsP3 and at the 3' end of aVEE strain TC83replicon.
Producing VRPs
[0073] Methods of preparing VRPs are well known in the art. In some embodiments an alphavirus is assembled into a VRP using a packaging cell. An "alphavirus packaging cell" (or "packaging cell") is a cell that contains one or more alphavirus structural protein expression cassettes and that produces recombinant alphavirus particles after introduction of an alphavirus replicon, eukaryotic layered vector initiation system (e.g., U.S. Pat. No. 5,814,482), or recombinant alphavirus particle. The one or more different alphavirus structural protein cassettes serve as "helpers" by providing the alphavirus structural proteins. An "alphavirus structural protein cassette" is an expression cassette that encodes one or more alphavirus structural proteins and comprises at least one and up to five copies (i.e., 1, 2, 3, 4, or 5) of an alphavirus replicase recognition sequence. Structural protein expression cassettes typically comprise, from 5' to 3', a 5' sequence which initiates transcription of alphavirus RNA, an optional alphavirus subgenomic region promoter, a nucleotide sequence encoding the alphavirus structural protein, a 3' untranslated region (which also directs RNA transcription), and a polyA tract. See, e.g., WO 2010/019437.
[0074] In preferred embodiments two different alphavirus structural protein cassettes ("split" defective helpers) are used in a packaging cell to minimize recombination events which could produce a replication-competent virus. In some embodiments an alphavirus structural protein cassette encodes the capsid protein (C) but not either of the glycoproteins (E2 and E1). In some embodiments an alphavirus structural protein cassette encodes the capsid protein and either the E1 or E2 glycoproteins (but not both). In some embodiments an alphavirus structural protein cassette encodes the E2 and E1 glycoproteins but not the capsid protein. In some embodiments an alphavirus structural protein cassette encodes the E1 or E2 glycoprotein (but not both) and not the capsid protein.
[0075] In some embodiments, VRPs are produced by the simultaneous introduction of replicons and helper RNAs into cells of various sources. Under these conditions, for example, BHKV cells (1.times.10.sup.7) are electroporated at, for example, 220 volts, 1000 .mu.F, 2 manually pulses with 10 .mu.g replicon RNA:6 .mu.g defective helper Cap RNA:10 .mu.g defective helper Gly RNA, alphavirus containing supernatant is collected .about.24 hours later. Replicons and/or helpers can also be introduced in DNA forms which launch suitable RNAs within the transfected cells.
[0076] A packaging cell may be a mammalian cell or a non-mammalian cell, such as an insect (e.g., SF9) or avian cell (e.g., a primary chick or duck fibroblast or fibroblast cell line). See U.S. Pat. No. 7,445,924. Avian sources of cells include, but are not limited to, avian embryonic stem cells such as EB66.RTM. (VIVALIS); chicken cells, including chicken embryonic stem cells such as EBx.RTM. cells, chicken embryonic fibroblasts, and chicken embryonic germ cells; duck cells such as the AGE1.CR and AGE1.CR.pIX cell lines (ProBioGen) which are described, for example, in Vaccine 27:4975-4982 (2009) and WO2005/042728); and geese cells. In some embodiments, a packaging cell is a primary duck fibroblast or duck retinal cell line, such as AGE.CR (PROBIOGEN).
[0077] Mammalian sources of cells for simultaneous nucleic acid introduction and/or packaging cells include, but are not limited to, human or non-human primate cells, including PerC6 (PER.C6) cells (CRUCELL N.V.), which are described, for example, in WO 01/38362 and WO 02/40665, as well as deposited under ECACC deposit number 96022940); MRC-5 (ATCC CCL-171); WI-38 (ATCC CCL-75); fetal rhesus lung cells (ATCC CL-160); human embryonic kidney cells (e.g., 293 cells, typically transformed by sheared adenovirus type 5 DNA); VERO cells from monkey kidneys); cells of horse, cow (e.g., MDBK cells), sheep, dog (e.g., MDCK cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, deposit number DSM ACC 2219 as described in WO 97/37001); cat, and rodent (e.g., hamster cells such as BHK21-F, HKCC cells, or Chinese hamster ovary (CHO) cells), and may be obtained from a wide variety of developmental stages, including for example, adult, neonatal, fetal, and embryo.
[0078] In some embodiments a packaging cell is stably transformed with one or more structural protein expression cassette(s). Structural protein expression cassettes can be introduced into cells using standard recombinant DNA techniques, including transferrin-polycation-mediated DNA transfer, transfection with naked or encapsulated nucleic acids, liposome-mediated cellular fusion, intracellular transportation of DNA-coated latex beads, protoplast fusion, viral infection, electroporation, "gene gun" methods, and DEAE- or calcium phosphate-mediated transfection. Structural protein expression cassettes typically are introduced into a host cell as DNA molecules, but can also be introduced as in vitro-transcribed RNA. Each expression cassette can be introduced separately or substantially simultaneously.
[0079] In some embodiments, stable alphavirus packaging cell lines are used to produce recombinant alphavirus particles. These are alphavirus-permissive cells comprising DNA cassettes expressing the defective helper RNA stably integrated into their genomes. See Polo et al., Proc. Natl. Acad. Sci. USA 96, 4598-603, 1999. The helper RNAs are constitutively expressed but the alphavirus structural proteins are not, because the genes are under the control of an alphavirus subgenomic promoter (Polo et al., 1999). Upon introduction of an alphavirus replicon into the genome of a packaging cell by transfection or VRP infection, replicase enzymes are produced and trigger expression of the capsid and glycoprotein genes on the helper RNAs, and output VRPs are produced. Introduction of the replicon can be accomplished by a variety of methods, including both transfection and infection with a seed stock of alphavirus replicon particles. The packaging cell is then incubated under conditions and for a time sufficient to produce packaged alphavirus replicon particles in the culture supernatant.
[0080] Thus, packaging cells allow VRPs to act as self-propagating viruses. This technology allows VRPs to be produced in much the same manner, and using the same equipment, as that used for live attenuated vaccines or other viral vectors that have producer cell lines available, such as replication-incompetent adenovirus vectors grown in cells expressing the adenovirus E1A and E1B genes.
[0081] In some embodiments, a two-step process is used: the first step comprises producing a seed stock of alphavirus replicon particles by transfecting a packaging cell with a replicon RNA or plasmid DNA-based replicon. A much larger stock of replicon particles is then produced in a second step, by infecting a fresh culture of packaging cells with the seed stock. This infection can be performed using various multiplicities of infection (MOI), including a MOI=0.00001, 0.00005, 0.0001, 0.0005, 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1.0, 3, 5, 10 or 20. In some embodiments infection is performed at a low MOI (e.g., less than 1). Over time, replicon particles can be harvested from packaging cells infected with the seed stock. In some embodiments, replicon particles can then be passaged in yet larger cultures of naive packaging cells by repeated low-multiplicity infection, resulting in commercial scale preparations with the same high titer.
Self-Replicating RNA Platforms
[0082] Two or more CMV proteins can be produced by expression of recombinant nucleic acids that encode the proteins in the cells of a subject. Preferably, the recombinant nucleic acid molecules encode two or more CMV proteins, e.g., are polycistronic. As defined above, "polycistronic" includes bicistronic. Preferred nucleic acids that can be administered to a subject to cause the production of CMV proteins are self-replicating RNA molecules. The self-replicating RNA molecules of the invention are based on the genomic RNA of RNA viruses, but lack the genes encoding one or more structural proteins. The self-replicating RNA molecules are capable of being translated to produce non-structural proteins of the RNA virus and CMV proteins encoded by the self-replicating RNA.
[0083] The self-replicating RNA generally contains at least one or more genes selected from the group consisting of viral replicase, viral proteases, viral helicases and other nonstructural viral proteins, and also comprise 5'- and 3'-end cis-active replication sequences, and a heterologous sequences that encodes two or more desired CMV proteins. A subgenomic promoter that directs expression of the heterologous sequence(s) can be included in the self-replicating RNA. If desired, a heterologous sequence may be fused in frame to other coding regions in the self-replicating RNA and/or may be under the control of an internal ribosome entry site (IRES).
[0084] Self-replicating RNA molecules of the invention can be designed so that the self-replicating RNA molecule cannot induce production of infectious viral particles. This can be achieved, for example, by omitting one or more viral genes encoding structural proteins that are necessary for the production of viral particles in the self-replicating RNA. For example, when the self-replicating RNA molecule is based on an alpha virus, such as Sindbis virus (SIN), Semliki forest virus and Venezuelan equine encephalitis virus (VEE), one or more genes encoding viral structural proteins, such as capsid and/or envelope glycoproteins, can be omitted. If desired, self-replicating RNA molecules of the invention can be designed to induce production of infectious viral particles that are attenuated or virulent, or to produce viral particles that are capable of a single round of subsequent infection.
[0085] A self-replicating RNA molecule can, when delivered to a vertebrate cell even without any proteins, lead to the production of multiple daughter RNAs by transcription from itself (or from an antisense copy of itself). The self-replicating RNA can be directly translated after delivery to a cell, and this translation provides a RNA-dependent RNA polymerase which then produces transcripts from the delivered RNA. Thus the delivered RNA leads to the production of multiple daughter RNAs. These transcripts are antisense relative to the delivered RNA and may be translated themselves to provide in situ expression of encoded CMV protein, or may be transcribed to provide further transcripts with the same sense as the delivered RNA which are translated to provide in situ expression of the encoded CMV protein(s).
[0086] One suitable system for achieving self-replication is to use an alphavirus-based RNA replicon, such as an alphavirus replicon as described herein. These +stranded replicons are translated after delivery to a cell to produce a replicase (or replicase-transcriptase). The replicase is translated as a polyprotein which auto cleaves to provide a replication complex which creates genomic -strand copies of the +strand delivered RNA. These -strand transcripts can themselves be transcribed to give further copies of the +stranded parent RNA and also to give rise to one or more subgenomic transcript which encodes two or more CMV proteins. Translation of the subgenomic transcript thus leads to in situ expression of the CMV protein(s) by the infected cell. Suitable alphavirus replicons can use a replicase from a Sindbis virus, a Semliki forest virus, an eastern equine encephalitis virus, a Venezuelan equine encephalitis virus, etc.
[0087] A preferred self-replicating RNA molecule thus encodes (i) a RNA-dependent RNA polymerase which can transcribe RNA from the self-replicating RNA molecule and (ii) two or more CMV proteins or fragments thereof. The polymerase can be an alphavirus replicase e.g. comprising alphavirus protein nsP4. Protein nsP4 is the key catalytic component of the replicase.
[0088] Whereas natural alphavirus genomes encode structural virion proteins in addition to the non-structural replicase polyprotein, it is preferred that an alphavirus based self-replicating RNA molecule of the invention does not encode all alphavirus structural proteins. Thus the self replicating RNA can lead to the production of genomic RNA copies of itself in a cell, but not to the production of RNA-containing alphavirus virions. The inability to produce these virions means that, unlike a wild-type alphavirus, the self-replicating RNA molecule cannot perpetuate itself in infectious form. The alphavirus structural proteins which are necessary for perpetuation in wild-type viruses are absent from self replicating RNAs of the invention and their place is taken by gene(s) encoding the desired gene product (CMV protein or fragment thereof), such that the subgenomic transcript encodes the desired gene product rather than the structural alphavirus virion proteins.
[0089] Thus a self-replicating RNA molecule useful with the invention have two sequences that encode different CMV proteins or fragments thereof. The sequences encoding the CMV proteins or fragments can be in any desired orientation, and can be operably linked to the same or separate promoters. If desired, the sequences encoding the CMV proteins or fragments can be part of a single open reading frame. In some embodiments the RNA may have one or more additional (downstream) sequences or open reading frames e.g. that encode other additional CMV proteins or fragments thereof. A self-replicating RNA molecule can have a 5' sequence which is compatible with the encoded replicase.
[0090] In one aspect, the self-replicating RNA molecule is derived from or based on an alphavirus, such as an alphavirus replicon as defined herein. In other aspects, the self-replicating RNA molecule is derived from or based on a virus other than an alphavirus, preferably, a positive-stranded RNA viruses, and more preferably a picornavirus, flavivirus, rubivirus, pestivirus, hepacivirus, calicivirus, or coronavirus. Suitable wild-type alphavirus sequences are well-known and are available from sequence depositories, such as the American Type Culture Collection, Rockville, Md. Representative examples of suitable alphaviruses include Aura (ATCC VR-368), Bebaru virus (ATCC VR-600, ATCC VR-1240), Cabassou (ATCC VR-922), Chikungunya virus (ATCC VR-64, ATCC VR-1241), Eastern equine encephalomyelitis virus (ATCC VR-65, ATCC VR-1242), Fort Morgan (ATCC VR-924), Getah virus (ATCC VR-369, ATCC VR-1243), Kyzylagach (ATCC VR-927), Mayaro virus (ATCC VR-66; ATCC VR-1277), Middleburg (ATCC VR-370), Mucambo virus (ATCC VR-580, ATCC VR-1244), Ndumu (ATCC VR-371), Pixuna virus (ATCC VR-372, ATCC VR-1245), Ross River virus (ATCC VR-373, ATCC VR-1246), Semliki Forest (ATCC VR-67, ATCC VR-1247), Sindbis virus (ATCC VR-68, ATCC VR-1248), Tonate (ATCC VR-925), Triniti (ATCC VR-469), Una (ATCC VR-374), Venezuelan equine encephalomyelitis (ATCC VR-69, ATCC VR-923, ATCC VR-1250 ATCC VR-1249, ATCC VR-532), Western equine encephalomyelitis (ATCC VR-70, ATCC VR-1251, ATCC VR-622, ATCC VR-1252), Whataroa (ATCC VR-926), and Y-62-33 (ATCC VR-375).
[0091] The self-replicating RNA molecules of the invention can contain one or more modified nucleotides and therefore have improved stability and be resistant to degradation and clearance in vivo, and other advantages. Without wishing to be bound by any particular theory, it is believed that self-replicating RNA molecules that contain modified nucleotides avoid or reduce stimulation of endosomal and cytoplasmic immune receptors when the self-replicating RNA is delivered into a cell. This permits self-replication, amplification and expression of protein to occur. This also reduces safety concerns relative to self-replicating RNA that does not contain modified nucleotides, because the self-replicating RNA that contains modified nucleotides reduce activation of the innate immune system and subsequent undesired consequences (e.g., inflammation at injection site, irritation at injection site, pain, and the like). It is also believed that the RNA molecules produced as a result of self-replication are recognized as foreign nucleic acids by the cytoplasmic immune receptors. Thus, self-replicating RNA molecules that contain modified nucleotides provide for efficient amplification of the RNA in a host cell and expression of CMV proteins, as well as adjuvant effects.
[0092] The RNA sequence can be modified with respect to its codon usage, for example, to increase translation efficacy and half-life of the RNA. A poly A tail (e.g., of about 30 adenosine residues or more) may be attached to the 3' end of the RNA to increase its half-life. The 5' end of the RNA may be capped with a modified ribonucleotide with the structure m7G (5') ppp (5') N (cap 0 structure) or a derivative thereof, which can be incorporated during RNA synthesis or can be enzymatically engineered after RNA transcription (e.g., by using Vaccinia Virus Capping Enzyme (VCE) consisting of mRNA triphosphatase, guanylyl-transferase and guanine-7-methytransferase, which catalyzes the construction of N7-monomethylated cap 0 structures). Cap 0 structure can provide stability and translational efficacy to the RNA molecule. The 5' cap of the RNA molecule may be further modified by a 2'-O-Methyltransferase which results in the generation of a cap 1 structure (m7Gppp [m2'-O] N), which may further increases translation efficacy. A cap 1 structure may also increase in vivo potency.
[0093] As used herein, "modified nucleotide" refers to a nucleotide that contains one or more chemical modifications (e.g., substitutions) in or on the nitrogenous base of the nucleoside (e.g., cytosine (C), thymine (T) or uracil (U), adenine (A) or guanine (G)). If desired, a self replicating RNA molecule can contain chemical modifications in or on the sugar moiety of the nucleoside (e.g., ribose, deoxyribose, modified ribose, modified deoxyribose, six-membered sugar analog, or open-chain sugar analog), or the phosphate.
[0094] The self-replicating RNA molecules can contain at least one modified nucleotide, that preferably is not part of the 5' cap (e.g., in addition to the modification that are part of the 5'' cap). Accordingly, the self-replicating RNA molecule can contain a modified nucleotide at a single position, can contain a particular modified nucleotide (e.g., pseudouridine, N6-methyladenosine, 5-methylcytidine, 5-methyluridine) at two or more positions, or can contain two, three, four, five, six, seven, eight, nine, ten or more modified nucleotides (e.g., each at one or more positions). Preferably, the self-replicating RNA molecules comprise modified nucleotides that contain a modification on or in the nitrogenous base, but do not contain modified sugar or phosphate moieties.
[0095] In some examples, between 0.001% and 99% or 100% of the nucleotides in a self-replicating RNA molecule are modified nucleotides. For example, 0.001%-25%, 0.01%-25%, 0.1%-25%, or 1%-25% of the nucleotides in a self-replicating RNA molecule are modified nucleotides.
[0096] In other examples, between 0.001% and 99% or 100% of a particular unmodified nucleotide in a self-replicating RNA molecule is replaced with a modified nucleotide. For example, about 1% of the nucleotides in the self-replicating RNA molecule that contain uridine can be modified, such as by replacement of uridine with pseudouridine. In other examples, the desired amount (percentage) of two, three, or four particular nucleotides (nucleotides that contain uridine, cytidine, guanosine, or adenine) in a self-replicating RNA molecule are modified nucleotides. For example, 0.001%-25%, 0.01%-25%, 0.1%-25, or 1%-25% of a particular nucleotide in a self-replicating RNA molecule are modified nucleotides. In other examples, 0.001%-20%, 0.001%-15%, 0.001%-10%, 0.01%-20%, 0.01%-15%, 0.1%-25, 0.01%-10%, 1%-20%, 1%-15%, 1%-10%, or about 5%, about 10%, about 15%, about 20% of a particular nucleotide in a self-replicating RNA molecule are modified nucleotides.
[0097] It is preferred that less than 100% of the nucleotides in a self-replicating RNA molecule are modified nucleotides. It is also preferred that less than 100% of a particular nucleotide in a self-replicating RNA molecule are modified nucleotides. Thus, preferred self-replicating RNA molecules comprise at least some unmodified nucleotides.
[0098] There are more than 96 naturally occurring nucleoside modifications found on mammalian RNA. See, e.g., Limbach et al., Nucleic Acids Research, 22(12):2183-2196 (1994). The preparation of nucleotides and modified nucleotides and nucleosides are well-known in the art, e.g. from U.S. Pat. Nos. 4,373,071, 4,458,066, 4,500,707, 4,668,777, 4,973,679, 5,047,524, 5,132,418, 5,153,319, 5,262,530, 5,700,642 all of which are incorporated herein by reference in their entirety, and many modified nucleosides and modified nucleotides are commercially available.
[0099] Modified nucleobases which can be incorporated into modified nucleosides and nucleotides and be present in the RNA molecules include: m5C (5-methylcytidine), m5U (5-methyluridine), m6A (N6-methyladenosine), s2U (2-thiouridine), Um (2'-O-methyluridine), m1A (1-methyladenosine); m2A (2-methyladenosine); Am (2-1-O-methyladenosine); ms2m6A (2-methylthio-N6-methyladenosine); i6A (N6-isopentenyladenosine); ms2i6A (2-methylthio-N6isopentenyladenosine); io6A (N6-(cis-hydroxyisopentenyl)adenosine); ms2io6A (2-methylthio-N6-(cis-hydroxyisopentenyl) adenosine); g6A (N6-glycinylcarbamoyladenosine); t6A (N6-threonyl carbamoyladenosine); ms2t6A (2-methylthio-N6-threonyl carbamoyladenosine); m6t6A (N6-methyl-N6-threonylcarbamoyladenosine); hn6A (N6-hydroxynorvalylcarbamoyl adenosine); ms2hn6A (2-methylthio-N6-hydroxynorvalyl carbamoyladenosine); Ar(p) (2'-O-ribosyladenosine (phosphate)); I (inosine); m1I (1-methylinosine); m'Im (1,2'-O-dimethylinosine); m3C (3-methylcytidine); Cm (2T-O-methylcytidine); s2C (2-thiocytidine); ac4C (N4-acetylcytidine); f5C (5-fonnylcytidine); m5Cm (5,2-O-dimethylcytidine); ac4Cm (N4acetyl2TOmethylcytidine); 1(2C (lysidine); m1G (1-methylguanosine); m2G (N2-methylguanosine); m7G (7-methylguanosine); Gm (2'-O-methylguanosine); m22G (N2,N2-dimethylguanosine); m2Gm (N2,2'-O-dimethylguanosine); m22Gm (N2,N2,2'-O-trimethylguanosine); Gr(p) (2'-O-ribosylguanosine (phosphate)); yW (wybutosine); o2yW (peroxywybutosine); OHyW (hydroxywybutosine); OHyW* (undermodified hydroxywybutosine); imG (wyosine); mimG (methylguanosine); Q (queuosine); oQ (epoxyqueuosine); galQ (galtactosyl-queuosine); manQ (mannosyl-queuosine); preQo (7-cyano-7-deazaguanosine); preQi (7-aminomethyl-7-deazaguanosine); G* (archaeosine); D (dihydrouridine); m5Um (5,2'-O-dimethyluridine); s4U (4-thiouridine); m5s2U (5-methyl-2-thiouridine); s2Um (2-thio-2'-O-methyluridine); acp3U (3-(3-amino-3-carboxypropyl)uridine); ho5U (5-hydroxyuridine); mo5U (5-methoxyuridine); cmo5U (uridine 5-oxyacetic acid); mcmo5U (uridine 5-oxyacetic acid methyl ester); chm5U (5-(carboxyhydroxymethyl)uridine)); mchm5U (5-(carboxyhydroxymethyl)uridine methyl ester); mcm5U (5-methoxycarbonyl methyluridine); mcm5Um (S-methoxycarbonylmethyl-2-O-methyluridine); mcm5s2U (5-methoxycarbonylmethyl-2-thiouridine); nm5s2U (5-aminomethyl-2-thiouridine); mnm5U (5-methylaminomethyluridine); mnm5s2U (5-methylaminomethyl-2-thiouridine); mnm5se2U (5-methylaminomethyl-2-selenouridine); ncm5U (5-carbamoylmethyl uridine); ncm5Um (5-carbamoylmethyl-2'-O-methyluridine); cmnm5U (5-carboxymethylaminomethyluridine); cnmm5Um (5-carboxymethylaminomethyl-2-L-Omethyluridine); cmnm5s2U (5-carboxymethylaminomethyl-2-thiouridine); m62A (N6,N6-dimethyladenosine); Tm (2'-O-methylinosine); m4C (N4-methylcytidine); m4Cm (N4,2-O-dimethylcytidine); hm5C (5-hydroxymethylcytidine); m3U (3-methyluridine); cm5U (5-carboxymethyluridine); m6Am (N6,T-O-dimethyladenosine); rn62Am (N6,N6,O-2-trimethyladenosine); m2'7G (N2,7-dimethylguanosine); m2'2'7G (N2,N2,7-trimethylguanosine); m3Um (3,2T-O-dimethyluridine); m5D (5-methyldihydrouridine); f5Cm (5-formyl-2'-O-methylcytidine); m1Gm (1,2'-O-dimethylguanosine); m'Am (1,2-O-dimethyl adenosine) irinomethyluridine); tm5s2U (S-taurinomethyl-2-thiouridine)); imG-14 (4-demethyl guanosine); imG2 (isoguanosine); ac6A (N6-acetyladenosine), hypoxanthine, inosine, 8-oxo-adenine, 7-substituted derivatives thereof, dihydrouracil, pseudouracil, 2-thiouracil, 4-thiouracil, 5-aminouracil, 5-(C.sub.1-C.sub.6)-alkyluracil, 5-methyluracil, 5-(C.sub.2-C.sub.6)-alkenyluracil, 5-(C.sub.2-C.sub.6)-alkynyluracil, 5-(hydroxymethyl)uracil, 5-chlorouracil, 5-fluorouracil, 5-bromouracil, 5-hydroxycytosine, 5-(C.sub.1-C.sub.6)-alkylcytosine, 5-methylcytosine, 5-(C.sub.2-C.sub.6)-alkenylcytosine, 5-(C.sub.2-C.sub.6)-alkynylcytosine, 5-chlorocytosine, 5-fluorocytosine, 5-bromocytosine, N.sup.2-dimethylguanine, 7-deazaguanine, 8-azaguanine, 7-deaza-7-substituted guanine, 7-deaza-7-(C.sub.2-C.sub.6)alkynylguanine, 7-deaza-8-substituted guanine, 8-hydroxyguanine, 6-thioguanine, 8-oxoguanine, 2-aminopurine, 2-amino-6-chloropurine, 2,4-diaminopurine, 2,6-diaminopurine, 8-azapurine, substituted 7-deazapurine, 7-deaza-7-substituted purine, 7-deaza-8-substituted purine, hydrogen (abasic residue), m5C, m5U, m6A, s2U, W, or 2'-O-methyl-U. Any one or any combination of these modified nucleobases may be included in the self-replicating RNA of the invention. Many of these modified nucleobases and their corresponding ribonucleosides are available from commercial suppliers.
[0100] If desired, the self-replicating RNA molecule can contain phosphoramidate, phosphorothioate, and/or methylphosphonate linkages.
[0101] Self-replicating RNA molecules that comprise at least one modified nucleotide can be prepared using any suitable method. Several suitable methods are known in the art for producing RNA molecules that contain modified nucleotides. For example, a self-replicating RNA molecule that contains modified nucleotides can be prepared by transcribing (e.g., in vitro transcription) a DNA that encodes the self-replicating RNA molecule using a suitable DNA-dependent RNA polymerase, such as T7 phage RNA polymerase, SP6 phage RNA polymerase, T3 phage RNA polymerase, and the like, or mutants of these polymerases which allow efficient incorporation of modified nucleotides into RNA molecules. The transcription reaction will contain nucleotides and modified nucleotides, and other components that support the activity of the selected polymerase, such as a suitable buffer, and suitable salts. The incorporation of nucleotide analogs into a self-replicating RNA may be engineered, for example, to alter the stability of such RNA molecules, to increase resistance against RNases, to establish replication after introduction into appropriate host cells ("infectivity" of the RNA), and/or to induce or reduce innate and adaptive immune responses.
[0102] Suitable synthetic methods can be used alone, or in combination with one or more other methods (e.g., recombinant DNA or RNA technology), to produce a self-replicating RNA molecule that contain one or more modified nucleotides. Suitable methods for de novo synthesis are well-known in the art and can be adapted for particular applications. Exemplary methods include, for example, chemical synthesis using suitable protecting groups such as CEM (Masuda et al., (2007) Nucleic Acids Symposium Series 51:3-4), the .beta.-cyanoethyl phosphoramidite method (Beaucage S L et al. (1981) Tetrahedron Lett 22:1859); nucleoside H-phosphonate method (Garegg P et al. (1986) Tetrahedron Lett 27:4051-4; Froehler B C et al. (1986) Nucl Acid Res 14:5399-407; Garegg P et al. (1986) Tetrahedron Lett 27:4055-8; Gaffney B L et al. (1988) Tetrahedron Lett 29:2619-22). These chemistries can be performed or adapted for use with automated nucleic acid synthesizers that are commercially available. Additional suitable synthetic methods are disclosed in Uhlmann et al. (1990) Chem Rev 90:544-84, and Goodchild J (1990) Bioconjugate Chem 1: 165. Nucleic acid synthesis can also be performed using suitable recombinant methods that are well-known and conventional in the art, including cloning, processing, and/or expression of polynucleotides and gene products encoded by such polynucleotides. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic polynucleotides are examples of known techniques that can be used to design and engineer polynucleotide sequences. Site-directed mutagenesis can be used to alter nucleic acids and the encoded proteins, for example, to insert new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, introduce mutations and the like. Suitable methods for transcription, translation and expression of nucleic acid sequences are known and conventional in the art. (See generally, Current Protocols in Molecular Biology, Vol. 2, Ed. Ausubel, et al., Greene Publish. Assoc. & Wiley Interscience, Ch. 13, 1988; Glover, DNA Cloning, Vol. II, IRL Press, Wash., D.C., Ch. 3, 1986; Bitter, et al., in Methods in Enzymology 153:516-544 (1987); The Molecular Biology of the Yeast Saccharomyces, Eds. Strathern et al., Cold Spring Harbor Press, Vols. I and II, 1982; and Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, 1989.)
[0103] The presence and/or quantity of one or more modified nucleotides in a self-replicating RNA molecule can be determined using any suitable method. For example, a self-replicating RNA can be digested to monophosphates (e.g., using nuclease P1) and dephosphorylated (e.g., using a suitable phosphatase such as CIAP), and the resulting nucleosides analyzed by reversed phase HPLC (e.g., usings a YMC Pack ODS-AQ column (5 micron, 4.6.times.250 mm) and elute using a gradient, 30% B (0-5 min) to 100% B (5-13 min) and at 100% B (13-40) min, flow Rate (0.7 ml/min), UV detection (wavelength: 260 nm), column temperature (30.degree. C.). Buffer A (20 mM acetic acid-ammonium acetate pH 3.5), buffer B (20 mM acetic acid-ammonium acetate pH 3.5/methanol [90/10])).
[0104] The self-replicating RNA may be associated with a delivery system. The self-replicating RNA may be administered with or without an adjuvant.
RNA Delivery Systems
[0105] The self-replicating RNA described herein are suitable for delivery in a variety of modalities, such as naked RNA delivery or in combination with lipids, polymers or other compounds that facilitate entry into the cells. Self-replicating RNA molecules can be introduced into target cells or subjects using any suitable technique, e.g., by direct injection, microinjection, electroporation, lipofection, biolystics, and the like. The self-replicating RNA molecule may also be introduced into cells by way of receptor-mediated endocytosis. See e.g., U.S. Pat. No. 6,090,619; Wu and Wu, J. Biol. Chem., 263:14621 (1988); and Curiel et al., Proc. Natl. Acad. Sci. USA, 88:8850 (1991). For example, U.S. Pat. No. 6,083,741 discloses introducing an exogenous nucleic acid into mammalian cells by associating the nucleic acid to a polycation moiety (e.g., poly-L-lysine having 3-100 lysine residues (SEQ ID NO:4)), which is itself coupled to an integrin receptor-binding moiety (e.g., a cyclic peptide having the sequence Arg-Gly-Asp (SEQ ID NO:5).
[0106] The self-replicating RNA molecules can be delivered into cells via amphiphiles. See e.g., U.S. Pat. No. 6,071,890. Typically, a nucleic acid molecule may form a complex with the cationic amphiphile. Mammalian cells contacted with the complex can readily take it up.
[0107] The self-replicating RNA can be delivered as naked RNA (e.g. merely as an aqueous solution of RNA) but, to enhance entry into cells and also subsequent intercellular effects, the self-replicating RNA is preferably administered in combination with a delivery system, such as a particulate or emulsion delivery system. A large number of delivery systems are well known to those of skill in the art. Such delivery systems include, for example liposome-based delivery (Debs and Zhu (1993) WO 93/24640; Mannino and Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat. No. 5,279,833; Brigham (1991) WO 91/06309; and Felgner et al. (1987) Proc. Natl. Acad. Sci. USA 84: 7413-7414), as well as use of viral vectors (e.g., adenoviral (see, e.g., Berns et al. (1995) Ann. NY Acad. Sci. 772: 95-104; Ali et al. (1994) Gene Ther. 1: 367-384; and Haddada et al. (1995) Curr. Top. Microbiol. Immunol. 199 (Pt 3): 297-306 for review), papillomaviral, retroviral (see, e.g., Buchscher et al. (1992) J. Virol. 66(5) 2731-2739; Johann et al. (1992) J. Virol. 66 (5): 1635-1640 (1992); Sommerfelt et al., (1990) Virol. 176:58-59; Wilson et al. (1989) J. Virol. 63:2374-2378; Miller et al., J. Virol. 65:2220-2224 (1991); Wong-Staal et al., PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third Edition Paul (ed) Raven Press, Ltd., New York and the references therein, and Yu et al., Gene Therapy (1994) supra.), and adeno-associated viral vectors (see, West et al. (1987) Virology 160:38-47; Carter et al. (1989) U.S. Pat. No. 4,797,368; Carter et al. WO 93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka (1994) J. Clin. Invst. 94:1351 and Samulski (supra) for an overview of AAV vectors; see also, Lebkowski, U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5(11):3251-3260; Tratschin, et al. (1984) Mol. Cell. Biol., 4:2072-2081; Hermonat and Muzyczka (1984) Proc. Natl. Acad. Sci. USA, 81:6466-6470; McLaughlin et al. (1988) and Samulski et al. (1989) J. Virol., 63:03822-3828), and the like.
[0108] Three particularly useful delivery systems are (i) liposomes, (ii) non-toxic and biodegradable polymer microparticles, and (iii) cationic submicron oil-in-water emulsions.
[0109] Liposomes
[0110] Various amphiphilic lipids can form bilayers in an aqueous environment to encapsulate a RNA-containing aqueous core as a liposome. These lipids can have an anionic, cationic or zwitterionic hydrophilic head group. Formation of liposomes from anionic phospholipids dates back to the 1960s, and cationic liposome-forming lipids have been studied since the 1990s. Some phospholipids are anionic whereas other are zwitterionic. Suitable classes of phospholipid include, but are not limited to, phosphatidylethanolamines, phosphatidylcholines, phosphatidylserines, and phosphatidylglycerols, and some useful phospholipids are listed in Table 2. Useful cationic lipids include, but are not limited to, dioleoyl trimethylammonium propane (DOTAP), 1,2-distearyloxy-N,N-dimethyl-3-aminopropane (DSDMA), 1,2-dioleyloxy-N,Ndimethyl-3-aminopropane (DODMA), 1,2-dilinoleyloxy-N,N-dimethyl-3-aminopropane (DLinDMA), 1,2-dilinolenyloxy-N,N-dimethyl-3-aminopropane (DLenDMA). Zwitterionic lipids include, but are not limited to, acyl zwitterionic lipids and ether zwitterionic lipids. Examples of useful zwitterionic lipids are DPPC, DOPC and dodecylphosphocholine. The lipids can be saturated or unsaturated.
[0111] Liposomes can be formed from a single lipid or from a mixture of lipids. A mixture may comprise (i) a mixture of anionic lipids (ii) a mixture of cationic lipids (iii) a mixture of zwitterionic lipids (iv) a mixture of anionic lipids and cationic lipids (v) a mixture of anionic lipids and zwitterionic lipids (vi) a mixture of zwitterionic lipids and cationic lipids or (vii) a mixture of anionic lipids, cationic lipids and zwitterionic lipids. Similarly, a mixture may comprise both saturated and unsaturated lipids. For example, a mixture may comprise DSPC (zwitterionic, saturated), DlinDMA (cationic, unsaturated), and/or DMPG (anionic, saturated). Where a mixture of lipids is used, not all of the component lipids in the mixture need to be amphiphilic e.g. one or more amphiphilic lipids can be mixed with cholesterol.
[0112] The hydrophilic portion of a lipid can be PEGylated (i.e. modified by covalent attachment of a polyethylene glycol). This modification can increase stability and prevent non-specific adsorption of the liposomes. For instance, lipids can be conjugated to PEG using techniques such as those disclosed in Heyes et al. (2005) J Controlled Release 107:276-87.
[0113] A mixture of DSPC, DlinDMA, PEG-DMPG and cholesterol can be used to form liposomes. A separate aspect of the invention is a liposome comprising DSPC, DlinDMA, PEG-DMG and cholesterol. This liposome preferably encapsulates RNA, such as a self-replicating RNA e.g. encoding an immunogen.
[0114] Liposomes are usually divided into three groups: multilamellar vesicles (MLV); small unilamellar vesicles (SUV); and large unilamellar vesicles (LUV). MLVs have multiple bilayers in each vesicle, forming several separate aqueous compartments. SUVs and LUVs have a single bilayer encapsulating an aqueous core; SUVs typically have a diameter .ltoreq.50 nm, and LUVs have a diameter >50 nm. Liposomes useful with of the invention are ideally LUVs with a diameter in the range of 50-220 nm. For a composition comprising a population of LUVs with different diameters: (i) at least 80% by number should have diameters in the range of 20-220 nm, (ii) the average diameter (Zav, by intensity) of the population is ideally in the range of 40-200 nm, and/or (iii) the diameters should have a polydispersity index <0.2.
[0115] Techniques for preparing suitable liposomes are well known in the art e.g. see Liposomes: Methods and Protocols, Volume 1: Pharmaceutical Nanocarriers: Methods and Protocols. (ed. Weissig). Humana Press, 2009. ISBN 160327359X; Liposome Technology, volumes I, II & III. (ed. Gregoriadis). Informa Healthcare, 2006; and Functional Polymer Colloids and Microparticles volume 4 (Microspheres, microcapsules & liposomes). (eds. Arshady & Guyot). Citus Books, 2002. One useful method involves mixing (i) an ethanolic solution of the lipids (ii) an aqueous solution of the nucleic acid and (iii) buffer, followed by mixing, equilibration, dilution and purification (Heyes et al. (2005) J Controlled Release 107:276-87.).
[0116] RNA is preferably encapsulated within the liposomes, and so the liposome forms a outer layer around an aqueous RNA-containing core. This encapsulation has been found to protect RNA from RNase digestion. The liposomes can include some external RNA (e.g. on the surface of the liposomes), but preferably, at least half of the RNA (and ideally substantially all of it) is encapsulated.
[0117] Polymeric Microparticles
[0118] Various polymers can form microparticles to encapsulate or adsorb RNA. The use of a substantially non-toxic polymer means that a recipient can safely receive the particles, and the use of a biodegradable polymer means that the particles can be metabolised after delivery to avoid long-term persistence. Useful polymers are also sterilisable, to assist in preparing pharmaceutical grade formulations.
[0119] Suitable non-toxic and biodegradable polymers include, but are not limited to, poly(.alpha.-hydroxy acids), polyhydroxy butyric acids, polylactones (including polycaprolactones), polydioxanones, polyvalerolactone, polyorthoesters, polyanhydrides, polycyanoacrylates, tyrosine-derived polycarbonates, polyvinyl-pyrrolidinones or polyester-amides, and combinations thereof.
[0120] In some embodiments, the microparticles are formed from poly(.alpha.-hydroxy acids), such as a poly(lactides) ("PLA"), copolymers of lactide and glycolide such as a poly(D,L-lactide-co-glycolide) ("PLG"), and copolymers of D,L-lactide and caprolactone. Useful PLG polymers include those having a lactide/glycolide molar ratio ranging, for example, from 20:80 to 80:20 e.g. 25:75, 40:60, 45:55, 55:45, 60:40, 75:25. Useful PLG polymers include those having a molecular weight between, for example, 5,000-200,000 Da e.g. between 10,000-100,000, 20,000-70,000, 40,000-50,000 Da.
[0121] The microparticles ideally have a diameter in the range of 0.02 .mu.m to 8 .mu.m. For a composition comprising a population of microparticles with different diameters at least 80% by number should have diameters in the range of 0.03-7 .mu.m.
[0122] Techniques for preparing suitable microparticles are well known in the art e.g. see Functional Polymer Colloids and Microparticles volume 4 (Microspheres, microcapsules & liposomes). (eds. Arshady & Guyot). Citus Books, 2002; Polymers in Drug Delivery. (eds. Uchegbu & Schatzlein). CRC Press, 2006. (in particular chapter 7) and Microparticulate Systems for the Delivery of Proteins and Vaccines. (eds. Cohen & Bernstein). CRC Press, 1996. To facilitate adsorption of RNA, a microparticle may include a cationic surfactant and/or lipid e.g. as disclosed in O'Hagan et al. (2001) J Virology 75:9037-9043; and Singh et al. (2003) Pharmaceutical Research 20: 247-251. An alternative way of making polymeric microparticles is by molding and curing e.g. as disclosed in WO2009/132206.
[0123] Microparticles of the invention can have a zeta potential of between 40-100 mV. RNA can be adsorbed to the microparticles, and adsorption is facilitated by including cationic materials (e.g. cationic lipids) in the microparticle.
[0124] Oil-In-Water Cationic Emulsions
[0125] Oil-in-water emulsions are known for adjuvanting influenza vaccines e.g. the MF59.TM. adjuvant in the FLUAD.TM. product, and the AS03 adjuvant in the PREPANDRIX.TM. product. RNA delivery can be accomplished with the use of an oil-in-water emulsion, provided that the emulsion includes one or more cationic molecules. For instance, a cationic lipid can be included in the emulsion to provide a positively charged droplet surface to which negatively-charged RNA can attach.
[0126] The emulsion comprises one or more oils. Suitable oil(s) include those from, for example, an animal (such as fish) or a vegetable source. The oil is ideally biodegradable (metabolizable) and biocompatible. Sources for vegetable oils include nuts, seeds and grains. Peanut oil, soybean oil, coconut oil, and olive oil, the most commonly available, exemplify the nut oils. Jojoba oil can be used e.g. obtained from the jojoba bean. Seed oils include safflower oil, cottonseed oil, sunflower seed oil, sesame seed oil and the like. In the grain group, corn oil is the most readily available, but the oil of other cereal grains such as wheat, oats, rye, rice, teff, triticale and the like may also be used. 6-10 carbon fatty acid esters of glycerol and 1,2-propanediol, while not occurring naturally in seed oils, may be prepared by hydrolysis, separation and esterification of the appropriate materials starting from the nut and seed oils. Fats and oils from mammalian milk are metabolizable and so may be used. The procedures for separation, purification, saponification and other means necessary for obtaining pure oils from animal sources are well known in the art.
[0127] Most fish contain metabolizable oils which may be readily recovered. For example, cod liver oil, shark liver oils, and whale oil such as spermaceti exemplify several of the fish oils which may be used herein. A number of branched chain oils are synthesized biochemically in 5-carbon isoprene units and are generally referred to as terpenoids. Squalane, the saturated analog to squalene, can also be used. Fish oils, including squalene and squalane, are readily available from commercial sources or may be obtained by methods known in the art.
[0128] Other useful oils are the tocopherols, particularly in combination with squalene. Where the oil phase of an emulsion includes a tocopherol, any of the .alpha., .beta., .gamma., .delta., or .xi. tocopherols can be used, but .alpha.-tocopherols are preferred. D-.alpha.-tocopherol and DL-.alpha.-tocopherol can both be used. A preferred .alpha.-tocopherol is DL-.alpha.-tocopherol. An oil combination comprising squalene and a tocopherol (e.g. DL-.alpha.-tocopherol) can be used.
[0129] Preferred emulsions comprise squalene, a shark liver oil which is a branched, unsaturated terpenoid (C.sub.30H.sub.50; [(CH.sub.3).sub.2C[.dbd.CHCH.sub.2CH.sub.2C(CH.sub.3)].sub.2.dbd.CHCH.sub- .2--].sub.2; 2,6,10,15,19,23-hexamethyl-2,6,10,14,18,22-tetracosahexaene; CAS RN 7683-64-9).
[0130] The oil in the emulsion may comprise a combination of oils e.g. squalene and at least one further oil.
[0131] The aqueous component of the emulsion can be plain water (e.g. w.f.i.) or can include further components e.g. solutes. For instance, it may include salts to form a buffer e.g. citrate or phosphate salts, such as sodium salts. Typical buffers include: a phosphate buffer; a Tris buffer; a borate buffer; a succinate buffer; a histidine buffer; or a citrate buffer. A buffered aqueous phase is preferred, and buffers will typically be included in the 5-20 mM range.
[0132] The emulsion also includes a cationic lipid. Preferably this lipid is a surfactant so that it can facilitate formation and stabilization of the emulsion. Useful cationic lipids generally contains a nitrogen atom that is positively charged under physiological conditions e.g. as a tertiary or quaternary amine. This nitrogen can be in the hydrophilic head group of an amphiphilic surfactant. Useful cationic lipids include, but are not limited to: 1,2-dioleoyloxy-3-(trimethylammonio)propane (DOTAP), 3'-[N-(N',N'-Dimethylaminoethane)-carbamoyl]Cholesterol (DC Cholesterol), dimethyldioctadecyl-ammonium (DDA e.g. the bromide), 1,2-Dimyristoyl-3-Trimethyl-AmmoniumPropane (DMTAP), dipalmitoyl(C16:0)trimethyl ammonium propane (DPTAP), distearoyltrimethylammonium propane (DSTAP). Other useful cationic lipids are: benzalkonium chloride (BAK), benzethonium chloride, cetramide (which contains tetradecyltrimethylammonium bromide and possibly small amounts of dedecyltrimethylammonium bromide and hexadecyltrimethyl ammonium bromide), cetylpyridinium chloride (CPC), cetyl trimethylammonium chloride (CTAC), N,N',N'-polyoxyethylene (10)-N-tallow-1,3-diaminopropane, dodecyltrimethylammonium bromide, hexadecyltrimethyl-ammonium bromide, mixed alkyl-trimethyl-ammonium bromide, benzyldimethyldodecylammonium chloride, benzyldimethylhexadecyl-ammonium chloride, benzyltrimethylammonium methoxide, cetyldimethylethylammonium bromide, dimethyldioctadecyl ammonium bromide (DDAB), methylbenzethonium chloride, decamethonium chloride, methyl mixed trialkyl ammonium chloride, methyl trioctylammonium chloride), N,N-dimethyl-N-[2 (2-methyl-4-(1,1,3,3tetramethylbutyl)-phenoxy]-ethoxy)ethyl]-benzenemetha- -naminium chloride (DEBDA), dialkyldimetylammonium salts, [1-(2,3-dioleyloxy)-propyl]-N,N,N,trimethylammonium chloride, 1,2-diacyl-3-(trimethylammonio) propane (acyl group=dimyristoyl, dipalmitoyl, distearoyl, dioleoyl), 1,2-diacyl-3 (dimethylammonio)propane (acyl group=dimyristoyl, dipalmitoyl, distearoyl, dioleoyl), 1,2-dioleoyl-3-(4'-trimethyl-ammonio)butanoyl-sn-glycerol, 1,2-dioleoyl 3-succinyl-sn-glycerol choline ester, cholesteryl (4'-trimethylammonio) butanoate), N-alkyl pyridinium salts (e.g. cetylpyridinium bromide and cetylpyridinium chloride), N-alkylpiperidinium salts, dicationic bolaform electrolytes (C12Me6; C12BU6), dialkylglycetylphosphorylcholine, lysolecithin, L-.alpha. dioleoylphosphatidylethanolamine, cholesterol hemisuccinate choline ester, lipopolyamines, including but not limited to dioctadecylamidoglycylspermine (DOGS), dipalmitoyl phosphatidylethanol-amidospermine (DPPES), lipopoly-L (or D)-lysine (LPLL, LPDL), poly (L (or D)-lysine conjugated to N-glutarylphosphatidylethanolamine, didodecyl glutamate ester with pendant amino group (C{circumflex over ( )}GluPhCnN), ditetradecyl glutamate ester with pendant amino group (C14GIuCnN+), cationic derivatives of cholesterol, including but not limited to cholesteryl-3 .beta.-oxysuccinamidoethylenetrimethylammonium salt, cholesteryl-3 .beta.-oxysuccinamidoethylene-dimethylamine, cholesteryl-3 .beta.-carboxyamidoethylenetrimethylammonium salt, and cholesteryl-3 .beta.-carboxyamidoethylenedimethylamine Other useful cationic lipids are described in US 2008/0085870 and US 2008/0057080, which are incorporated herein by reference. The cationic lipid is preferably biodegradable (metabolizable) and biocompatible.
[0133] In addition to the oil and cationic lipid, an emulsion can include a non-ionic surfactant and/or a zwitterionic surfactant. Such surfactants include, but are not limited to: the polyoxyethylene sorbitan esters surfactants (commonly referred to as the Tweens), especially polysorbate 20 and polysorbate 80; copolymers of ethylene oxide (EO), propylene oxide (PO), and/or butylene oxide (BO), sold under the DOWFAX.TM. tradename, such as linear EO/PO block copolymers; octoxynols, which can vary in the number of repeating ethoxy (oxy-1,2-ethanediyl) groups, with octoxynol-9 (Triton X-100, or t-octylphenoxypolyethoxyethanol) being of particular interest; (octylphenoxy)polyethoxyethanol (IGEPAL CA-630/NP-40); phospholipids such as phosphatidylcholine (lecithin); polyoxyethylene fatty ethers derived from lauryl, cetyl, stearyl and oleyl alcohols (known as Brij surfactants), such as triethyleneglycol monolauryl ether (Brij 30); polyoxyethylene-9-lauryl ether; and sorbitan esters (commonly known as the Spans), such as sorbitan trioleate (Span 85) and sorbitan monolaurate. Preferred surfactants for including in the emulsion are polysorbate 80 (Tween 80; polyoxyethylene sorbitan monooleate), Span 85 (sorbitan trioleate), lecithin and Triton X-100.
[0134] Mixtures of these surfactants can be included in the emulsion e.g. Tween 80/Span 85 mixtures, or Tween 80/Triton-X100 mixtures. A combination of a polyoxyethylene sorbitan ester such as polyoxyethylene sorbitan monooleate (Tween 80) and an octoxynol such as t-octylphenoxy-polyethoxyethanol (Triton X-100) is also suitable. Another useful combination comprises laureth 9 plus a polyoxyethylene sorbitan ester and/or an octoxynol. Useful mixtures can comprise a surfactant with a HLB value in the range of 10-20 (e.g. polysorbate 80, with a HLB of 15.0) and a surfactant with a HLB value in the range of 1-10 (e.g. sorbitan trioleate, with a HLB of 1.8).
[0135] Preferred amounts of oil (% by volume) in the final emulsion are between 2-20% e.g. 5-15%, 6-14%, 7-13%, 8-12%. A squalene content of about 4-6% or about 9-11% is particularly useful.
[0136] Preferred amounts of surfactants (% by weight) in the final emulsion are between 0.001% and 8%. For example: polyoxyethylene sorbitan esters (such as polysorbate 80) 0.2 to 4%, in particular between 0.4-0.6%, between 0.45-0.55%, about 0.5% or between 1.5-2%, between 1.8-2.2%, between 1.9-2.1%, about 2%, or 0.85-0.95%, or about 1%; sorbitan esters (such as sorbitan trioleate) 0.02 to 2%, in particular about 0.5% or about 1%; octyl- or nonylphenoxy polyoxyethanols (such as Triton X-100) 0.001 to 0.1%, in particular 0.005 to 0.02%; polyoxyethylene ethers (such as laureth 9) 0.1 to 8%, preferably 0.1 to 10% and in particular 0.1 to 1% or about 0.5%.
[0137] The absolute amounts of oil and surfactant, and their ratio, can be varied within wide limits while still forming an emulsion. A skilled person can easily vary the relative proportions of the components to obtain a desired emulsion, but a weight ratio of between 4:1 and 5:1 for oil and surfactant is typical (excess oil).
[0138] An important parameter for ensuring immunostimulatory activity of an emulsion, particularly in large animals, is the oil droplet size (diameter). The most effective emulsions have a droplet size in the submicron range. Suitably the droplet sizes will be in the range 50-750 nm. Most usefully the average droplet size is less than 250 nm e.g. less than 200 nm, less than 150 nm. The average droplet size is usefully in the range of 80-180 nm. Ideally, at least 80% (by number) of the emulsion's oil droplets are less than 250 nm in diameter, and preferably at least 90%. Apparatuses for determining the average droplet size in an emulsion, and the size distribution, are commercially available. These typically use the techniques of dynamic light scattering and/or single-particle optical sensing e.g. the Accusizer.TM. and Nicomp.TM. series of instruments available from Particle Sizing Systems (Santa Barbara, USA), or the Zetasizer.TM. instruments from Malvern Instruments (UK), or the Particle Size Distribution Analyzer instruments from Horiba (Kyoto, Japan).
[0139] Ideally, the distribution of droplet sizes (by number) has only one maximum i.e. there is a single population of droplets distributed around an average (mode), rather than having two maxima. Preferred emulsions have a polydispersity of <0.4 e.g. 0.3, 0.2, or less.
[0140] Suitable emulsions with submicron droplets and a narrow size distribution can be obtained by the use of microfluidization. This technique reduces average oil droplet size by propelling streams of input components through geometrically fixed channels at high pressure and high velocity. These streams contact channel walls, chamber walls and each other. The results shear, impact and cavitation forces cause a reduction in droplet size. Repeated steps of microfluidization can be performed until an emulsion with a desired droplet size average and distribution are achieved.
[0141] As an alternative to microfluidization, thermal methods can be used to cause phase inversion. These methods can also provide a submicron emulsion with a tight particle size distribution.
[0142] Preferred emulsions can be filter sterilized i.e. their droplets can pass through a 220 nm filter. As well as providing a sterilization, this procedure also removes any large droplets in the emulsion.
[0143] In certain embodiments, the cationic lipid in the emulsion is DOTAP. The cationic oil-in-water emulsion may comprise from about 0.5 mg/ml to about 25 mg/ml DOTAP. For example, the cationic oil-in-water emulsion may comprise DOTAP at from about 0.5 mg/ml to about 25 mg/ml, from about 0.6 mg/ml to about 25 mg/ml, from about 0.7 mg/ml to about 25 mg/ml, from about 0.8 mg/ml to about 25 mg/ml, from about 0.9 mg/ml to about 25 mg/ml, from about 1.0 mg/ml to about 25 mg/ml, from about 1.1 mg/ml to about 25 mg/ml, from about 1.2 mg/ml to about 25 mg/ml, from about 1.3 mg/ml to about 25 mg/ml, from about 1.4 mg/ml to about 25 mg/ml, from about 1.5 mg/ml to about 25 mg/ml, from about 1.6 mg/ml to about 25 mg/ml, from about 1.7 mg/ml to about 25 mg/ml, from about 0.5 mg/ml to about 24 mg/ml, from about 0.5 mg/ml to about 22 mg/ml, from about 0.5 mg/ml to about 20 mg/ml, from about 0.5 mg/ml to about 18 mg/ml, from about 0.5 mg/ml to about 15 mg/ml, from about 0.5 mg/ml to about 12 mg/ml, from about 0.5 mg/ml to about 10 mg/ml, from about 0.5 mg/ml to about 5 mg/ml, from about 0.5 mg/ml to about 2 mg/ml, from about 0.5 mg/ml to about 1.9 mg/ml, from about 0.5 mg/ml to about 1.8 mg/ml, from about 0.5 mg/ml to about 1.7 mg/ml, from about 0.5 mg/ml to about 1.6 mg/ml, from about 0.6 mg/ml to about 1.6 mg/ml, from about 0.7 mg/ml to about 1.6 mg/ml, from about 0.8 mg/ml to about 1.6 mg/ml, about 0.5 mg/ml, about 0.6 mg/ml, about 0.7 mg/ml, about 0.8 mg/ml, about 0.9 mg/ml, about 1.0 mg/ml, about 1.1 mg/ml, about 1.2 mg/ml, about 1.3 mg/ml, about 1.4 mg/ml, about 1.5 mg/ml, about 1.6 mg/ml, about 12 mg/ml, about 18 mg/ml, about 20 mg/ml, about 21.8 mg/ml, about 24 mg/ml, etc. In an exemplary embodiment, the cationic oil-in-water emulsion comprises from about 0.8 mg/ml to about 1.6 mg/ml DOTAP, such as 0.8 mg/ml, 1.2 mg/ml, 1.4 mg/ml or 1.6 mg/ml.
[0144] In certain embodiments, the cationic lipid is DC Cholesterol. The cationic oil-in-water emulsion may comprise DC Cholesterol at from about 0.1 mg/ml to about 5 mg/ml DC Cholesterol. For example, the cationic oil-in-water emulsion may comprise DC Cholesterol from about 0.1 mg/ml to about 5 mg/ml, from about 0.2 mg/ml to about 5 mg/ml, from about 0.3 mg/ml to about 5 mg/ml, from about 0.4 mg/ml to about 5 mg/ml, from about 0.5 mg/ml to about 5 mg/ml, from about 0.62 mg/ml to about 5 mg/ml, from about 1 mg/ml to about 5 mg/ml, from about 1.5 mg/ml to about 5 mg/ml, from about 2 mg/ml to about 5 mg/ml, from about 2.46 mg/ml to about 5 mg/ml, from about 3 mg/ml to about 5 mg/ml, from about 3.5 mg/ml to about 5 mg/ml, from about 4 mg/ml to about 5 mg/ml, from about 4.5 mg/ml to about 5 mg/ml, from about 0.1 mg/ml to about 4.92 mg/ml, from about 0.1 mg/ml to about 4.5 mg/ml, from about 0.1 mg/ml to about 4 mg/ml, from about 0.1 mg/ml to about 3.5 mg/ml, from about 0.1 mg/ml to about 3 mg/ml, from about 0.1 mg/ml to about 2.46 mg/ml, from about 0.1 mg/ml to about 2 mg/ml, from about 0.1 mg/ml to about 1.5 mg/ml, from about 0.1 mg/ml to about 1 mg/ml, from about 0.1 mg/ml to about 0.62 mg/ml, about 0.15 mg/ml, about 0.3 mg/ml, about 0.6 mg/ml, about 0.62 mg/ml, about 0.9 mg/ml, about 1.2 mg/ml, about 2.46 mg/ml, about 4.92 mg/ml, etc. In an exemplary embodiment, the cationic oil-in-water emulsion comprises from about 0.62 mg/ml to about 4.92 mg/ml DC Cholesterol, such as 2.46 mg/ml.
[0145] In certain embodiments, the cationic lipid is DDA. The cationic oil-in-water emulsion may comprise from about 0.1 mg/ml to about 5 mg/ml DDA. For example, the cationic oil-in-water emulsion may comprise DDA at from about 0.1 mg/ml to about 5 mg/ml, from about 0.1 mg/ml to about 4.5 mg/ml, from about 0.1 mg/ml to about 4 mg/ml, from about 0.1 mg/ml to about 3.5 mg/ml, from about 0.1 mg/ml to about 3 mg/ml, from about 0.1 mg/ml to about 2.5 mg/ml, from about 0.1 mg/ml to about 2 mg/ml, from about 0.1 mg/ml to about 1.5 mg/ml, from about 0.1 mg/ml to about 1.45 mg/ml, from about 0.2 mg/ml to about 5 mg/ml, from about 0.3 mg/ml to about 5 mg/ml, from about 0.4 mg/ml to about 5 mg/ml, from about 0.5 mg/ml to about 5 mg/ml, from about 0.6 mg/ml to about 5 mg/ml, from about 0.73 mg/ml to about 5 mg/ml, from about 0.8 mg/ml to about 5 mg/ml, from about 0.9 mg/ml to about 5 mg/ml, from about 1.0 mg/ml to about 5 mg/ml, from about 1.2 mg/ml to about 5 mg/ml, from about 1.45 mg/ml to about 5 mg/ml, from about 2 mg/ml to about 5 mg/ml, from about 2.5 mg/ml to about 5 mg/ml, from about 3 mg/ml to about 5 mg/ml, from about 3.5 mg/ml to about 5 mg/ml, from about 4 mg/ml to about 5 mg/ml, from about 4.5 mg/ml to about 5 mg/ml, about 1.2 mg/ml, about 1.45 mg/ml, etc. Alternatively, the cationic oil-in-water emulsion may comprise DDA at about 20 mg/ml, about 21 mg/ml, about 21.5 mg/ml, about 21.6 mg/ml, about 25 mg/ml. In an exemplary embodiment, the cationic oil-in-water emulsion comprises from about 0.73 mg/ml to about 1.45 mg/ml DDA, such as 1.45 mg/ml.
[0146] Catheters or like devices may be used to deliver the self-replicating RNA molecules of the invention, as naked RNA or in combination with a delivery system, into a target organ or tissue. Suitable catheters are disclosed in, e.g., U.S. Pat. Nos. 4,186,745; 5,397,307; 5,547,472; 5,674,192; and 6,129,705, all of which are incorporated herein by reference.
[0147] The present invention includes the use of suitable delivery systems, such as liposomes, polymer microparticles or submicron emulsion microparticles with encapsulated or adsorbed self-replicating RNA, to deliver a self-replicating RNA molecule that encodes two or more CMV proteins, for example, to elicit an immune response alone, or in combination with another macromolecule. The invention includes liposomes, microparticles and submicron emulsions with adsorbed and/or encapsulated self-replicating RNA molecules, and combinations thereof.
[0148] The self-replicating RNA molecules associated with liposomes and submicron emulsion microparticles can be effectively delivered to a host cell, and can induce an immune response to the protein encoded by the self-replicating RNA.
[0149] Polycistronic self replicating RNA molecules that encode CMV proteins, and VRPs produced using polycistronic alphavirus replicons, can be used to form CMV protein complexes in a cell. Complexes include, but are not limited to, gB/gH/gL; gH/gL; gH/gL/gO; gM/gN; gH/gL/UL128/UL130/UL131; and UL128/UL130/UL131.
[0150] In some embodiments combinations of VRPs are delivered to a cell. Combinations include, but are not limited to:
[0151] 1. a gH/gL VRP and another VRP;
[0152] 2. a gH/gL VRP and a gB VRP;
[0153] 3. a gH/gL/gO VRP and a gB VRP;
[0154] 4. a gB VRP and a gH/gL/UL128/UL130/UL131 VRP;
[0155] 5. a gB VRP and UL128/UL130/UL131 VRP;
[0156] 6. a gB VRP and a gM/gN VRP;
[0157] 7. a gB VRP, a gH/gL VRP, and a UL128/UL130/UL131 VRP;
[0158] 8. a gB VRP, a gH/gLgO VRP, and a UL128/UL130/UL131 VRP;
[0159] 9. a gB VRP, a gM/gN VRP, a gH/gL VRP, and a UL128/UL130/UL131 VRP;
[0160] 10. a gB VRP, a gM/gN VRP, a gH/gL/O VRP, and a UL128/UL130/UL131 VRP;
[0161] 11. a gH/gL VRP and a UL128/UL130/UL131 VRP; and
[0162] In some embodiments combinations of self-replicating RNA molecules are delivered to a cell. Combinations include, but are not limited to:
[0163] 1. a self-replicating RNA molecule encoding gH/gL and a self-replicating RNA molecule encoding another protein;
[0164] 2. a self-replicating RNA molecule encoding gH and gL and a self-replicating RNA molecule encoding gB;
[0165] 3. a self-replicating RNA molecule encoding gH, gL and gO and a self-replicating RNA molecule encoding gB;
[0166] 4. a self-replicating RNA molecule encoding gB and a self-replicating RNA molecule encoding gH, gL, UL128, UL130 and UL131;
[0167] 5. a self-replicating RNA molecule encoding gB and a self-replicating RNA molecule encoding UL128, UL130 and UL131;
[0168] 6. a self-replicating RNA molecule encoding gB and a self-replicating RNA molecule encoding gM and gN;
[0169] 7. a self-replicating RNA molecule encoding gB, a self-replicating RNA molecule encoding gH and gL, and a self-replicating RNA molecule encoding UL128, UL130 and UL131;
[0170] 8. a self-replicating RNA molecule encoding gB, a self-replicating RNA molecule encoding gH, gL, and gO, and a self-replicating RNA molecule encoding UL128, UL130 and UL131;
[0171] 9. a self-replicating RNA molecule encoding gB, a self-replicating RNA molecule encoding gM and gN, a self-replicating RNA molecule encoding gH and gL, and a self-replicating RNA molecule encoding UL128, UL130 and UL131;
[0172] 10. a self-replicating RNA molecule encoding gB, a self-replicating RNA molecule encoding gM and gN, a self-replicating RNA molecule encoding gH, gL and gO, and a self-replicating RNA molecule encoding UL128, UL130 and UL131;
[0173] 11. a self-replicating RNA molecule encoding gH and gL, and a self-replicating RNA molecule encoding UL128, UL130 and UL131; and
CMV Proteins
[0174] Suitable CMV proteins include gB, gH, gL, gO, and can be from any CMV strain. Other suitable CMV proteins include UL128, UL130 and UL131, and can be from any CMV strain. For example, CMV proteins can be from Merlin, AD169, VR1814, Towne, Toledo, TR, PH, TB40, or Fix strains of CMV. Exemplary CMV proteins and fragments are described herein. These proteins and fragments can be encoded by any suitable nucleotide sequence, including sequences that are codon optimized or deoptimized for expression in a desired host, such as a human cell. Exemplary sequences of CMV proteins and nucleic acids encoding the proteins are provided in Table 2
TABLE-US-00002 TABLE 2 Full length gH polynucleotide (CMV gH FL) SEQ ID NO: 31 Full length gH polypeptide (CMV gH FL) SEQ ID NO: 32 Full length gL polynucleotide (CMV gL FL) SEQ ID NO: 35 Full length gL polypeptide (CMV gL FL) SEQ ID NO: 36 Full length gO polynucleotide (CMV gO FL) SEQ ID NO: 41 Full length gO polypeptide (CMV gO FL) SEQ ID NO: 42 gH sol polynucleotide (CMV gH sol) SEQ ID NO: 33 gH sol polypeptide (CMV gH sol) SEQ ID NO: 34 Full length UL128 polynucleotide (CMV UL128 FL) SEQ ID NO: 43 Full length UL128 polypeptide (CMV UL128 FL) SEQ ID NO: 44 Full length UL130 polynucleotide (CMV UL130 FL) SEQ ID NO: 45 Full length UL130 polypeptide (CMV UL130 FL) SEQ ID NO: 46 Full length UL131 polynucleotide (CMV UL131 FL) SEQ ID NO: 47 Full length UL131 polypeptide (CMV UL131 FL) SEQ ID NO: 48 Full length gB polynucleotide (CMV gB FL) SEQ ID NO: 25 Full length gB polypeptide (CMV gB FL) SEQ ID NO: 26 gB sol 750 polynucleotide (CMV gB 750) SEQ ID NO: 27 gB sol 750 polypeptide (CMV gB 750) SEQ ID NO: 28 gB sol 692 polynucleotide (CMV gB 692) SEQ ID NO: 29 gB sol 692 polypeptide (CMV gB 692) SEQ ID NO: 30 Full length gM polynucleotide (CMV gM FL) SEQ ID NO: 37 Full length gM polypeptide (CMV gM FL) SEQ ID NO: 38 Full length gN polynucleotide (CMV gN FL) SEQ ID NO: 39 Full length gN polypeptide (CMV gN FL) SEQ ID NO: 40
[0175] CMV gB Proteins
[0176] A gB protein can be full length or can omit one or more regions of the protein. Alternatively, fragments of a gB protein can be used. gB amino acids are numbered according to the full-length gB amino acid sequence (CMV gB FL) shown in SEQ ID NO: 26, which is 907 amino acids long. Suitable regions of a gB protein, which can be excluded from the full-length protein or included as fragments include: the signal sequence (amino acids 1-24), a gB-DLD disintegrin-like domain (amino acids 57-146), a furin cleavage site (amino acids 459-460), a heptad repeat region (679-693), a membrane spanning domain (amino acids 751-771), and a cytoplasmic domain from amino acids 771-906. In some embodiments a gB protein includes amino acids 67-86 (Neutralizing Epitope AD2) and/or amino acids 532-635 (Immunodominant Epitope AD1). Specific examples of gB fragments, include "gB sol 692," which includes the first 692 amino acids of gB, and "gB sol 750," which includes the first 750 amino acids of gB. The signal sequence, amino acids 1-24, can be present or absent from gB sol 692 and gB sol 750 as desired. Optionally, the gB protein can be a gB fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 675, 700, 725, 750, 775, 800, 825, 850, or 875 amino acids. A gB fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 636, 637, 638, 639, 640, 641, 642, 643, 644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796, 797, 798, 799, 800, 801, 802, 803, 804, 805, 806, 807, 808, 809, 810, 811, 812, 813, 814, 815, 816, 817, 818, 819, 820, 821, 822, 823, 824, 825, 826, 827, 828, 829, 830, 831, 832, 833, 834, 835, 836, 837, 838, 839, 840, 841, 842, 843, 844, 845, 846, 847, 848, 849, 850, 851, 852, 853, 854, 855, 856, 857, 858, 859, 860, 861, 862, 863, 864, 865, 866, 867, 868, 869, 870, 871, 872, 873, 874, 875, 876, 877, 878, 879, 880, 881, 882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 894, 895, 896, or 897.
[0177] Optionally, a gB fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a gB fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0178] CMV gH Proteins
[0179] In some embodiments, a gH protein is a full-length gH protein (CMV gH FL, SEQ ID NO: 32, for example, which is a 742 amino acid protein). gH has a membrane spanning domain and a cytoplasmic domain starting at position 716 to position 743. Removing amino acids from 717 to 743 provides a soluble gH (e.g., CMV gH sol, SEQ ID NO: 34). In some embodiments the gH protein can be a gH fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 675, 700, or 725 amino acids. Optionally, the gH protein can be a gH fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 675, 700, or 725 amino acids. A gH fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 636, 637, 638, 639, 640, 641, 642, 643, 644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 731, or 732.
[0180] gH residues are numbered according to the full-length gH amino acid sequence (CMV gH FL) shown in SEQ ID NO: 32. Optionally, a gH fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a gH fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0181] CMV gL Proteins
[0182] In some embodiments a gL protein is a full-length gL protein (CMV gL FL, SEQ ID NO: 36, for example, which is a 278 amino acid protein). In some embodiments a gL fragment can be used. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, or 250 amino acids. A gL fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, or 268.
[0183] gL residues are numbered according to the full-length gL amino acid sequence (CMV gL FL) shown in SEQ ID NO: 36. Optionally, a gL fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a gL fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0184] CMV gO Proteins
[0185] In some embodiments, a gO protein is a full-length gO protein (CMV gO FL, SEQ ID NO: 42, for example, which is a 472 amino acid protein). In some embodiments the gO protein can be a gO fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, or 450 amino acids. A gO fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, or 462.
[0186] gO residues are numbered according to the full-length gO amino acid sequence (CMV gO FL) shown in SEQ ID NO: 42. Optionally, a gO fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a gO fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0187] CMV gM Proteins
[0188] In some embodiments, a gM protein is a full-length gM protein (CMV gM FL, SEQ ID NO: 38, for example, which is a 371 amino acid protein). In some embodiments the gM protein can be a gM fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, or 350 amino acids. A gM fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, or 361.
[0189] gM residues are numbered according to the full-length gM amino acid sequence (CMV gM FL) shown in SEQ ID NO: 38. Optionally, a gM fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a gM fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0190] CMV gN Proteins
[0191] In some embodiments, a gN protein is a full-length gN protein (CMV gN FL, SEQ ID NO: 40, for example, which is a 135 amino acid protein). In some embodiments the gN protein can be a gN fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, or 125 amino acids. A gN fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, or 125.
[0192] gN residues are numbered according to the full-length gN amino acid sequence (CMV gN FL) shown in SEQ ID NO: 40. Optionally, a gN fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a gN fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0193] CMV UL128 Proteins
[0194] In some embodiments, a UL128 protein is a full-length UL128 protein (CMV UL128 FL, SEQ ID NO: 44, for example, which is a 171 amino acid protein). In some embodiments the UL128 protein can be a UL128 fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, or 150 amino acids. A UL128 fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, or 161.
[0195] UL128 residues are numbered according to the full-length UL128 amino acid sequence (CMV UL128 FL) shown in SEQ ID NO: 44. Optionally, a UL128 fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a UL128 fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0196] CMV UL130 Proteins
[0197] In some embodiments, a UL130 protein is a full-length UL130 protein (CMV UL130 FL, SEQ ID NO: 46, for example, which is a 214 amino acid protein). In some embodiments the UL130 protein can be a UL130 fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, or 200 amino acids. A UL130 fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, or 204.
[0198] UL130 residues are numbered according to the full-length UL130 amino acid sequence (CMV UL130 FL) shown in SEQ ID NO: 46. Optionally, a UL130 fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a UL130 fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0199] CMV UL131 Proteins
[0200] In some embodiments, a UL131 protein is a full-length UL131 protein (CMV UL131, SEQ ID NO: 48, for example, which is a 129 amino acid protein). In some embodiments the UL131 protein can be a UL131 fragment of 10 amino acids or longer. For example, the number of amino acids in the fragment can comprise 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 175, or 200 amino acids. A UL131 fragment can begin at any of residue number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119.
[0201] UL131 residues are numbered according to the full-length UL131 amino acid sequence (CMV UL131 FL) shown in SEQ ID NO: 48. Optionally, a UL131 fragment can extend further into the N-terminus by 5, 10, 20, or 30 amino acids from the starting residue of the fragment. Optionally, a UL131 fragment can extend further into the C-terminus by 5, 10, 20, or 30 amino acids from the last residue of the fragment.
[0202] As stated above, the invention relates to recombinant polycistronic nucleic acid molecules that contain a first sequence encoding a first herpesvirus protein or fragment thereof, and a second sequence encoding a second herpesvirus protein or fragment thereof. Accordingly, the foregoing description of certain preferred embodiments, such as alphavirus VRPs and self-replicating RNAs that contain sequences encoding two or more CMV proteins or fragments thereof, is illustrative of the invention but does not limit the scope of the invention. It will be appreciated that the sequences encoding CMV proteins in such preferred embodiments, can be replaced with sequences encoding proteins, such as gH and gL, or fragments thereof that are 10 amino acids long or longer, from other herpesviruses such as HHV-1, HHV-2, HHV-3, HHV-4, HHV-6, HHV-7 and HHV-8. For example, suitable VZV (HHV-3) proteins include gB, gE, gH, gI, and gL, and fragments thereof that are 10 amino acids long or longer, and can be from any VZV strain. For example, VZV proteins or fragments thereof can be from pOka, Dumas, HJO, CA123, or DR strains of VZV. These exemplary VZV proteins and fragments thereof can be encoded by any suitable nucleotide sequence, including sequences that are codon optimized or deoptimized for expression in a desired host, such as a human cell. Exemplary sequences of VZV proteins are provided herein.
[0203] For example, in one embodiment, the polycistronic nucleic acid molecule contains a first sequence encoding a VZV gH protein or fragment thereof, and a second sequence encoding a VZV gL protein or fragment thereof.
[0204] In some embodiments, each of the sequences encoding a herpes virus protein or fragment that are present in the polycistronic nucleic acid molecule is operably linked to its own control elements. For example, each sequences encoding a herpes virus protein or fragment is operably linked to its own subgenomic promoter. Thus the polycistronic nucleic acid molecule, such as an alphavirus replicon, can contain two, three, four or five subgenomic promoters, each of which controls expression of a herpes virus protein or fragment. When this type of polycistronic nucleic acid molecule is a self replicating RNA, such as an alphavirus replicon, it can be packaged as a VRP, or associate or formulated with an RNA delivery system.
Methods and Uses
[0205] In some embodiments, self-replicating RNA molecules or VRPs are administered to an individual to stimulate an immune response. In such embodiments, self-replicating RNA molecules or VRPs typically are present in a composition which may comprise a pharmaceutically acceptable carrier and, optionally, an adjuvant. See, e.g., U.S. Pat. Nos. 6,299,884; 7,641,911; 7,306,805; and US 2007/0207090.
[0206] The immune response can comprise a humoral immune response, a cell-mediated immune response, or both. In some embodiments an immune response is induced against each delivered CMV protein. A cell-mediated immune response can comprise a Helper T-cell (T.sub.h) response, a CD8+ cytotoxic T-cell (CTL) response, or both. In some embodiments the immune response comprises a humoral immune response, and the antibodies are neutralizing antibodies. Neutralizing antibodies block viral infection of cells. CMV infects epithelial cells and also fibroblast cells. In some embodiments the immune response reduces or prevents infection of both cell types. Neutralizing antibody responses can be complement-dependent or complement-independent. In some embodiments the neutralizing antibody response is complement-independent. In some embodiments the neutralizing antibody response is cross-neutralizing; i.e., an antibody generated against an administered composition neutralizes a CMV virus of a strain other than the strain used in the composition.
[0207] A useful measure of antibody potency in the art is "50% neutralization titer." To determine 50% neutralizing titer, serum from immunized animals is diluted to assess how dilute serum can be yet retain the ability to block entry of 50% of viruses into cells. For example, a titer of 700 means that serum retained the ability to neutralize 50% of virus after being diluted 700-fold. Thus, higher titers indicate more potent neutralizing antibody responses. In some embodiments, this titer is in a range having a lower limit of about 200, about 400, about 600, about 800, about 1000, about 1500, about 2000, about 2500, about 3000, about 3500, about 4000, about 4500, about 5000, about 5500, about 6000, about 6500, or about 7000. The 50% neutralization titer range can have an upper limit of about 400, about 600, about 800, about 1000, about 1500, about 200, about 2500, about 3000, about 3500, about 4000, about 4500, about 5000, about 5500, about 6000, about 6500, about 7000, about 8000, about 9000, about 10000, about 11000, about 12000, about 13000, about 14000, about 15000, about 16000, about 17000, about 18000, about 19000, about 20000, about 21000, about 22000, about 23000, about 24000, about 25000, about 26000, about 27000, about 28000, about 29000, or about 30000. For example, the 50% neutralization titer can be about 3000 to about 6500. "About" means plus or minus 10% of the recited value. Neutralization titer can be measured as described in the specific examples, below.
[0208] An immune response can be stimulated by administering VRPs or self-replicating RNA to an individual, typically a mammal, including a human. In some embodiments the immune response induced is a protective immune response, i.e., the response reduces the risk or severity of CMV infection. Stimulating a protective immune response is particularly desirable in some populations particularly at risk from CMV infection and disease. For example, at-risk populations include solid organ transplant (SOT) patients, bone marrow transplant patients, and hematopoietic stem cell transplant (HSCT) patients. VRPs can be administered to a transplant donor pre-transplant, or a transplant recipient pre- and/or post-transplant. Because vertical transmission from mother to child is a common source of infecting infants, administering VRPs or self-replicating RNA to a woman who can become pregnant is particularly useful.
[0209] Any suitable route of administration can be used. For example, a composition can be administered intra-muscularly, intra-peritoneally, sub-cutaneously, or trans-dermally. Some embodiments will be administered through an intra-mucosal route such as intra-orally, intra-nasally, intra-vaginally, and intra-rectally. Compositions can be administered according to any suitable schedule.
[0210] All patents, patent applications, and references cited in this disclosure, including nucleotide and amino acid sequences referred to by accession number, are expressly incorporated herein by reference. The above disclosure is a general description. A more complete understanding can be obtained by reference to the following specific examples, which are provided for purposes of illustration only.
Example 1
Delivery of Individual CMV Antigens Using a VRP Platform
[0211] Each of CMV glycoproteins gB and gH induce neutralizing responses, and gB is the dominant antigen among antibodies in human sera that neutralize infection of fibroblasts (Britt et al. (1990) J. Virol. 64(3):1079-85). The following experiments demonstrate in mice a neutralizing response against these antigens delivered using a VRP platform.
[0212] Each CMV antigen was cloned into a pcDNA-6His vector (Invitrogen) and tested for protein expression before cloning into an alphavirus replicon vector, pVCR 2.1 SalI/XbaI derived from the plasmid described by Perri et al. (J. Virol 77(19)10394-10403 (2003)) producing the constructs shown in FIG. 2. pVCR 2.1 SalI/XbaI is a self-replicating RNA vector that, when electroporated with defective helper capsid and glycoprotein RNA, forms an infectious alphavirus particle.
[0213] pVCR vectors were used to make RNA which was electroporated into baby hamster kidney (BHKV) cells in the presence of defective helper capsid and glycoprotein RNAs derived from Venezuelan equine encephalitis virus (VEE). After electroporation, the supernatant containing secreted alphavirus vector particles (VRPs) was collected, purified, titered, and used for mouse immunization studies. Mice were immunized with 1.times.10.sup.6 infectious units (IU)/mouse in a series of two immunizations, three weeks apart. The terminal bleed was three weeks after the second immunization.
[0214] Monocistronic gB, gH and gL VRPs
[0215] Two different versions of soluble gB were constructed: "gB sol 750" lacks the transmembrane spanning domain and cytoplasmic domain; and "gB sol 692" also lacks a hydrophobic region (FIG. 2A) and is similar to the Reap et al. construct. A soluble gH which lacks the transmembrane spanning domain and cytoplasmic domain ("gH sol 716") was also constructed (FIG. 2C). Sera from immunized mice were screened in several assays Immunoblot (data not shown) and immunofluorescence assays were used to confirm specific antibody responses to the antigens. Neutralization assays were used to demonstrate that the elicited antibody responses were able to neutralize CMV infection.
[0216] Sera from immunized mice were examined by immunofluorescence for recognition of gB in 293T cells transfected with constructs expressing gB-6His. Cells were probed with either anti-His antibodies ("anti-6His"), a monoclonal gB antibody ("anti-gB 27-156"), or collected pooled mouse sera. Pre-immune serum was negative in all cases. In cells transfected with constructs expressing gB FL-6His, fixed, and permeabilized, anti-6His staining revealed an expression pattern of surface expression with a punctate cytoplasmic pattern most likely corresponding to the endocytic/exocytic trafficking pathway. Both anti-gB 27-156 and the pooled mouse sera showed a similar expression pattern. Sera from mice immunized with each of gB FL VRPs, gB sol 750 VRPs, and gB sol 692 VRPs showed the same expression pattern.
[0217] Mice immunized with gH FL VRPs and gH sol 716 VRPs produced antibodies specific to gH Immunofluorescence analysis of 293T cells transfected with constructs expressing gH FL-6His detected strong recognition of gH by anti-6His, anti-gH, and pooled mouse sera. Sera collected from mice immunized with gL VRPs produced a specific antibody response as determined by immunoblot analysis and immunofluorescence. gL VRPs failed to elicit a neutralizing response.
[0218] Sera from mice immunized with gB VRPs or gH VRPs were analyzed for the presence of neutralizing antibodies using a CMV neutralization assay. Sera at various dilutions were pre-incubated with CMV virus TB40UL32EGFP ("TB40-GFP," a clinical isolate engineered to express GFP and then added to ARPE-19 epithelial cells and incubated for 5 days. At 5 days post-infection, the GFP-positive cells were counted. In this assay, cells incubated with serum containing neutralizing antibodies have fewer GFP-positive cells compared to cells incubated with virus alone or with virus incubated with pre-immune sera. Sera from mice immunized with gB VRPs, gB FL VRPs, gB sol 750 VRPs, or gB sol 692 VRPs had strong neutralizing activity in the presence of guinea pig complement (50% neutralization titer at a serum dilution of 1:1280-1:2560; FIG. 3). Sera from mice immunized with gH FL VRPs or gH sol VRPs had some neutralizing activity that was independent of guinea pig complement (FIG. 3).
Example 2
Construction of Polycistronic Alphavirus Vectors
[0219] CMV produces several multi-protein complexes during infection. To determine whether a single replicon expressing all components of a desired complex can be used to produce the CMV complex in a subject, or whether components of the complex could be co-delivered from multiple replicon vectors, we designed a platform that allows controlled expression of multiple CMV proteins.
[0220] An alphavirus vector (pVCR 2.1 SalI/XbaI) was modified to allow assembly of multiple subgenomic promoters (SGP) and genes of interest (GOI). pVCR 2.1SalI/XbaI ApaI site at 11026-31 bp was changed from GGGCCC (SEQ ID NO: 7) to GGCGCC (SEQ ID NO: 8). ClaI and PmlI restriction sites added in the region immediately downstream of the first subgenomic promoter and SalI-XbaI insert sites. The sequence at 7727-7754 bp was changed from ctcgatgtacttccgaggaactgatgtg (SEQ ID NO: 9) to ATCGATGTACTTCCGAGGAACTCACGTG (SEQ ID NO: 10).
[0221] A shuttling vector system was designed to allow insertion of a GOI directly downstream of a SGP using the SalI-XbaI sites. pcDNA 3.1 (-) C was modified as follows. Three SalI sites were deleted: positions 1046-1051 bp, 3332-3337 bp and 5519-21, 1-3 bp from GTCGAC (SEQ ID NO: 11) to GTCTAC (SEQ ID NO: 12). pcDNA 3.1 (-) C was modified to mutate an XbaI site at position 916-921 bp from TCTAGA (SEQ ID NO: 13) to TCAAGA (SEQ ID NO: 14). pcDNA 3.1 (-) C was modified to add a ClaI site and SacII site at positions 942-947 (ClaI) and 950-955 (SacII) bp from ctggatatctgcag (SEQ ID NO: 15) to ATCGATATCCGCGG (SEQ ID NO: 16).
[0222] Once the restriction sites were added and the resulting sequence was verified, the region from bp 7611-7689 (ctataactctctacggctaacctgaatggactacgacatagtctagtcgaccaagcctctagacggc gcgcccaccca) (SEQ ID NO: 17) was amplified from the modified pVCR 2.1 alphavirus vector using the following primers
TABLE-US-00003 Forward SGP S-X Not F: (SEQ ID NO: 18) 5'ATAAGAATGCGGCCGCCTATAACTCTCTACGGCTAACC 3' Reverse SGP S-X Cla R: (SEQ ID NO: 19) 5'CCATCGATTGGGTGGGCGCGCCGTCTAG3' or Forward SGP S-X Cla F: (SEQ ID NO: 20) 5'CCATCGATCTATAACTCTCTACGGCTAACC3' and Reverse SGP S-X Sac R: (SSEQ ID NO: 21) 5'TCCCCGCGGTGGGTGGGCGCGCCGTCTAG 3'.
[0223] The amplified regions were added into the modified pcDNA 3.1(-)C vector to make shuttling vectors (pcDNA SV) between appropriate sites (NotI-ClaI or ClaI-SacII). Insertion of the NotI-SGP Sal-Xba-ClaI forms pcDNA SV cassette 2, insertion of the ClaI-SGP Sal-Xba-SacII forms pcDNA SV cassette 3. These SV cassettes were sequenced. The pcDNA SV cassette 2 contains an additional 12 bp between the XbaI site and the ClaI site (CCACTGTGATCG) (SEQ ID NO: 22) because the ClaI site was not cut in the pcDNA SV cassette 2 vector. A PmlI site was therefore added. For pcDNA SV cassette 2, the PmlI site was inserted at bp 1012 (CACGTG) (SEQ ID NO: 23). For cassette 3, PmlI site was added at bp 935-940 (ACTGTG (SEQ ID NO: 24) was changed to CACGTG (SEQ ID NO: 23).
[0224] For each polycistronic vector the first gene was inserted directly into the pVCR 2.1 modified vector using the SalI-XbaI sites. The second gene was ligated into pcDNA SV cassette 2 using SalI-XbaI and excised using NotI-PmlI, NotI-SacII or PCRed using primers for NotI-ClaI and digested using NotI and ClaI. The resulting insert SGP-SalI-GOI-Xba was ligated into the modified pVCR 2.1 vector using NotI-PmlI, NotI-SacII, or NotI-ClaI sites. The NotI-ClaI insert was used only when a desired gene in the construct contained a PmlI site.
[0225] In some cases a third gene was ligated into pcDNA SV cassette 3 using SalI-XbaI and excised using PmlI-SacII or PCRed using primers for ClaI-SacII and digested using ClaI and SacII. The resulting insert SGP-SalI-GOI-XbaI was ligated into the modified pVCR 2.1 using PmlI-SacII or ClaI-SacII.
[0226] SalI-XbaI digestion was used to validate construction of the polycistronic vector DNA. After digestion with SalI-XbaI, agarose gel electrophoresis was performed to confirm the presence of the GOIs. The polycistronic vector DNA was then linearized with PmeI overnight, purified using Qiagen's PCR purification kit, and used as template to make RNA using the Ambion mMessage mMachine kit. RNA quality was checked by running a sample aliquot on an RNA agarose gel.
[0227] Expression from a Polycistronic Vector
[0228] Fluorescent proteins GFP (green fluorescent protein) and mCherry (red fluorescent protein) were used as the GOIs to assess the ability of the polycistronic vector to express two proteins. We prepared a bicistronic vector in which GFP would be expressed using a first subgenomic promoter and mCherry would be expressed from a second subgenomic promoter (FIG. 4A). Polynucleotides containing coding sequences for these proteins were inserted using SalI-XbaI sites. The first polynucleotide (GFP) was inserted directly into the modified alphavirus replicon vector. The second polynucleotide (mCherry) was inserted first into a shuttling vector that contains a subgenomic promoter directly upstream of the coding sequence. A fragment containing both the second subgenomic promoter and the second polynucleotide was isolated and ligated into the modified alphavirus replicon vector containing the first polynucleotide, providing an alphavirus replicon with multiple subgenomic promoters.
[0229] VRPs were produced in BHKV cells by electroporating replicon RNAs with defective helper RNAs for Cap and Gly. The VRPs were harvested 24 hours after electroporation and used to infect BHKV cells at a multiplicity of infection (MOI) of 20 infectious units (IU) per cell.
[0230] The experiment tested four sets of VRPs: one VRP expressing only GFP; one VRP expressing mCherry; one VRP expressing only GFP and one VRP expressing only mCherry, both at MOI of 20 IU/cell; and one VRP containing the bicistronic vector GFP(1)-SGPmCherry(2). VRP-infected BHKV cells were examined 24 hours post-infection to determine percent of colocalization. Nearly all the cells were positive for GFP or mCherry when singly infected. Cells infected with two separate VRPs appeared either green or red. Very few cells were yellow, indicating that few cells expressed GFP and mCherry at equal levels and that there was a low level of co-infection. These data were confirmed using FACS analysis (FIG. 4B).
[0231] In contrast, cells infected with alphavirus containing the bicistronic vector GFP(1)-SGPmCherry(2) were all yellow, which indicates approximately equal expression of GFP and mCherry. This study demonstrates that multiple proteins can be expressed successfully from a single polycistronic alphavirus replicon vector.
Example 3
Production of CMV Complexes
[0232] This example demonstrates that CMV protein complexes can be formed in a cell after delivery of the complex components from a polycistronic alphavirus replicon vector.
[0233] gH/gL and gH/gL/gO Complexes
[0234] Polycistronic gH/gL and gH/gL/gO alphavirus replicons were constructed as described above (shown schematically in FIG. 5A). VRPs containing gH, gL, gO, gH/gL and gH/gL/gO encoding replicons were produced in BHKV cells as described above and used to infect BHKV cells to demonstrate complex formation in vitro. VRP infected ARPE-19 cells produced disulfide linked complexes of gH/gL. gO did not detectably alter gH/gL association (FIG. 5B).
[0235] Immunofluorescence studies were conducted to evaluate the localization of gH and gL delivered alone and when delivered using a polycistronic alphavirus to look at relocalization of the proteins when co-expressed. gH localization did not appear to change in the presence or absence of gL, or gL/gO. gL localization did change when in the presence of gH and gH/gO.
[0236] Finally, gH/gL association was examined via immunoprecipitation. A commercial gH antibody (Genway) was used to investigate the association of gH and gL. In all cases, the gH antibody efficiently immunoprecipitated gH (FIG. 5C). When no gH was present, gL was not immunoprecipitated. When gL was expressed in the presence of gH or gH/gO, there was association of gL with gH (FIG. 5C).
[0237] The relocalization of gL in the presence of gH and the association of gH/gL (with or without gO) indicates that all components of the polycistronic alphavirus replicons were expressed and associated to form a complex.
Example 4
[0238] VRPs that Effect gH/gL Complex Formation In Vitro Induce Potent Immune Response to CMV which is Qualitatively and Quantitatively Superior to the Immune Response Elicited to gB VRPs.
[0239] This example demonstrates the induction of robust immune responses to complexes formed by delivering polycistronic gH/gL VRPs or gH/gL/gO VRPs compared with immune responses obtained using VRPs delivering single components or single-component VRPs administered in combination or to responses elicited by gB VRPs.
[0240] Mice were infected three times with VRPs administered 3 weeks apart (10.sup.6IU per mouse; 5 BalbC mice/group). Sera collected from immunizations with single and polycistronic VRPs were screened for neutralizing antibodies using a CMV neutralization assay as described above. Neutralization titer was measured as follows. Various dilutions of sera were pre-incubated with TB40-UL32-EGFP in the presence or absence of guinea pig complement and then added to ARPE-19 epithelial cells or MRC-5 fibroblast cells and incubated for 5 days. After 5 days infection with the virus, GFP-positive cells were counted. Results for the ARPE-19 cells are shown in FIG. 6A, FIG. 6B, and FIG. 6C. Results for the MRC-5 cells are shown in FIG. 7A and FIG. 7B.
[0241] Sera from mice immunized with gH FL VRPs had low complement-independent neutralizing activity (FIG. 6A and FIG. 6B). No neutralizing activity was observed using sera from mice immunized with only gL or gO in the presence or absence of guinea pig complement. (FIG. 6C) Pooled sera from immunization with several CMV gB proteins (gB FL, gB sol 750, and gB sol 692) demonstrated strong neutralizing activity in the presence of guinea pig complement, with a 50% neutralization titer at 1:1280 sera dilution. However, there was no neutralizing activity in the absence of guinea pig complement in ARPE-19 cells for the pooled gB sera. VRPs expressing single CMV proteins (gH- or gL-VRPs or co-administering gH-, gL-, and gO-VRPs at 10.sup.6 IU/mouse/VRP) did not enhance neutralizing activity beyond that of gH alone.
[0242] In contrast, sera from mice immunized with bicistronic gH/gL or tricistronic gH/gL/gO VRPs (1.times.10.sup.6 IU/mouse) demonstrated robust neutralizing responses. Moreover, the responses were similar in the presence and absence of guinea pig complement, showing that polycistronic VRPs successfully induced a complement-independent immune response. (FIG. 6C.) The 50% neutralization titer was 1:3500-6400+ sera dilution in ARPE-19 cells with TB40-GFP CMV virus. This titer is approximately 3-4 fold higher titer than the 50% complement-dependent neutralization titer for gB pooled sera.
[0243] Results in the MRC-5 fibroblast cells were similar to those in ARPE-19 cells (FIGS. 7A and 7B). Sera from mice immunized with bicistronic gH/gL or tricistronic gH/gL/gO VRPs demonstrated strong neutralizing activity compared to sera from mice immunized with VRPs encoding gH alone, gL alone, or gO alone and to sera from mice immunized by coadministration of gH VRPs and gL VRPs, or coadministration of gH VRPs, gL VRPs, and gO VRPs. These results demonstrate that administration of the polycistronic VRPs induced an immune response that provides good complement-independent neutralization of CMV infection of fibroblast cells. To assess the breadth and potency of the gH/gL immune sera against different strains of CMV, we compared the ability of the sera to block infection of fibroblasts and epithelial cells with six different strains of CMV. FIG. 8 shows that the gH/gL sera potently neutralize infection of both cell types with a broad range of strains.
[0244] These data also demonstrate strong neutralizing activity for sera from mice immunized with the polycistronic VRPs but not with mixed pools of VRPs expressing only one protein. This shows that polycistronic replicons that encode the components of a protein complex on a single replicon result in efficient production of the complex in situ. Moreover, because Merlin strain CMV proteins were used to stimulate these responses, the in vitro data obtained using TB40 strain CMV virus demonstrates that the neutralizing antibodies induced by delivery of the polycistronic VRPs are cross-neutralizing antibodies.
Example 5
[0245] RNA Synthesis
[0246] Plasmid DNA encoding alphavirus replicons (see FIGS. 14-16) served as a template for synthesis of RNA in vitro. Alphavirus replicons contain the genetic elements required for RNA replication but lack those encoding gene products necessary for particle assembly; the structural genes of the alphavirus genome are replaced by sequences encoding a heterologous protein. Upon delivery of the replicons to eukaryotic cells, the positive-stranded RNA is translated to produce four non-structural proteins, which together replicate the genomic RNA and transcribe abundant subgenomic mRNAs encoding the heterologous gene product or gene of interest (GOI). Due to the lack of expression of the alphavirus structural proteins, replicons are incapable of inducing the generation of infectious particles. A bacteriophage (T7 or SP6) promoter upstream of the alphavirus cDNA facilitates the synthesis of the replicon RNA in vitro and the hepatitis delta virus (HDV) ribozyme immediately downstream of the poly(A)-tail generates the correct 3'-end through its self-cleaving activity.
[0247] In order to allow the formation of an antigenic protein complex, the expression of the individual components of said complex in the same cell is of paramount importance. In theory, this can be accomplished by co-transfecting cells with the genes encoding the individual components. However, in case of non-virally or VRP delivered alphavirus replicon RNAs, this strategy is hampered by inefficient co-delivery of multiple RNAs to the same cell or, alternatively, by inefficient launch of multiple self-replicating RNAs in an individual cell. A potentially more efficient way to facilitate co-expression of components of a protein complex is to deliver the respective genes as part of the same self-replicating RNA molecule. To this end, we engineered alphavirus replicon constructs encoding multiple genes of interest. Every GOI is preceded by its own subgenomic promoter which is recognized by the alphavirus transcription machinery. Thereby, multiple subgenomic messenger RNA species are synthesized in an individual cell allowing the assembly of multi-component protein complexes.
[0248] Following linearization of the plasmid DNA downstream of the HDV ribozyme with a suitable restriction endonuclease, run-off transcripts were synthesized in vitro using T7 bacteriophage derived DNA-dependent RNA polymerase. Transcriptions were performed for 2 hours at 37.degree. C. in the presence of 7.5 mM of each of the nucleoside triphosphates (ATP, CTP, GTP and UTP) following the instructions provided by the manufacturer (Ambion, Austin, Tex.). Following transcription, the template DNA was digested with TURBO DNase (Ambion, Austin, Tex.). The replicon RNA was precipitated with LiCl and reconstituted in nuclease-free water. Uncapped RNA was capped post-transcripionally with Vaccinia Capping Enzyme (VCE) using the ScriptCap m.sup.7G Capping System (Epicentre Biotechnologies, Madison, Wis.) as outlined in the user manual. Post-transcriptionally capped RNA was precipitated with LiCl and reconstituted in nuclease-free water. The concentration of the RNA samples was determined by measuring the optical density at 260 nm. Integrity of the in vitro transcripts was confirmed by denaturing agarose gel electrophoresis.
[0249] Lipid Nanoparticle (LNP) Formulation
[0250] 1,2-dilinoleyloxy-N,N-dimethyl-3-aminopropane (DlinDMA) was synthesized using a previously published procedure [Heyes, J., Palmer, L., Bremner, K., MacLachlan, I. Cationic lipid saturation influences intracellular delivery of encapsulated nucleic acids. Journal of Controlled Release, 107: 276-287 (2005)]. 1, 2-Diastearoyl-sn-glycero-3-phosphocholine (DSPC) was purchased from Genzyme. Cholesterol was obtained from Sigma-Aldrich (St. Lois, Mo.). 1, 2-dimyristoyl-sn-glycero-3-phosphoethanolamine-N-[methoxy(polyethylene glycol)-2000] (ammonium salt) (PEG DMG 2000), was obtained from Avanti Polar Lipids.
[0251] LNPs (RV01(14)) were formulated using the following method. 150 .mu.g batch, (PES hollow fibers and no mustang): Fresh lipid stock solutions in ethanol were prepared. 37 mg of DlinDMA, 11.8 mg of DSPC, 27.8 mg of Cholesterol and 8.07 mg of PEG DMG 2000 were weighed and dissolved in 7.55 mL of ethanol. The freshly prepared lipid stock solution was gently rocked at 37.degree. C. for about 15 min to form a homogenous mixture. Then, 453 .mu.L of the stock was added to 1.547 mL ethanol to make a working lipid stock solution of 2 mL. This amount of lipids was used to form LNPs with 150 .mu.g RNA at a 8:1 N:P (Nitrogen to Phosphate) ratio. The protonatable nitrogen on DlinDMA (the cationic lipid) and phosphates on the RNA are used for this calculation. Each .mu.g of self-replicating RNA molecule was assumed to contain 3 nmoles of anionic phosphate, each .mu.g of DlinDMA was assumed to contains 1.6 nmoles of cationic nitrogen. A 2 mL working solution of RNA was also prepared from a stock solution of .about.1 .mu.g/.mu.L in 100 mM citrate buffer (pH 6) (Teknova). Three 20 mL glass vials (with stir bars) were rinsed with RNase Away solution (Molecular BioProducts) and washed with plenty of MilliQ water before use to decontaminate the vials of RNAses. One of the vials was used for the RNA working solution and the others for collecting the lipid and RNA mixes (as described later). The working lipid and RNA solutions were heated at 37.degree. C. for 10 min before being loaded into 3 cc luer-lok syringes (BD Medical). 2 mL of citrate buffer (pH 6) was loaded in another 3 cc syringe. Syringes containing RNA and the lipids were connected to a T mixer (PEEK.TM. 500 .mu.m ID junction) using FEP tubing ([fluorinated ethylene-propylene] 2 mm ID.times.3 mm OD, Idex Health Science, Oak Harbor, Wash.). The outlet from the T mixer was also FEP tubing (2 mm ID.times.3 mm). The third syringe containing the citrate buffer was connected to a separate piece of tubing (2 mm ID.times.3 mm OD). All syringes were then driven at a flow rate of 7 mL/min using a syringe pump (from kdScientific, model no. KDS-220). The tube outlets were positioned to collect the mixtures in a 20 mL glass vial (while stirring). The stir bar was taken out and the ethanol/aqueous solution was allowed to equilibrate to room temperature for 1 h. Then the mixture was loaded in a 5 cc syringe (BD Medical), which was fitted to a piece of FEP tubing (2 mm ID.times.3 mm OD) and in another 5 cc syringe with equal length of FEP tubing, an equal volume of 100 mM citrate buffer (pH 6) was loaded. The two syringes were driven at 7 mL/min flow rate using a syringe pump and the final mixture collected in a 20 mL glass vial (while stirring). Next, LNPs were concentrated to 2 mL and dialyzed against 10-15 volumes of 1.times.PBS (from Teknova) using the Tangential Flow Filtration (TFF) system before recovering the final product. The TFF system and hollow fiber filtration membranes were purchased from Spectrum Labs and were used according to the manufacturer's guidelines. Polyethersulfone (PES) hollow fiber filtration membranes (part number P-C1-100E-100-01N) with a 100 kD pore size cutoff and 20 cm.sup.2 surface area were used. For in vitro and in vivo experiments, formulations were diluted to the required RNA concentration with 1.times.PBS (from Teknova).
[0252] Particle Size
[0253] Particle size was measured using a Zetasizer Nano ZS (Malvern Instruments, Worcestershire, UK) according to the manufacturer's instructions. Particle sizes are reported as the Z average with the polydispersity index (pdi). Liposomes were diluted in 1.times.PBS before measurement.
[0254] Encapsulation Efficiency and RNA Concentration
[0255] The percentage of encapsulated RNA and RNA concentration were determined by Quant-iT RiboGreen RNA reagent kit (Invitrogen). Manufacturer's instructions were followed in the assay. The ribosomal RNA standard provided in the kit was used to generate a standard curve. LNPs either obtained from method 1 or methods 2-5 were diluted ten fold or one hundred fold respectively in 1.times. TE buffer (from kit), before addition of the dye. Separately, LNPs were diluted ten or 100 fold in 1.times. TE buffer containing 0.5% Triton X (Sigma-Aldrich), before addition of the dye. Thereafter an equal amount of dye was added to each solution and then .about.180 .mu.L of each solution after dye addition was loaded in duplicate into a 96 well tissue culture plate (obtained from VWR, catalog #353072). The fluorescence (Ex 485 nm, Em 528 nm) was read on a microplate reader (from BioTek Instruments, Inc.).
[0256] Triton X was used to disrupt the LNPs, providing a fluorescence reading corresponding to the total RNA amount and the sample without Triton X provided fluorescence corresponding to the unencapsulated RNA. % RNA encapsulation was determined as follows: LNP RNA Encapsulation (%)=[(F.sub.t-F.sub.i)/F.sub.t].times.100, where F.sub.t is the fluorescence intensity of LNPs with triton X addition and F.sub.i is the fluorescence intensity of the LNP solution without detergent addition. These values (F.sub.t and F.sub.i) were obtained after subtraction from blank (1.times.TE buffer) fluorescence intensity. The concentration of encapsulated RNA was obtained by comparing F.sub.t-F.sub.i with the standard curve generated. All LNP formulations were dosed in vivo based on the encapsulated dose.
[0257] Viral Replicon Particles (VRP)
[0258] To compare RNA vaccines to traditional RNA-vectored approaches for achieving in vivo expression of reporter genes or antigens, we utilized viral replicon particles (VRPs), produced in BHK cells by the methods described by Perri et al. (J. Virol 77(19):10394-10403 (2003)), coding for expression of the same antigens as the corresponding RNA constructs. In this system, the antigen consisted of alphavirus chimeric replicons (VCR) derived from the genome of Venezuelan equine encephalitis virus (VEEV) engineered to contain the 3' terminal sequences (3' UTR) of Sindbis virus and a Sindbis virus packaging signal (PS) (see FIG. 2 of Perri et al). The replicons were packaged into VRPs by co-electroporating them into baby hamster kidney (BHK) cells along with defective helper RNAs encoding the Sindbis virus capsid and glycoprotein genes (see FIG. 2 of Perri et al). The VRPs were then harvested and partially purified by ultracentrifugation on a sucrose cushion and concentrated on an Amicon concentrator. The resulting VRP stock was titrated by standard methods and inoculated into animals in culture fluid or other isotonic buffers. An alphavirus replicon particle chimera derived from venezuelan equine encephalitis and sindbis viruses is a potent gene-based vaccine delivery vector. J. Virol. 77, 10394-10403.
[0259] Murine Immunogenicity Studies
[0260] Groups of 10 female BALB/c mice aged 8-10 weeks and weighing about 20 g were immunized with 1.times.10.sup.6 IU (VRP) or 1.0 .mu.g (RNA) at day 0, 21 and 42 with bleeds taken 3 weeks after the 2.sup.nd and 3 weeks after the 3.sup.rd vaccinations. All animals were injected in the quadriceps in the two hind legs each getting an equivalent volume (50 .mu.l per site).
[0261] Micro Neutralization Assay
[0262] Serum samples were tested for the presence of neutralizing antibodies by an infection reduction neutralization test. Two-fold serial dilutions of HI-serum (in DMEM with 10% HI FBS) were added to an equal volume of CMV (strain TB40 or clinical isolate 8819) previously titered to give approximately 200 IU/50 .mu.l. The VR1814, Towne, AD169 strains and the clinical isolate 8822 were also used. Serum/virus mixtures were incubated for 2 hours at 37.degree. C. and 5% CO2, to allow virus neutralization to occur, and then 50 .mu.l of this mixture (containing approximately 200 IU) was inoculated on duplicate wells of ARPE-19 cells in 96 half well plates. Plates were incubated for 40-44 hours. Unless otherwise noted, the number of positive infected foci was determined by immunostaining with an AlexaFluor 488 conjugated IE1 CMV monoclonal antibody followed by automated counting. The neutralization titer is defined as the reciprocal of the serum dilution producing a 50% reduction in number of positive virus foci per well, relative to controls (no serum).
[0263] Immunogenicity of gH/gL VRPs and LNP Formulated RNA
[0264] The A323 replicon that expresses the surface glycoprotein B (gB) of CMV, the A160 replicon that expresses the membrane complex of the full-length glycoprotein H and L (gH/gL) and the A322 replicon that expresses the membrane complex of the soluble form of glycoprotein H and L (gHsol/gL) were used for this experiment. BALB/c mice, 10 animals per group, were given bilateral intramuscular vaccinations (50 .mu.L per leg) on days 0, 21 and 42 with VRPs expressing gB (1.times.10.sup.6 IU), VRPs expressing gH/gL (1.times.10.sup.6 IU), VRP's expressing gHsol/gL (1.times.10.sup.6 IU) and PBS as the controls. The three test groups received self-replicating RNA (A160, A322 or A323) formulated in LNP (RV01(14). Serum was collected for immunological analysis on days 39 (3wp2) and 63 (3wp3).
Results
[0265] The sive and percentage of encapsulated RNA in the RV01(14) formulations made for the experiment are shown in Table 3.
TABLE-US-00004 TABLE 3 pKa of Particle Lipid Composition cationic Size Zav Percent RNA RV# (% moles of total) RNA lipid (nm) pdI Encapsulation RV01 (14) DlinDMA 40%, DSPC- gB FL 5.8 170 0.098 88.3 10%, Chol- 48%, PEG DMG 2k-2% RV01 (14) DlinDMA 40%, DSPC- gH FL/gL 5.8 168.8 0.144 87.4 10%, Chol- 48%, PEG DMG 2k-2% RV01 (14) DlinDMA 40%, DSPC- gHsol/gL 5.8 162 0.131 90 10%, Chol- 48%, PEG DMG 2k-2%
[0266] The 50% neutralizing titers for the terminal sera (day 63, three weeks after final vaccination) are shown in Table 4.
TABLE-US-00005 TABLE 4 ARPE-19, HCMV TB40 ARPE-19, HCMV 8819 pool #1 pool #2 average pool #1 pool #2 average Preimmune serum -- 126 212 169 50 50 50 gB FL VRP 10.sup.6 IU 1332 295 814 5085 1031 3058 gB FL RNA-RV01(14) 1 .mu.g 686 179 433 1261 557 909 gH FL/gL VRP 10.sup.6 IU 1425 1624 1525 2496 1374 1935 gH FL/gL RNA- 1 .mu.g 6196 6390 6293 5800 10267 8034 RV01(14) gH sol/gL VRP 10.sup.6 IU 2375 2254 2315 1733 1924 1829 gH sol/gL RNA- 1 .mu.g 4600 2062 3331 2912 1533 2223 RV01(14)
[0267] RNA expressing either a full-length or a presumed soluble form of the HCMV gH/gL complex elicit high titers of neutralizing antibody, as assayed on epithelial cells using two different HCMV strains. The average titers elicited by the gH/gL RNAs are at least as high as the average titer for the corresponding gH/gL VRPs (see FIG. 17).
Example 6 Bicistronic and Pentacistronic Nucleic Acids Encoding CMV Proteins
[0268] Additional bicistronic and pentacistronic alphavirus replicons that express glycoprotein complexes from human cytomegalovirus (HCMV) were prepared, and are shown schematically in FIGS. 18 and 20. The alphavirus replicons were based on venezuelan equine encephalitis virus (VEE). The replicons were packaged into viral replicon particles (VRPs), encapsulated in lipid nanoparticles (LNP), or formulated with a cationic nanoemulsion (CNE). Expression of the encoded HCMV proteins and protein complexes from each of the replicons was confirmed by immunoblot, co-immunoprecipitation, and flow cytometry. Flow cytometry was used to verify expression of the pentameric gH/gL/UL128/UL130/UL131 complex from pentameric replicons encoding the protein components of the complex, using human monoclonal antibodies specific to conformational epitopes present on the pentameric complex (Macagno et al (2010), J. Virol. 84(2):1005-13). FIG. 19 shows that these antibodies bind to BHKV cells transfected with replicon RNA expressing the HCMV gH/gL/UL128/UL130/UL131 pentameric complex (A527). Similar results were obtained when cells were infected with VRPs made from the same replicon construct. This shows that replicons designed to express the pentameric complex do indeed express the desired antigen and not the potential byproduct gH/gL.
[0269] The VRPs, RNA encaspulated in LNPs, and RNA formulated with CNE were used to immunize Balb/c mice by intramuscular injections in the rear quadriceps. The mice were immunized three times, three weeks apart, and serum samples were collected prior to each immunization as well as three weeks after the third and final immunization. The sera were evaluated in microneutralization assays to measure the potency of the neutralizing antibody response that was elicited by the vaccinations. The titers are expressed as 50% neutralizing titer.
[0270] The immunogenicity of a number of different configurations of a bicistronic expression cassette for a soluble HCMV gH/gL complex in VRPs was assessed. FIG. 20 shows that VRPs expressing the membrane-anchored, full-length gH/gL complex elicited potent neutralizing antibodies at slightly higher titers than the soluble complex (gHsol/gL) expressed from a similar bicistronic expression cassette. Changing the order of the genes encoding gHsol and gL or replacing one of the subgenomic promoters with an IRES or an FMDV 2A site did not substantially improve immunogenicity.
[0271] The breadth and potency of HCMV neutralizing activity in sera from mice immunized with VEE/SIN VRPs expressing gH/gL was assessed by using the sera to block infection of fibroblasts and epithelial cells with different strains of HCMV. Table 5 shows that gH/gL immune sera were broadly and potently neutralizing against six different strains of HCMV on both cell types in the absence of complement. Addition of complement had a slight negative effect on the neutralizing potency of the sera.
TABLE-US-00006 TABLE 5 Neutralizing antibody titers in sera from mice immunized with pVCR-derived VRPs expressing gH/gL. Serum from mice immunized with pVCR- derived VRPs expressing gH/gL Without With HCMV Strain Cell complement complement Towne Fibroblasts 5244 4081 AD169 (MRC-5) 2126 2208 TB40-UL32-EGFP 678 505 VR1814 4764 2126 TB40-UL32-EGFP Epithelial cells 5602 3247 VR1814 (ARPE-19) 6510 2420 8819 (clinical isolate) 8706 5242 8822 (clinical isolate) 3427 2684
[0272] The immunogenicity of LNP-encapsulated RNAs encoding the pentameric complex (A526 and A527) compared to LNP-encapsulated RNA (A160) and VRPs (pVCR modified gH-SGPgL) expressing gH/gL was assessed. Table 6 shows that replicons expressing the pentameric complex elicited more potently neutralizing antibodies than replicons expressing gH/gL.
TABLE-US-00007 TABLE 6 Neutralizing antibody titers. Titer Titer Titer Replicon post 1.sup.st post 2.sup.nd post 3.sup.rd C313 pVCR modified gH-SGP-gL 126 6,296 26,525 VRP 10.sup.6 IU A160 gH FL/gL 1 .mu.g LNP 347 9,848 42,319 A526 Pentameric 2A 1 .mu.g LNP 179 12,210 80,000 A527 Pentameric IRES 1 .mu.g LNP 1,510 51,200 130,000
[0273] The pentacistronic VEE-based RNA replicon that elicited the highest titers of neutralizing antibodies (A527) was packaged as VRPs and the immunogenicity of the VRPs were compared to gH/gL-expressing VRPs and LNP-encapsulated replicons expressing gH/gL and pentameric complex. Table 7 shows that VRPs expressing the pentameric complex elicited higher titers of neutralizing antibodies than VRPs expressing gH/gL. Moreover, 10.sup.6 infectious units of VRPs are at least as potent as 1 .mu.g of LNP-encapsulated RNA when the VRPs and the RNA encoded the same protein complexes.
TABLE-US-00008 TABLE 7 Neutralizing antibody titers. Sera were collected three weeks after the second immunization. Replicon 50% Neutralizing Titer A160 gH FL/gL VRP 10.sup.6 IU 14,833 A527 Pentameric IRES VRP 10.sup.6 IU 51,200 A160 gH FL/gL LNP 0.01 .mu.g 4,570 A160 gH FL/gL LNP 0.1 .mu.g 9,415 A160 gH FL/gL LNP 1 .mu.g 14,427 A527 Pentameric IRES 0.01 .mu.g LNP 12,693 A527 Pentameric IRES 0.1 .mu.g LNP 10,309 A527 Pentameric IRES 1 .mu.g LNP 43,157
[0274] The breadth and potency of HCMV neutralizing activity in sera from mice immunized with VEE-based RNA encoding the pentameric complex (A527) was assessed by using the sera to block infection of fibroblasts and epithelial cells with different strains of HCMV. Table 8 shows that anti-gH/gL/UL128/UL130/UL131 immune sera broadly and potently neutralized infection of epithelial cells. This effect was complement independent. In contrast, the sera had a reduced or not detectable effect on infection of fibroblasts. These results are what is expected for immune sera that contains mostly antibodies specific for the gH/gL/UL128/UL130/UL131 pentameric complex, because the pentameric complex is not required for infection of fibroblasts and, consequently, antibodies to UL128, UL130, and UL131 do not block infection of fibroblasts (Adler et al (2006), J. Gen. Virol. 87(Pt. 9):2451-60; Wang and Shenk (2005), Proc. Natl. Acad. Sci. USA 102(50):18153-8). Thus, these data demonstrate that the pentameric replicons encoding the gH/gL/UL128/UL130/UL131pentameric complex specifically elicit antibodies to the complex in vivo.
TABLE-US-00009 TABLE 8 Neutralizing antibody titers in sera from mice immunized with the A527 RNA replicon encapsulated in LNPs. The replicon expresses the HCMV pentameric complex using subgenomic promoters and IRESes. Serum from mice immunized with A527 pentameric IRES RNA in LNPs Without With HCMV Strain Cell complement complement Towne Fibroblasts 3433 1574 AD169 (MRC-5) 2292 <1000 TB40-UL32-EGFP <1000 <1000 VR1814 4683 1324 TB40-UL32-EGFP Epithelial cells 86991 59778 VR1814 (ARPE-19) 82714 37293 8819 (clinical isolate) 94418 43269 8822 (clinical isolate) 85219 49742
[0275] To see if bicistronic and pentacistronic replicons expressing the gH/gL and pentameric complexes would elicit neutralizing antibodies in different formulations, cotton rats were immunized with bicistronic or pentacistronic replicons mixed with a cationic nanoemulsion (CNE). Table 9 shows that replicons in CNE elicited comparable neutralizing antibody titers to the same replicons encapsulated in LNPs.
TABLE-US-00010 TABLE 9 Neutralizing antibody titers. The sera were collected three weeks after the second immunization. Replicon 50% Neutralizing Titer A160 gH FL/gL VRP 10.sup.6 IU 594 A160 gH FL/gL 1 .mu.g LNP 141 A527 Pentameric IRES 1 .mu.g LNP 4,416 A160 gH FL/gL 1 .mu.g CNE 413 A527 Pentameric IRES 1 .mu.g CNE 4,411
Example 7. Replicons Encoding VZV Proteins
[0276] Nucleic acids encoding VZV proteins were cloned into a VEE replicon vector to produce monocystronic replicons that encode gB, gH, gL, gE, and gI, and to produce bicistronic replicons that encode gH/gL or gE/gI. In the bicistronic replicons, expression of each VZV open reading frame was driven by a separate subgenomic promoter.
[0277] To prepare replicon RNA, plasmid encoding the replicon was linearized by digestion with PmeI, and the linearized plasmid was extracted with phenol/chloroform/isoamylalchohol, precipitated in sodium acetate/ethanol and resuspended in 20 .mu.l of RNase-free water.
[0278] RNA was prepared by In vitro transcription of 1 .mu.g of linearized DNA using the MEGAscript T7 kit (AMBION #AM1333). A 20 .mu.l reaction was set up according to the manufacturer's instruction without cap analog and incubated for 2 hours at 32.degree. C. TURBO DNase (1 .mu.l) was added and the mixture was incubate for 30 min. at 32.degree. C. RNase-free water (30 .mu.l) and ammonium acetate solution (30 .mu.l) were added. The solution was mixed and chilled for at least 30 min at -20.degree. C. Then the solution was centrifuged at maximum speed for 25 min. at 4.degree. C. The supernatant was discarded, and the pellet was rinsed with 70% ethanol, and again centrifuged at maximum speed for 10 min. at 4.degree. C. The pellet was air dried and resuspended in 50 .mu.l of RNase-free water. The concentration of RNA was measured and quality was check on a denaturing gel.
[0279] The RNA was capped using the ScriptCap m7G Capping System (Epicentre #SCCE0625). The reaction was scaled by combining the RNA and RNase-free water. The RNA was then denatured for 5-10 min at 65.degree. C. The denatured RNA was transfered quickly to ice and the following reagents were added in the following order: ScriptCap Capping Buffer, 10 mM GTP, 2 mM SAM fresh prepared, ScriptGuard RNase inhibitor, and ScriptCap Capping Enzyme. The mixture was incubated for 60 min at 37.degree. C. The reaction was stopped by adding RNase-free water and 7.5 M LiCl, mixing well and storing the mixture for at least 30 min at -20.degree. C. Then, the mixture was centrifuged at maximum speed for 25 min. at 4.degree. C., the pellet was rinsed with 70% ethanol, again centrifuged at maximum speed for 10 min. at 4.degree. C. and the pellet was air dried. The pellet was resuspended in RNase-free water. The concentration of RNA was measured and quality was checked on a denaturing gel.
RNA Transfection
[0280] Cells (BHK-V cells) were seeded on 6-well plates brought to 90-95% confluence at the time of transfection. For each transfection 3 .mu.g of RNA was diluted in 50 mL OPTIMEM media in a first tube. Lipofectamine 2000 was added to a second tube contained 50 mL OPTIMEM media. The firs and second tubes were combined and kept for 20 mM at room temperature. The culture media in the 6-well plates were replaced with fresh media, and the RNA-Lipofectamine complex was placed onto the cells, and mixed by gently rocking the plate. The plates were incubated for 24 hours at 37.degree. C. in a CO.sub.2 incubator.
[0281] Expression of the VZV proteins in transfected cells was assessed by western blot and immunofluorescence. For western blots, lysates of transfected cells were separated by electrophoresis (5 .mu.g total proteins/lane) and blotted. A cleared viral suspension (7 .mu.g total protein/lane) derived from the OKA/Merck vaccine strain was used as a positive control. Blots were probed using commercially available antibodies (1:1000 dilution) that bind VZV proteins.
[0282] For immunofluorescence, transfected cells were harvested and seeded in 96 well plate, and intracellular staining was performed using commercially available mouse mAbs (dilution range 1:100 1:400). Cell pellets were fixed and permeabilized with Citofix-Citoperm solutions. A secondary reagent, Alexa488 labelled goat anti-mouse F(ab').sub.2 (1:400 final dilution), was used.
[0283] Expression of VZV proteins gE and gI was detected in cells transfected with monocistronic constructs (gE or gI), and expression of both gE and gI was detected in cells transfected with a bicistronic gE/gI construct in western blots using commercially available mouse antibodies, 13B1 for gE and 8C4 for gI. Expression of VZV protein gB was detected in cells transfected with a monocistronic construct encoding gB, by immunofluorescence using commercially available antibody 10G6. Expression of the VZV protein complex gH/gL, was detected by immunofluorescence in cells transfected with monocistronic gH and monocistronic gL, or with a bicistronic gH/gL construct. The gH/gL complex was detected using commercially available antibody SG3.
Murine Immunogenicity Studies
[0284] Groups of 8 female BALB/c mice aged 6-8 weeks and weighing about 20 g were immunized intramuscularly with 7.0 or 1.0 .mu.g of replicon RNA formulated with a CNE or LNP (RV01) at day 0, 21 and 42. Blood samples were taken from the immunized animals 3 weeks after the 2nd immunization and 3 weeks after the 3rd immunization. The groups are shown in Table 10.
TABLE-US-00011 TABLE 10 Dose Group Antigen (micrograms) Formulation Study 1 1 YFP 7 CNE 2 YFP 1 CNE 3 gB 7 CNE 4 gB 1 CNE 5 gE 7 CNE 6 gE 1 CNE 7 gH 7 CNE 8 gH 1 CNE 9 gI 7 CNE 10 gI 1 CNE 11 gL 7 CNE 12 gL 1 CNE 13 gE/gI 7 CNE 14 gE/gI 1 CNE 15 gH/gL 7 CNE 16 gH/gL 1 CNE Study 2 1 gB 1 RV01 2 gE 1 RV01 3 gH 1 RV01 4 gI 1 RV01 5 gL 1 RV01 6 gE/gI 1 RV01 7 gH/gL 1 RV01
Immune Response to VZV Antigens
[0285] Serum samples were tested for the presence of antibodies to gB, by intracellular staining of VZV-replicon transfected MRC-5 cells. MRC-5 cells were maintained in Dulbecco Modified Eagle's Medium with 10% fetal bovine serum. VZV Oka strain inoculum (obtained from ATCC) was used to infect MRC-5 cell culture and infected whole cells were used for subpassage of virus. The ratio between infected and un-infected cells was 1:10. 30 hrs post infection cells were trypsin-dispersed for seeding in a 96 well plate to perform an intracellular staining with pools of mice sera (dilution range 1:200 to 1:800) obtained after immunization. Commercial mAbs were used as controls to quantify the infection level. Cell pellets ware fixed and permeabilized with Citofix-Citoperm solutions. A secondary reagent, Alexa488 labelled goat anti-mouse F(ab')2 was used (1:400 final dilution).
[0286] Commercial antibodies to gB (10G6), gH (SG3), and gE (13B1 (SBA) and 8612 (Millipore)) were used as positive controls, and each intracellularly stained infected MRC-5 cells Immune sera obtained 3 weeks after the third immunization with either 1 or 7 .mu.g of RNA formulated with CNE or LNP were diluted 1/200, 1/400 and 1/800 and used to intracellulary stain infected MRC-5 cells. The results are shown in FIG. 21 (Study 1, groups 1, 5, 7, 9, 11, 13 and 15, CNE formulation) and FIG. 22 (Study 2, groups 1-7, LNP formulation).
Neutralizing Assay
[0287] Each immunized mouse serum was serially diluted by two fold increments starting at 1:20 in standard culture medium, and added to the equal volume of VZV suspension in the presence of guinea pig complement. After incubation for 1 hour at 37.degree. C., the human epithelial cell line A549, was added. Infected cells can be measured after one week of culture by counting plaques formed in the culture under microscope. From the plaque number the % inhibition at each serum dilution was calculated. A chart for each serum sample was made by plotting the value of % inhibition against the logarithmic scale the dilution factor. Subsequently an approximate line of relationship between dilution factor and % inhibition was drawn. Then the 50% neutralization titer was determined as the dilution factor where the line crossed at the value of 50% inhibition.
[0288] Table 11 shows that sera obtained from mice immunized with monocistronic gE, bicistrnic gE/gI, and bicistronic gH/gL contained robust neutralizing antibody titers.
TABLE-US-00012 TABLE 11 Neutralization titers of pooled sera from mice immunized with 7 .mu.g RNA Control (YFP) gB gE gI gE/gI gH gL gH/gL <20 <20 1111 <20 440 <20 <20 1070 <20 <20 413 51 >2560 <20 <20 >2560 <20 <20 >2560 <20 1031 <20 <20 >2560 <20 20 2128 <20 1538 <20 <20 >2560 <20 20 861 <20 636 20 <20 >2560 <20 <20 1390 <20 2339 <20 <20 >2560 <20 <20 969 <20 1903 <20 <20 900 <20 <20 1011 20 1969 20 <20 >2560 <20* <20* <20* <20* <20* <20* <20* <20* *pre-immune pooled sera
REFERENCES
[0289] Britt W J, Alford C A. Cytomegalovirus. In Fields B N, Knipe D M, Howley P M (ed.). Fields Virology, 3.sup.rd edition, Philadelphia, Pa.: Lippincott/Raven; 1996. p. 2493-523.
[0290] Chee M S, Bankier A T, Beck S, Bohni R, Brown C M, Cerny R, Horsnell T, Hutchinson C A, Kouzarides T, Martignetti J A, Preddie E, Satchwell S C, Tomlinson P, Weston K M and Barrell B G. 1990. Analysis of the protein-coding content of the sequence of human cytomegalovirus strain AD169. Curr. Top. Microbiol. Immunol. 154:125-70.
[0291] Davison A J, Dolan A, Akter P, Addison C, Dargan D J, Alcendor D J, McGeoch D J and Hayward G S. 2003. The human cytomegalovirus genome revisited: comparison with the chimpanzee cytomegalovirus genome. J. Gen. Virol. 84:17-28. (Erratum, 84:1053).
[0292] Crumpacker C S and Wadhwa S. 2005. Cytomegalovirus, p 1786-1800. In G. L. Mandell, J. E. Bennett, and R. Dolin (ed.), Principles and practice of infectious diseases, vol 2. Elsevier, Philadelphia, Pa.
[0293] Pomeroy C and Englund J A. 1987. Cyotmegalovirus: epidemiology and infection control. Am J Infect Control 15: 107-119.
[0294] Murphy E, Yu D, Grimwood J, Schmutz J, Dickson M, Jarvis M A, Nelson J A, Myers R M and Shenk T E. 2003. Coding potential of laboratory and clinical strains of cytomegalovirus. Proc. Natl. Acad. Sci. USA 100:14976-81.
[0295] Mocarski E S and Tan Courcelle C. 2001. Cytomegalovirus and their replication, p. 2629-73. In D M Knipe and P M Howley (ed.) Fields Virology, 4.sup.th edition, vol. 2. Lippincott Williams and Wilkins, Philadelphia, Pa.
[0296] Compton T. 2004. Receptors and immune sensors: the complex entry path of human cytomegalovirus. Trends Cell. Bio. 14(1): 5-8.
[0297] Britt W J and Alford C A. 2004. Human cytomegalovirus virion proteins. Hum. Immunol. 65:395-402.
[0298] Varnum S M, Streblow D N, Monroe M E, Smith P, Auberry K J, Pasa-Tolic L, Wang D, Camp II D G, Rodland K, Wiley, Britt W, Shenk T, Smith R D and Nelson J A. 2004. Identification of proteins in human cytomegalovirus (HCMV) particles: the HCMV proteome. J. Virol. 78:10960-66. (Erratum, 78:13395).
[0299] Ljungman P, Griffiths P and Paya C. 2002. Definitions of cytomegalovirus infection and disease in transplant recipients. Clin. Infect. Dis. 34:1094-97.
[0300] Rubin R. 2002. Clinical approach to infection in the compromised host, p. 573-679. In R. Rubin and L S Young (ed), Infection in the organ transplant recipient. Kluwer Academic Press, New York, N.Y.
[0301] Stagno S and Britt W J. 2005. Cytomegalovirus, p. 389-424. In J S Remington and J O Klein (ed), Infectious diseases of the fetus and newborn infant, 6htt edition. W B Saunders, Philadelphia, P A.
[0302] Britt W J, Vugler L, Butfiloski E J and Stephens E B. 1990. Cell surface expression of human cytomegalovirus (HCMV) gp55-116 (gB): use of HCMV-vaccinia recombinant virus infected cells in analysis of the human neutralizing antibody response. J. Virol. 64:1079-85.
[0303] Reap E A, Dryga S A, Morris J, Rivers B, Norberg P K, Olmsted R A and Chulay J D. 2007. Cellular and Humoral Immune Responses to Alphavirus Replicon Vaccines expressing Cytomegalovirus pp65, IL1 and gB proteins. Clin. Vacc. Immunol. 14:748-55.
[0304] Balasuriya U B R, Heidner H W, Hedges J F, Williams J C, Davis N L, Johnston R E and MacLachlan N J. 2000. Expression of the two major envelope proteins of equine arteritis virus as a heterodimer is necessary for induction of neutralizing antibodies in mice immunized with recombinant Venezuelan equine encephalitis virus replicon particles. J. Virol. 74:10623-30.
[0305] Dunn W, Chou C, Li H, Hai R, Patterson D, Stoic V, Zhu H and Liu F. 2003. Functional profiling of a human cytomegalovirus genome. Proc. Natl. Acad. Sci USA 100:14223-28.
[0306] Hobom U, Brune W, Messerle M, Hahn G and Kosinowski U H. 2000. Fast screening procedures for random transposon libraries of cloned herpesvirus genomes: mutational analysis of human cytomegalovirus envelope glycoprotein genes. J. Virol. 74:7720-29.
[0307] Ryckman B J, Chase M C and Johnson D C. 2009. HCMV T R strain glycoprotein O acts as a chaperone promoting gH/gL incorporation into virions, but is not present in virions. J. Virol.
[0308] Wille P T, Knoche A J, Nelson J A, Jarvis M A and Johnson J C. 2009. An HCMV gO-null mutant fails to incorporate gH/gL into the virion envelope and is unable to enter fibroblasts, epithelial, and endothelial cells. J. Virol.
[0309] Shimamura M, Mach M and Britt W J. 2006. Human Cytomegalovirus infection elicits a glycoprotein M (gM)/gN-specific virus-neutralizing antibody response. J. Virol. 80:4591-4600.
[0310] Cha T A, Tom E, Kemble G W, Duke G M, Mocarski E S and Spaete R R. 1996. Human cytomegalovirus clinical isolates carry at least 19 genes not found in laboratory strains. J. Virol. 70:78-83.
[0311] Wang D and Shenk T. 2005. Human cytomegalovirus virion protein complex required for epithelial and endothelial cell tropism. Proc. Natl. Acad. Sci. USA 102:18153-58.
[0312] Adler B, Scrivano L, Ruzcics Z, Rupp B, Sinzger C and Kosinowski U. 2006. Role of human cytomegalovirus UL131A in cell type-specific virus entry and release. J. Gen. Virol. 87:2451-60.
[0313] Ryckman B J, Rainish B L, Chase M C, Borton J A, Nelson J A, Jarvis J A and Johnson D C. 2008. Characterization of the human cytomegalovirus gH/gL/UL128-UL131 complex that mediates entry into epithelial and endothelial cells. J. Virol. 82: 60-70.
TABLE-US-00013
[0313] SEQUENCES CMV gB FL (SEQ ID NO: 25): 1 - atggaaagccggatctggtgcctggtcgtgtgcgtgaacctgtgcatcgtgtgcctgggagc cgccgtgagcagcagcagcaccagaggcaccagcgccacacacagccaccacagcagccaca ccacctctgccgcccacagcagatccggcagcgtgtcccagagagtgaccagcagccagacc gtgtcccacggcgtgaacgagacaatctacaacaccaccctgaagtacggcgacgtcgtggg cgtgaataccaccaagtacccctacagagtgtgcagcatggcccagggcaccgacctgatca gattcgagcggaacatcgtgtgcaccagcatgaagcccatcaacgaggacctggacgagggc atcatggtggtgtacaagagaaacatcgtggcccacaccttcaaagtgcgggtgtaccagaa ggtgctgaccttccggcggagctacgcctacatccacaccacatacctgctgggcagcaaca ccgagtacgtggcccctcccatgtgggagatccaccacatcaacagccacagccagtgctac agcagctacagccgcgtgatcgccggcacagtgttcgtggcctaccaccgggacagctacga gaacaagaccatgcagctgatgcccgacgactacagcaacacccacagcaccagatacgtga ccgtgaaggaccagtggcacagcagaggcagcacctggctgtaccgggagacatgcaacctg aactgcatggtcaccatcaccaccgccagaagcaagtacccttaccacttcttcgccacctc caccggcgacgtggtggacatcagccccttctacaacggcaccaaccggaacgccagctact tcggcgagaacgccgacaagttcttcatcttccccaactacaccatcgtgtccgacttcggc agacccaacagcgctctggaaacccacagactggtggcctttctggaacgggccgacagcgt gatcagctgggacatccaggacgagaagaacgtgacctgccagctgaccttctgggaggcct ctgagagaaccatcagaagcgaggccgaggacagctaccacttcagcagcgccaagatgacc gccaccttcctgagcaagaaacaggaagtgaacatgagcgactccgccctggactgcgtgag ggacgaggccatcaacaagctgcagcagatcttcaacaccagctacaaccagacctacgaga agtatggcaatgtgtccgtgttcgagacaacaggcggcctggtggtgttctggcagggcatc aagcagaaaagcctggtggagctggaacggctcgccaaccggtccagcctgaacctgaccca caaccggaccaagcggagcaccgacggcaacaacgcaacccacctgtccaacatggaaagcg tgcacaacctggtgtacgcacagctgcagttcacctacgacaccctgcggggctacatcaac agagccctggcccagatcgccgaggcttggtgcgtggaccagcggcggaccctggaagtgtt caaagagctgtccaagatcaaccccagcgccatcctgagcgccatctacaacaagcctatcg ccgccagattcatgggcgacgtgctgggcctggccagctgcgtgaccatcaaccagaccagc gtgaaggtgctgcgggacatgaacgtgaaagagagcccaggccgctgctactccagacccgt ggtcatcttcaacttcgccaacagctcctacgtgcagtacggccagctgggcgaggacaacg agatcctgctggggaaccaccggaccgaggaatgccagctgcccagcctgaagatctttatc gccggcaacagcgcctacgagtatgtggactacctgttcaagcggatgatcgacctgagcag catctccaccgtggacagcatgatcgccctggacatcgaccccctggaaaacaccgacttcc gggtgctggaactgtacagccagaaagagctgcggagcagcaacgtgttcgacctggaagag atcatgcgggagttcaacagctacaagcagcgcgtgaaatacgtggaggacaaggtggtgga ccccctgcctccttacctgaagggcctggacgacctgatgagcggactgggcgctgccggaa aagccgtgggagtggccattggagctgtgggcggagctgtggcctctgtcgtggaaggcgtc gccacctttctgaagaaccccttcggcgccttcaccatcatcctggtggccattgccgtcgt gatcatcacctacctgatctacacccggcagcggagactgtgtacccagcccctgcagaacc tgttcccctacctggtgtccgccgatggcaccacagtgaccagcggctccaccaaggatacc agcctgcaggccccacccagctacgaagagagcgtgtacaacagcggcagaaagggccctgg ccctcccagctctgatgccagcacagccgcccctccctacaccaacgagcaggcctaccaga tgctgctggccctggctagactggatgccgagcagagggcccagcagaacggcaccgacagc ctggatggcagaaccggcacccaggacaagggccagaagcccaacctgctggaccggctgcg gcaccggaagaacggctaccggcacctgaaggacagcgacgaggaagagaacgtctgataa - 2727 CMV gB FL (SEQ ID NO: 26): MESRIWCLVVCVNLCIVCLGAAVSSSSTRGTSATHSHHSSHTTSAAHSRSGSVSQRVTSSQT VSHGVNETIYNTTLKYGDVVGVNTTKYPYRVCSMAQGTDLIRFERNIVCTSMKPINEDLDEG IMVVYKRNIVAHTFKVRVYQKVLTFRRSYAYIHTTYLLGSNTEYVAPPMWEIHHINSHSQCY SSYSRVIAGTVFVAYHRDSYENKTMQLMPDDYSNTHSTRYVTVKDQWHSRGSTWLYRETCNL NCMVTITTARSKYPYHFFATSTGDVVDISPFYNGTNRNASYFGENADKFFIFPNYTIVSDFG RPNSALETHRLVAFLERADSVISWDIQDEKNVTCQLTFWEASERTIRSEAEDSYHFSSAKMT ATFLSKKQEVNMSDSALDCVRDEAINKLQQIFNTSYNQTYEKYGNVSVFETTGGLVVFWQGI KQKSLVELERLANRSSLNLTHNRTKRSTDGNNATHLSNMESVHNLVYAQLQFTYDTLRGYIN RALAQIAEAWCVDQRRTLEVFKELSKINPSAILSAIYNKPIAARFMGDVLGLASCVTINQTS VKVLRDMNVKESPGRCYSRPVVIFNFANSSYVQYGQLGEDNEILLGNHRTEECQLPSLKIFI AGNSAYEYVDYLFKRMIDLSSISTVDSMIALDIDPLENTDFRVLELYSQKELRSSNVFDLEE IMREFNSYKQRVKYVEDKVVDPLPPYLKGLDDLMSGLGAAGKAVGVAIGAVGGAVASVVEGV ATFLKNPFGAFTIILVAIAVVIITYLIYTRQRRLCTQPLQNLFPYLVSADGTTVTSGSTKDT SLQAPPSYEESVYNSGRKGPGPPSSDASTAAPPYTNEQAYQMLLALARLDAEQRAQQNGTDS LDGRTGTQDKGQKPNLLDRLRHRKNGYRHLKDSDEEENV-- CMV gB sol 750 (SEQ ID NO: 27): 1- atggaaagccggatctggtgcctggtcgtgtgcgtgaacctgtgcatcgtgtgcctgggagc cgccgtgagcagcagcagcaccagaggcaccagcgccacacacagccaccacagcagccaca ccacctctgccgcccacagcagatccggcagcgtgtcccagagagtgaccagcagccagacc gtgtcccacggcgtgaacgagacaatctacaacaccaccctgaagtacggcgacgtcgtggg cgtgaataccaccaagtacccctacagagtgtgcagcatggcccagggcaccgacctgatca gattcgagcggaacatcgtgtgcaccagcatgaagcccatcaacgaggacctggacgagggc atcatggtggtgtacaagagaaacatcgtggcccacaccttcaaagtgcgggtgtaccagaa ggtgctgaccttccggcggagctacgcctacatccacaccacatacctgctgggcagcaaca ccgagtacgtggcccctcccatgtgggagatccaccacatcaacagccacagccagtgctac agcagctacagccgcgtgatcgccggcacagtgttcgtggcctaccaccgggacagctacga gaacaagaccatgcagctgatgcccgacgactacagcaacacccacagcaccagatacgtga ccgtgaaggaccagtggcacagcagaggcagcacctggctgtaccgggagacatgcaacctg aactgcatggtcaccatcaccaccgccagaagcaagtacccttaccacttcttcgccacctc caccggcgacgtggtggacatcagccccttctacaacggcaccaaccggaacgccagctact tcggcgagaacgccgacaagttcttcatcttccccaactacaccatcgtgtccgacttcggc agacccaacagcgctctggaaacccacagactggtggcctttctggaacgggccgacagcgt gatcagctgggacatccaggacgagaagaacgtgacctgccagctgaccttctgggaggcct ctgagagaaccatcagaagcgaggccgaggacagctaccacttcagcagcgccaagatgacc gccaccttcctgagcaagaaacaggaagtgaacatgagcgactccgccctggactgcgtgag ggacgaggccatcaacaagctgcagcagatcttcaacaccagctacaaccagacctacgaga agtatggcaatgtgtccgtgttcgagacaacaggcggcctggtggtgttctggcagggcatc aagcagaaaagcctggtggagctggaacggctcgccaaccggtccagcctgaacctgaccca caaccggaccaagcggagcaccgacggcaacaacgcaacccacctgtccaacatggaaagcg tgcacaacctggtgtacgcacagctgcagttcacctacgacaccctgcggggctacatcaac agagccctggcccagatcgccgaggcttggtgcgtggaccagcggcggaccctggaagtgtt caaagagctgtccaagatcaaccccagcgccatcctgagcgccatctacaacaagcctatcg ccgccagattcatgggcgacgtgctgggcctggccagctgcgtgaccatcaaccagaccagc gtgaaggtgctgcgggacatgaacgtgaaagagagcccaggccgctgctactccagacccgt ggtcatcttcaacttcgccaacagctcctacgtgcagtacggccagctgggcgaggacaacg agatcctgctggggaaccaccggaccgaggaatgccagctgcccagcctgaagatctttatc gccggcaacagcgcctacgagtatgtggactacctgttcaagcggatgatcgacctgagcag catctccaccgtggacagcatgatcgccctggacatcgaccccctggaaaacaccgacttcc gggtgctggaactgtacagccagaaagagctgcggagcagcaacgtgttcgacctggaagag atcatgcgggagttcaacagctacaagcagcgcgtgaaatacgtggaggacaaggtggtgga ccccctgcctccttacctgaagggcctggacgacctgatgagcggactgggcgctgccggaa aagccgtgggagtggccattggagctgtgggcggagctgtggcctctgtcgtggaaggcgtc gccacctttctgaagaactgataa - 2256 Cmv gB sol 750 (SEQ ID NO: 28): MESRIWCLVVCVNLCIVCLGAAVSSSSTRGTSATHSHHSSHTTSAAHSRSGSVSQRVTSSQT VSHGVNETIYNTTLKYGDVVGVNTTKYPYRVCSMAQGTDLIRFERNIVCTSMKPINEDLDEG IMVVYKRNIVAHTFKVRVYQKVLTFRRSYAYIHTTYLLGSNTEYVAPPMWEIHHINSHSQCY SSYSRVIAGTVFVAYHRDSYENKTMQLMPDDYSNTHSTRYVTVKDQWHSRGSTWLYRETCNL NCMVTITTARSKYPYHFFATSTGDVVDISPFYNGTNRNASYFGENADKFFIFPNYTIVSDFG RPNSALETHRLVAFLERADSVISWDIQDEKNVTCQLTFWEASERTIRSEAEDSYHFSSAKMT ATFLSKKQEVNMSDSALDCVRDEAINKLQQIFNTSYNQTYEKYGNVSVFETTGGLVVFWQGI KQKSLVELERLANRSSLNLTHNRTKRSTDGNNATHLSNMESVHNLVYAQLQFTYDTLRGYIN RALAQIAEAWCVDQRRTLEVFKELSKINPSAILSAIYNKPIAARFMGDVLGLASCVTINQTS VKVLRDMNVKESPGRCYSRPVVIFNFANSSYVQYGQLGEDNEILLGNHRTEECQLPSLKIFI AGNSAYEYVDYLFKRMIDLSSISTVDSMIALDIDPLENTDFRVLELYSQKELRSSNVFDLEE IMREFNSYKQRVKYVEDKVVDPLPPYLKGLDDLMSGLGAAGKAVGVAIGAVGGAVASVVEGV ATFLKN-- CMV gB sol 692 (SEQ ID NO: 29): 1- atggaaagccggatctggtgcctggtcgtgtgcgtgaacctgtgcatcgtgtgcctgggagc cgccgtgagcagcagcagcaccagaggcaccagcgccacacacagccaccacagcagccaca ccacctctgccgcccacagcagatccggcagcgtgtcccagagagtgaccagcagccagacc gtgtcccacggcgtgaacgagacaatctacaacaccaccctgaagtacggcgacgtcgtggg cgtgaataccaccaagtacccctacagagtgtgcagcatggcccagggcaccgacctgatca gattcgagcggaacatcgtgtgcaccagcatgaagcccatcaacgaggacctggacgagggc atcatggtggtgtacaagagaaacatcgtggcccacaccttcaaagtgcgggtgtaccagaa ggtgctgaccttccggcggagctacgcctacatccacaccacatacctgctgggcagcaaca ccgagtacgtggcccctcccatgtgggagatccaccacatcaacagccacagccagtgctac agcagctacagccgcgtgatcgccggcacagtgttcgtggcctaccaccgggacagctacga gaacaagaccatgcagctgatgcccgacgactacagcaacacccacagcaccagatacgtga ccgtgaaggaccagtggcacagcagaggcagcacctggctgtaccgggagacatgcaacctg aactgcatggtcaccatcaccaccgccagaagcaagtacccttaccacttcttcgccacctc caccggcgacgtggtggacatcagccccttctacaacggcaccaaccggaacgccagctact tcggcgagaacgccgacaagttcttcatcttccccaactacaccatcgtgtccgacttcggc agacccaacagcgctctggaaacccacagactggtggcctttctggaacgggccgacagcgt gatcagctgggacatccaggacgagaagaacgtgacctgccagctgaccttctgggaggcct ctgagagaaccatcagaagcgaggccgaggacagctaccacttcagcagcgccaagatgacc gccaccttcctgagcaagaaacaggaagtgaacatgagcgactccgccctggactgcgtgag ggacgaggccatcaacaagctgcagcagatcttcaacaccagctacaaccagacctacgaga agtatggcaatgtgtccgtgttcgagacaacaggcggcctggtggtgttctggcagggcatc aagcagaaaagcctggtggagctggaacggctcgccaaccggtccagcctgaacctgaccca caaccggaccaagcggagcaccgacggcaacaacgcaacccacctgtccaacatggaaagcg tgcacaacctggtgtacgcacagctgcagttcacctacgacaccctgcggggctacatcaac agagccctggcccagatcgccgaggcttggtgcgtggaccagcggcggaccctggaagtgtt caaagagctgtccaagatcaaccccagcgccatcctgagcgccatctacaacaagcctatcg ccgccagattcatgggcgacgtgctgggcctggccagctgcgtgaccatcaaccagaccagc gtgaaggtgctgcgggacatgaacgtgaaagagagcccaggccgctgctactccagacccgt ggtcatcttcaacttcgccaacagctcctacgtgcagtacggccagctgggcgaggacaacg agatcctgctggggaaccaccggaccgaggaatgccagctgcccagcctgaagatctttatc gccggcaacagcgcctacgagtatgtggactacctgttcaagcggatgatcgacctgagcag catctccaccgtggacagcatgatcgccctggacatcgaccccctggaaaacaccgacttcc gggtgctggaactgtacagccagaaagagctgcggagcagcaacgtgttcgacctggaagag atcatgcgggagttcaacagctacaagcagtgataa - 2082 Cmv gB sol 692 (SEQ ID NO: 30); MESRIWCLVVCVNLCIVCLGAAVSSSSTRGTSATHSHHSSHTTSAAHSRSGSVSQRVT SSQTVSHGVNETIYNTTLKYGDVVGVNTTKYPYRVCSMAQGTDLIRFERNIVCTSMK PINEDLDEGIMVVYKRNIVAHTFKVRVYQKVLTFRRSYAYIHTTYLLGSNTEYVAPP MWEIHHINSHSQCYSSYSRVIAGTVFVAYHRDSYENKTMQLMPDDYSNTHSTRYVT VKDQWHSRGSTWLYRETCNLNCMVTITTARSKYPYHFFATSTGDVVDISPFYNGTN RNASYFGENADKFFIFPNYTIVSDFGRPNSALETHRLVAFLERADSVISWDIQDEKNV TCQLTFWEASERTIRSEAEDSYHFSSAKMTATFLSKKQEVNMSDSALDCVRDEAINK LQQIFNTSYNQTYEKYGNVSVFETTGGLVVFWQGIKQKSLVELERLANRSSLNLTHN RTKRSTDGNNATHLSNMESVHNLVYAQLQFTYDTLRGYINRALAQIAEAWCVDQR RTLEVFKELSKINPSAILSAIYNKPIAARFMGDVLGLASCVTINQTSVKVLRDMNVKE SPGRCYSRPVVIFNFANSSYVQYGQLGEDNEILLGNHRTEECQLPSLKIFIAGNSAYEY VDYLFKRMIDLSSISTVDSMIALDIDPLENTDFRVLELYSQKELRSSNVFDLEEIMREF NSYKQ- CMV gH FL (SEQ ID NO: 31): 1- atgaggcctggcctgccctcctacctgatcatcctggccgtgtgcctgttcagccacctgctgtccagcagata- c ggcgccgaggccgtgagcgagcccctggacaaggctttccacctgctgctgaacacctacggcagacccatccg- g tttctgcgggagaacaccacccagtgcacctacaacagcagcctgcggaacagcaccgtcgtgagagagaacgc- c atcagcttcaactttttccagagctacaaccagtactacgtgttccacatgcccagatgcctgtttgccggccc- t ctggccgagcagttcctgaaccaggtggacctgaccgagacactggaaagataccagcagcggctgaataccta- c gccctggtgtccaaggacctggccagctaccggtcctttagccagcagctcaaggctcaggatagcctcggcga- g cagcctaccaccgtgccccctcccatcgacctgagcatcccccacgtgtggatgcctccccagaccacccctca- c ggctggaccgagagccacaccacctccggcctgcacagaccccacttcaaccagacctgcatcctgttcgacgg- c cacgacctgctgtttagcaccgtgaccccctgcctgcaccagggcttctacctgatcgacgagctgagatacgt- g aagatcaccctgaccgaggatttcttcgtggtcaccgtgtccatcgacgacgacacccccatgctgctgatctt- c ggccacctgcccagagtgctgttcaaggccccctaccagcgggacaacttcatcctgcggcagaccgagaagca- c gagctgctggtgctggtcaagaaggaccagctgaaccggcactcctacctgaaggaccccgacttcctggacgc- c gccctggacttcaactacctggacctgagcgccctgctgagaaacagcttccacagatacgccgtggacgtgct- g aagtccggacggtgccagatgctcgatcggcggaccgtggagatggccttcgcctatgccctcgccctgttcgc- c gctgccagacaggaagaggctggcgcccaggtgtcagtgcccagagccctggatagacaggccgccctgctgca- g atccaggaattcatgatcacctgcctgagccagaccccccctagaaccaccctgctgctgtaccccacagccgt- g gatctggccaagagggccctgtggacccccaaccagatcaccgacatcacaagcctcgtgcggctcgtgtacat- c ctgagcaagcagaaccagcagcacctgatcccccagtgggccctgagacagatcgccgacttcgccctgaagct- g cacaagacccatctggccagctttctgagcgccttcgccaggcaggaactgtacctgatgggcagcctggtcca- c agcatgctggtgcataccaccgagcggcgggagatcttcatcgtggagacaggcctgtgtagcctggccgagct- g tcccactttacccagctgctggcccaccctcaccacgagtacctgagcgacctgtacaccccctgcagcagcag- c ggcagacgggaccacagcctggaacggctgaccagactgttccccgatgccaccgtgcctgctacagtgcctgc- c gccctgtccatcctgtccaccatgcagcccagcaccctggaaaccttccccgacctgttctgcctgcccctggg- c gagagctttagcgccctgaccgtgtccgagcacgtgtcctacatcgtgaccaatcagtacctgatcaagggcat- c agctaccccgtgtccaccacagtcgtgggccagagcctgatcatcacccagaccgacagccagaccaagtgcga- g ctgacccggaacatgcacaccacacacagcatcaccgtggccctgaacatcagcctggaaaactgcgctttctg- t cagtctgccctgctggaatacgacgatacccagggcgtgatcaacatcatgtacatgcacgacagcgacgacgt- g ctgttcgccctggacccctacaacgaggtggtggtgtccagcccccggacccactacctgatgctgctgaagaa- c ggcaccgtgctggaagtgaccgacgtggtggtggacgccaccgacagcagactgctgatgatgagcgtgtacgc- c ctgagcgccatcatcggcatctacctgctgtaccggatgctgaaaacctgctgataa - 2232 Cmv gH FL (SEQ ID NO: 32); MRPGLPSYLIILAVCLFSHLLSSRYGAEAVSEPLDKAFHLLLNTYGRPIRFLRENTTQCTYN SSLRNSTVVRENAISFNFFQSYNQYYVFHMPRCLFAGPLAEQFLNQVDLTETLERYQQRLNT YALVSKDLASYRSFSQQLKAQDSLGEQPTTVPPPIDLSIPHVWMPPQTTPHGWTESHTTSGL HRPHFNQTCILFDGHDLLFSTVTPCLHQGFYLIDELRYVKITLTEDFFVVTVSIDDDTPMLL IFGHLPRVLFKAPYQRDNFILRQTEKHELLVLVKKDQLNRHSYLKDPDFLDAALDFNYLDLS ALLRNSFHRYAVDVLKSGRCQMLDRRTVEMAFAYALALFAAARQEEAGAQVSVPRALDRQAA LLQIQEFMITCLSQTPPRTTLLLYPTAVDLAKRALWTPNQITDITSLVRLVYILSKQNQQHL IPQWALRQIADFALKLHKTHLASFLSAFARQELYLMGSLVHSMLVHTTERREIFIVETGLCS LAELSHFTQLLAHPHHEYLSDLYTPCSSSGRRDHSLERLTRLFPDATVPATVPAALSILSTM QPSTLETFPDLFCLPLGESFSALTVSEHVSYIVTNQYLIKGISYPVSTTVVGQSLIITQTDS QTKCELTRNMHTTHSITVALNISLENCAFCQSALLEYDDTQGVINIMYMHDSDDVLFALDPY
NEVVVSSPRTHYLMLLKNGTVLEVTDVVVDATDSRLLMMSVYALSAIIGIYLLYRMLKTC-- CMV gH sol (SEQ ID NO: 33): 1- atgaggcctggcctgccctcctacctgatcatcctggccgtgtgcctgttcagccacctgct gtccagcagatacggcgccgaggccgtgagcgagcccctggacaaggctttccacctgctgc tgaacacctacggcagacccatccggtttctgcgggagaacaccacccagtgcacctacaac agcagcctgcggaacagcaccgtcgtgagagagaacgccatcagcttcaactttttccagag ctacaaccagtactacgtgttccacatgcccagatgcctgtttgccggccctctggccgagc agttcctgaaccaggtggacctgaccgagacactggaaagataccagcagcggctgaatacc tacgccctggtgtccaaggacctggccagctaccggtcctttagccagcagctcaaggctca ggatagcctcggcgagcagcctaccaccgtgccccctcccatcgacctgagcatcccccacg tgtggatgcctccccagaccacccctcacggctggaccgagagccacaccacctccggcctg cacagaccccacttcaaccagacctgcatcctgttcgacggccacgacctgctgtttagcac cgtgaccccctgcctgcaccagggcttctacctgatcgacgagctgagatacgtgaagatca ccctgaccgaggatttcttcgtggtcaccgtgtccatcgacgacgacacccccatgctgctg atcttcggccacctgcccagagtgctgttcaaggccccctaccagcgggacaacttcatcct gcggcagaccgagaagcacgagctgctggtgctggtcaagaaggaccagctgaaccggcact cctacctgaaggaccccgacttcctggacgccgccctggacttcaactacctggacctgagc gccctgctgagaaacagcttccacagatacgccgtggacgtgctgaagtccggacggtgcca gatgctcgatcggcggaccgtggagatggccttcgcctatgccctcgccctgttcgccgctg ccagacaggaagaggctggcgcccaggtgtcagtgcccagagccctggatagacaggccgcc ctgctgcagatccaggaattcatgatcacctgcctgagccagaccccccctagaaccaccct gctgctgtaccccacagccgtggatctggccaagagggccctgtggacccccaaccagatca ccgacatcacaagcctcgtgcggctcgtgtacatcctgagcaagcagaaccagcagcacctg atcccccagtgggccctgagacagatcgccgacttcgccctgaagctgcacaagacccatct ggccagctttctgagcgccttcgccaggcaggaactgtacctgatgggcagcctggtccaca gcatgctggtgcataccaccgagcggcgggagatcttcatcgtggagacaggcctgtgtagc ctggccgagctgtcccactttacccagctgctggcccaccctcaccacgagtacctgagcga cctgtacaccccctgcagcagcagcggcagacgggaccacagcctggaacggctgaccagac tgttccccgatgccaccgtgcctgctacagtgcctgccgccctgtccatcctgtccaccatg cagcccagcaccctggaaaccttccccgacctgttctgcctgcccctgggcgagagctttag cgccctgaccgtgtccgagcacgtgtcctacatcgtgaccaatcagtacctgatcaagggca tcagctaccccgtgtccaccacagtcgtgggccagagcctgatcatcacccagaccgacagc cagaccaagtgcgagctgacccggaacatgcacaccacacacagcatcaccgtggccctgaa catcagcctggaaaactgcgctttctgtcagtctgccctgctggaatacgacgatacccagg gcgtgatcaacatcatgtacatgcacgacagcgacgacgtgctgttcgccctggacccctac aacgaggtggtggtgtccagcccccggacccactacctgatgctgctgaagaacggcaccgt gctggaagtgaccgacgtggtggtggacgccaccgactgataa - 2151 CMV gH sol (SEQ ID NO: 34); MRPGLPSYLIILAVCLFSHLLSSRYGAEAVSEPLDKAFHLLLNTYGRPIRFLRENTTQCTYN SSLRNSTVVRENAISFNFFQSYNQYYVFHMPRCLFAGPLAEQFLNQVDLTETLERYQQRLNT YALVSKDLASYRSFSQQLKAQDSLGEQPTTVPPPIDLSIPHVWMPPQTTPHGWTESHTTSGL HRPHFNQTCILFDGHDLLFSTVTPCLHQGFYLIDELRYVKITLTEDFFVVTVSIDDDTPMLL IFGHLPRVLFKAPYQRDNFILRQTEKHELLVLVKKDQLNRHSYLKDPDFLDAALDFNYLDLS ALLRNSFHRYAVDVLKSGRCQMLDRRTVEMAFAYALALFAAARQEEAGAQVSVPRALDRQAA LLQIQEFMITCLSQTPPRTTLLLYPTAVDLAKRALWTPNQITDITSLVRLVYILSKQNQQHL IPQWALRQIADFALKLHKTHLASFLSAFARQELYLMGSLVHSMLVHTTERREIFIVETGLCS LAELSHFTQLLAHPHHEYLSDLYTPCSSSGRRDHSLERLTRLFPDATVPATVPAALSILSTM QPSTLETFPDLFCLPLGESFSALTVSEHVSYIVTNQYLIKGISYPVSTTVVGQSLIITQTDS QTKCELTRNMHTTHSITVALNISLENCAFCQSALLEYDDTQGVINIMYMHDSDDVLFALDPY NEVVVSSPRTHYLMLLKNGTVLEVTDVVVDATD-- CMV gL fl (SEQ ID NO: 35): 1- atgtgcagaaggcccgactgcggcttcagcttcagccctggacccgtgatcctgctgtggtg ctgcctgctgctgcctatcgtgtcctctgccgccgtgtctgtggcccctacagccgccgaga aggtgccagccgagtgccccgagctgaccagaagatgcctgctgggcgaggtgttcgagggc gacaagtacgagagctggctgcggcccctggtcaacgtgaccggcagagatggccccctgag ccagctgatccggtacagacccgtgacccccgaggccgccaatagcgtgctgctggacgagg ccttcctggataccctggccctgctgtacaacaaccccgaccagctgagagccctgctgacc ctgctgtccagcgacaccgcccccagatggatgaccgtgatgcggggctacagcgagtgtgg agatggcagccctgccgtgtacacctgcgtggacgacctgtgcagaggctacgacctgacca gactgagctacggccggtccatcttcacagagcacgtgctgggcttcgagctggtgcccccc agcctgttcaacgtggtggtggccatccggaacgaggccaccagaaccaacagagccgtgcg gctgcctgtgtctacagccgctgcacctgagggcatcacactgttctacggcctgtacaacg ccgtgaaagagttctgcctccggcaccagctggatccccccctgctgagacacctggacaag tactacgccggcctgcccccagagctgaagcagaccagagtgaacctgcccgcccacagcag atatggccctcaggccgtggacgccagatgataa - 840 CMV gL FL (SEQ ID NO: 36); MCRRPDCGFSFSPGPVILLWCCLLLPIVSSAAVSVAPTAAEKVPAECPELTRRCLLGEVFEG DKYESWLRPLVNVTGRDGPLSQLIRYRPVTPEAANSVLLDEAFLDTLALLYNNPDQLRALLT LLSSDTAPRWMTVMRGYSECGDGSPAVYTCVDDLCRGYDLTRLSYGRSIFTEHVLGFELVPP SLFNVVVAIRNEATRTNRAVRLPVSTAAAPEGITLFYGLYNAVKEFCLRHQLDPPLLRHLDK YYAGLPPELKQTRVNLPAHSRYGPQAVDAR-- CMV gM FL (SEQ ID NO: 37): 1- atggcccccagccacgtggacaaagtgaacacccggacttggagcgccagcatcgtgttcat ggtgctgaccttcgtgaacgtgtccgtgcacctggtgctgtccaacttcccccacctgggct acccctgcgtgtactaccacgtggtggacttcgagcggctgaacatgagcgcctacaacgtg atgcacctgcacacccccatgctgtttctggacagcgtgcagctcgtgtgctacgccgtgtt catgcagctggtgtttctggccgtgaccatctactacctcgtgtgctggatcaagatcagca tgcggaaggacaagggcatgagcctgaaccagagcacccgggacatcagctacatgggcgac agcctgaccgccttcctgttcatcctgagcatggacaccttccagctgttcaccctgaccat gagcttccggctgcccagcatgatcgccttcatggccgccgtgcactttttctgtctgacca tcttcaacgtgtccatggtcacccagtaccggtcctacaagcggagcctgttcttcttctcc cggctgcaccccaagctgaagggcaccgtgcagttccggaccctgatcgtgaacctggtgga ggtggccctgggcttcaataccaccgtggtggctatggccctgtgctacggcttcggcaaca acttcttcgtgcggaccggccatatggtgctggccgtgttcgtggtgtacgccatcatcagc atcatctactttctgctgatcgaggccgtgttcttccagtacgtgaaggtgcagttcggcta ccatctgggcgcctttttcggcctgtgcggcctgatctaccccatcgtgcagtacgacacct tcctgagcaacgagtaccggaccggcatcagctggtccttcggaatgctgttcttcatctgg gccatgttcaccacctgcagagccgtgcggtacttcagaggcagaggcagcggctccgtgaa gtaccaggccctggccacagcctctggcgaagaggtggccgccctgagccaccacgacagcc tggaaagcagacggctgcgggaggaagaggacgacgacgacgaggacttcgaggacgcctga taa - 1119 CMV gM FL (SEQ ID NO: 38); MAPSHVDKVNTRTWSASIVFMVLTFVNVSVHLVLSNFPHLGYPCVYYHVVDFERLNMSAYNV MHLHTPMLFLDSVQLVCYAVFMQLVFLAVTIYYLVCWIKISMRKDKGMSLNQSTRDISYMGD SLTAFLFILSMDTFQLFTLTMSFRLPSMIAFMAAVHFFCLTIFNVSMVTQYRSYKRSLFFFS RLHPKLKGTVQFRTLIVNLVEVALGFNTTVVAMALCYGFGNNFFVRTGHMVLAVFVVYAIIS IIYFLLIEAVFFQYVKVQFGYHLGAFFGLCGLIYPIVQYDTFLSNEYRTGISWSFGMLFFIW AMFTTCRAVRYFRGRGSGSVKYQALATASGEEVAALSHHDSLESRRLREEEDDDDEDFEDA- - CMV gN FL (SEQ ID NO: 39): 1- atggaatggaacaccctggtcctgggcctgctggtgctgtctgtcgtggccagcagcaacaa cacatccacagccagcacccctagacctagcagcagcacccacgccagcactaccgtgaagg ctaccaccgtggccaccacaagcaccaccactgctaccagcaccagctccaccacctctgcc aagcctggctctaccacacacgaccccaacgtgatgaggccccacgcccacaacgacttcta caacgctcactgcaccagccacatgtacgagctgtccctgagcagctttgccgcctggtgga ccatgctgaacgccctgatcctgatgggcgccttctgcatcgtgctgcggcactgctgcttc cagaacttcaccgccaccaccaccaagggctactgataa - 411 CMV gN FL (SEQ ID NO: 40); MEWNTLVLGLLVLSVVASSNNTSTASTPRPSSSTHASTTVKATTVATTSTTTATSTSSTTSA KPGSTTHDPNVMRPHAHNDFYNAHCTSHMYELSLSSFAAWWTMLNALILMGAFCIVLRHCCF QNFTATTTKGY-- CMV gO FL (SEQ ID NO: 41): 1- atgggcaagaaagaaatgatcatggtcaagggcatccccaagatcatgctgctgattagcat cacctttctgctgctgtccctgatcaactgcaacgtgctggtcaacagccggggcaccagaa gatcctggccctacaccgtgctgtcctaccggggcaaagagatcctgaagaagcagaaagag gacatcctgaagcggctgatgagcaccagcagcgacggctaccggttcctgatgtaccccag ccagcagaaattccacgccatcgtgatcagcatggacaagttcccccaggactacatcctgg ccggacccatccggaacgacagcatcacccacatgtggttcgacttctacagcacccagctg cggaagcccgccaaatacgtgtacagcgagtacaaccacaccgcccacaagatcaccctgag gcctcccccttgtggcaccgtgcccagcatgaactgcctgagcgagatgctgaacgtgtcca agcggaacgacaccggcgagaagggctgcggcaacttcaccaccttcaaccccatgttcttc aacgtgccccggtggaacaccaagctgtacatcggcagcaacaaagtgaacgtggacagcca gaccatctactttctgggcctgaccgccctgctgctgagatacgcccagcggaactgcaccc ggtccttctacctggtcaacgccatgagccggaacctgttccgggtgcccaagtacatcaac ggcaccaagctgaagaacaccatgcggaagctgaagcggaagcaggccctggtcaaagagca gccccagaagaagaacaagaagtcccagagcaccaccaccccctacctgagctacaccacct ccaccgccttcaacgtgaccaccaacgtgacctacagcgccacagccgccgtgaccagagtg gccacaagcaccaccggctaccggcccgacagcaactttatgaagtccatcatggccaccca gctgagagatctggccacctgggtgtacaccaccctgcggtacagaaacgagcccttctgca agcccgaccggaacagaaccgccgtgagcgagttcatgaagaatacccacgtgctgatcaga aacgagacaccctacaccatctacggcaccctggacatgagcagcctgtactacaacgagac aatgagcgtggagaacgagacagccagcgacaacaacgaaaccacccccacctcccccagca cccggttccagcggaccttcatcgaccccctgtgggactacctggacagcctgctgttcctg gacaagatccggaacttcagcctgcagctgcccgcctacggcaatctgaccccccctgagca cagaagggccgccaacctgagcaccctgaacagcctgtggtggtggagccagtgataa - 1422 CMV gO FL (SEQ ID NO: 42); MGKKEMIMVKGIPKIMLLISITFLLLSLINCNVLVNSRGTRRSWPYTVLSYRGKEILKKQKE DILKRLMSTSSDGYRFLMYPSQQKFHAIVISMDKFPQDYILAGPIRNDSITHMWFDFYSTQL RKPAKYVYSEYNHTAHKITLRPPPCGTVPSMNCLSEMLNVSKRNDTGEKGCGNFTTFNPMFF NVPRWNTKLYIGSNKVNVDSQTIYFLGLTALLLRYAQRNCTRSFYLVNAMSRNLFRVPKYIN GTKLKNTMRKLKRKQALVKEQPQKKNKKSQSTTTPYLSYTTSTAFNVTTNVTYSATAAVTRV ATSTTGYRPDSNFMKSIMATQLRDLATWVYTTLRYRNEPFCKPDRNRTAVSEFMKNTHVLIR NETPYTIYGTLDMSSLYYNETMSVENETASDNNETTPTSPSTRFQRTFIDPLWDYLDSLLFL DKIRNFSLQLPAYGNLTPPEHRRAANLSTLNSLWWWSQ-- CMV UL128 FL (SEQ ID NO: 43): 1- atgagccccaaggacctgacccccttcctgacaaccctgtggctgctcctgggccatagcag agtgcctagagtgcgggccgaggaatgctgcgagttcatcaacgtgaaccacccccccgagc ggtgctacgacttcaagatgtgcaaccggttcaccgtggccctgagatgccccgacggcgaa gtgtgctacagccccgagaaaaccgccgagatccggggcatcgtgaccaccatgacccacag cctgacccggcaggtggtgcacaacaagctgaccagctgcaactacaaccccctgtacctgg aagccgacggccggatcagatgcggcaaagtgaacgacaaggcccagtacctgctgggagcc gccggaagcgtgccctaccggtggatcaacctggaatacgacaagatcacccggatcgtggg cctggaccagtacctggaaagcgtgaagaagcacaagcggctggacgtgtgcagagccaaga tgggctacatgctgcagtgataa - 519 CMV UL128 FL (SEQ ID NO: 44); MSPKDLTPFLTTLWLLLGHSRVPRVRAEECCEFINVNHPPERCYDFKMCNRFTVALRCPDGE VCYSPEKTAEIRGIVTTMTHSLTRQVVHNKLTSCNYNPLYLEADGRIRCGKVNDKAQYLLGA AGSVPYRWINLEYDKITRIVGLDQYLESVKKHKRLDVCRAKMGYMLQ-- CMV UL130 FL (SEQ ID NO: 45): 1- atgctgcggctgctgctgagacaccacttccactgcctgctgctgtgtgccgtgtgggccac cccttgtctggccagcccttggagcaccctgaccgccaaccagaaccctagccccccttggt ccaagctgacctacagcaagccccacgacgccgccaccttctactgcccctttctgtacccc agccctcccagaagccccctgcagttcagcggcttccagagagtgtccaccggccctgagtg ccggaacgagacactgtacctgctgtacaaccgggagggccagacactggtggagcggagca gcacctgggtgaaaaaagtgatctggtatctgagcggccggaaccagaccatcctgcagcgg atgcccagaaccgccagcaagcccagcgacggcaacgtgcagatcagcgtggaggacgccaa aatcttcggcgcccacatggtgcccaagcagaccaagctgctgagattcgtggtcaacgacg gcaccagatatcagatgtgcgtgatgaagctggaaagctgggcccacgtgttccgggactac tccgtgagcttccaggtccggctgaccttcaccgaggccaacaaccagacctacaccttctg cacccaccccaacctgatcgtgtgataa - 648 CMV UL130 FL (SEQ ID NO: 46); MLRLLLRHHFHCLLLCAVWATPCLASPWSTLTANQNPSPPWSKLTYSKPHDAATFYCPFLYP SPPRSPLQFSGFQRVSTGPECRNETLYLLYNREGQTLVERSSTWVKKVIWYLSGRNQTILQR MPRTASKPSDGNVQISVEDAKIFGAHMVPKQTKLLRFVVNDGTRYQMCVMKLESWAHVERDY SVSFQVRLTFTEANNQTYTFCTHPNLIV-- CMV UL131 FL (SEQ ID NO: 47): 1- atgcggctgtgcagagtgtggctgtccgtgtgcctgtgtgccgtggtgctgggccagtgcca gagagagacagccgagaagaacgactactaccgggtgccccactactgggatgcctgcagca gagccctgcccgaccagacccggtacaaatacgtggagcagctcgtggacctgaccctgaac taccactacgacgccagccacggcctggacaacttcgacgtgctgaagcggatcaacgtgac cgaggtgtccctgctgatcagcgacttccggcggcagaacagaagaggcggcaccaacaagc ggaccaccttcaacgccgctggctctctggcccctcacgccagatccctggaattcagcgtg cggctgttcgccaactgataa - 393 CMV UL131 FL (SEQ ID NO: 48); MRLCRVWLSVCLCAVVLGQCQRETAEKNDYYRVPHYWDACSRALPDQTRYKYVEQLVDLTLN YHYDASHGLDNFDVLKRINVTEVSLLISDFRRQNRRGGTNKRTTFNAAGSLAPHARSLEFSV RLFAN-- EMCV IRES nucleotide sequence (SEQ ID NO: 49); aacgttactggccgaagccgcttggaataaggccggtgtgcgtttgtctatatgttattttc caccatattgccgtcttttggcaatgtgagggcccggaaacctggccctgtcttcttgacga gcattcctaggggtctttcccctctcgccaaaggaatgcaaggtctgttgaatgtcgtgaag gaagcagttcctctggaagcttcttgaagacaaacaacgtctgtagcgaccctttgcaggca gcggaaccccccacctggcgacaggtgcctctgcggccaaaagccacgtgtataagatacac ctgcaaaggcggcacaaccccagtgccacgttgtgagttggatagttgtggaaagagtcaaa tggctctcctcaagcgtattcaacaaggggctgaaggatgcccagaaggtaccccattgtat gggatctgatctggggcctcggtgcacatgctttacatgtgtttagtcgaggttaaaaaaac gtctaggccccccgaaccacggggacgtggttttcctttgaaaaacacgataat EV71 IRES nucleotide sequence (SEQ ID NO: 50); gtacctttgtacgcctgttttataccccctccctgatttgcaacttagaagcaacgcaaacc agatcaatagtaggtgtgacataccagtcgcatcttgatcaagcacttctgtatccccggac cgagtatcaatagactgtgcacacggttgaaggagaaaacgtccgttacccggctaactact tcgagaagcctagtaacgccattgaagttgcagagtgtttcgctcagcactccccccgtgta gatcaggtcgatgagtcaccgcattccccacgggcgaccgtggcggtggctgcgttggcggc ctgcctatggggtaacccataggacgctctaatacggacatggcgtgaagagtctattgagc tagttagtagtcctccggcccctgaatgcggctaatcctaactgcggagcacatacccttaa tccaaagggcagtgtgtcgtaacgggcaactctgcagcggaaccgactactttgggtgtccg tgtttctttttattcttgtattggctgcttatggtgacaattaaagaattgttaccatatag ctattggattggccatccagtgtcaaacagagctattgtatatctctttgttggattcacac ctctcactcttgaaacgttacacaccctcaattacattatactgctgaacacgaagcg VEE Subgenomic Promoter (SEQ ID NO: 51): 5'-CTCTCTACGGCTAACCTGAATGGA-3' pVCR modified vector gH sol-SGP gL (SEQ ID NO: 52): cgcgtcggctacaattaatacataaccttatgtatcatacacatacgatttaggtgacacta tagatgggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcac gttgacatcgaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttga ggtagaagccaagcaggtcactgataatgaccatgctaatgccagagcgttttcgcatctgg cttcaaaactgatcgaaacggaggtggacccatccgacacgatccttgacattggaagtgcg cccgcccgcagaatgtattctaagcacaagtatcattgtatctgtccgatgagatgtgcgga agatccggacagattgtataagtatgcaactaagctgaagaaaaactgtaaggaaataactg ataaggaattggacaagaaaatgaaggagctcgccgccgtcatgagcgaccctgacctggaa actgagactatgtgcctccacgacgacgagtcgtgtcgctacgaagggcaagtcgctgttta ccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaataagggagtta
gagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctggagca tatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggcct atgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatt tgaaaccatccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggac ttactgaggagctggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatg tcggtgtgagactatagttagttgcgacgggtacgtcgttaaaagaatagctatcagtccag gcctgtatgggaagccttcaggctatgctgctacgatgcaccgcgagggattcttgtgctgc aaagtgacagacacattgaacggggagagggtctcttttcccgtgtgcacgtatgtgccagc tacattgtgtgaccaaatgactggcatactggcaacagatgtcagtgcggacgacgcgcaaa aactgctggttgggctcaaccagcgtatagtcgtcaacggtcgcacccagagaaacaccaat accatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggtgggcaaaggaata taaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtcatggggt gttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaacc atcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacatt ggagatcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctc tcattaccgccgaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgt gaagccgaggagttgcgcgcagctctaccacctttggcagctgatgttgaggagcccactct ggaagccgatgtagacttgatgttacaagaggctggggccggctcagtggagacacctcgtg gcttgataaaggttaccagctacgctggcgaggacaagatcggctcttacgctgtgctttct ccgcaggctgtactcaagagtgaaaaattatcttgcatccaccctctcgctgaacaagtcat agtgataacacactctggccgaaaagggcgttatgccgtggaaccataccatggtaaagtag tggtgccagagggacatgcaatacccgtccaggactttcaagctctgagtgaaagtgccacc attgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccacacatggagg agcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcgaat acctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctc acaggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgacc agccgctccttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctg gcatcattaaaagcgcagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgt gcagaaattataagggacgtcaagaaaatgaaagggctggacgtcaatgccagaactgtgga ctcagtgctcttgaatggatgcaaacaccccgtagagaccctgtatattgacgaagcttttg cttgtcatgcaggtactctcagagcgctcatagccattataagacctaaaaaggcagtgctc tgcggggatcccaaacagtgcggtttttttaacatgatgtgcctgaaagtgcattttaacca cgagatttgcacacaagtcttccacaaaagcatctctcgccgttgcactaaatctgtgactt cggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccgaaagagactaag attgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcacttgttt cagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctg cctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcct ctgtacgcacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgt gtggaaaacactagccggcgacccatggataaaaacactgactgccaagtaccctgggaatt tcactgccacgatagaggagtggcaagcagagcatgatgccatcatgaggcacatcttggag agaccggaccctaccgacgtcttccagaataaggcaaacgtgtgttgggccaaggctttagt gccggtgctgaagaccgctggcatagacatgaccactgaacaatggaacactgtggattatt ttgaaacggacaaagctcactcagcagagatagtattgaaccaactatgcgtgaggttcttt ggactcgatctggactccggtctattttctgcacccactgttccgttatccattaggaataa tcactgggataactccccgtcgcctaacatgtacgggctgaataaagaagtggtccgtcagc tctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatgacatgaac actggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgcc tcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagca aattgaagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggtt gactggttgtcagaccggcctgaggctaccttcagagctcggctggatttaggcatcccagg tgatgtgcccaaatatgacataatatttgttaatgtgaggaccccatataaataccatcact atcagcagtgtgaagaccatgccattaagcttagcatgttgaccaagaaagcttgtctgcat ctgaatcccggcggaacctgtgtcagcataggttatggttacgctgacagggccagcgaaag catcattggtgctatagcgcggcagttcaagttttcccgggtatgcaaaccgaaatcctcac ttgaagagacggaagttctgtttgtattcattgggtacgatcgcaaggcccgtacgcacaat ccttacaagctttcatcaaccttgaccaacatttatacaggttccagactccacgaagccgg atgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaaggagtgatta taaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataagaaa ttcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgc agctaaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtg acaaacagttggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaag tcagtagcgattccactgttgtccaccggcatcttttccgggaacaaagatcgactaaccca atcattgaaccatttgctgacagctttagacaccactgatgcagatgtagccatatactgca gggacaagaaatgggaaatgactctcaaggaagcagtggctaggagagaagcagtggaggag atatgcatatccgacgactcttcagtgacagaacctgatgcagagctggtgagggtgcatcc gaagagttctttggctggaaggaagggctacagcacaagcgatggcaaaactttctcatatt tggaagggaccaagtttcaccaggcggccaaggatatagcagaaattaatgccatgtggccc gttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagcatgagcagtat taggtcgaaatgccccgtcgaagagtcggaagcctcctcaccacctagcacgctgccttgct tgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaa attactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatcca atgctcccagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatc tcgtggaaacaccaccggtagacgagactccggagccatcggcagagaaccaatccacagag gggacacctgaacaaccaccacttataaccgaggatgagaccaggactagaacgcctgagcc gatcatcatcgaagaggaagaagaggatagcataagtttgctgtcagatggcccgacccacc aggtgctgcaagtcgaggcagacattcacgggccgccctctgtatctagctcatcctggtcc attcctcatgcatccgactttgatgtggacagtttatccatacttgacaccctggagggagc tagcgtgaccagcggggcaacgtcagccgagactaactcttacttcgcaaagagtatggagt ttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctccacatcccgctccg cgcacaagaacaccgtcacttgcacccagcagggcctgctcgagagggatcacgggagaaac cgtgggatacgcggttacacacaatagcgagggcttcttgctatgcaaagttactgacacag taaaaggagaacgggtatcgttccctgtgtgcacgtacatcccggccaccataaactcgaga accagcctggtctccaacccgccaggcgtaaatagggtgattacaagagaggagtttgaggc gttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatcttttcctccgacaccg gtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtgttggag aggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtgg agaacatgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggca gaaggaaaagtggagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaa ccgtgccttttcaagccccaaggtcgcagtggaagcctgtaacgccatgttgaaagagaact ttccgactgtggcttcttactgtattattccagagtacgatgcctatttggacatggttgac ggagcttcatgctgcttagacactgccagtttttgccctgcaaagctgcgcagctttccaaa gaaacactcctatttggaacccacaatacgatcggcagtgccttcagcgatccagaacacgc tccagaacgtcctggcagctgccacaaaaagaaattgcaatgtcacgcaaatgagagaattg cccgtattggattcggcggcctttaatgtggaatgcttcaagaaatatgcgtgtaataatga atattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggtaaattaca ttaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaatatg ttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactcc aggaacaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctag caacagcgtatctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgctt ccgaacattcatacactgtttgatatgtcggctgaagactttgacgctattatagccgagca cttccagcctggggattgtgttctggaaactgacatcgcgtcgtttgataaaagtgaggacg acgccatggctctgaccgcgttaatgattctggaagacttaggtgtggacgcagagctgttg acgctgattgaggcggctttcggcgaaatttcatcaatacatttgcccactaaaactaaatt taaattcggagccatgatgaaatctggaatgttcctcacactgtttgtgaacacagtcatta acattgtaatcgcaagcagagtgttgagagaacggctaaccggatcaccatgtgcagcattc attggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcagacaggtgcgc cacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgccttatt tctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagacccc ctaaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacag gagaagggcattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgt gcaaggcagtagaatcaaggtatgaaaccgtaggaacttccatcatagttatggccatgact actctagctagcagtgttaaatcattcagctacctgagaggggcccctataactctctacgg ctaacctgaatggactacgacatagtctagtcgacgccaccatgaggcctggcctgccctcc tacctgatcatcctggccgtgtgcctgttcagccacctgctgtccagcagatacggcgccga ggccgtgagcgagcccctggacaaggctttccacctgctgctgaacacctacggcagaccca tccggtttctgcgggagaacaccacccagtgcacctacaacagcagcctgcggaacagcacc gtcgtgagagagaacgccatcagcttcaactttttccagagctacaaccagtactacgtgtt ccacatgcccagatgcctgtttgccggccctctggccgagcagttcctgaaccaggtggacc tgaccgagacactggaaagataccagcagcggctgaatacctacgccctggtgtccaaggac ctggccagctaccggtcctttagccagcagctcaaggctcaggatagcctcggcgagcagcc taccaccgtgccccctcccatcgacctgagcatcccccacgtgtggatgcctccccagacca cccctcacggctggaccgagagccacaccacctccggcctgcacagaccccacttcaaccag acctgcatcctgttcgacggccacgacctgctgtttagcaccgtgaccccctgcctgcacca gggcttctacctgatcgacgagctgagatacgtgaagatcaccctgaccgaggatttcttcg tggtcaccgtgtccatcgacgacgacacccccatgctgctgatcttcggccacctgcccaga gtgctgttcaaggccccctaccagcgggacaacttcatcctgcggcagaccgagaagcacga gctgctggtgctggtcaagaaggaccagctgaaccggcactcctacctgaaggaccccgact tcctggacgccgccctggacttcaactacctggacctgagcgccctgctgagaaacagcttc cacagatacgccgtggacgtgctgaagtccggacggtgccagatgctcgatcggcggaccgt ggagatggccttcgcctatgccctcgccctgttcgccgctgccagacaggaagaggctggcg cccaggtgtcagtgcccagagccctggatagacaggccgccctgctgcagatccaggaattc atgatcacctgcctgagccagaccccccctagaaccaccctgctgctgtaccccacagccgt ggatctggccaagagggccctgtggacccccaaccagatcaccgacatcacaagcctcgtgc ggctcgtgtacatcctgagcaagcagaaccagcagcacctgatcccccagtgggccctgaga cagatcgccgacttcgccctgaagctgcacaagacccatctggccagctttctgagcgcctt cgccaggcaggaactgtacctgatgggcagcctggtccacagcatgctggtgcataccaccg agcggcgggagatcttcatcgtggagacaggcctgtgtagcctggccgagctgtcccacttt acccagctgctggcccaccctcaccacgagtacctgagcgacctgtacaccccctgcagcag cagcggcagacgggaccacagcctggaacggctgaccagactgttccccgatgccaccgtgc ctgctacagtgcctgccgccctgtccatcctgtccaccatgcagcccagcaccctggaaacc ttccccgacctgttctgcctgcccctgggcgagagctttagcgccctgaccgtgtccgagca cgtgtcctacatcgtgaccaatcagtacctgatcaagggcatcagctaccccgtgtccacca cagtcgtgggccagagcctgatcatcacccagaccgacagccagaccaagtgcgagctgacc cggaacatgcacaccacacacagcatcaccgtggccctgaacatcagcctggaaaactgcgc tttctgtcagtctgccctgctggaatacgacgatacccagggcgtgatcaacatcatgtaca tgcacgacagcgacgacgtgctgttcgccctggacccctacaacgaggtggtggtgtccagc ccccggacccactacctgatgctgctgaagaacggcaccgtgctggaagtgaccgacgtggt ggtggacgccaccgactgataatctagacggcgcgcccacccagcggccgcctataactctc tacggctaacctgaatggactacgacatagtctagtcgacgccaccatgtgcagaaggcccg actgcggcttcagcttcagccctggacccgtgatcctgctgtggtgctgcctgctgctgcct atcgtgtcctctgccgccgtgtctgtggcccctacagccgccgagaaggtgccagccgagtg ccccgagctgaccagaagatgcctgctgggcgaggtgttcgagggcgacaagtacgagagct ggctgcggcccctggtcaacgtgaccggcagagatggccccctgagccagctgatccggtac agacccgtgacccccgaggccgccaatagcgtgctgctggacgaggccttcctggataccct ggccctgctgtacaacaaccccgaccagctgagagccctgctgaccctgctgtccagcgaca ccgcccccagatggatgaccgtgatgcggggctacagcgagtgtggagatggcagccctgcc gtgtacacctgcgtggacgacctgtgcagaggctacgacctgaccagactgagctacggccg gtccatcttcacagagcacgtgctgggcttcgagctggtgccccccagcctgttcaacgtgg tggtggccatccggaacgaggccaccagaaccaacagagccgtgcggctgcctgtgtctaca gccgctgcacctgagggcatcacactgttctacggcctgtacaacgccgtgaaagagttctg cctccggcaccagctggatccccccctgctgagacacctggacaagtactacgccggcctgc ccccagagctgaagcagaccagagtgaacctgcccgcccacagcagatatggccctcaggcc gtggacgccagatgataatctagacggcgcgcccacccaatcgatgtacttccgaggaactc acgtgcataatgcatcaggctggtacattagatccccgcttaccgcgggcaatatagcaaca ctaaaaactcgatgtacttccgaggaagcgcagtgcataatgctgcgcagtgttgccacata accactatattaaccatttatctagcggacgccaaaaactcaatgtatttctgaggaagcgt ggtgcataatgccacgcagcgtctgcataacttttattatttcttttattaatcaacaaaat tttgtttttaacatttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagggtc ggcatggcatctccacctcctcgcggtccgacctgggcatccgaaggaggacgcacgtccac tcggatggctaagggagagccacgagctcctgtttaaaccagctccaattcgccctatagtg agtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggc gttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaaga ggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccct gtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgcc agcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctt tccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacc tcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacg gtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactgg aacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcgg cctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatatta acgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttattt ttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaata atattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttg cggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaa gatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttga gagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcg cggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcag aatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaag agaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaa cgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgc cttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgat gcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagctt cccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcg gcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcgg tatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacgg ggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgatt aagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttca tttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatccctt aacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttga gatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggt ggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagag cgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactct gtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcga taagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgg gctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgaga tacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggta tccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcct ggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgc tcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggc cttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataacc gtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgag tcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggcc gattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacg caattaatgtgagttagctcactcattaggcaccccaggctttacactttatgctcccggct cgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatga ttacgccaagcgcgcaattaaccctcactaaagggaacaaaagctgggtaccggcgcca pVCR modified vector gH FL-SGP gL (SEQ ID NO: 53): cgcgtcggctacaattaatacataaccttatgtatcatacacatacgatttaggtgacacta tagatgggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcac gttgacatcgaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttga ggtagaagccaagcaggtcactgataatgaccatgctaatgccagagcgttttcgcatctgg cttcaaaactgatcgaaacggaggtggacccatccgacacgatccttgacattggaagtgcg cccgcccgcagaatgtattctaagcacaagtatcattgtatctgtccgatgagatgtgcgga agatccggacagattgtataagtatgcaactaagctgaagaaaaactgtaaggaaataactg ataaggaattggacaagaaaatgaaggagctcgccgccgtcatgagcgaccctgacctggaa actgagactatgtgcctccacgacgacgagtcgtgtcgctacgaagggcaagtcgctgttta ccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaataagggagtta gagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctggagca tatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggcct atgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatt tgaaaccatccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggac ttactgaggagctggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatg tcggtgtgagactatagttagttgcgacgggtacgtcgttaaaagaatagctatcagtccag gcctgtatgggaagccttcaggctatgctgctacgatgcaccgcgagggattcttgtgctgc aaagtgacagacacattgaacggggagagggtctcttttcccgtgtgcacgtatgtgccagc tacattgtgtgaccaaatgactggcatactggcaacagatgtcagtgcggacgacgcgcaaa aactgctggttgggctcaaccagcgtatagtcgtcaacggtcgcacccagagaaacaccaat accatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggtgggcaaaggaata taaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtcatggggt gttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaacc atcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacatt ggagatcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctc tcattaccgccgaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgt gaagccgaggagttgcgcgcagctctaccacctttggcagctgatgttgaggagcccactct ggaagccgatgtagacttgatgttacaagaggctggggccggctcagtggagacacctcgtg gcttgataaaggttaccagctacgctggcgaggacaagatcggctcttacgctgtgctttct ccgcaggctgtactcaagagtgaaaaattatcttgcatccaccctctcgctgaacaagtcat agtgataacacactctggccgaaaagggcgttatgccgtggaaccataccatggtaaagtag tggtgccagagggacatgcaatacccgtccaggactttcaagctctgagtgaaagtgccacc
attgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccacacatggagg agcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcgaat acctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctc acaggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgacc agccgctccttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctg gcatcattaaaagcgcagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgt gcagaaattataagggacgtcaagaaaatgaaagggctggacgtcaatgccagaactgtgga ctcagtgctcttgaatggatgcaaacaccccgtagagaccctgtatattgacgaagcttttg cttgtcatgcaggtactctcagagcgctcatagccattataagacctaaaaaggcagtgctc tgcggggatcccaaacagtgcggtttttttaacatgatgtgcctgaaagtgcattttaacca cgagatttgcacacaagtcttccacaaaagcatctctcgccgttgcactaaatctgtgactt cggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccgaaagagactaag attgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcacttgttt cagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctg cctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcct ctgtacgcacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgt gtggaaaacactagccggcgacccatggataaaaacactgactgccaagtaccctgggaatt tcactgccacgatagaggagtggcaagcagagcatgatgccatcatgaggcacatcttggag agaccggaccctaccgacgtcttccagaataaggcaaacgtgtgttgggccaaggctttagt gccggtgctgaagaccgctggcatagacatgaccactgaacaatggaacactgtggattatt ttgaaacggacaaagctcactcagcagagatagtattgaaccaactatgcgtgaggttcttt ggactcgatctggactccggtctattttctgcacccactgttccgttatccattaggaataa tcactgggataactccccgtcgcctaacatgtacgggctgaataaagaagtggtccgtcagc tctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatgacatgaac actggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgcc tcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagca aattgaagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggtt gactggttgtcagaccggcctgaggctaccttcagagctcggctggatttaggcatcccagg tgatgtgcccaaatatgacataatatttgttaatgtgaggaccccatataaataccatcact atcagcagtgtgaagaccatgccattaagcttagcatgttgaccaagaaagcttgtctgcat ctgaatcccggcggaacctgtgtcagcataggttatggttacgctgacagggccagcgaaag catcattggtgctatagcgcggcagttcaagttttcccgggtatgcaaaccgaaatcctcac ttgaagagacggaagttctgtttgtattcattgggtacgatcgcaaggcccgtacgcacaat ccttacaagctttcatcaaccttgaccaacatttatacaggttccagactccacgaagccgg atgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaaggagtgatta taaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataagaaa ttcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgc agctaaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtg acaaacagttggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaag tcagtagcgattccactgttgtccaccggcatcttttccgggaacaaagatcgactaaccca atcattgaaccatttgctgacagctttagacaccactgatgcagatgtagccatatactgca gggacaagaaatgggaaatgactctcaaggaagcagtggctaggagagaagcagtggaggag atatgcatatccgacgactcttcagtgacagaacctgatgcagagctggtgagggtgcatcc gaagagttctttggctggaaggaagggctacagcacaagcgatggcaaaactttctcatatt tggaagggaccaagtttcaccaggcggccaaggatatagcagaaattaatgccatgtggccc gttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagcatgagcagtat taggtcgaaatgccccgtcgaagagtcggaagcctcctcaccacctagcacgctgccttgct tgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaa attactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatcca atgctcccagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatc tcgtggaaacaccaccggtagacgagactccggagccatcggcagagaaccaatccacagag gggacacctgaacaaccaccacttataaccgaggatgagaccaggactagaacgcctgagcc gatcatcatcgaagaggaagaagaggatagcataagtttgctgtcagatggcccgacccacc aggtgctgcaagtcgaggcagacattcacgggccgccctctgtatctagctcatcctggtcc attcctcatgcatccgactttgatgtggacagtttatccatacttgacaccctggagggagc tagcgtgaccagcggggcaacgtcagccgagactaactcttacttcgcaaagagtatggagt ttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctccacatcccgctccg cgcacaagaacaccgtcacttgcacccagcagggcctgctcgagagggatcacgggagaaac cgtgggatacgcggttacacacaatagcgagggcttcttgctatgcaaagttactgacacag taaaaggagaacgggtatcgttccctgtgtgcacgtacatcccggccaccataaactcgaga accagcctggtctccaacccgccaggcgtaaatagggtgattacaagagaggagtttgaggc gttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatcttttcctccgacaccg gtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtgttggag aggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtgg agaacatgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggca gaaggaaaagtggagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaa ccgtgccttttcaagccccaaggtcgcagtggaagcctgtaacgccatgttgaaagagaact ttccgactgtggcttcttactgtattattccagagtacgatgcctatttggacatggttgac ggagcttcatgctgcttagacactgccagtttttgccctgcaaagctgcgcagctttccaaa gaaacactcctatttggaacccacaatacgatcggcagtgccttcagcgatccagaacacgc tccagaacgtcctggcagctgccacaaaaagaaattgcaatgtcacgcaaatgagagaattg cccgtattggattcggcggcctttaatgtggaatgcttcaagaaatatgcgtgtaataatga atattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggtaaattaca ttaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaatatg ttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactcc aggaacaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctag caacagcgtatctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgctt ccgaacattcatacactgtttgatatgtcggctgaagactttgacgctattatagccgagca cttccagcctggggattgtgttctggaaactgacatcgcgtcgtttgataaaagtgaggacg acgccatggctctgaccgcgttaatgattctggaagacttaggtgtggacgcagagctgttg acgctgattgaggcggctttcggcgaaatttcatcaatacatttgcccactaaaactaaatt taaattcggagccatgatgaaatctggaatgttcctcacactgtttgtgaacacagtcatta acattgtaatcgcaagcagagtgttgagagaacggctaaccggatcaccatgtgcagcattc attggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcagacaggtgcgc cacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgccttatt tctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagacccc ctaaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacag gagaagggcattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgt gcaaggcagtagaatcaaggtatgaaaccgtaggaacttccatcatagttatggccatgact actctagctagcagtgttaaatcattcagctacctgagaggggcccctataactctctacgg ctaacctgaatggactacgacatagtctagtcgacgccaccatgaggcctggcctgccctcc tacctgatcatcctggccgtgtgcctgttcagccacctgctgtccagcagatacggcgccga ggccgtgagcgagcccctggacaaggctttccacctgctgctgaacacctacggcagaccca tccggtttctgcgggagaacaccacccagtgcacctacaacagcagcctgcggaacagcacc gtcgtgagagagaacgccatcagcttcaactttttccagagctacaaccagtactacgtgtt ccacatgcccagatgcctgtttgccggccctctggccgagcagttcctgaaccaggtggacc tgaccgagacactggaaagataccagcagcggctgaatacctacgccctggtgtccaaggac ctggccagctaccggtcctttagccagcagctcaaggctcaggatagcctcggcgagcagcc taccaccgtgccccctcccatcgacctgagcatcccccacgtgtggatgcctccccagacca cccctcacggctggaccgagagccacaccacctccggcctgcacagaccccacttcaaccag acctgcatcctgttcgacggccacgacctgctgtttagcaccgtgaccccctgcctgcacca gggcttctacctgatcgacgagctgagatacgtgaagatcaccctgaccgaggatttcttcg tggtcaccgtgtccatcgacgacgacacccccatgctgctgatcttcggccacctgcccaga gtgctgttcaaggccccctaccagcgggacaacttcatcctgcggcagaccgagaagcacga gctgctggtgctggtcaagaaggaccagctgaaccggcactcctacctgaaggaccccgact tcctggacgccgccctggacttcaactacctggacctgagcgccctgctgagaaacagcttc cacagatacgccgtggacgtgctgaagtccggacggtgccagatgctcgatcggcggaccgt ggagatggccttcgcctatgccctcgccctgttcgccgctgccagacaggaagaggctggcg cccaggtgtcagtgcccagagccctggatagacaggccgccctgctgcagatccaggaattc atgatcacctgcctgagccagaccccccctagaaccaccctgctgctgtaccccacagccgt ggatctggccaagagggccctgtggacccccaaccagatcaccgacatcacaagcctcgtgc ggctcgtgtacatcctgagcaagcagaaccagcagcacctgatcccccagtgggccctgaga cagatcgccgacttcgccctgaagctgcacaagacccatctggccagctttctgagcgcctt cgccaggcaggaactgtacctgatgggcagcctggtccacagcatgctggtgcataccaccg agcggcgggagatcttcatcgtggagacaggcctgtgtagcctggccgagctgtcccacttt acccagctgctggcccaccctcaccacgagtacctgagcgacctgtacaccccctgcagcag cagcggcagacgggaccacagcctggaacggctgaccagactgttccccgatgccaccgtgc ctgctacagtgcctgccgccctgtccatcctgtccaccatgcagcccagcaccctggaaacc ttccccgacctgttctgcctgcccctgggcgagagctttagcgccctgaccgtgtccgagca cgtgtcctacatcgtgaccaatcagtacctgatcaagggcatcagctaccccgtgtccacca cagtcgtgggccagagcctgatcatcacccagaccgacagccagaccaagtgcgagctgacc cggaacatgcacaccacacacagcatcaccgtggccctgaacatcagcctggaaaactgcgc tttctgtcagtctgccctgctggaatacgacgatacccagggcgtgatcaacatcatgtaca tgcacgacagcgacgacgtgctgttcgccctggacccctacaacgaggtggtggtgtccagc ccccggacccactacctgatgctgctgaagaacggcaccgtgctggaagtgaccgacgtggt ggtggacgccaccgacagcagactgctgatgatgagcgtgtacgccctgagcgccatcatcg gcatctacctgctgtaccggatgctgaaaacctgctgataatctagacggcgcgcccaccca gcggccgcctataactctctacggctaacctgaatggactacgacatagtctagtcgacgcc accatgtgcagaaggcccgactgcggcttcagcttcagccctggacccgtgatcctgctgtg gtgctgcctgctgctgcctatcgtgtcctctgccgccgtgtctgtggcccctacagccgccg agaaggtgccagccgagtgccccgagctgaccagaagatgcctgctgggcgaggtgttcgag ggcgacaagtacgagagctggctgcggcccctggtcaacgtgaccggcagagatggccccct gagccagctgatccggtacagacccgtgacccccgaggccgccaatagcgtgctgctggacg aggccttcctggataccctggccctgctgtacaacaaccccgaccagctgagagccctgctg accctgctgtccagcgacaccgcccccagatggatgaccgtgatgcggggctacagcgagtg tggagatggcagccctgccgtgtacacctgcgtggacgacctgtgcagaggctacgacctga ccagactgagctacggccggtccatcttcacagagcacgtgctgggcttcgagctggtgccc cccagcctgttcaacgtggtggtggccatccggaacgaggccaccagaaccaacagagccgt gcggctgcctgtgtctacagccgctgcacctgagggcatcacactgttctacggcctgtaca acgccgtgaaagagttctgcctccggcaccagctggatccccccctgctgagacacctggac aagtactacgccggcctgcccccagagctgaagcagaccagagtgaacctgcccgcccacag cagatatggccctcaggccgtggacgccagatgataatctagacggcgcgcccacccaatcg atgtacttccgaggaactcacgtgcataatgcatcaggctggtacattagatccccgcttac cgcgggcaatatagcaacactaaaaactcgatgtacttccgaggaagcgcagtgcataatgc tgcgcagtgttgccacataaccactatattaaccatttatctagcggacgccaaaaactcaa tgtatttctgaggaagcgtggtgcataatgccacgcagcgtctgcataacttttattatttc ttttattaatcaacaaaattttgtttttaacatttcaaaaaaaaaaaaaaaaaaaaaaaaaa aaaaaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccgacctgggcatccg aaggaggacgcacgtccactcggatggctaagggagagccacgagctcctgtttaaaccagc tccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcg tgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgcca gctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaat ggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcag cgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttc tcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccga tttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgg gccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtg gactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataa gggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgc gaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcgg aacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataac cctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtc gcccttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggt gaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctca acagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcactttt aaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcg ccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatctta cggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcg gccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacat gggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacg acgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggc gaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgc aggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccg gtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatc gtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctga gataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatacttt agattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataat ctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaa gatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaa aaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaag gtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttagg ccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccag tggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccg gataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaac gacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaag ggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggag cttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttga gcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcgg cctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcc cctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccg aacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgc ctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaa gcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggcttt acactttatgctcccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacag gaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaag ctgggtaccggcgcca pVCR modified vector gH sol-SGP gL-SGP gO (SEQ ID NO: 54): cgcgtcggctacaattaatacataaccttatgtatcatacacatacgatttaggtgacacta tagatgggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcac gttgacatcgaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttga ggtagaagccaagcaggtcactgataatgaccatgctaatgccagagcgttttcgcatctgg cttcaaaactgatcgaaacggaggtggacccatccgacacgatccttgacattggaagtgcg cccgcccgcagaatgtattctaagcacaagtatcattgtatctgtccgatgagatgtgcgga agatccggacagattgtataagtatgcaactaagctgaagaaaaactgtaaggaaataactg ataaggaattggacaagaaaatgaaggagctcgccgccgtcatgagcgaccctgacctggaa actgagactatgtgcctccacgacgacgagtcgtgtcgctacgaagggcaagtcgctgttta ccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaataagggagtta gagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctggagca tatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggcct atgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatt tgaaaccatccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggac ttactgaggagctggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatg tcggtgtgagactatagttagttgcgacgggtacgtcgttaaaagaatagctatcagtccag gcctgtatgggaagccttcaggctatgctgctacgatgcaccgcgagggattcttgtgctgc aaagtgacagacacattgaacggggagagggtctcttttcccgtgtgcacgtatgtgccagc tacattgtgtgaccaaatgactggcatactggcaacagatgtcagtgcggacgacgcgcaaa aactgctggttgggctcaaccagcgtatagtcgtcaacggtcgcacccagagaaacaccaat accatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggtgggcaaaggaata taaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtcatggggt gttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaacc atcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacatt ggagatcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctc tcattaccgccgaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgt gaagccgaggagttgcgcgcagctctaccacctttggcagctgatgttgaggagcccactct ggaagccgatgtagacttgatgttacaagaggctggggccggctcagtggagacacctcgtg gcttgataaaggttaccagctacgctggcgaggacaagatcggctcttacgctgtgctttct ccgcaggctgtactcaagagtgaaaaattatcttgcatccaccctctcgctgaacaagtcat agtgataacacactctggccgaaaagggcgttatgccgtggaaccataccatggtaaagtag tggtgccagagggacatgcaatacccgtccaggactttcaagctctgagtgaaagtgccacc attgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccacacatggagg agcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcgaat acctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctc acaggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgacc agccgctccttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctg gcatcattaaaagcgcagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgt gcagaaattataagggacgtcaagaaaatgaaagggctggacgtcaatgccagaactgtgga ctcagtgctcttgaatggatgcaaacaccccgtagagaccctgtatattgacgaagcttttg cttgtcatgcaggtactctcagagcgctcatagccattataagacctaaaaaggcagtgctc tgcggggatcccaaacagtgcggtttttttaacatgatgtgcctgaaagtgcattttaacca cgagatttgcacacaagtcttccacaaaagcatctctcgccgttgcactaaatctgtgactt cggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccgaaagagactaag attgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcacttgttt cagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctg cctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcct ctgtacgcacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgt gtggaaaacactagccggcgacccatggataaaaacactgactgccaagtaccctgggaatt tcactgccacgatagaggagtggcaagcagagcatgatgccatcatgaggcacatcttggag agaccggaccctaccgacgtcttccagaataaggcaaacgtgtgttgggccaaggctttagt gccggtgctgaagaccgctggcatagacatgaccactgaacaatggaacactgtggattatt
ttgaaacggacaaagctcactcagcagagatagtattgaaccaactatgcgtgaggttcttt ggactcgatctggactccggtctattttctgcacccactgttccgttatccattaggaataa tcactgggataactccccgtcgcctaacatgtacgggctgaataaagaagtggtccgtcagc tctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatgacatgaac actggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgcc tcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagca aattgaagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggtt gactggttgtcagaccggcctgaggctaccttcagagctcggctggatttaggcatcccagg tgatgtgcccaaatatgacataatatttgttaatgtgaggaccccatataaataccatcact atcagcagtgtgaagaccatgccattaagcttagcatgttgaccaagaaagcttgtctgcat ctgaatcccggcggaacctgtgtcagcataggttatggttacgctgacagggccagcgaaag catcattggtgctatagcgcggcagttcaagttttcccgggtatgcaaaccgaaatcctcac ttgaagagacggaagttctgtttgtattcattgggtacgatcgcaaggcccgtacgcacaat ccttacaagctttcatcaaccttgaccaacatttatacaggttccagactccacgaagccgg atgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaaggagtgatta taaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataagaaa ttcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgc agctaaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtg acaaacagttggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaag tcagtagcgattccactgttgtccaccggcatcttttccgggaacaaagatcgactaaccca atcattgaaccatttgctgacagctttagacaccactgatgcagatgtagccatatactgca gggacaagaaatgggaaatgactctcaaggaagcagtggctaggagagaagcagtggaggag atatgcatatccgacgactcttcagtgacagaacctgatgcagagctggtgagggtgcatcc gaagagttctttggctggaaggaagggctacagcacaagcgatggcaaaactttctcatatt tggaagggaccaagtttcaccaggcggccaaggatatagcagaaattaatgccatgtggccc gttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagcatgagcagtat taggtcgaaatgccccgtcgaagagtcggaagcctcctcaccacctagcacgctgccttgct tgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaa attactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatcca atgctcccagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatc tcgtggaaacaccaccggtagacgagactccggagccatcggcagagaaccaatccacagag gggacacctgaacaaccaccacttataaccgaggatgagaccaggactagaacgcctgagcc gatcatcatcgaagaggaagaagaggatagcataagtttgctgtcagatggcccgacccacc aggtgctgcaagtcgaggcagacattcacgggccgccctctgtatctagctcatcctggtcc attcctcatgcatccgactttgatgtggacagtttatccatacttgacaccctggagggagc tagcgtgaccagcggggcaacgtcagccgagactaactcttacttcgcaaagagtatggagt ttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctccacatcccgctccg cgcacaagaacaccgtcacttgcacccagcagggcctgctcgagagggatcacgggagaaac cgtgggatacgcggttacacacaatagcgagggcttcttgctatgcaaagttactgacacag taaaaggagaacgggtatcgttccctgtgtgcacgtacatcccggccaccataaactcgaga accagcctggtctccaacccgccaggcgtaaatagggtgattacaagagaggagtttgaggc gttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatcttttcctccgacaccg gtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtgttggag aggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtgg agaacatgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggca gaaggaaaagtggagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaa ccgtgccttttcaagccccaaggtcgcagtggaagcctgtaacgccatgttgaaagagaact ttccgactgtggcttcttactgtattattccagagtacgatgcctatttggacatggttgac ggagcttcatgctgcttagacactgccagtttttgccctgcaaagctgcgcagctttccaaa gaaacactcctatttggaacccacaatacgatcggcagtgccttcagcgatccagaacacgc tccagaacgtcctggcagctgccacaaaaagaaattgcaatgtcacgcaaatgagagaattg cccgtattggattcggcggcctttaatgtggaatgcttcaagaaatatgcgtgtaataatga atattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggtaaattaca ttaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaatatg ttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactcc aggaacaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctag caacagcgtatctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgctt ccgaacattcatacactgtttgatatgtcggctgaagactttgacgctattatagccgagca cttccagcctggggattgtgttctggaaactgacatcgcgtcgtttgataaaagtgaggacg acgccatggctctgaccgcgttaatgattctggaagacttaggtgtggacgcagagctgttg acgctgattgaggcggctttcggcgaaatttcatcaatacatttgcccactaaaactaaatt taaattcggagccatgatgaaatctggaatgttcctcacactgtttgtgaacacagtcatta acattgtaatcgcaagcagagtgttgagagaacggctaaccggatcaccatgtgcagcattc attggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcagacaggtgcgc cacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgccttatt tctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagacccc ctaaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacag gagaagggcattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgt gcaaggcagtagaatcaaggtatgaaaccgtaggaacttccatcatagttatggccatgact actctagctagcagtgttaaatcattcagctacctgagaggggcccctataactctctacgg ctaacctgaatggactacgacatagtctagtcgacgccaccatgaggcctggcctgccctcc tacctgatcatcctggccgtgtgcctgttcagccacctgctgtccagcagatacggcgccga ggccgtgagcgagcccctggacaaggctttccacctgctgctgaacacctacggcagaccca tccggtttctgcgggagaacaccacccagtgcacctacaacagcagcctgcggaacagcacc gtcgtgagagagaacgccatcagcttcaactttttccagagctacaaccagtactacgtgtt ccacatgcccagatgcctgtttgccggccctctggccgagcagttcctgaaccaggtggacc tgaccgagacactggaaagataccagcagcggctgaatacctacgccctggtgtccaaggac ctggccagctaccggtcctttagccagcagctcaaggctcaggatagcctcggcgagcagcc taccaccgtgccccctcccatcgacctgagcatcccccacgtgtggatgcctccccagacca cccctcacggctggaccgagagccacaccacctccggcctgcacagaccccacttcaaccag acctgcatcctgttcgacggccacgacctgctgtttagcaccgtgaccccctgcctgcacca gggcttctacctgatcgacgagctgagatacgtgaagatcaccctgaccgaggatttcttcg tggtcaccgtgtccatcgacgacgacacccccatgctgctgatcttcggccacctgcccaga gtgctgttcaaggccccctaccagcgggacaacttcatcctgcggcagaccgagaagcacga gctgctggtgctggtcaagaaggaccagctgaaccggcactcctacctgaaggaccccgact tcctggacgccgccctggacttcaactacctggacctgagcgccctgctgagaaacagcttc cacagatacgccgtggacgtgctgaagtccggacggtgccagatgctcgatcggcggaccgt ggagatggccttcgcctatgccctcgccctgttcgccgctgccagacaggaagaggctggcg cccaggtgtcagtgcccagagccctggatagacaggccgccctgctgcagatccaggaattc atgatcacctgcctgagccagaccccccctagaaccaccctgctgctgtaccccacagccgt ggatctggccaagagggccctgtggacccccaaccagatcaccgacatcacaagcctcgtgc ggctcgtgtacatcctgagcaagcagaaccagcagcacctgatcccccagtgggccctgaga cagatcgccgacttcgccctgaagctgcacaagacccatctggccagctttctgagcgcctt cgccaggcaggaactgtacctgatgggcagcctggtccacagcatgctggtgcataccaccg agcggcgggagatcttcatcgtggagacaggcctgtgtagcctggccgagctgtcccacttt acccagctgctggcccaccctcaccacgagtacctgagcgacctgtacaccccctgcagcag cagcggcagacgggaccacagcctggaacggctgaccagactgttccccgatgccaccgtgc ctgctacagtgcctgccgccctgtccatcctgtccaccatgcagcccagcaccctggaaacc ttccccgacctgttctgcctgcccctgggcgagagctttagcgccctgaccgtgtccgagca cgtgtcctacatcgtgaccaatcagtacctgatcaagggcatcagctaccccgtgtccacca cagtcgtgggccagagcctgatcatcacccagaccgacagccagaccaagtgcgagctgacc cggaacatgcacaccacacacagcatcaccgtggccctgaacatcagcctggaaaactgcgc tttctgtcagtctgccctgctggaatacgacgatacccagggcgtgatcaacatcatgtaca tgcacgacagcgacgacgtgctgttcgccctggacccctacaacgaggtggtggtgtccagc ccccggacccactacctgatgctgctgaagaacggcaccgtgctggaagtgaccgacgtggt ggtggacgccaccgactgataatctagacggcgcgcccacccagcggccgcctataactctc tacggctaacctgaatggactacgacatagtctagtcgacgccaccatgtgcagaaggcccg actgcggcttcagcttcagccctggacccgtgatcctgctgtggtgctgcctgctgctgcct atcgtgtcctctgccgccgtgtctgtggcccctacagccgccgagaaggtgccagccgagtg ccccgagctgaccagaagatgcctgctgggcgaggtgttcgagggcgacaagtacgagagct ggctgcggcccctggtcaacgtgaccggcagagatggccccctgagccagctgatccggtac agacccgtgacccccgaggccgccaatagcgtgctgctggacgaggccttcctggataccct ggccctgctgtacaacaaccccgaccagctgagagccctgctgaccctgctgtccagcgaca ccgcccccagatggatgaccgtgatgcggggctacagcgagtgtggagatggcagccctgcc gtgtacacctgcgtggacgacctgtgcagaggctacgacctgaccagactgagctacggccg gtccatcttcacagagcacgtgctgggcttcgagctggtgccccccagcctgttcaacgtgg tggtggccatccggaacgaggccaccagaaccaacagagccgtgcggctgcctgtgtctaca gccgctgcacctgagggcatcacactgttctacggcctgtacaacgccgtgaaagagttctg cctccggcaccagctggatccccccctgctgagacacctggacaagtactacgccggcctgc ccccagagctgaagcagaccagagtgaacctgcccgcccacagcagatatggccctcaggcc gtggacgccagatgataatctagacggcgcgcccacccaatcgatctataactctctacggc taacctgaatggactacgacatagtctagtcgacgccaccatgggcaagaaagaaatgatca tggtcaagggcatccccaagatcatgctgctgattagcatcacctttctgctgctgtccctg atcaactgcaacgtgctggtcaacagccggggcaccagaagatcctggccctacaccgtgct gtcctaccggggcaaagagatcctgaagaagcagaaagaggacatcctgaagcggctgatga gcaccagcagcgacggctaccggttcctgatgtaccccagccagcagaaattccacgccatc gtgatcagcatggacaagttcccccaggactacatcctggccggacccatccggaacgacag catcacccacatgtggttcgacttctacagcacccagctgcggaagcccgccaaatacgtgt acagcgagtacaaccacaccgcccacaagatcaccctgaggcctcccccttgtggcaccgtg cccagcatgaactgcctgagcgagatgctgaacgtgtccaagcggaacgacaccggcgagaa gggctgcggcaacttcaccaccttcaaccccatgttcttcaacgtgccccggtggaacacca agctgtacatcggcagcaacaaagtgaacgtggacagccagaccatctactttctgggcctg accgccctgctgctgagatacgcccagcggaactgcacccggtccttctacctggtcaacgc catgagccggaacctgttccgggtgcccaagtacatcaacggcaccaagctgaagaacacca tgcggaagctgaagcggaagcaggccctggtcaaagagcagccccagaagaagaacaagaag tcccagagcaccaccaccccctacctgagctacaccacctccaccgccttcaacgtgaccac caacgtgacctacagcgccacagccgccgtgaccagagtggccacaagcaccaccggctacc ggcccgacagcaactttatgaagtccatcatggccacccagctgagagatctggccacctgg gtgtacaccaccctgcggtacagaaacgagcccttctgcaagcccgaccggaacagaaccgc cgtgagcgagttcatgaagaatacccacgtgctgatcagaaacgagacaccctacaccatct acggcaccctggacatgagcagcctgtactacaacgagacaatgagcgtggagaacgagaca gccagcgacaacaacgaaaccacccccacctcccccagcacccggttccagcggaccttcat cgaccccctgtgggactacctggacagcctgctgttcctggacaagatccggaacttcagcc tgcagctgcccgcctacggcaatctgaccccccctgagcacagaagggccgccaacctgagc accctgaacagcctgtggtggtggagccagtgataatctagacggcgcgcccacccaccgcg ggcaatatagcaacactaaaaactcgatgtacttccgaggaagcgcagtgcataatgctgcg cagtgttgccacataaccactatattaaccatttatctagcggacgccaaaaactcaatgta tttctgaggaagcgtggtgcataatgccacgcagcgtctgcataacttttattatttctttt attaatcaacaaaattttgtttttaacatttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa aaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccgacctgggcatccgaagg aggacgcacgtccactcggatggctaagggagagccacgagctcctgtttaaaccagctcca attcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgac tgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctg gcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcg aatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtg accgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgc cacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgattta gtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggcca tcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggact cttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataaggga ttttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaat tttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacc cctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctg ataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgccc ttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaa gtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacag cggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaag ttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgc atacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacgga tggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggcca acttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggg gatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacga gcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaac tacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcagga ccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtga gcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtag ttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagata ggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagat tgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctca tgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatc aaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaacc accgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaa ctggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccac cacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggc tgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggata aggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacc tacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggag aaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttc cagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgt cgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctt tttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctg attctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacg accgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctct ccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgg gcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacac tttatgctcccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaa cagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctgg gtaccggcgcca pVCR modified vector gH FL-SGP gL-SGP gO (SEQ ID NO: 55): cgcgtcggctacaattaatacataaccttatgtatcatacacatacgatttaggtgacacta tagatgggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcac gttgacatcgaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttga ggtagaagccaagcaggtcactgataatgaccatgctaatgccagagcgttttcgcatctgg cttcaaaactgatcgaaacggaggtggacccatccgacacgatccttgacattggaagtgcg cccgcccgcagaatgtattctaagcacaagtatcattgtatctgtccgatgagatgtgcgga agatccggacagattgtataagtatgcaactaagctgaagaaaaactgtaaggaaataactg ataaggaattggacaagaaaatgaaggagctcgccgccgtcatgagcgaccctgacctggaa actgagactatgtgcctccacgacgacgagtcgtgtcgctacgaagggcaagtcgctgttta ccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaataagggagtta gagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctggagca tatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggcct atgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatt tgaaaccatccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggac ttactgaggagctggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatg tcggtgtgagactatagttagttgcgacgggtacgtcgttaaaagaatagctatcagtccag gcctgtatgggaagccttcaggctatgctgctacgatgcaccgcgagggattcttgtgctgc aaagtgacagacacattgaacggggagagggtctcttttcccgtgtgcacgtatgtgccagc tacattgtgtgaccaaatgactggcatactggcaacagatgtcagtgcggacgacgcgcaaa aactgctggttgggctcaaccagcgtatagtcgtcaacggtcgcacccagagaaacaccaat accatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggtgggcaaaggaata taaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtcatggggt gttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaacc atcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacatt ggagatcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctc tcattaccgccgaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgt gaagccgaggagttgcgcgcagctctaccacctttggcagctgatgttgaggagcccactct ggaagccgatgtagacttgatgttacaagaggctggggccggctcagtggagacacctcgtg gcttgataaaggttaccagctacgctggcgaggacaagatcggctcttacgctgtgctttct ccgcaggctgtactcaagagtgaaaaattatcttgcatccaccctctcgctgaacaagtcat agtgataacacactctggccgaaaagggcgttatgccgtggaaccataccatggtaaagtag tggtgccagagggacatgcaatacccgtccaggactttcaagctctgagtgaaagtgccacc attgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccacacatggagg agcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcgaat acctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctc acaggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgacc agccgctccttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctg gcatcattaaaagcgcagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgt gcagaaattataagggacgtcaagaaaatgaaagggctggacgtcaatgccagaactgtgga ctcagtgctcttgaatggatgcaaacaccccgtagagaccctgtatattgacgaagcttttg cttgtcatgcaggtactctcagagcgctcatagccattataagacctaaaaaggcagtgctc tgcggggatcccaaacagtgcggtttttttaacatgatgtgcctgaaagtgcattttaacca cgagatttgcacacaagtcttccacaaaagcatctctcgccgttgcactaaatctgtgactt cggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccgaaagagactaag attgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcacttgttt cagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctg cctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcct ctgtacgcacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgt gtggaaaacactagccggcgacccatggataaaaacactgactgccaagtaccctgggaatt tcactgccacgatagaggagtggcaagcagagcatgatgccatcatgaggcacatcttggag
agaccggaccctaccgacgtcttccagaataaggcaaacgtgtgttgggccaaggctttagt gccggtgctgaagaccgctggcatagacatgaccactgaacaatggaacactgtggattatt ttgaaacggacaaagctcactcagcagagatagtattgaaccaactatgcgtgaggttcttt ggactcgatctggactccggtctattttctgcacccactgttccgttatccattaggaataa tcactgggataactccccgtcgcctaacatgtacgggctgaataaagaagtggtccgtcagc tctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatgacatgaac actggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgcc tcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagca aattgaagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggtt gactggttgtcagaccggcctgaggctaccttcagagctcggctggatttaggcatcccagg tgatgtgcccaaatatgacataatatttgttaatgtgaggaccccatataaataccatcact atcagcagtgtgaagaccatgccattaagcttagcatgttgaccaagaaagcttgtctgcat ctgaatcccggcggaacctgtgtcagcataggttatggttacgctgacagggccagcgaaag catcattggtgctatagcgcggcagttcaagttttcccgggtatgcaaaccgaaatcctcac ttgaagagacggaagttctgtttgtattcattgggtacgatcgcaaggcccgtacgcacaat ccttacaagctttcatcaaccttgaccaacatttatacaggttccagactccacgaagccgg atgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaaggagtgatta taaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataagaaa ttcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgc agctaaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtg acaaacagttggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaag tcagtagcgattccactgttgtccaccggcatcttttccgggaacaaagatcgactaaccca atcattgaaccatttgctgacagctttagacaccactgatgcagatgtagccatatactgca gggacaagaaatgggaaatgactctcaaggaagcagtggctaggagagaagcagtggaggag atatgcatatccgacgactcttcagtgacagaacctgatgcagagctggtgagggtgcatcc gaagagttctttggctggaaggaagggctacagcacaagcgatggcaaaactttctcatatt tggaagggaccaagtttcaccaggcggccaaggatatagcagaaattaatgccatgtggccc gttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagcatgagcagtat taggtcgaaatgccccgtcgaagagtcggaagcctcctcaccacctagcacgctgccttgct tgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaa attactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatcca atgctcccagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatc tcgtggaaacaccaccggtagacgagactccggagccatcggcagagaaccaatccacagag gggacacctgaacaaccaccacttataaccgaggatgagaccaggactagaacgcctgagcc gatcatcatcgaagaggaagaagaggatagcataagtttgctgtcagatggcccgacccacc aggtgctgcaagtcgaggcagacattcacgggccgccctctgtatctagctcatcctggtcc attcctcatgcatccgactttgatgtggacagtttatccatacttgacaccctggagggagc tagcgtgaccagcggggcaacgtcagccgagactaactcttacttcgcaaagagtatggagt ttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctccacatcccgctccg cgcacaagaacaccgtcacttgcacccagcagggcctgctcgagagggatcacgggagaaac cgtgggatacgcggttacacacaatagcgagggcttcttgctatgcaaagttactgacacag taaaaggagaacgggtatcgttccctgtgtgcacgtacatcccggccaccataaactcgaga accagcctggtctccaacccgccaggcgtaaatagggtgattacaagagaggagtttgaggc gttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatcttttcctccgacaccg gtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtgttggag aggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtgg agaacatgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggca gaaggaaaagtggagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaa ccgtgccttttcaagccccaaggtcgcagtggaagcctgtaacgccatgttgaaagagaact ttccgactgtggcttcttactgtattattccagagtacgatgcctatttggacatggttgac ggagcttcatgctgcttagacactgccagtttttgccctgcaaagctgcgcagctttccaaa gaaacactcctatttggaacccacaatacgatcggcagtgccttcagcgatccagaacacgc tccagaacgtcctggcagctgccacaaaaagaaattgcaatgtcacgcaaatgagagaattg cccgtattggattcggcggcctttaatgtggaatgcttcaagaaatatgcgtgtaataatga atattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggtaaattaca ttaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaatatg ttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactcc aggaacaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctag caacagcgtatctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgctt ccgaacattcatacactgtttgatatgtcggctgaagactttgacgctattatagccgagca cttccagcctggggattgtgttctggaaactgacatcgcgtcgtttgataaaagtgaggacg acgccatggctctgaccgcgttaatgattctggaagacttaggtgtggacgcagagctgttg acgctgattgaggcggctttcggcgaaatttcatcaatacatttgcccactaaaactaaatt taaattcggagccatgatgaaatctggaatgttcctcacactgtttgtgaacacagtcatta acattgtaatcgcaagcagagtgttgagagaacggctaaccggatcaccatgtgcagcattc attggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcagacaggtgcgc cacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgccttatt tctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagacccc ctaaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacag gagaagggcattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgt gcaaggcagtagaatcaaggtatgaaaccgtaggaacttccatcatagttatggccatgact actctagctagcagtgttaaatcattcagctacctgagaggggcccctataactctctacgg ctaacctgaatggactacgacatagtctagtcgacgccaccatgaggcctggcctgccctcc tacctgatcatcctggccgtgtgcctgttcagccacctgctgtccagcagatacggcgccga ggccgtgagcgagcccctggacaaggctttccacctgctgctgaacacctacggcagaccca tccggtttctgcgggagaacaccacccagtgcacctacaacagcagcctgcggaacagcacc gtcgtgagagagaacgccatcagcttcaactttttccagagctacaaccagtactacgtgtt ccacatgcccagatgcctgtttgccggccctctggccgagcagttcctgaaccaggtggacc tgaccgagacactggaaagataccagcagcggctgaatacctacgccctggtgtccaaggac ctggccagctaccggtcctttagccagcagctcaaggctcaggatagcctcggcgagcagcc taccaccgtgccccctcccatcgacctgagcatcccccacgtgtggatgcctccccagacca cccctcacggctggaccgagagccacaccacctccggcctgcacagaccccacttcaaccag acctgcatcctgttcgacggccacgacctgctgtttagcaccgtgaccccctgcctgcacca gggcttctacctgatcgacgagctgagatacgtgaagatcaccctgaccgaggatttcttcg tggtcaccgtgtccatcgacgacgacacccccatgctgctgatcttcggccacctgcccaga gtgctgttcaaggccccctaccagcgggacaacttcatcctgcggcagaccgagaagcacga gctgctggtgctggtcaagaaggaccagctgaaccggcactcctacctgaaggaccccgact tcctggacgccgccctggacttcaactacctggacctgagcgccctgctgagaaacagcttc cacagatacgccgtggacgtgctgaagtccggacggtgccagatgctcgatcggcggaccgt ggagatggccttcgcctatgccctcgccctgttcgccgctgccagacaggaagaggctggcg cccaggtgtcagtgcccagagccctggatagacaggccgccctgctgcagatccaggaattc atgatcacctgcctgagccagaccccccctagaaccaccctgctgctgtaccccacagccgt ggatctggccaagagggccctgtggacccccaaccagatcaccgacatcacaagcctcgtgc ggctcgtgtacatcctgagcaagcagaaccagcagcacctgatcccccagtgggccctgaga cagatcgccgacttcgccctgaagctgcacaagacccatctggccagctttctgagcgcctt cgccaggcaggaactgtacctgatgggcagcctggtccacagcatgctggtgcataccaccg agcggcgggagatcttcatcgtggagacaggcctgtgtagcctggccgagctgtcccacttt acccagctgctggcccaccctcaccacgagtacctgagcgacctgtacaccccctgcagcag cagcggcagacgggaccacagcctggaacggctgaccagactgttccccgatgccaccgtgc ctgctacagtgcctgccgccctgtccatcctgtccaccatgcagcccagcaccctggaaacc ttccccgacctgttctgcctgcccctgggcgagagctttagcgccctgaccgtgtccgagca cgtgtcctacatcgtgaccaatcagtacctgatcaagggcatcagctaccccgtgtccacca cagtcgtgggccagagcctgatcatcacccagaccgacagccagaccaagtgcgagctgacc cggaacatgcacaccacacacagcatcaccgtggccctgaacatcagcctggaaaactgcgc tttctgtcagtctgccctgctggaatacgacgatacccagggcgtgatcaacatcatgtaca tgcacgacagcgacgacgtgctgttcgccctggacccctacaacgaggtggtggtgtccagc ccccggacccactacctgatgctgctgaagaacggcaccgtgctggaagtgaccgacgtggt ggtggacgccaccgacagcagactgctgatgatgagcgtgtacgccctgagcgccatcatcg gcatctacctgctgtaccggatgctgaaaacctgctgataatctagacggcgcgcccaccca gcggccgcctataactctctacggctaacctgaatggactacgacatagtctagtcgacgcc accatgtgcagaaggcccgactgcggcttcagcttcagccctggacccgtgatcctgctgtg gtgctgcctgctgctgcctatcgtgtcctctgccgccgtgtctgtggcccctacagccgccg agaaggtgccagccgagtgccccgagctgaccagaagatgcctgctgggcgaggtgttcgag ggcgacaagtacgagagctggctgcggcccctggtcaacgtgaccggcagagatggccccct gagccagctgatccggtacagacccgtgacccccgaggccgccaatagcgtgctgctggacg aggccttcctggataccctggccctgctgtacaacaaccccgaccagctgagagccctgctg accctgctgtccagcgacaccgcccccagatggatgaccgtgatgcggggctacagcgagtg tggagatggcagccctgccgtgtacacctgcgtggacgacctgtgcagaggctacgacctga ccagactgagctacggccggtccatcttcacagagcacgtgctgggcttcgagctggtgccc cccagcctgttcaacgtggtggtggccatccggaacgaggccaccagaaccaacagagccgt gcggctgcctgtgtctacagccgctgcacctgagggcatcacactgttctacggcctgtaca acgccgtgaaagagttctgcctccggcaccagctggatccccccctgctgagacacctggac aagtactacgccggcctgcccccagagctgaagcagaccagagtgaacctgcccgcccacag cagatatggccctcaggccgtggacgccagatgataatctagacggcgcgcccacccaatcg atctataactctctacggctaacctgaatggactacgacatagtctagtcgacgccaccatg ggcaagaaagaaatgatcatggtcaagggcatccccaagatcatgctgctgattagcatcac ctttctgctgctgtccctgatcaactgcaacgtgctggtcaacagccggggcaccagaagat cctggccctacaccgtgctgtcctaccggggcaaagagatcctgaagaagcagaaagaggac atcctgaagcggctgatgagcaccagcagcgacggctaccggttcctgatgtaccccagcca gcagaaattccacgccatcgtgatcagcatggacaagttcccccaggactacatcctggccg gacccatccggaacgacagcatcacccacatgtggttcgacttctacagcacccagctgcgg aagcccgccaaatacgtgtacagcgagtacaaccacaccgcccacaagatcaccctgaggcc tcccccttgtggcaccgtgcccagcatgaactgcctgagcgagatgctgaacgtgtccaagc ggaacgacaccggcgagaagggctgcggcaacttcaccaccttcaaccccatgttcttcaac gtgccccggtggaacaccaagctgtacatcggcagcaacaaagtgaacgtggacagccagac catctactttctgggcctgaccgccctgctgctgagatacgcccagcggaactgcacccggt ccttctacctggtcaacgccatgagccggaacctgttccgggtgcccaagtacatcaacggc accaagctgaagaacaccatgcggaagctgaagcggaagcaggccctggtcaaagagcagcc ccagaagaagaacaagaagtcccagagcaccaccaccccctacctgagctacaccacctcca ccgccttcaacgtgaccaccaacgtgacctacagcgccacagccgccgtgaccagagtggcc acaagcaccaccggctaccggcccgacagcaactttatgaagtccatcatggccacccagct gagagatctggccacctgggtgtacaccaccctgcggtacagaaacgagcccttctgcaagc ccgaccggaacagaaccgccgtgagcgagttcatgaagaatacccacgtgctgatcagaaac gagacaccctacaccatctacggcaccctggacatgagcagcctgtactacaacgagacaat gagcgtggagaacgagacagccagcgacaacaacgaaaccacccccacctcccccagcaccc ggttccagcggaccttcatcgaccccctgtgggactacctggacagcctgctgttcctggac aagatccggaacttcagcctgcagctgcccgcctacggcaatctgaccccccctgagcacag aagggccgccaacctgagcaccctgaacagcctgtggtggtggagccagtgataatctagac ggcgcgcccacccaccgcgggcaatatagcaacactaaaaactcgatgtacttccgaggaag cgcagtgcataatgctgcgcagtgttgccacataaccactatattaaccatttatctagcgg acgccaaaaactcaatgtatttctgaggaagcgtggtgcataatgccacgcagcgtctgcat aacttttattatttcttttattaatcaacaaaattttgtttttaacatttcaaaaaaaaaaa aaaaaaaaaaaaaaaaaaaaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtc cgacctgggcatccgaaggaggacgcacgtccactcggatggctaagggagagccacgagct cctgtttaaaccagctccaattcgccctatagtgagtcgtattacgcgcgctcactggccgt cgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcac atccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacag ttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgt ggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctt tcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctc cctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtga tggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtcca cgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctat tcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgattta acaaaaatttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcg gggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgc tcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtatt caacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctca cccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttaca tcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttcca atgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggca agagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtca cagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatg agtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgc ttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatg aagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgc aaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatgga ggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctg ataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggt aagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaa tagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagttt actcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaag atcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtc agaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgct gcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctacca actctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagt gtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgc taatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactca agacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcc cagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcg ccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacagga gagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcg ccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaa acgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttc tttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgatac cgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcc caatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacagg tttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcatta ggcaccccaggctttacactttatgctcccggctcgtatgttgtgtggaattgtgagcggat aacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcac taaagggaacaaaagctgggtaccggcgcca A526 Vector: SGP-gH-SGP-gL-SGP-UL128-2A-UL130-2Amod-UL131 (SEQ ID NO: 56): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTAGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G
TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACAGCAGACTGCTGATGATGAGCGTGTACGCCCTGAGCGCCATCA- T CGGCATCTACCTGCTGTACCGGATGCTGAAAACCTGCTGATAATCTAGAGGCCCCTATAACTCTCTACGGCTAA- C CTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTG- G ACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAG- C CGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACA- A GTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACA- G ACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACA- A CAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGC- G GGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACC- T GACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGT- T CAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCG- C TGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGG- A TCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACC- T GCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAACGCCGGCGGCCCCTATAACTCTCTA- C GGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGCCCCAAGGACCTGACCCCCTTCCTGACA- A CCCTGTGGCTGCTCCTGGGCCATAGCAGAGTGCCTAGAGTGCGGGCCGAGGAATGCTGCGAGTTCATCAACGTG- A ACCACCCCCCCGAGCGGTGCTACGACTTCAAGATGTGCAACCGGTTCACCGTGGCCCTGAGATGCCCCGACGGC- G AAGTGTGCTACAGCCCCGAGAAAACCGCCGAGATCCGGGGCATCGTGACCACCATGACCCACAGCCTGACCCGG- C AGGTGGTGCACAACAAGCTGACCAGCTGCAACTACAACCCCCTGTACCTGGAAGCCGACGGCCGGATCAGATGC- G GCAAAGTGAACGACAAGGCCCAGTACCTGCTGGGAGCCGCCGGAAGCGTGCCCTACCGGTGGATCAACCTGGAA-
T ACGACAAGATCACCCGGATCGTGGGCCTGGACCAGTACCTGGAAAGCGTGAAGAAGCACAAGCGGCTGGACGTG- T GCAGAGCCAAGATGGGCTACATGCTGCAGCTGTTGAATTTTGACCTTCTTAAGCTTGCGGGAGACGTCGAGTCC- A ACCCCGGGCCCATGCTGCGGCTGCTGCTGAGACACCACTTCCACTGCCTGCTGCTGTGTGCCGTGTGGGCCACC- C CTTGTCTGGCCAGCCCTTGGAGCACCCTGACCGCCAACCAGAACCCTAGCCCCCCTTGGTCCAAGCTGACCTAC- A GCAAGCCCCACGACGCCGCCACCTTCTACTGCCCCTTTCTGTACCCCAGCCCTCCCAGAAGCCCCCTGCAGTTC- A GCGGCTTCCAGAGAGTGTCCACCGGCCCTGAGTGCCGGAACGAGACACTGTACCTGCTGTACAACCGGGAGGGC- C AGACACTGGTGGAGCGGAGCAGCACCTGGGTGAAAAAAGTGATCTGGTATCTGAGCGGCCGGAACCAGACCATC- C TGCAGCGGATGCCCAGAACCGCCAGCAAGCCCAGCGACGGCAACGTGCAGATCAGCGTGGAGGACGCCAAAATC- T TCGGCGCCCACATGGTGCCCAAGCAGACCAAGCTGCTGAGATTCGTGGTCAACGACGGCACCAGATATCAGATG- T GCGTGATGAAGCTGGAAAGCTGGGCCCACGTGTTCCGGGACTACTCCGTGAGCTTCCAGGTCCGGCTGACCTTC- A CCGAGGCCAACAACCAGACCTACACCTTCTGCACCCACCCCAACCTGATCGTGCTGCTGAACTTCGACCTGCTG- A AGCTGGCCGGCGACGTGGAGAGCAACCCCGGCCCCCATATGCGGCTGTGCAGAGTGTGGCTGTCCGTGTGCCTG- T GTGCCGTGGTGCTGGGCCAGTGCCAGAGAGAGACAGCCGAGAAGAACGACTACTACCGGGTGCCCCACTACTGG- G ATGCCTGCAGCAGAGCCCTGCCCGACCAGACCCGGTACAAATACGTGGAGCAGCTCGTGGACCTGACCCTGAAC- T ACCACTACGACGCCAGCCACGGCCTGGACAACTTCGACGTGCTGAAGCGGATCAACGTGACCGAGGTGTCCCTG- C TGATCAGCGACTTCCGGCGGCAGAACAGAAGAGGCGGCACCAACAAGCGGACCACCTTCAACGCCGCTGGCTCT- C TGGCCCCTCACGCCAGATCCCTGGAATTCAGCGTGCGGCTGTTCGCCAACTGATAACGTTGCATCCTGCAGGAT- A CAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTT- T TCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA- G GGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGG- C TAAGGGAGAGCCACGTTTAAACGCTAGAGCAAGACGTTTCCCGTTGAATATGGCTCATAACACCCCTTGTATTA- C TGTTTATGTAAGCAGACAGTTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGAGATTT- T GAGACACAACGTGGCTTTGTTGAATAAATCGAACTTTTGCTGAGTTGAAGGATCAGATCACGCATCTTCCCGAC- A ACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAAATCACCAACTGGTCCACCTACAACAAAGCTCTCATCAAC- C GTGGCTCCCTCACTTTCTGGCTGGATGATGGGGCGATTCAGGCCTGGTATGAGTCAGCAACACCTTCTTCACGA- G GCAGACCTCAGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTC- A TGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCT- C ACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGA- A GATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCT- G ACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT- C CCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGC- G TTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCC- C CCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCA- C CACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGA- A AGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTT- C GAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGA- A GATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATC- A AAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAAC- T TGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGCCGCAAT- G CCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGCAAT- A TCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCACCATAAT- G TTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGCAAA- C AGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCATACGGGT- A CGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAGACGACG- C ATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGGCACTTC- G CCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGTGGTGGC- C AGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAACAGCAC- C GGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATAGCC- A AACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCTTTTTCA- A TATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA- A ATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCG- T TAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAA- T AGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGG- G CGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGG- T AACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A527 Vector: SGP-gH-SGP-gL-SGP-UL128-EMCV-UL130-EV71-UL131 (SEQ ID NO: 57): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTAGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A
TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACAGCAGACTGCTGATGATGAGCGTGTACGCCCTGAGCGCCATCA- T CGGCATCTACCTGCTGTACCGGATGCTGAAAACCTGCTGATAATCTAGAGGCCCCTATAACTCTCTACGGCTAA- C CTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTG- G ACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAG- C CGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACA- A GTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACA- G ACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACA- A CAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGC- G GGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACC- T GACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGT- T CAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCG- C TGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGG- A TCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACC- T GCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAACGCCGGCGGCCCCTATAACTCTCTA- C GGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGCCCCAAGGACCTGACCCCCTTCCTGACA- A CCCTGTGGCTGCTCCTGGGCCATAGCAGAGTGCCTAGAGTGCGGGCCGAGGAATGCTGCGAGTTCATCAACGTG- A ACCACCCCCCCGAGCGGTGCTACGACTTCAAGATGTGCAACCGGTTCACCGTGGCCCTGAGATGCCCCGACGGC- G AAGTGTGCTACAGCCCCGAGAAAACCGCCGAGATCCGGGGCATCGTGACCACCATGACCCACAGCCTGACCCGG- C AGGTGGTGCACAACAAGCTGACCAGCTGCAACTACAACCCCCTGTACCTGGAAGCCGACGGCCGGATCAGATGC- G GCAAAGTGAACGACAAGGCCCAGTACCTGCTGGGAGCCGCCGGAAGCGTGCCCTACCGGTGGATCAACCTGGAA- T ACGACAAGATCACCCGGATCGTGGGCCTGGACCAGTACCTGGAAAGCGTGAAGAAGCACAAGCGGCTGGACGTG- T GCAGAGCCAAGATGGGCTACATGCTGCAGTGATAAGGCGCGCCAACGTTACTGGCCGAAGCCGCTTGGAATAAG- G CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTG- G CCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCG- T GAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACC- C CCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCC- A GTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA- A GGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGT- C GAGGTTAAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATATGCTG- C GGCTGCTGCTGAGACACCACTTCCACTGCCTGCTGCTGTGTGCCGTGTGGGCCACCCCTTGTCTGGCCAGCCCT- T GGAGCACCCTGACCGCCAACCAGAACCCTAGCCCCCCTTGGTCCAAGCTGACCTACAGCAAGCCCCACGACGCC- G CCACCTTCTACTGCCCCTTTCTGTACCCCAGCCCTCCCAGAAGCCCCCTGCAGTTCAGCGGCTTCCAGAGAGTG- T CCACCGGCCCTGAGTGCCGGAACGAGACACTGTACCTGCTGTACAACCGGGAGGGCCAGACACTGGTGGAGCGG- A GCAGCACCTGGGTGAAAAAAGTGATCTGGTATCTGAGCGGCCGGAACCAGACCATCCTGCAGCGGATGCCCAGA- A CCGCCAGCAAGCCCAGCGACGGCAACGTGCAGATCAGCGTGGAGGACGCCAAAATCTTCGGAGCCCACATGGTG- C CCAAGCAGACCAAGCTGCTGAGATTCGTGGTCAACGACGGCACCAGATATCAGATGTGCGTGATGAAGCTGGAA- A GCTGGGCCCACGTGTTCCGGGACTACTCCGTGAGCTTCCAGGTCCGGCTGACCTTCACCGAGGCCAACAACCAG- A CCTACACCTTCTGCACCCACCCCAACCTGATCGTGTGATAAGTACCTTTGTACGCCTGTTTTATACCCCCTCCC- T GATTTGCAACTTAGAAGCAACGCAAACCAGATCAATAGTAGGTGTGACATACCAGTCGCATCTTGATCAAGCAC- T TCTGTATCCCCGGACCGAGTATCAATAGACTGTGCACACGGTTGAAGGAGAAAACGTCCGTTACCCGGCTAACT- A CTTCGAGAAGCCTAGTAACGCCATTGAAGTTGCAGAGTGTTTCGCTCAGCACTCCCCCCGTGTAGATCAGGTCG- A TGAGTCACCGCATTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCTATGGGGTAACCCATAGG- A CGCTCTAATACGGACATGGCGTGAAGAGTCTATTGAGCTAGTTAGTAGTCCTCCGGCCCCTGAATGCGGCTAAT- C CTAACTGCGGAGCACATACCCTTAATCCAAAGGGCAGTGTGTCGTAACGGGCAACTCTGCAGCGGAACCGACTA- C TTTGGGTGTCCGTGTTTCTTTTTATTCTTGTATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAG- C TATTGGATTGGCCATCCAGTGTCAAACAGAGCTATTGTATATCTCTTTGTTGGATTCACACCTCTCACTCTTGA- A ACGTTACACACCCTCAATTACATTATACTGCTGAACACGAAGCGCATATGCGGCTGTGCAGAGTGTGGCTGTCC- G TGTGCCTGTGTGCCGTGGTGCTGGGCCAGTGCCAGAGAGAGACAGCCGAGAAGAACGACTACTACCGGGTGCCC- C ACTACTGGGATGCCTGCAGCAGAGCCCTGCCCGACCAGACCCGGTACAAATACGTGGAGCAGCTCGTGGACCTG- A CCCTGAACTACCACTACGACGCCAGCCACGGCCTGGACAACTTCGACGTGCTGAAGCGGATCAACGTGACCGAG- G TGTCCCTGCTGATCAGCGACTTCCGGCGGCAGAACAGAAGAGGCGGCACCAACAAGCGGACCACCTTCAACGCC- G CTGGCTCTCTGGCCCCTCACGCCAGATCCCTGGAATTCAGCGTGCGGCTGTTCGCCAACTGATAACGTTGCATC- C TGCAGGATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTT- A TTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAA- A AAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCA- C TCGGATGGCTAAGGGAGAGCCACGTTTAAACGCTAGAGCAAGACGTTTCCCGTTGAATATGGCTCATAACACCC- C TTGTATTACTGTTTATGTAAGCAGACAGTTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACAT- C AGAGATTTTGAGACACAACGTGGCTTTGTTGAATAAATCGAACTTTTGCTGAGTTGAAGGATCAGATCACGCAT- C TTCCCGACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAAATCACCAACTGGTCCACCTACAACAAAGCT- C TCATCAACCGTGGCTCCCTCACTTTCTGGCTGGATGATGGGGCGATTCAGGCCTGGTATGAGTCAGCAACACCT- T CTTCACGAGGCAGACCTCAGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTG- A AGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGC- T TCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGA- T TTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTC- C GCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATAC- C AGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGT-
T ATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATG- C ACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACAT- G CAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGG- C TAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCA- G AGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACG- A TCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCAT- G AGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATA- T GAGTAAACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGG- T GCCGCAATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGC- C AGCGCAATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTC- C ACCATAATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAG- A CGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTC- C ATACGGGTACGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATG- C AGACGACGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACC- C GGCACTTCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACC- G GTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCAC- A AACAGCACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCA- A TCATAGCCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTT- C CTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAA- A AATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTA- A AATTCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAA- T CAAAAGAATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTG- T TGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGAT- T AAGTTGGGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A531 Vector: SGP-gHso1-SGP-gL (SEQ ID NO: 58): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC-
T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACTGATAATCTAGAGGCCCCTATAACTCTCTACGGCTAACCTGAA- T GGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTGGACCCG- T GATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAGCCGCCG- A GAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACAAGTACG- A GAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACAGACCCG- T GACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACAACAACC- C CGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGCGGGGCT- A CAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACCTGACCA- G ACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGTTCAACG- T GGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCGCTGCAC- C TGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGGATCCCC- C CCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACCTGCCCG- C CCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAAGCGGCCGCATACAGCAGCAATTGGCAAGCTG- C TTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGG- A TTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCAC- C TCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAA- A CACGTGATATCTGGCCTCATGGGCCTTCCTTTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGC- A TTAACATGGTCATAGCTGTTTCCTTGCGTATTGGGCGCTCTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGG- T CGTTCGGGTAAAGCCTGGGGTGCCTAATGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTT- G CTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAA- C CCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCC- G CTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCT- C AGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTT- A TCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAG- G ATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAG- A ACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAA- A CAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGA- A GATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAG- A TTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGA- G TAAACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGC- C GCAATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAG- C GCAATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCAC- C ATAATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACG- C GCAAACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCAT- A CGGGTACGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAG- A CGACGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGG- C ACTTCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGT- G GTGGCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAA- C AGCACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATC- A TAGCCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCT- T TTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAA- T AAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAA- T TCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCA- A AAGAATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTG- G GAAGGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAA- G TTGGGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A532 Vector: SGP-gHso1-2A-gL (SEQ ID NO: 59): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT-
G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACCTGTTGAATTTTGACCTTCTTAAGCTTGCGGGAGACGTCGAGT- C CAACCCCGGGCCCATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTGGACCCGTGATCCTGCTGTGGT- G CTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAGCCGCCGAGAAGGTGCCAGCCG- A GTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACAAGTACGAGAGCTGGCTGCGGC- C CCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACAGACCCGTGACCCCCGAGGCCG- C CAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACAACAACCCCGACCAGCTGAGAG- C CCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGCGGGGCTACAGCGAGTGTGGAG- A TGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACCTGACCAGACTGAGCTACGGCC- G GTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGTTCAACGTGGTGGTGGCCATCC- G GAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCGCTGCACCTGAGGGCATCACAC- T GTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGGATCCCCCCCTGCTGAGACACC- T GGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACCTGCCCGCCCACAGCAGATATG- G CCCTCAGGCCGTGGACGCCAGATGATAAGCGGCCGCATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCG- C GGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATA- T TTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGA- C CTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAAACACGTGATATCTGG- C CTCATGGGCCTTCCTTTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAACATGGTCATA- G CTGTTTCCTTGCGTATTGGGCGCTCTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGGTAAAGC- C TGGGGTGCCTAATGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCA- T AGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATA- A AGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCT- G TCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGT- C GTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCG- T CTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAG- G TATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGAACAGTATTTGGTAT- C TGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGG- T AGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTT- T TCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGAT- C TTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGA- C AGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGCCGCAATGCCATACAG- C ACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGCAATATCCTGATA- A CGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCACCATAATGTTCGGCAG- G CACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGCAAACAGCTCTGC- C GGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCATACGGGTACGCGCACG- T TCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAGACGACGCATGGCATC- C GCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGGCACTTCGCCCAGCAG- C AGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGTGGTGGCCAGCCAGCT- C AGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAACAGCACCGGACGACC- C TGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATAGCCAAACAGACG- T TCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCTTTTTCAATATTATTG- A AGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGT- T CCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTT- T GTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAG- A TAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGTTTCGG- T GCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAG- G GTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A533 Vector: SGP-gHso1-EV71-gL (SEQ ID NO: 60): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA-
G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T
GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACTGATAATCTAGATTAAAACAGCTGTGGGTTGTTCCCACCCACA- G GGCCCACTGGGCGCTAGCACTCTGATTTTACGAAATCCTTGTGCGCCTGTTTTATATCCCTTCCCTAATTCGAA- A CGTAGAAGCAATGCGCACCACTGATCAATAGTAGGCGTAACGCGCCAGTTACGTCATGATCAAGCATATCTGTT- C CCCCGGACTGAGTATCAATAGACTGCTTACGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGA- G AAGCCCAGTAACACCATGGAAGCTGCAGGGTGTTTCGCTCAGCACTTCCCCCGTGTAGATCAGGTCGATGAGCC- A CTGCAATCCCCACAGGTGACTGTGGCAGTGGCTGCGTTGGCGGCCTGCCTATGGGGAGACCCATAGGACGCTCT- A ATGTGGACATGGTGCGAAGAGCCTATTGAGCTAGTTAGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCTAACT- G CGGAGCACATGCCTTCAACCCAGAGGGTAGTGTGTCGTAATGGGCAACTCTGCAGCGGAACCGACTACTTTGGG- T GTCCGTGTTTCTTTTTATTCTTATATTGGCTGCTTATGGTGACAATTACAGAATTGTTACCATATAGCTATTGG- A TTGGCCATCCGGTGTGTAATAGAGCTGTTATATACCTATTTGTTGGCTTTGTACCACTAACTTTAAAATCTATA- A CTACCCTCAACTTTATATTAACCCTCAATACAGTTGAACATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAG- C CCTGGACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCC- T ACAGCCGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGG- C GACAAGTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCG- G TACAGACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCT- G TACAACAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGT- G ATGCGGGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTA- C GACCTGACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAG- C CTGTTCAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTAC- A GCCGCTGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCA- G CTGGATCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGT- G AACCTGCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAAGCGGCCGCATACAGCAGCAA- T TGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTT- T TCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCAT- G GCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGA- G CCACGTTTAAACACGTGATATCTGGCCTCATGGGCCTTCCTTTCACTGCCCGCTTTCCAGTCGGGAAACCTGTC- G TGCCAGCTGCATTAACATGGTCATAGCTGTTTCCTTGCGTATTGGGCGCTCTCCGCTTCCTCGCTCACTGACTC- G CTGCGCTCGGTCGTTCGGGTAAAGCCTGGGGTGCCTAATGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAA- A AGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAG- A GGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTT- C CGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGC- T GTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGAC- C GCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCC- A CTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGC- T ACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCT- T GATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAA- G GATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT- T TGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAA- A GTATATATGAGTAAACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGG- C TATCCGGTGCCGCAATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCA- C GGGTGGCCAGCGCAATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGG- C CATTTTCCACCATAATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTC- G CTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGG- C CCGCTTCCATACGGGTACGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCC- A GGGTATGCAGACGACGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGA- T CCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACAC- G GAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCG- G TTTTCACAAACAGCACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGC- T GCGCCCAATCATAGCCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATC- A TACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGT- A TTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATA- T TTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCC- C TTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTG- C GCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGC- A AGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A534 Vector: SGP-gL-EV71-gH (SEQ ID NO: 61): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G
AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAAGGC- C CGACTGCGGCTTCAGCTTCAGCCCTGGACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCT- C TGCCGCCGTGTCTGTGGCCCCTACAGCCGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCC- T GCTGGGCGAGGTGTTCGAGGGCGACAAGTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATG- G CCCCCTGAGCCAGCTGATCCGGTACAGACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCT- T CCTGGATACCCTGGCCCTGCTGTACAACAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACA- C CGCCCCCAGATGGATGACCGTGATGCGGGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCG- T GGACGACCTGTGCAGAGGCTACGACCTGACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGG- G CTTCGAGCTGGTGCCCCCCAGCCTGTTCAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAG- C CGTGCGGCTGCCTGTGTCTACAGCCGCTGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGA- A AGAGTTCTGCCTCCGGCACCAGCTGGATCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCC- C AGAGCTGAAGCAGACCAGAGTGAACCTGCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGAT- A ATCTAGATTAAAACAGCTGTGGGTTGTTCCCACCCACAGGGCCCACTGGGCGCTAGCACTCTGATTTTACGAAA- T CCTTGTGCGCCTGTTTTATATCCCTTCCCTAATTCGAAACGTAGAAGCAATGCGCACCACTGATCAATAGTAGG- C GTAACGCGCCAGTTACGTCATGATCAAGCATATCTGTTCCCCCGGACTGAGTATCAATAGACTGCTTACGCGGT- T GAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGAGAAGCCCAGTAACACCATGGAAGCTGCAGGGTGTTT- C GCTCAGCACTTCCCCCGTGTAGATCAGGTCGATGAGCCACTGCAATCCCCACAGGTGACTGTGGCAGTGGCTGC- G TTGGCGGCCTGCCTATGGGGAGACCCATAGGACGCTCTAATGTGGACATGGTGCGAAGAGCCTATTGAGCTAGT- T AGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCTAACTGCGGAGCACATGCCTTCAACCCAGAGGGTAGTGTGT- C GTAATGGGCAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCTTTTTATTCTTATATTGGCTGCTT- A TGGTGACAATTACAGAATTGTTACCATATAGCTATTGGATTGGCCATCCGGTGTGTAATAGAGCTGTTATATAC- C TATTTGTTGGCTTTGTACCACTAACTTTAAAATCTATAACTACCCTCAACTTTATATTAACCCTCAATACAGTT- G AACATGAGGCCTGGCCTGCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAG- A TACGGCGCCGAGGCCGTGAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCAT- C CGGTTTCTGCGGGAGAACACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAA- C GCCATCAGCTTCAACTTTTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGG- C CCTCTGGCCGAGCAGTTCCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATAC- C TACGCCCTGGTGTCCAAGGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGG- C GAGCAGCCTACCACCGTGCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCC- T CACGGCTGGACCGAGAGCCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGA- C GGCCACGACCTGCTGTTTAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATA- C GTGAAGATCACCCTGACCGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGAT- C TTCGGCCACCTGCCCAGAGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAA- G CACGAGCTGCTGGTGCTGGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGA- C GCCGCCCTGGACTTCAACTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGT- G CTGAAGTCCGGACGGTGCCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTT- C GCCGCTGCCAGACAGGAAGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCT- G CAGATCCAGGAATTCATGATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGC- C GTGGATCTGGCCAAGAGGGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTA- C ATCCTGAGCAAGCAGAACCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAA- G CTGCACAAGACCCATCTGGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGT- C CACAGCATGCTGGTGCATACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGA- G CTGTCCCACTTTACCCAGCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAG- C AGCGGCAGACGGGACCACAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCC- T GCCGCCCTGTCCATCCTGTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCT- G GGCGAGAGCTTTAGCGCCCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGG- C ATCAGCTACCCCGTGTCCACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTG- C GAGCTGACCCGGAACATGCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTT- C TGTCAGTCTGCCCTGCTGGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGA- C GTGCTGTTCGCCCTGGACCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAA- G AACGGCACCGTGCTGGAAGTGACCGACGTGGTGGTGGACGCCACCGACTGATAAGCGGCCGCATACAGCAGCAA- T TGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTT- T TCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCAT- G GCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGA- G CCACGTTTAAACACGTGATATCTGGCCTCATGGGCCTTCCTTTCACTGCCCGCTTTCCAGTCGGGAAACCTGTC- G TGCCAGCTGCATTAACATGGTCATAGCTGTTTCCTTGCGTATTGGGCGCTCTCCGCTTCCTCGCTCACTGACTC- G CTGCGCTCGGTCGTTCGGGTAAAGCCTGGGGTGCCTAATGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAA- A AGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAG- A GGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTT- C CGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGC- T GTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGAC- C GCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCC- A CTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGC- T ACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCT- T GATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAA- G GATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT- T TGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAA- A GTATATATGAGTAAACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGG- C TATCCGGTGCCGCAATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCA- C GGGTGGCCAGCGCAATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGG- C CATTTTCCACCATAATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTC- G CTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGG- C CCGCTTCCATACGGGTACGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCC- A GGGTATGCAGACGACGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGA- T CCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACAC- G GAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCG- G TTTTCACAAACAGCACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGC- T GCGCCCAATCATAGCCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATC- A TACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGT- A TTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATA- T TTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCC- C TTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTG- C GCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGC- A AGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A535 Vector: SGP-342-EV71-gHso1-2A-gL (SEQ ID NO: 62): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A
AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGCTATTCCAGAAGT- A GTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGTATATCCATTTTCGGATCTGATCAAGAGAC- A GGATGAGGATCGTTTCGCATGATTGAATAAGATGGATTGCACGTAGGTTCTCCGGCCGCTTGGGTGGAGAGGCT- A TTCGGCTATGACTGGGCACAACTGACAATCGGCTGCTCTGATGCCGCCGTGATCCGGTTGTCAGCGCAGGGGCG- C CCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGAAGGACGAGGCAGCGCGGCTATCGTG- G CTGGCCACGACGGGCGTTCCTTGCGCAGTCTAGACTGGCGCGCCAAACCTGCAGGTTAAAACAGCTGTGGGTTG- T TCCCACCCACAGGGCCCACTGGGCGCTAGCACTCTGATTTTACGAAATCCTTGTGCGCCTGTTTTATATCCCTT- C CCTAATTCGAAACGTAGAAGCAATGCGCACCACTGATCAATAGTAGGCGTAACGCGCCAGTTACGTCATGATCA- A GCATATCTGTTCCCCCGGACTGAGTATCAATAGACTGCTTACGCGGTTGAAGGAGAAAACGTTCGTTATCCGGC- T AACTACTTCGAGAAGCCCAGTAACACCATGGAAGCTGCAGGGTGTTTCGCTCAGCACTTCCCCCGTGTAGATCA- G GTCGATGAGCCACTGCAATCCCCACAGGTGACTGTGGCAGTGGCTGCGTTGGCGGCCTGCCTATGGGGAGACCC- A TAGGACGCTCTAATGTGGACATGGTGCGAAGAGCCTATTGAGCTAGTTAGTAGTCCTCCGGCCCCTGAATGCGG- C TAATCCTAACTGCGGAGCACATGCCTTCAACCCAGAGGGTAGTGTGTCGTAATGGGCAACTCTGCAGCGGAACC- G ACTACTTTGGGTGTCCGTGTTTCTTTTTATTCTTATATTGGCTGCTTATGGTGACAATTACAGAATTGTTACCA- T ATAGCTATTGGATTGGCCATCCGGTGTGTAATAGAGCTGTTATATACCTATTTGTTGGCTTTGTACCACTAACT- T TAAAATCTATAACTACCCTCAACTTTATATTAACCCTCAATACAGTTGAACATGAGGCCTGGCCTGCCCTCCTA- C CTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCGTGAGCGAGCC- C CTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGAACACCACCCA- G TGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACTTTTTCCAGAG- C TACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGTTCCTGAACCA- G GTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCAAGGACCTGGC- C AGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCGTGCCCCCTCC- C ATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGAGCCACACCAC- C TCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGTTTAGCACCGT- G ACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGACCGAGGATTT- C TTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCAGAGTGCTGTT- C AAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGCTGGTCAAGAA- G GACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCAACTACCTGGA- C CTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGTGCCAGATGCT- C GATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGGAAGAGGCTGG- C GCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCATGATCACCTG- C CTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGAGGGCCCTGTG- G ACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGAACCAGCAGCA- C CTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATCTGGCCAGCTT-
T CTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGCATACCACCGA- G CGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCCAGCTGCTGGC- C CACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACCACAGCCTGGA- A CGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCCTGTCCACCAT- G CAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCGCCCTGACCGT- G TCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGTCCACCACAGT- C GTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACATGCACACCAC- A CACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGCTGGAATACGA- C GATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGGACCCCTACAA- C GAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGGAAGTGACCGA- C GTGGTGGTGGACGCCACCGACCTGTTGAATTTTGACCTTCTTAAGCTTGCGGGAGACGTCGAGTCCAACCCCGG- G CCCATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTGGACCCGTGATCCTGCTGTGGTGCTGCCTGCT- G CTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAGCCGCCGAGAAGGTGCCAGCCGAGTGCCCCGA- G CTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACAAGTACGAGAGCTGGCTGCGGCCCCTGGTCAA- C GTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACAGACCCGTGACCCCCGAGGCCGCCAATAGCGT- G CTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACAACAACCCCGACCAGCTGAGAGCCCTGCTGAC- C CTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGCGGGGCTACAGCGAGTGTGGAGATGGCAGCCC- T GCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACCTGACCAGACTGAGCTACGGCCGGTCCATCTT- C ACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGTTCAACGTGGTGGTGGCCATCCGGAACGAGGC- C ACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCGCTGCACCTGAGGGCATCACACTGTTCTACGG- C CTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGGATCCCCCCCTGCTGAGACACCTGGACAAGTA- C TACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACCTGCCCGCCCACAGCAGATATGGCCCTCAGGC- C GTGGACGCCAGATGATAAGCGGCCGCATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGG- C ATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAA- A AAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATC- C GAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAAACACGTGATATCTGGCCTCATGGGC- C TTCCTTTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAACATGGTCATAGCTGTTTCCT- T GCGTATTGGGCGCTCTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGGTAAAGCCTGGGGTGCC- T AATGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGC- C CCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAG- G CGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTT- C TCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCC- A AGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCC- A ACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGC- G GTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGAACAGTATTTGGTATCTGCGCTCTG- C TGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGT- T TTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGG- T CTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAG- A TCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTATTAG- A AAAATTCATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGCCGCAATGCCATACAGCACCAGAAAA- C GATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGCAATATCCTGATAACGATCCGCC- A CGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCACCATAATGTTCGGCAGGCACGCATCA- C CATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGG- C CCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCATACGGGTACGCGCACGTTCAATACGA- T GTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAGACGACGCATGGCATCCGCCATAATG- C TCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCA- C GGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCC- G CTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAACAGCACCGGACGACCCTGCGCGCTC- A GACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATAGCCAAACAGACGTTCCACCCAC- G CTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCTTTTTCAATATTATTGAAGCATTTAT- C AGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACA- T TTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCA- G CTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGA- G TGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTC- T TCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCA- G TCACACGCGTAATACGACTCACTATAG A536 Vector: SGP-342-EV71-gHso1-EMCV-gL (SEQ ID NO: 63): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC-
A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGCTATTCCAGAAGT- A GTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGTATATCCATTTTCGGATCTGATCAAGAGAC- A GGATGAGGATCGTTTCGCATGATTGAATAAGATGGATTGCACGTAGGTTCTCCGGCCGCTTGGGTGGAGAGGCT- A TTCGGCTATGACTGGGCACAACTGACAATCGGCTGCTCTGATGCCGCCGTGATCCGGTTGTCAGCGCAGGGGCG- C CCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGAAGGACGAGGCAGCGCGGCTATCGTG- G CTGGCCACGACGGGCGTTCCTTGCGCAGTCTAGACTGGCGCGCCAAACCTGCAGGTTAAAACAGCTGTGGGTTG- T TCCCACCCACAGGGCCCACTGGGCGCTAGCACTCTGATTTTACGAAATCCTTGTGCGCCTGTTTTATATCCCTT- C CCTAATTCGAAACGTAGAAGCAATGCGCACCACTGATCAATAGTAGGCGTAACGCGCCAGTTACGTCATGATCA- A GCATATCTGTTCCCCCGGACTGAGTATCAATAGACTGCTTACGCGGTTGAAGGAGAAAACGTTCGTTATCCGGC- T AACTACTTCGAGAAGCCCAGTAACACCATGGAAGCTGCAGGGTGTTTCGCTCAGCACTTCCCCCGTGTAGATCA- G GTCGATGAGCCACTGCAATCCCCACAGGTGACTGTGGCAGTGGCTGCGTTGGCGGCCTGCCTATGGGGAGACCC- A TAGGACGCTCTAATGTGGACATGGTGCGAAGAGCCTATTGAGCTAGTTAGTAGTCCTCCGGCCCCTGAATGCGG- C TAATCCTAACTGCGGAGCACATGCCTTCAACCCAGAGGGTAGTGTGTCGTAATGGGCAACTCTGCAGCGGAACC- G ACTACTTTGGGTGTCCGTGTTTCTTTTTATTCTTATATTGGCTGCTTATGGTGACAATTACAGAATTGTTACCA- T ATAGCTATTGGATTGGCCATCCGGTGTGTAATAGAGCTGTTATATACCTATTTGTTGGCTTTGTACCACTAACT- T TAAAATCTATAACTACCCTCAACTTTATATTAACCCTCAATACAGTTGAACATGAGGCCTGGCCTGCCCTCCTA- C CTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCGTGAGCGAGCC- C CTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGAACACCACCCA- G TGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACTTTTTCCAGAG- C TACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGTTCCTGAACCA- G GTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCAAGGACCTGGC- C AGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCGTGCCCCCTCC- C ATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGAGCCACACCAC- C TCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGTTTAGCACCGT- G ACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGACCGAGGATTT- C TTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCAGAGTGCTGTT- C AAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGCTGGTCAAGAA- G GACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCAACTACCTGGA- C CTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGTGCCAGATGCT- C GATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGGAAGAGGCTGG- C GCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCATGATCACCTG- C CTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGAGGGCCCTGTG- G ACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGAACCAGCAGCA- C CTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATCTGGCCAGCTT- T CTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGCATACCACCGA- G CGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCCAGCTGCTGGC- C CACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACCACAGCCTGGA- A CGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCCTGTCCACCAT- G CAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCGCCCTGACCGT- G TCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGTCCACCACAGT- C GTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACATGCACACCAC- A CACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGCTGGAATACGA- C GATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGGACCCCTACAA- C GAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGGAAGTGACCGA- C GTGGTGGTGGACGCCACCGACTGATAACGCCGGCGCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAA- G GCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCT- G GCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTC- G TGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAAC- C CCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCC- C AGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTG- A AGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAG- T CGAGGTTAAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATAATAT- G TGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTGGACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCC- T ATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAGCCGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGAC- C AGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACAAGTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGAC- C GGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACAGACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCT- G GACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACAACAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCT- G TCCAGCGACACCGCCCCCAGATGGATGACCGTGATGCGGGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGT- G TACACCTGCGTGGACGACCTGTGCAGAGGCTACGACCTGACCAGACTGAGCTACGGCCGGTCCATCTTCACAGA- G CACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGTTCAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAG- A ACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCGCTGCACCTGAGGGCATCACACTGTTCTACGGCCTGTA- C AACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGGATCCCCCCCTGCTGAGACACCTGGACAAGTACTACGC- C GGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACCTGCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGA- C GCCAGATGATAAGCGGCCGCATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCC- G CCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAA- A AAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGG- A GGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAAACACGTGATATCTGGCCTCATGGGCCTTCCT- T TCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAACATGGTCATAGCTGTTTCCTTGCGTA- T TGGGCGCTCTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGGTAAAGCCTGGGGTGCCTAATGA- G CAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCT- G ACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT- C CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCT- T CGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG- G GCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCG- G TAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCT- A CAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAG- C CAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTT- G TTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGAC- G CTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTT- T TAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTATTAGAAAAAT- T CATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGCCGCAATGCCATACAGCACCAGAAAACGATCC- G CCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGCAATATCCTGATAACGATCCGCCACGCCC- A GACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCACCATAATGTTCGGCAGGCACGCATCACCATGG- G TCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGA- T GTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCATACGGGTACGCGCACGTTCAATACGATGTTTC- G CCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAGACGACGCATGGCATCCGCCATAATGCTCACT- T TTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCACGGCCC- G CTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCA- T CCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAACAGCACCGGACGACCCTGCGCGCTCAGACGA- A ACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATAGCCAAACAGACGTTCCACCCACGCTGCC- G GGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGT- T ATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCC- C GAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCAT- T TTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGGCC- G
CTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCT- A TTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACA- C GCGTAATACGACTCACTATAG A537 Vector: SGP-342-EV71-gL-EMCV-gHso1 (SEQ ID NO: 64): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGCTATTCCAGAAGT- A GTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGTATATCCATTTTCGGATCTGATCAAGAGAC- A GGATGAGGATCGTTTCGCATGATTGAATAAGATGGATTGCACGTAGGTTCTCCGGCCGCTTGGGTGGAGAGGCT- A TTCGGCTATGACTGGGCACAACTGACAATCGGCTGCTCTGATGCCGCCGTGATCCGGTTGTCAGCGCAGGGGCG- C CCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGAAGGACGAGGCAGCGCGGCTATCGTG- G CTGGCCACGACGGGCGTTCCTTGCGCAGTCTAGACTGGCGCGCCAAACCTGCAGGTTAAAACAGCTGTGGGTTG- T TCCCACCCACAGGGCCCACTGGGCGCTAGCACTCTGATTTTACGAAATCCTTGTGCGCCTGTTTTATATCCCTT- C CCTAATTCGAAACGTAGAAGCAATGCGCACCACTGATCAATAGTAGGCGTAACGCGCCAGTTACGTCATGATCA- A GCATATCTGTTCCCCCGGACTGAGTATCAATAGACTGCTTACGCGGTTGAAGGAGAAAACGTTCGTTATCCGGC- T AACTACTTCGAGAAGCCCAGTAACACCATGGAAGCTGCAGGGTGTTTCGCTCAGCACTTCCCCCGTGTAGATCA- G GTCGATGAGCCACTGCAATCCCCACAGGTGACTGTGGCAGTGGCTGCGTTGGCGGCCTGCCTATGGGGAGACCC- A TAGGACGCTCTAATGTGGACATGGTGCGAAGAGCCTATTGAGCTAGTTAGTAGTCCTCCGGCCCCTGAATGCGG- C TAATCCTAACTGCGGAGCACATGCCTTCAACCCAGAGGGTAGTGTGTCGTAATGGGCAACTCTGCAGCGGAACC- G ACTACTTTGGGTGTCCGTGTTTCTTTTTATTCTTATATTGGCTGCTTATGGTGACAATTACAGAATTGTTACCA- T ATAGCTATTGGATTGGCCATCCGGTGTGTAATAGAGCTGTTATATACCTATTTGTTGGCTTTGTACCACTAACT- T TAAAATCTATAACTACCCTCAACTTTATATTAACCCTCAATACAGTTGAACATGTGCAGAAGGCCCGACTGCGG- C TTCAGCTTCAGCCCTGGACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGT- G TCTGTGGCCCCTACAGCCGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGA- G GTGTTCGAGGGCGACAAGTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAG- C CAGCTGATCCGGTACAGACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATAC- C CTGGCCCTGCTGTACAACAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAG- A TGGATGACCGTGATGCGGGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCT- G
TGCAGAGGCTACGACCTGACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCT- G GTGCCCCCCAGCCTGTTCAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCT- G CCTGTGTCTACAGCCGCTGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTG- C CTCCGGCACCAGCTGGATCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAA- G CAGACCAGAGTGAACCTGCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAACGCCGGCG- C CCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACC- A TATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTT- T CCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGA- C AAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAG- C CACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGA- G TCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCT- G ATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAAACGTCTAGGCCCCCCGAACCAC- G GGGACGTGGTTTTCCTTTGAAAAACACGATAATAATATGAGGCCTGGCCTGCCCTCCTACCTGATCATCCTGGC- C GTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCGTGAGCGAGCCCCTGGACAAGGCTTT- C CACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGAACACCACCCAGTGCACCTACAACAG- C AGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACTTTTTCCAGAGCTACAACCAGTACTA- C GTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGTTCCTGAACCAGGTGGACCTGACCGA- G ACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCAAGGACCTGGCCAGCTACCGGTCCTT- T AGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCGTGCCCCCTCCCATCGACCTGAGCAT- C CCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGAGCCACACCACCTCCGGCCTGCACAG- A CCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGTTTAGCACCGTGACCCCCTGCCTGCA- C CAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGACCGAGGATTTCTTCGTGGTCACCGT- G TCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCAGAGTGCTGTTCAAGGCCCCCTACCA- G CGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGCTGGTCAAGAAGGACCAGCTGAACCG- G CACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCAACTACCTGGACCTGAGCGCCCTGCT- G AGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGTGCCAGATGCTCGATCGGCGGACCGT- G GAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGGAAGAGGCTGGCGCCCAGGTGTCAGT- G CCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCATGATCACCTGCCTGAGCCAGACCCC- C CCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGAGGGCCCTGTGGACCCCCAACCAGAT- C ACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGAACCAGCAGCACCTGATCCCCCAGTG- G GCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATCTGGCCAGCTTTCTGAGCGCCTTCGC- C AGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGCATACCACCGAGCGGCGGGAGATCTT- C ATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCCAGCTGCTGGCCCACCCTCACCACGA- G TACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACCACAGCCTGGAACGGCTGACCAGACT- G TTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCCTGTCCACCATGCAGCCCAGCACCCT- G GAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCGCCCTGACCGTGTCCGAGCACGTGTC- C TACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGTCCACCACAGTCGTGGGCCAGAGCCT- G ATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACATGCACACCACACACAGCATCACCGT- G GCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGCTGGAATACGACGATACCCAGGGCGT- G ATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGGACCCCTACAACGAGGTGGTGGTGTC- C AGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGGAAGTGACCGACGTGGTGGTGGACGC- C ACCGACTGATAAGCGGCCGCATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCC- G CCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAA- A AAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGG- A GGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAAACACGTGATATCTGGCCTCATGGGCCTTCCT- T TCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAACATGGTCATAGCTGTTTCCTTGCGTA- T TGGGCGCTCTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGGTAAAGCCTGGGGTGCCTAATGA- G CAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCT- G ACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT- C CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCT- T CGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG- G GCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCG- G TAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCT- A CAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAG- C CAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTT- G TTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGAC- G CTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTT- T TAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTATTAGAAAAAT- T CATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGCCGCAATGCCATACAGCACCAGAAAACGATCC- G CCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGCAATATCCTGATAACGATCCGCCACGCCC- A GACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCACCATAATGTTCGGCAGGCACGCATCACCATGG- G TCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGA- T GTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCATACGGGTACGCGCACGTTCAATACGATGTTTC- G CCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAGACGACGCATGGCATCCGCCATAATGCTCACT- T TTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCACGGCCC- G CTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCA- T CCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAACAGCACCGGACGACCCTGCGCGCTCAGACGA- A ACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATAGCCAAACAGACGTTCCACCCACGCTGCC- G GGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGT- T ATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCC- C GAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCAT- T TTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGGCC- G CTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCT- A TTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACA- C GCGTAATACGACTCACTATAG A554 Vector: SGP-gH-SGP-gL-SGP-UL128-SGP-UL130-SGP-UL131 (SEQ ID NO: 65) ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTAGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A
GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACAGCAGACTGCTGATGATGAGCGTGTACGCCCTGAGCGCCATCA- T CGGCATCTACCTGCTGTACCGGATGCTGAAAACCTGCTGATAATCTAGAGGCCCCTATAACTCTCTACGGCTAA- C CTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTG- G ACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAG- C CGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACA- A GTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACA- G ACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACA- A CAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGC- G GGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACC- T GACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGT- T CAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCG- C TGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGG- A TCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACC- T GCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAACGCCGGCGGCCCCTATAACTCTCTA- C GGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGCCCCAAGGACCTGACCCCCTTCCTGACA- A CCCTGTGGCTGCTCCTGGGCCATAGCAGAGTGCCTAGAGTGCGGGCCGAGGAATGCTGCGAGTTCATCAACGTG- A ACCACCCCCCCGAGCGGTGCTACGACTTCAAGATGTGCAACCGGTTCACCGTGGCCCTGAGATGCCCCGACGGC- G AAGTGTGCTACAGCCCCGAGAAAACCGCCGAGATCCGGGGCATCGTGACCACCATGACCCACAGCCTGACCCGG- C AGGTGGTGCACAACAAGCTGACCAGCTGCAACTACAACCCCCTGTACCTGGAAGCCGACGGCCGGATCAGATGC- G GCAAAGTGAACGACAAGGCCCAGTACCTGCTGGGAGCCGCCGGAAGCGTGCCCTACCGGTGGATCAACCTGGAA- T ACGACAAGATCACCCGGATCGTGGGCCTGGACCAGTACCTGGAAAGCGTGAAGAAGCACAAGCGGCTGGACGTG- T GCAGAGCCAAGATGGGCTACATGCTGCAGTGATAAGGCGCGCCGCCCCTATAACTCTCTACGGCTAACCTGAAT- G GACTACGACATAGTCTAGTCCGCCAAGATGCTGCGGCTGCTGCTGAGACACCACTTCCACTGCCTGCTGCTGTG- T GCCGTGTGGGCCACCCCTTGTCTGGCCAGCCCTTGGAGCACCCTGACCGCCAACCAGAACCCTAGCCCCCCTTG- G TCCAAGCTGACCTACAGCAAGCCCCACGACGCCGCCACCTTCTACTGCCCCTTTCTGTACCCCAGCCCTCCCAG- A AGCCCCCTGCAGTTCAGCGGCTTCCAGAGAGTGTCCACCGGCCCTGAGTGCCGGAACGAGACACTGTACCTGCT- G TACAACCGGGAGGGCCAGACACTGGTGGAGCGGAGCAGCACCTGGGTGAAAAAAGTGATCTGGTATCTGAGCGG- C CGGAACCAGACCATCCTGCAGCGGATGCCCAGAACCGCCAGCAAGCCCAGCGACGGCAACGTGCAGATCAGCGT- G GAGGACGCCAAAATCTTCGGAGCCCACATGGTGCCCAAGCAGACCAAGCTGCTGAGATTCGTGGTCAACGACGG- C ACCAGATATCAGATGTGCGTGATGAAGCTGGAAAGCTGGGCCCACGTGTTCCGGGACTACTCCGTGAGCTTCCA- G GTCCGGCTGACCTTCACCGAGGCCAACAACCAGACCTACACCTTCTGCACCCACCCCAACCTGATCGTGTGATA- A GCGGCCGCGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGCGGC- T GTGCAGAGTGTGGCTGTCCGTGTGCCTGTGTGCCGTGGTGCTGGGCCAGTGCCAGAGAGAGACAGCCGAGAAGA- A CGACTACTACCGGGTGCCCCACTACTGGGATGCCTGCAGCAGAGCCCTGCCCGACCAGACCCGGTACAAATACG- T GGAGCAGCTCGTGGACCTGACCCTGAACTACCACTACGACGCCAGCCACGGCCTGGACAACTTCGACGTGCTGA- A GCGGATCAACGTGACCGAGGTGTCCCTGCTGATCAGCGACTTCCGGCGGCAGAACAGAAGAGGCGGCACCAACA- A GCGGACCACCTTCAACGCCGCTGGCTCTCTGGCCCCTCACGCCAGATCCCTGGAATTCAGCGTGCGGCTGTTCG- C CAACTGATAACGTTGCATCCTGCAGGATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGG- C ATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAA- A AAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATC- C GAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAAACGCTAGAGCAAGACGTTTCCCGTT- G AATATGGCTCATAACACCCCTTGTATTACTGTTTATGTAAGCAGACAGTTTTATTGTTCATGATGATATATTTT- T ATCTTGTGCAATGTAACATCAGAGATTTTGAGACACAACGTGGCTTTGTTGAATAAATCGAACTTTTGCTGAGT- T GAAGGATCAGATCACGCATCTTCCCGACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAAATCACCAACT- G GTCCACCTACAACAAAGCTCTCATCAACCGTGGCTCCCTCACTTTCTGGCTGGATGATGGGGCGATTCAGGCCT- G GTATGAGTCAGCAACACCTTCTTCACGAGGCAGACCTCAGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGG- C ACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATG-
T GATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATG- G CTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAA- A GCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACC- C GACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTT- T ACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTC- G CTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTG- A GTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTG- A AGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGG- T TCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAG- A TTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCA- C GTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTT- A AATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAAC- G CAATACGCTGGCTATCCGGTGCCGCAATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCT- T CCGCAATATCACGGGTGGCCAGCGCAATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAG- C CGCTAAAACGGCCATTTTCCACCATAATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCA- T CCGGCATGCTCGCTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCC- T GATCCACCAGGCCCGCTTCCATACGGGTACGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAG- G TCGCCGGGTCCAGGGTATGCAGACGACGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGG- C TAGACAGCAGATCCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGC- A CCGCCGCACACGGAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCA- C CGCTCAGATCGGTTTTCACAAACAGCACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAG- C CAATGGTCTGCTGCGCCCAATCATAGCCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCA- T CCTGTTCAATCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATAC- A TATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTG- T AAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAA- T CGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCG- C CATTCAGGCTGCGCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGG- G GGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATA- G A555 Vector: SGP-gHso1-SGP-gL-SGP-UL128-SGP-UL130-SGP-UL131 (SEQ ID NO: 66): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTAGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG-
A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG- A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACTGATAATCTAGAGGCCCCTATAACTCTCTACGGCTAACCTGAA- T GGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAAGGCCCGACTGCGGCTTCAGCTTCAGCCCTGGACCCG- T GATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGTCCTCTGCCGCCGTGTCTGTGGCCCCTACAGCCGCCG- A GAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGATGCCTGCTGGGCGAGGTGTTCGAGGGCGACAAGTACG- A GAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAGATGGCCCCCTGAGCCAGCTGATCCGGTACAGACCCG- T GACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGGCCTTCCTGGATACCCTGGCCCTGCTGTACAACAACC- C CGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCGACACCGCCCCCAGATGGATGACCGTGATGCGGGGCT- A CAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCTGCGTGGACGACCTGTGCAGAGGCTACGACCTGACCA- G ACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGCTGGGCTTCGAGCTGGTGCCCCCCAGCCTGTTCAACG- T GGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACAGAGCCGTGCGGCTGCCTGTGTCTACAGCCGCTGCAC- C TGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCGTGAAAGAGTTCTGCCTCCGGCACCAGCTGGATCCCC- C CCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGCCCCCAGAGCTGAAGCAGACCAGAGTGAACCTGCCCG- C CCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGATGATAACGCCGGCGGCCCCTATAACTCTCTACGGCTA- A CCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGCCCCAAGGACCTGACCCCCTTCCTGACAACCCTG- T GGCTGCTCCTGGGCCATAGCAGAGTGCCTAGAGTGCGGGCCGAGGAATGCTGCGAGTTCATCAACGTGAACCAC- C CCCCCGAGCGGTGCTACGACTTCAAGATGTGCAACCGGTTCACCGTGGCCCTGAGATGCCCCGACGGCGAAGTG- T GCTACAGCCCCGAGAAAACCGCCGAGATCCGGGGCATCGTGACCACCATGACCCACAGCCTGACCCGGCAGGTG- G TGCACAACAAGCTGACCAGCTGCAACTACAACCCCCTGTACCTGGAAGCCGACGGCCGGATCAGATGCGGCAAA- G TGAACGACAAGGCCCAGTACCTGCTGGGAGCCGCCGGAAGCGTGCCCTACCGGTGGATCAACCTGGAATACGAC- A AGATCACCCGGATCGTGGGCCTGGACCAGTACCTGGAAAGCGTGAAGAAGCACAAGCGGCTGGACGTGTGCAGA- G CCAAGATGGGCTACATGCTGCAGTGATAAGGCGCGCCAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGT- G TGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTG- T CTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGG- A AGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCAC- C TGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCC- A CGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGAAGGATG- C CCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGT- T AAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATATGCTGCGGCTG- C TGCTGAGACACCACTTCCACTGCCTGCTGCTGTGTGCCGTGTGGGCCACCCCTTGTCTGGCCAGCCCTTGGAGC- A CCCTGACCGCCAACCAGAACCCTAGCCCCCCTTGGTCCAAGCTGACCTACAGCAAGCCCCACGACGCCGCCACC- T TCTACTGCCCCTTTCTGTACCCCAGCCCTCCCAGAAGCCCCCTGCAGTTCAGCGGCTTCCAGAGAGTGTCCACC- G GCCCTGAGTGCCGGAACGAGACACTGTACCTGCTGTACAACCGGGAGGGCCAGACACTGGTGGAGCGGAGCAGC- A CCTGGGTGAAAAAAGTGATCTGGTATCTGAGCGGCCGGAACCAGACCATCCTGCAGCGGATGCCCAGAACCGCC- A GCAAGCCCAGCGACGGCAACGTGCAGATCAGCGTGGAGGACGCCAAAATCTTCGGAGCCCACATGGTGCCCAAG- C AGACCAAGCTGCTGAGATTCGTGGTCAACGACGGCACCAGATATCAGATGTGCGTGATGAAGCTGGAAAGCTGG- G CCCACGTGTTCCGGGACTACTCCGTGAGCTTCCAGGTCCGGCTGACCTTCACCGAGGCCAACAACCAGACCTAC- A CCTTCTGCACCCACCCCAACCTGATCGTGTGATAAGTACCTTTGTACGCCTGTTTTATACCCCCTCCCTGATTT- G CAACTTAGAAGCAACGCAAACCAGATCAATAGTAGGTGTGACATACCAGTCGCATCTTGATCAAGCACTTCTGT- A TCCCCGGACCGAGTATCAATAGACTGTGCACACGGTTGAAGGAGAAAACGTCCGTTACCCGGCTAACTACTTCG- A GAAGCCTAGTAACGCCATTGAAGTTGCAGAGTGTTTCGCTCAGCACTCCCCCCGTGTAGATCAGGTCGATGAGT- C ACCGCATTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCTATGGGGTAACCCATAGGACGCTC- T AATACGGACATGGCGTGAAGAGTCTATTGAGCTAGTTAGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCTAAC- T GCGGAGCACATACCCTTAATCCAAAGGGCAGTGTGTCGTAACGGGCAACTCTGCAGCGGAACCGACTACTTTGG- G TGTCCGTGTTTCTTTTTATTCTTGTATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTG- G ATTGGCCATCCAGTGTCAAACAGAGCTATTGTATATCTCTTTGTTGGATTCACACCTCTCACTCTTGAAACGTT- A CACACCCTCAATTACATTATACTGCTGAACACGAAGCGCATATGCGGCTGTGCAGAGTGTGGCTGTCCGTGTGC- C TGTGTGCCGTGGTGCTGGGCCAGTGCCAGAGAGAGACAGCCGAGAAGAACGACTACTACCGGGTGCCCCACTAC- T GGGATGCCTGCAGCAGAGCCCTGCCCGACCAGACCCGGTACAAATACGTGGAGCAGCTCGTGGACCTGACCCTG- A ACTACCACTACGACGCCAGCCACGGCCTGGACAACTTCGACGTGCTGAAGCGGATCAACGTGACCGAGGTGTCC- C TGCTGATCAGCGACTTCCGGCGGCAGAACAGAAGAGGCGGCACCAACAAGCGGACCACCTTCAACGCCGCTGGC- T CTCTGGCCCCTCACGCCAGATCCCTGGAATTCAGCGTGCGGCTGTTCGCCAACTGATAACGTTGCATCCTGCAG- G ATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTA- T TTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA- A AAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTCGGA- T GGCTAAGGGAGAGCCACGTTTAAACGCTAGAGCAAGACGTTTCCCGTTGAATATGGCTCATAACACCCCTTGTA- T TACTGTTTATGTAAGCAGACAGTTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGAGA- T TTTGAGACACAACGTGGCTTTGTTGAATAAATCGAACTTTTGCTGAGTTGAAGGATCAGATCACGCATCTTCCC- G ACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAAATCACCAACTGGTCCACCTACAACAAAGCTCTCATC- A ACCGTGGCTCCCTCACTTTCTGGCTGGATGATGGGGCGATTCAGGCCTGGTATGAGTCAGCAACACCTTCTTCA- C GAGGCAGACCTCAGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGC- T TCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTC- G CTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCT- G GAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCC- C CTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCG- T TTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGC- C GCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAA- C CCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAA- G CACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAAC- T GAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAAC- C TTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCA- A GAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATT- A TCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTA- A ACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAGACGATAAAACGCAATACGCTGGCTATCCGGTGCCGC- A ATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCCGCCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGC- A ATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCAATCAATAAAGCCGCTAAAACGGCCATTTTCCACCAT- A ATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAGATCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGC- A AACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATCCAGATCATCCTGATCCACCAGGCCCGCTTCCATACG- G GTACGCGCACGTTCAATACGATGTTTCGCCTGATGATCAAACGGACAGGTCGCCGGGTCCAGGGTATGCAGACG- A CGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGGCGCCAGATGGCTAGACAGCAGATCCTGACCCGGCAC- T TCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCACCACATCCAGCACCGCCGCACACGGAACACCGGTGGT- G GCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTCGTTCAGCGCACCGCTCAGATCGGTTTTCACAAACAG- C ACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGCATCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATA- G CCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGCATGCAGGCCATCCTGTTCAATCATACTCTTCCTTTT- T CAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAA- A CAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTC- G CGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAA- G AATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCGCTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGA- A GGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTT- G GGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACGACTCACTATAG A556 Vector: SGP-gHso16His-SGP-gL-SGP-UL128-SGP-UL130-SGP-UL131 (SEQ ID NO: 67): ATAGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAA- G ACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAAT- G
ACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACG- A TCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGA- T GTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAG- G AATTGGACAAGAAAATGAAGGAGCTCGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTC- C ACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACA- A GTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTT- A AGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATA- G GCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCA- T CCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTG- C CGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTAC- G TCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAG- G GATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCA- G CTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTT- G GGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCC- G TAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTA- C GAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCG- G ATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTG- G AGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAG- G ACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTA- C CACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTAGACTTGATGTTACAAGAGGCTGGGGCC- G GCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCT- G TGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTG- A TAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACAT- G CAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAAC- A GGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCC- A GCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTA- G GGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCT- C CTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTC- A CCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAA- G GGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTAT- A TTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTG- C TCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGC- A CACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTAC- G ACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAG- C AGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATA- A TGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCT- C TGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTA- G CCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAA- G CAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAAC- G TGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACT- G TGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGA- C TCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCC- C CGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGG- G CAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTA- C CTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTC- G TCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGG- T TGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGAC- A TAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTT- A GCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCT- G ACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCC- T CACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATCCTTACAAG- C TTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTG- G TGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGA- G GGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGA- C TGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGT- G ACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATT- C CACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCT- T TAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTG- G CTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTG- A GGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTG- G AAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCC- A ATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCG- G AAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTA- A AAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAG- A AGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTG- G AAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCA- C CACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGC- A TAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTA- T CTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAG- G GAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCG- C GACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTT- G CACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAG- G AGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCA- G GCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGT- G CATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAA- G TGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGC- A AGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCC- A TAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACC- C TGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGT- A ACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGAC- A TGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAA- C ACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCA- G CTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTG- G AATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAA- G AAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTG- A ATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACA- A AACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGA- A TCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCT- G AAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTT- G ATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTG- T TGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGA- G CCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTG- T TGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCG- G ACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAG- A AAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCC- C TAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTG- C ATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAA- A CCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGA- G GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGAGGCCTGGCC- T GCCCTCCTACCTGATCATCCTGGCCGTGTGCCTGTTCAGCCACCTGCTGTCCAGCAGATACGGCGCCGAGGCCG- T GAGCGAGCCCCTGGACAAGGCTTTCCACCTGCTGCTGAACACCTACGGCAGACCCATCCGGTTTCTGCGGGAGA- A CACCACCCAGTGCACCTACAACAGCAGCCTGCGGAACAGCACCGTCGTGAGAGAGAACGCCATCAGCTTCAACT- T TTTCCAGAGCTACAACCAGTACTACGTGTTCCACATGCCCAGATGCCTGTTTGCCGGCCCTCTGGCCGAGCAGT- T CCTGAACCAGGTGGACCTGACCGAGACACTGGAAAGATACCAGCAGCGGCTGAATACCTACGCCCTGGTGTCCA- A GGACCTGGCCAGCTACCGGTCCTTTAGCCAGCAGCTCAAGGCTCAGGATAGCCTCGGCGAGCAGCCTACCACCG- T GCCCCCTCCCATCGACCTGAGCATCCCCCACGTGTGGATGCCTCCCCAGACCACCCCTCACGGCTGGACCGAGA- G CCACACCACCTCCGGCCTGCACAGACCCCACTTCAACCAGACCTGCATCCTGTTCGACGGCCACGACCTGCTGT- T TAGCACCGTGACCCCCTGCCTGCACCAGGGCTTCTACCTGATCGACGAGCTGAGATACGTGAAGATCACCCTGA- C CGAGGATTTCTTCGTGGTCACCGTGTCCATCGACGACGACACCCCCATGCTGCTGATCTTCGGCCACCTGCCCA- G AGTGCTGTTCAAGGCCCCCTACCAGCGGGACAACTTCATCCTGCGGCAGACCGAGAAGCACGAGCTGCTGGTGC- T GGTCAAGAAGGACCAGCTGAACCGGCACTCCTACCTGAAGGACCCCGACTTCCTGGACGCCGCCCTGGACTTCA- A CTACCTGGACCTGAGCGCCCTGCTGAGAAACAGCTTCCACAGATACGCCGTGGACGTGCTGAAGTCCGGACGGT- G CCAGATGCTCGATCGGCGGACCGTGGAGATGGCCTTCGCCTATGCCCTCGCCCTGTTCGCCGCTGCCAGACAGG- A AGAGGCTGGCGCCCAGGTGTCAGTGCCCAGAGCCCTGGATAGACAGGCCGCCCTGCTGCAGATCCAGGAATTCA- T GATCACCTGCCTGAGCCAGACCCCCCCTAGAACCACCCTGCTGCTGTACCCCACAGCCGTGGATCTGGCCAAGA- G GGCCCTGTGGACCCCCAACCAGATCACCGACATCACAAGCCTCGTGCGGCTCGTGTACATCCTGAGCAAGCAGA- A CCAGCAGCACCTGATCCCCCAGTGGGCCCTGAGACAGATCGCCGACTTCGCCCTGAAGCTGCACAAGACCCATC- T GGCCAGCTTTCTGAGCGCCTTCGCCAGGCAGGAACTGTACCTGATGGGCAGCCTGGTCCACAGCATGCTGGTGC- A TACCACCGAGCGGCGGGAGATCTTCATCGTGGAGACAGGCCTGTGTAGCCTGGCCGAGCTGTCCCACTTTACCC- A GCTGCTGGCCCACCCTCACCACGAGTACCTGAGCGACCTGTACACCCCCTGCAGCAGCAGCGGCAGACGGGACC- A CAGCCTGGAACGGCTGACCAGACTGTTCCCCGATGCCACCGTGCCTGCTACAGTGCCTGCCGCCCTGTCCATCC- T GTCCACCATGCAGCCCAGCACCCTGGAAACCTTCCCCGACCTGTTCTGCCTGCCCCTGGGCGAGAGCTTTAGCG- C CCTGACCGTGTCCGAGCACGTGTCCTACATCGTGACCAATCAGTACCTGATCAAGGGCATCAGCTACCCCGTGT- C CACCACAGTCGTGGGCCAGAGCCTGATCATCACCCAGACCGACAGCCAGACCAAGTGCGAGCTGACCCGGAACA- T GCACACCACACACAGCATCACCGTGGCCCTGAACATCAGCCTGGAAAACTGCGCTTTCTGTCAGTCTGCCCTGC- T GGAATACGACGATACCCAGGGCGTGATCAACATCATGTACATGCACGACAGCGACGACGTGCTGTTCGCCCTGG-
A CCCCTACAACGAGGTGGTGGTGTCCAGCCCCCGGACCCACTACCTGATGCTGCTGAAGAACGGCACCGTGCTGG- A AGTGACCGACGTGGTGGTGGACGCCACCGACGGCAGCGGATCTGGGTCCCACCATCACCATCACCATTGATAAT- C TAGAGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATGTGCAGAA- G GCCCGACTGCGGCTTCAGCTTCAGCCCTGGACCCGTGATCCTGCTGTGGTGCTGCCTGCTGCTGCCTATCGTGT- C CTCTGCCGCCGTGTCTGTGGCCCCTACAGCCGCCGAGAAGGTGCCAGCCGAGTGCCCCGAGCTGACCAGAAGAT- G CCTGCTGGGCGAGGTGTTCGAGGGCGACAAGTACGAGAGCTGGCTGCGGCCCCTGGTCAACGTGACCGGCAGAG- A TGGCCCCCTGAGCCAGCTGATCCGGTACAGACCCGTGACCCCCGAGGCCGCCAATAGCGTGCTGCTGGACGAGG- C CTTCCTGGATACCCTGGCCCTGCTGTACAACAACCCCGACCAGCTGAGAGCCCTGCTGACCCTGCTGTCCAGCG- A CACCGCCCCCAGATGGATGACCGTGATGCGGGGCTACAGCGAGTGTGGAGATGGCAGCCCTGCCGTGTACACCT- G CGTGGACGACCTGTGCAGAGGCTACGACCTGACCAGACTGAGCTACGGCCGGTCCATCTTCACAGAGCACGTGC- T GGGCTTCGAGCTGGTGCCCCCCAGCCTGTTCAACGTGGTGGTGGCCATCCGGAACGAGGCCACCAGAACCAACA- G AGCCGTGCGGCTGCCTGTGTCTACAGCCGCTGCACCTGAGGGCATCACACTGTTCTACGGCCTGTACAACGCCG- T GAAAGAGTTCTGCCTCCGGCACCAGCTGGATCCCCCCCTGCTGAGACACCTGGACAAGTACTACGCCGGCCTGC- C CCCAGAGCTGAAGCAGACCAGAGTGAACCTGCCCGCCCACAGCAGATATGGCCCTCAGGCCGTGGACGCCAGAT- G ATAACGCCGGCGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGATG- A GCCCCAAGGACCTGACCCCCTTCCTGACAACCCTGTGGCTGCTCCTGGGCCATAGCAGAGTGCCTAGAGTGCGG- G CCGAGGAATGCTGCGAGTTCATCAACGTGAACCACCCCCCCGAGCGGTGCTACGACTTCAAGATGTGCAACCGG- T TCACCGTGGCCCTGAGATGCCCCGACGGCGAAGTGTGCTACAGCCCCGAGAAAACCGCCGAGATCCGGGGCATC- G TGACCACCATGACCCACAGCCTGACCCGGCAGGTGGTGCACAACAAGCTGACCAGCTGCAACTACAACCCCCTG- T ACCTGGAAGCCGACGGCCGGATCAGATGCGGCAAAGTGAACGACAAGGCCCAGTACCTGCTGGGAGCCGCCGGA- A GCGTGCCCTACCGGTGGATCAACCTGGAATACGACAAGATCACCCGGATCGTGGGCCTGGACCAGTACCTGGAA- A GCGTGAAGAAGCACAAGCGGCTGGACGTGTGCAGAGCCAAGATGGGCTACATGCTGCAGTGATAAGGCGCGCCA- A CGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGT- C TTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCG- C CAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGT- C TGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTAT- A AGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGC- T CTCCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGC- C TCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGG- T TTTCCTTTGAAAAACACGATAATATGCTGCGGCTGCTGCTGAGACACCACTTCCACTGCCTGCTGCTGTGTGCC- G TGTGGGCCACCCCTTGTCTGGCCAGCCCTTGGAGCACCCTGACCGCCAACCAGAACCCTAGCCCCCCTTGGTCC- A AGCTGACCTACAGCAAGCCCCACGACGCCGCCACCTTCTACTGCCCCTTTCTGTACCCCAGCCCTCCCAGAAGC- C CCCTGCAGTTCAGCGGCTTCCAGAGAGTGTCCACCGGCCCTGAGTGCCGGAACGAGACACTGTACCTGCTGTAC- A ACCGGGAGGGCCAGACACTGGTGGAGCGGAGCAGCACCTGGGTGAAAAAAGTGATCTGGTATCTGAGCGGCCGG- A ACCAGACCATCCTGCAGCGGATGCCCAGAACCGCCAGCAAGCCCAGCGACGGCAACGTGCAGATCAGCGTGGAG- G ACGCCAAAATCTTCGGAGCCCACATGGTGCCCAAGCAGACCAAGCTGCTGAGATTCGTGGTCAACGACGGCACC- A GATATCAGATGTGCGTGATGAAGCTGGAAAGCTGGGCCCACGTGTTCCGGGACTACTCCGTGAGCTTCCAGGTC- C GGCTGACCTTCACCGAGGCCAACAACCAGACCTACACCTTCTGCACCCACCCCAACCTGATCGTGTGATAAGTA- C CTTTGTACGCCTGTTTTATACCCCCTCCCTGATTTGCAACTTAGAAGCAACGCAAACCAGATCAATAGTAGGTG- T GACATACCAGTCGCATCTTGATCAAGCACTTCTGTATCCCCGGACCGAGTATCAATAGACTGTGCACACGGTTG- A AGGAGAAAACGTCCGTTACCCGGCTAACTACTTCGAGAAGCCTAGTAACGCCATTGAAGTTGCAGAGTGTTTCG- C TCAGCACTCCCCCCGTGTAGATCAGGTCGATGAGTCACCGCATTCCCCACGGGCGACCGTGGCGGTGGCTGCGT- T GGCGGCCTGCCTATGGGGTAACCCATAGGACGCTCTAATACGGACATGGCGTGAAGAGTCTATTGAGCTAGTTA- G TAGTCCTCCGGCCCCTGAATGCGGCTAATCCTAACTGCGGAGCACATACCCTTAATCCAAAGGGCAGTGTGTCG- T AACGGGCAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCTTTTTATTCTTGTATTGGCTGCTTAT- G GTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTGGCCATCCAGTGTCAAACAGAGCTATTGTATATCT- C TTTGTTGGATTCACACCTCTCACTCTTGAAACGTTACACACCCTCAATTACATTATACTGCTGAACACGAAGCG- C ATATGCGGCTGTGCAGAGTGTGGCTGTCCGTGTGCCTGTGTGCCGTGGTGCTGGGCCAGTGCCAGAGAGAGACA- G CCGAGAAGAACGACTACTACCGGGTGCCCCACTACTGGGATGCCTGCAGCAGAGCCCTGCCCGACCAGACCCGG- T ACAAATACGTGGAGCAGCTCGTGGACCTGACCCTGAACTACCACTACGACGCCAGCCACGGCCTGGACAACTTC- G ACGTGCTGAAGCGGATCAACGTGACCGAGGTGTCCCTGCTGATCAGCGACTTCCGGCGGCAGAACAGAAGAGGC- G GCACCAACAAGCGGACCACCTTCAACGCCGCTGGCTCTCTGGCCCCTCACGCCAGATCCCTGGAATTCAGCGTG- C GGCTGTTCGCCAACTGATAACGTTGCATCCTGCAGGATACAGCAGCAATTGGCAAGCTGCTTACATAGAACTCG- C GGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATA- T TTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGA- C CTGGGCATCCGAAGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGAGCCACGTTTAAACGCTAGAGCAAGAC- G TTTCCCGTTGAATATGGCTCATAACACCCCTTGTATTACTGTTTATGTAAGCAGACAGTTTTATTGTTCATGAT- G ATATATTTTTATCTTGTGCAATGTAACATCAGAGATTTTGAGACACAACGTGGCTTTGTTGAATAAATCGAACT- T TTGCTGAGTTGAAGGATCAGATCACGCATCTTCCCGACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAA- A TCACCAACTGGTCCACCTACAACAAAGCTCTCATCAACCGTGGCTCCCTCACTTTCTGGCTGGATGATGGGGCG- A TTCAGGCCTGGTATGAGTCAGCAACACCTTCTTCACGAGGCAGACCTCAGCGCTAGCGGAGTGTATACTGGCTT- A CTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCA- G CAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCG- A GCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGG- G CCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGG- T GGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTG- C CTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGG- T AGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAAC- T ATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGA- G TTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCA- G TTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTC- A GAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAA- C GAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAA- A TGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTATTAGAAAAATTCATCCAGCAG- A CGATAAAACGCAATACGCTGGCTATCCGGTGCCGCAATGCCATACAGCACCAGAAAACGATCCGCCCATTCGCC- G CCCAGTTCTTCCGCAATATCACGGGTGGCCAGCGCAATATCCTGATAACGATCCGCCACGCCCAGACGGCCGCA- A TCAATAAAGCCGCTAAAACGGCCATTTTCCACCATAATGTTCGGCAGGCACGCATCACCATGGGTCACCACCAG- A TCTTCGCCATCCGGCATGCTCGCTTTCAGACGCGCAAACAGCTCTGCCGGTGCCAGGCCCTGATGTTCTTCATC- C AGATCATCCTGATCCACCAGGCCCGCTTCCATACGGGTACGCGCACGTTCAATACGATGTTTCGCCTGATGATC- A AACGGACAGGTCGCCGGGTCCAGGGTATGCAGACGACGCATGGCATCCGCCATAATGCTCACTTTTTCTGCCGG- C GCCAGATGGCTAGACAGCAGATCCTGACCCGGCACTTCGCCCAGCAGCAGCCAATCACGGCCCGCTTCGGTCAC- C ACATCCAGCACCGCCGCACACGGAACACCGGTGGTGGCCAGCCAGCTCAGACGCGCCGCTTCATCCTGCAGCTC- G TTCAGCGCACCGCTCAGATCGGTTTTCACAAACAGCACCGGACGACCCTGCGCGCTCAGACGAAACACCGCCGC- A TCAGAGCAGCCAATGGTCTGCTGCGCCCAATCATAGCCAAACAGACGTTCCACCCACGCTGCCGGGCTACCCGC- A TGCAGGCCATCCTGTTCAATCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCAT- G AGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCC- A CCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAA- T AGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGGCCGCTACAGGGCG- C TCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGTTTCGGTGCGGGCCTCTTCGCTATTACGCCAGC- T GGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACACGCGTAATACG- A CTCACTATAG VZV gB (SEQ ID NO: 68): MFVTAVVSVSPSSFYESLQVEPTQSEDITRSAHLGDGDEIREAIHKSQDAETKPTFYVCPPP TGSTIVRLEPPRTCPDYHLGKNFTEGIAVVYKENIAAYKFKATVYYKDVIVSTAWAGSSYTQ ITNRYADRVPIPVSEITDTIDKFGKCSSKATYVRNNHKVEAFNEDKNPQDMPLIASKYNSVG SKAWHTTNDTYMVAGTPGTYRTGTSVNCIIEEVEARSIFPYDSFGLSTGDIIYMSPFFGLRD GAYREHSNYAMDRFHQFEGYRQRDLDTRALLEPAARNFLVTPHLTVGWNWKPKRTEVCSLVK WREVEDVVRDEYAHNFRFTMKTLSTTFISETNEFNLNQIHLSQCVKEEARAIINRIYTTRYN SSHVRTGDIQTYLARGGFVVVFQPLLSNSLARLYLQELVRENTNHSPQKHPTRNTRSRRSVP VELRANRTITTTSSVEFAMLQFTYDHIQEHVNEMLARISSSWCQLQNRERALWSGLFPINPS ALASTILDQRVKARILGDVISVSNCPELGSDTRIILQNSMRVSGSTTRCYSRPLISIVSLNG SGTVEGQLGTDNELIMSRDLLEPCVANHKRYFLFGHHYVYYEDYRYVREIAVHDVGMISTYV DLNLTLLKDREFMPLQVYTRDELRDTGLLDYSEIQRRNQMHSLRFYDIDKVVQYDSGTAIMQ GMAQFFQGLGTAGQAVGHVVLGATGALLSTVHGFTTFLSNPFGALAVGLLVLAGLVAAFFAY RYVLKLKTSPMKALYPLTTKGLKQLPEGMDPFAEKPNATDTPIEEIGDSQNTEPSVNSGFDP DKFREAQEMIKYMTLVSAAERQESKARKKNKTSALLTSRLTGLALRNRRGYSRVRTENVTGV VZV gH (SEQ ID NO: 69): MFALVLAVVILPLWTTANKSYVTPTPATRSIGHMSALLREYSDRNMSLKLEAFYPTGFDEEL IKSLHWGNDRKHVFLVIVKVNPTTHEGDVGLVIFPKYLLSPYHFKAEHRAPFPAGRFGFLSH PVTPDVSFFDSSFAPYLTTQHLVAFTTFPPNPLVWHLERAETAATAERPFGVSLLPARPTVP KNTILEHKAHFATWDALARHTFFSAEAIITNSTLRIHVPLFGSVWPIRYWATGSVLLTSDSG RVEVNIGVGFMSSLISLSSGLPIELIVVPHTVKLNAVTSDTTWFQLNPPGPDPGPSYRVYLL GRGLDMNFSKHATVDICAYPEESLDYRYHLSMAHTEALRMTTKADQHDINEESYYHIAARIA TSIFALSEMGRTTEYFLLDEIVDVQYQLKFLNYILMRIGAGAHPNTISGTSDLIFADPSQLH DELSLLFGQVKPANVDYFISYDEARDQLKTAYALSRGQDHVNALSLARRVIMSIYKGLLVKQ NLNATERQALFFASMILLNFREGLENSSRVLDGRTTLLLMTSMCTAAHATQAALNIQEGLAY LNPSKHMFTIPNVYSPCMGSLRTDLTEEIHVMNLLSAIPTRPGLNEVLHTQLDESEIFDAAF KTMMIFTTWTAKDLHILHTHVPEVFTCQDAAARNGEYVLILPAVQGHSYVITRNKPQRGLVY SLADVDVYNPISVVYLSKDTCVSEHGVIETVALPHPDNLKECLYCGSVFLRYLTTGAIMDII IIDSKDTERQLAAMGNSTIPPFNPDMHGDDSKAVLLFPNGTVVTLLGFERRQAIRMSGQYLG ASLGGAFLAVVGFGIIGWMLCGNSRLREYNKIPLT VZV gL (SEQ ID NO: 70): MASHKWLLQMIVFLKTITIAYCLHLQDDTPLFFGAKPLSDVSLIITEPCVSSVYEAWDYAAP PVSNLSEALSGIVVKTKCPVPEVILWFKDKQMAYWTNPYVTLKGLTQSVGEEHKSGDIRDAL LDALSGVWVDSTPSSTNIPENGCVWGADRLFQRVCQ VZV gI (SEQ ID NO: 71): MFLIQCLISAVIFYIQVTNALIFKGDHVSLQVNSSLTSILIPMQNDNYTEIKGQLVFIGEQL PTGTNYSGTLELLYADTVAFCFRSVQVIRYDGCPRIRTSAFISCRYKHSWHYGNSTDRISTE PDAGVMLKITKPGINDAGVYVLLVRLDHSRSTDGFILGVNVYTAGSHHNIHGVIYTSPSLQN GYSTRALFQQARLCDLPATPKGSGTSLFQHMLDLRAGKSLEDNPWLHEDVVTTETKSVVKEG IENHVYPTDMSTLPEKSLNDPPENLLIIIPIVASVMILTAMVIVIVISVKRRRIKKHPIYRP NTKTRRGIQNATPESDVMLEAAIAQLATIREESPPHSVVNPFVK VZV gE (SEQ ID NO: 72): MGTVNKPVVGVLMGFGIITGTLRITNPVRASVLRYDDFHIDEDKLDTNSVYEPYYHSDHAES SWVNRGESSRKAYDHNSPYIWPRNDYDGFLENAHEHHGVYNQGRGIDSGERLMQPTQMSAQE DLGDDTGIHVIPTLNGDDRHKIVNVDQRQYGDVFKGDLNPKPQGQRLIEVSVEENHPFTLRA PIQRIYGVRYTETWSFLPSLTCTGDAAPAIQHICLKHTTCFQDVVVDVDCAENTKEDQLAEI SYRFQGKKEADQPWIVVNTSTLFDELELDPPEIEPGVLKVLRTEKQYLGVYIWNMRGSDGTS TYATFLVTWKGDEKTRNPTPAVTPQPRGAEFHMWNYHSHVFSVGDTFSLAMHLQYKIHEAPF DLLLEWLYVPIDPTCQPMRLYSTCLYHPNAPQCLSHMNSGCTFTSPHLAQRVASTVYQNCEH ADNYTAYCLGISHMEPSFGLILHDGGTTLKFVDTPESLSGLYVFVVYFNGHVEAVAYTVVST VDHFVNAIEERGFPPTAGQPPATTKPKEITPVNPGTSPLLRYAAWTGGLAAVVLLCLVIFLI CTAKRMRVKAYRVDKSPYNQSMYYAGLPVDDFEDSESTDTEEEFGNAIGGSHGGSSYTVYID KTR VZV VEERep.SGPgB (SEQ ID NO: 73): 1_ ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa
taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt ctagtcgagtctagtcgacgccaccatgttcgtgaccgccgtggtgtccgtgtcccccagcagctttt acgagagcctgcaggtcgagcccacccagagcgaggacatcacaagatctgcccacctgggcgacggc gacgagatcagagaggccatccacaagagccaggacgccgagacaaagcccaccttctacgtgtgccc cccacctaccggctctacaattgtgcggctggaaccccccagaacctgccctgattaccacctgggca agaacttcaccgagggaattgccgtggtgtacaaagagaatatcgccgcctacaagttcaaggccacc gtgtactacaaggacgtgatcgtgtccaccgcctgggccggcagcagctacacccagatcaccaacag atacgccgaccgggtgcccatccccgtgtctgagatcaccgacaccatcgacaagttcggcaagtgca gcagcaaggccacctacgtgcggaacaaccacaaggtggaagccttcaacgaggacaagaacccccag gacatgcccctgatcgccagcaagtacaacagcgtgggctccaaggcctggcacaccaccaacgacac ctacatggtggccggcacccccggcacatacagaacaggcaccagcgtgaactgcatcatcgaggaag tggaagcccggtccatcttcccatacgacagcttcggcctgagcaccggcgacattatctacatgagc cctttcttcggcctgcgggacggcgcctacagagagcacagcaactacgccatggaccggttccacca gttcgagggctacagacagcgggacctggacacaagagccctgctggaacctgccgccagaaacttcc tggtcacccctcacctgaccgtgggctggaactggaagcccaagcggaccgaagtgtgcagcctggtc aagtggcgcgaggtggaagatgtcgtgcgggatgagtacgcccacaacttccggttcaccatgaagac cctgagcaccaccttcatcagcgagacaaacgagttcaacctgaaccagatccacctgagccagtgcg tgaaagaggaagccagagccatcatcaaccggatctacaccacccggtacaacagcagccacgtgcgg accggcgatatccagacctatctggctagaggcggcttcgtggtggtgtttcagcccctgctgagcaa cagcctggctagactgtacctgcaggaactcgtcagagagaacaccaaccacagcccccagaagcacc ccacccggaataccagatccagacgcagcgtgcccgtggaactgagagccaaccggaccatcaccacc accagcagcgtggaattcgccatgctgcagttcacctacgaccacatccaggaacacgtgaacgagat gctggcccggatcagcagcagttggtgccagctgcagaatcgggaaagggccctgtggtccggcctgt tccccatcaatccaagcgccctggccagcaccatcctggaccagagagtgaaggccagaatcctgggg gacgtgatcagcgtgtccaactgtcctgagctgggcagcgacacccggatcatcctgcagaacagcat gcgggtgtccggcagcaccaccagatgctacagcagacccctgatcagcatcgtgtccctgaacggca gcggcacagtggaaggccagctgggcaccgataacgagctgatcatgagccgggacctgctcgaaccc tgcgtggccaatcacaagcggtactttctgttcggccaccactacgtgtactatgaggactacagata cgtgcgcgagatcgccgtgcacgacgtgggcatgatcagcacctacgtggacctgaacctgaccctgc tgaaggaccgcgagttcatgccactgcaggtctacacccgggacgagctgagagataccggcctgctg gactacagcgagatccagcggcggaaccagatgcactccctgcggttctacgacatcgacaaggtggt gcagtacgacagcggcaccgccatcatgcagggcatggcccagttctttcagggcctgggaacagccg gacaggccgtgggacatgtggtgctgggagctacaggcgccctgctgtctaccgtgcacggcttcacc acctttctgagcaaccccttcggagccctggctgtgggactgctggtcctggctggactggtggccgc cttctttgcctaccgctacgtgctgaagctgaaaaccagccccatgaaggccctgtaccccctgacca ccaagggcctgaagcagctgcctgagggcatggaccccttcgccgagaagcccaatgccaccgacacc cccatcgaggaaatcggcgacagccagaacaccgagccctccgtgaacagcggcttcgaccccgacaa gtttcgcgaggcccaggaaatgatcaagtacatgaccctggtgtctgctgccgagcggcaggaaagca aggcccggaagaagaacaagacctccgccctgctgaccagcagactgacaggactggccctgcggaac agacggggctatagcagagtgcggaccgagaatgtgaccggcgtgtaatctagacgcggccgcataca gcagcaattggcaagctgcttacatagaactcgcggcgattggcatgccgccttaaaatttttatttt atttttcttttcttttccgaatcggattttgtttttaatatttcaaaaaaaaaaaaaaaaaaaaaaaa aaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccgacctgggcatccgaaggaggac gcacgtccactcggatggctaagggagagccacgtttaaaccagctccaattcgccctatagtgagtc gtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaac ttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgc ccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggc gggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctt tcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctcccttta gggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtag tgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggac tcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttg ccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaat attaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttattttt ctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaa aaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgcctt cctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagt gggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttc caatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagag caactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagca tcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcgg ccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggat catgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacac cacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagctt cccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggccctt ccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagc actggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatgg atgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaa gtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagat cctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccg tagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaa aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaac tggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttca agaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggc gataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctg aacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagc gtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagg gtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgg gtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaa acgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcct gcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcag ccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctc tccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagt gagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgctccc ggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgatt acgccaagcgcgcaattaaccctcactaaagggaacaaaagctgggtaccgggcccacgcgtaatacg actcactatag_13339 VZV VEERep.SGPgH (SEQ ID NO: 74): 1_ ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag
actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt ctagtcgagtctagtcgacgccaccatgttcgccctggtgctggccgtggtcatcctgcctctgtgga ccaccgccaacaagagctacgtgacccccacacccgccaccagatccatcggacacatgagcgccctg ctgagagagtacagcgaccggaacatgagcctgaagctggaagccttctaccccaccggcttcgacga ggaactgatcaagagcctgcactggggcaacgaccggaagcacgtgttcctcgtgatcgtgaaagtga accccaccacccacgagggcgacgtcggcctggtcatcttccccaagtacctgctgagcccctaccac ttcaaggccgagcacagagcccccttccctgctggccgctttggctttctgagccaccctgtgacccc cgacgtgtcattcttcgacagcagcttcgccccctacctgaccacacagcacctggtggccttcacca ccttcccccccaatcctctcgtgtggcacctggaaagagccgagacagccgccaccgccgaaagacct tttggcgtgtccctgctgcccgccagacctaccgtgcccaagaacaccatcctggaacacaaggccca cttcgccacctgggatgccctggccagacacaccttctttagcgccgaggccatcatcaccaacagca ccctgagaatccacgtgcccctgttcggcagcgtgtggcccatcagatactgggccacaggcagcgtg ctgctgaccagcgatagcggcagagtggaagtgaacatcggcgtgggcttcatgagcagcctgatcag cctgagcagcggcctgcccatcgagctgattgtggtgccccacaccgtgaagctgaacgccgtgacca gcgacaccacctggttccagctgaacccccctggccctgatcctggccctagttacagagtgtacctg ctgggcagaggcctggacatgaacttcagcaagcacgccaccgtggacatctgcgcctaccctgagga aagcctggactacagataccacctgagcatggcccacaccgaggccctgagaatgaccaccaaggccg accagcacgacatcaacgaggaaagctactaccacattgccgccagaatcgccaccagcatcttcgcc ctgagcgagatgggccggaccaccgagtactttctgctggacgagatcgtggacgtgcagtaccagct gaagttcctgaactacatcctgatgcggatcggcgctggcgcccaccctaataccatcagcggcacca gcgacctgatcttcgccgatcctagccagctgcacgacgagctgagcctgctgttcggccaggtcaaa cccgccaacgtggactacttcatcagctacgacgaggcccgggaccagctgaaaacagcctacgccct gtccagaggccaggatcatgtgaacgccctgtccctggccaggcgcgtgatcatgagcatctacaagg gcctgctggtcaagcagaacctgaacgccaccgagcggcaggccctgttcttcgccagcatgatcctg ctgaacttcagagagggcctggaaaacagcagccgggtgctggatggcagaaccaccctgctgctgat gaccagcatgtgcacagccgcccatgccacacaggccgccctgaatatccaggaaggcctggcttacc tgaaccccagcaagcacatgttcaccatccccaacgtgtacagcccctgcatgggcagcctgagaacc gacctgaccgaagagatccacgtgatgaacctgctgtccgccatccccaccagacccggactgaatga ggtgctgcacacccagctggacgagtccgagatcttcgacgccgccttcaagaccatgatgatcttta ccacctggaccgccaaggacctgcacatcctgcacacacacgtgcccgaggtgttcacatgccaagat gccgccgctcggaacggcgagtatgtgctgattctgcctgccgtgcagggccacagctacgtgatcac ccggaacaagccccagcggggcctggtgtatagcctggctgacgtggacgtgtacaaccccatcagcg tggtgtacctgagcaaggatacctgcgtgtccgagcacggcgtgatcgaaacagtggccctgccccac cccgacaacctgaaagagtgcctgtactgcggctccgtgttcctgcggtatctgaccaccggcgccat catggacatcatcatcatcgacagcaaggacaccgagagacagctggccgccatgggcaacagcacca tcccccccttcaaccccgacatgcacggcgacgatagcaaggccgtgctgctgttccccaacggcacc gtggtcacactgctgggcttcgagcggagacaggccatcagaatgagcggccagtacctgggcgcctc tctgggtggtgcctttctggccgtcgtgggctttggcatcatcggctggatgctgtgcggcaacagca gactgcgcgagtacaacaagatccccctgacctaatctagacgcggccgcatacagcagcaattggca agctgcttacatagaactcgcggcgattggcatgccgccttaaaatttttattttatttttcttttct tttccgaatcggattttgtttttaatatttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagg gtcggcatggcatctccacctcctcgcggtccgacctgggcatccgaaggaggacgcacgtccactcg gatggctaagggagagccacgtttaaaccagctccaattcgccctatagtgagtcgtattacgcgcgc tcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgc agcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagt tgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggtt acgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctt tctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgattta gtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccc tgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaac tggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcct attggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttaca atttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattca aatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtat gagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctc acccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaa ctggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcac ttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgcc gcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggc atgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttct gacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgcc ttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgta gcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaatt aatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggt ttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagat ggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatag acagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatata tactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataat ctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaa aggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctac cagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcaga gcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagc accgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtc ttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcg tgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgaga aagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggag agcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctc tgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgc ggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctg attctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgag cgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttg gccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaa ttaatgtgagttagctcactcattaggcaccccaggctttacactttatgctcccggctcgtatgttg tgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgc aattaaccctcactaaagggaacaaaagctgggtaccgggcccacgcgtaatacgactcactatag_1 3258 VZV VEERep.SGPgL (SEQ ID NO: 75): 1_ ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt
ctagtcgagtctagtcgacgccaccatggccagccacaagtggctgctgcagatgatcgtgttcctga aaaccatcacaatcgcctactgcctgcatctgcaggacgacacccctctgttcttcggcgccaagcct ctgagcgacgtgtccctgatcatcaccgagccttgcgtgtccagcgtgtacgaggcctgggattatgc cgcccctcccgtgtccaatctgagcgaagccctgagcggcatcgtggtcaagaccaagtgccccgtgc ccgaagtgatcctgtggttcaaggacaagcagatggcctactggaccaacccttacgtgaccctgaag ggcctgacccagagcgtgggcgaggaacacaagagcggcgacatcagagatgccctgctggatgccct gtccggtgtctgggtggacagcacaccctccagcaccaacatccccgagaacggctgtgtgtggggag ccgaccggctgttccagagagtgtgtcagtaatctagacgcggccgcatacagcagcaattggcaagc tgcttacatagaactcgcggcgattggcatgccgccttaaaatttttattttatttttcttttctttt ccgaatcggattttgtttttaatatttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagggtc ggcatggcatctccacctcctcgcggtccgacctgggcatccgaaggaggacgcacgtccactcggat ggctaagggagagccacgtttaaaccagctccaattcgccctatagtgagtcgtattacgcgcgctca ctggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagc acatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgc gcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacg cgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttct cgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtg ctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctga tagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactgg aacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcctatt ggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttacaatt taggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaat atgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgag tattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctcacc cagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactg gatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttt taaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgca tacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatg acagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgac aacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttg atcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagca atggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaat agactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggttta ttgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggt aagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagaca gatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatac tttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctc atgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaagg atcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccag cggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcg cagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcacc gcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtctta ccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgc acacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaag cgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagc gcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctga cttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggc ctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgatt ctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgc agcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggcc gattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaatta atgtgagttagctcactcattaggcaccccaggctttacactttatgctcccggctcgtatgttgtgt ggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaat taaccctcactaaagggaacaaaagctgggtaccgggcccacgcgtaatacgactcactatag_1121 5 VZV VEERep.SGPgH-SGPgL (SEQ ID NO: 76) 1_ ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt ctagtcgagtctagtcgacgccaccatgttcgccctggtgctggccgtggtcatcctgcctctgtgga ccaccgccaacaagagctacgtgacccccacacccgccaccagatccatcggacacatgagcgccctg ctgagagagtacagcgaccggaacatgagcctgaagctggaagccttctaccccaccggcttcgacga ggaactgatcaagagcctgcactggggcaacgaccggaagcacgtgttcctcgtgatcgtgaaagtga accccaccacccacgagggcgacgtcggcctggtcatcttccccaagtacctgctgagcccctaccac ttcaaggccgagcacagagcccccttccctgctggccgctttggctttctgagccaccctgtgacccc cgacgtgtcattcttcgacagcagcttcgccccctacctgaccacacagcacctggtggccttcacca ccttcccccccaatcctctcgtgtggcacctggaaagagccgagacagccgccaccgccgaaagacct tttggcgtgtccctgctgcccgccagacctaccgtgcccaagaacaccatcctggaacacaaggccca cttcgccacctgggatgccctggccagacacaccttctttagcgccgaggccatcatcaccaacagca ccctgagaatccacgtgcccctgttcggcagcgtgtggcccatcagatactgggccacaggcagcgtg ctgctgaccagcgatagcggcagagtggaagtgaacatcggcgtgggcttcatgagcagcctgatcag cctgagcagcggcctgcccatcgagctgattgtggtgccccacaccgtgaagctgaacgccgtgacca gcgacaccacctggttccagctgaacccccctggccctgatcctggccctagttacagagtgtacctg ctgggcagaggcctggacatgaacttcagcaagcacgccaccgtggacatctgcgcctaccctgagga aagcctggactacagataccacctgagcatggcccacaccgaggccctgagaatgaccaccaaggccg accagcacgacatcaacgaggaaagctactaccacattgccgccagaatcgccaccagcatcttcgcc ctgagcgagatgggccggaccaccgagtactttctgctggacgagatcgtggacgtgcagtaccagct gaagttcctgaactacatcctgatgcggatcggcgctggcgcccaccctaataccatcagcggcacca gcgacctgatcttcgccgatcctagccagctgcacgacgagctgagcctgctgttcggccaggtcaaa cccgccaacgtggactacttcatcagctacgacgaggcccgggaccagctgaaaacagcctacgccct gtccagaggccaggatcatgtgaacgccctgtccctggccaggcgcgtgatcatgagcatctacaagg gcctgctggtcaagcagaacctgaacgccaccgagcggcaggccctgttcttcgccagcatgatcctg ctgaacttcagagagggcctggaaaacagcagccgggtgctggatggcagaaccaccctgctgctgat gaccagcatgtgcacagccgcccatgccacacaggccgccctgaatatccaggaaggcctggcttacc tgaaccccagcaagcacatgttcaccatccccaacgtgtacagcccctgcatgggcagcctgagaacc gacctgaccgaagagatccacgtgatgaacctgctgtccgccatccccaccagacccggactgaatga ggtgctgcacacccagctggacgagtccgagatcttcgacgccgccttcaagaccatgatgatcttta ccacctggaccgccaaggacctgcacatcctgcacacacacgtgcccgaggtgttcacatgccaagat gccgccgctcggaacggcgagtatgtgctgattctgcctgccgtgcagggccacagctacgtgatcac ccggaacaagccccagcggggcctggtgtatagcctggctgacgtggacgtgtacaaccccatcagcg tggtgtacctgagcaaggatacctgcgtgtccgagcacggcgtgatcgaaacagtggccctgccccac cccgacaacctgaaagagtgcctgtactgcggctccgtgttcctgcggtatctgaccaccggcgccat catggacatcatcatcatcgacagcaaggacaccgagagacagctggccgccatgggcaacagcacca tcccccccttcaaccccgacatgcacggcgacgatagcaaggccgtgctgctgttccccaacggcacc gtggtcacactgctgggcttcgagcggagacaggccatcagaatgagcggccagtacctgggcgcctc tctgggtggtgcctttctggccgtcgtgggctttggcatcatcggctggatgctgtgcggcaacagca gactgcgcgagtacaacaagatccccctgacctaatctagacgtcgcgaccacccaggatccgcctat aactctctacggctaacctgaatggactacgacatagtctagtcgacgccaccatggccagccacaag tggctgctgcagatgatcgtgttcctgaaaaccatcacaatcgcctactgcctgcatctgcaggacga cacccctctgttcttcggcgccaagcctctgagcgacgtgtccctgatcatcaccgagccttgcgtgt ccagcgtgtacgaggcctgggattatgccgcccctcccgtgtccaatctgagcgaagccctgagcggc atcgtggtcaagaccaagtgccccgtgcccgaagtgatcctgtggttcaaggacaagcagatggccta ctggaccaacccttacgtgaccctgaagggcctgacccagagcgtgggcgaggaacacaagagcggcg acatcagagatgccctgctggatgccctgtccggtgtctgggtggacagcacaccctccagcaccaac atccccgagaacggctgtgtgtggggagccgaccggctgttccagagagtgtgtcagtaatctagacg cggccgcatacagcagcaattggcaagctgcttacatagaactcgcggcgattggcatgccgccttaa aatttttattttatttttcttttcttttccgaatcggattttgtttttaatatttcaaaaaaaaaaaa aaaaaaaaaaaaaaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccgacctgggcat ccgaaggaggacgcacgtccactcggatggctaagggagagccacgtttaaaccagctccaattcgcc ctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctg gcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcc cgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgc attaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccg ctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgg gggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtga tggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttct ttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgattta taagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaa ttttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctat ttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttc aataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcg gcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagtt gggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccg aagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgac gccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagt cacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtg ataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcac aacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacga cgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactac ttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctg cgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcgg tatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtc aggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaa ctgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggat ctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgag cgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgc ttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttt tccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttag gccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggct gctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgca gcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactga
gatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccg gtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatcttta tagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcgga gcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcac atgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgatac cgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatac gcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactgg aaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttaca ctttatgctcccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagct atgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctgggtaccgggccc acgcgtaatacgactcactatag_13827 VZV VEERep.SGPgE (SEQ ID NO: 77): 1 ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt ctagtcgagtctagtcgacgccaccatgggcaccgtgaacaagcctgtcgtgggcgtgctgatgggct tcggcatcatcaccggcaccctgagaatcaccaaccctgtgcgggccagcgtgctgagatacgacgac ttccacatcgacgaggacaagctggacaccaacagcgtgtacgagccctactaccacagcgaccacgc cgagagcagctgggtcaacagaggcgagagcagccggaaggcctacgaccacaacagcccctacatct ggccccggaacgactacgacggcttcctggaaaacgcccacgagcaccacggcgtgtacaatcagggc agaggcatcgacagcggcgagagactgatgcagcccacacagatgagcgcccaggaagatctgggcga cgacacaggcatccacgtgatccccaccctgaacggcgacgaccggcacaagatcgtgaacgtggacc agcggcagtacggcgacgtgttcaagggcgacctgaaccctaagccccagggccagagactgatcgag gtgtccgtggaagagaaccaccccttcaccctgagagcccccatccagagaatctacggcgtgcggta taccgagacttggagcttcctgcccagcctgacctgtacaggcgacgccgctcctgccatccagcaca tctgcctgaagcacaccacctgtttccaggacgtggtggtggacgtggactgcgccgagaacaccaaa gaggaccagctggccgagatcagctaccggttccagggcaagaaagaggccgaccagccctggatcgt ggtcaataccagcaccctgttcgacgagctggaactggacccccccgagattgaacccggcgtgctga aggtgctgcggaccgagaagcagtacctgggcgtgtacatctggaacatgcggggctccgacggcacc tctacctacgccaccttcctggtcacatggaagggcgacgagaaaacccggaaccctacccctgccgt gacccctcagcctagaggcgccgagttccatatgtggaattaccactcccacgtgttcagcgtgggcg acaccttcagcctggccatgcatctgcagtacaagatccacgaggcccccttcgacctgctgctggaa tggctgtacgtgcccatcgaccctacctgccagcccatgcggctgtacagcacctgtctgtaccaccc caacgcccctcagtgcctgagccacatgaacagcggctgcaccttcaccagccctcacctggctcaga gggtggccagcaccgtgtaccagaattgcgagcacgccgacaactacaccgcctactgcctgggcatc agccacatggaacccagcttcggcctgatcctgcacgatggcggcaccaccctgaagttcgtggacac acccgagagcctgagcggcctgtacgtgttcgtggtgtacttcaacggccacgtggaagccgtggcct acaccgtggtgtccaccgtggaccacttcgtgaacgccatcgaggaaagaggcttcccacccacagcc ggacagcctccagccaccaccaagcccaaagaaatcacccccgtgaaccccggcaccagccccctgct gagatatgctgcttggacaggcggactggccgctgtggtgctgctgtgcctggtcatcttcctgatct gcaccgccaagcggatgagagtgaaggcctaccgggtggacaagtccccctacaaccagagcatgtac tacgccggcctgcccgtggacgatttcgaggatagcgagagcaccgacaccgaggaagagttcggcaa cgccatcggcggatctcacggcggcagcagctacaccgtgtacatcgacaagaccagataatctagac gcggccgcatacagcagcaattggcaagctgcttacatagaactcgcggcgattggcatgccgcctta aaatttttattttatttttcttttcttttccgaatcggattttgtttttaatatttcaaaaaaaaaaa aaaaaaaaaaaaaaaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccgacctgggca tccgaaggaggacgcacgtccactcggatggctaagggagagccacgtttaaaccagctccaattcgc cctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccct ggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggc ccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcg cattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgccc gctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcg ggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtg atggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttc tttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgattt ataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcga attttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaaccccta tttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgctt caataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgc ggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagt tgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgcccc gaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattga cgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccag tcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagt gataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgca caacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacg acgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaacta cttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttct gcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcg gtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagt caggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggta actgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaagga tctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactga gcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctg cttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactcttt ttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagtta ggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggc tgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgc agcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactg agatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatcc ggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatcttt atagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcgg agcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctca catgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgata ccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaata cgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactg gaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttac actttatgctcccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagc tatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctgggtaccgggcc cacgcgtaatacgactcactatag_12604 VZV VEERep.SGPgI (SEQ ID NO: 78) 1 ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc
cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt ctagtcgagtctagtcgacgccaccatgtttctgatccagtgcctgatcagcgccgtgatcttctata ttcaagtcacaaacgccctgatctttaagggcgaccacgtgtcactgcaggtcaacagcagcctgacc agcatcctgatccccatgcagaacgacaattacaccgagatcaagggccagctggtgttcatcggcga gcagctgcccaccggcaccaattacagcggcaccctggaactgctgtacgccgataccgtggccttct gcttcagaagcgtgcaggtcatcagatacgacggctgcccccggatcagaaccagcgccttcatcagc tgccggtacaagcacagctggcactacggcaacagcaccgaccggatcagcaccgaacctgatgccgg cgtgatgctgaagatcaccaagcccggcatcaacgacgccggcgtgtacgtgctgctcgtgcggctgg atcacagcagaagcaccgacggcttcatcctgggcgtgaacgtgtacaccgccggcagccaccacaac atccacggcgtgatctacaccagccccagcctgcagaacggctacagcaccagagccctgttccagca ggccagactgtgcgatctgcccgccacacctaagggcagcggcacaagcctgtttcagcacatgctgg acctgagagccggcaagagcctggaagataacccctggctgcacgaggacgtggtcaccaccgagaca aagagcgtggtcaaagagggcatcgagaaccacgtgtaccccaccgacatgagcaccctgcccgagaa gtccctgaacgacccccctgagaacctgctgatcatcatccccatcgtggccagcgtgatgatcctga ccgccatggtcatcgtgatcgtgatcagcgtgaagcggcggagaatcaagaagcaccccatctaccgg cccaacaccaagaccagacggggcatccagaacgccacccctgagtccgacgtgatgctggaagccgc cattgcccagctggccaccatcagagaggaaagcccccctcacagcgtcgtgaaccccttcgtgaagt aatctagacgcggccgcatacagcagcaattggcaagctgcttacatagaactcgcggcgattggcat gccgccttaaaatttttattttatttttcttttcttttccgaatcggattttgtttttaatatttcaa aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccg acctgggcatccgaaggaggacgcacgtccactcggatggctaagggagagccacgtttaaaccagct ccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgg gaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatag cgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccct gtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgcc ctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagc tctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttg attagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggag tccacgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattc ttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaat ttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcg gaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctga taaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcc cttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctg aagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagt tttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatc ccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagt actcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccata accatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgc ttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagcca taccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaact ggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcagg accacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtg ggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacg acggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaa gcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaat ttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcg ttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgt aatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctac caactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtag ccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgtt accagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccgg ataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctac accgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcgga caggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcct ggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtca ggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggcc ttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagt gagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagag cgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtt tcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcacccc aggctttacactttatgctcccggctcgtatgttgtgtggaattgtgagcggataacaatttcacaca ggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctggg taccgggcccacgcgtaatacgactcactatag_11797 VZV VEErep.SGPgE-SGPgI (SEQ ID NO: 79): 1_ ataggcggcgcatgagagaagcccagaccaattacctacccaaaatggagaaagttcacgttgacatc gaggaagacagcccattcctcagagctttgcagcggagcttcccgcagtttgaggtagaagccaagca ggtcactgataatgaccatgctaatgccagagcgttttcgcatctggcttcaaaactgatcgaaacgg aggtggacccatccgacacgatccttgacattggaagtgcgcccgcccgcagaatgtattctaagcac aagtatcattgtatctgtccgatgagatgtgcggaagatccggacagattgtataagtatgcaactaa gctgaagaaaaactgtaaggaaataactgataaggaattggacaagaaaatgaaggagctcgccgccg tcatgagcgaccctgacctggaaactgagactatgtgcctccacgacgacgagtcgtgtcgctacgaa gggcaagtcgctgtttaccaggatgtatacgcggttgacggaccgacaagtctctatcaccaagccaa taagggagttagagtcgcctactggataggctttgacaccaccccttttatgtttaagaacttggctg gagcatatccatcatactctaccaactgggccgacgaaaccgtgttaacggctcgtaacataggccta tgcagctctgacgttatggagcggtcacgtagagggatgtccattcttagaaagaagtatttgaaacc atccaacaatgttctattctctgttggctcgaccatctaccacgagaagagggacttactgaggagct ggcacctgccgtctgtatttcacttacgtggcaagcaaaattacacatgtcggtgtgagactatagtt agttgcgacgggtacgtcgttaaaagaatagctatcagtccaggcctgtatgggaagccttcaggcta tgctgctacgatgcaccgcgagggattcttgtgctgcaaagtgacagacacattgaacggggagaggg tctcttttcccgtgtgcacgtatgtgccagctacattgtgtgaccaaatgactggcatactggcaaca gatgtcagtgcggacgacgcgcaaaaactgctggttgggctcaaccagcgtatagtcgtcaacggtcg cacccagagaaacaccaataccatgaaaaattaccttttgcccgtagtggcccaggcatttgctaggt gggcaaaggaatataaggaagatcaagaagatgaaaggccactaggactacgagatagacagttagtc atggggtgttgttgggcttttagaaggcacaagataacatctatttataagcgcccggatacccaaac catcatcaaagtgaacagcgatttccactcattcgtgctgcccaggataggcagtaacacattggaga tcgggctgagaacaagaatcaggaaaatgttagaggagcacaaggagccgtcacctctcattaccgcc gaggacgtacaagaagctaagtgcgcagccgatgaggctaaggaggtgcgtgaagccgaggagttgcg cgcagctctaccacctttggcagctgatgttgaggagcccactctggaagccgatgtagacttgatgt tacaagaggctggggccggctcagtggagacacctcgtggcttgataaaggttaccagctacgatggc gaggacaagatcggctcttacgctgtgctttctccgcaggctgtactcaagagtgaaaaattatcttg catccaccctctcgctgaacaagtcatagtgataacacactctggccgaaaagggcgttatgccgtgg aaccataccatggtaaagtagtggtgccagagggacatgcaatacccgtccaggactttcaagctctg agtgaaagtgccaccattgtgtacaacgaacgtgagttcgtaaacaggtacctgcaccatattgccac acatggaggagcgctgaacactgatgaagaatattacaaaactgtcaagcccagcgagcacgacggcg aatacctgtacgacatcgacaggaaacagtgcgtcaagaaagaactagtcactgggctagggctcaca ggcgagctggtggatcctcccttccatgaattcgcctacgagagtctgagaacacgaccagccgctcc ttaccaagtaccaaccataggggtgtatggcgtgccaggatcaggcaagtctggcatcattaaaagcg cagtcaccaaaaaagatctagtggtgagcgccaagaaagaaaactgtgcagaaattataagggacgtc aagaaaatgaaagggctggacgtcaatgccagaactgtggactcagtgctcttgaatggatgcaaaca ccccgtagagaccctgtatattgacgaagcttttgcttgtcatgcaggtactctcagagcgctcatag ccattataagacctaaaaaggcagtgctctgcggggatcccaaacagtgcggtttttttaacatgatg tgcctgaaagtgcattttaaccacgagatttgcacacaagtcttccacaaaagcatctctcgccgttg cactaaatctgtgacttcggtcgtctcaaccttgttttacgacaaaaaaatgagaacgacgaatccga aagagactaagattgtgattgacactaccggcagtaccaaacctaagcaggacgatctcattctcact tgtttcagagggtgggtgaagcagttgcaaatagattacaaaggcaacgaaataatgacggcagctgc ctctcaagggctgacccgtaaaggtgtgtatgccgttcggtacaaggtgaatgaaaatcctctgtacg cacccacctcagaacatgtgaacgtcctactgacccgcacggaggaccgcatcgtgtggaaaacacta gccggcgacccatggataaaaacactgactgccaagtaccctgggaatttcactgccacgatagagga gtggcaagcagagcatgatgccatcatgaggcacatcttggagagaccggaccctaccgacgtcttcc agaataaggcaaacgtgtgttgggccaaggctttagtgccggtgctgaagaccgctggcatagacatg accactgaacaatggaacactgtggattattttgaaacggacaaagctcactcagcagagatagtatt gaaccaactatgcgtgaggttctttggactcgatctggactccggtctattttctgcacccactgttc cgttatccattaggaataatcactgggataactccccgtcgcctaacatgtacgggctgaataaagaa gtggtccgtcagctctctcgcaggtacccacaactgcctcgggcagttgccactggaagagtctatga catgaacactggtacactgcgcaattatgatccgcgcataaacctagtacctgtaaacagaagactgc ctcatgctttagtcctccaccataatgaacacccacagagtgacttttcttcattcgtcagcaaattg aagggcagaactgtcctggtggtcggggaaaagttgtccgtcccaggcaaaatggttgactggttgtc agaccggcctgaggctaccttcagagctcggctggatttaggcatcccaggtgatgtgcccaaatatg acataatatttgttaatgtgaggaccccatataaataccatcactatcagcagtgtgaagaccatgcc attaagcttagcatgttgaccaagaaagcttgtctgcatctgaatcccggcggaacctgtgtcagcat aggttatggttacgctgacagggccagcgaaagcatcattggtgctatagcgcggcagttcaagtttt cccgggtatgcaaaccgaaatcctcacttgaagagacggaagttctgtttgtattcattgggtacgat cgcaaggcccgtacgcacaatccttacaagctttcatcaaccttgaccaacatttatacaggttccag actccacgaagccggatgtgcaccctcatatcatgtggtgcgaggggatattgccacggccaccgaag gagtgattataaatgctgctaacagcaaaggacaacctggcggaggggtgtgcggagcgctgtataag aaattcccggaaagcttcgatttacagccgatcgaagtaggaaaagcgcgactggtcaaaggtgcagc taaacatatcattcatgccgtaggaccaaacttcaacaaagtttcggaggttgaaggtgacaaacagt tggcagaggcttatgagtccatcgctaagattgtcaacgataacaattacaagtcagtagcgattcca ctgttgtccaccggcatcttttccgggaacaaagatcgactaacccaatcattgaaccatttgctgac agctttagacaccactgatgcagatgtagccatatactgcagggacaagaaatgggaaatgactctca aggaagcagtggctaggagagaagcagtggaggagatatgcatatccgacgactcttcagtgacagaa cctgatgcagagctggtgagggtgcatccgaagagttctttggctggaaggaagggctacagcacaag cgatggcaaaactttctcatatttggaagggaccaagtttcaccaggcggccaaggatatagcagaaa ttaatgccatgtggcccgttgcaacggaggccaatgagcaggtatgcatgtatatcctcggagaaagc atgagcagtattaggtcgaaatgccccgtcgaagagtcggaagcctccacaccacctagcacgctgcc ttgcttgtgcatccatgccatgactccagaaagagtacagcgcctaaaagcctcacgtccagaacaaa ttactgtgtgctcatcctttccattgccgaagtatagaatcactggtgtgcagaagatccaatgctcc cagcctatattgttctcaccgaaagtgcctgcgtatattcatccaaggaagtatctcgtggaaacacc accggtagacgagactccggagccatcggcagagaaccaatccacagaggggacacctgaacaaccac cacttataaccgaggatgagaccaggactagaacgcctgagccgatcatcatcgaagaggaagaagag gatagcataagtttgctgtcagatggcccgacccaccaggtgctgcaagtcgaggcagacattcacgg gccgccctctgtatctagctcatcctggtccattcctcatgcatccgactttgatgtggacagtttat ccatacttgacaccctggagggagctagcgtgaccagcggggcaacgtcagccgagactaactcttac ttcgcaaagagtatggagtttctggcgcgaccggtgcctgcgcctcgaacagtattcaggaaccctcc acatcccgctccgcgcacaagaacaccgtcacttgcacccagcagggcctgctcgagaaccagcctag tttccaccccgccaggcgtgaatagggtgatcactagagaggagctcgaggcgcttaccccgtcacgc actcctagcaggtcggtctcgagaaccagcctggtctccaacccgccaggcgtaaatagggtgattac aagagaggagtttgaggcgttcgtagcacaacaacaatgacggtttgatgcgggtgcatacatctttt cctccgacaccggtcaagggcatttacaacaaaaatcagtaaggcaaacggtgctatccgaagtggtg ttggagaggaccgaattggagatttcgtatgccccgcgcctcgaccaagaaaaagaagaattactacg caagaaattacagttaaatcccacacctgctaacagaagcagataccagtccaggaaggtggagaaca tgaaagccataacagctagacgtattctgcaaggcctagggcattatttgaaggcagaaggaaaagtg gagtgctaccgaaccctgcatcctgttcctttgtattcatctagtgtgaaccgtgccttttcaagccc caaggtcgcagtggaagcctgtaacgccatgttgaaagagaactttccgactgtggcttcttactgta ttattccagagtacgatgcctatttggacatggttgacggagcttcatgctgcttagacactgccagt ttttgccctgcaaagctgcgcagctttccaaagaaacactcctatttggaacccacaatacgatcggc agtgccttcagcgatccagaacacgctccagaacgtcctggcagctgccacaaaaagaaattgcaatg tcacgcaaatgagagaattgcccgtattggattcggcggcctttaatgtggaatgcttcaagaaatat gcgtgtaataatgaatattgggaaacgtttaaagaaaaccccatcaggcttactgaagaaaacgtggt aaattacattaccaaattaaaaggaccaaaagctgctgctctttttgcgaagacacataatttgaata tgttgcaggacataccaatggacaggtttgtaatggacttaaagagagacgtgaaagtgactccagga acaaaacatactgaagaacggcccaaggtacaggtgatccaggctgccgatccgctagcaacagcgta tctgtgcggaatccaccgagagctggttaggagattaaatgcggtcctgcttccgaacattcatacac tgtttgatatgtcggctgaagactttgacgctattatagccgagcacttccagcctggggattgtgtt ctggaaactgacatcgcgtcgtttgataaaagtgaggacgacgccatggctctgaccgcgttaatgat tctggaagacttaggtgtggacgcagagctgttgacgctgattgaggcggctttcggcgaaatttcat caatacatttgcccactaaaactaaatttaaattcggagccatgatgaaatctggaatgttcctcaca ctgtttgtgaacacagtcattaacattgtaatcgcaagcagagtgttgagagaacggctaaccggatc accatgtgcagcattcattggagatgacaatatcgtgaaaggagtcaaatcggacaaattaatggcag acaggtgcgccacctggttgaatatggaagtcaagattatagatgctgtggtgggcgagaaagcgcct tatttctgtggagggtttattttgtgtgactccgtgaccggcacagcgtgccgtgtggcagaccccct aaaaaggctgtttaagcttggcaaacctctggcagcagacgatgaacatgatgatgacaggagaaggg cattgcatgaagagtcaacacgctggaaccgagtgggtattctttcagagctgtgcaaggcagtagaa tcaaggtatgaaaccgtaggaacttccatcatagttatggccatgactactctagctagcagtgttaa atcattcagctacctgagaggggcccctataactctctacggctaacctgaatggactacgacatagt ctagtcgagtctagtcgacgccaccatgggcaccgtgaacaagcctgtcgtgggcgtgctgatgggct tcggcatcatcaccggcaccctgagaatcaccaaccctgtgcgggccagcgtgctgagatacgacgac ttccacatcgacgaggacaagctggacaccaacagcgtgtacgagccctactaccacagcgaccacgc cgagagcagctgggtcaacagaggcgagagcagccggaaggcctacgaccacaacagcccctacatct ggccccggaacgactacgacggcttcctggaaaacgcccacgagcaccacggcgtgtacaatcagggc agaggcatcgacagcggcgagagactgatgcagcccacacagatgagcgcccaggaagatctgggcga cgacacaggcatccacgtgatccccaccctgaacggcgacgaccggcacaagatcgtgaacgtggacc agcggcagtacggcgacgtgttcaagggcgacctgaaccctaagccccagggccagagactgatcgag gtgtccgtggaagagaaccaccccttcaccctgagagcccccatccagagaatctacggcgtgcggta taccgagacttggagcttcctgcccagcctgacctgtacaggcgacgccgctcctgccatccagcaca tctgcctgaagcacaccacctgtttccaggacgtggtggtggacgtggactgcgccgagaacaccaaa
gaggaccagctggccgagatcagctaccggttccagggcaagaaagaggccgaccagccctggatcgt ggtcaataccagcaccctgttcgacgagctggaactggacccccccgagattgaacccggcgtgctga aggtgctgcggaccgagaagcagtacctgggcgtgtacatctggaacatgcggggctccgacggcacc tctacctacgccaccttcctggtcacatggaagggcgacgagaaaacccggaaccctacccctgccgt gacccctcagcctagaggcgccgagttccatatgtggaattaccactcccacgtgttcagcgtgggcg acaccttcagcctggccatgcatctgcagtacaagatccacgaggcccccttcgacctgctgctggaa tggctgtacgtgcccatcgaccctacctgccagcccatgcggctgtacagcacctgtctgtaccaccc caacgcccctcagtgcctgagccacatgaacagcggctgcaccttcaccagccctcacctggctcaga gggtggccagcaccgtgtaccagaattgcgagcacgccgacaactacaccgcctactgcctgggcatc agccacatggaacccagcttcggcctgatcctgcacgatggcggcaccaccctgaagttcgtggacac acccgagagcctgagcggcctgtacgtgttcgtggtgtacttcaacggccacgtggaagccgtggcct acaccgtggtgtccaccgtggaccacttcgtgaacgccatcgaggaaagaggcttcccacccacagcc ggacagcctccagccaccaccaagcccaaagaaatcacccccgtgaaccccggcaccagccccctgct gagatatgctgcttggacaggcggactggccgctgtggtgctgctgtgcctggtcatcttcctgatct gcaccgccaagcggatgagagtgaaggcctaccgggtggacaagtccccctacaaccagagcatgtac tacgccggcctgcccgtggacgatttcgaggatagcgagagcaccgacaccgaggaagagttcggcaa cgccatcggcggatctcacggcggcagcagctacaccgtgtacatcgacaagaccagataatctagac gtcgcgaccacccaggatccgcctataactctctacggctaacctgaatggactacgacatagtctag tcgacgccaccatgtttctgatccagtgcctgatcagcgccgtgatcttctatattcaagtcacaaac gccctgatctttaagggcgaccacgtgtcactgcaggtcaacagcagcctgaccagcatcctgatccc catgcagaacgacaattacaccgagatcaagggccagctggtgttcatcggcgagcagctgcccaccg gcaccaattacagcggcaccctggaactgctgtacgccgataccgtggccttctgcttcagaagcgtg caggtcatcagatacgacggctgcccccggatcagaaccagcgccttcatcagctgccggtacaagca cagctggcactacggcaacagcaccgaccggatcagcaccgaacctgatgccggcgtgatgctgaaga tcaccaagcccggcatcaacgacgccggcgtgtacgtgctgctcgtgcggctggatcacagcagaagc accgacggcttcatcctgggcgtgaacgtgtacaccgccggcagccaccacaacatccacggcgtgat ctacaccagccccagcctgcagaacggctacagcaccagagccctgttccagcaggccagactgtgcg atctgcccgccacacctaagggcagcggcacaagcctgtttcagcacatgctggacctgagagccggc aagagcctggaagataacccctggctgcacgaggacgtggtcaccaccgagacaaagagcgtggtcaa agagggcatcgagaaccacgtgtaccccaccgacatgagcaccctgcccgagaagtccctgaacgacc cccctgagaacctgctgatcatcatccccatcgtggccagcgtgatgatcctgaccgccatggtcatc gtgatcgtgatcagcgtgaagcggcggagaatcaagaagcaccccatctaccggcccaacaccaagac cagacggggcatccagaacgccacccctgagtccgacgtgatgctggaagccgccattgcccagctgg ccaccatcagagaggaaagcccccctcacagcgtcgtgaaccccttcgtgaagtaatctagacgcggc cgcatacagcagcaattggcaagctgcttacatagaactcgcggcgattggcatgccgccttaaaatt tttattttatttttcttttcttttccgaatcggattttgtttttaatatttcaaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaaaaagggtcggcatggcatctccacctcctcgcggtccgacctgggcatccga aggaggacgcacgtccactcggatggctaagggagagccacgtttaaaccagctccaattcgccctat agtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgt tacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgca ccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcatta agcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcc tttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggc tccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggt tcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaa tagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataag ggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaatttt aacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgt ttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaata atattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcat tttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggt gcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaaga acgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccg ggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcaca gaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataa cactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaaca tgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgag cgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttac tctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgct cggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatc attgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggc aactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgt cagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctag gtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtc agaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgc aaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccg aaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggcca ccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctg ccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcgg tcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagata cctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaa gcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagt cctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcct atggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgt tctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgct cgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaa accgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaag cgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacacttt atgctcccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatga ccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctgggtaccgggcccacgc gtaatacgactcactatag_13775 VEE-based replicon encoding eGFP (SEQ ID NO: 80) nsP1 ~~~~~~~~~~~~~~~~~ 1 ATAGGCGGCG CATGAGAGAA GCCCAGACCA ATTACCTACC CAAAATGGAG AAAGTTCACG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 61 TTGACATCGA GGAAGACAGC CCATTCCTCA GAGCTTTGCA GCGGAGCTTC CCGCAGTTTG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 AGGTAGAAGC CAAGCAGGTC ACTGATAATG ACCATGCTAA TGCCAGAGCG TTTTCGCATC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 TGGCTTCAAA ACTGATCGAA ACGGAGGTGG ACCCATCCGA CACGATCCTT GACATTGGAA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 GTGCGCCCGC CCGCAGAATG TATTCTAAGC ACAAGTATCA TTGTATCTGT CCGATGAGAT nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 GTGCGGAAGA TCCGGACAGA TTGTATAAGT ATGCAACTAA GCTGAAGAAA AACTGTAAGG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 AAATAACTGA TAAGGAATTG GACAAGAAAA TGAAGGAGCT CGCCGCCGTC ATGAGCGACC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 CTGACCTGGA AACTGAGACT ATGTGCCTCC ACGACGACGA GTCGTGTCGC TACGAAGGGC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 AAGTCGCTGT TTACCAGGAT GTATACGCGG TTGACGGACC GACAAGTCTC TATCACCAAG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 CCAATAAGGG AGTTAGAGTC GCCTACTGGA TAGGCTTTGA CACCACCCCT TTTATGTTTA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 601 AGAACTTGGC TGGAGCATAT CCATCATACT CTACCAACTG GGCCGACGAA ACCGTGTTAA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 661 CGGCTCGTAA CATAGGCCTA TGCAGCTCTG ACGTTATGGA GCGGTCACGT AGAGGGATGT nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 721 CCATTCTTAG AAAGAAGTAT TTGAAACCAT CCAACAATGT TCTATTCTCT GTTGGCTCGA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 781 CCATCTACCA CGAGAAGAGG GACTTACTGA GGAGCTGGCA CCTGCCGTCT GTATTTCACT nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 841 TACGTGGCAA GCAAAATTAC ACATGTCGGT GTGAGACTAT AGTTAGTTGC GACGGGTACG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 901 TCGTTAAAAG AATAGCTATC AGTCCAGGCC TGTATGGGAA GCCTTCAGGC TATGCTGCTA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 961 CGATGCACCG CGAGGGATTC TTGTGCTGCA AAGTGACAGA CACATTGAAC GGGGAGAGGG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1021 TCTCTTTTCC CGTGTGCACG TATGTGCCAG CTACATTGTG TGACCAAATG ACTGGCATAC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1081 TGGCAACAGA TGTCAGTGCG GACGACGCGC AAAAACTGCT GGTTGGGCTC AACCAGCGTA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1141 TAGTCGTCAA CGGTCGCACC CAGAGAAACA CCAATACCAT GAAAAATTAC CTTTTGCCCG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1201 TAGTGGCCCA GGCATTTGCT AGGTGGGCAA AGGAATATAA GGAAGATCAA GAAGATGAAA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1261 GGCCACTAGG ACTACGAGAT AGACAGTTAG TCATGGGGTG TTGTTGGGCT TTTAGAAGGC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1321 ACAAGATAAC ATCTATTTAT AAGCGCCCGG ATACCCAAAC CATCATCAAA GTGAACAGCG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1381 ATTTCCACTC ATTCGTGCTG CCCAGGATAG GCAGTAACAC ATTGGAGATC GGGCTGAGAA nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1441 CAAGAATCAG GAAAATGTTA GAGGAGCACA AGGAGCCGTC ACCTCTCATT ACCGCCGAGG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1501 ACGTACAAGA AGCTAAGTGC GCAGCCGATG AGGCTAAGGA GGTGCGTGAA GCCGAGGAGT nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1561 TGCGCGCAGC TCTACCACCT TTGGCAGCTG ATGTTGAGGA GCCCACTCTG GAAGCCGATG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1621 TAGACTTGAT GTTACAAGAG GCTGGGGCCG GCTCAGTGGA GACACCTCGT GGCTTGATAA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1681 AGGTTACCAG CTACGATGGC GAGGACAAGA TCGGCTCTTA CGCTGTGCTT TCTCCGCAGG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1741 CTGTACTCAA GAGTGAAAAA TTATCTTGCA TCCACCCTCT CGCTGAACAA GTCATAGTGA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1801 TAACACACTC TGGCCGAAAA GGGCGTTATG CCGTGGAACC ATACCATGGT AAAGTAGTGG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1861 TGCCAGAGGG ACATGCAATA CCCGTCCAGG ACTTTCAAGC TCTGAGTGAA AGTGCCACCA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1921 TTGTGTACAA CGAACGTGAG TTCGTAAACA GGTACCTGCA CCATATTGCC ACACATGGAG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1981 GAGCGCTGAA CACTGATGAA GAATATTACA AAACTGTCAA GCCCAGCGAG CACGACGGCG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2041 AATACCTGTA CGACATCGAC AGGAAACAGT GCGTCAAGAA AGAACTAGTC ACTGGGCTAG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2101 GGCTCACAGG CGAGCTGGTG GATCCTCCCT TCCATGAATT CGCCTACGAG AGTCTGAGAA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2161 CACGACCAGC CGCTCCTTAC CAAGTACCAA CCATAGGGGT GTATGGCGTG CCAGGATCAG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2221 GCAAGTCTGG CATCATTAAA AGCGCAGTCA CCAAAAAAGA TCTAGTGGTG AGCGCCAAGA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2281 AAGAAAACTG TGCAGAAATT ATAAGGGACG TCAAGAAAAT GAAAGGGCTG GACGTCAATG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2341 CCAGAACTGT GGACTCAGTG CTCTTGAATG GATGCAAACA CCCCGTAGAG ACCCTGTATA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2401 TTGACGAAGC TTTTGCTTGT CATGCAGGTA CTCTCAGAGC GCTCATAGCC ATTATAAGAC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
2461 CTAAAAAGGC AGTGCTCTGC GGGGATCCCA AACAGTGCGG TTTTTTTAAC ATGATGTGCC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2521 TGAAAGTGCA TTTTAACCAC GAGATTTGCA CACAAGTCTT CCACAAAAGC ATCTCTCGCC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2581 GTTGCACTAA ATCTGTGACT TCGGTCGTCT CAACCTTGTT TTACGACAAA AAAATGAGAA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2641 CGACGAATCC GAAAGAGACT AAGATTGTGA TTGACACTAC CGGCAGTACC AAACCTAAGC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2701 AGGACGATCT CATTCTCACT TGTTTCAGAG GGTGGGTGAA GCAGTTGCAA ATAGATTACA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2761 AAGGCAACGA AATAATGACG GCAGCTGCCT CTCAAGGGCT GACCCGTAAA GGTGTGTATG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2821 CCGTTCGGTA CAAGGTGAAT GAAAATCCTC TGTACGCACC CACCTCAGAA CATGTGAACG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2881 TCCTACTGAC CCGCACGGAG GACCGCATCG TGTGGAAAAC ACTAGCCGGC GACCCATGGA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2941 TAAAAACACT GACTGCCAAG TACCCTGGGA ATTTCACTGC CACGATAGAG GAGTGGCAAG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3001 CAGAGCATGA TGCCATCATG AGGCACATCT TGGAGAGACC GGACCCTACC GACGTCTTCC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3061 AGAATAAGGC AAACGTGTGT TGGGCCAAGG CTTTAGTGCC GGTGCTGAAG ACCGCTGGCA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3121 TAGACATGAC CACTGAACAA TGGAACACTG TGGATTATTT TGAAACGGAC AAAGCTCACT nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3181 CAGCAGAGAT AGTATTGAAC CAACTATGCG TGAGGTTCTT TGGACTCGAT CTGGACTCCG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3241 GTCTATTTTC TGCACCCACT GTTCCGTTAT CCATTAGGAA TAATCACTGG GATAACTCCC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3301 CGTCGCCTAA CATGTACGGG CTGAATAAAG AAGTGGTCCG TCAGCTCTCT CGCAGGTACC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3361 CACAACTGCC TCGGGCAGTT GCCACTGGAA GAGTCTATGA CATGAACACT GGTACACTGC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3421 GCAATTATGA TCCGCGCATA AACCTAGTAC CTGTAAACAG AAGACTGCCT CATGCTTTAG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3481 TCCTCCACCA TAATGAACAC CCACAGAGTG ACTTTTCTTC ATTCGTCAGC AAATTGAAGG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3541 GCAGAACTGT CCTGGTGGTC GGGGAAAAGT TGTCCGTCCC AGGCAAAATG GTTGACTGGT nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3601 TGTCAGACCG GCCTGAGGCT ACCTTCAGAG CTCGGCTGGA TTTAGGCATC CCAGGTGATG nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3661 TGCCCAAATA TGACATAATA TTTGTTAATG TGAGGACCCC ATATAAATAC CATCACTATC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3721 AGCAGTGTGA AGACCATGCC ATTAAGCTTA GCATGTTGAC CAAGAAAGCT TGTCTGCATC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3781 TGAATCCCGG CGGAACCTGT GTCAGCATAG GTTATGGTTA CGCTGACAGG GCCAGCGAAA nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3841 GCATCATTGG TGCTATAGCG CGGCAGTTCA AGTTTTCCCG GGTATGCAAA CCGAAATCCT nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3901 CACTTGAAGA GACGGAAGTT CTGTTTGTAT TCATTGGGTA CGATCGCAAG GCCCGTACGC nsP2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3961 ACAATCCTTA CAAGCTTTCA TCAACCTTGA CCAACATTTA TACAGGTTCC AGACTCCACG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ nsP2 ~~~~~~~~~~~~ 4021 AAGCCGGATG TGCACCCTCA TATCATGTGG TGCGAGGGGA TATTGCCACG GCCACCGAAG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4081 GAGTGATTAT AAATGCTGCT AACAGCAAAG GACAACCTGG CGGAGGGGTG TGCGGAGCGC nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4141 TGTATAAGAA ATTCCCGGAA AGCTTCGATT TACAGCCGAT CGAAGTAGGA AAAGCGCGAC nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4201 TGGTCAAAGG TGCAGCTAAA CATATCATTC ATGCCGTAGG ACCAAACTTC AACAAAGTTT nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4261 CGGAGGTTGA AGGTGACAAA CAGTTGGCAG AGGCTTATGA GTCCATCGCT AAGATTGTCA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4321 ACGATAACAA TTACAAGTCA GTAGCGATTC CACTGTTGTC CACCGGCATC TTTTCCGGGA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4381 ACAAAGATCG ACTAACCCAA TCATTGAACC ATTTGCTGAC AGCTTTAGAC ACCACTGATG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4441 CAGATGTAGC CATATACTGC AGGGACAAGA AATGGGAAAT GACTCTCAAG GAAGCAGTGG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4501 CTAGGAGAGA AGCAGTGGAG GAGATATGCA TATCCGACGA CTCTTCAGTG ACAGAACCTG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4561 ATGCAGAGCT GGTGAGGGTG CATCCGAAGA GTTCTTTGGC TGGAAGGAAG GGCTACAGCA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4621 CAAGCGATGG CAAAACTTTC TCATATTTGG AAGGGACCAA GTTTCACCAG GCGGCCAAGG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4681 ATATAGCAGA AATTAATGCC ATGTGGCCCG TTGCAACGGA GGCCAATGAG CAGGTATGCA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4741 TGTATATCCT CGGAGAAAGC ATGAGCAGTA TTAGGTCGAA ATGCCCCGTC GAAGAGTCGG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4801 AAGCCTCCAC ACCACCTAGC ACGCTGCCTT GCTTGTGCAT CCATGCCATG ACTCCAGAAA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4861 GAGTACAGCG CCTAAAAGCC TCACGTCCAG AACAAATTAC TGTGTGCTCA TCCTTTCCAT nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4921 TGCCGAAGTA TAGAATCACT GGTGTGCAGA AGATCCAATG CTCCCAGCCT ATATTGTTCT nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4981 CACCGAAAGT GCCTGCGTAT ATTCATCCAA GGAAGTATCT CGTGGAAACA CCACCGGTAG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5041 ACGAGACTCC GGAGCCATCG GCAGAGAACC AATCCACAGA GGGGACACCT GAACAACCAC nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5101 CACTTATAAC CGAGGATGAG ACCAGGACTA GAACGCCTGA GCCGATCATC ATCGAAGAGG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5161 AAGAAGAGGA TAGCATAAGT TTGCTGTCAG ATGGCCCGAC CCACCAGGTG CTGCAAGTCG nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5221 AGGCAGACAT TCACGGGCCG CCCTCTGTAT CTAGCTCATC CTGGTCCATT CCTCATGCAT nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5281 CCGACTTTGA TGTGGACAGT TTATCCATAC TTGACACCCT GGAGGGAGCT AGCGTGACCA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5341 GCGGGGCAAC GTCAGCCGAG ACTAACTCTT ACTTCGCAAA GAGTATGGAG TTTCTGGCGC nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5401 GACCGGTGCC TGCGCCTCGA ACAGTATTCA GGAACCCTCC ACATCCCGCT CCGCGCACAA nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5461 GAACACCGTC ACTTGCACCC AGCAGGGCCT GCTCGAGAAC CAGCCTAGTT TCCACCCCGC nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5521 CAGGCGTGAA TAGGGTGATC ACTAGAGAGG AGCTCGAGGC GCTTACCCCG TCACGCACTC nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5581 CTAGCAGGTC GGTCTCGAGA ACCAGCCTGG TCTCCAACCC GCCAGGCGTA AATAGGGTGA nsP4 ~~~~~~~~~~~~~~~~~~~~ nsP3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5641 TTACAAGAGA GGAGTTTGAG GCGTTCGTAG CACAACAACA ATGACGGTTT GATGCGGGTG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5701 CATACATCTT TTCCTCCGAC ACCGGTCAAG GGCATTTACA ACAAAAATCA GTAAGGCAAA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5761 CGGTGCTATC CGAAGTGGTG TTGGAGAGGA CCGAATTGGA GATTTCGTAT GCCCCGCGCC nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5821 TCGACCAAGA AAAAGAAGAA TTACTACGCA AGAAATTACA GTTAAATCCC ACACCTGCTA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5881 ACAGAAGCAG ATACCAGTCC AGGAAGGTGG AGAACATGAA AGCCATAACA GCTAGACGTA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5941 TTCTGCAAGG CCTAGGGCAT TATTTGAAGG CAGAAGGAAA AGTGGAGTGC TACCGAACCC nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6001 TGCATCCTGT TCCTTTGTAT TCATCTAGTG TGAACCGTGC CTTTTCAAGC CCCAAGGTCG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6061 CAGTGGAAGC CTGTAACGCC ATGTTGAAAG AGAACTTTCC GACTGTGGCT TCTTACTGTA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6121 TTATTCCAGA GTACGATGCC TATTTGGACA TGGTTGACGG AGCTTCATGC TGCTTAGACA nsP4
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6181 CTGCCAGTTT TTGCCCTGCA AAGCTGCGCA GCTTTCCAAA GAAACACTCC TATTTGGAAC nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6241 CCACAATACG ATCGGCAGTG CCTTCAGCGA TCCAGAACAC GCTCCAGAAC GTCCTGGCAG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6301 CTGCCACAAA AAGAAATTGC AATGTCACGC AAATGAGAGA ATTGCCCGTA TTGGATTCGG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6361 CGGCCTTTAA TGTGGAATGC TTCAAGAAAT ATGCGTGTAA TAATGAATAT TGGGAAACGT nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6421 TTAAAGAAAA CCCCATCAGG CTTACTGAAG AAAACGTGGT AAATTACATT ACCAAATTAA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6481 AAGGACCAAA AGCTGCTGCT CTTTTTGCGA AGACACATAA TTTGAATATG TTGCAGGACA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6541 TACCAATGGA CAGGTTTGTA ATGGACTTAA AGAGAGACGT GAAAGTGACT CCAGGAACAA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6601 AACATACTGA AGAACGGCCC AAGGTACAGG TGATCCAGGC TGCCGATCCG CTAGCAACAG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6661 CGTATCTGTG CGGAATCCAC CGAGAGCTGG TTAGGAGATT AAATGCGGTC CTGCTTCCGA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6721 ACATTCATAC ACTGTTTGAT ATGTCGGCTG AAGACTTTGA CGCTATTATA GCCGAGCACT nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6781 TCCAGCCTGG GGATTGTGTT CTGGAAACTG ACATCGCGTC GTTTGATAAA AGTGAGGACG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6841 ACGCCATGGC TCTGACCGCG TTAATGATTC TGGAAGACTT AGGTGTGGAC GCAGAGCTGT nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6901 TGACGCTGAT TGAGGCGGCT TTCGGCGAAA TTTCATCAAT ACATTTGCCC ACTAAAACTA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 6961 AATTTAAATT CGGAGCCATG ATGAAATCTG GAATGTTCCT CACACTGTTT GTGAACACAG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7021 TCATTAACAT TGTAATCGCA AGCAGAGTGT TGAGAGAACG GCTAACCGGA TCACCATGTG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7081 CAGCATTCAT TGGAGATGAC AATATCGTGA AAGGAGTCAA ATCGGACAAA TTAATGGCAG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7141 ACAGGTGCGC CACCTGGTTG AATATGGAAG TCAAGATTAT AGATGCTGTG GTGGGCGAGA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7201 AAGCGCCTTA TTTCTGTGGA GGGTTTATTT TGTGTGACTC CGTGACCGGC ACAGCGTGCC nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7261 GTGTGGCAGA CCCCCTAAAA AGGCTGTTTA AGCTTGGCAA ACCTCTGGCA GCAGACGATG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7321 AACATGATGA TGACAGGAGA AGGGCATTGC ATGAAGAGTC AACACGCTGG AACCGAGTGG nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7381 GTATTCTTTC AGAGCTGTGC AAGGCAGTAGA ATCAAGGTA TGAAACCGTA GGAACTTCCA nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7441 TCATAGTTAT GGCCATGACT ACTCTAGCTA GCAGTGTTAA ATCATTCAGC TACCTGAGAG subgenomic promoter ~~~~~~~~~~~~~~~~~~~~~~~~~ nsP4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7501 GGGCCCCTAT AACTCTCTAC GGCTAACCTG AATGGACTAC GACATAGTCT AGTCGACGCC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7561 ACCATGGTGA GCAAGGGCGA GGAGCTGTTC ACCGGGGTGG TGCCCATCCT GGTCGAGCTG eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7621 GACGGCGACG TAAACGGCCA CAAGTTCAGC GTGTCCGGCG AGGGCGAGGG CGATGCCACC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7681 TACGGCAAGC TGACCCTGAA GTTCATCTGC ACCACCGGCA AGCTGCCCGT GCCCTGGCCC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7741 ACCCTCGTGA CCACCCTGAC CTACGGCGTG CAGTGCTTCA GCCGCTACCC CGACCACATG eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7801 AAGCAGCACG ACTTCTTCAA GTCCGCCATG CCCGAAGGCT ACGTCCAGGA GCGCACCATC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7861 TTCTTCAAGG ACGACGGCAA CTACAAGACC CGCGCCGAGG TGAAGTTCGA GGGCGACACC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7921 CTGGTGAACC GCATCGAGCT GAAGGGCATC GACTTCAAGG AGGACGGCAA CATCCTGGGG eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 7981 CACAAGCTGG AGTACAACTA CAACAGCCAC AACGTCTATA TCATGGCCGA CAAGCAGAAG eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8041 AACGGCATCA AGGTGAACTT CAAGATCCGC CACAACATCG AGGACGGCAG CGTGCAGCTC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8101 GCCGACCACT ACCAGCAGAA CACCCCCATC GGCGACGGCC CCGTGCTGCT GCCCGACAAC eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8161 CACTACCTGA GCACCCAGTC CGCCCTGAGC AAAGACCCCA ACGAGAAGCG CGATCACATG eGFP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8221 GTCCTGCTGG AGTTCGTGAC CGCCGCCGGG ATCACTCTCG GCATGGACGA GCTGTACAAG eGFP 3'UTR ~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8281 TGATAATCTA GACGGCGCGC CCACCCAGCG GCCGCATACA GCAGCAATTG GCAAGCTGCT 3'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8341 TACATAGAAC TCGCGGCGAT TGGCATGCCG CCTTAAAATT TTTATTTTAT TTTTCTTTTC 3'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8401 TTTTCCGAAT CGGATTTTGT TTTTAATATT TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA HDV ribozyme ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8461 AAAAAAAGGG TCGGCATGGC ATCTCCACCT CCTCGCGGTC CGACCTGGGC ATCCGAAGGA HDV ribozyme ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 8521 GGACGCACGT CCACTCGGAT GGCTAAGGGA GAGCCACGTT TAAACCAGCT CCAATTCGCC 8581 CTATAGTGAG TCGTATTACG CGCGCTCACT GGCCGTCGTT TTACAACGTC GTGACTGGGA 8641 AAACCCTGGC GTTACCCAAC TTAATCGCCT TGCAGCACAT CCCCCTTTCG CCAGCTGGCG 8701 TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG TTGCGCAGCC TGAATGGCGA 8761 ATGGGACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT 8821 GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT 8881 CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGGCTCCCTT TAGGGTTCCG 8941 ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT TAGGGTGATG GTTCACGTAG 9001 TGGGCCATCG CCCTGATAGA CGGTTTTTCG CCCTTTGACG TTGGAGTCCA CGTTCTTTAA 9061 TAGTGGACTC TTGTTCCAAA CTGGAACAAC ACTCAACCCT ATCTCGGTCT ATTCTTTTGA 9121 TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA AATGAGCTGA TTTAACAAAA 9181 ATTTAACGCG AATTTTAACA AAATATTAAC GCTTACAATT TAGGTGGCAC TTTTCGGGGA 9241 AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC bla ~~~~~~~~~ 9301 ATGAGACAAT AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG TATGAGTATT bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9361 CAACATTTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9421 CACCCAGAAA CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9481 TACATCGAAC TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC CGAAGAACGT bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9541 TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG CGGTATTATC CCGTATTGAC bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9601 GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9661 TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9721 GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9781 AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9841 GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGTAGCA bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9901 ATGGCAACAA CGTTGCGCAA ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 9961 CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG CTCGGCCCTT bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 10021 CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 10081 ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA CACGACGGGG bla ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 10141 AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG AGATAGGTGC CTCACTGATT bla ~~~~~~~~~ 10201 AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT 10261 CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC
10321 CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 10381 TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA ACCACCGCTA 10441 CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 10501 TTCAGCAGAG CGCAGATACC AAATACTGTT CTTCTAGTGT AGCCGTAGTT AGGCCACCAC 10561 TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT 10621 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 10681 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG 10741 ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA 10801 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG 10861 GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA 10921 CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 10981 AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 11041 GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT 11101 CGCCGCAGCC GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA 11161 ATACGCAAAC CGCCTCTCCC CGCGCGTTGG CCGATTCATT AATGCAGCTG GCACGACAGG 11221 TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAGTTA GCTCACTCAT 11281 TAGGCACCCC AGGCTTTACA CTTTATGCTC CCGGCTCGTA TGTTGTGTGG AATTGTGAGC 11341 GGATAACAAT TTCACACAGG AAACAGCTAT GACCATGATT ACGCCAAGCG CGCAATTAAC 11401 CCTCACTAAA GGGAACAAAA GCTGGGTACC GGGCCCACGC GTAATACGAC TCACTATAG VEE cap helper 5'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ nsP1 ~~~~~~~~~~~~~~~~~ 1 ATAGGCGGCG CATGAGAGAA GCCCAGACCA ATTACCTACC CAAATAGGAG AAAGTTCACG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 61 TTGACATCGA GGAAGACAGC CCATTCCTCA GAGCTTTGCA GCGGAGCTTC CCGCAGTTTG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 AGGTAGAAGC CAAGCAGGTC ACTGATAATG ACCATGCTAA TGCCAGAGCG TTTTCGCATC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 TGGCTTCAAA ACTGATCGAA ACGGAGGTGG ACCCATCCGA CACGATCCTT GACATTGGAC VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 GGACCGACCA TGTTCCCGTT CCAGCCAATG TATCCGATGC AGCCAATGCC CTATCGCAAC VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 CCGTTCGCGG CCCCGCGCAG GCCCTGGTTC CCCAGAACCG ACCCTTTTCT GGCGATGCAG VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 GTGCAGGAAT TAACCCGCTC GATGGCTAAC CTGACGTTCA AGCAACGCCG GGACGCGCCA VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 CCTGAGGGGC CATCCGCTAA GAAACCGAAG AAGGAGGCCT CGCAAAAACA GAAAGGGGGA VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 GGCCAAGGGA AGAAGAAGAA GAACCAAGGG AAGAAGAAGG CTAAGACAGG GCCGCCTAAT VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 CCGAAGGCAC AGAATGGAAA CAAGAAGAAG ACCAACAAGA AACCAGGCAA GAGACAGCGC VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 601 ATGGTCATGA AATTGGAATC TGACAAGACG TTCCCAATCA TGTTGGAAGG GAAGATAAAC VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ H152G ~~~ 661 GGCTACGCTT GTGTGGTCGG AGGGAAGTTA TTCAGGCCGA TGGGTGTGGA AGGCAAGATC VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 721 GACAACGACG TTCTGGCCGC GCTTAAGACG AAGAAAGCAT CCAAATACGA TCTTGAGTAT VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 781 GCAGATGTGC CACAGAACAT GCGGGCCGAT ACATTCAAAT ACACCCATGA GAAACCCCAA VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 841 GGCTATTACA GCTGGCATCA TGGAGCAGTC CAATATGAAA ATGGGCGTTT CACGGTGCCG VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 901 AAAGGAGTTG GGGCCAAGGG AGACAGCGGA CGACCCATTC TGGATAACCA GGGACGGGTG VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 961 GTCGCTATTG TGCTGGGAGG TGTGAATGAA GGATCTAGGA CAGCCCTTTC AGTCGTCATG VEECAP ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1021 TGGAACGAGA AGGGAGTTAC CGTGAAGTAT ACTCCGGAGA ACTGCGAGCA ATGGTAATAG VEECAP 3'UTR ~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1081 TAAGCGGCCG CATACAGCAG CAATTGGCAA GCTGCTTACA TAGAACTCGC GGCGATTGGC 3'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1141 ATGCCGCCTT AAAATTTTTA TTTTATTTTT CTTTTCTTTT CCGAATCGGA TTTTGTTTTT 3'UTR HDV ribozyme ~~~~~~~~~ 1201 AATATTTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAGGGTCGG CATGGCATCT HDV ribozyme ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1261 CCACCTCCTC GCGGTCCGAC CTGGGCATCC GAAGGAGGAC GCACGTCCAC TCGGATGGCT HDV ribozyme ~~~~~~~~~~~~~~ 1321 AAGGGAGAGC CACGTTTAAA CACGTGATAT CTGGCCTCAT GGGCCTTCCT TTCACTGCCC 1381 GCTTTCCAGT CGGGAAACCT GTCGTGCCAG CTGCATTAAC ATGGTCATAG CTGTTTCCTT 1441 GCGTATTGGG CGCTCTCCGC TTCCTCGCTC ACTGACTCGC TGCGCTCGGT CGTTCGGGTA colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1501 AAGCCTGGGG TGCCTAATGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT AAAAAGGCCG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1561 CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA AATCGACGCT colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1621 CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1681 GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG TCCGCCTTTC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1741 TCCCTTCGGG AAGCGTGGCG CTTTCTCATA GCTCACGCTG TAGGTATCTC AGTTCGGTGT colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1801 AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1861 CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1921 CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1981 TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGAACAGT ATTTGGTATC TGCGCTCTGC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2041 TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2101 CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2161 AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT 2221 AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA 2281 AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC AGTTATTAGA ~~~ KanR 2341 AAAATTCATC CAGCAGACGA TAAAACGCAA TACGCTGGCT ATCCGGTGCC GCAATGCCAT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2401 ACAGCACCAG AAAACGATCC GCCCATTCGC CGCCCAGTTC TTCCGCAATA TCACGGGTGG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2461 CCAGCGCAAT ATCCTGATAA CGATCCGCCA CGCCCAGACG GCCGCAATCA ATAAAGCCGC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2521 TAAAACGGCC ATTTTCCACC ATAATGTTCG GCAGGCACGC ATCACCATGG GTCACCACCA ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2581 GATCTTCGCC ATCCGGCATG CTCGCTTTCA GACGCGCAAA CAGCTCTGCC GGTGCCAGGC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2641 CCTGATGTTC TTCATCCAGA TCATCCTGAT CCACCAGGCC CGCTTCCATA CGGGTACGCG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2701 CACGTTCAAT ACGATGTTTC GCCTGATGAT CAAACGGACA GGTCGCCGGG TCCAGGGTAT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2761 GCAGACGACG CATGGCATCC GCCATAATGC TCACTTTTTC TGCCGGCGCC AGATGGCTAG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2821 ACAGCAGATC CTGACCCGGC ACTTCGCCCA GCAGCAGCCA ATCACGGCCC GCTTCGGTCA ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2881 CCACATCCAG CACCGCCGCA CACGGAACAC CGGTGGTGGC CAGCCAGCTC AGACGCGCCG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 2941 CTTCATCCTG CAGCTCGTTC AGCGCACCGC TCAGATCGGT TTTCACAAAC AGCACCGGAC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 3001 GACCCTGCGC GCTCAGACGA AACACCGCCG CATCAGAGCA GCCAATGGTC TGCTGCGCCC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 3061 AATCATAGCC AAACAGACGT TCCACCCACG CTGCCGGGCT ACCCGCATGC AGGCCATCCT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 3121 GTTCAATCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA ~~~~~~~~~~~ KanR 3181 TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT
CCGCGCACAT 3241 TTCCCCGAAA AGTGCCACCT AAATTGTAAG CGTTAATATT TTGTTAAAAT TCGCGTTAAA 3301 TTTTTGTTAA ATCAGCTCAT TTTTTAACCA ATAGGCCGAA ATCGGCAAAA TCCCTTATAA 3361 ATCAAAAGAA TAGACCGAGA TAGGGTTGAG TGGCCGCTAC AGGGCGCTCC CATTCGCCAT 3421 TCAGGCTGCG CAACTGTTGG GAAGGGCGTT TCGGTGCGGG CCTCTTCGCT ATTACGCCAG 3481 CTGGCGAAAG GGGGATGTGC TGCAAGGCGA TTAAGTTGGG TAACGCCAGG GTTTTCCCAG T7 promoter ~~~~~~~~~~~~~~~~~~~~ 3541 TCACACGCGT AATACGACTC ACTATAG VEE gly helper 5'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ nsP1 ~~~~~~~~~~~~~~~~~ 1 ATAGGCGGCG CATGAGAGAA GCCCAGACCA ATTACCTACC CAAATAGGAG AAAGTTCACG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 61 TTGACATCGA GGAAGACAGC CCATTCCTCA GAGCTTTGCA GCGGAGCTTC CCGCAGTTTG nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 AGGTAGAAGC CAAGCAGGTC ACTGATAATG ACCATGCTAA TGCCAGAGCG TTTTCGCATC nsP1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 TGGCTTCAAA ACTGATCGAA ACGGAGGTGG ACCCATCCGA CACGATCCTT GACATTGGAC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 GGACCGACCA TGTCACTAGT GACCACCATG TGTCTGCTCG CCAATGTGAC GTTCCCATGT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 GCTCAACCAC CAATTTGCTA CGACAGAAAA CCAGCAGAGA CTTTGGCCAT GCTCAGCGTT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 AACGTTGACA ACCCGGGCTA CGATGAGCTG CTGGAAGCAG CTGTTAAGTG CCCCGGAAGG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 AAAAGGAGAT CCACCGAGGA GCTGTTTAAT GAGTATAAGC TAACGCGCCC TTACATGGCC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 AGATGCATCA GATGTGCAGT TGGGAGCTGC CATAGTCCAA TAGCAATCGA GGCAGTAAAG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 AGCGACGGGC ACGACGGTTA TGTTAGACTT CAGACTTCCT CGCAGTATGG CCTGGATTCC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 601 TCCGGCAACT TAAAGGGCAG GACCATGCGG TATGACATGC ACGGGACCAT TAAAGAGATA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 661 CCACTACATC AAGTGTCACT CTATACATCT CGCCCGTGTC ACATTGTGGA TGGGCACGGT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 721 TATTTCCTGC TTGCCAGGTG CCCGGCAGGG GACTCCATCA CCATGGAATT TAAGAAAGAT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 781 TCCGTCAGAC ACTCCTGCTC GGTGCCGTAT GAAGTGAAAT TTAATCCTGT AGGCAGAGAA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 841 CTCTATACTC ATCCCCCAGA ACACGGAGTA GAGCAAGCGT GCCAAGTCTA CGCACATGAT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 901 GCACAGAACA GAGGAGCTTA TGTCGAGATG CACCTCCCGG GCTCAGAAGT GGACAGCAGT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 961 TTGGTTTCCT TGAGCGGCAG TTCAGTCACC GTGACACCTC CTGATGGGAC TAGCGCCCTG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1021 GTGGAATGCG AGTGTGGCGG CACAAAGATC TCCGAGACCA TCAACAAGAC AAAACAGTTC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1081 AGCCAGTGCA CAAAGAAGGA GCAGTGCAGA GCATATCGGC TGCAGAACGA TAAGTGGGTG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1141 TATAATTCTG ACAAACTGCC CAAAGCAGCG GGAGCCACCT TAAAAGGAAA ACTGCATGTC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1201 CCATTCTTGC TGGCAGACGG CAAATGCACC GTGCCTCTAG CACCAGAACC TATGATAACC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1261 TTCGGTTTCA GATCAGTGTC ACTGAAACTG CACCCTAAGA ATCCCACATA TCTAATCACC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1321 CGCCAACTTG CTGATGAGCC TCACTACACG CACGAGCTCA TATCTGAACC AGCTGTTAGG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1381 AATTTTACCG TCACCGAAAA AGGGTGGGAG TTTGTATGGG GAAACCACCC GCCGAAAAGG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1441 TTTTGGGCAC AGGAAACAGC ACCCGGAAAT CCACATGGGC TACCGCACGA GGTGATAACT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1501 CATTATTACC ACAGATACCC TATGTCCACC ATCCTGGGTT TGTCAATTTG TGCCGCCATT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1561 GCAACCGTTT CCGTTGCAGC GTCTACCTGG CTGTTTTGCA GATCTAGAGT TGCGTGCCTA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1621 ACTCCTTACC GGCTAACACC TAACGCTAGG ATACCATTTT GTCTGGCTGT GCTTTGCTGC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1681 GCCCGCACTG CCCGGGCCGA GACCACCTGG GAGTCCTTGG ATCACCTATG GAACAATAAC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1741 CAACAGATGT TCTGGATTCA ATTGCTGATC CCTCTGGCCG CCTTGATCGT AGTGACTCGC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1801 CTGCTCAGGT GCGTGTGCTG TGTCGTGCCT TTTTTAGTCA TGGCCGGCGC CGCAGGCGCC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1861 GGCGCCTACG AGCACGCGAC CACGATGCCG AGCCAAGCGG GAATCTCGTA TAACACTATA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1921 GTCAACAGAG CAGGCTACGC ACCACTCCCT ATCAGCATAA CACCAACAAA GATCAAGCTG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1981 ATACCTACAG TGAACTTGGA GTACGTCACC TGCCACTACA AAACAGGAAT GGATTCACCA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2041 GCCATCAAAT GCTGCGGATC TCAGGAATGC ACTCCAACTT ACAGGCCTGA TGAACAGTGC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2101 AAAGTCTTCA CAGGGGTTTA CCCGTTCATG TGGGGTGGTG CATATTGCTT TTGCGACACT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2161 GAGAACACCC AAGTCAGCAA GGCCTACGTA ATGAAATCTG ACGACTGCCT TGCGGATCAT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2221 GCTGAAGCAT ATAAAGCGCA CACAGCCTCA GTGCAGGCGT TCCTCAACAT CACAGTGGGA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2281 GAACACTCTA TTGTGACTAC CGTGTATGTG AATGGAGAAA CTCCTGTGAA TTTCAATGGG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2341 GTCAAAATAA CTGCAGGTCC GCTTTCCACA GCTTGGACAC CCTTTGATCG CAAAATCGTG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2401 CAGTATGCCG GGGAGATCTA TAATTATGAT TTTCCTGAGT ATGGGGCAGG ACAACCAGGA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2461 GCATTTGGAG ATATACAATC CAGAACAGTC TCAAGCTCTG ATCTGTATGC CAATACCAAC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2521 CTAGTGCTGC AGAGACCCAA AGCAGGAGCG ATCCACGTGC CATACACTCA GGCACCTTCG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2581 GGTTTTGAGC AATGGAAGAA AGATAAAGCT CCATCATTGA AATTTACCGC CCCTTTCGGA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2641 TGCGAAATAT ATACAAACCC CATTCGCGCC GAAAACTGTG CTGTAGGGTC AATTCCATTA VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2701 GCCTTTGACA TTCCCGACGC CTTGTTCACC AGGGTGTCAG AAACACCGAC ACTTTCAGCG VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2761 GCCGAATGCA CTCTTAACGA GTGCGTGTAT TCTTCCGACT TTGGTGGGAT CGCCACGGTC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2821 AAGTACTCGG CCAGCAAGTC AGGCAAGTGC GCAGTCCATG TGCCATCAGG GACTGCTACC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2881 CTAAAAGAAG CAGCAGTCGA GCTAACCGAG CAAGGGTCGG CGACTATCCA TTTCTCGACC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2941 GCAAATATCC ACCCGGAGTT CAGGCTCCAA ATATGCACAT CATATGTTAC GTGCAAAGGT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3001 GATTGTCACC CCCCGAAAGA CCATATTGTG ACACACCCTC AGTATCACGC CCAAACATTT VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3061 ACAGCCGCGG TGTCAAAAAC CGCGTGGACG TGGTTAACAT CCCTGCTGGG AGGATCAGCC VEE GLY ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3121 GTAATTATTA TAATTGGCTT GGTGCTGGCT ACTATTGTGG CCATGTACGT GCTGACCAAC VEE GLY 3'UTR ~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3181 CAGAAACATA ATTAATAGTA AGCGGCCGCA TACAGCAGCA ATTGGCAAGC TGCTTACATA 3'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3241 GAACTCGCGG CGATTGGCAT GCCGCCTTAA AATTTTTATT TTATTTTTCT TTTCTTTTCC 3'UTR ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3301 GAATCGGATT TTGTTTTTAA TATTTCAAAA AAAAAAAAAA AAAA AAAAAAAAAA HDV ribozyme ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3361 AGGGTCGGCA TGGCATCTCC ACCTCCTCGC GGTCCGACCT GGGCATCCGA AGGAGGACGC HDV ribozyme ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3421 ACGTCCACTC GGATGGCTAA GGGAGAGCCA CGTTTAAACA CGTGATATCT GGCCTCATGG 3481 GCCTTCCTTT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAACAT
3541 GGTCATAGCT GTTTCCTTGC GTATTGGGCG CTCTCCGCTT CCTCGCTCAC TGACTCGCTG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3601 CGCTCGGTCG TTCGGGTAAA GCCTGGGGTG CCTAATGAGC AAAAGGCCAG CAAAAGGCCA colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3661 GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3721 ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA TAAAGATACC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3781 AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3841 GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3901 GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 3961 TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4021 ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4081 GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG CTACACTAGA AGAACAGTAT colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4141 TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4201 CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC colE1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4261 GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 4321 GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG ATCTTCACCT 4381 AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 4441 GGTCTGACAG TTATTAGAAA AATTCATCCA GCAGACGATA AAACGCAATA CGCTGGCTAT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4501 CCGGTGCCGC AATGCCATAC AGCACCAGAA AACGATCCGC CCATTCGCCG CCCAGTTCTT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4561 CCGCAATATC ACGGGTGGCC AGCGCAATAT CCTGATAACG ATCCGCCACG CCCAGACGGC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4621 CGCAATCAAT AAAGCCGCTA AAACGGCCAT TTTCCACCAT AATGTTCGGC AGGCACGCAT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4681 CACCATGGGT CACCACCAGA TCTTCGCCAT CCGGCATGCT CGCTTTCAGA CGCGCAAACA ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4741 GCTCTGCCGG TGCCAGGCCC TGATGTTCTT CATCCAGATC ATCCTGATCC ACCAGGCCCG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4801 CTTCCATACG GGTACGCGCA CGTTCAATAC GATGTTTCGC CTGATGATCA AACGGACAGG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4861 TCGCCGGGTC CAGGGTATGC AGACGACGCA TGGCATCCGC CATAATGCTC ACTTTTTCTG ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4921 CCGGCGCCAG ATGGCTAGAC AGCAGATCCT GACCCGGCAC TTCGCCCAGC AGCAGCCAAT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 4981 CACGGCCCGC TTCGGTCACC ACATCCAGCA CCGCCGCACA CGGAACACCG GTGGTGGCCA ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 5041 GCCAGCTCAG ACGCGCCGCT TCATCCTGCA GCTCGTTCAG CGCACCGCTC AGATCGGTTT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 5101 TCACAAACAG CACCGGACGA CCCTGCGCGC TCAGACGAAA CACCGCCGCA TCAGAGCAGC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 5161 CAATGGTCTG CTGCGCCCAA TCATAGCCAA ACAGACGTTC CACCCACGCT GCCGGGCTAC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 5221 CCGCATGCAG GCCATCCTGT TCAATCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ KanR 5281 ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 5341 TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTAA ATTGTAAGCG TTAATATTTT 5401 GTTAAAATTC GCGTTAAATT TTTGTTAAAT CAGCTCATTT TTTAACCAAT AGGCCGAAAT 5461 CGGCAAAATC CCTTATAAAT CAAAAGAATA GACCGAGATA GGGTTGAGTG GCCGCTACAG 5521 GGCGCTCCCA TTCGCCATTC AGGCTGCGCA ACTGTTGGGA AGGGCGTTTC GGTGCGGGCC 5581 TCTTCGCTAT TACGCCAGCT GGCGAAAGGG GGATGTGCTG CAAGGCGATT AAGTTGGGTA T7 promoter ~~~~~~~~~~~~~~~~~~~~ 5641 ACGCCAGGGT TTTCCCAGTC ACACGCGTAA TACGACTCAC TATAG
Sequence CWU
1
1
85124DNAVenezuelan equine encephalitis virus 1ctctctacgg ctaacctgaa tgga
2428PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
consensus sequence"VARIANT(2)..(2)/replace="Ile"MOD_RES(4)..(4)Any amino
acid 2Asp Val Glu Xaa Asn Pro Gly Pro1 538PRTFoot and mouth
disease virus 3Asp Val Glu Ser Asn Pro Gly Pro1
54100PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide"misc_feature(1)..(100)/note="This
sequence may encompass 3 to 100 'Lys' residues" 4Lys Lys Lys Lys Lys Lys
Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys1 5
10 15Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys
Lys Lys Lys Lys 20 25 30Lys
Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 35
40 45Lys Lys Lys Lys Lys Lys Lys Lys Lys
Lys Lys Lys Lys Lys Lys Lys 50 55
60Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys65
70 75 80Lys Lys Lys Lys Lys
Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 85
90 95Lys Lys Lys Lys
10053PRTUnknownsource/note="Description of Unknown integrin
receptor-binding moiety peptide" 5Arg Gly Asp166PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
6xHis tag" 6His His His His His His1 576DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
oligonucleotide" 7gggccc
686DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic oligonucleotide" 8ggcgcc
6928DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
oligonucleotide" 9ctcgatgtac ttccgaggaa ctgatgtg
281028DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic oligonucleotide" 10atcgatgtac
ttccgaggaa ctcacgtg
28116DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic oligonucleotide" 11gtcgac
6126DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
oligonucleotide" 12gtctac
6136DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic oligonucleotide" 13tctaga
6146DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
oligonucleotide" 14tcaaga
61514DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic oligonucleotide" 15ctggatatct gcag
141614DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
oligonucleotide" 16atcgatatcc gcgg
141778DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic oligonucleotide" 17ctataactct
ctacggctaa cctgaatgga ctacgacata gtctagtcga ccaagcctct 60agacggcgcg
cccaccca
781838DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic primer" 18ataagaatgc ggccgcctat aactctctac
ggctaacc 381928DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
primer" 19ccatcgattg ggtgggcgcg ccgtctag
282030DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic primer" 20ccatcgatct ataactctct acggctaacc
302129DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
primer" 21tccccgcggt gggtgggcgc gccgtctag
292212DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic oligonucleotide" 22ccactgtgat cg
12236DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
oligonucleotide" 23cacgtg
6246DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic oligonucleotide" 24actgtg
6252727DNAHuman
herpesvirus 5 25atggaaagcc ggatctggtg cctggtcgtg tgcgtgaacc tgtgcatcgt
gtgcctggga 60gccgccgtga gcagcagcag caccagaggc accagcgcca cacacagcca
ccacagcagc 120cacaccacct ctgccgccca cagcagatcc ggcagcgtgt cccagagagt
gaccagcagc 180cagaccgtgt cccacggcgt gaacgagaca atctacaaca ccaccctgaa
gtacggcgac 240gtcgtgggcg tgaataccac caagtacccc tacagagtgt gcagcatggc
ccagggcacc 300gacctgatca gattcgagcg gaacatcgtg tgcaccagca tgaagcccat
caacgaggac 360ctggacgagg gcatcatggt ggtgtacaag agaaacatcg tggcccacac
cttcaaagtg 420cgggtgtacc agaaggtgct gaccttccgg cggagctacg cctacatcca
caccacatac 480ctgctgggca gcaacaccga gtacgtggcc cctcccatgt gggagatcca
ccacatcaac 540agccacagcc agtgctacag cagctacagc cgcgtgatcg ccggcacagt
gttcgtggcc 600taccaccggg acagctacga gaacaagacc atgcagctga tgcccgacga
ctacagcaac 660acccacagca ccagatacgt gaccgtgaag gaccagtggc acagcagagg
cagcacctgg 720ctgtaccggg agacatgcaa cctgaactgc atggtcacca tcaccaccgc
cagaagcaag 780tacccttacc acttcttcgc cacctccacc ggcgacgtgg tggacatcag
ccccttctac 840aacggcacca accggaacgc cagctacttc ggcgagaacg ccgacaagtt
cttcatcttc 900cccaactaca ccatcgtgtc cgacttcggc agacccaaca gcgctctgga
aacccacaga 960ctggtggcct ttctggaacg ggccgacagc gtgatcagct gggacatcca
ggacgagaag 1020aacgtgacct gccagctgac cttctgggag gcctctgaga gaaccatcag
aagcgaggcc 1080gaggacagct accacttcag cagcgccaag atgaccgcca ccttcctgag
caagaaacag 1140gaagtgaaca tgagcgactc cgccctggac tgcgtgaggg acgaggccat
caacaagctg 1200cagcagatct tcaacaccag ctacaaccag acctacgaga agtatggcaa
tgtgtccgtg 1260ttcgagacaa caggcggcct ggtggtgttc tggcagggca tcaagcagaa
aagcctggtg 1320gagctggaac ggctcgccaa ccggtccagc ctgaacctga cccacaaccg
gaccaagcgg 1380agcaccgacg gcaacaacgc aacccacctg tccaacatgg aaagcgtgca
caacctggtg 1440tacgcacagc tgcagttcac ctacgacacc ctgcggggct acatcaacag
agccctggcc 1500cagatcgccg aggcttggtg cgtggaccag cggcggaccc tggaagtgtt
caaagagctg 1560tccaagatca accccagcgc catcctgagc gccatctaca acaagcctat
cgccgccaga 1620ttcatgggcg acgtgctggg cctggccagc tgcgtgacca tcaaccagac
cagcgtgaag 1680gtgctgcggg acatgaacgt gaaagagagc ccaggccgct gctactccag
acccgtggtc 1740atcttcaact tcgccaacag ctcctacgtg cagtacggcc agctgggcga
ggacaacgag 1800atcctgctgg ggaaccaccg gaccgaggaa tgccagctgc ccagcctgaa
gatctttatc 1860gccggcaaca gcgcctacga gtatgtggac tacctgttca agcggatgat
cgacctgagc 1920agcatctcca ccgtggacag catgatcgcc ctggacatcg accccctgga
aaacaccgac 1980ttccgggtgc tggaactgta cagccagaaa gagctgcgga gcagcaacgt
gttcgacctg 2040gaagagatca tgcgggagtt caacagctac aagcagcgcg tgaaatacgt
ggaggacaag 2100gtggtggacc ccctgcctcc ttacctgaag ggcctggacg acctgatgag
cggactgggc 2160gctgccggaa aagccgtggg agtggccatt ggagctgtgg gcggagctgt
ggcctctgtc 2220gtggaaggcg tcgccacctt tctgaagaac cccttcggcg ccttcaccat
catcctggtg 2280gccattgccg tcgtgatcat cacctacctg atctacaccc ggcagcggag
actgtgtacc 2340cagcccctgc agaacctgtt cccctacctg gtgtccgccg atggcaccac
agtgaccagc 2400ggctccacca aggataccag cctgcaggcc ccacccagct acgaagagag
cgtgtacaac 2460agcggcagaa agggccctgg ccctcccagc tctgatgcca gcacagccgc
ccctccctac 2520accaacgagc aggcctacca gatgctgctg gccctggcta gactggatgc
cgagcagagg 2580gcccagcaga acggcaccga cagcctggat ggcagaaccg gcacccagga
caagggccag 2640aagcccaacc tgctggaccg gctgcggcac cggaagaacg gctaccggca
cctgaaggac 2700agcgacgagg aagagaacgt ctgataa
272726907PRTHuman herpesvirus 5 26Met Glu Ser Arg Ile Trp Cys
Leu Val Val Cys Val Asn Leu Cys Ile1 5 10
15Val Cys Leu Gly Ala Ala Val Ser Ser Ser Ser Thr Arg
Gly Thr Ser 20 25 30Ala Thr
His Ser His His Ser Ser His Thr Thr Ser Ala Ala His Ser 35
40 45Arg Ser Gly Ser Val Ser Gln Arg Val Thr
Ser Ser Gln Thr Val Ser 50 55 60His
Gly Val Asn Glu Thr Ile Tyr Asn Thr Thr Leu Lys Tyr Gly Asp65
70 75 80Val Val Gly Val Asn Thr
Thr Lys Tyr Pro Tyr Arg Val Cys Ser Met 85
90 95Ala Gln Gly Thr Asp Leu Ile Arg Phe Glu Arg Asn
Ile Val Cys Thr 100 105 110Ser
Met Lys Pro Ile Asn Glu Asp Leu Asp Glu Gly Ile Met Val Val 115
120 125Tyr Lys Arg Asn Ile Val Ala His Thr
Phe Lys Val Arg Val Tyr Gln 130 135
140Lys Val Leu Thr Phe Arg Arg Ser Tyr Ala Tyr Ile His Thr Thr Tyr145
150 155 160Leu Leu Gly Ser
Asn Thr Glu Tyr Val Ala Pro Pro Met Trp Glu Ile 165
170 175His His Ile Asn Ser His Ser Gln Cys Tyr
Ser Ser Tyr Ser Arg Val 180 185
190Ile Ala Gly Thr Val Phe Val Ala Tyr His Arg Asp Ser Tyr Glu Asn
195 200 205Lys Thr Met Gln Leu Met Pro
Asp Asp Tyr Ser Asn Thr His Ser Thr 210 215
220Arg Tyr Val Thr Val Lys Asp Gln Trp His Ser Arg Gly Ser Thr
Trp225 230 235 240Leu Tyr
Arg Glu Thr Cys Asn Leu Asn Cys Met Val Thr Ile Thr Thr
245 250 255Ala Arg Ser Lys Tyr Pro Tyr
His Phe Phe Ala Thr Ser Thr Gly Asp 260 265
270Val Val Asp Ile Ser Pro Phe Tyr Asn Gly Thr Asn Arg Asn
Ala Ser 275 280 285Tyr Phe Gly Glu
Asn Ala Asp Lys Phe Phe Ile Phe Pro Asn Tyr Thr 290
295 300Ile Val Ser Asp Phe Gly Arg Pro Asn Ser Ala Leu
Glu Thr His Arg305 310 315
320Leu Val Ala Phe Leu Glu Arg Ala Asp Ser Val Ile Ser Trp Asp Ile
325 330 335Gln Asp Glu Lys Asn
Val Thr Cys Gln Leu Thr Phe Trp Glu Ala Ser 340
345 350Glu Arg Thr Ile Arg Ser Glu Ala Glu Asp Ser Tyr
His Phe Ser Ser 355 360 365Ala Lys
Met Thr Ala Thr Phe Leu Ser Lys Lys Gln Glu Val Asn Met 370
375 380Ser Asp Ser Ala Leu Asp Cys Val Arg Asp Glu
Ala Ile Asn Lys Leu385 390 395
400Gln Gln Ile Phe Asn Thr Ser Tyr Asn Gln Thr Tyr Glu Lys Tyr Gly
405 410 415Asn Val Ser Val
Phe Glu Thr Thr Gly Gly Leu Val Val Phe Trp Gln 420
425 430Gly Ile Lys Gln Lys Ser Leu Val Glu Leu Glu
Arg Leu Ala Asn Arg 435 440 445Ser
Ser Leu Asn Leu Thr His Asn Arg Thr Lys Arg Ser Thr Asp Gly 450
455 460Asn Asn Ala Thr His Leu Ser Asn Met Glu
Ser Val His Asn Leu Val465 470 475
480Tyr Ala Gln Leu Gln Phe Thr Tyr Asp Thr Leu Arg Gly Tyr Ile
Asn 485 490 495Arg Ala Leu
Ala Gln Ile Ala Glu Ala Trp Cys Val Asp Gln Arg Arg 500
505 510Thr Leu Glu Val Phe Lys Glu Leu Ser Lys
Ile Asn Pro Ser Ala Ile 515 520
525Leu Ser Ala Ile Tyr Asn Lys Pro Ile Ala Ala Arg Phe Met Gly Asp 530
535 540Val Leu Gly Leu Ala Ser Cys Val
Thr Ile Asn Gln Thr Ser Val Lys545 550
555 560Val Leu Arg Asp Met Asn Val Lys Glu Ser Pro Gly
Arg Cys Tyr Ser 565 570
575Arg Pro Val Val Ile Phe Asn Phe Ala Asn Ser Ser Tyr Val Gln Tyr
580 585 590Gly Gln Leu Gly Glu Asp
Asn Glu Ile Leu Leu Gly Asn His Arg Thr 595 600
605Glu Glu Cys Gln Leu Pro Ser Leu Lys Ile Phe Ile Ala Gly
Asn Ser 610 615 620Ala Tyr Glu Tyr Val
Asp Tyr Leu Phe Lys Arg Met Ile Asp Leu Ser625 630
635 640Ser Ile Ser Thr Val Asp Ser Met Ile Ala
Leu Asp Ile Asp Pro Leu 645 650
655Glu Asn Thr Asp Phe Arg Val Leu Glu Leu Tyr Ser Gln Lys Glu Leu
660 665 670Arg Ser Ser Asn Val
Phe Asp Leu Glu Glu Ile Met Arg Glu Phe Asn 675
680 685Ser Tyr Lys Gln Arg Val Lys Tyr Val Glu Asp Lys
Val Val Asp Pro 690 695 700Leu Pro Pro
Tyr Leu Lys Gly Leu Asp Asp Leu Met Ser Gly Leu Gly705
710 715 720Ala Ala Gly Lys Ala Val Gly
Val Ala Ile Gly Ala Val Gly Gly Ala 725
730 735Val Ala Ser Val Val Glu Gly Val Ala Thr Phe Leu
Lys Asn Pro Phe 740 745 750Gly
Ala Phe Thr Ile Ile Leu Val Ala Ile Ala Val Val Ile Ile Thr 755
760 765Tyr Leu Ile Tyr Thr Arg Gln Arg Arg
Leu Cys Thr Gln Pro Leu Gln 770 775
780Asn Leu Phe Pro Tyr Leu Val Ser Ala Asp Gly Thr Thr Val Thr Ser785
790 795 800Gly Ser Thr Lys
Asp Thr Ser Leu Gln Ala Pro Pro Ser Tyr Glu Glu 805
810 815Ser Val Tyr Asn Ser Gly Arg Lys Gly Pro
Gly Pro Pro Ser Ser Asp 820 825
830Ala Ser Thr Ala Ala Pro Pro Tyr Thr Asn Glu Gln Ala Tyr Gln Met
835 840 845Leu Leu Ala Leu Ala Arg Leu
Asp Ala Glu Gln Arg Ala Gln Gln Asn 850 855
860Gly Thr Asp Ser Leu Asp Gly Arg Thr Gly Thr Gln Asp Lys Gly
Gln865 870 875 880Lys Pro
Asn Leu Leu Asp Arg Leu Arg His Arg Lys Asn Gly Tyr Arg
885 890 895His Leu Lys Asp Ser Asp Glu
Glu Glu Asn Val 900 905272256DNAHuman
herpesvirus 5 27atggaaagcc ggatctggtg cctggtcgtg tgcgtgaacc tgtgcatcgt
gtgcctggga 60gccgccgtga gcagcagcag caccagaggc accagcgcca cacacagcca
ccacagcagc 120cacaccacct ctgccgccca cagcagatcc ggcagcgtgt cccagagagt
gaccagcagc 180cagaccgtgt cccacggcgt gaacgagaca atctacaaca ccaccctgaa
gtacggcgac 240gtcgtgggcg tgaataccac caagtacccc tacagagtgt gcagcatggc
ccagggcacc 300gacctgatca gattcgagcg gaacatcgtg tgcaccagca tgaagcccat
caacgaggac 360ctggacgagg gcatcatggt ggtgtacaag agaaacatcg tggcccacac
cttcaaagtg 420cgggtgtacc agaaggtgct gaccttccgg cggagctacg cctacatcca
caccacatac 480ctgctgggca gcaacaccga gtacgtggcc cctcccatgt gggagatcca
ccacatcaac 540agccacagcc agtgctacag cagctacagc cgcgtgatcg ccggcacagt
gttcgtggcc 600taccaccggg acagctacga gaacaagacc atgcagctga tgcccgacga
ctacagcaac 660acccacagca ccagatacgt gaccgtgaag gaccagtggc acagcagagg
cagcacctgg 720ctgtaccggg agacatgcaa cctgaactgc atggtcacca tcaccaccgc
cagaagcaag 780tacccttacc acttcttcgc cacctccacc ggcgacgtgg tggacatcag
ccccttctac 840aacggcacca accggaacgc cagctacttc ggcgagaacg ccgacaagtt
cttcatcttc 900cccaactaca ccatcgtgtc cgacttcggc agacccaaca gcgctctgga
aacccacaga 960ctggtggcct ttctggaacg ggccgacagc gtgatcagct gggacatcca
ggacgagaag 1020aacgtgacct gccagctgac cttctgggag gcctctgaga gaaccatcag
aagcgaggcc 1080gaggacagct accacttcag cagcgccaag atgaccgcca ccttcctgag
caagaaacag 1140gaagtgaaca tgagcgactc cgccctggac tgcgtgaggg acgaggccat
caacaagctg 1200cagcagatct tcaacaccag ctacaaccag acctacgaga agtatggcaa
tgtgtccgtg 1260ttcgagacaa caggcggcct ggtggtgttc tggcagggca tcaagcagaa
aagcctggtg 1320gagctggaac ggctcgccaa ccggtccagc ctgaacctga cccacaaccg
gaccaagcgg 1380agcaccgacg gcaacaacgc aacccacctg tccaacatgg aaagcgtgca
caacctggtg 1440tacgcacagc tgcagttcac ctacgacacc ctgcggggct acatcaacag
agccctggcc 1500cagatcgccg aggcttggtg cgtggaccag cggcggaccc tggaagtgtt
caaagagctg 1560tccaagatca accccagcgc catcctgagc gccatctaca acaagcctat
cgccgccaga 1620ttcatgggcg acgtgctggg cctggccagc tgcgtgacca tcaaccagac
cagcgtgaag 1680gtgctgcggg acatgaacgt gaaagagagc ccaggccgct gctactccag
acccgtggtc 1740atcttcaact tcgccaacag ctcctacgtg cagtacggcc agctgggcga
ggacaacgag 1800atcctgctgg ggaaccaccg gaccgaggaa tgccagctgc ccagcctgaa
gatctttatc 1860gccggcaaca gcgcctacga gtatgtggac tacctgttca agcggatgat
cgacctgagc 1920agcatctcca ccgtggacag catgatcgcc ctggacatcg accccctgga
aaacaccgac 1980ttccgggtgc tggaactgta cagccagaaa gagctgcgga gcagcaacgt
gttcgacctg 2040gaagagatca tgcgggagtt caacagctac aagcagcgcg tgaaatacgt
ggaggacaag 2100gtggtggacc ccctgcctcc ttacctgaag ggcctggacg acctgatgag
cggactgggc 2160gctgccggaa aagccgtggg agtggccatt ggagctgtgg gcggagctgt
ggcctctgtc 2220gtggaaggcg tcgccacctt tctgaagaac tgataa
225628750PRTHuman herpesvirus 5 28Met Glu Ser Arg Ile Trp Cys
Leu Val Val Cys Val Asn Leu Cys Ile1 5 10
15Val Cys Leu Gly Ala Ala Val Ser Ser Ser Ser Thr Arg
Gly Thr Ser 20 25 30Ala Thr
His Ser His His Ser Ser His Thr Thr Ser Ala Ala His Ser 35
40 45Arg Ser Gly Ser Val Ser Gln Arg Val Thr
Ser Ser Gln Thr Val Ser 50 55 60His
Gly Val Asn Glu Thr Ile Tyr Asn Thr Thr Leu Lys Tyr Gly Asp65
70 75 80Val Val Gly Val Asn Thr
Thr Lys Tyr Pro Tyr Arg Val Cys Ser Met 85
90 95Ala Gln Gly Thr Asp Leu Ile Arg Phe Glu Arg Asn
Ile Val Cys Thr 100 105 110Ser
Met Lys Pro Ile Asn Glu Asp Leu Asp Glu Gly Ile Met Val Val 115
120 125Tyr Lys Arg Asn Ile Val Ala His Thr
Phe Lys Val Arg Val Tyr Gln 130 135
140Lys Val Leu Thr Phe Arg Arg Ser Tyr Ala Tyr Ile His Thr Thr Tyr145
150 155 160Leu Leu Gly Ser
Asn Thr Glu Tyr Val Ala Pro Pro Met Trp Glu Ile 165
170 175His His Ile Asn Ser His Ser Gln Cys Tyr
Ser Ser Tyr Ser Arg Val 180 185
190Ile Ala Gly Thr Val Phe Val Ala Tyr His Arg Asp Ser Tyr Glu Asn
195 200 205Lys Thr Met Gln Leu Met Pro
Asp Asp Tyr Ser Asn Thr His Ser Thr 210 215
220Arg Tyr Val Thr Val Lys Asp Gln Trp His Ser Arg Gly Ser Thr
Trp225 230 235 240Leu Tyr
Arg Glu Thr Cys Asn Leu Asn Cys Met Val Thr Ile Thr Thr
245 250 255Ala Arg Ser Lys Tyr Pro Tyr
His Phe Phe Ala Thr Ser Thr Gly Asp 260 265
270Val Val Asp Ile Ser Pro Phe Tyr Asn Gly Thr Asn Arg Asn
Ala Ser 275 280 285Tyr Phe Gly Glu
Asn Ala Asp Lys Phe Phe Ile Phe Pro Asn Tyr Thr 290
295 300Ile Val Ser Asp Phe Gly Arg Pro Asn Ser Ala Leu
Glu Thr His Arg305 310 315
320Leu Val Ala Phe Leu Glu Arg Ala Asp Ser Val Ile Ser Trp Asp Ile
325 330 335Gln Asp Glu Lys Asn
Val Thr Cys Gln Leu Thr Phe Trp Glu Ala Ser 340
345 350Glu Arg Thr Ile Arg Ser Glu Ala Glu Asp Ser Tyr
His Phe Ser Ser 355 360 365Ala Lys
Met Thr Ala Thr Phe Leu Ser Lys Lys Gln Glu Val Asn Met 370
375 380Ser Asp Ser Ala Leu Asp Cys Val Arg Asp Glu
Ala Ile Asn Lys Leu385 390 395
400Gln Gln Ile Phe Asn Thr Ser Tyr Asn Gln Thr Tyr Glu Lys Tyr Gly
405 410 415Asn Val Ser Val
Phe Glu Thr Thr Gly Gly Leu Val Val Phe Trp Gln 420
425 430Gly Ile Lys Gln Lys Ser Leu Val Glu Leu Glu
Arg Leu Ala Asn Arg 435 440 445Ser
Ser Leu Asn Leu Thr His Asn Arg Thr Lys Arg Ser Thr Asp Gly 450
455 460Asn Asn Ala Thr His Leu Ser Asn Met Glu
Ser Val His Asn Leu Val465 470 475
480Tyr Ala Gln Leu Gln Phe Thr Tyr Asp Thr Leu Arg Gly Tyr Ile
Asn 485 490 495Arg Ala Leu
Ala Gln Ile Ala Glu Ala Trp Cys Val Asp Gln Arg Arg 500
505 510Thr Leu Glu Val Phe Lys Glu Leu Ser Lys
Ile Asn Pro Ser Ala Ile 515 520
525Leu Ser Ala Ile Tyr Asn Lys Pro Ile Ala Ala Arg Phe Met Gly Asp 530
535 540Val Leu Gly Leu Ala Ser Cys Val
Thr Ile Asn Gln Thr Ser Val Lys545 550
555 560Val Leu Arg Asp Met Asn Val Lys Glu Ser Pro Gly
Arg Cys Tyr Ser 565 570
575Arg Pro Val Val Ile Phe Asn Phe Ala Asn Ser Ser Tyr Val Gln Tyr
580 585 590Gly Gln Leu Gly Glu Asp
Asn Glu Ile Leu Leu Gly Asn His Arg Thr 595 600
605Glu Glu Cys Gln Leu Pro Ser Leu Lys Ile Phe Ile Ala Gly
Asn Ser 610 615 620Ala Tyr Glu Tyr Val
Asp Tyr Leu Phe Lys Arg Met Ile Asp Leu Ser625 630
635 640Ser Ile Ser Thr Val Asp Ser Met Ile Ala
Leu Asp Ile Asp Pro Leu 645 650
655Glu Asn Thr Asp Phe Arg Val Leu Glu Leu Tyr Ser Gln Lys Glu Leu
660 665 670Arg Ser Ser Asn Val
Phe Asp Leu Glu Glu Ile Met Arg Glu Phe Asn 675
680 685Ser Tyr Lys Gln Arg Val Lys Tyr Val Glu Asp Lys
Val Val Asp Pro 690 695 700Leu Pro Pro
Tyr Leu Lys Gly Leu Asp Asp Leu Met Ser Gly Leu Gly705
710 715 720Ala Ala Gly Lys Ala Val Gly
Val Ala Ile Gly Ala Val Gly Gly Ala 725
730 735Val Ala Ser Val Val Glu Gly Val Ala Thr Phe Leu
Lys Asn 740 745
750292082DNAHuman herpesvirus 5 29atggaaagcc ggatctggtg cctggtcgtg
tgcgtgaacc tgtgcatcgt gtgcctggga 60gccgccgtga gcagcagcag caccagaggc
accagcgcca cacacagcca ccacagcagc 120cacaccacct ctgccgccca cagcagatcc
ggcagcgtgt cccagagagt gaccagcagc 180cagaccgtgt cccacggcgt gaacgagaca
atctacaaca ccaccctgaa gtacggcgac 240gtcgtgggcg tgaataccac caagtacccc
tacagagtgt gcagcatggc ccagggcacc 300gacctgatca gattcgagcg gaacatcgtg
tgcaccagca tgaagcccat caacgaggac 360ctggacgagg gcatcatggt ggtgtacaag
agaaacatcg tggcccacac cttcaaagtg 420cgggtgtacc agaaggtgct gaccttccgg
cggagctacg cctacatcca caccacatac 480ctgctgggca gcaacaccga gtacgtggcc
cctcccatgt gggagatcca ccacatcaac 540agccacagcc agtgctacag cagctacagc
cgcgtgatcg ccggcacagt gttcgtggcc 600taccaccggg acagctacga gaacaagacc
atgcagctga tgcccgacga ctacagcaac 660acccacagca ccagatacgt gaccgtgaag
gaccagtggc acagcagagg cagcacctgg 720ctgtaccggg agacatgcaa cctgaactgc
atggtcacca tcaccaccgc cagaagcaag 780tacccttacc acttcttcgc cacctccacc
ggcgacgtgg tggacatcag ccccttctac 840aacggcacca accggaacgc cagctacttc
ggcgagaacg ccgacaagtt cttcatcttc 900cccaactaca ccatcgtgtc cgacttcggc
agacccaaca gcgctctgga aacccacaga 960ctggtggcct ttctggaacg ggccgacagc
gtgatcagct gggacatcca ggacgagaag 1020aacgtgacct gccagctgac cttctgggag
gcctctgaga gaaccatcag aagcgaggcc 1080gaggacagct accacttcag cagcgccaag
atgaccgcca ccttcctgag caagaaacag 1140gaagtgaaca tgagcgactc cgccctggac
tgcgtgaggg acgaggccat caacaagctg 1200cagcagatct tcaacaccag ctacaaccag
acctacgaga agtatggcaa tgtgtccgtg 1260ttcgagacaa caggcggcct ggtggtgttc
tggcagggca tcaagcagaa aagcctggtg 1320gagctggaac ggctcgccaa ccggtccagc
ctgaacctga cccacaaccg gaccaagcgg 1380agcaccgacg gcaacaacgc aacccacctg
tccaacatgg aaagcgtgca caacctggtg 1440tacgcacagc tgcagttcac ctacgacacc
ctgcggggct acatcaacag agccctggcc 1500cagatcgccg aggcttggtg cgtggaccag
cggcggaccc tggaagtgtt caaagagctg 1560tccaagatca accccagcgc catcctgagc
gccatctaca acaagcctat cgccgccaga 1620ttcatgggcg acgtgctggg cctggccagc
tgcgtgacca tcaaccagac cagcgtgaag 1680gtgctgcggg acatgaacgt gaaagagagc
ccaggccgct gctactccag acccgtggtc 1740atcttcaact tcgccaacag ctcctacgtg
cagtacggcc agctgggcga ggacaacgag 1800atcctgctgg ggaaccaccg gaccgaggaa
tgccagctgc ccagcctgaa gatctttatc 1860gccggcaaca gcgcctacga gtatgtggac
tacctgttca agcggatgat cgacctgagc 1920agcatctcca ccgtggacag catgatcgcc
ctggacatcg accccctgga aaacaccgac 1980ttccgggtgc tggaactgta cagccagaaa
gagctgcgga gcagcaacgt gttcgacctg 2040gaagagatca tgcgggagtt caacagctac
aagcagtgat aa 208230692PRTHuman herpesvirus 5 30Met
Glu Ser Arg Ile Trp Cys Leu Val Val Cys Val Asn Leu Cys Ile1
5 10 15Val Cys Leu Gly Ala Ala Val
Ser Ser Ser Ser Thr Arg Gly Thr Ser 20 25
30Ala Thr His Ser His His Ser Ser His Thr Thr Ser Ala Ala
His Ser 35 40 45Arg Ser Gly Ser
Val Ser Gln Arg Val Thr Ser Ser Gln Thr Val Ser 50 55
60His Gly Val Asn Glu Thr Ile Tyr Asn Thr Thr Leu Lys
Tyr Gly Asp65 70 75
80Val Val Gly Val Asn Thr Thr Lys Tyr Pro Tyr Arg Val Cys Ser Met
85 90 95Ala Gln Gly Thr Asp Leu
Ile Arg Phe Glu Arg Asn Ile Val Cys Thr 100
105 110Ser Met Lys Pro Ile Asn Glu Asp Leu Asp Glu Gly
Ile Met Val Val 115 120 125Tyr Lys
Arg Asn Ile Val Ala His Thr Phe Lys Val Arg Val Tyr Gln 130
135 140Lys Val Leu Thr Phe Arg Arg Ser Tyr Ala Tyr
Ile His Thr Thr Tyr145 150 155
160Leu Leu Gly Ser Asn Thr Glu Tyr Val Ala Pro Pro Met Trp Glu Ile
165 170 175His His Ile Asn
Ser His Ser Gln Cys Tyr Ser Ser Tyr Ser Arg Val 180
185 190Ile Ala Gly Thr Val Phe Val Ala Tyr His Arg
Asp Ser Tyr Glu Asn 195 200 205Lys
Thr Met Gln Leu Met Pro Asp Asp Tyr Ser Asn Thr His Ser Thr 210
215 220Arg Tyr Val Thr Val Lys Asp Gln Trp His
Ser Arg Gly Ser Thr Trp225 230 235
240Leu Tyr Arg Glu Thr Cys Asn Leu Asn Cys Met Val Thr Ile Thr
Thr 245 250 255Ala Arg Ser
Lys Tyr Pro Tyr His Phe Phe Ala Thr Ser Thr Gly Asp 260
265 270Val Val Asp Ile Ser Pro Phe Tyr Asn Gly
Thr Asn Arg Asn Ala Ser 275 280
285Tyr Phe Gly Glu Asn Ala Asp Lys Phe Phe Ile Phe Pro Asn Tyr Thr 290
295 300Ile Val Ser Asp Phe Gly Arg Pro
Asn Ser Ala Leu Glu Thr His Arg305 310
315 320Leu Val Ala Phe Leu Glu Arg Ala Asp Ser Val Ile
Ser Trp Asp Ile 325 330
335Gln Asp Glu Lys Asn Val Thr Cys Gln Leu Thr Phe Trp Glu Ala Ser
340 345 350Glu Arg Thr Ile Arg Ser
Glu Ala Glu Asp Ser Tyr His Phe Ser Ser 355 360
365Ala Lys Met Thr Ala Thr Phe Leu Ser Lys Lys Gln Glu Val
Asn Met 370 375 380Ser Asp Ser Ala Leu
Asp Cys Val Arg Asp Glu Ala Ile Asn Lys Leu385 390
395 400Gln Gln Ile Phe Asn Thr Ser Tyr Asn Gln
Thr Tyr Glu Lys Tyr Gly 405 410
415Asn Val Ser Val Phe Glu Thr Thr Gly Gly Leu Val Val Phe Trp Gln
420 425 430Gly Ile Lys Gln Lys
Ser Leu Val Glu Leu Glu Arg Leu Ala Asn Arg 435
440 445Ser Ser Leu Asn Leu Thr His Asn Arg Thr Lys Arg
Ser Thr Asp Gly 450 455 460Asn Asn Ala
Thr His Leu Ser Asn Met Glu Ser Val His Asn Leu Val465
470 475 480Tyr Ala Gln Leu Gln Phe Thr
Tyr Asp Thr Leu Arg Gly Tyr Ile Asn 485
490 495Arg Ala Leu Ala Gln Ile Ala Glu Ala Trp Cys Val
Asp Gln Arg Arg 500 505 510Thr
Leu Glu Val Phe Lys Glu Leu Ser Lys Ile Asn Pro Ser Ala Ile 515
520 525Leu Ser Ala Ile Tyr Asn Lys Pro Ile
Ala Ala Arg Phe Met Gly Asp 530 535
540Val Leu Gly Leu Ala Ser Cys Val Thr Ile Asn Gln Thr Ser Val Lys545
550 555 560Val Leu Arg Asp
Met Asn Val Lys Glu Ser Pro Gly Arg Cys Tyr Ser 565
570 575Arg Pro Val Val Ile Phe Asn Phe Ala Asn
Ser Ser Tyr Val Gln Tyr 580 585
590Gly Gln Leu Gly Glu Asp Asn Glu Ile Leu Leu Gly Asn His Arg Thr
595 600 605Glu Glu Cys Gln Leu Pro Ser
Leu Lys Ile Phe Ile Ala Gly Asn Ser 610 615
620Ala Tyr Glu Tyr Val Asp Tyr Leu Phe Lys Arg Met Ile Asp Leu
Ser625 630 635 640Ser Ile
Ser Thr Val Asp Ser Met Ile Ala Leu Asp Ile Asp Pro Leu
645 650 655Glu Asn Thr Asp Phe Arg Val
Leu Glu Leu Tyr Ser Gln Lys Glu Leu 660 665
670Arg Ser Ser Asn Val Phe Asp Leu Glu Glu Ile Met Arg Glu
Phe Asn 675 680 685Ser Tyr Lys Gln
690312232DNAHuman herpesvirus 5 31atgaggcctg gcctgccctc ctacctgatc
atcctggccg tgtgcctgtt cagccacctg 60ctgtccagca gatacggcgc cgaggccgtg
agcgagcccc tggacaaggc tttccacctg 120ctgctgaaca cctacggcag acccatccgg
tttctgcggg agaacaccac ccagtgcacc 180tacaacagca gcctgcggaa cagcaccgtc
gtgagagaga acgccatcag cttcaacttt 240ttccagagct acaaccagta ctacgtgttc
cacatgccca gatgcctgtt tgccggccct 300ctggccgagc agttcctgaa ccaggtggac
ctgaccgaga cactggaaag ataccagcag 360cggctgaata cctacgccct ggtgtccaag
gacctggcca gctaccggtc ctttagccag 420cagctcaagg ctcaggatag cctcggcgag
cagcctacca ccgtgccccc tcccatcgac 480ctgagcatcc cccacgtgtg gatgcctccc
cagaccaccc ctcacggctg gaccgagagc 540cacaccacct ccggcctgca cagaccccac
ttcaaccaga cctgcatcct gttcgacggc 600cacgacctgc tgtttagcac cgtgaccccc
tgcctgcacc agggcttcta cctgatcgac 660gagctgagat acgtgaagat caccctgacc
gaggatttct tcgtggtcac cgtgtccatc 720gacgacgaca cccccatgct gctgatcttc
ggccacctgc ccagagtgct gttcaaggcc 780ccctaccagc gggacaactt catcctgcgg
cagaccgaga agcacgagct gctggtgctg 840gtcaagaagg accagctgaa ccggcactcc
tacctgaagg accccgactt cctggacgcc 900gccctggact tcaactacct ggacctgagc
gccctgctga gaaacagctt ccacagatac 960gccgtggacg tgctgaagtc cggacggtgc
cagatgctcg atcggcggac cgtggagatg 1020gccttcgcct atgccctcgc cctgttcgcc
gctgccagac aggaagaggc tggcgcccag 1080gtgtcagtgc ccagagccct ggatagacag
gccgccctgc tgcagatcca ggaattcatg 1140atcacctgcc tgagccagac cccccctaga
accaccctgc tgctgtaccc cacagccgtg 1200gatctggcca agagggccct gtggaccccc
aaccagatca ccgacatcac aagcctcgtg 1260cggctcgtgt acatcctgag caagcagaac
cagcagcacc tgatccccca gtgggccctg 1320agacagatcg ccgacttcgc cctgaagctg
cacaagaccc atctggccag ctttctgagc 1380gccttcgcca ggcaggaact gtacctgatg
ggcagcctgg tccacagcat gctggtgcat 1440accaccgagc ggcgggagat cttcatcgtg
gagacaggcc tgtgtagcct ggccgagctg 1500tcccacttta cccagctgct ggcccaccct
caccacgagt acctgagcga cctgtacacc 1560ccctgcagca gcagcggcag acgggaccac
agcctggaac ggctgaccag actgttcccc 1620gatgccaccg tgcctgctac agtgcctgcc
gccctgtcca tcctgtccac catgcagccc 1680agcaccctgg aaaccttccc cgacctgttc
tgcctgcccc tgggcgagag ctttagcgcc 1740ctgaccgtgt ccgagcacgt gtcctacatc
gtgaccaatc agtacctgat caagggcatc 1800agctaccccg tgtccaccac agtcgtgggc
cagagcctga tcatcaccca gaccgacagc 1860cagaccaagt gcgagctgac ccggaacatg
cacaccacac acagcatcac cgtggccctg 1920aacatcagcc tggaaaactg cgctttctgt
cagtctgccc tgctggaata cgacgatacc 1980cagggcgtga tcaacatcat gtacatgcac
gacagcgacg acgtgctgtt cgccctggac 2040ccctacaacg aggtggtggt gtccagcccc
cggacccact acctgatgct gctgaagaac 2100ggcaccgtgc tggaagtgac cgacgtggtg
gtggacgcca ccgacagcag actgctgatg 2160atgagcgtgt acgccctgag cgccatcatc
ggcatctacc tgctgtaccg gatgctgaaa 2220acctgctgat aa
223232742PRTHuman herpesvirus 5 32Met
Arg Pro Gly Leu Pro Ser Tyr Leu Ile Ile Leu Ala Val Cys Leu1
5 10 15Phe Ser His Leu Leu Ser Ser
Arg Tyr Gly Ala Glu Ala Val Ser Glu 20 25
30Pro Leu Asp Lys Ala Phe His Leu Leu Leu Asn Thr Tyr Gly
Arg Pro 35 40 45Ile Arg Phe Leu
Arg Glu Asn Thr Thr Gln Cys Thr Tyr Asn Ser Ser 50 55
60Leu Arg Asn Ser Thr Val Val Arg Glu Asn Ala Ile Ser
Phe Asn Phe65 70 75
80Phe Gln Ser Tyr Asn Gln Tyr Tyr Val Phe His Met Pro Arg Cys Leu
85 90 95Phe Ala Gly Pro Leu Ala
Glu Gln Phe Leu Asn Gln Val Asp Leu Thr 100
105 110Glu Thr Leu Glu Arg Tyr Gln Gln Arg Leu Asn Thr
Tyr Ala Leu Val 115 120 125Ser Lys
Asp Leu Ala Ser Tyr Arg Ser Phe Ser Gln Gln Leu Lys Ala 130
135 140Gln Asp Ser Leu Gly Glu Gln Pro Thr Thr Val
Pro Pro Pro Ile Asp145 150 155
160Leu Ser Ile Pro His Val Trp Met Pro Pro Gln Thr Thr Pro His Gly
165 170 175Trp Thr Glu Ser
His Thr Thr Ser Gly Leu His Arg Pro His Phe Asn 180
185 190Gln Thr Cys Ile Leu Phe Asp Gly His Asp Leu
Leu Phe Ser Thr Val 195 200 205Thr
Pro Cys Leu His Gln Gly Phe Tyr Leu Ile Asp Glu Leu Arg Tyr 210
215 220Val Lys Ile Thr Leu Thr Glu Asp Phe Phe
Val Val Thr Val Ser Ile225 230 235
240Asp Asp Asp Thr Pro Met Leu Leu Ile Phe Gly His Leu Pro Arg
Val 245 250 255Leu Phe Lys
Ala Pro Tyr Gln Arg Asp Asn Phe Ile Leu Arg Gln Thr 260
265 270Glu Lys His Glu Leu Leu Val Leu Val Lys
Lys Asp Gln Leu Asn Arg 275 280
285His Ser Tyr Leu Lys Asp Pro Asp Phe Leu Asp Ala Ala Leu Asp Phe 290
295 300Asn Tyr Leu Asp Leu Ser Ala Leu
Leu Arg Asn Ser Phe His Arg Tyr305 310
315 320Ala Val Asp Val Leu Lys Ser Gly Arg Cys Gln Met
Leu Asp Arg Arg 325 330
335Thr Val Glu Met Ala Phe Ala Tyr Ala Leu Ala Leu Phe Ala Ala Ala
340 345 350Arg Gln Glu Glu Ala Gly
Ala Gln Val Ser Val Pro Arg Ala Leu Asp 355 360
365Arg Gln Ala Ala Leu Leu Gln Ile Gln Glu Phe Met Ile Thr
Cys Leu 370 375 380Ser Gln Thr Pro Pro
Arg Thr Thr Leu Leu Leu Tyr Pro Thr Ala Val385 390
395 400Asp Leu Ala Lys Arg Ala Leu Trp Thr Pro
Asn Gln Ile Thr Asp Ile 405 410
415Thr Ser Leu Val Arg Leu Val Tyr Ile Leu Ser Lys Gln Asn Gln Gln
420 425 430His Leu Ile Pro Gln
Trp Ala Leu Arg Gln Ile Ala Asp Phe Ala Leu 435
440 445Lys Leu His Lys Thr His Leu Ala Ser Phe Leu Ser
Ala Phe Ala Arg 450 455 460Gln Glu Leu
Tyr Leu Met Gly Ser Leu Val His Ser Met Leu Val His465
470 475 480Thr Thr Glu Arg Arg Glu Ile
Phe Ile Val Glu Thr Gly Leu Cys Ser 485
490 495Leu Ala Glu Leu Ser His Phe Thr Gln Leu Leu Ala
His Pro His His 500 505 510Glu
Tyr Leu Ser Asp Leu Tyr Thr Pro Cys Ser Ser Ser Gly Arg Arg 515
520 525Asp His Ser Leu Glu Arg Leu Thr Arg
Leu Phe Pro Asp Ala Thr Val 530 535
540Pro Ala Thr Val Pro Ala Ala Leu Ser Ile Leu Ser Thr Met Gln Pro545
550 555 560Ser Thr Leu Glu
Thr Phe Pro Asp Leu Phe Cys Leu Pro Leu Gly Glu 565
570 575Ser Phe Ser Ala Leu Thr Val Ser Glu His
Val Ser Tyr Ile Val Thr 580 585
590Asn Gln Tyr Leu Ile Lys Gly Ile Ser Tyr Pro Val Ser Thr Thr Val
595 600 605Val Gly Gln Ser Leu Ile Ile
Thr Gln Thr Asp Ser Gln Thr Lys Cys 610 615
620Glu Leu Thr Arg Asn Met His Thr Thr His Ser Ile Thr Val Ala
Leu625 630 635 640Asn Ile
Ser Leu Glu Asn Cys Ala Phe Cys Gln Ser Ala Leu Leu Glu
645 650 655Tyr Asp Asp Thr Gln Gly Val
Ile Asn Ile Met Tyr Met His Asp Ser 660 665
670Asp Asp Val Leu Phe Ala Leu Asp Pro Tyr Asn Glu Val Val
Val Ser 675 680 685Ser Pro Arg Thr
His Tyr Leu Met Leu Leu Lys Asn Gly Thr Val Leu 690
695 700Glu Val Thr Asp Val Val Val Asp Ala Thr Asp Ser
Arg Leu Leu Met705 710 715
720Met Ser Val Tyr Ala Leu Ser Ala Ile Ile Gly Ile Tyr Leu Leu Tyr
725 730 735Arg Met Leu Lys Thr
Cys 740332151DNAHuman herpesvirus 5 33atgaggcctg gcctgccctc
ctacctgatc atcctggccg tgtgcctgtt cagccacctg 60ctgtccagca gatacggcgc
cgaggccgtg agcgagcccc tggacaaggc tttccacctg 120ctgctgaaca cctacggcag
acccatccgg tttctgcggg agaacaccac ccagtgcacc 180tacaacagca gcctgcggaa
cagcaccgtc gtgagagaga acgccatcag cttcaacttt 240ttccagagct acaaccagta
ctacgtgttc cacatgccca gatgcctgtt tgccggccct 300ctggccgagc agttcctgaa
ccaggtggac ctgaccgaga cactggaaag ataccagcag 360cggctgaata cctacgccct
ggtgtccaag gacctggcca gctaccggtc ctttagccag 420cagctcaagg ctcaggatag
cctcggcgag cagcctacca ccgtgccccc tcccatcgac 480ctgagcatcc cccacgtgtg
gatgcctccc cagaccaccc ctcacggctg gaccgagagc 540cacaccacct ccggcctgca
cagaccccac ttcaaccaga cctgcatcct gttcgacggc 600cacgacctgc tgtttagcac
cgtgaccccc tgcctgcacc agggcttcta cctgatcgac 660gagctgagat acgtgaagat
caccctgacc gaggatttct tcgtggtcac cgtgtccatc 720gacgacgaca cccccatgct
gctgatcttc ggccacctgc ccagagtgct gttcaaggcc 780ccctaccagc gggacaactt
catcctgcgg cagaccgaga agcacgagct gctggtgctg 840gtcaagaagg accagctgaa
ccggcactcc tacctgaagg accccgactt cctggacgcc 900gccctggact tcaactacct
ggacctgagc gccctgctga gaaacagctt ccacagatac 960gccgtggacg tgctgaagtc
cggacggtgc cagatgctcg atcggcggac cgtggagatg 1020gccttcgcct atgccctcgc
cctgttcgcc gctgccagac aggaagaggc tggcgcccag 1080gtgtcagtgc ccagagccct
ggatagacag gccgccctgc tgcagatcca ggaattcatg 1140atcacctgcc tgagccagac
cccccctaga accaccctgc tgctgtaccc cacagccgtg 1200gatctggcca agagggccct
gtggaccccc aaccagatca ccgacatcac aagcctcgtg 1260cggctcgtgt acatcctgag
caagcagaac cagcagcacc tgatccccca gtgggccctg 1320agacagatcg ccgacttcgc
cctgaagctg cacaagaccc atctggccag ctttctgagc 1380gccttcgcca ggcaggaact
gtacctgatg ggcagcctgg tccacagcat gctggtgcat 1440accaccgagc ggcgggagat
cttcatcgtg gagacaggcc tgtgtagcct ggccgagctg 1500tcccacttta cccagctgct
ggcccaccct caccacgagt acctgagcga cctgtacacc 1560ccctgcagca gcagcggcag
acgggaccac agcctggaac ggctgaccag actgttcccc 1620gatgccaccg tgcctgctac
agtgcctgcc gccctgtcca tcctgtccac catgcagccc 1680agcaccctgg aaaccttccc
cgacctgttc tgcctgcccc tgggcgagag ctttagcgcc 1740ctgaccgtgt ccgagcacgt
gtcctacatc gtgaccaatc agtacctgat caagggcatc 1800agctaccccg tgtccaccac
agtcgtgggc cagagcctga tcatcaccca gaccgacagc 1860cagaccaagt gcgagctgac
ccggaacatg cacaccacac acagcatcac cgtggccctg 1920aacatcagcc tggaaaactg
cgctttctgt cagtctgccc tgctggaata cgacgatacc 1980cagggcgtga tcaacatcat
gtacatgcac gacagcgacg acgtgctgtt cgccctggac 2040ccctacaacg aggtggtggt
gtccagcccc cggacccact acctgatgct gctgaagaac 2100ggcaccgtgc tggaagtgac
cgacgtggtg gtggacgcca ccgactgata a 215134715PRTHuman
herpesvirus 5 34Met Arg Pro Gly Leu Pro Ser Tyr Leu Ile Ile Leu Ala Val
Cys Leu1 5 10 15Phe Ser
His Leu Leu Ser Ser Arg Tyr Gly Ala Glu Ala Val Ser Glu 20
25 30Pro Leu Asp Lys Ala Phe His Leu Leu
Leu Asn Thr Tyr Gly Arg Pro 35 40
45Ile Arg Phe Leu Arg Glu Asn Thr Thr Gln Cys Thr Tyr Asn Ser Ser 50
55 60Leu Arg Asn Ser Thr Val Val Arg Glu
Asn Ala Ile Ser Phe Asn Phe65 70 75
80Phe Gln Ser Tyr Asn Gln Tyr Tyr Val Phe His Met Pro Arg
Cys Leu 85 90 95Phe Ala
Gly Pro Leu Ala Glu Gln Phe Leu Asn Gln Val Asp Leu Thr 100
105 110Glu Thr Leu Glu Arg Tyr Gln Gln Arg
Leu Asn Thr Tyr Ala Leu Val 115 120
125Ser Lys Asp Leu Ala Ser Tyr Arg Ser Phe Ser Gln Gln Leu Lys Ala
130 135 140Gln Asp Ser Leu Gly Glu Gln
Pro Thr Thr Val Pro Pro Pro Ile Asp145 150
155 160Leu Ser Ile Pro His Val Trp Met Pro Pro Gln Thr
Thr Pro His Gly 165 170
175Trp Thr Glu Ser His Thr Thr Ser Gly Leu His Arg Pro His Phe Asn
180 185 190Gln Thr Cys Ile Leu Phe
Asp Gly His Asp Leu Leu Phe Ser Thr Val 195 200
205Thr Pro Cys Leu His Gln Gly Phe Tyr Leu Ile Asp Glu Leu
Arg Tyr 210 215 220Val Lys Ile Thr Leu
Thr Glu Asp Phe Phe Val Val Thr Val Ser Ile225 230
235 240Asp Asp Asp Thr Pro Met Leu Leu Ile Phe
Gly His Leu Pro Arg Val 245 250
255Leu Phe Lys Ala Pro Tyr Gln Arg Asp Asn Phe Ile Leu Arg Gln Thr
260 265 270Glu Lys His Glu Leu
Leu Val Leu Val Lys Lys Asp Gln Leu Asn Arg 275
280 285His Ser Tyr Leu Lys Asp Pro Asp Phe Leu Asp Ala
Ala Leu Asp Phe 290 295 300Asn Tyr Leu
Asp Leu Ser Ala Leu Leu Arg Asn Ser Phe His Arg Tyr305
310 315 320Ala Val Asp Val Leu Lys Ser
Gly Arg Cys Gln Met Leu Asp Arg Arg 325
330 335Thr Val Glu Met Ala Phe Ala Tyr Ala Leu Ala Leu
Phe Ala Ala Ala 340 345 350Arg
Gln Glu Glu Ala Gly Ala Gln Val Ser Val Pro Arg Ala Leu Asp 355
360 365Arg Gln Ala Ala Leu Leu Gln Ile Gln
Glu Phe Met Ile Thr Cys Leu 370 375
380Ser Gln Thr Pro Pro Arg Thr Thr Leu Leu Leu Tyr Pro Thr Ala Val385
390 395 400Asp Leu Ala Lys
Arg Ala Leu Trp Thr Pro Asn Gln Ile Thr Asp Ile 405
410 415Thr Ser Leu Val Arg Leu Val Tyr Ile Leu
Ser Lys Gln Asn Gln Gln 420 425
430His Leu Ile Pro Gln Trp Ala Leu Arg Gln Ile Ala Asp Phe Ala Leu
435 440 445Lys Leu His Lys Thr His Leu
Ala Ser Phe Leu Ser Ala Phe Ala Arg 450 455
460Gln Glu Leu Tyr Leu Met Gly Ser Leu Val His Ser Met Leu Val
His465 470 475 480Thr Thr
Glu Arg Arg Glu Ile Phe Ile Val Glu Thr Gly Leu Cys Ser
485 490 495Leu Ala Glu Leu Ser His Phe
Thr Gln Leu Leu Ala His Pro His His 500 505
510Glu Tyr Leu Ser Asp Leu Tyr Thr Pro Cys Ser Ser Ser Gly
Arg Arg 515 520 525Asp His Ser Leu
Glu Arg Leu Thr Arg Leu Phe Pro Asp Ala Thr Val 530
535 540Pro Ala Thr Val Pro Ala Ala Leu Ser Ile Leu Ser
Thr Met Gln Pro545 550 555
560Ser Thr Leu Glu Thr Phe Pro Asp Leu Phe Cys Leu Pro Leu Gly Glu
565 570 575Ser Phe Ser Ala Leu
Thr Val Ser Glu His Val Ser Tyr Ile Val Thr 580
585 590Asn Gln Tyr Leu Ile Lys Gly Ile Ser Tyr Pro Val
Ser Thr Thr Val 595 600 605Val Gly
Gln Ser Leu Ile Ile Thr Gln Thr Asp Ser Gln Thr Lys Cys 610
615 620Glu Leu Thr Arg Asn Met His Thr Thr His Ser
Ile Thr Val Ala Leu625 630 635
640Asn Ile Ser Leu Glu Asn Cys Ala Phe Cys Gln Ser Ala Leu Leu Glu
645 650 655Tyr Asp Asp Thr
Gln Gly Val Ile Asn Ile Met Tyr Met His Asp Ser 660
665 670Asp Asp Val Leu Phe Ala Leu Asp Pro Tyr Asn
Glu Val Val Val Ser 675 680 685Ser
Pro Arg Thr His Tyr Leu Met Leu Leu Lys Asn Gly Thr Val Leu 690
695 700Glu Val Thr Asp Val Val Val Asp Ala Thr
Asp705 710 71535840DNAHuman herpesvirus 5
35atgtgcagaa ggcccgactg cggcttcagc ttcagccctg gacccgtgat cctgctgtgg
60tgctgcctgc tgctgcctat cgtgtcctct gccgccgtgt ctgtggcccc tacagccgcc
120gagaaggtgc cagccgagtg ccccgagctg accagaagat gcctgctggg cgaggtgttc
180gagggcgaca agtacgagag ctggctgcgg cccctggtca acgtgaccgg cagagatggc
240cccctgagcc agctgatccg gtacagaccc gtgacccccg aggccgccaa tagcgtgctg
300ctggacgagg ccttcctgga taccctggcc ctgctgtaca acaaccccga ccagctgaga
360gccctgctga ccctgctgtc cagcgacacc gcccccagat ggatgaccgt gatgcggggc
420tacagcgagt gtggagatgg cagccctgcc gtgtacacct gcgtggacga cctgtgcaga
480ggctacgacc tgaccagact gagctacggc cggtccatct tcacagagca cgtgctgggc
540ttcgagctgg tgccccccag cctgttcaac gtggtggtgg ccatccggaa cgaggccacc
600agaaccaaca gagccgtgcg gctgcctgtg tctacagccg ctgcacctga gggcatcaca
660ctgttctacg gcctgtacaa cgccgtgaaa gagttctgcc tccggcacca gctggatccc
720cccctgctga gacacctgga caagtactac gccggcctgc ccccagagct gaagcagacc
780agagtgaacc tgcccgccca cagcagatat ggccctcagg ccgtggacgc cagatgataa
84036278PRTHuman herpesvirus 5 36Met Cys Arg Arg Pro Asp Cys Gly Phe Ser
Phe Ser Pro Gly Pro Val1 5 10
15Ile Leu Leu Trp Cys Cys Leu Leu Leu Pro Ile Val Ser Ser Ala Ala
20 25 30Val Ser Val Ala Pro Thr
Ala Ala Glu Lys Val Pro Ala Glu Cys Pro 35 40
45Glu Leu Thr Arg Arg Cys Leu Leu Gly Glu Val Phe Glu Gly
Asp Lys 50 55 60Tyr Glu Ser Trp Leu
Arg Pro Leu Val Asn Val Thr Gly Arg Asp Gly65 70
75 80Pro Leu Ser Gln Leu Ile Arg Tyr Arg Pro
Val Thr Pro Glu Ala Ala 85 90
95Asn Ser Val Leu Leu Asp Glu Ala Phe Leu Asp Thr Leu Ala Leu Leu
100 105 110Tyr Asn Asn Pro Asp
Gln Leu Arg Ala Leu Leu Thr Leu Leu Ser Ser 115
120 125Asp Thr Ala Pro Arg Trp Met Thr Val Met Arg Gly
Tyr Ser Glu Cys 130 135 140Gly Asp Gly
Ser Pro Ala Val Tyr Thr Cys Val Asp Asp Leu Cys Arg145
150 155 160Gly Tyr Asp Leu Thr Arg Leu
Ser Tyr Gly Arg Ser Ile Phe Thr Glu 165
170 175His Val Leu Gly Phe Glu Leu Val Pro Pro Ser Leu
Phe Asn Val Val 180 185 190Val
Ala Ile Arg Asn Glu Ala Thr Arg Thr Asn Arg Ala Val Arg Leu 195
200 205Pro Val Ser Thr Ala Ala Ala Pro Glu
Gly Ile Thr Leu Phe Tyr Gly 210 215
220Leu Tyr Asn Ala Val Lys Glu Phe Cys Leu Arg His Gln Leu Asp Pro225
230 235 240Pro Leu Leu Arg
His Leu Asp Lys Tyr Tyr Ala Gly Leu Pro Pro Glu 245
250 255Leu Lys Gln Thr Arg Val Asn Leu Pro Ala
His Ser Arg Tyr Gly Pro 260 265
270Gln Ala Val Asp Ala Arg 275371119DNAHuman herpesvirus 5
37atggccccca gccacgtgga caaagtgaac acccggactt ggagcgccag catcgtgttc
60atggtgctga ccttcgtgaa cgtgtccgtg cacctggtgc tgtccaactt cccccacctg
120ggctacccct gcgtgtacta ccacgtggtg gacttcgagc ggctgaacat gagcgcctac
180aacgtgatgc acctgcacac ccccatgctg tttctggaca gcgtgcagct cgtgtgctac
240gccgtgttca tgcagctggt gtttctggcc gtgaccatct actacctcgt gtgctggatc
300aagatcagca tgcggaagga caagggcatg agcctgaacc agagcacccg ggacatcagc
360tacatgggcg acagcctgac cgccttcctg ttcatcctga gcatggacac cttccagctg
420ttcaccctga ccatgagctt ccggctgccc agcatgatcg ccttcatggc cgccgtgcac
480tttttctgtc tgaccatctt caacgtgtcc atggtcaccc agtaccggtc ctacaagcgg
540agcctgttct tcttctcccg gctgcacccc aagctgaagg gcaccgtgca gttccggacc
600ctgatcgtga acctggtgga ggtggccctg ggcttcaata ccaccgtggt ggctatggcc
660ctgtgctacg gcttcggcaa caacttcttc gtgcggaccg gccatatggt gctggccgtg
720ttcgtggtgt acgccatcat cagcatcatc tactttctgc tgatcgaggc cgtgttcttc
780cagtacgtga aggtgcagtt cggctaccat ctgggcgcct ttttcggcct gtgcggcctg
840atctacccca tcgtgcagta cgacaccttc ctgagcaacg agtaccggac cggcatcagc
900tggtccttcg gaatgctgtt cttcatctgg gccatgttca ccacctgcag agccgtgcgg
960tacttcagag gcagaggcag cggctccgtg aagtaccagg ccctggccac agcctctggc
1020gaagaggtgg ccgccctgag ccaccacgac agcctggaaa gcagacggct gcgggaggaa
1080gaggacgacg acgacgagga cttcgaggac gcctgataa
111938371PRTHuman herpesvirus 5 38Met Ala Pro Ser His Val Asp Lys Val Asn
Thr Arg Thr Trp Ser Ala1 5 10
15Ser Ile Val Phe Met Val Leu Thr Phe Val Asn Val Ser Val His Leu
20 25 30Val Leu Ser Asn Phe Pro
His Leu Gly Tyr Pro Cys Val Tyr Tyr His 35 40
45Val Val Asp Phe Glu Arg Leu Asn Met Ser Ala Tyr Asn Val
Met His 50 55 60Leu His Thr Pro Met
Leu Phe Leu Asp Ser Val Gln Leu Val Cys Tyr65 70
75 80Ala Val Phe Met Gln Leu Val Phe Leu Ala
Val Thr Ile Tyr Tyr Leu 85 90
95Val Cys Trp Ile Lys Ile Ser Met Arg Lys Asp Lys Gly Met Ser Leu
100 105 110Asn Gln Ser Thr Arg
Asp Ile Ser Tyr Met Gly Asp Ser Leu Thr Ala 115
120 125Phe Leu Phe Ile Leu Ser Met Asp Thr Phe Gln Leu
Phe Thr Leu Thr 130 135 140Met Ser Phe
Arg Leu Pro Ser Met Ile Ala Phe Met Ala Ala Val His145
150 155 160Phe Phe Cys Leu Thr Ile Phe
Asn Val Ser Met Val Thr Gln Tyr Arg 165
170 175Ser Tyr Lys Arg Ser Leu Phe Phe Phe Ser Arg Leu
His Pro Lys Leu 180 185 190Lys
Gly Thr Val Gln Phe Arg Thr Leu Ile Val Asn Leu Val Glu Val 195
200 205Ala Leu Gly Phe Asn Thr Thr Val Val
Ala Met Ala Leu Cys Tyr Gly 210 215
220Phe Gly Asn Asn Phe Phe Val Arg Thr Gly His Met Val Leu Ala Val225
230 235 240Phe Val Val Tyr
Ala Ile Ile Ser Ile Ile Tyr Phe Leu Leu Ile Glu 245
250 255Ala Val Phe Phe Gln Tyr Val Lys Val Gln
Phe Gly Tyr His Leu Gly 260 265
270Ala Phe Phe Gly Leu Cys Gly Leu Ile Tyr Pro Ile Val Gln Tyr Asp
275 280 285Thr Phe Leu Ser Asn Glu Tyr
Arg Thr Gly Ile Ser Trp Ser Phe Gly 290 295
300Met Leu Phe Phe Ile Trp Ala Met Phe Thr Thr Cys Arg Ala Val
Arg305 310 315 320Tyr Phe
Arg Gly Arg Gly Ser Gly Ser Val Lys Tyr Gln Ala Leu Ala
325 330 335Thr Ala Ser Gly Glu Glu Val
Ala Ala Leu Ser His His Asp Ser Leu 340 345
350Glu Ser Arg Arg Leu Arg Glu Glu Glu Asp Asp Asp Asp Glu
Asp Phe 355 360 365Glu Asp Ala
37039411DNAHuman herpesvirus 5 39atggaatgga acaccctggt cctgggcctg
ctggtgctgt ctgtcgtggc cagcagcaac 60aacacatcca cagccagcac ccctagacct
agcagcagca cccacgccag cactaccgtg 120aaggctacca ccgtggccac cacaagcacc
accactgcta ccagcaccag ctccaccacc 180tctgccaagc ctggctctac cacacacgac
cccaacgtga tgaggcccca cgcccacaac 240gacttctaca acgctcactg caccagccac
atgtacgagc tgtccctgag cagctttgcc 300gcctggtgga ccatgctgaa cgccctgatc
ctgatgggcg ccttctgcat cgtgctgcgg 360cactgctgct tccagaactt caccgccacc
accaccaagg gctactgata a 41140135PRTHuman herpesvirus 5 40Met
Glu Trp Asn Thr Leu Val Leu Gly Leu Leu Val Leu Ser Val Val1
5 10 15Ala Ser Ser Asn Asn Thr Ser
Thr Ala Ser Thr Pro Arg Pro Ser Ser 20 25
30Ser Thr His Ala Ser Thr Thr Val Lys Ala Thr Thr Val Ala
Thr Thr 35 40 45Ser Thr Thr Thr
Ala Thr Ser Thr Ser Ser Thr Thr Ser Ala Lys Pro 50 55
60Gly Ser Thr Thr His Asp Pro Asn Val Met Arg Pro His
Ala His Asn65 70 75
80Asp Phe Tyr Asn Ala His Cys Thr Ser His Met Tyr Glu Leu Ser Leu
85 90 95Ser Ser Phe Ala Ala Trp
Trp Thr Met Leu Asn Ala Leu Ile Leu Met 100
105 110Gly Ala Phe Cys Ile Val Leu Arg His Cys Cys Phe
Gln Asn Phe Thr 115 120 125Ala Thr
Thr Thr Lys Gly Tyr 130 135411422DNAHuman herpesvirus
5 41atgggcaaga aagaaatgat catggtcaag ggcatcccca agatcatgct gctgattagc
60atcacctttc tgctgctgtc cctgatcaac tgcaacgtgc tggtcaacag ccggggcacc
120agaagatcct ggccctacac cgtgctgtcc taccggggca aagagatcct gaagaagcag
180aaagaggaca tcctgaagcg gctgatgagc accagcagcg acggctaccg gttcctgatg
240taccccagcc agcagaaatt ccacgccatc gtgatcagca tggacaagtt cccccaggac
300tacatcctgg ccggacccat ccggaacgac agcatcaccc acatgtggtt cgacttctac
360agcacccagc tgcggaagcc cgccaaatac gtgtacagcg agtacaacca caccgcccac
420aagatcaccc tgaggcctcc cccttgtggc accgtgccca gcatgaactg cctgagcgag
480atgctgaacg tgtccaagcg gaacgacacc ggcgagaagg gctgcggcaa cttcaccacc
540ttcaacccca tgttcttcaa cgtgccccgg tggaacacca agctgtacat cggcagcaac
600aaagtgaacg tggacagcca gaccatctac tttctgggcc tgaccgccct gctgctgaga
660tacgcccagc ggaactgcac ccggtccttc tacctggtca acgccatgag ccggaacctg
720ttccgggtgc ccaagtacat caacggcacc aagctgaaga acaccatgcg gaagctgaag
780cggaagcagg ccctggtcaa agagcagccc cagaagaaga acaagaagtc ccagagcacc
840accaccccct acctgagcta caccacctcc accgccttca acgtgaccac caacgtgacc
900tacagcgcca cagccgccgt gaccagagtg gccacaagca ccaccggcta ccggcccgac
960agcaacttta tgaagtccat catggccacc cagctgagag atctggccac ctgggtgtac
1020accaccctgc ggtacagaaa cgagcccttc tgcaagcccg accggaacag aaccgccgtg
1080agcgagttca tgaagaatac ccacgtgctg atcagaaacg agacacccta caccatctac
1140ggcaccctgg acatgagcag cctgtactac aacgagacaa tgagcgtgga gaacgagaca
1200gccagcgaca acaacgaaac cacccccacc tcccccagca cccggttcca gcggaccttc
1260atcgaccccc tgtgggacta cctggacagc ctgctgttcc tggacaagat ccggaacttc
1320agcctgcagc tgcccgccta cggcaatctg accccccctg agcacagaag ggccgccaac
1380ctgagcaccc tgaacagcct gtggtggtgg agccagtgat aa
142242472PRTHuman herpesvirus 5 42Met Gly Lys Lys Glu Met Ile Met Val Lys
Gly Ile Pro Lys Ile Met1 5 10
15Leu Leu Ile Ser Ile Thr Phe Leu Leu Leu Ser Leu Ile Asn Cys Asn
20 25 30Val Leu Val Asn Ser Arg
Gly Thr Arg Arg Ser Trp Pro Tyr Thr Val 35 40
45Leu Ser Tyr Arg Gly Lys Glu Ile Leu Lys Lys Gln Lys Glu
Asp Ile 50 55 60Leu Lys Arg Leu Met
Ser Thr Ser Ser Asp Gly Tyr Arg Phe Leu Met65 70
75 80Tyr Pro Ser Gln Gln Lys Phe His Ala Ile
Val Ile Ser Met Asp Lys 85 90
95Phe Pro Gln Asp Tyr Ile Leu Ala Gly Pro Ile Arg Asn Asp Ser Ile
100 105 110Thr His Met Trp Phe
Asp Phe Tyr Ser Thr Gln Leu Arg Lys Pro Ala 115
120 125Lys Tyr Val Tyr Ser Glu Tyr Asn His Thr Ala His
Lys Ile Thr Leu 130 135 140Arg Pro Pro
Pro Cys Gly Thr Val Pro Ser Met Asn Cys Leu Ser Glu145
150 155 160Met Leu Asn Val Ser Lys Arg
Asn Asp Thr Gly Glu Lys Gly Cys Gly 165
170 175Asn Phe Thr Thr Phe Asn Pro Met Phe Phe Asn Val
Pro Arg Trp Asn 180 185 190Thr
Lys Leu Tyr Ile Gly Ser Asn Lys Val Asn Val Asp Ser Gln Thr 195
200 205Ile Tyr Phe Leu Gly Leu Thr Ala Leu
Leu Leu Arg Tyr Ala Gln Arg 210 215
220Asn Cys Thr Arg Ser Phe Tyr Leu Val Asn Ala Met Ser Arg Asn Leu225
230 235 240Phe Arg Val Pro
Lys Tyr Ile Asn Gly Thr Lys Leu Lys Asn Thr Met 245
250 255Arg Lys Leu Lys Arg Lys Gln Ala Leu Val
Lys Glu Gln Pro Gln Lys 260 265
270Lys Asn Lys Lys Ser Gln Ser Thr Thr Thr Pro Tyr Leu Ser Tyr Thr
275 280 285Thr Ser Thr Ala Phe Asn Val
Thr Thr Asn Val Thr Tyr Ser Ala Thr 290 295
300Ala Ala Val Thr Arg Val Ala Thr Ser Thr Thr Gly Tyr Arg Pro
Asp305 310 315 320Ser Asn
Phe Met Lys Ser Ile Met Ala Thr Gln Leu Arg Asp Leu Ala
325 330 335Thr Trp Val Tyr Thr Thr Leu
Arg Tyr Arg Asn Glu Pro Phe Cys Lys 340 345
350Pro Asp Arg Asn Arg Thr Ala Val Ser Glu Phe Met Lys Asn
Thr His 355 360 365Val Leu Ile Arg
Asn Glu Thr Pro Tyr Thr Ile Tyr Gly Thr Leu Asp 370
375 380Met Ser Ser Leu Tyr Tyr Asn Glu Thr Met Ser Val
Glu Asn Glu Thr385 390 395
400Ala Ser Asp Asn Asn Glu Thr Thr Pro Thr Ser Pro Ser Thr Arg Phe
405 410 415Gln Arg Thr Phe Ile
Asp Pro Leu Trp Asp Tyr Leu Asp Ser Leu Leu 420
425 430Phe Leu Asp Lys Ile Arg Asn Phe Ser Leu Gln Leu
Pro Ala Tyr Gly 435 440 445Asn Leu
Thr Pro Pro Glu His Arg Arg Ala Ala Asn Leu Ser Thr Leu 450
455 460Asn Ser Leu Trp Trp Trp Ser Gln465
47043519DNAHuman herpesvirus 5 43atgagcccca aggacctgac ccccttcctg
acaaccctgt ggctgctcct gggccatagc 60agagtgccta gagtgcgggc cgaggaatgc
tgcgagttca tcaacgtgaa ccaccccccc 120gagcggtgct acgacttcaa gatgtgcaac
cggttcaccg tggccctgag atgccccgac 180ggcgaagtgt gctacagccc cgagaaaacc
gccgagatcc ggggcatcgt gaccaccatg 240acccacagcc tgacccggca ggtggtgcac
aacaagctga ccagctgcaa ctacaacccc 300ctgtacctgg aagccgacgg ccggatcaga
tgcggcaaag tgaacgacaa ggcccagtac 360ctgctgggag ccgccggaag cgtgccctac
cggtggatca acctggaata cgacaagatc 420acccggatcg tgggcctgga ccagtacctg
gaaagcgtga agaagcacaa gcggctggac 480gtgtgcagag ccaagatggg ctacatgctg
cagtgataa 51944171PRTHuman herpesvirus 5 44Met
Ser Pro Lys Asp Leu Thr Pro Phe Leu Thr Thr Leu Trp Leu Leu1
5 10 15Leu Gly His Ser Arg Val Pro
Arg Val Arg Ala Glu Glu Cys Cys Glu 20 25
30Phe Ile Asn Val Asn His Pro Pro Glu Arg Cys Tyr Asp Phe
Lys Met 35 40 45Cys Asn Arg Phe
Thr Val Ala Leu Arg Cys Pro Asp Gly Glu Val Cys 50 55
60Tyr Ser Pro Glu Lys Thr Ala Glu Ile Arg Gly Ile Val
Thr Thr Met65 70 75
80Thr His Ser Leu Thr Arg Gln Val Val His Asn Lys Leu Thr Ser Cys
85 90 95Asn Tyr Asn Pro Leu Tyr
Leu Glu Ala Asp Gly Arg Ile Arg Cys Gly 100
105 110Lys Val Asn Asp Lys Ala Gln Tyr Leu Leu Gly Ala
Ala Gly Ser Val 115 120 125Pro Tyr
Arg Trp Ile Asn Leu Glu Tyr Asp Lys Ile Thr Arg Ile Val 130
135 140Gly Leu Asp Gln Tyr Leu Glu Ser Val Lys Lys
His Lys Arg Leu Asp145 150 155
160Val Cys Arg Ala Lys Met Gly Tyr Met Leu Gln 165
17045648DNAHuman herpesvirus 5 45atgctgcggc tgctgctgag
acaccacttc cactgcctgc tgctgtgtgc cgtgtgggcc 60accccttgtc tggccagccc
ttggagcacc ctgaccgcca accagaaccc tagcccccct 120tggtccaagc tgacctacag
caagccccac gacgccgcca ccttctactg cccctttctg 180taccccagcc ctcccagaag
ccccctgcag ttcagcggct tccagagagt gtccaccggc 240cctgagtgcc ggaacgagac
actgtacctg ctgtacaacc gggagggcca gacactggtg 300gagcggagca gcacctgggt
gaaaaaagtg atctggtatc tgagcggccg gaaccagacc 360atcctgcagc ggatgcccag
aaccgccagc aagcccagcg acggcaacgt gcagatcagc 420gtggaggacg ccaaaatctt
cggcgcccac atggtgccca agcagaccaa gctgctgaga 480ttcgtggtca acgacggcac
cagatatcag atgtgcgtga tgaagctgga aagctgggcc 540cacgtgttcc gggactactc
cgtgagcttc caggtccggc tgaccttcac cgaggccaac 600aaccagacct acaccttctg
cacccacccc aacctgatcg tgtgataa 64846214PRTHuman
herpesvirus 5 46Met Leu Arg Leu Leu Leu Arg His His Phe His Cys Leu Leu
Leu Cys1 5 10 15Ala Val
Trp Ala Thr Pro Cys Leu Ala Ser Pro Trp Ser Thr Leu Thr 20
25 30Ala Asn Gln Asn Pro Ser Pro Pro Trp
Ser Lys Leu Thr Tyr Ser Lys 35 40
45Pro His Asp Ala Ala Thr Phe Tyr Cys Pro Phe Leu Tyr Pro Ser Pro 50
55 60Pro Arg Ser Pro Leu Gln Phe Ser Gly
Phe Gln Arg Val Ser Thr Gly65 70 75
80Pro Glu Cys Arg Asn Glu Thr Leu Tyr Leu Leu Tyr Asn Arg
Glu Gly 85 90 95Gln Thr
Leu Val Glu Arg Ser Ser Thr Trp Val Lys Lys Val Ile Trp 100
105 110Tyr Leu Ser Gly Arg Asn Gln Thr Ile
Leu Gln Arg Met Pro Arg Thr 115 120
125Ala Ser Lys Pro Ser Asp Gly Asn Val Gln Ile Ser Val Glu Asp Ala
130 135 140Lys Ile Phe Gly Ala His Met
Val Pro Lys Gln Thr Lys Leu Leu Arg145 150
155 160Phe Val Val Asn Asp Gly Thr Arg Tyr Gln Met Cys
Val Met Lys Leu 165 170
175Glu Ser Trp Ala His Val Phe Arg Asp Tyr Ser Val Ser Phe Gln Val
180 185 190Arg Leu Thr Phe Thr Glu
Ala Asn Asn Gln Thr Tyr Thr Phe Cys Thr 195 200
205His Pro Asn Leu Ile Val 21047393DNAHuman herpesvirus 5
47atgcggctgt gcagagtgtg gctgtccgtg tgcctgtgtg ccgtggtgct gggccagtgc
60cagagagaga cagccgagaa gaacgactac taccgggtgc cccactactg ggatgcctgc
120agcagagccc tgcccgacca gacccggtac aaatacgtgg agcagctcgt ggacctgacc
180ctgaactacc actacgacgc cagccacggc ctggacaact tcgacgtgct gaagcggatc
240aacgtgaccg aggtgtccct gctgatcagc gacttccggc ggcagaacag aagaggcggc
300accaacaagc ggaccacctt caacgccgct ggctctctgg cccctcacgc cagatccctg
360gaattcagcg tgcggctgtt cgccaactga taa
39348129PRTHuman herpesvirus 5 48Met Arg Leu Cys Arg Val Trp Leu Ser Val
Cys Leu Cys Ala Val Val1 5 10
15Leu Gly Gln Cys Gln Arg Glu Thr Ala Glu Lys Asn Asp Tyr Tyr Arg
20 25 30Val Pro His Tyr Trp Asp
Ala Cys Ser Arg Ala Leu Pro Asp Gln Thr 35 40
45Arg Tyr Lys Tyr Val Glu Gln Leu Val Asp Leu Thr Leu Asn
Tyr His 50 55 60Tyr Asp Ala Ser His
Gly Leu Asp Asn Phe Asp Val Leu Lys Arg Ile65 70
75 80Asn Val Thr Glu Val Ser Leu Leu Ile Ser
Asp Phe Arg Arg Gln Asn 85 90
95Arg Arg Gly Gly Thr Asn Lys Arg Thr Thr Phe Asn Ala Ala Gly Ser
100 105 110Leu Ala Pro His Ala
Arg Ser Leu Glu Phe Ser Val Arg Leu Phe Ala 115
120 125Asn49550DNAEncephalomyocarditis virus 49aacgttactg
gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 60tccaccatat
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 120acgagcattc
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 180gtgaaggaag
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 240tgcaggcagc
ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 300taagatacac
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 360gaaagagtca
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 420gtaccccatt
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 480tcgaggttaa
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa 540acacgataat
55050678DNAHuman
enterovirus 71 50gtacctttgt acgcctgttt tataccccct ccctgatttg caacttagaa
gcaacgcaaa 60ccagatcaat agtaggtgtg acataccagt cgcatcttga tcaagcactt
ctgtatcccc 120ggaccgagta tcaatagact gtgcacacgg ttgaaggaga aaacgtccgt
tacccggcta 180actacttcga gaagcctagt aacgccattg aagttgcaga gtgtttcgct
cagcactccc 240cccgtgtaga tcaggtcgat gagtcaccgc attccccacg ggcgaccgtg
gcggtggctg 300cgttggcggc ctgcctatgg ggtaacccat aggacgctct aatacggaca
tggcgtgaag 360agtctattga gctagttagt agtcctccgg cccctgaatg cggctaatcc
taactgcgga 420gcacataccc ttaatccaaa gggcagtgtg tcgtaacggg caactctgca
gcggaaccga 480ctactttggg tgtccgtgtt tctttttatt cttgtattgg ctgcttatgg
tgacaattaa 540agaattgtta ccatatagct attggattgg ccatccagtg tcaaacagag
ctattgtata 600tctctttgtt ggattcacac ctctcactct tgaaacgtta cacaccctca
attacattat 660actgctgaac acgaagcg
6785124DNAVenezuelan equine encephalitis virus 51ctctctacgg
ctaacctgaa tgga
245214071DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 52cgcgtcggct acaattaata
cataacctta tgtatcatac acatacgatt taggtgacac 60tatagatggg cggcgcatga
gagaagccca gaccaattac ctacccaaaa tggagaaagt 120tcacgttgac atcgaggaag
acagcccatt cctcagagct ttgcagcgga gcttcccgca 180gtttgaggta gaagccaagc
aggtcactga taatgaccat gctaatgcca gagcgttttc 240gcatctggct tcaaaactga
tcgaaacgga ggtggaccca tccgacacga tccttgacat 300tggaagtgcg cccgcccgca
gaatgtattc taagcacaag tatcattgta tctgtccgat 360gagatgtgcg gaagatccgg
acagattgta taagtatgca actaagctga agaaaaactg 420taaggaaata actgataagg
aattggacaa gaaaatgaag gagctcgccg ccgtcatgag 480cgaccctgac ctggaaactg
agactatgtg cctccacgac gacgagtcgt gtcgctacga 540agggcaagtc gctgtttacc
aggatgtata cgcggttgac ggaccgacaa gtctctatca 600ccaagccaat aagggagtta
gagtcgccta ctggataggc tttgacacca ccccttttat 660gtttaagaac ttggctggag
catatccatc atactctacc aactgggccg acgaaaccgt 720gttaacggct cgtaacatag
gcctatgcag ctctgacgtt atggagcggt cacgtagagg 780gatgtccatt cttagaaaga
agtatttgaa accatccaac aatgttctat tctctgttgg 840ctcgaccatc taccacgaga
agagggactt actgaggagc tggcacctgc cgtctgtatt 900tcacttacgt ggcaagcaaa
attacacatg tcggtgtgag actatagtta gttgcgacgg 960gtacgtcgtt aaaagaatag
ctatcagtcc aggcctgtat gggaagcctt caggctatgc 1020tgctacgatg caccgcgagg
gattcttgtg ctgcaaagtg acagacacat tgaacgggga 1080gagggtctct tttcccgtgt
gcacgtatgt gccagctaca ttgtgtgacc aaatgactgg 1140catactggca acagatgtca
gtgcggacga cgcgcaaaaa ctgctggttg ggctcaacca 1200gcgtatagtc gtcaacggtc
gcacccagag aaacaccaat accatgaaaa attacctttt 1260gcccgtagtg gcccaggcat
ttgctaggtg ggcaaaggaa tataaggaag atcaagaaga 1320tgaaaggcca ctaggactac
gagatagaca gttagtcatg gggtgttgtt gggcttttag 1380aaggcacaag ataacatcta
tttataagcg cccggatacc caaaccatca tcaaagtgaa 1440cagcgatttc cactcattcg
tgctgcccag gataggcagt aacacattgg agatcgggct 1500gagaacaaga atcaggaaaa
tgttagagga gcacaaggag ccgtcacctc tcattaccgc 1560cgaggacgta caagaagcta
agtgcgcagc cgatgaggct aaggaggtgc gtgaagccga 1620ggagttgcgc gcagctctac
cacctttggc agctgatgtt gaggagccca ctctggaagc 1680cgatgtagac ttgatgttac
aagaggctgg ggccggctca gtggagacac ctcgtggctt 1740gataaaggtt accagctacg
ctggcgagga caagatcggc tcttacgctg tgctttctcc 1800gcaggctgta ctcaagagtg
aaaaattatc ttgcatccac cctctcgctg aacaagtcat 1860agtgataaca cactctggcc
gaaaagggcg ttatgccgtg gaaccatacc atggtaaagt 1920agtggtgcca gagggacatg
caatacccgt ccaggacttt caagctctga gtgaaagtgc 1980caccattgtg tacaacgaac
gtgagttcgt aaacaggtac ctgcaccata ttgccacaca 2040tggaggagcg ctgaacactg
atgaagaata ttacaaaact gtcaagccca gcgagcacga 2100cggcgaatac ctgtacgaca
tcgacaggaa acagtgcgtc aagaaagaac tagtcactgg 2160gctagggctc acaggcgagc
tggtggatcc tcccttccat gaattcgcct acgagagtct 2220gagaacacga ccagccgctc
cttaccaagt accaaccata ggggtgtatg gcgtgccagg 2280atcaggcaag tctggcatca
ttaaaagcgc agtcaccaaa aaagatctag tggtgagcgc 2340caagaaagaa aactgtgcag
aaattataag ggacgtcaag aaaatgaaag ggctggacgt 2400caatgccaga actgtggact
cagtgctctt gaatggatgc aaacaccccg tagagaccct 2460gtatattgac gaagcttttg
cttgtcatgc aggtactctc agagcgctca tagccattat 2520aagacctaaa aaggcagtgc
tctgcgggga tcccaaacag tgcggttttt ttaacatgat 2580gtgcctgaaa gtgcatttta
accacgagat ttgcacacaa gtcttccaca aaagcatctc 2640tcgccgttgc actaaatctg
tgacttcggt cgtctcaacc ttgttttacg acaaaaaaat 2700gagaacgacg aatccgaaag
agactaagat tgtgattgac actaccggca gtaccaaacc 2760taagcaggac gatctcattc
tcacttgttt cagagggtgg gtgaagcagt tgcaaataga 2820ttacaaaggc aacgaaataa
tgacggcagc tgcctctcaa gggctgaccc gtaaaggtgt 2880gtatgccgtt cggtacaagg
tgaatgaaaa tcctctgtac gcacccacct cagaacatgt 2940gaacgtccta ctgacccgca
cggaggaccg catcgtgtgg aaaacactag ccggcgaccc 3000atggataaaa acactgactg
ccaagtaccc tgggaatttc actgccacga tagaggagtg 3060gcaagcagag catgatgcca
tcatgaggca catcttggag agaccggacc ctaccgacgt 3120cttccagaat aaggcaaacg
tgtgttgggc caaggcttta gtgccggtgc tgaagaccgc 3180tggcatagac atgaccactg
aacaatggaa cactgtggat tattttgaaa cggacaaagc 3240tcactcagca gagatagtat
tgaaccaact atgcgtgagg ttctttggac tcgatctgga 3300ctccggtcta ttttctgcac
ccactgttcc gttatccatt aggaataatc actgggataa 3360ctccccgtcg cctaacatgt
acgggctgaa taaagaagtg gtccgtcagc tctctcgcag 3420gtacccacaa ctgcctcggg
cagttgccac tggaagagtc tatgacatga acactggtac 3480actgcgcaat tatgatccgc
gcataaacct agtacctgta aacagaagac tgcctcatgc 3540tttagtcctc caccataatg
aacacccaca gagtgacttt tcttcattcg tcagcaaatt 3600gaagggcaga actgtcctgg
tggtcgggga aaagttgtcc gtcccaggca aaatggttga 3660ctggttgtca gaccggcctg
aggctacctt cagagctcgg ctggatttag gcatcccagg 3720tgatgtgccc aaatatgaca
taatatttgt taatgtgagg accccatata aataccatca 3780ctatcagcag tgtgaagacc
atgccattaa gcttagcatg ttgaccaaga aagcttgtct 3840gcatctgaat cccggcggaa
cctgtgtcag cataggttat ggttacgctg acagggccag 3900cgaaagcatc attggtgcta
tagcgcggca gttcaagttt tcccgggtat gcaaaccgaa 3960atcctcactt gaagagacgg
aagttctgtt tgtattcatt gggtacgatc gcaaggcccg 4020tacgcacaat ccttacaagc
tttcatcaac cttgaccaac atttatacag gttccagact 4080ccacgaagcc ggatgtgcac
cctcatatca tgtggtgcga ggggatattg ccacggccac 4140cgaaggagtg attataaatg
ctgctaacag caaaggacaa cctggcggag gggtgtgcgg 4200agcgctgtat aagaaattcc
cggaaagctt cgatttacag ccgatcgaag taggaaaagc 4260gcgactggtc aaaggtgcag
ctaaacatat cattcatgcc gtaggaccaa acttcaacaa 4320agtttcggag gttgaaggtg
acaaacagtt ggcagaggct tatgagtcca tcgctaagat 4380tgtcaacgat aacaattaca
agtcagtagc gattccactg ttgtccaccg gcatcttttc 4440cgggaacaaa gatcgactaa
cccaatcatt gaaccatttg ctgacagctt tagacaccac 4500tgatgcagat gtagccatat
actgcaggga caagaaatgg gaaatgactc tcaaggaagc 4560agtggctagg agagaagcag
tggaggagat atgcatatcc gacgactctt cagtgacaga 4620acctgatgca gagctggtga
gggtgcatcc gaagagttct ttggctggaa ggaagggcta 4680cagcacaagc gatggcaaaa
ctttctcata tttggaaggg accaagtttc accaggcggc 4740caaggatata gcagaaatta
atgccatgtg gcccgttgca acggaggcca atgagcaggt 4800atgcatgtat atcctcggag
aaagcatgag cagtattagg tcgaaatgcc ccgtcgaaga 4860gtcggaagcc tcctcaccac
ctagcacgct gccttgcttg tgcatccatg ccatgactcc 4920agaaagagta cagcgcctaa
aagcctcacg tccagaacaa attactgtgt gctcatcctt 4980tccattgccg aagtatagaa
tcactggtgt gcagaagatc caatgctccc agcctatatt 5040gttctcaccg aaagtgcctg
cgtatattca tccaaggaag tatctcgtgg aaacaccacc 5100ggtagacgag actccggagc
catcggcaga gaaccaatcc acagagggga cacctgaaca 5160accaccactt ataaccgagg
atgagaccag gactagaacg cctgagccga tcatcatcga 5220agaggaagaa gaggatagca
taagtttgct gtcagatggc ccgacccacc aggtgctgca 5280agtcgaggca gacattcacg
ggccgccctc tgtatctagc tcatcctggt ccattcctca 5340tgcatccgac tttgatgtgg
acagtttatc catacttgac accctggagg gagctagcgt 5400gaccagcggg gcaacgtcag
ccgagactaa ctcttacttc gcaaagagta tggagtttct 5460ggcgcgaccg gtgcctgcgc
ctcgaacagt attcaggaac cctccacatc ccgctccgcg 5520cacaagaaca ccgtcacttg
cacccagcag ggcctgctcg agagggatca cgggagaaac 5580cgtgggatac gcggttacac
acaatagcga gggcttcttg ctatgcaaag ttactgacac 5640agtaaaagga gaacgggtat
cgttccctgt gtgcacgtac atcccggcca ccataaactc 5700gagaaccagc ctggtctcca
acccgccagg cgtaaatagg gtgattacaa gagaggagtt 5760tgaggcgttc gtagcacaac
aacaatgacg gtttgatgcg ggtgcataca tcttttcctc 5820cgacaccggt caagggcatt
tacaacaaaa atcagtaagg caaacggtgc tatccgaagt 5880ggtgttggag aggaccgaat
tggagatttc gtatgccccg cgcctcgacc aagaaaaaga 5940agaattacta cgcaagaaat
tacagttaaa tcccacacct gctaacagaa gcagatacca 6000gtccaggaag gtggagaaca
tgaaagccat aacagctaga cgtattctgc aaggcctagg 6060gcattatttg aaggcagaag
gaaaagtgga gtgctaccga accctgcatc ctgttccttt 6120gtattcatct agtgtgaacc
gtgccttttc aagccccaag gtcgcagtgg aagcctgtaa 6180cgccatgttg aaagagaact
ttccgactgt ggcttcttac tgtattattc cagagtacga 6240tgcctatttg gacatggttg
acggagcttc atgctgctta gacactgcca gtttttgccc 6300tgcaaagctg cgcagctttc
caaagaaaca ctcctatttg gaacccacaa tacgatcggc 6360agtgccttca gcgatccaga
acacgctcca gaacgtcctg gcagctgcca caaaaagaaa 6420ttgcaatgtc acgcaaatga
gagaattgcc cgtattggat tcggcggcct ttaatgtgga 6480atgcttcaag aaatatgcgt
gtaataatga atattgggaa acgtttaaag aaaaccccat 6540caggcttact gaagaaaacg
tggtaaatta cattaccaaa ttaaaaggac caaaagctgc 6600tgctcttttt gcgaagacac
ataatttgaa tatgttgcag gacataccaa tggacaggtt 6660tgtaatggac ttaaagagag
acgtgaaagt gactccagga acaaaacata ctgaagaacg 6720gcccaaggta caggtgatcc
aggctgccga tccgctagca acagcgtatc tgtgcggaat 6780ccaccgagag ctggttagga
gattaaatgc ggtcctgctt ccgaacattc atacactgtt 6840tgatatgtcg gctgaagact
ttgacgctat tatagccgag cacttccagc ctggggattg 6900tgttctggaa actgacatcg
cgtcgtttga taaaagtgag gacgacgcca tggctctgac 6960cgcgttaatg attctggaag
acttaggtgt ggacgcagag ctgttgacgc tgattgaggc 7020ggctttcggc gaaatttcat
caatacattt gcccactaaa actaaattta aattcggagc 7080catgatgaaa tctggaatgt
tcctcacact gtttgtgaac acagtcatta acattgtaat 7140cgcaagcaga gtgttgagag
aacggctaac cggatcacca tgtgcagcat tcattggaga 7200tgacaatatc gtgaaaggag
tcaaatcgga caaattaatg gcagacaggt gcgccacctg 7260gttgaatatg gaagtcaaga
ttatagatgc tgtggtgggc gagaaagcgc cttatttctg 7320tggagggttt attttgtgtg
actccgtgac cggcacagcg tgccgtgtgg cagaccccct 7380aaaaaggctg tttaagcttg
gcaaacctct ggcagcagac gatgaacatg atgatgacag 7440gagaagggca ttgcatgaag
agtcaacacg ctggaaccga gtgggtattc tttcagagct 7500gtgcaaggca gtagaatcaa
ggtatgaaac cgtaggaact tccatcatag ttatggccat 7560gactactcta gctagcagtg
ttaaatcatt cagctacctg agaggggccc ctataactct 7620ctacggctaa cctgaatgga
ctacgacata gtctagtcga cgccaccatg aggcctggcc 7680tgccctccta cctgatcatc
ctggccgtgt gcctgttcag ccacctgctg tccagcagat 7740acggcgccga ggccgtgagc
gagcccctgg acaaggcttt ccacctgctg ctgaacacct 7800acggcagacc catccggttt
ctgcgggaga acaccaccca gtgcacctac aacagcagcc 7860tgcggaacag caccgtcgtg
agagagaacg ccatcagctt caactttttc cagagctaca 7920accagtacta cgtgttccac
atgcccagat gcctgtttgc cggccctctg gccgagcagt 7980tcctgaacca ggtggacctg
accgagacac tggaaagata ccagcagcgg ctgaatacct 8040acgccctggt gtccaaggac
ctggccagct accggtcctt tagccagcag ctcaaggctc 8100aggatagcct cggcgagcag
cctaccaccg tgccccctcc catcgacctg agcatccccc 8160acgtgtggat gcctccccag
accacccctc acggctggac cgagagccac accacctccg 8220gcctgcacag accccacttc
aaccagacct gcatcctgtt cgacggccac gacctgctgt 8280ttagcaccgt gaccccctgc
ctgcaccagg gcttctacct gatcgacgag ctgagatacg 8340tgaagatcac cctgaccgag
gatttcttcg tggtcaccgt gtccatcgac gacgacaccc 8400ccatgctgct gatcttcggc
cacctgccca gagtgctgtt caaggccccc taccagcggg 8460acaacttcat cctgcggcag
accgagaagc acgagctgct ggtgctggtc aagaaggacc 8520agctgaaccg gcactcctac
ctgaaggacc ccgacttcct ggacgccgcc ctggacttca 8580actacctgga cctgagcgcc
ctgctgagaa acagcttcca cagatacgcc gtggacgtgc 8640tgaagtccgg acggtgccag
atgctcgatc ggcggaccgt ggagatggcc ttcgcctatg 8700ccctcgccct gttcgccgct
gccagacagg aagaggctgg cgcccaggtg tcagtgccca 8760gagccctgga tagacaggcc
gccctgctgc agatccagga attcatgatc acctgcctga 8820gccagacccc ccctagaacc
accctgctgc tgtaccccac agccgtggat ctggccaaga 8880gggccctgtg gacccccaac
cagatcaccg acatcacaag cctcgtgcgg ctcgtgtaca 8940tcctgagcaa gcagaaccag
cagcacctga tcccccagtg ggccctgaga cagatcgccg 9000acttcgccct gaagctgcac
aagacccatc tggccagctt tctgagcgcc ttcgccaggc 9060aggaactgta cctgatgggc
agcctggtcc acagcatgct ggtgcatacc accgagcggc 9120gggagatctt catcgtggag
acaggcctgt gtagcctggc cgagctgtcc cactttaccc 9180agctgctggc ccaccctcac
cacgagtacc tgagcgacct gtacaccccc tgcagcagca 9240gcggcagacg ggaccacagc
ctggaacggc tgaccagact gttccccgat gccaccgtgc 9300ctgctacagt gcctgccgcc
ctgtccatcc tgtccaccat gcagcccagc accctggaaa 9360ccttccccga cctgttctgc
ctgcccctgg gcgagagctt tagcgccctg accgtgtccg 9420agcacgtgtc ctacatcgtg
accaatcagt acctgatcaa gggcatcagc taccccgtgt 9480ccaccacagt cgtgggccag
agcctgatca tcacccagac cgacagccag accaagtgcg 9540agctgacccg gaacatgcac
accacacaca gcatcaccgt ggccctgaac atcagcctgg 9600aaaactgcgc tttctgtcag
tctgccctgc tggaatacga cgatacccag ggcgtgatca 9660acatcatgta catgcacgac
agcgacgacg tgctgttcgc cctggacccc tacaacgagg 9720tggtggtgtc cagcccccgg
acccactacc tgatgctgct gaagaacggc accgtgctgg 9780aagtgaccga cgtggtggtg
gacgccaccg actgataatc tagacggcgc gcccacccag 9840cggccgccta taactctcta
cggctaacct gaatggacta cgacatagtc tagtcgacgc 9900caccatgtgc agaaggcccg
actgcggctt cagcttcagc cctggacccg tgatcctgct 9960gtggtgctgc ctgctgctgc
ctatcgtgtc ctctgccgcc gtgtctgtgg cccctacagc 10020cgccgagaag gtgccagccg
agtgccccga gctgaccaga agatgcctgc tgggcgaggt 10080gttcgagggc gacaagtacg
agagctggct gcggcccctg gtcaacgtga ccggcagaga 10140tggccccctg agccagctga
tccggtacag acccgtgacc cccgaggccg ccaatagcgt 10200gctgctggac gaggccttcc
tggataccct ggccctgctg tacaacaacc ccgaccagct 10260gagagccctg ctgaccctgc
tgtccagcga caccgccccc agatggatga ccgtgatgcg 10320gggctacagc gagtgtggag
atggcagccc tgccgtgtac acctgcgtgg acgacctgtg 10380cagaggctac gacctgacca
gactgagcta cggccggtcc atcttcacag agcacgtgct 10440gggcttcgag ctggtgcccc
ccagcctgtt caacgtggtg gtggccatcc ggaacgaggc 10500caccagaacc aacagagccg
tgcggctgcc tgtgtctaca gccgctgcac ctgagggcat 10560cacactgttc tacggcctgt
acaacgccgt gaaagagttc tgcctccggc accagctgga 10620tccccccctg ctgagacacc
tggacaagta ctacgccggc ctgcccccag agctgaagca 10680gaccagagtg aacctgcccg
cccacagcag atatggccct caggccgtgg acgccagatg 10740ataatctaga cggcgcgccc
acccaatcga tgtacttccg aggaactcac gtgcataatg 10800catcaggctg gtacattaga
tccccgctta ccgcgggcaa tatagcaaca ctaaaaactc 10860gatgtacttc cgaggaagcg
cagtgcataa tgctgcgcag tgttgccaca taaccactat 10920attaaccatt tatctagcgg
acgccaaaaa ctcaatgtat ttctgaggaa gcgtggtgca 10980taatgccacg cagcgtctgc
ataactttta ttatttcttt tattaatcaa caaaattttg 11040tttttaacat ttcaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaagggtcgg 11100catggcatct ccacctcctc
gcggtccgac ctgggcatcc gaaggaggac gcacgtccac 11160tcggatggct aagggagagc
cacgagctcc tgtttaaacc agctccaatt cgccctatag 11220tgagtcgtat tacgcgcgct
cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 11280tggcgttacc caacttaatc
gccttgcagc acatccccct ttcgccagct ggcgtaatag 11340cgaagaggcc cgcaccgatc
gcccttccca acagttgcgc agcctgaatg gcgaatggga 11400cgcgccctgt agcggcgcat
taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc 11460tacacttgcc agcgccctag
cgcccgctcc tttcgctttc ttcccttcct ttctcgccac 11520gttcgccggc tttccccgtc
aagctctaaa tcgggggctc cctttagggt tccgatttag 11580tgctttacgg cacctcgacc
ccaaaaaact tgattagggt gatggttcac gtagtgggcc 11640atcgccctga tagacggttt
ttcgcccttt gacgttggag tccacgttct ttaatagtgg 11700actcttgttc caaactggaa
caacactcaa ccctatctcg gtctattctt ttgatttata 11760agggattttg ccgatttcgg
cctattggtt aaaaaatgag ctgatttaac aaaaatttaa 11820cgcgaatttt aacaaaatat
taacgcttac aatttaggtg gcacttttcg gggaaatgtg 11880cgcggaaccc ctatttgttt
atttttctaa atacattcaa atatgtatcc gctcatgaga 11940caataaccct gataaatgct
tcaataatat tgaaaaagga agagtatgag tattcaacat 12000ttccgtgtcg cccttattcc
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 12060gaaacgctgg tgaaagtaaa
agatgctgaa gatcagttgg gtgcacgagt gggttacatc 12120gaactggatc tcaacagcgg
taagatcctt gagagttttc gccccgaaga acgttttcca 12180atgatgagca cttttaaagt
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 12240caagagcaac tcggtcgccg
catacactat tctcagaatg acttggttga gtactcacca 12300gtcacagaaa agcatcttac
ggatggcatg acagtaagag aattatgcag tgctgccata 12360accatgagtg ataacactgc
ggccaactta cttctgacaa cgatcggagg accgaaggag 12420ctaaccgctt ttttgcacaa
catgggggat catgtaactc gccttgatcg ttgggaaccg 12480gagctgaatg aagccatacc
aaacgacgag cgtgacacca cgatgcctgt agcaatggca 12540acaacgttgc gcaaactatt
aactggcgaa ctacttactc tagcttcccg gcaacaatta 12600atagactgga tggaggcgga
taaagttgca ggaccacttc tgcgctcggc ccttccggct 12660ggctggttta ttgctgataa
atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 12720gcactggggc cagatggtaa
gccctcccgt atcgtagtta tctacacgac ggggagtcag 12780gcaactatgg atgaacgaaa
tagacagatc gctgagatag gtgcctcact gattaagcat 12840tggtaactgt cagaccaagt
ttactcatat atactttaga ttgatttaaa acttcatttt 12900taatttaaaa ggatctaggt
gaagatcctt tttgataatc tcatgaccaa aatcccttaa 12960cgtgagtttt cgttccactg
agcgtcagac cccgtagaaa agatcaaagg atcttcttga 13020gatccttttt ttctgcgcgt
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 13080gtggtttgtt tgccggatca
agagctacca actctttttc cgaaggtaac tggcttcagc 13140agagcgcaga taccaaatac
tgttcttcta gtgtagccgt agttaggcca ccacttcaag 13200aactctgtag caccgcctac
atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 13260agtggcgata agtcgtgtct
taccgggttg gactcaagac gatagttacc ggataaggcg 13320cagcggtcgg gctgaacggg
gggttcgtgc acacagccca gcttggagcg aacgacctac 13380accgaactga gatacctaca
gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 13440aaggcggaca ggtatccggt
aagcggcagg gtcggaacag gagagcgcac gagggagctt 13500ccagggggaa acgcctggta
tctttatagt cctgtcgggt ttcgccacct ctgacttgag 13560cgtcgatttt tgtgatgctc
gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 13620gcctttttac ggttcctggc
cttttgctgg ccttttgctc acatgttctt tcctgcgtta 13680tcccctgatt ctgtggataa
ccgtattacc gcctttgagt gagctgatac cgctcgccgc 13740agccgaacga ccgagcgcag
cgagtcagtg agcgaggaag cggaagagcg cccaatacgc 13800aaaccgcctc tccccgcgcg
ttggccgatt cattaatgca gctggcacga caggtttccc 13860gactggaaag cgggcagtga
gcgcaacgca attaatgtga gttagctcac tcattaggca 13920ccccaggctt tacactttat
gctcccggct cgtatgttgt gtggaattgt gagcggataa 13980caatttcaca caggaaacag
ctatgaccat gattacgcca agcgcgcaat taaccctcac 14040taaagggaac aaaagctggg
taccggcgcc a 140715314152DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 53cgcgtcggct acaattaata cataacctta tgtatcatac acatacgatt
taggtgacac 60tatagatggg cggcgcatga gagaagccca gaccaattac ctacccaaaa
tggagaaagt 120tcacgttgac atcgaggaag acagcccatt cctcagagct ttgcagcgga
gcttcccgca 180gtttgaggta gaagccaagc aggtcactga taatgaccat gctaatgcca
gagcgttttc 240gcatctggct tcaaaactga tcgaaacgga ggtggaccca tccgacacga
tccttgacat 300tggaagtgcg cccgcccgca gaatgtattc taagcacaag tatcattgta
tctgtccgat 360gagatgtgcg gaagatccgg acagattgta taagtatgca actaagctga
agaaaaactg 420taaggaaata actgataagg aattggacaa gaaaatgaag gagctcgccg
ccgtcatgag 480cgaccctgac ctggaaactg agactatgtg cctccacgac gacgagtcgt
gtcgctacga 540agggcaagtc gctgtttacc aggatgtata cgcggttgac ggaccgacaa
gtctctatca 600ccaagccaat aagggagtta gagtcgccta ctggataggc tttgacacca
ccccttttat 660gtttaagaac ttggctggag catatccatc atactctacc aactgggccg
acgaaaccgt 720gttaacggct cgtaacatag gcctatgcag ctctgacgtt atggagcggt
cacgtagagg 780gatgtccatt cttagaaaga agtatttgaa accatccaac aatgttctat
tctctgttgg 840ctcgaccatc taccacgaga agagggactt actgaggagc tggcacctgc
cgtctgtatt 900tcacttacgt ggcaagcaaa attacacatg tcggtgtgag actatagtta
gttgcgacgg 960gtacgtcgtt aaaagaatag ctatcagtcc aggcctgtat gggaagcctt
caggctatgc 1020tgctacgatg caccgcgagg gattcttgtg ctgcaaagtg acagacacat
tgaacgggga 1080gagggtctct tttcccgtgt gcacgtatgt gccagctaca ttgtgtgacc
aaatgactgg 1140catactggca acagatgtca gtgcggacga cgcgcaaaaa ctgctggttg
ggctcaacca 1200gcgtatagtc gtcaacggtc gcacccagag aaacaccaat accatgaaaa
attacctttt 1260gcccgtagtg gcccaggcat ttgctaggtg ggcaaaggaa tataaggaag
atcaagaaga 1320tgaaaggcca ctaggactac gagatagaca gttagtcatg gggtgttgtt
gggcttttag 1380aaggcacaag ataacatcta tttataagcg cccggatacc caaaccatca
tcaaagtgaa 1440cagcgatttc cactcattcg tgctgcccag gataggcagt aacacattgg
agatcgggct 1500gagaacaaga atcaggaaaa tgttagagga gcacaaggag ccgtcacctc
tcattaccgc 1560cgaggacgta caagaagcta agtgcgcagc cgatgaggct aaggaggtgc
gtgaagccga 1620ggagttgcgc gcagctctac cacctttggc agctgatgtt gaggagccca
ctctggaagc 1680cgatgtagac ttgatgttac aagaggctgg ggccggctca gtggagacac
ctcgtggctt 1740gataaaggtt accagctacg ctggcgagga caagatcggc tcttacgctg
tgctttctcc 1800gcaggctgta ctcaagagtg aaaaattatc ttgcatccac cctctcgctg
aacaagtcat 1860agtgataaca cactctggcc gaaaagggcg ttatgccgtg gaaccatacc
atggtaaagt 1920agtggtgcca gagggacatg caatacccgt ccaggacttt caagctctga
gtgaaagtgc 1980caccattgtg tacaacgaac gtgagttcgt aaacaggtac ctgcaccata
ttgccacaca 2040tggaggagcg ctgaacactg atgaagaata ttacaaaact gtcaagccca
gcgagcacga 2100cggcgaatac ctgtacgaca tcgacaggaa acagtgcgtc aagaaagaac
tagtcactgg 2160gctagggctc acaggcgagc tggtggatcc tcccttccat gaattcgcct
acgagagtct 2220gagaacacga ccagccgctc cttaccaagt accaaccata ggggtgtatg
gcgtgccagg 2280atcaggcaag tctggcatca ttaaaagcgc agtcaccaaa aaagatctag
tggtgagcgc 2340caagaaagaa aactgtgcag aaattataag ggacgtcaag aaaatgaaag
ggctggacgt 2400caatgccaga actgtggact cagtgctctt gaatggatgc aaacaccccg
tagagaccct 2460gtatattgac gaagcttttg cttgtcatgc aggtactctc agagcgctca
tagccattat 2520aagacctaaa aaggcagtgc tctgcgggga tcccaaacag tgcggttttt
ttaacatgat 2580gtgcctgaaa gtgcatttta accacgagat ttgcacacaa gtcttccaca
aaagcatctc 2640tcgccgttgc actaaatctg tgacttcggt cgtctcaacc ttgttttacg
acaaaaaaat 2700gagaacgacg aatccgaaag agactaagat tgtgattgac actaccggca
gtaccaaacc 2760taagcaggac gatctcattc tcacttgttt cagagggtgg gtgaagcagt
tgcaaataga 2820ttacaaaggc aacgaaataa tgacggcagc tgcctctcaa gggctgaccc
gtaaaggtgt 2880gtatgccgtt cggtacaagg tgaatgaaaa tcctctgtac gcacccacct
cagaacatgt 2940gaacgtccta ctgacccgca cggaggaccg catcgtgtgg aaaacactag
ccggcgaccc 3000atggataaaa acactgactg ccaagtaccc tgggaatttc actgccacga
tagaggagtg 3060gcaagcagag catgatgcca tcatgaggca catcttggag agaccggacc
ctaccgacgt 3120cttccagaat aaggcaaacg tgtgttgggc caaggcttta gtgccggtgc
tgaagaccgc 3180tggcatagac atgaccactg aacaatggaa cactgtggat tattttgaaa
cggacaaagc 3240tcactcagca gagatagtat tgaaccaact atgcgtgagg ttctttggac
tcgatctgga 3300ctccggtcta ttttctgcac ccactgttcc gttatccatt aggaataatc
actgggataa 3360ctccccgtcg cctaacatgt acgggctgaa taaagaagtg gtccgtcagc
tctctcgcag 3420gtacccacaa ctgcctcggg cagttgccac tggaagagtc tatgacatga
acactggtac 3480actgcgcaat tatgatccgc gcataaacct agtacctgta aacagaagac
tgcctcatgc 3540tttagtcctc caccataatg aacacccaca gagtgacttt tcttcattcg
tcagcaaatt 3600gaagggcaga actgtcctgg tggtcgggga aaagttgtcc gtcccaggca
aaatggttga 3660ctggttgtca gaccggcctg aggctacctt cagagctcgg ctggatttag
gcatcccagg 3720tgatgtgccc aaatatgaca taatatttgt taatgtgagg accccatata
aataccatca 3780ctatcagcag tgtgaagacc atgccattaa gcttagcatg ttgaccaaga
aagcttgtct 3840gcatctgaat cccggcggaa cctgtgtcag cataggttat ggttacgctg
acagggccag 3900cgaaagcatc attggtgcta tagcgcggca gttcaagttt tcccgggtat
gcaaaccgaa 3960atcctcactt gaagagacgg aagttctgtt tgtattcatt gggtacgatc
gcaaggcccg 4020tacgcacaat ccttacaagc tttcatcaac cttgaccaac atttatacag
gttccagact 4080ccacgaagcc ggatgtgcac cctcatatca tgtggtgcga ggggatattg
ccacggccac 4140cgaaggagtg attataaatg ctgctaacag caaaggacaa cctggcggag
gggtgtgcgg 4200agcgctgtat aagaaattcc cggaaagctt cgatttacag ccgatcgaag
taggaaaagc 4260gcgactggtc aaaggtgcag ctaaacatat cattcatgcc gtaggaccaa
acttcaacaa 4320agtttcggag gttgaaggtg acaaacagtt ggcagaggct tatgagtcca
tcgctaagat 4380tgtcaacgat aacaattaca agtcagtagc gattccactg ttgtccaccg
gcatcttttc 4440cgggaacaaa gatcgactaa cccaatcatt gaaccatttg ctgacagctt
tagacaccac 4500tgatgcagat gtagccatat actgcaggga caagaaatgg gaaatgactc
tcaaggaagc 4560agtggctagg agagaagcag tggaggagat atgcatatcc gacgactctt
cagtgacaga 4620acctgatgca gagctggtga gggtgcatcc gaagagttct ttggctggaa
ggaagggcta 4680cagcacaagc gatggcaaaa ctttctcata tttggaaggg accaagtttc
accaggcggc 4740caaggatata gcagaaatta atgccatgtg gcccgttgca acggaggcca
atgagcaggt 4800atgcatgtat atcctcggag aaagcatgag cagtattagg tcgaaatgcc
ccgtcgaaga 4860gtcggaagcc tcctcaccac ctagcacgct gccttgcttg tgcatccatg
ccatgactcc 4920agaaagagta cagcgcctaa aagcctcacg tccagaacaa attactgtgt
gctcatcctt 4980tccattgccg aagtatagaa tcactggtgt gcagaagatc caatgctccc
agcctatatt 5040gttctcaccg aaagtgcctg cgtatattca tccaaggaag tatctcgtgg
aaacaccacc 5100ggtagacgag actccggagc catcggcaga gaaccaatcc acagagggga
cacctgaaca 5160accaccactt ataaccgagg atgagaccag gactagaacg cctgagccga
tcatcatcga 5220agaggaagaa gaggatagca taagtttgct gtcagatggc ccgacccacc
aggtgctgca 5280agtcgaggca gacattcacg ggccgccctc tgtatctagc tcatcctggt
ccattcctca 5340tgcatccgac tttgatgtgg acagtttatc catacttgac accctggagg
gagctagcgt 5400gaccagcggg gcaacgtcag ccgagactaa ctcttacttc gcaaagagta
tggagtttct 5460ggcgcgaccg gtgcctgcgc ctcgaacagt attcaggaac cctccacatc
ccgctccgcg 5520cacaagaaca ccgtcacttg cacccagcag ggcctgctcg agagggatca
cgggagaaac 5580cgtgggatac gcggttacac acaatagcga gggcttcttg ctatgcaaag
ttactgacac 5640agtaaaagga gaacgggtat cgttccctgt gtgcacgtac atcccggcca
ccataaactc 5700gagaaccagc ctggtctcca acccgccagg cgtaaatagg gtgattacaa
gagaggagtt 5760tgaggcgttc gtagcacaac aacaatgacg gtttgatgcg ggtgcataca
tcttttcctc 5820cgacaccggt caagggcatt tacaacaaaa atcagtaagg caaacggtgc
tatccgaagt 5880ggtgttggag aggaccgaat tggagatttc gtatgccccg cgcctcgacc
aagaaaaaga 5940agaattacta cgcaagaaat tacagttaaa tcccacacct gctaacagaa
gcagatacca 6000gtccaggaag gtggagaaca tgaaagccat aacagctaga cgtattctgc
aaggcctagg 6060gcattatttg aaggcagaag gaaaagtgga gtgctaccga accctgcatc
ctgttccttt 6120gtattcatct agtgtgaacc gtgccttttc aagccccaag gtcgcagtgg
aagcctgtaa 6180cgccatgttg aaagagaact ttccgactgt ggcttcttac tgtattattc
cagagtacga 6240tgcctatttg gacatggttg acggagcttc atgctgctta gacactgcca
gtttttgccc 6300tgcaaagctg cgcagctttc caaagaaaca ctcctatttg gaacccacaa
tacgatcggc 6360agtgccttca gcgatccaga acacgctcca gaacgtcctg gcagctgcca
caaaaagaaa 6420ttgcaatgtc acgcaaatga gagaattgcc cgtattggat tcggcggcct
ttaatgtgga 6480atgcttcaag aaatatgcgt gtaataatga atattgggaa acgtttaaag
aaaaccccat 6540caggcttact gaagaaaacg tggtaaatta cattaccaaa ttaaaaggac
caaaagctgc 6600tgctcttttt gcgaagacac ataatttgaa tatgttgcag gacataccaa
tggacaggtt 6660tgtaatggac ttaaagagag acgtgaaagt gactccagga acaaaacata
ctgaagaacg 6720gcccaaggta caggtgatcc aggctgccga tccgctagca acagcgtatc
tgtgcggaat 6780ccaccgagag ctggttagga gattaaatgc ggtcctgctt ccgaacattc
atacactgtt 6840tgatatgtcg gctgaagact ttgacgctat tatagccgag cacttccagc
ctggggattg 6900tgttctggaa actgacatcg cgtcgtttga taaaagtgag gacgacgcca
tggctctgac 6960cgcgttaatg attctggaag acttaggtgt ggacgcagag ctgttgacgc
tgattgaggc 7020ggctttcggc gaaatttcat caatacattt gcccactaaa actaaattta
aattcggagc 7080catgatgaaa tctggaatgt tcctcacact gtttgtgaac acagtcatta
acattgtaat 7140cgcaagcaga gtgttgagag aacggctaac cggatcacca tgtgcagcat
tcattggaga 7200tgacaatatc gtgaaaggag tcaaatcgga caaattaatg gcagacaggt
gcgccacctg 7260gttgaatatg gaagtcaaga ttatagatgc tgtggtgggc gagaaagcgc
cttatttctg 7320tggagggttt attttgtgtg actccgtgac cggcacagcg tgccgtgtgg
cagaccccct 7380aaaaaggctg tttaagcttg gcaaacctct ggcagcagac gatgaacatg
atgatgacag 7440gagaagggca ttgcatgaag agtcaacacg ctggaaccga gtgggtattc
tttcagagct 7500gtgcaaggca gtagaatcaa ggtatgaaac cgtaggaact tccatcatag
ttatggccat 7560gactactcta gctagcagtg ttaaatcatt cagctacctg agaggggccc
ctataactct 7620ctacggctaa cctgaatgga ctacgacata gtctagtcga cgccaccatg
aggcctggcc 7680tgccctccta cctgatcatc ctggccgtgt gcctgttcag ccacctgctg
tccagcagat 7740acggcgccga ggccgtgagc gagcccctgg acaaggcttt ccacctgctg
ctgaacacct 7800acggcagacc catccggttt ctgcgggaga acaccaccca gtgcacctac
aacagcagcc 7860tgcggaacag caccgtcgtg agagagaacg ccatcagctt caactttttc
cagagctaca 7920accagtacta cgtgttccac atgcccagat gcctgtttgc cggccctctg
gccgagcagt 7980tcctgaacca ggtggacctg accgagacac tggaaagata ccagcagcgg
ctgaatacct 8040acgccctggt gtccaaggac ctggccagct accggtcctt tagccagcag
ctcaaggctc 8100aggatagcct cggcgagcag cctaccaccg tgccccctcc catcgacctg
agcatccccc 8160acgtgtggat gcctccccag accacccctc acggctggac cgagagccac
accacctccg 8220gcctgcacag accccacttc aaccagacct gcatcctgtt cgacggccac
gacctgctgt 8280ttagcaccgt gaccccctgc ctgcaccagg gcttctacct gatcgacgag
ctgagatacg 8340tgaagatcac cctgaccgag gatttcttcg tggtcaccgt gtccatcgac
gacgacaccc 8400ccatgctgct gatcttcggc cacctgccca gagtgctgtt caaggccccc
taccagcggg 8460acaacttcat cctgcggcag accgagaagc acgagctgct ggtgctggtc
aagaaggacc 8520agctgaaccg gcactcctac ctgaaggacc ccgacttcct ggacgccgcc
ctggacttca 8580actacctgga cctgagcgcc ctgctgagaa acagcttcca cagatacgcc
gtggacgtgc 8640tgaagtccgg acggtgccag atgctcgatc ggcggaccgt ggagatggcc
ttcgcctatg 8700ccctcgccct gttcgccgct gccagacagg aagaggctgg cgcccaggtg
tcagtgccca 8760gagccctgga tagacaggcc gccctgctgc agatccagga attcatgatc
acctgcctga 8820gccagacccc ccctagaacc accctgctgc tgtaccccac agccgtggat
ctggccaaga 8880gggccctgtg gacccccaac cagatcaccg acatcacaag cctcgtgcgg
ctcgtgtaca 8940tcctgagcaa gcagaaccag cagcacctga tcccccagtg ggccctgaga
cagatcgccg 9000acttcgccct gaagctgcac aagacccatc tggccagctt tctgagcgcc
ttcgccaggc 9060aggaactgta cctgatgggc agcctggtcc acagcatgct ggtgcatacc
accgagcggc 9120gggagatctt catcgtggag acaggcctgt gtagcctggc cgagctgtcc
cactttaccc 9180agctgctggc ccaccctcac cacgagtacc tgagcgacct gtacaccccc
tgcagcagca 9240gcggcagacg ggaccacagc ctggaacggc tgaccagact gttccccgat
gccaccgtgc 9300ctgctacagt gcctgccgcc ctgtccatcc tgtccaccat gcagcccagc
accctggaaa 9360ccttccccga cctgttctgc ctgcccctgg gcgagagctt tagcgccctg
accgtgtccg 9420agcacgtgtc ctacatcgtg accaatcagt acctgatcaa gggcatcagc
taccccgtgt 9480ccaccacagt cgtgggccag agcctgatca tcacccagac cgacagccag
accaagtgcg 9540agctgacccg gaacatgcac accacacaca gcatcaccgt ggccctgaac
atcagcctgg 9600aaaactgcgc tttctgtcag tctgccctgc tggaatacga cgatacccag
ggcgtgatca 9660acatcatgta catgcacgac agcgacgacg tgctgttcgc cctggacccc
tacaacgagg 9720tggtggtgtc cagcccccgg acccactacc tgatgctgct gaagaacggc
accgtgctgg 9780aagtgaccga cgtggtggtg gacgccaccg acagcagact gctgatgatg
agcgtgtacg 9840ccctgagcgc catcatcggc atctacctgc tgtaccggat gctgaaaacc
tgctgataat 9900ctagacggcg cgcccaccca gcggccgcct ataactctct acggctaacc
tgaatggact 9960acgacatagt ctagtcgacg ccaccatgtg cagaaggccc gactgcggct
tcagcttcag 10020ccctggaccc gtgatcctgc tgtggtgctg cctgctgctg cctatcgtgt
cctctgccgc 10080cgtgtctgtg gcccctacag ccgccgagaa ggtgccagcc gagtgccccg
agctgaccag 10140aagatgcctg ctgggcgagg tgttcgaggg cgacaagtac gagagctggc
tgcggcccct 10200ggtcaacgtg accggcagag atggccccct gagccagctg atccggtaca
gacccgtgac 10260ccccgaggcc gccaatagcg tgctgctgga cgaggccttc ctggataccc
tggccctgct 10320gtacaacaac cccgaccagc tgagagccct gctgaccctg ctgtccagcg
acaccgcccc 10380cagatggatg accgtgatgc ggggctacag cgagtgtgga gatggcagcc
ctgccgtgta 10440cacctgcgtg gacgacctgt gcagaggcta cgacctgacc agactgagct
acggccggtc 10500catcttcaca gagcacgtgc tgggcttcga gctggtgccc cccagcctgt
tcaacgtggt 10560ggtggccatc cggaacgagg ccaccagaac caacagagcc gtgcggctgc
ctgtgtctac 10620agccgctgca cctgagggca tcacactgtt ctacggcctg tacaacgccg
tgaaagagtt 10680ctgcctccgg caccagctgg atccccccct gctgagacac ctggacaagt
actacgccgg 10740cctgccccca gagctgaagc agaccagagt gaacctgccc gcccacagca
gatatggccc 10800tcaggccgtg gacgccagat gataatctag acggcgcgcc cacccaatcg
atgtacttcc 10860gaggaactca cgtgcataat gcatcaggct ggtacattag atccccgctt
accgcgggca 10920atatagcaac actaaaaact cgatgtactt ccgaggaagc gcagtgcata
atgctgcgca 10980gtgttgccac ataaccacta tattaaccat ttatctagcg gacgccaaaa
actcaatgta 11040tttctgagga agcgtggtgc ataatgccac gcagcgtctg cataactttt
attatttctt 11100ttattaatca acaaaatttt gtttttaaca tttcaaaaaa aaaaaaaaaa
aaaaaaaaaa 11160aaaaaaaaaa aaaagggtcg gcatggcatc tccacctcct cgcggtccga
cctgggcatc 11220cgaaggagga cgcacgtcca ctcggatggc taagggagag ccacgagctc
ctgtttaaac 11280cagctccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg
tcgttttaca 11340acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag
cacatccccc 11400tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc
aacagttgcg 11460cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg
cgggtgtggt 11520ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc
ctttcgcttt 11580cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa
atcgggggct 11640ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac
ttgattaggg 11700tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt
tgacgttgga 11760gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca
accctatctc 11820ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt
taaaaaatga 11880gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta
caatttaggt 11940ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta
aatacattca 12000aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata
ttgaaaaagg 12060aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc
ggcattttgc 12120cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga
agatcagttg 12180ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct
tgagagtttt 12240cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg
tggcgcggta 12300ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta
ttctcagaat 12360gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat
gacagtaaga 12420gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt
acttctgaca 12480acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga
tcatgtaact 12540cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga
gcgtgacacc 12600acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga
actacttact 12660ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc
aggaccactt 12720ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc
cggtgagcgt 12780gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg
tatcgtagtt 12840atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat
cgctgagata 12900ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata
tatactttag 12960attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct
ttttgataat 13020ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga
ccccgtagaa 13080aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg
cttgcaaaca 13140aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc
aactcttttt 13200ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgttcttct
agtgtagccg 13260tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc
tctgctaatc 13320ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt
ggactcaaga 13380cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg
cacacagccc 13440agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct
atgagaaagc 13500gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag
ggtcggaaca 13560ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag
tcctgtcggg 13620tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg
gcggagccta 13680tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg
gccttttgct 13740cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac
cgcctttgag 13800tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt
gagcgaggaa 13860gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat
tcattaatgc 13920agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc
aattaatgtg 13980agttagctca ctcattaggc accccaggct ttacacttta tgctcccggc
tcgtatgttg 14040tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca
tgattacgcc 14100aagcgcgcaa ttaaccctca ctaaagggaa caaaagctgg gtaccggcgc
ca 141525415512DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 54cgcgtcggct
acaattaata cataacctta tgtatcatac acatacgatt taggtgacac 60tatagatggg
cggcgcatga gagaagccca gaccaattac ctacccaaaa tggagaaagt 120tcacgttgac
atcgaggaag acagcccatt cctcagagct ttgcagcgga gcttcccgca 180gtttgaggta
gaagccaagc aggtcactga taatgaccat gctaatgcca gagcgttttc 240gcatctggct
tcaaaactga tcgaaacgga ggtggaccca tccgacacga tccttgacat 300tggaagtgcg
cccgcccgca gaatgtattc taagcacaag tatcattgta tctgtccgat 360gagatgtgcg
gaagatccgg acagattgta taagtatgca actaagctga agaaaaactg 420taaggaaata
actgataagg aattggacaa gaaaatgaag gagctcgccg ccgtcatgag 480cgaccctgac
ctggaaactg agactatgtg cctccacgac gacgagtcgt gtcgctacga 540agggcaagtc
gctgtttacc aggatgtata cgcggttgac ggaccgacaa gtctctatca 600ccaagccaat
aagggagtta gagtcgccta ctggataggc tttgacacca ccccttttat 660gtttaagaac
ttggctggag catatccatc atactctacc aactgggccg acgaaaccgt 720gttaacggct
cgtaacatag gcctatgcag ctctgacgtt atggagcggt cacgtagagg 780gatgtccatt
cttagaaaga agtatttgaa accatccaac aatgttctat tctctgttgg 840ctcgaccatc
taccacgaga agagggactt actgaggagc tggcacctgc cgtctgtatt 900tcacttacgt
ggcaagcaaa attacacatg tcggtgtgag actatagtta gttgcgacgg 960gtacgtcgtt
aaaagaatag ctatcagtcc aggcctgtat gggaagcctt caggctatgc 1020tgctacgatg
caccgcgagg gattcttgtg ctgcaaagtg acagacacat tgaacgggga 1080gagggtctct
tttcccgtgt gcacgtatgt gccagctaca ttgtgtgacc aaatgactgg 1140catactggca
acagatgtca gtgcggacga cgcgcaaaaa ctgctggttg ggctcaacca 1200gcgtatagtc
gtcaacggtc gcacccagag aaacaccaat accatgaaaa attacctttt 1260gcccgtagtg
gcccaggcat ttgctaggtg ggcaaaggaa tataaggaag atcaagaaga 1320tgaaaggcca
ctaggactac gagatagaca gttagtcatg gggtgttgtt gggcttttag 1380aaggcacaag
ataacatcta tttataagcg cccggatacc caaaccatca tcaaagtgaa 1440cagcgatttc
cactcattcg tgctgcccag gataggcagt aacacattgg agatcgggct 1500gagaacaaga
atcaggaaaa tgttagagga gcacaaggag ccgtcacctc tcattaccgc 1560cgaggacgta
caagaagcta agtgcgcagc cgatgaggct aaggaggtgc gtgaagccga 1620ggagttgcgc
gcagctctac cacctttggc agctgatgtt gaggagccca ctctggaagc 1680cgatgtagac
ttgatgttac aagaggctgg ggccggctca gtggagacac ctcgtggctt 1740gataaaggtt
accagctacg ctggcgagga caagatcggc tcttacgctg tgctttctcc 1800gcaggctgta
ctcaagagtg aaaaattatc ttgcatccac cctctcgctg aacaagtcat 1860agtgataaca
cactctggcc gaaaagggcg ttatgccgtg gaaccatacc atggtaaagt 1920agtggtgcca
gagggacatg caatacccgt ccaggacttt caagctctga gtgaaagtgc 1980caccattgtg
tacaacgaac gtgagttcgt aaacaggtac ctgcaccata ttgccacaca 2040tggaggagcg
ctgaacactg atgaagaata ttacaaaact gtcaagccca gcgagcacga 2100cggcgaatac
ctgtacgaca tcgacaggaa acagtgcgtc aagaaagaac tagtcactgg 2160gctagggctc
acaggcgagc tggtggatcc tcccttccat gaattcgcct acgagagtct 2220gagaacacga
ccagccgctc cttaccaagt accaaccata ggggtgtatg gcgtgccagg 2280atcaggcaag
tctggcatca ttaaaagcgc agtcaccaaa aaagatctag tggtgagcgc 2340caagaaagaa
aactgtgcag aaattataag ggacgtcaag aaaatgaaag ggctggacgt 2400caatgccaga
actgtggact cagtgctctt gaatggatgc aaacaccccg tagagaccct 2460gtatattgac
gaagcttttg cttgtcatgc aggtactctc agagcgctca tagccattat 2520aagacctaaa
aaggcagtgc tctgcgggga tcccaaacag tgcggttttt ttaacatgat 2580gtgcctgaaa
gtgcatttta accacgagat ttgcacacaa gtcttccaca aaagcatctc 2640tcgccgttgc
actaaatctg tgacttcggt cgtctcaacc ttgttttacg acaaaaaaat 2700gagaacgacg
aatccgaaag agactaagat tgtgattgac actaccggca gtaccaaacc 2760taagcaggac
gatctcattc tcacttgttt cagagggtgg gtgaagcagt tgcaaataga 2820ttacaaaggc
aacgaaataa tgacggcagc tgcctctcaa gggctgaccc gtaaaggtgt 2880gtatgccgtt
cggtacaagg tgaatgaaaa tcctctgtac gcacccacct cagaacatgt 2940gaacgtccta
ctgacccgca cggaggaccg catcgtgtgg aaaacactag ccggcgaccc 3000atggataaaa
acactgactg ccaagtaccc tgggaatttc actgccacga tagaggagtg 3060gcaagcagag
catgatgcca tcatgaggca catcttggag agaccggacc ctaccgacgt 3120cttccagaat
aaggcaaacg tgtgttgggc caaggcttta gtgccggtgc tgaagaccgc 3180tggcatagac
atgaccactg aacaatggaa cactgtggat tattttgaaa cggacaaagc 3240tcactcagca
gagatagtat tgaaccaact atgcgtgagg ttctttggac tcgatctgga 3300ctccggtcta
ttttctgcac ccactgttcc gttatccatt aggaataatc actgggataa 3360ctccccgtcg
cctaacatgt acgggctgaa taaagaagtg gtccgtcagc tctctcgcag 3420gtacccacaa
ctgcctcggg cagttgccac tggaagagtc tatgacatga acactggtac 3480actgcgcaat
tatgatccgc gcataaacct agtacctgta aacagaagac tgcctcatgc 3540tttagtcctc
caccataatg aacacccaca gagtgacttt tcttcattcg tcagcaaatt 3600gaagggcaga
actgtcctgg tggtcgggga aaagttgtcc gtcccaggca aaatggttga 3660ctggttgtca
gaccggcctg aggctacctt cagagctcgg ctggatttag gcatcccagg 3720tgatgtgccc
aaatatgaca taatatttgt taatgtgagg accccatata aataccatca 3780ctatcagcag
tgtgaagacc atgccattaa gcttagcatg ttgaccaaga aagcttgtct 3840gcatctgaat
cccggcggaa cctgtgtcag cataggttat ggttacgctg acagggccag 3900cgaaagcatc
attggtgcta tagcgcggca gttcaagttt tcccgggtat gcaaaccgaa 3960atcctcactt
gaagagacgg aagttctgtt tgtattcatt gggtacgatc gcaaggcccg 4020tacgcacaat
ccttacaagc tttcatcaac cttgaccaac atttatacag gttccagact 4080ccacgaagcc
ggatgtgcac cctcatatca tgtggtgcga ggggatattg ccacggccac 4140cgaaggagtg
attataaatg ctgctaacag caaaggacaa cctggcggag gggtgtgcgg 4200agcgctgtat
aagaaattcc cggaaagctt cgatttacag ccgatcgaag taggaaaagc 4260gcgactggtc
aaaggtgcag ctaaacatat cattcatgcc gtaggaccaa acttcaacaa 4320agtttcggag
gttgaaggtg acaaacagtt ggcagaggct tatgagtcca tcgctaagat 4380tgtcaacgat
aacaattaca agtcagtagc gattccactg ttgtccaccg gcatcttttc 4440cgggaacaaa
gatcgactaa cccaatcatt gaaccatttg ctgacagctt tagacaccac 4500tgatgcagat
gtagccatat actgcaggga caagaaatgg gaaatgactc tcaaggaagc 4560agtggctagg
agagaagcag tggaggagat atgcatatcc gacgactctt cagtgacaga 4620acctgatgca
gagctggtga gggtgcatcc gaagagttct ttggctggaa ggaagggcta 4680cagcacaagc
gatggcaaaa ctttctcata tttggaaggg accaagtttc accaggcggc 4740caaggatata
gcagaaatta atgccatgtg gcccgttgca acggaggcca atgagcaggt 4800atgcatgtat
atcctcggag aaagcatgag cagtattagg tcgaaatgcc ccgtcgaaga 4860gtcggaagcc
tcctcaccac ctagcacgct gccttgcttg tgcatccatg ccatgactcc 4920agaaagagta
cagcgcctaa aagcctcacg tccagaacaa attactgtgt gctcatcctt 4980tccattgccg
aagtatagaa tcactggtgt gcagaagatc caatgctccc agcctatatt 5040gttctcaccg
aaagtgcctg cgtatattca tccaaggaag tatctcgtgg aaacaccacc 5100ggtagacgag
actccggagc catcggcaga gaaccaatcc acagagggga cacctgaaca 5160accaccactt
ataaccgagg atgagaccag gactagaacg cctgagccga tcatcatcga 5220agaggaagaa
gaggatagca taagtttgct gtcagatggc ccgacccacc aggtgctgca 5280agtcgaggca
gacattcacg ggccgccctc tgtatctagc tcatcctggt ccattcctca 5340tgcatccgac
tttgatgtgg acagtttatc catacttgac accctggagg gagctagcgt 5400gaccagcggg
gcaacgtcag ccgagactaa ctcttacttc gcaaagagta tggagtttct 5460ggcgcgaccg
gtgcctgcgc ctcgaacagt attcaggaac cctccacatc ccgctccgcg 5520cacaagaaca
ccgtcacttg cacccagcag ggcctgctcg agagggatca cgggagaaac 5580cgtgggatac
gcggttacac acaatagcga gggcttcttg ctatgcaaag ttactgacac 5640agtaaaagga
gaacgggtat cgttccctgt gtgcacgtac atcccggcca ccataaactc 5700gagaaccagc
ctggtctcca acccgccagg cgtaaatagg gtgattacaa gagaggagtt 5760tgaggcgttc
gtagcacaac aacaatgacg gtttgatgcg ggtgcataca tcttttcctc 5820cgacaccggt
caagggcatt tacaacaaaa atcagtaagg caaacggtgc tatccgaagt 5880ggtgttggag
aggaccgaat tggagatttc gtatgccccg cgcctcgacc aagaaaaaga 5940agaattacta
cgcaagaaat tacagttaaa tcccacacct gctaacagaa gcagatacca 6000gtccaggaag
gtggagaaca tgaaagccat aacagctaga cgtattctgc aaggcctagg 6060gcattatttg
aaggcagaag gaaaagtgga gtgctaccga accctgcatc ctgttccttt 6120gtattcatct
agtgtgaacc gtgccttttc aagccccaag gtcgcagtgg aagcctgtaa 6180cgccatgttg
aaagagaact ttccgactgt ggcttcttac tgtattattc cagagtacga 6240tgcctatttg
gacatggttg acggagcttc atgctgctta gacactgcca gtttttgccc 6300tgcaaagctg
cgcagctttc caaagaaaca ctcctatttg gaacccacaa tacgatcggc 6360agtgccttca
gcgatccaga acacgctcca gaacgtcctg gcagctgcca caaaaagaaa 6420ttgcaatgtc
acgcaaatga gagaattgcc cgtattggat tcggcggcct ttaatgtgga 6480atgcttcaag
aaatatgcgt gtaataatga atattgggaa acgtttaaag aaaaccccat 6540caggcttact
gaagaaaacg tggtaaatta cattaccaaa ttaaaaggac caaaagctgc 6600tgctcttttt
gcgaagacac ataatttgaa tatgttgcag gacataccaa tggacaggtt 6660tgtaatggac
ttaaagagag acgtgaaagt gactccagga acaaaacata ctgaagaacg 6720gcccaaggta
caggtgatcc aggctgccga tccgctagca acagcgtatc tgtgcggaat 6780ccaccgagag
ctggttagga gattaaatgc ggtcctgctt ccgaacattc atacactgtt 6840tgatatgtcg
gctgaagact ttgacgctat tatagccgag cacttccagc ctggggattg 6900tgttctggaa
actgacatcg cgtcgtttga taaaagtgag gacgacgcca tggctctgac 6960cgcgttaatg
attctggaag acttaggtgt ggacgcagag ctgttgacgc tgattgaggc 7020ggctttcggc
gaaatttcat caatacattt gcccactaaa actaaattta aattcggagc 7080catgatgaaa
tctggaatgt tcctcacact gtttgtgaac acagtcatta acattgtaat 7140cgcaagcaga
gtgttgagag aacggctaac cggatcacca tgtgcagcat tcattggaga 7200tgacaatatc
gtgaaaggag tcaaatcgga caaattaatg gcagacaggt gcgccacctg 7260gttgaatatg
gaagtcaaga ttatagatgc tgtggtgggc gagaaagcgc cttatttctg 7320tggagggttt
attttgtgtg actccgtgac cggcacagcg tgccgtgtgg cagaccccct 7380aaaaaggctg
tttaagcttg gcaaacctct ggcagcagac gatgaacatg atgatgacag 7440gagaagggca
ttgcatgaag agtcaacacg ctggaaccga gtgggtattc tttcagagct 7500gtgcaaggca
gtagaatcaa ggtatgaaac cgtaggaact tccatcatag ttatggccat 7560gactactcta
gctagcagtg ttaaatcatt cagctacctg agaggggccc ctataactct 7620ctacggctaa
cctgaatgga ctacgacata gtctagtcga cgccaccatg aggcctggcc 7680tgccctccta
cctgatcatc ctggccgtgt gcctgttcag ccacctgctg tccagcagat 7740acggcgccga
ggccgtgagc gagcccctgg acaaggcttt ccacctgctg ctgaacacct 7800acggcagacc
catccggttt ctgcgggaga acaccaccca gtgcacctac aacagcagcc 7860tgcggaacag
caccgtcgtg agagagaacg ccatcagctt caactttttc cagagctaca 7920accagtacta
cgtgttccac atgcccagat gcctgtttgc cggccctctg gccgagcagt 7980tcctgaacca
ggtggacctg accgagacac tggaaagata ccagcagcgg ctgaatacct 8040acgccctggt
gtccaaggac ctggccagct accggtcctt tagccagcag ctcaaggctc 8100aggatagcct
cggcgagcag cctaccaccg tgccccctcc catcgacctg agcatccccc 8160acgtgtggat
gcctccccag accacccctc acggctggac cgagagccac accacctccg 8220gcctgcacag
accccacttc aaccagacct gcatcctgtt cgacggccac gacctgctgt 8280ttagcaccgt
gaccccctgc ctgcaccagg gcttctacct gatcgacgag ctgagatacg 8340tgaagatcac
cctgaccgag gatttcttcg tggtcaccgt gtccatcgac gacgacaccc 8400ccatgctgct
gatcttcggc cacctgccca gagtgctgtt caaggccccc taccagcggg 8460acaacttcat
cctgcggcag accgagaagc acgagctgct ggtgctggtc aagaaggacc 8520agctgaaccg
gcactcctac ctgaaggacc ccgacttcct ggacgccgcc ctggacttca 8580actacctgga
cctgagcgcc ctgctgagaa acagcttcca cagatacgcc gtggacgtgc 8640tgaagtccgg
acggtgccag atgctcgatc ggcggaccgt ggagatggcc ttcgcctatg 8700ccctcgccct
gttcgccgct gccagacagg aagaggctgg cgcccaggtg tcagtgccca 8760gagccctgga
tagacaggcc gccctgctgc agatccagga attcatgatc acctgcctga 8820gccagacccc
ccctagaacc accctgctgc tgtaccccac agccgtggat ctggccaaga 8880gggccctgtg
gacccccaac cagatcaccg acatcacaag cctcgtgcgg ctcgtgtaca 8940tcctgagcaa
gcagaaccag cagcacctga tcccccagtg ggccctgaga cagatcgccg 9000acttcgccct
gaagctgcac aagacccatc tggccagctt tctgagcgcc ttcgccaggc 9060aggaactgta
cctgatgggc agcctggtcc acagcatgct ggtgcatacc accgagcggc 9120gggagatctt
catcgtggag acaggcctgt gtagcctggc cgagctgtcc cactttaccc 9180agctgctggc
ccaccctcac cacgagtacc tgagcgacct gtacaccccc tgcagcagca 9240gcggcagacg
ggaccacagc ctggaacggc tgaccagact gttccccgat gccaccgtgc 9300ctgctacagt
gcctgccgcc ctgtccatcc tgtccaccat gcagcccagc accctggaaa 9360ccttccccga
cctgttctgc ctgcccctgg gcgagagctt tagcgccctg accgtgtccg 9420agcacgtgtc
ctacatcgtg accaatcagt acctgatcaa gggcatcagc taccccgtgt 9480ccaccacagt
cgtgggccag agcctgatca tcacccagac cgacagccag accaagtgcg 9540agctgacccg
gaacatgcac accacacaca gcatcaccgt ggccctgaac atcagcctgg 9600aaaactgcgc
tttctgtcag tctgccctgc tggaatacga cgatacccag ggcgtgatca 9660acatcatgta
catgcacgac agcgacgacg tgctgttcgc cctggacccc tacaacgagg 9720tggtggtgtc
cagcccccgg acccactacc tgatgctgct gaagaacggc accgtgctgg 9780aagtgaccga
cgtggtggtg gacgccaccg actgataatc tagacggcgc gcccacccag 9840cggccgccta
taactctcta cggctaacct gaatggacta cgacatagtc tagtcgacgc 9900caccatgtgc
agaaggcccg actgcggctt cagcttcagc cctggacccg tgatcctgct 9960gtggtgctgc
ctgctgctgc ctatcgtgtc ctctgccgcc gtgtctgtgg cccctacagc 10020cgccgagaag
gtgccagccg agtgccccga gctgaccaga agatgcctgc tgggcgaggt 10080gttcgagggc
gacaagtacg agagctggct gcggcccctg gtcaacgtga ccggcagaga 10140tggccccctg
agccagctga tccggtacag acccgtgacc cccgaggccg ccaatagcgt 10200gctgctggac
gaggccttcc tggataccct ggccctgctg tacaacaacc ccgaccagct 10260gagagccctg
ctgaccctgc tgtccagcga caccgccccc agatggatga ccgtgatgcg 10320gggctacagc
gagtgtggag atggcagccc tgccgtgtac acctgcgtgg acgacctgtg 10380cagaggctac
gacctgacca gactgagcta cggccggtcc atcttcacag agcacgtgct 10440gggcttcgag
ctggtgcccc ccagcctgtt caacgtggtg gtggccatcc ggaacgaggc 10500caccagaacc
aacagagccg tgcggctgcc tgtgtctaca gccgctgcac ctgagggcat 10560cacactgttc
tacggcctgt acaacgccgt gaaagagttc tgcctccggc accagctgga 10620tccccccctg
ctgagacacc tggacaagta ctacgccggc ctgcccccag agctgaagca 10680gaccagagtg
aacctgcccg cccacagcag atatggccct caggccgtgg acgccagatg 10740ataatctaga
cggcgcgccc acccaatcga tctataactc tctacggcta acctgaatgg 10800actacgacat
agtctagtcg acgccaccat gggcaagaaa gaaatgatca tggtcaaggg 10860catccccaag
atcatgctgc tgattagcat cacctttctg ctgctgtccc tgatcaactg 10920caacgtgctg
gtcaacagcc ggggcaccag aagatcctgg ccctacaccg tgctgtccta 10980ccggggcaaa
gagatcctga agaagcagaa agaggacatc ctgaagcggc tgatgagcac 11040cagcagcgac
ggctaccggt tcctgatgta ccccagccag cagaaattcc acgccatcgt 11100gatcagcatg
gacaagttcc cccaggacta catcctggcc ggacccatcc ggaacgacag 11160catcacccac
atgtggttcg acttctacag cacccagctg cggaagcccg ccaaatacgt 11220gtacagcgag
tacaaccaca ccgcccacaa gatcaccctg aggcctcccc cttgtggcac 11280cgtgcccagc
atgaactgcc tgagcgagat gctgaacgtg tccaagcgga acgacaccgg 11340cgagaagggc
tgcggcaact tcaccacctt caaccccatg ttcttcaacg tgccccggtg 11400gaacaccaag
ctgtacatcg gcagcaacaa agtgaacgtg gacagccaga ccatctactt 11460tctgggcctg
accgccctgc tgctgagata cgcccagcgg aactgcaccc ggtccttcta 11520cctggtcaac
gccatgagcc ggaacctgtt ccgggtgccc aagtacatca acggcaccaa 11580gctgaagaac
accatgcgga agctgaagcg gaagcaggcc ctggtcaaag agcagcccca 11640gaagaagaac
aagaagtccc agagcaccac caccccctac ctgagctaca ccacctccac 11700cgccttcaac
gtgaccacca acgtgaccta cagcgccaca gccgccgtga ccagagtggc 11760cacaagcacc
accggctacc ggcccgacag caactttatg aagtccatca tggccaccca 11820gctgagagat
ctggccacct gggtgtacac caccctgcgg tacagaaacg agcccttctg 11880caagcccgac
cggaacagaa ccgccgtgag cgagttcatg aagaataccc acgtgctgat 11940cagaaacgag
acaccctaca ccatctacgg caccctggac atgagcagcc tgtactacaa 12000cgagacaatg
agcgtggaga acgagacagc cagcgacaac aacgaaacca cccccacctc 12060ccccagcacc
cggttccagc ggaccttcat cgaccccctg tgggactacc tggacagcct 12120gctgttcctg
gacaagatcc ggaacttcag cctgcagctg cccgcctacg gcaatctgac 12180cccccctgag
cacagaaggg ccgccaacct gagcaccctg aacagcctgt ggtggtggag 12240ccagtgataa
tctagacggc gcgcccaccc accgcgggca atatagcaac actaaaaact 12300cgatgtactt
ccgaggaagc gcagtgcata atgctgcgca gtgttgccac ataaccacta 12360tattaaccat
ttatctagcg gacgccaaaa actcaatgta tttctgagga agcgtggtgc 12420ataatgccac
gcagcgtctg cataactttt attatttctt ttattaatca acaaaatttt 12480gtttttaaca
tttcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagggtcg 12540gcatggcatc
tccacctcct cgcggtccga cctgggcatc cgaaggagga cgcacgtcca 12600ctcggatggc
taagggagag ccacgagctc ctgtttaaac cagctccaat tcgccctata 12660gtgagtcgta
ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc 12720ctggcgttac
ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 12780gcgaagaggc
ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggg 12840acgcgccctg
tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 12900ctacacttgc
cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 12960cgttcgccgg
ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 13020gtgctttacg
gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 13080catcgccctg
atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 13140gactcttgtt
ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 13200aagggatttt
gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 13260acgcgaattt
taacaaaata ttaacgctta caatttaggt ggcacttttc ggggaaatgt 13320gcgcggaacc
cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 13380acaataaccc
tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 13440tttccgtgtc
gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 13500agaaacgctg
gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 13560cgaactggat
ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 13620aatgatgagc
acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 13680gcaagagcaa
ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 13740agtcacagaa
aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 13800aaccatgagt
gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 13860gctaaccgct
tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 13920ggagctgaat
gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 13980aacaacgttg
cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 14040aatagactgg
atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 14100tggctggttt
attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 14160agcactgggg
ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 14220ggcaactatg
gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 14280ttggtaactg
tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 14340ttaatttaaa
aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 14400acgtgagttt
tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 14460agatcctttt
tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 14520ggtggtttgt
ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 14580cagagcgcag
ataccaaata ctgttcttct agtgtagccg tagttaggcc accacttcaa 14640gaactctgta
gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 14700cagtggcgat
aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 14760gcagcggtcg
ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 14820caccgaactg
agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag 14880aaaggcggac
aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 14940tccaggggga
aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 15000gcgtcgattt
ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 15060ggccttttta
cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 15120atcccctgat
tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 15180cagccgaacg
accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 15240caaaccgcct
ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 15300cgactggaaa
gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 15360accccaggct
ttacacttta tgctcccggc tcgtatgttg tgtggaattg tgagcggata 15420acaatttcac
acaggaaaca gctatgacca tgattacgcc aagcgcgcaa ttaaccctca 15480ctaaagggaa
caaaagctgg gtaccggcgc ca
155125515593DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 55cgcgtcggct acaattaata
cataacctta tgtatcatac acatacgatt taggtgacac 60tatagatggg cggcgcatga
gagaagccca gaccaattac ctacccaaaa tggagaaagt 120tcacgttgac atcgaggaag
acagcccatt cctcagagct ttgcagcgga gcttcccgca 180gtttgaggta gaagccaagc
aggtcactga taatgaccat gctaatgcca gagcgttttc 240gcatctggct tcaaaactga
tcgaaacgga ggtggaccca tccgacacga tccttgacat 300tggaagtgcg cccgcccgca
gaatgtattc taagcacaag tatcattgta tctgtccgat 360gagatgtgcg gaagatccgg
acagattgta taagtatgca actaagctga agaaaaactg 420taaggaaata actgataagg
aattggacaa gaaaatgaag gagctcgccg ccgtcatgag 480cgaccctgac ctggaaactg
agactatgtg cctccacgac gacgagtcgt gtcgctacga 540agggcaagtc gctgtttacc
aggatgtata cgcggttgac ggaccgacaa gtctctatca 600ccaagccaat aagggagtta
gagtcgccta ctggataggc tttgacacca ccccttttat 660gtttaagaac ttggctggag
catatccatc atactctacc aactgggccg acgaaaccgt 720gttaacggct cgtaacatag
gcctatgcag ctctgacgtt atggagcggt cacgtagagg 780gatgtccatt cttagaaaga
agtatttgaa accatccaac aatgttctat tctctgttgg 840ctcgaccatc taccacgaga
agagggactt actgaggagc tggcacctgc cgtctgtatt 900tcacttacgt ggcaagcaaa
attacacatg tcggtgtgag actatagtta gttgcgacgg 960gtacgtcgtt aaaagaatag
ctatcagtcc aggcctgtat gggaagcctt caggctatgc 1020tgctacgatg caccgcgagg
gattcttgtg ctgcaaagtg acagacacat tgaacgggga 1080gagggtctct tttcccgtgt
gcacgtatgt gccagctaca ttgtgtgacc aaatgactgg 1140catactggca acagatgtca
gtgcggacga cgcgcaaaaa ctgctggttg ggctcaacca 1200gcgtatagtc gtcaacggtc
gcacccagag aaacaccaat accatgaaaa attacctttt 1260gcccgtagtg gcccaggcat
ttgctaggtg ggcaaaggaa tataaggaag atcaagaaga 1320tgaaaggcca ctaggactac
gagatagaca gttagtcatg gggtgttgtt gggcttttag 1380aaggcacaag ataacatcta
tttataagcg cccggatacc caaaccatca tcaaagtgaa 1440cagcgatttc cactcattcg
tgctgcccag gataggcagt aacacattgg agatcgggct 1500gagaacaaga atcaggaaaa
tgttagagga gcacaaggag ccgtcacctc tcattaccgc 1560cgaggacgta caagaagcta
agtgcgcagc cgatgaggct aaggaggtgc gtgaagccga 1620ggagttgcgc gcagctctac
cacctttggc agctgatgtt gaggagccca ctctggaagc 1680cgatgtagac ttgatgttac
aagaggctgg ggccggctca gtggagacac ctcgtggctt 1740gataaaggtt accagctacg
ctggcgagga caagatcggc tcttacgctg tgctttctcc 1800gcaggctgta ctcaagagtg
aaaaattatc ttgcatccac cctctcgctg aacaagtcat 1860agtgataaca cactctggcc
gaaaagggcg ttatgccgtg gaaccatacc atggtaaagt 1920agtggtgcca gagggacatg
caatacccgt ccaggacttt caagctctga gtgaaagtgc 1980caccattgtg tacaacgaac
gtgagttcgt aaacaggtac ctgcaccata ttgccacaca 2040tggaggagcg ctgaacactg
atgaagaata ttacaaaact gtcaagccca gcgagcacga 2100cggcgaatac ctgtacgaca
tcgacaggaa acagtgcgtc aagaaagaac tagtcactgg 2160gctagggctc acaggcgagc
tggtggatcc tcccttccat gaattcgcct acgagagtct 2220gagaacacga ccagccgctc
cttaccaagt accaaccata ggggtgtatg gcgtgccagg 2280atcaggcaag tctggcatca
ttaaaagcgc agtcaccaaa aaagatctag tggtgagcgc 2340caagaaagaa aactgtgcag
aaattataag ggacgtcaag aaaatgaaag ggctggacgt 2400caatgccaga actgtggact
cagtgctctt gaatggatgc aaacaccccg tagagaccct 2460gtatattgac gaagcttttg
cttgtcatgc aggtactctc agagcgctca tagccattat 2520aagacctaaa aaggcagtgc
tctgcgggga tcccaaacag tgcggttttt ttaacatgat 2580gtgcctgaaa gtgcatttta
accacgagat ttgcacacaa gtcttccaca aaagcatctc 2640tcgccgttgc actaaatctg
tgacttcggt cgtctcaacc ttgttttacg acaaaaaaat 2700gagaacgacg aatccgaaag
agactaagat tgtgattgac actaccggca gtaccaaacc 2760taagcaggac gatctcattc
tcacttgttt cagagggtgg gtgaagcagt tgcaaataga 2820ttacaaaggc aacgaaataa
tgacggcagc tgcctctcaa gggctgaccc gtaaaggtgt 2880gtatgccgtt cggtacaagg
tgaatgaaaa tcctctgtac gcacccacct cagaacatgt 2940gaacgtccta ctgacccgca
cggaggaccg catcgtgtgg aaaacactag ccggcgaccc 3000atggataaaa acactgactg
ccaagtaccc tgggaatttc actgccacga tagaggagtg 3060gcaagcagag catgatgcca
tcatgaggca catcttggag agaccggacc ctaccgacgt 3120cttccagaat aaggcaaacg
tgtgttgggc caaggcttta gtgccggtgc tgaagaccgc 3180tggcatagac atgaccactg
aacaatggaa cactgtggat tattttgaaa cggacaaagc 3240tcactcagca gagatagtat
tgaaccaact atgcgtgagg ttctttggac tcgatctgga 3300ctccggtcta ttttctgcac
ccactgttcc gttatccatt aggaataatc actgggataa 3360ctccccgtcg cctaacatgt
acgggctgaa taaagaagtg gtccgtcagc tctctcgcag 3420gtacccacaa ctgcctcggg
cagttgccac tggaagagtc tatgacatga acactggtac 3480actgcgcaat tatgatccgc
gcataaacct agtacctgta aacagaagac tgcctcatgc 3540tttagtcctc caccataatg
aacacccaca gagtgacttt tcttcattcg tcagcaaatt 3600gaagggcaga actgtcctgg
tggtcgggga aaagttgtcc gtcccaggca aaatggttga 3660ctggttgtca gaccggcctg
aggctacctt cagagctcgg ctggatttag gcatcccagg 3720tgatgtgccc aaatatgaca
taatatttgt taatgtgagg accccatata aataccatca 3780ctatcagcag tgtgaagacc
atgccattaa gcttagcatg ttgaccaaga aagcttgtct 3840gcatctgaat cccggcggaa
cctgtgtcag cataggttat ggttacgctg acagggccag 3900cgaaagcatc attggtgcta
tagcgcggca gttcaagttt tcccgggtat gcaaaccgaa 3960atcctcactt gaagagacgg
aagttctgtt tgtattcatt gggtacgatc gcaaggcccg 4020tacgcacaat ccttacaagc
tttcatcaac cttgaccaac atttatacag gttccagact 4080ccacgaagcc ggatgtgcac
cctcatatca tgtggtgcga ggggatattg ccacggccac 4140cgaaggagtg attataaatg
ctgctaacag caaaggacaa cctggcggag gggtgtgcgg 4200agcgctgtat aagaaattcc
cggaaagctt cgatttacag ccgatcgaag taggaaaagc 4260gcgactggtc aaaggtgcag
ctaaacatat cattcatgcc gtaggaccaa acttcaacaa 4320agtttcggag gttgaaggtg
acaaacagtt ggcagaggct tatgagtcca tcgctaagat 4380tgtcaacgat aacaattaca
agtcagtagc gattccactg ttgtccaccg gcatcttttc 4440cgggaacaaa gatcgactaa
cccaatcatt gaaccatttg ctgacagctt tagacaccac 4500tgatgcagat gtagccatat
actgcaggga caagaaatgg gaaatgactc tcaaggaagc 4560agtggctagg agagaagcag
tggaggagat atgcatatcc gacgactctt cagtgacaga 4620acctgatgca gagctggtga
gggtgcatcc gaagagttct ttggctggaa ggaagggcta 4680cagcacaagc gatggcaaaa
ctttctcata tttggaaggg accaagtttc accaggcggc 4740caaggatata gcagaaatta
atgccatgtg gcccgttgca acggaggcca atgagcaggt 4800atgcatgtat atcctcggag
aaagcatgag cagtattagg tcgaaatgcc ccgtcgaaga 4860gtcggaagcc tcctcaccac
ctagcacgct gccttgcttg tgcatccatg ccatgactcc 4920agaaagagta cagcgcctaa
aagcctcacg tccagaacaa attactgtgt gctcatcctt 4980tccattgccg aagtatagaa
tcactggtgt gcagaagatc caatgctccc agcctatatt 5040gttctcaccg aaagtgcctg
cgtatattca tccaaggaag tatctcgtgg aaacaccacc 5100ggtagacgag actccggagc
catcggcaga gaaccaatcc acagagggga cacctgaaca 5160accaccactt ataaccgagg
atgagaccag gactagaacg cctgagccga tcatcatcga 5220agaggaagaa gaggatagca
taagtttgct gtcagatggc ccgacccacc aggtgctgca 5280agtcgaggca gacattcacg
ggccgccctc tgtatctagc tcatcctggt ccattcctca 5340tgcatccgac tttgatgtgg
acagtttatc catacttgac accctggagg gagctagcgt 5400gaccagcggg gcaacgtcag
ccgagactaa ctcttacttc gcaaagagta tggagtttct 5460ggcgcgaccg gtgcctgcgc
ctcgaacagt attcaggaac cctccacatc ccgctccgcg 5520cacaagaaca ccgtcacttg
cacccagcag ggcctgctcg agagggatca cgggagaaac 5580cgtgggatac gcggttacac
acaatagcga gggcttcttg ctatgcaaag ttactgacac 5640agtaaaagga gaacgggtat
cgttccctgt gtgcacgtac atcccggcca ccataaactc 5700gagaaccagc ctggtctcca
acccgccagg cgtaaatagg gtgattacaa gagaggagtt 5760tgaggcgttc gtagcacaac
aacaatgacg gtttgatgcg ggtgcataca tcttttcctc 5820cgacaccggt caagggcatt
tacaacaaaa atcagtaagg caaacggtgc tatccgaagt 5880ggtgttggag aggaccgaat
tggagatttc gtatgccccg cgcctcgacc aagaaaaaga 5940agaattacta cgcaagaaat
tacagttaaa tcccacacct gctaacagaa gcagatacca 6000gtccaggaag gtggagaaca
tgaaagccat aacagctaga cgtattctgc aaggcctagg 6060gcattatttg aaggcagaag
gaaaagtgga gtgctaccga accctgcatc ctgttccttt 6120gtattcatct agtgtgaacc
gtgccttttc aagccccaag gtcgcagtgg aagcctgtaa 6180cgccatgttg aaagagaact
ttccgactgt ggcttcttac tgtattattc cagagtacga 6240tgcctatttg gacatggttg
acggagcttc atgctgctta gacactgcca gtttttgccc 6300tgcaaagctg cgcagctttc
caaagaaaca ctcctatttg gaacccacaa tacgatcggc 6360agtgccttca gcgatccaga
acacgctcca gaacgtcctg gcagctgcca caaaaagaaa 6420ttgcaatgtc acgcaaatga
gagaattgcc cgtattggat tcggcggcct ttaatgtgga 6480atgcttcaag aaatatgcgt
gtaataatga atattgggaa acgtttaaag aaaaccccat 6540caggcttact gaagaaaacg
tggtaaatta cattaccaaa ttaaaaggac caaaagctgc 6600tgctcttttt gcgaagacac
ataatttgaa tatgttgcag gacataccaa tggacaggtt 6660tgtaatggac ttaaagagag
acgtgaaagt gactccagga acaaaacata ctgaagaacg 6720gcccaaggta caggtgatcc
aggctgccga tccgctagca acagcgtatc tgtgcggaat 6780ccaccgagag ctggttagga
gattaaatgc ggtcctgctt ccgaacattc atacactgtt 6840tgatatgtcg gctgaagact
ttgacgctat tatagccgag cacttccagc ctggggattg 6900tgttctggaa actgacatcg
cgtcgtttga taaaagtgag gacgacgcca tggctctgac 6960cgcgttaatg attctggaag
acttaggtgt ggacgcagag ctgttgacgc tgattgaggc 7020ggctttcggc gaaatttcat
caatacattt gcccactaaa actaaattta aattcggagc 7080catgatgaaa tctggaatgt
tcctcacact gtttgtgaac acagtcatta acattgtaat 7140cgcaagcaga gtgttgagag
aacggctaac cggatcacca tgtgcagcat tcattggaga 7200tgacaatatc gtgaaaggag
tcaaatcgga caaattaatg gcagacaggt gcgccacctg 7260gttgaatatg gaagtcaaga
ttatagatgc tgtggtgggc gagaaagcgc cttatttctg 7320tggagggttt attttgtgtg
actccgtgac cggcacagcg tgccgtgtgg cagaccccct 7380aaaaaggctg tttaagcttg
gcaaacctct ggcagcagac gatgaacatg atgatgacag 7440gagaagggca ttgcatgaag
agtcaacacg ctggaaccga gtgggtattc tttcagagct 7500gtgcaaggca gtagaatcaa
ggtatgaaac cgtaggaact tccatcatag ttatggccat 7560gactactcta gctagcagtg
ttaaatcatt cagctacctg agaggggccc ctataactct 7620ctacggctaa cctgaatgga
ctacgacata gtctagtcga cgccaccatg aggcctggcc 7680tgccctccta cctgatcatc
ctggccgtgt gcctgttcag ccacctgctg tccagcagat 7740acggcgccga ggccgtgagc
gagcccctgg acaaggcttt ccacctgctg ctgaacacct 7800acggcagacc catccggttt
ctgcgggaga acaccaccca gtgcacctac aacagcagcc 7860tgcggaacag caccgtcgtg
agagagaacg ccatcagctt caactttttc cagagctaca 7920accagtacta cgtgttccac
atgcccagat gcctgtttgc cggccctctg gccgagcagt 7980tcctgaacca ggtggacctg
accgagacac tggaaagata ccagcagcgg ctgaatacct 8040acgccctggt gtccaaggac
ctggccagct accggtcctt tagccagcag ctcaaggctc 8100aggatagcct cggcgagcag
cctaccaccg tgccccctcc catcgacctg agcatccccc 8160acgtgtggat gcctccccag
accacccctc acggctggac cgagagccac accacctccg 8220gcctgcacag accccacttc
aaccagacct gcatcctgtt cgacggccac gacctgctgt 8280ttagcaccgt gaccccctgc
ctgcaccagg gcttctacct gatcgacgag ctgagatacg 8340tgaagatcac cctgaccgag
gatttcttcg tggtcaccgt gtccatcgac gacgacaccc 8400ccatgctgct gatcttcggc
cacctgccca gagtgctgtt caaggccccc taccagcggg 8460acaacttcat cctgcggcag
accgagaagc acgagctgct ggtgctggtc aagaaggacc 8520agctgaaccg gcactcctac
ctgaaggacc ccgacttcct ggacgccgcc ctggacttca 8580actacctgga cctgagcgcc
ctgctgagaa acagcttcca cagatacgcc gtggacgtgc 8640tgaagtccgg acggtgccag
atgctcgatc ggcggaccgt ggagatggcc ttcgcctatg 8700ccctcgccct gttcgccgct
gccagacagg aagaggctgg cgcccaggtg tcagtgccca 8760gagccctgga tagacaggcc
gccctgctgc agatccagga attcatgatc acctgcctga 8820gccagacccc ccctagaacc
accctgctgc tgtaccccac agccgtggat ctggccaaga 8880gggccctgtg gacccccaac
cagatcaccg acatcacaag cctcgtgcgg ctcgtgtaca 8940tcctgagcaa gcagaaccag
cagcacctga tcccccagtg ggccctgaga cagatcgccg 9000acttcgccct gaagctgcac
aagacccatc tggccagctt tctgagcgcc ttcgccaggc 9060aggaactgta cctgatgggc
agcctggtcc acagcatgct ggtgcatacc accgagcggc 9120gggagatctt catcgtggag
acaggcctgt gtagcctggc cgagctgtcc cactttaccc 9180agctgctggc ccaccctcac
cacgagtacc tgagcgacct gtacaccccc tgcagcagca 9240gcggcagacg ggaccacagc
ctggaacggc tgaccagact gttccccgat gccaccgtgc 9300ctgctacagt gcctgccgcc
ctgtccatcc tgtccaccat gcagcccagc accctggaaa 9360ccttccccga cctgttctgc
ctgcccctgg gcgagagctt tagcgccctg accgtgtccg 9420agcacgtgtc ctacatcgtg
accaatcagt acctgatcaa gggcatcagc taccccgtgt 9480ccaccacagt cgtgggccag
agcctgatca tcacccagac cgacagccag accaagtgcg 9540agctgacccg gaacatgcac
accacacaca gcatcaccgt ggccctgaac atcagcctgg 9600aaaactgcgc tttctgtcag
tctgccctgc tggaatacga cgatacccag ggcgtgatca 9660acatcatgta catgcacgac
agcgacgacg tgctgttcgc cctggacccc tacaacgagg 9720tggtggtgtc cagcccccgg
acccactacc tgatgctgct gaagaacggc accgtgctgg 9780aagtgaccga cgtggtggtg
gacgccaccg acagcagact gctgatgatg agcgtgtacg 9840ccctgagcgc catcatcggc
atctacctgc tgtaccggat gctgaaaacc tgctgataat 9900ctagacggcg cgcccaccca
gcggccgcct ataactctct acggctaacc tgaatggact 9960acgacatagt ctagtcgacg
ccaccatgtg cagaaggccc gactgcggct tcagcttcag 10020ccctggaccc gtgatcctgc
tgtggtgctg cctgctgctg cctatcgtgt cctctgccgc 10080cgtgtctgtg gcccctacag
ccgccgagaa ggtgccagcc gagtgccccg agctgaccag 10140aagatgcctg ctgggcgagg
tgttcgaggg cgacaagtac gagagctggc tgcggcccct 10200ggtcaacgtg accggcagag
atggccccct gagccagctg atccggtaca gacccgtgac 10260ccccgaggcc gccaatagcg
tgctgctgga cgaggccttc ctggataccc tggccctgct 10320gtacaacaac cccgaccagc
tgagagccct gctgaccctg ctgtccagcg acaccgcccc 10380cagatggatg accgtgatgc
ggggctacag cgagtgtgga gatggcagcc ctgccgtgta 10440cacctgcgtg gacgacctgt
gcagaggcta cgacctgacc agactgagct acggccggtc 10500catcttcaca gagcacgtgc
tgggcttcga gctggtgccc cccagcctgt tcaacgtggt 10560ggtggccatc cggaacgagg
ccaccagaac caacagagcc gtgcggctgc ctgtgtctac 10620agccgctgca cctgagggca
tcacactgtt ctacggcctg tacaacgccg tgaaagagtt 10680ctgcctccgg caccagctgg
atccccccct gctgagacac ctggacaagt actacgccgg 10740cctgccccca gagctgaagc
agaccagagt gaacctgccc gcccacagca gatatggccc 10800tcaggccgtg gacgccagat
gataatctag acggcgcgcc cacccaatcg atctataact 10860ctctacggct aacctgaatg
gactacgaca tagtctagtc gacgccacca tgggcaagaa 10920agaaatgatc atggtcaagg
gcatccccaa gatcatgctg ctgattagca tcacctttct 10980gctgctgtcc ctgatcaact
gcaacgtgct ggtcaacagc cggggcacca gaagatcctg 11040gccctacacc gtgctgtcct
accggggcaa agagatcctg aagaagcaga aagaggacat 11100cctgaagcgg ctgatgagca
ccagcagcga cggctaccgg ttcctgatgt accccagcca 11160gcagaaattc cacgccatcg
tgatcagcat ggacaagttc ccccaggact acatcctggc 11220cggacccatc cggaacgaca
gcatcaccca catgtggttc gacttctaca gcacccagct 11280gcggaagccc gccaaatacg
tgtacagcga gtacaaccac accgcccaca agatcaccct 11340gaggcctccc ccttgtggca
ccgtgcccag catgaactgc ctgagcgaga tgctgaacgt 11400gtccaagcgg aacgacaccg
gcgagaaggg ctgcggcaac ttcaccacct tcaaccccat 11460gttcttcaac gtgccccggt
ggaacaccaa gctgtacatc ggcagcaaca aagtgaacgt 11520ggacagccag accatctact
ttctgggcct gaccgccctg ctgctgagat acgcccagcg 11580gaactgcacc cggtccttct
acctggtcaa cgccatgagc cggaacctgt tccgggtgcc 11640caagtacatc aacggcacca
agctgaagaa caccatgcgg aagctgaagc ggaagcaggc 11700cctggtcaaa gagcagcccc
agaagaagaa caagaagtcc cagagcacca ccacccccta 11760cctgagctac accacctcca
ccgccttcaa cgtgaccacc aacgtgacct acagcgccac 11820agccgccgtg accagagtgg
ccacaagcac caccggctac cggcccgaca gcaactttat 11880gaagtccatc atggccaccc
agctgagaga tctggccacc tgggtgtaca ccaccctgcg 11940gtacagaaac gagcccttct
gcaagcccga ccggaacaga accgccgtga gcgagttcat 12000gaagaatacc cacgtgctga
tcagaaacga gacaccctac accatctacg gcaccctgga 12060catgagcagc ctgtactaca
acgagacaat gagcgtggag aacgagacag ccagcgacaa 12120caacgaaacc acccccacct
cccccagcac ccggttccag cggaccttca tcgaccccct 12180gtgggactac ctggacagcc
tgctgttcct ggacaagatc cggaacttca gcctgcagct 12240gcccgcctac ggcaatctga
ccccccctga gcacagaagg gccgccaacc tgagcaccct 12300gaacagcctg tggtggtgga
gccagtgata atctagacgg cgcgcccacc caccgcgggc 12360aatatagcaa cactaaaaac
tcgatgtact tccgaggaag cgcagtgcat aatgctgcgc 12420agtgttgcca cataaccact
atattaacca tttatctagc ggacgccaaa aactcaatgt 12480atttctgagg aagcgtggtg
cataatgcca cgcagcgtct gcataacttt tattatttct 12540tttattaatc aacaaaattt
tgtttttaac atttcaaaaa aaaaaaaaaa aaaaaaaaaa 12600aaaaaaaaaa aaaaagggtc
ggcatggcat ctccacctcc tcgcggtccg acctgggcat 12660ccgaaggagg acgcacgtcc
actcggatgg ctaagggaga gccacgagct cctgtttaaa 12720ccagctccaa ttcgccctat
agtgagtcgt attacgcgcg ctcactggcc gtcgttttac 12780aacgtcgtga ctgggaaaac
cctggcgtta cccaacttaa tcgccttgca gcacatcccc 12840ctttcgccag ctggcgtaat
agcgaagagg cccgcaccga tcgcccttcc caacagttgc 12900gcagcctgaa tggcgaatgg
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 12960tggttacgcg cagcgtgacc
gctacacttg ccagcgccct agcgcccgct cctttcgctt 13020tcttcccttc ctttctcgcc
acgttcgccg gctttccccg tcaagctcta aatcgggggc 13080tccctttagg gttccgattt
agtgctttac ggcacctcga ccccaaaaaa cttgattagg 13140gtgatggttc acgtagtggg
ccatcgccct gatagacggt ttttcgccct ttgacgttgg 13200agtccacgtt ctttaatagt
ggactcttgt tccaaactgg aacaacactc aaccctatct 13260cggtctattc ttttgattta
taagggattt tgccgatttc ggcctattgg ttaaaaaatg 13320agctgattta acaaaaattt
aacgcgaatt ttaacaaaat attaacgctt acaatttagg 13380tggcactttt cggggaaatg
tgcgcggaac ccctatttgt ttatttttct aaatacattc 13440aaatatgtat ccgctcatga
gacaataacc ctgataaatg cttcaataat attgaaaaag 13500gaagagtatg agtattcaac
atttccgtgt cgcccttatt cccttttttg cggcattttg 13560ccttcctgtt tttgctcacc
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 13620gggtgcacga gtgggttaca
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 13680tcgccccgaa gaacgttttc
caatgatgag cacttttaaa gttctgctat gtggcgcggt 13740attatcccgt attgacgccg
ggcaagagca actcggtcgc cgcatacact attctcagaa 13800tgacttggtt gagtactcac
cagtcacaga aaagcatctt acggatggca tgacagtaag 13860agaattatgc agtgctgcca
taaccatgag tgataacact gcggccaact tacttctgac 13920aacgatcgga ggaccgaagg
agctaaccgc ttttttgcac aacatggggg atcatgtaac 13980tcgccttgat cgttgggaac
cggagctgaa tgaagccata ccaaacgacg agcgtgacac 14040cacgatgcct gtagcaatgg
caacaacgtt gcgcaaacta ttaactggcg aactacttac 14100tctagcttcc cggcaacaat
taatagactg gatggaggcg gataaagttg caggaccact 14160tctgcgctcg gcccttccgg
ctggctggtt tattgctgat aaatctggag ccggtgagcg 14220tgggtctcgc ggtatcattg
cagcactggg gccagatggt aagccctccc gtatcgtagt 14280tatctacacg acggggagtc
aggcaactat ggatgaacga aatagacaga tcgctgagat 14340aggtgcctca ctgattaagc
attggtaact gtcagaccaa gtttactcat atatacttta 14400gattgattta aaacttcatt
tttaatttaa aaggatctag gtgaagatcc tttttgataa 14460tctcatgacc aaaatccctt
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 14520aaagatcaaa ggatcttctt
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 14580aaaaaaacca ccgctaccag
cggtggtttg tttgccggat caagagctac caactctttt 14640tccgaaggta actggcttca
gcagagcgca gataccaaat actgttcttc tagtgtagcc 14700gtagttaggc caccacttca
agaactctgt agcaccgcct acatacctcg ctctgctaat 14760cctgttacca gtggctgctg
ccagtggcga taagtcgtgt cttaccgggt tggactcaag 14820acgatagtta ccggataagg
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 14880cagcttggag cgaacgacct
acaccgaact gagataccta cagcgtgagc tatgagaaag 14940cgccacgctt cccgaaggga
gaaaggcgga caggtatccg gtaagcggca gggtcggaac 15000aggagagcgc acgagggagc
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 15060gtttcgccac ctctgacttg
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 15120atggaaaaac gccagcaacg
cggccttttt acggttcctg gccttttgct ggccttttgc 15180tcacatgttc tttcctgcgt
tatcccctga ttctgtggat aaccgtatta ccgcctttga 15240gtgagctgat accgctcgcc
gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 15300agcggaagag cgcccaatac
gcaaaccgcc tctccccgcg cgttggccga ttcattaatg 15360cagctggcac gacaggtttc
ccgactggaa agcgggcagt gagcgcaacg caattaatgt 15420gagttagctc actcattagg
caccccaggc tttacacttt atgctcccgg ctcgtatgtt 15480gtgtggaatt gtgagcggat
aacaatttca cacaggaaac agctatgacc atgattacgc 15540caagcgcgca attaaccctc
actaaaggga acaaaagctg ggtaccggcg cca 155935615271DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 56ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tagacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtccgccaa 7560gatgaggcct ggcctgccct cctacctgat catcctggcc gtgtgcctgt
tcagccacct 7620gctgtccagc agatacggcg ccgaggccgt gagcgagccc ctggacaagg
ctttccacct 7680gctgctgaac acctacggca gacccatccg gtttctgcgg gagaacacca
cccagtgcac 7740ctacaacagc agcctgcgga acagcaccgt cgtgagagag aacgccatca
gcttcaactt 7800tttccagagc tacaaccagt actacgtgtt ccacatgccc agatgcctgt
ttgccggccc 7860tctggccgag cagttcctga accaggtgga cctgaccgag acactggaaa
gataccagca 7920gcggctgaat acctacgccc tggtgtccaa ggacctggcc agctaccggt
cctttagcca 7980gcagctcaag gctcaggata gcctcggcga gcagcctacc accgtgcccc
ctcccatcga 8040cctgagcatc ccccacgtgt ggatgcctcc ccagaccacc cctcacggct
ggaccgagag 8100ccacaccacc tccggcctgc acagacccca cttcaaccag acctgcatcc
tgttcgacgg 8160ccacgacctg ctgtttagca ccgtgacccc ctgcctgcac cagggcttct
acctgatcga 8220cgagctgaga tacgtgaaga tcaccctgac cgaggatttc ttcgtggtca
ccgtgtccat 8280cgacgacgac acccccatgc tgctgatctt cggccacctg cccagagtgc
tgttcaaggc 8340cccctaccag cgggacaact tcatcctgcg gcagaccgag aagcacgagc
tgctggtgct 8400ggtcaagaag gaccagctga accggcactc ctacctgaag gaccccgact
tcctggacgc 8460cgccctggac ttcaactacc tggacctgag cgccctgctg agaaacagct
tccacagata 8520cgccgtggac gtgctgaagt ccggacggtg ccagatgctc gatcggcgga
ccgtggagat 8580ggccttcgcc tatgccctcg ccctgttcgc cgctgccaga caggaagagg
ctggcgccca 8640ggtgtcagtg cccagagccc tggatagaca ggccgccctg ctgcagatcc
aggaattcat 8700gatcacctgc ctgagccaga ccccccctag aaccaccctg ctgctgtacc
ccacagccgt 8760ggatctggcc aagagggccc tgtggacccc caaccagatc accgacatca
caagcctcgt 8820gcggctcgtg tacatcctga gcaagcagaa ccagcagcac ctgatccccc
agtgggccct 8880gagacagatc gccgacttcg ccctgaagct gcacaagacc catctggcca
gctttctgag 8940cgccttcgcc aggcaggaac tgtacctgat gggcagcctg gtccacagca
tgctggtgca 9000taccaccgag cggcgggaga tcttcatcgt ggagacaggc ctgtgtagcc
tggccgagct 9060gtcccacttt acccagctgc tggcccaccc tcaccacgag tacctgagcg
acctgtacac 9120cccctgcagc agcagcggca gacgggacca cagcctggaa cggctgacca
gactgttccc 9180cgatgccacc gtgcctgcta cagtgcctgc cgccctgtcc atcctgtcca
ccatgcagcc 9240cagcaccctg gaaaccttcc ccgacctgtt ctgcctgccc ctgggcgaga
gctttagcgc 9300cctgaccgtg tccgagcacg tgtcctacat cgtgaccaat cagtacctga
tcaagggcat 9360cagctacccc gtgtccacca cagtcgtggg ccagagcctg atcatcaccc
agaccgacag 9420ccagaccaag tgcgagctga cccggaacat gcacaccaca cacagcatca
ccgtggccct 9480gaacatcagc ctggaaaact gcgctttctg tcagtctgcc ctgctggaat
acgacgatac 9540ccagggcgtg atcaacatca tgtacatgca cgacagcgac gacgtgctgt
tcgccctgga 9600cccctacaac gaggtggtgg tgtccagccc ccggacccac tacctgatgc
tgctgaagaa 9660cggcaccgtg ctggaagtga ccgacgtggt ggtggacgcc accgacagca
gactgctgat 9720gatgagcgtg tacgccctga gcgccatcat cggcatctac ctgctgtacc
ggatgctgaa 9780aacctgctga taatctagag gcccctataa ctctctacgg ctaacctgaa
tggactacga 9840catagtctag tccgccaaga tgtgcagaag gcccgactgc ggcttcagct
tcagccctgg 9900acccgtgatc ctgctgtggt gctgcctgct gctgcctatc gtgtcctctg
ccgccgtgtc 9960tgtggcccct acagccgccg agaaggtgcc agccgagtgc cccgagctga
ccagaagatg 10020cctgctgggc gaggtgttcg agggcgacaa gtacgagagc tggctgcggc
ccctggtcaa 10080cgtgaccggc agagatggcc ccctgagcca gctgatccgg tacagacccg
tgacccccga 10140ggccgccaat agcgtgctgc tggacgaggc cttcctggat accctggccc
tgctgtacaa 10200caaccccgac cagctgagag ccctgctgac cctgctgtcc agcgacaccg
cccccagatg 10260gatgaccgtg atgcggggct acagcgagtg tggagatggc agccctgccg
tgtacacctg 10320cgtggacgac ctgtgcagag gctacgacct gaccagactg agctacggcc
ggtccatctt 10380cacagagcac gtgctgggct tcgagctggt gccccccagc ctgttcaacg
tggtggtggc 10440catccggaac gaggccacca gaaccaacag agccgtgcgg ctgcctgtgt
ctacagccgc 10500tgcacctgag ggcatcacac tgttctacgg cctgtacaac gccgtgaaag
agttctgcct 10560ccggcaccag ctggatcccc ccctgctgag acacctggac aagtactacg
ccggcctgcc 10620cccagagctg aagcagacca gagtgaacct gcccgcccac agcagatatg
gccctcaggc 10680cgtggacgcc agatgataac gccggcggcc cctataactc tctacggcta
acctgaatgg 10740actacgacat agtctagtcc gccaagatga gccccaagga cctgaccccc
ttcctgacaa 10800ccctgtggct gctcctgggc catagcagag tgcctagagt gcgggccgag
gaatgctgcg 10860agttcatcaa cgtgaaccac ccccccgagc ggtgctacga cttcaagatg
tgcaaccggt 10920tcaccgtggc cctgagatgc cccgacggcg aagtgtgcta cagccccgag
aaaaccgccg 10980agatccgggg catcgtgacc accatgaccc acagcctgac ccggcaggtg
gtgcacaaca 11040agctgaccag ctgcaactac aaccccctgt acctggaagc cgacggccgg
atcagatgcg 11100gcaaagtgaa cgacaaggcc cagtacctgc tgggagccgc cggaagcgtg
ccctaccggt 11160ggatcaacct ggaatacgac aagatcaccc ggatcgtggg cctggaccag
tacctggaaa 11220gcgtgaagaa gcacaagcgg ctggacgtgt gcagagccaa gatgggctac
atgctgcagc 11280tgttgaattt tgaccttctt aagcttgcgg gagacgtcga gtccaacccc
gggcccatgc 11340tgcggctgct gctgagacac cacttccact gcctgctgct gtgtgccgtg
tgggccaccc 11400cttgtctggc cagcccttgg agcaccctga ccgccaacca gaaccctagc
cccccttggt 11460ccaagctgac ctacagcaag ccccacgacg ccgccacctt ctactgcccc
tttctgtacc 11520ccagccctcc cagaagcccc ctgcagttca gcggcttcca gagagtgtcc
accggccctg 11580agtgccggaa cgagacactg tacctgctgt acaaccggga gggccagaca
ctggtggagc 11640ggagcagcac ctgggtgaaa aaagtgatct ggtatctgag cggccggaac
cagaccatcc 11700tgcagcggat gcccagaacc gccagcaagc ccagcgacgg caacgtgcag
atcagcgtgg 11760aggacgccaa aatcttcggc gcccacatgg tgcccaagca gaccaagctg
ctgagattcg 11820tggtcaacga cggcaccaga tatcagatgt gcgtgatgaa gctggaaagc
tgggcccacg 11880tgttccggga ctactccgtg agcttccagg tccggctgac cttcaccgag
gccaacaacc 11940agacctacac cttctgcacc caccccaacc tgatcgtgct gctgaacttc
gacctgctga 12000agctggccgg cgacgtggag agcaaccccg gcccccatat gcggctgtgc
agagtgtggc 12060tgtccgtgtg cctgtgtgcc gtggtgctgg gccagtgcca gagagagaca
gccgagaaga 12120acgactacta ccgggtgccc cactactggg atgcctgcag cagagccctg
cccgaccaga 12180cccggtacaa atacgtggag cagctcgtgg acctgaccct gaactaccac
tacgacgcca 12240gccacggcct ggacaacttc gacgtgctga agcggatcaa cgtgaccgag
gtgtccctgc 12300tgatcagcga cttccggcgg cagaacagaa gaggcggcac caacaagcgg
accaccttca 12360acgccgctgg ctctctggcc cctcacgcca gatccctgga attcagcgtg
cggctgttcg 12420ccaactgata acgttgcatc ctgcaggata cagcagcaat tggcaagctg
cttacataga 12480actcgcggcg attggcatgc cgccttaaaa tttttatttt atttttcttt
tcttttccga 12540atcggatttt gtttttaata tttcaaaaaa aaaaaaaaaa aaaaaaaaaa
aaaaaaaaag 12600ggtcggcatg gcatctccac ctcctcgcgg tccgacctgg gcatccgaag
gaggacgcac 12660gtccactcgg atggctaagg gagagccacg tttaaacgct agagcaagac
gtttcccgtt 12720gaatatggct cataacaccc cttgtattac tgtttatgta agcagacagt
tttattgttc 12780atgatgatat atttttatct tgtgcaatgt aacatcagag attttgagac
acaacgtggc 12840tttgttgaat aaatcgaact tttgctgagt tgaaggatca gatcacgcat
cttcccgaca 12900acgcagaccg ttccgtggca aagcaaaagt tcaaaatcac caactggtcc
acctacaaca 12960aagctctcat caaccgtggc tccctcactt tctggctgga tgatggggcg
attcaggcct 13020ggtatgagtc agcaacacct tcttcacgag gcagacctca gcgctagcgg
agtgtatact 13080ggcttactat gttggcactg atgagggtgt cagtgaagtg cttcatgtgg
caggagaaaa 13140aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc
ttcctcgctc 13200actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta
cgaacggggc 13260ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg
gccgcggcaa 13320agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga
cgctcaaatc 13380agtggtggcg aaacccgaca ggactataaa gataccaggc gtttcccctg
gcggctccct 13440cgtgcgctct cctgttcctg cctttcggtt taccggtgtc attccgctgt
tatggccgcg 13500tttgtctcat tccacgcctg acactcagtt ccgggtaggc agttcgctcc
aagctggact 13560gtatgcacga accccccgtt cagtccgacc gctgcgcctt atccggtaac
tatcgtcttg 13620agtccaaccc ggaaagacat gcaaaagcac cactggcagc agccactggt
aattgattta 13680gaggagttag tcttgaagtc atgcgccggt taaggctaaa ctgaaaggac
aagttttggt 13740gactgcgctc ctccaagcca gttacctcgg ttcaaagagt tggtagctca
gagaaccttc 13800gaaaaaccgc cctgcaaggc ggttttttcg ttttcagagc aagagattac
gcgcagacca 13860aaacgatctc aagaagatca tcttattaag gggtctgacg ctcagtggaa
cgaaaactca 13920cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat
ccttttaaat 13980taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc
tgacagttat 14040tagaaaaatt catccagcag acgataaaac gcaatacgct ggctatccgg
tgccgcaatg 14100ccatacagca ccagaaaacg atccgcccat tcgccgccca gttcttccgc
aatatcacgg 14160gtggccagcg caatatcctg ataacgatcc gccacgccca gacggccgca
atcaataaag 14220ccgctaaaac ggccattttc caccataatg ttcggcaggc acgcatcacc
atgggtcacc 14280accagatctt cgccatccgg catgctcgct ttcagacgcg caaacagctc
tgccggtgcc 14340aggccctgat gttcttcatc cagatcatcc tgatccacca ggcccgcttc
catacgggta 14400cgcgcacgtt caatacgatg tttcgcctga tgatcaaacg gacaggtcgc
cgggtccagg 14460gtatgcagac gacgcatggc atccgccata atgctcactt tttctgccgg
cgccagatgg 14520ctagacagca gatcctgacc cggcacttcg cccagcagca gccaatcacg
gcccgcttcg 14580gtcaccacat ccagcaccgc cgcacacgga acaccggtgg tggccagcca
gctcagacgc 14640gccgcttcat cctgcagctc gttcagcgca ccgctcagat cggttttcac
aaacagcacc 14700ggacgaccct gcgcgctcag acgaaacacc gccgcatcag agcagccaat
ggtctgctgc 14760gcccaatcat agccaaacag acgttccacc cacgctgccg ggctacccgc
atgcaggcca 14820tcctgttcaa tcatactctt cctttttcaa tattattgaa gcatttatca
gggttattgt 14880ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg
ggttccgcgc 14940acatttcccc gaaaagtgcc acctaaattg taagcgttaa tattttgtta
aaattcgcgt 15000taaatttttg ttaaatcagc tcatttttta accaataggc cgaaatcggc
aaaatccctt 15060ataaatcaaa agaatagacc gagatagggt tgagtggccg ctacagggcg
ctcccattcg 15120ccattcaggc tgcgcaactg ttgggaaggg cgtttcggtg cgggcctctt
cgctattacg 15180ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc
cagggttttc 15240ccagtcacac gcgtaatacg actcactata g
152715716405DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 57ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtccgccaa 7560gatgaggcct
ggcctgccct cctacctgat catcctggcc gtgtgcctgt tcagccacct 7620gctgtccagc
agatacggcg ccgaggccgt gagcgagccc ctggacaagg ctttccacct 7680gctgctgaac
acctacggca gacccatccg gtttctgcgg gagaacacca cccagtgcac 7740ctacaacagc
agcctgcgga acagcaccgt cgtgagagag aacgccatca gcttcaactt 7800tttccagagc
tacaaccagt actacgtgtt ccacatgccc agatgcctgt ttgccggccc 7860tctggccgag
cagttcctga accaggtgga cctgaccgag acactggaaa gataccagca 7920gcggctgaat
acctacgccc tggtgtccaa ggacctggcc agctaccggt cctttagcca 7980gcagctcaag
gctcaggata gcctcggcga gcagcctacc accgtgcccc ctcccatcga 8040cctgagcatc
ccccacgtgt ggatgcctcc ccagaccacc cctcacggct ggaccgagag 8100ccacaccacc
tccggcctgc acagacccca cttcaaccag acctgcatcc tgttcgacgg 8160ccacgacctg
ctgtttagca ccgtgacccc ctgcctgcac cagggcttct acctgatcga 8220cgagctgaga
tacgtgaaga tcaccctgac cgaggatttc ttcgtggtca ccgtgtccat 8280cgacgacgac
acccccatgc tgctgatctt cggccacctg cccagagtgc tgttcaaggc 8340cccctaccag
cgggacaact tcatcctgcg gcagaccgag aagcacgagc tgctggtgct 8400ggtcaagaag
gaccagctga accggcactc ctacctgaag gaccccgact tcctggacgc 8460cgccctggac
ttcaactacc tggacctgag cgccctgctg agaaacagct tccacagata 8520cgccgtggac
gtgctgaagt ccggacggtg ccagatgctc gatcggcgga ccgtggagat 8580ggccttcgcc
tatgccctcg ccctgttcgc cgctgccaga caggaagagg ctggcgccca 8640ggtgtcagtg
cccagagccc tggatagaca ggccgccctg ctgcagatcc aggaattcat 8700gatcacctgc
ctgagccaga ccccccctag aaccaccctg ctgctgtacc ccacagccgt 8760ggatctggcc
aagagggccc tgtggacccc caaccagatc accgacatca caagcctcgt 8820gcggctcgtg
tacatcctga gcaagcagaa ccagcagcac ctgatccccc agtgggccct 8880gagacagatc
gccgacttcg ccctgaagct gcacaagacc catctggcca gctttctgag 8940cgccttcgcc
aggcaggaac tgtacctgat gggcagcctg gtccacagca tgctggtgca 9000taccaccgag
cggcgggaga tcttcatcgt ggagacaggc ctgtgtagcc tggccgagct 9060gtcccacttt
acccagctgc tggcccaccc tcaccacgag tacctgagcg acctgtacac 9120cccctgcagc
agcagcggca gacgggacca cagcctggaa cggctgacca gactgttccc 9180cgatgccacc
gtgcctgcta cagtgcctgc cgccctgtcc atcctgtcca ccatgcagcc 9240cagcaccctg
gaaaccttcc ccgacctgtt ctgcctgccc ctgggcgaga gctttagcgc 9300cctgaccgtg
tccgagcacg tgtcctacat cgtgaccaat cagtacctga tcaagggcat 9360cagctacccc
gtgtccacca cagtcgtggg ccagagcctg atcatcaccc agaccgacag 9420ccagaccaag
tgcgagctga cccggaacat gcacaccaca cacagcatca ccgtggccct 9480gaacatcagc
ctggaaaact gcgctttctg tcagtctgcc ctgctggaat acgacgatac 9540ccagggcgtg
atcaacatca tgtacatgca cgacagcgac gacgtgctgt tcgccctgga 9600cccctacaac
gaggtggtgg tgtccagccc ccggacccac tacctgatgc tgctgaagaa 9660cggcaccgtg
ctggaagtga ccgacgtggt ggtggacgcc accgacagca gactgctgat 9720gatgagcgtg
tacgccctga gcgccatcat cggcatctac ctgctgtacc ggatgctgaa 9780aacctgctga
taatctagag gcccctataa ctctctacgg ctaacctgaa tggactacga 9840catagtctag
tccgccaaga tgtgcagaag gcccgactgc ggcttcagct tcagccctgg 9900acccgtgatc
ctgctgtggt gctgcctgct gctgcctatc gtgtcctctg ccgccgtgtc 9960tgtggcccct
acagccgccg agaaggtgcc agccgagtgc cccgagctga ccagaagatg 10020cctgctgggc
gaggtgttcg agggcgacaa gtacgagagc tggctgcggc ccctggtcaa 10080cgtgaccggc
agagatggcc ccctgagcca gctgatccgg tacagacccg tgacccccga 10140ggccgccaat
agcgtgctgc tggacgaggc cttcctggat accctggccc tgctgtacaa 10200caaccccgac
cagctgagag ccctgctgac cctgctgtcc agcgacaccg cccccagatg 10260gatgaccgtg
atgcggggct acagcgagtg tggagatggc agccctgccg tgtacacctg 10320cgtggacgac
ctgtgcagag gctacgacct gaccagactg agctacggcc ggtccatctt 10380cacagagcac
gtgctgggct tcgagctggt gccccccagc ctgttcaacg tggtggtggc 10440catccggaac
gaggccacca gaaccaacag agccgtgcgg ctgcctgtgt ctacagccgc 10500tgcacctgag
ggcatcacac tgttctacgg cctgtacaac gccgtgaaag agttctgcct 10560ccggcaccag
ctggatcccc ccctgctgag acacctggac aagtactacg ccggcctgcc 10620cccagagctg
aagcagacca gagtgaacct gcccgcccac agcagatatg gccctcaggc 10680cgtggacgcc
agatgataac gccggcggcc cctataactc tctacggcta acctgaatgg 10740actacgacat
agtctagtcc gccaagatga gccccaagga cctgaccccc ttcctgacaa 10800ccctgtggct
gctcctgggc catagcagag tgcctagagt gcgggccgag gaatgctgcg 10860agttcatcaa
cgtgaaccac ccccccgagc ggtgctacga cttcaagatg tgcaaccggt 10920tcaccgtggc
cctgagatgc cccgacggcg aagtgtgcta cagccccgag aaaaccgccg 10980agatccgggg
catcgtgacc accatgaccc acagcctgac ccggcaggtg gtgcacaaca 11040agctgaccag
ctgcaactac aaccccctgt acctggaagc cgacggccgg atcagatgcg 11100gcaaagtgaa
cgacaaggcc cagtacctgc tgggagccgc cggaagcgtg ccctaccggt 11160ggatcaacct
ggaatacgac aagatcaccc ggatcgtggg cctggaccag tacctggaaa 11220gcgtgaagaa
gcacaagcgg ctggacgtgt gcagagccaa gatgggctac atgctgcagt 11280gataaggcgc
gccaacgtta ctggccgaag ccgcttggaa taaggccggt gtgcgtttgt 11340ctatatgtta
ttttccacca tattgccgtc ttttggcaat gtgagggccc ggaaacctgg 11400ccctgtcttc
ttgacgagca ttcctagggg tctttcccct ctcgccaaag gaatgcaagg 11460tctgttgaat
gtcgtgaagg aagcagttcc tctggaagct tcttgaagac aaacaacgtc 11520tgtagcgacc
ctttgcaggc agcggaaccc cccacctggc gacaggtgcc tctgcggcca 11580aaagccacgt
gtataagata cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag 11640ttggatagtt
gtggaaagag tcaaatggct ctcctcaagc gtattcaaca aggggctgaa 11700ggatgcccag
aaggtacccc attgtatggg atctgatctg gggcctcggt gcacatgctt 11760tacatgtgtt
tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt 11820tttcctttga
aaaacacgat aatatgctgc ggctgctgct gagacaccac ttccactgcc 11880tgctgctgtg
tgccgtgtgg gccacccctt gtctggccag cccttggagc accctgaccg 11940ccaaccagaa
ccctagcccc ccttggtcca agctgaccta cagcaagccc cacgacgccg 12000ccaccttcta
ctgccccttt ctgtacccca gccctcccag aagccccctg cagttcagcg 12060gcttccagag
agtgtccacc ggccctgagt gccggaacga gacactgtac ctgctgtaca 12120accgggaggg
ccagacactg gtggagcgga gcagcacctg ggtgaaaaaa gtgatctggt 12180atctgagcgg
ccggaaccag accatcctgc agcggatgcc cagaaccgcc agcaagccca 12240gcgacggcaa
cgtgcagatc agcgtggagg acgccaaaat cttcggagcc cacatggtgc 12300ccaagcagac
caagctgctg agattcgtgg tcaacgacgg caccagatat cagatgtgcg 12360tgatgaagct
ggaaagctgg gcccacgtgt tccgggacta ctccgtgagc ttccaggtcc 12420ggctgacctt
caccgaggcc aacaaccaga cctacacctt ctgcacccac cccaacctga 12480tcgtgtgata
agtacctttg tacgcctgtt ttataccccc tccctgattt gcaacttaga 12540agcaacgcaa
accagatcaa tagtaggtgt gacataccag tcgcatcttg atcaagcact 12600tctgtatccc
cggaccgagt atcaatagac tgtgcacacg gttgaaggag aaaacgtccg 12660ttacccggct
aactacttcg agaagcctag taacgccatt gaagttgcag agtgtttcgc 12720tcagcactcc
ccccgtgtag atcaggtcga tgagtcaccg cattccccac gggcgaccgt 12780ggcggtggct
gcgttggcgg cctgcctatg gggtaaccca taggacgctc taatacggac 12840atggcgtgaa
gagtctattg agctagttag tagtcctccg gcccctgaat gcggctaatc 12900ctaactgcgg
agcacatacc cttaatccaa agggcagtgt gtcgtaacgg gcaactctgc 12960agcggaaccg
actactttgg gtgtccgtgt ttctttttat tcttgtattg gctgcttatg 13020gtgacaatta
aagaattgtt accatatagc tattggattg gccatccagt gtcaaacaga 13080gctattgtat
atctctttgt tggattcaca cctctcactc ttgaaacgtt acacaccctc 13140aattacatta
tactgctgaa cacgaagcgc atatgcggct gtgcagagtg tggctgtccg 13200tgtgcctgtg
tgccgtggtg ctgggccagt gccagagaga gacagccgag aagaacgact 13260actaccgggt
gccccactac tgggatgcct gcagcagagc cctgcccgac cagacccggt 13320acaaatacgt
ggagcagctc gtggacctga ccctgaacta ccactacgac gccagccacg 13380gcctggacaa
cttcgacgtg ctgaagcgga tcaacgtgac cgaggtgtcc ctgctgatca 13440gcgacttccg
gcggcagaac agaagaggcg gcaccaacaa gcggaccacc ttcaacgccg 13500ctggctctct
ggcccctcac gccagatccc tggaattcag cgtgcggctg ttcgccaact 13560gataacgttg
catcctgcag gatacagcag caattggcaa gctgcttaca tagaactcgc 13620ggcgattggc
atgccgcctt aaaattttta ttttattttt cttttctttt ccgaatcgga 13680ttttgttttt
aatatttcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaagggtcgg 13740catggcatct
ccacctcctc gcggtccgac ctgggcatcc gaaggaggac gcacgtccac 13800tcggatggct
aagggagagc cacgtttaaa cgctagagca agacgtttcc cgttgaatat 13860ggctcataac
accccttgta ttactgttta tgtaagcaga cagttttatt gttcatgatg 13920atatattttt
atcttgtgca atgtaacatc agagattttg agacacaacg tggctttgtt 13980gaataaatcg
aacttttgct gagttgaagg atcagatcac gcatcttccc gacaacgcag 14040accgttccgt
ggcaaagcaa aagttcaaaa tcaccaactg gtccacctac aacaaagctc 14100tcatcaaccg
tggctccctc actttctggc tggatgatgg ggcgattcag gcctggtatg 14160agtcagcaac
accttcttca cgaggcagac ctcagcgcta gcggagtgta tactggctta 14220ctatgttggc
actgatgagg gtgtcagtga agtgcttcat gtggcaggag aaaaaaggct 14280gcaccggtgc
gtcagcagaa tatgtgatac aggatatatt ccgcttcctc gctcactgac 14340tcgctacgct
cggtcgttcg actgcggcga gcggaaatgg cttacgaacg gggcggagat 14400ttcctggaag
atgccaggaa gatacttaac agggaagtga gagggccgcg gcaaagccgt 14460ttttccatag
gctccgcccc cctgacaagc atcacgaaat ctgacgctca aatcagtggt 14520ggcgaaaccc
gacaggacta taaagatacc aggcgtttcc cctggcggct ccctcgtgcg 14580ctctcctgtt
cctgcctttc ggtttaccgg tgtcattccg ctgttatggc cgcgtttgtc 14640tcattccacg
cctgacactc agttccgggt aggcagttcg ctccaagctg gactgtatgc 14700acgaaccccc
cgttcagtcc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 14760acccggaaag
acatgcaaaa gcaccactgg cagcagccac tggtaattga tttagaggag 14820ttagtcttga
agtcatgcgc cggttaaggc taaactgaaa ggacaagttt tggtgactgc 14880gctcctccaa
gccagttacc tcggttcaaa gagttggtag ctcagagaac cttcgaaaaa 14940ccgccctgca
aggcggtttt ttcgttttca gagcaagaga ttacgcgcag accaaaacga 15000tctcaagaag
atcatcttat taaggggtct gacgctcagt ggaacgaaaa ctcacgttaa 15060gggattttgg
tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa 15120tgaagtttta
aatcaatcta aagtatatat gagtaaactt ggtctgacag ttattagaaa 15180aattcatcca
gcagacgata aaacgcaata cgctggctat ccggtgccgc aatgccatac 15240agcaccagaa
aacgatccgc ccattcgccg cccagttctt ccgcaatatc acgggtggcc 15300agcgcaatat
cctgataacg atccgccacg cccagacggc cgcaatcaat aaagccgcta 15360aaacggccat
tttccaccat aatgttcggc aggcacgcat caccatgggt caccaccaga 15420tcttcgccat
ccggcatgct cgctttcaga cgcgcaaaca gctctgccgg tgccaggccc 15480tgatgttctt
catccagatc atcctgatcc accaggcccg cttccatacg ggtacgcgca 15540cgttcaatac
gatgtttcgc ctgatgatca aacggacagg tcgccgggtc cagggtatgc 15600agacgacgca
tggcatccgc cataatgctc actttttctg ccggcgccag atggctagac 15660agcagatcct
gacccggcac ttcgcccagc agcagccaat cacggcccgc ttcggtcacc 15720acatccagca
ccgccgcaca cggaacaccg gtggtggcca gccagctcag acgcgccgct 15780tcatcctgca
gctcgttcag cgcaccgctc agatcggttt tcacaaacag caccggacga 15840ccctgcgcgc
tcagacgaaa caccgccgca tcagagcagc caatggtctg ctgcgcccaa 15900tcatagccaa
acagacgttc cacccacgct gccgggctac ccgcatgcag gccatcctgt 15960tcaatcatac
tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg 16020agcggataca
tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt 16080ccccgaaaag
tgccacctaa attgtaagcg ttaatatttt gttaaaattc gcgttaaatt 16140tttgttaaat
cagctcattt tttaaccaat aggccgaaat cggcaaaatc ccttataaat 16200caaaagaata
gaccgagata gggttgagtg gccgctacag ggcgctccca ttcgccattc 16260aggctgcgca
actgttggga agggcgtttc ggtgcgggcc tcttcgctat tacgccagct 16320ggcgaaaggg
ggatgtgctg caaggcgatt aagttgggta acgccagggt tttcccagtc 16380acacgcgtaa
tacgactcac tatag
164055813102DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 58ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tcgacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtccgccaa 7560gatgaggcct ggcctgccct
cctacctgat catcctggcc gtgtgcctgt tcagccacct 7620gctgtccagc agatacggcg
ccgaggccgt gagcgagccc ctggacaagg ctttccacct 7680gctgctgaac acctacggca
gacccatccg gtttctgcgg gagaacacca cccagtgcac 7740ctacaacagc agcctgcgga
acagcaccgt cgtgagagag aacgccatca gcttcaactt 7800tttccagagc tacaaccagt
actacgtgtt ccacatgccc agatgcctgt ttgccggccc 7860tctggccgag cagttcctga
accaggtgga cctgaccgag acactggaaa gataccagca 7920gcggctgaat acctacgccc
tggtgtccaa ggacctggcc agctaccggt cctttagcca 7980gcagctcaag gctcaggata
gcctcggcga gcagcctacc accgtgcccc ctcccatcga 8040cctgagcatc ccccacgtgt
ggatgcctcc ccagaccacc cctcacggct ggaccgagag 8100ccacaccacc tccggcctgc
acagacccca cttcaaccag acctgcatcc tgttcgacgg 8160ccacgacctg ctgtttagca
ccgtgacccc ctgcctgcac cagggcttct acctgatcga 8220cgagctgaga tacgtgaaga
tcaccctgac cgaggatttc ttcgtggtca ccgtgtccat 8280cgacgacgac acccccatgc
tgctgatctt cggccacctg cccagagtgc tgttcaaggc 8340cccctaccag cgggacaact
tcatcctgcg gcagaccgag aagcacgagc tgctggtgct 8400ggtcaagaag gaccagctga
accggcactc ctacctgaag gaccccgact tcctggacgc 8460cgccctggac ttcaactacc
tggacctgag cgccctgctg agaaacagct tccacagata 8520cgccgtggac gtgctgaagt
ccggacggtg ccagatgctc gatcggcgga ccgtggagat 8580ggccttcgcc tatgccctcg
ccctgttcgc cgctgccaga caggaagagg ctggcgccca 8640ggtgtcagtg cccagagccc
tggatagaca ggccgccctg ctgcagatcc aggaattcat 8700gatcacctgc ctgagccaga
ccccccctag aaccaccctg ctgctgtacc ccacagccgt 8760ggatctggcc aagagggccc
tgtggacccc caaccagatc accgacatca caagcctcgt 8820gcggctcgtg tacatcctga
gcaagcagaa ccagcagcac ctgatccccc agtgggccct 8880gagacagatc gccgacttcg
ccctgaagct gcacaagacc catctggcca gctttctgag 8940cgccttcgcc aggcaggaac
tgtacctgat gggcagcctg gtccacagca tgctggtgca 9000taccaccgag cggcgggaga
tcttcatcgt ggagacaggc ctgtgtagcc tggccgagct 9060gtcccacttt acccagctgc
tggcccaccc tcaccacgag tacctgagcg acctgtacac 9120cccctgcagc agcagcggca
gacgggacca cagcctggaa cggctgacca gactgttccc 9180cgatgccacc gtgcctgcta
cagtgcctgc cgccctgtcc atcctgtcca ccatgcagcc 9240cagcaccctg gaaaccttcc
ccgacctgtt ctgcctgccc ctgggcgaga gctttagcgc 9300cctgaccgtg tccgagcacg
tgtcctacat cgtgaccaat cagtacctga tcaagggcat 9360cagctacccc gtgtccacca
cagtcgtggg ccagagcctg atcatcaccc agaccgacag 9420ccagaccaag tgcgagctga
cccggaacat gcacaccaca cacagcatca ccgtggccct 9480gaacatcagc ctggaaaact
gcgctttctg tcagtctgcc ctgctggaat acgacgatac 9540ccagggcgtg atcaacatca
tgtacatgca cgacagcgac gacgtgctgt tcgccctgga 9600cccctacaac gaggtggtgg
tgtccagccc ccggacccac tacctgatgc tgctgaagaa 9660cggcaccgtg ctggaagtga
ccgacgtggt ggtggacgcc accgactgat aatctagagg 9720cccctataac tctctacggc
taacctgaat ggactacgac atagtctagt ccgccaagat 9780gtgcagaagg cccgactgcg
gcttcagctt cagccctgga cccgtgatcc tgctgtggtg 9840ctgcctgctg ctgcctatcg
tgtcctctgc cgccgtgtct gtggccccta cagccgccga 9900gaaggtgcca gccgagtgcc
ccgagctgac cagaagatgc ctgctgggcg aggtgttcga 9960gggcgacaag tacgagagct
ggctgcggcc cctggtcaac gtgaccggca gagatggccc 10020cctgagccag ctgatccggt
acagacccgt gacccccgag gccgccaata gcgtgctgct 10080ggacgaggcc ttcctggata
ccctggccct gctgtacaac aaccccgacc agctgagagc 10140cctgctgacc ctgctgtcca
gcgacaccgc ccccagatgg atgaccgtga tgcggggcta 10200cagcgagtgt ggagatggca
gccctgccgt gtacacctgc gtggacgacc tgtgcagagg 10260ctacgacctg accagactga
gctacggccg gtccatcttc acagagcacg tgctgggctt 10320cgagctggtg ccccccagcc
tgttcaacgt ggtggtggcc atccggaacg aggccaccag 10380aaccaacaga gccgtgcggc
tgcctgtgtc tacagccgct gcacctgagg gcatcacact 10440gttctacggc ctgtacaacg
ccgtgaaaga gttctgcctc cggcaccagc tggatccccc 10500cctgctgaga cacctggaca
agtactacgc cggcctgccc ccagagctga agcagaccag 10560agtgaacctg cccgcccaca
gcagatatgg ccctcaggcc gtggacgcca gatgataagc 10620ggccgcatac agcagcaatt
ggcaagctgc ttacatagaa ctcgcggcga ttggcatgcc 10680gccttaaaat ttttatttta
tttttctttt cttttccgaa tcggattttg tttttaatat 10740ttcaaaaaaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaagg gtcggcatgg catctccacc 10800tcctcgcggt ccgacctggg
catccgaagg aggacgcacg tccactcgga tggctaaggg 10860agagccacgt ttaaacacgt
gatatctggc ctcatgggcc ttcctttcac tgcccgcttt 10920ccagtcggga aacctgtcgt
gccagctgca ttaacatggt catagctgtt tccttgcgta 10980ttgggcgctc tccgcttcct
cgctcactga ctcgctgcgc tcggtcgttc gggtaaagcc 11040tggggtgcct aatgagcaaa
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 11100ctggcgtttt tccataggct
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 11160cagaggtggc gaaacccgac
aggactataa agataccagg cgtttccccc tggaagctcc 11220ctcgtgcgct ctcctgttcc
gaccctgccg cttaccggat acctgtccgc ctttctccct 11280tcgggaagcg tggcgctttc
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc 11340gttcgctcca agctgggctg
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 11400tccggtaact atcgtcttga
gtccaacccg gtaagacacg acttatcgcc actggcagca 11460gccactggta acaggattag
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 11520tggtggccta actacggcta
cactagaaga acagtatttg gtatctgcgc tctgctgaag 11580ccagttacct tcggaaaaag
agttggtagc tcttgatccg gcaaacaaac caccgctggt 11640agcggtggtt tttttgtttg
caagcagcag attacgcgca gaaaaaaagg atctcaagaa 11700gatcctttga tcttttctac
ggggtctgac gctcagtgga acgaaaactc acgttaaggg 11760attttggtca tgagattatc
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga 11820agttttaaat caatctaaag
tatatatgag taaacttggt ctgacagtta ttagaaaaat 11880tcatccagca gacgataaaa
cgcaatacgc tggctatccg gtgccgcaat gccatacagc 11940accagaaaac gatccgccca
ttcgccgccc agttcttccg caatatcacg ggtggccagc 12000gcaatatcct gataacgatc
cgccacgccc agacggccgc aatcaataaa gccgctaaaa 12060cggccatttt ccaccataat
gttcggcagg cacgcatcac catgggtcac caccagatct 12120tcgccatccg gcatgctcgc
tttcagacgc gcaaacagct ctgccggtgc caggccctga 12180tgttcttcat ccagatcatc
ctgatccacc aggcccgctt ccatacgggt acgcgcacgt 12240tcaatacgat gtttcgcctg
atgatcaaac ggacaggtcg ccgggtccag ggtatgcaga 12300cgacgcatgg catccgccat
aatgctcact ttttctgccg gcgccagatg gctagacagc 12360agatcctgac ccggcacttc
gcccagcagc agccaatcac ggcccgcttc ggtcaccaca 12420tccagcaccg ccgcacacgg
aacaccggtg gtggccagcc agctcagacg cgccgcttca 12480tcctgcagct cgttcagcgc
accgctcaga tcggttttca caaacagcac cggacgaccc 12540tgcgcgctca gacgaaacac
cgccgcatca gagcagccaa tggtctgctg cgcccaatca 12600tagccaaaca gacgttccac
ccacgctgcc gggctacccg catgcaggcc atcctgttca 12660atcatactct tcctttttca
atattattga agcatttatc agggttattg tctcatgagc 12720ggatacatat ttgaatgtat
ttagaaaaat aaacaaatag gggttccgcg cacatttccc 12780cgaaaagtgc cacctaaatt
gtaagcgtta atattttgtt aaaattcgcg ttaaattttt 12840gttaaatcag ctcatttttt
aaccaatagg ccgaaatcgg caaaatccct tataaatcaa 12900aagaatagac cgagataggg
ttgagtggcc gctacagggc gctcccattc gccattcagg 12960ctgcgcaact gttgggaagg
gcgtttcggt gcgggcctct tcgctattac gccagctggc 13020gaaaggggga tgtgctgcaa
ggcgattaag ttgggtaacg ccagggtttt cccagtcaca 13080cgcgtaatac gactcactat
ag 131025913087DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 59ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tcgacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtccgccaa 7560gatgaggcct ggcctgccct cctacctgat catcctggcc gtgtgcctgt
tcagccacct 7620gctgtccagc agatacggcg ccgaggccgt gagcgagccc ctggacaagg
ctttccacct 7680gctgctgaac acctacggca gacccatccg gtttctgcgg gagaacacca
cccagtgcac 7740ctacaacagc agcctgcgga acagcaccgt cgtgagagag aacgccatca
gcttcaactt 7800tttccagagc tacaaccagt actacgtgtt ccacatgccc agatgcctgt
ttgccggccc 7860tctggccgag cagttcctga accaggtgga cctgaccgag acactggaaa
gataccagca 7920gcggctgaat acctacgccc tggtgtccaa ggacctggcc agctaccggt
cctttagcca 7980gcagctcaag gctcaggata gcctcggcga gcagcctacc accgtgcccc
ctcccatcga 8040cctgagcatc ccccacgtgt ggatgcctcc ccagaccacc cctcacggct
ggaccgagag 8100ccacaccacc tccggcctgc acagacccca cttcaaccag acctgcatcc
tgttcgacgg 8160ccacgacctg ctgtttagca ccgtgacccc ctgcctgcac cagggcttct
acctgatcga 8220cgagctgaga tacgtgaaga tcaccctgac cgaggatttc ttcgtggtca
ccgtgtccat 8280cgacgacgac acccccatgc tgctgatctt cggccacctg cccagagtgc
tgttcaaggc 8340cccctaccag cgggacaact tcatcctgcg gcagaccgag aagcacgagc
tgctggtgct 8400ggtcaagaag gaccagctga accggcactc ctacctgaag gaccccgact
tcctggacgc 8460cgccctggac ttcaactacc tggacctgag cgccctgctg agaaacagct
tccacagata 8520cgccgtggac gtgctgaagt ccggacggtg ccagatgctc gatcggcgga
ccgtggagat 8580ggccttcgcc tatgccctcg ccctgttcgc cgctgccaga caggaagagg
ctggcgccca 8640ggtgtcagtg cccagagccc tggatagaca ggccgccctg ctgcagatcc
aggaattcat 8700gatcacctgc ctgagccaga ccccccctag aaccaccctg ctgctgtacc
ccacagccgt 8760ggatctggcc aagagggccc tgtggacccc caaccagatc accgacatca
caagcctcgt 8820gcggctcgtg tacatcctga gcaagcagaa ccagcagcac ctgatccccc
agtgggccct 8880gagacagatc gccgacttcg ccctgaagct gcacaagacc catctggcca
gctttctgag 8940cgccttcgcc aggcaggaac tgtacctgat gggcagcctg gtccacagca
tgctggtgca 9000taccaccgag cggcgggaga tcttcatcgt ggagacaggc ctgtgtagcc
tggccgagct 9060gtcccacttt acccagctgc tggcccaccc tcaccacgag tacctgagcg
acctgtacac 9120cccctgcagc agcagcggca gacgggacca cagcctggaa cggctgacca
gactgttccc 9180cgatgccacc gtgcctgcta cagtgcctgc cgccctgtcc atcctgtcca
ccatgcagcc 9240cagcaccctg gaaaccttcc ccgacctgtt ctgcctgccc ctgggcgaga
gctttagcgc 9300cctgaccgtg tccgagcacg tgtcctacat cgtgaccaat cagtacctga
tcaagggcat 9360cagctacccc gtgtccacca cagtcgtggg ccagagcctg atcatcaccc
agaccgacag 9420ccagaccaag tgcgagctga cccggaacat gcacaccaca cacagcatca
ccgtggccct 9480gaacatcagc ctggaaaact gcgctttctg tcagtctgcc ctgctggaat
acgacgatac 9540ccagggcgtg atcaacatca tgtacatgca cgacagcgac gacgtgctgt
tcgccctgga 9600cccctacaac gaggtggtgg tgtccagccc ccggacccac tacctgatgc
tgctgaagaa 9660cggcaccgtg ctggaagtga ccgacgtggt ggtggacgcc accgacctgt
tgaattttga 9720ccttcttaag cttgcgggag acgtcgagtc caaccccggg cccatgtgca
gaaggcccga 9780ctgcggcttc agcttcagcc ctggacccgt gatcctgctg tggtgctgcc
tgctgctgcc 9840tatcgtgtcc tctgccgccg tgtctgtggc ccctacagcc gccgagaagg
tgccagccga 9900gtgccccgag ctgaccagaa gatgcctgct gggcgaggtg ttcgagggcg
acaagtacga 9960gagctggctg cggcccctgg tcaacgtgac cggcagagat ggccccctga
gccagctgat 10020ccggtacaga cccgtgaccc ccgaggccgc caatagcgtg ctgctggacg
aggccttcct 10080ggataccctg gccctgctgt acaacaaccc cgaccagctg agagccctgc
tgaccctgct 10140gtccagcgac accgccccca gatggatgac cgtgatgcgg ggctacagcg
agtgtggaga 10200tggcagccct gccgtgtaca cctgcgtgga cgacctgtgc agaggctacg
acctgaccag 10260actgagctac ggccggtcca tcttcacaga gcacgtgctg ggcttcgagc
tggtgccccc 10320cagcctgttc aacgtggtgg tggccatccg gaacgaggcc accagaacca
acagagccgt 10380gcggctgcct gtgtctacag ccgctgcacc tgagggcatc acactgttct
acggcctgta 10440caacgccgtg aaagagttct gcctccggca ccagctggat ccccccctgc
tgagacacct 10500ggacaagtac tacgccggcc tgcccccaga gctgaagcag accagagtga
acctgcccgc 10560ccacagcaga tatggccctc aggccgtgga cgccagatga taagcggccg
catacagcag 10620caattggcaa gctgcttaca tagaactcgc ggcgattggc atgccgcctt
aaaattttta 10680ttttattttt cttttctttt ccgaatcgga ttttgttttt aatatttcaa
aaaaaaaaaa 10740aaaaaaaaaa aaaaaaaaaa aaagggtcgg catggcatct ccacctcctc
gcggtccgac 10800ctgggcatcc gaaggaggac gcacgtccac tcggatggct aagggagagc
cacgtttaaa 10860cacgtgatat ctggcctcat gggccttcct ttcactgccc gctttccagt
cgggaaacct 10920gtcgtgccag ctgcattaac atggtcatag ctgtttcctt gcgtattggg
cgctctccgc 10980ttcctcgctc actgactcgc tgcgctcggt cgttcgggta aagcctgggg
tgcctaatga 11040gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc
gtttttccat 11100aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag
gtggcgaaac 11160ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt
gcgctctcct 11220gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg
aagcgtggcg 11280ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg
ctccaagctg 11340ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg
taactatcgt 11400cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac
tggtaacagg 11460attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg
gcctaactac 11520ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt
taccttcgga 11580aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg
tggttttttt 11640gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc
tttgatcttt 11700tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt
ggtcatgaga 11760ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt
taaatcaatc 11820taaagtatat atgagtaaac ttggtctgac agttattaga aaaattcatc
cagcagacga 11880taaaacgcaa tacgctggct atccggtgcc gcaatgccat acagcaccag
aaaacgatcc 11940gcccattcgc cgcccagttc ttccgcaata tcacgggtgg ccagcgcaat
atcctgataa 12000cgatccgcca cgcccagacg gccgcaatca ataaagccgc taaaacggcc
attttccacc 12060ataatgttcg gcaggcacgc atcaccatgg gtcaccacca gatcttcgcc
atccggcatg 12120ctcgctttca gacgcgcaaa cagctctgcc ggtgccaggc cctgatgttc
ttcatccaga 12180tcatcctgat ccaccaggcc cgcttccata cgggtacgcg cacgttcaat
acgatgtttc 12240gcctgatgat caaacggaca ggtcgccggg tccagggtat gcagacgacg
catggcatcc 12300gccataatgc tcactttttc tgccggcgcc agatggctag acagcagatc
ctgacccggc 12360acttcgccca gcagcagcca atcacggccc gcttcggtca ccacatccag
caccgccgca 12420cacggaacac cggtggtggc cagccagctc agacgcgccg cttcatcctg
cagctcgttc 12480agcgcaccgc tcagatcggt tttcacaaac agcaccggac gaccctgcgc
gctcagacga 12540aacaccgccg catcagagca gccaatggtc tgctgcgccc aatcatagcc
aaacagacgt 12600tccacccacg ctgccgggct acccgcatgc aggccatcct gttcaatcat
actcttcctt 12660tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata
catatttgaa 12720tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa
agtgccacct 12780aaattgtaag cgttaatatt ttgttaaaat tcgcgttaaa tttttgttaa
atcagctcat 12840tttttaacca ataggccgaa atcggcaaaa tcccttataa atcaaaagaa
tagaccgaga 12900tagggttgag tggccgctac agggcgctcc cattcgccat tcaggctgcg
caactgttgg 12960gaagggcgtt tcggtgcggg cctcttcgct attacgccag ctggcgaaag
ggggatgtgc 13020tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacacgcgt
aatacgactc 13080actatag
130876013788DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 60ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tcgacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtccgccaa 7560gatgaggcct
ggcctgccct cctacctgat catcctggcc gtgtgcctgt tcagccacct 7620gctgtccagc
agatacggcg ccgaggccgt gagcgagccc ctggacaagg ctttccacct 7680gctgctgaac
acctacggca gacccatccg gtttctgcgg gagaacacca cccagtgcac 7740ctacaacagc
agcctgcgga acagcaccgt cgtgagagag aacgccatca gcttcaactt 7800tttccagagc
tacaaccagt actacgtgtt ccacatgccc agatgcctgt ttgccggccc 7860tctggccgag
cagttcctga accaggtgga cctgaccgag acactggaaa gataccagca 7920gcggctgaat
acctacgccc tggtgtccaa ggacctggcc agctaccggt cctttagcca 7980gcagctcaag
gctcaggata gcctcggcga gcagcctacc accgtgcccc ctcccatcga 8040cctgagcatc
ccccacgtgt ggatgcctcc ccagaccacc cctcacggct ggaccgagag 8100ccacaccacc
tccggcctgc acagacccca cttcaaccag acctgcatcc tgttcgacgg 8160ccacgacctg
ctgtttagca ccgtgacccc ctgcctgcac cagggcttct acctgatcga 8220cgagctgaga
tacgtgaaga tcaccctgac cgaggatttc ttcgtggtca ccgtgtccat 8280cgacgacgac
acccccatgc tgctgatctt cggccacctg cccagagtgc tgttcaaggc 8340cccctaccag
cgggacaact tcatcctgcg gcagaccgag aagcacgagc tgctggtgct 8400ggtcaagaag
gaccagctga accggcactc ctacctgaag gaccccgact tcctggacgc 8460cgccctggac
ttcaactacc tggacctgag cgccctgctg agaaacagct tccacagata 8520cgccgtggac
gtgctgaagt ccggacggtg ccagatgctc gatcggcgga ccgtggagat 8580ggccttcgcc
tatgccctcg ccctgttcgc cgctgccaga caggaagagg ctggcgccca 8640ggtgtcagtg
cccagagccc tggatagaca ggccgccctg ctgcagatcc aggaattcat 8700gatcacctgc
ctgagccaga ccccccctag aaccaccctg ctgctgtacc ccacagccgt 8760ggatctggcc
aagagggccc tgtggacccc caaccagatc accgacatca caagcctcgt 8820gcggctcgtg
tacatcctga gcaagcagaa ccagcagcac ctgatccccc agtgggccct 8880gagacagatc
gccgacttcg ccctgaagct gcacaagacc catctggcca gctttctgag 8940cgccttcgcc
aggcaggaac tgtacctgat gggcagcctg gtccacagca tgctggtgca 9000taccaccgag
cggcgggaga tcttcatcgt ggagacaggc ctgtgtagcc tggccgagct 9060gtcccacttt
acccagctgc tggcccaccc tcaccacgag tacctgagcg acctgtacac 9120cccctgcagc
agcagcggca gacgggacca cagcctggaa cggctgacca gactgttccc 9180cgatgccacc
gtgcctgcta cagtgcctgc cgccctgtcc atcctgtcca ccatgcagcc 9240cagcaccctg
gaaaccttcc ccgacctgtt ctgcctgccc ctgggcgaga gctttagcgc 9300cctgaccgtg
tccgagcacg tgtcctacat cgtgaccaat cagtacctga tcaagggcat 9360cagctacccc
gtgtccacca cagtcgtggg ccagagcctg atcatcaccc agaccgacag 9420ccagaccaag
tgcgagctga cccggaacat gcacaccaca cacagcatca ccgtggccct 9480gaacatcagc
ctggaaaact gcgctttctg tcagtctgcc ctgctggaat acgacgatac 9540ccagggcgtg
atcaacatca tgtacatgca cgacagcgac gacgtgctgt tcgccctgga 9600cccctacaac
gaggtggtgg tgtccagccc ccggacccac tacctgatgc tgctgaagaa 9660cggcaccgtg
ctggaagtga ccgacgtggt ggtggacgcc accgactgat aatctagatt 9720aaaacagctg
tgggttgttc ccacccacag ggcccactgg gcgctagcac tctgatttta 9780cgaaatcctt
gtgcgcctgt tttatatccc ttccctaatt cgaaacgtag aagcaatgcg 9840caccactgat
caatagtagg cgtaacgcgc cagttacgtc atgatcaagc atatctgttc 9900ccccggactg
agtatcaata gactgcttac gcggttgaag gagaaaacgt tcgttatccg 9960gctaactact
tcgagaagcc cagtaacacc atggaagctg cagggtgttt cgctcagcac 10020ttcccccgtg
tagatcaggt cgatgagcca ctgcaatccc cacaggtgac tgtggcagtg 10080gctgcgttgg
cggcctgcct atggggagac ccataggacg ctctaatgtg gacatggtgc 10140gaagagccta
ttgagctagt tagtagtcct ccggcccctg aatgcggcta atcctaactg 10200cggagcacat
gccttcaacc cagagggtag tgtgtcgtaa tgggcaactc tgcagcggaa 10260ccgactactt
tgggtgtccg tgtttctttt tattcttata ttggctgctt atggtgacaa 10320ttacagaatt
gttaccatat agctattgga ttggccatcc ggtgtgtaat agagctgtta 10380tatacctatt
tgttggcttt gtaccactaa ctttaaaatc tataactacc ctcaacttta 10440tattaaccct
caatacagtt gaacatgtgc agaaggcccg actgcggctt cagcttcagc 10500cctggacccg
tgatcctgct gtggtgctgc ctgctgctgc ctatcgtgtc ctctgccgcc 10560gtgtctgtgg
cccctacagc cgccgagaag gtgccagccg agtgccccga gctgaccaga 10620agatgcctgc
tgggcgaggt gttcgagggc gacaagtacg agagctggct gcggcccctg 10680gtcaacgtga
ccggcagaga tggccccctg agccagctga tccggtacag acccgtgacc 10740cccgaggccg
ccaatagcgt gctgctggac gaggccttcc tggataccct ggccctgctg 10800tacaacaacc
ccgaccagct gagagccctg ctgaccctgc tgtccagcga caccgccccc 10860agatggatga
ccgtgatgcg gggctacagc gagtgtggag atggcagccc tgccgtgtac 10920acctgcgtgg
acgacctgtg cagaggctac gacctgacca gactgagcta cggccggtcc 10980atcttcacag
agcacgtgct gggcttcgag ctggtgcccc ccagcctgtt caacgtggtg 11040gtggccatcc
ggaacgaggc caccagaacc aacagagccg tgcggctgcc tgtgtctaca 11100gccgctgcac
ctgagggcat cacactgttc tacggcctgt acaacgccgt gaaagagttc 11160tgcctccggc
accagctgga tccccccctg ctgagacacc tggacaagta ctacgccggc 11220ctgcccccag
agctgaagca gaccagagtg aacctgcccg cccacagcag atatggccct 11280caggccgtgg
acgccagatg ataagcggcc gcatacagca gcaattggca agctgcttac 11340atagaactcg
cggcgattgg catgccgcct taaaattttt attttatttt tcttttcttt 11400tccgaatcgg
attttgtttt taatatttca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 11460aaaagggtcg
gcatggcatc tccacctcct cgcggtccga cctgggcatc cgaaggagga 11520cgcacgtcca
ctcggatggc taagggagag ccacgtttaa acacgtgata tctggcctca 11580tgggccttcc
tttcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 11640catggtcata
gctgtttcct tgcgtattgg gcgctctccg cttcctcgct cactgactcg 11700ctgcgctcgg
tcgttcgggt aaagcctggg gtgcctaatg agcaaaaggc cagcaaaagg 11760ccaggaaccg
taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 11820agcatcacaa
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 11880accaggcgtt
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 11940ccggatacct
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 12000gtaggtatct
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 12060ccgttcagcc
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 12120gacacgactt
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 12180taggcggtgc
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 12240tatttggtat
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 12300gatccggcaa
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 12360cgcgcagaaa
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 12420agtggaacga
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 12480cctagatcct
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 12540cttggtctga
cagttattag aaaaattcat ccagcagacg ataaaacgca atacgctggc 12600tatccggtgc
cgcaatgcca tacagcacca gaaaacgatc cgcccattcg ccgcccagtt 12660cttccgcaat
atcacgggtg gccagcgcaa tatcctgata acgatccgcc acgcccagac 12720ggccgcaatc
aataaagccg ctaaaacggc cattttccac cataatgttc ggcaggcacg 12780catcaccatg
ggtcaccacc agatcttcgc catccggcat gctcgctttc agacgcgcaa 12840acagctctgc
cggtgccagg ccctgatgtt cttcatccag atcatcctga tccaccaggc 12900ccgcttccat
acgggtacgc gcacgttcaa tacgatgttt cgcctgatga tcaaacggac 12960aggtcgccgg
gtccagggta tgcagacgac gcatggcatc cgccataatg ctcacttttt 13020ctgccggcgc
cagatggcta gacagcagat cctgacccgg cacttcgccc agcagcagcc 13080aatcacggcc
cgcttcggtc accacatcca gcaccgccgc acacggaaca ccggtggtgg 13140ccagccagct
cagacgcgcc gcttcatcct gcagctcgtt cagcgcaccg ctcagatcgg 13200ttttcacaaa
cagcaccgga cgaccctgcg cgctcagacg aaacaccgcc gcatcagagc 13260agccaatggt
ctgctgcgcc caatcatagc caaacagacg ttccacccac gctgccgggc 13320tacccgcatg
caggccatcc tgttcaatca tactcttcct ttttcaatat tattgaagca 13380tttatcaggg
ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 13440aaataggggt
tccgcgcaca tttccccgaa aagtgccacc taaattgtaa gcgttaatat 13500tttgttaaaa
ttcgcgttaa atttttgtta aatcagctca ttttttaacc aataggccga 13560aatcggcaaa
atcccttata aatcaaaaga atagaccgag atagggttga gtggccgcta 13620cagggcgctc
ccattcgcca ttcaggctgc gcaactgttg ggaagggcgt ttcggtgcgg 13680gcctcttcgc
tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg 13740gtaacgccag
ggttttccca gtcacacgcg taatacgact cactatag
137886113788DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 61ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tcgacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtccgccaa 7560gatgtgcaga aggcccgact
gcggcttcag cttcagccct ggacccgtga tcctgctgtg 7620gtgctgcctg ctgctgccta
tcgtgtcctc tgccgccgtg tctgtggccc ctacagccgc 7680cgagaaggtg ccagccgagt
gccccgagct gaccagaaga tgcctgctgg gcgaggtgtt 7740cgagggcgac aagtacgaga
gctggctgcg gcccctggtc aacgtgaccg gcagagatgg 7800ccccctgagc cagctgatcc
ggtacagacc cgtgaccccc gaggccgcca atagcgtgct 7860gctggacgag gccttcctgg
ataccctggc cctgctgtac aacaaccccg accagctgag 7920agccctgctg accctgctgt
ccagcgacac cgcccccaga tggatgaccg tgatgcgggg 7980ctacagcgag tgtggagatg
gcagccctgc cgtgtacacc tgcgtggacg acctgtgcag 8040aggctacgac ctgaccagac
tgagctacgg ccggtccatc ttcacagagc acgtgctggg 8100cttcgagctg gtgcccccca
gcctgttcaa cgtggtggtg gccatccgga acgaggccac 8160cagaaccaac agagccgtgc
ggctgcctgt gtctacagcc gctgcacctg agggcatcac 8220actgttctac ggcctgtaca
acgccgtgaa agagttctgc ctccggcacc agctggatcc 8280ccccctgctg agacacctgg
acaagtacta cgccggcctg cccccagagc tgaagcagac 8340cagagtgaac ctgcccgccc
acagcagata tggccctcag gccgtggacg ccagatgata 8400atctagatta aaacagctgt
gggttgttcc cacccacagg gcccactggg cgctagcact 8460ctgattttac gaaatccttg
tgcgcctgtt ttatatccct tccctaattc gaaacgtaga 8520agcaatgcgc accactgatc
aatagtaggc gtaacgcgcc agttacgtca tgatcaagca 8580tatctgttcc cccggactga
gtatcaatag actgcttacg cggttgaagg agaaaacgtt 8640cgttatccgg ctaactactt
cgagaagccc agtaacacca tggaagctgc agggtgtttc 8700gctcagcact tcccccgtgt
agatcaggtc gatgagccac tgcaatcccc acaggtgact 8760gtggcagtgg ctgcgttggc
ggcctgccta tggggagacc cataggacgc tctaatgtgg 8820acatggtgcg aagagcctat
tgagctagtt agtagtcctc cggcccctga atgcggctaa 8880tcctaactgc ggagcacatg
ccttcaaccc agagggtagt gtgtcgtaat gggcaactct 8940gcagcggaac cgactacttt
gggtgtccgt gtttcttttt attcttatat tggctgctta 9000tggtgacaat tacagaattg
ttaccatata gctattggat tggccatccg gtgtgtaata 9060gagctgttat atacctattt
gttggctttg taccactaac tttaaaatct ataactaccc 9120tcaactttat attaaccctc
aatacagttg aacatgaggc ctggcctgcc ctcctacctg 9180atcatcctgg ccgtgtgcct
gttcagccac ctgctgtcca gcagatacgg cgccgaggcc 9240gtgagcgagc ccctggacaa
ggctttccac ctgctgctga acacctacgg cagacccatc 9300cggtttctgc gggagaacac
cacccagtgc acctacaaca gcagcctgcg gaacagcacc 9360gtcgtgagag agaacgccat
cagcttcaac tttttccaga gctacaacca gtactacgtg 9420ttccacatgc ccagatgcct
gtttgccggc cctctggccg agcagttcct gaaccaggtg 9480gacctgaccg agacactgga
aagataccag cagcggctga atacctacgc cctggtgtcc 9540aaggacctgg ccagctaccg
gtcctttagc cagcagctca aggctcagga tagcctcggc 9600gagcagccta ccaccgtgcc
ccctcccatc gacctgagca tcccccacgt gtggatgcct 9660ccccagacca cccctcacgg
ctggaccgag agccacacca cctccggcct gcacagaccc 9720cacttcaacc agacctgcat
cctgttcgac ggccacgacc tgctgtttag caccgtgacc 9780ccctgcctgc accagggctt
ctacctgatc gacgagctga gatacgtgaa gatcaccctg 9840accgaggatt tcttcgtggt
caccgtgtcc atcgacgacg acacccccat gctgctgatc 9900ttcggccacc tgcccagagt
gctgttcaag gccccctacc agcgggacaa cttcatcctg 9960cggcagaccg agaagcacga
gctgctggtg ctggtcaaga aggaccagct gaaccggcac 10020tcctacctga aggaccccga
cttcctggac gccgccctgg acttcaacta cctggacctg 10080agcgccctgc tgagaaacag
cttccacaga tacgccgtgg acgtgctgaa gtccggacgg 10140tgccagatgc tcgatcggcg
gaccgtggag atggccttcg cctatgccct cgccctgttc 10200gccgctgcca gacaggaaga
ggctggcgcc caggtgtcag tgcccagagc cctggataga 10260caggccgccc tgctgcagat
ccaggaattc atgatcacct gcctgagcca gaccccccct 10320agaaccaccc tgctgctgta
ccccacagcc gtggatctgg ccaagagggc cctgtggacc 10380cccaaccaga tcaccgacat
cacaagcctc gtgcggctcg tgtacatcct gagcaagcag 10440aaccagcagc acctgatccc
ccagtgggcc ctgagacaga tcgccgactt cgccctgaag 10500ctgcacaaga cccatctggc
cagctttctg agcgccttcg ccaggcagga actgtacctg 10560atgggcagcc tggtccacag
catgctggtg cataccaccg agcggcggga gatcttcatc 10620gtggagacag gcctgtgtag
cctggccgag ctgtcccact ttacccagct gctggcccac 10680cctcaccacg agtacctgag
cgacctgtac accccctgca gcagcagcgg cagacgggac 10740cacagcctgg aacggctgac
cagactgttc cccgatgcca ccgtgcctgc tacagtgcct 10800gccgccctgt ccatcctgtc
caccatgcag cccagcaccc tggaaacctt ccccgacctg 10860ttctgcctgc ccctgggcga
gagctttagc gccctgaccg tgtccgagca cgtgtcctac 10920atcgtgacca atcagtacct
gatcaagggc atcagctacc ccgtgtccac cacagtcgtg 10980ggccagagcc tgatcatcac
ccagaccgac agccagacca agtgcgagct gacccggaac 11040atgcacacca cacacagcat
caccgtggcc ctgaacatca gcctggaaaa ctgcgctttc 11100tgtcagtctg ccctgctgga
atacgacgat acccagggcg tgatcaacat catgtacatg 11160cacgacagcg acgacgtgct
gttcgccctg gacccctaca acgaggtggt ggtgtccagc 11220ccccggaccc actacctgat
gctgctgaag aacggcaccg tgctggaagt gaccgacgtg 11280gtggtggacg ccaccgactg
ataagcggcc gcatacagca gcaattggca agctgcttac 11340atagaactcg cggcgattgg
catgccgcct taaaattttt attttatttt tcttttcttt 11400tccgaatcgg attttgtttt
taatatttca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 11460aaaagggtcg gcatggcatc
tccacctcct cgcggtccga cctgggcatc cgaaggagga 11520cgcacgtcca ctcggatggc
taagggagag ccacgtttaa acacgtgata tctggcctca 11580tgggccttcc tttcactgcc
cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 11640catggtcata gctgtttcct
tgcgtattgg gcgctctccg cttcctcgct cactgactcg 11700ctgcgctcgg tcgttcgggt
aaagcctggg gtgcctaatg agcaaaaggc cagcaaaagg 11760ccaggaaccg taaaaaggcc
gcgttgctgg cgtttttcca taggctccgc ccccctgacg 11820agcatcacaa aaatcgacgc
tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 11880accaggcgtt tccccctgga
agctccctcg tgcgctctcc tgttccgacc ctgccgctta 11940ccggatacct gtccgccttt
ctcccttcgg gaagcgtggc gctttctcat agctcacgct 12000gtaggtatct cagttcggtg
taggtcgttc gctccaagct gggctgtgtg cacgaacccc 12060ccgttcagcc cgaccgctgc
gccttatccg gtaactatcg tcttgagtcc aacccggtaa 12120gacacgactt atcgccactg
gcagcagcca ctggtaacag gattagcaga gcgaggtatg 12180taggcggtgc tacagagttc
ttgaagtggt ggcctaacta cggctacact agaagaacag 12240tatttggtat ctgcgctctg
ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 12300gatccggcaa acaaaccacc
gctggtagcg gtggtttttt tgtttgcaag cagcagatta 12360cgcgcagaaa aaaaggatct
caagaagatc ctttgatctt ttctacgggg tctgacgctc 12420agtggaacga aaactcacgt
taagggattt tggtcatgag attatcaaaa aggatcttca 12480cctagatcct tttaaattaa
aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 12540cttggtctga cagttattag
aaaaattcat ccagcagacg ataaaacgca atacgctggc 12600tatccggtgc cgcaatgcca
tacagcacca gaaaacgatc cgcccattcg ccgcccagtt 12660cttccgcaat atcacgggtg
gccagcgcaa tatcctgata acgatccgcc acgcccagac 12720ggccgcaatc aataaagccg
ctaaaacggc cattttccac cataatgttc ggcaggcacg 12780catcaccatg ggtcaccacc
agatcttcgc catccggcat gctcgctttc agacgcgcaa 12840acagctctgc cggtgccagg
ccctgatgtt cttcatccag atcatcctga tccaccaggc 12900ccgcttccat acgggtacgc
gcacgttcaa tacgatgttt cgcctgatga tcaaacggac 12960aggtcgccgg gtccagggta
tgcagacgac gcatggcatc cgccataatg ctcacttttt 13020ctgccggcgc cagatggcta
gacagcagat cctgacccgg cacttcgccc agcagcagcc 13080aatcacggcc cgcttcggtc
accacatcca gcaccgccgc acacggaaca ccggtggtgg 13140ccagccagct cagacgcgcc
gcttcatcct gcagctcgtt cagcgcaccg ctcagatcgg 13200ttttcacaaa cagcaccgga
cgaccctgcg cgctcagacg aaacaccgcc gcatcagagc 13260agccaatggt ctgctgcgcc
caatcatagc caaacagacg ttccacccac gctgccgggc 13320tacccgcatg caggccatcc
tgttcaatca tactcttcct ttttcaatat tattgaagca 13380tttatcaggg ttattgtctc
atgagcggat acatatttga atgtatttag aaaaataaac 13440aaataggggt tccgcgcaca
tttccccgaa aagtgccacc taaattgtaa gcgttaatat 13500tttgttaaaa ttcgcgttaa
atttttgtta aatcagctca ttttttaacc aataggccga 13560aatcggcaaa atcccttata
aatcaaaaga atagaccgag atagggttga gtggccgcta 13620cagggcgctc ccattcgcca
ttcaggctgc gcaactgttg ggaagggcgt ttcggtgcgg 13680gcctcttcgc tattacgcca
gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg 13740gtaacgccag ggttttccca
gtcacacgcg taatacgact cactatag 137886214202DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 62ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tcgacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtccgccaa 7560gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca
aaaagcttgt 7620atatccattt tcggatctga tcaagagaca ggatgaggat cgtttcgcat
gattgaataa 7680gatggattgc acgtaggttc tccggccgct tgggtggaga ggctattcgg
ctatgactgg 7740gcacaactga caatcggctg ctctgatgcc gccgtgatcc ggttgtcagc
gcaggggcgc 7800ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgaa
ggacgaggca 7860gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagtctagac
tggcgcgcca 7920aacctgcagg ttaaaacagc tgtgggttgt tcccacccac agggcccact
gggcgctagc 7980actctgattt tacgaaatcc ttgtgcgcct gttttatatc ccttccctaa
ttcgaaacgt 8040agaagcaatg cgcaccactg atcaatagta ggcgtaacgc gccagttacg
tcatgatcaa 8100gcatatctgt tcccccggac tgagtatcaa tagactgctt acgcggttga
aggagaaaac 8160gttcgttatc cggctaacta cttcgagaag cccagtaaca ccatggaagc
tgcagggtgt 8220ttcgctcagc acttcccccg tgtagatcag gtcgatgagc cactgcaatc
cccacaggtg 8280actgtggcag tggctgcgtt ggcggcctgc ctatggggag acccatagga
cgctctaatg 8340tggacatggt gcgaagagcc tattgagcta gttagtagtc ctccggcccc
tgaatgcggc 8400taatcctaac tgcggagcac atgccttcaa cccagagggt agtgtgtcgt
aatgggcaac 8460tctgcagcgg aaccgactac tttgggtgtc cgtgtttctt tttattctta
tattggctgc 8520ttatggtgac aattacagaa ttgttaccat atagctattg gattggccat
ccggtgtgta 8580atagagctgt tatataccta tttgttggct ttgtaccact aactttaaaa
tctataacta 8640ccctcaactt tatattaacc ctcaatacag ttgaacatga ggcctggcct
gccctcctac 8700ctgatcatcc tggccgtgtg cctgttcagc cacctgctgt ccagcagata
cggcgccgag 8760gccgtgagcg agcccctgga caaggctttc cacctgctgc tgaacaccta
cggcagaccc 8820atccggtttc tgcgggagaa caccacccag tgcacctaca acagcagcct
gcggaacagc 8880accgtcgtga gagagaacgc catcagcttc aactttttcc agagctacaa
ccagtactac 8940gtgttccaca tgcccagatg cctgtttgcc ggccctctgg ccgagcagtt
cctgaaccag 9000gtggacctga ccgagacact ggaaagatac cagcagcggc tgaataccta
cgccctggtg 9060tccaaggacc tggccagcta ccggtccttt agccagcagc tcaaggctca
ggatagcctc 9120ggcgagcagc ctaccaccgt gccccctccc atcgacctga gcatccccca
cgtgtggatg 9180cctccccaga ccacccctca cggctggacc gagagccaca ccacctccgg
cctgcacaga 9240ccccacttca accagacctg catcctgttc gacggccacg acctgctgtt
tagcaccgtg 9300accccctgcc tgcaccaggg cttctacctg atcgacgagc tgagatacgt
gaagatcacc 9360ctgaccgagg atttcttcgt ggtcaccgtg tccatcgacg acgacacccc
catgctgctg 9420atcttcggcc acctgcccag agtgctgttc aaggccccct accagcggga
caacttcatc 9480ctgcggcaga ccgagaagca cgagctgctg gtgctggtca agaaggacca
gctgaaccgg 9540cactcctacc tgaaggaccc cgacttcctg gacgccgccc tggacttcaa
ctacctggac 9600ctgagcgccc tgctgagaaa cagcttccac agatacgccg tggacgtgct
gaagtccgga 9660cggtgccaga tgctcgatcg gcggaccgtg gagatggcct tcgcctatgc
cctcgccctg 9720ttcgccgctg ccagacagga agaggctggc gcccaggtgt cagtgcccag
agccctggat 9780agacaggccg ccctgctgca gatccaggaa ttcatgatca cctgcctgag
ccagaccccc 9840cctagaacca ccctgctgct gtaccccaca gccgtggatc tggccaagag
ggccctgtgg 9900acccccaacc agatcaccga catcacaagc ctcgtgcggc tcgtgtacat
cctgagcaag 9960cagaaccagc agcacctgat cccccagtgg gccctgagac agatcgccga
cttcgccctg 10020aagctgcaca agacccatct ggccagcttt ctgagcgcct tcgccaggca
ggaactgtac 10080ctgatgggca gcctggtcca cagcatgctg gtgcatacca ccgagcggcg
ggagatcttc 10140atcgtggaga caggcctgtg tagcctggcc gagctgtccc actttaccca
gctgctggcc 10200caccctcacc acgagtacct gagcgacctg tacaccccct gcagcagcag
cggcagacgg 10260gaccacagcc tggaacggct gaccagactg ttccccgatg ccaccgtgcc
tgctacagtg 10320cctgccgccc tgtccatcct gtccaccatg cagcccagca ccctggaaac
cttccccgac 10380ctgttctgcc tgcccctggg cgagagcttt agcgccctga ccgtgtccga
gcacgtgtcc 10440tacatcgtga ccaatcagta cctgatcaag ggcatcagct accccgtgtc
caccacagtc 10500gtgggccaga gcctgatcat cacccagacc gacagccaga ccaagtgcga
gctgacccgg 10560aacatgcaca ccacacacag catcaccgtg gccctgaaca tcagcctgga
aaactgcgct 10620ttctgtcagt ctgccctgct ggaatacgac gatacccagg gcgtgatcaa
catcatgtac 10680atgcacgaca gcgacgacgt gctgttcgcc ctggacccct acaacgaggt
ggtggtgtcc 10740agcccccgga cccactacct gatgctgctg aagaacggca ccgtgctgga
agtgaccgac 10800gtggtggtgg acgccaccga cctgttgaat tttgaccttc ttaagcttgc
gggagacgtc 10860gagtccaacc ccgggcccat gtgcagaagg cccgactgcg gcttcagctt
cagccctgga 10920cccgtgatcc tgctgtggtg ctgcctgctg ctgcctatcg tgtcctctgc
cgccgtgtct 10980gtggccccta cagccgccga gaaggtgcca gccgagtgcc ccgagctgac
cagaagatgc 11040ctgctgggcg aggtgttcga gggcgacaag tacgagagct ggctgcggcc
cctggtcaac 11100gtgaccggca gagatggccc cctgagccag ctgatccggt acagacccgt
gacccccgag 11160gccgccaata gcgtgctgct ggacgaggcc ttcctggata ccctggccct
gctgtacaac 11220aaccccgacc agctgagagc cctgctgacc ctgctgtcca gcgacaccgc
ccccagatgg 11280atgaccgtga tgcggggcta cagcgagtgt ggagatggca gccctgccgt
gtacacctgc 11340gtggacgacc tgtgcagagg ctacgacctg accagactga gctacggccg
gtccatcttc 11400acagagcacg tgctgggctt cgagctggtg ccccccagcc tgttcaacgt
ggtggtggcc 11460atccggaacg aggccaccag aaccaacaga gccgtgcggc tgcctgtgtc
tacagccgct 11520gcacctgagg gcatcacact gttctacggc ctgtacaacg ccgtgaaaga
gttctgcctc 11580cggcaccagc tggatccccc cctgctgaga cacctggaca agtactacgc
cggcctgccc 11640ccagagctga agcagaccag agtgaacctg cccgcccaca gcagatatgg
ccctcaggcc 11700gtggacgcca gatgataagc ggccgcatac agcagcaatt ggcaagctgc
ttacatagaa 11760ctcgcggcga ttggcatgcc gccttaaaat ttttatttta tttttctttt
cttttccgaa 11820tcggattttg tttttaatat ttcaaaaaaa aaaaaaaaaa aaaaaaaaaa
aaaaaaaagg 11880gtcggcatgg catctccacc tcctcgcggt ccgacctggg catccgaagg
aggacgcacg 11940tccactcgga tggctaaggg agagccacgt ttaaacacgt gatatctggc
ctcatgggcc 12000ttcctttcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca
ttaacatggt 12060catagctgtt tccttgcgta ttgggcgctc tccgcttcct cgctcactga
ctcgctgcgc 12120tcggtcgttc gggtaaagcc tggggtgcct aatgagcaaa aggccagcaa
aaggccagga 12180accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct
gacgagcatc 12240acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa
agataccagg 12300cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg
cttaccggat 12360acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca
cgctgtaggt 12420atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa
ccccccgttc 12480agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg
gtaagacacg 12540acttatcgcc actggcagca gccactggta acaggattag cagagcgagg
tatgtaggcg 12600gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga
acagtatttg 12660gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc
tcttgatccg 12720gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag
attacgcgca 12780gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac
gctcagtgga 12840acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc
ttcacctaga 12900tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag
taaacttggt 12960ctgacagtta ttagaaaaat tcatccagca gacgataaaa cgcaatacgc
tggctatccg 13020gtgccgcaat gccatacagc accagaaaac gatccgccca ttcgccgccc
agttcttccg 13080caatatcacg ggtggccagc gcaatatcct gataacgatc cgccacgccc
agacggccgc 13140aatcaataaa gccgctaaaa cggccatttt ccaccataat gttcggcagg
cacgcatcac 13200catgggtcac caccagatct tcgccatccg gcatgctcgc tttcagacgc
gcaaacagct 13260ctgccggtgc caggccctga tgttcttcat ccagatcatc ctgatccacc
aggcccgctt 13320ccatacgggt acgcgcacgt tcaatacgat gtttcgcctg atgatcaaac
ggacaggtcg 13380ccgggtccag ggtatgcaga cgacgcatgg catccgccat aatgctcact
ttttctgccg 13440gcgccagatg gctagacagc agatcctgac ccggcacttc gcccagcagc
agccaatcac 13500ggcccgcttc ggtcaccaca tccagcaccg ccgcacacgg aacaccggtg
gtggccagcc 13560agctcagacg cgccgcttca tcctgcagct cgttcagcgc accgctcaga
tcggttttca 13620caaacagcac cggacgaccc tgcgcgctca gacgaaacac cgccgcatca
gagcagccaa 13680tggtctgctg cgcccaatca tagccaaaca gacgttccac ccacgctgcc
gggctacccg 13740catgcaggcc atcctgttca atcatactct tcctttttca atattattga
agcatttatc 13800agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat
aaacaaatag 13860gggttccgcg cacatttccc cgaaaagtgc cacctaaatt gtaagcgtta
atattttgtt 13920aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg
ccgaaatcgg 13980caaaatccct tataaatcaa aagaatagac cgagataggg ttgagtggcc
gctacagggc 14040gctcccattc gccattcagg ctgcgcaact gttgggaagg gcgtttcggt
gcgggcctct 14100tcgctattac gccagctggc gaaaggggga tgtgctgcaa ggcgattaag
ttgggtaacg 14160ccagggtttt cccagtcaca cgcgtaatac gactcactat ag
142026314721DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 63ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tcgacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtccgccaa 7560gctattccag
aagtagtgag gaggcttttt tggaggccta ggcttttgca aaaagcttgt 7620atatccattt
tcggatctga tcaagagaca ggatgaggat cgtttcgcat gattgaataa 7680gatggattgc
acgtaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg 7740gcacaactga
caatcggctg ctctgatgcc gccgtgatcc ggttgtcagc gcaggggcgc 7800ccggttcttt
ttgtcaagac cgacctgtcc ggtgccctga atgaactgaa ggacgaggca 7860gcgcggctat
cgtggctggc cacgacgggc gttccttgcg cagtctagac tggcgcgcca 7920aacctgcagg
ttaaaacagc tgtgggttgt tcccacccac agggcccact gggcgctagc 7980actctgattt
tacgaaatcc ttgtgcgcct gttttatatc ccttccctaa ttcgaaacgt 8040agaagcaatg
cgcaccactg atcaatagta ggcgtaacgc gccagttacg tcatgatcaa 8100gcatatctgt
tcccccggac tgagtatcaa tagactgctt acgcggttga aggagaaaac 8160gttcgttatc
cggctaacta cttcgagaag cccagtaaca ccatggaagc tgcagggtgt 8220ttcgctcagc
acttcccccg tgtagatcag gtcgatgagc cactgcaatc cccacaggtg 8280actgtggcag
tggctgcgtt ggcggcctgc ctatggggag acccatagga cgctctaatg 8340tggacatggt
gcgaagagcc tattgagcta gttagtagtc ctccggcccc tgaatgcggc 8400taatcctaac
tgcggagcac atgccttcaa cccagagggt agtgtgtcgt aatgggcaac 8460tctgcagcgg
aaccgactac tttgggtgtc cgtgtttctt tttattctta tattggctgc 8520ttatggtgac
aattacagaa ttgttaccat atagctattg gattggccat ccggtgtgta 8580atagagctgt
tatataccta tttgttggct ttgtaccact aactttaaaa tctataacta 8640ccctcaactt
tatattaacc ctcaatacag ttgaacatga ggcctggcct gccctcctac 8700ctgatcatcc
tggccgtgtg cctgttcagc cacctgctgt ccagcagata cggcgccgag 8760gccgtgagcg
agcccctgga caaggctttc cacctgctgc tgaacaccta cggcagaccc 8820atccggtttc
tgcgggagaa caccacccag tgcacctaca acagcagcct gcggaacagc 8880accgtcgtga
gagagaacgc catcagcttc aactttttcc agagctacaa ccagtactac 8940gtgttccaca
tgcccagatg cctgtttgcc ggccctctgg ccgagcagtt cctgaaccag 9000gtggacctga
ccgagacact ggaaagatac cagcagcggc tgaataccta cgccctggtg 9060tccaaggacc
tggccagcta ccggtccttt agccagcagc tcaaggctca ggatagcctc 9120ggcgagcagc
ctaccaccgt gccccctccc atcgacctga gcatccccca cgtgtggatg 9180cctccccaga
ccacccctca cggctggacc gagagccaca ccacctccgg cctgcacaga 9240ccccacttca
accagacctg catcctgttc gacggccacg acctgctgtt tagcaccgtg 9300accccctgcc
tgcaccaggg cttctacctg atcgacgagc tgagatacgt gaagatcacc 9360ctgaccgagg
atttcttcgt ggtcaccgtg tccatcgacg acgacacccc catgctgctg 9420atcttcggcc
acctgcccag agtgctgttc aaggccccct accagcggga caacttcatc 9480ctgcggcaga
ccgagaagca cgagctgctg gtgctggtca agaaggacca gctgaaccgg 9540cactcctacc
tgaaggaccc cgacttcctg gacgccgccc tggacttcaa ctacctggac 9600ctgagcgccc
tgctgagaaa cagcttccac agatacgccg tggacgtgct gaagtccgga 9660cggtgccaga
tgctcgatcg gcggaccgtg gagatggcct tcgcctatgc cctcgccctg 9720ttcgccgctg
ccagacagga agaggctggc gcccaggtgt cagtgcccag agccctggat 9780agacaggccg
ccctgctgca gatccaggaa ttcatgatca cctgcctgag ccagaccccc 9840cctagaacca
ccctgctgct gtaccccaca gccgtggatc tggccaagag ggccctgtgg 9900acccccaacc
agatcaccga catcacaagc ctcgtgcggc tcgtgtacat cctgagcaag 9960cagaaccagc
agcacctgat cccccagtgg gccctgagac agatcgccga cttcgccctg 10020aagctgcaca
agacccatct ggccagcttt ctgagcgcct tcgccaggca ggaactgtac 10080ctgatgggca
gcctggtcca cagcatgctg gtgcatacca ccgagcggcg ggagatcttc 10140atcgtggaga
caggcctgtg tagcctggcc gagctgtccc actttaccca gctgctggcc 10200caccctcacc
acgagtacct gagcgacctg tacaccccct gcagcagcag cggcagacgg 10260gaccacagcc
tggaacggct gaccagactg ttccccgatg ccaccgtgcc tgctacagtg 10320cctgccgccc
tgtccatcct gtccaccatg cagcccagca ccctggaaac cttccccgac 10380ctgttctgcc
tgcccctggg cgagagcttt agcgccctga ccgtgtccga gcacgtgtcc 10440tacatcgtga
ccaatcagta cctgatcaag ggcatcagct accccgtgtc caccacagtc 10500gtgggccaga
gcctgatcat cacccagacc gacagccaga ccaagtgcga gctgacccgg 10560aacatgcaca
ccacacacag catcaccgtg gccctgaaca tcagcctgga aaactgcgct 10620ttctgtcagt
ctgccctgct ggaatacgac gatacccagg gcgtgatcaa catcatgtac 10680atgcacgaca
gcgacgacgt gctgttcgcc ctggacccct acaacgaggt ggtggtgtcc 10740agcccccgga
cccactacct gatgctgctg aagaacggca ccgtgctgga agtgaccgac 10800gtggtggtgg
acgccaccga ctgataacgc cggcgccccc ccctaacgtt actggccgaa 10860gccgcttgga
ataaggccgg tgtgcgtttg tctatatgtt attttccacc atattgccgt 10920cttttggcaa
tgtgagggcc cggaaacctg gccctgtctt cttgacgagc attcctaggg 10980gtctttcccc
tctcgccaaa ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc 11040ctctggaagc
ttcttgaaga caaacaacgt ctgtagcgac cctttgcagg cagcggaacc 11100ccccacctgg
cgacaggtgc ctctgcggcc aaaagccacg tgtataagat acacctgcaa 11160aggcggcaca
accccagtgc cacgttgtga gttggatagt tgtggaaaga gtcaaatggc 11220tctcctcaag
cgtattcaac aaggggctga aggatgccca gaaggtaccc cattgtatgg 11280gatctgatct
ggggcctcgg tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaaac 11340gtctaggccc
cccgaaccac ggggacgtgg ttttcctttg aaaaacacga taataatatg 11400tgcagaaggc
ccgactgcgg cttcagcttc agccctggac ccgtgatcct gctgtggtgc 11460tgcctgctgc
tgcctatcgt gtcctctgcc gccgtgtctg tggcccctac agccgccgag 11520aaggtgccag
ccgagtgccc cgagctgacc agaagatgcc tgctgggcga ggtgttcgag 11580ggcgacaagt
acgagagctg gctgcggccc ctggtcaacg tgaccggcag agatggcccc 11640ctgagccagc
tgatccggta cagacccgtg acccccgagg ccgccaatag cgtgctgctg 11700gacgaggcct
tcctggatac cctggccctg ctgtacaaca accccgacca gctgagagcc 11760ctgctgaccc
tgctgtccag cgacaccgcc cccagatgga tgaccgtgat gcggggctac 11820agcgagtgtg
gagatggcag ccctgccgtg tacacctgcg tggacgacct gtgcagaggc 11880tacgacctga
ccagactgag ctacggccgg tccatcttca cagagcacgt gctgggcttc 11940gagctggtgc
cccccagcct gttcaacgtg gtggtggcca tccggaacga ggccaccaga 12000accaacagag
ccgtgcggct gcctgtgtct acagccgctg cacctgaggg catcacactg 12060ttctacggcc
tgtacaacgc cgtgaaagag ttctgcctcc ggcaccagct ggatcccccc 12120ctgctgagac
acctggacaa gtactacgcc ggcctgcccc cagagctgaa gcagaccaga 12180gtgaacctgc
ccgcccacag cagatatggc cctcaggccg tggacgccag atgataagcg 12240gccgcataca
gcagcaattg gcaagctgct tacatagaac tcgcggcgat tggcatgccg 12300ccttaaaatt
tttattttat ttttcttttc ttttccgaat cggattttgt ttttaatatt 12360tcaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaaaaaaggg tcggcatggc atctccacct 12420cctcgcggtc
cgacctgggc atccgaagga ggacgcacgt ccactcggat ggctaaggga 12480gagccacgtt
taaacacgtg atatctggcc tcatgggcct tcctttcact gcccgctttc 12540cagtcgggaa
acctgtcgtg ccagctgcat taacatggtc atagctgttt ccttgcgtat 12600tgggcgctct
ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg ggtaaagcct 12660ggggtgccta
atgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 12720tggcgttttt
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 12780agaggtggcg
aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 12840tcgtgcgctc
tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 12900cgggaagcgt
ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 12960ttcgctccaa
gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 13020ccggtaacta
tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 13080ccactggtaa
caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 13140ggtggcctaa
ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 13200cagttacctt
cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 13260gcggtggttt
ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 13320atcctttgat
cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 13380ttttggtcat
gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 13440gttttaaatc
aatctaaagt atatatgagt aaacttggtc tgacagttat tagaaaaatt 13500catccagcag
acgataaaac gcaatacgct ggctatccgg tgccgcaatg ccatacagca 13560ccagaaaacg
atccgcccat tcgccgccca gttcttccgc aatatcacgg gtggccagcg 13620caatatcctg
ataacgatcc gccacgccca gacggccgca atcaataaag ccgctaaaac 13680ggccattttc
caccataatg ttcggcaggc acgcatcacc atgggtcacc accagatctt 13740cgccatccgg
catgctcgct ttcagacgcg caaacagctc tgccggtgcc aggccctgat 13800gttcttcatc
cagatcatcc tgatccacca ggcccgcttc catacgggta cgcgcacgtt 13860caatacgatg
tttcgcctga tgatcaaacg gacaggtcgc cgggtccagg gtatgcagac 13920gacgcatggc
atccgccata atgctcactt tttctgccgg cgccagatgg ctagacagca 13980gatcctgacc
cggcacttcg cccagcagca gccaatcacg gcccgcttcg gtcaccacat 14040ccagcaccgc
cgcacacgga acaccggtgg tggccagcca gctcagacgc gccgcttcat 14100cctgcagctc
gttcagcgca ccgctcagat cggttttcac aaacagcacc ggacgaccct 14160gcgcgctcag
acgaaacacc gccgcatcag agcagccaat ggtctgctgc gcccaatcat 14220agccaaacag
acgttccacc cacgctgccg ggctacccgc atgcaggcca tcctgttcaa 14280tcatactctt
cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 14340gatacatatt
tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 14400gaaaagtgcc
acctaaattg taagcgttaa tattttgtta aaattcgcgt taaatttttg 14460ttaaatcagc
tcatttttta accaataggc cgaaatcggc aaaatccctt ataaatcaaa 14520agaatagacc
gagatagggt tgagtggccg ctacagggcg ctcccattcg ccattcaggc 14580tgcgcaactg
ttgggaaggg cgtttcggtg cgggcctctt cgctattacg ccagctggcg 14640aaagggggat
gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacac 14700gcgtaatacg
actcactata g
147216414721DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 64ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tcgacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtccgccaa 7560gctattccag aagtagtgag
gaggcttttt tggaggccta ggcttttgca aaaagcttgt 7620atatccattt tcggatctga
tcaagagaca ggatgaggat cgtttcgcat gattgaataa 7680gatggattgc acgtaggttc
tccggccgct tgggtggaga ggctattcgg ctatgactgg 7740gcacaactga caatcggctg
ctctgatgcc gccgtgatcc ggttgtcagc gcaggggcgc 7800ccggttcttt ttgtcaagac
cgacctgtcc ggtgccctga atgaactgaa ggacgaggca 7860gcgcggctat cgtggctggc
cacgacgggc gttccttgcg cagtctagac tggcgcgcca 7920aacctgcagg ttaaaacagc
tgtgggttgt tcccacccac agggcccact gggcgctagc 7980actctgattt tacgaaatcc
ttgtgcgcct gttttatatc ccttccctaa ttcgaaacgt 8040agaagcaatg cgcaccactg
atcaatagta ggcgtaacgc gccagttacg tcatgatcaa 8100gcatatctgt tcccccggac
tgagtatcaa tagactgctt acgcggttga aggagaaaac 8160gttcgttatc cggctaacta
cttcgagaag cccagtaaca ccatggaagc tgcagggtgt 8220ttcgctcagc acttcccccg
tgtagatcag gtcgatgagc cactgcaatc cccacaggtg 8280actgtggcag tggctgcgtt
ggcggcctgc ctatggggag acccatagga cgctctaatg 8340tggacatggt gcgaagagcc
tattgagcta gttagtagtc ctccggcccc tgaatgcggc 8400taatcctaac tgcggagcac
atgccttcaa cccagagggt agtgtgtcgt aatgggcaac 8460tctgcagcgg aaccgactac
tttgggtgtc cgtgtttctt tttattctta tattggctgc 8520ttatggtgac aattacagaa
ttgttaccat atagctattg gattggccat ccggtgtgta 8580atagagctgt tatataccta
tttgttggct ttgtaccact aactttaaaa tctataacta 8640ccctcaactt tatattaacc
ctcaatacag ttgaacatgt gcagaaggcc cgactgcggc 8700ttcagcttca gccctggacc
cgtgatcctg ctgtggtgct gcctgctgct gcctatcgtg 8760tcctctgccg ccgtgtctgt
ggcccctaca gccgccgaga aggtgccagc cgagtgcccc 8820gagctgacca gaagatgcct
gctgggcgag gtgttcgagg gcgacaagta cgagagctgg 8880ctgcggcccc tggtcaacgt
gaccggcaga gatggccccc tgagccagct gatccggtac 8940agacccgtga cccccgaggc
cgccaatagc gtgctgctgg acgaggcctt cctggatacc 9000ctggccctgc tgtacaacaa
ccccgaccag ctgagagccc tgctgaccct gctgtccagc 9060gacaccgccc ccagatggat
gaccgtgatg cggggctaca gcgagtgtgg agatggcagc 9120cctgccgtgt acacctgcgt
ggacgacctg tgcagaggct acgacctgac cagactgagc 9180tacggccggt ccatcttcac
agagcacgtg ctgggcttcg agctggtgcc ccccagcctg 9240ttcaacgtgg tggtggccat
ccggaacgag gccaccagaa ccaacagagc cgtgcggctg 9300cctgtgtcta cagccgctgc
acctgagggc atcacactgt tctacggcct gtacaacgcc 9360gtgaaagagt tctgcctccg
gcaccagctg gatccccccc tgctgagaca cctggacaag 9420tactacgccg gcctgccccc
agagctgaag cagaccagag tgaacctgcc cgcccacagc 9480agatatggcc ctcaggccgt
ggacgccaga tgataacgcc ggcgcccccc cctaacgtta 9540ctggccgaag ccgcttggaa
taaggccggt gtgcgtttgt ctatatgtta ttttccacca 9600tattgccgtc ttttggcaat
gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 9660ttcctagggg tctttcccct
ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 9720aagcagttcc tctggaagct
tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 9780agcggaaccc cccacctggc
gacaggtgcc tctgcggcca aaagccacgt gtataagata 9840cacctgcaaa ggcggcacaa
ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 9900tcaaatggct ctcctcaagc
gtattcaaca aggggctgaa ggatgcccag aaggtacccc 9960attgtatggg atctgatctg
gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 10020taaaaaaacg tctaggcccc
ccgaaccacg gggacgtggt tttcctttga aaaacacgat 10080aataatatga ggcctggcct
gccctcctac ctgatcatcc tggccgtgtg cctgttcagc 10140cacctgctgt ccagcagata
cggcgccgag gccgtgagcg agcccctgga caaggctttc 10200cacctgctgc tgaacaccta
cggcagaccc atccggtttc tgcgggagaa caccacccag 10260tgcacctaca acagcagcct
gcggaacagc accgtcgtga gagagaacgc catcagcttc 10320aactttttcc agagctacaa
ccagtactac gtgttccaca tgcccagatg cctgtttgcc 10380ggccctctgg ccgagcagtt
cctgaaccag gtggacctga ccgagacact ggaaagatac 10440cagcagcggc tgaataccta
cgccctggtg tccaaggacc tggccagcta ccggtccttt 10500agccagcagc tcaaggctca
ggatagcctc ggcgagcagc ctaccaccgt gccccctccc 10560atcgacctga gcatccccca
cgtgtggatg cctccccaga ccacccctca cggctggacc 10620gagagccaca ccacctccgg
cctgcacaga ccccacttca accagacctg catcctgttc 10680gacggccacg acctgctgtt
tagcaccgtg accccctgcc tgcaccaggg cttctacctg 10740atcgacgagc tgagatacgt
gaagatcacc ctgaccgagg atttcttcgt ggtcaccgtg 10800tccatcgacg acgacacccc
catgctgctg atcttcggcc acctgcccag agtgctgttc 10860aaggccccct accagcggga
caacttcatc ctgcggcaga ccgagaagca cgagctgctg 10920gtgctggtca agaaggacca
gctgaaccgg cactcctacc tgaaggaccc cgacttcctg 10980gacgccgccc tggacttcaa
ctacctggac ctgagcgccc tgctgagaaa cagcttccac 11040agatacgccg tggacgtgct
gaagtccgga cggtgccaga tgctcgatcg gcggaccgtg 11100gagatggcct tcgcctatgc
cctcgccctg ttcgccgctg ccagacagga agaggctggc 11160gcccaggtgt cagtgcccag
agccctggat agacaggccg ccctgctgca gatccaggaa 11220ttcatgatca cctgcctgag
ccagaccccc cctagaacca ccctgctgct gtaccccaca 11280gccgtggatc tggccaagag
ggccctgtgg acccccaacc agatcaccga catcacaagc 11340ctcgtgcggc tcgtgtacat
cctgagcaag cagaaccagc agcacctgat cccccagtgg 11400gccctgagac agatcgccga
cttcgccctg aagctgcaca agacccatct ggccagcttt 11460ctgagcgcct tcgccaggca
ggaactgtac ctgatgggca gcctggtcca cagcatgctg 11520gtgcatacca ccgagcggcg
ggagatcttc atcgtggaga caggcctgtg tagcctggcc 11580gagctgtccc actttaccca
gctgctggcc caccctcacc acgagtacct gagcgacctg 11640tacaccccct gcagcagcag
cggcagacgg gaccacagcc tggaacggct gaccagactg 11700ttccccgatg ccaccgtgcc
tgctacagtg cctgccgccc tgtccatcct gtccaccatg 11760cagcccagca ccctggaaac
cttccccgac ctgttctgcc tgcccctggg cgagagcttt 11820agcgccctga ccgtgtccga
gcacgtgtcc tacatcgtga ccaatcagta cctgatcaag 11880ggcatcagct accccgtgtc
caccacagtc gtgggccaga gcctgatcat cacccagacc 11940gacagccaga ccaagtgcga
gctgacccgg aacatgcaca ccacacacag catcaccgtg 12000gccctgaaca tcagcctgga
aaactgcgct ttctgtcagt ctgccctgct ggaatacgac 12060gatacccagg gcgtgatcaa
catcatgtac atgcacgaca gcgacgacgt gctgttcgcc 12120ctggacccct acaacgaggt
ggtggtgtcc agcccccgga cccactacct gatgctgctg 12180aagaacggca ccgtgctgga
agtgaccgac gtggtggtgg acgccaccga ctgataagcg 12240gccgcataca gcagcaattg
gcaagctgct tacatagaac tcgcggcgat tggcatgccg 12300ccttaaaatt tttattttat
ttttcttttc ttttccgaat cggattttgt ttttaatatt 12360tcaaaaaaaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaggg tcggcatggc atctccacct 12420cctcgcggtc cgacctgggc
atccgaagga ggacgcacgt ccactcggat ggctaaggga 12480gagccacgtt taaacacgtg
atatctggcc tcatgggcct tcctttcact gcccgctttc 12540cagtcgggaa acctgtcgtg
ccagctgcat taacatggtc atagctgttt ccttgcgtat 12600tgggcgctct ccgcttcctc
gctcactgac tcgctgcgct cggtcgttcg ggtaaagcct 12660ggggtgccta atgagcaaaa
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 12720tggcgttttt ccataggctc
cgcccccctg acgagcatca caaaaatcga cgctcaagtc 12780agaggtggcg aaacccgaca
ggactataaa gataccaggc gtttccccct ggaagctccc 12840tcgtgcgctc tcctgttccg
accctgccgc ttaccggata cctgtccgcc tttctccctt 12900cgggaagcgt ggcgctttct
catagctcac gctgtaggta tctcagttcg gtgtaggtcg 12960ttcgctccaa gctgggctgt
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 13020ccggtaacta tcgtcttgag
tccaacccgg taagacacga cttatcgcca ctggcagcag 13080ccactggtaa caggattagc
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 13140ggtggcctaa ctacggctac
actagaagaa cagtatttgg tatctgcgct ctgctgaagc 13200cagttacctt cggaaaaaga
gttggtagct cttgatccgg caaacaaacc accgctggta 13260gcggtggttt ttttgtttgc
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 13320atcctttgat cttttctacg
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 13380ttttggtcat gagattatca
aaaaggatct tcacctagat ccttttaaat taaaaatgaa 13440gttttaaatc aatctaaagt
atatatgagt aaacttggtc tgacagttat tagaaaaatt 13500catccagcag acgataaaac
gcaatacgct ggctatccgg tgccgcaatg ccatacagca 13560ccagaaaacg atccgcccat
tcgccgccca gttcttccgc aatatcacgg gtggccagcg 13620caatatcctg ataacgatcc
gccacgccca gacggccgca atcaataaag ccgctaaaac 13680ggccattttc caccataatg
ttcggcaggc acgcatcacc atgggtcacc accagatctt 13740cgccatccgg catgctcgct
ttcagacgcg caaacagctc tgccggtgcc aggccctgat 13800gttcttcatc cagatcatcc
tgatccacca ggcccgcttc catacgggta cgcgcacgtt 13860caatacgatg tttcgcctga
tgatcaaacg gacaggtcgc cgggtccagg gtatgcagac 13920gacgcatggc atccgccata
atgctcactt tttctgccgg cgccagatgg ctagacagca 13980gatcctgacc cggcacttcg
cccagcagca gccaatcacg gcccgcttcg gtcaccacat 14040ccagcaccgc cgcacacgga
acaccggtgg tggccagcca gctcagacgc gccgcttcat 14100cctgcagctc gttcagcgca
ccgctcagat cggttttcac aaacagcacc ggacgaccct 14160gcgcgctcag acgaaacacc
gccgcatcag agcagccaat ggtctgctgc gcccaatcat 14220agccaaacag acgttccacc
cacgctgccg ggctacccgc atgcaggcca tcctgttcaa 14280tcatactctt cctttttcaa
tattattgaa gcatttatca gggttattgt ctcatgagcg 14340gatacatatt tgaatgtatt
tagaaaaata aacaaatagg ggttccgcgc acatttcccc 14400gaaaagtgcc acctaaattg
taagcgttaa tattttgtta aaattcgcgt taaatttttg 14460ttaaatcagc tcatttttta
accaataggc cgaaatcggc aaaatccctt ataaatcaaa 14520agaatagacc gagatagggt
tgagtggccg ctacagggcg ctcccattcg ccattcaggc 14580tgcgcaactg ttgggaaggg
cgtttcggtg cgggcctctt cgctattacg ccagctggcg 14640aaagggggat gtgctgcaag
gcgattaagt tgggtaacgc cagggttttc ccagtcacac 14700gcgtaatacg actcactata g
147216515300DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 65ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tagacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtccgccaa 7560gatgaggcct ggcctgccct cctacctgat catcctggcc gtgtgcctgt
tcagccacct 7620gctgtccagc agatacggcg ccgaggccgt gagcgagccc ctggacaagg
ctttccacct 7680gctgctgaac acctacggca gacccatccg gtttctgcgg gagaacacca
cccagtgcac 7740ctacaacagc agcctgcgga acagcaccgt cgtgagagag aacgccatca
gcttcaactt 7800tttccagagc tacaaccagt actacgtgtt ccacatgccc agatgcctgt
ttgccggccc 7860tctggccgag cagttcctga accaggtgga cctgaccgag acactggaaa
gataccagca 7920gcggctgaat acctacgccc tggtgtccaa ggacctggcc agctaccggt
cctttagcca 7980gcagctcaag gctcaggata gcctcggcga gcagcctacc accgtgcccc
ctcccatcga 8040cctgagcatc ccccacgtgt ggatgcctcc ccagaccacc cctcacggct
ggaccgagag 8100ccacaccacc tccggcctgc acagacccca cttcaaccag acctgcatcc
tgttcgacgg 8160ccacgacctg ctgtttagca ccgtgacccc ctgcctgcac cagggcttct
acctgatcga 8220cgagctgaga tacgtgaaga tcaccctgac cgaggatttc ttcgtggtca
ccgtgtccat 8280cgacgacgac acccccatgc tgctgatctt cggccacctg cccagagtgc
tgttcaaggc 8340cccctaccag cgggacaact tcatcctgcg gcagaccgag aagcacgagc
tgctggtgct 8400ggtcaagaag gaccagctga accggcactc ctacctgaag gaccccgact
tcctggacgc 8460cgccctggac ttcaactacc tggacctgag cgccctgctg agaaacagct
tccacagata 8520cgccgtggac gtgctgaagt ccggacggtg ccagatgctc gatcggcgga
ccgtggagat 8580ggccttcgcc tatgccctcg ccctgttcgc cgctgccaga caggaagagg
ctggcgccca 8640ggtgtcagtg cccagagccc tggatagaca ggccgccctg ctgcagatcc
aggaattcat 8700gatcacctgc ctgagccaga ccccccctag aaccaccctg ctgctgtacc
ccacagccgt 8760ggatctggcc aagagggccc tgtggacccc caaccagatc accgacatca
caagcctcgt 8820gcggctcgtg tacatcctga gcaagcagaa ccagcagcac ctgatccccc
agtgggccct 8880gagacagatc gccgacttcg ccctgaagct gcacaagacc catctggcca
gctttctgag 8940cgccttcgcc aggcaggaac tgtacctgat gggcagcctg gtccacagca
tgctggtgca 9000taccaccgag cggcgggaga tcttcatcgt ggagacaggc ctgtgtagcc
tggccgagct 9060gtcccacttt acccagctgc tggcccaccc tcaccacgag tacctgagcg
acctgtacac 9120cccctgcagc agcagcggca gacgggacca cagcctggaa cggctgacca
gactgttccc 9180cgatgccacc gtgcctgcta cagtgcctgc cgccctgtcc atcctgtcca
ccatgcagcc 9240cagcaccctg gaaaccttcc ccgacctgtt ctgcctgccc ctgggcgaga
gctttagcgc 9300cctgaccgtg tccgagcacg tgtcctacat cgtgaccaat cagtacctga
tcaagggcat 9360cagctacccc gtgtccacca cagtcgtggg ccagagcctg atcatcaccc
agaccgacag 9420ccagaccaag tgcgagctga cccggaacat gcacaccaca cacagcatca
ccgtggccct 9480gaacatcagc ctggaaaact gcgctttctg tcagtctgcc ctgctggaat
acgacgatac 9540ccagggcgtg atcaacatca tgtacatgca cgacagcgac gacgtgctgt
tcgccctgga 9600cccctacaac gaggtggtgg tgtccagccc ccggacccac tacctgatgc
tgctgaagaa 9660cggcaccgtg ctggaagtga ccgacgtggt ggtggacgcc accgacagca
gactgctgat 9720gatgagcgtg tacgccctga gcgccatcat cggcatctac ctgctgtacc
ggatgctgaa 9780aacctgctga taatctagag gcccctataa ctctctacgg ctaacctgaa
tggactacga 9840catagtctag tccgccaaga tgtgcagaag gcccgactgc ggcttcagct
tcagccctgg 9900acccgtgatc ctgctgtggt gctgcctgct gctgcctatc gtgtcctctg
ccgccgtgtc 9960tgtggcccct acagccgccg agaaggtgcc agccgagtgc cccgagctga
ccagaagatg 10020cctgctgggc gaggtgttcg agggcgacaa gtacgagagc tggctgcggc
ccctggtcaa 10080cgtgaccggc agagatggcc ccctgagcca gctgatccgg tacagacccg
tgacccccga 10140ggccgccaat agcgtgctgc tggacgaggc cttcctggat accctggccc
tgctgtacaa 10200caaccccgac cagctgagag ccctgctgac cctgctgtcc agcgacaccg
cccccagatg 10260gatgaccgtg atgcggggct acagcgagtg tggagatggc agccctgccg
tgtacacctg 10320cgtggacgac ctgtgcagag gctacgacct gaccagactg agctacggcc
ggtccatctt 10380cacagagcac gtgctgggct tcgagctggt gccccccagc ctgttcaacg
tggtggtggc 10440catccggaac gaggccacca gaaccaacag agccgtgcgg ctgcctgtgt
ctacagccgc 10500tgcacctgag ggcatcacac tgttctacgg cctgtacaac gccgtgaaag
agttctgcct 10560ccggcaccag ctggatcccc ccctgctgag acacctggac aagtactacg
ccggcctgcc 10620cccagagctg aagcagacca gagtgaacct gcccgcccac agcagatatg
gccctcaggc 10680cgtggacgcc agatgataac gccggcggcc cctataactc tctacggcta
acctgaatgg 10740actacgacat agtctagtcc gccaagatga gccccaagga cctgaccccc
ttcctgacaa 10800ccctgtggct gctcctgggc catagcagag tgcctagagt gcgggccgag
gaatgctgcg 10860agttcatcaa cgtgaaccac ccccccgagc ggtgctacga cttcaagatg
tgcaaccggt 10920tcaccgtggc cctgagatgc cccgacggcg aagtgtgcta cagccccgag
aaaaccgccg 10980agatccgggg catcgtgacc accatgaccc acagcctgac ccggcaggtg
gtgcacaaca 11040agctgaccag ctgcaactac aaccccctgt acctggaagc cgacggccgg
atcagatgcg 11100gcaaagtgaa cgacaaggcc cagtacctgc tgggagccgc cggaagcgtg
ccctaccggt 11160ggatcaacct ggaatacgac aagatcaccc ggatcgtggg cctggaccag
tacctggaaa 11220gcgtgaagaa gcacaagcgg ctggacgtgt gcagagccaa gatgggctac
atgctgcagt 11280gataaggcgc gccgccccta taactctcta cggctaacct gaatggacta
cgacatagtc 11340tagtccgcca agatgctgcg gctgctgctg agacaccact tccactgcct
gctgctgtgt 11400gccgtgtggg ccaccccttg tctggccagc ccttggagca ccctgaccgc
caaccagaac 11460cctagccccc cttggtccaa gctgacctac agcaagcccc acgacgccgc
caccttctac 11520tgcccctttc tgtaccccag ccctcccaga agccccctgc agttcagcgg
cttccagaga 11580gtgtccaccg gccctgagtg ccggaacgag acactgtacc tgctgtacaa
ccgggagggc 11640cagacactgg tggagcggag cagcacctgg gtgaaaaaag tgatctggta
tctgagcggc 11700cggaaccaga ccatcctgca gcggatgccc agaaccgcca gcaagcccag
cgacggcaac 11760gtgcagatca gcgtggagga cgccaaaatc ttcggagccc acatggtgcc
caagcagacc 11820aagctgctga gattcgtggt caacgacggc accagatatc agatgtgcgt
gatgaagctg 11880gaaagctggg cccacgtgtt ccgggactac tccgtgagct tccaggtccg
gctgaccttc 11940accgaggcca acaaccagac ctacaccttc tgcacccacc ccaacctgat
cgtgtgataa 12000gcggccgcgc ccctataact ctctacggct aacctgaatg gactacgaca
tagtctagtc 12060cgccaagatg cggctgtgca gagtgtggct gtccgtgtgc ctgtgtgccg
tggtgctggg 12120ccagtgccag agagagacag ccgagaagaa cgactactac cgggtgcccc
actactggga 12180tgcctgcagc agagccctgc ccgaccagac ccggtacaaa tacgtggagc
agctcgtgga 12240cctgaccctg aactaccact acgacgccag ccacggcctg gacaacttcg
acgtgctgaa 12300gcggatcaac gtgaccgagg tgtccctgct gatcagcgac ttccggcggc
agaacagaag 12360aggcggcacc aacaagcgga ccaccttcaa cgccgctggc tctctggccc
ctcacgccag 12420atccctggaa ttcagcgtgc ggctgttcgc caactgataa cgttgcatcc
tgcaggatac 12480agcagcaatt ggcaagctgc ttacatagaa ctcgcggcga ttggcatgcc
gccttaaaat 12540ttttatttta tttttctttt cttttccgaa tcggattttg tttttaatat
ttcaaaaaaa 12600aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gtcggcatgg catctccacc
tcctcgcggt 12660ccgacctggg catccgaagg aggacgcacg tccactcgga tggctaaggg
agagccacgt 12720ttaaacgcta gagcaagacg tttcccgttg aatatggctc ataacacccc
ttgtattact 12780gtttatgtaa gcagacagtt ttattgttca tgatgatata tttttatctt
gtgcaatgta 12840acatcagaga ttttgagaca caacgtggct ttgttgaata aatcgaactt
ttgctgagtt 12900gaaggatcag atcacgcatc ttcccgacaa cgcagaccgt tccgtggcaa
agcaaaagtt 12960caaaatcacc aactggtcca cctacaacaa agctctcatc aaccgtggct
ccctcacttt 13020ctggctggat gatggggcga ttcaggcctg gtatgagtca gcaacacctt
cttcacgagg 13080cagacctcag cgctagcgga gtgtatactg gcttactatg ttggcactga
tgagggtgtc 13140agtgaagtgc ttcatgtggc aggagaaaaa aggctgcacc ggtgcgtcag
cagaatatgt 13200gatacaggat atattccgct tcctcgctca ctgactcgct acgctcggtc
gttcgactgc 13260ggcgagcgga aatggcttac gaacggggcg gagatttcct ggaagatgcc
aggaagatac 13320ttaacaggga agtgagaggg ccgcggcaaa gccgtttttc cataggctcc
gcccccctga 13380caagcatcac gaaatctgac gctcaaatca gtggtggcga aacccgacag
gactataaag 13440ataccaggcg tttcccctgg cggctccctc gtgcgctctc ctgttcctgc
ctttcggttt 13500accggtgtca ttccgctgtt atggccgcgt ttgtctcatt ccacgcctga
cactcagttc 13560cgggtaggca gttcgctcca agctggactg tatgcacgaa ccccccgttc
agtccgaccg 13620ctgcgcctta tccggtaact atcgtcttga gtccaacccg gaaagacatg
caaaagcacc 13680actggcagca gccactggta attgatttag aggagttagt cttgaagtca
tgcgccggtt 13740aaggctaaac tgaaaggaca agttttggtg actgcgctcc tccaagccag
ttacctcggt 13800tcaaagagtt ggtagctcag agaaccttcg aaaaaccgcc ctgcaaggcg
gttttttcgt 13860tttcagagca agagattacg cgcagaccaa aacgatctca agaagatcat
cttattaagg 13920ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg
agattatcaa 13980aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca
atctaaagta 14040tatatgagta aacttggtct gacagttatt agaaaaattc atccagcaga
cgataaaacg 14100caatacgctg gctatccggt gccgcaatgc catacagcac cagaaaacga
tccgcccatt 14160cgccgcccag ttcttccgca atatcacggg tggccagcgc aatatcctga
taacgatccg 14220ccacgcccag acggccgcaa tcaataaagc cgctaaaacg gccattttcc
accataatgt 14280tcggcaggca cgcatcacca tgggtcacca ccagatcttc gccatccggc
atgctcgctt 14340tcagacgcgc aaacagctct gccggtgcca ggccctgatg ttcttcatcc
agatcatcct 14400gatccaccag gcccgcttcc atacgggtac gcgcacgttc aatacgatgt
ttcgcctgat 14460gatcaaacgg acaggtcgcc gggtccaggg tatgcagacg acgcatggca
tccgccataa 14520tgctcacttt ttctgccggc gccagatggc tagacagcag atcctgaccc
ggcacttcgc 14580ccagcagcag ccaatcacgg cccgcttcgg tcaccacatc cagcaccgcc
gcacacggaa 14640caccggtggt ggccagccag ctcagacgcg ccgcttcatc ctgcagctcg
ttcagcgcac 14700cgctcagatc ggttttcaca aacagcaccg gacgaccctg cgcgctcaga
cgaaacaccg 14760ccgcatcaga gcagccaatg gtctgctgcg cccaatcata gccaaacaga
cgttccaccc 14820acgctgccgg gctacccgca tgcaggccat cctgttcaat catactcttc
ctttttcaat 14880attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt
gaatgtattt 14940agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca
cctaaattgt 15000aagcgttaat attttgttaa aattcgcgtt aaatttttgt taaatcagct
cattttttaa 15060ccaataggcc gaaatcggca aaatccctta taaatcaaaa gaatagaccg
agatagggtt 15120gagtggccgc tacagggcgc tcccattcgc cattcaggct gcgcaactgt
tgggaagggc 15180gtttcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg
tgctgcaagg 15240cgattaagtt gggtaacgcc agggttttcc cagtcacacg cgtaatacga
ctcactatag 153006616324DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 66ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtccgccaa 7560gatgaggcct
ggcctgccct cctacctgat catcctggcc gtgtgcctgt tcagccacct 7620gctgtccagc
agatacggcg ccgaggccgt gagcgagccc ctggacaagg ctttccacct 7680gctgctgaac
acctacggca gacccatccg gtttctgcgg gagaacacca cccagtgcac 7740ctacaacagc
agcctgcgga acagcaccgt cgtgagagag aacgccatca gcttcaactt 7800tttccagagc
tacaaccagt actacgtgtt ccacatgccc agatgcctgt ttgccggccc 7860tctggccgag
cagttcctga accaggtgga cctgaccgag acactggaaa gataccagca 7920gcggctgaat
acctacgccc tggtgtccaa ggacctggcc agctaccggt cctttagcca 7980gcagctcaag
gctcaggata gcctcggcga gcagcctacc accgtgcccc ctcccatcga 8040cctgagcatc
ccccacgtgt ggatgcctcc ccagaccacc cctcacggct ggaccgagag 8100ccacaccacc
tccggcctgc acagacccca cttcaaccag acctgcatcc tgttcgacgg 8160ccacgacctg
ctgtttagca ccgtgacccc ctgcctgcac cagggcttct acctgatcga 8220cgagctgaga
tacgtgaaga tcaccctgac cgaggatttc ttcgtggtca ccgtgtccat 8280cgacgacgac
acccccatgc tgctgatctt cggccacctg cccagagtgc tgttcaaggc 8340cccctaccag
cgggacaact tcatcctgcg gcagaccgag aagcacgagc tgctggtgct 8400ggtcaagaag
gaccagctga accggcactc ctacctgaag gaccccgact tcctggacgc 8460cgccctggac
ttcaactacc tggacctgag cgccctgctg agaaacagct tccacagata 8520cgccgtggac
gtgctgaagt ccggacggtg ccagatgctc gatcggcgga ccgtggagat 8580ggccttcgcc
tatgccctcg ccctgttcgc cgctgccaga caggaagagg ctggcgccca 8640ggtgtcagtg
cccagagccc tggatagaca ggccgccctg ctgcagatcc aggaattcat 8700gatcacctgc
ctgagccaga ccccccctag aaccaccctg ctgctgtacc ccacagccgt 8760ggatctggcc
aagagggccc tgtggacccc caaccagatc accgacatca caagcctcgt 8820gcggctcgtg
tacatcctga gcaagcagaa ccagcagcac ctgatccccc agtgggccct 8880gagacagatc
gccgacttcg ccctgaagct gcacaagacc catctggcca gctttctgag 8940cgccttcgcc
aggcaggaac tgtacctgat gggcagcctg gtccacagca tgctggtgca 9000taccaccgag
cggcgggaga tcttcatcgt ggagacaggc ctgtgtagcc tggccgagct 9060gtcccacttt
acccagctgc tggcccaccc tcaccacgag tacctgagcg acctgtacac 9120cccctgcagc
agcagcggca gacgggacca cagcctggaa cggctgacca gactgttccc 9180cgatgccacc
gtgcctgcta cagtgcctgc cgccctgtcc atcctgtcca ccatgcagcc 9240cagcaccctg
gaaaccttcc ccgacctgtt ctgcctgccc ctgggcgaga gctttagcgc 9300cctgaccgtg
tccgagcacg tgtcctacat cgtgaccaat cagtacctga tcaagggcat 9360cagctacccc
gtgtccacca cagtcgtggg ccagagcctg atcatcaccc agaccgacag 9420ccagaccaag
tgcgagctga cccggaacat gcacaccaca cacagcatca ccgtggccct 9480gaacatcagc
ctggaaaact gcgctttctg tcagtctgcc ctgctggaat acgacgatac 9540ccagggcgtg
atcaacatca tgtacatgca cgacagcgac gacgtgctgt tcgccctgga 9600cccctacaac
gaggtggtgg tgtccagccc ccggacccac tacctgatgc tgctgaagaa 9660cggcaccgtg
ctggaagtga ccgacgtggt ggtggacgcc accgactgat aatctagagg 9720cccctataac
tctctacggc taacctgaat ggactacgac atagtctagt ccgccaagat 9780gtgcagaagg
cccgactgcg gcttcagctt cagccctgga cccgtgatcc tgctgtggtg 9840ctgcctgctg
ctgcctatcg tgtcctctgc cgccgtgtct gtggccccta cagccgccga 9900gaaggtgcca
gccgagtgcc ccgagctgac cagaagatgc ctgctgggcg aggtgttcga 9960gggcgacaag
tacgagagct ggctgcggcc cctggtcaac gtgaccggca gagatggccc 10020cctgagccag
ctgatccggt acagacccgt gacccccgag gccgccaata gcgtgctgct 10080ggacgaggcc
ttcctggata ccctggccct gctgtacaac aaccccgacc agctgagagc 10140cctgctgacc
ctgctgtcca gcgacaccgc ccccagatgg atgaccgtga tgcggggcta 10200cagcgagtgt
ggagatggca gccctgccgt gtacacctgc gtggacgacc tgtgcagagg 10260ctacgacctg
accagactga gctacggccg gtccatcttc acagagcacg tgctgggctt 10320cgagctggtg
ccccccagcc tgttcaacgt ggtggtggcc atccggaacg aggccaccag 10380aaccaacaga
gccgtgcggc tgcctgtgtc tacagccgct gcacctgagg gcatcacact 10440gttctacggc
ctgtacaacg ccgtgaaaga gttctgcctc cggcaccagc tggatccccc 10500cctgctgaga
cacctggaca agtactacgc cggcctgccc ccagagctga agcagaccag 10560agtgaacctg
cccgcccaca gcagatatgg ccctcaggcc gtggacgcca gatgataacg 10620ccggcggccc
ctataactct ctacggctaa cctgaatgga ctacgacata gtctagtccg 10680ccaagatgag
ccccaaggac ctgaccccct tcctgacaac cctgtggctg ctcctgggcc 10740atagcagagt
gcctagagtg cgggccgagg aatgctgcga gttcatcaac gtgaaccacc 10800cccccgagcg
gtgctacgac ttcaagatgt gcaaccggtt caccgtggcc ctgagatgcc 10860ccgacggcga
agtgtgctac agccccgaga aaaccgccga gatccggggc atcgtgacca 10920ccatgaccca
cagcctgacc cggcaggtgg tgcacaacaa gctgaccagc tgcaactaca 10980accccctgta
cctggaagcc gacggccgga tcagatgcgg caaagtgaac gacaaggccc 11040agtacctgct
gggagccgcc ggaagcgtgc cctaccggtg gatcaacctg gaatacgaca 11100agatcacccg
gatcgtgggc ctggaccagt acctggaaag cgtgaagaag cacaagcggc 11160tggacgtgtg
cagagccaag atgggctaca tgctgcagtg ataaggcgcg ccaacgttac 11220tggccgaagc
cgcttggaat aaggccggtg tgcgtttgtc tatatgttat tttccaccat 11280attgccgtct
tttggcaatg tgagggcccg gaaacctggc cctgtcttct tgacgagcat 11340tcctaggggt
ctttcccctc tcgccaaagg aatgcaaggt ctgttgaatg tcgtgaagga 11400agcagttcct
ctggaagctt cttgaagaca aacaacgtct gtagcgaccc tttgcaggca 11460gcggaacccc
ccacctggcg acaggtgcct ctgcggccaa aagccacgtg tataagatac 11520acctgcaaag
gcggcacaac cccagtgcca cgttgtgagt tggatagttg tggaaagagt 11580caaatggctc
tcctcaagcg tattcaacaa ggggctgaag gatgcccaga aggtacccca 11640ttgtatggga
tctgatctgg ggcctcggtg cacatgcttt acatgtgttt agtcgaggtt 11700aaaaaaacgt
ctaggccccc cgaaccacgg ggacgtggtt ttcctttgaa aaacacgata 11760atatgctgcg
gctgctgctg agacaccact tccactgcct gctgctgtgt gccgtgtggg 11820ccaccccttg
tctggccagc ccttggagca ccctgaccgc caaccagaac cctagccccc 11880cttggtccaa
gctgacctac agcaagcccc acgacgccgc caccttctac tgcccctttc 11940tgtaccccag
ccctcccaga agccccctgc agttcagcgg cttccagaga gtgtccaccg 12000gccctgagtg
ccggaacgag acactgtacc tgctgtacaa ccgggagggc cagacactgg 12060tggagcggag
cagcacctgg gtgaaaaaag tgatctggta tctgagcggc cggaaccaga 12120ccatcctgca
gcggatgccc agaaccgcca gcaagcccag cgacggcaac gtgcagatca 12180gcgtggagga
cgccaaaatc ttcggagccc acatggtgcc caagcagacc aagctgctga 12240gattcgtggt
caacgacggc accagatatc agatgtgcgt gatgaagctg gaaagctggg 12300cccacgtgtt
ccgggactac tccgtgagct tccaggtccg gctgaccttc accgaggcca 12360acaaccagac
ctacaccttc tgcacccacc ccaacctgat cgtgtgataa gtacctttgt 12420acgcctgttt
tataccccct ccctgatttg caacttagaa gcaacgcaaa ccagatcaat 12480agtaggtgtg
acataccagt cgcatcttga tcaagcactt ctgtatcccc ggaccgagta 12540tcaatagact
gtgcacacgg ttgaaggaga aaacgtccgt tacccggcta actacttcga 12600gaagcctagt
aacgccattg aagttgcaga gtgtttcgct cagcactccc cccgtgtaga 12660tcaggtcgat
gagtcaccgc attccccacg ggcgaccgtg gcggtggctg cgttggcggc 12720ctgcctatgg
ggtaacccat aggacgctct aatacggaca tggcgtgaag agtctattga 12780gctagttagt
agtcctccgg cccctgaatg cggctaatcc taactgcgga gcacataccc 12840ttaatccaaa
gggcagtgtg tcgtaacggg caactctgca gcggaaccga ctactttggg 12900tgtccgtgtt
tctttttatt cttgtattgg ctgcttatgg tgacaattaa agaattgtta 12960ccatatagct
attggattgg ccatccagtg tcaaacagag ctattgtata tctctttgtt 13020ggattcacac
ctctcactct tgaaacgtta cacaccctca attacattat actgctgaac 13080acgaagcgca
tatgcggctg tgcagagtgt ggctgtccgt gtgcctgtgt gccgtggtgc 13140tgggccagtg
ccagagagag acagccgaga agaacgacta ctaccgggtg ccccactact 13200gggatgcctg
cagcagagcc ctgcccgacc agacccggta caaatacgtg gagcagctcg 13260tggacctgac
cctgaactac cactacgacg ccagccacgg cctggacaac ttcgacgtgc 13320tgaagcggat
caacgtgacc gaggtgtccc tgctgatcag cgacttccgg cggcagaaca 13380gaagaggcgg
caccaacaag cggaccacct tcaacgccgc tggctctctg gcccctcacg 13440ccagatccct
ggaattcagc gtgcggctgt tcgccaactg ataacgttgc atcctgcagg 13500atacagcagc
aattggcaag ctgcttacat agaactcgcg gcgattggca tgccgcctta 13560aaatttttat
tttatttttc ttttcttttc cgaatcggat tttgttttta atatttcaaa 13620aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aagggtcggc atggcatctc cacctcctcg 13680cggtccgacc
tgggcatccg aaggaggacg cacgtccact cggatggcta agggagagcc 13740acgtttaaac
gctagagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtat 13800tactgtttat
gtaagcagac agttttattg ttcatgatga tatattttta tcttgtgcaa 13860tgtaacatca
gagattttga gacacaacgt ggctttgttg aataaatcga acttttgctg 13920agttgaagga
tcagatcacg catcttcccg acaacgcaga ccgttccgtg gcaaagcaaa 13980agttcaaaat
caccaactgg tccacctaca acaaagctct catcaaccgt ggctccctca 14040ctttctggct
ggatgatggg gcgattcagg cctggtatga gtcagcaaca ccttcttcac 14100gaggcagacc
tcagcgctag cggagtgtat actggcttac tatgttggca ctgatgaggg 14160tgtcagtgaa
gtgcttcatg tggcaggaga aaaaaggctg caccggtgcg tcagcagaat 14220atgtgataca
ggatatattc cgcttcctcg ctcactgact cgctacgctc ggtcgttcga 14280ctgcggcgag
cggaaatggc ttacgaacgg ggcggagatt tcctggaaga tgccaggaag 14340atacttaaca
gggaagtgag agggccgcgg caaagccgtt tttccatagg ctccgccccc 14400ctgacaagca
tcacgaaatc tgacgctcaa atcagtggtg gcgaaacccg acaggactat 14460aaagatacca
ggcgtttccc ctggcggctc cctcgtgcgc tctcctgttc ctgcctttcg 14520gtttaccggt
gtcattccgc tgttatggcc gcgtttgtct cattccacgc ctgacactca 14580gttccgggta
ggcagttcgc tccaagctgg actgtatgca cgaacccccc gttcagtccg 14640accgctgcgc
cttatccggt aactatcgtc ttgagtccaa cccggaaaga catgcaaaag 14700caccactggc
agcagccact ggtaattgat ttagaggagt tagtcttgaa gtcatgcgcc 14760ggttaaggct
aaactgaaag gacaagtttt ggtgactgcg ctcctccaag ccagttacct 14820cggttcaaag
agttggtagc tcagagaacc ttcgaaaaac cgccctgcaa ggcggttttt 14880tcgttttcag
agcaagagat tacgcgcaga ccaaaacgat ctcaagaaga tcatcttatt 14940aaggggtctg
acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 15000tcaaaaagga
tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 15060agtatatatg
agtaaacttg gtctgacagt tattagaaaa attcatccag cagacgataa 15120aacgcaatac
gctggctatc cggtgccgca atgccataca gcaccagaaa acgatccgcc 15180cattcgccgc
ccagttcttc cgcaatatca cgggtggcca gcgcaatatc ctgataacga 15240tccgccacgc
ccagacggcc gcaatcaata aagccgctaa aacggccatt ttccaccata 15300atgttcggca
ggcacgcatc accatgggtc accaccagat cttcgccatc cggcatgctc 15360gctttcagac
gcgcaaacag ctctgccggt gccaggccct gatgttcttc atccagatca 15420tcctgatcca
ccaggcccgc ttccatacgg gtacgcgcac gttcaatacg atgtttcgcc 15480tgatgatcaa
acggacaggt cgccgggtcc agggtatgca gacgacgcat ggcatccgcc 15540ataatgctca
ctttttctgc cggcgccaga tggctagaca gcagatcctg acccggcact 15600tcgcccagca
gcagccaatc acggcccgct tcggtcacca catccagcac cgccgcacac 15660ggaacaccgg
tggtggccag ccagctcaga cgcgccgctt catcctgcag ctcgttcagc 15720gcaccgctca
gatcggtttt cacaaacagc accggacgac cctgcgcgct cagacgaaac 15780accgccgcat
cagagcagcc aatggtctgc tgcgcccaat catagccaaa cagacgttcc 15840acccacgctg
ccgggctacc cgcatgcagg ccatcctgtt caatcatact cttccttttt 15900caatattatt
gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 15960atttagaaaa
ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctaaa 16020ttgtaagcgt
taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt 16080ttaaccaata
ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag 16140ggttgagtgg
ccgctacagg gcgctcccat tcgccattca ggctgcgcaa ctgttgggaa 16200gggcgtttcg
gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc 16260aaggcgatta
agttgggtaa cgccagggtt ttcccagtca cacgcgtaat acgactcact 16320atag
163246716360DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 67ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtccgccaa 7560gatgaggcct ggcctgccct
cctacctgat catcctggcc gtgtgcctgt tcagccacct 7620gctgtccagc agatacggcg
ccgaggccgt gagcgagccc ctggacaagg ctttccacct 7680gctgctgaac acctacggca
gacccatccg gtttctgcgg gagaacacca cccagtgcac 7740ctacaacagc agcctgcgga
acagcaccgt cgtgagagag aacgccatca gcttcaactt 7800tttccagagc tacaaccagt
actacgtgtt ccacatgccc agatgcctgt ttgccggccc 7860tctggccgag cagttcctga
accaggtgga cctgaccgag acactggaaa gataccagca 7920gcggctgaat acctacgccc
tggtgtccaa ggacctggcc agctaccggt cctttagcca 7980gcagctcaag gctcaggata
gcctcggcga gcagcctacc accgtgcccc ctcccatcga 8040cctgagcatc ccccacgtgt
ggatgcctcc ccagaccacc cctcacggct ggaccgagag 8100ccacaccacc tccggcctgc
acagacccca cttcaaccag acctgcatcc tgttcgacgg 8160ccacgacctg ctgtttagca
ccgtgacccc ctgcctgcac cagggcttct acctgatcga 8220cgagctgaga tacgtgaaga
tcaccctgac cgaggatttc ttcgtggtca ccgtgtccat 8280cgacgacgac acccccatgc
tgctgatctt cggccacctg cccagagtgc tgttcaaggc 8340cccctaccag cgggacaact
tcatcctgcg gcagaccgag aagcacgagc tgctggtgct 8400ggtcaagaag gaccagctga
accggcactc ctacctgaag gaccccgact tcctggacgc 8460cgccctggac ttcaactacc
tggacctgag cgccctgctg agaaacagct tccacagata 8520cgccgtggac gtgctgaagt
ccggacggtg ccagatgctc gatcggcgga ccgtggagat 8580ggccttcgcc tatgccctcg
ccctgttcgc cgctgccaga caggaagagg ctggcgccca 8640ggtgtcagtg cccagagccc
tggatagaca ggccgccctg ctgcagatcc aggaattcat 8700gatcacctgc ctgagccaga
ccccccctag aaccaccctg ctgctgtacc ccacagccgt 8760ggatctggcc aagagggccc
tgtggacccc caaccagatc accgacatca caagcctcgt 8820gcggctcgtg tacatcctga
gcaagcagaa ccagcagcac ctgatccccc agtgggccct 8880gagacagatc gccgacttcg
ccctgaagct gcacaagacc catctggcca gctttctgag 8940cgccttcgcc aggcaggaac
tgtacctgat gggcagcctg gtccacagca tgctggtgca 9000taccaccgag cggcgggaga
tcttcatcgt ggagacaggc ctgtgtagcc tggccgagct 9060gtcccacttt acccagctgc
tggcccaccc tcaccacgag tacctgagcg acctgtacac 9120cccctgcagc agcagcggca
gacgggacca cagcctggaa cggctgacca gactgttccc 9180cgatgccacc gtgcctgcta
cagtgcctgc cgccctgtcc atcctgtcca ccatgcagcc 9240cagcaccctg gaaaccttcc
ccgacctgtt ctgcctgccc ctgggcgaga gctttagcgc 9300cctgaccgtg tccgagcacg
tgtcctacat cgtgaccaat cagtacctga tcaagggcat 9360cagctacccc gtgtccacca
cagtcgtggg ccagagcctg atcatcaccc agaccgacag 9420ccagaccaag tgcgagctga
cccggaacat gcacaccaca cacagcatca ccgtggccct 9480gaacatcagc ctggaaaact
gcgctttctg tcagtctgcc ctgctggaat acgacgatac 9540ccagggcgtg atcaacatca
tgtacatgca cgacagcgac gacgtgctgt tcgccctgga 9600cccctacaac gaggtggtgg
tgtccagccc ccggacccac tacctgatgc tgctgaagaa 9660cggcaccgtg ctggaagtga
ccgacgtggt ggtggacgcc accgacggca gcggatctgg 9720gtcccaccat caccatcacc
attgataatc tagaggcccc tataactctc tacggctaac 9780ctgaatggac tacgacatag
tctagtccgc caagatgtgc agaaggcccg actgcggctt 9840cagcttcagc cctggacccg
tgatcctgct gtggtgctgc ctgctgctgc ctatcgtgtc 9900ctctgccgcc gtgtctgtgg
cccctacagc cgccgagaag gtgccagccg agtgccccga 9960gctgaccaga agatgcctgc
tgggcgaggt gttcgagggc gacaagtacg agagctggct 10020gcggcccctg gtcaacgtga
ccggcagaga tggccccctg agccagctga tccggtacag 10080acccgtgacc cccgaggccg
ccaatagcgt gctgctggac gaggccttcc tggataccct 10140ggccctgctg tacaacaacc
ccgaccagct gagagccctg ctgaccctgc tgtccagcga 10200caccgccccc agatggatga
ccgtgatgcg gggctacagc gagtgtggag atggcagccc 10260tgccgtgtac acctgcgtgg
acgacctgtg cagaggctac gacctgacca gactgagcta 10320cggccggtcc atcttcacag
agcacgtgct gggcttcgag ctggtgcccc ccagcctgtt 10380caacgtggtg gtggccatcc
ggaacgaggc caccagaacc aacagagccg tgcggctgcc 10440tgtgtctaca gccgctgcac
ctgagggcat cacactgttc tacggcctgt acaacgccgt 10500gaaagagttc tgcctccggc
accagctgga tccccccctg ctgagacacc tggacaagta 10560ctacgccggc ctgcccccag
agctgaagca gaccagagtg aacctgcccg cccacagcag 10620atatggccct caggccgtgg
acgccagatg ataacgccgg cggcccctat aactctctac 10680ggctaacctg aatggactac
gacatagtct agtccgccaa gatgagcccc aaggacctga 10740cccccttcct gacaaccctg
tggctgctcc tgggccatag cagagtgcct agagtgcggg 10800ccgaggaatg ctgcgagttc
atcaacgtga accacccccc cgagcggtgc tacgacttca 10860agatgtgcaa ccggttcacc
gtggccctga gatgccccga cggcgaagtg tgctacagcc 10920ccgagaaaac cgccgagatc
cggggcatcg tgaccaccat gacccacagc ctgacccggc 10980aggtggtgca caacaagctg
accagctgca actacaaccc cctgtacctg gaagccgacg 11040gccggatcag atgcggcaaa
gtgaacgaca aggcccagta cctgctggga gccgccggaa 11100gcgtgcccta ccggtggatc
aacctggaat acgacaagat cacccggatc gtgggcctgg 11160accagtacct ggaaagcgtg
aagaagcaca agcggctgga cgtgtgcaga gccaagatgg 11220gctacatgct gcagtgataa
ggcgcgccaa cgttactggc cgaagccgct tggaataagg 11280ccggtgtgcg tttgtctata
tgttattttc caccatattg ccgtcttttg gcaatgtgag 11340ggcccggaaa cctggccctg
tcttcttgac gagcattcct aggggtcttt cccctctcgc 11400caaaggaatg caaggtctgt
tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg 11460aagacaaaca acgtctgtag
cgaccctttg caggcagcgg aaccccccac ctggcgacag 11520gtgcctctgc ggccaaaagc
cacgtgtata agatacacct gcaaaggcgg cacaacccca 11580gtgccacgtt gtgagttgga
tagttgtgga aagagtcaaa tggctctcct caagcgtatt 11640caacaagggg ctgaaggatg
cccagaaggt accccattgt atgggatctg atctggggcc 11700tcggtgcaca tgctttacat
gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa 11760ccacggggac gtggttttcc
tttgaaaaac acgataatat gctgcggctg ctgctgagac 11820accacttcca ctgcctgctg
ctgtgtgccg tgtgggccac cccttgtctg gccagccctt 11880ggagcaccct gaccgccaac
cagaacccta gccccccttg gtccaagctg acctacagca 11940agccccacga cgccgccacc
ttctactgcc cctttctgta ccccagccct cccagaagcc 12000ccctgcagtt cagcggcttc
cagagagtgt ccaccggccc tgagtgccgg aacgagacac 12060tgtacctgct gtacaaccgg
gagggccaga cactggtgga gcggagcagc acctgggtga 12120aaaaagtgat ctggtatctg
agcggccgga accagaccat cctgcagcgg atgcccagaa 12180ccgccagcaa gcccagcgac
ggcaacgtgc agatcagcgt ggaggacgcc aaaatcttcg 12240gagcccacat ggtgcccaag
cagaccaagc tgctgagatt cgtggtcaac gacggcacca 12300gatatcagat gtgcgtgatg
aagctggaaa gctgggccca cgtgttccgg gactactccg 12360tgagcttcca ggtccggctg
accttcaccg aggccaacaa ccagacctac accttctgca 12420cccaccccaa cctgatcgtg
tgataagtac ctttgtacgc ctgttttata ccccctccct 12480gatttgcaac ttagaagcaa
cgcaaaccag atcaatagta ggtgtgacat accagtcgca 12540tcttgatcaa gcacttctgt
atccccggac cgagtatcaa tagactgtgc acacggttga 12600aggagaaaac gtccgttacc
cggctaacta cttcgagaag cctagtaacg ccattgaagt 12660tgcagagtgt ttcgctcagc
actccccccg tgtagatcag gtcgatgagt caccgcattc 12720cccacgggcg accgtggcgg
tggctgcgtt ggcggcctgc ctatggggta acccatagga 12780cgctctaata cggacatggc
gtgaagagtc tattgagcta gttagtagtc ctccggcccc 12840tgaatgcggc taatcctaac
tgcggagcac atacccttaa tccaaagggc agtgtgtcgt 12900aacgggcaac tctgcagcgg
aaccgactac tttgggtgtc cgtgtttctt tttattcttg 12960tattggctgc ttatggtgac
aattaaagaa ttgttaccat atagctattg gattggccat 13020ccagtgtcaa acagagctat
tgtatatctc tttgttggat tcacacctct cactcttgaa 13080acgttacaca ccctcaatta
cattatactg ctgaacacga agcgcatatg cggctgtgca 13140gagtgtggct gtccgtgtgc
ctgtgtgccg tggtgctggg ccagtgccag agagagacag 13200ccgagaagaa cgactactac
cgggtgcccc actactggga tgcctgcagc agagccctgc 13260ccgaccagac ccggtacaaa
tacgtggagc agctcgtgga cctgaccctg aactaccact 13320acgacgccag ccacggcctg
gacaacttcg acgtgctgaa gcggatcaac gtgaccgagg 13380tgtccctgct gatcagcgac
ttccggcggc agaacagaag aggcggcacc aacaagcgga 13440ccaccttcaa cgccgctggc
tctctggccc ctcacgccag atccctggaa ttcagcgtgc 13500ggctgttcgc caactgataa
cgttgcatcc tgcaggatac agcagcaatt ggcaagctgc 13560ttacatagaa ctcgcggcga
ttggcatgcc gccttaaaat ttttatttta tttttctttt 13620cttttccgaa tcggattttg
tttttaatat ttcaaaaaaa aaaaaaaaaa aaaaaaaaaa 13680aaaaaaaagg gtcggcatgg
catctccacc tcctcgcggt ccgacctggg catccgaagg 13740aggacgcacg tccactcgga
tggctaaggg agagccacgt ttaaacgcta gagcaagacg 13800tttcccgttg aatatggctc
ataacacccc ttgtattact gtttatgtaa gcagacagtt 13860ttattgttca tgatgatata
tttttatctt gtgcaatgta acatcagaga ttttgagaca 13920caacgtggct ttgttgaata
aatcgaactt ttgctgagtt gaaggatcag atcacgcatc 13980ttcccgacaa cgcagaccgt
tccgtggcaa agcaaaagtt caaaatcacc aactggtcca 14040cctacaacaa agctctcatc
aaccgtggct ccctcacttt ctggctggat gatggggcga 14100ttcaggcctg gtatgagtca
gcaacacctt cttcacgagg cagacctcag cgctagcgga 14160gtgtatactg gcttactatg
ttggcactga tgagggtgtc agtgaagtgc ttcatgtggc 14220aggagaaaaa aggctgcacc
ggtgcgtcag cagaatatgt gatacaggat atattccgct 14280tcctcgctca ctgactcgct
acgctcggtc gttcgactgc ggcgagcgga aatggcttac 14340gaacggggcg gagatttcct
ggaagatgcc aggaagatac ttaacaggga agtgagaggg 14400ccgcggcaaa gccgtttttc
cataggctcc gcccccctga caagcatcac gaaatctgac 14460gctcaaatca gtggtggcga
aacccgacag gactataaag ataccaggcg tttcccctgg 14520cggctccctc gtgcgctctc
ctgttcctgc ctttcggttt accggtgtca ttccgctgtt 14580atggccgcgt ttgtctcatt
ccacgcctga cactcagttc cgggtaggca gttcgctcca 14640agctggactg tatgcacgaa
ccccccgttc agtccgaccg ctgcgcctta tccggtaact 14700atcgtcttga gtccaacccg
gaaagacatg caaaagcacc actggcagca gccactggta 14760attgatttag aggagttagt
cttgaagtca tgcgccggtt aaggctaaac tgaaaggaca 14820agttttggtg actgcgctcc
tccaagccag ttacctcggt tcaaagagtt ggtagctcag 14880agaaccttcg aaaaaccgcc
ctgcaaggcg gttttttcgt tttcagagca agagattacg 14940cgcagaccaa aacgatctca
agaagatcat cttattaagg ggtctgacgc tcagtggaac 15000gaaaactcac gttaagggat
tttggtcatg agattatcaa aaaggatctt cacctagatc 15060cttttaaatt aaaaatgaag
ttttaaatca atctaaagta tatatgagta aacttggtct 15120gacagttatt agaaaaattc
atccagcaga cgataaaacg caatacgctg gctatccggt 15180gccgcaatgc catacagcac
cagaaaacga tccgcccatt cgccgcccag ttcttccgca 15240atatcacggg tggccagcgc
aatatcctga taacgatccg ccacgcccag acggccgcaa 15300tcaataaagc cgctaaaacg
gccattttcc accataatgt tcggcaggca cgcatcacca 15360tgggtcacca ccagatcttc
gccatccggc atgctcgctt tcagacgcgc aaacagctct 15420gccggtgcca ggccctgatg
ttcttcatcc agatcatcct gatccaccag gcccgcttcc 15480atacgggtac gcgcacgttc
aatacgatgt ttcgcctgat gatcaaacgg acaggtcgcc 15540gggtccaggg tatgcagacg
acgcatggca tccgccataa tgctcacttt ttctgccggc 15600gccagatggc tagacagcag
atcctgaccc ggcacttcgc ccagcagcag ccaatcacgg 15660cccgcttcgg tcaccacatc
cagcaccgcc gcacacggaa caccggtggt ggccagccag 15720ctcagacgcg ccgcttcatc
ctgcagctcg ttcagcgcac cgctcagatc ggttttcaca 15780aacagcaccg gacgaccctg
cgcgctcaga cgaaacaccg ccgcatcaga gcagccaatg 15840gtctgctgcg cccaatcata
gccaaacaga cgttccaccc acgctgccgg gctacccgca 15900tgcaggccat cctgttcaat
catactcttc ctttttcaat attattgaag catttatcag 15960ggttattgtc tcatgagcgg
atacatattt gaatgtattt agaaaaataa acaaataggg 16020gttccgcgca catttccccg
aaaagtgcca cctaaattgt aagcgttaat attttgttaa 16080aattcgcgtt aaatttttgt
taaatcagct cattttttaa ccaataggcc gaaatcggca 16140aaatccctta taaatcaaaa
gaatagaccg agatagggtt gagtggccgc tacagggcgc 16200tcccattcgc cattcaggct
gcgcaactgt tgggaagggc gtttcggtgc gggcctcttc 16260gctattacgc cagctggcga
aagggggatg tgctgcaagg cgattaagtt gggtaacgcc 16320agggttttcc cagtcacacg
cgtaatacga ctcactatag 1636068868PRTVaricella
zoster virus 68Met Phe Val Thr Ala Val Val Ser Val Ser Pro Ser Ser Phe
Tyr Glu1 5 10 15Ser Leu
Gln Val Glu Pro Thr Gln Ser Glu Asp Ile Thr Arg Ser Ala 20
25 30His Leu Gly Asp Gly Asp Glu Ile Arg
Glu Ala Ile His Lys Ser Gln 35 40
45Asp Ala Glu Thr Lys Pro Thr Phe Tyr Val Cys Pro Pro Pro Thr Gly 50
55 60Ser Thr Ile Val Arg Leu Glu Pro Pro
Arg Thr Cys Pro Asp Tyr His65 70 75
80Leu Gly Lys Asn Phe Thr Glu Gly Ile Ala Val Val Tyr Lys
Glu Asn 85 90 95Ile Ala
Ala Tyr Lys Phe Lys Ala Thr Val Tyr Tyr Lys Asp Val Ile 100
105 110Val Ser Thr Ala Trp Ala Gly Ser Ser
Tyr Thr Gln Ile Thr Asn Arg 115 120
125Tyr Ala Asp Arg Val Pro Ile Pro Val Ser Glu Ile Thr Asp Thr Ile
130 135 140Asp Lys Phe Gly Lys Cys Ser
Ser Lys Ala Thr Tyr Val Arg Asn Asn145 150
155 160His Lys Val Glu Ala Phe Asn Glu Asp Lys Asn Pro
Gln Asp Met Pro 165 170
175Leu Ile Ala Ser Lys Tyr Asn Ser Val Gly Ser Lys Ala Trp His Thr
180 185 190Thr Asn Asp Thr Tyr Met
Val Ala Gly Thr Pro Gly Thr Tyr Arg Thr 195 200
205Gly Thr Ser Val Asn Cys Ile Ile Glu Glu Val Glu Ala Arg
Ser Ile 210 215 220Phe Pro Tyr Asp Ser
Phe Gly Leu Ser Thr Gly Asp Ile Ile Tyr Met225 230
235 240Ser Pro Phe Phe Gly Leu Arg Asp Gly Ala
Tyr Arg Glu His Ser Asn 245 250
255Tyr Ala Met Asp Arg Phe His Gln Phe Glu Gly Tyr Arg Gln Arg Asp
260 265 270Leu Asp Thr Arg Ala
Leu Leu Glu Pro Ala Ala Arg Asn Phe Leu Val 275
280 285Thr Pro His Leu Thr Val Gly Trp Asn Trp Lys Pro
Lys Arg Thr Glu 290 295 300Val Cys Ser
Leu Val Lys Trp Arg Glu Val Glu Asp Val Val Arg Asp305
310 315 320Glu Tyr Ala His Asn Phe Arg
Phe Thr Met Lys Thr Leu Ser Thr Thr 325
330 335Phe Ile Ser Glu Thr Asn Glu Phe Asn Leu Asn Gln
Ile His Leu Ser 340 345 350Gln
Cys Val Lys Glu Glu Ala Arg Ala Ile Ile Asn Arg Ile Tyr Thr 355
360 365Thr Arg Tyr Asn Ser Ser His Val Arg
Thr Gly Asp Ile Gln Thr Tyr 370 375
380Leu Ala Arg Gly Gly Phe Val Val Val Phe Gln Pro Leu Leu Ser Asn385
390 395 400Ser Leu Ala Arg
Leu Tyr Leu Gln Glu Leu Val Arg Glu Asn Thr Asn 405
410 415His Ser Pro Gln Lys His Pro Thr Arg Asn
Thr Arg Ser Arg Arg Ser 420 425
430Val Pro Val Glu Leu Arg Ala Asn Arg Thr Ile Thr Thr Thr Ser Ser
435 440 445Val Glu Phe Ala Met Leu Gln
Phe Thr Tyr Asp His Ile Gln Glu His 450 455
460Val Asn Glu Met Leu Ala Arg Ile Ser Ser Ser Trp Cys Gln Leu
Gln465 470 475 480Asn Arg
Glu Arg Ala Leu Trp Ser Gly Leu Phe Pro Ile Asn Pro Ser
485 490 495Ala Leu Ala Ser Thr Ile Leu
Asp Gln Arg Val Lys Ala Arg Ile Leu 500 505
510Gly Asp Val Ile Ser Val Ser Asn Cys Pro Glu Leu Gly Ser
Asp Thr 515 520 525Arg Ile Ile Leu
Gln Asn Ser Met Arg Val Ser Gly Ser Thr Thr Arg 530
535 540Cys Tyr Ser Arg Pro Leu Ile Ser Ile Val Ser Leu
Asn Gly Ser Gly545 550 555
560Thr Val Glu Gly Gln Leu Gly Thr Asp Asn Glu Leu Ile Met Ser Arg
565 570 575Asp Leu Leu Glu Pro
Cys Val Ala Asn His Lys Arg Tyr Phe Leu Phe 580
585 590Gly His His Tyr Val Tyr Tyr Glu Asp Tyr Arg Tyr
Val Arg Glu Ile 595 600 605Ala Val
His Asp Val Gly Met Ile Ser Thr Tyr Val Asp Leu Asn Leu 610
615 620Thr Leu Leu Lys Asp Arg Glu Phe Met Pro Leu
Gln Val Tyr Thr Arg625 630 635
640Asp Glu Leu Arg Asp Thr Gly Leu Leu Asp Tyr Ser Glu Ile Gln Arg
645 650 655Arg Asn Gln Met
His Ser Leu Arg Phe Tyr Asp Ile Asp Lys Val Val 660
665 670Gln Tyr Asp Ser Gly Thr Ala Ile Met Gln Gly
Met Ala Gln Phe Phe 675 680 685Gln
Gly Leu Gly Thr Ala Gly Gln Ala Val Gly His Val Val Leu Gly 690
695 700Ala Thr Gly Ala Leu Leu Ser Thr Val His
Gly Phe Thr Thr Phe Leu705 710 715
720Ser Asn Pro Phe Gly Ala Leu Ala Val Gly Leu Leu Val Leu Ala
Gly 725 730 735Leu Val Ala
Ala Phe Phe Ala Tyr Arg Tyr Val Leu Lys Leu Lys Thr 740
745 750Ser Pro Met Lys Ala Leu Tyr Pro Leu Thr
Thr Lys Gly Leu Lys Gln 755 760
765Leu Pro Glu Gly Met Asp Pro Phe Ala Glu Lys Pro Asn Ala Thr Asp 770
775 780Thr Pro Ile Glu Glu Ile Gly Asp
Ser Gln Asn Thr Glu Pro Ser Val785 790
795 800Asn Ser Gly Phe Asp Pro Asp Lys Phe Arg Glu Ala
Gln Glu Met Ile 805 810
815Lys Tyr Met Thr Leu Val Ser Ala Ala Glu Arg Gln Glu Ser Lys Ala
820 825 830Arg Lys Lys Asn Lys Thr
Ser Ala Leu Leu Thr Ser Arg Leu Thr Gly 835 840
845Leu Ala Leu Arg Asn Arg Arg Gly Tyr Ser Arg Val Arg Thr
Glu Asn 850 855 860Val Thr Gly
Val86569841PRTVaricella zoster virus 69Met Phe Ala Leu Val Leu Ala Val
Val Ile Leu Pro Leu Trp Thr Thr1 5 10
15Ala Asn Lys Ser Tyr Val Thr Pro Thr Pro Ala Thr Arg Ser
Ile Gly 20 25 30His Met Ser
Ala Leu Leu Arg Glu Tyr Ser Asp Arg Asn Met Ser Leu 35
40 45Lys Leu Glu Ala Phe Tyr Pro Thr Gly Phe Asp
Glu Glu Leu Ile Lys 50 55 60Ser Leu
His Trp Gly Asn Asp Arg Lys His Val Phe Leu Val Ile Val65
70 75 80Lys Val Asn Pro Thr Thr His
Glu Gly Asp Val Gly Leu Val Ile Phe 85 90
95Pro Lys Tyr Leu Leu Ser Pro Tyr His Phe Lys Ala Glu
His Arg Ala 100 105 110Pro Phe
Pro Ala Gly Arg Phe Gly Phe Leu Ser His Pro Val Thr Pro 115
120 125Asp Val Ser Phe Phe Asp Ser Ser Phe Ala
Pro Tyr Leu Thr Thr Gln 130 135 140His
Leu Val Ala Phe Thr Thr Phe Pro Pro Asn Pro Leu Val Trp His145
150 155 160Leu Glu Arg Ala Glu Thr
Ala Ala Thr Ala Glu Arg Pro Phe Gly Val 165
170 175Ser Leu Leu Pro Ala Arg Pro Thr Val Pro Lys Asn
Thr Ile Leu Glu 180 185 190His
Lys Ala His Phe Ala Thr Trp Asp Ala Leu Ala Arg His Thr Phe 195
200 205Phe Ser Ala Glu Ala Ile Ile Thr Asn
Ser Thr Leu Arg Ile His Val 210 215
220Pro Leu Phe Gly Ser Val Trp Pro Ile Arg Tyr Trp Ala Thr Gly Ser225
230 235 240Val Leu Leu Thr
Ser Asp Ser Gly Arg Val Glu Val Asn Ile Gly Val 245
250 255Gly Phe Met Ser Ser Leu Ile Ser Leu Ser
Ser Gly Leu Pro Ile Glu 260 265
270Leu Ile Val Val Pro His Thr Val Lys Leu Asn Ala Val Thr Ser Asp
275 280 285Thr Thr Trp Phe Gln Leu Asn
Pro Pro Gly Pro Asp Pro Gly Pro Ser 290 295
300Tyr Arg Val Tyr Leu Leu Gly Arg Gly Leu Asp Met Asn Phe Ser
Lys305 310 315 320His Ala
Thr Val Asp Ile Cys Ala Tyr Pro Glu Glu Ser Leu Asp Tyr
325 330 335Arg Tyr His Leu Ser Met Ala
His Thr Glu Ala Leu Arg Met Thr Thr 340 345
350Lys Ala Asp Gln His Asp Ile Asn Glu Glu Ser Tyr Tyr His
Ile Ala 355 360 365Ala Arg Ile Ala
Thr Ser Ile Phe Ala Leu Ser Glu Met Gly Arg Thr 370
375 380Thr Glu Tyr Phe Leu Leu Asp Glu Ile Val Asp Val
Gln Tyr Gln Leu385 390 395
400Lys Phe Leu Asn Tyr Ile Leu Met Arg Ile Gly Ala Gly Ala His Pro
405 410 415Asn Thr Ile Ser Gly
Thr Ser Asp Leu Ile Phe Ala Asp Pro Ser Gln 420
425 430Leu His Asp Glu Leu Ser Leu Leu Phe Gly Gln Val
Lys Pro Ala Asn 435 440 445Val Asp
Tyr Phe Ile Ser Tyr Asp Glu Ala Arg Asp Gln Leu Lys Thr 450
455 460Ala Tyr Ala Leu Ser Arg Gly Gln Asp His Val
Asn Ala Leu Ser Leu465 470 475
480Ala Arg Arg Val Ile Met Ser Ile Tyr Lys Gly Leu Leu Val Lys Gln
485 490 495Asn Leu Asn Ala
Thr Glu Arg Gln Ala Leu Phe Phe Ala Ser Met Ile 500
505 510Leu Leu Asn Phe Arg Glu Gly Leu Glu Asn Ser
Ser Arg Val Leu Asp 515 520 525Gly
Arg Thr Thr Leu Leu Leu Met Thr Ser Met Cys Thr Ala Ala His 530
535 540Ala Thr Gln Ala Ala Leu Asn Ile Gln Glu
Gly Leu Ala Tyr Leu Asn545 550 555
560Pro Ser Lys His Met Phe Thr Ile Pro Asn Val Tyr Ser Pro Cys
Met 565 570 575Gly Ser Leu
Arg Thr Asp Leu Thr Glu Glu Ile His Val Met Asn Leu 580
585 590Leu Ser Ala Ile Pro Thr Arg Pro Gly Leu
Asn Glu Val Leu His Thr 595 600
605Gln Leu Asp Glu Ser Glu Ile Phe Asp Ala Ala Phe Lys Thr Met Met 610
615 620Ile Phe Thr Thr Trp Thr Ala Lys
Asp Leu His Ile Leu His Thr His625 630
635 640Val Pro Glu Val Phe Thr Cys Gln Asp Ala Ala Ala
Arg Asn Gly Glu 645 650
655Tyr Val Leu Ile Leu Pro Ala Val Gln Gly His Ser Tyr Val Ile Thr
660 665 670Arg Asn Lys Pro Gln Arg
Gly Leu Val Tyr Ser Leu Ala Asp Val Asp 675 680
685Val Tyr Asn Pro Ile Ser Val Val Tyr Leu Ser Lys Asp Thr
Cys Val 690 695 700Ser Glu His Gly Val
Ile Glu Thr Val Ala Leu Pro His Pro Asp Asn705 710
715 720Leu Lys Glu Cys Leu Tyr Cys Gly Ser Val
Phe Leu Arg Tyr Leu Thr 725 730
735Thr Gly Ala Ile Met Asp Ile Ile Ile Ile Asp Ser Lys Asp Thr Glu
740 745 750Arg Gln Leu Ala Ala
Met Gly Asn Ser Thr Ile Pro Pro Phe Asn Pro 755
760 765Asp Met His Gly Asp Asp Ser Lys Ala Val Leu Leu
Phe Pro Asn Gly 770 775 780Thr Val Val
Thr Leu Leu Gly Phe Glu Arg Arg Gln Ala Ile Arg Met785
790 795 800Ser Gly Gln Tyr Leu Gly Ala
Ser Leu Gly Gly Ala Phe Leu Ala Val 805
810 815Val Gly Phe Gly Ile Ile Gly Trp Met Leu Cys Gly
Asn Ser Arg Leu 820 825 830Arg
Glu Tyr Asn Lys Ile Pro Leu Thr 835
84070160PRTVaricella zoster virus 70Met Ala Ser His Lys Trp Leu Leu Gln
Met Ile Val Phe Leu Lys Thr1 5 10
15Ile Thr Ile Ala Tyr Cys Leu His Leu Gln Asp Asp Thr Pro Leu
Phe 20 25 30Phe Gly Ala Lys
Pro Leu Ser Asp Val Ser Leu Ile Ile Thr Glu Pro 35
40 45Cys Val Ser Ser Val Tyr Glu Ala Trp Asp Tyr Ala
Ala Pro Pro Val 50 55 60Ser Asn Leu
Ser Glu Ala Leu Ser Gly Ile Val Val Lys Thr Lys Cys65 70
75 80Pro Val Pro Glu Val Ile Leu Trp
Phe Lys Asp Lys Gln Met Ala Tyr 85 90
95Trp Thr Asn Pro Tyr Val Thr Leu Lys Gly Leu Thr Gln Ser
Val Gly 100 105 110Glu Glu His
Lys Ser Gly Asp Ile Arg Asp Ala Leu Leu Asp Ala Leu 115
120 125Ser Gly Val Trp Val Asp Ser Thr Pro Ser Ser
Thr Asn Ile Pro Glu 130 135 140Asn Gly
Cys Val Trp Gly Ala Asp Arg Leu Phe Gln Arg Val Cys Gln145
150 155 16071354PRTVaricella zoster
virus 71Met Phe Leu Ile Gln Cys Leu Ile Ser Ala Val Ile Phe Tyr Ile Gln1
5 10 15Val Thr Asn Ala
Leu Ile Phe Lys Gly Asp His Val Ser Leu Gln Val 20
25 30Asn Ser Ser Leu Thr Ser Ile Leu Ile Pro Met
Gln Asn Asp Asn Tyr 35 40 45Thr
Glu Ile Lys Gly Gln Leu Val Phe Ile Gly Glu Gln Leu Pro Thr 50
55 60Gly Thr Asn Tyr Ser Gly Thr Leu Glu Leu
Leu Tyr Ala Asp Thr Val65 70 75
80Ala Phe Cys Phe Arg Ser Val Gln Val Ile Arg Tyr Asp Gly Cys
Pro 85 90 95Arg Ile Arg
Thr Ser Ala Phe Ile Ser Cys Arg Tyr Lys His Ser Trp 100
105 110His Tyr Gly Asn Ser Thr Asp Arg Ile Ser
Thr Glu Pro Asp Ala Gly 115 120
125Val Met Leu Lys Ile Thr Lys Pro Gly Ile Asn Asp Ala Gly Val Tyr 130
135 140Val Leu Leu Val Arg Leu Asp His
Ser Arg Ser Thr Asp Gly Phe Ile145 150
155 160Leu Gly Val Asn Val Tyr Thr Ala Gly Ser His His
Asn Ile His Gly 165 170
175Val Ile Tyr Thr Ser Pro Ser Leu Gln Asn Gly Tyr Ser Thr Arg Ala
180 185 190Leu Phe Gln Gln Ala Arg
Leu Cys Asp Leu Pro Ala Thr Pro Lys Gly 195 200
205Ser Gly Thr Ser Leu Phe Gln His Met Leu Asp Leu Arg Ala
Gly Lys 210 215 220Ser Leu Glu Asp Asn
Pro Trp Leu His Glu Asp Val Val Thr Thr Glu225 230
235 240Thr Lys Ser Val Val Lys Glu Gly Ile Glu
Asn His Val Tyr Pro Thr 245 250
255Asp Met Ser Thr Leu Pro Glu Lys Ser Leu Asn Asp Pro Pro Glu Asn
260 265 270Leu Leu Ile Ile Ile
Pro Ile Val Ala Ser Val Met Ile Leu Thr Ala 275
280 285Met Val Ile Val Ile Val Ile Ser Val Lys Arg Arg
Arg Ile Lys Lys 290 295 300His Pro Ile
Tyr Arg Pro Asn Thr Lys Thr Arg Arg Gly Ile Gln Asn305
310 315 320Ala Thr Pro Glu Ser Asp Val
Met Leu Glu Ala Ala Ile Ala Gln Leu 325
330 335Ala Thr Ile Arg Glu Glu Ser Pro Pro His Ser Val
Val Asn Pro Phe 340 345 350Val
Lys72623PRTVaricella zoster virus 72Met Gly Thr Val Asn Lys Pro Val Val
Gly Val Leu Met Gly Phe Gly1 5 10
15Ile Ile Thr Gly Thr Leu Arg Ile Thr Asn Pro Val Arg Ala Ser
Val 20 25 30Leu Arg Tyr Asp
Asp Phe His Ile Asp Glu Asp Lys Leu Asp Thr Asn 35
40 45Ser Val Tyr Glu Pro Tyr Tyr His Ser Asp His Ala
Glu Ser Ser Trp 50 55 60Val Asn Arg
Gly Glu Ser Ser Arg Lys Ala Tyr Asp His Asn Ser Pro65 70
75 80Tyr Ile Trp Pro Arg Asn Asp Tyr
Asp Gly Phe Leu Glu Asn Ala His 85 90
95Glu His His Gly Val Tyr Asn Gln Gly Arg Gly Ile Asp Ser
Gly Glu 100 105 110Arg Leu Met
Gln Pro Thr Gln Met Ser Ala Gln Glu Asp Leu Gly Asp 115
120 125Asp Thr Gly Ile His Val Ile Pro Thr Leu Asn
Gly Asp Asp Arg His 130 135 140Lys Ile
Val Asn Val Asp Gln Arg Gln Tyr Gly Asp Val Phe Lys Gly145
150 155 160Asp Leu Asn Pro Lys Pro Gln
Gly Gln Arg Leu Ile Glu Val Ser Val 165
170 175Glu Glu Asn His Pro Phe Thr Leu Arg Ala Pro Ile
Gln Arg Ile Tyr 180 185 190Gly
Val Arg Tyr Thr Glu Thr Trp Ser Phe Leu Pro Ser Leu Thr Cys 195
200 205Thr Gly Asp Ala Ala Pro Ala Ile Gln
His Ile Cys Leu Lys His Thr 210 215
220Thr Cys Phe Gln Asp Val Val Val Asp Val Asp Cys Ala Glu Asn Thr225
230 235 240Lys Glu Asp Gln
Leu Ala Glu Ile Ser Tyr Arg Phe Gln Gly Lys Lys 245
250 255Glu Ala Asp Gln Pro Trp Ile Val Val Asn
Thr Ser Thr Leu Phe Asp 260 265
270Glu Leu Glu Leu Asp Pro Pro Glu Ile Glu Pro Gly Val Leu Lys Val
275 280 285Leu Arg Thr Glu Lys Gln Tyr
Leu Gly Val Tyr Ile Trp Asn Met Arg 290 295
300Gly Ser Asp Gly Thr Ser Thr Tyr Ala Thr Phe Leu Val Thr Trp
Lys305 310 315 320Gly Asp
Glu Lys Thr Arg Asn Pro Thr Pro Ala Val Thr Pro Gln Pro
325 330 335Arg Gly Ala Glu Phe His Met
Trp Asn Tyr His Ser His Val Phe Ser 340 345
350Val Gly Asp Thr Phe Ser Leu Ala Met His Leu Gln Tyr Lys
Ile His 355 360 365Glu Ala Pro Phe
Asp Leu Leu Leu Glu Trp Leu Tyr Val Pro Ile Asp 370
375 380Pro Thr Cys Gln Pro Met Arg Leu Tyr Ser Thr Cys
Leu Tyr His Pro385 390 395
400Asn Ala Pro Gln Cys Leu Ser His Met Asn Ser Gly Cys Thr Phe Thr
405 410 415Ser Pro His Leu Ala
Gln Arg Val Ala Ser Thr Val Tyr Gln Asn Cys 420
425 430Glu His Ala Asp Asn Tyr Thr Ala Tyr Cys Leu Gly
Ile Ser His Met 435 440 445Glu Pro
Ser Phe Gly Leu Ile Leu His Asp Gly Gly Thr Thr Leu Lys 450
455 460Phe Val Asp Thr Pro Glu Ser Leu Ser Gly Leu
Tyr Val Phe Val Val465 470 475
480Tyr Phe Asn Gly His Val Glu Ala Val Ala Tyr Thr Val Val Ser Thr
485 490 495Val Asp His Phe
Val Asn Ala Ile Glu Glu Arg Gly Phe Pro Pro Thr 500
505 510Ala Gly Gln Pro Pro Ala Thr Thr Lys Pro Lys
Glu Ile Thr Pro Val 515 520 525Asn
Pro Gly Thr Ser Pro Leu Leu Arg Tyr Ala Ala Trp Thr Gly Gly 530
535 540Leu Ala Ala Val Val Leu Leu Cys Leu Val
Ile Phe Leu Ile Cys Thr545 550 555
560Ala Lys Arg Met Arg Val Lys Ala Tyr Arg Val Asp Lys Ser Pro
Tyr 565 570 575Asn Gln Ser
Met Tyr Tyr Ala Gly Leu Pro Val Asp Asp Phe Glu Asp 580
585 590Ser Glu Ser Thr Asp Thr Glu Glu Glu Phe
Gly Asn Ala Ile Gly Gly 595 600
605Ser His Gly Gly Ser Ser Tyr Thr Val Tyr Ile Asp Lys Thr Arg 610
615 6207313339DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 73ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tagacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtcgagtct 7560agtcgacgcc accatgttcg tgaccgccgt ggtgtccgtg tcccccagca
gcttttacga 7620gagcctgcag gtcgagccca cccagagcga ggacatcaca agatctgccc
acctgggcga 7680cggcgacgag atcagagagg ccatccacaa gagccaggac gccgagacaa
agcccacctt 7740ctacgtgtgc cccccaccta ccggctctac aattgtgcgg ctggaacccc
ccagaacctg 7800ccctgattac cacctgggca agaacttcac cgagggaatt gccgtggtgt
acaaagagaa 7860tatcgccgcc tacaagttca aggccaccgt gtactacaag gacgtgatcg
tgtccaccgc 7920ctgggccggc agcagctaca cccagatcac caacagatac gccgaccggg
tgcccatccc 7980cgtgtctgag atcaccgaca ccatcgacaa gttcggcaag tgcagcagca
aggccaccta 8040cgtgcggaac aaccacaagg tggaagcctt caacgaggac aagaaccccc
aggacatgcc 8100cctgatcgcc agcaagtaca acagcgtggg ctccaaggcc tggcacacca
ccaacgacac 8160ctacatggtg gccggcaccc ccggcacata cagaacaggc accagcgtga
actgcatcat 8220cgaggaagtg gaagcccggt ccatcttccc atacgacagc ttcggcctga
gcaccggcga 8280cattatctac atgagccctt tcttcggcct gcgggacggc gcctacagag
agcacagcaa 8340ctacgccatg gaccggttcc accagttcga gggctacaga cagcgggacc
tggacacaag 8400agccctgctg gaacctgccg ccagaaactt cctggtcacc cctcacctga
ccgtgggctg 8460gaactggaag cccaagcgga ccgaagtgtg cagcctggtc aagtggcgcg
aggtggaaga 8520tgtcgtgcgg gatgagtacg cccacaactt ccggttcacc atgaagaccc
tgagcaccac 8580cttcatcagc gagacaaacg agttcaacct gaaccagatc cacctgagcc
agtgcgtgaa 8640agaggaagcc agagccatca tcaaccggat ctacaccacc cggtacaaca
gcagccacgt 8700gcggaccggc gatatccaga cctatctggc tagaggcggc ttcgtggtgg
tgtttcagcc 8760cctgctgagc aacagcctgg ctagactgta cctgcaggaa ctcgtcagag
agaacaccaa 8820ccacagcccc cagaagcacc ccacccggaa taccagatcc agacgcagcg
tgcccgtgga 8880actgagagcc aaccggacca tcaccaccac cagcagcgtg gaattcgcca
tgctgcagtt 8940cacctacgac cacatccagg aacacgtgaa cgagatgctg gcccggatca
gcagcagttg 9000gtgccagctg cagaatcggg aaagggccct gtggtccggc ctgttcccca
tcaatccaag 9060cgccctggcc agcaccatcc tggaccagag agtgaaggcc agaatcctgg
gggacgtgat 9120cagcgtgtcc aactgtcctg agctgggcag cgacacccgg atcatcctgc
agaacagcat 9180gcgggtgtcc ggcagcacca ccagatgcta cagcagaccc ctgatcagca
tcgtgtccct 9240gaacggcagc ggcacagtgg aaggccagct gggcaccgat aacgagctga
tcatgagccg 9300ggacctgctc gaaccctgcg tggccaatca caagcggtac tttctgttcg
gccaccacta 9360cgtgtactat gaggactaca gatacgtgcg cgagatcgcc gtgcacgacg
tgggcatgat 9420cagcacctac gtggacctga acctgaccct gctgaaggac cgcgagttca
tgccactgca 9480ggtctacacc cgggacgagc tgagagatac cggcctgctg gactacagcg
agatccagcg 9540gcggaaccag atgcactccc tgcggttcta cgacatcgac aaggtggtgc
agtacgacag 9600cggcaccgcc atcatgcagg gcatggccca gttctttcag ggcctgggaa
cagccggaca 9660ggccgtggga catgtggtgc tgggagctac aggcgccctg ctgtctaccg
tgcacggctt 9720caccaccttt ctgagcaacc ccttcggagc cctggctgtg ggactgctgg
tcctggctgg 9780actggtggcc gccttctttg cctaccgcta cgtgctgaag ctgaaaacca
gccccatgaa 9840ggccctgtac cccctgacca ccaagggcct gaagcagctg cctgagggca
tggacccctt 9900cgccgagaag cccaatgcca ccgacacccc catcgaggaa atcggcgaca
gccagaacac 9960cgagccctcc gtgaacagcg gcttcgaccc cgacaagttt cgcgaggccc
aggaaatgat 10020caagtacatg accctggtgt ctgctgccga gcggcaggaa agcaaggccc
ggaagaagaa 10080caagacctcc gccctgctga ccagcagact gacaggactg gccctgcgga
acagacgggg 10140ctatagcaga gtgcggaccg agaatgtgac cggcgtgtaa tctagacgcg
gccgcataca 10200gcagcaattg gcaagctgct tacatagaac tcgcggcgat tggcatgccg
ccttaaaatt 10260tttattttat ttttcttttc ttttccgaat cggattttgt ttttaatatt
tcaaaaaaaa 10320aaaaaaaaaa aaaaaaaaaa aaaaaaaggg tcggcatggc atctccacct
cctcgcggtc 10380cgacctgggc atccgaagga ggacgcacgt ccactcggat ggctaaggga
gagccacgtt 10440taaaccagct ccaattcgcc ctatagtgag tcgtattacg cgcgctcact
ggccgtcgtt 10500ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct
tgcagcacat 10560ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc
ttcccaacag 10620ttgcgcagcc tgaatggcga atgggacgcg ccctgtagcg gcgcattaag
cgcggcgggt 10680gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc
cgctcctttc 10740gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc
tctaaatcgg 10800gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa
aaaacttgat 10860tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg
ccctttgacg 10920ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac
actcaaccct 10980atctcggtct attcttttga tttataaggg attttgccga tttcggccta
ttggttaaaa 11040aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac
gcttacaatt 11100taggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt
ttctaaatac 11160attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa
taatattgaa 11220aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt
tttgcggcat 11280tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat
gctgaagatc 11340agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag
atccttgaga 11400gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg
ctatgtggcg 11460cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata
cactattctc 11520agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat
ggcatgacag 11580taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc
aacttacttc 11640tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg
ggggatcatg 11700taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac
gacgagcgtg 11760acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact
ggcgaactac 11820ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa
gttgcaggac 11880cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct
ggagccggtg 11940agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc
tcccgtatcg 12000tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga
cagatcgctg 12060agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac
tcatatatac 12120tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag
atcctttttg 12180ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg
tcagaccccg 12240tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc
tgctgcttgc 12300aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag
ctaccaactc 12360tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtt
cttctagtgt 12420agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac
ctcgctctgc 12480taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc
gggttggact 12540caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt
tcgtgcacac 12600agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt
gagctatgag 12660aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc
ggcagggtcg 12720gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt
tatagtcctg 12780tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca
ggggggcgga 12840gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt
tgctggcctt 12900ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt
attaccgcct 12960ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag
tcagtgagcg 13020aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg
ccgattcatt 13080aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc
aacgcaatta 13140atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctc
ccggctcgta 13200tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat
gaccatgatt 13260acgccaagcg cgcaattaac cctcactaaa gggaacaaaa gctgggtacc
gggcccacgc 13320gtaatacgac tcactatag
133397413258DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 74ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtcgagtct 7560agtcgacgcc
accatgttcg ccctggtgct ggccgtggtc atcctgcctc tgtggaccac 7620cgccaacaag
agctacgtga cccccacacc cgccaccaga tccatcggac acatgagcgc 7680cctgctgaga
gagtacagcg accggaacat gagcctgaag ctggaagcct tctaccccac 7740cggcttcgac
gaggaactga tcaagagcct gcactggggc aacgaccgga agcacgtgtt 7800cctcgtgatc
gtgaaagtga accccaccac ccacgagggc gacgtcggcc tggtcatctt 7860ccccaagtac
ctgctgagcc cctaccactt caaggccgag cacagagccc ccttccctgc 7920tggccgcttt
ggctttctga gccaccctgt gacccccgac gtgtcattct tcgacagcag 7980cttcgccccc
tacctgacca cacagcacct ggtggccttc accaccttcc cccccaatcc 8040tctcgtgtgg
cacctggaaa gagccgagac agccgccacc gccgaaagac cttttggcgt 8100gtccctgctg
cccgccagac ctaccgtgcc caagaacacc atcctggaac acaaggccca 8160cttcgccacc
tgggatgccc tggccagaca caccttcttt agcgccgagg ccatcatcac 8220caacagcacc
ctgagaatcc acgtgcccct gttcggcagc gtgtggccca tcagatactg 8280ggccacaggc
agcgtgctgc tgaccagcga tagcggcaga gtggaagtga acatcggcgt 8340gggcttcatg
agcagcctga tcagcctgag cagcggcctg cccatcgagc tgattgtggt 8400gccccacacc
gtgaagctga acgccgtgac cagcgacacc acctggttcc agctgaaccc 8460ccctggccct
gatcctggcc ctagttacag agtgtacctg ctgggcagag gcctggacat 8520gaacttcagc
aagcacgcca ccgtggacat ctgcgcctac cctgaggaaa gcctggacta 8580cagataccac
ctgagcatgg cccacaccga ggccctgaga atgaccacca aggccgacca 8640gcacgacatc
aacgaggaaa gctactacca cattgccgcc agaatcgcca ccagcatctt 8700cgccctgagc
gagatgggcc ggaccaccga gtactttctg ctggacgaga tcgtggacgt 8760gcagtaccag
ctgaagttcc tgaactacat cctgatgcgg atcggcgctg gcgcccaccc 8820taataccatc
agcggcacca gcgacctgat cttcgccgat cctagccagc tgcacgacga 8880gctgagcctg
ctgttcggcc aggtcaaacc cgccaacgtg gactacttca tcagctacga 8940cgaggcccgg
gaccagctga aaacagccta cgccctgtcc agaggccagg atcatgtgaa 9000cgccctgtcc
ctggccaggc gcgtgatcat gagcatctac aagggcctgc tggtcaagca 9060gaacctgaac
gccaccgagc ggcaggccct gttcttcgcc agcatgatcc tgctgaactt 9120cagagagggc
ctggaaaaca gcagccgggt gctggatggc agaaccaccc tgctgctgat 9180gaccagcatg
tgcacagccg cccatgccac acaggccgcc ctgaatatcc aggaaggcct 9240ggcttacctg
aaccccagca agcacatgtt caccatcccc aacgtgtaca gcccctgcat 9300gggcagcctg
agaaccgacc tgaccgaaga gatccacgtg atgaacctgc tgtccgccat 9360ccccaccaga
cccggactga atgaggtgct gcacacccag ctggacgagt ccgagatctt 9420cgacgccgcc
ttcaagacca tgatgatctt taccacctgg accgccaagg acctgcacat 9480cctgcacaca
cacgtgcccg aggtgttcac atgccaagat gccgccgctc ggaacggcga 9540gtatgtgctg
attctgcctg ccgtgcaggg ccacagctac gtgatcaccc ggaacaagcc 9600ccagcggggc
ctggtgtata gcctggctga cgtggacgtg tacaacccca tcagcgtggt 9660gtacctgagc
aaggatacct gcgtgtccga gcacggcgtg atcgaaacag tggccctgcc 9720ccaccccgac
aacctgaaag agtgcctgta ctgcggctcc gtgttcctgc ggtatctgac 9780caccggcgcc
atcatggaca tcatcatcat cgacagcaag gacaccgaga gacagctggc 9840cgccatgggc
aacagcacca tccccccctt caaccccgac atgcacggcg acgatagcaa 9900ggccgtgctg
ctgttcccca acggcaccgt ggtcacactg ctgggcttcg agcggagaca 9960ggccatcaga
atgagcggcc agtacctggg cgcctctctg ggtggtgcct ttctggccgt 10020cgtgggcttt
ggcatcatcg gctggatgct gtgcggcaac agcagactgc gcgagtacaa 10080caagatcccc
ctgacctaat ctagacgcgg ccgcatacag cagcaattgg caagctgctt 10140acatagaact
cgcggcgatt ggcatgccgc cttaaaattt ttattttatt tttcttttct 10200tttccgaatc
ggattttgtt tttaatattt caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 10260aaaaaagggt
cggcatggca tctccacctc ctcgcggtcc gacctgggca tccgaaggag 10320gacgcacgtc
cactcggatg gctaagggag agccacgttt aaaccagctc caattcgccc 10380tatagtgagt
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa 10440aaccctggcg
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 10500aatagcgaag
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 10560tgggacgcgc
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 10620accgctacac
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 10680gccacgttcg
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 10740tttagtgctt
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 10800gggccatcgc
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 10860agtggactct
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 10920ttataaggga
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 10980tttaacgcga
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa 11040atgtgcgcgg
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 11100tgagacaata
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 11160aacatttccg
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 11220acccagaaac
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 11280acatcgaact
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 11340ttccaatgat
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 11400ccgggcaaga
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 11460caccagtcac
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 11520ccataaccat
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 11580aggagctaac
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 11640aaccggagct
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 11700tggcaacaac
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 11760aattaataga
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 11820cggctggctg
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 11880ttgcagcact
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 11940gtcaggcaac
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 12000agcattggta
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 12060atttttaatt
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 12120cttaacgtga
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 12180cttgagatcc
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 12240cagcggtggt
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 12300tcagcagagc
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact 12360tcaagaactc
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 12420ctgccagtgg
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 12480aggcgcagcg
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 12540cctacaccga
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag 12600ggagaaaggc
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 12660agcttccagg
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 12720ttgagcgtcg
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 12780acgcggcctt
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 12840cgttatcccc
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 12900gccgcagccg
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 12960tacgcaaacc
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 13020ttcccgactg
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 13080aggcacccca
ggctttacac tttatgctcc cggctcgtat gttgtgtgga attgtgagcg 13140gataacaatt
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc 13200ctcactaaag
ggaacaaaag ctgggtaccg ggcccacgcg taatacgact cactatag
132587511215DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 75ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtcgagtct 7560agtcgacgcc accatggcca
gccacaagtg gctgctgcag atgatcgtgt tcctgaaaac 7620catcacaatc gcctactgcc
tgcatctgca ggacgacacc cctctgttct tcggcgccaa 7680gcctctgagc gacgtgtccc
tgatcatcac cgagccttgc gtgtccagcg tgtacgaggc 7740ctgggattat gccgcccctc
ccgtgtccaa tctgagcgaa gccctgagcg gcatcgtggt 7800caagaccaag tgccccgtgc
ccgaagtgat cctgtggttc aaggacaagc agatggccta 7860ctggaccaac ccttacgtga
ccctgaaggg cctgacccag agcgtgggcg aggaacacaa 7920gagcggcgac atcagagatg
ccctgctgga tgccctgtcc ggtgtctggg tggacagcac 7980accctccagc accaacatcc
ccgagaacgg ctgtgtgtgg ggagccgacc ggctgttcca 8040gagagtgtgt cagtaatcta
gacgcggccg catacagcag caattggcaa gctgcttaca 8100tagaactcgc ggcgattggc
atgccgcctt aaaattttta ttttattttt cttttctttt 8160ccgaatcgga ttttgttttt
aatatttcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 8220aaagggtcgg catggcatct
ccacctcctc gcggtccgac ctgggcatcc gaaggaggac 8280gcacgtccac tcggatggct
aagggagagc cacgtttaaa ccagctccaa ttcgccctat 8340agtgagtcgt attacgcgcg
ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac 8400cctggcgtta cccaacttaa
tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 8460agcgaagagg cccgcaccga
tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 8520gacgcgccct gtagcggcgc
attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 8580gctacacttg ccagcgccct
agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 8640acgttcgccg gctttccccg
tcaagctcta aatcgggggc tccctttagg gttccgattt 8700agtgctttac ggcacctcga
ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 8760ccatcgccct gatagacggt
ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 8820ggactcttgt tccaaactgg
aacaacactc aaccctatct cggtctattc ttttgattta 8880taagggattt tgccgatttc
ggcctattgg ttaaaaaatg agctgattta acaaaaattt 8940aacgcgaatt ttaacaaaat
attaacgctt acaatttagg tggcactttt cggggaaatg 9000tgcgcggaac ccctatttgt
ttatttttct aaatacattc aaatatgtat ccgctcatga 9060gacaataacc ctgataaatg
cttcaataat attgaaaaag gaagagtatg agtattcaac 9120atttccgtgt cgcccttatt
cccttttttg cggcattttg ccttcctgtt tttgctcacc 9180cagaaacgct ggtgaaagta
aaagatgctg aagatcagtt gggtgcacga gtgggttaca 9240tcgaactgga tctcaacagc
ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 9300caatgatgag cacttttaaa
gttctgctat gtggcgcggt attatcccgt attgacgccg 9360ggcaagagca actcggtcgc
cgcatacact attctcagaa tgacttggtt gagtactcac 9420cagtcacaga aaagcatctt
acggatggca tgacagtaag agaattatgc agtgctgcca 9480taaccatgag tgataacact
gcggccaact tacttctgac aacgatcgga ggaccgaagg 9540agctaaccgc ttttttgcac
aacatggggg atcatgtaac tcgccttgat cgttgggaac 9600cggagctgaa tgaagccata
ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 9660caacaacgtt gcgcaaacta
ttaactggcg aactacttac tctagcttcc cggcaacaat 9720taatagactg gatggaggcg
gataaagttg caggaccact tctgcgctcg gcccttccgg 9780ctggctggtt tattgctgat
aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 9840cagcactggg gccagatggt
aagccctccc gtatcgtagt tatctacacg acggggagtc 9900aggcaactat ggatgaacga
aatagacaga tcgctgagat aggtgcctca ctgattaagc 9960attggtaact gtcagaccaa
gtttactcat atatacttta gattgattta aaacttcatt 10020tttaatttaa aaggatctag
gtgaagatcc tttttgataa tctcatgacc aaaatccctt 10080aacgtgagtt ttcgttccac
tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 10140gagatccttt ttttctgcgc
gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 10200cggtggtttg tttgccggat
caagagctac caactctttt tccgaaggta actggcttca 10260gcagagcgca gataccaaat
actgttcttc tagtgtagcc gtagttaggc caccacttca 10320agaactctgt agcaccgcct
acatacctcg ctctgctaat cctgttacca gtggctgctg 10380ccagtggcga taagtcgtgt
cttaccgggt tggactcaag acgatagtta ccggataagg 10440cgcagcggtc gggctgaacg
gggggttcgt gcacacagcc cagcttggag cgaacgacct 10500acaccgaact gagataccta
cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 10560gaaaggcgga caggtatccg
gtaagcggca gggtcggaac aggagagcgc acgagggagc 10620ttccaggggg aaacgcctgg
tatctttata gtcctgtcgg gtttcgccac ctctgacttg 10680agcgtcgatt tttgtgatgc
tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 10740cggccttttt acggttcctg
gccttttgct ggccttttgc tcacatgttc tttcctgcgt 10800tatcccctga ttctgtggat
aaccgtatta ccgcctttga gtgagctgat accgctcgcc 10860gcagccgaac gaccgagcgc
agcgagtcag tgagcgagga agcggaagag cgcccaatac 10920gcaaaccgcc tctccccgcg
cgttggccga ttcattaatg cagctggcac gacaggtttc 10980ccgactggaa agcgggcagt
gagcgcaacg caattaatgt gagttagctc actcattagg 11040caccccaggc tttacacttt
atgctcccgg ctcgtatgtt gtgtggaatt gtgagcggat 11100aacaatttca cacaggaaac
agctatgacc atgattacgc caagcgcgca attaaccctc 11160actaaaggga acaaaagctg
ggtaccgggc ccacgcgtaa tacgactcac tatag 112157613827DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 76ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tagacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtcgagtct 7560agtcgacgcc accatgttcg ccctggtgct ggccgtggtc atcctgcctc
tgtggaccac 7620cgccaacaag agctacgtga cccccacacc cgccaccaga tccatcggac
acatgagcgc 7680cctgctgaga gagtacagcg accggaacat gagcctgaag ctggaagcct
tctaccccac 7740cggcttcgac gaggaactga tcaagagcct gcactggggc aacgaccgga
agcacgtgtt 7800cctcgtgatc gtgaaagtga accccaccac ccacgagggc gacgtcggcc
tggtcatctt 7860ccccaagtac ctgctgagcc cctaccactt caaggccgag cacagagccc
ccttccctgc 7920tggccgcttt ggctttctga gccaccctgt gacccccgac gtgtcattct
tcgacagcag 7980cttcgccccc tacctgacca cacagcacct ggtggccttc accaccttcc
cccccaatcc 8040tctcgtgtgg cacctggaaa gagccgagac agccgccacc gccgaaagac
cttttggcgt 8100gtccctgctg cccgccagac ctaccgtgcc caagaacacc atcctggaac
acaaggccca 8160cttcgccacc tgggatgccc tggccagaca caccttcttt agcgccgagg
ccatcatcac 8220caacagcacc ctgagaatcc acgtgcccct gttcggcagc gtgtggccca
tcagatactg 8280ggccacaggc agcgtgctgc tgaccagcga tagcggcaga gtggaagtga
acatcggcgt 8340gggcttcatg agcagcctga tcagcctgag cagcggcctg cccatcgagc
tgattgtggt 8400gccccacacc gtgaagctga acgccgtgac cagcgacacc acctggttcc
agctgaaccc 8460ccctggccct gatcctggcc ctagttacag agtgtacctg ctgggcagag
gcctggacat 8520gaacttcagc aagcacgcca ccgtggacat ctgcgcctac cctgaggaaa
gcctggacta 8580cagataccac ctgagcatgg cccacaccga ggccctgaga atgaccacca
aggccgacca 8640gcacgacatc aacgaggaaa gctactacca cattgccgcc agaatcgcca
ccagcatctt 8700cgccctgagc gagatgggcc ggaccaccga gtactttctg ctggacgaga
tcgtggacgt 8760gcagtaccag ctgaagttcc tgaactacat cctgatgcgg atcggcgctg
gcgcccaccc 8820taataccatc agcggcacca gcgacctgat cttcgccgat cctagccagc
tgcacgacga 8880gctgagcctg ctgttcggcc aggtcaaacc cgccaacgtg gactacttca
tcagctacga 8940cgaggcccgg gaccagctga aaacagccta cgccctgtcc agaggccagg
atcatgtgaa 9000cgccctgtcc ctggccaggc gcgtgatcat gagcatctac aagggcctgc
tggtcaagca 9060gaacctgaac gccaccgagc ggcaggccct gttcttcgcc agcatgatcc
tgctgaactt 9120cagagagggc ctggaaaaca gcagccgggt gctggatggc agaaccaccc
tgctgctgat 9180gaccagcatg tgcacagccg cccatgccac acaggccgcc ctgaatatcc
aggaaggcct 9240ggcttacctg aaccccagca agcacatgtt caccatcccc aacgtgtaca
gcccctgcat 9300gggcagcctg agaaccgacc tgaccgaaga gatccacgtg atgaacctgc
tgtccgccat 9360ccccaccaga cccggactga atgaggtgct gcacacccag ctggacgagt
ccgagatctt 9420cgacgccgcc ttcaagacca tgatgatctt taccacctgg accgccaagg
acctgcacat 9480cctgcacaca cacgtgcccg aggtgttcac atgccaagat gccgccgctc
ggaacggcga 9540gtatgtgctg attctgcctg ccgtgcaggg ccacagctac gtgatcaccc
ggaacaagcc 9600ccagcggggc ctggtgtata gcctggctga cgtggacgtg tacaacccca
tcagcgtggt 9660gtacctgagc aaggatacct gcgtgtccga gcacggcgtg atcgaaacag
tggccctgcc 9720ccaccccgac aacctgaaag agtgcctgta ctgcggctcc gtgttcctgc
ggtatctgac 9780caccggcgcc atcatggaca tcatcatcat cgacagcaag gacaccgaga
gacagctggc 9840cgccatgggc aacagcacca tccccccctt caaccccgac atgcacggcg
acgatagcaa 9900ggccgtgctg ctgttcccca acggcaccgt ggtcacactg ctgggcttcg
agcggagaca 9960ggccatcaga atgagcggcc agtacctggg cgcctctctg ggtggtgcct
ttctggccgt 10020cgtgggcttt ggcatcatcg gctggatgct gtgcggcaac agcagactgc
gcgagtacaa 10080caagatcccc ctgacctaat ctagacgtcg cgaccaccca ggatccgcct
ataactctct 10140acggctaacc tgaatggact acgacatagt ctagtcgacg ccaccatggc
cagccacaag 10200tggctgctgc agatgatcgt gttcctgaaa accatcacaa tcgcctactg
cctgcatctg 10260caggacgaca cccctctgtt cttcggcgcc aagcctctga gcgacgtgtc
cctgatcatc 10320accgagcctt gcgtgtccag cgtgtacgag gcctgggatt atgccgcccc
tcccgtgtcc 10380aatctgagcg aagccctgag cggcatcgtg gtcaagacca agtgccccgt
gcccgaagtg 10440atcctgtggt tcaaggacaa gcagatggcc tactggacca acccttacgt
gaccctgaag 10500ggcctgaccc agagcgtggg cgaggaacac aagagcggcg acatcagaga
tgccctgctg 10560gatgccctgt ccggtgtctg ggtggacagc acaccctcca gcaccaacat
ccccgagaac 10620ggctgtgtgt ggggagccga ccggctgttc cagagagtgt gtcagtaatc
tagacgcggc 10680cgcatacagc agcaattggc aagctgctta catagaactc gcggcgattg
gcatgccgcc 10740ttaaaatttt tattttattt ttcttttctt ttccgaatcg gattttgttt
ttaatatttc 10800aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaagggtc ggcatggcat
ctccacctcc 10860tcgcggtccg acctgggcat ccgaaggagg acgcacgtcc actcggatgg
ctaagggaga 10920gccacgttta aaccagctcc aattcgccct atagtgagtc gtattacgcg
cgctcactgg 10980ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt
aatcgccttg 11040cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc
gatcgccctt 11100cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc
gcattaagcg 11160cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc
ctagcgcccg 11220ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc
cgtcaagctc 11280taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc
gaccccaaaa 11340aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg
gtttttcgcc 11400ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact
ggaacaacac 11460tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt
tcggcctatt 11520ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa
atattaacgc 11580ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt
gtttattttt 11640ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa
tgcttcaata 11700atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta
ttcccttttt 11760tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag
taaaagatgc 11820tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca
gcggtaagat 11880ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta
aagttctgct 11940atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc
gccgcataca 12000ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc
ttacggatgg 12060catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca
ctgcggccaa 12120cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc
acaacatggg 12180ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca
taccaaacga 12240cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac
tattaactgg 12300cgaactactt actctagctt cccggcaaca attaatagac tggatggagg
cggataaagt 12360tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg
ataaatctgg 12420agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg
gtaagccctc 12480ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac
gaaatagaca 12540gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc
aagtttactc 12600atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct
aggtgaagat 12660cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc
actgagcgtc 12720agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc
gcgtaatctg 12780ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg
atcaagagct 12840accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa
atactgttct 12900tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc
ctacatacct 12960cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt
gtcttaccgg 13020gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa
cggggggttc 13080gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc
tacagcgtga 13140gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc
cggtaagcgg 13200cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct
ggtatcttta 13260tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat
gctcgtcagg 13320ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc
tggccttttg 13380ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg
ataaccgtat 13440taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc
gcagcgagtc 13500agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg
cgcgttggcc 13560gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca
gtgagcgcaa 13620cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact
ttatgctccc 13680ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa
acagctatga 13740ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc
tgggtaccgg 13800gcccacgcgt aatacgactc actatag
138277712604DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 77ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtcgagtct 7560agtcgacgcc
accatgggca ccgtgaacaa gcctgtcgtg ggcgtgctga tgggcttcgg 7620catcatcacc
ggcaccctga gaatcaccaa ccctgtgcgg gccagcgtgc tgagatacga 7680cgacttccac
atcgacgagg acaagctgga caccaacagc gtgtacgagc cctactacca 7740cagcgaccac
gccgagagca gctgggtcaa cagaggcgag agcagccgga aggcctacga 7800ccacaacagc
ccctacatct ggccccggaa cgactacgac ggcttcctgg aaaacgccca 7860cgagcaccac
ggcgtgtaca atcagggcag aggcatcgac agcggcgaga gactgatgca 7920gcccacacag
atgagcgccc aggaagatct gggcgacgac acaggcatcc acgtgatccc 7980caccctgaac
ggcgacgacc ggcacaagat cgtgaacgtg gaccagcggc agtacggcga 8040cgtgttcaag
ggcgacctga accctaagcc ccagggccag agactgatcg aggtgtccgt 8100ggaagagaac
caccccttca ccctgagagc ccccatccag agaatctacg gcgtgcggta 8160taccgagact
tggagcttcc tgcccagcct gacctgtaca ggcgacgccg ctcctgccat 8220ccagcacatc
tgcctgaagc acaccacctg tttccaggac gtggtggtgg acgtggactg 8280cgccgagaac
accaaagagg accagctggc cgagatcagc taccggttcc agggcaagaa 8340agaggccgac
cagccctgga tcgtggtcaa taccagcacc ctgttcgacg agctggaact 8400ggaccccccc
gagattgaac ccggcgtgct gaaggtgctg cggaccgaga agcagtacct 8460gggcgtgtac
atctggaaca tgcggggctc cgacggcacc tctacctacg ccaccttcct 8520ggtcacatgg
aagggcgacg agaaaacccg gaaccctacc cctgccgtga cccctcagcc 8580tagaggcgcc
gagttccata tgtggaatta ccactcccac gtgttcagcg tgggcgacac 8640cttcagcctg
gccatgcatc tgcagtacaa gatccacgag gcccccttcg acctgctgct 8700ggaatggctg
tacgtgccca tcgaccctac ctgccagccc atgcggctgt acagcacctg 8760tctgtaccac
cccaacgccc ctcagtgcct gagccacatg aacagcggct gcaccttcac 8820cagccctcac
ctggctcaga gggtggccag caccgtgtac cagaattgcg agcacgccga 8880caactacacc
gcctactgcc tgggcatcag ccacatggaa cccagcttcg gcctgatcct 8940gcacgatggc
ggcaccaccc tgaagttcgt ggacacaccc gagagcctga gcggcctgta 9000cgtgttcgtg
gtgtacttca acggccacgt ggaagccgtg gcctacaccg tggtgtccac 9060cgtggaccac
ttcgtgaacg ccatcgagga aagaggcttc ccacccacag ccggacagcc 9120tccagccacc
accaagccca aagaaatcac ccccgtgaac cccggcacca gccccctgct 9180gagatatgct
gcttggacag gcggactggc cgctgtggtg ctgctgtgcc tggtcatctt 9240cctgatctgc
accgccaagc ggatgagagt gaaggcctac cgggtggaca agtcccccta 9300caaccagagc
atgtactacg ccggcctgcc cgtggacgat ttcgaggata gcgagagcac 9360cgacaccgag
gaagagttcg gcaacgccat cggcggatct cacggcggca gcagctacac 9420cgtgtacatc
gacaagacca gataatctag acgcggccgc atacagcagc aattggcaag 9480ctgcttacat
agaactcgcg gcgattggca tgccgcctta aaatttttat tttatttttc 9540ttttcttttc
cgaatcggat tttgttttta atatttcaaa aaaaaaaaaa aaaaaaaaaa 9600aaaaaaaaaa
aagggtcggc atggcatctc cacctcctcg cggtccgacc tgggcatccg 9660aaggaggacg
cacgtccact cggatggcta agggagagcc acgtttaaac cagctccaat 9720tcgccctata
gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac 9780tgggaaaacc
ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc 9840tggcgtaata
gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat 9900ggcgaatggg
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc 9960agcgtgaccg
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc 10020tttctcgcca
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg 10080ttccgattta
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca 10140cgtagtgggc
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc 10200tttaatagtg
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct 10260tttgatttat
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa 10320caaaaattta
acgcgaattt taacaaaata ttaacgctta caatttaggt ggcacttttc 10380ggggaaatgt
gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 10440cgctcatgag
acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 10500gtattcaaca
tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 10560ttgctcaccc
agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 10620tgggttacat
cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 10680aacgttttcc
aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 10740ttgacgccgg
gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 10800agtactcacc
agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 10860gtgctgccat
aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 10920gaccgaagga
gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 10980gttgggaacc
ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 11040tagcaatggc
aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 11100ggcaacaatt
aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 11160cccttccggc
tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 11220gtatcattgc
agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 11280cggggagtca
ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 11340tgattaagca
ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 11400aacttcattt
ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 11460aaatccctta
acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 11520gatcttcttg
agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 11580cgctaccagc
ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 11640ctggcttcag
cagagcgcag ataccaaata ctgttcttct agtgtagccg tagttaggcc 11700accacttcaa
gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 11760tggctgctgc
cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 11820cggataaggc
gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 11880gaacgaccta
caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc 11940ccgaagggag
aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 12000cgagggagct
tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 12060tctgacttga
gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 12120ccagcaacgc
ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct 12180ttcctgcgtt
atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata 12240ccgctcgccg
cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc 12300gcccaatacg
caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg 12360acaggtttcc
cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca 12420ctcattaggc
accccaggct ttacacttta tgctcccggc tcgtatgttg tgtggaattg 12480tgagcggata
acaatttcac acaggaaaca gctatgacca tgattacgcc aagcgcgcaa 12540ttaaccctca
ctaaagggaa caaaagctgg gtaccgggcc cacgcgtaat acgactcact 12600atag
126047811797DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 78ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtcgagtct 7560agtcgacgcc accatgtttc
tgatccagtg cctgatcagc gccgtgatct tctatattca 7620agtcacaaac gccctgatct
ttaagggcga ccacgtgtca ctgcaggtca acagcagcct 7680gaccagcatc ctgatcccca
tgcagaacga caattacacc gagatcaagg gccagctggt 7740gttcatcggc gagcagctgc
ccaccggcac caattacagc ggcaccctgg aactgctgta 7800cgccgatacc gtggccttct
gcttcagaag cgtgcaggtc atcagatacg acggctgccc 7860ccggatcaga accagcgcct
tcatcagctg ccggtacaag cacagctggc actacggcaa 7920cagcaccgac cggatcagca
ccgaacctga tgccggcgtg atgctgaaga tcaccaagcc 7980cggcatcaac gacgccggcg
tgtacgtgct gctcgtgcgg ctggatcaca gcagaagcac 8040cgacggcttc atcctgggcg
tgaacgtgta caccgccggc agccaccaca acatccacgg 8100cgtgatctac accagcccca
gcctgcagaa cggctacagc accagagccc tgttccagca 8160ggccagactg tgcgatctgc
ccgccacacc taagggcagc ggcacaagcc tgtttcagca 8220catgctggac ctgagagccg
gcaagagcct ggaagataac ccctggctgc acgaggacgt 8280ggtcaccacc gagacaaaga
gcgtggtcaa agagggcatc gagaaccacg tgtaccccac 8340cgacatgagc accctgcccg
agaagtccct gaacgacccc cctgagaacc tgctgatcat 8400catccccatc gtggccagcg
tgatgatcct gaccgccatg gtcatcgtga tcgtgatcag 8460cgtgaagcgg cggagaatca
agaagcaccc catctaccgg cccaacacca agaccagacg 8520gggcatccag aacgccaccc
ctgagtccga cgtgatgctg gaagccgcca ttgcccagct 8580ggccaccatc agagaggaaa
gcccccctca cagcgtcgtg aaccccttcg tgaagtaatc 8640tagacgcggc cgcatacagc
agcaattggc aagctgctta catagaactc gcggcgattg 8700gcatgccgcc ttaaaatttt
tattttattt ttcttttctt ttccgaatcg gattttgttt 8760ttaatatttc aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaaaagggtc ggcatggcat 8820ctccacctcc tcgcggtccg
acctgggcat ccgaaggagg acgcacgtcc actcggatgg 8880ctaagggaga gccacgttta
aaccagctcc aattcgccct atagtgagtc gtattacgcg 8940cgctcactgg ccgtcgtttt
acaacgtcgt gactgggaaa accctggcgt tacccaactt 9000aatcgccttg cagcacatcc
ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 9060gatcgccctt cccaacagtt
gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc 9120gcattaagcg cggcgggtgt
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc 9180ctagcgcccg ctcctttcgc
tttcttccct tcctttctcg ccacgttcgc cggctttccc 9240cgtcaagctc taaatcgggg
gctcccttta gggttccgat ttagtgcttt acggcacctc 9300gaccccaaaa aacttgatta
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg 9360gtttttcgcc ctttgacgtt
ggagtccacg ttctttaata gtggactctt gttccaaact 9420ggaacaacac tcaaccctat
ctcggtctat tcttttgatt tataagggat tttgccgatt 9480tcggcctatt ggttaaaaaa
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa 9540atattaacgc ttacaattta
ggtggcactt ttcggggaaa tgtgcgcgga acccctattt 9600gtttattttt ctaaatacat
tcaaatatgt atccgctcat gagacaataa ccctgataaa 9660tgcttcaata atattgaaaa
aggaagagta tgagtattca acatttccgt gtcgccctta 9720ttcccttttt tgcggcattt
tgccttcctg tttttgctca cccagaaacg ctggtgaaag 9780taaaagatgc tgaagatcag
ttgggtgcac gagtgggtta catcgaactg gatctcaaca 9840gcggtaagat ccttgagagt
tttcgccccg aagaacgttt tccaatgatg agcactttta 9900aagttctgct atgtggcgcg
gtattatccc gtattgacgc cgggcaagag caactcggtc 9960gccgcataca ctattctcag
aatgacttgg ttgagtactc accagtcaca gaaaagcatc 10020ttacggatgg catgacagta
agagaattat gcagtgctgc cataaccatg agtgataaca 10080ctgcggccaa cttacttctg
acaacgatcg gaggaccgaa ggagctaacc gcttttttgc 10140acaacatggg ggatcatgta
actcgccttg atcgttggga accggagctg aatgaagcca 10200taccaaacga cgagcgtgac
accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac 10260tattaactgg cgaactactt
actctagctt cccggcaaca attaatagac tggatggagg 10320cggataaagt tgcaggacca
cttctgcgct cggcccttcc ggctggctgg tttattgctg 10380ataaatctgg agccggtgag
cgtgggtctc gcggtatcat tgcagcactg gggccagatg 10440gtaagccctc ccgtatcgta
gttatctaca cgacggggag tcaggcaact atggatgaac 10500gaaatagaca gatcgctgag
ataggtgcct cactgattaa gcattggtaa ctgtcagacc 10560aagtttactc atatatactt
tagattgatt taaaacttca tttttaattt aaaaggatct 10620aggtgaagat cctttttgat
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc 10680actgagcgtc agaccccgta
gaaaagatca aaggatcttc ttgagatcct ttttttctgc 10740gcgtaatctg ctgcttgcaa
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg 10800atcaagagct accaactctt
tttccgaagg taactggctt cagcagagcg cagataccaa 10860atactgttct tctagtgtag
ccgtagttag gccaccactt caagaactct gtagcaccgc 10920ctacatacct cgctctgcta
atcctgttac cagtggctgc tgccagtggc gataagtcgt 10980gtcttaccgg gttggactca
agacgatagt taccggataa ggcgcagcgg tcgggctgaa 11040cggggggttc gtgcacacag
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc 11100tacagcgtga gctatgagaa
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc 11160cggtaagcgg cagggtcgga
acaggagagc gcacgaggga gcttccaggg ggaaacgcct 11220ggtatcttta tagtcctgtc
gggtttcgcc acctctgact tgagcgtcga tttttgtgat 11280gctcgtcagg ggggcggagc
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc 11340tggccttttg ctggcctttt
gctcacatgt tctttcctgc gttatcccct gattctgtgg 11400ataaccgtat taccgccttt
gagtgagctg ataccgctcg ccgcagccga acgaccgagc 11460gcagcgagtc agtgagcgag
gaagcggaag agcgcccaat acgcaaaccg cctctccccg 11520cgcgttggcc gattcattaa
tgcagctggc acgacaggtt tcccgactgg aaagcgggca 11580gtgagcgcaa cgcaattaat
gtgagttagc tcactcatta ggcaccccag gctttacact 11640ttatgctccc ggctcgtatg
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa 11700acagctatga ccatgattac
gccaagcgcg caattaaccc tcactaaagg gaacaaaagc 11760tgggtaccgg gcccacgcgt
aatacgactc actatag 117977913755DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 79ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tagacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtcgagtct 7560agtcgacgcc accatgggca ccgtgaacaa gcctgtcgtg ggcgtgctga
tgggcttcgg 7620catcatcacc ggcaccctga gaatcaccaa ccctgtgcgg gccagcgtgc
tgagatacga 7680cgacttccac atcgacgagg acaagctgga caccaacagc gtgtacgagc
cctactacca 7740cagcgaccac gccgagagca gctgggtcaa cagaggcgag agcagccgga
aggcctacga 7800ccacaacagc ccctacatct ggccccggaa cgactacgac ggcttcctgg
aaaacgccca 7860cgagcaccac ggcgtgtaca atcagggcag aggcatcgac agcggcgaga
gactgatgca 7920gcccacacag atgagcgccc aggaagatct gggcgacgac acaggcatcc
acgtgatccc 7980caccctgaac ggcgacgacc ggcacaagat cgtgaacgtg gaccagcggc
agtacggcga 8040cgtgttcaag ggcgacctga accctaagcc ccagggccag agactgatcg
aggtgtccgt 8100ggaagagaac caccccttca ccctgagagc ccccatccag agaatctacg
gcgtgcggta 8160taccgagact tggagcttcc tgcccagcct gacctgtaca ggcgacgccg
ctcctgccat 8220ccagcacatc tgcctgaagc acaccacctg tttccaggac gtggtggtgg
acgtggactg 8280cgccgagaac accaaagagg accagctggc cgagatcagc taccggttcc
agggcaagaa 8340agaggccgac cagccctgga tcgtggtcaa taccagcacc ctgttcgacg
agctggaact 8400ggaccccccc gagattgaac ccggcgtgct gaaggtgctg cggaccgaga
agcagtacct 8460gggcgtgtac atctggaaca tgcggggctc cgacggcacc tctacctacg
ccaccttcct 8520ggtcacatgg aagggcgacg agaaaacccg gaaccctacc cctgccgtga
cccctcagcc 8580tagaggcgcc gagttccata tgtggaatta ccactcccac gtgttcagcg
tgggcgacac 8640cttcagcctg gccatgcatc tgcagtacaa gatccacgag gcccccttcg
acctgctgct 8700ggaatggctg tacgtgccca tcgaccctac ctgccagccc atgcggctgt
acagcacctg 8760tctgtaccac cccaacgccc ctcagtgcct gagccacatg aacagcggct
gcaccttcac 8820cagccctcac ctggctcaga gggtggccag caccgtgtac cagaattgcg
agcacgccga 8880caactacacc gcctactgcc tgggcatcag ccacatggaa cccagcttcg
gcctgatcct 8940gcacgatggc ggcaccaccc tgaagttcgt ggacacaccc gagagcctga
gcggcctgta 9000cgtgttcgtg gtgtacttca acggccacgt ggaagccgtg gcctacaccg
tggtgtccac 9060cgtggaccac ttcgtgaacg ccatcgagga aagaggcttc ccacccacag
ccggacagcc 9120tccagccacc accaagccca aagaaatcac ccccgtgaac cccggcacca
gccccctgct 9180gagatatgct gcttggacag gcggactggc cgctgtggtg ctgctgtgcc
tggtcatctt 9240cctgatctgc accgccaagc ggatgagagt gaaggcctac cgggtggaca
agtcccccta 9300caaccagagc atgtactacg ccggcctgcc cgtggacgat ttcgaggata
gcgagagcac 9360cgacaccgag gaagagttcg gcaacgccat cggcggatct cacggcggca
gcagctacac 9420cgtgtacatc gacaagacca gataatctag acgtcgcgac cacccaggat
ccgcctataa 9480ctctctacgg ctaacctgaa tggactacga catagtctag tcgacgccac
catgtttctg 9540atccagtgcc tgatcagcgc cgtgatcttc tatattcaag tcacaaacgc
cctgatcttt 9600aagggcgacc acgtgtcact gcaggtcaac agcagcctga ccagcatcct
gatccccatg 9660cagaacgaca attacaccga gatcaagggc cagctggtgt tcatcggcga
gcagctgccc 9720accggcacca attacagcgg caccctggaa ctgctgtacg ccgataccgt
ggccttctgc 9780ttcagaagcg tgcaggtcat cagatacgac ggctgccccc ggatcagaac
cagcgccttc 9840atcagctgcc ggtacaagca cagctggcac tacggcaaca gcaccgaccg
gatcagcacc 9900gaacctgatg ccggcgtgat gctgaagatc accaagcccg gcatcaacga
cgccggcgtg 9960tacgtgctgc tcgtgcggct ggatcacagc agaagcaccg acggcttcat
cctgggcgtg 10020aacgtgtaca ccgccggcag ccaccacaac atccacggcg tgatctacac
cagccccagc 10080ctgcagaacg gctacagcac cagagccctg ttccagcagg ccagactgtg
cgatctgccc 10140gccacaccta agggcagcgg cacaagcctg tttcagcaca tgctggacct
gagagccggc 10200aagagcctgg aagataaccc ctggctgcac gaggacgtgg tcaccaccga
gacaaagagc 10260gtggtcaaag agggcatcga gaaccacgtg taccccaccg acatgagcac
cctgcccgag 10320aagtccctga acgacccccc tgagaacctg ctgatcatca tccccatcgt
ggccagcgtg 10380atgatcctga ccgccatggt catcgtgatc gtgatcagcg tgaagcggcg
gagaatcaag 10440aagcacccca tctaccggcc caacaccaag accagacggg gcatccagaa
cgccacccct 10500gagtccgacg tgatgctgga agccgccatt gcccagctgg ccaccatcag
agaggaaagc 10560ccccctcaca gcgtcgtgaa ccccttcgtg aagtaatcta gacgcggccg
catacagcag 10620caattggcaa gctgcttaca tagaactcgc ggcgattggc atgccgcctt
aaaattttta 10680ttttattttt cttttctttt ccgaatcgga ttttgttttt aatatttcaa
aaaaaaaaaa 10740aaaaaaaaaa aaaaaaaaaa aaagggtcgg catggcatct ccacctcctc
gcggtccgac 10800ctgggcatcc gaaggaggac gcacgtccac tcggatggct aagggagagc
cacgtttaaa 10860ccagctccaa ttcgccctat agtgagtcgt attacgcgcg ctcactggcc
gtcgttttac 10920aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca
gcacatcccc 10980ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc
caacagttgc 11040gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg
gcgggtgtgg 11100tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct
cctttcgctt 11160tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta
aatcgggggc 11220tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa
cttgattagg 11280gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct
ttgacgttgg 11340agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc
aaccctatct 11400cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg
ttaaaaaatg 11460agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt
acaatttagg 11520tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct
aaatacattc 11580aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat
attgaaaaag 11640gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg
cggcattttg 11700ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg
aagatcagtt 11760gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc
ttgagagttt 11820tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat
gtggcgcggt 11880attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact
attctcagaa 11940tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca
tgacagtaag 12000agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact
tacttctgac 12060aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg
atcatgtaac 12120tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg
agcgtgacac 12180cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg
aactacttac 12240tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg
caggaccact 12300tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag
ccggtgagcg 12360tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc
gtatcgtagt 12420tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga
tcgctgagat 12480aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat
atatacttta 12540gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc
tttttgataa 12600tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag
accccgtaga 12660aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct
gcttgcaaac 12720aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac
caactctttt 12780tccgaaggta actggcttca gcagagcgca gataccaaat actgttcttc
tagtgtagcc 12840gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg
ctctgctaat 12900cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt
tggactcaag 12960acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt
gcacacagcc 13020cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc
tatgagaaag 13080cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca
gggtcggaac 13140aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata
gtcctgtcgg 13200gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg
ggcggagcct 13260atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct
ggccttttgc 13320tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta
ccgcctttga 13380gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag
tgagcgagga 13440agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga
ttcattaatg 13500cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg
caattaatgt 13560gagttagctc actcattagg caccccaggc tttacacttt atgctcccgg
ctcgtatgtt 13620gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc
atgattacgc 13680caagcgcgca attaaccctc actaaaggga acaaaagctg ggtaccgggc
ccacgcgtaa 13740tacgactcac tatag
137558011459DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 80ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtcgacgcc 7560accatggtga
gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg 7620gacggcgacg
taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc 7680tacggcaagc
tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc 7740accctcgtga
ccaccctgac ctacggcgtg cagtgcttca gccgctaccc cgaccacatg 7800aagcagcacg
acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc 7860ttcttcaagg
acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc 7920ctggtgaacc
gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg 7980cacaagctgg
agtacaacta caacagccac aacgtctata tcatggccga caagcagaag 8040aacggcatca
aggtgaactt caagatccgc cacaacatcg aggacggcag cgtgcagctc 8100gccgaccact
accagcagaa cacccccatc ggcgacggcc ccgtgctgct gcccgacaac 8160cactacctga
gcacccagtc cgccctgagc aaagacccca acgagaagcg cgatcacatg 8220gtcctgctgg
agttcgtgac cgccgccggg atcactctcg gcatggacga gctgtacaag 8280tgataatcta
gacggcgcgc ccacccagcg gccgcataca gcagcaattg gcaagctgct 8340tacatagaac
tcgcggcgat tggcatgccg ccttaaaatt tttattttat ttttcttttc 8400ttttccgaat
cggattttgt ttttaatatt tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa 8460aaaaaaaggg
tcggcatggc atctccacct cctcgcggtc cgacctgggc atccgaagga 8520ggacgcacgt
ccactcggat ggctaaggga gagccacgtt taaaccagct ccaattcgcc 8580ctatagtgag
tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc gtgactggga 8640aaaccctggc
gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 8700taatagcgaa
gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 8760atgggacgcg
ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 8820gaccgctaca
cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 8880cgccacgttc
gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 8940atttagtgct
ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 9000tgggccatcg
ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 9060tagtggactc
ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 9120tttataaggg
attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 9180atttaacgcg
aattttaaca aaatattaac gcttacaatt taggtggcac ttttcgggga 9240aatgtgcgcg
gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 9300atgagacaat
aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 9360caacatttcc
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 9420cacccagaaa
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 9480tacatcgaac
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 9540tttccaatga
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac 9600gccgggcaag
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 9660tcaccagtca
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 9720gccataacca
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 9780aaggagctaa
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 9840gaaccggagc
tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca 9900atggcaacaa
cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 9960caattaatag
actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 10020ccggctggct
ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 10080attgcagcac
tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 10140agtcaggcaa
ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 10200aagcattggt
aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 10260catttttaat
ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 10320ccttaacgtg
agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 10380tcttgagatc
ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 10440ccagcggtgg
tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 10500ttcagcagag
cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac 10560ttcaagaact
ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 10620gctgccagtg
gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 10680aaggcgcagc
ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 10740acctacaccg
aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 10800gggagaaagg
cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 10860gagcttccag
ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 10920cttgagcgtc
gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 10980aacgcggcct
ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 11040gcgttatccc
ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 11100cgccgcagcc
gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca 11160atacgcaaac
cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg 11220tttcccgact
ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta gctcactcat 11280taggcacccc
aggctttaca ctttatgctc ccggctcgta tgttgtgtgg aattgtgagc 11340ggataacaat
ttcacacagg aaacagctat gaccatgatt acgccaagcg cgcaattaac 11400cctcactaaa
gggaacaaaa gctgggtacc gggcccacgc gtaatacgac tcactatag
11459813567DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 81ataggcggcg catgagagaa
gcccagacca attacctacc caaataggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggac 240ggaccgacca tgttcccgtt
ccagccaatg tatccgatgc agccaatgcc ctatcgcaac 300ccgttcgcgg ccccgcgcag
gccctggttc cccagaaccg acccttttct ggcgatgcag 360gtgcaggaat taacccgctc
gatggctaac ctgacgttca agcaacgccg ggacgcgcca 420cctgaggggc catccgctaa
gaaaccgaag aaggaggcct cgcaaaaaca gaaaggggga 480ggccaaggga agaagaagaa
gaaccaaggg aagaagaagg ctaagacagg gccgcctaat 540ccgaaggcac agaatggaaa
caagaagaag accaacaaga aaccaggcaa gagacagcgc 600atggtcatga aattggaatc
tgacaagacg ttcccaatca tgttggaagg gaagataaac 660ggctacgctt gtgtggtcgg
agggaagtta ttcaggccga tgggtgtgga aggcaagatc 720gacaacgacg ttctggccgc
gcttaagacg aagaaagcat ccaaatacga tcttgagtat 780gcagatgtgc cacagaacat
gcgggccgat acattcaaat acacccatga gaaaccccaa 840ggctattaca gctggcatca
tggagcagtc caatatgaaa atgggcgttt cacggtgccg 900aaaggagttg gggccaaggg
agacagcgga cgacccattc tggataacca gggacgggtg 960gtcgctattg tgctgggagg
tgtgaatgaa ggatctagga cagccctttc agtcgtcatg 1020tggaacgaga agggagttac
cgtgaagtat actccggaga actgcgagca atggtaatag 1080taagcggccg catacagcag
caattggcaa gctgcttaca tagaactcgc ggcgattggc 1140atgccgcctt aaaattttta
ttttattttt cttttctttt ccgaatcgga ttttgttttt 1200aatatttcaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaagggtcgg catggcatct 1260ccacctcctc gcggtccgac
ctgggcatcc gaaggaggac gcacgtccac tcggatggct 1320aagggagagc cacgtttaaa
cacgtgatat ctggcctcat gggccttcct ttcactgccc 1380gctttccagt cgggaaacct
gtcgtgccag ctgcattaac atggtcatag ctgtttcctt 1440gcgtattggg cgctctccgc
ttcctcgctc actgactcgc tgcgctcggt cgttcgggta 1500aagcctgggg tgcctaatga
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 1560cgttgctggc gtttttccat
aggctccgcc cccctgacga gcatcacaaa aatcgacgct 1620caagtcagag gtggcgaaac
ccgacaggac tataaagata ccaggcgttt ccccctggaa 1680gctccctcgt gcgctctcct
gttccgaccc tgccgcttac cggatacctg tccgcctttc 1740tcccttcggg aagcgtggcg
ctttctcata gctcacgctg taggtatctc agttcggtgt 1800aggtcgttcg ctccaagctg
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 1860ccttatccgg taactatcgt
cttgagtcca acccggtaag acacgactta tcgccactgg 1920cagcagccac tggtaacagg
attagcagag cgaggtatgt aggcggtgct acagagttct 1980tgaagtggtg gcctaactac
ggctacacta gaagaacagt atttggtatc tgcgctctgc 2040tgaagccagt taccttcgga
aaaagagttg gtagctcttg atccggcaaa caaaccaccg 2100ctggtagcgg tggttttttt
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 2160aagaagatcc tttgatcttt
tctacggggt ctgacgctca gtggaacgaa aactcacgtt 2220aagggatttt ggtcatgaga
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 2280aatgaagttt taaatcaatc
taaagtatat atgagtaaac ttggtctgac agttattaga 2340aaaattcatc cagcagacga
taaaacgcaa tacgctggct atccggtgcc gcaatgccat 2400acagcaccag aaaacgatcc
gcccattcgc cgcccagttc ttccgcaata tcacgggtgg 2460ccagcgcaat atcctgataa
cgatccgcca cgcccagacg gccgcaatca ataaagccgc 2520taaaacggcc attttccacc
ataatgttcg gcaggcacgc atcaccatgg gtcaccacca 2580gatcttcgcc atccggcatg
ctcgctttca gacgcgcaaa cagctctgcc ggtgccaggc 2640cctgatgttc ttcatccaga
tcatcctgat ccaccaggcc cgcttccata cgggtacgcg 2700cacgttcaat acgatgtttc
gcctgatgat caaacggaca ggtcgccggg tccagggtat 2760gcagacgacg catggcatcc
gccataatgc tcactttttc tgccggcgcc agatggctag 2820acagcagatc ctgacccggc
acttcgccca gcagcagcca atcacggccc gcttcggtca 2880ccacatccag caccgccgca
cacggaacac cggtggtggc cagccagctc agacgcgccg 2940cttcatcctg cagctcgttc
agcgcaccgc tcagatcggt tttcacaaac agcaccggac 3000gaccctgcgc gctcagacga
aacaccgccg catcagagca gccaatggtc tgctgcgccc 3060aatcatagcc aaacagacgt
tccacccacg ctgccgggct acccgcatgc aggccatcct 3120gttcaatcat actcttcctt
tttcaatatt attgaagcat ttatcagggt tattgtctca 3180tgagcggata catatttgaa
tgtatttaga aaaataaaca aataggggtt ccgcgcacat 3240ttccccgaaa agtgccacct
aaattgtaag cgttaatatt ttgttaaaat tcgcgttaaa 3300tttttgttaa atcagctcat
tttttaacca ataggccgaa atcggcaaaa tcccttataa 3360atcaaaagaa tagaccgaga
tagggttgag tggccgctac agggcgctcc cattcgccat 3420tcaggctgcg caactgttgg
gaagggcgtt tcggtgcggg cctcttcgct attacgccag 3480ctggcgaaag ggggatgtgc
tgcaaggcga ttaagttggg taacgccagg gttttcccag 3540tcacacgcgt aatacgactc
actatag 3567825685DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 82ataggcggcg catgagagaa gcccagacca attacctacc caaataggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggac 240ggaccgacca tgtcactagt gaccaccatg tgtctgctcg ccaatgtgac
gttcccatgt 300gctcaaccac caatttgcta cgacagaaaa ccagcagaga ctttggccat
gctcagcgtt 360aacgttgaca acccgggcta cgatgagctg ctggaagcag ctgttaagtg
ccccggaagg 420aaaaggagat ccaccgagga gctgtttaat gagtataagc taacgcgccc
ttacatggcc 480agatgcatca gatgtgcagt tgggagctgc catagtccaa tagcaatcga
ggcagtaaag 540agcgacgggc acgacggtta tgttagactt cagacttcct cgcagtatgg
cctggattcc 600tccggcaact taaagggcag gaccatgcgg tatgacatgc acgggaccat
taaagagata 660ccactacatc aagtgtcact ctatacatct cgcccgtgtc acattgtgga
tgggcacggt 720tatttcctgc ttgccaggtg cccggcaggg gactccatca ccatggaatt
taagaaagat 780tccgtcagac actcctgctc ggtgccgtat gaagtgaaat ttaatcctgt
aggcagagaa 840ctctatactc atcccccaga acacggagta gagcaagcgt gccaagtcta
cgcacatgat 900gcacagaaca gaggagctta tgtcgagatg cacctcccgg gctcagaagt
ggacagcagt 960ttggtttcct tgagcggcag ttcagtcacc gtgacacctc ctgatgggac
tagcgccctg 1020gtggaatgcg agtgtggcgg cacaaagatc tccgagacca tcaacaagac
aaaacagttc 1080agccagtgca caaagaagga gcagtgcaga gcatatcggc tgcagaacga
taagtgggtg 1140tataattctg acaaactgcc caaagcagcg ggagccacct taaaaggaaa
actgcatgtc 1200ccattcttgc tggcagacgg caaatgcacc gtgcctctag caccagaacc
tatgataacc 1260ttcggtttca gatcagtgtc actgaaactg caccctaaga atcccacata
tctaatcacc 1320cgccaacttg ctgatgagcc tcactacacg cacgagctca tatctgaacc
agctgttagg 1380aattttaccg tcaccgaaaa agggtgggag tttgtatggg gaaaccaccc
gccgaaaagg 1440ttttgggcac aggaaacagc acccggaaat ccacatgggc taccgcacga
ggtgataact 1500cattattacc acagataccc tatgtccacc atcctgggtt tgtcaatttg
tgccgccatt 1560gcaaccgttt ccgttgcagc gtctacctgg ctgttttgca gatctagagt
tgcgtgccta 1620actccttacc ggctaacacc taacgctagg ataccatttt gtctggctgt
gctttgctgc 1680gcccgcactg cccgggccga gaccacctgg gagtccttgg atcacctatg
gaacaataac 1740caacagatgt tctggattca attgctgatc cctctggccg ccttgatcgt
agtgactcgc 1800ctgctcaggt gcgtgtgctg tgtcgtgcct tttttagtca tggccggcgc
cgcaggcgcc 1860ggcgcctacg agcacgcgac cacgatgccg agccaagcgg gaatctcgta
taacactata 1920gtcaacagag caggctacgc accactccct atcagcataa caccaacaaa
gatcaagctg 1980atacctacag tgaacttgga gtacgtcacc tgccactaca aaacaggaat
ggattcacca 2040gccatcaaat gctgcggatc tcaggaatgc actccaactt acaggcctga
tgaacagtgc 2100aaagtcttca caggggttta cccgttcatg tggggtggtg catattgctt
ttgcgacact 2160gagaacaccc aagtcagcaa ggcctacgta atgaaatctg acgactgcct
tgcggatcat 2220gctgaagcat ataaagcgca cacagcctca gtgcaggcgt tcctcaacat
cacagtggga 2280gaacactcta ttgtgactac cgtgtatgtg aatggagaaa ctcctgtgaa
tttcaatggg 2340gtcaaaataa ctgcaggtcc gctttccaca gcttggacac cctttgatcg
caaaatcgtg 2400cagtatgccg gggagatcta taattatgat tttcctgagt atggggcagg
acaaccagga 2460gcatttggag atatacaatc cagaacagtc tcaagctctg atctgtatgc
caataccaac 2520ctagtgctgc agagacccaa agcaggagcg atccacgtgc catacactca
ggcaccttcg 2580ggttttgagc aatggaagaa agataaagct ccatcattga aatttaccgc
ccctttcgga 2640tgcgaaatat atacaaaccc cattcgcgcc gaaaactgtg ctgtagggtc
aattccatta 2700gcctttgaca ttcccgacgc cttgttcacc agggtgtcag aaacaccgac
actttcagcg 2760gccgaatgca ctcttaacga gtgcgtgtat tcttccgact ttggtgggat
cgccacggtc 2820aagtactcgg ccagcaagtc aggcaagtgc gcagtccatg tgccatcagg
gactgctacc 2880ctaaaagaag cagcagtcga gctaaccgag caagggtcgg cgactatcca
tttctcgacc 2940gcaaatatcc acccggagtt caggctccaa atatgcacat catatgttac
gtgcaaaggt 3000gattgtcacc ccccgaaaga ccatattgtg acacaccctc agtatcacgc
ccaaacattt 3060acagccgcgg tgtcaaaaac cgcgtggacg tggttaacat ccctgctggg
aggatcagcc 3120gtaattatta taattggctt ggtgctggct actattgtgg ccatgtacgt
gctgaccaac 3180cagaaacata attaatagta agcggccgca tacagcagca attggcaagc
tgcttacata 3240gaactcgcgg cgattggcat gccgccttaa aatttttatt ttatttttct
tttcttttcc 3300gaatcggatt ttgtttttaa tatttcaaaa aaaaaaaaaa aaaaaaaaaa
aaaaaaaaaa 3360agggtcggca tggcatctcc acctcctcgc ggtccgacct gggcatccga
aggaggacgc 3420acgtccactc ggatggctaa gggagagcca cgtttaaaca cgtgatatct
ggcctcatgg 3480gccttccttt cactgcccgc tttccagtcg ggaaacctgt cgtgccagct
gcattaacat 3540ggtcatagct gtttccttgc gtattgggcg ctctccgctt cctcgctcac
tgactcgctg 3600cgctcggtcg ttcgggtaaa gcctggggtg cctaatgagc aaaaggccag
caaaaggcca 3660ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc
cctgacgagc 3720atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta
taaagatacc 3780aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg
ccgcttaccg 3840gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc
tcacgctgta 3900ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac
gaaccccccg 3960ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac
ccggtaagac 4020acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg
aggtatgtag 4080gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga
agaacagtat 4140ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt
agctcttgat 4200ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag
cagattacgc 4260gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct
gacgctcagt 4320ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg
atcttcacct 4380agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat
gagtaaactt 4440ggtctgacag ttattagaaa aattcatcca gcagacgata aaacgcaata
cgctggctat 4500ccggtgccgc aatgccatac agcaccagaa aacgatccgc ccattcgccg
cccagttctt 4560ccgcaatatc acgggtggcc agcgcaatat cctgataacg atccgccacg
cccagacggc 4620cgcaatcaat aaagccgcta aaacggccat tttccaccat aatgttcggc
aggcacgcat 4680caccatgggt caccaccaga tcttcgccat ccggcatgct cgctttcaga
cgcgcaaaca 4740gctctgccgg tgccaggccc tgatgttctt catccagatc atcctgatcc
accaggcccg 4800cttccatacg ggtacgcgca cgttcaatac gatgtttcgc ctgatgatca
aacggacagg 4860tcgccgggtc cagggtatgc agacgacgca tggcatccgc cataatgctc
actttttctg 4920ccggcgccag atggctagac agcagatcct gacccggcac ttcgcccagc
agcagccaat 4980cacggcccgc ttcggtcacc acatccagca ccgccgcaca cggaacaccg
gtggtggcca 5040gccagctcag acgcgccgct tcatcctgca gctcgttcag cgcaccgctc
agatcggttt 5100tcacaaacag caccggacga ccctgcgcgc tcagacgaaa caccgccgca
tcagagcagc 5160caatggtctg ctgcgcccaa tcatagccaa acagacgttc cacccacgct
gccgggctac 5220ccgcatgcag gccatcctgt tcaatcatac tcttcctttt tcaatattat
tgaagcattt 5280atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa
aataaacaaa 5340taggggttcc gcgcacattt ccccgaaaag tgccacctaa attgtaagcg
ttaatatttt 5400gttaaaattc gcgttaaatt tttgttaaat cagctcattt tttaaccaat
aggccgaaat 5460cggcaaaatc ccttataaat caaaagaata gaccgagata gggttgagtg
gccgctacag 5520ggcgctccca ttcgccattc aggctgcgca actgttggga agggcgtttc
ggtgcgggcc 5580tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt
aagttgggta 5640acgccagggt tttcccagtc acacgcgtaa tacgactcac tatag
56858313364DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 83ataggcggcg
catgagagaa gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga
ggaagacagc ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc
caagcaggtc actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa
actgatcgaa acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc
ccgcagaatg tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga
tccggacaga ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga
taaggaattg gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga
aactgagact atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt
ttaccaggat gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg
agttagagtc gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc
tggagcatat ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa
cataggccta tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag
aaagaagtat ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca
cgagaagagg gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa
gcaaaattac acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag
aatagctatc agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg
cgagggattc ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc
cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga
tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa
cggtcgcacc cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca
ggcatttgct aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg
actacgagat agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac
atctatttat aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc
attcgtgctg cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag
gaaaatgtta gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga
agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc
tctaccacct ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat
gttacaagag gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag
ctacgatggc gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa
gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc
tggccgaaaa gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg
acatgcaata cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa
cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa
cactgatgaa gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta
cgacatcgac aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg
cgagctggtg gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc
cgctccttac caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg
catcattaaa agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg
tgcagaaatt ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt
ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc
ttttgcttgt catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc
agtgctctgc ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca
ttttaaccac gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa
atctgtgact tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc
gaaagagact aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct
cattctcact tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga
aataatgacg gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta
caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac
ccgcacggag gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact
gactgccaag taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga
tgccatcatg aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc
aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac
cactgaacaa tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat
agtattgaac caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc
tgcacccact gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa
catgtacggg ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc
tcgggcagtt gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga
tccgcgcata aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca
taatgaacac ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt
cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg
gcctgaggct accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata
tgacataata tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga
agaccatgcc attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg
cggaacctgt gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg
tgctatagcg cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga
gacggaagtt ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta
caagctttca tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg
tgcaccctca tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat
aaatgctgct aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa
attcccggaa agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg
tgcagctaaa catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga
aggtgacaaa cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa
ttacaagtca gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg
actaacccaa tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc
catatactgc agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga
agcagtggag gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct
ggtgagggtg catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg
caaaactttc tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga
aattaatgcc atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct
cggagaaagc atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac
accacctagc acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg
cctaaaagcc tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta
tagaatcact ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt
gcctgcgtat attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc
ggagccatcg gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac
cgaggatgag accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga
tagcataagt ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat
tcacgggccg ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga
tgtggacagt ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac
gtcagccgag actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc
tgcgcctcga acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc
acttgcaccc agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa
tagggtgatc actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc
ggtctcgaga accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga
ggagtttgag gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt
ttcctccgac accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc
cgaagtggtg ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga
aaaagaagaa ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag
ataccagtcc aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg
cctagggcat tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt
tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc
ctgtaacgcc atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga
gtacgatgcc tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt
ttgccctgca aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg
atcggcagtg ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa
aagaaattgc aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa
tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa
ccccatcagg cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa
agctgctgct ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga
caggtttgta atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga
agaacggccc aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg
cggaatccac cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac
actgtttgat atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg
ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc
tctgaccgcg ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat
tgaggcggct ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt
cggagccatg atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat
tgtaatcgca agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat
tggagatgac aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc
cacctggttg aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta
tttctgtgga gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga
ccccctaaaa aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga
tgacaggaga agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc
agagctgtgc aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat
ggccatgact actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat
aactctctac ggctaacctg aatggactac gacatagtct agtcgacgcc 7560accatgaggc
ctggcctgcc ctcctacctg atcatcctgg ccgtgtgcct gttcagccac 7620ctgctgtcca
gcagatacgg cgccgaggcc gtgagcgagc ccctggacaa ggctttccac 7680ctgctgctga
acacctacgg cagacccatc cggtttctgc gggagaacac cacccagtgc 7740acctacaaca
gcagcctgcg gaacagcacc gtcgtgagag agaacgccat cagcttcaac 7800tttttccaga
gctacaacca gtactacgtg ttccacatgc ccagatgcct gtttgccggc 7860cctctggccg
agcagttcct gaaccaggtg gacctgaccg agacactgga aagataccag 7920cagcggctga
atacctacgc cctggtgtcc aaggacctgg ccagctaccg gtcctttagc 7980cagcagctca
aggctcagga tagcctcggc gagcagccta ccaccgtgcc ccctcccatc 8040gacctgagca
tcccccacgt gtggatgcct ccccagacca cccctcacgg ctggaccgag 8100agccacacca
cctccggcct gcacagaccc cacttcaacc agacctgcat cctgttcgac 8160ggccacgacc
tgctgtttag caccgtgacc ccctgcctgc accagggctt ctacctgatc 8220gacgagctga
gatacgtgaa gatcaccctg accgaggatt tcttcgtggt caccgtgtcc 8280atcgacgacg
acacccccat gctgctgatc ttcggccacc tgcccagagt gctgttcaag 8340gccccctacc
agcgggacaa cttcatcctg cggcagaccg agaagcacga gctgctggtg 8400ctggtcaaga
aggaccagct gaaccggcac tcctacctga aggaccccga cttcctggac 8460gccgccctgg
acttcaacta cctggacctg agcgccctgc tgagaaacag cttccacaga 8520tacgccgtgg
acgtgctgaa gtccggacgg tgccagatgc tcgatcggcg gaccgtggag 8580atggccttcg
cctatgccct cgccctgttc gccgctgcca gacaggaaga ggctggcgcc 8640caggtgtcag
tgcccagagc cctggataga caggccgccc tgctgcagat ccaggaattc 8700atgatcacct
gcctgagcca gaccccccct agaaccaccc tgctgctgta ccccacagcc 8760gtggatctgg
ccaagagggc cctgtggacc cccaaccaga tcaccgacat cacaagcctc 8820gtgcggctcg
tgtacatcct gagcaagcag aaccagcagc acctgatccc ccagtgggcc 8880ctgagacaga
tcgccgactt cgccctgaag ctgcacaaga cccatctggc cagctttctg 8940agcgccttcg
ccaggcagga actgtacctg atgggcagcc tggtccacag catgctggtg 9000cataccaccg
agcggcggga gatcttcatc gtggagacag gcctgtgtag cctggccgag 9060ctgtcccact
ttacccagct gctggcccac cctcaccacg agtacctgag cgacctgtac 9120accccctgca
gcagcagcgg cagacgggac cacagcctgg aacggctgac cagactgttc 9180cccgatgcca
ccgtgcctgc tacagtgcct gccgccctgt ccatcctgtc caccatgcag 9240cccagcaccc
tggaaacctt ccccgacctg ttctgcctgc ccctgggcga gagctttagc 9300gccctgaccg
tgtccgagca cgtgtcctac atcgtgacca atcagtacct gatcaagggc 9360atcagctacc
ccgtgtccac cacagtcgtg ggccagagcc tgatcatcac ccagaccgac 9420agccagacca
agtgcgagct gacccggaac atgcacacca cacacagcat caccgtggcc 9480ctgaacatca
gcctggaaaa ctgcgctttc tgtcagtctg ccctgctgga atacgacgat 9540acccagggcg
tgatcaacat catgtacatg cacgacagcg acgacgtgct gttcgccctg 9600gacccctaca
acgaggtggt ggtgtccagc ccccggaccc actacctgat gctgctgaag 9660aacggcaccg
tgctggaagt gaccgacgtg gtggtggacg ccaccgacag cagactgctg 9720atgatgagcg
tgtacgccct gagcgccatc atcggcatct acctgctgta ccggatgctg 9780aaaacctgct
gataatctag acggcgcgcc cacccagcgg ccgcctataa ctctctacgg 9840ctaacctgaa
tggactacga catagtctag tcgacgccac catgtgcaga aggcccgact 9900gcggcttcag
cttcagccct ggacccgtga tcctgctgtg gtgctgcctg ctgctgccta 9960tcgtgtcctc
tgccgccgtg tctgtggccc ctacagccgc cgagaaggtg ccagccgagt 10020gccccgagct
gaccagaaga tgcctgctgg gcgaggtgtt cgagggcgac aagtacgaga 10080gctggctgcg
gcccctggtc aacgtgaccg gcagagatgg ccccctgagc cagctgatcc 10140ggtacagacc
cgtgaccccc gaggccgcca atagcgtgct gctggacgag gccttcctgg 10200ataccctggc
cctgctgtac aacaaccccg accagctgag agccctgctg accctgctgt 10260ccagcgacac
cgcccccaga tggatgaccg tgatgcgggg ctacagcgag tgtggagatg 10320gcagccctgc
cgtgtacacc tgcgtggacg acctgtgcag aggctacgac ctgaccagac 10380tgagctacgg
ccggtccatc ttcacagagc acgtgctggg cttcgagctg gtgcccccca 10440gcctgttcaa
cgtggtggtg gccatccgga acgaggccac cagaaccaac agagccgtgc 10500ggctgcctgt
gtctacagcc gctgcacctg agggcatcac actgttctac ggcctgtaca 10560acgccgtgaa
agagttctgc ctccggcacc agctggatcc ccccctgctg agacacctgg 10620acaagtacta
cgccggcctg cccccagagc tgaagcagac cagagtgaac ctgcccgccc 10680acagcagata
tggccctcag gccgtggacg ccagatgata atctagacgg cgcgcccacc 10740cacctgcagg
atacagcagc aattggcaag ctgcttacat agaactcgcg gcgattggca 10800tgccgcctta
aaatttttat tttatttttc ttttcttttc cgaatcggat tttgttttta 10860atatttcaaa
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagggtcggc atggcatctc 10920cacctcctcg
cggtccgacc tgggcatccg aaggaggacg cacgtccact cggatggcta 10980agggagagcc
acgtttaaac gctagagcaa gacgtttccc gttgaatatg gctcataaca 11040ccccttgtat
tactgtttat gtaagcagac agttttattg ttcatgatga tatattttta 11100tcttgtgcaa
tgtaacatca gagattttga gacacaacgt ggctttgttg aataaatcga 11160acttttgctg
agttgaagga tcagatcacg catcttcccg acaacgcaga ccgttccgtg 11220gcaaagcaaa
agttcaaaat caccaactgg tccacctaca acaaagctct catcaaccgt 11280ggctccctca
ctttctggct ggatgatggg gcgattcagg cctggtatga gtcagcaaca 11340ccttcttcac
gaggcagacc tcagcgctag cggagtgtat actggcttac tatgttggca 11400ctgatgaggg
tgtcagtgaa gtgcttcatg tggcaggaga aaaaaggctg caccggtgcg 11460tcagcagaat
atgtgataca ggatatattc cgcttcctcg ctcactgact cgctacgctc 11520ggtcgttcga
ctgcggcgag cggaaatggc ttacgaacgg ggcggagatt tcctggaaga 11580tgccaggaag
atacttaaca gggaagtgag agggccgcgg caaagccgtt tttccatagg 11640ctccgccccc
ctgacaagca tcacgaaatc tgacgctcaa atcagtggtg gcgaaacccg 11700acaggactat
aaagatacca ggcgtttccc ctggcggctc cctcgtgcgc tctcctgttc 11760ctgcctttcg
gtttaccggt gtcattccgc tgttatggcc gcgtttgtct cattccacgc 11820ctgacactca
gttccgggta ggcagttcgc tccaagctgg actgtatgca cgaacccccc 11880gttcagtccg
accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggaaaga 11940catgcaaaag
caccactggc agcagccact ggtaattgat ttagaggagt tagtcttgaa 12000gtcatgcgcc
ggttaaggct aaactgaaag gacaagtttt ggtgactgcg ctcctccaag 12060ccagttacct
cggttcaaag agttggtagc tcagagaacc ttcgaaaaac cgccctgcaa 12120ggcggttttt
tcgttttcag agcaagagat tacgcgcaga ccaaaacgat ctcaagaaga 12180tcatcttatt
aaggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 12240catgagatta
tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa 12300atcaatctaa
agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga 12360ggcacctatc
tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt 12420gtagataact
acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg 12480agacccacgc
tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga 12540gcgcagaagt
ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga 12600agctagagta
agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctgcagg 12660catcgtggtg
tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc 12720aaggcgagtt
acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc 12780gatcgttgtc
agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca 12840taattctctt
actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac 12900caagtcattc
tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaacacg 12960ggataatacc
gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc 13020ggggcgaaaa
ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg 13080tgcacccaac
tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac 13140aggaaggcaa
aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat 13200actcttcctt
tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata 13260catatttgaa
tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa 13320agtgccacct
gacgtgtcga gacgcgtaat acgactcact atag
133648413283DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 84ataggcggcg catgagagaa
gcccagacca attacctacc caaaatggag aaagttcacg 60ttgacatcga ggaagacagc
ccattcctca gagctttgca gcggagcttc ccgcagtttg 120aggtagaagc caagcaggtc
actgataatg accatgctaa tgccagagcg ttttcgcatc 180tggcttcaaa actgatcgaa
acggaggtgg acccatccga cacgatcctt gacattggaa 240gtgcgcccgc ccgcagaatg
tattctaagc acaagtatca ttgtatctgt ccgatgagat 300gtgcggaaga tccggacaga
ttgtataagt atgcaactaa gctgaagaaa aactgtaagg 360aaataactga taaggaattg
gacaagaaaa tgaaggagct cgccgccgtc atgagcgacc 420ctgacctgga aactgagact
atgtgcctcc acgacgacga gtcgtgtcgc tacgaagggc 480aagtcgctgt ttaccaggat
gtatacgcgg ttgacggacc gacaagtctc tatcaccaag 540ccaataaggg agttagagtc
gcctactgga taggctttga caccacccct tttatgttta 600agaacttggc tggagcatat
ccatcatact ctaccaactg ggccgacgaa accgtgttaa 660cggctcgtaa cataggccta
tgcagctctg acgttatgga gcggtcacgt agagggatgt 720ccattcttag aaagaagtat
ttgaaaccat ccaacaatgt tctattctct gttggctcga 780ccatctacca cgagaagagg
gacttactga ggagctggca cctgccgtct gtatttcact 840tacgtggcaa gcaaaattac
acatgtcggt gtgagactat agttagttgc gacgggtacg 900tcgttaaaag aatagctatc
agtccaggcc tgtatgggaa gccttcaggc tatgctgcta 960cgatgcaccg cgagggattc
ttgtgctgca aagtgacaga cacattgaac ggggagaggg 1020tctcttttcc cgtgtgcacg
tatgtgccag ctacattgtg tgaccaaatg actggcatac 1080tggcaacaga tgtcagtgcg
gacgacgcgc aaaaactgct ggttgggctc aaccagcgta 1140tagtcgtcaa cggtcgcacc
cagagaaaca ccaataccat gaaaaattac cttttgcccg 1200tagtggccca ggcatttgct
aggtgggcaa aggaatataa ggaagatcaa gaagatgaaa 1260ggccactagg actacgagat
agacagttag tcatggggtg ttgttgggct tttagaaggc 1320acaagataac atctatttat
aagcgcccgg atacccaaac catcatcaaa gtgaacagcg 1380atttccactc attcgtgctg
cccaggatag gcagtaacac attggagatc gggctgagaa 1440caagaatcag gaaaatgtta
gaggagcaca aggagccgtc acctctcatt accgccgagg 1500acgtacaaga agctaagtgc
gcagccgatg aggctaagga ggtgcgtgaa gccgaggagt 1560tgcgcgcagc tctaccacct
ttggcagctg atgttgagga gcccactctg gaagccgatg 1620tagacttgat gttacaagag
gctggggccg gctcagtgga gacacctcgt ggcttgataa 1680aggttaccag ctacgatggc
gaggacaaga tcggctctta cgctgtgctt tctccgcagg 1740ctgtactcaa gagtgaaaaa
ttatcttgca tccaccctct cgctgaacaa gtcatagtga 1800taacacactc tggccgaaaa
gggcgttatg ccgtggaacc ataccatggt aaagtagtgg 1860tgccagaggg acatgcaata
cccgtccagg actttcaagc tctgagtgaa agtgccacca 1920ttgtgtacaa cgaacgtgag
ttcgtaaaca ggtacctgca ccatattgcc acacatggag 1980gagcgctgaa cactgatgaa
gaatattaca aaactgtcaa gcccagcgag cacgacggcg 2040aatacctgta cgacatcgac
aggaaacagt gcgtcaagaa agaactagtc actgggctag 2100ggctcacagg cgagctggtg
gatcctccct tccatgaatt cgcctacgag agtctgagaa 2160cacgaccagc cgctccttac
caagtaccaa ccataggggt gtatggcgtg ccaggatcag 2220gcaagtctgg catcattaaa
agcgcagtca ccaaaaaaga tctagtggtg agcgccaaga 2280aagaaaactg tgcagaaatt
ataagggacg tcaagaaaat gaaagggctg gacgtcaatg 2340ccagaactgt ggactcagtg
ctcttgaatg gatgcaaaca ccccgtagag accctgtata 2400ttgacgaagc ttttgcttgt
catgcaggta ctctcagagc gctcatagcc attataagac 2460ctaaaaaggc agtgctctgc
ggggatccca aacagtgcgg tttttttaac atgatgtgcc 2520tgaaagtgca ttttaaccac
gagatttgca cacaagtctt ccacaaaagc atctctcgcc 2580gttgcactaa atctgtgact
tcggtcgtct caaccttgtt ttacgacaaa aaaatgagaa 2640cgacgaatcc gaaagagact
aagattgtga ttgacactac cggcagtacc aaacctaagc 2700aggacgatct cattctcact
tgtttcagag ggtgggtgaa gcagttgcaa atagattaca 2760aaggcaacga aataatgacg
gcagctgcct ctcaagggct gacccgtaaa ggtgtgtatg 2820ccgttcggta caaggtgaat
gaaaatcctc tgtacgcacc cacctcagaa catgtgaacg 2880tcctactgac ccgcacggag
gaccgcatcg tgtggaaaac actagccggc gacccatgga 2940taaaaacact gactgccaag
taccctggga atttcactgc cacgatagag gagtggcaag 3000cagagcatga tgccatcatg
aggcacatct tggagagacc ggaccctacc gacgtcttcc 3060agaataaggc aaacgtgtgt
tgggccaagg ctttagtgcc ggtgctgaag accgctggca 3120tagacatgac cactgaacaa
tggaacactg tggattattt tgaaacggac aaagctcact 3180cagcagagat agtattgaac
caactatgcg tgaggttctt tggactcgat ctggactccg 3240gtctattttc tgcacccact
gttccgttat ccattaggaa taatcactgg gataactccc 3300cgtcgcctaa catgtacggg
ctgaataaag aagtggtccg tcagctctct cgcaggtacc 3360cacaactgcc tcgggcagtt
gccactggaa gagtctatga catgaacact ggtacactgc 3420gcaattatga tccgcgcata
aacctagtac ctgtaaacag aagactgcct catgctttag 3480tcctccacca taatgaacac
ccacagagtg acttttcttc attcgtcagc aaattgaagg 3540gcagaactgt cctggtggtc
ggggaaaagt tgtccgtccc aggcaaaatg gttgactggt 3600tgtcagaccg gcctgaggct
accttcagag ctcggctgga tttaggcatc ccaggtgatg 3660tgcccaaata tgacataata
tttgttaatg tgaggacccc atataaatac catcactatc 3720agcagtgtga agaccatgcc
attaagctta gcatgttgac caagaaagct tgtctgcatc 3780tgaatcccgg cggaacctgt
gtcagcatag gttatggtta cgctgacagg gccagcgaaa 3840gcatcattgg tgctatagcg
cggcagttca agttttcccg ggtatgcaaa ccgaaatcct 3900cacttgaaga gacggaagtt
ctgtttgtat tcattgggta cgatcgcaag gcccgtacgc 3960acaatcctta caagctttca
tcaaccttga ccaacattta tacaggttcc agactccacg 4020aagccggatg tgcaccctca
tatcatgtgg tgcgagggga tattgccacg gccaccgaag 4080gagtgattat aaatgctgct
aacagcaaag gacaacctgg cggaggggtg tgcggagcgc 4140tgtataagaa attcccggaa
agcttcgatt tacagccgat cgaagtagga aaagcgcgac 4200tggtcaaagg tgcagctaaa
catatcattc atgccgtagg accaaacttc aacaaagttt 4260cggaggttga aggtgacaaa
cagttggcag aggcttatga gtccatcgct aagattgtca 4320acgataacaa ttacaagtca
gtagcgattc cactgttgtc caccggcatc ttttccggga 4380acaaagatcg actaacccaa
tcattgaacc atttgctgac agctttagac accactgatg 4440cagatgtagc catatactgc
agggacaaga aatgggaaat gactctcaag gaagcagtgg 4500ctaggagaga agcagtggag
gagatatgca tatccgacga ctcttcagtg acagaacctg 4560atgcagagct ggtgagggtg
catccgaaga gttctttggc tggaaggaag ggctacagca 4620caagcgatgg caaaactttc
tcatatttgg aagggaccaa gtttcaccag gcggccaagg 4680atatagcaga aattaatgcc
atgtggcccg ttgcaacgga ggccaatgag caggtatgca 4740tgtatatcct cggagaaagc
atgagcagta ttaggtcgaa atgccccgtc gaagagtcgg 4800aagcctccac accacctagc
acgctgcctt gcttgtgcat ccatgccatg actccagaaa 4860gagtacagcg cctaaaagcc
tcacgtccag aacaaattac tgtgtgctca tcctttccat 4920tgccgaagta tagaatcact
ggtgtgcaga agatccaatg ctcccagcct atattgttct 4980caccgaaagt gcctgcgtat
attcatccaa ggaagtatct cgtggaaaca ccaccggtag 5040acgagactcc ggagccatcg
gcagagaacc aatccacaga ggggacacct gaacaaccac 5100cacttataac cgaggatgag
accaggacta gaacgcctga gccgatcatc atcgaagagg 5160aagaagagga tagcataagt
ttgctgtcag atggcccgac ccaccaggtg ctgcaagtcg 5220aggcagacat tcacgggccg
ccctctgtat ctagctcatc ctggtccatt cctcatgcat 5280ccgactttga tgtggacagt
ttatccatac ttgacaccct ggagggagct agcgtgacca 5340gcggggcaac gtcagccgag
actaactctt acttcgcaaa gagtatggag tttctggcgc 5400gaccggtgcc tgcgcctcga
acagtattca ggaaccctcc acatcccgct ccgcgcacaa 5460gaacaccgtc acttgcaccc
agcagggcct gctcgagaac cagcctagtt tccaccccgc 5520caggcgtgaa tagggtgatc
actagagagg agctcgaggc gcttaccccg tcacgcactc 5580ctagcaggtc ggtctcgaga
accagcctgg tctccaaccc gccaggcgta aatagggtga 5640ttacaagaga ggagtttgag
gcgttcgtag cacaacaaca atgacggttt gatgcgggtg 5700catacatctt ttcctccgac
accggtcaag ggcatttaca acaaaaatca gtaaggcaaa 5760cggtgctatc cgaagtggtg
ttggagagga ccgaattgga gatttcgtat gccccgcgcc 5820tcgaccaaga aaaagaagaa
ttactacgca agaaattaca gttaaatccc acacctgcta 5880acagaagcag ataccagtcc
aggaaggtgg agaacatgaa agccataaca gctagacgta 5940ttctgcaagg cctagggcat
tatttgaagg cagaaggaaa agtggagtgc taccgaaccc 6000tgcatcctgt tcctttgtat
tcatctagtg tgaaccgtgc cttttcaagc cccaaggtcg 6060cagtggaagc ctgtaacgcc
atgttgaaag agaactttcc gactgtggct tcttactgta 6120ttattccaga gtacgatgcc
tatttggaca tggttgacgg agcttcatgc tgcttagaca 6180ctgccagttt ttgccctgca
aagctgcgca gctttccaaa gaaacactcc tatttggaac 6240ccacaatacg atcggcagtg
ccttcagcga tccagaacac gctccagaac gtcctggcag 6300ctgccacaaa aagaaattgc
aatgtcacgc aaatgagaga attgcccgta ttggattcgg 6360cggcctttaa tgtggaatgc
ttcaagaaat atgcgtgtaa taatgaatat tgggaaacgt 6420ttaaagaaaa ccccatcagg
cttactgaag aaaacgtggt aaattacatt accaaattaa 6480aaggaccaaa agctgctgct
ctttttgcga agacacataa tttgaatatg ttgcaggaca 6540taccaatgga caggtttgta
atggacttaa agagagacgt gaaagtgact ccaggaacaa 6600aacatactga agaacggccc
aaggtacagg tgatccaggc tgccgatccg ctagcaacag 6660cgtatctgtg cggaatccac
cgagagctgg ttaggagatt aaatgcggtc ctgcttccga 6720acattcatac actgtttgat
atgtcggctg aagactttga cgctattata gccgagcact 6780tccagcctgg ggattgtgtt
ctggaaactg acatcgcgtc gtttgataaa agtgaggacg 6840acgccatggc tctgaccgcg
ttaatgattc tggaagactt aggtgtggac gcagagctgt 6900tgacgctgat tgaggcggct
ttcggcgaaa tttcatcaat acatttgccc actaaaacta 6960aatttaaatt cggagccatg
atgaaatctg gaatgttcct cacactgttt gtgaacacag 7020tcattaacat tgtaatcgca
agcagagtgt tgagagaacg gctaaccgga tcaccatgtg 7080cagcattcat tggagatgac
aatatcgtga aaggagtcaa atcggacaaa ttaatggcag 7140acaggtgcgc cacctggttg
aatatggaag tcaagattat agatgctgtg gtgggcgaga 7200aagcgcctta tttctgtgga
gggtttattt tgtgtgactc cgtgaccggc acagcgtgcc 7260gtgtggcaga ccccctaaaa
aggctgttta agcttggcaa acctctggca gcagacgatg 7320aacatgatga tgacaggaga
agggcattgc atgaagagtc aacacgctgg aaccgagtgg 7380gtattctttc agagctgtgc
aaggcagtag aatcaaggta tgaaaccgta ggaacttcca 7440tcatagttat ggccatgact
actctagcta gcagtgttaa atcattcagc tacctgagag 7500gggcccctat aactctctac
ggctaacctg aatggactac gacatagtct agtcgacgcc 7560accatgaggc ctggcctgcc
ctcctacctg atcatcctgg ccgtgtgcct gttcagccac 7620ctgctgtcca gcagatacgg
cgccgaggcc gtgagcgagc ccctggacaa ggctttccac 7680ctgctgctga acacctacgg
cagacccatc cggtttctgc gggagaacac cacccagtgc 7740acctacaaca gcagcctgcg
gaacagcacc gtcgtgagag agaacgccat cagcttcaac 7800tttttccaga gctacaacca
gtactacgtg ttccacatgc ccagatgcct gtttgccggc 7860cctctggccg agcagttcct
gaaccaggtg gacctgaccg agacactgga aagataccag 7920cagcggctga atacctacgc
cctggtgtcc aaggacctgg ccagctaccg gtcctttagc 7980cagcagctca aggctcagga
tagcctcggc gagcagccta ccaccgtgcc ccctcccatc 8040gacctgagca tcccccacgt
gtggatgcct ccccagacca cccctcacgg ctggaccgag 8100agccacacca cctccggcct
gcacagaccc cacttcaacc agacctgcat cctgttcgac 8160ggccacgacc tgctgtttag
caccgtgacc ccctgcctgc accagggctt ctacctgatc 8220gacgagctga gatacgtgaa
gatcaccctg accgaggatt tcttcgtggt caccgtgtcc 8280atcgacgacg acacccccat
gctgctgatc ttcggccacc tgcccagagt gctgttcaag 8340gccccctacc agcgggacaa
cttcatcctg cggcagaccg agaagcacga gctgctggtg 8400ctggtcaaga aggaccagct
gaaccggcac tcctacctga aggaccccga cttcctggac 8460gccgccctgg acttcaacta
cctggacctg agcgccctgc tgagaaacag cttccacaga 8520tacgccgtgg acgtgctgaa
gtccggacgg tgccagatgc tcgatcggcg gaccgtggag 8580atggccttcg cctatgccct
cgccctgttc gccgctgcca gacaggaaga ggctggcgcc 8640caggtgtcag tgcccagagc
cctggataga caggccgccc tgctgcagat ccaggaattc 8700atgatcacct gcctgagcca
gaccccccct agaaccaccc tgctgctgta ccccacagcc 8760gtggatctgg ccaagagggc
cctgtggacc cccaaccaga tcaccgacat cacaagcctc 8820gtgcggctcg tgtacatcct
gagcaagcag aaccagcagc acctgatccc ccagtgggcc 8880ctgagacaga tcgccgactt
cgccctgaag ctgcacaaga cccatctggc cagctttctg 8940agcgccttcg ccaggcagga
actgtacctg atgggcagcc tggtccacag catgctggtg 9000cataccaccg agcggcggga
gatcttcatc gtggagacag gcctgtgtag cctggccgag 9060ctgtcccact ttacccagct
gctggcccac cctcaccacg agtacctgag cgacctgtac 9120accccctgca gcagcagcgg
cagacgggac cacagcctgg aacggctgac cagactgttc 9180cccgatgcca ccgtgcctgc
tacagtgcct gccgccctgt ccatcctgtc caccatgcag 9240cccagcaccc tggaaacctt
ccccgacctg ttctgcctgc ccctgggcga gagctttagc 9300gccctgaccg tgtccgagca
cgtgtcctac atcgtgacca atcagtacct gatcaagggc 9360atcagctacc ccgtgtccac
cacagtcgtg ggccagagcc tgatcatcac ccagaccgac 9420agccagacca agtgcgagct
gacccggaac atgcacacca cacacagcat caccgtggcc 9480ctgaacatca gcctggaaaa
ctgcgctttc tgtcagtctg ccctgctgga atacgacgat 9540acccagggcg tgatcaacat
catgtacatg cacgacagcg acgacgtgct gttcgccctg 9600gacccctaca acgaggtggt
ggtgtccagc ccccggaccc actacctgat gctgctgaag 9660aacggcaccg tgctggaagt
gaccgacgtg gtggtggacg ccaccgactg ataatctaga 9720cggcgcgccc acccagcggc
cgcctataac tctctacggc taacctgaat ggactacgac 9780atagtctagt cgacgccacc
atgtgcagaa ggcccgactg cggcttcagc ttcagccctg 9840gacccgtgat cctgctgtgg
tgctgcctgc tgctgcctat cgtgtcctct gccgccgtgt 9900ctgtggcccc tacagccgcc
gagaaggtgc cagccgagtg ccccgagctg accagaagat 9960gcctgctggg cgaggtgttc
gagggcgaca agtacgagag ctggctgcgg cccctggtca 10020acgtgaccgg cagagatggc
cccctgagcc agctgatccg gtacagaccc gtgacccccg 10080aggccgccaa tagcgtgctg
ctggacgagg ccttcctgga taccctggcc ctgctgtaca 10140acaaccccga ccagctgaga
gccctgctga ccctgctgtc cagcgacacc gcccccagat 10200ggatgaccgt gatgcggggc
tacagcgagt gtggagatgg cagccctgcc gtgtacacct 10260gcgtggacga cctgtgcaga
ggctacgacc tgaccagact gagctacggc cggtccatct 10320tcacagagca cgtgctgggc
ttcgagctgg tgccccccag cctgttcaac gtggtggtgg 10380ccatccggaa cgaggccacc
agaaccaaca gagccgtgcg gctgcctgtg tctacagccg 10440ctgcacctga gggcatcaca
ctgttctacg gcctgtacaa cgccgtgaaa gagttctgcc 10500tccggcacca gctggatccc
cccctgctga gacacctgga caagtactac gccggcctgc 10560ccccagagct gaagcagacc
agagtgaacc tgcccgccca cagcagatat ggccctcagg 10620ccgtggacgc cagatgataa
tctagacggc gcgcccaccc acctgcagga tacagcagca 10680attggcaagc tgcttacata
gaactcgcgg cgattggcat gccgccttaa aatttttatt 10740ttatttttct tttcttttcc
gaatcggatt ttgtttttaa tatttcaaaa aaaaaaaaaa 10800aaaaaaaaaa aaaaaaaaaa
agggtcggca tggcatctcc acctcctcgc ggtccgacct 10860gggcatccga aggaggacgc
acgtccactc ggatggctaa gggagagcca cgtttaaacg 10920ctagagcaag acgtttcccg
ttgaatatgg ctcataacac cccttgtatt actgtttatg 10980taagcagaca gttttattgt
tcatgatgat atatttttat cttgtgcaat gtaacatcag 11040agattttgag acacaacgtg
gctttgttga ataaatcgaa cttttgctga gttgaaggat 11100cagatcacgc atcttcccga
caacgcagac cgttccgtgg caaagcaaaa gttcaaaatc 11160accaactggt ccacctacaa
caaagctctc atcaaccgtg gctccctcac tttctggctg 11220gatgatgggg cgattcaggc
ctggtatgag tcagcaacac cttcttcacg aggcagacct 11280cagcgctagc ggagtgtata
ctggcttact atgttggcac tgatgagggt gtcagtgaag 11340tgcttcatgt ggcaggagaa
aaaaggctgc accggtgcgt cagcagaata tgtgatacag 11400gatatattcc gcttcctcgc
tcactgactc gctacgctcg gtcgttcgac tgcggcgagc 11460ggaaatggct tacgaacggg
gcggagattt cctggaagat gccaggaaga tacttaacag 11520ggaagtgaga gggccgcggc
aaagccgttt ttccataggc tccgcccccc tgacaagcat 11580cacgaaatct gacgctcaaa
tcagtggtgg cgaaacccga caggactata aagataccag 11640gcgtttcccc tggcggctcc
ctcgtgcgct ctcctgttcc tgcctttcgg tttaccggtg 11700tcattccgct gttatggccg
cgtttgtctc attccacgcc tgacactcag ttccgggtag 11760gcagttcgct ccaagctgga
ctgtatgcac gaaccccccg ttcagtccga ccgctgcgcc 11820ttatccggta actatcgtct
tgagtccaac ccggaaagac atgcaaaagc accactggca 11880gcagccactg gtaattgatt
tagaggagtt agtcttgaag tcatgcgccg gttaaggcta 11940aactgaaagg acaagttttg
gtgactgcgc tcctccaagc cagttacctc ggttcaaaga 12000gttggtagct cagagaacct
tcgaaaaacc gccctgcaag gcggtttttt cgttttcaga 12060gcaagagatt acgcgcagac
caaaacgatc tcaagaagat catcttatta aggggtctga 12120cgctcagtgg aacgaaaact
cacgttaagg gattttggtc atgagattat caaaaaggat 12180cttcacctag atccttttaa
attaaaaatg aagttttaaa tcaatctaaa gtatatatga 12240gtaaacttgg tctgacagtt
accaatgctt aatcagtgag gcacctatct cagcgatctg 12300tctatttcgt tcatccatag
ttgcctgact ccccgtcgtg tagataacta cgatacggga 12360gggcttacca tctggcccca
gtgctgcaat gataccgcga gacccacgct caccggctcc 12420agatttatca gcaataaacc
agccagccgg aagggccgag cgcagaagtg gtcctgcaac 12480tttatccgcc tccatccagt
ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 12540agttaatagt ttgcgcaacg
ttgttgccat tgctgcaggc atcgtggtgt cacgctcgtc 12600gtttggtatg gcttcattca
gctccggttc ccaacgatca aggcgagtta catgatcccc 12660catgttgtgc aaaaaagcgg
ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 12720ggccgcagtg ttatcactca
tggttatggc agcactgcat aattctctta ctgtcatgcc 12780atccgtaaga tgcttttctg
tgactggtga gtactcaacc aagtcattct gagaatagtg 12840tatgcggcga ccgagttgct
cttgcccggc gtcaacacgg gataataccg cgccacatag 12900cagaacttta aaagtgctca
tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 12960cttaccgctg ttgagatcca
gttcgatgta acccactcgt gcacccaact gatcttcagc 13020atcttttact ttcaccagcg
tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 13080aaagggaata agggcgacac
ggaaatgttg aatactcata ctcttccttt ttcaatatta 13140ttgaagcatt tatcagggtt
attgtctcat gagcggatac atatttgaat gtatttagaa 13200aaataaacaa ataggggttc
cgcgcacatt tccccgaaaa gtgccacctg acgtgtcgag 13260acgcgtaata cgactcacta
tag 132838513463DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 85ataggcggcg catgagagaa gcccagacca attacctacc caaaatggag
aaagttcacg 60ttgacatcga ggaagacagc ccattcctca gagctttgca gcggagcttc
ccgcagtttg 120aggtagaagc caagcaggtc actgataatg accatgctaa tgccagagcg
ttttcgcatc 180tggcttcaaa actgatcgaa acggaggtgg acccatccga cacgatcctt
gacattggaa 240gtgcgcccgc ccgcagaatg tattctaagc acaagtatca ttgtatctgt
ccgatgagat 300gtgcggaaga tccggacaga ttgtataagt atgcaactaa gctgaagaaa
aactgtaagg 360aaataactga taaggaattg gacaagaaaa tgaaggagct cgccgccgtc
atgagcgacc 420ctgacctgga aactgagact atgtgcctcc acgacgacga gtcgtgtcgc
tacgaagggc 480aagtcgctgt ttaccaggat gtatacgcgg ttgacggacc gacaagtctc
tatcaccaag 540ccaataaggg agttagagtc gcctactgga taggctttga caccacccct
tttatgttta 600agaacttggc tggagcatat ccatcatact ctaccaactg ggccgacgaa
accgtgttaa 660cggctcgtaa cataggccta tgcagctctg acgttatgga gcggtcacgt
agagggatgt 720ccattcttag aaagaagtat ttgaaaccat ccaacaatgt tctattctct
gttggctcga 780ccatctacca cgagaagagg gacttactga ggagctggca cctgccgtct
gtatttcact 840tacgtggcaa gcaaaattac acatgtcggt gtgagactat agttagttgc
gacgggtacg 900tcgttaaaag aatagctatc agtccaggcc tgtatgggaa gccttcaggc
tatgctgcta 960cgatgcaccg cgagggattc ttgtgctgca aagtgacaga cacattgaac
ggggagaggg 1020tctcttttcc cgtgtgcacg tatgtgccag ctacattgtg tgaccaaatg
actggcatac 1080tggcaacaga tgtcagtgcg gacgacgcgc aaaaactgct ggttgggctc
aaccagcgta 1140tagtcgtcaa cggtcgcacc cagagaaaca ccaataccat gaaaaattac
cttttgcccg 1200tagtggccca ggcatttgct aggtgggcaa aggaatataa ggaagatcaa
gaagatgaaa 1260ggccactagg actacgagat agacagttag tcatggggtg ttgttgggct
tttagaaggc 1320acaagataac atctatttat aagcgcccgg atacccaaac catcatcaaa
gtgaacagcg 1380atttccactc attcgtgctg cccaggatag gcagtaacac attggagatc
gggctgagaa 1440caagaatcag gaaaatgtta gaggagcaca aggagccgtc acctctcatt
accgccgagg 1500acgtacaaga agctaagtgc gcagccgatg aggctaagga ggtgcgtgaa
gccgaggagt 1560tgcgcgcagc tctaccacct ttggcagctg atgttgagga gcccactctg
gaagccgatg 1620tagacttgat gttacaagag gctggggccg gctcagtgga gacacctcgt
ggcttgataa 1680aggttaccag ctacgatggc gaggacaaga tcggctctta cgctgtgctt
tctccgcagg 1740ctgtactcaa gagtgaaaaa ttatcttgca tccaccctct cgctgaacaa
gtcatagtga 1800taacacactc tggccgaaaa gggcgttatg ccgtggaacc ataccatggt
aaagtagtgg 1860tgccagaggg acatgcaata cccgtccagg actttcaagc tctgagtgaa
agtgccacca 1920ttgtgtacaa cgaacgtgag ttcgtaaaca ggtacctgca ccatattgcc
acacatggag 1980gagcgctgaa cactgatgaa gaatattaca aaactgtcaa gcccagcgag
cacgacggcg 2040aatacctgta cgacatcgac aggaaacagt gcgtcaagaa agaactagtc
actgggctag 2100ggctcacagg cgagctggtg gatcctccct tccatgaatt cgcctacgag
agtctgagaa 2160cacgaccagc cgctccttac caagtaccaa ccataggggt gtatggcgtg
ccaggatcag 2220gcaagtctgg catcattaaa agcgcagtca ccaaaaaaga tctagtggtg
agcgccaaga 2280aagaaaactg tgcagaaatt ataagggacg tcaagaaaat gaaagggctg
gacgtcaatg 2340ccagaactgt ggactcagtg ctcttgaatg gatgcaaaca ccccgtagag
accctgtata 2400ttgacgaagc ttttgcttgt catgcaggta ctctcagagc gctcatagcc
attataagac 2460ctaaaaaggc agtgctctgc ggggatccca aacagtgcgg tttttttaac
atgatgtgcc 2520tgaaagtgca ttttaaccac gagatttgca cacaagtctt ccacaaaagc
atctctcgcc 2580gttgcactaa atctgtgact tcggtcgtct caaccttgtt ttacgacaaa
aaaatgagaa 2640cgacgaatcc gaaagagact aagattgtga ttgacactac cggcagtacc
aaacctaagc 2700aggacgatct cattctcact tgtttcagag ggtgggtgaa gcagttgcaa
atagattaca 2760aaggcaacga aataatgacg gcagctgcct ctcaagggct gacccgtaaa
ggtgtgtatg 2820ccgttcggta caaggtgaat gaaaatcctc tgtacgcacc cacctcagaa
catgtgaacg 2880tcctactgac ccgcacggag gaccgcatcg tgtggaaaac actagccggc
gacccatgga 2940taaaaacact gactgccaag taccctggga atttcactgc cacgatagag
gagtggcaag 3000cagagcatga tgccatcatg aggcacatct tggagagacc ggaccctacc
gacgtcttcc 3060agaataaggc aaacgtgtgt tgggccaagg ctttagtgcc ggtgctgaag
accgctggca 3120tagacatgac cactgaacaa tggaacactg tggattattt tgaaacggac
aaagctcact 3180cagcagagat agtattgaac caactatgcg tgaggttctt tggactcgat
ctggactccg 3240gtctattttc tgcacccact gttccgttat ccattaggaa taatcactgg
gataactccc 3300cgtcgcctaa catgtacggg ctgaataaag aagtggtccg tcagctctct
cgcaggtacc 3360cacaactgcc tcgggcagtt gccactggaa gagtctatga catgaacact
ggtacactgc 3420gcaattatga tccgcgcata aacctagtac ctgtaaacag aagactgcct
catgctttag 3480tcctccacca taatgaacac ccacagagtg acttttcttc attcgtcagc
aaattgaagg 3540gcagaactgt cctggtggtc ggggaaaagt tgtccgtccc aggcaaaatg
gttgactggt 3600tgtcagaccg gcctgaggct accttcagag ctcggctgga tttaggcatc
ccaggtgatg 3660tgcccaaata tgacataata tttgttaatg tgaggacccc atataaatac
catcactatc 3720agcagtgtga agaccatgcc attaagctta gcatgttgac caagaaagct
tgtctgcatc 3780tgaatcccgg cggaacctgt gtcagcatag gttatggtta cgctgacagg
gccagcgaaa 3840gcatcattgg tgctatagcg cggcagttca agttttcccg ggtatgcaaa
ccgaaatcct 3900cacttgaaga gacggaagtt ctgtttgtat tcattgggta cgatcgcaag
gcccgtacgc 3960acaatcctta caagctttca tcaaccttga ccaacattta tacaggttcc
agactccacg 4020aagccggatg tgcaccctca tatcatgtgg tgcgagggga tattgccacg
gccaccgaag 4080gagtgattat aaatgctgct aacagcaaag gacaacctgg cggaggggtg
tgcggagcgc 4140tgtataagaa attcccggaa agcttcgatt tacagccgat cgaagtagga
aaagcgcgac 4200tggtcaaagg tgcagctaaa catatcattc atgccgtagg accaaacttc
aacaaagttt 4260cggaggttga aggtgacaaa cagttggcag aggcttatga gtccatcgct
aagattgtca 4320acgataacaa ttacaagtca gtagcgattc cactgttgtc caccggcatc
ttttccggga 4380acaaagatcg actaacccaa tcattgaacc atttgctgac agctttagac
accactgatg 4440cagatgtagc catatactgc agggacaaga aatgggaaat gactctcaag
gaagcagtgg 4500ctaggagaga agcagtggag gagatatgca tatccgacga ctcttcagtg
acagaacctg 4560atgcagagct ggtgagggtg catccgaaga gttctttggc tggaaggaag
ggctacagca 4620caagcgatgg caaaactttc tcatatttgg aagggaccaa gtttcaccag
gcggccaagg 4680atatagcaga aattaatgcc atgtggcccg ttgcaacgga ggccaatgag
caggtatgca 4740tgtatatcct cggagaaagc atgagcagta ttaggtcgaa atgccccgtc
gaagagtcgg 4800aagcctccac accacctagc acgctgcctt gcttgtgcat ccatgccatg
actccagaaa 4860gagtacagcg cctaaaagcc tcacgtccag aacaaattac tgtgtgctca
tcctttccat 4920tgccgaagta tagaatcact ggtgtgcaga agatccaatg ctcccagcct
atattgttct 4980caccgaaagt gcctgcgtat attcatccaa ggaagtatct cgtggaaaca
ccaccggtag 5040acgagactcc ggagccatcg gcagagaacc aatccacaga ggggacacct
gaacaaccac 5100cacttataac cgaggatgag accaggacta gaacgcctga gccgatcatc
atcgaagagg 5160aagaagagga tagcataagt ttgctgtcag atggcccgac ccaccaggtg
ctgcaagtcg 5220aggcagacat tcacgggccg ccctctgtat ctagctcatc ctggtccatt
cctcatgcat 5280ccgactttga tgtggacagt ttatccatac ttgacaccct ggagggagct
agcgtgacca 5340gcggggcaac gtcagccgag actaactctt acttcgcaaa gagtatggag
tttctggcgc 5400gaccggtgcc tgcgcctcga acagtattca ggaaccctcc acatcccgct
ccgcgcacaa 5460gaacaccgtc acttgcaccc agcagggcct gctcgagaac cagcctagtt
tccaccccgc 5520caggcgtgaa tagggtgatc actagagagg agctcgaggc gcttaccccg
tcacgcactc 5580ctagcaggtc ggtctcgaga accagcctgg tctccaaccc gccaggcgta
aatagggtga 5640ttacaagaga ggagtttgag gcgttcgtag cacaacaaca atgacggttt
gatgcgggtg 5700catacatctt ttcctccgac accggtcaag ggcatttaca acaaaaatca
gtaaggcaaa 5760cggtgctatc cgaagtggtg ttggagagga ccgaattgga gatttcgtat
gccccgcgcc 5820tcgaccaaga aaaagaagaa ttactacgca agaaattaca gttaaatccc
acacctgcta 5880acagaagcag ataccagtcc aggaaggtgg agaacatgaa agccataaca
gctagacgta 5940ttctgcaagg cctagggcat tatttgaagg cagaaggaaa agtggagtgc
taccgaaccc 6000tgcatcctgt tcctttgtat tcatctagtg tgaaccgtgc cttttcaagc
cccaaggtcg 6060cagtggaagc ctgtaacgcc atgttgaaag agaactttcc gactgtggct
tcttactgta 6120ttattccaga gtacgatgcc tatttggaca tggttgacgg agcttcatgc
tgcttagaca 6180ctgccagttt ttgccctgca aagctgcgca gctttccaaa gaaacactcc
tatttggaac 6240ccacaatacg atcggcagtg ccttcagcga tccagaacac gctccagaac
gtcctggcag 6300ctgccacaaa aagaaattgc aatgtcacgc aaatgagaga attgcccgta
ttggattcgg 6360cggcctttaa tgtggaatgc ttcaagaaat atgcgtgtaa taatgaatat
tgggaaacgt 6420ttaaagaaaa ccccatcagg cttactgaag aaaacgtggt aaattacatt
accaaattaa 6480aaggaccaaa agctgctgct ctttttgcga agacacataa tttgaatatg
ttgcaggaca 6540taccaatgga caggtttgta atggacttaa agagagacgt gaaagtgact
ccaggaacaa 6600aacatactga agaacggccc aaggtacagg tgatccaggc tgccgatccg
ctagcaacag 6660cgtatctgtg cggaatccac cgagagctgg ttaggagatt aaatgcggtc
ctgcttccga 6720acattcatac actgtttgat atgtcggctg aagactttga cgctattata
gccgagcact 6780tccagcctgg ggattgtgtt ctggaaactg acatcgcgtc gtttgataaa
agtgaggacg 6840acgccatggc tctgaccgcg ttaatgattc tggaagactt aggtgtggac
gcagagctgt 6900tgacgctgat tgaggcggct ttcggcgaaa tttcatcaat acatttgccc
actaaaacta 6960aatttaaatt cggagccatg atgaaatctg gaatgttcct cacactgttt
gtgaacacag 7020tcattaacat tgtaatcgca agcagagtgt tgagagaacg gctaaccgga
tcaccatgtg 7080cagcattcat tggagatgac aatatcgtga aaggagtcaa atcggacaaa
ttaatggcag 7140acaggtgcgc cacctggttg aatatggaag tcaagattat agatgctgtg
gtgggcgaga 7200aagcgcctta tttctgtgga gggtttattt tgtgtgactc cgtgaccggc
acagcgtgcc 7260gtgtggcaga ccccctaaaa aggctgttta agcttggcaa acctctggca
gcagacgatg 7320aacatgatga tgacaggaga agggcattgc atgaagagtc aacacgctgg
aaccgagtgg 7380gtattctttc agagctgtgc aaggcagtag aatcaaggta tgaaaccgta
ggaacttcca 7440tcatagttat ggccatgact actctagcta gcagtgttaa atcattcagc
tacctgagag 7500gggcccctat aactctctac ggctaacctg aatggactac gacatagtct
agtcgacgcc 7560accatggaaa gccggatctg gtgcctggtc gtgtgcgtga acctgtgcat
cgtgtgcctg 7620ggagccgccg tgagcagcag cagcaccaga ggcaccagcg ccacacacag
ccaccacagc 7680agccacacca cctctgccgc ccacagcaga tccggcagcg tgtcccagag
agtgaccagc 7740agccagaccg tgtcccacgg cgtgaacgag acaatctaca acaccaccct
gaagtacggc 7800gacgtcgtgg gcgtgaatac caccaagtac ccctacagag tgtgcagcat
ggcccagggc 7860accgacctga tcagattcga gcggaacatc gtgtgcacca gcatgaagcc
catcaacgag 7920gacctggacg agggcatcat ggtggtgtac aagagaaaca tcgtggccca
caccttcaaa 7980gtgcgggtgt accagaaggt gctgaccttc cggcggagct acgcctacat
ccacaccaca 8040tacctgctgg gcagcaacac cgagtacgtg gcccctccca tgtgggagat
ccaccacatc 8100aacagccaca gccagtgcta cagcagctac agccgcgtga tcgccggcac
agtgttcgtg 8160gcctaccacc gggacagcta cgagaacaag accatgcagc tgatgcccga
cgactacagc 8220aacacccaca gcaccagata cgtgaccgtg aaggaccagt ggcacagcag
aggcagcacc 8280tggctgtacc gggagacatg caacctgaac tgcatggtca ccatcaccac
cgccagaagc 8340aagtaccctt accacttctt cgccacctcc accggcgacg tggtggacat
cagccccttc 8400tacaacggca ccaaccggaa cgccagctac ttcggcgaga acgccgacaa
gttcttcatc 8460ttccccaact acaccatcgt gtccgacttc ggcagaccca acagcgctct
ggaaacccac 8520agactggtgg cctttctgga acgggccgac agcgtgatca gctgggacat
ccaggacgag 8580aagaacgtga cctgccagct gaccttctgg gaggcctctg agagaaccat
cagaagcgag 8640gccgaggaca gctaccactt cagcagcgcc aagatgaccg ccaccttcct
gagcaagaaa 8700caggaagtga acatgagcga ctccgccctg gactgcgtga gggacgaggc
catcaacaag 8760ctgcagcaga tcttcaacac cagctacaac cagacctacg agaagtatgg
caatgtgtcc 8820gtgttcgaga caacaggcgg cctggtggtg ttctggcagg gcatcaagca
gaaaagcctg 8880gtggagctgg aacggctcgc caaccggtcc agcctgaacc tgacccacaa
ccggaccaag 8940cggagcaccg acggcaacaa cgcaacccac ctgtccaaca tggaaagcgt
gcacaacctg 9000gtgtacgcac agctgcagtt cacctacgac accctgcggg gctacatcaa
cagagccctg 9060gcccagatcg ccgaggcttg gtgcgtggac cagcggcgga ccctggaagt
gttcaaagag 9120ctgtccaaga tcaaccccag cgccatcctg agcgccatct acaacaagcc
tatcgccgcc 9180agattcatgg gcgacgtgct gggcctggcc agctgcgtga ccatcaacca
gaccagcgtg 9240aaggtgctgc gggacatgaa cgtgaaagag agcccaggcc gctgctactc
cagacccgtg 9300gtcatcttca acttcgccaa cagctcctac gtgcagtacg gccagctggg
cgaggacaac 9360gagatcctgc tggggaacca ccggaccgag gaatgccagc tgcccagcct
gaagatcttt 9420atcgccggca acagcgccta cgagtatgtg gactacctgt tcaagcggat
gatcgacctg 9480agcagcatct ccaccgtgga cagcatgatc gccctggaca tcgaccccct
ggaaaacacc 9540gacttccggg tgctggaact gtacagccag aaagagctgc ggagcagcaa
cgtgttcgac 9600ctggaagaga tcatgcggga gttcaacagc tacaagcagc gcgtgaaata
cgtggaggac 9660aaggtggtgg accccctgcc tccttacctg aagggcctgg acgacctgat
gagcggactg 9720ggcgctgccg gaaaagccgt gggagtggcc attggagctg tgggcggagc
tgtggcctct 9780gtcgtggaag gcgtcgccac ctttctgaag aaccccttcg gcgccttcac
catcatcctg 9840gtggccattg ccgtcgtgat catcacctac ctgatctaca cccggcagcg
gagactgtgt 9900acccagcccc tgcagaacct gttcccctac ctggtgtccg ccgatggcac
cacagtgacc 9960agcggctcca ccaaggatac cagcctgcag gccccaccca gctacgaaga
gagcgtgtac 10020aacagcggca gaaagggccc tggccctccc agctctgatg ccagcacagc
cgcccctccc 10080tacaccaacg agcaggccta ccagatgctg ctggccctgg ctagactgga
tgccgagcag 10140agggcccagc agaacggcac cgacagcctg gatggcagaa ccggcaccca
ggacaagggc 10200cagaagccca acctgctgga ccggctgcgg caccggaaga acggctaccg
gcacctgaag 10260gacagcgacg aggaagagaa cgtctgataa tctagacggc gcgcccaccc
agcggccgca 10320tacagcagca attggcaagc tgcttacata gaactcgcgg cgattggcat
gccgccttaa 10380aatttttatt ttatttttct tttcttttcc gaatcggatt ttgtttttaa
tatttcaaaa 10440aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agggtcggca tggcatctcc
acctcctcgc 10500ggtccgacct gggcatccga aggaggacgc acgtccactc ggatggctaa
gggagagcca 10560cgtttaaacc agctccaatt cgccctatag tgagtcgtat tacgcgcgct
cactggccgt 10620cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc
gccttgcagc 10680acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc
gcccttccca 10740acagttgcgc agcctgaatg gcgaatggga cgcgccctgt agcggcgcat
taagcgcggc 10800gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag
cgcccgctcc 10860tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc
aagctctaaa 10920tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc
ccaaaaaact 10980tgattagggt gatggttcac gtagtgggcc atcgccctga tagacggttt
ttcgcccttt 11040gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa
caacactcaa 11100ccctatctcg gtctattctt ttgatttata agggattttg ccgatttcgg
cctattggtt 11160aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat
taacgcttac 11220aatttaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt
atttttctaa 11280atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct
tcaataatat 11340tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc
cttttttgcg 11400gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa
agatgctgaa 11460gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg
taagatcctt 11520gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt
tctgctatgt 11580ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg
catacactat 11640tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac
ggatggcatg 11700acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc
ggccaactta 11760cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa
catgggggat 11820catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc
aaacgacgag 11880cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt
aactggcgaa 11940ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga
taaagttgca 12000ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa
atctggagcc 12060ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa
gccctcccgt 12120atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa
tagacagatc 12180gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt
ttactcatat 12240atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt
gaagatcctt 12300tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg
agcgtcagac 12360cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt
aatctgctgc 12420ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca
agagctacca 12480actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac
tgttcttcta 12540gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac
atacctcgct 12600ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct
taccgggttg 12660gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg
gggttcgtgc 12720acacagccca gcttggagcg aacgacctac accgaactga gatacctaca
gcgtgagcta 12780tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt
aagcggcagg 12840gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta
tctttatagt 12900cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc
gtcagggggg 12960cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc
cttttgctgg 13020ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa
ccgtattacc 13080gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag
cgagtcagtg 13140agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg
ttggccgatt 13200cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga
gcgcaacgca 13260attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat
gctcccggct 13320cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag
ctatgaccat 13380gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctggg
taccgggccc 13440acgcgtaata cgactcacta tag
13463
User Contributions:
Comment about this patent or add new information about this topic: