Patent application title: Filovirus vectors and noninfectious filovirus-based particles
Inventors:
Yoshihiro Kawaoka (Madison, WI, US)
Luke D. Jasenosky (Madison, WI, US)
Gabriele Neumann (Madison, WI, US)
Gabriele Neumann (Madison, WI, US)
IPC8 Class: AC12N700FI
USPC Class:
4352351
Class name: Chemistry: molecular biology and microbiology virus or bacteriophage, except for viral vector or bacteriophage vector; composition thereof; preparation or purification thereof; production of viral subunits; media for propagating
Publication date: 2010-10-21
Patent application number: 20100267116
Claims:
1. A method to prepare filovirus, comprising: contacting a cell with a
vector comprising a promoter operably linked to a filovirus genomic cDNA
or a portion thereof linked to a transcription termination sequence, a
vector comprising a promoter operably linked to a DNA segment encoding a
filovirus RNA transcriptase-polymerase, a vector comprising a promoter
operably linked to a DNA segment encoding filovirus NP, a vector
comprising a promoter operably linked to a DNA segment encoding filovirus
VP30, and a vector comprising a promoter operably linked to a DNA segment
encoding filovirus VP35, so as to yield infectious filovirus,wherein the
portion of the cDNA, when transcribed, yields a RNA which is capable of
being packaged into filovirus virions or which is capable of being
replicated in the presence of filovirus proteins.
2. The method of claim 1 wherein the promoter in the vector comprising filovirus genomic cDNA is a RNA polymerase I promoter, RNA polymerase II promoter, RNA polymerase III promoter, T7 RNA polymerase promoter, or T3 RNA polymerase promoter.
3. The method of claim 1 further comprising a vector comprising a promoter operably linked to a DNA fragment of interest.
4. The method of claim 1 wherein the vector comprising the filovirus genomic cDNA further comprises a DNA fragment of interest within the genomic sequence.
5. The method of claim 1 wherein the promoter of the vector comprising the filovirus genomic cDNA is a T7 RNA polymerase promoter.
6. The method of claim 5 further comprising a vector comprising a promoter operably linked to a DNA segment encoding T7 RNA polymerase.
7. The method of claim 1 further comprising isolating the virus.
8. The method of claim 3 or 4 wherein the DNA fragment of interest encodes a detectable marker, a therapeutic protein or an immunogenic polypeptide or peptide of a pathogen or tumor antigen.
9. The method of claim 1 wherein the sequence of the genomic cDNA has one or more nucleotide deletions, insertions or substitutions relative to the sequence of a corresponding wild-type filovirus.
10. A composition comprising a plurality of filovirus vectors, comprising:a) a vector comprising a promoter operably linked to a filovirus genomic cDNA or a portion thereof linked to a transcription termination sequence, wherein the portion of the cDNA, when transcribed, yields a RNA which is capable of being packaged into filovirus virions or which is capable of being replicated in the presence of filovirus proteins; andb) a vector comprising a promoter operably linked to a DNA segment encoding a filovirus RNA transcriptase-polymerase, a vector comprising a promoter operably linked to a DNA segment encoding filovirus NP, a vector comprising a promoter operably linked to a DNA segment encoding filovirus VP30, and a vector comprising a promoter operably linked to a DNA segment encoding filovirus VP35.
11. The composition of claim 10 further comprising a vector comprising a promoter operably linked to a DNA fragment of interest.
12. The composition of claim 10 wherein the vector of a) further comprises a DNA fragment of interest in the same orientation as the genomic cDNA.
13. The composition of claim 11 or 12 wherein the DNA fragment of interest encodes an immunogenic polypeptide or peptide of a pathogen, a tumor antigen, or a therapeutic protein.
14. The composition of claim 10 wherein each vector of b) is on a separate plasmid.
15. The composition of claim 10 further comprising a vector comprising a promoter operably linked to a DNA segment encoding T7 RNA polymerase, wherein the promoter of the vector of a) is a T7 RNA polymerase promoter.
16. The composition of claim 10 wherein each of the vectors of b) further comprise a transcription termination sequence.
17. The composition of claim 10 wherein the cDNA is in the sense orientation.
18. The composition of claim 10 wherein the cDNA is in the antisense orientation.
19. A cell contacted with the composition of claim 10.
20. The cell of claim 19 which is a eukaryotic cell.
21. The cell of claim 19 which is an insect, yeast, or mammalian cell.
22. A vector encoding a mutant filovirus matrix protein having one or more amino acid deletions, insertions or substitutions relative to wild-type filovirus matrix protein, which mutant binds to lipid.
23. The vector of claim 22 which encodes a mutant filovirus matrix protein having residues corresponding to residues 1-276, 1-226, 50-276, 50-326 or 100-326 of the Ebola virus matrix protein.
24. The vector of claim 22 wherein the mutant filovirus matrix protein binds to the cell membrane of eukaryotic cells.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001]This application is a divisional of U.S. patent application Ser. No. 10/353,856, filed Jan. 29, 2003, which claims the benefit of the filing date of U.S. application Ser. No. 60/353,972, filed on Jan. 31, 2002, under 35 U.S.C. §119(e). Which applications are incorporated by reference herein.
BACKGROUND OF THE INVENTION
[0003]Ebola virus, a member of the family Filoviridae and the order Mononegavirales, is an enveloped, nonsegmented negative-strand RNA virus and is one of the most lethal human and nonhuman primate pathogens recognized to date (Feldmann et al., 1998; Vanderzanden et al., 1998). Four subtypes of Ebola virus have been identified, including Zaire, Sudan, Ivory Coast, and Reston (Sanchez et al., 1993). Human infection with subtype Zaire causes a fulminating, febrile, hemorrhagic disease that results in extensive mortality (Feldmann et al., 1993). Thus, Ebola virus infection presents a much-needed model to study virus-induced mechanisms leading to coagulation disorders and vascular instability. However, identification of major determinants of Ebola virus pathogenicity has been hampered by the lack of effective strategies for experimental mutagenesis.
[0004]Ebola virus particles have a filamentous appearance, but its shape may be branched, circular, U- or 6-shaped, or long and straight (Feldmann et al., 1996). Virions show a uniform diameter of approximately 80 nm, but vary greatly in length. Ebola virus particles consist of seven structural proteins. The glycoprotein (GP) of Ebola virus forms spikes of approximately 7 nm, which are spaced at 5- to 10-nm intervals on the virion surface (Feldmann et al., 1996 and Peters et al., 1995). Cleavage of the GP is thought to be an important determinant of viral pathogenicity (Volchkov et al., 1998; Sanchez et al., 1996; Takada et al., 1997; Volchkov et al., 1998a; Volchkov et al., 1998b; Wool-Lewis et al., 1998; Yang et al., 2000). The Ebola virus GP contains a highly conserved consensus motif for the subtilisin-like endoprotease furin, and previous studies demonstrated GP cleavage by this protease (Nina et al., 1991). Nonetheless, studies of murine leukemia virus (Wood-Lewis et al., 1999) or vesicular stomatitis virus (VSV) (Ito et al., 2001) pseudotyped with mutant Ebola virus GPs lacking the furin recognition motif at the cleavage site, showed that GP cleavage by furin was not essential for infectivity of the pseudotyped viruses. In many viruses, GP cleavage by furin and related endoproteases is essential for their infectivity. Thus, the significance of GP cleavage for the Ebola virus life cycle remains in question.
[0005]GP is the only transmembrane protein of Ebola virus, and is responsible for receptor binding and membrane fusion (Takada et al., 1997). Cells infected with recombinant vaccinia virus expressing the GP produced virosomes that varied in shape and diameter but uniformly possessed spike structures on their surface (Volchkov et al., 1998c), although the effects of over 80 vaccinia viral proteins (Moss, 1995) on the formation of particles are unknown. Similar virosomes are also released from Ebola virus-infected cells (Volchkov et al., 1998c). These findings suggest that the GP contributes not only to an early stage of the viral infection cycle but also to viral budding.
[0006]In addition, although recent studies have begun to address the immune response to viral infection (Baize et al., 1999; Basler et al., 2000; Vanderzanden et al., 1998; and Wilson et al., 2000), as well as the functions of the viral proteins involved in the replicative process (VP30, VP35, NP, L) (Basler et al., 2000 and Muhlberger et al., 1999) and GP, little is known about the functions of the viral proteins associated with the membrane, including viral protein 40 (VP40), which appears equivalent to matrix protein of other viruses.
[0007]The matrix proteins of many nonsegmented, negative-sense RNA viruses play a critical role in viral particle formation (virus assembly) and budding (Garoff et al., 1998). Expression of the matrix protein of VSV in insect and mammalian cells results in evagination of matrix protein-containing vesicles from the plasma membrane surface (Justice et al., 1995; and Li et al., 1993). Matrix proteins interact with membranes in a hydrophobic and/or electrostatic manner and electron micrographs of nonsegmented, negative-sense RNA viruses have demonstrated that the matrix protein forms a layer associated with the inner leaflet of the lipid bilayer (Garoff et al., 1998).
[0008]VP40 is the most abundant protein in virions (it represents 38% of the protein in the viral particle) and is located beneath the viral membrane, where is presumably maintains the structural integrity of the particle (Feldmann et al., 1996). VP40 is encoded by the third gene in the linear 3'-5' RNA genome of Ebola virus and is 326 amino acids in length, which includes a number of hydrophobic regions (Elliott et al., 1985 and Sanchez et al., 1996). VP40 contains a PPXY motif (X denotes any amino acid) at amino acids 10-13 (Harty et al., 1996) that is also present at amino acids 16-19 in Marburg virus, strain Popp (Sanchez et al., 1993). This motif has been shown to play an important role in the budding of rabies virus and VSV: when either of the prolines or the tyrosine of this motif is altered in the matrix proteins of these viruses, viral budding is markedly reduced by comparison to findings with wild-type virus (Harty et al., 1996). Mutation of the PPXY motif in the matrix protein of VSV appears to reduce virus yield by pre-empting budding of assembled virions at the plasma membrane (Jayakar et al., 2000). This motif interacts with the WW domain found in many cellular regulatory and signal transduction proteins (Bork et al., 1994 and Chen et al., 1995) and interactions between one or more cellular proteins and the matrix proteins of these viruses are thought to be crucial for efficient virus release from cells (Harty et al., 1999).
[0009]The matrix proteins of many enveloped viruses are thought to interact with the cytoplasmic tails of viral glycoproteins. Such interaction is believed to be important for virus assembly. In influenza viruses, the removal of the cytoplasmic tail of the hemagglutinin or neuraminidase glycoprotein alters virion morphology (Jin et al., 1997; Mitnaul et al., 1996). Although not essential for normal particle formation in rabies virus and VSV, glycoproteins enhance the efficiency of particle formation (Mebatsion et al., 1996; Mebatsion et al., 1999; Schnell et al., 1998).
[0010]Thus, what is needed is a method to readily manipulate the filovirus genome.
SUMMARY OF THE INVENTION
[0011]The invention provides methods to prepare filovirus, e.g., Marburg virus and Ebola virus, from cloned DNA and compositions useful therefor. As described herein, a reverse genetics system was employed to generate filovirus, e.g., Ebola virus, from cloned cDNA. The genomic sequence was prepared by reverse transcription and amplification of viral RNA. The expression of the resulting genomic cDNA, e.g., in host cells, in sense and antisense orientation yields cRNA or vRNA, which in the presence of certain viral proteins, e.g., L, NP, VP30 and VP35, yielded infectious virus. This system was also used to generate a mutant virus with an altered furin cleavage motif in GP. When expressed in cells, the GP of the wild-type, but not of the mutant, virus was cleaved into GP1 and GP2. Although posttranslational furin-mediated cleavage of GP was thought to be an essential step in Ebola virus infection, generation of a viable mutant Ebola virus lacking a furin recognition motif in the GP cleavage site demonstrated that GP cleavage is not essential for replication of Ebola virus in cell culture.
[0012]Thus, the invention provides a composition comprising a plurality of filovirus vectors. The composition comprises a vector comprising a promoter operably linked to a nucleic acid molecule comprising a filovirus genomic cDNA linked to a transcription termination sequence, a vector comprising a promoter operably linked to a nucleic acid molecule, for instance, a DNA segment, encoding a filovirus RNA transcriptase-polymerase, a vector comprising a promoter operably linked to a nucleic acid molecule encoding a filovirus NP, a vector comprising a promoter operably linked to a nucleic acid molecule encoding filovirus VP30, and a vector comprising a promoter operably linked to a nucleic acid molecule encoding filovirus VP35. Preferred promoters for the vector comprising the filovirus cDNA include, but are not limited to, a RNA polymerase I promoter, RNA polymerase II promoter, RNA polymerase III promoter, T7 RNA polymerase promoter, or T3 RNA polymerase promoter, and preferred transcription termination sequences include, but are not limited to, a RNA polymerase I transcription termination sequence, RNA polymerase II transcription termination sequence, RNA polymerase III transcription termination sequence, or a ribozyme. The sequence of the filovirus genomic cDNA may be that of wild-type or may have one or more nucleotide deletions, insertions or substitutions relative to the genomic sequence of a corresponding wild-type filovirus. Virus, either wild-type or mutant, such as a randomly mutagenized sequence or one subjected to directed evolution, prepared from such a cDNA, is useful to screen for antiviral compounds or other desirable properties such as immunogenicity, to prepare a vaccine which results in a protective immune response when administered to animals, e.g., mammals and preferably primates, or to deliver a nucleic acid sequence of interest to cells, e.g., a marker gene, a gene encoding an immunogenic protein from a pathogen including viruses other than a filovirus, bacteria, fungi or yeast, or a therapeutic protein, e.g., ADA, CFTR, factor VIII or factor IX. Further, as the length of a filovirus virion is variable, the nucleic acid sequence of interest may be introduced into cloned filovirus cDNA, as an individual open reading frame, e.g., one encoding a functional protein, or so as to encode a fusion protein with a filovirus protein, or as a replacement (substitution) for one or more coding regions in the filovirus genome. Depending on whether or not virus replication is desirable, a filovirus cDNA which lacks one or more filovirus coding regions but comprises a DNA of interest may be introduced into a cell along with the full-length (genomic) cDNA, optionally with vectors encoding filovirus proteins. In particular, each of the coding regions for genes not associated with filovirus replication, e.g., GP, VP40 and VP24, may be replaced with a DNA of interest. The resulting virus-like particles may be employed to screen for compounds with desirable pharmacological profiles, e.g., antiviral compounds. Alternatively, a filovirus cDNA which lacks one or more viral coding regions, but includes filovirus sequences for encapsidation and/or replication and includes a DNA of interest, may be introduced into a cell along with vectors encoding filovirus proteins, to form virus-like particles.
[0013]The invention thus also provides a method to prepare filovirus. The method comprises contacting a cell with a vector comprising a promoter operably linked to a filovirus genomic cDNA or a portion thereof, e.g., a portion which, when expressed as vRNA is packaged into virions and can be replicated in the presence of filovirus proteins, linked to a transcription termination sequence, a vector comprising a promoter operably linked to a nucleic acid molecule, e.g., a DNA segment, encoding a filovirus RNA transcriptase-polymerase, a vector comprising a promoter operably linked to a nucleic acid molecule encoding filovirus NP, a vector comprising a promoter operably linked to a nucleic acid molecule encoding filovirus VP30, and a vector comprising a promoter operably linked to a nucleic acid molecule encoding filovirus VP35, so as to yield infectious filovirus. A portion of a filovirus cDNA includes portions which, when transcribed, yield a RNA which is capable of being packaged into filovirus virions or which is capable of being replicated in the presence of filovirus proteins. In one embodiment, the genomic cDNA may have been recombinantly manipulated, for example, by introducing one or more nucleotide deletions, insertions or substitutions. The promoters may be recognized by RNA polymerases expressed in the cells to be transfected, transformed or transduced with the vectors of the invention, or may be recognized by a RNA polymerase that is introduced to the cell concurrently or sequentially with the filoviral vectors, e.g., by introduction of the polymerase itself or a vector encoding the polymerase. In one embodiment, the filovirus genomic cDNA may be manipulated to encode a fusion protein, encode a therapeutic protein or a protein useful in a vaccine, e.g., an immunogenic tumor-specific protein or an immunogenic peptide or protein of a pathogen, such as a bacteria, virus, yeast, or fungus. Also provided are cells contacted sequentially or concurrently with a composition, vector or virus of the invention, virus obtained by the methods of the invention, and cells infected with the virus.
[0014]As also described herein, VP40, when expressed apart from other viral proteins in mammalian cells, induced particle formation, which differed in length but with uniform diameters of approximately 65 nm. Efficient particle formation may rely on a conserved N-terminal PPXY motif, as mutation or loss of this motif resulted in markedly reduced particle formation. These findings demonstrate that VP40 alone possesses the information necessary to induce particle formation, and this process most likely requires cellular WW-domain-containing proteins that interact with the PPXY motif of VP40. Flotation gradient analysis indicated that VP40 binds to membranes in a hydrophobic manner, as NaCl at 1 M did not release the protein from the lipid bilayer. Triton X-114 phase-partitioning analysis suggested that VP40 possesses only minor features of an integral membrane protein. Truncation of the C-terminal 50 amino acids of VP40 resulted in decreased association with cellular membranes, and demonstrated that this deletion disrupts hydrophobic interactions of VP40 with the lipid bilayer, as well as abolishing particle formation. Truncation of the C-terminal 150 amino acids or N-terminal 100 amino acids of VP40 enhanced the protein's hydrophobic association with cellular membranes. These mutants may be useful as dominant negatives, to determine targets for antivirals.
[0015]When the Ebola virus GP was expressed in cells, pleomorphic particles were found budding from the plasma membrane. By contrast, when GP was co-expressed with VP40, GP was found on the filamentous particles induced by VP40. These results demonstrated the central role of VP40 in the formation of the filamentous structure of Ebola virions and suggests an interaction between VP40 and GP in morphogenesis.
[0016]Thus, the invention provides a method to prepare lipid encapsulated particles comprising recombinant filovirus matrix protein. The method comprises providing a culture of eukaryotic cells contacted with a vector comprising a promoter operably linked to a nucleic acid, e.g., DNA, encoding a filovirus matrix protein or a portion thereof which is capable of being incorporated into a filovirus particle. Supernatant from the culture which comprises lipid encapsulated particles comprising filovirus matrix protein is then collected. Preferred eukaryotic cells are mammalian cells, including primate cells such as monkey or human cells, although any eukaryotic cell, in which the expression of VP40 results in VP40-containing particles in supernatants, may be employed. The particles prepared by the method are useful as nucleic acid (DNA or RNA) or protein delivery vehicles, e.g., as replication incompetent virus-like particles useful as a vaccine or a tolerogen, e.g., to suppress or inhibit an immune response to an endogenous antigen, e.g., myelin basic protein, collagen, thyroglobulin, acetylcholine receptor, DNA, or islet cell antigens, or an exogenous antigen, e.g., protein antigens of Alternaria alternata (Alt a I), Artemisia vulgaris (Art v II), Aspergillus fumigatus (Asp f II), Dermatophagoides pteron. (Der p I, Der pIII, Der p IV, Der p VI and Der p and domestic animals such as Felis domesticus (Fel d I), cows, pigs, poultry, mice, hamsters, rabbits, rats, guinea pigs, dogs and horses. Common fungal antigens include those of Basidiomycetes such as Ustilago, Ganoderma, Alternaria, Cladosporium, Aspergillus, Sporobolomyces, Penicillium, Epicoccum, Fusarium, Phoma, Borrytis, Helminthosporium, Stemphylium and Cephalosporium; Phycomycetes such as Mucor and Rhizopus; and Ascomycetes such Eurotium and Chaetomium.
[0017]Accordingly, the eukaryotic cell may also express a nucleic acid, e.g., DNA, fragment of interest, including, but not limited to, one which encodes a therapeutic protein or peptide, an immunogenic peptide or protein of a pathogen, a tumor antigen or an immunogenic peptide thereof, a transmembrane protein such as one which specifically binds to a receptor on a particular cell type or tissue, a viral glycoprotein which specifically binds to a receptor on a particular cell type or tissue, or a fusion thereof with a filovirus GP. In one embodiment, cells express filovirus matrix protein and a fusion (chimera) of a filovirus glycoprotein, e.g., the transmembrane domain and intracellular domain of the filovirus glycoprotein and the extracellular portion of a non-filovirus protein, e.g., the extracellular domain of a cellular or viral transmembrane protein, such as influenza virus HA, or a soluble peptide or protein (one which does not comprise a transmembrane domain) which binds to a particular receptor. The resulting lipid encapsulated particles specifically bind to cells having a receptor for the extracellular non-filovirus protein or the soluble peptide or protein. In this manner, the lipid encapsulated particles may be targeted to a specific cell type or tissue in an animal and can deliver the encapsulated content(s) of the particle to the specific cell type or tissue. Also provided are isolated and/or purified lipid encapsulated particles obtained by the method. Such particles may be employed to screen for antiviral compounds, e.g., antivirals for other nonsegmented viruses.
BRIEF DESCRIPTION OF THE FIGURES
[0018]FIG. 1. Generation of Ebola virus entirely from cloned cDNA. (A) Schematic diagram of cDNA plasmids for Ebola virus cRNA or (B) vRNA synthesis and their efficiencies for virus generation. T7 and Rib indicate T7 RNA polymerase promoter and ribozyme sequences, respectively. G designates a guanine nucleotide inserted between the promoter and Ebola virus cDNA. Synthesis of positive-sense Ebola virus cRNA is represented by "Ebola," while the inverse lettering denotes synthesis of negative-sense vRNA.
[0019]FIG. 2. Replication of Ebola virus in cell culture. A) Six days after infection (light microscope). B) Three days after infection (antibody-based staining for virus). Mutant refers to an Ebola virus with an altered furin recognition sequence in GP.
[0020]FIG. 3. Replication kinetics of wild-type Ebola virus and its GP cleavage mutant.
[0021]FIG. 4. Comparison of GP cleavage between wild-type and mutant Ebola virus. Labeled proteins were separated on 8% (A) and 15% (B) sodium dodecyl sulfate-polyacrylamide gels under reducing conditions. M, molecular mass marker; lane 1, mock-infected Vero E6 cells; lane 2, wild-type Ebola virus generated from plasmids; lane 3, GP cleavage mutant virus.
[0022]FIG. 5. (A) Schematic representation of wild-type VP40 and VP40 mutants. Substituted residues are indicated in bold-face type. (B) Kyte-Doolittle hydrophobicity plot of Ebola virus VP40 over a window of 17 amino acids (Justice et al., 1995).
[0023]FIG. 6. Expression of VP40, VP40/M14A and VP40AAXY in 293T cells. The sample for the negative control was prepared from cells transfected with the empty vector (pCAGGS/MCS). Lysates were harvested 24 hours post-transfection, and proteins were separated by SDS-PAGE (12%) and detected by Western blotting.
[0024]FIG. 7. Particle formation by VP40 and its mutants. Lanes represent fractions from a sucrose gradient (numbered from the top) loaded with VP40 (A) or a mutant VP40(B-D) from cells transfected with VP40-encoding constructs. Proteins were separated by SDS-PAGE (12%) and detected by Western blotting.
[0025]FIG. 8. Protease protection analysis of VP40-induced particles. Lane 1: no treatment; lane 2: soybean trypsin inhibitor; lane 3: Triton X-100; lane 4: trypsin; lane 5: Triton X-100 and trypsin; and lane 6: trypsin inhibitor and trypsin. Proteins were separated by SDS-PAGE (12%) and detected by Western blotting.
[0026]FIG. 9. Membrane-association analysis of VP40 and its deletion mutants. Shown are gradients from cells expressing VP40 (A-B), VP40/1-276 (C-D), VP40/1-226 (E-F), VP40/1-176 (G-H), VP40/50-326 (1-J), and VP40/100-326 (K-L). Fractions are numbered from the top to the bottom of the gradient. Proteins were separated by SDS-PAGE (12%) and detected by Western blotting.
[0027]FIG. 10. Membrane-association analysis of VP40 mutants VP40AAXY (A) and VP40M14A (B). Lanes represent fractions collected from the top of a gradient formed with a homogenate in 80% sucrose overlaid with 65% and 10% sucrose layers. Lanes represent fractions collected from the top of a gradient formed with a homogenate in 80% sucrose overlaid with 65% and 10% sucrose layers. Proteins were separated by SDS-PAGE (12%) and detected by Western blotting.
[0028]FIGS. 11A-G. Triton X-114 phase partitioning analysis of VP40 and its deletion mutants. The homogenate was partitioned into aqueous (A) and detergent (D) phases. Proteins were separated by SDS-PAGE (12%) and detected by Western blotting.
[0029]FIG. 12. Budding of GP-associated particles from the plasma membrane. Twenty four hours post-transfection of 293T cells with a GP-expressing plasmid (A). 293T cells transfected with an empty expression vector lack such particle formation (B). Bar, 100 nm.
[0030]FIG. 13. Pleomorphic particles resulting from GP expression. The supernatants of cells expressing GP were centrifuged through 20% sucrose, and the pelleted material was then negatively stained with 2% PTA. Pleomorphic particles with surface spikes were observed (A and B). Pelleted material was also immunolabeled with a mixture of anti-GP monoclonal antibodies conjugated to 15-nm gold particles C) and D). Bar, 100 nm.
[0031]FIG. 14. Morphologic changes in 293T cells expressing VP40. At 24 h post-transfection of 293T cells with a VP40-expressing plasmid, filamentous particles budding from the plasma membrane (A), membrane ruffles and the adhering site of two bilayers (C, arrows), as well as aggregated ribosomes (E, arrows) were apparent. Intracellular electron-dense filamentous structures (F, arrowheads) were also observed. The filamentous particles and membrane ruffles were immunolabeled with an anti-VP40 antibody conjugated with 5-nm gold particles (B and D). M, mitochondrion; mt, microtubule. Bar, 100 nm (A, B, C, D, F) or 200 nm (E).
[0032]FIG. 15. Filamentous particles induced by VP40 expression. The supernatants of cells expressing VP40 were centrifuged through 20% sucrose, and the pelleted material was then negatively stained with 2% PTA. Particles with uniform diameters of approximately 65 nm and varied lengths were observed (A-C). Bar, 100 nm.
[0033]FIG. 16. Filamentous, spiked particles budding from the plasma membrane 24 hours after-transfection of 293T cells with plasmids coexpressing VP40 and GP (A and B). Bar, 100 nm.
[0034]FIG. 17. Ebola virus-like particles produced by coexpression of VP40 and GP. The supernatants of cells coexpressing these two proteins were centrifuged through 20% sucrose, and the pelleted material was then negatively stained with 2% PTA. Filamentous particles with surface spikes and varied lengths were observed (A-C). Pelleted material was also immunolabeled with a mixture of anti-GP monoclonal antibodies conjugated to 15-nm gold particles (D, arrowheads), or treated with 0.03% Triton X-100 at room temperature for 15 minutes, and then immunolabeled with a mixture of anti-GP antibodies conjugated to 15-nm gold particles (E, arrowheads) and an anti-VP40 antibody conjugated to 5-nm gold particles (E, arrows). Bar, 1 μm (A) or 100 nm (B-E).
[0035]FIGS. 18A-18KKK. Representative filovirus sequences (Accession numbers AB050936, NC002549, NC001608, AF086833 and AF272001).
DETAILED DESCRIPTION OF THE INVENTION
[0036]The invention provides isolated and/or purified vectors or plasmids, which encode filovirus proteins, and/or express filovirus genomic RNA. When introduced into a cell, these vectors yield infectious filovirus. Thus, the invention includes isolated and/or purified filovirus prepared by the methods disclosed herein. As also described, the invention provides isolated and/or purified noninfectious lipid encapsulated particles, i.e., the contacting of cells with noninfectious particles does not yield progeny virus. As used herein, the terms "isolated and/or purified" refer to in vitro preparation, isolation and/or purification of a vector, plasmid, virus or lipid encapsulated particle of the invention, so that it is not associated with in vivo substances, or is substantially purified from in vitro substances. As used herein, the term "recombinant nucleic acid" or "recombinant DNA sequence, fragment or segment" refers to a nucleic acid, e.g., to DNA, that has been derived or isolated from a source, that may be subsequently chemically altered in vitro, and includes, but is not limited to, a sequence that is naturally occurring, is not naturally occurring, or corresponds to naturally occurring sequences that are not positioned as they would be positioned in the native genome. An example of DNA "derived" from a source, would be a DNA sequence that is identified as a useful fragment, and which is then chemically synthesized in essentially pure form. An example of such DNA "isolated" from a source would be a useful DNA sequence that is excised or removed from said source by chemical means, e.g., by the use of restriction endonucleases, so that it can be further manipulated, e.g., amplified, for use in the invention, by the methodology of genetic engineering.
[0037]The vectors or plasmids of the invention comprise filovirus cDNA, for example, one or more open reading frames encoding filovirus proteins or portions of the genomic sequence which are capable of being replicated and packaged into virions in the presence of filovirus proteins. Therefore, gene(s) or portions thereof other than those of a filovirus may be employed in the vectors or plasmids, or methods, of the invention. A vector or plasmid of the invention may comprise a gene or open reading frame of interest, e.g., a foreign gene encoding an immunogenic peptide or protein useful as a vaccine or a therapeutic protein. If more than one vector is employed, the vectors may be physically linked or each vector may be present on an individual plasmid or other, e.g., linear, nucleic acid delivery vehicle. The vectors or plasmids may be introduced to any host cell, preferably a eukaryotic cell. Preferred host cells to prepare virus or lipid encapsulated particles of the invention include insect, avian or mammalian host cells such as canine, feline, equine, bovine, ovine, or primate cells including simian or human cells.
[0038]The filovirus genomic cDNA of the invention allows easy manipulation of filovirus, e.g., by the introduction of mutations into the viral genome. The methods of producing virus described herein, which do not require helper virus infection, are useful in viral mutagenesis studies, and in the production of vaccines (e.g., for AIDS, influenza, hepatitis B, hepatitis C, rhinovirus, filoviruses, malaria, herpes, and foot and mouth disease) and gene therapy vectors (e.g., for cancer, AIDS, adenosine deaminase, muscular dystrophy, ornithine transcarbamylase deficiency and central nervous system tumors). In particular, the use of lipid encapsulated particles of the invention which induce strong humoral and cellular immunity may be preferred as vaccine vectors as they are noninfectious and unlikely to give rise to infectious recombinant virus.
[0039]Thus, a virus for use in medical therapy (e.g., for a vaccine or gene therapy) is provided. For example, the invention provides a method to immunize an animal against a pathogen, e.g., a bacteria, virus, or parasite, or a malignant tumor. The method comprises administering to the animal an amount of at least one isolated virus of the invention which encodes and expresses, or comprises, an immunogenic peptide or protein of a pathogen or tumor, optionally in combination with an adjuvant, effective to immunize the animal. Alternatively, a lipid encapsulated particle of the invention may be used for immunization, either by delivering a DNA vaccine, or via expression of the immunogenic protein on the surface of the particle, for instance, the particle comprises a fusion protein comprising the extracellular domain of an immunogenic protein and the transmembrane and cytoplasmic portion of filovirus GP.
[0040]Also provided is a method to augment or increase the expression of an endogenous protein in an animal, e.g., a mammal such as a rodent, nonhuman primate or human, having an indication or disease characterized by a decreased amount or a lack of the endogenous protein. The method comprises administering to the animal an amount of an isolated virus of the invention effective to augment or increase the amount of the endogenous protein in the animal. Alternatively, a lipid encapsulated particle of the invention can be employed to deliver DNA encoding the protein or the protein itself. When the particle is used to deliver protein, optionally the particle comprises a chimeric transmembrane protein comprising an extracellular protein for targeting the particle to a specific tissue or cell type and the transmembrane and cytoplasmic portion of a filovirus GP.
[0041]The invention will be further described by the following non-limiting examples.
Example 1
Generation of Transfectant Ebola Virus
Materials and Methods
[0042]Efficiency of Virus Generation. To determine the efficiency of virus generation, Vero E6 cells were cotransfected with protein expression plasmids and the plasmid for Ebola virus vRNA or cRNA synthesis. Four days after transfection, the efficiency of virus generation was measured by determining the dose required to infect 50% of tissue culture cells (TCID50) per ml of supernatant. The data shown in FIG. 1A are representative results from three independent experiments. Experiments for the generation of Ebola virus as well as the characterization of recombinant Ebola virus were carried out in the BSL4 facility at the Canadian Science Centre for Human and Animal Health, Winnipeg, Canada. Cells were transfected with plasmids for the expression of the Ebola virus NP, L, VP30, and VP35 proteins, and with the plasmid for Ebola virus cRNA or vRNA synthesis, controlled by T7 RNA polymerase promoter and ribozyme sequences. T7 RNA polymerase was provided by cotransfection of cells with pC-T7Pol.
[0043]Immunofluorescence Assay. Vero E6 cells were infected at a multiplicity of infection of 10-2 with either wild-type Ebola virus generated from plasmids or Ebola virus with an altered furin recognition sequence in its GP. Six days later, cells were observed under a light microscope. Three days after infection, cells were permeabilized and stained with antiserum. Three days after infection, cells were fixed with 2% paraformaldehyde in phosphate-buffered saline, followed by inactivation by gamma irradiation (2 Mrads). Cells were permeabilized with 0.1% Triton X-100 in phosphate-buffered saline for 15 minutes, washed three times with phosphate-buffered saline, and incubated for 1 hour at room temperature with an anti-Ebola virus Zaire rabbit antiserum (1:100 dilution in phosphate-buffered saline). After three washes with phosphate-buffered saline, Cy3-labeled anti-rabbit (1:500) conjugate (Rockland, Gilbertsville, Pa.) was added for 1 hour at room temperature. The cells were then washed with phosphate-buffered saline, mounted, and analyzed using an Axioplan 2 microscope (Zeiss).
[0044]Replication Kinetics. Vero E6 cells were infected with the wild-type or cleavage site mutant at a multiplicity of infection of 10-2. Supernatants were harvested at 2, 24, 48, and 72 hours postinfection. The TCID50 was determined by infecting Vero E6 cells with 10-fold dilutions of the supernatants obtained at the above-mentioned time points.
[0045]Labeling of Protein and Immunoprecipitation Analysis. Vero E6 cells were infected at a multiplicity of infection of 10' and incubated until a cytopathic effect was observed. After the medium was removed, the cells were washed once with methionine- and cysteine-free DMEM and labeled for 24 hours in 2 ml of methionine- and cysteine-free Dulbecco's modified Eagle's medium containing 2% dialyzed fetal calf serum and 10 μCi of protein labeling mix (NEN, Mississauga, Canada)/ml. The supernatants were then clarified by centrifugation (1,000×g for 5 minutes at 4° C.). An equal volume of 2×RIPA buffer (2% Triton X-100, 2% sodium deoxycholate, 0.2% sodium dodecyl sulfate, 0.3 M NaCl, 40 mM Tris-HCl [pH 7.7], 20 mM EDTA [pH 8.0], 0.4 U of aprotinin/ml, 2 mM phenylmethylsulfonyl fluoride, 20 mM iodoacetamide) was added to the supernatants, and the solutions were subsequently inactivated by gamma irradiation (2 Mrad). Aliquots of the inactivated labeled material were mixed with an anti-Ebola virus Zaire horse serum and incubated at 4° C. overnight. The immune complexes were mixed with 30 μl of protein G sepharose for 3 hours at 4° C. with rotation. After 3 washes with RIPA buffer, the immunoprecipitated proteins were recovered by boiling them in 1×RIPA buffer.
Results
[0046]To generate a cDNA clone encoding the entire genome of Ebola virus (Zaire species, strain Mayinga), viral RNA was reverse transcribed with ThermoScript Reverse Transcriptase (Gibco/BRL, Rockville, Md.) and amplified by PCR with Pfu Turbo (Stratagene, La Jolla, Calif.). The resulting cDNA fragments were cloned in a Bluescript vector or its derivatives. A consensus sequence was determined and compared to a reference sequence (GenBank accession number AF086833). An A insertion was found between nucleotides 9,744 and 9,745, which was also detected in a partial Ebola virus genomic sequence (GenBank accession number L11365). In addition, an A insertion was found between nucleotides 18,495 and 18,496, and an A-to-T replacement was detected at position 18,226. The latter two changes have also been reported for a functional Ebola virus minigenome (Muhlberger et al., 1999). A full-length Ebola virus cDNA construct was assembled in a modified pTM1 vector (Moss et al., 1990), using conventional cloning techniques. Sequence analysis of the resulting full-length clone proved that no mutations had occurred during cloning procedures in E. coli.
[0047]Negative-sense RNA viruses have been generated from constructs encoding either the negative-sense viral RNA (vRNA) or the positive-sense complementary RNA (cRNA) (Marriott et al., 1999; Nagai et al., 1999; Neumann et al., 1999; Roberts et al., 1999; Schnell et al., 1994), albeit with higher efficiencies from the latter (Durbin et al., 1997; and Kato et al., 1996). cDNA constructs encoding the entire viral genome were generated, flanked by the T7 RNA polymerase promoter and a ribozyme, in both positive-sense and negative-sense orientations (FIG. 1A). To achieve efficient transcription, the wild-type T7 RNA polymerase promoter, which yields transcripts with an additional G at the 5' end, was used to generate pTM-T7G-Ebo-Rib and pTM-Rib-Ebo-GT7 (FIG. 1A).
[0048]The generation of negative-sense RNA viruses requires viral proteins and genomic RNA for replication and transcription. For Ebola virus, the proteins necessary for replication and transcription include the RNA-dependent RNA polymerase L, and the nucleoprotein (NP), and two additional auxiliary proteins (VP30 and VP35) (Muhlberger et al., 1999). To generate constructs for the expression of Ebola viral proteins, the respective cDNA fragments were amplified by PCR, the products sequenced and then cloned into the eukaryotic expression vector pCAGGS/MCS (controlled by the chicken β-actin promoter) (Kobasa et al., 1997; Niwa et al., 1991), resulting in four plasmids (pCEZ-NP, pCEZ-VP30, pCEZ-VP35, and pCEZ-L).
[0049]To generate Ebola virus, 5×105 Vero E6 (African green monkey kidney) cells were transfected with 1 μg of the respective plasmid for Ebola virus vRNA or cRNA synthesis and with the following amounts of protein expression plasmids: 1 μg of pCEZ-NP, 0.3 μg of pCEZ-VP30, 0.5 μg of pCEZ-VP35, and 2 μg of pCEZ-L (FIG. 1B). To drive the transcription of viral RNA from the T7 RNA polymerase promoter, cells were cotransfected with 1 μg of an expression plasmid for T7 RNA polymerase (pC-T7pol). Four days later, supernatants were collected and used to infect fresh Vero E6 cells. When examined at 6 to 8 days postinfection, the cells showed cytopathic effects, indicating the generation of infectious Ebola virus entirely from cloned cDNA. Ebola virus was produced from constructs encoding either negative-sense vRNA or positive-sense cRNA (FIG. 1A). To determine the efficiency of virus generation, supernatants of transfected cells were collected 4 days after transfection, and the titer of virus in the supernatant was determined in Vero E6 cells. The efficiencies of virus generation from negative-sense vRNA or positive-sense cRNA were comparable, resulting in the generation of 102 50% tissue culture infective doses (TCID50) per ml of supernatant.
[0050]A subsequent passage of the virus in Vero E6 cells was performed to confirm the authenticity of the replicating agent. The first signs of a cytopathic effect were observed at 48 hours postinfection and became more prominent during the following days (FIG. 2A). Indirect immunofluorescence assays with a rabbit antiserum to Ebola virus GP/secreted GP (sGP) demonstrated the presence of Ebola virus GPs (FIG. 2B). None of the negative controls (untreated cells or cells transfected with the full-length cDNA construct or the protein expression plasmids alone) showed cytopathic effects or reacted with the anti-GP/sGP antiserum
[0051]The availability of a method for generating Ebola virus mutants greatly increases opportunities to dissect mechanisms of viral pathogenesis. For many viruses, postranslational cleavage of membrane glycoproteins by host proteolytic enzymes, including subtilisin-like proteases such as furin, is a prerequisite for fusion between the viral envelope and cellular membranes and therefore an important step in pathogenesis (Klenk et al., 1994). The Ebola virus GP is cleaved by furin or furin-like proteases at a highly conserved sequence motif (R--X--K/R--R; X, any amino acid) (Volchkov et al., 1998). Since the amino acid sequence of the GP of the Reston species, the least pathogenic of all Ebola virus subtypes in humans, deviates from the optimal furin recognition sequence, GP cleavage has been thought to be an important determinant of Ebola virus pathogenicity (Feldmann et al., 1999).
[0052]The effect of an altered furin recognition motif on Ebola virus replication was studied by modifying pTM-T7G-Ebo-Rib. The multibase furin recognition site (RRTRR at amino acid positions 497 to 501 of the GP) in pTM-T7G-Ebo-Rib was replaced with 497-AGTAA-501. The modified plasmid, designated pTM-T7G-Ebo-Rib-Cl(-), was transfected into Vero E6 cells, together with protein expression plasmids for the NP, VP30, VP35, and L proteins and for T7 RNA polymerase. Fresh Vero E6 cells were subsequently incubated with supernatants derived from the transfected cells. Six days later, cytopathic effects were observed in these cells. Indirect immunofluorescence assays with antiserum to Zaire Ebola virus GP/sGP verified virus replication (FIG. 2). Growth curves in Vero E6 cells demonstrated that although the mutant virus grew slightly more slowly than the wild-type virus (FIG. 3), it reached 1010 TCID50/ml at 3 days postinfection.
[0053]To confirm the presence of mutations in the GP cleavage motif, wild-type and mutant viruses were passaged three times in Vero E6 cells, RNA extracted from virions, and reverse transcriptase PCR performed with primers spanning the altered furin recognition motif. Direct sequencing of the PCR products confirmed the retention of mutations in the GP cleavage site (data not shown).
[0054]FIG. 4 shows the results of experiments testing the cleavability of the Ebola virus mutant GP lacking a furin recognition motif. Virions derived from labeled Vero E6 cells infected with wild-type or mutant virus were lysed, and viral proteins were detected by immunoprecipitation using a horse antiserum to Zaire Ebola virus. For wild-type virus, both cleavage products GP1 (140 kDa) and GP2 (26 kDa) were detected (FIGS. 4A and B, lanes 2). By contrast, alteration of the furin recognition sequence abolished the generation of GP1 and GP2, and only the precursor, GP0, was detected (FIGS. 4A and B, lanes 3), confirming that furin or related proteases are the major host cell proteases for GP cleavage. These results indicate that the furin recognition motif at the Ebola GP cleavage site is dispensable for replication of the virus in cell culture.
Discussion
[0055]Marburg and Ebola viruses have been difficult to study because they must be handled in high-containment facilities, and effective methods of experimental mutagenesis were lacking. These limitations have restricted the development of antiviral drugs and vaccines, although reports of potentially useful experimental vaccines (Hevey et al., 1998; Sullivan et al., 2000; Vanderzanden et al., 1998; Xu et al., 1998) and antibody-mediated treatments (Maruyama et al., 1999; and Wilson et al., 2000) are beginning to emerge. The use of a reverse genetics system, which enables one to generate Ebola virus mutants entirely from cloned cDNA as described herein, opens a new era of filovirus research.
[0056]For many viruses in the Orthomyxoviridae and Paramyxoviridae families, GP cleavage by furin and other host cell proteases is absolutely required for their infectivity and thus determines the extent of viral pathogenicity (Klenk et al., 1994). In contrast, findings with viruses pseudotyped with Ebola GPs as well as the present results demonstrate that GP cleavage is dispensable for replication of Ebola virus, at least in cell culture. The furin cleavage motif is highly conserved among all Ebola GP sequences determined thus far, and its conservation suggests a role in the viral life cycle. Hence, GP cleavage by furin is not critical for Ebola virus replication in the cells tested, but it may be required for Ebola virus replication in vivo and/or in its natural reservoir. Further studies with animal models will be needed to establish the role of GP cleavage in Ebola virus replication and pathogenicity.
[0057]T7 RNA polymerase-based reverse genetics systems rely on the expression of this enzyme within the transfected cells. To this end, two approaches have been explored (reviewed in Marriott et al., 1999; Nagai et al., 1999; and Roberts et al., 1999). T7 RNA polymerase has been provided from recombinant vaccinia virus or from stable cell lines constitutively expressing this enzyme. The former approach leaves investigators with the task of separating the artificially generated recombinant virus from vaccinia virus. On the other hand, cell lines expressing T7 RNA polymerase may produce insufficient amounts to efficiently transcribe the viral genome. In contrast to Volchkov et al. (2001), who used a BHK-21 cell line stably expressing T7 RNA polymerase, an entirely plasmid-based system is described herein which was achieved using T7 RNA polymerase expression under control of the strong chicken β-actin promoter. This approach resulted in 102 PFU of virus per ml of culture supernatant. Expression of T7 RNA polymerase from plasmids may therefore be an alternative for the generation of other nonsegmented, negative-sense RNA viruses, thereby circumventing restraints encountered with the established systems.
[0058]The reverse genetics systems for the generation of Ebola virus can be used to identify key regulatory elements and structure-function relationships in the viral life cycle, and allows the study of mechanisms of filovirus pathogenicity in animal models. The system also promotes the development of new vaccines and the development of replication-deficient viruses.
Example 2
Generation of Noninfectious Ebola Particles
Materials and Methods
[0059]Cells. 293 and 293T human embryonic kidney cells were maintained in DMEM supplemented with 10% fetal calf serum, 2% L-glutamine, and penicillin-streptomycin solution (DMEM-FCS) (Sigma). The cells were grown at 37° C. in 5% CO2.
[0060]Construction of Plasmids. To generate cDNA constructs encoding the VP40 protein, primers were used that bind to the start and stop codons (positions 4479 and 5459 of the positive-sense antigenomic RNA) to reverse transcribe and PCR-amplify purified viral RNA (Titan RT-PCR Kit, Roche). The PCR product was cloned in the pT7Blue vector (Novagen) resulting in pT7EboZVP40. The cloned Ebola VP40 gene was sequenced to ensure that unwanted nucleotide replacements were not present.
[0061]To generate plasmid pETEBoZVP40His for the expression of 6-histidine-tagged VP40 in Escherichia coli, pT7EboZVP40 was used as a template for PCR amplification with the appropriate primers. The PCR product was blunt-end ligated into the SmaI-digested site of vector pM (CLONETECH). This construct was digested with NdeI and EcoRI and the fragment containing VP40 was ligated into the expression vector pET-5a (Promega). To generate plasmids pCEboZVP40, pCEboZVP40AAXY, pCEboZVP40M 14A, pCEboZVP40/1-276, pCEboZVP40/1-226, pCEboZVP40/1-176, pCEboZVP40/50-326, and pCEboZVP40/100-326 (proteins expressed from these plasmids are designated VP40, VP40AAXY, and the like) for expression of VP40 and its mutants in eukaryotic cells, the Ebola Zaire VP40 gene was amplified from pT7EboZVP40 using specific forward primers, each containing an EcoRI site 5' to the start of the coding region, and specific reverse primers, each containing a BglII site 3' to the stop codon for each construct, and blunt-end ligated into the EcoRV-digested site of vector pT7Blue. Each construct was digested with EcoRI and BglII, and the fragment containing the VP40 gene or modified VP40 gene was cloned into the EcoRI and BglII-digested eukaryotic expression vector pCAGGS/MCS (expression controlled by the chicken β-actin promoter) (Kobasa et al., 1997; and Niwa et al., 1991). Eukaryotic expression constructs employed in this study are schematically presented in FIG. 5A.
[0062]Antibody. A polyclonal antibody against Ebola Zaire VP40 was produced as follows: BL21 E. coli cells were transformed with plasmid pETEboZVP40H is. Expression of the 6-His-tagged VP40 protein was induced with 1 mM IPTG for 3 hours. The E. coli cells were lysed and cellular debris was remove by centrifugation. The supernatant was purified over an Ni-NTA agarose column (Qiagen). Expression of VP40 was verified by SDS-PAGE followed by Western blotting using a monoclonal antibody against the histidine tag (Kodak). Rabbits were immunized with approximately 0.5 mg of VP40, and antibody against keratin present in the antiserum was removed with a keratin column (Girault et al., 1989).
[0063]Cell Transfection for Expression of VP40 and its Mutants. 293 or 293T cells (60-mm plates) were transfected with expression vectors with the use of the Trans IT LT-1 liposomal reagent (Panvera) according to the manufacturer's instructions. Briefly, DNA and transfection reagent were mixed (6 μl of Trans IT LT-1 with 3 μg of DNA) in 0.2 ml OPTI-MEM (Gibco-BRL), incubated for 30 minutes at room temperature, and added to the cells. Transfected cells were incubated at 37° C. until harvest of the supernatant and/or cell monolayer.
[0064]Particle Formation Assay. Particles were assayed by the method of Li et al (1993) with some modifications. Forty-eight hours after transfection of 293T cells with pCEboZVP40, pCEboZVP40AAXY, pCEboZVP40M14A, or pCEboZVP40/1-276, the culture medium was removed and placed on ice. The cell monolayer was washed with phosphate-buffered saline (PBS), scraped into lysis buffer (0.25 M Tris-HCl, pH 8.0, 0.5% Triton X-100) and kept at 4° C. The culture medium (2 ml) was centrifuged at 2,000 rpm in a microcentrifuge for 5 minutes to remove cellular debris, layered over 20% sucrose in STE buffer (0.01 M Tris-Cl, pH 7.5, 0.01 M NaCl, 0.001 M EDTA, pH 8.0) (2 ml), and centrifuged at 150,000×g for 2 hours at 4° C. After centrifugation, the supernatant was removed and added to the cell lysate. This mixture was saved for analysis of total protein expression. The pellet was resuspended in 1 ml STE buffer overnight at 4° C. The resuspended pellet was layered over a 10-50% discontinuous sucrose gradient in STE buffer, centrifuged at 150,000×g for 4 hours at 4° C., and fractions (1 ml) were collected from the top of the gradient. Each fraction was mixed with 0.25 ml of 50% trichloroacetic acid (TCA) (10% TCA), the fractions were incubated for 30 minutes on ice, and the precipitated proteins were pelleted by microcentrifugation for 15 minutes. The pellets were washed once with cold acetone, air-dried, and resuspended in 0.05 ml SDS-PAGE sample buffer. Proteins in the mixture of cell lysate and supernatant from centrifugation through 20% sucrose were precipitated with 10% TCA, washed with acetone, and resuspended in 0.5 ml SDS-PAGE sample buffer. Proteins were separated by 12% SDS-PAGE and detected by Western blotting. Fractions are numbered from the top to the bottom of the gradient.
[0065]Protease Protection Assay. 293T cells were transfected with pCEboZVP40 and, at 48 hours post-transfection, the culture medium was removed. The medium was microcentrifuged at 2,000 rpm for 5 minutes to remove cellular debris, layered over a 20% sucrose cushion, and centrifuged at 165,000×g for 1 hour at 4° C. The supernatant was removed and the pellet was resuspended overnight at 4° C. in 0.4 ml STE buffer. This resuspension was divided into six aliquots and treated following a protocol previously described (Mik et al., 1989): Aliquot 1 received no further treatment; aliquot 2 was treated with soybean trypsin inhibitor (Biofluids) to a final concentration of 3 mg/ml; aliquot 3 with triton X-100 to a final concentration of 1%; aliquot 4 with trypsin (Worthington) to a final concentration of 0.1 mg/ml; aliquot 5 with both Triton X-100 to 1% and trypsin to 0.1 mg/ml final concentration; and aliquot 6 with both trypsin inhibitor (3 mg/ml final) and trypsin (0.1 mg/ml final). The samples were incubated at room temperature for 30 minutes, after which an excess of trypsin inhibitor (5 mg/ml) was added to each aliquot. SDS-PAGE sample buffer (6X) was added to each aliquot. Proteins from each aliquot were separated by 12% SDS-PAGE and detected by Western blotting.
[0066]Membrane-Association Assay. The method of Bergmann and Fusco (1988) was used, with some modifications, to determine membrane-association of VP40 and its mutants. Briefly, 48 hours after transfection of 293 cells with pCEboZVP40 or a mutant-VP40 expression plasmid, the culture medium was removed, and the cell monolayer, after a wash with (PBS), was scraped into ice-cold sucrose homogenization buffer (10% wt/wt sucrose, 10 mM Tris-HCl (pH 7.4), 1 mM EDTA, and 10 mM iodoacetamide). Cells were disrupted with 30 strokes of a Dounce homogenizer on ice and microcentrifuged for 3 minutes at 2,000 rpm to remove nuclei. The resulting supernatant was made to 1 M NaCl or left untreated, incubated at room temperature for 20 minutes, made to 80% sucrose (wt/vol), placed at the bottom of a Beckman SW41 centrifuge tube, and overlaid with 5 ml of 65% (wt/vol) sucrose and 2.5 ml of 10% sucrose. The gradient was centrifuged to equilibrium at 150,000×g for 18 hours at 4° C. Fractions (1 ml) were collected from the top of the gradient, diluted 1:1 with TBS-Triton buffer (0.025 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.5% Triton X-100) or, for experiments involving expression of VP40/100-326, precipitated with TCA (as described for the particle formation assay) owing to the weak signal of this deletion construct in Western analysis, and mixed with SDS-PAGE sample buffer. Proteins from each aliquot were separated by 12% SDS-PAGE and detected by Western blotting.
[0067]Triton X-114 Phase Partitioning Analysis. The method used was essentially that of Bordier (1981). Forty-eight hours post-transfection of 293 cells pCEboZY40, pCEboZVP40/1-276, pCEboZVP40/1-226, pCEboZVP40/1-176, pCEboZP40/50-326, pCEboZVP40/100-326, or, as a control, a vector expressing A/WSN/33 (H1N1) influenza virus hemagglutinin (HA), cells were scraped into cold TN buffer (10 mM Tris-HCl, pH 7.4, 150 mM NaCl), disrupted with 30 strokes in a Dounce homogenizer, and subjected to centrifugation at 2,000 rpm for 3 minutes to remove nuclei. Triton X-114 (Sigma) was added to each supernatant to 1%, and the resulting solution was incubated for 15 minutes at 4° C. with agitation. Unsolubilized material was pelleted by centrifugation in a picofuge for 5 minutes at 4° C., and the supernatant was heated to 37° C. for 5 minutes. The supernatant was layered onto a 37° C. sucrose (6%) cushion in TN buffer containing 0.06% Triton X-114 and centrifuged at 2,000 rpm for 3 minutes at room temperature. The detergent (lower) and aqueous (upper) phases were recovered separately, the aqueous phase was extracted a second time, like phases were pooled, and the detergent phase was diluted in TN buffer. Proteins in each phase were precipitated with 50% acetone and resuspended in SDS-PAGE sample buffer. Proteins were separated by 12% SDS-PAGE and analyzed by Western blotting.
[0068]Western Blotting. Samples in sample buffer (10 μl) were incubated at 100° C. for 5 minutes and separated on 12% polyacrylamide gels. Resolved proteins were transferred to Westran polyvinylidine difluoride membranes (Schleicher & Schuell) and blocked overnight at 4° C. with 5% skim milk in PBST (0.05% Tween 20 (Sigma) in PBS). Blots were incubated in primary antibody for 1 hour at room temperature, washed three times with PBST, incubated in biotinylated anti-rabbit secondary antibody (Vector Laboratories) for 30 minutes, washed three times with PBST, incubated in streptavidin-horseradish peroxidase reagent (Vector Laboratories) for 30 minutes and washed three times with PBST. Blots were then incubated in Lumi-Light Western blotting substrate (Boehringer-Mannheim) for 5 minutes and exposed to x-ray film (Kodak).
Results
[0069]Expression of VP40 in Mammalian Cells. To ensure that VP40 is expressed at efficient levels in human embryonic kidney 293T cells, the cell lysate was analyzed 24 hours after transfection with pCEboZVP40 by Western blotting. Two bands reacting with anti-VP40 polyclonal antibody were found, a small distance apart, in the range of 40 kDa (FIG. 6). The lysate from cells transfected with the expression vector alone did not react with the antibody.
[0070]VP40 contains an internal start codon at nucleotides 40-42 (codon 14) that is in frame with the first AUG. To determine whether protein synthesis from this internal start codon was responsible for the faster-migrating band on the gel, a construct was generated, pCEboZVP40M14A, which expresses a mutant VP40 with this second AUG changed to GCG, which encodes alanine, and expressed it as described above. Analysis of the cell lysate revealed a single, larger-sized band (FIG. 6), suggesting that the second AUG is used as a start codon to an appreciable extent in this system.
[0071]To determine whether loss of the PPXY motif at amino acids 10-13 of VP40 affects expression of the protein, 293T cells were transfected with pCEboZVP40AAXY, which expresses a mutant VP40 in which the PPEY sequence at amino acids 10-13 was changed to AAEY. Two bands corresponding to those seen with the expression of wild-type VP40 were detected (FIG. 6). However, in contrast to the results obtained with wild-type VP40 expression, where the slower-migrating band was the predominate product, pCEboZVP40AAXY expressed the two products at similar levels, indicating that loss of the PPXY motif affects either the translation of VP40 or its stability.
[0072]Production of Membrane-Bound Particles. To determine whether VP40-associated vesicles are produced when the protein is expressed in the absence of other viral proteins, 293T cells were transfected with pCEboZVP40 and, after 48 hours, collected the supernatant. After removal of cellular debris, the supernatant was subjected to ultracentrifugation over a 20% sucrose cushion. The pellet was resuspended and centrifuged through a 10-50% discontinuous sucrose gradient, and fractions were analyzed by Western blotting (FIG. 7). Fractions 6-8 contained VP40, with the majority of the protein found in fraction 7. The VP40 in fractions 6-8 was most likely associated with membrane lipids in a particle-like structure, as the sucrose densities in these fractions ranged from 1.11 to 1.13 g/ml, which corresponds to findings for matrix protein-generated particles of other viruses (Giddings et al., 1998; Sandefur et al., 1998). Bands detected below full-length protein in the total protein fraction are likely degradation products. These data indicate that VP40 expressed in the absence of other viral proteins can produce membrane-bound particles.
[0073]Protease Protection Assay. To confirm the ability of VP40 to produce membrane-bound particles when expressed alone, a trypsin protection assay was employed. Culture supernatant from cells transfected with pCEboZVP40 was centrifuged at 165,000×g through 20% sucrose, and the pellet was resuspended in STE buffer and divided into six equal aliquots. Aliquots 1-3 served as controls (untreated, trypsin inhibitor treated, and triton X-100 treated), aliquot 4 was treated with trypsin, aliquot 5 with trypsin and triton X-100, and aliquot 6 with trypsin inhibitor and trypsin. Trypsin degraded VP40 only in the presence of triton X-100 (FIG. 7), indicating that the viral protein does induce the production of fully membrane-bound particles; that is, trypsin digestion of VP40 required disruption of the lipid-bilayer surrounding the protein.
[0074]VP40 Mutants and Membrane-Bound Particle Formation. Does the PPXY motif at amino acids 10-13 of VP40 contribute to particle production? To address this question, VP40AAXY was expressed in 293T cells and assayed for particles as described for wild-type VP40. VP40AAXY was not detected in fractions corresponding to the sucrose densities to which wild-type VP40 particles migrated (FIG. 7). Since VP40AAXY was synthesized at levels similar to wild-type VP40, this finding indicates that mutation of the PPXY motif markedly disrupts VP40-generated vesicle formation.
[0075]FIG. 7 also shows the effect of loss of the second AUG codon on particle formation. A substantial amount of VP40M14A was present in fractions 5-8 in the gradient, and the percentage of total VP40M14A expressed in 293T cells that contributed to membrane-bound particle formation was much greater than the percentage of total wild-type VP40 involved in particle formation. This result is consistent with the finding that the PPXY motif present immediately upstream of the second AUG is critical for VP40-associated particle formation (FIG. 7).
[0076]To determine whether the C-terminus of VP40 is essential for particle formation, a deletion mutant, VP40/1-276, was assayed which lacks the final 50 amino acids of VP40, for particle generation. Since this deletion mutant was not present at the same sucrose densities that characterized the migration of wild-type VP40, it was concluded that the first 276 amino acids of VP40 are not sufficient for particle formation (FIG. 7).
[0077]VP40 Association with Cell Membranes and Structural Requirements for Activity. Flotation analysis was used to determine if VP40 binds cellular membranes efficiently in mammalian cells. In this method, postnuclear membrane fractions in 80% sucrose are loaded at the bottom of a centrifuge tube and overlaid with 65% and 10% sucrose. During centrifugation, cellular membranes and their associated proteins float to the 10-65% sucrose interface, while soluble proteins remain in the dense sucrose fractions at the bottom of the tube.
[0078]A large percentage of wild-type VP40 was found at the 10-65% sucrose interface (fraction 3), while the remaining protein was found in the loading zone (fractions 8-12) (FIG. 9), indicating that VP40 does indeed bind cellular membranes. To clarify the interactions involved in this association, VP40-associated membranes were treated with 1 M NaCl to determine whether electrostatic interactions were required for this association and subjected them to flotation analysis. Salt treatment had a negligible affect on the ability of VP40 to associate with membranes (FIG. 9), suggesting that the protein contains at least one hydrophobic domain able to associate with membranes.
[0079]To elucidate the domain(s) of VP40 important for membrane association, deletion mutants were generated. Constructs expressing amino acids 50-326 (pCEboZVP40/50-326), amino acids 100-326 (pCEboZVP40/100-326), amino acids 1-176 (pCEboZVP40/1-176), amino acids 1-226 (pCEboZVP40/1-226), and amino acids 1-276 (pCEboZVP40/1-276) of VP40 were expressed in 293 cells and their membrane association in the presence or absence of 1 M NaCl was examined. The mutants with the largest truncations, VP40/1-176 and VP40/100-326, showed the highest level of association with the lipid bilayer (FIG. 9). Salt treatment did not affect these interactions. Mutants VP40/1-226 and VP40/50-326 associated with membranes to the extent found with wild-type VP40, and these interactions were also relatively unperturbed by treatment with salt. By contrast, only a small portion of VP40/1-276 associated with the lipid bilayer, and this interaction was eliminated upon treatment with salt. These results indicate that loss of the C-terminal 50 amino acids of VP40 markedly alters the membrane-binding capabilities of VP40, primarily by disrupting hydrophobic interactions. This effect was ameliorated when 50 additional C-terminal amino acids were deleted, and membrane-association was promoted when the protein was further truncated to 176 amino acids. Deletion of the N-terminal 49 amino acids of VP40 did not alter the membrane-binding characteristics of the protein, although truncation of 50 additional N-terminal amino acids did enhance protein-membrane association, as seen with VP40/1-176 (FIG. 9).
[0080]Since particle formation was markedly reduced with VP40AAXY, cells expressing this mutant were subjected to flotation analysis in order to determine whether a decreased ability to bind membranes was involved in this deficiency. As shown in FIG. 10, the loss of the PPXY motif in VP40 did not affect the ability of the protein to bind membranes, indicating that lack of particle production with this mutant was not due to the loss of membrane association.
[0081]Flotation analysis was also used to determine whether the more efficient particle formation induced by VP40M14A, by comparison to wild-type VP40, could be attributed, at least in part, to increased membrane binding by this mutant. The percentage of VP40M14A associated with membranes was only slightly greater than that determined for wild-type VP40 (FIG. 10), indicating that this mutant relies on another mechanism to increase particle formation.
[0082]Triton X-114 Phase Partitioning Analysis. To probe the nature of the VP40-membrane interaction further, Triton X-114 phase partitioning analysis was used as integral membrane proteins and lipid anchored proteins partition in the detergent phase of a protein extraction and peripheral membrane proteins partition in the aqueous phase. FIG. 11 shows the results of this analysis for wild-type VP40, the five deletion mutants of VP40, and influenza virus HA. HA, an integral membrane protein, was found entirely in the detergent phase of the extraction, as expected. Only a small portion of total VP40 was found in the detergent phase, while VP40/1-276 was found almost entirely in the aqueous phase. VP40/1-226 and VP40/50-326 partitioned in the detergent phase in proportions similar to that found for wild-type VP40. By contrast, when VP40/1-176 and VP40/100-326 were expressed, large proportions of each partitioned in the detergent phase. These results indicate that wild-type VP40 possesses only minor traits of an integral membrane protein, and that deletion of its C-terminal 50 amino acids (VP40/1-276) abrogates these features. Further truncation of the C-terminus (VP40/1-226 and VP40/1-176) enhances the integral membrane character of protein. Deletion of the N-terminal 49 amino acids of VP40 (VP40/50-326) does not alter the general structural features of the protein, while deletion of amino acids 1-99 (VP40/100-326) appears to increase the extent of anchoring to lipids.
Discussion
[0083]Thus, VP40 of Ebola virus, when expressed in the absence of other viral proteins, can induce the formation of membrane-encompassed particles, much in the manner of the matrix proteins of VSV, rabies, and simian immunodeficiency virus (Giddings et al., 1998; Harty et al., 1999; Justice et al., 1995; Li et al., 1993). Cellular proteins containing the WW domain are, in all likelihood, crucial for this process, as VP40 containing an altered version of a PPXY motif at amino acids 10-13 induces little or no particle formation. Harty et al. (1999) demonstrated that the matrix proteins of VSV and rabies viruses, which possess this motif at their N-termini, bind the cellular Yes-kinase-associated and Nedd4 proteins via a PPXY motif-WW domain, interaction, and that the loss of this motif results in impaired virus release from infected cells. Jayakar et al. (2000) recently demonstrated that mutation of the PPXY motif in the matrix protein of VSV impedes budding of fully assembled virions at the plasma membrane. The data described herein provides evidence for an important role of the PPXY motif in particle formation induced by VP40, and suggest that cellular proteins are crucial players in this process.
[0084]The efficiency of particle production markedly increased when the second ATG codon of VP40 (codon 14) was changed to GCG (alanine), but the reason for this enhancement remains unclear. This ATG codon immediately follows the PPXY motif. Perhaps the faster-migrating version of VP40, which lacks the PPXY motif, interferes with the assembly or budding of full-length VP40 molecules at the cell surface, or with the interaction between VP40 and a cellular protein. Whether translation from this second ATG occurs in actual viral infection or is an artifact of the system employed in this study is unknown.
[0085]Ruigrok et al. (2000) reported that VP40 expressed in E. coli can bind liposomes in vitro and that this interaction is largely electrostatic. In mammalian cells, a substantial amount of VP40 bound to the cellular membrane, and that this interaction was disrupted negligibly by the presence of 1 M NaCl, indicating that at least one hydrophobic domain is involved in this interaction. A small but appreciable portion of VP40 partitioned with detergent in the manner of an integral membrane or lipid-anchored protein in Triton X-114 phase-partitioning analysis. This result, together with the inability of 1 M NaCl to dissociate VP40 from the lipid bilayer, indicates that the protein has certain properties of an integral membrane protein, as do a number of matrix proteins of negative-stranded RNA viruses (Chong et al., 1993; Zhang et al., 1996), even though Ebola VP40 does not appear to contain a region of significant length and hydrophobicity to span the cell membrane (FIG. 5B). Short hydrophobic stretches of VP40 may be able to penetrate the lipid bilayer to some extent, lending modest integral-membrane character to the protein.
[0086]Ruigrok et al. (2000) also reported that a deletion mutant of VP40 containing amino acids 31-212 failed to bind liposomes efficiently, indicating that the C-terminus of VP40 is absolutely required for membrane binding. To elucidate the domains involved in the association of VP40 with cellular membranes, carboxy and amino-terminal deletion mutants were constructed. VP40 lacking its C-terminal 50 amino acids demonstrated appreciably reduced membrane association. The Kyte-Doolittle hydrophobicity plot (1982) of VP40 (FIG. 5B) indicates that amino acids 277-326 of the protein are primarily hydrophobic, so that deletion of amino acids 277-326 eliminates a substantial hydrophobic region that is likely important for efficient membrane-binding by the full-length protein. This hypothesis is supported by the fact that 1 M NaCl completely disrupted this association, suggesting that affinity of this deletion construct with the lipid bilayer depends primarily on electrostatic interactions.
[0087]When amino acids 227-326 of VP40 were deleted, the resulting truncated protein associated with the lipid bilayer as efficiently as wild-type VP40; moreover, C-terminal deletion of amino acids 177-326 resulted in a protein with much higher affinity for the lipid bilayer than was found for wild-type VP40. Salt treatment did not perturb membrane association of these truncated versions of VP40, indicating the presence of hydrophobic interactions mediated by the N-terminal 176 amino acids of the protein.
[0088]The hydrophobicity plot indicates that amino acids 227-276, and particularly amino acids 177-226, are primarily hydrophilic. Deletion of the hydrophilic residues present in this region of VP40 may allow the truncated protein to fold into a structure capable of strong hydrophobic association with the cell membrane, perhaps by effectively exposing the highly hydrophobic central domain of the protein. These results are consistent with data obtained by Triton X-114 extraction analysis (FIG. 11). Since VP40 lacking its C-terminal 50 amino acids was unable to produce particles (FIG. 7), and these C-terminal residues appear to be required for efficient membrane association of VP40, binding of this highly hydrophobic region to the lipid bilayer may be an essential step in the particle formation process.
[0089]The crystal structure of amino acids 31-326 of Ebola virus was recently elucidated by Dessen et al. (2000). It shows VP40 to be distinct from other viral matrix proteins, in that it consists of two similar domains connected by a flexible linker at amino acids 195-200. Ruigrok et al. (2000) showed that amino acids 31-212 of VP40 form hexamers spontaneously in solution. Dessen and associates postulate that, during the life cycle of Ebola virus, VP40 molecules associate with the lipid bilayer through interactions contributed primarily by their C-termini. After membrane binding, the molecules undergo a conformational change that frees their N-termini for hexamerization. These hexamers then form building blocks for a lattice that underlies the plasma membrane, and subsequently may interact with the cytoplasmic tails of viral glycoproteins and/or the ribonucleoprotein complex. This model is based on data demonstrating the hexamerization of VP40 molecules that lack their N-terminal 30 amino acids as well as their C-terminal 114 amino acids. The PPXY motif that appears crucial for membrane-bound particle formation is located at amino acids 10-13 of VP40, and this motif most likely interacts with a cellular protein that exhibits a WW domain during virus particle assembly or budding. It has not yet been demonstrated that VP40 with a truncated C-terminus can form hexamers when the entire N-terminus is present. If hexamerization does occur during virion morphogenesis, the 18 hexamers that form presumably must leave the PPXY motif accessible to cellular proteins that participate in particle formation and/or budding.
Example 3
Particles Comprising Filovirus Matrix Protein and Glycoprotein
Materials and Methods
[0090]Cells. 293T human embryonic kidney cells were maintained in Dulbecco's modified Eagle medium supplemented with 10% fetal calf serum, L-glutamine and penicillin-streptomycin-gentamicin solution. The cells were grown in an incubator at 37° C. in 5% CO2.
[0091]Plasmids. Full-length cDNAs encoding the Ebola virus (species Zaire) VP40 or GP were cloned separately into a mammalian expression vector, pCAGGS/MCS (Kobasa et al., 1997; Niwa et al., 1991), which contains the chicken β-actin promoter. The resultant constructs were designated pCEboZVP40 and pCEboZGP, respectively.
[0092]Cell Transfection for Expression of VP40 and GP. 293T cells (1×106) were transfected with plasmids using the Trans IT LT-1 reagent (Panvera, Madison, Wis.) according to the manufacturer's instructions. Briefly, 1 μg of DNA in 0.1 ml Opti-MEM (Gibco-BRL) and 3 μl of the transfection reagent were mixed, incubated for 10 minutes at room temperature, and added to the cells. Transfected cells were incubated at 37° C. for 24 or 48 hours.
[0093]Electron Microscopy. Ultrathin section electron microscopy was performed as follows. Twenty-four hours post-transfection of 293T cells with plasmids, the cells were washed with phosphate-buffered saline (PBS) and fixed for 20 minutes with 2.5% glutaraldehyde (GLA) in 0.1 M cacodylate buffer (pH 7.4). They were scraped off the dish, pelleted by low-speed centrifugation and then fixed for 30 minutes with the same fixative. Small pieces of fixed pellet were washed with the same buffer, postfixed with 2% osmium tetroxide in the same buffer for 1 hour at 4° C., dehydrated with a series of ethanol gradients followed by propylene oxide, embedded in Epon 812 Resin mixture (TAAB) and polymerized at 70° C. for 2 days. For immune electron microscopy, cells were fixed with 4% paraformaldehyde and 0.1% GLA, dehydrated and embedded in LR White Resin (London Resin Company Ltd.). Thin sections were stained with uranil acetate and lead citrate, and examined with a JEM-1200EX electron microscope at 80 Kv.
[0094]For negative staining, culture media of 293T cells were collected at 24 hours post-transfection onto a Formvar-coated copper grid, stained with 2% phosphotungstic acid solution (PTA) and examined with a JEM-1200 electron microscope at 80 Kv.
[0095]For immune electron microscopy, the samples were absorbed to Formvar-coated nickel grids and washed with PBS containing 0.5% bovine serum albumin (PBS-BSA). The grids were then treated with mouse anti-GP monoclonal antibody (a mixture of ZGP12, ZGP42, and ZGP133 (31); 1:150 in PBS-BSA) or rabbit anti-VP40 polyclonal antibody (1:300 in PBS-BSA), and rinsed six times with PBS, followed by incubation with a goat antimouse immunoglobulin conjugated to 15-nm gold particles (1:50 dilution; BBlnternational) or a goat antirabbit immunoglobulin conjugated to 5-nm gold particles (1:100 dilution; BBlnternational). After washing, the samples were fixed for 10 min in 2% glutaraldehyde and negatively stained with 2% PTA.
Results
[0096]Pleomorphic Particle Formation by GP. To determine the morphology of vesicles induced by Ebola virus GP expression, GP-expressing cells and their supernatants were analyzed by electron microscopy. The ultrathin sections of these cells showed particle-like structures with surface spikes budding from the plasma membrane (FIG. 12A); no such structures were observed using cells transfected with the expression vector alone (FIG. 12B). As previously observed in the recombinant vaccinia virus system (Volchkov et al., 1998), pleomorphic structures similar to virosomes with a range of diameters were apparent in the supernatants of GP-expressing cells (FIGS. 13 A and B). The spikes on the surface of the vesicles reacted with anti-GP monoclonal antibodies (FIGS. 13 C and D), confirming the GP derivation of the structures.
[0097]VP40 Induces Filamentous Particle Formation. To determine how VP40 protein expressed in 293T cells is released into culture medium (Harty et al., 2000; Timmins et al., 2001; Example 2), the VP40-expressing cells were analyzed by transmission electron microscopy. The ultrathin sections of the cells expressing VP40 showed budding of filamentous structures (approximately 65 nm in diameter) on the cell surface (FIG. 14A). In some cells, the plasma membranes appeared ruffled and to consist of two bilayers (FIG. 14C). Aggregated ribosomes (FIG. 14E, arrows) were occasionally found in the cytoplasm of cells expressing VP40, as were electron-dense filamentous structures (approximately 45 nm in diameter; FIG. 14F, arrowheads), which were never seen in cells transfected with the expression vector alone. The budding particles and membrane ruffles reacted with rabbit anti-VP40 polyclonal antibody (FIGS. 14B and D), confirming that VP40 had contributed to the generation of these structures. In studies to further determine the size and morphology of the VP40 particles released from cells, the supernatants of cells expressing this protein were centrifuged through 20% sucrose, and the pelleted material was negatively stained with 2% PTA and analyzed by electron microscopy. Filamentous particles, which had uniform diameters of approximately 65 nm but varied lengths, were observed (FIGS. 15A-C). These results indicate that VP40 alone can induce the formation of filamentous particles, which bud from the cell surface.
[0098]VP40-GP Interaction in Particle Morphogenesis. To determine how GP expression affects VP40-driven particle formation, 293T cells were transfected with both VP40- and GP-expressing plasmids. In ultrathin sections of the transfected cells, filamentous particle-like structures of 80-nm external diameter were observed that were budding from the plasma membrane (FIGS. 16A and B). The structures possessed spikes of approximately 10 nm on their surface, in contrast to the structures observed in cells expressing VP40 alone (FIG. 14A). Also, unlike the findings with expression of GP alone, few pleomorphic particles were observed. The particle structures were studied in more detail after negative staining of the particles in culture supernatants of cells expressing both VP40 and GP. Filamentous Ebola virus-like particles with surface spikes of approximately 85-nm in external diameter and lengths that ranged to 10 μm were observed (FIGS. 17A-C). The spikes projected from the particle surface at 5- to 10-nm intervals and were morphologically indistinguishable from those on the Ebola virion surface (Feldmann et al., 1996; Peters et al., 1995). Labeling of the spikes with a mixture of anti-GP monoclonal antibodies conjugated with gold particles confirmed their identity as GP (FIG. 17D). Furthermore, when treated with 0.03% Triton X-100 and with both the anti-VP40 antibody conjugated to 5-nm gold particles and a mixture of anti-GP monoclonal antibodies conjugated to 15-nm gold particles, the filamentous particles became labeled with both antibodies, demonstrating that the Ebola vires-like particles contained GP as well as VP40 proteins (FIG. 17E). These results demonstrate GP incorporation into VP40-generated filamentous structures, without affecting filamentous particle formation.
Discussion
[0099]A hallmark of Ebola virus is its filamentous virions as featured in its family name Filoviridae. The shape of enveloped viruses are determined by viral proteins in retroviruses (Campbell et al., 1997; Gay et al., 1998; Joshi et al., 2000) or by both viral RNA length and proteins in VSV (Pattnaik et al., 1991). Because specific interactions among viral components are required for the formation of defined virion shapes, understanding of such interactions can lead to the identification of targets for the development of antiviral compounds.
[0100]As shown herein by electron microscopy, the expression of VP40 in the absence of any other Ebola vires proteins leads to the formation of filamentous particles, which resemble spikeless virions released into the supernatant of cultured Ebola virus-infected cells (Geisbert et al., 1995). Thus, these results suggest that the Ebola virus VP40 possesses structural information necessary and sufficient to induce the formation of filamentous particles, which then bud from the plasma membrane. Interestingly, some filamentous structures were observed in the cytoplasm of cells expressing VP40 as have been found in the cytoplasm of the cells infected with Ebola virus. Similar structures have also been observed in cells expressing the M1 protein of influenza virus or the GAG protein of retrovirus (Delchambre et al., 1989; Gheyson et al., 1989; Gomez-Puertas et al., 2000). However, the tubular structures observed upon expression of influenza virus M1 alone were not seen during normal viral infection or when M1 was coexpressed with other influenza viral proteins. Thus, VP40 may form intracellular filamentous structures by self-aggregation.
[0101]Membrane ruffles containing VP40 protein were observed in some VP40-expressing cells (FIGS. 14C and D). The M protein of VSV induces similar double-layered membranes at the cell surface when expressed from recombinant Sendai virus (Sakaguchi et al, 1999). IpaC protein secreted by Shigella flexneri has also been linked to large-scale membrane extension in macrophages, including lamellipodia and membrane ruffles (Kuwae et al, 2001; Tran Van Nhieu et al., 1999), while Salmonella typhimurium triggers the formation of host cell membrane ruffles in nonphagocytic cells (Ginocchio et al., 1994; Zhou et al., 1999). These membrane ruffles are thought to result from interactions between the bacterial proteins, including IpaC, and the actin cytoskeletons of host cells (Tran Van Nhieu et al., 1999; Zhou et al., 1999). In Ebola virus-infected cells, host cell plasma membranes proliferate extensively at the peak stage of viral budding (Geisbert et al, 1995), as observed in cells expressing VP40 alone. Thus, VP40 may interact with actin filaments during the assembly or budding of Ebola virus at the cell surface.
[0102]The impact of glycoprotein interaction with the matrix protein on virion morphology differs among viruses. For example, deletion of the cytoplasmic tails of the influenza virus hemagglutinin and neuraminidase alters virus morphology (Jin et al., 1997; Mitnaul et al., 1996), while the characteristic morphology of rabies virus and VSV do not depend on glycoprotein-matrix protein interaction (Mebatsion et al, 1996; Mebatsion et al., 1994; Schnell et al., 1998; Takada et al., 1997). The Ebola virus GP, like VSV-G, was incorporated into filamentous particles without affecting the morphology of the particles. However, such interaction may contribute to the efficiency of budding, as demonstrated by research on VSV (Jayakar et al., 2000; Mebatsion et al., 1999).
[0103]In conclusion, VP40 induces VP40 containing-filamentous particle formation and GP spikes are incorporated into VP40 induced-filamentous particles upon coexpression of GP and VP40, resulting in Ebola virus-like particles. This virus-like particle formation system will be useful to further elucidate the mechanism of Ebola virus particle formation, including the functional link among Ebola viral and cellular components.
REFERENCES
[0104]Baize et al., Nat. Med., 5, 423 (1999). [0105]Basler et al., Proc. Natl. Acad. Sci. U.S.A., 97, 12289 (2000). [0106]Bergmann et al., J. Cell Biol., 107, 1707 (1988). [0107]Bordier, J. Biol. Chem., 256, 1604 (1981). [0108]Bork et al., Trends Biochem. Sci., 19, 531 (1994). [0109]Campbell et al., J. Virol., 71, 4425 (1997). [0110]Chen et al., Proc. Natl. Acad. Sci. U.S.A., 92, 7819 (1995). [0111]Chong et al., J. Virol., 67, 407 (1993). [0112]Coronel et al., J. Virol., 73, 7035 (1999). [0113]Delchambre et al., EMBO. J., 8, 2653 (1989). [0114]Dessen et al., EMBO J., 19, 4228 (2000). [0115]Durbin et al., Virology, 235, 323 (1997). [0116]Elliott et al., Virology, 147, 169 (1985). [0117]Feldmann et al., Arch. Virol. Suppl., 15, 159 (1999). [0118]Feldmann et al., Arch. Virol. Suppl., 7, 81 (1993). [0119]Feldmann et al., 1996. Marburg and Ebola viruses. P. 1-52. In Maramorosch, K., Murphy, F. A. and Shatkin, A. J. (ed.), Advances in virus research 47, Academic Press. [0120]Feldmann et al., Filoviruses, p. 651-664, 9th ed. Edward Arnold, London, United Kingdom (1998). [0121]Garoff et al., Microbiol. Mol. Biol. Rev., 62, 1171 (1998). [0122]Gay et al., Virology, 247, 160 (1998). [0123]Geisbert et al., Virus Res., 39, 129 (1995). [0124]Gheysen et al., Cell, 59, 103 (1989). [0125]Giddings et al., Virology, 248, 108 (1998). [0126]Ginocchio et al., Cell, 76, 717 (1994). [0127]Girault et al., Anal. Biochem., 182, 193 (1989). [0128]Gomez-Puertas et al., J. Virol., 74, 11538 (2000). [0129]Haffer et al., J. Virol., 64, 2653 (1990). [0130]Harty et al., J. Virol., 73, 2921 (1999). [0131]Harty et al., Proc. Natl. Acad. Sci. U.S.A., 97, 13871 (2000). [0132]Hevey et al., Virology, 251, 28 (1998). [0133]Ito et al., J. Virol., 75, 1576 (2001). [0134]Jasenosky et al., J. Virol., 75, 5205 (2001). [0135]Jayakar et al., J. Virol., 74, 9818 (2000). [0136]Jin et al., EMBO. J., 16, 1236 (1997). [0137]Joshi et al., J. Virol., 74, 10260 (2000). [0138]Justice et al., J. Virol., 69, 3156 (1995). [0139]Kato et al., Genes Cells, 1, 569 (1996). [0140]Klenk et al., Trends Microbiol., 2, 39 (1994). [0141]Kobasa et al., J. Virol., 71, 6706 (1997). [0142]Kuwae et al., J. Biol. Chem., 276, 32230 (2001). [0143]Kyte et al., J. Mol. Biol., 157, 105 (1982). [0144]Li et al., J. Virol., 67, 4415 (1993). [0145]Marriott et al., Adv. Virus Res., 53, 321 (1999). [0146]Maruyama et al., J. Virol., 73, 6024 (1999). [0147]Mebatsion et al., Cell, 84, 941 (1996). [0148]Mebatsion et al., J. Virol., 73, 242 (1999). [0149]Mitnaul et al., J. Virol., 70, 873 (1996). [0150]Moss et al., Nature, 348, 91 (1990). [0151]Moss, Poxyiridae: The viruses and their replication. P. 2637-2672. In B. N. Fields, D. M. Knipe, and P. M. Howley (ed.), Fields virology, Lippincott-Raven Publishers, Philadelphia, Pa. 1996. [0152]Muhlberger et al., J. Virol., 73, 2333 (1999). [0153]Nagai et al., Microbiol. Immunol., 43, 613 (1999). [0154]Neumann et al., J. Virol., 74, 547 (2000). [0155]Neumann et al., Proc. Natl. Acad. Sci. USA, 96, 9345 (1999). [0156]Niwa et al., Gene, 108, 193 (1991). [0157]Pattnaik et al., Proc. Natl. Acad. Sci. USA, 88, 1379 (1991). [0158]Peters et al., 1995. Filoviridae: Marburg and Ebola viruses. P. 1161-1176. In B. N. Fields, D. M. Knipe, and P. M. Howley (ed.), Fields virology, Lippincott-Raven Publishers, Philadelphia, Pa. [0159]Roberts et al., Adv. Virus Res., 53, 301 (1999). [0160]Ruigrok et al., J. Mol. Biol., 300, 103 (2000). [0161]Sakaguchi et al., Virology, 263, 230 (1999). [0162]Sanchez et al., Proc. Natl. Acad. Sci. U.S.A., 93, 3602 (1996). [0163]Sanchez et al., Virus Res., 29, 215 (1993). [0164]Sandefur et al., J. Virol., 72, 2723 (1998). [0165]Schnell et al., EMBO J., 13, 4195 (1994). [0166]Schnell et al., EMBO. J., 17, 1289 (1998). [0167]Sudol et al., FEBS Lett., 369, 67 (1995). [0168]Sullivan et al., Nature, 408, 605 (2000). [0169]Takada et al., J. Virol., 75, 2324 (2001). [0170]Takada et al., Proc. Natl. Acad. Sci. U.S.A., 94, 14764 (1997). [0171]Takimoto et al., J. Virol., 75, 11384 (2001). [0172]Timmins et al., Virology, 283, 1 (2001). [0173]Tran Van Nhieu et al., EMBO. J., 18, 3249 (1999). [0174]Vanderzanden et al., Virology, 246, 134 (1998). [0175]Volchkov et al., J. Gen. Virol., 80, 355 (1999). [0176]Volchkov et al., Proc. Natl. Acad. Sci. USA, 95, 5762 (1998). [0177]Volchkov et al., Science, 291, 1965 (2001). [0178]Volchkov et al., Virology, 245, 110 (1998). [0179]Wills et al., J. Virol., 63, 4331 (1989). [0180]Wilson et al., Science, 287, 1664 (2000). [0181]Wilson et al., Science, 287, 1664 (2000). [0182]Wool-Lewis et al., J. Virol., 73, 1419 (1999). [0183]Wool-Lewis, et al., J. Virol. 72, 3155 (1998). [0184]Xu et al., Nat. Med., 5, 373 (1998). [0185]Yang et al., Nat. Med., 6, 886 (2000). [0186]Yang et al., Science, 279, 1034 (1998). [0187]Zhang et al., Virology, 225, 255 (1996). [0188]Zhou et al., Proc. Natl. Acad. Sci. USA, 96, 10176 (1999).
[0189]All publications, patents and patent applications are incorporated herein by reference. While in the foregoing specification this invention has been described in relation to certain preferred embodiments thereof, and many details have been set forth for purposes of illustration, it will be apparent to those skilled in the art that the invention is susceptible to additional embodiments and that certain of the details described herein may be varied considerably without departing from the basic principles of the invention.
Sequence CWU
1
581739PRTReston Ebola virus 1Met Asp Arg Gly Thr Arg Arg Ile Trp Val Ser
Gln Asn Gln Gly Asp1 5 10
15Thr Asp Leu Asp Tyr His Lys Ile Leu Thr Ala Gly Leu Thr Val Gln
20 25 30Gln Gly Ile Val Arg Gln Lys
Ile Ile Ser Val Tyr Leu Val Asp Asn 35 40
45Leu Glu Ala Met Cys Gln Leu Val Ile Gln Ala Phe Glu Ala Gly
Ile 50 55 60Asp Phe Gln Glu Asn Ala
Asp Ser Phe Leu Leu Met Leu Cys Leu His65 70
75 80His Ala Tyr Gln Gly Asp Tyr Lys Leu Phe Leu
Glu Ser Asn Ala Val 85 90
95Gln Tyr Leu Glu Gly His Gly Phe Lys Phe Glu Leu Arg Lys Lys Asp
100 105 110Gly Val Asn Arg Leu Glu
Glu Leu Leu Pro Ala Ala Thr Ser Gly Lys 115 120
125Asn Ile Arg Arg Thr Leu Ala Ala Leu Pro Glu Glu Glu Thr
Thr Glu 130 135 140Ala Asn Ala Gly Gln
Phe Leu Ser Phe Ala Ser Leu Phe Leu Pro Lys145 150
155 160Leu Val Val Gly Glu Lys Ala Cys Leu Glu
Lys Val Gln Arg Gln Ile 165 170
175Gln Val His Ala Glu Gln Gly Leu Ile Gln Tyr Pro Thr Ala Trp Gln
180 185 190Ser Val Gly His Met
Met Val Ile Phe Arg Leu Met Arg Thr Asn Phe 195
200 205Leu Ile Lys Tyr Leu Leu Ile His Gln Gly Met His
Met Val Ala Gly 210 215 220His Asp Ala
Asn Asp Ala Val Ile Ala Asn Ser Val Ala Gln Ala Arg225
230 235 240Phe Ser Gly Leu Leu Ile Val
Lys Thr Val Leu Asp His Ile Leu Gln 245
250 255Lys Thr Asp Gln Gly Val Arg Leu His Pro Leu Ala
Arg Thr Ala Lys 260 265 270Val
Arg Asn Glu Val Asn Ala Phe Lys Ala Ala Leu Ser Ser Leu Ala 275
280 285Lys His Gly Glu Tyr Ala Pro Phe Ala
Arg Leu Leu Asn Leu Ser Gly 290 295
300Val Asn Asn Leu Glu His Gly Leu Tyr Pro Gln Leu Ser Ala Ile Ala305
310 315 320Leu Gly Val Ala
Thr Ala His Gly Ser Thr Leu Ala Gly Val Asn Val 325
330 335Gly Glu Gln Tyr Gln Gln Leu Arg Glu Ala
Ala Thr Glu Ala Glu Lys 340 345
350Gln Leu Gln Gln Tyr Ala Glu Ser Arg Glu Leu Asp Ser Leu Gly Leu
355 360 365Asp Asp Gln Glu Arg Arg Ile
Leu Met Asn Phe His Gln Lys Lys Asn 370 375
380Glu Ile Ser Phe Gln Gln Thr Asn Ala Met Val Thr Leu Arg Lys
Glu385 390 395 400Arg Leu
Ala Lys Leu Thr Glu Ala Ile Thr Leu Ala Ser Arg Pro Asn
405 410 415Leu Gly Ser Arg Gln Asp Asp
Asp Asn Glu Ile Pro Phe Pro Gly Pro 420 425
430Ile Ser Asn Asn Pro Asp Gln Asp His Leu Glu Asp Asp Pro
Arg Asp 435 440 445Ser Arg Asp Thr
Ile Ile Pro Asn Ser Ala Ile Asp Pro Glu Asp Gly 450
455 460Asp Phe Glu Asn Tyr Asn Gly Tyr His Asp Asp Glu
Val Gly Thr Ala465 470 475
480Gly Asp Leu Val Leu Phe Asp Leu Asp Asp His Glu Asp Asp Asn Lys
485 490 495Ala Phe Glu Leu Gln
Asp Ser Ser Pro Gln Ser Gln Arg Glu Ile Glu 500
505 510Arg Glu Arg Leu Ile His Pro Pro Pro Gly Asn Asn
Lys Asp Asp Asn 515 520 525Arg Ala
Ser Asp Asn Asn Gln Gln Ser Ala Asp Ser Glu Glu Gln Glu 530
535 540Gly Gln Tyr Asn Arg His Arg Gly Pro Glu Arg
Thr Thr Ala Asn Arg545 550 555
560Arg Leu Ser Pro Val His Glu Glu Asp Thr Pro Ile Asp Gln Gly Asp
565 570 575Asp Asp Pro Ser
Ser Pro Pro Pro Leu Glu Ser Asp Asp Asp Asp Ala 580
585 590Ser Ser Ser Gln Gln Asp Pro Asp Tyr Thr Ala
Val Ala Pro Pro Ala 595 600 605Pro
Val Tyr Arg Ser Ala Glu Ala His Glu Pro Pro His Lys Ser Ser 610
615 620Asn Glu Pro Ala Glu Thr Ser Gln Leu Asn
Glu Asp Pro Asp Ile Gly625 630 635
640Gln Ser Lys Ser Met Gln Lys Leu Gly Glu Thr Tyr His His Leu
Leu 645 650 655Arg Thr Gln
Gly Pro Phe Glu Ala Ile Asn Tyr Tyr His Met Met Lys 660
665 670Asp Glu Pro Val Ile Phe Ser Thr Asp Asp
Gly Lys Glu Tyr Thr Tyr 675 680
685Pro Asp Ser Leu Glu Glu Ala Tyr Pro Pro Trp Leu Thr Glu Lys Glu 690
695 700Arg Leu Asp Asn Glu Asn Arg Tyr
Ile Tyr Ile Asn Asn Gln Gln Phe705 710
715 720Phe Trp Pro Val Met Ser Pro Arg Asp Lys Phe Leu
Ala Ile Leu Gln 725 730
735His His Gln2329PRTReston Ebola virus 2Met Tyr Asn Asp Lys Leu Lys Ile
Cys Ser Gly Pro Glu Thr Thr Gly1 5 10
15Trp Ile Ser Glu Gln Leu Met Thr Gly Lys Ile Pro Val Thr
Asp Ile 20 25 30Phe Ile Asp
Ile Asp Asn Lys Pro Asp Gln Met Glu Val Arg Leu Lys 35
40 45Pro Ser Ser Arg Ser Ser Thr Arg Thr Cys Thr
Ser Ser Ser Gln Thr 50 55 60Glu Val
Asn Tyr Val Pro Leu Leu Lys Lys Val Glu Asp Thr Leu Thr65
70 75 80Met Leu Val Ser Ala Thr Ser
Arg Gln Asn Ala Ala Ile Glu Ala Leu 85 90
95Glu Asn Arg Leu Ser Thr Leu Glu Ser Ser Leu Lys Pro
Ile Gln Asp 100 105 110Met Gly
Lys Val Ile Ser Ser Leu Asn Arg Ser Cys Ala Glu Met Val 115
120 125Ala Lys Tyr Asp Leu Leu Val Met Thr Thr
Gly Arg Ala Thr Ser Thr 130 135 140Ala
Ala Ala Val Asp Ala Tyr Trp Lys Glu His Lys Gln Pro Pro Pro145
150 155 160Gly Pro Ala Leu Tyr Glu
Glu Asn Ala Leu Lys Gly Lys Ile Asp Asp 165
170 175Pro Asn Ser Tyr Val Pro Asp Ala Val Gln Glu Ala
Tyr Lys Asn Leu 180 185 190Asp
Ser Thr Ser Thr Leu Thr Glu Glu Asn Phe Gly Lys Pro Tyr Ile 195
200 205Ser Ala Lys Asp Leu Lys Glu Ile Met
Tyr Asp His Leu Pro Gly Phe 210 215
220Gly Thr Ala Phe His Gln Leu Val Gln Val Ile Cys Lys Ile Gly Lys225
230 235 240Asp Asn Asn Leu
Leu Asp Thr Ile His Ala Glu Phe Gln Ala Ser Leu 245
250 255Ala Asp Gly Asp Ser Pro Gln Cys Ala Leu
Ile Gln Ile Thr Lys Arg 260 265
270Val Pro Ile Phe Gln Asp Val Pro Pro Pro Thr Ile His Ile Arg Ser
275 280 285Arg Gly Asp Ile Pro Arg Ala
Cys Gln Lys Ser Leu Arg Pro Ala Pro 290 295
300Pro Ser Pro Lys Ile Asp Arg Gly Trp Val Cys Leu Phe Lys Met
Gln305 310 315 320Asp Gly
Lys Thr Leu Gly Leu Lys Ile 3253331PRTReston Ebola virus
3Met Arg Arg Gly Val Leu Pro Thr Ala Pro Pro Ala Tyr Asn Asp Ile1
5 10 15Ala Tyr Ser Met Ser Ile
Leu Pro Thr Arg Pro Ser Val Ile Val Asn 20 25
30Glu Thr Lys Ser Asp Val Leu Ala Val Pro Gly Ala Asp
Val Pro Ser 35 40 45Asn Ser Met
Arg Pro Val Ala Asp Asp Asn Ile Asp His Ser Ser His 50
55 60Thr Pro Ser Gly Val Ala Ser Ala Phe Ile Leu Glu
Ala Lys Val Asn65 70 75
80Val Ile Ser Gly Thr Lys Val Leu Met Lys Gln Ile Pro Ile Trp Leu
85 90 95Pro Leu Gly Val Ala Asp
Gln Lys Ile Tyr Ser Phe Asp Ser Thr Thr 100
105 110Ala Ala Ile Met Leu Ala Ser Tyr Thr Val Thr His
Phe Gly Lys Ile 115 120 125Ser Asn
Pro Leu Val Arg Val Asn Arg Leu Gly Pro Gly Ile Pro Asp 130
135 140His Pro Leu Arg Leu Leu Arg Leu Gly Asn Gln
Ala Phe Leu Gln Glu145 150 155
160Phe Val Leu Pro Pro Val Gln Leu Pro Gln Tyr Phe Thr Phe Asp Leu
165 170 175Thr Ala Leu Lys
Leu Ile Thr Gln Pro Leu Pro Ala Ala Thr Trp Thr 180
185 190Asp Glu Thr Pro Ala Gly Ala Val Asn Ala Leu
Arg Pro Gly Leu Ser 195 200 205Leu
His Pro Lys Leu Arg Pro Ile Leu Leu Pro Gly Lys Ile Gly Lys 210
215 220Lys Gly His Ala Ser Asp Leu Thr Ser Pro
Asp Lys Ile Gln Thr Ile225 230 235
240Met Asn Ala Ile Pro Asp Leu Lys Ile Val Pro Ile Asp Pro Ile
Lys 245 250 255Asn Ile Val
Gly Ile Glu Val Pro Glu Leu Leu Val Gln Arg Leu Thr 260
265 270Gly Lys Lys Pro Gln Pro Lys Asn Gly Gln
Pro Ile Ile Pro Val Leu 275 280
285Leu Pro Lys Tyr Val Gly Leu Asp Pro Ile Ser Pro Gly Asp Leu Thr 290
295 300Met Val Ile Thr Gln Asp Cys Asp
Ser Cys His Ser Pro Ala Ser His305 310
315 320Pro Tyr His Met Asp Lys Gln Asp Ser Tyr Gln
325 3304677PRTReston Ebola virus 4Met Gly Ser Gly
Tyr Gln Leu Leu Gln Leu Pro Arg Glu Arg Phe Arg1 5
10 15Lys Thr Ser Phe Leu Val Trp Val Ile Ile
Leu Phe Gln Arg Ala Ile 20 25
30Ser Met Pro Leu Gly Ile Val Thr Asn Ser Thr Leu Lys Ala Thr Glu
35 40 45Ile Asp Gln Leu Val Cys Arg Asp
Lys Leu Ser Ser Thr Ser Gln Leu 50 55
60Lys Ser Val Gly Leu Asn Leu Glu Gly Asn Gly Ile Ala Thr Asp Val65
70 75 80Pro Ser Ala Thr Lys
Arg Trp Gly Phe Arg Ser Gly Val Pro Pro Lys 85
90 95Val Val Ser Tyr Glu Ala Gly Glu Trp Ala Glu
Asn Cys Tyr Asn Leu 100 105
110Glu Ile Lys Lys Ser Asp Gly Ser Glu Cys Leu Pro Leu Pro Pro Asp
115 120 125Gly Val Arg Gly Phe Pro Arg
Cys Arg Tyr Val His Lys Val Gln Gly 130 135
140Thr Gly Pro Cys Pro Gly Asp Leu Ala Phe His Lys Asn Gly Ala
Phe145 150 155 160Phe Leu
Tyr Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr
165 170 175Phe Ala Glu Gly Val Ile Ala
Phe Leu Ile Leu Ser Glu Pro Lys Lys 180 185
190His Phe Trp Lys Ala Thr Pro Ala His Glu Pro Val Asn Thr
Thr Asp 195 200 205Asp Ser Thr Ser
Tyr Tyr Met Thr Leu Thr Leu Ser Tyr Glu Met Ser 210
215 220Asn Phe Gly Gly Glu Glu Ser Asn Thr Leu Phe Lys
Val Asp Asn His225 230 235
240Thr Tyr Val Gln Leu Asp Arg Pro His Thr Pro Gln Phe Leu Val Gln
245 250 255Leu Asn Glu Thr Leu
Arg Arg Asn Asn Arg Leu Ser Asn Ser Thr Gly 260
265 270Arg Leu Thr Trp Thr Val Asp Pro Lys Ile Glu Pro
Asp Val Gly Glu 275 280 285Trp Ala
Phe Trp Glu Thr Lys Lys Asn Phe Ser Gln Gln Leu His Gly 290
295 300Glu Asn Leu His Phe Gln Ile Leu Ser Thr His
Thr Asn Asn Ser Ser305 310 315
320Asp Gln Ser Pro Ala Gly Thr Val Gln Gly Lys Ile Ser Tyr His Pro
325 330 335Pro Thr Asn Asn
Ser Glu Leu Val Pro Thr Asp Ser Pro Pro Val Val 340
345 350Ser Val Leu Thr Ala Gly Arg Thr Glu Glu Met
Ser Thr Gln Gly Leu 355 360 365Thr
Asn Gly Glu Thr Ile Thr Gly Phe Thr Ala Asn Pro Met Thr Thr 370
375 380Thr Ile Ala Pro Ser Pro Thr Met Thr Ser
Glu Val Asp Asn Asn Val385 390 395
400Pro Ser Glu Gln Pro Asn Asn Thr Ala Ser Ile Glu Asp Ser Pro
Pro 405 410 415Ser Ala Ser
Asn Glu Thr Ile Asp His Ser Glu Met Asn Ser Ile Gln 420
425 430Gly Ser Asn Asn Ser Ala Gln Ser Pro Gln
Thr Lys Ala Thr Pro Ala 435 440
445Pro Thr Ala Ser Pro Met Thr Leu Asp Pro Gln Glu Thr Ala Asn Ile 450
455 460Ser Lys Pro Gly Thr Ser Pro Gly
Ser Ala Ala Gly Pro Ser Gln Pro465 470
475 480Gly Leu Thr Ile Asn Thr Ile Ser Lys Val Ala Asp
Ser Leu Ser Pro 485 490
495Thr Arg Lys Gln Lys Arg Ser Val Arg Gln Asn Thr Ala Asn Lys Cys
500 505 510Asn Pro Asp Leu His Tyr
Trp Thr Ala Val Asp Glu Gly Ala Ala Ala 515 520
525Gly Leu Ala Trp Ile Pro Tyr Phe Gly Pro Ala Ala Glu Gly
Ile Tyr 530 535 540Ile Glu Gly Val Met
His Asn Gln Asn Gly Leu Ile Cys Gly Leu Arg545 550
555 560Gln Leu Ala Asn Glu Thr Thr Gln Ala Leu
Gln Leu Phe Leu Arg Ala 565 570
575Thr Thr Glu Leu Arg Thr Tyr Ser Leu Leu Asn Arg Lys Ala Ile Asp
580 585 590Phe Leu Leu Gln Arg
Trp Gly Gly Thr Cys Arg Ile Leu Gly Pro Ser 595
600 605Cys Cys Ile Glu Pro His Asp Trp Thr Lys Asn Ile
Thr Asp Glu Ile 610 615 620Asn Gln Ile
Lys His Asp Phe Ile Asp Asn Pro Leu Pro Asp His Gly625
630 635 640Asp Asp Leu Asn Leu Trp Thr
Gly Trp Arg Gln Trp Ile Pro Ala Gly 645
650 655Ile Gly Ile Ile Gly Val Ile Ile Ala Ile Ile Ala
Leu Leu Cys Ile 660 665 670Cys
Lys Ile Leu Cys 6755367PRTReston Ebola virus 5Met Gly Ser Gly Tyr
Gln Leu Leu Gln Leu Pro Arg Glu Arg Phe Arg1 5
10 15Lys Thr Ser Phe Leu Val Trp Val Ile Ile Leu
Phe Gln Arg Ala Ile 20 25
30Ser Met Pro Leu Gly Ile Val Thr Asn Ser Thr Leu Lys Ala Thr Glu
35 40 45Ile Asp Gln Leu Val Cys Arg Asp
Lys Leu Ser Ser Thr Ser Gln Leu 50 55
60Lys Ser Val Gly Leu Asn Leu Glu Gly Asn Gly Ile Ala Thr Asp Val65
70 75 80Pro Ser Ala Thr Lys
Arg Trp Gly Phe Arg Ser Gly Val Pro Pro Lys 85
90 95Val Val Ser Tyr Glu Ala Gly Glu Trp Ala Glu
Asn Cys Tyr Asn Leu 100 105
110Glu Ile Lys Lys Ser Asp Gly Ser Glu Cys Leu Pro Leu Pro Pro Asp
115 120 125Gly Val Arg Gly Phe Pro Arg
Cys Arg Tyr Val His Lys Val Gln Gly 130 135
140Thr Gly Pro Cys Pro Gly Asp Leu Ala Phe His Lys Asn Gly Ala
Phe145 150 155 160Phe Leu
Tyr Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr
165 170 175Phe Ala Glu Gly Val Ile Ala
Phe Leu Ile Leu Ser Glu Pro Lys Lys 180 185
190His Phe Trp Lys Ala Thr Pro Ala His Glu Pro Val Asn Thr
Thr Asp 195 200 205Asp Ser Thr Ser
Tyr Tyr Met Thr Leu Thr Leu Ser Tyr Glu Met Ser 210
215 220Asn Phe Gly Gly Glu Glu Ser Asn Thr Leu Phe Lys
Val Asp Asn His225 230 235
240Thr Tyr Val Gln Leu Asp Arg Pro His Thr Pro Gln Phe Leu Val Gln
245 250 255Leu Asn Glu Thr Leu
Arg Arg Asn Asn Arg Leu Ser Asn Ser Thr Gly 260
265 270Arg Leu Thr Trp Thr Val Asp Pro Lys Ile Glu Pro
Asp Val Gly Glu 275 280 285Trp Ala
Phe Trp Glu Thr Lys Lys Thr Phe Pro Asn Asn Phe Met Glu 290
295 300Lys Thr Cys Ile Ser Lys Phe Tyr Gln Pro Thr
Pro Thr Thr Pro Gln305 310 315
320Ile Arg Ala Arg Arg Glu Leu Ser Lys Glu Lys Leu Ala Thr Thr His
325 330 335Pro Pro Thr Thr
Pro Ser Trp Phe Gln Arg Ile Pro Leu Gln Trp Phe 340
345 350Gln Cys Ser Leu Gln Asp Gly Gln Arg Lys Cys
Arg Pro Lys Val 355 360
3656287PRTReston Ebola virus 6Met Glu His Ser Arg Glu Arg Gly Arg Ser Ser
Asn Met Arg His Asn1 5 10
15Ser Arg Glu Pro Tyr Glu Asn Pro Ser Arg Ser Arg Ser Leu Ser Arg
20 25 30Asp Pro Asn Gln Val Asp Arg
Arg Gln Pro Arg Ser Ala Ser Gln Ile 35 40
45Arg Val Pro Asn Leu Phe His Arg Lys Lys Thr Asp Ala Leu Ile
Val 50 55 60Pro Pro Ala Pro Lys Asp
Ile Cys Pro Thr Leu Lys Lys Gly Phe Leu65 70
75 80Cys Asp Ser Lys Phe Cys Lys Lys Asp His Gln
Leu Asp Ser Leu Asn 85 90
95Asp His Glu Leu Leu Leu Leu Ile Ala Arg Arg Thr Cys Gly Ile Ile
100 105 110Glu Ser Asn Ser Gln Ile
Thr Ser Pro Lys Asp Met Arg Leu Ala Asn 115 120
125Pro Thr Ala Glu Asp Phe Ser Gln Gly Asn Ser Pro Lys Leu
Thr Leu 130 135 140Ala Val Leu Leu Gln
Ile Ala Glu His Trp Ala Thr Arg Asp Leu Arg145 150
155 160Gln Ile Glu Asp Ser Lys Leu Arg Ala Leu
Leu Thr Leu Cys Ala Val 165 170
175Leu Thr Arg Lys Phe Ser Lys Ser Gln Leu Gly Leu Leu Cys Glu Thr
180 185 190His Leu Arg His Glu
Gly Leu Gly Gln Asp Gln Ala Asp Ser Val Leu 195
200 205Glu Val Tyr Gln Arg Leu His Ser Asp Lys Gly Gly
Asn Phe Glu Ala 210 215 220Ala Leu Trp
Gln Gln Trp Asp Arg Gln Ser Leu Ile Met Phe Ile Ser225
230 235 240Ala Phe Leu Asn Ile Ala Leu
Gln Thr Pro Cys Glu Ser Ser Ser Val 245
250 255Val Val Ser Gly Leu Ala Thr Leu Tyr Pro Ala Gln
Asp Asn Ser Thr 260 265 270Pro
Ser Glu Ala Thr Asn Asp Thr Thr Trp Ser Ser Thr Val Glu 275
280 2857251PRTReston Ebola virus 7Met Ala Lys
Ala Thr Gly Arg Tyr Asn Leu Val Pro Pro Lys Lys Asp1 5
10 15Met Glu Lys Gly Val Ile Phe Ser Asp
Leu Cys Asn Phe Leu Ile Thr 20 25
30Gln Thr Leu Gln Gly Trp Lys Val Tyr Trp Ala Gly Ile Glu Phe Asp
35 40 45Val Ser Gln Lys Gly Met Ala
Leu Leu Thr Arg Leu Lys Thr Asn Asp 50 55
60Phe Ala Pro Ala Trp Ala Met Thr Arg Asn Leu Phe Pro His Leu Phe65
70 75 80Gln Asn Pro Asn
Ser Val Ile Gln Ser Pro Ile Trp Ala Leu Arg Val 85
90 95Ile Leu Ala Ala Gly Leu Gln Asp Gln Leu
Leu Asp His Ser Leu Val 100 105
110Glu Pro Leu Thr Gly Ala Leu Gly Leu Ile Ser Asp Trp Leu Leu Thr
115 120 125Thr Thr Ser Thr His Phe Asn
Leu Arg Thr Arg Ser Val Lys Asp Gln 130 135
140Leu Ser Leu Arg Met Leu Ser Leu Ile Arg Ser Asn Ile Leu Gln
Phe145 150 155 160Ile Asn
Lys Leu Asp Ala Leu His Val Val Asn Tyr Asn Gly Leu Leu
165 170 175Ser Ser Ile Glu Ile Gly Thr
Ser Thr His Thr Ile Ile Ile Thr Arg 180 185
190Thr Asn Met Gly Phe Leu Val Glu Val Gln Glu Pro Asp Lys
Ser Ala 195 200 205Met Asn Ser Lys
Arg Pro Gly Pro Val Lys Phe Ser Leu Leu His Glu 210
215 220Ser Ala Phe Lys Pro Phe Thr Arg Val Pro Gln Ser
Gly Met Gln Ser225 230 235
240Leu Ile Met Glu Phe Asn Ser Leu Leu Ala Ile 245
25082212PRTReston Ebola virus 8Met Ala Thr Gln His Thr Gln Tyr
Pro Asp Ala Arg Leu Ser Ser Pro1 5 10
15Ile Val Leu Asp Gln Cys Asp Leu Val Thr Arg Ala Cys Gly
Leu Tyr 20 25 30Ser Ser Tyr
Ser Leu Asn Pro Gln Leu Arg Gln Cys Lys Leu Pro Lys 35
40 45His Ile Tyr Arg Leu Lys Phe Asp Thr Ile Val
Ser Lys Phe Leu Ser 50 55 60Asp Thr
Pro Val Ala Thr Leu Pro Ile Asp Tyr Leu Val Pro Ile Leu65
70 75 80Leu Arg Ser Leu Thr Gly His
Gly Asp Arg Pro Leu Thr Pro Thr Cys 85 90
95Asn Gln Phe Leu Asp Gly Ile Ile Asn Tyr Thr Leu His
Asp Ala Ala 100 105 110Phe Leu
Asp Tyr Tyr Leu Lys Ala Thr Gly Ala Gln Asp His Leu Thr 115
120 125Asn Ile Thr Thr Arg Glu Lys Leu Lys Asn
Glu Ile Leu Asn Asn Asp 130 135 140Tyr
Val His Gln Leu Phe Phe Trp His Asp Leu Ser Ile Leu Ala Arg145
150 155 160Arg Gly Arg Leu Asn Arg
Gly Asn Asn Arg Ser Thr Trp Phe Val His 165
170 175Asp Glu Phe Ile Asp Ile Leu Gly Tyr Gly Asp Tyr
Ile Phe Trp Lys 180 185 190Ile
Pro Leu Ser Leu Leu Pro Val Thr Ile Asp Gly Val Pro His Ala 195
200 205Ala Thr Asp Trp Tyr Gln Pro Thr Leu
Phe Lys Glu Ser Ile Leu Gly 210 215
220His Ser Gln Ile Leu Ser Val Ser Thr Ala Glu Ile Leu Ile Met Cys225
230 235 240Lys Asp Ile Ile
Thr Cys Arg Phe Asn Thr Ser Leu Ile Ala Ser Ile 245
250 255Ala Lys Leu Glu Asp Val Asp Val Ser Asp
Tyr Pro Asp Pro Ser Asp 260 265
270Ile Leu Lys Ile Tyr Asn Ala Gly Asp Tyr Val Ile Ser Ile Leu Gly
275 280 285Ser Glu Gly Tyr Lys Ile Ile
Lys Tyr Leu Glu Pro Leu Cys Leu Ala 290 295
300Lys Ile Gln Leu Cys Ser Lys Phe Thr Glu Arg Lys Gly Arg Phe
Leu305 310 315 320Thr Gln
Met His Leu Ser Val Ile Asn Asp Leu Arg Glu Leu Ile Ser
325 330 335Asn Arg Arg Leu Lys Asp Tyr
Gln Gln Glu Lys Ile Arg Asp Phe His 340 345
350Lys Ile Leu Leu Gln Leu Gln Leu Ser Pro Gln Gln Phe Cys
Glu Leu 355 360 365Phe Ser Val Gln
Lys His Trp Gly His Pro Ile Leu His Ser Glu Lys 370
375 380Ala Ile Gln Lys Val Lys Arg His Ala Thr Ile Leu
Lys Ala Leu Arg385 390 395
400Pro Asn Val Ile Phe Glu Thr Tyr Cys Val Phe Lys Tyr Asn Ile Ala
405 410 415Lys His Tyr Phe Asp
Ser Gln Gly Thr Trp Tyr Ser Val Ile Ser Asp 420
425 430Arg Asn Leu Thr Pro Gly Leu Asn Ser Phe Ile Lys
Arg Asn His Phe 435 440 445Pro Ser
Leu Pro Met Ile Lys Asp Leu Leu Trp Glu Phe Tyr His Leu 450
455 460Asn His Pro Pro Leu Phe Ser Thr Lys Val Ile
Ser Asp Leu Ser Ile465 470 475
480Phe Ile Lys Asp Arg Ala Thr Ala Val Glu Gln Thr Cys Trp Asp Ala
485 490 495Val Phe Glu Pro
Asn Val Leu Gly Tyr Asn Pro Pro Asn Lys Phe Ser 500
505 510Thr Lys Arg Val Pro Glu Gln Phe Leu Glu Gln
Glu Asp Phe Ser Ile 515 520 525Glu
Ser Val Leu Asn Tyr Ala Gln Glu Leu His Tyr Leu Leu Pro Gln 530
535 540Asn Arg Asn Phe Ser Phe Ser Leu Lys Glu
Lys Glu Leu Asn Ile Gly545 550 555
560Arg Thr Phe Gly Lys Leu Pro Tyr Leu Thr Arg Asn Val Gln Thr
Leu 565 570 575Cys Glu Ala
Leu Leu Ala Asp Gly Leu Ala Lys Ala Phe Pro Ser Asn 580
585 590Met Met Val Val Thr Glu Arg Glu Gln Lys
Glu Ser Leu Leu His Gln 595 600
605Ala Ser Trp His His Thr Ser Asp Asp Phe Gly Glu Asn Ala Thr Val 610
615 620Arg Gly Ser Ser Phe Val Thr Asp
Leu Glu Lys Tyr Asn Leu Ala Phe625 630
635 640Arg Tyr Glu Phe Thr Ala Pro Phe Ile Glu Tyr Cys
Asn His Cys Tyr 645 650
655Gly Val Arg Asn Val Phe Asn Trp Met His Tyr Leu Ile Pro Gln Cys
660 665 670Tyr Met His Val Ser Asp
Tyr Tyr Asn Pro Pro His Asn Val Asn Leu 675 680
685Ser Asn Arg Glu Tyr Pro Pro Glu Gly Pro Ser Ser Tyr Arg
Gly His 690 695 700Leu Gly Gly Ile Glu
Gly Leu Gln Gln Lys Leu Trp Thr Ser Ile Ser705 710
715 720Cys Ala Gln Ile Ser Leu Val Glu Ile Lys
Thr Gly Phe Lys Leu Arg 725 730
735Ser Ala Val Met Gly Asp Asn Gln Cys Ile Thr Val Leu Ser Val Phe
740 745 750Pro Leu Glu Thr Asp
Pro Glu Glu Gln Glu Gln Ser Ala Glu Asp Asn 755 760
765Ala Ala Arg Val Ala Ala Ser Leu Ala Lys Val Thr Ser
Ala Cys Gly 770 775 780Ile Phe Leu Lys
Pro Glu Glu Thr Phe Val His Ser Gly Phe Ile Tyr785 790
795 800Phe Gly Lys Lys Gln Tyr Leu Asn Gly
Val Gln Leu Pro Gln Ser Leu 805 810
815Lys Thr Ala Ala Arg Met Ala Pro Leu Ser Asp Ala Ile Phe Asp
Asp 820 825 830Leu Gln Gly Thr
Leu Ala Ser Ile Gly Thr Ala Phe Glu Arg Ala Ile 835
840 845Ser Glu Thr Arg His Ile Leu Pro Cys Arg Ile Val
Ala Ala Phe His 850 855 860Thr Tyr Phe
Ala Val Arg Ile Leu Gln Tyr His His Leu Gly Phe Asn865
870 875 880Lys Gly Ile Asp Leu Gly Gln
Leu Ser Leu Ser Lys Pro Leu Asp Tyr 885
890 895Gly Thr Ile Thr Leu Thr Leu Ala Val Pro Gln Val
Leu Gly Gly Leu 900 905 910Ser
Phe Leu Asn Pro Glu Lys Cys Phe Tyr Arg Asn Phe Gly Asp Pro 915
920 925Val Thr Ser Gly Leu Phe Gln Leu Arg
Val Tyr Leu Glu Met Val Asn 930 935
940Met Lys Asp Leu Phe Tyr Pro Leu Ile Ser Lys Asn Pro Gly Asn Cys945
950 955 960Ser Ala Ile Asp
Phe Val Leu Asn Pro Ser Gly Leu Asn Val Pro Gly 965
970 975Ser Gln Asp Leu Thr Ser Phe Leu Arg Gln
Ile Val Arg Arg Ser Ile 980 985
990Thr Leu Thr Ala Arg Asn Lys Leu Ile Asn Thr Leu Phe His Ala Ser
995 1000 1005Ala Asp Leu Glu Asp Glu Met
Val Cys Lys Trp Leu Leu Ser Ser Asn 1010 1015
1020Pro Val Met Ser Arg Phe Ala Ala Asp Ile Phe Ser Arg Thr Pro
Ser1025 1030 1035 1040Gly
Lys Arg Leu Gln Ile Leu Gly Tyr Leu Glu Gly Thr Arg Thr Leu
1045 1050 1055Leu Ala Ser Lys Ile Ile Asn
Asn Asn Ser Glu Thr Pro Val Leu Asp 1060 1065
1070Lys Leu Arg Lys Ile Thr Leu Gln Arg Trp Asn Leu Trp Phe
Ser Tyr 1075 1080 1085Leu Asp His
Cys Asp Gln Leu Leu Ala Asp Ala Leu Gln Lys Ile Ser 1090
1095 1100Cys Thr Val Asp Leu Ala Gln Ile Leu Arg Glu Tyr
Thr Trp Ser His1105 1110 1115
1120Ile Leu Glu Gly Arg Pro Leu Ile Gly Ala Thr Leu Pro Cys Met Val
1125 1130 1135Glu Gln Phe Lys Val
Lys Trp Leu Arg Gln Tyr Glu Pro Cys Pro Glu 1140
1145 1150Cys Leu Asn Lys Lys Gly Ser Asn Ala Tyr Val Ser
Val Ala Val Lys 1155 1160 1165Asp
Gln Val Val Ser Ala Trp Pro Asn Thr Ser Arg Ile Ser Trp Thr 1170
1175 1180Ile Gly Ser Gly Val Pro Tyr Ile Gly Ser
Arg Thr Glu Asp Lys Ile1185 1190 1195
1200Gly Gln Pro Ala Ile Lys Pro Arg Cys Pro Ser Ser Ala Leu Lys
Glu 1205 1210 1215Ala Ile
Glu Leu Ala Ser Arg Leu Thr Trp Val Thr Gln Gly Ser Ser 1220
1225 1230Asn Ser Glu Gln Leu Ile Arg Pro Phe
Leu Glu Ala Arg Val Asn Leu 1235 1240
1245Ser Val Ser Glu Val Leu Gln Met Thr Pro Ser His Tyr Ser Gly Asn
1250 1255 1260Ile Val His Arg Tyr Asn Asp
Gln Tyr Ser Pro His Ser Phe Met Ala1265 1270
1275 1280Asn Arg Met Ser Asn Thr Ala Thr Arg Leu Ile Val
Ser Thr Asn Thr 1285 1290
1295Leu Gly Glu Phe Ser Gly Gly Gly Gln Ala Ala Arg Asp Ser Asn Ile
1300 1305 1310Ile Phe Gln Asn Val Ile
Asn Leu Ala Val Ala Leu Tyr Asp Ile Arg 1315 1320
1325Phe Arg Asn Thr Asn Thr Ser Asp Ile Arg His Asn Arg Ala
His Leu 1330 1335 1340His Leu Thr Glu
Cys Cys Thr Lys Glu Val Pro Ala Gln Tyr Leu Thr1345 1350
1355 1360Tyr Thr Ser Ala Leu Asn Leu Asp Leu
Ser Arg Tyr Arg Asp Asn Glu 1365 1370
1375Leu Ile Tyr Asp Ser Asn Pro Leu Arg Gly Gly Leu Asn Cys Asn
Leu 1380 1385 1390Thr Met Asp
Ser Pro Leu Val Lys Gly Pro Arg Leu Asn Met Ile Glu 1395
1400 1405Asp Asp Leu Leu Arg Phe Pro His Leu Ser Gly
Trp Glu Leu Ala Lys 1410 1415 1420Thr
Val Val Gln Ser Ile Ile Ser Asp Asn Ser Asn Ser Ser Thr Asp1425
1430 1435 1440Pro Ile Ser Ser Gly Glu
Thr Arg Ser Phe Thr Thr His Phe Leu Thr 1445
1450 1455Tyr Pro Gln Ile Gly Leu Leu Tyr Ser Phe Gly Ala
Val Leu Cys Phe 1460 1465
1470Tyr Leu Gly Asn Thr Ile Leu Trp Thr Lys Lys Leu Asp Tyr Glu Gln
1475 1480 1485Phe Leu Tyr Tyr Leu His Asn
Gln Leu His Asn Leu Pro His Arg Ala 1490 1495
1500Leu Arg Val Phe Lys Pro Thr Phe Lys His Ala Ser Val Met Ser
Arg1505 1510 1515 1520Leu
Met Glu Ile Asp Ser Asn Phe Ser Ile Tyr Ile Gly Gly Thr Ser
1525 1530 1535Gly Asp Arg Gly Leu Ser Asp
Ala Ala Arg Leu Phe Leu Arg Thr Ala 1540 1545
1550Ile Ala Ser Phe Leu Gln Phe Leu Lys Ser Trp Ile Ile Asp
Arg Gln 1555 1560 1565Lys Ala Ile
Pro Leu Trp Ile Val Tyr Pro Leu Glu Gly Gln Gln Pro 1570
1575 1580Glu Ser Ile Asn Glu Phe Leu His Lys Ile Phe Gly
Leu Leu Lys Gln1585 1590 1595
1600Gly Pro Lys Asn Ile Pro Lys Glu Val Ser Ile Gln Asn Asp Gly His
1605 1610 1615Leu Asp Leu Ala Glu
Asn Asn Tyr Val Tyr Asn Ser Lys Ser Thr Ala 1620
1625 1630Ser Asn Phe Phe His Ala Ser Leu Ala Tyr Trp Arg
Ser Arg Lys Ser 1635 1640 1645Arg
Lys Thr Gln Asp His Asn Asp Phe Ser Arg Gly Asp Gly Thr Leu 1650
1655 1660Thr Glu Pro Val Cys Lys Phe Ser Ser Asn
His Gln Ser Asp Glu Lys1665 1670 1675
1680Tyr Tyr Asn Val Thr Cys Gly Lys Ser Pro Lys Pro Gln Glu Arg
Lys 1685 1690 1695Asp Phe
Ser Gln Tyr Arg Leu Ser Asn Asn Gly Gln Thr Met Ser Asn 1700
1705 1710His Arg Lys Lys Gly Lys Phe His Lys
Trp Asn Pro Cys Lys Val Leu 1715 1720
1725Met Glu Ser Gln Arg Gly Thr Val Leu Lys Glu Gly Asp Tyr Phe Gln
1730 1735 1740Asn Asn Thr Pro Pro Thr Asp
Asp Val Ser Ser Pro His Arg Leu Ile1745 1750
1755 1760Leu Pro Phe Phe Lys Leu Gly Asn His Asn His Ala
His Asp Gln Asp 1765 1770
1775Ala Gln Glu Leu Ile Asn Gln Asn Ile Lys Gln Tyr Leu His Gln Leu
1780 1785 1790Arg Ser Met Leu Asp Thr
Thr Ile Tyr Cys Arg Phe Thr Gly Ile Val 1795 1800
1805Ser Ser Met His Tyr Lys Leu Asp Glu Val Leu Leu Glu Tyr
Asn Ser 1810 1815 1820Phe Asp Ser Ala
Ile Thr Leu Ala Glu Gly Glu Gly Ser Gly Ala Leu1825 1830
1835 1840Leu Leu Leu Gln Lys Tyr Ser Thr Arg
Leu Leu Phe Leu Asn Thr Leu 1845 1850
1855Ala Thr Glu His Ser Ile Glu Ser Glu Val Val Ser Gly Phe Ser
Thr 1860 1865 1870Pro Arg Met
Leu Leu Pro Ile Met Gln Lys Val His Glu Gly Gln Val 1875
1880 1885Thr Val Ile Leu Asn Asn Ser Ala Ser Gln Ile
Thr Asp Ile Thr Ser 1890 1895 1900Ser
Met Trp Leu Ser Asn Gln Lys Tyr Asn Leu Pro Cys Gln Val Glu1905
1910 1915 1920Ile Ile Thr Met Asp Ala
Glu Thr Thr Glu Asn Leu Asn Arg Ser Gln 1925
1930 1935Leu Tyr Arg Ala Val Tyr Asn Leu Ile Leu Asp His
Ile Asp Pro Gln 1940 1945
1950Tyr Leu Lys Val Val Val Leu Lys Val Phe Leu Ser Asp Ile Glu Gly
1955 1960 1965Ile Leu Trp Ile Asn Asp Tyr
Leu Ala Pro Leu Phe Gly Ala Gly Tyr 1970 1975
1980Leu Ile Lys Pro Ile Thr Ser Ser Ala Arg Ser Ser Glu Trp Tyr
Leu1985 1990 1995 2000Cys
Leu Ser Asn Leu Ile Ser Thr Asn Arg Arg Ser Ala His Gln Thr
2005 2010 2015His Lys Ala Cys Leu Gly Val
Ile Arg Asp Ala Leu Gln Ala Gln Val 2020 2025
2030Gln Arg Gly Val Tyr Trp Leu Ser His Ile Ala Gln Tyr Ala
Thr Lys 2035 2040 2045Asn Leu His
Cys Glu Tyr Ile Cys Leu Gly Phe Pro Pro Leu Glu Lys 2050
2055 2060Val Leu Tyr His Arg Tyr Asn Leu Val Asp Thr Gly
Leu Gly Pro Leu2065 2070 2075
2080Ser Ser Val Ile Arg His Leu Thr Asn Leu Gln Ala Glu Ile Arg Asp
2085 2090 2095Leu Val Leu Asp Tyr
Thr Leu Met Arg Glu Ser Arg Thr Gln Thr Tyr 2100
2105 2110His Phe Ile Lys Thr Ala Lys Gly Arg Ile Thr Lys
Leu Val Asn Asp 2115 2120 2125Phe
Leu Lys Phe Ser Leu Ile Val Gln Ala Leu Lys Asn Asn Ser Ser 2130
2135 2140Trp Tyr Thr Glu Leu Lys Lys Leu Pro Glu
Val Ile Asn Val Cys Asn2145 2150 2155
2160Arg Phe Tyr His Thr His Ser Cys Glu Cys Gln Glu Lys Phe Phe
Val 2165 2170 2175Gln Thr
Leu Tyr Leu Gln Arg Leu Arg Asp Ala Glu Ile Lys Leu Ile 2180
2185 2190Glu Arg Leu Thr Gly Leu Met Arg Phe
Tyr Pro Glu Gly Leu Ile Tyr 2195 2200
2205Ser Asn His Thr 2210918890DNAReston Ebola virus 9gggacacaca
aaagaaaaag gttttttaag attttttgtg tgcgagtaac tatgaggaag 60attaacagtt
ttcctcagtt taaggtatac actgaaattg agattgagat tctcctcttt 120gctattctgt
aactttccct ggttgtgaca attgaatcag ttttatctat taccaattac 180catcaacatg
gtatgtctag tgatcttggg actcttcttc atctggtttt tcctagagct 240ctgaatctat
tttgtgagaa gttcatccaa acgacccagt gtctgaaaat acaagaggtt 300cccctttccg
tcaagtttaa ggggttgttt tgattgtgtg tagattttat aatcctagag 360tgccaaggag
ttgcgtgtca tcattaattg ggaagatcaa ggaaacaatt tgttccaata 420atatcgtaca
tcttgactaa gtcgaacaag gggaagtcga tatggatcgt gggaccagaa 480gaatctgggt
gtcgcaaaat caaggtgata ctgatttaga ttatcataaa attttgacag 540ctggccttac
tgttcaacag ggaattgtca ggcagaaaat aatttctgta tatcttgttg 600ataacttgga
ggctatgtgt caattggtaa tacaagcctt tgaggccgga attgatttcc 660aagaaaatgc
cgacagcttc cttctgatgc tttgcctaca tcatgcttac caaggtgact 720ataaattgtt
cttggagagc aatgctgtac agtatttgga aggtcatgga ttcaaatttg 780agctccggaa
gaaggacggt gtcaatcggc tcgaggaatt gcttcctgct gcaacgagtg 840gaaaaaacat
caggcgtacg ttggccgcac tgcctgaaga ggagactaca gaagcaaatg 900cagggcaatt
tctctcattt gcgagtttgt ttcttcccaa actggttgtg ggagagaagg 960cttgcttgga
aaaagtccag cgacaaattc aggttcatgc agaacagggt ttaattcaat 1020atcccactgc
atggcaatca gttggacaca tgatggtaat cttcagattg atgaggacta 1080atttcttgat
taaatattta ctgatccacc agggtatgca tatggtagct ggccacgatg 1140ccaatgatgc
tgtcattgct aattcagttg ctcaggctcg cttttcagga ctcctaattg 1200tcaaaaccgt
tcttgatcat attctgcaga aaaccgacca aggagtaaga cttcaccctt 1260tggcccgaac
agccaaagtg cgtaatgagg ttaatgcatt taaggccgcc ctaagctcac 1320ttgctaagca
tggggagtat gccccttttg ctcgccttct caatctctcg ggagttaaca 1380acctagaaca
tggtctctac ccacagttat cagcaattgc tcttggagtt gccacagcac 1440atggtagcac
ccttgcagga gttaatgttg gtgagcagta tcagcagctt agagaggctg 1500ccactgaagc
tgagaagcaa ctccaacaat atgctgagtc cagagaactc gacagcctag 1560gcctagacga
tcaggaaaga agaatactaa tgaacttcca tcagaagaaa aatgaaatta 1620gtttccagca
gaccaatgca atggtaaccc ttaggaaaga gcgactggct aaattaacag 1680aagctataac
gctggcctca agacctaacc tcgggtctag acaagacgac gacaatgaaa 1740taccgttccc
tgggcctata agcaacaacc cagaccaaga tcatctggag gatgatccta 1800gagactccag
agacactatc attcctaata gtgcaattga ccccgaggat ggtgattttg 1860aaaattacaa
tggctatcat gatgatgaag ttgggacggc aggtgacttg gtcttgttcg 1920atcttgacga
tcatgaggat gacaataaag cttttgagct acaggacagc tcaccacaat 1980cccaaaggga
aatagagaga gaaagattaa ttcatccacc cccaggcaac aacaaggacg 2040acaatcgggc
ctcagacaac aatcaacaat cagcagattc tgaggaacaa gaaggtcaat 2100acaacaggca
ccgaggccca gaacgtacga ccgccaatcg aagactctca ccagtgcacg 2160aagaggacac
ccctatagat caaggcgatg atgatccctc aagcccacct ccgctggaat 2220ctgatgatga
cgatgcatca agtagccaac aagatcccga ttatacagct gttgcccctc 2280ctgctcctgt
ataccgcagt gcagaagccc acgagcctcc ccacaaatcc tcgaacgagc 2340cagctgaaac
atcacaattg aatgaagacc ctgatatcgg tcaatcaaag tctatgcaaa 2400aattaggaga
gacatatcac catctgctga gaactcaagg tccatttgaa gctatcaatt 2460attatcacat
gatgaaggat gagccggtaa tatttagcac tgatgatggg aaggaataca 2520cctacccgga
ttcacttgag gaagcctatc ctccatggct caccgagaaa gaacgactgg 2580acaatgaaaa
tcgatacatt tacataaata atcaacagtt cttctggcct gtcatgagtc 2640ccagagacaa
atttcttgca atcttgcagc accatcagta accacagcac aaagcgcggt 2700ccacttcgta
aagctaaata cacttaaagc ttgaccgatt catctacaaa aactaatcca 2760ttataactta
ttagtgctac ttttctataa gtgattctca atctaaggcc attaagagtt 2820taagcaatat
acatatacac ttacaccggt ctatccaaga tgtggctcaa tgttcttaat 2880ttgaacatag
tcataagggg ataaataata ctttatattt ctgattgtgg actgacccat 2940tctgcttaaa
atgcttcgcc cattaaaaat gtgatctaat agatagccct gactagacca 3000attaagaaaa
acatttgatg aagattaaaa ccttcatcgc cagtaaatga ttatattgtc 3060tgtaggcagg
tgtttactcc accttaaagt cggaaatatc ctaccttagg accattgtta 3120agaggtgcat
aggcattacc atccttgaga acatgtataa tgataaattg aagatatgtt 3180caggcccaga
aacaactgga tggatttctg agcaactaat gacaggtaag attccagtaa 3240ctgatatatt
cattgatatt gataacaagc cagatcaaat ggaagtccgg ctcaaaccat 3300catcaaggag
ctcaaccaga acttgtacaa gtagcagtca gacggaggtc aactatgtac 3360ctctccttaa
aaaggttgag gatacattaa ctatgctagt gagtgcaacc agtcgtcaga 3420atgctgcaat
cgaggccctt gaaaaccgcc tcagcacact tgagagtagc ttaaagccaa 3480tccaagacat
gggtaaagtg atttcatcat tgaatcgcag ttgtgccgaa atggtggcaa 3540aatatgatct
tctagttatg acaactggac gggctacttc aaccgcagct gcagtagatg 3600cgtactggaa
agagcacaaa cagccaccac cagggccagc gttgtatgaa gagaatgcgc 3660ttaaaggaaa
aatcgatgat ccaaacagct atgtaccaga tgctgtgcag gaggcttaca 3720agaaccttga
cagtacatcg accctgaccg aggaaaattt tgggaaacct tatatatctg 3780ctaaagatct
gaaggagatc atgtatgatc atctacctgg ttttgggact gcctttcacc 3840aacttgttca
agtgatttgt aaaataggaa aggataacaa cctcttggac acaatccatg 3900ctgagttcca
ggcaagtcta gcagatggtg actctcccca atgtgcactc atacagataa 3960ccaaaagagt
cccaatcttt caggatgtgc cgcccccgac aatccacatt agatcccgtg 4020gtgatatccc
acgagcatgc caaaagagtc tccgaccagc accaccatca cccaaaattg 4080atcgtggttg
ggtttgtttg tttaagatgc aagatggtaa aacgcttgga cttaagatct 4140aaggatcaag
atttatttaa caaggcaagc cacaacctta gatagaacct cagccagact 4200attgaactat
tgacgctgtt gatgataata tataattaat ggtcatattt gaatatgaca 4260acatcttgct
tcttgttttg ccttgtatct ctttgagttg gaagatcatt ccaaacttac 4320aaacatgcac
aagatgttat ggtttagcaa agaattgata ggagtactgg tatataatgt 4380aaatataaca
agtgatgaag attaagaaaa accagtcggt attttccaga cttggcattt 4440cttatcttca
tcttctaaag tgagatattt tatcatcaaa aaatgagacg cggagtgtta 4500ccaacggctc
ctccagcata taatgatatt gcatactcta tgagcatact cccaacccga 4560ccaagtgtca
tagtcaatga gaccaaatca gatgtactgg cagtgccagg agcagatgtt 4620ccatcaaact
ccatgagacc agtggctgat gataacattg atcactcaag ccatactcca 4680agcggagtag
cttctgcctt tatattggaa gctaaagtga atgtaatttc gggaacaaaa 4740gtcctgatga
agcaaatacc tatttggctt ccactgggtg tagctgatca gaagatatac 4800agctttgatt
caacaacagc cgcaattatg ttggcttcct acacagtgac acacttcggg 4860aagatatcta
acccgctggt acgtgtcaac aggctaggcc caggaatacc cgatcatccg 4920ctacgactcc
taaggttggg caatcaggca ttccttcaag agtttgttct tccaccagtc 4980cagcttcccc
agtatttcac atttgatcta acagctctaa agctcatcac tcaaccattg 5040ccagctgcaa
cctggacaga cgaaactcca gcaggagcag tcaatgctct tcgtcctggg 5100ctctcactcc
atcccaagct tcgtccaatt cttctaccgg ggaagatagg aaagaaaggt 5160catgcttcag
acttaacatc acctgacaaa attcaaacaa tcatgaatgc aataccggac 5220ctcaaaattg
tcccgattga tccaatcaag aacatagttg gaattgaggt tccagaatta 5280ctagttcaaa
ggctgaccgg caaaaaacca caacccaaaa atggccaacc aattattcca 5340gttcttcttc
cgaaatatgt tggacttgat cctatatcgc caggggactt aactatggtt 5400atcacccagg
attgtgattc atgccactct ccagccagcc atccgtatca catggacaag 5460caggatagtt
accaataatt taaattccat tcgagctatt attctgctag taattccgac 5520gggatcaata
gactaaaaat ctgattgtat agaattataa aagaatcaag cagaggcaac 5580agactcacag
cttacgccta gatgactaat attaaggagt tttttaatct aattttccag 5640tcttaagtaa
taatcatttc ttttgtaatt aattatgcat ttgttaactt atcggtgcga 5700gatttccttg
agaacccggc ggggcttcta ctatctgtag taaccagaag agaagttcaa 5760cccagtcaaa
actaaaccaa gcaatattct gaatgctcta tagtctattc taatcagagg 5820tataacaatg
gctaagattt caatgactcg ttaacaatcg ctagtaattt taatctccag 5880attaagaaaa
agatatacga tgaagattaa ggcgacaacg agccgaaact tcatctcttt 5940taaagatcta
acattatctg ttccaaagtc atacaaggac acattcaaat cagggattgt 6000aagctgctat
ttcttacctc cccaaatcac ctatacaaca tggggtcagg atatcaactt 6060ctccaattgc
ctcgggaacg ttttcgtaaa acttcgttct tagtatgggt aatcatcctc 6120ttccagcgag
caatctccat gccgcttggt atagtgacaa atagcactct caaagcaaca 6180gaaattgatc
aattggtttg tcgggacaaa ctgtcatcaa ccagtcagct caagtctgtg 6240gggctgaatc
tggaaggaaa tggaattgca accgatgtcc catcagcaac aaaacgctgg 6300ggattccgtt
caggtgtgcc tcccaaggtg gtcagctatg aagccggaga atgggcagaa 6360aattgctaca
atctggagat caaaaagtca gacggaagtg agtgcctccc tctccctccc 6420gacggtgtac
ggggattccc tagatgtcgc tatgtccaca aagttcaagg aacaggtcct 6480tgtcccggtg
acttagcttt ccataaaaat ggggcttttt tcttgtatga tagattggcc 6540tcaactgtca
tctaccgtgg gacaactttt gctgaaggtg tcatagcttt tttaattctg 6600tcagagccca
agaagcattt ttggaaggct acaccagctc atgaaccggt gaacacaaca 6660gatgattcca
caagctacta catgaccctg acactcagct acgagatgtc aaattttgga 6720ggcgaggaaa
gtaacaccct ttttaaggta gacaaccaca catatgtgca actagatcgt 6780ccacacactc
cgcagttcct tgttcagctc aatgaaacac ttcgaagaaa taatcgcctt 6840agcaacagta
cagggagatt gacttggaca gtggatccca aaattgaacc agatgttggt 6900gagtgggcct
tctgggaaac taaaaaaact tttcccaaca acttcatgga gaaaacttgc 6960atttccaaat
tctatcaacc cacaccaaca actcctcaga tcagagcccg gcgggaactg 7020tccaaggaaa
aattagctac cacccaccca ccaacaactc cgagctggtt ccaacggatt 7080cccctccagt
ggtttcagtg ctcactgcag gacggacaga ggaaatgtcg acccaaggtc 7140taactaacgg
agagacaatc acaggtttca ccgcgaaccc aatgacaacc accattgccc 7200caagtccaac
catgacaagc gaggttgata acaatgtacc aagtgaacaa ccgaacaaca 7260cagcatccat
tgaagactcc cccccatcgg caagcaacga gacaattgac cactccgaaa 7320tgaattcgat
ccaaggctcg aacaactccg cccagagccc acagaccaag gccacgccag 7380cgcccacagc
atccccgatg accctggacc cgcaagagac ggccaacatc agcaaaccag 7440gaaccagccc
aggaagcgca gccggaccaa gtcagcccgg actcactata aatacaataa 7500gtaaggtagc
tgattcactg agtcccacca ggaaacaaaa gcgatcggtt cgacaaaaca 7560ccgctaataa
atgtaaccca gatcttcact attggacagc tgttgatgag ggggcagcag 7620caggattggc
atggattcca tattttggac ctgcagcaga aggcatctac attgagggtg 7680taatgcataa
tcagaatggg cttatttgcg ggctacgtca gctagccaat gaaactaccc 7740aggctcttca
attatttctg cgggccacaa cagaactgag gacttactca cttcttaaca 7800gaaaagctat
tgattttctt cttcaacgat ggggaggtac ctgtcgaatc ctaggaccat 7860cttgttgcat
tgagccacat gattggacaa aaaatattac tgatgaaatt aaccaaatta 7920aacatgactt
tattgacaat cccctaccag accacggaga tgatcttaat ctatggacag 7980gttggagaca
atggatcccg gctggaattg ggattattgg agttataatt gctataatag 8040ccctactttg
tatatgtaag attttgtgtt gatttattct gagatctgag agaaaaaaat 8100ctcagggtta
ctctaaggag aaatattatt tttaaaattt acttaaatgc tgaccactta 8160tcttaaatga
gcaattaata atatgttttt ctgcttcttt gcttgattta caatatgata 8220tttctcttaa
taatgattaa tatattaaga aaaacttatg acgaagatta aaggggagga 8280tcgttaacgg
gaaaatctcc catctcgttc gtcgaagcca cgttggtggt gcttgcagct 8340gagaacaact
ccagagattg taggtagaaa ggaccagcat ttataggtag gggtcagaaa 8400gcaacaatag
ccataaaagg agagcctgac attgctattt aatatcctag aacctgattt 8460ctaggttcta
gttgtacaat ccggatgatg gagcattcaa gagaacgggg tagatctagc 8520aacatgcgac
ataatagccg ggaaccatac gaaaatccat caaggtctcg ctcattatct 8580cgggacccta
atcaggttga tcgtaggcag cctcgaagtg catcccaaat tcgtgttccg 8640aatctgttcc
atcggaaaaa gactgatgca ctcatagttc ctccggctcc caaagatata 8700tgcccaacac
tcaaaaaagg attcctctgc gatagtaaat tttgcaaaaa agatcaccaa 8760ttggatagct
taaatgatca tgaattacta ctgctaattg caagaagaac atgtggaatt 8820atcgagagca
attcgcagat tacatcccca aaagatatgc ggttagcgaa tccaacagct 8880gaagacttct
cacaaggtaa tagtcctaaa ttaacacttg cagtccttct tcaaattgct 8940gaacattggg
caaccagaga cctaaggcaa attgaggact ctaaacttag agctctttta 9000accctttgtg
ccgtattaac aaggaaattt tctaaatccc aactgggtct tctatgtgag 9060acccacctac
ggcatgaggg cctcggacag gaccaagctg attctgtatt agaggtctac 9120caaagactcc
acagtgataa aggagggaat tttgaggctg ccctgtggca acaatgggac 9180cgacagtcgt
taataatgtt catctctgct tttctcaaca ttgctctcca gacaccttgt 9240gaaagttcta
gtgtcgtagt ctcaggtctt gccacattgt acccagcaca agacaattct 9300acaccgtccg
aggcaactaa tgataccacc tggtcaagta cagttgaata gaaaaccact 9360ggagctattt
ttccacgatt gctctcagtc aataaattaa tatagatata atacgacttc 9420ggtgtgcaat
tgtcaagggt tccatttggt aataatgatt cttaaaacaa tctactatcg 9480taattatcga
tggatctacc ctatttgacg gtacatgact tgaatgtaat aaggtaagtt 9540ggtatctgag
gtattttgtc tagagtatac tcaaaatcgt atgtctagca aattatcaat 9600agcaaagtta
aattctccta acctcatatt ttgatcaagt aatcatgatt ttatggtaat 9660tctttgcaga
ttatcggttt aatctttatt aagaaaaaat catgattgta gacaatttac 9720tggtagtccc
tgggtatcca agtttatgaa cagagctaga gagaatttgc tacttccgag 9780gtataacttt
attatttgct acttcgaatg cctaaaacca gtaatgcagg atgaagatta 9840attgcggagg
aatcaggaat tcaactttag ttccttaagg cctcgtctga atcttcatca 9900gttagtaagt
tcttttatag aagtcattag cttctaaggt gattatattt tagtattaaa 9960ttttgttaat
tgcttgctat aaagttgaaa tgtctaatgc ttaaatgaac atttctttga 10020agctgacata
cgaatacatc atatcatatg aaaacatcgc aattagagcg tccttgaagt 10080ctggcattga
cagtcaccag gctgttctca gtagtctgtc cttggaagct cttggggaga 10140caagaagagg
tcccagagag tcccaacagg ttggcataag gtcattaaca ccagcatagt 10200cagctcgatc
aagactgtaa gcgagtcgat tgcaactaaa aagattattt cttgttgttt 10260aaacaaattc
cttttgtgtg agacaccctc aaggcacaag atggctaaag ccacaggccg 10320atacaatctc
gtgcccccaa agaaagatat ggaaaaggga gtgattttta gtgatctttg 10380taatttcttg
attactcaaa ccctgcaagg ttggaaggtt tattgggcag gaattgagtt 10440tgatgtaagt
caaaaaggca tggctcttct gacaagactc aaaacaaatg actttgctcc 10500tgcctgggcg
atgacaagaa atctcttccc acatctgttc caaaacccaa attcggttat 10560tcaatctccc
atctgggctt tgagggtgat tttggcagcc ggattgcagg atcagttgtt 10620agaccattca
ttggttgagc cattgacagg ggctctcggt ctaatttctg attggctcct 10680aactacaacg
tcaacacatt tcaatcttcg tactagaagc gtaaaggacc agcttagtct 10740tcgtatgtta
tctttgatca ggtcaaacat cttgcagttc atcaacaagc ttgacgccct 10800gcatgttgtc
aattacaatg gtttactcag tagtattgag atcgggactt ctacacacac 10860aatcattata
actcgtacaa atatgggttt tctcgtggaa gttcaggagc ctgacaaatc 10920agctatgaat
tctaagcgcc caggaccagt caagttctca ttacttcatg agtctgcctt 10980caaacctttc
actcgtgttc cacaatctgg gatgcaatca ttaataatgg agttcaacag 11040tttgttggca
atttaacaag gtgatcttaa aataagtaca tgaatgagaa ttagttgtgg 11100gtcttaccta
gcattgttga gttagctatc taatctattt tcactaattg cattgagcac 11160tgctagtagg
tttgcaccac gttaaagatt cagagtgtat gaattgtgca gatttaaact 11220tgggttttgc
cttatgcttc acaggtggtc tttttaaaat ggagattatc agcatttctt 11280caatgggagg
agttagcaat cagaaattgg agataaatgg acatcgggat agaacaatgc 11340ctaactattg
ggcggctttc atttttaaat gtgtatataa ccaatctttt cctatctttg 11400cttatattgg
tgtaacttta ctttaataac atgtcaatgc tatactgtta agagaaggtc 11460tgaggaagat
taagaaaaag gtctcgtgtt cacttggttg ccgtcaagta tcctgtggtt 11520tttttctacc
taacttcctc atgccatatg gctacccagc atacccagta cccggatgca 11580cgtttatctt
cacctatagt cctggatcaa tgtgatttgg taactcgagc atgtgggtta 11640tattcatctt
attctctaaa tcctcagcta aggcaatgta aattaccaaa acatatatat 11700cgacttaagt
tcgacacaat agtatccaaa ttcctaagtg atacacctgt agcaacactg 11760ccgatagact
atttagtacc aattctcctg cgttccctaa cggggcacgg tgataggccg 11820ttgaccccga
cttgtaatca attccttgat ggaattatta attacactct tcatgatgca 11880gcctttcttg
attactatct caaggcaaca ggtgcacagg accatttgac aaacattaca 11940actagagaga
agcttaaaaa cgaaattcta aacaatgatt atgtccatca attgttcttc 12000tggcatgacc
tgtctatttt ggctcgacgt gggcgtctga atcgcgggaa caaccgttca 12060acctggtttg
ttcatgatga attcattgat attttaggat atggcgatta tattttttgg 12120aaaatacctt
tatcattatt accagttact atagacgggg tcccacacgc ggcaactgac 12180tggtatcaac
cgactctttt taaagaatcc atcctagggc acagccaaat cctatctgtg 12240tcgacagctg
aaatactaat tatgtgtaaa gatattatca cctgtaggtt taatacatca 12300ctgattgcat
ccattgcaaa attagaggat gtagatgtgt ctgattatcc tgacccgagt 12360gatattctta
agatatacaa tgctggagac tatgtaatat ctattcttgg ctcagaaggt 12420tataagataa
taaagtacct tgaaccactt tgtttggcca aaatccaact ttgctctaaa 12480ttcacagaaa
gaaaaggtcg tttcctcaca cagatgcatt tatcagtaat aaatgatctt 12540cgggagttga
tttctaaccg caggttaaag gactatcagc aagagaagat tagggatttt 12600cacaaaatat
tattacaatt gcaattatct cctcaacagt tttgtgaatt attctctgtt 12660caaaaacatt
gggggcatcc aattttacat agtgagaaag ctatacaaaa agtaaaacgg 12720catgcaacca
tccttaaggc tctcagacct aatgtcattt ttgagacata ttgtgtattc 12780aagtacaata
ttgccaagca ctatttcgac agccaaggaa cttggtacag tgtaatctca 12840gacaggaatt
taactccagg actcaactcc ttcataaaac gtaatcactt tccttcacta 12900cccatgatta
aggatcttct atgggaattc tatcatctta atcaccctcc gttattctct 12960acaaaggtga
ttagtgactt aagtattttc atcaaggata gggccacagc tgttgaacag 13020acatgttggg
atgcagtctt tgaacccaat gtgctaggtt acaatcctcc aaacaaattc 13080tccactaaaa
gggtgccgga acaatttcta gaacaggagg atttttcaat cgaaagtgtc 13140ctgaattatg
cacaggaatt acattattta ttaccacaga ataggaattt ttccttttct 13200cttaaagaaa
aagaattaaa tattggacga acatttggta agctaccata tctcacacgg 13260aatgtccaaa
ctttatgtga ggctctgtta gcagatggac tggctaaggc cttccccagt 13320aacatgatgg
tagtaactga acgtgaacaa aaagagagcc ttcttcatca ggcatcatgg 13380caccacacca
gtgatgattt tggagagaat gctaccgttc gagggagtag ttttgtaact 13440gatttagaga
agtacaatct tgcatttcgc tatgagttca ctgcaccatt tattgagtac 13500tgcaaccatt
gctatggtgt gcgtaatgtc tttaattgga tgcattattt aatcccgcag 13560tgttacatgc
atgtaagtga ttattataat ccgcctcaca atgttaatct tagcaatcga 13620gaatatcctc
ctgaaggccc gagttcgtac cgagggcact taggaggcat agagggatta 13680caacaaaaac
tgtggacgag tatatcctgt gcacaaatct ccttagtgga aattaaaact 13740ggttttaagt
tacgatcagc ggtcatggga gacaatcagt gtataaccgt attgtctgtt 13800tttccacttg
aaacagaccc tgaagagcag gagcaaagcg ccgaagacaa tgctgcaaga 13860gtagcagcaa
gtcttgcaaa agtaaccagt gcatgtggga tctttcttaa accagaagag 13920acattcgtac
actcaggttt catttatttc ggaaaaaaac aatatctcaa tggtgtacaa 13980ttaccgcaat
cactcaaaac agcagcaaga atggcgccac tctctgatgc tatattcgat 14040gatctacaag
gaacacttgc cagtattgga actgccttcg aacgtgctat atcggaaacg 14100cgacatatcc
tcccatgtcg tattgtagca gctttccata cgtatttcgc cgttcggatt 14160ttacaatatc
accatcttgg atttaataaa ggcatcgatt tagggcagtt gtcacttagt 14220aaaccattag
actatgggac tattactcta acattggcgg ttccacaagt ccttggggga 14280ttgtcttttc
taaatccaga aaagtgtttt tatcgaaact tcggagatcc tgtgacttct 14340ggacttttcc
agctacgggt gtacctagaa atggttaaca tgaaagacct attttatcca 14400ttaatatcga
aaaatccagg aaattgtagt gccattgatt ttgtcttaaa tccatccgga 14460ttaaatgttc
caggatcaca agacttgaca tcctttttgc gacagatcgt taggcgtagt 14520attacactaa
ctgcaagaaa taagttaatt aacactctct tccatgcctc tgctgatttg 14580gaagatgaga
tggtttgtaa atggctcctt tcatcaaacc ctgtcatgag tcgctttgca 14640gcggatattt
tttccaggac acctagtggt aaacgtctcc aaatattagg ttatcttgaa 14700gggaccagga
ctctattggc ctccaaaatc ataaacaaca acagtgagac acctgtactt 14760gataagctga
ggaagatcac cctacaaaga tggaatctgt ggttcagtta tttggaccat 14820tgtgaccaat
tactagcaga tgctctacag aaaattagtt gcacggtgga tttggcccag 14880attttgcgtg
agtatacatg gtcacacatc ttagagggta gaccattgat cggagcgaca 14940ttaccatgta
tggtggagca attcaaagtt aagtggctaa gacaatatga accttgtcca 15000gaatgcctca
acaaaaaagg ctcaaatgct tatgtctcag ttgcagtcaa agatcaagtg 15060gtcagtgctt
ggcctaatac ttctcgaata agttggacaa tagggagtgg tgtcccctat 15120atagggtcaa
gaaccgagga taaaatcgga cagcctgcaa tcaagccgcg atgcccttca 15180tctgccctca
aggaggctat agaattagca tcaaggctca cttgggttac acaaggaagt 15240tctaatagtg
aacaattaat ccggcctttc ttagaagcga gagtcaacct tagtgtcagt 15300gaagtcctgc
aaatgacacc atcacattat tcaggaaata ttgtccatcg atataacgac 15360caatatagcc
cgcactcatt tatggcgaat cgcatgagca atactgcgac ccgtctcata 15420gtgtcaacta
atacacttgg agaattttca ggtggagggc aggccgccag ggatagcaat 15480ataattttcc
agaatgttat aaatttagca gttgcccttt atgatattag attccggaat 15540acgaacacct
ctgatataag gcataatagg gctcatcttc acctgacaga gtgctgtact 15600aaagaggtcc
cggcccagta tttgacatat acaagtgcac tcaatctgga tttaagccgt 15660tatcgtgata
atgaactaat atatgactca aatccactga ggggaggatt gaactgcaat 15720ttaacaatgg
atagtccttt agtgaagggt cctaggctta acatgattga agatgatctt 15780ctccgctttc
cacacctttc tggatgggag ttagcgaaaa cggtggtaca atccatcatc 15840tcagacaata
gcaactcatc aacagatcca atcagtagcg gagaaacacg ctctttcaca 15900actcattttc
tcacttaccc tcagattggc cttctttaca gtttcggggc agtattatgc 15960ttttatctag
gcaatactat cctatggact aaaaaacttg attatgaaca gtttctatat 16020tatttgcata
accagctgca caacttacct catcgagcac tccgtgtttt taaaccaaca 16080tttaagcatg
ccagtgtgat gtcccgatta atggaaattg attccaactt ctcaatttat 16140attggcggga
catctggaga tcgagggctg tctgatgctg ctcgactgtt tcttcggaca 16200gcaatcgcga
gttttttaca atttcttaaa agctggatca tcgatcgcca aaaggcaatt 16260cctttatgga
tagtatatcc gcttgaaggt caacagccgg aatccatcaa tgaatttcta 16320cataaaattt
ttggtctgct caaacaaggc cccaaaaata ttccaaagga ggtcagcatt 16380caaaatgatg
gacatttgga tttggcagaa aataattatg tttacaatag taagagcact 16440gctagtaatt
tcttccatgc atccttagct tactggagaa gtaggaaatc tcggaaaact 16500caagaccata
atgatttctc aagaggggat ggaacactta cagaacccgt gtgtaagttc 16560tcaagcaatc
atcagtcaga tgaaaagtac tacaatgtga catgtggaaa gtcaccgaag 16620ccgcaagaac
gcaaagactt ctcgcaatac agactcagca ataacgggca aacaatgagt 16680aatcatcgta
agaaagggaa gttccacaag tggaatccct gcaaagtgtt aatggagagt 16740caaaggggaa
ctgttctaaa agagggtgac tactttcaaa acaatactcc accaacagat 16800gatgtatcaa
gtcctcaccg actcattcta ccatttttta aattgggaaa tcacaaccat 16860gcacatgatc
aagatgccca agaattgata aatcaaaata ttaaacagta cctacatcag 16920ctaaggtcta
tgttggacac cactatatat tgtagattca cagggattgt ctcatccatg 16980cattacaaat
tggacgaagt tcttctagaa tacaatagtt tcgattcagc tatcacatta 17040gctgaaggtg
aggggtcagg ggctctatta cttttgcaga aatatagtac aaggttatta 17100tttttgaaca
cattggcaac agaacacagt atagagtcag aagttgtatc aggtttttct 17160actccgagaa
tgttgttacc aataatgcaa aaggttcatg aaggacaagt cactgttatc 17220ttaaataatt
cagcaagtca gataactgac ataactagct caatgtggtt aagtaatcaa 17280aaatataatc
taccttgtca agttgaaatc attacgatgg atgctgaaac aacagagaac 17340ttaaacaggt
cccaactcta ccgagcagta tataacttaa tacttgatca cattgatccg 17400cagtatctca
aggtggtggt actcaaagta tttctgagtg atatagaagg aatattatgg 17460attaatgatt
acttggctcc attattcggg gctggttact tgattaaacc gattacatca 17520agtgcccggt
caagtgaatg gtacctttgc ttatcaaatt tgatatctac taacaggaga 17580tcggcccatc
agactcacaa ggcatgtctt ggtgttatca gagatgcttt gcaagcacaa 17640gtccagcgag
gcgtgtactg gttgagtcac atcgcacagt atgctacaaa gaatctccat 17700tgtgaataca
tatgccttgg tttcccacct ctagaaaagg tcctatatca caggtataat 17760ctagttgata
ctggactcgg tccattgtcg tcagttatta gacatttaac taacctccag 17820gcagagatac
gagacttagt attagattat accctgatga gagagagtcg cactcaaacg 17880taccatttta
ttaagactgc aaaaggcaga atcacaaagt tagtcaatga ctttctgaag 17940ttttctttaa
ttgtccaggc actcaaaaat aattcttctt ggtatactga gcttaaaaaa 18000ttacctgagg
tgattaatgt gtgtaatcga ttttatcata ctcacagttg cgaatgtcag 18060gaaaaattct
ttgtccagac gctttattta caacgcctac gcgatgcaga aatcaagcta 18120attgaacgcc
ttaccgggtt aatgcgattt tatccagaag ggttaatata ttccaatcac 18180acataggtac
taaatcatca tagtatgagg aataaaataa tgataattcc tgacgacagt 18240tttagttccg
attctaagta tatcggaaga gagtatgcca atcttaatta ttaaaggtaa 18300caagctatta
gttattactt attgataaga ataaacttta tcatagcgta acacatcata 18360actttatagc
gattttgcat ttctaatcct agtatttatt agaatgtact atcagagaaa 18420tgaccccagt
tcctatcttt aaataatgat tgtgtgtatt aaattattag tttattaggt 18480ttatgagttg
gttacacagt gagtattagt aattgaggat tatgtagata ggtaatctaa 18540cactgaatca
cccatctgat gtcaccatat ccaaatattg tgctagtcgc atttaaacat 18600gctatcttca
gttaagtaac atagactgaa aatgctaaga agagattgga gtaaaagtat 18660aaaataaatt
taattaaact tcaaagtgat taaatgataa tgatcttggg aactcgatat 18720gacctcaagt
caaaaataat gtcaatataa ttgtttagta atatgagtta taatgtgaat 18780tttgataact
aactagcttt agtagttaag atcaaatgca aacattctaa gaatgttaag 18840cgcacacaaa
aacattataa aaaaccaatt ttttcctttt tgtgtgtccc
1889010739PRTZaire Ebola virus 10Met Asp Ser Arg Pro Gln Lys Ile Trp Met
Ala Pro Ser Leu Thr Glu1 5 10
15Ser Asp Met Asp Tyr His Lys Ile Leu Thr Ala Gly Leu Ser Val Gln
20 25 30Gln Gly Ile Val Arg Gln
Arg Val Ile Pro Val Tyr Gln Val Asn Asn 35 40
45Leu Glu Glu Ile Cys Gln Leu Ile Ile Gln Ala Phe Glu Ala
Gly Val 50 55 60Asp Phe Gln Glu Ser
Ala Asp Ser Phe Leu Leu Met Leu Cys Leu His65 70
75 80His Ala Tyr Gln Gly Asp Tyr Lys Leu Phe
Leu Glu Ser Gly Ala Val 85 90
95Lys Tyr Leu Glu Gly His Gly Phe Arg Phe Glu Val Lys Lys Arg Asp
100 105 110Gly Val Lys Arg Leu
Glu Glu Leu Leu Pro Ala Val Ser Ser Gly Lys 115
120 125Asn Ile Lys Arg Thr Leu Ala Ala Met Pro Glu Glu
Glu Thr Thr Glu 130 135 140Ala Asn Ala
Gly Gln Phe Leu Ser Phe Ala Ser Leu Phe Leu Pro Lys145
150 155 160Leu Val Val Gly Glu Lys Ala
Cys Leu Glu Lys Val Gln Arg Gln Ile 165
170 175Gln Val His Ala Glu Gln Gly Leu Ile Gln Tyr Pro
Thr Ala Trp Gln 180 185 190Ser
Val Gly His Met Met Val Ile Phe Arg Leu Met Arg Thr Asn Phe 195
200 205Leu Ile Lys Phe Leu Leu Ile His Gln
Gly Met His Met Val Ala Gly 210 215
220His Asp Ala Asn Asp Ala Val Ile Ser Asn Ser Val Ala Gln Ala Arg225
230 235 240Phe Ser Gly Leu
Leu Ile Val Lys Thr Val Leu Asp His Ile Leu Gln 245
250 255Lys Thr Glu Arg Gly Val Arg Leu His Pro
Leu Ala Arg Thr Ala Lys 260 265
270Val Lys Asn Glu Val Asn Ser Phe Lys Ala Ala Leu Ser Ser Leu Ala
275 280 285Lys His Gly Glu Tyr Ala Pro
Phe Ala Arg Leu Leu Asn Leu Ser Gly 290 295
300Val Asn Asn Leu Glu His Gly Leu Phe Pro Gln Leu Ser Ala Ile
Ala305 310 315 320Leu Gly
Val Ala Thr Ala His Gly Ser Thr Leu Ala Gly Val Asn Val
325 330 335Gly Glu Gln Tyr Gln Gln Leu
Arg Glu Ala Ala Thr Glu Ala Glu Lys 340 345
350Gln Leu Gln Gln Tyr Ala Glu Ser Arg Glu Leu Asp His Leu
Gly Leu 355 360 365Asp Asp Gln Glu
Lys Lys Ile Leu Met Asn Phe His Gln Lys Lys Asn 370
375 380Glu Ile Ser Phe Gln Gln Thr Asn Ala Met Val Thr
Leu Arg Lys Glu385 390 395
400Arg Leu Ala Lys Leu Thr Glu Ala Ile Thr Ala Ala Ser Leu Pro Lys
405 410 415Thr Ser Gly His Tyr
Asp Asp Asp Asp Asp Ile Pro Phe Pro Gly Pro 420
425 430Ile Asn Asp Asp Asp Asn Pro Gly His Gln Asp Asp
Asp Pro Thr Asp 435 440 445Ser Gln
Asp Thr Thr Ile Pro Asp Val Val Val Asp Pro Asp Asp Gly 450
455 460Ser Tyr Gly Glu Tyr Gln Ser Tyr Ser Glu Asn
Gly Met Asn Ala Pro465 470 475
480Asp Asp Leu Val Leu Phe Asp Leu Asp Glu Asp Asp Glu Asp Thr Lys
485 490 495Pro Val Pro Asn
Arg Ser Thr Lys Gly Gly Gln Gln Lys Asn Ser Gln 500
505 510Lys Gly Gln His Ile Glu Gly Arg Gln Thr Gln
Ser Arg Pro Ile Gln 515 520 525Asn
Val Pro Gly Pro His Arg Thr Ile His His Ala Ser Ala Pro Leu 530
535 540Thr Asp Asn Asp Arg Arg Asn Glu Pro Ser
Gly Ser Thr Ser Pro Arg545 550 555
560Met Leu Thr Pro Ile Asn Glu Glu Ala Asp Pro Leu Asp Asp Ala
Asp 565 570 575Asp Glu Thr
Ser Ser Leu Pro Pro Leu Glu Ser Asp Asp Glu Glu Gln 580
585 590Asp Arg Asp Gly Thr Ser Asn Arg Thr Pro
Thr Val Ala Pro Pro Ala 595 600
605Pro Val Tyr Arg Asp His Ser Glu Lys Lys Glu Leu Pro Gln Asp Glu 610
615 620Gln Gln Asp Gln Asp His Thr Gln
Glu Ala Arg Asn Gln Asp Ser Asp625 630
635 640Asn Thr Gln Ser Glu His Ser Phe Glu Glu Met Tyr
Arg His Ile Leu 645 650
655Arg Ser Gln Gly Pro Phe Asp Ala Val Leu Tyr Tyr His Met Met Lys
660 665 670Asp Glu Pro Val Val Phe
Ser Thr Ser Asp Gly Lys Glu Tyr Thr Tyr 675 680
685Pro Asp Ser Leu Glu Glu Glu Tyr Pro Pro Trp Leu Thr Glu
Lys Glu 690 695 700Ala Met Asn Glu Glu
Asn Arg Phe Val Thr Leu Asp Gly Gln Gln Phe705 710
715 720Tyr Trp Pro Val Met Asn His Lys Asn Lys
Phe Met Ala Ile Leu Gln 725 730
735His His Gln11340PRTZaire Ebola virus 11Met Thr Thr Arg Thr Lys
Gly Arg Gly His Thr Ala Ala Thr Thr Gln1 5
10 15Asn Asp Arg Met Pro Gly Pro Glu Leu Ser Gly Trp
Ile Ser Glu Gln 20 25 30Leu
Met Thr Gly Arg Ile Pro Val Ser Asp Ile Phe Cys Asp Ile Glu 35
40 45Asn Asn Pro Gly Leu Cys Tyr Ala Ser
Gln Met Gln Gln Thr Lys Pro 50 55
60Asn Pro Lys Thr Arg Asn Ser Gln Thr Gln Thr Asp Pro Ile Cys Asn65
70 75 80His Ser Phe Glu Glu
Val Val Gln Thr Leu Ala Ser Leu Ala Thr Val 85
90 95Val Gln Gln Gln Thr Ile Ala Ser Glu Ser Leu
Glu Gln Arg Ile Thr 100 105
110Ser Leu Glu Asn Gly Leu Lys Pro Val Tyr Asp Met Ala Lys Thr Ile
115 120 125Ser Ser Leu Asn Arg Val Cys
Ala Glu Met Val Ala Lys Tyr Asp Leu 130 135
140Leu Val Met Thr Thr Gly Arg Ala Thr Ala Thr Ala Ala Ala Thr
Glu145 150 155 160Ala Tyr
Trp Ala Glu His Gly Gln Pro Pro Pro Gly Pro Ser Leu Tyr
165 170 175Glu Glu Ser Ala Ile Arg Gly
Lys Ile Glu Ser Arg Asp Glu Thr Val 180 185
190Pro Gln Ser Val Arg Glu Ala Phe Asn Asn Leu Asn Ser Thr
Thr Ser 195 200 205Leu Thr Glu Glu
Asn Phe Gly Lys Pro Asp Ile Ser Ala Lys Asp Leu 210
215 220Arg Asn Ile Met Tyr Asp His Leu Pro Gly Phe Gly
Thr Ala Phe His225 230 235
240Gln Leu Val Gln Val Ile Cys Lys Leu Gly Lys Asp Ser Asn Ser Leu
245 250 255Asp Ile Ile His Ala
Glu Phe Gln Ala Ser Leu Ala Glu Gly Asp Ser 260
265 270Pro Gln Cys Ala Leu Ile Gln Ile Thr Lys Arg Val
Pro Ile Phe Gln 275 280 285Asp Ala
Ala Pro Pro Val Ile His Ile Arg Ser Arg Gly Asp Ile Pro 290
295 300Arg Ala Cys Gln Lys Ser Leu Arg Pro Val Pro
Pro Ser Pro Lys Ile305 310 315
320Asp Arg Gly Trp Val Cys Val Phe Gln Leu Gln Asp Gly Lys Thr Leu
325 330 335Gly Leu Lys Ile
34012326PRTZaire Ebola virus 12Met Arg Arg Val Ile Leu Pro Thr
Ala Pro Pro Glu Tyr Met Glu Ala1 5 10
15Ile Tyr Pro Val Arg Ser Asn Ser Thr Ile Ala Arg Gly Gly
Asn Ser 20 25 30Asn Thr Gly
Phe Leu Thr Pro Glu Ser Val Asn Gly Asp Thr Pro Ser 35
40 45Asn Pro Leu Arg Pro Ile Ala Asp Asp Thr Ile
Asp His Ala Ser His 50 55 60Thr Pro
Gly Ser Val Ser Ser Ala Phe Ile Leu Glu Ala Met Val Asn65
70 75 80Val Ile Ser Gly Pro Lys Val
Leu Met Lys Gln Ile Pro Ile Trp Leu 85 90
95Pro Leu Gly Val Ala Asp Gln Lys Thr Tyr Ser Phe Asp
Ser Thr Thr 100 105 110Ala Ala
Ile Met Leu Ala Ser Tyr Thr Ile Thr His Phe Gly Lys Ala 115
120 125Thr Asn Pro Leu Val Arg Val Asn Arg Leu
Gly Pro Gly Ile Pro Asp 130 135 140His
Pro Leu Arg Leu Leu Arg Ile Gly Asn Gln Ala Phe Leu Gln Glu145
150 155 160Phe Val Leu Pro Pro Val
Gln Leu Pro Gln Tyr Phe Thr Phe Asp Leu 165
170 175Thr Ala Leu Lys Leu Ile Thr Gln Pro Leu Pro Ala
Ala Thr Trp Thr 180 185 190Asp
Asp Thr Pro Thr Gly Ser Asn Gly Ala Leu Arg Pro Gly Ile Ser 195
200 205Phe His Pro Lys Leu Arg Pro Ile Leu
Leu Pro Asn Lys Ser Gly Lys 210 215
220Lys Gly Asn Ser Ala Asp Leu Thr Ser Pro Glu Lys Ile Gln Ala Ile225
230 235 240Met Thr Ser Leu
Gln Asp Phe Lys Ile Val Pro Ile Asp Pro Thr Lys 245
250 255Asn Ile Met Gly Ile Glu Val Pro Glu Thr
Leu Val His Lys Leu Thr 260 265
270Gly Lys Lys Val Thr Ser Lys Asn Gly Gln Pro Ile Ile Pro Val Leu
275 280 285Leu Pro Lys Tyr Ile Gly Leu
Asp Pro Val Ala Pro Gly Asp Leu Thr 290 295
300Met Val Ile Thr Gln Asp Cys Asp Thr Cys His Ser Pro Ala Ser
Leu305 310 315 320Pro Ala
Val Ile Glu Lys 32513676PRTZaire Ebola virus 13Met Gly Val
Thr Gly Ile Leu Gln Leu Pro Arg Asp Arg Phe Lys Arg1 5
10 15Thr Ser Phe Phe Leu Trp Val Ile Ile
Leu Phe Gln Arg Thr Phe Ser 20 25
30Ile Pro Leu Gly Val Ile His Asn Ser Thr Leu Gln Val Ser Asp Val
35 40 45Asp Lys Leu Val Cys Arg Asp
Lys Leu Ser Ser Thr Asn Gln Leu Arg 50 55
60Ser Val Gly Leu Asn Leu Glu Gly Asn Gly Val Ala Thr Asp Val Pro65
70 75 80Ser Ala Thr Lys
Arg Trp Gly Phe Arg Ser Gly Val Pro Pro Lys Val 85
90 95Val Asn Tyr Glu Ala Gly Glu Trp Ala Glu
Asn Cys Tyr Asn Leu Glu 100 105
110Ile Lys Lys Pro Asp Gly Ser Glu Cys Leu Pro Ala Ala Pro Asp Gly
115 120 125Ile Arg Gly Phe Pro Arg Cys
Arg Tyr Val His Lys Val Ser Gly Thr 130 135
140Gly Pro Cys Ala Gly Asp Phe Ala Phe His Lys Glu Gly Ala Phe
Phe145 150 155 160Leu Tyr
Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr Phe
165 170 175Ala Glu Gly Val Val Ala Phe
Leu Ile Leu Pro Gln Ala Lys Lys Asp 180 185
190Phe Phe Ser Ser His Pro Leu Arg Glu Pro Val Asn Ala Thr
Glu Asp 195 200 205Pro Ser Ser Gly
Tyr Tyr Ser Thr Thr Ile Arg Tyr Gln Ala Thr Gly 210
215 220Phe Gly Thr Asn Glu Thr Glu Tyr Leu Phe Glu Val
Asp Asn Leu Thr225 230 235
240Tyr Val Gln Leu Glu Ser Arg Phe Thr Pro Gln Phe Leu Leu Gln Leu
245 250 255Asn Glu Thr Ile Tyr
Thr Ser Gly Lys Arg Ser Asn Thr Thr Gly Lys 260
265 270Leu Ile Trp Lys Val Asn Pro Glu Ile Asp Thr Thr
Ile Gly Glu Trp 275 280 285Ala Phe
Trp Glu Thr Lys Lys Asn Leu Thr Arg Lys Ile Arg Ser Glu 290
295 300Glu Leu Ser Phe Thr Val Val Ser Asn Gly Ala
Lys Asn Ile Ser Gly305 310 315
320Gln Ser Pro Ala Arg Thr Ser Ser Asp Pro Gly Thr Asn Thr Thr Thr
325 330 335Glu Asp His Lys
Ile Met Ala Ser Glu Asn Ser Ser Ala Met Val Gln 340
345 350Val His Ser Gln Gly Arg Glu Ala Ala Val Ser
His Leu Thr Thr Leu 355 360 365Ala
Thr Ile Ser Thr Ser Pro Gln Ser Leu Thr Thr Lys Pro Gly Pro 370
375 380Asp Asn Ser Thr His Asn Thr Pro Val Tyr
Lys Leu Asp Ile Ser Glu385 390 395
400Ala Thr Gln Val Glu Gln His His Arg Arg Thr Asp Asn Asp Ser
Thr 405 410 415Ala Ser Asp
Thr Pro Ser Ala Thr Thr Ala Ala Gly Pro Pro Lys Ala 420
425 430Glu Asn Thr Asn Thr Ser Lys Ser Thr Asp
Phe Leu Asp Pro Ala Thr 435 440
445Thr Thr Ser Pro Gln Asn His Ser Glu Thr Ala Gly Asn Asn Asn Thr 450
455 460His His Gln Asp Thr Gly Glu Glu
Ser Ala Ser Ser Gly Lys Leu Gly465 470
475 480Leu Ile Thr Asn Thr Ile Ala Gly Val Ala Gly Leu
Ile Thr Gly Gly 485 490
495Arg Arg Thr Arg Arg Glu Ala Ile Val Asn Ala Gln Pro Lys Cys Asn
500 505 510Pro Asn Leu His Tyr Trp
Thr Thr Gln Asp Glu Gly Ala Ala Ile Gly 515 520
525Leu Ala Trp Ile Pro Tyr Phe Gly Pro Ala Ala Glu Gly Ile
Tyr Ile 530 535 540Glu Gly Leu Met His
Asn Gln Asp Gly Leu Ile Cys Gly Leu Arg Gln545 550
555 560Leu Ala Asn Glu Thr Thr Gln Ala Leu Gln
Leu Phe Leu Arg Ala Thr 565 570
575Thr Glu Leu Arg Thr Phe Ser Ile Leu Asn Arg Lys Ala Ile Asp Phe
580 585 590Leu Leu Gln Arg Trp
Gly Gly Thr Cys His Ile Leu Gly Pro Asp Cys 595
600 605Cys Ile Glu Pro His Asp Trp Thr Lys Asn Ile Thr
Asp Lys Ile Asp 610 615 620Gln Ile Ile
His Asp Phe Val Asp Lys Thr Leu Pro Asp Gln Gly Asp625
630 635 640Asn Asp Asn Trp Trp Thr Gly
Trp Arg Gln Trp Ile Pro Ala Gly Ile 645
650 655Gly Val Thr Gly Val Ile Ile Ala Val Ile Ala Leu
Phe Cys Ile Cys 660 665 670Lys
Phe Val Phe 67514364PRTZaire Ebola virus 14Met Gly Val Thr Gly Ile
Leu Gln Leu Pro Arg Asp Arg Phe Lys Arg1 5
10 15Thr Ser Phe Phe Leu Trp Val Ile Ile Leu Phe Gln
Arg Thr Phe Ser 20 25 30Ile
Pro Leu Gly Val Ile His Asn Ser Thr Leu Gln Val Ser Asp Val 35
40 45Asp Lys Leu Val Cys Arg Asp Lys Leu
Ser Ser Thr Asn Gln Leu Arg 50 55
60Ser Val Gly Leu Asn Leu Glu Gly Asn Gly Val Ala Thr Asp Val Pro65
70 75 80Ser Ala Thr Lys Arg
Trp Gly Phe Arg Ser Gly Val Pro Pro Lys Val 85
90 95Val Asn Tyr Glu Ala Gly Glu Trp Ala Glu Asn
Cys Tyr Asn Leu Glu 100 105
110Ile Lys Lys Pro Asp Gly Ser Glu Cys Leu Pro Ala Ala Pro Asp Gly
115 120 125Ile Arg Gly Phe Pro Arg Cys
Arg Tyr Val His Lys Val Ser Gly Thr 130 135
140Gly Pro Cys Ala Gly Asp Phe Ala Phe His Lys Glu Gly Ala Phe
Phe145 150 155 160Leu Tyr
Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr Phe
165 170 175Ala Glu Gly Val Val Ala Phe
Leu Ile Leu Pro Gln Ala Lys Lys Asp 180 185
190Phe Phe Ser Ser His Pro Leu Arg Glu Pro Val Asn Ala Thr
Glu Asp 195 200 205Pro Ser Ser Gly
Tyr Tyr Ser Thr Thr Ile Arg Tyr Gln Ala Thr Gly 210
215 220Phe Gly Thr Asn Glu Thr Glu Tyr Leu Phe Glu Val
Asp Asn Leu Thr225 230 235
240Tyr Val Gln Leu Glu Ser Arg Phe Thr Pro Gln Phe Leu Leu Gln Leu
245 250 255Asn Glu Thr Ile Tyr
Thr Ser Gly Lys Arg Ser Asn Thr Thr Gly Lys 260
265 270Leu Ile Trp Lys Val Asn Pro Glu Ile Asp Thr Thr
Ile Gly Glu Trp 275 280 285Ala Phe
Trp Glu Thr Lys Lys Thr Ser Leu Glu Lys Phe Ala Val Lys 290
295 300Ser Cys Leu Ser Gln Leu Tyr Gln Thr Glu Pro
Lys Thr Ser Val Val305 310 315
320Arg Val Arg Arg Glu Leu Leu Pro Thr Gln Gly Pro Thr Gln Gln Leu
325 330 335Lys Thr Thr Lys
Ser Trp Leu Gln Lys Ile Pro Leu Gln Trp Phe Lys 340
345 350Cys Thr Val Lys Glu Gly Lys Leu Gln Cys Arg
Ile 355 36015297PRTZaire Ebola virus 15Met Gly Val
Thr Gly Ile Leu Gln Leu Pro Arg Asp Arg Phe Lys Arg1 5
10 15Thr Ser Phe Phe Leu Trp Val Ile Ile
Leu Phe Gln Arg Thr Phe Ser 20 25
30Ile Pro Leu Gly Val Ile His Asn Ser Thr Leu Gln Val Ser Asp Val
35 40 45Asp Lys Leu Val Cys Arg Asp
Lys Leu Ser Ser Thr Asn Gln Leu Arg 50 55
60Ser Val Gly Leu Asn Leu Glu Gly Asn Gly Val Ala Thr Asp Val Pro65
70 75 80Ser Ala Thr Lys
Arg Trp Gly Phe Arg Ser Gly Val Pro Pro Lys Val 85
90 95Val Asn Tyr Glu Ala Gly Glu Trp Ala Glu
Asn Cys Tyr Asn Leu Glu 100 105
110 Ile Lys Lys Pro Asp Gly Ser Glu Cys Leu Pro Ala Ala Pro Asp Gly
115 120 125Ile Arg Gly Phe Pro Arg Cys
Arg Tyr Val His Lys Val Ser Gly Thr 130 135
140Gly Pro Cys Ala Gly Asp Phe Ala Phe His Lys Glu Gly Ala Phe
Phe145 150 155 160Leu Tyr
Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr Phe
165 170 175Ala Glu Gly Val Val Ala Phe
Leu Ile Leu Pro Gln Ala Lys Lys Asp 180 185
190Phe Phe Ser Ser His Pro Leu Arg Glu Pro Val Asn Ala Thr
Glu Asp 195 200 205Pro Ser Ser Gly
Tyr Tyr Ser Thr Thr Ile Arg Tyr Gln Ala Thr Gly 210
215 220Phe Gly Thr Asn Glu Thr Glu Tyr Leu Phe Glu Val
Asp Asn Leu Thr225 230 235
240Tyr Val Gln Leu Glu Ser Arg Phe Thr Pro Gln Phe Leu Leu Gln Leu
245 250 255Asn Glu Thr Ile Tyr
Thr Ser Gly Lys Arg Ser Asn Thr Thr Gly Lys 260
265 270Leu Ile Trp Lys Val Asn Pro Glu Ile Asp Thr Thr
Ile Gly Glu Trp 275 280 285Ala Phe
Trp Glu Thr Lys Lys Pro His 290 29516288PRTZaire Ebola
virus 16Met Glu Ala Ser Tyr Glu Arg Gly Arg Pro Arg Ala Ala Arg Gln His1
5 10 15Ser Arg Asp Gly
His Asp His His Val Arg Ala Arg Ser Ser Ser Arg 20
25 30Glu Asn Tyr Arg Gly Glu Tyr Arg Gln Ser Arg
Ser Ala Ser Gln Val 35 40 45Arg
Val Pro Thr Val Phe His Lys Lys Arg Val Glu Pro Leu Thr Val 50
55 60Pro Pro Ala Pro Lys Asp Ile Cys Pro Thr
Leu Lys Lys Gly Phe Leu65 70 75
80Cys Asp Ser Ser Phe Cys Lys Lys Asp His Gln Leu Glu Ser Leu
Thr 85 90 95Asp Arg Glu
Leu Leu Leu Leu Ile Ala Arg Lys Thr Cys Gly Ser Val 100
105 110Glu Gln Gln Leu Asn Ile Thr Ala Pro Lys
Asp Ser Arg Leu Ala Asn 115 120
125Pro Thr Ala Asp Asp Phe Gln Gln Glu Glu Gly Pro Lys Ile Thr Leu 130
135 140Leu Thr Leu Ile Lys Thr Ala Glu
His Trp Ala Arg Gln Asp Ile Arg145 150
155 160Thr Ile Glu Asp Ser Lys Leu Arg Ala Leu Leu Thr
Leu Cys Ala Val 165 170
175Met Thr Arg Lys Phe Ser Lys Ser Gln Leu Ser Leu Leu Cys Glu Thr
180 185 190His Leu Arg Arg Glu Gly
Leu Gly Gln Asp Gln Ala Glu Pro Val Leu 195 200
205Glu Val Tyr Gln Arg Leu His Ser Asp Lys Gly Gly Ser Phe
Glu Ala 210 215 220Ala Leu Trp Gln Gln
Trp Asp Arg Gln Ser Leu Ile Met Phe Ile Thr225 230
235 240Ala Phe Leu Asn Ile Ala Leu Gln Leu Pro
Cys Glu Ser Ser Ala Val 245 250
255Val Val Ser Gly Leu Arg Thr Leu Val Pro Gln Ser Asp Asn Glu Glu
260 265 270Ala Ser Thr Asn Pro
Gly Thr Cys Ser Trp Ser Asp Glu Gly Thr Pro 275
280 28517251PRTZaire Ebola virus 17Met Ala Lys Ala Thr
Gly Arg Tyr Asn Leu Ile Ser Pro Lys Lys Asp1 5
10 15Leu Glu Lys Gly Val Val Leu Ser Asp Leu Cys
Asn Phe Leu Val Ser 20 25
30Gln Thr Ile Gln Gly Trp Lys Val Tyr Trp Ala Gly Ile Glu Phe Asp
35 40 45Val Thr His Lys Gly Met Ala Leu
Leu His Arg Leu Lys Thr Asn Asp 50 55
60Phe Ala Pro Ala Trp Ser Met Thr Arg Asn Leu Phe Pro His Leu Phe65
70 75 80Gln Asn Pro Asn Ser
Thr Ile Glu Ser Pro Leu Trp Ala Leu Arg Val 85
90 95Ile Leu Ala Ala Gly Ile Gln Asp Gln Leu Ile
Asp Gln Ser Leu Ile 100 105
110Glu Pro Leu Ala Gly Ala Leu Gly Leu Ile Ser Asp Trp Leu Leu Thr
115 120 125Thr Asn Thr Asn His Phe Asn
Met Arg Thr Gln Arg Val Lys Glu Gln 130 135
140Leu Ser Leu Lys Met Leu Ser Leu Ile Arg Ser Asn Ile Leu Lys
Phe145 150 155 160Ile Asn
Lys Leu Asp Ala Leu His Val Val Asn Tyr Asn Gly Leu Leu
165 170 175Ser Ser Ile Glu Ile Gly Thr
Gln Asn His Thr Ile Ile Ile Thr Arg 180 185
190Thr Asn Met Gly Phe Leu Val Glu Leu Gln Glu Pro Asp Lys
Ser Ala 195 200 205Met Asn Arg Met
Lys Pro Gly Pro Ala Lys Phe Ser Leu Leu His Glu 210
215 220Ser Thr Leu Lys Ala Phe Thr Gln Gly Ser Ser Thr
Arg Met Gln Ser225 230 235
240Leu Ile Leu Glu Phe Asn Ser Ser Leu Ala Ile 245
250182212PRTZaire Ebola virus 18Met Ala Thr Gln His Thr Gln Tyr
Pro Asp Ala Arg Leu Ser Ser Pro1 5 10
15Ile Val Leu Asp Gln Cys Asp Leu Val Thr Arg Ala Cys Gly
Leu Tyr 20 25 30Ser Ser Tyr
Ser Leu Asn Pro Gln Leu Arg Asn Cys Lys Leu Pro Lys 35
40 45His Ile Tyr Arg Leu Lys Tyr Asp Val Thr Val
Thr Lys Phe Leu Ser 50 55 60Asp Val
Pro Val Ala Thr Leu Pro Ile Asp Phe Ile Val Pro Val Leu65
70 75 80Leu Lys Ala Leu Ser Gly Asn
Gly Phe Cys Pro Val Glu Pro Arg Cys 85 90
95Gln Gln Phe Leu Asp Glu Ile Ile Lys Tyr Thr Met Gln
Asp Ala Leu 100 105 110Phe Leu
Lys Tyr Tyr Leu Lys Asn Val Gly Ala Gln Glu Asp Cys Val 115
120 125Asp Glu His Phe Gln Glu Lys Ile Leu Ser
Ser Ile Gln Gly Asn Glu 130 135 140Phe
Leu His Gln Met Phe Phe Trp Tyr Asp Leu Ala Ile Leu Thr Arg145
150 155 160Arg Gly Arg Leu Asn Arg
Gly Asn Ser Arg Ser Thr Trp Phe Val His 165
170 175Asp Asp Leu Ile Asp Ile Leu Gly Tyr Gly Asp Tyr
Val Phe Trp Lys 180 185 190Ile
Pro Ile Ser Met Leu Pro Leu Asn Thr Gln Gly Ile Pro His Ala 195
200 205Ala Met Asp Trp Tyr Gln Ala Ser Val
Phe Lys Glu Ala Val Gln Gly 210 215
220His Thr His Ile Val Ser Val Ser Thr Ala Asp Val Leu Ile Met Cys225
230 235 240Lys Asp Leu Ile
Thr Cys Arg Phe Asn Thr Thr Leu Ile Ser Lys Ile 245
250 255Ala Glu Ile Glu Asp Pro Val Cys Ser Asp
Tyr Pro Asn Phe Lys Ile 260 265
270Val Ser Met Leu Tyr Gln Ser Gly Asp Tyr Leu Leu Ser Ile Leu Gly
275 280 285Ser Asp Gly Tyr Lys Ile Ile
Lys Phe Leu Glu Pro Leu Cys Leu Ala 290 295
300Lys Ile Gln Leu Cys Ser Lys Tyr Thr Glu Arg Lys Gly Arg Phe
Leu305 310 315 320Thr Gln
Met His Leu Ala Val Asn His Thr Leu Glu Glu Ile Thr Glu
325 330 335Met Arg Ala Leu Lys Pro Ser
Gln Ala Gln Lys Ile Arg Glu Phe His 340 345
350Arg Thr Leu Ile Arg Leu Glu Met Thr Pro Gln Gln Leu Cys
Glu Leu 355 360 365Phe Ser Ile Gln
Lys His Trp Gly His Pro Val Leu His Ser Glu Thr 370
375 380Ala Ile Gln Lys Val Lys Lys His Ala Thr Val Leu
Lys Ala Leu Arg385 390 395
400Pro Ile Val Ile Phe Glu Thr Tyr Cys Val Phe Lys Tyr Ser Ile Ala
405 410 415Lys His Tyr Phe Asp
Ser Gln Gly Ser Trp Tyr Ser Val Thr Ser Asp 420
425 430Arg Asn Leu Thr Pro Gly Leu Asn Ser Tyr Ile Lys
Arg Asn Gln Phe 435 440 445Pro Pro
Leu Pro Met Ile Lys Glu Leu Leu Trp Glu Phe Tyr His Leu 450
455 460Asp His Pro Pro Leu Phe Ser Thr Lys Ile Ile
Ser Asp Leu Ser Ile465 470 475
480Phe Ile Lys Asp Arg Ala Thr Ala Val Glu Arg Thr Cys Trp Asp Ala
485 490 495Val Phe Glu Pro
Asn Val Leu Gly Tyr Asn Pro Pro His Lys Phe Ser 500
505 510Thr Lys Arg Val Pro Glu Gln Phe Leu Glu Gln
Glu Asn Phe Ser Ile 515 520 525Glu
Asn Val Leu Ser Tyr Ala Gln Lys Leu Glu Tyr Leu Leu Pro Gln 530
535 540Tyr Arg Asn Phe Ser Phe Ser Leu Lys Glu
Lys Glu Leu Asn Val Gly545 550 555
560Arg Thr Phe Gly Lys Leu Pro Tyr Pro Thr Arg Asn Val Gln Thr
Leu 565 570 575Cys Glu Ala
Leu Leu Ala Asp Gly Leu Ala Lys Ala Phe Pro Ser Asn 580
585 590Met Met Val Val Thr Glu Arg Glu Gln Lys
Glu Ser Leu Leu His Gln 595 600
605Ala Ser Trp His His Thr Ser Asp Asp Phe Gly Glu His Ala Thr Val 610
615 620Arg Gly Ser Ser Phe Val Thr Asp
Leu Glu Lys Tyr Asn Leu Ala Phe625 630
635 640Arg Tyr Glu Phe Thr Ala Pro Phe Ile Glu Tyr Cys
Asn Arg Cys Tyr 645 650
655Gly Val Lys Asn Val Phe Asn Trp Met His Tyr Thr Ile Pro Gln Cys
660 665 670Tyr Met His Val Ser Asp
Tyr Tyr Asn Pro Pro His Asn Leu Thr Leu 675 680
685Glu Asn Arg Asp Asn Pro Pro Glu Gly Pro Ser Ser Tyr Arg
Gly His 690 695 700Met Gly Gly Ile Glu
Gly Leu Gln Gln Lys Leu Trp Thr Ser Ile Ser705 710
715 720Cys Ala Gln Ile Ser Leu Val Glu Ile Lys
Thr Gly Phe Lys Leu Arg 725 730
735Ser Ala Val Met Gly Asp Asn Gln Cys Ile Thr Val Leu Ser Val Phe
740 745 750Pro Leu Glu Thr Asp
Ala Asp Glu Gln Glu Gln Ser Ala Glu Asp Asn 755
760 765Ala Ala Arg Val Ala Ala Ser Leu Ala Lys Val Thr
Ser Ala Cys Gly 770 775 780Ile Phe Leu
Lys Pro Asp Glu Thr Phe Val His Ser Gly Phe Ile Tyr785
790 795 800Phe Gly Lys Lys Gln Tyr Leu
Asn Gly Val Gln Leu Pro Gln Ser Leu 805
810 815Lys Thr Ala Thr Arg Met Ala Pro Leu Ser Asp Ala
Ile Phe Asp Asp 820 825 830Leu
Gln Gly Thr Leu Ala Ser Ile Gly Thr Ala Phe Glu Arg Ser Ile 835
840 845Ser Glu Thr Arg His Ile Phe Pro Cys
Arg Ile Thr Ala Ala Phe His 850 855
860Thr Phe Phe Ser Val Arg Ile Leu Gln Tyr His His Leu Gly Phe Asn865
870 875 880Lys Gly Phe Asp
Leu Gly Gln Leu Thr Leu Gly Lys Pro Leu Asp Phe 885
890 895Gly Thr Ile Ser Leu Ala Leu Ala Val Pro
Gln Val Leu Gly Gly Leu 900 905
910Ser Phe Leu Asn Pro Glu Lys Cys Phe Tyr Arg Asn Leu Gly Asp Pro
915 920 925Val Thr Ser Gly Leu Phe Gln
Leu Lys Thr Tyr Leu Arg Met Ile Glu 930 935
940Met Asp Asp Leu Phe Leu Pro Leu Ile Ala Lys Asn Pro Gly Asn
Cys945 950 955 960Thr Ala
Ile Asp Phe Val Leu Asn Pro Ser Gly Leu Asn Val Pro Gly
965 970 975Ser Gln Asp Leu Thr Ser Phe
Leu Arg Gln Ile Val Arg Arg Thr Ile 980 985
990Thr Leu Ser Ala Lys Asn Lys Leu Ile Asn Thr Leu Phe His
Ala Ser 995 1000 1005Ala Asp Phe
Glu Asp Glu Met Val Cys Lys Trp Leu Leu Ser Ser Thr 1010
1015 1020Pro Val Met Ser Arg Phe Ala Ala Asp Ile Phe Ser
Arg Thr Pro Ser1025 1030 1035
1040Gly Lys Arg Leu Gln Ile Leu Gly Tyr Leu Glu Gly Thr Arg Thr Leu
1045 1050 1055Leu Ala Ser Lys Ile
Ile Asn Asn Asn Thr Glu Thr Pro Val Leu Asp 1060
1065 1070Arg Leu Arg Lys Ile Thr Leu Gln Arg Trp Ser Leu
Trp Phe Ser Tyr 1075 1080 1085Leu
Asp His Cys Asp Asn Ile Leu Ala Glu Ala Leu Thr Gln Ile Thr 1090
1095 1100Cys Thr Val Asp Leu Ala Gln Ile Leu Arg
Glu Tyr Ser Trp Ala His1105 1110 1115
1120Ile Leu Glu Gly Arg Pro Leu Ile Gly Ala Thr Leu Pro Cys Met
Ile 1125 1130 1135Glu Gln
Phe Lys Val Phe Trp Leu Lys Pro Tyr Glu Gln Cys Pro Gln 1140
1145 1150Cys Ser Asn Ala Lys Gln Pro Gly Gly
Lys Pro Phe Val Ser Val Ala 1155 1160
1165Val Lys Lys His Ile Val Ser Ala Trp Pro Asn Ala Ser Arg Ile Ser
1170 1175 1180Trp Thr Ile Gly Asp Gly Ile
Pro Tyr Ile Gly Ser Arg Thr Glu Asp1185 1190
1195 1200Lys Ile Gly Gln Pro Ala Ile Lys Pro Lys Cys Pro
Ser Ala Ala Leu 1205 1210
1215Arg Glu Ala Ile Glu Leu Ala Ser Arg Leu Thr Trp Val Thr Gln Gly
1220 1225 1230Ser Ser Asn Ser Asp Leu
Leu Ile Lys Pro Phe Leu Glu Ala Arg Val 1235 1240
1245Asn Leu Ser Val Gln Glu Ile Leu Gln Met Thr Pro Ser His
Tyr Ser 1250 1255 1260Gly Asn Ile Val
His Arg Tyr Asn Asp Gln Tyr Ser Pro His Ser Phe1265 1270
1275 1280Met Ala Asn Arg Met Ser Asn Ser Ala
Thr Arg Leu Ile Val Ser Thr 1285 1290
1295Asn Thr Leu Gly Glu Phe Ser Gly Gly Gly Gln Ser Ala Arg Asp
Ser 1300 1305 1310Asn Ile Ile
Phe Gln Asn Val Ile Asn Tyr Ala Val Ala Leu Phe Asp 1315
1320 1325Ile Lys Phe Arg Asn Thr Glu Ala Thr Asp Ile
Gln Tyr Asn Arg Ala 1330 1335 1340His
Leu His Leu Thr Lys Cys Cys Thr Arg Glu Val Pro Ala Gln Tyr1345
1350 1355 1360Leu Thr Tyr Thr Ser Thr
Leu Asp Leu Asp Leu Thr Arg Tyr Arg Glu 1365
1370 1375Asn Glu Leu Ile Tyr Asp Ser Asn Pro Leu Lys Gly
Gly Leu Asn Cys 1380 1385
1390Asn Ile Ser Phe Asp Asn Pro Phe Phe Gln Gly Lys Arg Leu Asn Ile
1395 1400 1405Ile Glu Asp Asp Leu Ile Arg
Leu Pro His Leu Ser Gly Trp Glu Leu 1410 1415
1420Ala Lys Thr Ile Met Gln Ser Ile Ile Ser Asp Ser Asn Asn Ser
Ser1425 1430 1435 1440Thr
Asp Pro Ile Ser Ser Gly Glu Thr Arg Ser Phe Thr Thr His Phe
1445 1450 1455Leu Thr Tyr Pro Lys Ile Gly
Leu Leu Tyr Ser Phe Gly Ala Phe Val 1460 1465
1470Ser Tyr Tyr Leu Gly Asn Thr Ile Leu Arg Thr Lys Lys Leu
Thr Leu 1475 1480 1485Asp Asn Phe
Leu Tyr Tyr Leu Thr Thr Gln Ile His Asn Leu Pro His 1490
1495 1500Arg Ser Leu Arg Ile Leu Lys Pro Thr Phe Lys His
Ala Ser Val Met1505 1510 1515
1520Ser Arg Leu Met Ser Ile Asp Pro His Phe Ser Ile Tyr Ile Gly Gly
1525 1530 1535Ala Ala Gly Asp Arg
Gly Leu Ser Asp Ala Ala Arg Leu Phe Leu Arg 1540
1545 1550Thr Ser Ile Ser Ser Phe Leu Thr Phe Val Lys Glu
Trp Ile Ile Asn 1555 1560 1565Arg
Gly Thr Ile Val Pro Leu Trp Ile Val Tyr Pro Leu Glu Gly Gln 1570
1575 1580Asn Pro Thr Pro Val Asn Asn Phe Leu Tyr
Gln Ile Val Glu Leu Leu1585 1590 1595
1600Val His Asp Ser Ser Arg Gln Gln Ala Phe Lys Thr Thr Ile Ser
Asp 1605 1610 1615His Val
His Pro His Asp Asn Leu Val Tyr Thr Cys Lys Ser Thr Ala 1620
1625 1630Ser Asn Phe Phe His Ala Ser Leu Ala
Tyr Trp Arg Ser Arg His Arg 1635 1640
1645Asn Ser Asn Arg Lys Tyr Leu Ala Arg Asp Ser Ser Thr Gly Ser Ser
1650 1655 1660Thr Asn Asn Ser Asp Gly His
Ile Glu Arg Ser Gln Glu Gln Thr Thr1665 1670
1675 1680Arg Asp Pro His Asp Gly Thr Glu Arg Asn Leu Val
Leu Gln Met Ser 1685 1690
1695His Glu Ile Lys Arg Thr Thr Ile Pro Gln Glu Asn Thr His Gln Gly
1700 1705 1710Pro Ser Phe Gln Ser Phe
Leu Ser Asp Ser Ala Cys Gly Thr Ala Asn 1715 1720
1725Pro Lys Leu Asn Phe Asp Arg Ser Arg His Asn Val Lys Phe
Gln Asp 1730 1735 1740His Asn Ser Ala
Ser Lys Arg Glu Gly His Gln Ile Ile Ser His Arg1745 1750
1755 1760Leu Val Leu Pro Phe Phe Thr Leu Ser
Gln Gly Thr Arg Gln Leu Thr 1765 1770
1775Ser Ser Asn Glu Ser Gln Thr Gln Asp Glu Ile Ser Lys Tyr Leu
Arg 1780 1785 1790Gln Leu Arg
Ser Val Ile Asp Thr Thr Val Tyr Cys Arg Phe Thr Gly 1795
1800 1805Ile Val Ser Ser Met His Tyr Lys Leu Asp Glu
Val Leu Trp Glu Ile 1810 1815 1820Glu
Ser Phe Lys Ser Ala Val Thr Leu Ala Glu Gly Glu Gly Ala Gly1825
1830 1835 1840Ala Leu Leu Leu Ile Gln
Lys Tyr Gln Val Lys Thr Leu Phe Phe Asn 1845
1850 1855Thr Leu Ala Thr Glu Ser Ser Ile Glu Ser Glu Ile
Val Ser Gly Met 1860 1865
1870Thr Thr Pro Arg Met Leu Leu Pro Val Met Ser Lys Phe His Asn Asp
1875 1880 1885Gln Ile Glu Ile Ile Leu Asn
Asn Ser Ala Ser Gln Ile Thr Asp Ile 1890 1895
1900Thr Asn Pro Thr Trp Phe Lys Asp Gln Arg Ala Arg Leu Pro Lys
Gln1905 1910 1915 1920Val
Glu Val Ile Thr Met Asp Ala Glu Thr Thr Glu Asn Ile Asn Arg
1925 1930 1935Ser Lys Leu Tyr Glu Ala Val
Tyr Lys Leu Ile Leu His His Ile Asp 1940 1945
1950Pro Ser Val Leu Lys Ala Val Val Leu Lys Val Phe Leu Ser
Asp Thr 1955 1960 1965Glu Gly Met
Leu Trp Leu Asn Asp Asn Leu Ala Pro Phe Phe Ala Thr 1970
1975 1980Gly Tyr Leu Ile Lys Pro Ile Thr Ser Ser Ala Arg
Ser Ser Glu Trp1985 1990 1995
2000Tyr Leu Cys Leu Thr Asn Phe Leu Ser Thr Thr Arg Lys Met Pro His
2005 2010 2015Gln Asn His Leu Ser
Cys Lys Gln Val Ile Leu Thr Ala Leu Gln Leu 2020
2025 2030Gln Ile Gln Arg Ser Pro Tyr Trp Leu Ser His Leu
Thr Gln Tyr Ala 2035 2040 2045Asp
Cys Glu Leu His Leu Ser Tyr Ile Arg Leu Gly Phe Pro Ser Leu 2050
2055 2060Glu Lys Val Leu Tyr His Arg Tyr Asn Leu
Val Asp Ser Lys Arg Gly2065 2070 2075
2080Pro Leu Val Ser Ile Thr Gln His Leu Ala His Leu Arg Ala Glu
Ile 2085 2090 2095Arg Glu
Leu Thr Asn Asp Tyr Asn Gln Gln Arg Gln Ser Arg Thr Gln 2100
2105 2110Thr Tyr His Phe Ile Arg Thr Ala Lys
Gly Arg Ile Thr Lys Leu Val 2115 2120
2125Asn Asp Tyr Leu Lys Phe Phe Leu Ile Val Gln Ala Leu Lys His Asn
2130 2135 2140Gly Thr Trp Gln Ala Glu Phe
Lys Lys Leu Pro Glu Leu Ile Ser Val2145 2150
2155 2160Cys Asn Arg Phe Tyr His Ile Arg Asp Cys Asn Cys
Glu Glu Arg Phe 2165 2170
2175Leu Val Gln Thr Leu Tyr Leu His Arg Met Gln Asp Ser Glu Val Lys
2180 2185 2190Leu Ile Glu Arg Leu Thr
Gly Leu Leu Ser Leu Phe Pro Asp Gly Leu 2195 2200
2205Tyr Arg Phe Asp 22101918959DNAZaire Ebola virus
19cggacacaca aaaagaaaga agaattttta ggatcttttg tgtgcgaata actatgagga
60agattaataa ttttcctctc attgaaattt atatcggaat ttaaattgaa attgttactg
120taatcacacc tggtttgttt cagagccaca tcacaaagat agagaacaac ctaggtctcc
180gaagggagca agggcatcag tgtgctcagt tgaaaatccc ttgtcaacac ctaggtctta
240tcacatcaca agttccacct cagactctgc agggtgatcc aacaacctta atagaaacat
300tattgttaaa ggacagcatt agttcacagt caaacaagca agattgagaa ttaaccttgg
360ttttgaactt gaacacttag gggattgaag attcaacaac cctaaagctt ggggtaaaac
420attggaaata gttaaaagac aaattgctcg gaatcacaaa attccgagta tggattctcg
480tcctcagaaa atctggatgg cgccgagtct cactgaatct gacatggatt accacaagat
540cttgacagca ggtctgtccg ttcaacaggg gattgttcgg caaagagtca tcccagtgta
600tcaagtaaac aatcttgaag aaatttgcca acttatcata caggcctttg aagcaggtgt
660tgattttcaa gagagtgcgg acagtttcct tctcatgctt tgtcttcatc atgcgtacca
720gggagattac aaacttttct tggaaagtgg cgcagtcaag tatttggaag ggcacgggtt
780ccgttttgaa gtcaagaagc gtgatggagt gaagcgcctt gaggaattgc tgccagcagt
840atctagtgga aaaaacatta agagaacact tgctgccatg ccggaagagg agacaactga
900agctaatgcc ggtcagtttc tctcctttgc aagtctattc cttccgaaat tggtagtagg
960agaaaaggct tgccttgaga aggttcaaag gcaaattcaa gtacatgcag agcaaggact
1020gatacaatat ccaacagctt ggcaatcagt aggacacatg atggtgattt tccgtttgat
1080gcgaacaaat tttctgatca aatttctcct aatacaccaa gggatgcaca tggttgccgg
1140gcatgatgcc aacgatgctg tgatttcaaa ttcagtggct caagctcgtt tttcaggctt
1200attgattgtc aaaacagtac ttgatcatat cctacaaaag acagaacgag gagttcgtct
1260ccatcctctt gcaaggaccg ccaaggtaaa aaatgaggtg aactccttta aggctgcact
1320cagctccctg gccaagcatg gagagtatgc tcctttcgcc cgacttttga acctttctgg
1380agtaaataat cttgagcatg gtcttttccc tcaactatcg gcaattgcac tcggagtcgc
1440cacagcacac gggagtaccc tcgcaggagt aaatgttgga gaacagtatc aacaactcag
1500agaggctgcc actgaggctg agaagcaact ccaacaatat gcagagtctc gcgaacttga
1560ccatcttgga cttgatgatc aggaaaagaa aattcttatg aacttccatc agaaaaagaa
1620cgaaatcagc ttccagcaaa caaacgctat ggtaactcta agaaaagagc gcctggccaa
1680gctgacagaa gctatcactg ctgcgtcact gcccaaaaca agtggacatt acgatgatga
1740tgacgacatt ccctttccag gacccatcaa tgatgacgac aatcctggcc atcaagatga
1800tgatccgact gactcacagg atacgaccat tcccgatgtg gtggttgatc ccgatgatgg
1860aagctacggc gaataccaga gttactcgga aaacggcatg aatgcaccag atgacttggt
1920cctattcgat ctagacgagg acgacgagga cactaagcca gtgcctaata gatcgaccaa
1980gggtggacaa cagaagaaca gtcaaaaggg ccagcatata gagggcagac agacacaatc
2040caggccaatt caaaatgtcc caggccctca cagaacaatc caccacgcca gtgcgccact
2100cacggacaat gacagaagaa atgaaccctc cggctcaacc agccctcgca tgctgacacc
2160aattaacgaa gaggcagacc cactggacga tgccgacgac gagacgtcta gccttccgcc
2220cttggagtca gatgatgaag agcaggacag ggacggaact tccaaccgca cacccactgt
2280cgccccaccg gctcccgtat acagagatca ctctgaaaag aaagaactcc cgcaagacga
2340gcaacaagat caggaccaca ctcaagaggc caggaaccag gacagtgaca acacccagtc
2400agaacactct tttgaggaga tgtatcgcca cattctaaga tcacaggggc catttgatgc
2460tgttttgtat tatcatatga tgaaggatga gcctgtagtt ttcagtacca gtgatggcaa
2520agagtacacg tatccagact cccttgaaga ggaatatcca ccatggctca ctgaaaaaga
2580ggctatgaat gaagagaata gatttgttac attggatggt caacaatttt attggccggt
2640gatgaatcac aagaataaat tcatggcaat cctgcaacat catcagtgaa tgagcatgga
2700acaatgggat gattcaaccg acaaatagct aacattaagt agtcaaggaa cgaaaacagg
2760aagaattttt gatgtctaag gtgtgaatta ttatcacaat aaaagtgatt cttatttttg
2820aatttaaagc tagcttatta ttactagccg tttttcaaag ttcaatttga gtcttaatgc
2880aaataggcgt taagccacag ttatagccat aattgtaact caatattcta actagcgatt
2940tatctaaatt aaattacatt atgcttttat aacttaccta ctagcctgcc caacatttac
3000acgatcgttt tataattaag aaaaaactaa tgatgaagat taaaaccttc atcatcctta
3060cgtcaattga attctctagc actcgaagct tattgtcttc aatgtaaaag aaaagctggt
3120ctaacaagat gacaactaga acaaagggca ggggccatac tgcggccacg actcaaaacg
3180acagaatgcc aggccctgag ctttcgggct ggatctctga gcagctaatg accggaagaa
3240ttcctgtaag cgacatcttc tgtgatattg agaacaatcc aggattatgc tacgcatccc
3300aaatgcaaca aacgaagcca aacccgaaga cgcgcaacag tcaaacccaa acggacccaa
3360tttgcaatca tagttttgag gaggtagtac aaacattggc ttcattggct actgttgtgc
3420aacaacaaac catcgcatca gaatcattag aacaacgcat tacgagtctt gagaatggtc
3480taaagccagt ttatgatatg gcaaaaacaa tctcctcatt gaacagggtt tgtgctgaga
3540tggttgcaaa atatgatctt ctggtgatga caaccggtcg ggcaacagca accgctgcgg
3600caactgaggc ttattgggcc gaacatggtc aaccaccacc tggaccatca ctttatgaag
3660aaagtgcgat tcggggtaag attgaatcta gagatgagac cgtccctcaa agtgttaggg
3720aggcattcaa caatctaaac agtaccactt cactaactga ggaaaatttt gggaaacctg
3780acatttcggc aaaggatttg agaaacatta tgtatgatca cttgcctggt tttggaactg
3840ctttccacca attagtacaa gtgatttgta aattgggaaa agatagcaac tcattggaca
3900tcattcatgc tgagttccag gccagcctgg ctgaaggaga ctctcctcaa tgtgccctaa
3960ttcaaattac aaaaagagtt ccaatcttcc aagatgctgc tccacctgtc atccacatcc
4020gctctcgagg tgacattccc cgagcttgcc agaaaagctt gcgtccagtc ccaccatcgc
4080ccaagattga tcgaggttgg gtatgtgttt ttcagcttca agatggtaaa acacttggac
4140tcaaaatttg agccaatctc ccttccctcc gaaagaggcg aataatagca gaggcttcaa
4200ctgctgaact atagggtacg ttacattaat gatacacttg tgagtatcag ccctggataa
4260tataagtcaa ttaaacgacc aagataaaat tgttcatatc tcgctagcag cttaaaatat
4320aaatgtaata ggagctatat ctctgacagt attataatca attgttatta agtaacccaa
4380accaaaagtg atgaagatta agaaaaacct acctcggctg agagagtgtt ttttcattaa
4440ccttcatctt gtaaacgttg agcaaaattg ttaaaaatat gaggcgggtt atattgccta
4500ctgctcctcc tgaatatatg gaggccatat accctgtcag gtcaaattca acaattgcta
4560gaggtggcaa cagcaataca ggcttcctga caccggagtc agtcaatggg gacactccat
4620cgaatccact caggccaatt gccgatgaca ccatcgacca tgccagccac acaccaggca
4680gtgtgtcatc agcattcatc cttgaagcta tggtgaatgt catatcgggc cccaaagtgc
4740taatgaagca aattccaatt tggcttcctc taggtgtcgc tgatcaaaag acctacagct
4800ttgactcaac tacggccgcc atcatgcttg cttcatacac tatcacccat ttcggcaagg
4860caaccaatcc acttgtcaga gtcaatcggc tgggtcctgg aatcccggat catcccctca
4920ggctcctgcg aattggaaac caggctttcc tccaggagtt cgttcttccg ccagtccaac
4980taccccagta tttcaccttt gatttgacag cactcaaact gatcacccaa ccactgcctg
5040ctgcaacatg gaccgatgac actccaacag gatcaaatgg agcgttgcgt ccaggaattt
5100catttcatcc aaaacttcgc cccattcttt tacccaacaa aagtgggaag aaggggaaca
5160gtgccgatct aacatctccg gagaaaatcc aagcaataat gacttcactc caggacttta
5220agatcgttcc aattgatcca accaaaaata tcatgggaat cgaagtgcca gaaactctgg
5280tccacaagct gaccggtaag aaggtgactt ctaaaaatgg acaaccaatc atccctgttc
5340ttttgccaaa gtacattggg ttggacccgg tggctccagg agacctcacc atggtaatca
5400cacaggattg tgacacgtgt cattctcctg caagtcttcc agctgtgatt gagaagtaat
5460tgcaataatt gactcagatc cagttttata gaatcttctc agggatagtg ataacatcta
5520tttagtaatc cgtccattag aggagacact tttaattgat caatatacta aaggtgcttt
5580acaccattgt cttttttctc tcctaaatgt agaacttaac aaaagactca taatatactt
5640gtttttaaag gattgattga tgaaagatca taactaataa cattacaaat aatcctacta
5700taatcaatac ggtgattcaa atgttaatct ttctcattgc acatactttt tgcccttatc
5760ctcaaattgc ctgcatgctt acatctgagg atagccagtg tgacttggat tggaaatgtg
5820gagaaaaaat cgggacccat ttctaggttg ttcacaatcc aagtacagac attgcccttc
5880taattaagaa aaaatcggcg atgaagatta agccgacagt gagcgtaatc ttcatctctc
5940ttagattatt tgttttccag agtaggggtc gtcaggtcct tttcaatcgt gtaaccaaaa
6000taaactccac tagaaggata ttgtggggca acaacacaat gggcgttaca ggaatattgc
6060agttacctcg tgatcgattc aagaggacat cattctttct ttgggtaatt atccttttcc
6120aaagaacatt ttccatccca cttggagtca tccacaatag cacattacag gttagtgatg
6180tcgacaaact agtttgtcgt gacaaactgt catccacaaa tcaattgaga tcagttggac
6240tgaatctcga agggaatgga gtggcaactg acgtgccatc tgcaactaaa agatggggct
6300tcaggtccgg tgtcccacca aaggtggtca attatgaagc tggtgaatgg gctgaaaact
6360gctacaatct tgaaatcaaa aaacctgacg ggagtgagtg tctaccagca gcgccagacg
6420ggattcgggg cttcccccgg tgccggtatg tgcacaaagt atcaggaacg ggaccgtgtg
6480ccggagactt tgccttccat aaagagggtg ctttcttcct gtatgatcga cttgcttcca
6540cagttatcta ccgaggaacg actttcgctg aaggtgtcgt tgcatttctg atactgcccc
6600aagctaagaa ggacttcttc agctcacacc ccttgagaga gccggtcaat gcaacggagg
6660acccgtctag tggctactat tctaccacaa ttagatatca ggctaccggt tttggaacca
6720atgagacaga gtacttgttc gaggttgaca atttgaccta cgtccaactt gaatcaagat
6780tcacaccaca gtttctgctc cagctgaatg agacaatata tacaagtggg aaaaggagca
6840ataccacggg aaaactaatt tggaaggtca accccgaaat tgatacaaca atcggggagt
6900gggccttctg ggaaactaaa aaaacctcac tagaaaaatt cgcagtgaag agttgtcttt
6960cacagttgta tcaaacggag ccaaaaacat cagtggtcag agtccggcgc gaacttcttc
7020cgacccaggg accaacacaa caactgaaga ccacaaaatc atggcttcag aaaattcctc
7080tgcaatggtt caagtgcaca gtcaaggaag ggaagctgca gtgtcgcatc taacaaccct
7140tgccacaatc tccacgagtc cccaatccct cacaaccaaa ccaggtccgg acaacagcac
7200ccataataca cccgtgtata aacttgacat ctctgaggca actcaagttg aacaacatca
7260ccgcagaaca gacaacgaca gcacagcctc cgacactccc tctgccacga ccgcagccgg
7320acccccaaaa gcagagaaca ccaacacgag caagagcact gacttcctgg accccgccac
7380cacaacaagt ccccaaaacc acagcgagac cgctggcaac aacaacactc atcaccaaga
7440taccggagaa gagagtgcca gcagcgggaa gctaggctta attaccaata ctattgctgg
7500agtcgcagga ctgatcacag gcgggagaag aactcgaaga gaagcaattg tcaatgctca
7560acccaaatgc aaccctaatt tacattactg gactactcag gatgaaggtg ctgcaatcgg
7620actggcctgg ataccatatt tcgggccagc agccgaggga atttacatag aggggctaat
7680gcacaatcaa gatggtttaa tctgtgggtt gagacagctg gccaacgaga cgactcaagc
7740tcttcaactg ttcctgagag ccacaactga gctacgcacc ttttcaatcc tcaaccgtaa
7800ggcaattgat ttcttgctgc agcgatgggg cggcacatgc cacattctgg gaccggactg
7860ctgtatcgaa ccacatgatt ggaccaagaa cataacagac aaaattgatc agattattca
7920tgattttgtt gataaaaccc ttccggacca gggggacaat gacaattggt ggacaggatg
7980gagacaatgg ataccggcag gtattggagt tacaggcgtt ataattgcag ttatcgcttt
8040attctgtata tgcaaatttg tcttttagtt tttcttcaga ttgcttcatg gaaaagctca
8100gcctcaaatc aatgaaacca ggatttaatt atatggatta cttgaatcta agattacttg
8160acaaatgata atataataca ctggagcttt aaacatagcc aatgtgattc taactccttt
8220aaactcacag ttaatcataa acaaggtttg acatcaatct agttatctct ttgagaatga
8280taaacttgat gaagattaag aaaaaggtaa tctttcgatt atctttaatc ttcatccttg
8340attctacaat catgacagtt gtctttagtg acaagggaaa gaagcctttt tattaagttg
8400taataatcag atctgcgaac cggtagagtt tagttgcaac ctaacacaca taaagcattg
8460gtcaaaaagt caatagaaat ttaaacagtg agtggagaca acttttaaat ggaagcttca
8520tatgagagag gacgcccacg agctgccaga cagcattcaa gggatggaca cgaccaccat
8580gttcgagcac gatcatcatc cagagagaat tatcgaggtg agtaccgtca atcaaggagc
8640gcctcacaag tgcgcgttcc tactgtattt cataagaaga gagttgaacc attaacagtt
8700cctccagcac ctaaagacat atgtccgacc ttgaaaaaag gatttttgtg tgacagtagt
8760ttttgcaaaa aagatcacca gttggagagt ttaactgata gggaattact cctactaatc
8820gcccgtaaga cttgtggatc agtagaacaa caattaaata taactgcacc caaggactcg
8880cgcttagcaa atccaacggc tgatgatttc cagcaagagg aaggtccaaa aattaccttg
8940ttgacactga tcaagacggc agaacactgg gcgagacaag acatcagaac catagaggat
9000tcaaaattaa gagcattgtt gactctatgt gctgtgatga cgaggaaatt ctcaaaatcc
9060cagctgagtc ttttatgtga gacacaccta aggcgcgagg ggcttgggca agatcaggca
9120gaacccgttc tcgaagtata tcaacgatta cacagtgata aaggaggcag ttttgaagct
9180gcactatggc aacaatggga ccgacaatcc ctaattatgt ttatcactgc attcttgaat
9240attgctctcc agttaccgtg tgaaagttct gctgtcgttg tttcagggtt aagaacattg
9300gttcctcaat cagataatga ggaagcttca accaacccgg ggacatgctc atggtctgat
9360gagggtaccc cttaataagg ctgactaaaa cactatataa ccttctactt gatcacaata
9420ctccgtatac ctatcatcat atatttaatc aagacgatat cctttaaaac ttattcagta
9480ctataatcac tctcgtttca aattaataag atgtgcatga ttgccctaat atatgaagag
9540gtatgataca accctaacag tgatcaaaga aaatcataat ctcgtatcgc tcgtaatata
9600acctgccaag catacctctt gcacaaagtg attcttgtac acaaataatg ttttactcta
9660caggaggtag caacgatcca tcccatcaaa aaataagtat ttcatgactt actaatgatc
9720tcttaaaata ttaagaaaaa ctgacggaac ataaattctt tatgcttcaa gctgtggagg
9780aggtgtttgg tattggctat tgttatatta caatcaataa caagcttgta aaaatattgt
9840tcttgtttca agaggtagat tgtgaccgga aatgctaaac taatgatgaa gattaatgcg
9900gaggtctgat aagaataaac cttattattc agattaggcc ccaagaggca ttcttcatct
9960ccttttagca aagtactatt tcagggtagt ccaattagtg gcacgtcttt tagctgtata
10020tcagtcgccc ctgagatacg ccacaaaagt gtctctaagc taaattggtc tgtacacatc
10080ccatacattg tattaggggc aataatatct aattgaactt agccgtttaa aatttagtgc
10140ataaatctgg gctaacacca ccaggtcaac tccattggct gaaaagaagc ttacctacaa
10200cgaacatcac tttgagcgcc ctcacaatta aaaaatagga acgtcgttcc aacaatcgag
10260cgcaaggttt caaggttgaa ctgagagtgt ctagacaaca aaatattgat actccagaca
10320ccaagcaaga cctgagaaaa aaccatggct aaagctacgg gacgatacaa tctaatatcg
10380cccaaaaagg acctggagaa aggggttgtc ttaagcgacc tctgtaactt cttagttagc
10440caaactattc aggggtggaa ggtttattgg gctggtattg agtttgatgt gactcacaaa
10500ggaatggccc tattgcatag actgaaaact aatgactttg cccctgcatg gtcaatgaca
10560aggaatctct ttcctcattt atttcaaaat ccgaattcca caattgaatc accgctgtgg
10620gcattgagag tcatccttgc agcagggata caggaccagc tgattgacca gtctttgatt
10680gaacccttag caggagccct tggtctgatc tctgattggc tgctaacaac caacactaac
10740catttcaaca tgcgaacaca acgtgtcaag gaacaattga gcctaaaaat gctgtcgttg
10800attcgatcca atattctcaa gtttattaac aaattggatg ctctacatgt cgtgaactac
10860aacggattgt tgagcagtat tgaaattgga actcaaaatc atacaatcat cataactcga
10920actaacatgg gttttctggt ggagctccaa gaacccgaca aatcggcaat gaaccgcatg
10980aagcctgggc cggcgaaatt ttccctcctt catgagtcca cactgaaagc atttacacaa
11040ggatcctcga cacgaatgca aagtttgatt cttgaattta atagctctct tgctatctaa
11100ctaaggtaga atacttcata ttgagctaac tcatatatgc tgactcaata gttatcttga
11160catctctgct ttcataatca gatatataag cataataaat aaatactcat atttcttgat
11220aatttgttta accacagata aatcctcact gtaagccagc ttccaagttg acacccttac
11280aaaaaccagg actcagaatc cctcaaacaa gagattccaa gacaacatca tagaattgct
11340ttattatatg aataagcatt ttatcaccag aaatcctata tactaaatgg ttaattgtaa
11400ctgaacccgc aggtcacatg tgttaggttt cacagattct atatattact aactctatac
11460tcgtaattaa cattagataa gtagattaag aaaaaagcct gaggaagatt aagaaaaact
11520gcttattggg tctttccgtg ttttagatga agcagttgaa attcttcctc ttgatattaa
11580atggctacac aacataccca atacccagac gctaggttat catcaccaat tgtattggac
11640caatgtgacc tagtcactag agcttgcggg ttatattcat catactccct taatccgcaa
11700ctacgcaact gtaaactccc gaaacatatc taccgtttga aatacgatgt aactgttacc
11760aagttcttga gtgatgtacc agtggcgaca ttgcccatag atttcatagt cccagttctt
11820ctcaaggcac tgtcaggcaa tggattctgt cctgttgagc cgcggtgcca acagttctta
11880gatgaaatca ttaagtacac aatgcaagat gctctcttct tgaaatatta tctcaaaaat
11940gtgggtgctc aagaagactg tgttgatgaa cactttcaag agaaaatctt atcttcaatt
12000cagggcaatg aatttttaca tcaaatgttt ttctggtatg atctggctat tttaactcga
12060aggggtagat taaatcgagg aaactctaga tcaacatggt ttgttcatga tgatttaata
12120gacatcttag gctatgggga ctatgttttt tggaagatcc caatttcaat gttaccactg
12180aacacacaag gaatccccca tgctgctatg gactggtatc aggcatcagt attcaaagaa
12240gcggttcaag ggcatacaca cattgtttct gtttctactg ccgacgtctt gataatgtgc
12300aaagatttaa ttacatgtcg attcaacaca actctaatct caaaaatagc agagattgag
12360gatccagttt gttctgatta tcccaatttt aagattgtgt ctatgcttta ccagagcgga
12420gattacttac tctccatatt agggtctgat gggtataaaa ttattaagtt cctcgaacca
12480ttgtgcttgg ccaaaattca attatgctca aagtacactg agaggaaggg ccgattctta
12540acacaaatgc atttagctgt aaatcacacc ctagaagaaa ttacagaaat gcgtgcacta
12600aagccttcac aggctcaaaa gatccgtgaa ttccatagaa cattgataag gctggagatg
12660acgccacaac aactttgtga gctattttcc attcaaaaac actgggggca tcctgtgcta
12720catagtgaaa cagcaatcca aaaagttaaa aaacatgcta cggtgctaaa agcattacgc
12780cctatagtga ttttcgagac atactgtgtt tttaaatata gtattgccaa acattatttt
12840gatagtcaag gatcttggta cagtgttact tcagatagga atctaacacc gggtcttaat
12900tcttatatca aaagaaatca attccctccg ttgccaatga ttaaagaact actatgggaa
12960ttttaccacc ttgaccaccc tccacttttc tcaaccaaaa ttattagtga cttaagtatt
13020tttataaaag acagagctac cgcagtagaa aggacatgct gggatgcagt attcgagcct
13080aatgttctag gatataatcc acctcacaaa tttagtacta aacgtgtacc ggaacaattt
13140ttagagcaag aaaacttttc tattgagaat gttctttcct acgcacaaaa actcgagtat
13200ctactaccac aatatcggaa cttttctttc tcattgaaag agaaagagtt gaatgtaggt
13260agaaccttcg gaaaattgcc ttatccgact cgcaatgttc aaacactttg tgaagctctg
13320ttagctgatg gtcttgctaa agcatttcct agcaatatga tggtagttac ggaacgtgag
13380caaaaagaaa gcttattgca tcaagcatca tggcaccaca caagtgatga ttttggtgaa
13440catgccacag ttagagggag tagctttgta actgatttag agaaatacaa tcttgcattt
13500agatatgagt ttacagcacc ttttatagaa tattgcaacc gttgctatgg tgttaagaat
13560gtttttaatt ggatgcatta tacaatccca cagtgttata tgcatgtcag tgattattat
13620aatccaccac ataacctcac actggagaat cgagacaacc cccccgaagg gcctagttca
13680tacaggggtc atatgggagg gattgaagga ctgcaacaaa aactctggac aagtatttca
13740tgtgctcaaa tttctttagt tgaaattaag actggtttta agttacgctc agctgtgatg
13800ggtgacaatc agtgcattac tgttttatca gtcttcccct tagagactga cgcagacgag
13860caggaacaga gcgccgaaga caatgcagcg agggtggccg ccagcctagc aaaagttaca
13920agtgcctgtg gaatcttttt aaaacctgat gaaacatttg tacattcagg ttttatctat
13980tttggaaaaa aacaatattt gaatggggtc caattgcctc agtcccttaa aacggctaca
14040agaatggcac cattgtctga tgcaattttt gatgatcttc aagggaccct ggctagtata
14100ggcactgctt ttgagcgatc catctctgag acacgacata tctttccttg caggataacc
14160gcagctttcc atacgttttt ttcggtgaga atcttgcaat atcatcatct cgggttcaat
14220aaaggttttg accttggaca gttaacactc ggcaaacctc tggatttcgg aacaatatca
14280ttggcactag cggtaccgca ggtgcttgga gggttatcct tcttgaatcc tgagaaatgt
14340ttctaccgga atctaggaga tccagttacc tcaggcttat tccagttaaa aacttatctc
14400cgaatgattg agatggatga tttattctta cctttaattg cgaagaaccc tgggaactgc
14460actgccattg actttgtgct aaatcctagc ggattaaatg tccctgggtc gcaagactta
14520acttcatttc tgcgccagat tgtacgcagg accatcaccc taagtgcgaa aaacaaactt
14580attaatacct tatttcatgc gtcagctgac ttcgaagacg aaatggtttg taaatggcta
14640ttatcatcaa ctcctgttat gagtcgtttt gcggccgata tcttttcacg cacgccgagc
14700gggaagcgat tgcaaattct aggatacctg gaaggaacac gcacattatt agcctctaag
14760atcatcaaca ataatacaga gacaccggtt ttggacagac tgaggaaaat aacattgcaa
14820aggtggagcc tatggtttag ttatcttgat cattgtgata atatcctggc ggaggcttta
14880acccaaataa cttgcacagt tgatttagca cagattctga gggaatattc atgggctcat
14940attttagagg gaagacctct tattggagcc acactcccat gtatgattga gcaattcaaa
15000gtgttttggc tgaaacccta cgaacaatgt ccgcagtgtt caaatgcaaa gcaaccaggt
15060gggaaaccat tcgtgtcagt ggcagtcaag aaacatattg ttagtgcatg gccgaacgca
15120tcccgaataa gctggactat cggggatgga atcccataca ttggatcaag gacagaagat
15180aagataggac aacctgctat taaaccaaaa tgtccttccg cagccttaag agaggccatt
15240gaattggcgt cccgtttaac atgggtaact caaggcagtt cgaacagtga cttgctaata
15300aaaccatttt tggaagcacg agtaaattta agtgttcaag aaatacttca aatgacccct
15360tcacattact caggaaatat tgttcacagg tacaacgatc aatacagtcc tcattctttc
15420atggccaatc gtatgagtaa ttcagcaacg cgattgattg tttctacaaa cactttaggt
15480gagttttcag gaggtggcca gtctgcacgc gacagcaata ttattttcca gaatgttata
15540aattatgcag ttgcactgtt cgatattaaa tttagaaaca ctgaggctac agatatccaa
15600tataatcgtg ctcaccttca tctaactaag tgttgcaccc gggaagtacc agctcagtat
15660ttaacataca catctacatt ggatttagat ttaacaagat accgagaaaa cgaattgatt
15720tatgacagta atcctctaaa aggaggactc aattgcaata tctcattcga taatccattt
15780ttccaaggta aacggctgaa cattatagaa gatgatctta ttcgactgcc tcacttatct
15840ggatgggagc tagccaagac catcatgcaa tcaattattt cagatagcaa caattcatct
15900acagacccaa ttagcagtgg agaaacaaga tcattcacta cccatttctt aacttatccc
15960aagataggac ttctgtacag ttttggggcc tttgtaagtt attatcttgg caatacaatt
16020cttcggacta agaaattaac acttgacaat tttttatatt acttaactac tcaaattcat
16080aatctaccac atcgctcatt gcgaatactt aagccaacat tcaaacatgc aagcgttatg
16140tcacggttaa tgagtattga tcctcatttt tctatttaca taggcggtgc tgcaggtgac
16200agaggactct cagatgcggc caggttattt ttgagaacgt ccatttcatc ttttcttaca
16260tttgtaaaag aatggataat taatcgcgga acaattgtcc ctttatggat agtatatccg
16320ctagagggtc aaaacccaac acctgtgaat aattttctct atcagatcgt agaactgctg
16380gtgcatgatt catcaagaca acaggctttt aaaactacca taagtgatca tgtacatcct
16440cacgacaatc ttgtttacac atgtaagagt acagccagca atttcttcca tgcatcattg
16500gcgtactgga ggagcagaca cagaaacagc aaccgaaaat acttggcaag agactcttca
16560actggatcaa gcacaaacaa cagtgatggt catattgaga gaagtcaaga acaaaccacc
16620agagatccac atgatggcac tgaacggaat ctagtcctac aaatgagcca tgaaataaaa
16680agaacgacaa ttccacaaga aaacacgcac cagggtccgt cgttccagtc ctttctaagt
16740gactctgctt gtggtacagc aaatccaaaa ctaaatttcg atcgatcgag acacaatgtg
16800aaatttcagg atcataactc ggcatccaag agggaaggtc atcaaataat ctcacaccgt
16860ctagtcctac ctttctttac attatctcaa gggacacgcc aattaacgtc atccaatgag
16920tcacaaaccc aagacgagat atcaaagtac ttacggcaat tgagatccgt cattgatacc
16980acagtttatt gtagatttac cggtatagtc tcgtccatgc attacaaact tgatgaggtc
17040ctttgggaaa tagagagttt caagtcggct gtgacgctag cagagggaga aggtgctggt
17100gccttactat tgattcagaa ataccaagtt aagaccttat ttttcaacac gctagctact
17160gagtccagta tagagtcaga aatagtatca ggaatgacta ctcctaggat gcttctacct
17220gttatgtcaa aattccataa tgaccaaatt gagattattc ttaacaactc agcaagccaa
17280ataacagaca taacaaatcc tacttggttt aaagaccaaa gagcaaggct acctaagcaa
17340gtcgaggtta taaccatgga tgcagagaca acagagaata taaacagatc gaaattgtac
17400gaagctgtat ataaattgat cttacaccat attgatccta gcgtattgaa agcagtggtc
17460cttaaagtct ttctaagtga tactgagggt atgttatggc taaatgataa tttagccccg
17520ttttttgcca ctggttattt aattaagcca ataacgtcaa gtgctagatc tagtgagtgg
17580tatctttgtc tgacgaactt cttatcaact acacgtaaga tgccacacca aaaccatctc
17640agttgtaaac aggtaatact tacggcattg caactgcaaa ttcaacgaag cccatactgg
17700ctaagtcatt taactcagta tgctgactgt gagttacatt taagttatat ccgccttggt
17760tttccatcat tagagaaagt actataccac aggtataacc tcgtcgattc aaaaagaggt
17820ccactagtct ctatcactca gcacttagca catcttagag cagagattcg agaattaact
17880aatgattata atcaacagcg acaaagtcgg actcaaacat atcactttat tcgtactgca
17940aaaggacgaa tcacaaaact agtcaatgat tatttaaaat tctttcttat tgtgcaagca
18000ttaaaacata atgggacatg gcaagctgag tttaagaaat taccagagtt gattagtgtg
18060tgcaataggt tctaccatat tagagattgc aattgtgaag aacgtttctt agttcaaacc
18120ttatatttac atagaatgca ggattctgaa gttaagctta tcgaaaggct gacagggctt
18180ctgagtttat ttccggatgg tctctacagg tttgattgaa ttaccgtgca tagtatcctg
18240atacttgcaa aggttggtta ttaacataca gattataaaa aactcataaa ttgctctcat
18300acatcatatt gatctaatct caataaacaa ctatttaaat aacgaaagga gtccctatat
18360tatatactat atttagcctc tctccctgcg tgataatcaa aaaattcaca atgcagcatg
18420tgtgacatat tactgccgca atgaatttaa cgcaacataa taaactctgc actctttata
18480attaagcttt aacgaaaggt ctgggctcat attgttattg atataataat gttgtatcaa
18540tatcctgtca gatggaatag tgttttggtt gataacacaa cttcttaaaa caaaattgat
18600ctttaagatt aagtttttta taattatcat tactttaatt tgtcgtttta aaaacggtga
18660tagccttaat ctttgtgtaa aataagagat taggtgtaat aaccttaaca tttttgtcta
18720gtaagctact atttcataca gaatgataaa attaaaagaa aaggcaggac tgtaaaatca
18780gaaatacctt ctttacaata tagcagacta gataataatc ttcgtgttaa tgataattaa
18840gacattgacc acgctcatca gaaggctcgc cagaataaac gttgcaaaaa ggattcctgg
18900aaaaatggtc gcacacaaaa atttaaaaat aaatctattt cttctttttt gtgtgtcca
1895920695PRTMarburg virus 20Met Asp Leu His Ser Leu Leu Glu Leu Gly Thr
Lys Pro Thr Ala Pro1 5 10
15His Val Arg Asn Lys Lys Val Ile Leu Phe Asp Thr Asn His Gln Val
20 25 30Ser Ile Cys Asn Gln Ile Ile
Asp Ala Ile Asn Ser Gly Ile Asp Leu 35 40
45Gly Asp Leu Leu Glu Gly Gly Leu Leu Thr Leu Cys Val Glu His
Tyr 50 55 60Tyr Asn Ser Asp Lys Asp
Lys Phe Asn Thr Ser Pro Ile Ala Lys Tyr65 70
75 80Leu Arg Asp Ala Gly Tyr Glu Phe Asp Val Val
Lys Asn Ala Asp Ala 85 90
95Thr Arg Phe Leu Asp Val Ile Pro Asn Glu Pro His Tyr Ser Pro Leu
100 105 110Ile Leu Ala Leu Lys Thr
Leu Glu Ser Thr Glu Ser Gln Arg Gly Arg 115 120
125Ile Gly Leu Phe Leu Ser Phe Cys Ser Leu Phe Leu Pro Lys
Leu Val 130 135 140Val Gly Asp Arg Ala
Ser Ile Glu Lys Ala Leu Arg Gln Val Thr Val145 150
155 160His Gln Glu Gln Gly Ile Val Thr Tyr Pro
Asn His Trp Leu Thr Thr 165 170
175Gly His Met Lys Val Ile Phe Gly Ile Leu Arg Ser Ser Phe Ile Leu
180 185 190Lys Phe Val Leu Ile
His Gln Gly Val Asn Leu Val Thr Gly His Asp 195
200 205Ala Tyr Asp Ser Ile Ile Ser Asn Ser Val Gly Gln
Thr Arg Phe Ser 210 215 220Gly Leu Leu
Ile Val Lys Thr Val Leu Glu Phe Ile Leu Gln Lys Thr225
230 235 240Asp Ser Gly Val Thr Leu His
Pro Leu Val Arg Thr Ser Lys Val Lys 245
250 255Asn Glu Val Ala Ser Phe Lys Gln Ala Leu Ser Asn
Leu Ala Arg His 260 265 270Gly
Glu Tyr Ala Pro Phe Ala Arg Val Leu Asn Leu Ser Gly Ile Asn 275
280 285Asn Leu Glu His Gly Leu Tyr Pro Gln
Leu Ser Ala Ile Ala Leu Gly 290 295
300Val Ala Thr Ala His Gly Ser Thr Leu Ala Gly Val Asn Val Gly Glu305
310 315 320Gln Tyr Gln Gln
Leu Arg Glu Ala Ala His Asp Ala Glu Ile Lys Leu 325
330 335Gln Arg Arg His Glu His Gln Glu Ile Gln
Ala Ile Ala Glu Asp Asp 340 345
350Glu Glu Arg Lys Ile Leu Glu Gln Phe His Leu Gln Lys Thr Glu Ile
355 360 365Thr His Ser Gln Thr Leu Ala
Val Leu Ser Gln Lys Arg Glu Lys Leu 370 375
380Ala Arg Leu Ala Ala Glu Ile Glu Asn Asn Ile Val Glu Asp Gln
Gly385 390 395 400Phe Lys
Gln Ser Gln Asn Gln Val Ser Gln Ser Phe Leu Asn Asp Pro
405 410 415Thr Pro Val Glu Val Thr Val
Gln Ala Arg Pro Ile Asn Arg Pro Thr 420 425
430Ala Leu Pro Pro Pro Val Asp Asn Lys Ile Glu His Glu Ser
Thr Glu 435 440 445Asp Ser Ser Ser
Ser Ser Ser Phe Val Asp Leu Asn Asp Pro Phe Ala 450
455 460Leu Leu Asn Glu Asp Glu Asp Thr Leu Asp Asp Ser
Val Met Ile Pro465 470 475
480Ser Thr Thr Ser Arg Glu Phe Gln Gly Ile Pro Ala Pro Pro Arg Gln
485 490 495Ser Gln Asp Leu Asn
Asn Ser Gln Gly Lys Gln Glu Asp Glu Ser Thr 500
505 510Asn Pro Ile Lys Lys Gln Phe Leu Arg Tyr Gln Glu
Leu Pro Pro Val 515 520 525Gln Glu
Asp Asp Glu Ser Glu Tyr Thr Thr Asp Ser Gln Glu Ser Ile 530
535 540Asp Gln Pro Gly Ser Asp Asn Glu Gln Gly Val
Asp Leu Pro Pro Pro545 550 555
560Pro Leu Tyr Ala Gln Glu Lys Arg Gln Asp Pro Ile Gln His Pro Ala
565 570 575Val Ser Ser Gln
Asp Pro Phe Gly Ser Ile Gly Asp Val Asn Gly Asp 580
585 590Ile Leu Glu Pro Ile Arg Ser Pro Ser Ser Pro
Ser Ala Pro Gln Glu 595 600 605Asp
Thr Arg Ala Arg Glu Ala Tyr Glu Leu Ser Pro Asp Phe Thr Asn 610
615 620Tyr Glu Asp Asn Gln Gln Asn Trp Pro Gln
Arg Val Val Thr Lys Lys625 630 635
640Gly Arg Thr Phe Leu Tyr Pro Asn Asp Leu Leu Gln Thr Asn Pro
Pro 645 650 655Glu Ser Leu
Ile Thr Ala Leu Val Glu Glu Tyr Gln Asn Pro Val Ser 660
665 670Ala Lys Glu Leu Gln Ala Asp Trp Pro Asp
Met Ser Phe Asp Glu Arg 675 680
685Arg His Val Ala Met Asn Leu 690 69521329PRTMarburg
virus 21Met Trp Asp Ser Ser Tyr Met Gln Gln Val Ser Glu Gly Leu Met Thr1
5 10 15Gly Lys Val Pro
Ile Asp Gln Val Phe Gly Ala Asn Pro Ser Glu Lys 20
25 30Leu His Lys Arg Arg Lys Pro Lys Gly Thr Val
Gly Leu Gln Cys Ser 35 40 45Pro
Cys Leu Met Ser Lys Ala Thr Ser Thr Asp Asp Ile Val Trp Asp 50
55 60Gln Leu Ile Val Lys Lys Thr Leu Ala Asp
Leu Leu Ile Pro Ile Asn65 70 75
80Arg Gln Ile Ser Asp Ile Gln Ser Thr Leu Asn Glu Val Thr Thr
Arg 85 90 95Val His Glu
Ile Glu Arg Gln Leu His Glu Ile Thr Pro Val Leu Lys 100
105 110Met Gly Arg Thr Leu Glu Ala Ile Ser Lys
Gly Met Ser Glu Met Leu 115 120
125Ala Lys Tyr Asp His Leu Val Ile Ser Thr Gly Arg Thr Thr Ala Pro 130
135 140Ala Ala Ala Phe Asp Ala Tyr Leu
Asn Glu His Gly Val Pro Pro Pro145 150
155 160Gln Pro Ala Ile Phe Lys Asp Leu Gly Val Ala Gln
Gln Ala Cys Ser 165 170
175Lys Gly Thr Met Val Lys Asn Glu Thr Thr Asp Ala Ala Asp Lys Met
180 185 190Ser Lys Val Leu Glu Leu
Ser Glu Glu Thr Phe Ser Lys Pro Asn Leu 195 200
205Ser Ala Lys Asp Leu Ala Leu Leu Leu Phe Thr His Leu Pro
Gly Asn 210 215 220Asn Thr Pro Phe His
Ile Leu Ala Gln Val Leu Ser Lys Ile Ala Tyr225 230
235 240Lys Ser Gly Lys Ser Gly Ala Phe Leu Asp
Ala Phe His Gln Ile Leu 245 250
255Ser Glu Gly Glu Asn Ala Gln Ala Ala Leu Thr Arg Leu Ser Arg Thr
260 265 270Phe Asp Ala Phe Leu
Gly Val Val Pro Pro Val Ile Arg Val Lys Asn 275
280 285Phe Gln Thr Val Pro Arg Pro Cys Gln Lys Ser Leu
Arg Ala Val Pro 290 295 300Pro Asn Pro
Thr Ile Asp Lys Gly Trp Val Cys Val Tyr Ser Ser Glu305
310 315 320Gln Gly Glu Thr Arg Ala Leu
Lys Ile 32522303PRTMarburg virus 22Met Ala Ser Ser Ser Asn
Tyr Asn Thr Tyr Met Gln Tyr Leu Asn Pro1 5
10 15Pro Pro Tyr Ala Asp His Gly Ala Asn Gln Leu Ile
Pro Ala Asp Gln 20 25 30Leu
Ser Asn Gln Gln Gly Ile Thr Pro Asn Tyr Val Gly Asp Leu Asn 35
40 45Leu Asp Asp Gln Phe Lys Gly Asn Val
Cys His Ala Phe Thr Leu Glu 50 55
60Ala Ile Ile Asp Ile Ser Ala Tyr Asn Glu Pro Thr Val Lys Gly Val65
70 75 80Pro Ala Trp Leu Pro
Leu Gly Ile Met Ser Asn Phe Glu Tyr Pro Leu 85
90 95Ala His Thr Val Ala Ala Leu Leu Thr Gly Ser
Tyr Thr Ile Thr Gln 100 105
110Phe Thr His Asn Gly Gln Lys Phe Val Arg Val Asn Arg Leu Gly Thr
115 120 125Gly Ile Pro Ala His Pro Leu
Arg Met Leu Arg Glu Gly Asn Gln Ala 130 135
140Phe Ile Gln Asn Met Val Ile Pro Arg Asn Phe Ser Thr Asn Gln
Phe145 150 155 160Thr Tyr
Asn Leu Thr Asn Leu Val Leu Ser Val Gln Lys Leu Pro Asp
165 170 175Asp Ala Trp Arg Pro Ser Lys
Asp Lys Leu Ile Gly Asn Thr Met His 180 185
190Pro Ala Val Ser Ile His Pro Asn Leu Pro Pro Ile Val Leu
Pro Thr 195 200 205Val Lys Lys Gln
Ala Tyr Arg Gln His Lys Asn Pro Asn Asn Gly Pro 210
215 220Leu Leu Ala Ile Ser Gly Ile Leu His Gln Leu Arg
Val Glu Lys Val225 230 235
240Pro Glu Lys Thr Ser Leu Phe Arg Ile Ser Leu Pro Ala Asp Met Phe
245 250 255Ser Val Lys Glu Gly
Met Met Lys Lys Arg Gly Glu Asn Ser Pro Val 260
265 270Val Tyr Phe Gln Ala Pro Glu Asn Phe Pro Leu Asn
Gly Phe Asn Asn 275 280 285Arg Gln
Val Val Leu Ala Tyr Ala Asn Pro Thr Leu Ser Ala Val 290
295 30023681PRTMarburg virus 23Met Lys Thr Thr Cys Leu
Phe Ile Ser Leu Ile Leu Ile Gln Gly Ile1 5
10 15Lys Thr Leu Pro Ile Leu Glu Ile Ala Ser Asn Asn
Gln Pro Gln Asn 20 25 30Val
Asp Ser Val Cys Ser Gly Thr Leu Gln Lys Thr Glu Asp Val His 35
40 45Leu Met Gly Phe Thr Leu Ser Gly Gln
Lys Val Ala Asp Ser Pro Leu 50 55
60Glu Ala Ser Lys Arg Trp Ala Phe Arg Thr Gly Val Pro Pro Lys Asn65
70 75 80Val Glu Tyr Thr Glu
Gly Glu Glu Ala Lys Thr Cys Tyr Asn Ile Ser 85
90 95Val Thr Asp Pro Ser Gly Lys Ser Leu Leu Leu
Asp Pro Pro Thr Asn 100 105
110Ile Arg Asp Tyr Pro Lys Cys Lys Thr Ile His His Ile Gln Gly Gln
115 120 125Asn Pro His Ala Gln Gly Ile
Ala Leu His Leu Trp Gly Ala Phe Phe 130 135
140Leu Tyr Asp Arg Ile Ala Ser Thr Thr Met Tyr Arg Gly Arg Val
Phe145 150 155 160Thr Glu
Gly Asn Ile Ala Ala Met Ile Val Asn Lys Thr Val His Lys
165 170 175Met Ile Phe Ser Arg Gln Gly
Gln Gly Tyr Arg His Met Asn Leu Thr 180 185
190Ser Thr Asn Lys Tyr Trp Thr Ser Asn Asn Gly Thr Gln Thr
Asn Asp 195 200 205Thr Gly Cys Phe
Gly Ala Leu Gln Glu Tyr Asn Ser Thr Lys Asn Gln 210
215 220Thr Cys Ala Pro Ser Lys Ile Pro Ser Pro Leu Pro
Thr Ala Arg Pro225 230 235
240Glu Ile Lys Pro Thr Ser Thr Pro Thr Asp Ala Thr Thr Leu Asn Thr
245 250 255Thr Asp Pro Asn Asn
Asp Asp Glu Asp Leu Ile Thr Ser Gly Ser Gly 260
265 270Ser Gly Glu Gln Glu Pro Tyr Thr Thr Ser Asp Ala
Val Thr Lys Gln 275 280 285Gly Leu
Ser Ser Thr Met Pro Pro Thr Pro Ser Pro Gln Pro Ser Thr 290
295 300Pro Gln Gln Glu Gly Asn Asn Thr Asp His Ser
Gln Gly Thr Val Thr305 310 315
320Glu Pro Asn Lys Thr Asn Thr Thr Ala Gln Pro Ser Met Pro Pro His
325 330 335Asn Thr Thr Ala
Ile Ser Thr Asn Asn Thr Ser Lys Asn Asn Phe Ser 340
345 350Thr Leu Ser Val Ser Leu Gln Asn Thr Thr Asn
Tyr Asp Thr Gln Ser 355 360 365Thr
Ala Thr Glu Asn Glu Gln Thr Ser Ala Pro Ser Lys Thr Thr Leu 370
375 380Pro Pro Thr Gly Asn Leu Thr Thr Ala Lys
Ser Thr Asn Asn Thr Lys385 390 395
400Gly Pro Thr Thr Thr Ala Pro Asn Met Thr Asn Gly His Leu Thr
Ser 405 410 415Pro Ser Pro
Thr Pro Asn Pro Thr Thr Gln His Leu Val Tyr Phe Arg 420
425 430Lys Lys Arg Ser Ile Leu Trp Arg Glu Gly
Asp Met Phe Pro Phe Leu 435 440
445Asp Gly Leu Ile Asn Ala Pro Ile Asp Phe Asp Pro Val Pro Asn Thr 450
455 460Lys Thr Ile Phe Asp Glu Ser Ser
Ser Ser Gly Ala Ser Ala Glu Glu465 470
475 480Asp Gln His Ala Ser Pro Asn Ile Ser Leu Thr Leu
Ser Tyr Phe Pro 485 490
495Asn Ile Asn Glu Asn Thr Ala Tyr Ser Gly Glu Asn Glu Asn Asp Cys
500 505 510Asp Ala Glu Leu Arg Ile
Trp Ser Val Gln Glu Asp Asp Leu Ala Ala 515 520
525Gly Leu Ser Trp Ile Pro Phe Phe Gly Pro Gly Ile Glu Gly
Leu Tyr 530 535 540Thr Ala Gly Leu Ile
Lys Asn Gln Asn Asn Leu Val Cys Arg Leu Arg545 550
555 560Arg Leu Ala Asn Gln Thr Ala Lys Ser Leu
Glu Leu Leu Leu Arg Val 565 570
575Thr Thr Glu Glu Arg Thr Phe Ser Leu Ile Asn Arg His Ala Ile Asp
580 585 590Phe Leu Leu Thr Arg
Trp Gly Gly Thr Cys Lys Val Leu Gly Pro Asp 595
600 605Cys Cys Ile Gly Ile Glu Asp Leu Ser Arg Asn Ile
Ser Glu Gln Ile 610 615 620Asp Gln Ile
Lys Lys Asp Glu Gln Lys Glu Gly Thr Gly Trp Gly Leu625
630 635 640Gly Gly Lys Trp Trp Thr Ser
Asp Trp Gly Val Leu Thr Asn Leu Gly 645
650 655Ile Leu Leu Leu Leu Ser Ile Ala Val Leu Ile Ala
Leu Ser Cys Ile 660 665 670Cys
Arg Ile Phe Thr Lys Tyr Ile Gly 675
68024281PRTMarburg virus 24Met Gln Gln Pro Arg Gly Arg Ser Arg Thr Arg
Asn His Gln Thr Ala1 5 10
15Ser Ser Ile Tyr His Glu Thr Gln Leu Pro Ser Lys Pro His Tyr Thr
20 25 30Asn His His Pro Arg Ala Arg
Ser Met Ser Ser Thr Arg Ser Ser Ala 35 40
45Glu Ser Ser Pro Thr Asn His Ile Pro Arg Ala Arg Pro Pro Pro
Thr 50 55 60Phe Asn Leu Ser Lys Pro
Pro Pro Pro Pro Lys Asp Met Cys Arg Asn65 70
75 80Met Lys Ile Gly Leu Pro Cys Thr Asp Pro Thr
Cys Asn Arg Asp His 85 90
95Asp Leu Asp Asn Leu Thr Asn Arg Glu Leu Leu Leu Leu Met Ala Arg
100 105 110Lys Met Leu Pro Asn Thr
Asp Lys Thr Phe Arg Ser Leu Gln Asp Cys 115 120
125Gly Ser Pro Ser Leu Ser Lys Gly Leu Ser Lys Asp Lys Gln
Glu Gln 130 135 140Thr Lys Asp Val Leu
Thr Leu Glu Asn Leu Gly His Ile Leu Asn Tyr145 150
155 160Leu His Arg Ser Asp Ile Gly Lys Leu Asp
Glu Thr Ser Leu Arg Ala 165 170
175Ala Leu Ser Leu Thr Cys Ala Gly Ile Arg Lys Thr Asn Arg Ser Leu
180 185 190Ile Asn Thr Met Thr
Glu Leu His Ile Asn His Glu Asn Leu Pro Gln 195
200 205Asp Gln Asn Gly Val Ile Lys Gln Thr Tyr Thr Gly
Ile His Leu Asp 210 215 220Lys Gly Gly
Gln Phe Glu Ala Ala Leu Trp Gln Gly Trp Asp Lys Arg225
230 235 240Ser Ile Ser Leu Phe Val Gln
Ala Ala Leu Tyr Val Met Asn Asn Ile 245
250 255Pro Cys Glu Ser Ser Thr Ser Val Gln Ala Ser Tyr
Asp His Phe Ile 260 265 270Leu
Pro Gln Ser Gln Ser Lys Gly Gln 275
28025253PRTMarburg virus 25Met Ala Glu Leu Ser Thr Arg Tyr Asn Leu Pro
Ala Asn Val Thr Glu1 5 10
15Lys Ser Ile Asn Leu Asp Leu Asn Ser Thr Ala Arg Trp Ile Lys Glu
20 25 30Pro Ser Val Gly Gly Trp Thr
Val Lys Trp Gly Asn Phe Val Phe His 35 40
45Ile Pro Asn Thr Gly Met Ala Leu Leu His His Leu Lys Ser Asn
Phe 50 55 60Val Val Pro Glu Trp Gln
Gln Thr Arg Asn Leu Phe Ser His Leu Phe65 70
75 80Lys Asn Pro Lys Ser Thr Ile Ile Glu Pro Phe
Leu Ala Leu Arg Ile 85 90
95Leu Leu Gly Val Ala Leu Lys Asp Gln Glu Leu Gln Gln Ser Leu Ile
100 105 110Pro Gly Phe Arg Ser Ile
Val His Met Leu Ser Glu Trp Leu Leu Leu 115 120
125Glu Val Thr Ser Ala Ile His Ile Ser Pro Asn Leu Leu Gly
Ile Tyr 130 135 140Leu Thr Ser Asp Met
Phe Lys Ile Leu Met Ala Gly Val Lys Asn Phe145 150
155 160Phe Asn Lys Met Phe Thr Leu His Val Val
Asn Asp His Gly Lys Pro 165 170
175Ser Ser Ile Glu Ile Lys Leu Thr Gly Gln Gln Ile Ile Ile Thr Arg
180 185 190Val Asn Met Gly Phe
Leu Val Glu Val Arg Arg Ile Asp Ile Glu Pro 195
200 205Cys Cys Gly Glu Thr Val Leu Ser Glu Ser Val Val
Phe Gly Leu Val 210 215 220Ala Glu Ala
Val Leu Arg Glu His Ser Gln Met Glu Lys Gly Gln Pro225
230 235 240Leu Asp Leu Thr Gln Tyr Met
Asn Ser Lys Ile Ala Ile 245
250262331PRTMarburg virus 26 Met Gln His Pro Thr Gln Tyr Pro Asp Ala Arg
Leu Ser Ser Pro Ile1 5 10
15Ile Leu Asp Gln Cys Asp Leu Leu Ala Arg Ser Leu Gly Leu Tyr Ser
20 25 30His Tyr Ser His Asn Pro Lys
Leu Arg Asn Cys Arg Ile Pro His His 35 40
45Ile Tyr Arg Leu Arg Asn Ser Thr Ala Leu Lys Thr Phe Leu Gln
Asn 50 55 60Cys Ser Ile Leu Thr Val
Pro Phe His Ser Ile Trp Asp His Ile Leu65 70
75 80Thr Ser Ile Gln Tyr Asp Ala Ile Asn His Val
Asp Asp Phe Lys Tyr 85 90
95Leu Leu Pro Ser Glu Leu Val Lys Tyr Ala Asn Trp Asp Asn Glu Phe
100 105 110Leu Lys Ala Tyr Leu Asn
Lys Ile Leu Gly Leu Asp His Val Phe Pro 115 120
125Ala Ser Ala Arg Ser Gln Trp Glu Asp Phe Ser Pro Lys Glu
Asn Pro 130 135 140Tyr Tyr Trp Gly Met
Leu Leu Leu Val His Leu Ser Gln Leu Ala Arg145 150
155 160Arg Ile Lys Gly Gln Arg Gly Ser Leu Arg
Ser Asn Trp Lys Phe Ile 165 170
175Gly Thr Asp Leu Glu Leu Phe Gly Ile Ala Asp Phe Ile Ile Phe Lys
180 185 190Val Pro Val Lys Thr
Ile Ile Arg Asn Ala Val Ser Leu Gln Ala Ser 195
200 205Lys Pro Gly Leu Arg Val Trp Tyr Arg Asp Gln Asn
Leu Thr Pro Tyr 210 215 220Leu Cys Asp
Asp Glu Phe Ile Val Ser Val Ala Ser Tyr Glu Cys Phe225
230 235 240Ile Met Ile Lys Asp Val Phe
Ile Glu Arg Tyr Asn Thr Trp Glu Ile 245
250 255Cys Ala Arg Ala Trp Leu Glu Asp Ser Asp Gly Ala
Asp Tyr Leu Pro 260 265 270Leu
Asp Val Leu Gly Glu Leu Tyr Asn Gln Gly Asp Gln Ile Ile Ala 275
280 285Met Tyr Leu Glu Asp Gly Phe Lys Leu
Ile Lys His Leu Glu Pro Leu 290 295
300Cys Val Ser Cys Ile Gln Thr His Gly Ile Phe Thr Pro Gly Lys Tyr305
310 315 320Trp Phe Gln Ser
Gln Arg Ile Glu Ser Tyr Tyr Glu Glu Leu Cys Ser 325
330 335Leu Asn Trp Lys Phe Lys Ile Ser Gly Asn
Lys Ala Glu Cys Ala Gln 340 345
350Asn Phe Ile Lys Thr Ile Ile Gln Gly Lys Leu Thr Pro Gln Gln Tyr
355 360 365Cys Glu Leu Phe Ser Leu Gln
Lys His Trp Gly His Pro Val Leu Tyr 370 375
380Ile Asp Val Ala Leu Asp Lys Val Lys Lys His Ala Gln Ser Val
Lys385 390 395 400Ile Leu
Lys Pro Lys Val Met Phe Glu Thr Phe Cys Val Phe Lys Phe
405 410 415Ile Val Ala Lys Asn His Tyr
His Ser Gln Gly Ser Trp Tyr Lys Thr 420 425
430Thr Met Asp Leu His Leu Thr Pro Tyr Leu Arg Gln His Ile
Val Ser 435 440 445Asn Ser Phe Pro
Ser Gln Ala Glu Ile Tyr Gln His Leu Trp Glu Trp 450
455 460Tyr Phe Val Glu His Glu Pro Leu Phe Ser Thr Lys
Ile Ile Ser Asp465 470 475
480Leu Ser Ile Phe Ile Lys Asp Arg Ala Thr Ala Val Asn Gln Glu Cys
485 490 495Trp Asp Ser Val Phe
Asp Arg Ser Val Leu Gly Tyr Asn Pro Pro Val 500
505 510Arg Phe Gln Ser Lys Arg Val Pro Glu Gln Phe Leu
Gly Gln Ala Asp 515 520 525Phe Ser
Leu Asn Gln Ile Leu Asp Phe Ala Glu Lys Leu Glu Tyr Leu 530
535 540Ala Pro Ser Tyr Arg Asn Phe Ser Phe Ser Leu
Lys Glu Lys Glu Leu545 550 555
560Asn Ile Gly Arg Thr Phe Gly Lys Leu Pro Tyr Arg Val Arg Asn Val
565 570 575Gln Thr Leu Ala
Glu Ala Leu Leu Ala Asp Gly Leu Ala Lys Ala Phe 580
585 590Pro Ser Asn Met Met Val Val Thr Glu Arg Glu
Gln Lys Glu Ala Leu 595 600 605Leu
His Gln Ala Ser Trp His His Asn Ser Ala Ser Ile Gly Glu Asn 610
615 620Ala Ile Val Arg Gly Ala Ser Phe Val Thr
Asp Leu Glu Lys Tyr Asn625 630 635
640Leu Ala Phe Arg Tyr Glu Phe Thr Arg His Phe Ile Asp Tyr Cys
Asn 645 650 655Arg Cys Tyr
Gly Val Lys Asn Leu Phe Asp Trp Met His Phe Leu Ile 660
665 670Pro Leu Cys Tyr Met His Val Ser Asp Phe
Tyr Ser Pro Pro His Cys 675 680
685Val Thr Glu Asp Asn Arg Asn Asn Pro Pro Asp Cys Ala Asn Ala Tyr 690
695 700His Tyr His Leu Gly Gly Ile Glu
Gly Leu Gln Gln Lys Leu Trp Thr705 710
715 720Cys Ile Ser Cys Ala Gln Ile Thr Leu Val Glu Leu
Lys Thr Lys Leu 725 730
735Lys Leu Lys Ser Ser Val Met Gly Asp Asn Gln Cys Ile Thr Thr Leu
740 745 750Ser Leu Phe Pro Ile Asp
Ala Pro Asp Asp Tyr Gln Glu Asn Glu Ala 755 760
765Glu Leu Asn Ala Ala Arg Val Ala Val Glu Leu Ala Ile Thr
Thr Gly 770 775 780Tyr Asp Gly Ile Phe
Leu Lys Pro Glu Glu Thr Phe Val His Ser Gly785 790
795 800Phe Ile Tyr Phe Gly Lys Lys Gln Tyr Leu
Asn Gly Val Gln Leu Pro 805 810
815Gln Ser Leu Lys Thr Met Ala Arg Cys Gly Pro Leu Ser Asp Ser Ile
820 825 830Phe Asp Asp Leu Gln
Gly Ser Leu Ala Ser Ile Gly Thr Ser Phe Glu 835
840 845Arg Gly Thr Ser Glu Thr Arg His Ile Phe Pro Ser
Arg Trp Ile Ala 850 855 860Ser Phe His
Ser Met Leu Ala Ile Asn Leu Leu Asn Gln Asn His Leu865
870 875 880Gly Phe Pro Leu Gly Phe Ser
Ile Asp Ile Ser Cys Phe Lys Lys Pro 885
890 895Leu Thr Phe Ser Glu Lys Leu Ile Ala Leu Ile Thr
Pro Gln Val Leu 900 905 910Gly
Gly Leu Ser Phe Leu Asn Pro Glu Lys Leu Phe Tyr Arg Asn Ile 915
920 925Ser Asp Pro Leu Thr Ser Gly Leu Phe
Gln Leu Lys Asn Ala Leu Glu 930 935
940Phe Leu Glu Lys Glu Glu Leu Phe Tyr Ile Leu Ile Ala Lys Lys Pro945
950 955 960Gly Leu Ala Asp
Ala Ser Asp Phe Val Met Asn Pro Leu Gly Leu Asn 965
970 975Val Pro Gly Ser Arg Glu Ile Ile Thr Phe
Leu Arg Gln Thr Val Arg 980 985
990Glu Asn Ile Thr Ile Thr Ser Gln Asn Arg Ile Ile Asn Ser Leu Phe
995 1000 1005His Ile Gly Ser Asp Leu Glu
Asp Gln Arg Val Cys Glu Trp Leu Leu 1010 1015
1020Ser Ser Asn Pro Val Met Ser Arg Phe Ala Ala Asp Ile Phe Ser
Arg1025 1030 1035 1040Thr
Pro Ser Gly Lys Arg Leu Gln Val Leu Gly Tyr Leu Glu Gly Thr
1045 1050 1055Arg Thr Leu Leu Ala Ser Arg
Thr Ile Ser Leu Thr Thr Glu Gly Thr 1060 1065
1070Met Leu Met Lys Leu Arg Glu Leu Thr Arg Asn Arg Trp Lys
Ser Trp 1075 1080 1085Phe Ser Tyr
Ile Asp Ala Leu Asp Asp Asp Leu Ser Glu Ser Leu Glu 1090
1095 1100Lys Phe Thr Cys Thr Val Asp Ile Ala Asn Phe Leu
Arg Ala Tyr Ser1105 1110 1115
1120Trp Leu Asp Val Leu Lys Gly Lys Arg Leu Ile Gly Ala Thr Leu Pro
1125 1130 1135Cys Leu Leu Glu Gln
Phe Lys Val Lys Trp Ile Asn Leu Ser Glu Asp 1140
1145 1150Leu Arg Glu Gln Phe Asn Met Ser Ser Glu Ser Glu
Ser Thr Ile Asn 1155 1160 1165Leu
Leu Pro Tyr Asp Cys Lys Glu Leu Arg Leu Gly Arg Ser Asn Asp 1170
1175 1180Thr Glu Leu Asn Tyr Val Ser Cys Ala Leu
Asp Arg Lys Val Val Gln1185 1190 1195
1200Lys His Pro Ser Val Asn Arg Leu Ala Trp Thr Ile Gly Asn Arg
Ala 1205 1210 1215Pro Tyr
Ile Gly Ser Arg Thr Glu Asp Lys Ile Gly Tyr Pro Pro Leu 1220
1225 1230Arg Val Asn Cys Pro Ser Ala Ala Leu
Lys Glu Ala Ile Glu Met Val 1235 1240
1245Ser Arg Leu Leu Trp Val Thr Gln Gly Thr Ala Asp Arg Glu Lys Leu
1250 1255 1260Leu Ile Pro Leu Leu Asn Ser
Arg Val Asn Leu Asp Tyr Gln Thr Val1265 1270
1275 1280Leu Asn Phe Leu Pro Thr His Tyr Ser Gly Asn Ile
Val His Arg Tyr 1285 1290
1295Asn Asp Gln Tyr Gly Gln His Ser Phe Met Ala Asn Arg Met Ser Asn
1300 1305 1310Thr Ser Thr Arg Ala Ile
Ile Ser Thr Asn Thr Leu Gly Lys Tyr Ala 1315 1320
1325Gly Gly Gly Gln Ala Ala Val Asp Ser Asn Ile Ile Phe Gln
Asn Thr 1330 1335 1340Ile Asn Leu Gly
Val Ala Val Leu Asp Ile Ala Leu Ser Leu Ala Lys1345 1350
1355 1360Leu Ser Ser Ala Ser Asn Val Thr Phe
Arg Leu Met Leu Asn Lys Cys 1365 1370
1375Cys Thr Arg His Val Pro Ser Glu Tyr Leu Phe Phe Asp Lys Pro
Leu 1380 1385 1390Asp Val Asp
Leu Asn Lys Tyr Met Asp Asn Glu Leu Val Tyr Asp Asn 1395
1400 1405Asp Pro Leu Cys Ser Gly Ile Lys Gly Arg Leu
Gly Arg Val Ser Arg 1410 1415 1420Ser
Thr Leu Ser Leu Ser Leu Asn Val Ser Asp Ile Gly Ser Tyr Asp1425
1430 1435 1440Phe Pro Thr Ile Ala Ala
Trp Thr Leu Gly Glu Thr Ile Val Gly Ser 1445
1450 1455Ile Phe Ser Asp Glu Ser Ser Gln Ser Thr Asp Pro
Ile Ser Ser Gly 1460 1465
1470Cys Thr Lys Thr Phe Val Thr His Phe Leu Val Tyr Pro Val Glu Ser
1475 1480 1485Ile Phe Tyr Ala Phe Gly Ala
Asn Leu Ile Val Glu Ser Leu Ser Leu 1490 1495
1500Ser Arg Ile Lys Ser Ile Lys Asn Leu Ser Asp Leu Thr Phe Leu
Ile1505 1510 1515 1520Ser
Ser Thr Ile Arg Asn Leu Ser His Arg Ser Leu Arg Ile Leu Gln
1525 1530 1535Ser Thr Phe Arg His Glu Leu
Val Leu Thr Arg Leu Ala His His Ile 1540 1545
1550Pro Leu Ile Ser Leu Met Leu Gly Gly Ser Ala Gly Glu Lys
Ser Ser 1555 1560 1565Ser Asp Ala
Val Arg Leu Phe Leu Thr Ala Ser Tyr Gln Asn Phe Ile 1570
1575 1580Asn Asn Phe Ser Cys Leu Met Lys Lys Gly Gln Ser
Ser Leu Pro Val1585 1590 1595
1600Trp Leu Tyr Phe Pro Ser Glu Gly Gln Gln Leu Lys Pro Ile Leu Lys
1605 1610 1615Ile Leu Gln Arg Leu
Ser Asp Leu Leu Ser Pro Asp Lys Val Gln Lys 1620
1625 1630His Gln Ile Leu Ala Asp Thr Cys Cys Pro Ile Asp
Ser Phe Trp Val 1635 1640 1645Tyr
Pro Ser Lys Ser Thr Arg Thr Asn His Tyr Tyr Ala Ser Leu Asn 1650
1655 1660Tyr Trp Arg Asp Lys Ala Asn Lys Val Lys
Asn Thr Pro Phe Ser His1665 1670 1675
1680Leu Ile Asn Cys Ser Phe Leu Glu Leu Ser Ser His Thr Ser Ser
Val 1685 1690 1695Ser Ser
Asn Gln Gln Val Thr Asn Ser Lys Tyr Ile Val His Pro Glu 1700
1705 1710Asn Ile Pro Glu Ile Asn Ala Arg Thr
Lys Leu Ile Asp Tyr Gly Ser 1715 1720
1725Thr Ala Leu Gln Gly Met Asp Ile Lys Met Pro Leu Ser Glu Gln Asn
1730 1735 1740Leu Val Gly Asn Cys Arg Pro
Ser Lys Gly Ile Arg Phe Lys Asp Asn1745 1750
1755 1760Pro Lys Thr Thr Lys His Asp Gln Gly Phe Val Gly
Lys Asp Ser Ser 1765 1770
1775Pro Arg Pro Met Ser Pro Glu Asp Asn Met Gln Thr Pro Ala Tyr Ile
1780 1785 1790His Ser Ser Pro Pro Tyr
Gln Thr Leu Thr Lys Ser Pro Asp Val His 1795 1800
1805Glu Asp Phe Asp Ala Ser Lys Val Ile Leu Asn Ser Glu Ile
Asn Asn 1810 1815 1820Leu Asn Leu Thr
Asp Cys Thr Leu Asn Thr Lys Ser Leu Thr Thr Pro1825 1830
1835 1840Thr Gly Thr Glu Ile Leu Gly Ile Ser
Pro Phe Arg Ser Ser Arg Tyr 1845 1850
1855Ser Ser Thr Ser Arg Glu Arg Ser Arg Leu Ser Arg Glu Gln Ala
Ser 1860 1865 1870Tyr Leu Tyr
Val Asp Cys Ser Asn Ile Pro Ser Ile Ser Leu Asp Pro 1875
1880 1885Gly Phe Gln Asn Met Ser Asp Gln Asn Gln Val
Gln Met Leu Ile Asn 1890 1895 1900Thr
Tyr Lys Arg Asp Leu His Ala Cys Phe Asp Ser Asn Gln Phe Cys1905
1910 1915 1920Arg Phe Thr Gly Val Val
Ser Ser Met His Tyr Lys Leu Tyr Asp Leu 1925
1930 1935Leu Pro Pro Gly Glu Leu Arg Lys Ala Ile Cys Leu
Ala Glu Gly Glu 1940 1945
1950Gly Ser Gly Ala Arg Leu Leu Leu Lys Trp Lys Lys Thr Asp Tyr Leu
1955 1960 1965Phe Phe Asn Thr Leu Ala Thr
Asp Ser Gln Gln Glu Ala Glu Ile Leu 1970 1975
1980Ser Gly Arg Val Ile Pro Arg Met Leu Tyr Asn Ile Asp Arg Leu
Asn1985 1990 1995 2000Ala
Leu Leu Glu Ser Arg Arg Leu Ile Leu Asn Asn Leu Thr Ile Gln
2005 2010 2015Ile Thr Asp Ile Thr Ser Pro
Leu Trp Leu Asp Ser Val Ile Gln Tyr 2020 2025
2030Leu Pro Glu Asp Ser Asp Ile Leu Thr Met Asp Ala Glu Thr
Thr Lys 2035 2040 2045Asp Glu Thr
Arg Glu Gln Leu Tyr Lys Thr Ile Val Asn Ile Trp Thr 2050
2055 2060Arg Thr Ser Pro Asn Ile Pro Lys Ile Ser Ile Ile
Lys Val Phe Leu2065 2070 2075
2080Leu Asp Tyr Glu Gly Thr Leu Phe Leu Met Arg Asn Ala Ile Gln Tyr
2085 2090 2095Tyr Gly Gln Val Gln
Leu Lys Lys Pro Tyr Ser Ser Asn Ala Lys Asn 2100
2105 2110Ser Glu Trp Tyr Leu Cys Cys Gly Lys Arg Arg Ile
Gln Arg Leu Lys 2115 2120 2125Ile
Asp Phe Ser Asp Gln Val Gly Ile Phe Leu Ile Cys Lys Ala Met 2130
2135 2140Ser Arg Gln Arg Gln Ala Ile Pro Tyr Trp
Leu Lys His Ile Glu Lys2145 2150 2155
2160Asn Tyr Pro Ala Ser Leu His Lys Phe Phe Leu Thr Leu Gly Phe
Pro 2165 2170 2175Ser Leu
Glu Ser Ser Phe Cys His Arg Tyr Thr Ile Pro Phe Ser Glu 2180
2185 2190Gly Lys Ala Leu Phe His Lys Val Gln
Ser Tyr Val Arg Gln Gly Lys 2195 2200
2205Gln His Leu His Ser Leu Met Leu Asp Tyr Glu Asn Asn Ser Pro Leu
2210 2215 2220Leu Asp Leu Arg Asn His Phe
Ile Cys Ser Leu Arg Gly Lys Ile Thr2225 2230
2235 2240Lys Tyr Tyr Asn Asp Ile Leu Lys Leu Asn Leu Val
Ile Lys Ala Val 2245 2250
2255Glu Lys Gly Lys Asn Trp Ser Gln Leu Val Glu Thr Leu Pro Asn Met
2260 2265 2270His Ser Val Cys Ile Val
His Val Asp His Glu Cys Phe Gly Cys Glu 2275 2280
2285Lys Arg Leu Leu Leu Lys Leu Asp Phe Ile Arg Asn Thr Lys
Ile Ala 2290 2295 2300Glu Gln Lys Leu
Leu Asn Arg Val Ile Gly Tyr Ile Leu Phe Phe Pro2305 2310
2315 2320Phe Gly Leu Phe Lys Ser Glu Ser Leu
Thr Ala 2325 23302719112DNAMarburg virus 27
gacacacaaa aacaagagat gatgattttg tgtatcatat aaataaagaa gaatattaac
60attgacattg agacttgtca gtctgttaat attcttgaaa agatggattt acatagcttg
120ttagagttgg gtacaaaacc cactgcccct catgttcgta ataagaaggt gatattattt
180gacacaaatc atcaggttag tatctgtaat cagataatag atgcaataaa ctcagggatt
240gatcttggag atcttctaga agggggtttg ctgacgttgt gtgttgaaca ttactataat
300tccgataaag ataaattcaa cacaagtcct atcgcaaaat acttgcgtga tgcgggctat
360gagtttgatg tcgtcaagaa tgcagatgca acccgctttc tggatgtgat tcctaacgaa
420cctcattaca gtcctttaat tttggccctt aagacattgg aaagtactga atctcagagg
480gggagaattg ggctcttttt gtcattttgc agtctttttc tcccgaaact tgttgtcgga
540gatcgggcta gtatcgaaaa ggctttaaga caagtaacag tacatcaaga acaggggatc
600gtcacatacc ctaatcactg gcttactaca ggccatatga aagtaatttt tgggattttg
660aggtctagct ttatcttaaa atttgtgtta attcatcaag gagtaaattt ggtgacaggt
720catgatgcct atgacagtat cattagtaat tcagtaggtc aaactagatt ctcaggactt
780cttattgtga aaacagttct tgagttcatc ttgcaaaaaa ctgattcagg ggtgacacta
840catcctttgg tgcggacctc caaagtaaaa aatgaagttg ctagtttcaa gcaggcgttg
900agcaacctag cccgacatgg agaatacgca ccgttcgcac gggttctgaa tttatcaggg
960attaacaacc tcgaacatgg actctatcct cagctttcgg cgattgcgct gggtgttgca
1020acagcacacg gcagtacatt ggctggtgtc aatgttggcg aacagtatca acagctacga
1080gaggcggcac atgatgcgga aataaaacta caaaggcgac atgaacatca ggaaattcaa
1140gctattgcag aggatgatga ggagaggaag atattagaac aattccacct tcagaaaact
1200gaaatcacac acagtcagac actagccgtc ctcagccaga aacgagaaaa attagctcgt
1260cttgctgcag aaattgaaaa caatattgtg gaagatcagg gatttaaaca atcacagaat
1320caggtgtcac agtcgttttt gaatgaccct acacctgtgg aagtaacggt tcaagccagg
1380cccataaatc gaccaactgc tctgcctccc ccagttgaca acaaaattga gcacgaatct
1440acagaagata gctcttcttc aagcagcttt gttgatctta atgatccatt tgcgctgctg
1500aatgaggacg aagacactct tgacgacagt gtcatgatcc cgagcacaac atcgagagaa
1560tttcaaggga ttccagcacc accaagacaa tctcaggacc tcaacaacag ccaaggaaag
1620caggaagatg aatcaacaaa tccgattaag aaacagtttc tgagatatca agaactgcct
1680ccggttcaag aggatgatga atcggaatac acaaccgact ctcaggagag tatcgaccaa
1740ccaggatctg acaatgaaca aggagttgat cttccacctc ctccattgta cgctcaggaa
1800aaaaggcaag atccaataca gcacccagca gtaagctctc aggatccctt tggcagtatt
1860ggtgatgtaa atggtgatat cttagaaccc ataagatcac cttcttcacc atctgctcct
1920caggaagaca caagggcaag agaagcctat gaattgtcgc ctgatttcac aaattatgag
1980gacaatcagc agaattggcc acaaagagtg gtgacaaaga agggtaggac tttcctttat
2040cctaatgatc ttctgcagac aaatcctcca gaatcactta taacagccct cgtagaggaa
2100taccaaaatc ctgtctcagc taaggagctc caagcagatt ggcccgacat gtcatttgat
2160gaaaggagac atgttgctat gaacttgtag tccagataac acagcacggt tacctactta
2220tctactttga tccgattcgt cctcagatca cagtaatcaa atttatttga atattcaaac
2280tactttttag gatcctatta cttgttacta ttgtgtgaga caacataagc tatcaaataa
2340caatcacggg caagaaccgg gcatactatg gtgatgcgag ggcattattc agtgctacaa
2400attctttttt caattgctat aatgatacaa ctacgaacct ccatacattt gccgcaatac
2460tgtaatcaac actgctgtat ctctccttca agccatctga tttaacttaa taaacatgac
2520ttgattcaga gagtgtgctg aaaatgttat tgattgagct tctcaaatgg tgcactatcc
2580tactgttttg ctcagcctag tatactgtaa catataagtg gactctccac ttctcttctc
2640gagtattccc tataagtgat ttacttgata gaatgtcaag tccactggtt tggagtttcc
2700ttactctaat gattgtaata attaactgtt ggcttagatg ataacagata cgaggttata
2760taattactca tagtataaag tataattctt gcctctgttt cttctgtttt ctctttcctt
2820tgtaatatgc caattaagaa aaactaaaaa tcgaagaata ttaaaggttt tctttaatat
2880tcagaaaagg ttttttattc tcttctttct ttttgcaaac atattgaaat aataattttc
2940acaatgtggg actcatcata tatgcaacaa gtcagtgagg ggttgatgac tggaaaagtt
3000cccatagatc aagtgtttgg tgccaatccc tcagagaagt tacacaagag aaggaaacca
3060aaaggcacag ttggactaca atgcagccct tgtctaatgt caaaggcgac aagcactgat
3120gatattgttt gggaccaact gatcgtgaag aaaacactag ctgatctact tataccgata
3180aataggcaga tatcggacat tcaaagcact ctaaacgaag taacaacaag agtccatgaa
3240attgagcggc aattacatga gataacccca gtgttaaaaa tgggaaggac actggaagca
3300atttccaagg ggatgtcaga aatgttagcc aaatacgacc acctcgtaat ttcaactgga
3360agaaccactg caccagctgc tgcctttgat gcttacttaa atgagcatgg tgtccctccc
3420ccccaacctg cgattttcaa agatcttggg gttgctcaac aagcttgtag taaggggacc
3480atggttaaaa atgaaacaac agatgcagcc gacaagatgt cgaaagttct tgaactcagt
3540gaggagacgt tctccaagcc aaatctttca gctaaggatt tagccctttt gttgtttacc
3600catctacccg gcaacaacac tccattccat atcctagctc aagtcctttc aaaaattgct
3660tacaagtcag gaaagtccgg agcatttttg gatgcatttc accagattct aagtgaagga
3720gagaatgctc aggcagcatt gactcgacta agcagaacat ttgatgcttt cctcggagta
3780gttcctccag tgataagagt caaaaacttc caaacagtcc ctcgcccatg tcaaaaaagt
3840cttcgggctg ttcctcccaa cccaacaatt gacaaaggat gggtctgtgt ttattcatct
3900gagcaaggtg agacacgggc cctgaaaatc taattctcat tgttaacagt tgcaggggga
3960gtgatctttc cgagttgata caaagacact aaacatttca aaagcatata tgtgggcaaa
4020acgtgactag accatcttaa tagaagtagt aatttatttc tgtcttaagt gtgattttca
4080ccttgaaaga gttaaatggt gatagattaa tccttgaagt aactttttta tatattatag
4140aggaactaat attactaaca aaaggggtct acctaacagg tatgactgag tgatcagtat
4200attttataaa ccaagcaatt gacttctcac tttttaagaa tcaactaaca acatagaaaa
4260catatttatc cttgtgtaat tctcggctta gttggaatta acttttgttg caattcaaga
4320cgcttattca tagtagatta tatgattttt tataagttta agatatctta aattataccc
4380acaagagata ctgttttaat taagaaaaac tatgaagaac attaagaaga tctttctctc
4440gtagtgttct tttactggaa ggagtatccc aatctcagct tgttgaatta attgttactt
4500aagtcattct ttttaaaatt aattcacaca aggtagtttg ggtttatatc tagaacaaat
4560tttaatatgg ccagttccag caattacaac acatacatgc aatacttgaa cccccctcct
4620tatgctgatc acggtgcaaa ccagttgatc ccggcggatc agctatcaaa tcagcagggt
4680ataactccaa attatgtggg tgacttaaac ctagatgatc agttcaaagg gaatgtctgc
4740catgctttca ctttagaggc aataattgac atatctgcgt ataatgaacc aacagtcaaa
4800ggtgttccag catggctgcc tctcgggatt atgagcaatt ttgaatatcc tttagctcat
4860actgtggctg cgttgctcac aggcagctat acaatcaccc aatttactca taatgggcaa
4920aaattcgtcc gtgtaaatcg actcggtaca ggaatcccag cacacccact cagaatgttg
4980cgtgaaggaa atcaagcttt tattcagaat atggtgatcc ccagaaattt ttccactaat
5040caattcacct acaatctcac taacttagta ttgagtgtgc aaaagcttcc tgatgatgcc
5100tggcgcccat ccaaggacaa attaattggg aacaccatgc atcccgcagt ctccatacac
5160ccgaatttgc cacccattgt tctaccaaca gtcaagaagc aggcttatcg tcagcataaa
5220aatcccaaca atggaccact gctggccata tctggcatcc ttcaccaact gagggtcgag
5280aaagtcccag agaagacaag cctgtttagg atttcacttc ctgccgatat gttctcagta
5340aaagaaggta tgatgaagaa aaggggagaa aattccccgg tggtttattt tcaagcacct
5400gagaacttcc ctttgaatgg cttcaacaac agacaagttg tactagcgta tgcgaatcca
5460acgctcagtg ccgtttgaaa taatgctcaa atgagacagg agtccatctg cataagaagc
5520atggcctaaa tgggtgtctg ttaagttctc acaagattag tttgtattga tttcaataat
5580gctttaacct tacattgctg ctttaaatgg ttaattaagc tgatcagctt gcaagatgta
5640atctcttttg ggtcatcaga tctataatgg gtttactaga ttatataaaa gaaatagtaa
5700tgttttataa acaattcttg cttagtttta ctttgattta ctaacatata tcattgtgcc
5760cttcattgct aagtaaactc aactgatgat gatattcctt ctgaaatagt aagaaaaact
5820aatgaagaac attaattgcc gggtaagagt gattaagttc tttaaatttg accaaagtaa
5880tgttttgtta gtgaatacat tcttatattg cttgattaaa aacaagaaat tatcctaaca
5940tgaagaccac atgtctcttt atcagtctta tcttaatcca agggataaaa actctcccta
6000ttttagagat agctagtaac aatcaacccc aaaatgtgga ttcggtatgc tccggaactc
6060tccagaagac agaagatgtc catctgatgg gattcacact gagtgggcaa aaagttgctg
6120attccccttt ggaggcatcc aagcgatggg ctttcaggac aggtgtacct cccaagaatg
6180ttgagtatac agaaggggag gaagccaaaa catgctacaa tataagtgta acggatccct
6240ctggaaaatc cttgctgttg gatcctccta ccaacatccg tgactatcct aaatgcaaaa
6300ctatccatca tattcaaggt caaaaccctc atgcgcaagg gatcgccctc catttgtggg
6360gagcattttt cctgtatgat cgcattgcct ccacaacaat gtaccgaggc agagtcttca
6420ctgaagggaa catagcagct atgattgtca ataagacagt gcacaaaatg attttctcga
6480ggcaaggaca ggggtaccgt cacatgaatc tgacttctac taataaatat tggacaagta
6540acaatggaac acaaacgaat gacactggat gcttcggtgc tcttcaagaa tacaactcca
6600cgaagaatca aacatgtgct ccgtccaaaa taccctcacc actgcccaca gcccgtccag
6660agatcaaacc cacaagcacc ccaactgatg ccaccacact caacaccaca gacccaaaca
6720atgatgatga ggacctcata acatccggtt cagggtccgg agaacaggaa ccctatacaa
6780cttcagatgc ggtcactaag caagggcttt catcaacaat gccacccact ccctcaccac
6840aaccaagcac gccacagcaa gaaggaaaca acacagacca ttcccaaggt actgtgactg
6900aacccaacaa aaccaacaca acggcacaac cgtccatgcc cccccacaac accactgcaa
6960tctctactaa caacacctcc aagaacaact tcagcaccct ctctgtatca ctacaaaaca
7020ccaccaatta cgacacacag agcacagcca ctgaaaatga acaaaccagt gccccctcga
7080aaacaaccct gcctccaaca ggaaatctta ccacagcaaa gagcactaac aacacgaaag
7140gccccaccac aacggcacca aatatgacaa atgggcattt aaccagtccc tcccccaccc
7200ccaacccgac cacacaacat cttgtatatt tcagaaagaa acgaagtatc ctctggaggg
7260aaggcgacat gtttcctttt ctggacgggt taataaatgc tccaattgat tttgatccag
7320ttccaaatac aaagacgatc tttgatgaat cttctagttc tggtgcttcg gctgaggaag
7380atcaacatgc ctcccccaat atcagtttaa ctttatccta ttttcctaat ataaatgaaa
7440acactgccta ctctggagaa aatgagaacg attgtgatgc agagttaaga atttggagcg
7500ttcaggagga tgacctggca gcagggctca gttggatacc gttttttggc cctggaatcg
7560aaggacttta tactgctggt ttaattaaaa accaaaacaa tttggtctgc aggttgaggc
7620gtctagccaa tcaaactgcc aaatccttgg aactcttatt aagagtcaca accgaggaaa
7680ggacattttc cttaattaat agacatgcca ttgactttct actcacaagg tggggaggaa
7740catgcaaagt gcttggacct gattgttgca ttggaataga agacttgtcc aggaatattt
7800cggaacaaat tgaccaaatc aaaaaagatg aacaaaaaga ggggactggt tggggtctag
7860gtggtaaatg gtggacatcc gactggggtg ttcttactaa cttgggcatt ttgctactat
7920tatccatagc tgtcttgatt gctctatcct gtatttgtcg tatctttacc aaatatatcg
7980ggtaatatta agtgtgtatt gattaaagct ttaggacaat tgctactgag cccttcttct
8040aatctactga aatcaacttg ggagattttt aagaagctga taatttaatg tgaatcagta
8100gtttacgtat tgttgattgt tatggtttga tattcaattg ttatcatagt caagagtaac
8160cttttctatt tgatgcatta atgttttaaa ctacctctta agcttttgtg gatggtttca
8220atatgtgcgt agaggttaat ttaaagagat ttcttgttgc acagtttttt gtattactta
8280cttgggcttg aagacatagt taagactggc cgaaaatgct ctccagtcaa ctccattccc
8340cctcagaaga gacgtgccgt tcaaagagtc ttgatttata actaaccatt gtaagaatta
8400atttactctt tccgttatac ttatctacat taattccttg aatgtccagc atcattaacg
8460acttgtctta attcaatctt ttggatgcaa accataagga aaaatgagcc actttccctc
8520tactctgaac taaggaaatt tctcttatca gcctaaaatc tgatccgtta ggtcatgggc
8580ccttcataat ctgtttgagc atgaatgttg atcaaatgac caaataatag tgcatttgta
8640tagattcaat tatcctttat taagaaaaag atagacagaa cacaaagaat tgataaaata
8700ttactttgat caattttgcg aggaattata aaaatcttga gggacaaatt attgtaacgt
8760agagtcgaag aacattaagt gttctttgtt agaattattc atccaagttg ttttgagtat
8820actcgcttca atacaacttc ccttcatatt tgattcaaga tttaaaatgc aacaaccccg
8880tggaaggagt cgaactcgca accaccaaac cgcatcatct atatatcatg aaactcagtt
8940gccctccaaa cctcactaca ccaatcatca tccacgtgca agatcgatga gctcaacccg
9000cagtagtgca gaaagcagtc ccaccaatca tattccccgt gctcgaccac ccccaacatt
9060caacttatcg aaaccccctc ctcctccaaa agacatgtgt aggaacatga aaattggatt
9120gccgtgcact gatcccactt gtaatagaga tcatgacctt gataatctaa caaatcgtga
9180acttttgcta ttgatggccc gaaaaatgct ccccaataca gacaagactt ttagaagtct
9240gcaggattgt gggtcaccgt ctctttctaa agggctctca aaagataaac aggagcaaac
9300gaaagatgtg ttgaccttgg aaaatctagg acacattctg aactacctcc acagatcaga
9360tattgggaaa ttggatgaga catcactccg tgcagcatta agtttgacgt gcgctggaat
9420tcgaaagacg aatagatcct tgatcaacac catgaccgaa ttacacatta accatgaaaa
9480tctcccgcaa gaccaaaacg gtgttatcaa acagacatat acaggtattc accttgacaa
9540aggaggtcaa ttcgaagccg ccttatggca aggttgggat aagagatcga tatctttatt
9600cgtacaagca gctttatatg taatgaacaa tatcccttgt gaatcatcaa ccagtgtgca
9660agcctcatac gatcatttta ttcttcctca aagtcaaagt aaaggacaat gattattgtt
9720tgaaagttga caatcaaatc actttcagtt tttagtttca actcttattg cgagacttga
9780acacaattct actaacttca ataagtgacc ccaaattcaa gtttactgaa gactacgacg
9840ataataatca ccaattcatt gtaaattact cgattaaaat attcttaagc tatcttaaac
9900ttgatgatgc agctctgttt cacctttctg ttgatttcaa tgttacagct atatctaagt
9960gtctaattaa caacttgtac ctctaaggaa aatcatgaag aacattaaga aaaaggatgt
10020tcttattttt caactaaact tgcatatcct ttgttgatac ccttgagaga caacttttga
10080cactagatca cggatcaagc atatttcatt caaacacccc aaattttcaa tcatacacat
10140aataaccatt ttagtagcgt tacctttcaa tacaatctag gtgattgtga aaagacttcc
10200aaacatggca gaattatcaa cgcgttacaa cttgcctgca aatgttacgg aaaaaagcat
10260aaatcttgac cttaattcca cagcacgatg gataaaagaa cccagtgttg ggggctggac
10320agtgaagtgg ggaaactttg ttttccacat accaaatact gggatggcat tgttgcatca
10380tttaaagtct aacttcgttg ttccagagtg gcaacaaaca aggaatctat tctcccacct
10440ctttaaaaac ccaaagtcaa caattataga accgttcttg gctttgagga tcttgcttgg
10500agttgctttg aaggatcaag aattacagca atcattaatt cctggattta gatctattgt
10560tcatatgctt tcagaatggt tgctcctaga ggtaacgtcg gcaatccata ttagccccaa
10620tctgttggga atctatttga cctcagacat gtttaagatt ctgatggcag gtgtgaaaaa
10680tttctttaat aagatgttca ctcttcatgt tgtaaatgac cacggaaaac ccagcagtat
10740tgaaataaag ttaactggac aacagatcat tatcactcgt gttaatatgg ggtttctagt
10800ggaagtcagg aggattgata ttgaaccttg ttgtggtgag acagtcctct cagaatcagt
10860tgtttttggg ctagtggctg aggcagttct aagagaacac agtcaaatgg agaagggcca
10920acccctcgat ctgacacaat acatgaacag caaaattgct atataagtgg cttaaattag
10980catggatatt catagtttaa ccacataata atgttggagg cacagtacat tatagttaat
11040tatcctgtat aacaaagaat atacctaccc tgatttatat ttactggtat aaaatagtgg
11100tatcatctta ttaaatagtt gtcatataac aggctgttcc tataatctga ttgtgagatt
11160ataaacttgt agaattaccg tggatcacaa ctgttgcata tcttccaaaa tatatctttt
11220gcaagcgatg tgtgcttgaa tacgtcgata taatacatac taataacgat tgattaagaa
11280aaaccaatga tggatattaa atatccatca agcaggtgtc gcagaatacc aggggtttca
11340tatgctgcca tatttactaa atcttacata ggattatatc attctcttcg atacacgtta
11400tatctttagc aaagtaatga aaatagcctt gtcatgttag acgccagtta tccatcttaa
11460gtgaatcctt tcttcaatat gcagcatcca actcaatatc ctgatgcaag gttgtcctcc
11520cctataatcc tagaccagtg tgacttatta gccagaagtt tagggttgta tagtcattat
11580tcacataatc cgaaattgcg taattgtagg attccacatc atatttaccg tttaaggaat
11640tcgacagcat taaaaacatt tcttcagaac tgttcaatac tcaccgtccc ttttcattca
11700atctgggatc atattttaac ttccattcaa tatgatgcaa ttaatcatgt tgatgatttt
11760aaatacctat tgccctctga gctagtcaag tatgcaaatt gggacaacga gttcttgaag
11820gcatatctta ataagatctt aggacttgac catgtttttc cagcttctgc aaggtcacaa
11880tgggaggatt tttctcctaa ggaaaatcct tattattggg ggatgctgtt actcgtgcat
11940ttatctcaac ttgccaggag gataaaagga caaagagggt cattaagaag taactggaag
12000tttataggaa cagatttaga gctgtttgga atagcagatt ttattatttt taaagttcca
12060gtaaaaacaa taatccgaaa tgctgtaagc ttacaagctt caaaaccagg gttaagagta
12120tggtaccgtg accaaaactt gaccccttat ctatgcgatg atgagtttat tgtaagcgtc
12180gctagttatg aatgttttat catgattaaa gacgtcttca ttgagaggta taacacgtgg
12240gaaatatgtg cccgcgcctg gctcgaagac agtgatggag ctgattatct ccctcttgat
12300gtgttaggtg agttatacaa ccagggagat caaattattg ccatgtactt ggaagacggt
12360ttcaaattga tcaaacactt ggaacccttg tgtgtcagct gtatacaaac acatggcatc
12420tttacaccag gaaaatactg gttccaatca cagaggattg agtcatatta tgaggagctc
12480tgtagtctca attggaaatt taaaatttca ggcaataaag ctgagtgtgc tcaaaacttt
12540attaaaacta taattcaggg gaaattgact cctcaacaat actgtgaatt attctctcta
12600caaaagcatt ggggtcaccc cgttttatac attgatgttg cactagataa ggttaaaaaa
12660catgcgcaat ctgtaaaaat cttaaaacct aaagtcatgt ttgaaacttt ttgtgttttc
12720aaatttatag tagcaaagaa tcattatcat tctcaaggat catggtataa aaccacaatg
12780gatttgcatt taactccata tcttagacaa catattgtgt caaattcatt tccgtcacaa
12840gccgaaattt atcagcatct ttgggagtgg tatttcgtgg agcatgaacc tcttttctca
12900actaaaataa taagtgattt aagtattttt ataaaagaca gggctactgc tgtgaaccag
12960gagtgttggg acagtgtttt cgatagaagt gtattagggt ataaccctcc tgttagattt
13020cagtcaaaga gagtgccaga gcaatttttg ggccaagcag acttttcctt gaatcaaata
13080ttggattttg ctgaaaagtt agaatatttg gctccttctt ataggaattt ttccttctca
13140ttaaaagaaa aagagttgaa tataggaaga acttttggga aattaccata tcgtgtcaga
13200aatgtccaaa cactcgcaga agccttgcta gcagatggac tagcaaaagc attccctagc
13260aacatgatgg ttgttactga gagggaacag aaagaagcat tattgcatca ggcttcttgg
13320caccacaatt cagcaagcat aggggaaaac gctatagtaa ggggtgcaag ttttgttact
13380gatcttgaga aatacaacct tgccttccga tatgaattta cacgacattt catagactac
13440tgtaatcgat gttatggtgt gaagaattta ttcgattgga tgcacttttt aataccacta
13500tgttatatgc atgtcagtga tttttatagc ccaccacatt gcgtaacaga agataaccga
13560aataacccac cggattgtgc taatgcttat cattatcact tagggggtat agagggactt
13620caacagaaat tgtggacatg tatatcatgt gcccagatca cccttgtaga gttaaaaact
13680aaattaaaat taaaatccag tgttatgggt gataatcaat gtataacaac tctaagtctt
13740tttccaattg atgctcccga cgattatcaa gagaacgaag ctgaattaaa tgcggcacga
13800gttgctgtcg aattagctat tactacgggt tatgatggta tatttttgaa gcctgaagaa
13860acatttgtcc attcagggtt catttatttt ggtaaaaagc aatacctcaa cggtgttcaa
13920ctgccacaat cattgaaaac aatggcaaga tgtggaccct tatctgactc tatttttgat
13980gatcttcaag gttccctggc cagtattggt acatcctttg agagaggaac aagtgagaca
14040cggcacattt ttccgagtcg ttggatagct tcatttcatt caatgttagc aataaattta
14100ttaaatcaga atcaccttgg gtttccccta gggttcagta ttgatatttc ttgtttcaaa
14160aagcctctta ccttttcgga aaaattaatt gctcttataa cgccccaagt tctaggaggg
14220ttatcatttt tgaatccgga gaaattgttc taccggaaca taagtgatcc gctcacttcg
14280ggtctatttc aacttaagaa tgcattagaa tttcttgaaa aggaagaatt attctatatc
14340ttgattgcta aaaaacctgg tttagcagat gcctcagatt tcgtcatgaa tccattaggc
14400ttaaatgtac caggatcaag ggaaataata acgttcctta gacaaacagt tcgtgaaaat
14460atcacgatca cgtcacaaaa tagaataata aattcccttt ttcacatagg ttctgattta
14520gaggaccaaa gggtgtgtga gtggctttta tcatcaaacc ccgtaatgag tcgatttgct
14580gctgacatct tttcaagaac gcctagtgga aaacggcttc aggtcttagg ctatctggaa
14640ggaacaagaa cattactagc ttctcggaca ataagtttaa ctacagaagg gacaatgttg
14700atgaaattaa gggaattaac aagaaaccga tggaaaagct ggttttctta tattgatgca
14760ttggacgatg atttatctga gtccttagaa aaattcacat gtactgttga tatagctaat
14820ttcttgaggg catattcatg gctcgacgtc ttaaaaggga aaaggctaat tggtgccaca
14880ttgccatgtt tactagagca atttaaggta aagtggatta atttgtctga ggatttaagg
14940gaacaattta atatgtcttc agaatcagaa tcaactataa atttattgcc gtatgactgc
15000aaggaactgc gacttggaag aagcaatgac acagagttaa actatgtcag ttgtgctctc
15060gaccggaaag ttgtccagaa acatccctct gttaatcgtc tggcttggac aataggaaat
15120cgagcaccgt atataggatc acggacagaa gacaagatcg gttatcctcc cttaagagta
15180aattgtccat cagcggcact taaagaagcc attgagatgg tttctagatt gttgtgggtg
15240actcaaggca ctgcagaccg agaaaaattg cttattcctc tcctcaattc gagggtaaat
15300ctggactatc agacagtgct taacttttta cctacacact actcaggcaa catagttcat
15360agatataatg accaatatgg acaacattcc tttatggcaa acaggatgag taatacatct
15420acacgtgcaa ttatatcaac taacacactg ggcaaatatg ctgggggggg tcaagctgct
15480gttgatagta atataatctt ccaaaatact atcaatttag gagtggcagt tttagatatt
15540gcattatctc ttgctaaatt gtcgtcagca tcaaatgtca ctttccgttt gatgttaaat
15600aagtgctgca cgcggcatgt gccatctgaa tacctatttt ttgataaacc tttagatgtg
15660gatttgaaca agtatatgga caatgagtta gtttatgaca atgaccctct ttgcagtggg
15720attaaaggga gattaggcag agtatcccga tcaacactct cgttgagttt gaatgtcagt
15780gacattggtt cttatgactt tccaactatt gctgcatgga cactaggaga aactatagtc
15840ggaagcattt tttctgatga gtcttctcaa agtacagatc caataagttc aggttgcaca
15900aaaactttcg tcacacattt ccttgtgtat ccagttgaga gtatttttta tgcattcggg
15960gctaacttaa tagtagaaag tttaagtcta agtaggatca aatcaattaa gaacctctca
16020gatttgacat tccttatatc atccacaatc aggaatttat cacatagatc acttcggatt
16080cttcaatcta ccttccgaca tgaattggta ctcacccgac tagcccacca cataccgtta
16140atttctttaa tgttaggggg ttctgcagga gagaaaagtt catcagatgc tgttcggcta
16200tttcttacag caagttacca gaatttcatc aacaacttca gttgtttgat gaaaaagggc
16260cagtcatcac taccggtttg gctttacttt cctagtgaag ggcaacaatt aaaacctata
16320ttaaaaatct tacagagatt atcagacttg ttatcacctg acaaagttca aaagcatcaa
16380atcttagctg acacctgttg tccaattgac agcttttggg tctatccaag caagtccaca
16440aggactaacc actattatgc aagccttaat tattggagag acaaagctaa taaggtcaag
16500aatactcctt tttcgcattt gataaattgt tcatttcttg aactttcttc acacaccagt
16560tcggtctctt ctaatcaaca agtgaccaat tcgaaatata ttgttcatcc agagaatatc
16620cctgaaataa atgcaagaac caaattaata gattatggat caacagctct acaggggatg
16680gatatcaaga tgccactctc ggagcaaaat ctggttggaa attgtcgacc atcaaagggc
16740attagattca aggacaatcc aaaaacaaca aaacatgacc agggatttgt ggggaaggac
16800tcttcaccgc gaccaatgtc ccctgaagac aacatgcaga ctcctgcata catacatagt
16860tcccccccat atcaaaccct tacaaaatca ccagatgtac atgaggactt tgatgcctcg
16920aaggtaatct taaattctga aataaataac cttaacctta cggattgtac gcttaataca
16980aagtcattga caactcctac cgggacagaa atcttaggta taagtccgtt cagatcctct
17040agatattcat caacttccag ggaacggtct cgactatcta gagaacaagc ttcatatttg
17100tatgttgatt gcagtaatat tccctctatc tctctagacc cgggttttca gaatatgtct
17160gatcagaatc aagttcaaat gttaatcaat acctacaaac gtgatttaca tgcttgtttt
17220gatagcaatc aattctgtcg gtttacaggg gtagtctcat caatgcatta caagctttat
17280gatctcttgc ctccaggtga attgagaaag gcaatttgct tggccgaagg agaaggaagt
17340ggtgctcggt tacttttgaa gtggaagaag acggattatt tatttttcaa cactttggct
17400acggattcac agcaagaagc agagatttta agtggccggg taataccgag aatgttatat
17460aacatagata ggttaaatgc tttgcttgaa tcaagaagat taatattgaa caacctaact
17520atccaaatta cagatattac aagtccacta tggctagatt ctgtaataca atacttacct
17580gaagatagcg acattcttac aatggacgca gagaccacta aagatgaaac aagggaacag
17640ctttataaga ctattgtgaa tatttggaca cgtacttctc ctaatattcc aaaaattagc
17700atcatcaagg tatttttatt agactatgaa gggactttgt tcttaatgag gaatgccatt
17760cagtattatg ggcaggttca actcaagaaa ccatatagct caaatgcaaa aaactcagaa
17820tggtacttgt gttgcggtaa acgaagaatt caacgactca aaattgattt ctcagaccag
17880gtaggaattt ttctgatttg taaagcaatg tcgcgccaaa gacaagcaat tccttactgg
17940ttaaaacata tagaaaagaa ttatcctgct tcattacata agtttttcct aactttgggt
18000ttcccttctt tagagtcatc tttctgccat cgttatacta ttccattcag tgaaggaaag
18060gctctttttc ataaggtcca atcttatgtt cgtcaaggca aacaacattt acattctctt
18120atgttggatt atgaaaacaa ttcacctcta ctagacttga gaaatcactt tatttgctca
18180ttgaggggaa agataactaa gtattacaat gatatattaa agttaaatct agttatcaag
18240gcagtagaga aaggtaaaaa ttggtcacaa cttgttgaga cccttcctaa tatgcattca
18300gtctgcatag tacacgtgga tcatgagtgc tttggatgtg agaaacggtt actactcaaa
18360ttggatttta ttagaaacac aaagatcgca gaacaaaaat tacttaatag agtaatcggg
18420tatattttat tctttccgtt cggtctgttt aaatctgaat cattaacagc ataactttaa
18480caaagagaac ttcatttaat tcacgaaaat aatctattta aaaatgaggg ttacattttc
18540tagagtattg tatgagaaat aataaaataa acaagaagaa gaaaaaacta ttagacagct
18600tgctttacac aagataatct tatatcgtct caaaccgtac acaagtaggg aaatcacgcg
18660cacaaattaa cttgtgattg aacgttcggt cacaccagtg gtaacttttc aatgttagtt
18720actcaaatat tattgctcat aattggtatt gatattggta cattgggtga gtccttgagc
18780tttatcctta atataatgta agaaattagg gaaatactga gatatactag ttgaattgag
18840ttatgacata ccatatatca taaatataaa agaacgatct gctgtaatct ataagcatct
18900cttttacata cattggggaa agaactaggt tatcgttgag attaaaaaga ctacgttacg
18960ttttctctga tgacaagtga caaaatttcg tagttaaatt tctagaatgt caatgtgaat
19020gtaaattaag aaaaaccaat atataaaatt aaaaaattaa aaaactttga tataagtaac
19080acaaaacatt cttcatcttt tttgtgtgtc ca
1911228739PRTZaire Ebola virus strain Mayinga 28Met Asp Ser Arg Pro Gln
Lys Ile Trp Met Ala Pro Ser Leu Thr Glu1 5
10 15Ser Asp Met Asp Tyr His Lys Ile Leu Thr Ala Gly
Leu Ser Val Gln 20 25 30Gln
Gly Ile Val Arg Gln Arg Val Ile Pro Val Tyr Gln Val Asn Asn 35
40 45Leu Glu Glu Ile Cys Gln Leu Ile Ile
Gln Ala Phe Glu Ala Gly Val 50 55
60Asp Phe Gln Glu Ser Ala Asp Ser Phe Leu Leu Met Leu Cys Leu His65
70 75 80His Ala Tyr Gln Gly
Asp Tyr Lys Leu Phe Leu Glu Ser Gly Ala Val 85
90 95Lys Tyr Leu Glu Gly His Gly Phe Arg Phe Glu
Val Lys Lys Arg Asp 100 105
110Gly Val Lys Arg Leu Glu Glu Leu Leu Pro Ala Val Ser Ser Gly Lys
115 120 125Asn Ile Lys Arg Thr Leu Ala
Ala Met Pro Glu Glu Glu Thr Thr Glu 130 135
140Ala Asn Ala Gly Gln Phe Leu Ser Phe Ala Ser Leu Phe Leu Pro
Lys145 150 155 160Leu Val
Val Gly Glu Lys Ala Cys Leu Glu Lys Val Gln Arg Gln Ile
165 170 175Gln Val His Ala Glu Gln Gly
Leu Ile Gln Tyr Pro Thr Ala Trp Gln 180 185
190Ser Val Gly His Met Met Val Ile Phe Arg Leu Met Arg Thr
Asn Phe 195 200 205Leu Ile Lys Phe
Leu Leu Ile His Gln Gly Met His Met Val Ala Gly 210
215 220His Asp Ala Asn Asp Ala Val Ile Ser Asn Ser Val
Ala Gln Ala Arg225 230 235
240Phe Ser Gly Leu Leu Ile Val Lys Thr Val Leu Asp His Ile Leu Gln
245 250 255Lys Thr Glu Arg Gly
Val Arg Leu His Pro Leu Ala Arg Thr Ala Lys 260
265 270Val Lys Asn Glu Val Asn Ser Phe Lys Ala Ala Leu
Ser Ser Leu Ala 275 280 285Lys His
Gly Glu Tyr Ala Pro Phe Ala Arg Leu Leu Asn Leu Ser Gly 290
295 300Val Asn Asn Leu Glu His Gly Leu Phe Pro Gln
Leu Ser Ala Ile Ala305 310 315
320Leu Gly Val Ala Thr Ala His Gly Ser Thr Leu Ala Gly Val Asn Val
325 330 335Gly Glu Gln Tyr
Gln Gln Leu Arg Glu Ala Ala Thr Glu Ala Glu Lys 340
345 350Gln Leu Gln Gln Tyr Ala Glu Ser Arg Glu Leu
Asp His Leu Gly Leu 355 360 365Asp
Asp Gln Glu Lys Lys Ile Leu Met Asn Phe His Gln Lys Lys Asn 370
375 380Glu Ile Ser Phe Gln Gln Thr Asn Ala Met
Val Thr Leu Arg Lys Glu385 390 395
400Arg Leu Ala Lys Leu Thr Glu Ala Ile Thr Ala Ala Ser Leu Pro
Lys 405 410 415Thr Ser Gly
His Tyr Asp Asp Asp Asp Asp Ile Pro Phe Pro Gly Pro 420
425 430Ile Asn Asp Asp Asp Asn Pro Gly His Gln
Asp Asp Asp Pro Thr Asp 435 440
445Ser Gln Asp Thr Thr Ile Pro Asp Val Val Val Asp Pro Asp Asp Gly 450
455 460Ser Tyr Gly Glu Tyr Gln Ser Tyr
Ser Glu Asn Gly Met Asn Ala Pro465 470
475 480Asp Asp Leu Val Leu Phe Asp Leu Asp Glu Asp Asp
Glu Asp Thr Lys 485 490
495Pro Val Pro Asn Arg Ser Thr Lys Gly Gly Gln Gln Lys Asn Ser Gln
500 505 510Lys Gly Gln His Ile Glu
Gly Arg Gln Thr Gln Ser Arg Pro Ile Gln 515 520
525Asn Val Pro Gly Pro His Arg Thr Ile His His Ala Ser Ala
Pro Leu 530 535 540Thr Asp Asn Asp Arg
Arg Asn Glu Pro Ser Gly Ser Thr Ser Pro Arg545 550
555 560Met Leu Thr Pro Ile Asn Glu Glu Ala Asp
Pro Leu Asp Asp Ala Asp 565 570
575Asp Glu Thr Ser Ser Leu Pro Pro Leu Glu Ser Asp Asp Glu Glu Gln
580 585 590Asp Arg Asp Gly Thr
Ser Asn Arg Thr Pro Thr Val Ala Pro Pro Ala 595
600 605Pro Val Tyr Arg Asp His Ser Glu Lys Lys Glu Leu
Pro Gln Asp Glu 610 615 620Gln Gln Asp
Gln Asp His Thr Gln Glu Ala Arg Asn Gln Asp Ser Asp625
630 635 640Asn Thr Gln Ser Glu His Ser
Phe Glu Glu Met Tyr Arg His Ile Leu 645
650 655Arg Ser Gln Gly Pro Phe Asp Ala Val Leu Tyr Tyr
His Met Met Lys 660 665 670Asp
Glu Pro Val Val Phe Ser Thr Ser Asp Gly Lys Glu Tyr Thr Tyr 675
680 685Pro Asp Ser Leu Glu Glu Glu Tyr Pro
Pro Trp Leu Thr Glu Lys Glu 690 695
700Ala Met Asn Glu Glu Asn Arg Phe Val Thr Leu Asp Gly Gln Gln Phe705
710 715 720Tyr Trp Pro Val
Met Asn His Lys Asn Lys Phe Met Ala Ile Leu Gln 725
730 735His His Gln29340PRTZaire Ebola virus
strain Mayinga 29Met Thr Thr Arg Thr Lys Gly Arg Gly His Thr Ala Ala Thr
Thr Gln1 5 10 15Asn Asp
Arg Met Pro Gly Pro Glu Leu Ser Gly Trp Ile Ser Glu Gln 20
25 30Leu Met Thr Gly Arg Ile Pro Val Ser
Asp Ile Phe Cys Asp Ile Glu 35 40
45Asn Asn Pro Gly Leu Cys Tyr Ala Ser Gln Met Gln Gln Thr Lys Pro 50
55 60Asn Pro Lys Thr Arg Asn Ser Gln Thr
Gln Thr Asp Pro Ile Cys Asn65 70 75
80His Ser Phe Glu Glu Val Val Gln Thr Leu Ala Ser Leu Ala
Thr Val 85 90 95Val Gln
Gln Gln Thr Ile Ala Ser Glu Ser Leu Glu Gln Arg Ile Thr 100
105 110Ser Leu Glu Asn Gly Leu Lys Pro Val
Tyr Asp Met Ala Lys Thr Ile 115 120
125Ser Ser Leu Asn Arg Val Cys Ala Glu Met Val Ala Lys Tyr Asp Leu
130 135 140Leu Val Met Thr Thr Gly Arg
Ala Thr Ala Thr Ala Ala Ala Thr Glu145 150
155 160Ala Tyr Trp Ala Glu His Gly Gln Pro Pro Pro Gly
Pro Ser Leu Tyr 165 170
175Glu Glu Ser Ala Ile Arg Gly Lys Ile Glu Ser Arg Asp Glu Thr Val
180 185 190Pro Gln Ser Val Arg Glu
Ala Phe Asn Asn Leu Asn Ser Thr Thr Ser 195 200
205Leu Thr Glu Glu Asn Phe Gly Lys Pro Asp Ile Ser Ala Lys
Asp Leu 210 215 220Arg Asn Ile Met Tyr
Asp His Leu Pro Gly Phe Gly Thr Ala Phe His225 230
235 240Gln Leu Val Gln Val Ile Cys Lys Leu Gly
Lys Asp Ser Asn Ser Leu 245 250
255Asp Ile Ile His Ala Glu Phe Gln Ala Ser Leu Ala Glu Gly Asp Ser
260 265 270Pro Gln Cys Ala Leu
Ile Gln Ile Thr Lys Arg Val Pro Ile Phe Gln 275
280 285Asp Ala Ala Pro Pro Val Ile His Ile Arg Ser Arg
Gly Asp Ile Pro 290 295 300Arg Ala Cys
Gln Lys Ser Leu Arg Pro Val Pro Pro Ser Pro Lys Ile305
310 315 320Asp Arg Gly Trp Val Cys Val
Phe Gln Leu Gln Asp Gly Lys Thr Leu 325
330 335Gly Leu Lys Ile 34030326PRTZaire Ebola
virus strain Mayinga 30Met Arg Arg Val Ile Leu Pro Thr Ala Pro Pro Glu
Tyr Met Glu Ala1 5 10
15Ile Tyr Pro Val Arg Ser Asn Ser Thr Ile Ala Arg Gly Gly Asn Ser
20 25 30Asn Thr Gly Phe Leu Thr Pro
Glu Ser Val Asn Gly Asp Thr Pro Ser 35 40
45Asn Pro Leu Arg Pro Ile Ala Asp Asp Thr Ile Asp His Ala Ser
His 50 55 60Thr Pro Gly Ser Val Ser
Ser Ala Phe Ile Leu Glu Ala Met Val Asn65 70
75 80Val Ile Ser Gly Pro Lys Val Leu Met Lys Gln
Ile Pro Ile Trp Leu 85 90
95Pro Leu Gly Val Ala Asp Gln Lys Thr Tyr Ser Phe Asp Ser Thr Thr
100 105 110Ala Ala Ile Met Leu Ala
Ser Tyr Thr Ile Thr His Phe Gly Lys Ala 115 120
125Thr Asn Pro Leu Val Arg Val Asn Arg Leu Gly Pro Gly Ile
Pro Asp 130 135 140His Pro Leu Arg Leu
Leu Arg Ile Gly Asn Gln Ala Phe Leu Gln Glu145 150
155 160Phe Val Leu Pro Pro Val Gln Leu Pro Gln
Tyr Phe Thr Phe Asp Leu 165 170
175Thr Ala Leu Lys Leu Ile Thr Gln Pro Leu Pro Ala Ala Thr Trp Thr
180 185 190Asp Asp Thr Pro Thr
Gly Ser Asn Gly Ala Leu Arg Pro Gly Ile Ser 195
200 205Phe His Pro Lys Leu Arg Pro Ile Leu Leu Pro Asn
Lys Ser Gly Lys 210 215 220Lys Gly Asn
Ser Ala Asp Leu Thr Ser Pro Glu Lys Ile Gln Ala Ile225
230 235 240Met Thr Ser Leu Gln Asp Phe
Lys Ile Val Pro Ile Asp Pro Thr Lys 245
250 255Asn Ile Met Gly Ile Glu Val Pro Glu Thr Leu Val
His Lys Leu Thr 260 265 270Gly
Lys Lys Val Thr Ser Lys Asn Gly Gln Pro Ile Ile Pro Val Leu 275
280 285Leu Pro Lys Tyr Ile Gly Leu Asp Pro
Val Ala Pro Gly Asp Leu Thr 290 295
300Met Val Ile Thr Gln Asp Cys Asp Thr Cys His Ser Pro Ala Ser Leu305
310 315 320Pro Ala Val Ile
Glu Lys 32531676PRTZaire Ebola virus strain Mayinga 31Met
Gly Val Thr Gly Ile Leu Gln Leu Pro Arg Asp Arg Phe Lys Arg1
5 10 15Thr Ser Phe Phe Leu Trp Val
Ile Ile Leu Phe Gln Arg Thr Phe Ser 20 25
30Ile Pro Leu Gly Val Ile His Asn Ser Thr Leu Gln Val Ser
Asp Val 35 40 45Asp Lys Leu Val
Cys Arg Asp Lys Leu Ser Ser Thr Asn Gln Leu Arg 50 55
60Ser Val Gly Leu Asn Leu Glu Gly Asn Gly Val Ala Thr
Asp Val Pro65 70 75
80Ser Ala Thr Lys Arg Trp Gly Phe Arg Ser Gly Val Pro Pro Lys Val
85 90 95Val Asn Tyr Glu Ala Gly
Glu Trp Ala Glu Asn Cys Tyr Asn Leu Glu 100
105 110Ile Lys Lys Pro Asp Gly Ser Glu Cys Leu Pro Ala
Ala Pro Asp Gly 115 120 125Ile Arg
Gly Phe Pro Arg Cys Arg Tyr Val His Lys Val Ser Gly Thr 130
135 140Gly Pro Cys Ala Gly Asp Phe Ala Phe His Lys
Glu Gly Ala Phe Phe145 150 155
160Leu Tyr Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr Phe
165 170 175Ala Glu Gly Val
Val Ala Phe Leu Ile Leu Pro Gln Ala Lys Lys Asp 180
185 190Phe Phe Ser Ser His Pro Leu Arg Glu Pro Val
Asn Ala Thr Glu Asp 195 200 205Pro
Ser Ser Gly Tyr Tyr Ser Thr Thr Ile Arg Tyr Gln Ala Thr Gly 210
215 220Phe Gly Thr Asn Glu Thr Glu Tyr Leu Phe
Glu Val Asp Asn Leu Thr225 230 235
240Tyr Val Gln Leu Glu Ser Arg Phe Thr Pro Gln Phe Leu Leu Gln
Leu 245 250 255Asn Glu Thr
Ile Tyr Thr Ser Gly Lys Arg Ser Asn Thr Thr Gly Lys 260
265 270Leu Ile Trp Lys Val Asn Pro Glu Ile Asp
Thr Thr Ile Gly Glu Trp 275 280
285Ala Phe Trp Glu Thr Lys Lys Asn Leu Thr Arg Lys Ile Arg Ser Glu 290
295 300Glu Leu Ser Phe Thr Val Val Ser
Asn Gly Ala Lys Asn Ile Ser Gly305 310
315 320Gln Ser Pro Ala Arg Thr Ser Ser Asp Pro Gly Thr
Asn Thr Thr Thr 325 330
335Glu Asp His Lys Ile Met Ala Ser Glu Asn Ser Ser Ala Met Val Gln
340 345 350Val His Ser Gln Gly Arg
Glu Ala Ala Val Ser His Leu Thr Thr Leu 355 360
365Ala Thr Ile Ser Thr Ser Pro Gln Ser Leu Thr Thr Lys Pro
Gly Pro 370 375 380Asp Asn Ser Thr His
Asn Thr Pro Val Tyr Lys Leu Asp Ile Ser Glu385 390
395 400Ala Thr Gln Val Glu Gln His His Arg Arg
Thr Asp Asn Asp Ser Thr 405 410
415Ala Ser Asp Thr Pro Ser Ala Thr Thr Ala Ala Gly Pro Pro Lys Ala
420 425 430Glu Asn Thr Asn Thr
Ser Lys Ser Thr Asp Phe Leu Asp Pro Ala Thr 435
440 445Thr Thr Ser Pro Gln Asn His Ser Glu Thr Ala Gly
Asn Asn Asn Thr 450 455 460His His Gln
Asp Thr Gly Glu Glu Ser Ala Ser Ser Gly Lys Leu Gly465
470 475 480Leu Ile Thr Asn Thr Ile Ala
Gly Val Ala Gly Leu Ile Thr Gly Gly 485
490 495Arg Arg Thr Arg Arg Glu Ala Ile Val Asn Ala Gln
Pro Lys Cys Asn 500 505 510Pro
Asn Leu His Tyr Trp Thr Thr Gln Asp Glu Gly Ala Ala Ile Gly 515
520 525Leu Ala Trp Ile Pro Tyr Phe Gly Pro
Ala Ala Glu Gly Ile Tyr Ile 530 535
540Glu Gly Leu Met His Asn Gln Asp Gly Leu Ile Cys Gly Leu Arg Gln545
550 555 560Leu Ala Asn Glu
Thr Thr Gln Ala Leu Gln Leu Phe Leu Arg Ala Thr 565
570 575Thr Glu Leu Arg Thr Phe Ser Ile Leu Asn
Arg Lys Ala Ile Asp Phe 580 585
590Leu Leu Gln Arg Trp Gly Gly Thr Cys His Ile Leu Gly Pro Asp Cys
595 600 605Cys Ile Glu Pro His Asp Trp
Thr Lys Asn Ile Thr Asp Lys Ile Asp 610 615
620Gln Ile Ile His Asp Phe Val Asp Lys Thr Leu Pro Asp Gln Gly
Asp625 630 635 640Asn Asp
Asn Trp Trp Thr Gly Trp Arg Gln Trp Ile Pro Ala Gly Ile
645 650 655Gly Val Thr Gly Val Ile Ile
Ala Val Ile Ala Leu Phe Cys Ile Cys 660 665
670Lys Phe Val Phe 67532364PRTZaire Ebola virus
strain Mayinga 32Met Gly Val Thr Gly Ile Leu Gln Leu Pro Arg Asp Arg Phe
Lys Arg1 5 10 15Thr Ser
Phe Phe Leu Trp Val Ile Ile Leu Phe Gln Arg Thr Phe Ser 20
25 30Ile Pro Leu Gly Val Ile His Asn Ser
Thr Leu Gln Val Ser Asp Val 35 40
45Asp Lys Leu Val Cys Arg Asp Lys Leu Ser Ser Thr Asn Gln Leu Arg 50
55 60Ser Val Gly Leu Asn Leu Glu Gly Asn
Gly Val Ala Thr Asp Val Pro65 70 75
80Ser Ala Thr Lys Arg Trp Gly Phe Arg Ser Gly Val Pro Pro
Lys Val 85 90 95Val Asn
Tyr Glu Ala Gly Glu Trp Ala Glu Asn Cys Tyr Asn Leu Glu 100
105 110Ile Lys Lys Pro Asp Gly Ser Glu Cys
Leu Pro Ala Ala Pro Asp Gly 115 120
125Ile Arg Gly Phe Pro Arg Cys Arg Tyr Val His Lys Val Ser Gly Thr
130 135 140Gly Pro Cys Ala Gly Asp Phe
Ala Phe His Lys Glu Gly Ala Phe Phe145 150
155 160Leu Tyr Asp Arg Leu Ala Ser Thr Val Ile Tyr Arg
Gly Thr Thr Phe 165 170
175Ala Glu Gly Val Val Ala Phe Leu Ile Leu Pro Gln Ala Lys Lys Asp
180 185 190Phe Phe Ser Ser His Pro
Leu Arg Glu Pro Val Asn Ala Thr Glu Asp 195 200
205Pro Ser Ser Gly Tyr Tyr Ser Thr Thr Ile Arg Tyr Gln Ala
Thr Gly 210 215 220Phe Gly Thr Asn Glu
Thr Glu Tyr Leu Phe Glu Val Asp Asn Leu Thr225 230
235 240Tyr Val Gln Leu Glu Ser Arg Phe Thr Pro
Gln Phe Leu Leu Gln Leu 245 250
255Asn Glu Thr Ile Tyr Thr Ser Gly Lys Arg Ser Asn Thr Thr Gly Lys
260 265 270Leu Ile Trp Lys Val
Asn Pro Glu Ile Asp Thr Thr Ile Gly Glu Trp 275
280 285Ala Phe Trp Glu Thr Lys Lys Thr Ser Leu Glu Lys
Phe Ala Val Lys 290 295 300Ser Cys Leu
Ser Gln Leu Tyr Gln Thr Glu Pro Lys Thr Ser Val Val305
310 315 320Arg Val Arg Arg Glu Leu Leu
Pro Thr Gln Gly Pro Thr Gln Gln Leu 325
330 335Lys Thr Thr Lys Ser Trp Leu Gln Lys Ile Pro Leu
Gln Trp Phe Lys 340 345 350Cys
Thr Val Lys Glu Gly Lys Leu Gln Cys Arg Ile 355
36033297PRTZaire Ebola virus strain Mayinga 33Met Gly Val Thr Gly Ile Leu
Gln Leu Pro Arg Asp Arg Phe Lys Arg1 5 10
15Thr Ser Phe Phe Leu Trp Val Ile Ile Leu Phe Gln Arg
Thr Phe Ser 20 25 30Ile Pro
Leu Gly Val Ile His Asn Ser Thr Leu Gln Val Ser Asp Val 35
40 45Asp Lys Leu Val Cys Arg Asp Lys Leu Ser
Ser Thr Asn Gln Leu Arg 50 55 60Ser
Val Gly Leu Asn Leu Glu Gly Asn Gly Val Ala Thr Asp Val Pro65
70 75 80Ser Ala Thr Lys Arg Trp
Gly Phe Arg Ser Gly Val Pro Pro Lys Val 85
90 95Val Asn Tyr Glu Ala Gly Glu Trp Ala Glu Asn Cys
Tyr Asn Leu Glu 100 105 110Ile
Lys Lys Pro Asp Gly Ser Glu Cys Leu Pro Ala Ala Pro Asp Gly 115
120 125Ile Arg Gly Phe Pro Arg Cys Arg Tyr
Val His Lys Val Ser Gly Thr 130 135
140Gly Pro Cys Ala Gly Asp Phe Ala Phe His Lys Glu Gly Ala Phe Phe145
150 155 160Leu Tyr Asp Arg
Leu Ala Ser Thr Val Ile Tyr Arg Gly Thr Thr Phe 165
170 175Ala Glu Gly Val Val Ala Phe Leu Ile Leu
Pro Gln Ala Lys Lys Asp 180 185
190Phe Phe Ser Ser His Pro Leu Arg Glu Pro Val Asn Ala Thr Glu Asp
195 200 205Pro Ser Ser Gly Tyr Tyr Ser
Thr Thr Ile Arg Tyr Gln Ala Thr Gly 210 215
220Phe Gly Thr Asn Glu Thr Glu Tyr Leu Phe Glu Val Asp Asn Leu
Thr225 230 235 240Tyr Val
Gln Leu Glu Ser Arg Phe Thr Pro Gln Phe Leu Leu Gln Leu
245 250 255Asn Glu Thr Ile Tyr Thr Ser
Gly Lys Arg Ser Asn Thr Thr Gly Lys 260 265
270Leu Ile Trp Lys Val Asn Pro Glu Ile Asp Thr Thr Ile Gly
Glu Trp 275 280 285Ala Phe Trp Glu
Thr Lys Lys Pro His 290 29534288PRTZaire Ebola virus
strain Mayinga 34Met Glu Ala Ser Tyr Glu Arg Gly Arg Pro Arg Ala Ala Arg
Gln His1 5 10 15Ser Arg
Asp Gly His Asp His His Val Arg Ala Arg Ser Ser Ser Arg 20
25 30Glu Asn Tyr Arg Gly Glu Tyr Arg Gln
Ser Arg Ser Ala Ser Gln Val 35 40
45Arg Val Pro Thr Val Phe His Lys Lys Arg Val Glu Pro Leu Thr Val 50
55 60Pro Pro Ala Pro Lys Asp Ile Cys Pro
Thr Leu Lys Lys Gly Phe Leu65 70 75
80Cys Asp Ser Ser Phe Cys Lys Lys Asp His Gln Leu Glu Ser
Leu Thr 85 90 95Asp Arg
Glu Leu Leu Leu Leu Ile Ala Arg Lys Thr Cys Gly Ser Val 100
105 110Glu Gln Gln Leu Asn Ile Thr Ala Pro
Lys Asp Ser Arg Leu Ala Asn 115 120
125Pro Thr Ala Asp Asp Phe Gln Gln Glu Glu Gly Pro Lys Ile Thr Leu
130 135 140Leu Thr Leu Ile Lys Thr Ala
Glu His Trp Ala Arg Gln Asp Ile Arg145 150
155 160Thr Ile Glu Asp Ser Lys Leu Arg Ala Leu Leu Thr
Leu Cys Ala Val 165 170
175Met Thr Arg Lys Phe Ser Lys Ser Gln Leu Ser Leu Leu Cys Glu Thr
180 185 190His Leu Arg Arg Glu Gly
Leu Gly Gln Asp Gln Ala Glu Pro Val Leu 195 200
205Glu Val Tyr Gln Arg Leu His Ser Asp Lys Gly Gly Ser Phe
Glu Ala 210 215 220Ala Leu Trp Gln Gln
Trp Asp Arg Gln Ser Leu Ile Met Phe Ile Thr225 230
235 240Ala Phe Leu Asn Ile Ala Leu Gln Leu Pro
Cys Glu Ser Ser Ala Val 245 250
255Val Val Ser Gly Leu Arg Thr Leu Val Pro Gln Ser Asp Asn Glu Glu
260 265 270Ala Ser Thr Asn Pro
Gly Thr Cys Ser Trp Ser Asp Glu Gly Thr Pro 275
280 28535251PRTZaire Ebola virus strain Mayinga 35Met Ala
Lys Ala Thr Gly Arg Tyr Asn Leu Ile Ser Pro Lys Lys Asp1 5
10 15Leu Glu Lys Gly Val Val Leu Ser
Asp Leu Cys Asn Phe Leu Val Ser 20 25
30Gln Thr Ile Gln Gly Trp Lys Val Tyr Trp Ala Gly Ile Glu Phe
Asp 35 40 45Val Thr His Lys Gly
Met Ala Leu Leu His Arg Leu Lys Thr Asn Asp 50 55
60Phe Ala Pro Ala Trp Ser Met Thr Arg Asn Leu Phe Pro His
Leu Phe65 70 75 80Gln
Asn Pro Asn Ser Thr Ile Glu Ser Pro Leu Trp Ala Leu Arg Val
85 90 95Ile Leu Ala Ala Gly Ile Gln
Asp Gln Leu Ile Asp Gln Ser Leu Ile 100 105
110Glu Pro Leu Ala Gly Ala Leu Gly Leu Ile Ser Asp Trp Leu
Leu Thr 115 120 125Thr Asn Thr Asn
His Phe Asn Met Arg Thr Gln Arg Val Lys Glu Gln 130
135 140Leu Ser Leu Lys Met Leu Ser Leu Ile Arg Ser Asn
Ile Leu Lys Phe145 150 155
160Ile Asn Lys Leu Asp Ala Leu His Val Val Asn Tyr Asn Gly Leu Leu
165 170 175Ser Ser Ile Glu Ile
Gly Thr Gln Asn His Thr Ile Ile Ile Thr Arg 180
185 190Thr Asn Met Gly Phe Leu Val Glu Leu Gln Glu Pro
Asp Lys Ser Ala 195 200 205Met Asn
Arg Met Lys Pro Gly Pro Ala Lys Phe Ser Leu Leu His Glu 210
215 220Ser Thr Leu Lys Ala Phe Thr Gln Gly Ser Ser
Thr Arg Met Gln Ser225 230 235
240Leu Ile Leu Glu Phe Asn Ser Ser Leu Ala Ile 245
250362212PRTZaire Ebola virus strain Mayinga 36Met Ala Thr
Gln His Thr Gln Tyr Pro Asp Ala Arg Leu Ser Ser Pro1 5
10 15Ile Val Leu Asp Gln Cys Asp Leu Val Thr
Arg Ala Cys Gly Leu Tyr 20 25
30Ser Ser Tyr Ser Leu Asn Pro Gln Leu Arg Asn Cys Lys Leu Pro Lys
35 40 45His Ile Tyr Arg Leu Lys Tyr Asp
Val Thr Val Thr Lys Phe Leu Ser 50 55
60Asp Val Pro Val Ala Thr Leu Pro Ile Asp Phe Ile Val Pro Val Leu65
70 75 80Leu Lys Ala Leu Ser
Gly Asn Gly Phe Cys Pro Val Glu Pro Arg Cys 85
90 95Gln Gln Phe Leu Asp Glu Ile Ile Lys Tyr Thr
Met Gln Asp Ala Leu 100 105
110Phe Leu Lys Tyr Tyr Leu Lys Asn Val Gly Ala Gln Glu Asp Cys Val
115 120 125Asp Glu His Phe Gln Glu Lys
Ile Leu Ser Ser Ile Gln Gly Asn Glu 130 135
140Phe Leu His Gln Met Phe Phe Trp Tyr Asp Leu Ala Ile Leu Thr
Arg145 150 155 160Arg Gly
Arg Leu Asn Arg Gly Asn Ser Arg Ser Thr Trp Phe Val His
165 170 175Asp Asp Leu Ile Asp Ile Leu
Gly Tyr Gly Asp Tyr Val Phe Trp Lys 180 185
190Ile Pro Ile Ser Met Leu Pro Leu Asn Thr Gln Gly Ile Pro
His Ala 195 200 205Ala Met Asp Trp
Tyr Gln Ala Ser Val Phe Lys Glu Ala Val Gln Gly 210
215 220His Thr His Ile Val Ser Val Ser Thr Ala Asp Val
Leu Ile Met Cys225 230 235
240Lys Asp Leu Ile Thr Cys Arg Phe Asn Thr Thr Leu Ile Ser Lys Ile
245 250 255Ala Glu Ile Glu Asp
Pro Val Cys Ser Asp Tyr Pro Asn Phe Lys Ile 260
265 270Val Ser Met Leu Tyr Gln Ser Gly Asp Tyr Leu Leu
Ser Ile Leu Gly 275 280 285Ser Asp
Gly Tyr Lys Ile Ile Lys Phe Leu Glu Pro Leu Cys Leu Ala 290
295 300Lys Ile Gln Leu Cys Ser Lys Tyr Thr Glu Arg
Lys Gly Arg Phe Leu305 310 315
320Thr Gln Met His Leu Ala Val Asn His Thr Leu Glu Glu Ile Thr Glu
325 330 335Met Arg Ala Leu
Lys Pro Ser Gln Ala Gln Lys Ile Arg Glu Phe His 340
345 350Arg Thr Leu Ile Arg Leu Glu Met Thr Pro Gln
Gln Leu Cys Glu Leu 355 360 365Phe
Ser Ile Gln Lys His Trp Gly His Pro Val Leu His Ser Glu Thr 370
375 380Ala Ile Gln Lys Val Lys Lys His Ala Thr
Val Leu Lys Ala Leu Arg385 390 395
400Pro Ile Val Ile Phe Glu Thr Tyr Cys Val Phe Lys Tyr Ser Ile
Ala 405 410 415Lys His Tyr
Phe Asp Ser Gln Gly Ser Trp Tyr Ser Val Thr Ser Asp 420
425 430Arg Asn Leu Thr Pro Gly Leu Asn Ser Tyr
Ile Lys Arg Asn Gln Phe 435 440
445Pro Pro Leu Pro Met Ile Lys Glu Leu Leu Trp Glu Phe Tyr His Leu 450
455 460Asp His Pro Pro Leu Phe Ser Thr
Lys Ile Ile Ser Asp Leu Ser Ile465 470
475 480Phe Ile Lys Asp Arg Ala Thr Ala Val Glu Arg Thr
Cys Trp Asp Ala 485 490
495Val Phe Glu Pro Asn Val Leu Gly Tyr Asn Pro Pro His Lys Phe Ser
500 505 510Thr Lys Arg Val Pro Glu
Gln Phe Leu Glu Gln Glu Asn Phe Ser Ile 515 520
525Glu Asn Val Leu Ser Tyr Ala Gln Lys Leu Glu Tyr Leu Leu
Pro Gln 530 535 540Tyr Arg Asn Phe Ser
Phe Ser Leu Lys Glu Lys Glu Leu Asn Val Gly545 550
555 560Arg Thr Phe Gly Lys Leu Pro Tyr Pro Thr
Arg Asn Val Gln Thr Leu 565 570
575Cys Glu Ala Leu Leu Ala Asp Gly Leu Ala Lys Ala Phe Pro Ser Asn
580 585 590Met Met Val Val Thr
Glu Arg Glu Gln Lys Glu Ser Leu Leu His Gln 595
600 605Ala Ser Trp His His Thr Ser Asp Asp Phe Gly Glu
His Ala Thr Val 610 615 620Arg Gly Ser
Ser Phe Val Thr Asp Leu Glu Lys Tyr Asn Leu Ala Phe625
630 635 640Arg Tyr Glu Phe Thr Ala Pro
Phe Ile Glu Tyr Cys Asn Arg Cys Tyr 645
650 655Gly Val Lys Asn Val Phe Asn Trp Met His Tyr Thr
Ile Pro Gln Cys 660 665 670Tyr
Met His Val Ser Asp Tyr Tyr Asn Pro Pro His Asn Leu Thr Leu 675
680 685Glu Asn Arg Asp Asn Pro Pro Glu Gly
Pro Ser Ser Tyr Arg Gly His 690 695
700Met Gly Gly Ile Glu Gly Leu Gln Gln Lys Leu Trp Thr Ser Ile Ser705
710 715 720Cys Ala Gln Ile
Ser Leu Val Glu Ile Lys Thr Gly Phe Lys Leu Arg 725
730 735Ser Ala Val Met Gly Asp Asn Gln Cys Ile
Thr Val Leu Ser Val Phe 740 745
750Pro Leu Glu Thr Asp Ala Asp Glu Gln Glu Gln Ser Ala Glu Asp Asn
755 760 765Ala Ala Arg Val Ala Ala Ser
Leu Ala Lys Val Thr Ser Ala Cys Gly 770 775
780Ile Phe Leu Lys Pro Asp Glu Thr Phe Val His Ser Gly Phe Ile
Tyr785 790 795 800Phe Gly
Lys Lys Gln Tyr Leu Asn Gly Val Gln Leu Pro Gln Ser Leu
805 810 815Lys Thr Ala Thr Arg Met Ala
Pro Leu Ser Asp Ala Ile Phe Asp Asp 820 825
830Leu Gln Gly Thr Leu Ala Ser Ile Gly Thr Ala Phe Glu Arg
Ser Ile 835 840 845Ser Glu Thr Arg
His Ile Phe Pro Cys Arg Ile Thr Ala Ala Phe His 850
855 860Thr Phe Phe Ser Val Arg Ile Leu Gln Tyr His His
Leu Gly Phe Asn865 870 875
880Lys Gly Phe Asp Leu Gly Gln Leu Thr Leu Gly Lys Pro Leu Asp Phe
885 890 895Gly Thr Ile Ser Leu
Ala Leu Ala Val Pro Gln Val Leu Gly Gly Leu 900
905 910Ser Phe Leu Asn Pro Glu Lys Cys Phe Tyr Arg Asn
Leu Gly Asp Pro 915 920 925Val Thr
Ser Gly Leu Phe Gln Leu Lys Thr Tyr Leu Arg Met Ile Glu 930
935 940Met Asp Asp Leu Phe Leu Pro Leu Ile Ala Lys
Asn Pro Gly Asn Cys945 950 955
960Thr Ala Ile Asp Phe Val Leu Asn Pro Ser Gly Leu Asn Val Pro Gly
965 970 975Ser Gln Asp Leu
Thr Ser Phe Leu Arg Gln Ile Val Arg Arg Thr Ile 980
985 990Thr Leu Ser Ala Lys Asn Lys Leu Ile Asn Thr
Leu Phe His Ala Ser 995 1000
1005Ala Asp Phe Glu Asp Glu Met Val Cys Lys Trp Leu Leu Ser Ser Thr
1010 1015 1020Pro Val Met Ser Arg Phe Ala
Ala Asp Ile Phe Ser Arg Thr Pro Ser1025 1030
1035 1040Gly Lys Arg Leu Gln Ile Leu Gly Tyr Leu Glu Gly
Thr Arg Thr Leu 1045 1050
1055Leu Ala Ser Lys Ile Ile Asn Asn Asn Thr Glu Thr Pro Val Leu Asp
1060 1065 1070Arg Leu Arg Lys Ile Thr
Leu Gln Arg Trp Ser Leu Trp Phe Ser Tyr 1075 1080
1085Leu Asp His Cys Asp Asn Ile Leu Ala Glu Ala Leu Thr Gln
Ile Thr 1090 1095 1100Cys Thr Val Asp
Leu Ala Gln Ile Leu Arg Glu Tyr Ser Trp Ala His1105 1110
1115 1120Ile Leu Glu Gly Arg Pro Leu Ile Gly
Ala Thr Leu Pro Cys Met Ile 1125 1130
1135Glu Gln Phe Lys Val Phe Trp Leu Lys Pro Tyr Glu Gln Cys Pro
Gln 1140 1145 1150Cys Ser Asn
Ala Lys Gln Pro Gly Gly Lys Pro Phe Val Ser Val Ala 1155
1160 1165Val Lys Lys His Ile Val Ser Ala Trp Pro Asn
Ala Ser Arg Ile Ser 1170 1175 1180Trp
Thr Ile Gly Asp Gly Ile Pro Tyr Ile Gly Ser Arg Thr Glu Asp1185
1190 1195 1200Lys Ile Gly Gln Pro Ala
Ile Lys Pro Lys Cys Pro Ser Ala Ala Leu 1205
1210 1215Arg Glu Ala Ile Glu Leu Ala Ser Arg Leu Thr Trp
Val Thr Gln Gly 1220 1225
1230Ser Ser Asn Ser Asp Leu Leu Ile Lys Pro Phe Leu Glu Ala Arg Val
1235 1240 1245Asn Leu Ser Val Gln Glu Ile
Leu Gln Met Thr Pro Ser His Tyr Ser 1250 1255
1260Gly Asn Ile Val His Arg Tyr Asn Asp Gln Tyr Ser Pro His Ser
Phe1265 1270 1275 1280Met
Ala Asn Arg Met Ser Asn Ser Ala Thr Arg Leu Ile Val Ser Thr
1285 1290 1295Asn Thr Leu Gly Glu Phe Ser
Gly Gly Gly Gln Ser Ala Arg Asp Ser 1300 1305
1310Asn Ile Ile Phe Gln Asn Val Ile Asn Tyr Ala Val Ala Leu
Phe Asp 1315 1320 1325Ile Lys Phe
Arg Asn Thr Glu Ala Thr Asp Ile Gln Tyr Asn Arg Ala 1330
1335 1340His Leu His Leu Thr Lys Cys Cys Thr Arg Glu Val
Pro Ala Gln Tyr1345 1350 1355
1360Leu Thr Tyr Thr Ser Thr Leu Asp Leu Asp Leu Thr Arg Tyr Arg Glu
1365 1370 1375Asn Glu Leu Ile Tyr
Asp Ser Asn Pro Leu Lys Gly Gly Leu Asn Cys 1380
1385 1390Asn Ile Ser Phe Asp Asn Pro Phe Phe Gln Gly Lys
Arg Leu Asn Ile 1395 1400 1405Ile
Glu Asp Asp Leu Ile Arg Leu Pro His Leu Ser Gly Trp Glu Leu 1410
1415 1420Ala Lys Thr Ile Met Gln Ser Ile Ile Ser
Asp Ser Asn Asn Ser Ser1425 1430 1435
1440Thr Asp Pro Ile Ser Ser Gly Glu Thr Arg Ser Phe Thr Thr His
Phe 1445 1450 1455Leu Thr
Tyr Pro Lys Ile Gly Leu Leu Tyr Ser Phe Gly Ala Phe Val 1460
1465 1470Ser Tyr Tyr Leu Gly Asn Thr Ile Leu
Arg Thr Lys Lys Leu Thr Leu 1475 1480
1485Asp Asn Phe Leu Tyr Tyr Leu Thr Thr Gln Ile His Asn Leu Pro His
1490 1495 1500Arg Ser Leu Arg Ile Leu Lys
Pro Thr Phe Lys His Ala Ser Val Met1505 1510
1515 1520Ser Arg Leu Met Ser Ile Asp Pro His Phe Ser Ile
Tyr Ile Gly Gly 1525 1530
1535Ala Ala Gly Asp Arg Gly Leu Ser Asp Ala Ala Arg Leu Phe Leu Arg
1540 1545 1550Thr Ser Ile Ser Ser Phe
Leu Thr Phe Val Lys Glu Trp Ile Ile Asn 1555 1560
1565Arg Gly Thr Ile Val Pro Leu Trp Ile Val Tyr Pro Leu Glu
Gly Gln 1570 1575 1580Asn Pro Thr Pro
Val Asn Asn Phe Leu Tyr Gln Ile Val Glu Leu Leu1585 1590
1595 1600Val His Asp Ser Ser Arg Gln Gln Ala
Phe Lys Thr Thr Ile Ser Asp 1605 1610
1615His Val His Pro His Asp Asn Leu Val Tyr Thr Cys Lys Ser Thr
Ala 1620 1625 1630Ser Asn Phe
Phe His Ala Ser Leu Ala Tyr Trp Arg Ser Arg His Arg 1635
1640 1645Asn Ser Asn Arg Lys Tyr Leu Ala Arg Asp Ser
Ser Thr Gly Ser Ser 1650 1655 1660Thr
Asn Asn Ser Asp Gly His Ile Glu Arg Ser Gln Glu Gln Thr Thr1665
1670 1675 1680Arg Asp Pro His Asp Gly
Thr Glu Arg Asn Leu Val Leu Gln Met Ser 1685
1690 1695His Glu Ile Lys Arg Thr Thr Ile Pro Gln Glu Asn
Thr His Gln Gly 1700 1705
1710Pro Ser Phe Gln Ser Phe Leu Ser Asp Ser Ala Cys Gly Thr Ala Asn
1715 1720 1725Pro Lys Leu Asn Phe Asp Arg
Ser Arg His Asn Val Lys Phe Gln Asp 1730 1735
1740His Asn Ser Ala Ser Lys Arg Glu Gly His Gln Ile Ile Ser His
Arg1745 1750 1755 1760Leu
Val Leu Pro Phe Phe Thr Leu Ser Gln Gly Thr Arg Gln Leu Thr
1765 1770 1775Ser Ser Asn Glu Ser Gln Thr
Gln Asp Glu Ile Ser Lys Tyr Leu Arg 1780 1785
1790Gln Leu Arg Ser Val Ile Asp Thr Thr Val Tyr Cys Arg Phe
Thr Gly 1795 1800 1805Ile Val Ser
Ser Met His Tyr Lys Leu Asp Glu Val Leu Trp Glu Ile 1810
1815 1820Glu Ser Phe Lys Ser Ala Val Thr Leu Ala Glu Gly
Glu Gly Ala Gly1825 1830 1835
1840Ala Leu Leu Leu Ile Gln Lys Tyr Gln Val Lys Thr Leu Phe Phe Asn
1845 1850 1855Thr Leu Ala Thr Glu
Ser Ser Ile Glu Ser Glu Ile Val Ser Gly Met 1860
1865 1870Thr Thr Pro Arg Met Leu Leu Pro Val Met Ser Lys
Phe His Asn Asp 1875 1880 1885Gln
Ile Glu Ile Ile Leu Asn Asn Ser Ala Ser Gln Ile Thr Asp Ile 1890
1895 1900Thr Asn Pro Thr Trp Phe Lys Asp Gln Arg
Ala Arg Leu Pro Lys Gln1905 1910 1915
1920Val Glu Val Ile Thr Met Asp Ala Glu Thr Thr Glu Asn Ile Asn
Arg 1925 1930 1935Ser Lys
Leu Tyr Glu Ala Val Tyr Lys Leu Ile Leu His His Ile Asp 1940
1945 1950Pro Ser Val Leu Lys Ala Val Val Leu
Lys Val Phe Leu Ser Asp Thr 1955 1960
1965Glu Gly Met Leu Trp Leu Asn Asp Asn Leu Ala Pro Phe Phe Ala Thr
1970 1975 1980Gly Tyr Leu Ile Lys Pro Ile
Thr Ser Ser Ala Arg Ser Ser Glu Trp1985 1990
1995 2000Tyr Leu Cys Leu Thr Asn Phe Leu Ser Thr Thr Arg
Lys Met Pro His 2005 2010
2015Gln Asn His Leu Ser Cys Lys Gln Val Ile Leu Thr Ala Leu Gln Leu
2020 2025 2030Gln Ile Gln Arg Ser Pro
Tyr Trp Leu Ser His Leu Thr Gln Tyr Ala 2035 2040
2045Asp Cys Glu Leu His Leu Ser Tyr Ile Arg Leu Gly Phe Pro
Ser Leu 2050 2055 2060Glu Lys Val Leu
Tyr His Arg Tyr Asn Leu Val Asp Ser Lys Arg Gly2065 2070
2075 2080Pro Leu Val Ser Ile Thr Gln His Leu
Ala His Leu Arg Ala Glu Ile 2085 2090
2095Arg Glu Leu Thr Asn Asp Tyr Asn Gln Gln Arg Gln Ser Arg Thr
Gln 2100 2105 2110Thr Tyr His
Phe Ile Arg Thr Ala Lys Gly Arg Ile Thr Lys Leu Val 2115
2120 2125Asn Asp Tyr Leu Lys Phe Phe Leu Ile Val Gln
Ala Leu Lys His Asn 2130 2135 2140Gly
Thr Trp Gln Ala Glu Phe Lys Lys Leu Pro Glu Leu Ile Ser Val2145
2150 2155 2160Cys Asn Arg Phe Tyr His
Ile Arg Asp Cys Asn Cys Glu Glu Arg Phe 2165
2170 2175Leu Val Gln Thr Leu Tyr Leu His Arg Met Gln Asp
Ser Glu Val Lys 2180 2185
2190Leu Ile Glu Arg Leu Thr Gly Leu Leu Ser Leu Phe Pro Asp Gly Leu
2195 2200 2205Tyr Arg Phe Asp
22103718959DNAZaire Ebola virus strain Mayinga 37cggacacaca aaaagaaaga
agaattttta ggatcttttg tgtgcgaata actatgagga 60agattaataa ttttcctctc
attgaaattt atatcggaat ttaaattgaa attgttactg 120taatcacacc tggtttgttt
cagagccaca tcacaaagat agagaacaac ctaggtctcc 180gaagggagca agggcatcag
tgtgctcagt tgaaaatccc ttgtcaacac ctaggtctta 240tcacatcaca agttccacct
cagactctgc agggtgatcc aacaacctta atagaaacat 300tattgttaaa ggacagcatt
agttcacagt caaacaagca agattgagaa ttaaccttgg 360ttttgaactt gaacacttag
gggattgaag attcaacaac cctaaagctt ggggtaaaac 420attggaaata gttaaaagac
aaattgctcg gaatcacaaa attccgagta tggattctcg 480tcctcagaaa atctggatgg
cgccgagtct cactgaatct gacatggatt accacaagat 540cttgacagca ggtctgtccg
ttcaacaggg gattgttcgg caaagagtca tcccagtgta 600tcaagtaaac aatcttgaag
aaatttgcca acttatcata caggcctttg aagcaggtgt 660tgattttcaa gagagtgcgg
acagtttcct tctcatgctt tgtcttcatc atgcgtacca 720gggagattac aaacttttct
tggaaagtgg cgcagtcaag tatttggaag ggcacgggtt 780ccgttttgaa gtcaagaagc
gtgatggagt gaagcgcctt gaggaattgc tgccagcagt 840atctagtgga aaaaacatta
agagaacact tgctgccatg ccggaagagg agacaactga 900agctaatgcc ggtcagtttc
tctcctttgc aagtctattc cttccgaaat tggtagtagg 960agaaaaggct tgccttgaga
aggttcaaag gcaaattcaa gtacatgcag agcaaggact 1020gatacaatat ccaacagctt
ggcaatcagt aggacacatg atggtgattt tccgtttgat 1080gcgaacaaat tttctgatca
aatttctcct aatacaccaa gggatgcaca tggttgccgg 1140gcatgatgcc aacgatgctg
tgatttcaaa ttcagtggct caagctcgtt tttcaggctt 1200attgattgtc aaaacagtac
ttgatcatat cctacaaaag acagaacgag gagttcgtct 1260ccatcctctt gcaaggaccg
ccaaggtaaa aaatgaggtg aactccttta aggctgcact 1320cagctccctg gccaagcatg
gagagtatgc tcctttcgcc cgacttttga acctttctgg 1380agtaaataat cttgagcatg
gtcttttccc tcaactatcg gcaattgcac tcggagtcgc 1440cacagcacac gggagtaccc
tcgcaggagt aaatgttgga gaacagtatc aacaactcag 1500agaggctgcc actgaggctg
agaagcaact ccaacaatat gcagagtctc gcgaacttga 1560ccatcttgga cttgatgatc
aggaaaagaa aattcttatg aacttccatc agaaaaagaa 1620cgaaatcagc ttccagcaaa
caaacgctat ggtaactcta agaaaagagc gcctggccaa 1680gctgacagaa gctatcactg
ctgcgtcact gcccaaaaca agtggacatt acgatgatga 1740tgacgacatt ccctttccag
gacccatcaa tgatgacgac aatcctggcc atcaagatga 1800tgatccgact gactcacagg
atacgaccat tcccgatgtg gtggttgatc ccgatgatgg 1860aagctacggc gaataccaga
gttactcgga aaacggcatg aatgcaccag atgacttggt 1920cctattcgat ctagacgagg
acgacgagga cactaagcca gtgcctaata gatcgaccaa 1980gggtggacaa cagaagaaca
gtcaaaaggg ccagcatata gagggcagac agacacaatc 2040caggccaatt caaaatgtcc
caggccctca cagaacaatc caccacgcca gtgcgccact 2100cacggacaat gacagaagaa
atgaaccctc cggctcaacc agccctcgca tgctgacacc 2160aattaacgaa gaggcagacc
cactggacga tgccgacgac gagacgtcta gccttccgcc 2220cttggagtca gatgatgaag
agcaggacag ggacggaact tccaaccgca cacccactgt 2280cgccccaccg gctcccgtat
acagagatca ctctgaaaag aaagaactcc cgcaagacga 2340gcaacaagat caggaccaca
ctcaagaggc caggaaccag gacagtgaca acacccagtc 2400agaacactct tttgaggaga
tgtatcgcca cattctaaga tcacaggggc catttgatgc 2460tgttttgtat tatcatatga
tgaaggatga gcctgtagtt ttcagtacca gtgatggcaa 2520agagtacacg tatccagact
cccttgaaga ggaatatcca ccatggctca ctgaaaaaga 2580ggctatgaat gaagagaata
gatttgttac attggatggt caacaatttt attggccggt 2640gatgaatcac aagaataaat
tcatggcaat cctgcaacat catcagtgaa tgagcatgga 2700acaatgggat gattcaaccg
acaaatagct aacattaagt agtcaaggaa cgaaaacagg 2760aagaattttt gatgtctaag
gtgtgaatta ttatcacaat aaaagtgatt cttatttttg 2820aatttaaagc tagcttatta
ttactagccg tttttcaaag ttcaatttga gtcttaatgc 2880aaataggcgt taagccacag
ttatagccat aattgtaact caatattcta actagcgatt 2940tatctaaatt aaattacatt
atgcttttat aacttaccta ctagcctgcc caacatttac 3000acgatcgttt tataattaag
aaaaaactaa tgatgaagat taaaaccttc atcatcctta 3060cgtcaattga attctctagc
actcgaagct tattgtcttc aatgtaaaag aaaagctggt 3120ctaacaagat gacaactaga
acaaagggca ggggccatac tgcggccacg actcaaaacg 3180acagaatgcc aggccctgag
ctttcgggct ggatctctga gcagctaatg accggaagaa 3240ttcctgtaag cgacatcttc
tgtgatattg agaacaatcc aggattatgc tacgcatccc 3300aaatgcaaca aacgaagcca
aacccgaaga cgcgcaacag tcaaacccaa acggacccaa 3360tttgcaatca tagttttgag
gaggtagtac aaacattggc ttcattggct actgttgtgc 3420aacaacaaac catcgcatca
gaatcattag aacaacgcat tacgagtctt gagaatggtc 3480taaagccagt ttatgatatg
gcaaaaacaa tctcctcatt gaacagggtt tgtgctgaga 3540tggttgcaaa atatgatctt
ctggtgatga caaccggtcg ggcaacagca accgctgcgg 3600caactgaggc ttattgggcc
gaacatggtc aaccaccacc tggaccatca ctttatgaag 3660aaagtgcgat tcggggtaag
attgaatcta gagatgagac cgtccctcaa agtgttaggg 3720aggcattcaa caatctaaac
agtaccactt cactaactga ggaaaatttt gggaaacctg 3780acatttcggc aaaggatttg
agaaacatta tgtatgatca cttgcctggt tttggaactg 3840ctttccacca attagtacaa
gtgatttgta aattgggaaa agatagcaac tcattggaca 3900tcattcatgc tgagttccag
gccagcctgg ctgaaggaga ctctcctcaa tgtgccctaa 3960ttcaaattac aaaaagagtt
ccaatcttcc aagatgctgc tccacctgtc atccacatcc 4020gctctcgagg tgacattccc
cgagcttgcc agaaaagctt gcgtccagtc ccaccatcgc 4080ccaagattga tcgaggttgg
gtatgtgttt ttcagcttca agatggtaaa acacttggac 4140tcaaaatttg agccaatctc
ccttccctcc gaaagaggcg aataatagca gaggcttcaa 4200ctgctgaact atagggtacg
ttacattaat gatacacttg tgagtatcag ccctggataa 4260tataagtcaa ttaaacgacc
aagataaaat tgttcatatc tcgctagcag cttaaaatat 4320aaatgtaata ggagctatat
ctctgacagt attataatca attgttatta agtaacccaa 4380accaaaagtg atgaagatta
agaaaaacct acctcggctg agagagtgtt ttttcattaa 4440ccttcatctt gtaaacgttg
agcaaaattg ttaaaaatat gaggcgggtt atattgccta 4500ctgctcctcc tgaatatatg
gaggccatat accctgtcag gtcaaattca acaattgcta 4560gaggtggcaa cagcaataca
ggcttcctga caccggagtc agtcaatggg gacactccat 4620cgaatccact caggccaatt
gccgatgaca ccatcgacca tgccagccac acaccaggca 4680gtgtgtcatc agcattcatc
cttgaagcta tggtgaatgt catatcgggc cccaaagtgc 4740taatgaagca aattccaatt
tggcttcctc taggtgtcgc tgatcaaaag acctacagct 4800ttgactcaac tacggccgcc
atcatgcttg cttcatacac tatcacccat ttcggcaagg 4860caaccaatcc acttgtcaga
gtcaatcggc tgggtcctgg aatcccggat catcccctca 4920ggctcctgcg aattggaaac
caggctttcc tccaggagtt cgttcttccg ccagtccaac 4980taccccagta tttcaccttt
gatttgacag cactcaaact gatcacccaa ccactgcctg 5040ctgcaacatg gaccgatgac
actccaacag gatcaaatgg agcgttgcgt ccaggaattt 5100catttcatcc aaaacttcgc
cccattcttt tacccaacaa aagtgggaag aaggggaaca 5160gtgccgatct aacatctccg
gagaaaatcc aagcaataat gacttcactc caggacttta 5220agatcgttcc aattgatcca
accaaaaata tcatgggaat cgaagtgcca gaaactctgg 5280tccacaagct gaccggtaag
aaggtgactt ctaaaaatgg acaaccaatc atccctgttc 5340ttttgccaaa gtacattggg
ttggacccgg tggctccagg agacctcacc atggtaatca 5400cacaggattg tgacacgtgt
cattctcctg caagtcttcc agctgtgatt gagaagtaat 5460tgcaataatt gactcagatc
cagttttata gaatcttctc agggatagtg ataacatcta 5520tttagtaatc cgtccattag
aggagacact tttaattgat caatatacta aaggtgcttt 5580acaccattgt cttttttctc
tcctaaatgt agaacttaac aaaagactca taatatactt 5640gtttttaaag gattgattga
tgaaagatca taactaataa cattacaaat aatcctacta 5700taatcaatac ggtgattcaa
atgttaatct ttctcattgc acatactttt tgcccttatc 5760ctcaaattgc ctgcatgctt
acatctgagg atagccagtg tgacttggat tggaaatgtg 5820gagaaaaaat cgggacccat
ttctaggttg ttcacaatcc aagtacagac attgcccttc 5880taattaagaa aaaatcggcg
atgaagatta agccgacagt gagcgtaatc ttcatctctc 5940ttagattatt tgttttccag
agtaggggtc gtcaggtcct tttcaatcgt gtaaccaaaa 6000taaactccac tagaaggata
ttgtggggca acaacacaat gggcgttaca ggaatattgc 6060agttacctcg tgatcgattc
aagaggacat cattctttct ttgggtaatt atccttttcc 6120aaagaacatt ttccatccca
cttggagtca tccacaatag cacattacag gttagtgatg 6180tcgacaaact agtttgtcgt
gacaaactgt catccacaaa tcaattgaga tcagttggac 6240tgaatctcga agggaatgga
gtggcaactg acgtgccatc tgcaactaaa agatggggct 6300tcaggtccgg tgtcccacca
aaggtggtca attatgaagc tggtgaatgg gctgaaaact 6360gctacaatct tgaaatcaaa
aaacctgacg ggagtgagtg tctaccagca gcgccagacg 6420ggattcgggg cttcccccgg
tgccggtatg tgcacaaagt atcaggaacg ggaccgtgtg 6480ccggagactt tgccttccat
aaagagggtg ctttcttcct gtatgatcga cttgcttcca 6540cagttatcta ccgaggaacg
actttcgctg aaggtgtcgt tgcatttctg atactgcccc 6600aagctaagaa ggacttcttc
agctcacacc ccttgagaga gccggtcaat gcaacggagg 6660acccgtctag tggctactat
tctaccacaa ttagatatca ggctaccggt tttggaacca 6720atgagacaga gtacttgttc
gaggttgaca atttgaccta cgtccaactt gaatcaagat 6780tcacaccaca gtttctgctc
cagctgaatg agacaatata tacaagtggg aaaaggagca 6840ataccacggg aaaactaatt
tggaaggtca accccgaaat tgatacaaca atcggggagt 6900gggccttctg ggaaactaaa
aaaacctcac tagaaaaatt cgcagtgaag agttgtcttt 6960cacagttgta tcaaacggag
ccaaaaacat cagtggtcag agtccggcgc gaacttcttc 7020cgacccaggg accaacacaa
caactgaaga ccacaaaatc atggcttcag aaaattcctc 7080tgcaatggtt caagtgcaca
gtcaaggaag ggaagctgca gtgtcgcatc taacaaccct 7140tgccacaatc tccacgagtc
cccaatccct cacaaccaaa ccaggtccgg acaacagcac 7200ccataataca cccgtgtata
aacttgacat ctctgaggca actcaagttg aacaacatca 7260ccgcagaaca gacaacgaca
gcacagcctc cgacactccc tctgccacga ccgcagccgg 7320acccccaaaa gcagagaaca
ccaacacgag caagagcact gacttcctgg accccgccac 7380cacaacaagt ccccaaaacc
acagcgagac cgctggcaac aacaacactc atcaccaaga 7440taccggagaa gagagtgcca
gcagcgggaa gctaggctta attaccaata ctattgctgg 7500agtcgcagga ctgatcacag
gcgggagaag aactcgaaga gaagcaattg tcaatgctca 7560acccaaatgc aaccctaatt
tacattactg gactactcag gatgaaggtg ctgcaatcgg 7620actggcctgg ataccatatt
tcgggccagc agccgaggga atttacatag aggggctaat 7680gcacaatcaa gatggtttaa
tctgtgggtt gagacagctg gccaacgaga cgactcaagc 7740tcttcaactg ttcctgagag
ccacaactga gctacgcacc ttttcaatcc tcaaccgtaa 7800ggcaattgat ttcttgctgc
agcgatgggg cggcacatgc cacattctgg gaccggactg 7860ctgtatcgaa ccacatgatt
ggaccaagaa cataacagac aaaattgatc agattattca 7920tgattttgtt gataaaaccc
ttccggacca gggggacaat gacaattggt ggacaggatg 7980gagacaatgg ataccggcag
gtattggagt tacaggcgtt ataattgcag ttatcgcttt 8040attctgtata tgcaaatttg
tcttttagtt tttcttcaga ttgcttcatg gaaaagctca 8100gcctcaaatc aatgaaacca
ggatttaatt atatggatta cttgaatcta agattacttg 8160acaaatgata atataataca
ctggagcttt aaacatagcc aatgtgattc taactccttt 8220aaactcacag ttaatcataa
acaaggtttg acatcaatct agttatctct ttgagaatga 8280taaacttgat gaagattaag
aaaaaggtaa tctttcgatt atctttaatc ttcatccttg 8340attctacaat catgacagtt
gtctttagtg acaagggaaa gaagcctttt tattaagttg 8400taataatcag atctgcgaac
cggtagagtt tagttgcaac ctaacacaca taaagcattg 8460gtcaaaaagt caatagaaat
ttaaacagtg agtggagaca acttttaaat ggaagcttca 8520tatgagagag gacgcccacg
agctgccaga cagcattcaa gggatggaca cgaccaccat 8580gttcgagcac gatcatcatc
cagagagaat tatcgaggtg agtaccgtca atcaaggagc 8640gcctcacaag tgcgcgttcc
tactgtattt cataagaaga gagttgaacc attaacagtt 8700cctccagcac ctaaagacat
atgtccgacc ttgaaaaaag gatttttgtg tgacagtagt 8760ttttgcaaaa aagatcacca
gttggagagt ttaactgata gggaattact cctactaatc 8820gcccgtaaga cttgtggatc
agtagaacaa caattaaata taactgcacc caaggactcg 8880cgcttagcaa atccaacggc
tgatgatttc cagcaagagg aaggtccaaa aattaccttg 8940ttgacactga tcaagacggc
agaacactgg gcgagacaag acatcagaac catagaggat 9000tcaaaattaa gagcattgtt
gactctatgt gctgtgatga cgaggaaatt ctcaaaatcc 9060cagctgagtc ttttatgtga
gacacaccta aggcgcgagg ggcttgggca agatcaggca 9120gaacccgttc tcgaagtata
tcaacgatta cacagtgata aaggaggcag ttttgaagct 9180gcactatggc aacaatggga
ccgacaatcc ctaattatgt ttatcactgc attcttgaat 9240attgctctcc agttaccgtg
tgaaagttct gctgtcgttg tttcagggtt aagaacattg 9300gttcctcaat cagataatga
ggaagcttca accaacccgg ggacatgctc atggtctgat 9360gagggtaccc cttaataagg
ctgactaaaa cactatataa ccttctactt gatcacaata 9420ctccgtatac ctatcatcat
atatttaatc aagacgatat cctttaaaac ttattcagta 9480ctataatcac tctcgtttca
aattaataag atgtgcatga ttgccctaat atatgaagag 9540gtatgataca accctaacag
tgatcaaaga aaatcataat ctcgtatcgc tcgtaatata 9600acctgccaag catacctctt
gcacaaagtg attcttgtac acaaataatg ttttactcta 9660caggaggtag caacgatcca
tcccatcaaa aaataagtat ttcatgactt actaatgatc 9720tcttaaaata ttaagaaaaa
ctgacggaac ataaattctt tatgcttcaa gctgtggagg 9780aggtgtttgg tattggctat
tgttatatta caatcaataa caagcttgta aaaatattgt 9840tcttgtttca agaggtagat
tgtgaccgga aatgctaaac taatgatgaa gattaatgcg 9900gaggtctgat aagaataaac
cttattattc agattaggcc ccaagaggca ttcttcatct 9960ccttttagca aagtactatt
tcagggtagt ccaattagtg gcacgtcttt tagctgtata 10020tcagtcgccc ctgagatacg
ccacaaaagt gtctctaagc taaattggtc tgtacacatc 10080ccatacattg tattaggggc
aataatatct aattgaactt agccgtttaa aatttagtgc 10140ataaatctgg gctaacacca
ccaggtcaac tccattggct gaaaagaagc ttacctacaa 10200cgaacatcac tttgagcgcc
ctcacaatta aaaaatagga acgtcgttcc aacaatcgag 10260cgcaaggttt caaggttgaa
ctgagagtgt ctagacaaca aaatattgat actccagaca 10320ccaagcaaga cctgagaaaa
aaccatggct aaagctacgg gacgatacaa tctaatatcg 10380cccaaaaagg acctggagaa
aggggttgtc ttaagcgacc tctgtaactt cttagttagc 10440caaactattc aggggtggaa
ggtttattgg gctggtattg agtttgatgt gactcacaaa 10500ggaatggccc tattgcatag
actgaaaact aatgactttg cccctgcatg gtcaatgaca 10560aggaatctct ttcctcattt
atttcaaaat ccgaattcca caattgaatc accgctgtgg 10620gcattgagag tcatccttgc
agcagggata caggaccagc tgattgacca gtctttgatt 10680gaacccttag caggagccct
tggtctgatc tctgattggc tgctaacaac caacactaac 10740catttcaaca tgcgaacaca
acgtgtcaag gaacaattga gcctaaaaat gctgtcgttg 10800attcgatcca atattctcaa
gtttattaac aaattggatg ctctacatgt cgtgaactac 10860aacggattgt tgagcagtat
tgaaattgga actcaaaatc atacaatcat cataactcga 10920actaacatgg gttttctggt
ggagctccaa gaacccgaca aatcggcaat gaaccgcatg 10980aagcctgggc cggcgaaatt
ttccctcctt catgagtcca cactgaaagc atttacacaa 11040ggatcctcga cacgaatgca
aagtttgatt cttgaattta atagctctct tgctatctaa 11100ctaaggtaga atacttcata
ttgagctaac tcatatatgc tgactcaata gttatcttga 11160catctctgct ttcataatca
gatatataag cataataaat aaatactcat atttcttgat 11220aatttgttta accacagata
aatcctcact gtaagccagc ttccaagttg acacccttac 11280aaaaaccagg actcagaatc
cctcaaacaa gagattccaa gacaacatca tagaattgct 11340ttattatatg aataagcatt
ttatcaccag aaatcctata tactaaatgg ttaattgtaa 11400ctgaacccgc aggtcacatg
tgttaggttt cacagattct atatattact aactctatac 11460tcgtaattaa cattagataa
gtagattaag aaaaaagcct gaggaagatt aagaaaaact 11520gcttattggg tctttccgtg
ttttagatga agcagttgaa attcttcctc ttgatattaa 11580atggctacac aacataccca
atacccagac gctaggttat catcaccaat tgtattggac 11640caatgtgacc tagtcactag
agcttgcggg ttatattcat catactccct taatccgcaa 11700ctacgcaact gtaaactccc
gaaacatatc taccgtttga aatacgatgt aactgttacc 11760aagttcttga gtgatgtacc
agtggcgaca ttgcccatag atttcatagt cccagttctt 11820ctcaaggcac tgtcaggcaa
tggattctgt cctgttgagc cgcggtgcca acagttctta 11880gatgaaatca ttaagtacac
aatgcaagat gctctcttct tgaaatatta tctcaaaaat 11940gtgggtgctc aagaagactg
tgttgatgaa cactttcaag agaaaatctt atcttcaatt 12000cagggcaatg aatttttaca
tcaaatgttt ttctggtatg atctggctat tttaactcga 12060aggggtagat taaatcgagg
aaactctaga tcaacatggt ttgttcatga tgatttaata 12120gacatcttag gctatgggga
ctatgttttt tggaagatcc caatttcaat gttaccactg 12180aacacacaag gaatccccca
tgctgctatg gactggtatc aggcatcagt attcaaagaa 12240gcggttcaag ggcatacaca
cattgtttct gtttctactg ccgacgtctt gataatgtgc 12300aaagatttaa ttacatgtcg
attcaacaca actctaatct caaaaatagc agagattgag 12360gatccagttt gttctgatta
tcccaatttt aagattgtgt ctatgcttta ccagagcgga 12420gattacttac tctccatatt
agggtctgat gggtataaaa ttattaagtt cctcgaacca 12480ttgtgcttgg ccaaaattca
attatgctca aagtacactg agaggaaggg ccgattctta 12540acacaaatgc atttagctgt
aaatcacacc ctagaagaaa ttacagaaat gcgtgcacta 12600aagccttcac aggctcaaaa
gatccgtgaa ttccatagaa cattgataag gctggagatg 12660acgccacaac aactttgtga
gctattttcc attcaaaaac actgggggca tcctgtgcta 12720catagtgaaa cagcaatcca
aaaagttaaa aaacatgcta cggtgctaaa agcattacgc 12780cctatagtga ttttcgagac
atactgtgtt tttaaatata gtattgccaa acattatttt 12840gatagtcaag gatcttggta
cagtgttact tcagatagga atctaacacc gggtcttaat 12900tcttatatca aaagaaatca
attccctccg ttgccaatga ttaaagaact actatgggaa 12960ttttaccacc ttgaccaccc
tccacttttc tcaaccaaaa ttattagtga cttaagtatt 13020tttataaaag acagagctac
cgcagtagaa aggacatgct gggatgcagt attcgagcct 13080aatgttctag gatataatcc
acctcacaaa tttagtacta aacgtgtacc ggaacaattt 13140ttagagcaag aaaacttttc
tattgagaat gttctttcct acgcacaaaa actcgagtat 13200ctactaccac aatatcggaa
cttttctttc tcattgaaag agaaagagtt gaatgtaggt 13260agaaccttcg gaaaattgcc
ttatccgact cgcaatgttc aaacactttg tgaagctctg 13320ttagctgatg gtcttgctaa
agcatttcct agcaatatga tggtagttac ggaacgtgag 13380caaaaagaaa gcttattgca
tcaagcatca tggcaccaca caagtgatga ttttggtgaa 13440catgccacag ttagagggag
tagctttgta actgatttag agaaatacaa tcttgcattt 13500agatatgagt ttacagcacc
ttttatagaa tattgcaacc gttgctatgg tgttaagaat 13560gtttttaatt ggatgcatta
tacaatccca cagtgttata tgcatgtcag tgattattat 13620aatccaccac ataacctcac
actggagaat cgagacaacc cccccgaagg gcctagttca 13680tacaggggtc atatgggagg
gattgaagga ctgcaacaaa aactctggac aagtatttca 13740tgtgctcaaa tttctttagt
tgaaattaag actggtttta agttacgctc agctgtgatg 13800ggtgacaatc agtgcattac
tgttttatca gtcttcccct tagagactga cgcagacgag 13860caggaacaga gcgccgaaga
caatgcagcg agggtggccg ccagcctagc aaaagttaca 13920agtgcctgtg gaatcttttt
aaaacctgat gaaacatttg tacattcagg ttttatctat 13980tttggaaaaa aacaatattt
gaatggggtc caattgcctc agtcccttaa aacggctaca 14040agaatggcac cattgtctga
tgcaattttt gatgatcttc aagggaccct ggctagtata 14100ggcactgctt ttgagcgatc
catctctgag acacgacata tctttccttg caggataacc 14160gcagctttcc atacgttttt
ttcggtgaga atcttgcaat atcatcatct cgggttcaat 14220aaaggttttg accttggaca
gttaacactc ggcaaacctc tggatttcgg aacaatatca 14280ttggcactag cggtaccgca
ggtgcttgga gggttatcct tcttgaatcc tgagaaatgt 14340ttctaccgga atctaggaga
tccagttacc tcaggcttat tccagttaaa aacttatctc 14400cgaatgattg agatggatga
tttattctta cctttaattg cgaagaaccc tgggaactgc 14460actgccattg actttgtgct
aaatcctagc ggattaaatg tccctgggtc gcaagactta 14520acttcatttc tgcgccagat
tgtacgcagg accatcaccc taagtgcgaa aaacaaactt 14580attaatacct tatttcatgc
gtcagctgac ttcgaagacg aaatggtttg taaatggcta 14640ttatcatcaa ctcctgttat
gagtcgtttt gcggccgata tcttttcacg cacgccgagc 14700gggaagcgat tgcaaattct
aggatacctg gaaggaacac gcacattatt agcctctaag 14760atcatcaaca ataatacaga
gacaccggtt ttggacagac tgaggaaaat aacattgcaa 14820aggtggagcc tatggtttag
ttatcttgat cattgtgata atatcctggc ggaggcttta 14880acccaaataa cttgcacagt
tgatttagca cagattctga gggaatattc atgggctcat 14940attttagagg gaagacctct
tattggagcc acactcccat gtatgattga gcaattcaaa 15000gtgttttggc tgaaacccta
cgaacaatgt ccgcagtgtt caaatgcaaa gcaaccaggt 15060gggaaaccat tcgtgtcagt
ggcagtcaag aaacatattg ttagtgcatg gccgaacgca 15120tcccgaataa gctggactat
cggggatgga atcccataca ttggatcaag gacagaagat 15180aagataggac aacctgctat
taaaccaaaa tgtccttccg cagccttaag agaggccatt 15240gaattggcgt cccgtttaac
atgggtaact caaggcagtt cgaacagtga cttgctaata 15300aaaccatttt tggaagcacg
agtaaattta agtgttcaag aaatacttca aatgacccct 15360tcacattact caggaaatat
tgttcacagg tacaacgatc aatacagtcc tcattctttc 15420atggccaatc gtatgagtaa
ttcagcaacg cgattgattg tttctacaaa cactttaggt 15480gagttttcag gaggtggcca
gtctgcacgc gacagcaata ttattttcca gaatgttata 15540aattatgcag ttgcactgtt
cgatattaaa tttagaaaca ctgaggctac agatatccaa 15600tataatcgtg ctcaccttca
tctaactaag tgttgcaccc gggaagtacc agctcagtat 15660ttaacataca catctacatt
ggatttagat ttaacaagat accgagaaaa cgaattgatt 15720tatgacagta atcctctaaa
aggaggactc aattgcaata tctcattcga taatccattt 15780ttccaaggta aacggctgaa
cattatagaa gatgatctta ttcgactgcc tcacttatct 15840ggatgggagc tagccaagac
catcatgcaa tcaattattt cagatagcaa caattcatct 15900acagacccaa ttagcagtgg
agaaacaaga tcattcacta cccatttctt aacttatccc 15960aagataggac ttctgtacag
ttttggggcc tttgtaagtt attatcttgg caatacaatt 16020cttcggacta agaaattaac
acttgacaat tttttatatt acttaactac tcaaattcat 16080aatctaccac atcgctcatt
gcgaatactt aagccaacat tcaaacatgc aagcgttatg 16140tcacggttaa tgagtattga
tcctcatttt tctatttaca taggcggtgc tgcaggtgac 16200agaggactct cagatgcggc
caggttattt ttgagaacgt ccatttcatc ttttcttaca 16260tttgtaaaag aatggataat
taatcgcgga acaattgtcc ctttatggat agtatatccg 16320ctagagggtc aaaacccaac
acctgtgaat aattttctct atcagatcgt agaactgctg 16380gtgcatgatt catcaagaca
acaggctttt aaaactacca taagtgatca tgtacatcct 16440cacgacaatc ttgtttacac
atgtaagagt acagccagca atttcttcca tgcatcattg 16500gcgtactgga ggagcagaca
cagaaacagc aaccgaaaat acttggcaag agactcttca 16560actggatcaa gcacaaacaa
cagtgatggt catattgaga gaagtcaaga acaaaccacc 16620agagatccac atgatggcac
tgaacggaat ctagtcctac aaatgagcca tgaaataaaa 16680agaacgacaa ttccacaaga
aaacacgcac cagggtccgt cgttccagtc ctttctaagt 16740gactctgctt gtggtacagc
aaatccaaaa ctaaatttcg atcgatcgag acacaatgtg 16800aaatttcagg atcataactc
ggcatccaag agggaaggtc atcaaataat ctcacaccgt 16860ctagtcctac ctttctttac
attatctcaa gggacacgcc aattaacgtc atccaatgag 16920tcacaaaccc aagacgagat
atcaaagtac ttacggcaat tgagatccgt cattgatacc 16980acagtttatt gtagatttac
cggtatagtc tcgtccatgc attacaaact tgatgaggtc 17040ctttgggaaa tagagagttt
caagtcggct gtgacgctag cagagggaga aggtgctggt 17100gccttactat tgattcagaa
ataccaagtt aagaccttat ttttcaacac gctagctact 17160gagtccagta tagagtcaga
aatagtatca ggaatgacta ctcctaggat gcttctacct 17220gttatgtcaa aattccataa
tgaccaaatt gagattattc ttaacaactc agcaagccaa 17280ataacagaca taacaaatcc
tacttggttt aaagaccaaa gagcaaggct acctaagcaa 17340gtcgaggtta taaccatgga
tgcagagaca acagagaata taaacagatc gaaattgtac 17400gaagctgtat ataaattgat
cttacaccat attgatccta gcgtattgaa agcagtggtc 17460cttaaagtct ttctaagtga
tactgagggt atgttatggc taaatgataa tttagccccg 17520ttttttgcca ctggttattt
aattaagcca ataacgtcaa gtgctagatc tagtgagtgg 17580tatctttgtc tgacgaactt
cttatcaact acacgtaaga tgccacacca aaaccatctc 17640agttgtaaac aggtaatact
tacggcattg caactgcaaa ttcaacgaag cccatactgg 17700ctaagtcatt taactcagta
tgctgactgt gagttacatt taagttatat ccgccttggt 17760tttccatcat tagagaaagt
actataccac aggtataacc tcgtcgattc aaaaagaggt 17820ccactagtct ctatcactca
gcacttagca catcttagag cagagattcg agaattaact 17880aatgattata atcaacagcg
acaaagtcgg actcaaacat atcactttat tcgtactgca 17940aaaggacgaa tcacaaaact
agtcaatgat tatttaaaat tctttcttat tgtgcaagca 18000ttaaaacata atgggacatg
gcaagctgag tttaagaaat taccagagtt gattagtgtg 18060tgcaataggt tctaccatat
tagagattgc aattgtgaag aacgtttctt agttcaaacc 18120ttatatttac atagaatgca
ggattctgaa gttaagctta tcgaaaggct gacagggctt 18180ctgagtttat ttccggatgg
tctctacagg tttgattgaa ttaccgtgca tagtatcctg 18240atacttgcaa aggttggtta
ttaacataca gattataaaa aactcataaa ttgctctcat 18300acatcatatt gatctaatct
caataaacaa ctatttaaat aacgaaagga gtccctatat 18360tatatactat atttagcctc
tctccctgcg tgataatcaa aaaattcaca atgcagcatg 18420tgtgacatat tactgccgca
atgaatttaa cgcaacataa taaactctgc actctttata 18480attaagcttt aacgaaaggt
ctgggctcat attgttattg atataataat gttgtatcaa 18540tatcctgtca gatggaatag
tgttttggtt gataacacaa cttcttaaaa caaaattgat 18600ctttaagatt aagtttttta
taattatcat tactttaatt tgtcgtttta aaaacggtga 18660tagccttaat ctttgtgtaa
aataagagat taggtgtaat aaccttaaca tttttgtcta 18720gtaagctact atttcataca
gaatgataaa attaaaagaa aaggcaggac tgtaaaatca 18780gaaatacctt ctttacaata
tagcagacta gataataatc ttcgtgttaa tgataattaa 18840gacattgacc acgctcatca
gaaggctcgc cagaataaac gttgcaaaaa ggattcctgg 18900aaaaatggtc gcacacaaaa
atttaaaaat aaatctattt cttctttttt gtgtgtcca 1895938739PRTZaire Ebola
virus strain Mayinga 38Met Asp Ser Arg Pro Gln Lys Ile Trp Met Ala Pro
Ser Leu Thr Glu1 5 10
15Ser Asp Met Asp Tyr His Lys Ile Leu Thr Ala Gly Leu Ser Val Gln
20 25 30Gln Gly Ile Val Arg Gln Arg
Val Ile Pro Val Tyr Gln Val Asn Asn 35 40
45Leu Glu Glu Ile Cys Gln Leu Ile Ile Gln Ala Phe Glu Ala Gly
Val 50 55 60Asp Phe Gln Glu Ser Ala
Asp Ser Phe Leu Leu Met Leu Cys Leu His65 70
75 80His Ala Tyr Gln Gly Asp Tyr Lys Leu Phe Leu
Glu Ser Gly Ala Val 85 90
95Lys Tyr Leu Glu Gly His Gly Phe Arg Phe Glu Val Lys Lys Arg Asp
100 105 110Gly Val Lys Arg Leu Glu
Glu Leu Leu Pro Ala Val Ser Ser Gly Lys 115 120
125Asn Ile Lys Arg Thr Leu Ala Ala Met Pro Glu Glu Glu Thr
Thr Glu 130 135 140Ala Asn Ala Gly Gln
Phe Leu Ser Phe Ala Ser Leu Phe Leu Pro Lys145 150
155 160Leu Val Val Gly Glu Lys Ala Cys Leu Glu
Lys Val Gln Arg Gln Ile 165 170
175Gln Val His Ala Glu Gln Gly Leu Ile Gln Tyr Pro Thr Ala Trp Gln
180 185 190Ser Val Gly His Met
Met Val Ile Phe Arg Leu Met Arg Thr Asn Phe 195
200 205Leu Ile Lys Phe Leu Leu Ile His Gln Gly Met His
Met Val Ala Gly 210 215 220His Asp Ala
Asn Asp Ala Val Ile Ser Asn Ser Val Ala Gln Ala Arg225
230 235 240Phe Ser Gly Leu Leu Ile Val
Lys Thr Val Leu Asp His Ile Leu Gln 245
250 255Lys Thr Glu Arg Gly Val Arg Leu His Pro Leu Ala
Arg Thr Ala Lys 260 265 270Val
Lys Asn Glu Val Asn Ser Phe Lys Ala Ala Leu Ser Ser Leu Ala 275
280 285Lys His Gly Glu Tyr Ala Pro Phe Ala
Arg Leu Leu Asn Leu Ser Gly 290 295
300Val Asn Asn Leu Glu His Gly Leu Phe Pro Gln Leu Ser Ala Ile Ala305
310 315 320Leu Gly Val Ala
Thr Ala His Gly Ser Thr Leu Ala Gly Val Asn Val 325
330 335Gly Glu Gln Tyr Gln Gln Leu Arg Glu Ala
Ala Thr Glu Ala Glu Lys 340 345
350Gln Leu Gln Gln Tyr Ala Glu Ser Arg Glu Leu Asp His Leu Gly Leu
355 360 365Asp Asp Gln Glu Lys Lys Ile
Leu Met Asn Phe His Gln Lys Lys Asn 370 375
380Glu Ile Ser Phe Gln Gln Thr Asn Ala Met Val Thr Leu Arg Lys
Glu385 390 395 400Arg Leu
Ala Lys Leu Thr Glu Ala Ile Thr Ala Ala Ser Leu Pro Lys
405 410 415Thr Ser Gly His Tyr Asp Asp
Asp Asp Asp Ile Pro Phe Pro Gly Pro 420 425
430Ile Asn Asp Asp Asp Asn Pro Gly His Gln Asp Asp Asp Pro
Thr Asp 435 440 445Ser Gln Asp Thr
Thr Ile Pro Asp Val Val Val Asp Pro Asp Asp Gly 450
455 460Ser Tyr Gly Glu Tyr Gln Ser Tyr Ser Glu Asn Gly
Met Asn Ala Pro465 470 475
480Asp Asp Leu Val Leu Phe Asp Leu Asp Glu Asp Asp Glu Asp Thr Lys
485 490 495Pro Val Pro Asn Arg
Ser Thr Lys Gly Gly Gln Gln Lys Asn Ser Gln 500
505 510Lys Gly Gln His Ile Glu Gly Arg Gln Thr Gln Phe
Arg Pro Ile Gln 515 520 525Asn Val
Pro Gly Pro His Arg Thr Ile His His Ala Ser Ala Pro Leu 530
535 540Thr Asp Asn Asp Arg Arg Asn Glu Pro Ser Gly
Ser Thr Ser Pro Arg545 550 555
560Met Leu Thr Pro Ile Asn Glu Glu Ala Asp Pro Leu Asp Asp Ala Asp
565 570 575Asp Glu Thr Ser
Ser Leu Pro Pro Leu Glu Ser Asp Asp Glu Glu Gln 580
585 590Asp Arg Asp Gly Thr Ser Asn Arg Thr Pro Thr
Val Ala Pro Pro Ala 595 600 605Pro
Val Tyr Arg Asp His Ser Glu Lys Lys Glu Leu Pro Gln Asp Glu 610
615 620Gln Gln Asp Gln Asp His Thr Gln Glu Ala
Arg Asn Gln Asp Ser Asp625 630 635
640Asn Thr Gln Ser Glu His Ser Leu Glu Glu Met Tyr Arg His Ile
Leu 645 650 655Arg Ser Gln
Gly Pro Phe Asp Ala Val Leu Tyr Tyr His Met Met Lys 660
665 670Asp Glu Pro Val Val Phe Ser Thr Ser Asp
Gly Lys Glu Tyr Thr Tyr 675 680
685Pro Asp Ser Leu Glu Glu Glu Tyr Pro Pro Trp Leu Thr Glu Lys Glu 690
695 700Ala Met Asn Glu Glu Asn Arg Phe
Val Thr Leu Asp Gly Gln Gln Phe705 710
715 720Tyr Trp Pro Val Met Asn His Lys Asn Lys Phe Met
Ala Ile Leu Gln 725 730
735His His Gln39340PRTZaire Ebola virus strain Mayinga 39Met Thr Thr Arg
Thr Lys Gly Arg Gly His Thr Ala Ala Thr Thr Gln1 5
10 15Asn Asp Arg Met Pro Gly Pro Glu Leu Ser
Gly Trp Ile Ser Glu Gln 20 25
30Leu Met Thr Gly Arg Ile Pro Val Ser Asp Ile Phe Cys Asp Ile Glu
35 40 45Asn Asn Pro Gly Leu Cys Tyr Ala
Ser Gln Met Gln Gln Thr Lys Pro 50 55
60Asn Pro Lys Thr Arg Asn Ser Gln Thr Gln Thr Asp Pro Ile Cys Asn65
70 75 80His Ser Phe Glu Glu
Val Val Gln Thr Leu Ala Ser Leu Ala Thr Val 85
90 95Val Gln Gln Gln Thr Ile Ala Ser Glu Ser Leu
Glu Gln Arg Ile Thr 100 105
110Ser Leu Glu Asn Gly Leu Lys Pro Val Tyr Asp Met Ala Lys Thr Ile
115 120 125Ser Ser Leu Asn Arg Val Cys
Ala Glu Met Val Ala Lys Tyr Asp Leu 130 135
140Leu Val Met Thr Thr Gly Arg Ala Thr Ala Thr Ala Ala Ala Thr
Glu145 150 155 160Ala Tyr
Trp Ala Glu His Gly Gln Pro Pro Pro Gly Pro Ser Leu Tyr
165 170 175Glu Glu Ser Ala Ile Arg Gly
Lys Ile Glu Ser Arg Asp Glu Thr Val 180 185
190Pro Gln Ser Val Arg Glu Ala Phe Asn Asn Leu Asn Ser Thr
Thr Ser 195 200 205Leu Thr Glu Glu
Asn Phe Gly Lys Pro Asp Ile Ser Ala Lys Asp Leu 210
215 220Arg Asn Ile Met Tyr Asp His Leu Pro Gly Phe Gly
Thr Ala Phe His225 230 235
240Gln Leu Val Gln Val Ile Cys Lys Leu Gly Lys Asp Ser Asn Ser Leu
245 250 255Asp Ile Ile His Ala
Glu Phe Gln Ala Ser Leu Ala Glu Gly Asp Ser 260
265 270Pro Gln Cys Ala Leu Ile Gln Ile Thr Lys Arg Val
Pro Ile Phe Gln 275 280 285Asp Ala
Ala Pro Pro Val Ile His Ile Arg Ser Arg Gly Asp Ile Pro 290
295 300Arg Ala Cys Gln Lys Ser Leu Arg Pro Val Pro
Pro Ser Pro Lys Ile305 310 315
320Asp Arg Gly Trp Val Cys Val Phe Gln Leu Gln Asp Gly Lys Thr Leu
325 330 335Gly Leu Lys Ile
34040326PRTZaire Ebola virus strain Mayinga 40Met Arg Arg Val
Ile Leu Pro Thr Ala Pro Pro Glu Tyr Met Glu Ala1 5
10 15Ile Tyr Pro Val Arg Ser Asn Ser Thr Ile
Ala Arg Gly Gly Asn Ser 20 25
30Asn Thr Gly Phe Leu Thr Pro Glu Ser Val Asn Gly Asp Thr Pro Ser
35 40 45Asn Pro Leu Arg Pro Ile Ala Asp
Asp Thr Ile Asp His Ala Ser His 50 55
60Thr Pro Gly Ser Val Ser Ser Ala Phe Ile Leu Glu Ala Met Val Asn65
70 75 80Val Ile Ser Gly Pro
Lys Val Leu Met Lys Gln Ile Pro Ile Trp Leu 85
90 95Pro Leu Gly Val Ala Asp Gln Lys Thr Tyr Ser
Phe Asp Ser Thr Thr 100 105
110Ala Ala Ile Met Leu Ala Ser Tyr Thr Ile Thr His Phe Gly Lys Ala
115 120 125Thr Asn Pro Leu Val Arg Val
Asn Arg Leu Gly Pro Gly Ile Pro Asp 130 135
140His Pro Leu Arg Leu Leu Arg Ile Gly Asn Gln Ala Phe Leu Gln
Glu145 150 155 160Phe Val
Leu Pro Pro Val Gln Leu Pro Gln Tyr Phe Thr Phe Asp Leu
165 170 175Thr Ala Leu Lys Leu Ile Thr
Gln Pro Leu Pro Ala Ala Thr Trp Thr 180 185
190Asp Asp Thr Pro Thr Gly Ser Asn Gly Ala Leu Arg Pro Gly
Ile Ser 195 200 205Phe His Pro Lys
Leu Arg Pro Ile Leu Leu Pro Asn Lys Ser Gly Lys 210
215 220Lys Gly Asn Ser Ala Asp Leu Thr Ser Pro Glu Lys
Ile Gln Ala Ile225 230 235
240Met Thr Ser Leu Gln Asp Phe Lys Ile Val Pro Ile Asp Pro Thr Lys
245 250 255Asn Ile Met Gly Ile
Glu Val Pro Glu Thr Leu Val His Lys Leu Thr 260
265 270Gly Lys Lys Val Thr Ser Lys Asn Gly Gln Pro Ile
Ile Pro Val Leu 275 280 285Leu Pro
Lys Tyr Ile Gly Leu Asp Pro Val Ala Pro Gly Asp Leu Thr 290
295 300Met Val Ile Thr Gln Asp Cys Asp Thr Cys His
Ser Pro Ala Ser Leu305 310 315
320Pro Ala Val Ile Glu Lys 32541676PRTZaire Ebola
virus strain Mayinga 41Met Gly Val Thr Gly Ile Leu Gln Leu Pro Arg Asp
Arg Phe Lys Arg1 5 10
15Thr Ser Phe Phe Leu Trp Val Ile Ile Leu Phe Gln Arg Thr Phe Ser
20 25 30Ile Pro Leu Gly Val Ile His
Asn Ser Thr Leu Gln Val Ser Asp Val 35 40
45Asp Lys Leu Val Cys Arg Asp Lys Leu Ser Ser Thr Asn Gln Leu
Arg 50 55 60Ser Val Gly Leu Asn Leu
Glu Gly Asn Gly Val Ala Thr Asp Val Pro65 70
75 80Ser Ala Thr Lys Arg Trp Gly Phe Arg Ser Gly
Val Pro Pro Lys Val 85 90
95Val Asn Tyr Glu Ala Gly Glu Trp Ala Glu Asn Cys Tyr Asn Leu Glu
100 105 110Ile Lys Lys Pro Asp Gly
Ser Glu Cys Leu Pro Ala Ala Pro Asp Gly 115 120
125Ile Arg Gly Phe Pro Arg Cys Arg Tyr Val His Lys Val Ser
Gly Thr 130 135 140Gly Pro Cys Ala Gly
Asp Phe Ala Phe His Lys Glu Gly Ala Phe Phe145 150
155 160Leu Tyr Asp Arg Leu Ala Ser Thr Val Ile
Tyr Arg Gly Thr Thr Phe 165 170
175Ala Glu Gly Val Val Ala Phe Leu Ile Leu Pro Gln Ala Lys Lys Asp
180 185 190Phe Phe Ser Ser His
Pro Leu Arg Glu Pro Val Asn Ala Thr Glu Asp 195
200 205Pro Ser Ser Gly Tyr Tyr Ser Thr Thr Ile Arg Tyr
Gln Ala Thr Gly 210 215 220Phe Gly Thr
Asn Glu Thr Glu Tyr Leu Phe Glu Val Asp Asn Leu Thr225
230 235 240Tyr Val Gln Leu Glu Ser Arg
Phe Thr Pro Gln Phe Leu Leu Gln Leu 245
250 255Asn Glu Thr Ile Tyr Thr Ser Gly Lys Arg Ser Asn
Thr Thr Gly Lys 260 265 270Leu
Ile Trp Lys Val Asn Pro Glu Ile Asp Thr Thr Ile Gly Glu Trp 275
280 285Ala Phe Trp Glu Thr Lys Lys Asn Leu
Thr Arg Lys Ile Arg Ser Glu 290 295
300Glu Leu Ser Phe Thr Val Val Ser Asn Gly Ala Lys Asn Ile Ser Gly305
310 315 320Gln Ser Pro Ala
Arg Thr Ser Ser Asp Pro Gly Thr Asn Thr Thr Thr 325
330 335Glu Asp His Lys Ile Met Ala Ser Glu Asn
Ser Ser Ala Met Val Gln 340 345
350Val His Ser Gln Gly Arg Glu Ala Ala Val Ser His Leu Thr Thr Leu
355 360 365Ala Thr Ile Ser Thr Ser Pro
Gln Ser Leu Thr Thr Lys Pro Gly Pro 370 375
380Asp Asn Ser Thr His Asn Thr Pro Val Tyr Lys Leu Asp Ile Ser
Glu385 390 395 400Ala Thr
Gln Val Glu Gln His His Arg Arg Thr Asp Asn Asp Ser Thr
405 410 415Ala Ser Asp Thr Pro Ser Ala
Thr Thr Ala Ala Gly Pro Pro Lys Ala 420 425
430Glu Asn Thr Asn Thr Ser Lys Ser Thr Asp Phe Leu Asp Pro
Ala Thr 435 440 445Thr Thr Ser Pro
Gln Asn His Ser Glu Thr Ala Gly Asn Asn Asn Thr 450
455 460His His Gln Asp Thr Gly Glu Glu Ser Ala Ser Ser
Gly Lys Leu Gly465 470 475
480Leu Ile Thr Asn Thr Ile Ala Gly Val Ala Gly Leu Ile Thr Gly Gly
485 490 495Arg Arg Thr Arg Arg
Glu Ala Ile Val Asn Ala Gln Pro Lys Cys Asn 500
505 510Pro Asn Leu His Tyr Trp Thr Thr Gln Asp Glu Gly
Ala Ala Ile Gly 515 520 525Leu Ala
Trp Ile Pro Tyr Phe Gly Pro Ala Ala Glu Gly Ile Tyr Ile 530
535 540Glu Gly Leu Met His Asn Gln Asp Gly Leu Ile
Cys Gly Leu Arg Gln545 550 555
560Leu Ala Asn Glu Thr Thr Gln Ala Leu Gln Leu Phe Leu Arg Ala Thr
565 570 575Thr Glu Leu Arg
Thr Phe Ser Ile Leu Asn Arg Lys Ala Ile Asp Phe 580
585 590Leu Leu Gln Arg Trp Gly Gly Thr Cys His Ile
Leu Gly Pro Asp Cys 595 600 605Cys
Ile Glu Pro His Asp Trp Thr Lys Asn Ile Thr Asp Lys Ile Asp 610
615 620Gln Ile Ile His Asp Phe Val Asp Lys Thr
Leu Pro Asp Gln Gly Asp625 630 635
640Asn Asp Asn Trp Trp Thr Gly Trp Arg Gln Trp Ile Pro Ala Gly
Ile 645 650 655Gly Val Thr
Gly Val Ile Ile Ala Val Ile Ala Leu Phe Cys Ile Cys 660
665 670Lys Phe Val Phe 67542364PRTZaire
Ebola virus strain Mayinga 42Met Gly Val Thr Gly Ile Leu Gln Leu Pro Arg
Asp Arg Phe Lys Arg1 5 10
15Thr Ser Phe Phe Leu Trp Val Ile Ile Leu Phe Gln Arg Thr Phe Ser
20 25 30Ile Pro Leu Gly Val Ile His
Asn Ser Thr Leu Gln Val Ser Asp Val 35 40
45Asp Lys Leu Val Cys Arg Asp Lys Leu Ser Ser Thr Asn Gln Leu
Arg 50 55 60Ser Val Gly Leu Asn Leu
Glu Gly Asn Gly Val Ala Thr Asp Val Pro65 70
75 80Ser Ala Thr Lys Arg Trp Gly Phe Arg Ser Gly
Val Pro Pro Lys Val 85 90
95Val Asn Tyr Glu Ala Gly Glu Trp Ala Glu Asn Cys Tyr Asn Leu Glu
100 105 110Ile Lys Lys Pro Asp Gly
Ser Glu Cys Leu Pro Ala Ala Pro Asp Gly 115 120
125Ile Arg Gly Phe Pro Arg Cys Arg Tyr Val His Lys Val Ser
Gly Thr 130 135 140Gly Pro Cys Ala Gly
Asp Phe Ala Phe His Lys Glu Gly Ala Phe Phe145 150
155 160Leu Tyr Asp Arg Leu Ala Ser Thr Val Ile
Tyr Arg Gly Thr Thr Phe 165 170
175Ala Glu Gly Val Val Ala Phe Leu Ile Leu Pro Gln Ala Lys Lys Asp
180 185 190Phe Phe Ser Ser His
Pro Leu Arg Glu Pro Val Asn Ala Thr Glu Asp 195
200 205Pro Ser Ser Gly Tyr Tyr Ser Thr Thr Ile Arg Tyr
Gln Ala Thr Gly 210 215 220Phe Gly Thr
Asn Glu Thr Glu Tyr Leu Phe Glu Val Asp Asn Leu Thr225
230 235 240Tyr Val Gln Leu Glu Ser Arg
Phe Thr Pro Gln Phe Leu Leu Gln Leu 245
250 255Asn Glu Thr Ile Tyr Thr Ser Gly Lys Arg Ser Asn
Thr Thr Gly Lys 260 265 270Leu
Ile Trp Lys Val Asn Pro Glu Ile Asp Thr Thr Ile Gly Glu Trp 275
280 285Ala Phe Trp Glu Thr Lys Lys Thr Ser
Leu Glu Lys Phe Ala Val Lys 290 295
300Ser Cys Leu Ser Gln Leu Tyr Gln Thr Glu Pro Lys Thr Ser Val Val305
310 315 320Arg Val Arg Arg
Glu Leu Leu Pro Thr Gln Gly Pro Thr Gln Gln Leu 325
330 335Lys Thr Thr Lys Ser Trp Leu Gln Lys Ile
Pro Leu Gln Trp Phe Lys 340 345
350Cys Thr Val Lys Glu Gly Lys Leu Gln Cys Arg Ile 355
36043288PRTZaire Ebola virus strain Mayinga 43Met Glu Ala Ser Tyr
Glu Arg Gly Arg Pro Arg Ala Ala Arg Gln His1 5
10 15Ser Arg Asp Gly His Asp His His Val Arg Ala
Arg Ser Ser Ser Arg 20 25
30Glu Asn Tyr Arg Gly Glu Tyr Arg Gln Ser Arg Ser Ala Ser Gln Val
35 40 45Arg Val Pro Thr Val Phe His Lys
Lys Arg Val Glu Pro Leu Thr Val 50 55
60Pro Pro Ala Pro Lys Asp Ile Cys Pro Thr Leu Lys Lys Gly Phe Leu65
70 75 80Cys Asp Ser Ser Phe
Cys Lys Lys Asp His Gln Leu Glu Ser Leu Thr 85
90 95Asp Arg Glu Leu Leu Leu Leu Ile Ala Arg Lys
Thr Cys Gly Ser Val 100 105
110Glu Gln Gln Leu Asn Ile Thr Ala Pro Lys Asp Ser Arg Leu Ala Asn
115 120 125Pro Thr Ala Asp Asp Phe Gln
Gln Glu Glu Gly Pro Lys Ile Thr Leu 130 135
140Leu Thr Leu Ile Lys Thr Ala Glu His Trp Ala Arg Gln Asp Ile
Arg145 150 155 160Thr Ile
Glu Asp Ser Lys Leu Arg Ala Leu Leu Thr Leu Cys Ala Val
165 170 175Met Thr Arg Lys Phe Ser Lys
Ser Gln Leu Ser Leu Leu Cys Glu Thr 180 185
190His Leu Arg Arg Glu Gly Leu Gly Gln Asp Gln Ala Glu Pro
Val Leu 195 200 205Glu Val Tyr Gln
Arg Leu His Ser Asp Lys Gly Gly Ser Phe Glu Ala 210
215 220Ala Leu Trp Gln Gln Trp Asp Arg Gln Ser Leu Ile
Met Phe Ile Thr225 230 235
240Ala Phe Leu Asn Ile Ala Leu Gln Leu Pro Cys Glu Ser Ser Ala Val
245 250 255Val Val Ser Gly Leu
Arg Thr Leu Val Pro Gln Ser Asp Asn Glu Glu 260
265 270Ala Ser Thr Asn Pro Gly Thr Cys Ser Trp Ser Asp
Glu Gly Thr Pro 275 280
28544251PRTZaire Ebola virus strain Mayinga 44Met Ala Lys Ala Thr Gly Arg
Tyr Asn Leu Ile Ser Pro Lys Lys Asp1 5 10
15Leu Glu Lys Gly Val Val Leu Ser Asp Leu Cys Asn Phe Leu
Val Ser 20 25 30Gln Thr Ile
Gln Gly Trp Lys Val Tyr Trp Ala Gly Ile Glu Phe Asp 35
40 45Val Thr His Lys Gly Met Ala Leu Leu His Arg Leu
Lys Thr Asn Asp 50 55 60Phe Ala Pro
Ala Trp Ser Ile Thr Arg Asn Leu Phe Pro His Leu Phe65 70
75 80Gln Asn Pro Asn Ser Thr Ile Glu
Ser Pro Leu Trp Ala Leu Arg Val 85 90
95Ile Leu Ala Ala Gly Ile Gln Asp Gln Leu Ile Asp Gln Ser Leu
Ile 100 105 110Glu Pro Leu Ala
Gly Ala Leu Gly Leu Ile Ser Asp Trp Leu Leu Thr 115
120 125Thr Asn Thr Asn His Phe Asn Met Arg Thr Gln Arg
Val Lys Glu Gln 130 135 140Leu Ser Pro
Lys Met Leu Ser Leu Ile Arg Ser Asn Ile Leu Lys Phe145
150 155 160Ile Asn Lys Leu Asp Ala Leu
His Val Val Asn Tyr Asn Gly Leu Leu 165
170 175Ser Ser Ile Glu Ile Gly Thr Gln Asn His Ile Ile
Ile Ile Thr Arg 180 185 190Thr
Asn Met Gly Phe Leu Val Glu Leu Gln Glu Pro Asp Lys Ser Ala 195
200 205Met Asn Arg Met Lys Pro Gly Pro Ala
Lys Phe Ser Leu Leu His Glu 210 215
220Ser Thr Leu Lys Ala Phe Thr Gln Gly Ser Ser Thr Arg Met Gln Ser225
230 235 240Leu Ile Leu Glu
Phe Asn Ser Ser Leu Ala Ile 245
250452212PRTZaire Ebola virus strain Mayinga 45Met Ala Thr Gln His Thr
Gln Tyr Pro Asp Ala Arg Leu Ser Ser Pro1 5
10 15Ile Val Leu Asp Gln Cys Asp Leu Val Thr Arg Ala
Cys Gly Leu Tyr 20 25 30Ser
Ser Tyr Ser Leu Asn Pro Gln Leu Arg Asn Cys Lys Leu Pro Lys 35
40 45His Ile Tyr Arg Leu Lys Tyr Asp Val
Thr Val Thr Lys Phe Leu Ser 50 55
60Asp Val Pro Val Ala Thr Leu Pro Ile Asp Phe Ile Val Pro Val Leu65
70 75 80Leu Lys Ala Leu Ser
Gly Asn Gly Phe Cys Pro Val Glu Pro Arg Cys 85
90 95Gln Gln Phe Leu Asp Glu Ile Ile Lys Tyr Thr
Met Gln Asp Ala Leu 100 105
110Phe Leu Lys Tyr Tyr Leu Lys Asn Val Gly Ala Gln Glu Asp Cys Val
115 120 125Asp Glu His Phe Gln Glu Lys
Ile Leu Ser Ser Ile Gln Gly Asn Glu 130 135
140Phe Leu His Gln Met Phe Phe Trp Tyr Asp Leu Ala Ile Leu Thr
Arg145 150 155 160Arg Gly
Arg Leu Asn Arg Gly Asn Ser Arg Ser Thr Trp Phe Val His
165 170 175Asp Asp Leu Ile Asp Ile Leu
Gly Tyr Gly Asp Tyr Val Phe Trp Lys 180 185
190Ile Pro Ile Ser Met Leu Pro Leu Asn Thr Gln Gly Ile Pro
His Ala 195 200 205Ala Met Asp Trp
Tyr Gln Ala Ser Val Phe Lys Glu Ala Val Gln Gly 210
215 220His Thr His Ile Val Ser Val Ser Thr Ala Asp Val
Leu Ile Met Cys225 230 235
240Lys Asp Leu Ile Thr Cys Arg Phe Asn Thr Thr Leu Ile Ser Lys Ile
245 250 255Ala Glu Ile Glu Asp
Pro Val Cys Ser Asp Tyr Pro Asn Phe Lys Ile 260
265 270Val Ser Met Leu Tyr Gln Ser Gly Asp Tyr Leu Leu
Ser Ile Leu Gly 275 280 285Ser Asp
Gly Tyr Lys Ile Ile Lys Phe Leu Glu Pro Leu Cys Leu Ala 290
295 300Lys Ile Gln Leu Cys Ser Lys Tyr Thr Glu Arg
Lys Gly Arg Phe Leu305 310 315
320Thr Gln Met His Leu Ala Val Asn His Thr Leu Glu Glu Ile Thr Glu
325 330 335Met Arg Ala Leu
Lys Pro Ser Gln Ala Gln Lys Ile Arg Glu Phe His 340
345 350Arg Thr Leu Ile Arg Leu Glu Met Thr Pro Gln
Gln Leu Cys Glu Leu 355 360 365Phe
Ser Ile Gln Lys His Trp Gly His Pro Val Leu His Ser Glu Thr 370
375 380Ala Ile Gln Lys Val Lys Lys His Ala Thr
Val Leu Lys Ala Leu Arg385 390 395
400Pro Ile Val Ile Phe Glu Thr Tyr Cys Val Phe Lys Tyr Ser Ile
Ala 405 410 415Lys His Tyr
Phe Asp Ser Gln Gly Ser Trp Tyr Ser Val Thr Ser Asp 420
425 430Arg Asn Leu Thr Pro Gly Leu Asn Ser Tyr
Ile Lys Arg Asn Gln Phe 435 440
445Pro Pro Leu Pro Met Ile Lys Glu Leu Leu Trp Glu Phe Tyr His Leu 450
455 460Asp His Pro Pro Leu Phe Ser Thr
Lys Ile Ile Ser Asp Leu Ser Ile465 470
475 480Phe Ile Lys Asp Arg Ala Thr Ala Val Glu Arg Thr
Cys Trp Asp Ala 485 490
495Val Phe Glu Pro Asn Val Leu Gly Tyr Asn Pro Pro His Lys Phe Ser
500 505 510Thr Lys Arg Val Pro Glu
Gln Phe Leu Glu Gln Glu Asn Phe Ser Ile 515 520
525Glu Asn Val Leu Ser Tyr Ala Gln Lys Leu Glu Tyr Leu Leu
Pro Gln 530 535 540Tyr Arg Asn Phe Ser
Phe Ser Leu Lys Glu Lys Glu Leu Asn Val Gly545 550
555 560Arg Thr Phe Gly Lys Leu Pro Tyr Pro Thr
Arg Asn Val Gln Thr Leu 565 570
575Cys Glu Ala Leu Leu Ala Asp Gly Leu Ala Lys Ala Phe Pro Ser Asn
580 585 590Met Met Val Val Thr
Glu Arg Glu Gln Lys Glu Ser Leu Leu His Gln 595
600 605Ala Ser Trp His His Thr Ser Asp Asp Phe Gly Glu
His Ala Thr Val 610 615 620Arg Gly Ser
Ser Phe Val Thr Asp Leu Glu Lys Tyr Asn Leu Ala Phe625
630 635 640Arg Tyr Glu Phe Thr Ala Pro
Phe Ile Glu Tyr Cys Asn Arg Cys Tyr 645
650 655Gly Val Lys Asn Val Phe Asn Trp Met His Tyr Thr
Ile Pro Gln Cys 660 665 670Tyr
Met His Val Ser Asp Tyr Tyr Asn Pro Pro His Asn Leu Thr Leu 675
680 685Glu Asn Arg Asp Asn Pro Pro Glu Gly
Pro Ser Ser Tyr Arg Gly His 690 695
700Met Gly Gly Ile Glu Gly Leu Gln Gln Lys Leu Trp Thr Ser Ile Ser705
710 715 720Cys Ala Gln Ile
Ser Leu Val Glu Ile Lys Thr Gly Phe Lys Leu Arg 725
730 735Ser Ala Val Met Gly Asp Asn Gln Cys Ile
Thr Val Leu Ser Val Phe 740 745
750Pro Leu Glu Thr Asp Ala Asp Glu Gln Glu Gln Ser Ala Glu Asp Asn
755 760 765Ala Ala Arg Val Ala Ala Ser
Leu Ala Lys Val Thr Ser Ala Cys Gly 770 775
780Ile Phe Leu Lys Pro Asp Glu Thr Phe Val His Ser Gly Phe Ile
Tyr785 790 795 800Phe Gly
Lys Lys Gln Tyr Leu Asn Gly Val Gln Leu Pro Gln Ser Leu
805 810 815Lys Thr Ala Ala Arg Met Ala
Pro Leu Ser Asp Ala Ile Phe Asp Asp 820 825
830Leu Gln Gly Thr Leu Ala Ser Ile Gly Thr Ala Phe Glu Arg
Ser Ile 835 840 845Ser Glu Thr Arg
His Ile Phe Pro Cys Arg Ile Thr Ala Ala Phe His 850
855 860Thr Phe Phe Ser Val Arg Ile Leu Gln Tyr His His
Leu Gly Phe Asn865 870 875
880Lys Gly Phe Asp Leu Gly Gln Leu Thr Leu Gly Lys Pro Leu Asp Phe
885 890 895Gly Thr Ile Ser Leu
Ala Leu Ala Val Pro Gln Val Leu Gly Gly Leu 900
905 910Ser Phe Leu Asn Pro Glu Lys Cys Phe Tyr Arg Asn
Leu Gly Asp Pro 915 920 925Val Thr
Ser Gly Leu Phe Gln Leu Lys Thr Tyr Leu Arg Met Ile Glu 930
935 940Met Asp Asp Leu Phe Leu Pro Leu Ile Ala Lys
Asn Pro Gly Asn Cys945 950 955
960Thr Ala Ile Asp Phe Val Leu Asn Pro Ser Gly Leu Asn Val Pro Gly
965 970 975Ser Gln Asp Leu
Thr Ser Phe Leu Arg Gln Ile Val Arg Arg Thr Ile 980
985 990Thr Leu Ser Ala Lys Asn Lys Leu Ile Asn Thr
Leu Phe His Ala Ser 995 1000
1005Ala Asp Phe Glu Asp Glu Met Val Cys Lys Trp Leu Leu Ser Ser Thr
1010 1015 1020Pro Val Met Ser Arg Phe Ala
Ala Asp Ile Phe Ser Arg Thr Pro Ser1025 1030
1035 1040Gly Lys Arg Leu Gln Ile Leu Gly Tyr Leu Glu Gly
Thr Arg Thr Leu 1045 1050
1055Leu Ala Ser Lys Ile Ile Asn Asn Asn Thr Glu Thr Pro Val Leu Asp
1060 1065 1070Arg Leu Arg Lys Ile Thr
Leu Gln Arg Trp Ser Leu Trp Phe Ser Tyr 1075 1080
1085Leu Asp His Cys Asp Asn Ile Leu Ala Glu Ala Leu Thr Gln
Ile Thr 1090 1095 1100Cys Thr Val Asp
Leu Ala Gln Ile Leu Arg Glu Tyr Ser Trp Ala His1105 1110
1115 1120Ile Leu Glu Gly Arg Pro Leu Ile Gly
Ala Thr Leu Pro Cys Met Ile 1125 1130
1135Glu Gln Phe Lys Val Phe Trp Leu Lys Pro Tyr Glu Gln Cys Pro
Gln 1140 1145 1150Cys Ser Asn
Ala Lys Gln Pro Gly Gly Lys Pro Phe Val Ser Val Ala 1155
1160 1165Val Lys Lys His Ile Val Ser Ala Trp Pro Asn
Ala Ser Arg Ile Ser 1170 1175 1180Trp
Thr Ile Gly Asp Gly Ile Pro Tyr Ile Gly Ser Arg Thr Glu Asp1185
1190 1195 1200Lys Ile Gly Gln Pro Ala
Ile Lys Pro Lys Cys Pro Ser Ala Ala Leu 1205
1210 1215Arg Glu Ala Ile Glu Leu Ala Ser Arg Leu Thr Trp
Val Thr Gln Gly 1220 1225
1230Ser Ser Asn Ser Asp Leu Leu Ile Lys Pro Phe Leu Glu Ala Arg Val
1235 1240 1245Asn Leu Ser Val Gln Glu Ile
Leu Gln Met Thr Pro Ser His Tyr Ser 1250 1255
1260Gly Asn Ile Val His Arg Tyr Asn Asp Gln Tyr Ser Pro His Ser
Phe1265 1270 1275 1280Met
Ala Asn Arg Met Ser Asn Ser Ala Thr Arg Leu Ile Val Ser Thr
1285 1290 1295Asn Thr Leu Gly Glu Phe Ser
Gly Gly Gly Gln Ser Ala Arg Asp Ser 1300 1305
1310Asn Ile Ile Phe Gln Asn Val Ile Asn Tyr Ala Val Ala Leu
Phe Asp 1315 1320 1325Ile Lys Phe
Arg Asn Thr Glu Ala Thr Asp Ile Gln Tyr Asn Arg Ala 1330
1335 1340His Leu His Leu Thr Lys Cys Cys Thr Arg Glu Val
Pro Ala Gln Tyr1345 1350 1355
1360Leu Thr Tyr Thr Ser Thr Leu Asp Leu Asp Leu Thr Arg Tyr Arg Glu
1365 1370 1375Asn Glu Leu Ile Tyr
Asp Ser Asn Pro Leu Lys Gly Gly Leu Asn Cys 1380
1385 1390Asn Ile Ser Phe Asp Asn Pro Phe Phe Gln Gly Lys
Arg Leu Asn Ile 1395 1400 1405Ile
Glu Asp Asp Leu Ile Arg Leu Pro His Leu Ser Gly Trp Glu Leu 1410
1415 1420Ala Lys Thr Ile Met Gln Ser Ile Ile Ser
Asp Ser Asn Asn Ser Ser1425 1430 1435
1440Thr Asp Pro Ile Ser Ser Gly Glu Thr Arg Ser Phe Thr Thr His
Phe 1445 1450 1455Leu Thr
Tyr Pro Lys Ile Gly Leu Leu Tyr Ser Phe Gly Ala Phe Val 1460
1465 1470Ser Tyr Tyr Leu Gly Asn Thr Ile Leu
Arg Thr Lys Lys Leu Thr Leu 1475 1480
1485Asp Asn Phe Leu Tyr Tyr Leu Thr Thr Gln Ile His Asn Leu Pro His
1490 1495 1500Arg Ser Leu Arg Ile Leu Lys
Pro Thr Phe Lys His Ala Ser Val Met1505 1510
1515 1520Ser Arg Leu Met Ser Ile Asp Pro His Phe Ser Ile
Tyr Ile Gly Gly 1525 1530
1535Ala Ala Gly Asp Arg Gly Leu Ser Asp Ala Ala Arg Leu Phe Leu Arg
1540 1545 1550Thr Ser Ile Ser Ser Phe
Leu Thr Phe Val Lys Glu Trp Ile Ile Asn 1555 1560
1565Arg Gly Thr Ile Val Pro Leu Trp Ile Val Tyr Pro Leu Glu
Gly Gln 1570 1575 1580Asn Pro Thr Pro
Val Asn Asn Phe Leu Tyr Gln Ile Val Glu Leu Leu1585 1590
1595 1600Val His Asp Ser Ser Arg Gln Gln Ala
Phe Lys Thr Thr Ile Ser Asp 1605 1610
1615His Val His Pro His Asp Asn Leu Val Tyr Thr Cys Lys Ser Thr
Ala 1620 1625 1630Ser Asn Phe
Phe His Ala Ser Leu Ala Tyr Trp Arg Ser Arg His Arg 1635
1640 1645Asn Ser Asn Arg Lys Tyr Leu Ala Arg Asp Ser
Ser Thr Gly Ser Ser 1650 1655 1660Thr
Asn Asn Ser Asp Gly His Ile Glu Arg Ser Gln Glu Gln Thr Thr1665
1670 1675 1680Arg Asp Pro His Asp Gly
Thr Glu Arg Asn Leu Val Leu Gln Met Ser 1685
1690 1695His Glu Ile Lys Arg Thr Thr Ile Pro Gln Glu Asn
Thr His Gln Gly 1700 1705
1710Pro Ser Phe Gln Ser Phe Leu Ser Asp Ser Ala Cys Gly Thr Ala Asn
1715 1720 1725Pro Lys Leu Asn Phe Asp Arg
Ser Arg His Asn Val Lys Phe Gln Asp 1730 1735
1740His Asn Ser Ala Ser Lys Arg Glu Gly His Gln Ile Ile Ser His
Arg1745 1750 1755 1760Leu
Val Leu Pro Phe Phe Thr Leu Ser Gln Gly Thr Arg Gln Leu Thr
1765 1770 1775Ser Ser Asn Glu Ser Gln Thr
Gln Asp Glu Ile Ser Lys Tyr Leu Arg 1780 1785
1790Gln Leu Arg Ser Val Ile Asp Thr Thr Val Tyr Cys Arg Phe
Thr Gly 1795 1800 1805Ile Val Ser
Ser Met His Tyr Lys Leu Asp Glu Val Leu Trp Glu Ile 1810
1815 1820Glu Ser Phe Lys Ser Ala Val Thr Leu Ala Glu Gly
Glu Gly Ala Gly1825 1830 1835
1840Ala Leu Leu Leu Ile Gln Lys Tyr Gln Val Lys Thr Leu Phe Phe Asn
1845 1850 1855Thr Leu Ala Thr Glu
Ser Ser Ile Glu Ser Glu Ile Val Ser Gly Met 1860
1865 1870Thr Thr Pro Arg Met Leu Leu Pro Val Met Ser Lys
Phe His Asn Asp 1875 1880 1885Gln
Ile Glu Ile Ile Leu Asn Asn Ser Ala Ser Gln Ile Thr Asp Ile 1890
1895 1900Thr Asn Pro Thr Trp Phe Lys Asp Gln Arg
Ala Arg Leu Pro Lys Gln1905 1910 1915
1920Val Glu Val Ile Thr Met Asp Ala Glu Thr Thr Glu Asn Ile Asn
Arg 1925 1930 1935Ser Lys
Leu Tyr Glu Ala Val Tyr Lys Leu Ile Leu His His Ile Asp 1940
1945 1950Pro Ser Val Leu Lys Ala Val Val Leu
Lys Val Phe Leu Ser Asp Thr 1955 1960
1965Glu Gly Met Leu Trp Leu Asn Asp Asn Leu Ala Pro Phe Phe Ala Thr
1970 1975 1980Gly Tyr Leu Ile Lys Pro Ile
Thr Ser Ser Ala Arg Ser Ser Glu Trp1985 1990
1995 2000Tyr Leu Cys Leu Thr Asn Phe Leu Ser Thr Thr Arg
Lys Met Pro His 2005 2010
2015Gln Asn His Leu Ser Cys Lys Gln Val Ile Leu Thr Ala Leu Gln Leu
2020 2025 2030Gln Ile Gln Arg Ser Pro
Tyr Trp Leu Ser His Leu Thr Gln Tyr Ala 2035 2040
2045Asp Cys Glu Leu His Leu Ser Tyr Ile Arg Leu Gly Phe Pro
Ser Leu 2050 2055 2060Glu Lys Val Leu
Tyr His Arg Tyr Asn Leu Val Asp Ser Lys Arg Gly2065 2070
2075 2080Pro Leu Val Ser Ile Thr Gln His Leu
Ala His Leu Arg Ala Glu Ile 2085 2090
2095Arg Glu Leu Thr Asn Asp Tyr Asn Gln Gln Arg Gln Ser Arg Thr
Gln 2100 2105 2110Thr Tyr His
Phe Ile Arg Thr Ala Lys Gly Arg Ile Thr Lys Leu Val 2115
2120 2125Asn Asp Tyr Leu Lys Phe Phe Leu Ile Val Gln
Ala Leu Lys His Asn 2130 2135 2140Gly
Thr Trp Gln Ala Glu Phe Lys Lys Leu Pro Glu Leu Ile Ser Val2145
2150 2155 2160Cys Asn Arg Phe Tyr His
Ile Arg Asp Cys Asn Cys Glu Glu Arg Phe 2165
2170 2175Leu Val Gln Thr Leu Tyr Leu His Arg Met Gln Asp
Ser Glu Val Lys 2180 2185
2190Leu Ile Glu Arg Leu Thr Gly Leu Leu Ser Leu Phe Pro Asp Gly Leu
2195 2200 2205Tyr Arg Phe Asp
22104618959DNAZaire Ebola virus strain Mayinga 46cggacacaca aaaagaaaga
agaattttta ggatcttttg tgtgcgaata actatgagga 60agattaataa ttttcctctc
attgaaattt atatcggaat ttaaattgaa attgttactg 120taatcacacc tggtttgttt
cagagccaca tcacaaagat agagaacaac ctaggtctcc 180gaagggagca agggcatcag
tgtgctcagt tgaaaatccc ttgtcaacac ctaggtctta 240tcacatcaca agttccacct
cagactctgc agggtgatcc aacaacctta atagaaacat 300tattgttaaa ggacagcatt
agttcacagt caaacaagca agattgagaa ttaaccttgg 360ttttgaactt gaacacttag
gggattgaag attcaacaac cctaaagctt ggggtaaaac 420attggaaata gttaaaagac
aaattgctcg gaatcacaaa attccgagta tggattctcg 480tcctcagaaa atctggatgg
cgccgagtct cactgaatct gacatggatt accacaagat 540cttgacagca ggtctgtccg
ttcaacaggg gattgttcgg caaagagtca tcccagtgta 600tcaagtaaac aatcttgaag
aaatttgcca acttatcata caggcctttg aagcaggtgt 660tgattttcaa gagagtgcgg
acagtttcct tctcatgctt tgtcttcatc atgcgtacca 720gggagattac aaacttttct
tggaaagtgg cgcagtcaag tatttggaag ggcacgggtt 780ccgttttgaa gtcaagaagc
gtgatggagt gaagcgcctt gaggaattgc tgccagcagt 840atctagtgga aaaaacatta
agagaacact tgctgccatg ccggaagagg agacaactga 900agctaatgcc ggtcagtttc
tctcctttgc aagtctattc cttccgaaat tggtagtagg 960agaaaaggct tgccttgaga
aggttcaaag gcaaattcaa gtacatgcag agcaaggact 1020gatacaatat ccaacagctt
ggcaatcagt aggacacatg atggtgattt tccgtttgat 1080gcgaacaaat tttctgatca
aatttctcct aatacaccaa gggatgcaca tggttgccgg 1140gcatgatgcc aacgatgctg
tgatttcaaa ttcagtggct caagctcgtt tttcaggctt 1200attgattgtc aaaacagtac
ttgatcatat cctacaaaag acagaacgag gagttcgtct 1260ccatcctctt gcaaggaccg
ccaaggtaaa aaatgaggtg aactccttta aggctgcact 1320cagctccctg gccaagcatg
gagagtatgc tcctttcgcc cgacttttga acctttctgg 1380agtaaataat cttgagcatg
gtcttttccc tcaactatcg gcaattgcac tcggagtcgc 1440cacagcacac gggagtaccc
tcgcaggagt aaatgttgga gaacagtatc aacaactcag 1500agaggctgcc actgaggctg
agaagcaact ccaacaatat gcagagtctc gcgaacttga 1560ccatcttgga cttgatgatc
aggaaaagaa aattcttatg aacttccatc agaaaaagaa 1620cgaaatcagc ttccagcaaa
caaacgctat ggtaactcta agaaaagagc gcctggccaa 1680gctgacagaa gctatcactg
ctgcgtcact gcccaaaaca agtggacatt acgatgatga 1740tgacgacatt ccctttccag
gacccatcaa tgatgacgac aatcctggcc atcaagatga 1800tgatccgact gactcacagg
atacgaccat tcccgatgtg gtggttgatc ctgatgatgg 1860aagctacggc gaataccaga
gttactcgga aaacggcatg aatgcaccag atgacttggt 1920cctattcgat ctagacgagg
acgacgagga cactaagcca gtgcctaata gatcgaccaa 1980gggtggacaa cagaagaaca
gtcaaaaggg ccagcatata gagggcagac agacacaatt 2040caggccaatt caaaatgtcc
caggccctca cagaacaatc caccacgcca gtgcgccact 2100cacggacaat gacagaagaa
atgaaccctc cggctcaacc agccctcgca tgctgacacc 2160aattaacgaa gaggcagacc
cactggacga tgccgacgac gagacgtcta gccttccgcc 2220cttggagtca gatgatgaag
agcaggacag ggacggaact tccaaccgca cacccactgt 2280cgccccaccg gctcccgtat
acagagatca ctctgaaaag aaagaactcc cgcaagacga 2340gcaacaagat caggaccaca
ctcaagaggc caggaaccag gacagtgaca acacccagtc 2400agaacactcc cttgaggaga
tgtatcgcca cattctaaga tcacaggggc catttgatgc 2460tgttttgtat tatcatatga
tgaaggatga gcctgtagtt ttcagtacca gtgatggcaa 2520agagtacacg tatccagact
cccttgaaga ggaatatcca ccatggctca ctgaaaaaga 2580ggctatgaat gaagagaata
gatttgttac attggatggt caacaatttt attggccggt 2640gatgaatcac aagaataaat
tcatggcaat cctgcaacat catcagtgaa tgagcatgga 2700acaatgggat gattcaaccg
acaaatagct aacattaagt agtcaaggaa cgaaaacagg 2760aagaattttt gatgtctaag
gtgtgaatta ttatcacaat aaaagtgatt cttatttttg 2820aatttaaagc tagcttatta
ttactagccg tttttcaaag ttcaatttga gtcttaatgc 2880aaataggcgt taagccacag
ttatagccat aattgtaact caatattcta actagcgatt 2940tatctaaatt aaattacatt
atgcttttat aacttaccta ctagcctgcc caacatttac 3000acgatcgttt tataattaag
aaaaaactaa tgatgaagat taaaaccttc atcatcctta 3060cgtcaattga attctctagc
actcgaagct tattgtcttc aatgtaaaag aaaagctggt 3120ctaacaagat gacaactaga
acaaagggca ggggccatac tgcggccacg actcaaaacg 3180acagaatgcc aggccctgag
ctttcgggct ggatctctga gcagctaatg accggaagaa 3240ttcctgtaag cgacatcttc
tgtgatattg agaacaatcc aggattatgc tacgcatccc 3300aaatgcaaca aacgaagcca
aacccgaaga cgcgcaacag tcaaacccaa acggacccaa 3360tttgcaatca tagttttgag
gaggtagtac aaacattggc ttcattggct actgttgtgc 3420aacaacaaac catcgcatca
gaatcattag aacaacgcat tacgagtctt gagaatggtc 3480taaagccagt ttatgatatg
gcaaaaacaa tctcctcatt gaacagggtt tgtgctgaga 3540tggttgcaaa atatgatctt
ctggtgatga caaccggtcg ggcaacagca accgctgcgg 3600caactgaggc ttattgggcc
gaacatggtc aaccaccacc tggaccatca ctttatgaag 3660aaagtgcgat tcggggtaag
attgaatcta gagatgagac cgtccctcaa agtgttaggg 3720aggcattcaa caatctaaac
agtaccactt cactaactga ggaaaatttt gggaaacctg 3780acatttcggc aaaggatttg
agaaacatta tgtatgatca cttgcctggt tttggaactg 3840ctttccacca attagtacaa
gtgatttgta aattgggaaa agatagcaac tcattggaca 3900tcattcatgc tgagttccag
gccagcctgg ctgaaggaga ctctcctcaa tgtgccctaa 3960ttcaaattac aaaaagagtt
ccaatcttcc aagatgctgc tccacctgtc atccacatcc 4020gctctcgagg tgacattccc
cgagcttgcc agaaaagctt gcgtccagtc ccaccatcgc 4080ccaagattga tcgaggttgg
gtatgtgttt ttcagcttca agatggtaaa acacttggac 4140tcaaaatttg agccaatctc
ccttccctcc gaaagaggcg aataatagca gaggcttcaa 4200ctgctgaact atagggtacg
ttacattaat gatacacttg tgagtatcag ccctggataa 4260tataagtcaa ttaaacgacc
aagataaaat tgttcatatc tcgctagcag cttaaaatat 4320aaatgtaata ggagctatat
ctctgacagt attataatca attgttatta agtaacccaa 4380accaaaagtg atgaagatta
agaaaaacct acctcggctg agagagtgtt ttttcattaa 4440ccttcatctt gtaaacgttg
agcaaaattg ttaaaaatat gaggcgggtt atattgccta 4500ctgctcctcc tgaatatatg
gaggccatat accctgtcag gtcaaattca acaattgcta 4560gaggtggcaa cagcaataca
ggcttcctga caccggagtc agtcaatggg gacactccat 4620cgaatccact caggccaatt
gccgatgaca ccatcgacca tgccagccac acaccaggca 4680gtgtgtcatc agcattcatc
cttgaagcta tggtgaatgt catatcgggc cccaaagtgc 4740taatgaagca aattccaatt
tggcttcctc taggtgtcgc tgatcaaaag acctacagct 4800ttgactcaac tacggccgcc
atcatgcttg cttcatacac tatcacccat ttcggcaagg 4860caaccaatcc acttgtcaga
gtcaatcggc tgggtcctgg aatcccggat catcccctca 4920ggctcctgcg aattggaaac
caggctttcc tccaggagtt cgttcttccg ccagtccaac 4980taccccagta tttcaccttt
gatttgacag cactcaaact gatcacccaa ccactgcctg 5040ctgcaacatg gaccgatgac
actccaacag gatcaaatgg agcgttgcgt ccaggaattt 5100catttcatcc aaaacttcgc
cccattcttt tacccaacaa aagtgggaag aaggggaaca 5160gtgccgatct aacatctccg
gagaaaatcc aagcaataat gacttcactc caggacttta 5220agatcgttcc aattgatcca
accaaaaata tcatgggaat cgaagtgcca gaaactctgg 5280tccacaagct gaccggtaag
aaggtgactt ctaaaaatgg acaaccaatc atccctgttc 5340ttttgccaaa gtacattggg
ttggacccgg tggctccagg agacctcacc atggtaatca 5400cacaggattg tgacacgtgt
cattctcctg caagtcttcc agctgtgatt gagaagtaat 5460tgcaataatt gactcagatc
cagttttata gaatcttctc agggatagtg ataacatcta 5520tttagtaatc cgtccattag
aggagacact tttaattgat caatatacta aaggtgcttt 5580acaccattgt cttttttctc
tcctaaatgt agaacttaac aaaagactca taatatactt 5640gtttttaaag gattgattga
tgaaagatca taactaataa cattacaaat aatcctacta 5700taatcaatac ggtgattcaa
atgttaatct ttctcattgc acatactttt tgcccttatc 5760ctcaaattgc ctgcatgctt
acatctgagg atagccagtg tgacttggat tggaaatgtg 5820gagaaaaaat cgggacccat
ttctaggttg ttcacaatcc aagtacagac attgcccttc 5880taattaagaa aaaatcggcg
atgaagatta agccgacagt gagcgtaatc ttcatctctc 5940ttagattatt tgttttccag
agtaggggtc gtcaggtcct tttcaatcgt gtaaccaaaa 6000taaactccac tagaaggata
ttgtggggca acaacacaat gggcgttaca ggaatattgc 6060agttacctcg tgatcgattc
aagaggacat cattctttct ttgggtaatt atccttttcc 6120aaagaacatt ttccatccca
cttggagtca tccacaatag cacattacag gttagtgatg 6180tcgacaaact agtttgtcgt
gacaaactgt catccacaaa tcaattgaga tcagttggac 6240tgaatctcga agggaatgga
gtggcaactg acgtgccatc tgcaactaaa agatggggct 6300tcaggtccgg tgtcccacca
aaggtggtca attatgaagc tggtgaatgg gctgaaaact 6360gctacaatct tgaaatcaaa
aaacctgacg ggagtgagtg tctaccagca gcgccagacg 6420ggattcgggg cttcccccgg
tgccggtatg tgcacaaagt atcaggaacg ggaccgtgtg 6480ccggagactt tgccttccat
aaagagggtg ctttcttcct gtatgatcga cttgcttcca 6540cagttatcta ccgaggaacg
actttcgctg aaggtgtcgt tgcatttctg atactgcccc 6600aagctaagaa ggacttcttc
agctcacacc ccttgagaga gccggtcaat gcaacggagg 6660acccgtctag tggctactat
tctaccacaa ttagatatca ggctaccggt tttggaacca 6720atgagacaga gtacttgttc
gaggttgaca atttgaccta cgtccaactt gaatcaagat 6780tcacaccaca gtttctgctc
cagctgaatg agacaatata tacaagtggg aaaaggagca 6840ataccacggg aaaactaatt
tggaaggtca accccgaaat tgatacaaca atcggggagt 6900gggccttctg ggaaactaaa
aaaacctcac tagaaaaatt cgcagtgaag agttgtcttt 6960cacagttgta tcaaacggag
ccaaaaacat cagtggtcag agtccggcgc gaacttcttc 7020cgacccaggg accaacacaa
caactgaaga ccacaaaatc atggcttcag aaaattcctc 7080tgcaatggtt caagtgcaca
gtcaaggaag ggaagctgca gtgtcgcatc taacaaccct 7140tgccacaatc tccacgagtc
cccaatccct cacaaccaaa ccaggtccgg acaacagcac 7200ccataataca cccgtgtata
aacttgacat ctctgaggca actcaagttg aacaacatca 7260ccgcagaaca gacaacgaca
gcacagcctc cgacactccc tctgccacga ccgcagccgg 7320acccccaaaa gcagagaaca
ccaacacgag caagagcact gacttcctgg accccgccac 7380cacaacaagt ccccaaaacc
acagcgagac cgctggcaac aacaacactc atcaccaaga 7440taccggagaa gagagtgcca
gcagcgggaa gctaggctta attaccaata ctattgctgg 7500agtcgcagga ctgatcacag
gcgggagaag aactcgaaga gaagcaattg tcaatgctca 7560acccaaatgc aaccctaatt
tacattactg gactactcag gatgaaggtg ctgcaatcgg 7620actggcctgg ataccatatt
tcgggccagc agccgaggga atttacatag aggggctaat 7680gcacaatcaa gatggtttaa
tctgtgggtt gagacagctg gccaacgaga cgactcaagc 7740tcttcaactg ttcctgagag
ccacaactga gctacgcacc ttttcaatcc tcaaccgtaa 7800ggcaattgat ttcttgctgc
agcgatgggg cggcacatgc cacattctgg gaccggactg 7860ctgtatcgaa ccacatgatt
ggaccaagaa cataacagac aaaattgatc agattattca 7920tgattttgtt gataaaaccc
ttccggacca gggggacaat gacaattggt ggacaggatg 7980gagacaatgg ataccggcag
gtattggagt tacaggcgtt ataattgcag ttatcgcttt 8040attctgtata tgcaaatttg
tcttttagtt tttcttcaga ttgcttcatg gaaaagctca 8100gcctcaaatc aatgaaacca
ggatttaatt atatggatta cttgaatcta agattacttg 8160acaaatgata atataataca
ctggagcttt aaacatagcc aatgtgattc taactccttt 8220aaactcacag ttaatcataa
acaaggtttg acatcaatct agttatctct ttgagaatga 8280taaacttgat gaagattaag
aaaaaggtaa tctttcgatt atctttaatc ttcatccttg 8340attctacaat catgacagtt
gtctttagtg acaagggaaa gaagcctttt tattaagttg 8400taataatcag atctgcgaac
cggtagagtt tagttgcaac ctaacacaca taaagcattg 8460gtcaaaaagt caatagaaat
ttaaacagtg agtggagaca acttttaaat ggaagcttca 8520tatgagagag gacgcccacg
agctgccaga cagcattcaa gggatggaca cgaccaccat 8580gttcgagcac gatcatcatc
cagagagaat tatcgaggtg agtaccgtca atcaaggagc 8640gcctcacaag tgcgcgttcc
tactgtattt cataagaaga gagttgaacc attaacagtt 8700cctccagcac ctaaagacat
atgtccgacc ttgaaaaaag gatttttgtg tgacagtagt 8760ttttgcaaaa aagatcacca
gttggagagt ttaactgata gggaattact cctactaatc 8820gcccgtaaga cttgtggatc
agtagaacaa caattaaata taactgcacc caaggactcg 8880cgcttagcaa atccaacggc
tgatgatttc cagcaagagg aaggtccaaa aattaccttg 8940ttgacactga tcaagacggc
agaacactgg gcgagacaag acatcagaac catagaggat 9000tcaaaattaa gagcattgtt
gactctatgt gctgtgatga cgaggaaatt ctcaaaatcc 9060cagctgagtc ttttatgtga
gacacaccta aggcgcgagg ggcttgggca agatcaggca 9120gaacccgttc tcgaagtata
tcaacgatta cacagtgata aaggaggcag ttttgaagct 9180gcactatggc aacaatggga
ccgacaatcc ctaattatgt ttatcactgc attcttgaat 9240attgctctcc agttaccgtg
tgaaagttct gctgtcgttg tttcagggtt aagaacattg 9300gttcctcaat cagataatga
ggaagcttca accaacccgg ggacatgctc atggtctgat 9360gagggtaccc cttaataagg
ctgactaaaa cactatataa ccttctactt gatcacaata 9420ctccgtatac ctatcatcat
atatttaatc aagacgatat cctttaaaac ttattcagta 9480ctataatcac tctcgtttca
aattaataag atgtgcatga ttgccctaat atatgaagag 9540gtatgataca accctaacag
tgatcaaaga aaatcataat ctcgtatcgc tcgaaatata 9600acctgccaag catacctctt
gcacaaagtg attcttgtac acaaataatg ttttactcta 9660caggaggtag caacgatcca
tcccatcaaa aaataagtat ttcatgactt actaatgatc 9720tcttaaaata ttaagaaaaa
ctgacggaac ataaattctt tatgcttcaa gctgtggagg 9780aggtgtttgg tattggctat
tgttatatta caatcaataa caagcttgta aaaatattgt 9840tcttgtttca agaggtagat
tgtgaccgga aatgctaaac taatgatgaa gattaatgcg 9900gaggtctgat aagaataaac
cttattattc agattaggcc ccaagaggca ttcttcatct 9960ccttttagca aagtactatt
tcagggtagt ccaattagtg gcacgtcttt tagctgtata 10020tcagtcgccc ctgagatacg
ccacaaaagt gtctctaagc taaattggtc tgtacacatc 10080ccatacattg tattaggggc
aataatatct aattgaactt agccgtttaa aatttagtgc 10140ataaatctgg gctaacacca
ccaggtcaac tccattggct gaaaagaagc ttacctacaa 10200cgaacatcac tttgagcgcc
ctcacaatta aaaaatagga acgtcgttcc aacaatcgag 10260cgcaaggttt caaggttgaa
ctgagagtgt ctagacaaca aaatattgat actccagaca 10320ccaagcaaga cctgagaaaa
aaccatggct aaagctacgg gacgatacaa tctaatatcg 10380cccaaaaagg acctggagaa
aggggttgtc ttaagcgacc tctgtaactt cttagttagc 10440caaactattc aggggtggaa
ggtttattgg gctggtattg agtttgatgt gactcacaaa 10500ggaatggccc tattgcatag
actgaaaact aatgactttg cccctgcatg gtcaataaca 10560aggaatctct ttcctcattt
atttcaaaat ccgaattcca caattgaatc accgctgtgg 10620gcattgagag tcatccttgc
agcagggata caggaccagc tgattgacca gtctttgatt 10680gaacccttag caggagccct
tggtctgatc tctgattggc tgctaacaac caacactaac 10740catttcaaca tgcgaacaca
acgtgtcaag gaacaattga gcccaaaaat gctgtcgttg 10800attcgatcca atattctcaa
gtttattaac aaattggatg ctctacatgt cgtgaactac 10860aacggattgt tgagcagtat
tgaaattgga actcaaaatc atataatcat cataactcga 10920actaacatgg gttttctggt
ggagctccaa gaacccgaca aatcggcaat gaaccgcatg 10980aagcctgggc cggcgaaatt
ttccctcctt catgagtcca cactgaaagc atttacacaa 11040ggatcctcga cacgaatgca
aagtttgatt cttgaattta atagctctct tgctatctaa 11100ctaaggtaga atacttcata
ttgagctaac tcatatatgc tgactcaata gttatcttga 11160catctctgct ttcataatca
gatatataag cataataaat aaatactcat atttcttgat 11220aatttgttta accacagata
aatcctcact gtaagccagc ttccaagttg acacccttac 11280aaaaaccagg actcagaatc
cctcaaacaa gagattccaa gacaacatca tagaattgct 11340ttattatatg aataagcatt
ttatcaccag aaatcctata tactaaatgg ttaattgtaa 11400ctgaacccgc aggtcacatg
tgttaggttt cacagattct atatattact aactctatac 11460tcgtaattaa cattagataa
gtagattaag aaaaaagcct gaggaagatt aagaaaaact 11520gcttattggg tctttccgtg
ttttagatga agcagttgaa attcttcctc ttgatattaa 11580atggctacac aacataccca
atacccagac gctaggttat catcaccaat tgtattggac 11640caatgtgacc tagtcactag
agcttgcggg ttatattcat catactccct taatccgcaa 11700ctacgcaact gtaaactccc
gaaacatatc taccgtttga aatacgatgt aactgttacc 11760aagttcttga gtgatgtacc
agtggcgaca ttgcccatag atttcatagt cccagttctt 11820ctcaaggcac tgtcaggcaa
tggattctgt cctgttgagc cgcggtgcca acagttctta 11880gatgaaatca ttaagtacac
aatgcaagat gctctcttct tgaaatatta tctcaaaaat 11940gtgggtgctc aagaagactg
tgttgatgaa cactttcaag agaaaatctt atcttcaatt 12000cagggcaatg aatttttaca
tcaaatgttt ttctggtatg atctggctat tttaactcga 12060aggggtagat taaatcgagg
aaactctaga tcaacatggt ttgttcatga tgatttaata 12120gacatcttag gctatgggga
ctatgttttt tggaagatcc caatttcaat gttaccactg 12180aacacacaag gaatccccca
tgctgctatg gactggtatc aggcatcagt attcaaagaa 12240gcggttcaag ggcatacaca
cattgtttct gtttctactg ccgacgtctt gataatgtgc 12300aaagatttaa ttacatgtcg
attcaacaca actctaatct caaaaatagc agagattgag 12360gatccagttt gttctgatta
tcccaatttt aagattgtgt ctatgcttta ccagagcgga 12420gattacttac tctccatatt
agggtctgat gggtataaaa ttattaagtt cctcgaacca 12480ttgtgcttgg ccaaaattca
attatgctca aagtacactg agaggaaggg ccgattctta 12540acacaaatgc atttagctgt
aaatcacacc ctagaagaaa ttacagaaat gcgtgcacta 12600aagccttcac aggctcaaaa
gatccgtgaa ttccatagaa cattgataag gctggagatg 12660acgccacaac aactttgtga
gctattttcc attcaaaaac actgggggca tcctgtgcta 12720catagtgaaa cagcaatcca
aaaagttaaa aaacatgcta cggtgctaaa agcattacgc 12780cctatagtga ttttcgagac
atactgtgtt tttaaatata gtattgccaa acattatttt 12840gatagtcaag gatcttggta
cagtgttact tcagatagga atctaacacc gggtcttaat 12900tcttatatca aaagaaatca
attccctccg ttgccaatga ttaaagaact actatgggaa 12960ttttaccacc ttgaccaccc
tccacttttc tcaaccaaaa ttattagtga cttaagtatt 13020tttataaaag acagagctac
cgcagtagaa aggacatgct gggatgcagt attcgagcct 13080aatgttctag gatataatcc
acctcacaaa tttagtacta aacgtgtacc ggaacaattt 13140ttagagcaag aaaacttttc
tattgagaat gttctttcct acgcacaaaa actcgagtat 13200ctactaccac aatatcggaa
cttttctttc tcattgaaag agaaagagtt gaatgtaggt 13260agaaccttcg gaaaattgcc
ttatccgact cgcaatgttc aaacactttg tgaagctctg 13320ttagctgatg gtcttgctaa
agcatttcct agcaatatga tggtagttac ggaacgtgag 13380caaaaagaaa gcttattgca
tcaagcatca tggcaccaca caagtgatga ttttggtgaa 13440catgccacag ttagagggag
tagctttgta actgatttag agaaatacaa tcttgcattt 13500agatatgagt ttacagcacc
ttttatagaa tattgcaacc gttgctatgg tgttaagaat 13560gtttttaatt ggatgcatta
tacaatccca cagtgttata tgcatgtcag tgattattat 13620aatccaccac ataacctcac
actggagaat cgagacaacc cccccgaagg gcctagttca 13680tacaggggtc atatgggagg
gattgaagga ctgcaacaaa aactctggac aagtatttca 13740tgtgctcaaa tttctttagt
tgaaattaag actggtttta agttacgctc agctgtgatg 13800ggtgacaatc agtgcattac
tgttttatca gtcttcccct tagagactga cgcagacgag 13860caggaacaga gcgccgaaga
caatgcagcg agggtggccg ccagcctagc aaaagttaca 13920agtgcctgtg gaatcttttt
aaaacctgat gaaacatttg tacattcagg ttttatctat 13980tttggaaaaa aacaatattt
gaatggggtc caattgcctc agtcccttaa aacggctgca 14040agaatggcac cattgtctga
tgcaattttt gatgatcttc aagggaccct ggctagtata 14100ggcactgctt ttgagcgatc
catctctgag acacgacata tctttccttg caggataacc 14160gcagctttcc atacgttttt
ttcggtgaga atcttgcaat atcatcatct cgggttcaat 14220aaaggttttg accttggaca
gttaacactc ggcaaacctc tggatttcgg aacaatatca 14280ttggcactag cggtaccgca
ggtgcttgga gggttatcct tcttgaatcc tgagaaatgt 14340ttctaccgga atctaggaga
tccagttacc tcaggcttat tccagttaaa aacttatctc 14400cgaatgattg agatggatga
tttattctta cctttaattg cgaagaaccc tgggaactgc 14460actgccattg actttgtgct
aaatcctagc ggattaaatg tccctgggtc gcaagactta 14520acttcatttc tgcgccagat
tgtacgcagg accatcaccc taagtgcgaa aaacaaactt 14580attaatacct tatttcatgc
gtcagctgac ttcgaagacg aaatggtttg taaatggcta 14640ttatcatcaa ctcctgttat
gagtcgtttt gcggccgata tcttttcacg cacgccgagc 14700gggaagcgat tgcaaattct
aggatacctg gaaggaacac gcacattatt agcctctaag 14760atcatcaaca ataatacaga
gacaccggtt ttggacagac tgaggaaaat aacattgcaa 14820aggtggagcc tatggtttag
ttatcttgat cattgtgata atatcctggc ggaggcttta 14880acccaaataa cttgcacagt
tgatttagca cagattctga gggaatattc atgggctcat 14940attttagagg gaagacctct
tattggagcc acactcccat gtatgattga gcaattcaaa 15000gtgttttggc tgaaacccta
cgaacaatgt ccgcagtgtt caaatgcaaa gcaaccaggt 15060gggaaaccat tcgtgtcagt
ggcagtcaag aaacatattg ttagtgcatg gccgaacgca 15120tcccgaataa gctggactat
cggggatgga atcccataca ttggatcaag gacagaagat 15180aagataggac aacctgctat
taaaccaaaa tgtccttccg cagccttaag agaggccatt 15240gaattggcgt cccgtttaac
atgggtaact caaggcagtt cgaacagtga cttgctaata 15300aaaccatttt tggaagcacg
agtaaattta agtgttcaag aaatacttca aatgacccct 15360tcacattact caggaaatat
tgttcacagg tacaacgatc aatacagtcc tcattctttc 15420atggccaatc gtatgagtaa
ttcagcaacg cgattgattg tttctacaaa cactttaggt 15480gagttttcag gaggtggcca
gtctgcacgc gacagcaata ttattttcca gaatgttata 15540aattatgcag ttgcactgtt
cgatattaaa tttagaaaca ctgaggctac agatatccaa 15600tataatcgtg ctcaccttca
tctaactaag tgttgcaccc gggaagtacc agctcagtat 15660ttaacataca catctacatt
ggatttagat ttaacaagat accgagaaaa cgaattgatt 15720tatgacagta atcctctaaa
aggaggactc aattgcaata tctcattcga taatccattt 15780ttccaaggta aacggctgaa
cattatagaa gatgatctta ttcgactgcc tcacttatct 15840ggatgggagc tagccaagac
catcatgcaa tcaattattt cagatagcaa caattcatct 15900acagacccaa ttagcagtgg
agaaacaaga tcattcacta cccatttctt aacttatccc 15960aagataggac ttctgtacag
ttttggggcc tttgtaagtt attatcttgg caatacaatt 16020cttcggacta agaaattaac
acttgacaat tttttatatt acttaactac tcaaattcat 16080aatctaccac atcgctcatt
gcgaatactt aagccaacat tcaaacatgc aagcgttatg 16140tcacggttaa tgagtattga
tcctcatttt tctatttaca taggcggtgc tgcaggtgac 16200agaggactct cagatgcggc
caggttattt ttgagaacgt ccatttcatc ttttcttaca 16260tttgtaaaag aatggataat
taatcgcgga acaattgtcc ctttatggat agtatatccg 16320ctagagggtc aaaacccaac
acctgtgaat aattttctct atcagatcgt agaactgctg 16380gtgcatgatt catcaagaca
acaggctttt aaaactacca taagtgatca tgtacatcct 16440cacgacaatc ttgtttacac
atgtaagagt acagccagca atttcttcca tgcatcattg 16500gcgtactgga ggagcagaca
cagaaacagc aaccgaaaat acttggcaag agactcttca 16560actggatcaa gcacaaacaa
cagtgatggt catattgaga gaagtcaaga acaaaccacc 16620agagatccac atgatggcac
tgaacggaat ctagtcctac aaatgagcca tgaaataaaa 16680agaacgacaa ttccacaaga
aaacacgcac cagggtccgt cgttccagtc ctttctaagt 16740gactctgctt gtggtacagc
aaatccaaaa ctaaatttcg atcgatcgag acacaatgtg 16800aaatttcagg atcataactc
ggcatccaag agggaaggtc atcaaataat ctcacaccgt 16860ctagtcctac ctttctttac
attatctcaa gggacacgcc aattaacgtc atccaatgag 16920tcacaaaccc aagacgagat
atcaaagtac ttacggcaat tgagatccgt cattgatacc 16980acagtttatt gtagatttac
cggtatagtc tcgtccatgc attacaaact tgatgaggtc 17040ctttgggaaa tagagagttt
caagtcggct gtgacgctag cagagggaga aggtgctggt 17100gccttactat tgattcagaa
ataccaagtt aagaccttat ttttcaacac gctagctact 17160gagtccagta tagagtcaga
aatagtatca ggaatgacta ctcctaggat gcttctacct 17220gttatgtcaa aattccataa
tgaccaaatt gagattattc ttaacaactc agcaagccaa 17280ataacagaca taacaaatcc
tacttggttt aaagaccaaa gagcaaggct acctaagcaa 17340gtcgaggtta taaccatgga
tgcagagaca acagagaata taaacagatc gaaattgtac 17400gaagctgtat ataaattgat
cttacaccat attgatccta gcgtattgaa agcagtggtc 17460cttaaagtct ttctaagtga
tactgagggt atgttatggc taaatgataa tttagccccg 17520ttttttgcca ctggttattt
aattaagcca ataacgtcaa gtgctagatc tagtgagtgg 17580tatctttgtc tgacgaactt
cttatcaact acacgtaaga tgccacacca aaaccatctc 17640agttgtaaac aggtaatact
tacggcattg caactgcaaa ttcaacgaag cccatactgg 17700ctaagtcatt taactcagta
tgctgactgt gagttacatt taagttatat ccgccttggt 17760tttccatcat tagagaaagt
actataccac aggtataacc tcgtcgattc aaaaagaggt 17820ccactagtct ctatcactca
gcacttagca catcttagag cagagattcg agaattaact 17880aatgattata atcaacagcg
acaaagtcgg actcaaacat atcactttat tcgtactgca 17940aaaggacgaa tcacaaaact
agtcaatgat tatttaaaat tctttcttat tgtgcaagca 18000ttaaaacata atgggacatg
gcaagctgag tttaagaaat taccagagtt gattagtgtg 18060tgcaataggt tctaccatat
tagagattgc aattgtgaag aacgtttctt agttcaaacc 18120ttatatttac atagaatgca
ggattctgaa gttaagctta tcgaaaggct gacagggctt 18180ctgagtttat ttccggatgg
tctctacagg tttgattgaa ttaccgtgca tagtatcctg 18240atacttgcaa aggttggtta
ttaacataca gattataaaa aactcataaa ttgctctcat 18300acatcatatt gatctaatct
caataaacaa ctatttaaat aacgaaagga gtccctatat 18360tatatactat atttagcctc
tctccctgcg tgataatcaa aaaattcaca atgcagcatg 18420tgtgacatat tactgccgca
atgaatttaa cgcaacataa taaactctgc actctttata 18480attaagcttt aacgaaaggt
ctgggctcat attgttattg atataataat gttgtatcaa 18540tatcctgtca gatggaatag
tgttttggtt gataacacaa cttcttaaaa caaaattgat 18600ctttaagatt aagtttttta
taattatcat tactttaatt tgtcgtttta aaaacggtga 18660tagccttaat ctttgtgtaa
aataagagat taggtgtaat aaccttaaca tttttgtcta 18720gtaagctact atttcataca
gaatgataaa attaaaagaa aaggcaggac tgtaaaatca 18780gaaatacctt ctttacaata
tagcagacta gataataatc ttcgtgttaa tgataattaa 18840gacattgacc acgctcatca
gaaggctcgc cagaataaac gttgcaaaaa ggattcctgg 18900aaaaatggtc gcacacaaaa
atttaaaaat aaatctattt cttctttttt gtgtgtcca 18959475PRTArtificial
SequenceA peptide 47Pro Pro Glu Tyr Met1 5485PRTArtificial
SequenceA peptide 48Pro Pro Glu Tyr Ala1 5495PRTArtificial
SequenceA peptide 49Ala Ala Glu Tyr Met1 5504PRTArtificial
SequenceA synthetic peptide 50Pro Pro Xaa Tyr1514PRTArtificial SequenceA
synthetic peptide 51Ala Ala Xaa Tyr1524PRTZaire Ebola virus 52Pro Pro Glu
Tyr1534PRTArtificial SequenceA synthetic peptide 53Ala Ala Glu
Tyr1544PRTArtificial SequenceA synthetic peptide 54Arg Xaa Xaa
Arg1555PRTZaire Ebola virus 55Arg Arg Thr Arg Arg1
5564PRTZaire Ebola virus 56Arg Val Arg Arg1574PRTZaire Ebola virus 57Arg
Thr Arg Arg1585PRTArtificial SequenceA synthetic peptide 58Ala Gly Thr
Ala Ala1 5
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20140115207 | HIGH PERFORMANCE INTERCONNECT PHYSICAL LAYER |
20140115206 | METHODS AND SYSTEMS FOR RUNNING NETWORK PROTOCOLS OVER PERIPHERAL COMPONENT INTERCONNECT EXPRESS |
20140115205 | Secure Digital Card Capable of Transmitting Data Over Wireless Network |
20140115204 | SYSTEM AND METHOD OF PROCESSING SEISMIC DATA ON A CO-PROCESSOR DEVICE |
20140115203 | SYSTEM AND METHOD OF PROCESSING SEISMIC DATA ON A CO-PROCESSOR DEVICE |