Patent application title: Water-Absorbing and Quick-Drying Property-Imparting Agent, and Method for Imparting Water-Absorbing and Quick-Drying Properties
Inventors:
Hiroaki Suzumura (Tsuruoka-Shi, Yamagata, JP)
Yui Hirose (Tsuruoka-Shi, Yamagata, JP)
Assignees:
SPIBER INC.
IPC8 Class: AC07K14435FI
USPC Class:
1 1
Class name:
Publication date: 2022-07-28
Patent application number: 20220235100
Abstract:
Provided is a water-absorbing and quick-drying property-imparting agent
capable of easily imparting water-absorbing and quick-drying properties
to various materials or articles in a simple process, and a method
capable of easily imparting water-absorbing and quick-drying properties
to predetermined materials or articles.
A water-absorbing and quick-drying property-imparting agent containing
modified fibroin as an active ingredient, and a method for imparting
water-absorbing and quick-drying properties to an article, the method
including a step of incorporating modified fibroin into the article.Claims:
1-4. (canceled)
5. A method of imparting water-absorbing and quick-drying properties to an article, the method comprising: incorporating a water-absorbing and quick-drying property-imparting agent into the article, wherein the agent comprises modified fibroin as an active ingredient.
6. The method according to claim 5, wherein the modified fibroin contains modified fibroin having an average value of hydropathy indices (average hydropathy index (HI)) of 0 or less.
7. The method according to claim 5, wherein the modified fibroin contains modified spider silk fibroin.
8. The method according to claim 5, wherein the agent is in the form of a fiber.
9. The method according to claim 5, wherein the article is selected from the group consisting of fibers, woven fabrics, knitted fabrics, nonwoven fabrics, cotton, sponges, films, resins, and composite materials.
10. The method according to claim 5, wherein the incorporating the agent into the article is performed by mixing the agent with raw material, by forming the article by combining the agent prepared in the form of formed body with another material, or by forming the article by forming the agent.
11. The method according to claim 5, wherein the content of the modified fibroin in the article is 20 mass % or more.
Description:
TECHNICAL FIELD
[0001] The present invention relates to a water-absorbing and quick-drying property-imparting agent, and a method for imparting water-absorbing and quick-drying properties.
BACKGROUND ART
[0002] In general, clothing such as underwear, innerwear, and sportswear, and bedclothes and the like are required to have so-called water-absorbing and quick-drying properties such as absorbing sweat well and drying quickly. Therefore, many underwear, bedclothes, and the like are produced using cotton spun yarns having relatively high hygroscopic properties.
[0003] Although cotton has high water-absorbing properties, it is difficult to say that cotton has sufficient quick-drying properties because absorbed moisture evaporates slowly. In addition, in sportswear and the like, polyurethane fibers and polyester fibers are often used because followability to movement of a human body and stretchability are emphasized, but these synthetic fibers are inferior in water-absorbing properties.
[0004] Under such circumstances, various techniques for improving water-absorbing and quick-drying properties of various fibers, fabrics, and the like have been proposed. For example, Patent Literature 1 discloses a water-absorptive quick-drying spun yarn composed of a false-twisted spun yarn including a core made of polyester fibers having a heteromorphic cross section and a sheath made of short fibers including cotton fibers, the core being in a non-twisted state and the sheath being in a bound state.
CITATION LIST
Patent Literature
[0005] Patent Literature 1: JU 3213540
SUMMARY OF INVENTION
Technical Problem
[0006] For example, as in the water-absorptive quick-drying spun yarn described in Patent Literature 1, the water-absorbing and quick-drying properties of conventional fibers, fabrics, and the like are achieved by using fibers or the like artificially imparted with a special structure. Therefore, the conventional water-absorptive quick-drying fibers and water-absorptive quick-drying fabrics inevitably require complicated production processes.
[0007] An object of the present invention is to provide a water-absorbing and quick-drying property-imparting agent capable of easily imparting water-absorbing and quick-drying properties to various materials or articles in a simple process, and a method capable of easily imparting water-absorbing and quick-drying properties to predetermined materials or articles.
Solution to Problem
[0008] The present inventors have found that modified fibroin has excellent water-absorbing and quick-drying properties. The present invention is based on this novel finding.
[0009] The present invention relates to, for example, the following inventions.
[0010] [1]
[0011] A water-absorbing and quick-drying property-imparting agent containing modified fibroin as an active ingredient.
[0012] [2]
[0013] The water-absorbing and quick-drying property-imparting agent according to [1], wherein the modified fibroin contains modified fibroin having an average value of hydropathy indices (average hydropathy index (HI)) of 0 or less.
[0014] [3]
[0015] The water-absorbing and quick-drying property-imparting agent according to [1] or [2], wherein the modified fibroin contains modified spider silk fibroin.
[0016] [4]
[0017] The water-absorbing and quick-drying property-imparting agent according to any one of [1] to [3], wherein the water-absorbing and quick-drying property-imparting agent is in the form of a fiber.
[0018] [5]
[0019] A method of imparting water-absorbing and quick-drying properties to an article, the method including:
[0020] a step of incorporating modified fibroin into the article.
Advantageous Effects of Invention
[0021] According to the present invention, it is possible to provide a water-absorbing and quick-drying property-imparting agent capable of easily imparting water-absorbing and quick-drying properties to various materials or articles in a simple process, and a method capable of easily imparting water-absorbing and quick-drying properties to predetermined materials or articles.
[0022] According to the present invention, for example, by mixing the water-absorbing and quick-drying property-imparting agent according to the present invention with a material having biodegradability, water-absorbing and quick-drying properties can be imparted without impairing biodegradability. According to the present invention, by adjusting the amount of modified fibroin (active ingredient) contained in a predetermined article, the degree of water-absorbing and quick-drying properties of the article can also be adjusted.
BRIEF DESCRIPTION OF DRAWINGS
[0023] FIG. 1 is a schematic diagram illustrating an example of a domain sequence of modified fibroin.
[0024] FIG. 2 is a graph illustrating a distribution of values of z/w (%) in naturally derived fibroin.
[0025] FIG. 3 is a graph illustrating a distribution of values of x/y (%) in naturally derived fibroin.
[0026] FIG. 4 is a schematic diagram illustrating an example of a domain sequence of modified fibroin.
[0027] FIG. 5 is a schematic diagram illustrating an example of a domain sequence of modified fibroin.
DESCRIPTION OF EMBODIMENTS
[0028] Hereinafter, embodiments of the present invention will be described in detail. However, the present invention is not limited to the following embodiments.
[0029] [Water-Absorbing and Quick-Drying Property-Imparting Agent]
[0030] The water-absorbing and quick-drying property-imparting agent according to the present embodiment contains modified fibroin as an active ingredient. The water-absorbing and quick-drying properties refer to a property of absorbing moisture such as sweat and quickly drying. In the present specification, the moisture may be liquid water or gaseous water. That is, in the present specification, water absorption includes moisture absorption. The water-absorbing and quick-drying property-imparting agent according to the present embodiment utilizes the property of modified fibroin excellent in water-absorbing and quick-drying properties.
[0031] (Modified Fibroin)
[0032] The modified fibroin according to the present embodiment is a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif. An amino acid sequence (N-terminal sequence and C-terminal sequence) may be further added to either or both of the N-terminal side and the C-terminal side of the domain sequence of the modified fibroin. The N-terminal sequence and the C-terminal sequence, although not limited thereto, are typically regions that do not have repetitions of amino acid motifs characteristic of fibroin and consist of amino acids of about 100 residues.
[0033] The term "modified fibroin" in the present specification refers to artificially produced fibroin (artificial fibroin). The modified fibroin may be fibroin in which a domain sequence is different from an amino acid sequence of naturally derived fibroin or may be fibroin in which a domain sequence is the same as an amino acid sequence of naturally derived fibroin. The term "naturally derived fibroin" as used herein is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif.
[0034] The "modified fibroin" may be fibroin obtained by using an amino acid sequence of naturally derived fibroin as it is, fibroin in which an amino acid sequence is modified based on an amino acid sequence of naturally derived fibroin (for example, fibroin in which an amino acid sequence is modified by modifying a cloned gene sequence of naturally derived fibroin), or fibroin artificially designed and synthesized independently of naturally derived fibroin (for example, fibroin having a desired amino acid sequence by chemically synthesizing a nucleic acid encoding a designed amino acid sequence).
[0035] The term "domain sequence" in the present specification is an amino acid sequence that produces a crystal region (typically, corresponding to the (A).sub.n motif of the amino acid sequence) and an amorphous region (typically, corresponding to REP of the amino acid sequence) specific to fibroin, and means an amino acid sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif. Here, the (A).sub.n motif represents an amino acid sequence mainly composed of alanine residues, and the number of amino acid residues therein is 2 to 27. The number of the amino acid residues in the (A).sub.n motif may be an integer of 2 to 20, 4 to 27, 4 to 20, 8 to 20, 10 to 20, 4 to 16, 8 to 16, or 10 to 16. In addition, the proportion of the number of alanine residues in the total number of amino acid residues in the (A).sub.n motif may be 40% or more, or may also be 60% or more, 70% or more, 80% or more, 83% or more, 85% or more, 86% or more, 90% or more, 95% or more, or 100% (meaning that the (A).sub.n motif only consists of alanine residues). At least seven of a plurality of (A).sub.n motifs in the domain sequence may consist of only alanine residues. The REP represents an amino acid sequence consisting of 2 to 200 amino acid residues. The REP may be an amino acid sequence consisting of 10 to 200 amino acid residues. m represents an integer of 2 to 300, and may be an integer of 10 to 300. A plurality of (A).sub.n motifs may be the same amino acid sequences or different amino acid sequences. A plurality of REPs may be the same amino acid sequences or different amino acid sequences.
[0036] The modified fibroin according to the present embodiment can be obtained by, for example, performing modification of an amino acid sequence corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues with respect to a cloned gene sequence of naturally derived fibroin. Substitution, deletion, insertion, and/or addition of the amino acid residues can be performed by methods well known to those skilled in the art, such as site-directed mutagenesis. Specifically, the modification may be performed in accordance with a method described in literatures such as Nucleic Acid Res. 10, 6487 (1982), and Methods in Enzymology, 100, 448 (1983).
[0037] The naturally derived fibroin is a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif, and a specific example thereof can include fibroin produced by insects or spiders.
[0038] Examples of the fibroin produced by insects can include silk proteins produced by silkworms such as Bombyx mori, Bombyx mandarina, Antheraea yamamai, Anteraea pernyi, Eriogyna pyretorum, Pilosamia Cynthia ricini, Samia cynthia, Caligura japonica, Antheraea mylitta, and Antheraea assama and a hornet silk protein secreted by larvae of Vespasimillima xanthoptera.
[0039] More specific examples of the fibroin produced by insects can include the silkworm fibroin L chain (GenBank Accession Nos. M76430 (nucleotide sequence) and AAA27840.1 (amino acid sequence)).
[0040] Examples of the fibroin produced by spiders can include spider silk proteins produced by spiders belonging to the order Araneae. More specific examples thereof can include spider silk proteins produced by spiders belonging to the genus Araneus, such as Araneus ventricosus, Araneus diadematus, Araneus pinguis, Araneus pentagrammicus, and Araneus nojimai, spiders belonging to the genus Neoscona, such as Neoscona scylla, Neoscona nautica, Neoscona adianta, and Neoscona scylloides, spiders belonging to the genus Pronus, such as Pronous minutus, spiders belonging to the genus Cyrtarachne, such as Cyrtarachne bufo and Cyrtarachne inaequalis, spiders belonging to the genus Gasteracantha, such as Gasteracantha kuhlii and Gasteracantha mammosa, spiders belonging to the genus Ordgarius, such as Ordgarius hobsoni and Ordgarius sexspinosus, spiders belonging to the genus Argiope, such as Argiope amoena, Argiope minuta, and Argiope bruennichi, spiders belonging to the genus Arachnura, such as Arachnura logio, spiders belonging to the genus Acusilas, such as Acusilas coccineus, spiders belonging to the genus Cytophora, such as Cyrtophora moluccensis, Cyrtophora exanthematica, and Cyrtophora unicolor, spiders belonging to the genus Poltys, such as Poltys illepidus, spiders belonging to the genus Cyclosa, such as Cyclosa octotuberculata, Cyclosa sedeculata, Cyclosa vallata, and Cyclosa atrata, and spiders belonging to the genus Chorizopes, such as Chorizopes nipponicus, and spider silk proteins produced by spiders belonging to the family Tetragnathidae, such as spiders belonging to the genus Tetragnatha, such as Tetragnatha praedonia, Tetragnatha maxillosa, Tetragnatha extensa, and Tetragnatha squamata, spiders belonging to the genus Leucauge, such as Leucauge magnifica, Leucauge blanda, and Leucauge subblanda, spiders belonging to the genus Nephila, such as Nephila clavata and Nephila pilipes, spiders belonging to the genus Menosira, such as Menosira ornata, spiders belonging to the genus Dyschiriognatha, such as Dyschiriognatha tenera, spiders belonging to the genus Latrodectus, such as Latrodectus mactans, Latrodectus hasseltii, Latrodectus geometricus, and Latrodectus tredecimguttatus, and spiders belonging to the genus Euprosthenops. Examples of the spider silk protein can include dragline silk proteins such as MaSps (MaSp1 and MaSp2) and ADFs (ADF3 and ADF4), MiSps (MiSp1 and MiSp2), AcSp, PySp, and Flag.
[0041] More specific examples of the spider silk protein produced by spiders include fibroin-3 (adf-3) [derived from Araneus diadematus] (GenBank Accession No. AAC47010 (amino acid sequence), U47855 (nucleotide sequence)), fibroin-4 (adf-4) [derived from Araneus diadematus] (GenBank Accession No. AAC47011 (amino acid sequence), U47856 (nucleotide sequence)), dragline silk protein spidroin 1 [derived from Nephila clavipes] (GenBank Accession No. AAC04504 (amino acid sequence), U37520 (nucleotide sequence)), major ampullate spidroin 1 [derived from Latrodectus hesperus] (GenBank Accession No. ABR68856 (amino acid sequence), EF595246 (nucleotide sequence)), dragline silk protein spidroin 2 [derived from Nephila clavata] (GenBank Accession No. AAL32472 (amino acid sequence), AF441245 (nucleotide sequence)), major ampullate spidroin 1 [derived from Euprosthenops australis] (GenBank Accession No. CAJ00428 (amino acid sequence), AJ973155 (nucleotide sequence)), and major ampullate spidroin 2 [Euprosthenops australis] (GenBank Accession No. CAM32249.1 (amino acid sequence), AM490169 (nucleotide sequence)), minor ampullate silk protein 1 [Nephila clavipes] (GenBank Accession No. AAC14589.1 (amino acid sequence)), minor ampullate silk protein 2 [Nephila clavipes] (GenBank Accession No. AAC14591.1 (amino acid sequence)), and minor ampullate spidroin-like protein [Nephilengys cruentata] (GenBank Accession No. ABR37278.1 (amino acid sequence).
[0042] More specific examples of the naturally derived fibroin can include fibroin whose sequence information is registered in NCBI GenBank. For example, sequences thereof may be confirmed by extracting sequences in which spidroin, ampullate, fibroin, "silk and polypeptide", or "silk and protein" is described as a keyword in DEFINITION among sequences containing INV as DIVISION among sequence information registered in NCBI GenBank, sequences in which a specific character string of products is described from CDS, or sequences in which a specific character string is described from SOURCE to TISSUE TYPE.
[0043] The modified fibroin according to the present embodiment may be modified silk fibroin (in which an amino acid sequence of a silk protein produced by silkworm is modified), or may be modified spider silk fibroin (in which an amino acid sequence of a spider silk protein produced by spiders is modified).
[0044] Specific examples of the modified fibroin can include modified fibroin derived from a major dragline silk protein produced in a major ampullate gland of a spider (first modified fibroin), modified fibroin having a domain sequence in which the content of glycine residues is reduced (second modified fibroin), modified fibroin having a domain sequence in which the content of an (A).sub.n motif is reduced (third modified fibroin), modified fibroin in which the content of glycine residues and the content of an (A).sub.n motif are reduced (fourth modified fibroin), modified fibroin having a domain sequence including a region locally having a high hydropathy index (fifth modified fibroin), and modified fibroin having a domain sequence in which the content of glutamine residues is reduced (sixth modified fibroin).
[0045] An example of the first modified fibroin can include a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. In the first modified fibroin, the number of amino acid residues in the (A).sub.n motif is preferably an integer of 3 to 20, more preferably an integer of 4 to 20, still more preferably an integer of 8 to 20, even still more preferably an integer of 10 to 20, still further preferably an integer of 4 to 16, particularly preferably an integer of 8 to 16, and most preferably an integer of 10 to 16. In the first modified fibroin, the number of amino acid residues constituting REP in Formula 1 is preferably 10 to 200 residues, more preferably 10 to 150 residues, and still more preferably 20 to 100 residues, and still even more preferably 20 to 75 residues. In the first modified fibroin, the total number of glycine residues, serine residues, and alanine residues included in the amino acid sequence represented by Formula 1: [(A).sub.n motif-REP], is preferably 40% or more, more preferably 60% or more, and still more preferably 70% or more, relative to the total number of amino acid residues.
[0046] The first modified fibroin may be a polypeptide including an amino acid sequence unit represented by Formula 1: [(A).sub.n motif-REP].sub.m, and including a C-terminal sequence which is an amino acid sequence set forth in any one of SEQ ID NO: 1 to 3 or a C-terminal sequence which is an amino acid sequence having 90% or more homology with the amino acid sequence set forth in any one of SEQ ID NO: 1 to 3.
[0047] The amino acid sequence set forth in SEQ ID NO: 1 is identical to an amino acid sequence consisting of 50 amino acid residues of the C-terminal of an amino acid sequence of ADF3 (GI: 1263287, NCBI). The amino acid sequence set forth in SEQ ID NO: 2 is identical to an amino acid sequence set forth in SEQ ID NO: 1 in which 20 amino acid residues have been removed from the C-terminal. The amino acid sequence set forth in SEQ ID NO: 3 is identical to an amino acid sequence set forth in SEQ ID NO: 1 in which 29 amino acid residues have been removed from the C-terminal.
[0048] A specific example of the first modified fibroin can include modified fibroin including (1-i) the amino acid sequence set forth in SEQ ID NO: 4 (recombinant spider silk protein ADF3KaiLargeNRSH1), or (1-ii) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 4. The sequence identity is preferably 95% or more.
[0049] The amino acid sequence set forth in SEQ ID NO: 4 is obtained by the following mutation: in an amino acid sequence of ADF3 in which an amino acid sequence (SEQ ID NO: 5) consisting of a start codon, a His 10-tag, and an HRV3C protease (Human rhinovirus 3C protease) recognition site is added to the N-terminal, the 1st to 13th repetitive regions are about doubled and the translation ends at the 1154th amino acid residue. The C-terminal amino acid sequence of the amino acid sequence set forth in SEQ ID NO: 4 is identical to the amino acid sequence set forth in SEQ ID NO: 3.
[0050] The modified fibroin of (1-i) may consist of the amino acid sequence set forth in SEQ ID NO: 4.
[0051] The domain sequence of the second modified fibroin has an amino acid sequence in which the content of glycine residues is reduced, as compared with naturally derived fibroin. It can be said that the second modified fibroin has an amino acid sequence corresponding to an amino acid sequence in which at least one or a plurality of glycine residues in REP are substituted with another amino acid residue, as compared with naturally derived fibroin.
[0052] The domain sequence of the second modified fibroin may have an amino acid sequence corresponding to an amino acid sequence in which one glycine residue in at least one or the plurality of motif sequences is substituted with another amino acid residue, in at least one motif sequence selected from GGX and GPGXX (where G represents a glycine residue, P represents a proline residue, and X represents an amino acid residue other than glycine) in REP, as compared with naturally derived fibroin.
[0053] In the second modified fibroin, the proportion of the motif sequence in which the glycine residue has been substituted with another amino acid residue may be 10% or more relative to the entire motif sequence.
[0054] The second modified fibroin includes a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m, and may have an amino acid sequence in which z/w is 30% or more, 40% or more, 50% or more, or 50.9% or more in a case where the total number of amino acid residues in the amino acid sequence consisting of XGX (where X represents an amino acid residue other than glycine) included in all REPs in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence is defined as z, and the total number of amino acid residues in the sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence is defined as w. The number of alanine residues with respect to the total number of amino acid residues in the (A).sub.n motif is 83% or more, preferably 86% or more, more preferably 90% or more, still more preferably 95% or more, and even still more preferably 100% (meaning that the (A).sub.n motif consists of only alanine residues).
[0055] The second modified fibroin is preferably one in which the content ratio of the amino acid sequence consisting of XGX is increased by substituting one glycine residue of the GGX motif with another amino acid residue. In the second modified fibroin, the content ratio of the amino acid sequence consisting of GGX in the domain sequence is preferably 30% or less, more preferably 20% or less, still more preferably 10% or less, even still more preferably 6% or less, still further preferably 4% or less, and particularly preferably 2% or less. The content ratio of the amino acid sequence consisting of GGX in the domain sequence can be calculated by the same method as the calculation method of the content ratio (z/w) of the amino acid sequence consisting of XGX described below.
[0056] The method of calculating z/w will be described in more detail. First, the amino acid sequence consisting of XGX is extracted from all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence in the fibroin including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m (modified fibroin or naturally derived fibroin). The total number of amino acid residues constituting XGX is z. For example, in a case where 50 amino acid sequences consisting of XGX are extracted (there is no overlap), z is 50.times.3=150. Also, for example, in a case where X (central X) included in two XGXs exists as in a case of the amino acid sequence consisting of XGXGX, z is calculated by subtracting the overlapping portion (in a case of XGXGX, it is 5 amino acid residues). w is the total number of amino acid residues included in a sequence excluding the sequence from the (A)n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence. For example, in a case of the domain sequence shown in FIG. 1, w is 4+50+4+100+4+10+4+20+4+30=230 (excluding the (A).sub.n motif located at the most C-terminal side). Next, z/w (%) can be calculated by dividing z by w.
[0057] Here, z/w in naturally derived fibroin will be described. First, as described above, 663 types of fibroins (415 types of fibroins derived from spiders among them) were extracted by confirming fibroins with amino acid sequence information registered in NCBI GenBank by an exemplified method. The values of z/w were calculated by the calculation method described above, from amino acid sequences of naturally derived fibroins which include a domain sequence represented by Formula 1: [(A).sub.n motif-REP]m and in which the content ratio of the amino acid sequence consisting of GGX in the fibroin is 6% or less, among all the extracted fibroins. The results are shown in FIG. 2. In FIG. 2, the horizontal axis represents z/w (%), and the vertical axis represents a frequency. As is clear from FIG. 2, the values of z/w in naturally derived fibroin are all smaller than 50.9% (the largest value is 50.86%).
[0058] In the second modified fibroin, z/w is preferably 50.9% or more, more preferably 56.1% or more, still more preferably 58.7% or more, even still more preferably 70% or more, and still further preferably 80% or more. The upper limit of z/w is not particularly limited, but may be 95% or less, for example.
[0059] The second modified fibroin can be obtained by, for example, substituting and modifying at least a part of a nucleotide sequence encoding a glycine residue from a cloned gene sequence of naturally derived fibroin so as to encode another amino acid residue. In this case, one glycine residue in a GGX motif or a GPGXX motif may be selected as the glycine residue to be modified, and substitution may be performed so that z/w is 50.9% or more. In addition, the second modified fibroin can also be obtained by, for example, designing an amino acid sequence satisfying each of the above aspects from the amino acid sequence of naturally derived fibroin, and chemically synthesizing a nucleic acid encoding the designed amino acid sequence. In any case, in addition to the modification corresponding to substitution of a glycine residue in REP with another amino acid residue from the amino acid sequence of naturally derived fibroin, modification of the amino acid sequence corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues may be performed.
[0060] The above-described another amino acid residue is not particularly limited as long as it is an amino acid residue other than a glycine residue, but it is preferably a hydrophobic amino acid residue such as a valine (V) residue, a leucine (L) residue, an isoleucine (I) residue, a methionine (M) residue, a proline (P) residue, a phenylalanine (F) residue, or a tryptophan (W) residue, or a hydrophilic amino acid residue such as a glutamine (Q) residue, an asparagine (N) residue, a serine (S) residue, a lysine (K) residue, or a glutamic acid (E) residue, more preferably a valine (V) residue, a leucine (L) residue, an isoleucine (I) residue, a phenylalanine (F) residue, or a glutamine (Q) residue, and still more preferably a glutamine (Q) residue.
[0061] A more specific example of the second modified fibroin can include modified fibroin including (2-i) the amino acid sequence set forth in SEQ ID NO: 6 (Met-PRT380), SEQ ID NO: 7 (Met-PRT410), SEQ ID NO: 8 (Met-PRT525), or SEQ ID NO: 9 (Met-PRT799), or (2-ii) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9.
[0062] The modified fibroin of (2-i) will be described. The amino acid sequence set forth in SEQ ID NO: 6 is obtained by substituting all GGXs with GQX in REP of the amino acid sequence set forth in SEQ ID NO: 10 (Met-PRT313) corresponding to naturally derived fibroin. The amino acid sequence set forth in SEQ ID NO: 7 is obtained by deleting every other two (A).sub.n motifs from the N-terminal side to the C-terminal side from the amino acid sequence set forth in SEQ ID NO: 6 and further inserting one [(A).sub.n motif-REP] before the C-terminal sequence. The amino acid sequence set forth in SEQ ID NO: 8 is obtained by inserting two alanine residues on the C-terminal side of each (A).sub.n motif of the amino acid sequence set forth in SEQ ID NO: 7 and further substituting a part of glutamine (Q) residues with a serine (S) residue to delete a part of amino acids on the C-terminal side so as to be almost the same as the molecular weight of SEQ ID NO: 7. The amino acid sequence set forth in SEQ ID NO: 9 is obtained by adding a predetermined hinge sequence and a His tag sequence to the C-terminal of a sequence obtained by repeating a region of 20 domain sequences (where several amino acid residues on the C-terminal side of the region are substituted) present in the amino acid sequence set forth in SEQ ID NO: 7 four times.
[0063] The value of z/w in the amino acid sequence set forth in SEQ ID NO: 10 (corresponding to naturally derived fibroin) is 46.8%. The values of z/w in the amino acid sequence set forth in SEQ ID NO: 6, the amino acid sequence set forth in SEQ ID NO: 7, the amino acid sequence set forth in SEQ ID NO: 8, and the amino acid sequence set forth in SEQ ID NO: 9 are 58.7%, 70.1%, 66.1%, and 70.0%, respectively. In addition, the values of x/y in the amino acid sequences set forth in SEQ ID NO: 10, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, and SEQ ID NO: 9 at a Giza ratio (described below) of 1:1.8 to 11.3 are 15.0%, 15.0%, 93.4%, 92.7%, and 89.8%, respectively.
[0064] The modified fibroin of (2-i) may consist of the amino acid sequence set forth in SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9.
[0065] The modified fibroin of (2-ii) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9. The modified fibroin of (2-ii) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. The sequence identity is preferably 95% or more.
[0066] It is preferred that the modified fibroin of (2-ii) preferably has 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9, and z/w is 50.9% or more in a case where the total number of amino acid residues in the amino acid sequence consisting of XGX (where X represents an amino acid residue other than glycine) included in REP is defined as z, and the total number of amino acid residues of REP in the domain sequence is defined as w.
[0067] The second modified fibroin may have a tag sequence at either or both of the N-terminal and the C-terminal. This enables the modified fibroin to be isolated, immobilized, detected, and visualized.
[0068] The tag sequence may be, for example, an affinity tag utilizing specific affinity (binding property, affinity) with another molecule. A specific example of the affinity tag includes a histidine tag (His tag). The His tag is a short peptide in which about 4 to 10 histidine residues are arranged and has a property of specifically binding to a metal ion such as nickel. Thus, the His tag can be used for isolation of modified fibroin by chelating metal chromatography. A specific example of the tag sequence can include the amino acid sequence set forth in SEQ ID NO: 11 (amino acid sequence including a His tag sequence and a hinge sequence).
[0069] Also, a tag sequence such as glutathione-S-transferase (GST) that specifically binds to glutathione, and a maltose binding protein (MBP) that specifically binds to maltose can also be utilized.
[0070] Further, an "epitope tag" utilizing an antigen-antibody reaction can also be utilized. Adding a peptide (epitope) exhibiting antigenicity as a tag sequence allows an antibody against the epitope to be bound. Examples of the epitope tag include an HA (peptide sequence of hemagglutinin of influenza virus) tag, a myc tag, and a FLAG tag. The modified fibroin can easily be purified with high specificity by utilizing an epitope tag.
[0071] Moreover, it is possible to use a tag sequence which can be cleaved with a specific protease. The modified fibroin from which the tag sequence has been cleaved can be recovered by treating a protein adsorbed through the tag sequence with protease.
[0072] A more specific example of the modified fibroin including a tag sequence can include modified fibroin including (2-iii) the amino acid sequence set forth in SEQ ID NO: 12 (PRT380), SEQ ID NO: 13 (PRT410), SEQ ID NO: 14 (PRT525), or SEQ ID NO: 15 (PRT799), or (2-iv) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15.
[0073] Each of the amino acid sequences set forth in SEQ ID NO: 16 (PRT313), SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, and SEQ ID NO: 15 is obtained by adding the amino acid sequence set forth in SEQ ID NO: 11 (including a His tag sequence and a hinge sequence) to the N-terminal of each of the amino acid sequences set forth in SEQ ID NO: 10, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, and SEQ ID NO: 9.
[0074] The modified fibroin of (2-iii) may consist of the amino acid sequence set forth in SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15.
[0075] The modified fibroin of (2-iv) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15. The modified fibroin of (2-iv) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. The sequence identity is preferably 95% or more.
[0076] It is preferred that the modified fibroin of (2-iv) has 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15, and z/w is 50.9% or more in a case where the total number of amino acid residues in the amino acid sequence consisting of XGX (where X represents the amino acid residue other than glycine) in REP is defined as z, and the total number of amino acid residues in REP in the domain sequence is defined as w.
[0077] The second modified fibroin may include a secretory signal for releasing the protein produced in the recombinant protein production system to the outside of a host. The sequence of the secretory signal can be appropriately set depending on the type of the host.
[0078] The domain sequence of the third modified fibroin has an amino acid sequence in which the content of the (A).sub.n motif is reduced, as compared with naturally derived fibroin. It can be said that the domain sequence of the third modified fibroin has an amino acid sequence corresponding to an amino acid sequence in which at least one or a plurality of (A).sub.n motifs are deleted, as compared with naturally derived fibroin.
[0079] The third modified fibroin may have an amino acid sequence corresponding to an amino acid sequence in which 10 to 40% of the (A).sub.n motifs are deleted from naturally derived fibroin.
[0080] The domain sequence of the third modified fibroin may have an amino acid sequence corresponding to an amino acid sequence in which at least one (A).sub.n motif of every one to three (A).sub.n motifs is deleted from the N-terminal side to the C-terminal side, as compared with naturally derived fibroin.
[0081] The third modified fibroin may have an amino acid sequence corresponding to an amino acid sequence in which deletion of at least two consecutive (A).sub.n motifs and deletion of one (A).sub.n motif are repeated in this order from the N-terminal side to the C-terminal side, as compared with naturally derived fibroin.
[0082] The third modified fibroin may have a domain sequence having an amino acid sequence corresponding to an amino acid sequence in which at least (A).sub.n motif every other two positions is deleted from the N-terminal side to the C-terminal side.
[0083] The third modified fibroin includes a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m, and may have an amino acid sequence in which x/y is 20% or more, 30% or more, 40% or more, or 50% or more in a case where the numbers of amino acid residues in REPs of two adjacent [(A).sub.n motif-REP] units are sequentially compared from the N-terminal side to the C-terminal side, and the number of amino acid residues in one REP having a smaller number of amino acid residues is defined as 1, the maximum value of the total value of the number of amino acid residues in the two adjacent [(A).sub.n motif-REP] units where the ratio of the number of amino acid residues in the other REP is 1.8 to 11.3 is defined as x, and the total number of amino acid residues in the domain sequence is defined as y. The number of alanine residues with respect to the total number of amino acid residues in the (A).sub.n motif is 83% or more, preferably 86% or more, more preferably 90% or more, still more preferably 95% or more, and even still more preferably 100% (meaning that the (A).sub.n motif consists of only alanine residues).
[0084] The method of calculating x/y will be described in more detail with reference to FIG. 1. FIG. 1 shows a domain sequence excluding the N-terminal sequence and the C-terminal sequence from the modified fibroin. The domain sequence has a sequence of (A).sub.n motif-first REP (50 amino acid residues)-(A).sub.n motif-second REP (100 amino acid residues)-(A).sub.n motif-third REP (10 amino acid residues)-(A).sub.n motif-fourth REP (20 amino acid residues)-(A).sub.n motif-fifth REP (30 amino acid residues)-(A).sub.n motif from the N-terminal side (left side).
[0085] The two adjacent [(A).sub.n motif-REP] units are sequentially selected from the N-terminal side to the C-terminal side so as not to overlap. At this time, an unselected [(A).sub.n motif-REP] unit may exist. FIG. 1 shows a pattern 1 (a comparison between the first REP and the second REP, and a comparison between the third REP and the fourth REP), a pattern 2 (a comparison between the first REP and the second REP, and a comparison between the fourth REP and the fifth REP), a pattern 3 (a comparison between the second REP and the third REP, and a comparison between the fourth REP and the fifth REP), and a pattern 4 (a comparison between the first REP and the second REP). There are other selection methods besides this.
[0086] Subsequently, the number of amino acid residues of each REP in the selected two adjacent [(A).sub.n motif-REP] units is compared for each pattern. The comparison is performed by determining the ratio of the number of amino acid residues of the other REP in a case where one REP having a smaller number of amino acid residues is defined as 1. For example, in a case of comparing the first REP (50 amino acid residues) and the second REP (100 amino acid residues), the ratio of the number of amino acid residues of the second REP is 100/50=2 in a case where the first REP having a smaller number of amino acid residues is defined as 1. Similarly, in a case of comparing the fourth REP (20 amino acid residues) and the fifth REP (30 amino acid residues), the ratio of the number of amino acid residues of the fifth REP is 30/20=1.5 in a case where the fourth REP having a smaller number of amino acid residues is defined as 1.
[0087] In FIG. 1, a set of [(A).sub.n motif-REP] units in which the ratio of the number of amino acid residues of the other REP is 1.8 to 11.3 in a case where one REP having a smaller number of amino acid residues is defined as 1 is indicated by a solid line. In the present specification, the ratio is referred to as a Giza ratio. A set of [(A).sub.n motif-REP] units in which the ratio of the number of amino acid residues of the other REP is less than 1.8 or more than 11.3 in a case where one REP having a smaller number of amino acid residues is defined as 1 is indicated by a broken line.
[0088] In each pattern, the number of all amino acid residues of two adjacent [(A).sub.n motif-REP] units indicated by solid lines (including not only the number of amino acid residues of REP but also the number of amino acid residues of the (A)n motif) is combined. Then, the total values combined are compared, and the total value of the pattern whose total value is the maximum (the maximum value of the total value) is defined as x. In the example shown in FIG. 1, the total value of the pattern 1 is the maximum.
[0089] Then, x/y (%) can be calculated by dividing x by the total number of amino acid residues y of the domain sequence.
[0090] In the third modified fibroin, x/y is preferably 50% or more, more preferably 60% or more, still more preferably 65% or more, even still more preferably 70% or more, still further preferably 75% or more, and particularly preferably 80% or more. The upper limit of x/y is not particularly limited, but may be, for example, 100% or less. In a case where the Giza ratio is 1:1.9 to 11.3, x/y is preferably 89.6% or more; in a case where the Giza ratio is 1:1.8 to 3.4, x/y is preferably 77.1% or more; in a case where the Giza ratio is 1:1.9 to 8.4, x/y is preferably 75.9% or more; and in a case where the Giza ratio is 1:1.9 to 4.1, x/y is preferably 64.2% or more.
[0091] In a case where the third modified fibroin is modified fibroin in which at least seven of a plurality of (A).sub.n motifs in the domain sequence consist of only alanine residues, x/y is preferably 46.4% or more, more preferably 50% or more, still more preferably 55% or more, even still more preferably 60% or more, still further preferably 70% or more, and particularly preferably 80% or more. The upper limit of x/y is not particularly limited, but is only required to be 100% or less.
[0092] Here, x/y in naturally derived fibroin will be described. First, as described above, 663 types of fibroins (415 types of fibroins derived from spiders among them) were extracted by confirming fibroins with amino acid sequence information registered in NCBI GenBank by an exemplified method. The values of x/y were calculated by the calculation method described above, from amino acid sequences of naturally derived fibroins consisting of a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m, among all the extracted fibroins. The results in a case where the Giza ratio is 1:1.9 to 4.1 are shown in FIG. 3.
[0093] The horizontal axis in FIG. 3 represents x/y (%), and the vertical axis represents a frequency. As is clear from FIG. 3, the values of x/y in naturally derived fibroin are all smaller than 64.2% (the largest value is 64.14%).
[0094] The third modified fibroin can be obtained from, for example, a cloned gene sequence of naturally derived fibroin, by deleting one or a plurality of sequences encoding an (A).sub.n motif so that x/y is 64.2% or more. In addition, for example, the third modified fibroin can also be obtained, from the amino acid sequence of naturally derived fibroin, by designing an amino acid sequence corresponding to an amino acid sequence in which one or a plurality of (A).sub.n motifs are deleted so that x/y is 64.2% or more, and chemically synthesizing a nucleic acid encoding the designed amino acid sequence. In any case, in addition to the modification corresponding to deletion of the (A).sub.n motif from the amino acid sequence of naturally derived fibroin, modification of the amino acid sequence corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues may be performed.
[0095] A more specific example of the third modified fibroin can include modified fibroin including (3-i) the amino acid sequence set forth in SEQ ID NO: 17 (Met-PRT399), SEQ ID NO: 7 (Met-PRT410), SEQ ID NO: 8 (Met-PRT525), or SEQ ID NO: 9 (Met-PRT799), or (3-ii) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 17, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9.
[0096] The modified fibroin of (3-i) will be described. The amino acid sequence set forth in SEQ ID NO: 17 is obtained by deleting every other two (A).sub.n motifs from the N-terminal side to the C-terminal side from the amino acid sequence set forth in SEQ ID NO: 10 (Met-PRT313) corresponding to naturally derived fibroin and further inserting one [(A).sub.n motif-REP] before the C-terminal sequence. The amino acid sequence set forth in SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9 is as described in the second modified fibroin.
[0097] The value of x/y in the amino acid sequence set forth in SEQ ID NO: 10 (corresponding to naturally derived fibroin) at a Giza ratio of 1:1.8 to 11.3 is 15.0%. Both the value of x/y in the amino acid sequence set forth in SEQ ID NO: 17 and the value of x/y in the amino acid sequence set forth in SEQ ID NO: 7 are 93.4%. The value of x/y in the amino acid sequence set forth in SEQ ID NO: 8 is 92.7%. The value of x/y in the amino acid sequence set forth in SEQ ID NO: 9 is 89.8%. The values of z/w in the amino acid sequences set forth in SEQ ID NO: 10, SEQ ID NO: 17, SEQ ID NO: 7, SEQ ID NO: 8, and SEQ ID NO: 9 are 46.8%, 56.2%, 70.1%, 66.1%, and 70.0%, respectively.
[0098] The modified fibroin of (3-i) may consist of the amino acid sequence set forth in SEQ ID NO: 17, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9.
[0099] The modified fibroin of (3-ii) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 17, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9. The modified fibroin of (3-ii) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. The sequence identity is preferably 95% or more.
[0100] It is preferred that the modified fibroin of (3-ii) has 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 17, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9, and x/y is 64.2% or more in a case where the numbers of amino acid residues in REPs of two adjacent [(A).sub.n motif-REP] units are sequentially compared from the N-terminal side to the C-terminal side, and the number of amino acid residues in one REP having a small number of amino acid residues is defined as 1, the maximum value of the total value of the number of amino acid residues in the two adjacent [(A).sub.n motif-REP] units where the ratio of the number of amino acid residues in the other REP is 1.8 to 11.3 (the Giza ratio is 1:1.8 to 11.3) is defined as x, and the total number of amino acid residues in the domain sequence is defined as y.
[0101] The third modified fibroin may include the above-described tag sequence at either or both of the N-terminal and the C-terminal.
[0102] A more specific example of the modified fibroin including a tag sequence can include modified fibroin including (3-iii) the amino acid sequence set forth in SEQ ID NO: 18 (PRT399), SEQ ID NO: 13 (PRT410), SEQ ID NO: 14 (PRT525), or SEQ ID NO: 15 (PRT799), or (3-iv) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 18, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15.
[0103] Each of the amino acid sequences set forth in SEQ ID NO: 18, SEQ ID NO: 13, SEQ ID NO: 14, and SEQ ID NO: 15 is obtained by adding the amino acid sequence set forth in SEQ ID NO: 11 (including a His tag sequence and a hinge sequence) to the N-terminal of each of the amino acid sequences set forth in SEQ ID NO: 17, SEQ ID NO: 7, SEQ ID NO: 8, and SEQ ID NO: 9.
[0104] The modified fibroin of (3-iii) may consist of the amino acid sequence set forth in SEQ ID NO: 18, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15.
[0105] The modified fibroin of (3-iv) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 18, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15. The modified fibroin of (3-iv) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. The sequence identity is preferably 95% or more.
[0106] It is preferred that the modified fibroin of (3-iv) has 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 18, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15, and x/y is 64.2% or more in a case where the number of amino acid residues in REPs in two adjacent [(A).sub.n motif-REP] units are sequentially compared from the N-terminal side to the C-terminal side, and the number of amino acid residues in one REP having a small number of amino acid residues is defined as 1, the maximum value of the total value of the number of amino acid residues in the two adjacent [(A).sub.n motif-REP] units where the ratio of the number of amino acid residues in the other REP is 1.8 to 11.3 is defined as x, and the total number of amino acid residues in the domain sequence is defined as y.
[0107] The third modified fibroin may include a secretory signal for releasing the protein produced in the recombinant protein production system to the outside of a host. The sequence of the secretory signal can be appropriately set depending on the type of the host.
[0108] The domain sequence of the fourth modified fibroin has an amino acid sequence in which the content of an (A).sub.n motif and the content of glycine residues are reduced, as compared with naturally derived fibroin. It can be said that the domain sequence of the fourth modified fibroin has an amino acid sequence corresponding to an amino acid sequence in which at least one or a plurality of (A).sub.n motifs are deleted and at least one or a plurality of glycine residues in REP are substituted with another amino acid residue, as compared with naturally derived fibroin. That is, the fourth modified fibroin is modified fibroin having the characteristics of the above-described second modified fibroin and third modified fibroin. Specific aspects thereof, and the like are as in the descriptions for the second modified fibroin and the third modified fibroin.
[0109] A more specific example of the fourth modified fibroin can include modified fibroin including (4-i) the amino acid sequence set forth in SEQ ID NO: 7 (Met-PRT410), SEQ ID NO: 8 (Met-PRT525), SEQ ID NO: 9 (Met-PRT799), SEQ ID NO: 13 (PRT410), SEQ ID NO: 14 (PRT525), or SEQ ID NO: 15 (PRT799), or (4-ii) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15. Specific aspects of the modified fibroin including the amino acid sequence set forth in SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, or SEQ ID NO: 15 are as described above.
[0110] The domain sequence of the fifth modified fibroin may have an amino acid sequence including a region locally having a high hydropathy index corresponding to an amino acid sequence in which one or a plurality of amino acid residues in REP are substituted with amino acid residues having a high hydropathy index and/or one or a plurality of amino acid residues having a high hydropathy index are inserted into REP, as compared with naturally derived fibroin.
[0111] The region locally having a high hydropathy index preferably consists of consecutive two to four amino acid residues.
[0112] The above-described amino acid residue having a high hydropathy index is more preferably an amino acid residue selected from isoleucine (I), valine (V), leucine (L), phenylalanine (F), cysteine (C), methionine (M), and alanine (A).
[0113] The fifth modified fibroin may be further subjected to modification of an amino acid sequence corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues as compared with naturally derived fibroin, in addition to modification corresponding to substitution of one or a plurality of amino acid residues in REP with amino acid residues having a high hydropathy index and/or insertion of one or a plurality of amino acid residues having a high hydropathy index into REP, as compared with naturally derived fibroin.
[0114] The fifth modified fibroin can be obtained by, for example, substituting one or a plurality of hydrophilic amino acid residues in REP (for example, amino acid residues having a negative hydropathy index) with hydrophobic amino acid residues (for example, amino acid residues having a positive hydropathy index) from a cloned gene sequence of naturally derived fibroin, and/or inserting one or a plurality of hydrophobic amino acid residues into REP. In addition, the fifth modified fibroin can be obtained by, for example, designing an amino acid sequence corresponding to an amino acid sequence in which one or a plurality of hydrophilic amino acid residues in REP are substituted with hydrophobic amino acid residues from an amino acid sequence of naturally derived fibroin, and/or one or a plurality of hydrophobic amino acid residues are inserted into REP, and chemically synthesizing a nucleic acid encoding the designed amino acid sequence. In any case, in addition to modification corresponding to substitution of one or a plurality of hydrophilic amino acid residues in REP with hydrophobic amino acid residues from an amino acid sequence of naturally derived fibroin, and/or insertion of one or a plurality of hydrophobic amino acid residues into REP, modification of an amino acid sequence corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues may be further performed.
[0115] The fifth modified fibroin includes a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m, and may have an amino acid sequence in which p/q is 6.2% or more in a case where in all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence, the total number of amino acid residues included in a region where the average value of hydropathy indices of four consecutive amino acid residues is 2.6 or more is defined as p, and the total number of amino acid residues included in the sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence is defined as q.
[0116] A known index (Hydropathy index: Kyte J, & Doolittle R (1982), "A simple method for displaying the hydropathic character of a protein", J. Mol. Biol., 157, pp. 105-132) is used as the hydropathy index of the amino acid residue. Specifically, the hydropathy index (hereinafter, also referred to as "HI") of each amino acid is as shown in Table 1.
TABLE-US-00001 TABLE 1 Amino acid HI Amino acid HI Isoleucine (Ile) 4.5 Tryptophan (Trp) -0.9 Valine (Val) 4.2 Tyrosine (Tyr) -1.3 Leucine (Leu) 3.8 Proline (Pro) -1.6 Phenylalanine (Phe) 2.8 Histidine (His) -3.2 Cysteine (Cys) 2.5 Asparagine (Asn) -3.5 Methionine (Met) 1.9 Aspartic acid (Asp) -3.5 Alanine (Ala) 1.8 Glutamine (Gln) -3.5 Glycine (Gly) -0.4 Glutamine acid (Glu) -3.5 Threonine (Thr) -0.7 Lysine (Lys) -3.9 Serine (Ser) -0.8 Arginine (Arg) -4.5
[0117] The method of calculating p/q will be described in more detail. In the calculation, a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence represented by Formula 1 [(A).sub.n motif-REP].sub.m (hereinafter also referred to as "sequence A") is used. First, in all REPs included in the sequence A, the average values of hydropathy indices of four consecutive amino acid residues are calculated. The average value of hydropathy indices is determined by dividing the sum of HIs of respective amino acid residues included in the four consecutive amino acid residues by 4 (number of amino acid residues). The average value of hydropathy indices is determined for all of the four consecutive amino acid residues (each of the amino acid residues is used for calculating the average value 1 to 4 times). Then, a region where the average value of hydropathy indices of the four consecutive amino acid residues is 2.6 or more is specified. Even in a case where a certain amino acid residue corresponds to the "four consecutive amino acid residues having an average value of hydropathy indices of 2.6 or more" multiple times, the amino acid residue is included as one amino acid residue in the region. The total number of amino acid residues included in the region is p. Also, the total number of amino acid residues included in the sequence A is q.
[0118] For example, in a case where the "four consecutive amino acid residues having an average value of hydropathy indices of 2.6 or more" are extracted from 20 places (no overlap), in the region where the average value of hydropathy indices of four consecutive amino acid residues is 2.6 or more, 20 of the four consecutive amino acid residues (no overlap) are included, and thus p is 20.times.4=80. Further, for example, in a case where two of the "four consecutive amino acid residues having an average value of hydropathy indices of 2.6 or more" overlap by one amino acid residue, in the region where the average value of hydropathy indices of the four consecutive amino acid residues is 2.6 or more, seven amino acid residues are included (p=2.times.4-1=7. "-1" is the deduction of the overlapping portion). For example, in a case of the domain sequence shown in FIG. 4, seven sets of "four consecutive amino acid residues having an average value of hydropathy indices of 2.6 or more" are present without overlaps, and thus, p is 7.times.4=28. Furthermore, for example, in the case of the domain sequence shown in FIG. 4, q is 4+50+4+40+4+10+4+20+4+30=170 (the (A).sub.n motif located at the end of the C-terminal side is excluded). Next, p/q (%) can be calculated by dividing p by q. In the case of FIG. 4, p/q is 28/170=16.47%.
[0119] In the fifth modified fibroin, p/q is preferably 6.2% or more, more preferably 7% or more, still more preferably 10% or more, even still more preferably 20% or more, and still further preferably 30% or more. The upper limit of p/q is not particularly limited, but may be 45% or less, for example.
[0120] The fifth modified fibroin can be obtained by, for example, substituting one or a plurality of hydrophilic amino acid residues in REP (for example, amino acid residues having a negative hydropathy index) with hydrophobic amino acid residues (for example, amino acid residues having a positive hydropathy index) so that a cloned amino acid sequence of naturally derived fibroin satisfies the condition of p/q, and/or modifying the cloned amino acid sequence of naturally derived fibroin into an amino acid sequence including a region locally having a high hydropathy index by inserting one or a plurality of hydrophobic amino acid residues into REP. In addition, the fifth modified fibroin can also be obtained by, for example, designing an amino acid sequence satisfying the condition of p/q from the amino acid sequence of naturally derived fibroin, and chemically synthesizing a nucleic acid encoding the designed amino acid sequence. In any case, modification corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues may also be performed, in addition to modification corresponding to substitution of one or a plurality of amino acid residues in REP with amino acid residues having a high hydropathy index, and/or insertion of one or a plurality of amino acid residues having a high hydropathy index into REP, as compared with naturally derived fibroin.
[0121] The amino acid residue having a high hydropathy index is not particularly limited, but is preferably isoleucine (I), valine (V), leucine (L), phenylalanine (F), cysteine (C), methionine (M), and alanine (A), and more preferably valine (V), leucine (L), and isoleucine (I).
[0122] A more specific example of the fifth modified fibroin can include modified fibroin including (5-i) the amino acid sequence set forth in SEQ ID NO: 19 (Met-PRT720), SEQ ID NO: 20 (Met-PRT665), or SEQ ID NO: 21 (Met-PRT666), or (5-ii) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 19, SEQ ID NO: 20, or SEQ ID NO: 21.
[0123] The modified fibroin of (5-i) will be described. The amino acid sequence set forth in SEQ ID NO: 19 is obtained by inserting an amino acid sequence consisting of three amino acid residues (VLI) at two sites for each REP into the amino acid sequence set forth in SEQ ID NO: 7 (Met-PRT410), except for the domain sequence at the end on the C-terminal side, and further substituting a part of glutamine (Q) residues with serine (S) residues, and deleting a part of amino acids on the C-terminal side. The amino acid sequence set forth in SEQ ID NO: 20 is obtained by inserting the amino acid sequence consisting of three amino acid residues (VLI) at one site for each REP into the amino acid sequence set forth in SEQ ID NO: 8 (Met-PRT525). The amino acid sequence set forth in SEQ ID NO: 21 is obtained by inserting the amino acid sequence consisting of three amino acid residues (VLI) at two sites for each REP into the amino acid sequence set forth in SEQ ID NO: 8.
[0124] The modified fibroin of (5-i) may consist of the amino acid sequence set forth in SEQ ID NO: 19, SEQ ID NO: 20, or SEQ ID NO: 21.
[0125] The modified fibroin of (5-ii) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 19, SEQ ID NO: 20, or SEQ ID NO: 21. The modified fibroin of (5-ii) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. The sequence identity is preferably 95% or more.
[0126] It is preferred that the modified fibroin of (5-ii) has 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 19, SEQ ID NO: 20, or SEQ ID NO: 21, and p/q is 6.2% or more in a case where in all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence, the total number of amino acid residues included in a region where the average value of hydropathy indices of four consecutive amino acid residues is 2.6 or more is defined as p, and the total number of amino acid residues included in the sequence excluding the sequence from the (A).sub.n motif located at the most the C-terminal side to the C-terminal of the domain sequence from the domain sequence is defined as q.
[0127] The fifth modified fibroin may include a tag sequence at either or both of the N-terminal and the C-terminal.
[0128] A more specific example of the modified fibroin including a tag sequence can include modified fibroin including (5-iii) the amino acid sequence set forth in SEQ ID NO: 22 (PRT720), SEQ ID NO: 23 (PRT665), or SEQ ID NO: 24 (PRT666), or (5-iv) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 22, SEQ ID NO: 23, or SEQ ID NO: 24.
[0129] Each of the amino acid sequences set forth in SEQ ID NO: 22, SEQ ID NO: 23, and SEQ ID NO: 24 is obtained by adding the amino acid sequence set forth in SEQ ID NO: 11 (including a His tag sequence and a hinge sequence) to the N-terminal of each of the amino acid sequences set forth in SEQ ID NO: 19, SEQ ID NO: 20, and SEQ ID NO: 21.
[0130] The modified fibroin of (5-iii) may consist of the amino acid sequence set forth in SEQ ID NO: 22, SEQ ID NO: 23, or SEQ ID NO: 24.
[0131] The modified fibroin of (5-iv) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 22, SEQ ID NO: 23, or SEQ ID NO: 24. The modified fibroin of (5-iv) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m. The sequence identity is preferably 95% or more.
[0132] It is preferred that the modified fibroin of (5-iv) has 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 22, SEQ ID NO: 23, or SEQ ID NO: 24, and p/q is 6.2% or more in a case where in all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence, the total number of amino acid residues included in a region where the average value of hydropathy indices of four consecutive amino acid residues is 2.6 or more is defined as p, and the total number of amino acid residues included in the sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence is defined as q.
[0133] The fifth modified fibroin may include a secretory signal for releasing the protein produced in the recombinant protein production system to the outside of a host. The sequence of the secretory signal can be appropriately set depending on the type of the host.
[0134] The sixth modified fibroin has an amino acid sequence in which the content of glutamine residues is reduced, as compared with naturally derived fibroin.
[0135] In the sixth modified fibroin, at least one motif selected from a GGX motif and a GPGXX motif is preferably included in the amino acid sequence of REP.
[0136] In a case where the sixth modified fibroin has the GPGXX motif in REP, the GPGXX motif content is usually 1% or more, may also be 5% or more, and preferably 10% or more. The upper limit of the GPGXX motif content is not particularly limited, and may be 50% or less, or may also be 30% or less.
[0137] In the present specification, the "GPGXX motif content" is a value calculated by the following method.
[0138] The content rate of the GPGXX motif in fibroin (modified fibroin or naturally derived fibroin) including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif is calculated as s/t, in a case where the number obtained by tripling the total number of GPGXX motifs in regions of all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence (that is, corresponding to the total number of G and P in the GPGXX motifs) is defined as s, and the total number of amino acid residues in all REPs excluding a sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence and further excluding the (A).sub.n motifs is defined as t.
[0139] In the calculation of the content rate of the GPGXX motif, the "sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence" is used to exclude the effect occurring due to the fact that the "sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence" (a sequence corresponding to REP) may include a sequence having a low correlation with the sequence characteristic of fibroin, which influences the calculation result of the content rate of the GPGXX motif in a case where m is small (that is, in a case where the domain sequence is short). Incidentally, in a case where the "GPGXX motif" is located at the C-terminal of REP, even when "XX" is "AA", for example, it is treated as the "GPGXX motif".
[0140] FIG. 5 is a schematic diagram illustrating a domain sequence of modified fibroin. The method for calculating the content rate of the GPGXX motif will be specifically described with reference to FIG. 5. First, in the domain sequence of the modified fibroin shown in FIG. 5 (which is the "[(A).sub.n motif-REP].sub.m-(A).sub.n motif" type), all REPs are included in the "sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence" (in FIG. 5, the sequence indicated as a "region A"), and therefore, the number of the GPGXX motifs for calculating s is 7, and s is 7.times.3=21. Similarly, since all REPs are included in the "sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence" (in FIG. 5, the sequence indicated as the "region A"), the total number t of the amino acid residues in all REPs when the (A).sub.n motifs are further excluded from the sequence is 50+40+10+20+30=150. Next, s/t (%) can be calculated by dividing s by t, and in the case of the modified fibroin of FIG. 5, s/t is 21/150=14.0%.
[0141] In the sixth modified fibroin, the glutamine residue content is preferably 9% or less, more preferably 7% or less, still more preferably 4% or less, and particularly preferably 0%.
[0142] In the present specification, the "glutamine residue content" is a value calculated by the following method.
[0143] The glutamine residue content in fibroin (modified fibroin or naturally derived fibroin) including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif is calculated as u/t, in a case where the total number of glutamine residues included in regions of all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence (a sequence corresponding to the "region A" in FIG. 5) is defined as u, and the total number of amino acid residues in all REPs in the sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence and further excluding the (A).sub.n motifs is defined as t. In the calculation of the glutamine residue content, the reason for targeting the "sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence" is the same as the reason descried above.
[0144] The domain sequence of the sixth modified fibroin may have an amino acid sequence corresponding to an amino acid sequence in which one or a plurality of glutamine residues in REP are deleted, or one or a plurality of glutamine residues are substituted with another amino acid residue, as compared with naturally derived fibroin.
[0145] The "another amino acid residue" may be an amino acid residue other than the glutamine residue, but is preferably an amino acid residue having a higher hydropathy index than that of the glutamine residue. The hydropathy index of the amino acid residue is as shown in Table 1.
[0146] As shown in Table 1, examples of the amino acid residue having a higher hydropathy index than that of the glutamine residue include amino acid residues selected from isoleucine (I), valine (V), leucine (L), phenylalanine (F), cysteine (C), methionine (M), alanine (A), glycine (G), threonine (T), serine (S), tryptophan (W), tyrosine (Y), proline (P), and histidine (H). Among them, the amino acid residue is more preferably an amino acid residue selected from isoleucine (I), valine (V), leucine (L), phenylalanine (F), cysteine (C), methionine (M), and alanine (A), and still more preferably an amino acid residue selected from isoleucine (I), valine (V), leucine (L), and phenylalanine (F).
[0147] In the sixth modified fibroin, the hydrophobicity of REP is preferably -0.8 or more, more preferably -0.7 or more, still more preferably 0 or more, even still more preferably 0.3 or more, and particularly preferably 0.4 or more. The upper limit of the hydrophobicity of REP is not particularly limited, but may be 1.0 or less or 0.7 or less.
[0148] In the present specification, the "hydrophobicity of REP" is a value calculated by the following method.
[0149] The hydrophobicity of REP in fibroin including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif (modified fibroin or naturally derived fibroin) is calculated as v/t, in a case where the sum of hydropathy indices of amino acid residues in regions of all REPs included in a sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence (a sequence corresponding to the "region A" in FIG. 5) is defined as v, and the total number of amino acid residues in all REPs in the sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence and further excluding the (A).sub.n motifs is defined as t. In the calculation of the hydrophobicity of REP, the reason for targeting the "sequence excluding the sequence from the (A).sub.n motif located at the most C-terminal side to the C-terminal of the domain sequence from the domain sequence" is the same as the reason descried above.
[0150] The domain sequence of the sixth modified fibroin may be further subjected to modification of an amino acid sequence corresponding to substitution, deletion, insertion, and/or addition of one or a plurality of amino acid residues, in addition to modification corresponding to deletion of one or a plurality of glutamine residues in REP, and/or substitution of one or a plurality of glutamine residues in REP with another amino acid residue, as compared with naturally derived fibroin.
[0151] The sixth modified fibroin can be obtained by, for example, deleting one or a plurality of glutamine residues in REP from a cloned gene sequence of naturally derived fibroin, and/or substituting one or a plurality of glutamine residues in REP with another amino acid residue. In addition, the sixth modified fibroin can be obtained by, for example, designing an amino acid sequence corresponding to an amino acid sequence in which one or a plurality of glutamine residues in REP are deleted from an amino acid sequence of naturally derived fibroin, and/or one or a plurality of glutamine residues in REP are substituted with another amino acid residue, and chemically synthesizing a nucleic acid encoding the designed amino acid sequence.
[0152] A more specific example of the sixth modified fibroin can include modified fibroin including (6-i) the amino acid sequence set forth in SEQ ID NO: 25 (Met-PRT888), SEQ ID NO: 26 (Met-PRT965), SEQ ID NO: 27 (Met-PRT889), SEQ ID NO: 28 (Met-PRT916), SEQ ID NO: 29 (Met-PRT918), SEQ ID NO: 30 (Met-PRT699), SEQ ID NO: 31 (Met-PRT698), SEQ ID NO: 32 (Met-PRT966), SEQ ID NO: 41 (Met-PRT917), or SEQ ID NO: 42 (Met-PRT1028), and modified fibroin including (6-ii) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 41, or SEQ ID NO: 42.
[0153] The modified fibroin of (6-i) will be described. The amino acid sequence set forth in SEQ ID NO: 25 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 (Met-PRT410) with VL. The amino acid sequence set forth in SEQ ID NO: 26 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 with TS and substituting the remaining Q with A. The amino acid sequence set forth in SEQ ID NO: 27 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 with VL and substituting the remaining Q with I. The amino acid sequence set forth in SEQ ID NO: 28 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 with VI and substituting the remaining Q with L. The amino acid sequence set forth in SEQ ID NO: 29 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 with VF and substituting the remaining Q with I.
[0154] The amino acid sequence set forth in SEQ ID NO: 30 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 8 (Met-PRT525) with VL. The amino acid sequence set forth in SEQ ID NO: 31 is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 8 with VL and substituting the remaining Q with I.
[0155] The amino acid sequence set forth in SEQ ID NO: 32 is obtained by substituting, with VF, all QQs in a sequence obtained by repeating a region of 20 domain sequences present in the amino acid sequence set forth in SEQ ID NO: 7 (Met-PRT410) two times and substituting the remaining Q with I.
[0156] The amino acid sequence set forth in SEQ ID NO: 41 (Met-PRT917) is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 with LI and substituting the remaining Q with V. The amino acid sequence set forth in SEQ ID NO: 42 (Met-PRT1028) is obtained by substituting all QQs in the amino acid sequence set forth in SEQ ID NO: 7 with IF and substituting the remaining Q with T.
[0157] The glutamine residue content in each of the amino acid sequences set forth in SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 41, and SEQ ID NO: 42 is 9% or less (Table 2).
TABLE-US-00002 TABLE 2 Glutamine GPGXX Hydro- residue motif phobicity Modified Fibroin content content of REP Met-PRT410 (SEQ ID NO: 7) 17.7% 27.9% -1.52 Met-PRT888 (SEQ ID NO: 25) 6.3% 27.9% -0.07 Met-PRT965 (SEQ ID NO: 26) 0.0% 27.9% -0.65 Met-PRT889 (SEQ ID NO: 27) 0.0% 27.9% 0.35 Met-PRT916 (SEQ ID NO: 28) 0.0% 27.9% 0.47 Met-PRT918 (SEQ ID NO: 29) 0.0% 27.9% 0.45 Met-PRT699 (SEQ ID NO: 30) 3.6% 26.4% -0.78 Met-PRT698 (SEQ ID NO: 31) 0.0% 26.4% -0.03 Met-PRT966 (SEQ ID NO: 32) 0.0% 28.0% 0.35 Met-PRT917 (SEQ ID NO: 41) 0.0% 27.9% 0.46 Met-PRT1028 (SEQ ID NO: 42) 0.0% 28.1% 0.05
[0158] The modified fibroin of (6-i) may consist of the amino acid sequence set forth in SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 41, or SEQ ID NO: 42.
[0159] The modified fibroin of (6-ii) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 41, or SEQ ID NO: 42. The modified fibroin of (6-ii) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP], or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif. The sequence identity is preferably 95% or more.
[0160] In the modified fibroin of (6-ii), the glutamine residue content is preferably 9% or less. In the modified fibroin of (6-ii), the GPGXX motif content is preferably 10% or more.
[0161] The sixth modified fibroin may have a tag sequence at either or both of the N-terminal and the C-terminal. This enables the modified fibroin to be isolated, immobilized, detected, and visualized.
[0162] A more specific example of the modified fibroin including a tag sequence can include modified fibroin including (6-iii) the amino acid sequence set forth in SEQ ID NO: 33 (PRT888), SEQ ID NO: 34 (PRT965), SEQ ID NO: 35 (PRT889), SEQ ID NO: 36 (PRT916), SEQ ID NO: 37 (PRT918), SEQ ID NO: 38 (PRT699), SEQ ID NO: 39 (PRT698), SEQ ID NO: 40 (PRT966), SEQ ID NO: 43 (PRT917), or SEQ ID NO: 44 (PRT1028), or modified fibroin including (6-iv) an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 43, or SEQ ID NO: 44.
[0163] Each of the amino acid sequences set forth in SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 43, and SEQ ID NO: 44 is obtained by adding the amino acid sequence set forth in SEQ ID NO: 11 (including a His tag sequence and a hinge sequence) to the N-terminal of each of the amino acid sequences set forth in SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 41, and SEQ ID NO: 42. Since only the tag sequence is added to the N-terminal, the glutamine residue content is not changed, and the glutamine residue content in each of the amino acid sequences set forth in SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 43, or SEQ ID NO: 44 is 9% or less (Table 3).
TABLE-US-00003 TABLE 3 Glutamine GPGXX Hydro- residue motif phobicity Modified Fibroin content content of REP PRT888 (SEQ ID NO: 33) 6.3% 27.9% -0.07 PRT965 (SEQ ID NO: 34) 0.0% 27.9% -0.65 PRT889 (SEQ ID NO: 35) 0.0% 27.9% 0.35 PRT916 (SEQ ID NO: 36) 0.0% 27.9% 0.47 PRT918 (SEQ ID NO: 37) 0.0% 27.9% 0.45 PRT699 (SEQ ID NO: 38) 3.6% 26.4% -0.78 PRT698 (SEQ ID NO: 39) 0.0% 26.4% -0.03 PRT966 (SEQ ID NO: 40) 0.0% 28.0% 0.35 PRT917 (SEQ ID NO: 43) 0.0% 27.9% 0.46 PRT1028 (SEQ ID NO: 44) 0.0% 28.1% 0.05
[0164] The modified fibroin of (6-iii) may consist of the amino acid sequence set forth in SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 43, or SEQ ID NO: 44.
[0165] The modified fibroin of (6-iv) includes an amino acid sequence having 90% or more sequence identity with the amino acid sequence set forth in SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 43, or SEQ ID NO: 44. The modified fibroin of (6-iv) is also a protein including a domain sequence represented by Formula 1: [(A).sub.n motif-REP].sub.m or Formula 2: [(A).sub.n motif-REP].sub.m-(A).sub.n motif. The sequence identity is preferably 95% or more.
[0166] In the modified fibroin of (6-iv), the glutamine residue content is preferably 9% or less. In the modified fibroin of (6-iv), the GPGXX motif content is preferably 10% or more.
[0167] The sixth modified fibroin may include a secretory signal for releasing the protein produced in the recombinant protein production system to the outside of a host. The sequence of the secretory signal can be appropriately set depending on the type of the host.
[0168] The modified fibroin may also be modified fibroin having at least two or more characteristics among the characteristics of the first modified fibroin, the second modified fibroin, the third modified fibroin, the fourth modified fibroin, the fifth modified fibroin, and the sixth modified fibroin.
[0169] The modified fibroin is preferably a hydrophilic modified fibroin because it is more excellent in water-absorbing and quick-drying properties. In the present specification, the "hydrophilic modified fibroin" is modified fibroin of which a value calculated by obtaining the sum of hydropathy indices (HIs) of all amino acid residues constituting the modified fibroin and then dividing the sum by the total number of amino acid residues (average HI) is 0 or less. The hydropathy index is as shown in Table 1. Modified fibroin having an average HI of more than 0 may also be referred to as hydrophobic modified fibroin.
[0170] Examples of the hydrophilic modified fibroin can include modified fibroin including the amino acid sequence set forth in SEQ ID NO: 4, the amino acid sequence set forth in SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9, the amino acid sequence set forth in SEQ ID NO: 13, SEQ ID NO: 11, SEQ ID NO: 14, or SEQ ID NO: 15, the amino acid sequence set forth in SEQ ID NO: 18, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9, the amino acid sequence set forth in SEQ ID NO: 17, SEQ ID NO: 11, SEQ ID NO: 14, or SEQ ID NO: 15, or the amino acid sequence set forth in SEQ ID NO: 19, SEQ ID NO: 20, or SEQ ID NO: 21.
[0171] Examples of the hydrophobic modified fibroin can include modified fibroin including the amino acid sequence set forth in SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, or SEQ ID NO: 43, or the amino acid sequence set forth in SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40, SEQ ID NO: 41, or SEQ ID NO: 44.
[0172] The modified fibroin according to the present embodiment can be produced by an ordinary method using a nucleic acid encoding the modified fibroin. The nucleic acid encoding the modified fibroin may be chemically synthesized based on nucleotide sequence information, or may be synthesized using a PCR method or the like. Isolation and purification of the produced modified fibroin can be performed by a commonly used method.
[0173] (Water-Absorbing and Quick-Drying Property-Imparting Agent)
[0174] The water-absorbing and quick-drying property-imparting agent according to the present embodiment may contain one type of modified fibroin alone or a combination of two or more types thereof.
[0175] The water-absorbing properties of the water-absorbing and quick-drying property-imparting agent according to the present embodiment as evaluated in accordance with JIS L 1907 may be 60 seconds or less, 30 seconds or less, 20 seconds or less, 10 seconds or less, or 5 seconds or less. Water-absorbing properties can be evaluated, for example, by a method described in Examples described later.
[0176] The quick-drying properties of the water-absorbing and quick-drying property-imparting agent according to the present embodiment as evaluated through measurement of the diffusible residual moisture content may be 100 minutes or less, 90 minutes or less, 80 minutes or less, or 70 minutes or less. The diffusible residual moisture content (%) is a value calculated by the following equation.
Diffusible residual moisture content (%)=weight (g) of water at each time/weight (g) of water at the start of measurement.times.100
[0177] The quick-drying properties means a time required until the diffusible residual moisture content reaches 10% or less. The quick-drying properties can be evaluated, for example, by a method described in Examples described later.
[0178] The water-absorbing and quick-drying property-imparting agent according to the present embodiment may further contain other additives (components other than the active ingredient) according to its form, application, and the like. Examples of the additive include a plasticizer, a leveling agent, a crosslinking agent, a nucleating agent, an antioxidant, an ultraviolet absorber, a coloring agent, a filler, and a synthetic resin. The content of the additive may be 50 parts by mass or less with respect to 100 parts by mass of the total amount of the water-absorbing and quick-drying property-imparting agent.
[0179] The water-absorbing and quick-drying property-imparting agent according to the present embodiment may be in any form of, for example, a powder, a paste, and a liquid (for example, a suspension or solution). The water-absorbing and quick-drying property-imparting agent according to the present embodiment may also be in the form of, for example, a fiber, a film, a gel, a porous body, a particle, or the like. The form of the water-absorbing and quick-drying property-imparting agent according to the present embodiment may be appropriately set according to the object to which water-absorbing and quick-drying properties are imparted (water-absorbing and quick-drying property-imparted article) and the application thereof.
[0180] Since the water-absorbing and quick-drying property-imparting agent according to the present embodiment contains modified fibroin as a main component, the water-absorbing and quick-drying property-imparting agent can be formed into any form described above. The formed body may be formed of modified fibroin itself, or may be formed of a combination of modified fibroin and another material.
[0181] When the water-absorbing and quick-drying property-imparting agent according to the present embodiment is prepared in the form of a powder, for example, a protein obtained by the above-described method for producing modified fibroin may be dried into a powder. The protein powder may contain other additives as necessary.
[0182] When the water-absorbing and quick-drying property-imparting agent according to the present embodiment is prepared in the form of a liquid (for example, a solution), for example, a protein obtained by the above-described method for producing modified fibroin may be dissolved in a solvent that can dissolve modified fibroin to obtain a liquid (modified fibroin solution). The modified fibroin solution may contain other additives as necessary. Examples of the solvent that can dissolve modified fibroin can include dimethyl sulfoxide (DMSO), N,N-dimethylformamide (DMF), formic acid, and hexafluoroisopropanol (HFIP). An inorganic salt may be added to the solvent as a dissolution promoter.
[0183] When the water-absorbing and quick-drying property-imparting agent according to the present embodiment is prepared in the form of a fiber, for example, the modified fibroin solution may be used as a dope solution, and the dope solution is spun into fibers (modified fibroin fibers) by a known spinning method such as wet spinning, dry spinning, dry-wet spinning, or melt spinning. The form of the fiber may be a single yarn, may be a composite yarn such as a blended yarn, a combined filament yarn, a union yarn, a mix-weaved yarn, a piled yarn, and a covering yarn, or may be a nonwoven fabric or the like.
[0184] The modified fibroin fiber may be a short fiber or a long fiber. The modified fibroin fiber may also be a modified fibroin fiber alone or may be combined with other fibers. That is, a single yarn composed of only modified fibroin fibers, or a composite yarn composed of a combination of modified fibroin fibers and other fibers may be used singly or in combination. The single yarn and the composite yarn may be spun yarns in which short fibers are twisted, or may be filament yarns in which long fibers are twisted or not twisted. The modified fibroin fiber may be either a short fiber or a long fiber, and may be used as a fiber alone or in combination with another fiber without being processed into a yarn. Examples of the other fiber include synthetic fibers such as nylon and polyester, regenerated fibers such as cupra and rayon, and natural fibers such as cotton and hemp. When the modified fibroin fiber is used in combination with the other fiber, the content of the modified fibroin fiber is preferably 20 mass % or more, more preferably 30 mass % or more, still more preferably 40 mass % or more, and even still more preferably 50 mass % or more, based on the total amount of fibers.
[0185] When the water-absorbing and quick-drying property-imparting agent according to the present embodiment is prepared in the form of a film, a gel, a porous body, a particle, or the like, the water-absorbing and quick-drying property-imparting agent can be produced, for example, in accordance with the methods described in JP 2009-505668 A, JP 2009-505668 A, JP 5678283 B2, JP 4638735 B2, and the like.
[0186] [Method for Imparting Water-Absorbing and Quick-Drying Properties to Article]
[0187] A method for imparting water-absorbing and quick-drying properties to an article according to the present embodiment includes a step of incorporating modified fibroin into the article. Since the modified fibroin according to the present invention is excellent in water-absorbing and quick-drying properties, it is possible to impart water-absorbing and quick-drying properties to an article by incorporating the modified fibroin into the article.
[0188] The article is not particularly limited as long as it should impart water-absorbing and quick-drying properties. Specific examples thereof include fibers, woven fabrics, knitted fabrics, nonwoven fabrics, cotton, sponges, films, resins, composite materials (the form of the water-absorbing and quick-drying property-imparting agent according to the present embodiment is not limited), and various articles produced using these materials.
[0189] The step of incorporating the modified fibroin may be a step of incorporating the modified fibroin by blending the water-absorbing and quick-drying property-imparting agent according to the present invention described above. The method for incorporating the modified fibroin is not particularly limited, and may be a method of mixing the modified fibroin with the material (raw material), or a method of forming an article by combining the water-absorbing and quick-drying property-imparting agent prepared in the form of the above-described formed body with another material (formed body or the like). Alternatively, a method of forming an article (formed body) by forming modified fibroin itself (other additives may be included as necessary) may be used.
[0190] The content of the modified fibroin in the article is preferably 20 mass % or more, more preferably 30 mass % or more, still more preferably 40 mass % or more, and even still more preferably 50 mass % or more, based on the total amount of the article. The upper limit of the content of the modified fibroin may be 100 mass % or 90 mass % or less, based on the total amount of the article. By adjusting the content of the modified fibroin in the article, the water-absorbing and quick-drying properties of the article can be controlled. That is, since the modified fibroin according to the present invention is excellent in water-absorbing and quick-drying properties, the water-absorbing and quick-drying properties of the article can be further enhanced as the content of the modified fibroin in the article is increased.
Examples
[0191] Hereinafter, the present invention will be described more specifically based on Examples and the like. However, the present invention is not limited to the following Examples.
[0192] [Production of Modified Fibroin]
[0193] (1) Production of Expression Vector
[0194] Modified fibroin having the amino acid sequence set forth in SEQ ID NO: 37 (PRT918) and modified fibroin having the amino acid sequence shown in SEQ ID NO: 15 (PRT799) were designed. A nucleic acid encoding the designed modified fibroin was synthesized. In the nucleic acid, an NdeI site was added to the 5' end and an EcoRI site was added downstream of the stop codon. The nucleic acid was cloned into a cloning vector (pUC118). Thereafter, the nucleic acid was enzymatically cleaved by treatment with NdeI and EcoRI, and then recombined into a protein expression vector pET-22b(+) to obtain an expression vector.
[0195] (2) Expression of Protein
[0196] Escherichia coli BLR (DE3) was transformed with the obtained expression vector. The transformed Escherichia coli was cultured in 2 mL of an LB culture medium containing ampicillin for 15 hours. The culture solution was added to a 100 mL seed culture medium containing ampicillin (Table 4) so that OD.sub.600 was 0.005. The temperature of the culture solution was maintained at 30.degree. C., and the flask culture was performed (for about 15 hours) until the OD.sub.600 reached 5, thus obtaining a seed culture solution.
TABLE-US-00004 TABLE 4 Seed culture medium Reagent Concentration (g/L) Glucose 5.0 KH.sub.2PO.sub.4 4.0 K.sub.2HPO.sub.4 9.3 Yeast Extract 6.0 Ampicillin 0.4
[0197] The seed culture solution was added to a jar fermenter to which a 500 mL production culture medium (Table 5) was added so that OD.sub.600 was 0.05. The culture was performed while maintaining the culture solution temperature at 37.degree. C. and constantly controlling the pH to 6.9. Further, the dissolved oxygen concentration in the culture solution was maintained at 20% of the dissolved oxygen saturation concentration.
TABLE-US-00005 TABLE 5 Production culture medium Reagent Concentration (g/L) Glucose 12.0 KH.sub.2PO.sub.4 9.0 MgSO.sub.4.cndot.7H.sub.2O 2.4 Yeast Extract 15 FeSO.sub.4.cndot.7H.sub.2O 0.04 MnSO.sub.4.cndot.5H.sub.2O 0.04 CaCl.sub.2.cndot.2H.sub.2O 0.04 ADEKANOL (Adeka, LG-295S) 0.1 (mL/L)
[0198] Immediately after glucose in the production culture medium was completely consumed, a feed solution (455 g/1 L of glucose, 120 g/1 L of Yeast Extract) was added at a rate of 1 mL/min. The culture was performed while maintaining the culture solution temperature at 37.degree. C. and constantly controlling the pH to 6.9. Further, the dissolved oxygen concentration in the culture solution was maintained at 20% of the dissolved oxygen saturation concentration, and the culture was performed for 20 hours. Thereafter, 1 M isopropyl-.beta.-thiogalactopyranoside (IPTG) was added to the culture solution to a final concentration of 1 mM to induce the expression of the modified fibroin. The culture solution was centrifuged 20 hours after addition of IPTG, and bacterial cells were recovered. SDS-PAGE was conducted using the bacterial cells prepared from the culture solutions obtained before the addition of IPTG and after the addition of IPTG. The expression of the target modified fibroin which depended on the addition of IPTG was confirmed by the appearance of a band of the size of the target modified fibroin.
[0199] (3) Purification of Protein
[0200] The bacterial cells recovered 2 hours after the addition of IPTG were washed with a 20 mM Tris-HCl buffer (pH 7.4).
[0201] The bacterial cells after washing were suspended in a 20 mM Tris-HCl buffer (pH 7.4) containing about 1 mM PMSF and the cells were disrupted with a high-pressure homogenizer (manufactured by GEA Niro Soavi).
[0202] The disrupted cells were centrifuged to obtain a precipitate. The obtained precipitate was washed with a 20 mM Tris-HCl buffer (pH 7.4) until the purity of the precipitate became high. The precipitate after washing was suspended in a 8 M guanidine buffer (8 M guanidine hydrochloride, 10 mM sodium dihydrogen phosphate, 20 mM NaCl, 1 mM Tris-HCl, pH 7.0) so that the concentration of the precipitate was 100 mg/mL, and dissolved by stirring with a stirrer for 30 minutes at 60.degree. C. After dissolution, dialysis was performed with water using a dialysis tube (cellulose tube 36/32, manufactured by Sanko Junyaku Co., Ltd.). A white aggregated protein obtained after dialysis was recovered by centrifugation, and moisture was removed in a lyophilizer to recover lyophilized powder, thereby obtaining modified fibroins (PRT918 and PRT799).
[0203] PRT918 is hydrophobic modified fibroin having an average HI of more than 0. PRT799 is hydrophilic modified fibroin having an average HI of 0 or less.
[0204] [Production of Protein Fiber]
[0205] Dimethyl sulfoxide (DMSO) in which LiCl was dissolved so as to be 4.0 mass % was prepared as a solvent, and a lyophilized powder of the modified fibroin was added thereto so as to have a concentration of 24 mass %, and dissolved for 3 hours using a shaker. Thereafter, insoluble matters and foams were removed to obtain a modified fibroin solution (spinning dope).
[0206] The prepared spinning dope at 60.degree. C. was filtrated with a metal filter having an opening of 5 .mu.m. Thereafter, the filtrate was allowed to stand in a 30 mL-stainless steel syringe to remove foams. The resulting spinning dope was discharged from a solid nozzle having a needle diameter of 0.2 mm into a 100 mass % methanol coagulation bath. The discharge temperature was 60.degree. C. After completion of the coagulation, the obtained original yarn was wound up, and naturally dried to obtain a modified fibroin fiber (raw material fiber).
[0207] For comparison, commercially available silk fibers, cotton fibers, and polyester fibers were prepared as raw material fibers.
[0208] [Production of Knitted Fabric]
[0209] Knitted fabrics were produced by weft-knitting respective raw material fibers by using a weft-knitting machine. The knitted fabric using PRT918 fibers as raw material fibers had a thickness of 1/30N (metric count, single yarn) and a gauge number of 18. The knitted fabric using PRT799 fibers as raw material fibers had a thickness of 1/30N (metric count, single yarn) and a gauge number of 16. The thickness and gauge number of each of the knitted fabrics formed by using other raw material fibers were adjusted so as to have a cover factor approximately the same as those of the knitted fabrics using PRT918 fibers and PRT799 fibers. Details are as follows.
[0210] Silk thickness: 2/60N (single yarn), gauge number: 14
[0211] Cotton thickness: 2/34 N (two ply yarn), gauge number: 14
[0212] Polyester thickness: 1/60N (single yarn), gauge number: 14
[0213] [Evaluation of Water-Absorbing and Quick-Drying Properties]
[0214] (Evaluation of Water-Absorbing Properties)
[0215] The water-absorbing properties were evaluated by a test in accordance with JIS L 1907 (Testing methods for water-absorbing properties of textiles/dropping method). Specifically, under a standard environment (temperature 20.+-.2.degree. C./humidity 65.+-.4% RH), one drop of water was dropped from the burette onto the surface of the knitted fabric prepared above, and the time until the dropped water droplet was absorbed by the knitted fabric (specular reflection disappeared) was measured (maximum measurement time was 60 seconds). The results are shown in Table 6.
[0216] (Evaluation of Quick-Drying Properties)
[0217] The quick-drying properties were evaluated by measuring the diffusible residual moisture content. Specifically, under a standard environment (temperature 20.+-.2.degree. C./humidity 65.+-.4% RH), 0.6 mL of tap water was dropped onto the back side of the knitted fabric, and the weight of water was determined by measuring the weight of the knitted fabric at every lapse of a certain time (every 5 minutes), and the diffusible residual moisture content was calculated by the following equation.
Diffusible residual moisture content (%)=weight (g) of water at each time/weight (g) of water at the start of measurement.times.100
[0218] The measurement was performed until the diffusible residual moisture content reached 10% or less (that is, the weight of water reached 10% of 0.6 mL=60 .mu.L (60 mg)), and the time required until the diffusible residual moisture content reached 10% was determined. The results are shown in Table 6.
TABLE-US-00006 TABLE 6 Water-absorbing Rapid-drying properties (sec) properties (min) Modified fibroin >60 54 (PRT918) Modified fibroin 3 66 (PRT799) Silk >60 134.3 Cotton 1 125.7 Polyester >60 300.8
[0219] It can be understood that the modified fibroins (PRT799 and PRT918) are excellent in water-absorbing and quick-drying properties. In particular, it can be understood that the hydrophilic modified fibroin (PRT799) is excellent in both water-absorbing properties and quick-drying properties.
Sequence CWU
1
1
44150PRTAraneus diadematus 1Ser Gly Cys Asp Val Leu Val Gln Ala Leu Leu
Glu Val Val Ser Ala1 5 10
15Leu Val Ser Ile Leu Gly Ser Ser Ser Ile Gly Gln Ile Asn Tyr Gly
20 25 30Ala Ser Ala Gln Tyr Thr Gln
Met Val Gly Gln Ser Val Ala Gln Ala 35 40
45Leu Ala 50230PRTAraneus diadematus 2Ser Gly Cys Asp Val Leu
Val Gln Ala Leu Leu Glu Val Val Ser Ala1 5
10 15Leu Val Ser Ile Leu Gly Ser Ser Ser Ile Gly Gln
Ile Asn 20 25
30321PRTAraneus diadematus 3Ser Gly Cys Asp Val Leu Val Gln Ala Leu Leu
Glu Val Val Ser Ala1 5 10
15Leu Val Ser Ile Leu 2041154PRTArtificial
Sequencerecombinant spider silk protein ADF3KaiLargeNRSH1 4Met His
His His His His His His His His His Ser Ser Gly Ser Ser1 5
10 15Leu Glu Val Leu Phe Gln Gly Pro
Ala Arg Ala Gly Ser Gly Gln Gln 20 25
30Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln
Gly 35 40 45Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala Ala Ala Ala Gly Gly Tyr 50 55
60Gly Pro Gly Ser Gly Gln Gln Gly Pro Ser Gln Gln Gly Pro
Gly Gln65 70 75 80Gln
Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
85 90 95Ala Ala Ala Ala Gly Gly Tyr
Gly Pro Gly Ser Gly Gln Gln Gly Pro 100 105
110Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala 115 120 125Ala Gly Gly Asn
Gly Pro Gly Ser Gly Gln Gln Gly Ala Gly Gln Gln 130
135 140Gly Pro Gly Gln Gln Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Ala145 150 155
160Gly Gly Tyr Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly
165 170 175Pro Gly Gly Gln Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala 180
185 190Ala Ala Gly Gly Tyr Gly Pro Gly Ser Gly Gln Gly
Pro Gly Gln Gln 195 200 205Gly Pro
Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala 210
215 220Ala Ala Ala Gly Gly Tyr Gly Pro Gly Ser Gly
Gln Gln Gly Pro Gly225 230 235
240Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gly Gln Gly Pro Tyr Gly
245 250 255Pro Gly Ala Ser
Ala Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly 260
265 270Tyr Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro
Gly Gly Gln Gly Pro 275 280 285Tyr
Gly Pro Gly Ala Ser Ala Ala Ser Ala Ala Ser Gly Gly Tyr Gly 290
295 300Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln
Gln Gly Pro Gly Gly Gln305 310 315
320Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Gly
Gly 325 330 335Tyr Gly Pro
Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly 340
345 350Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly
Gly Gln Gly Pro Tyr Gly 355 360
365Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly 370
375 380Ser Gly Gln Gln Gly Pro Gly Gln
Gln Gly Pro Gly Gln Gln Gly Pro385 390
395 400Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gln
Gln Gly Pro Gly 405 410
415Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gly
420 425 430Gln Gly Ala Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Gly Ala Ala Gly 435 440
445Gly Tyr Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln
Gly Pro 450 455 460Gly Gln Gln Gly Pro
Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly465 470
475 480Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly
Gln Gln Gly Pro Tyr Gly 485 490
495Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly
500 505 510Ser Gly Gln Gln Gly
Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro 515
520 525Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ala Ser
Ala Ala Val Ser 530 535 540Val Ser Arg
Ala Arg Ala Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln545
550 555 560Gly Pro Gly Gln Gln Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly 565
570 575Ala Ser Ala Ala Ala Ala Ala Ala Gly Gly Tyr Gly
Pro Gly Ser Gly 580 585 590Gln
Gln Gly Pro Ser Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gly 595
600 605Gln Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Ala Gly 610 615
620Gly Tyr Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gly Gln Gly Pro625
630 635 640Tyr Gly Pro Gly
Ser Ser Ala Ala Ala Ala Ala Ala Gly Gly Asn Gly 645
650 655Pro Gly Ser Gly Gln Gln Gly Ala Gly Gln
Gln Gly Pro Gly Gln Gln 660 665
670Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro
675 680 685Gly Ser Gly Gln Gln Gly Pro
Gly Gln Gln Gly Pro Gly Gly Gln Gly 690 695
700Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Gly Gly
Tyr705 710 715 720Gly Pro
Gly Ser Gly Gln Gly Pro Gly Gln Gln Gly Pro Gly Gly Gln
725 730 735Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala Ala Gly Gly 740 745
750Tyr Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly
Pro Gly 755 760 765Gln Gln Gly Pro
Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala 770
775 780Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly Tyr
Gly Gln Gln Gly785 790 795
800Pro Gly Gln Gln Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala
805 810 815Ser Ala Ala Ser Ala
Ala Ser Gly Gly Tyr Gly Pro Gly Ser Gly Gln 820
825 830Gln Gly Pro Gly Gln Gln Gly Pro Gly Gly Gln Gly
Pro Tyr Gly Pro 835 840 845Gly Ala
Ser Ala Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly Ser 850
855 860Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly
Gln Gln Gly Pro Gly865 870 875
880Gln Gln Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
885 890 895Ala Ala Ala Ala
Ala Gly Gly Tyr Gly Pro Gly Ser Gly Gln Gln Gly 900
905 910Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro
Gly Gln Gln Gly Pro 915 920 925Gly
Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly 930
935 940Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly
Gly Gln Gly Ala Tyr Gly945 950 955
960Pro Gly Ala Ser Ala Ala Ala Gly Ala Ala Gly Gly Tyr Gly Pro
Gly 965 970 975Ser Gly Gln
Gln Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro 980
985 990Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro
Gly Gln Gln Gly Pro Gly 995 1000
1005Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser
1010 1015 1020Ala Ala Ala Ala Ala Ala
Gly Gly Tyr Gly Pro Gly Ser Gly Gln 1025 1030
1035Gln Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly
Gly 1040 1045 1050Gln Gly Pro Tyr Gly
Pro Gly Ala Ala Ser Ala Ala Val Ser Val 1055 1060
1065Gly Gly Tyr Gly Pro Gln Ser Ser Ser Val Pro Val Ala
Ser Ala 1070 1075 1080Val Ala Ser Arg
Leu Ser Ser Pro Ala Ala Ser Ser Arg Val Ser 1085
1090 1095Ser Ala Val Ser Ser Leu Val Ser Ser Gly Pro
Thr Lys His Ala 1100 1105 1110Ala Leu
Ser Asn Thr Ile Ser Ser Val Val Ser Gln Val Ser Ala 1115
1120 1125Ser Asn Pro Gly Leu Ser Gly Cys Asp Val
Leu Val Gln Ala Leu 1130 1135 1140Leu
Glu Val Val Ser Ala Leu Val Ser Ile Leu 1145
1150524PRTArtificial SequenceHis tag and start codon 5Met His His His His
His His His His His His Ser Ser Gly Ser Ser1 5
10 15Leu Glu Val Leu Phe Gln Gly Pro
206597PRTArtificial SequenceMet-PRT380 6Met Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala Ser Ala Ala1 5 10
15Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser Gly Gln Gln Gly
Pro Gly 20 25 30Gln Ser Ala
Ala Ala Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly 35
40 45Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala Ala
Ala Ala Ala Gly Pro 50 55 60Gly Gln
Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala65
70 75 80Ala Ala Gly Pro Gly Ser Gly
Gln Gln Gly Pro Gly Ala Ser Ala Ala 85 90
95Ala Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro
Gly Gln Gln 100 105 110Gly Pro
Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly 115
120 125Pro Gly Gln Gln Gly Pro Tyr Gly Ser Ala
Ala Ala Ala Ala Gly Pro 130 135 140Gly
Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala145
150 155 160Ala Ala Ala Ala Gly Pro
Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro 165
170 175Ser Ala Ser Ala Ala Ala Ala Ala Gly Ser Gly Gln
Gln Gly Pro Gly 180 185 190Gln
Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly 195
200 205Ser Gly Pro Gly Gln Gln Gly Pro Tyr
Gly Pro Gly Gln Ser Ala Ala 210 215
220Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr225
230 235 240Ala Ser Ala Ala
Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly 245
250 255Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Tyr Gly Pro 260 265
270Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
275 280 285Gly Gln Asn Gly Pro Gly Ser
Gly Gln Tyr Gly Pro Gly Gln Gln Gly 290 295
300Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln Gly
Pro305 310 315 320Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro
325 330 335Gly Gln Gln Gly Pro Gly Gln
Tyr Gly Pro Gly Ser Ser Ala Ala Ala 340 345
350Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser
Ser Ala 355 360 365Ala Ala Ala Ala
Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly 370
375 380Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gln Gln Gly Pro385 390 395
400Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
405 410 415Gly Pro Gly Gln Gln
Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala 420
425 430Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln
Gly Pro Ser Ala 435 440 445Ser Ala
Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln Tyr 450
455 460Gly Pro Tyr Gly Pro Gly Gln Ser Ala Ala Ala
Ala Ala Gly Pro Gly465 470 475
480Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
485 490 495Ala Ala Ala Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro 500
505 510Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly
Ser Gly Gln Tyr Gly 515 520 525Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser 530
535 540Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro
Gly Gln Ser Ala Ala Ala545 550 555
560Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr
Gly 565 570 575Pro Gly Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln 580
585 590Gly Pro Gly Ala Ser
5957590PRTArtificial SequenceMet-PRT410 7Met Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala Ser Ala Ala1 5 10
15Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser Gly Gln Gln Gly
Pro Gly 20 25 30Gln Ser Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly 35
40 45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro
Gly Gln Tyr Gly Pro 50 55 60Gly Gln
Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly65
70 75 80Ser Gly Gln Gln Gly Pro Gly
Ala Ser Gly Gln Tyr Gly Pro Gly Gln 85 90
95Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala Ala
Ala Ala Ala 100 105 110Gly Gln
Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Ser Ala 115
120 125Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Tyr Gly Gln Gly Pro Tyr 130 135 140Gly
Pro Gly Ala Ser Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly145
150 155 160Pro Ser Ala Ser Ala Ala
Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro 165
170 175Gly Gln Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala
Ala Gly Gln Tyr 180 185 190Gly
Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly 195
200 205Ser Gly Gln Gln Gly Pro Gly Gln Gln
Gly Pro Tyr Ala Ser Ala Ala 210 215
220Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser225
230 235 240Ala Ala Ala Ala
Ala Gly Gln Tyr Gly Tyr Gly Pro Gly Gln Gln Gly 245
250 255Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn
Gly Pro Gly Ser Gly Gln 260 265
270Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala
275 280 285Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala Ser Ala Ala Ala 290 295
300Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Tyr
Gly305 310 315 320Pro Gly
Ser Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser
325 330 335Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro 340 345
350Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gln Gln 355 360 365Gly Pro Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly 370
375 380Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Gly385 390 395
400Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala
405 410 415Ala Ala Ala Gly Gln
Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr 420
425 430Gly Pro Gly Gln Ser Gly Pro Gly Ser Gly Gln Gln
Gly Gln Gly Pro 435 440 445Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro 450
455 460Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser
Ala Ala Ala Ala Ala465 470 475
480Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly
485 490 495Pro Gly Ser Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser 500
505 510Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly
Pro Gly Gln Gln Gly 515 520 525Pro
Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly 530
535 540Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly
Pro Gly Gln Ser Gly Ser545 550 555
560Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala
Ala 565 570 575Ala Ala Gly
Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser 580
585 5908565PRTArtificial SequenceMet-PRT525 8Met Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5
10 15Ala Ala Ala Ala Ala Gly Ser Asn Gly
Pro Gly Ser Gly Gln Gln Gly 20 25
30Pro Gly Gln Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln
35 40 45Gln Gly Pro Gly Ser Ser Ala
Ala Ala Ala Ala Ala Ala Gly Pro Gly 50 55
60Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala65
70 75 80Ala Ala Ala Gly
Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser Gly 85
90 95Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly
Gln Gln Gly Pro Gly Ser 100 105
110Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser Gly Pro Gly
115 120 125Gln Gln Gly Pro Tyr Gly Ser
Ala Ala Ala Ala Ala Ala Ala Gly Pro 130 135
140Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser
Gly145 150 155 160Pro Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala
165 170 175Ala Ala Ala Ala Ala Gly Ser
Gly Gln Gln Gly Pro Gly Gln Tyr Gly 180 185
190Pro Tyr Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr
Gly Ser 195 200 205Gly Pro Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly 210
215 220Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser
Ala Ala Ala Ala225 230 235
240Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser
245 250 255Ala Ala Ala Ala Ala
Ala Ala Gly Ser Tyr Gly Tyr Gly Pro Gly Gln 260
265 270Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn
Gly Pro Gly Ser 275 280 285Gly Gln
Tyr Gly Pro Gly Gln Gln Gly Pro Gly Pro Ser Ala Ala Ala 290
295 300Ala Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala305 310 315
320Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Gln Gln
325 330 335Gly Pro Gly Gln
Tyr Gly Pro Gly Ser Ser Gly Pro Gly Gln Gln Gly 340
345 350Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala
Ala Ala Ala Gly Ser 355 360 365Tyr
Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Pro Ser Ala Ala 370
375 380Ala Ala Ala Ala Ala Gly Ser Tyr Gln Gln
Gly Pro Gly Gln Gln Gly385 390 395
400Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln Gln Gly Pro Tyr
Gly 405 410 415Pro Gly Ala
Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr 420
425 430Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser
Ala Ala Ala Ala Ala Ala 435 440
445Ala Gly Ser Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr Gly Pro 450
455 460Gly Gln Ser Gly Pro Gly Ser Gly
Gln Gln Gly Gln Gly Pro Tyr Gly465 470
475 480Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly
Ser Tyr Gly Pro 485 490
495Gly Gln Gln Gly Pro Tyr Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala
500 505 510Ala Ala Gly Pro Gly Ser
Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln 515 520
525Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly
Pro Gly 530 535 540Pro Ser Ala Ala Ala
Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln545 550
555 560Gly Pro Gly Ala Ser
56592364PRTArtificial SequenceMet-PRT799 9Met Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala Ser Ala Ala1 5 10
15Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser Gly Gln Gln Gly
Pro Gly 20 25 30Gln Ser Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly 35
40 45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro
Gly Gln Tyr Gly Pro 50 55 60Gly Gln
Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly65
70 75 80Ser Gly Gln Gln Gly Pro Gly
Ala Ser Gly Gln Tyr Gly Pro Gly Gln 85 90
95Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala Ala
Ala Ala Ala 100 105 110Gly Gln
Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Ser Ala 115
120 125Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Tyr Gly Gln Gly Pro Tyr 130 135 140Gly
Pro Gly Ala Ser Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly145
150 155 160Pro Ser Ala Ser Ala Ala
Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro 165
170 175Gly Gln Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala
Ala Gly Gln Tyr 180 185 190Gly
Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly 195
200 205Ser Gly Gln Gln Gly Pro Gly Gln Gln
Gly Pro Tyr Ala Ser Ala Ala 210 215
220Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser225
230 235 240Ala Ala Ala Ala
Ala Gly Gln Tyr Gly Tyr Gly Pro Gly Gln Gln Gly 245
250 255Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn
Gly Pro Gly Ser Gly Gln 260 265
270Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala
275 280 285Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala Ser Ala Ala Ala 290 295
300Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Tyr
Gly305 310 315 320Pro Gly
Ser Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser
325 330 335Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro 340 345
350Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gln Gln 355 360 365Gly Pro Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly 370
375 380Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Gly385 390 395
400Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala
405 410 415Ala Ala Ala Gly Gln
Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr 420
425 430Gly Pro Gly Gln Ser Gly Pro Gly Ser Gly Gln Gln
Gly Gln Gly Pro 435 440 445Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro 450
455 460Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser
Ala Ala Ala Ala Ala465 470 475
480Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly
485 490 495Pro Gly Ser Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser 500
505 510Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly
Pro Gly Gln Gln Gly 515 520 525Pro
Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly 530
535 540Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly
Pro Gly Gln Ser Gly Ser545 550 555
560Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala
Ala 565 570 575Ala Ala Gly
Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser Gly Gln 580
585 590Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Gln 595 600
605Asn Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Ser Gly Gln Tyr 610
615 620Gly Pro Gly Gln Gln Gly Pro Gly
Gln Gln Gly Pro Gly Ser Ser Ala625 630
635 640Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly
Gln Gln Gly Pro 645 650
655Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly
660 665 670Pro Gly Ala Ser Gly Gln
Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln 675 680
685Gln Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gly Ser 690 695 700Gly Pro Gly Gln Gln
Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly705 710
715 720Pro Gly Ser Gly Gln Tyr Gly Gln Gly Pro
Tyr Gly Pro Gly Ala Ser 725 730
735Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala
740 745 750Ala Ala Ala Ala Gly
Ser Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro 755
760 765Tyr Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly
Ser Gly Pro Gly 770 775 780Gln Gln Gly
Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Gln Gln Gly785
790 795 800Pro Gly Gln Gln Gly Pro Tyr
Ala Ser Ala Ala Ala Ala Ala Gly Pro 805
810 815Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala
Ala Ala Ala Ala 820 825 830Gly
Gln Tyr Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly 835
840 845Ala Ser Gly Gln Asn Gly Pro Gly Ser
Gly Gln Tyr Gly Pro Gly Gln 850 855
860Gln Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln865
870 875 880Gly Pro Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr 885
890 895Gly Pro Gly Gln Gln Gly Pro Gly Gln Tyr
Gly Pro Gly Ser Ser Gly 900 905
910Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala
915 920 925Ala Gly Gln Tyr Gly Pro Gly
Gln Gln Gly Pro Tyr Gly Pro Gly Gln 930 935
940Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln
Gln945 950 955 960Gly Pro
Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln Gln Gly Pro Tyr
965 970 975Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Gly Pro Gly Gln Tyr Gly 980 985
990Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala
Gly Gln 995 1000 1005Tyr Gly Ser
Gly Pro Gly Gln Tyr Gly Pro Tyr Gly Pro Gly Gln 1010
1015 1020Ser Gly Pro Gly Ser Gly Gln Gln Gly Gln Gly
Pro Tyr Gly Pro 1025 1030 1035Gly Ala
Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro Gly Gln 1040
1045 1050Gln Gly Pro Tyr Gly Pro Gly Gln Ser Ala
Ala Ala Ala Ala Gly 1055 1060 1065Pro
Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly 1070
1075 1080Pro Gly Ser Gly Gln Tyr Gly Pro Gly
Gln Gln Gly Pro Gly Gln 1085 1090
1095Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln
1100 1105 1110Gln Gly Pro Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly 1115 1120
1125Gln Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly 1130 1135 1140Gln Ser Gly Ser Gly
Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr 1145 1150
1155Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Gln Gly 1160 1165 1170Pro Gly Ala Ser
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser 1175
1180 1185Ala Ala Ala Ala Ala Gly Gln Asn Gly Pro Gly
Ser Gly Gln Gln 1190 1195 1200Gly Pro
Gly Gln Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro 1205
1210 1215Gly Gln Gln Gly Pro Gly Ser Ser Ala Ala
Ala Ala Ala Gly Pro 1220 1225 1230Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala 1235
1240 1245Ala Ala Ala Gly Pro Gly Ser Gly Gln
Gln Gly Pro Gly Ala Ser 1250 1255
1260Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro
1265 1270 1275Gly Ser Ser Ala Ala Ala
Ala Ala Gly Gln Tyr Gly Ser Gly Pro 1280 1285
1290Gly Gln Gln Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly
Pro 1295 1300 1305Gly Ser Gly Gln Tyr
Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser 1310 1315
1320Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser
Ala Ser 1325 1330 1335Ala Ala Ala Ala
Ala Gly Ser Gly Gln Gln Gly Pro Gly Gln Tyr 1340
1345 1350Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Ser 1355 1360 1365Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser 1370
1375 1380Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro
Tyr Ala Ser Ala Ala 1385 1390 1395Ala
Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser 1400
1405 1410Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gly Tyr Gly Pro Gly Gln 1415 1420
1425Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly
1430 1435 1440Ser Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Gln Ser Ala 1445 1450
1455Ala Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly 1460 1465 1470Ala Ser Ala Ala Ala
Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln 1475 1480
1485Gly Pro Gly Gln Tyr Gly Pro Gly Ser Ser Gly Pro Gly
Gln Gln 1490 1495 1500Gly Pro Tyr Gly
Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln 1505
1510 1515Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Gln Ser Ala 1520 1525 1530Ala Ala
Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly 1535
1540 1545Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly
Gln Gln Gly Pro Tyr 1550 1555 1560Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr 1565
1570 1575Gly Pro Gly Gln Gln Gly Pro Ser Ala
Ser Ala Ala Ala Ala Ala 1580 1585
1590Gly Gln Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr Gly Pro
1595 1600 1605Gly Gln Ser Gly Pro Gly
Ser Gly Gln Gln Gly Gln Gly Pro Tyr 1610 1615
1620Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly
Pro 1625 1630 1635Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala 1640 1645
1650Ala Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser
Gly Gln 1655 1660 1665Asn Gly Pro Gly
Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro 1670
1675 1680Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gln Gln Gly Pro 1685 1690 1695Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala 1700
1705 1710Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln
Gln Gly Pro Tyr Gly 1715 1720 1725Pro
Gly Gln Ser Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly 1730
1735 1740Pro Tyr Ala Ser Ala Ala Ala Ala Ala
Gly Pro Gly Ser Gly Gln 1745 1750
1755Gln Gly Pro Gly Ala Ser Gly Gln Gln Gly Pro Tyr Gly Pro Gly
1760 1765 1770Ala Ser Ala Ala Ala Ala
Ala Gly Gln Asn Gly Pro Gly Ser Gly 1775 1780
1785Gln Gln Gly Pro Gly Gln Ser Gly Gln Tyr Gly Pro Gly Gln
Gln 1790 1795 1800Gly Pro Gly Gln Gln
Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 1805 1810
1815Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser
Ala Ser 1820 1825 1830Ala Ala Ala Ala
Ala Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly 1835
1840 1845Ala Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly
Pro Gly Gln Gln 1850 1855 1860Gly Pro
Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser 1865
1870 1875Gly Pro Gly Gln Gln Gly Pro Tyr Gly Ser
Ala Ala Ala Ala Ala 1880 1885 1890Gly
Pro Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly 1895
1900 1905Ala Ser Gly Pro Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Ser 1910 1915
1920Ala Ser Ala Ala Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly
1925 1930 1935Gln Tyr Gly Pro Tyr Ala
Ser Ala Ala Ala Ala Ala Gly Gln Tyr 1940 1945
1950Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln
Ser 1955 1960 1965Gly Ser Gly Gln Gln
Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser 1970 1975
1980Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr
Gly Pro 1985 1990 1995Gly Ser Ser Ala
Ala Ala Ala Ala Gly Gln Tyr Gly Tyr Gly Pro 2000
2005 2010Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser
Gly Gln Asn Gly 2015 2020 2025Pro Gly
Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln 2030
2035 2040Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln
Gln Gly Pro Tyr Gly 2045 2050 2055Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro Gly 2060
2065 2070Gln Gln Gly Pro Gly Gln Tyr Gly Pro
Gly Ser Ser Gly Pro Gly 2075 2080
2085Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
2090 2095 2100Gly Gln Tyr Gly Pro Gly
Gln Gln Gly Pro Tyr Gly Pro Gly Gln 2105 2110
2115Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly
Gln 2120 2125 2130Gln Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Pro Gly Gln Gln Gly 2135 2140
2145Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly
Pro Gly 2150 2155 2160Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala 2165
2170 2175Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln
Tyr Gly Pro Tyr 2180 2185 2190Gly Pro
Gly Gln Ser Gly Pro Gly Ser Gly Gln Gln Gly Gln Gly 2195
2200 2205Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala
Ala Ala Gly Gln Tyr 2210 2215 2220Gly
Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Ala Ala 2225
2230 2235Ala Ala Ala Gly Pro Gly Ser Gly Gln
Tyr Gly Pro Gly Ala Ser 2240 2245
2250Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln
2255 2260 2265Gly Pro Gly Gln Ser Ala
Ala Ala Ala Ala Gly Gln Tyr Gln Gln 2270 2275
2280Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
Ala 2285 2290 2295Ala Ala Ala Gly Gln
Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro 2300 2305
2310Tyr Gly Pro Gly Gln Ser Gly Ser Gly Gln Gln Gly Pro
Gly Gln 2315 2320 2325Gln Gly Pro Tyr
Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser 2330
2335 2340Gly Gln Gln Gly Ser Ser Val Asp Lys Leu Ala
Ala Ala Leu Glu 2345 2350 2355His His
His His His His 236010597PRTArtificial SequenceMet-PRT313 10Met Gly
Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5
10 15Ala Ala Ala Gly Gly Asn Gly Pro
Gly Ser Gly Gln Gln Gly Pro Gly 20 25
30Gly Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly Gly Gln
Gly 35 40 45Pro Gly Gln Gln Gly
Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro 50 55
60Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro Ser Ala Ser Ala
Ala Ala65 70 75 80Ala
Ala Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser Ala Ala
85 90 95Ala Ala Ala Gly Gly Tyr Gly
Pro Gly Gly Gln Gly Pro Gly Gln Gln 100 105
110Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly
Ser Gly 115 120 125Pro Gly Gln Gln
Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly Pro 130
135 140Gly Ser Gly Gly Tyr Gly Gln Gly Pro Tyr Gly Pro
Gly Ala Ser Ala145 150 155
160Ala Ala Ala Ala Gly Pro Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro
165 170 175Ser Ala Ser Ala Ala
Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly 180
185 190Gly Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala
Gly Gly Tyr Gly 195 200 205Ser Gly
Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gly Ser Ala Ala 210
215 220Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly
Gln Gln Gly Pro Tyr225 230 235
240Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Gly Gln Gly Pro Tyr Gly
245 250 255Pro Gly Ser Ser
Ala Ala Ala Ala Ala Gly Gly Tyr Gly Tyr Gly Pro 260
265 270Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala 275 280 285Gly
Gly Asn Gly Pro Gly Ser Gly Gly Tyr Gly Pro Gly Gln Gln Gly 290
295 300Pro Gly Gly Ser Ala Ala Ala Ala Ala Gly
Pro Gly Gly Gln Gly Pro305 310 315
320Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly
Pro 325 330 335Gly Gly Gln
Gly Pro Gly Gly Tyr Gly Pro Gly Ser Ser Ala Ala Ala 340
345 350Ala Ala Gly Pro Gly Gly Gln Gly Pro Tyr
Gly Pro Gly Ser Ser Ala 355 360
365Ala Ala Ala Ala Gly Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly 370
375 380Pro Gly Gly Ser Ala Ala Ala Ala
Ala Gly Gly Tyr Gln Gln Gly Pro385 390
395 400Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala 405 410
415Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala
420 425 430Ala Ala Gly Pro Gly Gly
Tyr Gly Pro Gly Gly Gln Gly Pro Ser Ala 435 440
445Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly Ser Gly Pro Gly
Gly Tyr 450 455 460Gly Pro Tyr Gly Pro
Gly Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly465 470
475 480Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly
Pro Gly Ala Ser Ala Ala 485 490
495Ala Ala Ala Gly Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
500 505 510Gly Gly Ser Ala Ala
Ala Ala Ala Gly Pro Gly Ser Gly Gly Tyr Gly 515
520 525Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gly Asn
Gly Pro Gly Ser 530 535 540Gly Gly Tyr
Gly Pro Gly Gln Gln Gly Pro Gly Gly Ser Ala Ala Ala545
550 555 560Ala Ala Gly Gly Tyr Gln Gln
Gly Pro Gly Gly Gln Gly Pro Tyr Gly 565
570 575Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly
Ser Gly Gln Gln 580 585 590Gly
Pro Gly Ala Ser 5951112PRTArtificial SequenceHisTag 11Met His His
His His His His Ser Ser Gly Ser Ser1 5
1012608PRTArtificial SequencePRT380 12Met His His His His His His Ser Ser
Gly Ser Ser Gly Pro Gly Gln1 5 10
15Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly
Gln 20 25 30Asn Gly Pro Gly
Ser Gly Gln Gln Gly Pro Gly Gln Ser Ala Ala Ala 35
40 45Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro
Gly Gln Gln Gly 50 55 60Pro Gly Ser
Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro65 70
75 80Gly Gln Gln Gly Pro Ser Ala Ser
Ala Ala Ala Ala Ala Gly Pro Gly 85 90
95Ser Gly Gln Gln Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Gln 100 105 110Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser 115
120 125Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly
Pro Gly Gln Gln Gly 130 135 140Pro Tyr
Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Tyr145
150 155 160Gly Gln Gly Pro Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly 165
170 175Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser
Ala Ser Ala Ala 180 185 190Ala
Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro Tyr 195
200 205Ala Ser Ala Ala Ala Ala Ala Gly Gln
Tyr Gly Ser Gly Pro Gly Gln 210 215
220Gln Gly Pro Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Ser225
230 235 240Gly Gln Gln Gly
Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala 245
250 255Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr
Gly Pro Gly Ser Ser Ala 260 265
270Ala Ala Ala Ala Gly Gln Tyr Gly Tyr Gly Pro Gly Gln Gln Gly Pro
275 280 285Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Gln Asn Gly Pro 290 295
300Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser
Ala305 310 315 320Ala Ala
Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala
325 330 335Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro 340 345
350Gly Gln Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly
Pro Gly 355 360 365Gln Gln Gly Pro
Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly 370
375 380Gln Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Gln Ser Ala385 390 395
400Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro
405 410 415Tyr Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln 420
425 430Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
Ala Gly Pro Gly 435 440 445Gln Tyr
Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala 450
455 460Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln Tyr
Gly Pro Tyr Gly Pro465 470 475
480Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly
485 490 495Gln Gly Pro Tyr
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln 500
505 510Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Gln Ser Ala Ala 515 520 525Ala
Ala Ala Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Ala 530
535 540Ala Ala Ala Ala Gly Gln Asn Gly Pro Gly
Ser Gly Gln Tyr Gly Pro545 550 555
560Gly Gln Gln Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln
Tyr 565 570 575Gln Gln Gly
Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala 580
585 590Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Gln Gly Pro Gly Ala Ser 595 600
60513601PRTArtificial SequencePRT410 13Met His His His His His His Ser
Ser Gly Ser Ser Gly Pro Gly Gln1 5 10
15Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Gln 20 25 30Asn Gly Pro
Gly Ser Gly Gln Gln Gly Pro Gly Gln Ser Gly Gln Tyr 35
40 45Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly
Pro Gly Ser Ser Ala 50 55 60Ala Ala
Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro65
70 75 80Ser Ala Ser Ala Ala Ala Ala
Ala Gly Pro Gly Ser Gly Gln Gln Gly 85 90
95Pro Gly Ala Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly
Pro Gly Gln 100 105 110Gln Gly
Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser 115
120 125Gly Pro Gly Gln Gln Gly Pro Tyr Gly Ser
Ala Ala Ala Ala Ala Gly 130 135 140Pro
Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser145
150 155 160Gly Pro Gly Gln Tyr Gly
Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala 165
170 175Ala Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly
Gln Tyr Gly Pro 180 185 190Tyr
Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly 195
200 205Gln Gln Gly Pro Tyr Gly Pro Gly Gln
Ser Gly Ser Gly Gln Gln Gly 210 215
220Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro225
230 235 240Gly Gln Gln Gly
Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 245
250 255Gly Gln Tyr Gly Tyr Gly Pro Gly Gln Gln
Gly Pro Tyr Gly Pro Gly 260 265
270Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln
275 280 285Gln Gly Pro Gly Gln Ser Ala
Ala Ala Ala Ala Gly Pro Gly Gln Gln 290 295
300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln
Tyr305 310 315 320Gly Pro
Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Gln Gln Gly Pro Tyr
Gly Pro Gly Ser Ser Ala Ala Ala Ala 340 345
350Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Gln 355 360 365Ser Ala Ala Ala
Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln
Gln Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly
405 410 415Pro Gly Gln Gln Gly
Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Gln 420
425 430Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr Gly
Pro Gly Gln Ser 435 440 445Gly Pro
Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro465 470 475
480Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly
485 490 495Gln Tyr Gly Pro
Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln 500
505 510Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser
Ala Ala Ala Ala Ala 515 520 525Gly
Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly 530
535 540Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gly Ser Gly Pro Gly Gln545 550 555
560Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Gln Gln Gly
Pro 565 570 575Gly Gln Gln
Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly 580
585 590Ser Gly Gln Gln Gly Pro Gly Ala Ser
595 60014576PRTArtificial SequencePRT525 14Met His His
His His His His Ser Ser Gly Ser Ser Gly Pro Gly Gln1 5
10 15Gln Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Ala Ala 20 25
30Gly Ser Asn Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Ser Gly
35 40 45Gln Tyr Gly Pro Gly Gln Gln
Gly Pro Gly Gln Gln Gly Pro Gly Ser 50 55
60Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly65
70 75 80Gln Gln Gly Pro
Ser Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro 85
90 95Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser
Gly Gln Tyr Gly Pro Gly 100 105
110Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala Ala Ala Ala
115 120 125Ala Ala Ala Gly Ser Tyr Gly
Ser Gly Pro Gly Gln Gln Gly Pro Tyr 130 135
140Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Tyr145 150 155 160Gly Gln
Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln Tyr Gly
165 170 175Pro Gly Gln Gln Gly Pro Ser
Ala Ser Ala Ala Ala Ala Ala Ala Ala 180 185
190Gly Ser Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro Tyr Ala
Ser Ala 195 200 205Ala Ala Ala Ala
Ala Ala Gly Ser Tyr Gly Ser Gly Pro Gly Gln Gln 210
215 220Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Gln
Gln Gly Pro Gly225 230 235
240Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro
245 250 255Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 260
265 270Ala Ala Gly Ser Tyr Gly Tyr Gly Pro Gly Gln Gln
Gly Pro Tyr Gly 275 280 285Pro Gly
Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro 290
295 300Gly Gln Gln Gly Pro Gly Pro Ser Ala Ala Ala
Ala Ala Ala Ala Gly305 310 315
320Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
325 330 335Ala Ala Ala Gly
Ser Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Tyr 340
345 350Gly Pro Gly Ser Ser Gly Pro Gly Gln Gln Gly
Pro Tyr Gly Pro Gly 355 360 365Ser
Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Gln 370
375 380Gln Gly Pro Tyr Gly Pro Gly Pro Ser Ala
Ala Ala Ala Ala Ala Ala385 390 395
400Gly Ser Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly 405 410 415Ala Ser Gly
Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala 420
425 430Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln
Tyr Gly Pro Gly Gln Gln 435 440
445Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly 450
455 460Ser Gly Pro Gly Gln Tyr Gly Pro
Tyr Gly Pro Gly Gln Ser Gly Pro465 470
475 480Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly Pro
Gly Ala Ser Ala 485 490
495Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Gln Gln Gly Pro
500 505 510Tyr Gly Pro Gly Pro Ser
Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly 515 520
525Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro
Gly Ser 530 535 540Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Pro Ser Ala Ala Ala545 550
555 560Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Gln Gly Pro Gly Ala Ser 565 570
575152375PRTArtificial SequencePRT799 15Met His His His His His His
Ser Ser Gly Ser Ser Gly Pro Gly Gln1 5 10
15Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
Ala Gly Gln 20 25 30Asn Gly
Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Ser Gly Gln Tyr 35
40 45Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln
Gly Pro Gly Ser Ser Ala 50 55 60Ala
Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro65
70 75 80Ser Ala Ser Ala Ala Ala
Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly 85
90 95Pro Gly Ala Ser Gly Gln Tyr Gly Pro Gly Gln Gln
Gly Pro Gly Gln 100 105 110Gln
Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser 115
120 125Gly Pro Gly Gln Gln Gly Pro Tyr Gly
Ser Ala Ala Ala Ala Ala Gly 130 135
140Pro Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser145
150 155 160Gly Pro Gly Gln
Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala 165
170 175Ala Ala Ala Ala Gly Ser Gly Gln Gln Gly
Pro Gly Gln Tyr Gly Pro 180 185
190Tyr Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly
195 200 205Gln Gln Gly Pro Tyr Gly Pro
Gly Gln Ser Gly Ser Gly Gln Gln Gly 210 215
220Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly
Pro225 230 235 240Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
245 250 255Gly Gln Tyr Gly Tyr Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly 260 265
270Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro
Gly Gln 275 280 285Gln Gly Pro Gly
Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln 290
295 300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
Ala Gly Gln Tyr305 310 315
320Gly Pro Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Gln Gln Gly
Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala 340
345 350Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Tyr
Gly Pro Gly Gln 355 360 365Ser Ala
Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly
Gln Gln Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly
405 410 415Pro Gly Gln Gln
Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Gln 420
425 430Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr
Gly Pro Gly Gln Ser 435 440 445Gly
Pro Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly
Pro Gly Gln Gln Gly Pro465 470 475
480Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser
Gly 485 490 495Gln Tyr Gly
Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln 500
505 510Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln
Ser Ala Ala Ala Ala Ala 515 520
525Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly 530
535 540Ala Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Ser Gly Pro Gly Gln545 550
555 560Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly
Gln Gln Gly Pro 565 570
575Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly
580 585 590Ser Gly Gln Gln Gly Pro
Gly Ala Ser Gly Gln Gln Gly Pro Tyr Gly 595 600
605Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Asn Gly Pro
Gly Ser 610 615 620Gly Gln Gln Gly Pro
Gly Gln Ser Gly Gln Tyr Gly Pro Gly Gln Gln625 630
635 640Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser
Ala Ala Ala Ala Ala Gly 645 650
655Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala
660 665 670Ala Ala Ala Gly Pro
Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser Gly 675
680 685Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln
Gly Pro Gly Ser 690 695 700Ser Ala Ala
Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln Gln705
710 715 720Gly Pro Tyr Gly Ser Ala Ala
Ala Ala Ala Gly Pro Gly Ser Gly Gln 725
730 735Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly
Pro Gly Gln Tyr 740 745 750Gly
Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly 755
760 765Ser Gly Gln Gln Gly Pro Gly Gln Tyr
Gly Pro Tyr Ala Ser Ala Ala 770 775
780Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr785
790 795 800Gly Pro Gly Gln
Ser Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly 805
810 815Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly
Pro Gly Gln Gln Gly Pro 820 825
830Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Tyr
835 840 845Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Ala Ser Gly Gln Asn 850 855
860Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly
Gln865 870 875 880Ser Ala
Ala Ala Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
885 890 895Gly Ala Ser Ala Ala Ala Ala
Ala Gly Gln Tyr Gly Pro Gly Gln Gln 900 905
910Gly Pro Gly Gln Tyr Gly Pro Gly Ser Ser Gly Pro Gly Gln
Gln Gly 915 920 925Pro Tyr Gly Pro
Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly 930
935 940Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser
Ala Ala Ala Ala945 950 955
960Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
965 970 975Gly Ala Ser Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser 980
985 990Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro
Gly Gln Gln Gly 995 1000 1005Pro
Ser Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly 1010
1015 1020Pro Gly Gln Tyr Gly Pro Tyr Gly Pro
Gly Gln Ser Gly Pro Gly 1025 1030
1035Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
1040 1045 1050Ala Ala Ala Ala Gly Gln
Tyr Gly Pro Gly Gln Gln Gly Pro Tyr 1055 1060
1065Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser
Gly 1070 1075 1080Gln Tyr Gly Pro Gly
Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly 1085 1090
1095Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser Ala
Ala Ala 1100 1105 1110Ala Ala Gly Gln
Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr 1115
1120 1125Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly
Gln Tyr Gly Ser 1130 1135 1140Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser 1145
1150 1155Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro
Tyr Ala Ser Ala Ala 1160 1165 1170Ala
Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser 1175
1180 1185Gly Gln Gln Gly Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala Ala 1190 1195
1200Ala Gly Gln Asn Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln
1205 1210 1215Ser Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Gln Gln Gly 1220 1225
1230Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr
Gly 1235 1240 1245Pro Gly Gln Gln Gly
Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly 1250 1255
1260Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser Gly Gln
Tyr Gly 1265 1270 1275Pro Gly Gln Gln
Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala 1280
1285 1290Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro
Gly Gln Gln Gly 1295 1300 1305Pro Tyr
Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln 1310
1315 1320Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala
Ser Gly Pro Gly Gln 1325 1330 1335Tyr
Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala 1340
1345 1350Ala Gly Ser Gly Gln Gln Gly Pro Gly
Gln Tyr Gly Pro Tyr Ala 1355 1360
1365Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly Gln
1370 1375 1380Gln Gly Pro Tyr Gly Pro
Gly Gln Ser Gly Ser Gly Gln Gln Gly 1385 1390
1395Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala
Gly 1400 1405 1410Pro Gly Gln Gln Gly
Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala 1415 1420
1425Ala Ala Gly Gln Tyr Gly Tyr Gly Pro Gly Gln Gln Gly
Pro Tyr 1430 1435 1440Gly Pro Gly Ala
Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr 1445
1450 1455Gly Pro Gly Gln Gln Gly Pro Gly Gln Ser Ala
Ala Ala Ala Ala 1460 1465 1470Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala 1475
1480 1485Ala Ala Ala Gly Gln Tyr Gly Pro Gly Gln
Gln Gly Pro Gly Gln 1490 1495 1500Tyr
Gly Pro Gly Ser Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly 1505
1510 1515Pro Gly Ser Ser Ala Ala Ala Ala Ala
Gly Gln Tyr Gly Pro Gly 1520 1525
1530Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala
1535 1540 1545Gly Gln Tyr Gln Gln Gly
Pro Gly Gln Gln Gly Pro Tyr Gly Pro 1550 1555
1560Gly Ala Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly
Ala 1565 1570 1575Ser Ala Ala Ala Ala
Ala Gly Pro Gly Gln Tyr Gly Pro Gly Gln 1580 1585
1590Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Gln
Tyr Gly 1595 1600 1605Ser Gly Pro Gly
Gln Tyr Gly Pro Tyr Gly Pro Gly Gln Ser Gly 1610
1615 1620Pro Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr
Gly Pro Gly Ala 1625 1630 1635Ser Ala
Ala Ala Ala Ala Gly Gln Tyr Gly Pro Gly Gln Gln Gly 1640
1645 1650Pro Tyr Gly Pro Gly Gln Ser Ala Ala Ala
Ala Ala Gly Pro Gly 1655 1660 1665Ser
Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly 1670
1675 1680Ser Gly Gln Tyr Gly Pro Gly Gln Gln
Gly Pro Gly Gln Ser Ala 1685 1690
1695Ala Ala Ala Ala Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly
1700 1705 1710Pro Tyr Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala Gly Gln Tyr 1715 1720
1725Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln
Ser 1730 1735 1740Gly Ser Gly Gln Gln
Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser 1745 1750
1755Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly
Pro Gly 1760 1765 1770Ala Ser Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala 1775
1780 1785Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser Gly
Gln Gln Gly Pro 1790 1795 1800Gly Gln
Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln 1805
1810 1815Gln Gly Pro Gly Ser Ser Ala Ala Ala Ala
Ala Gly Pro Gly Gln 1820 1825 1830Tyr
Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala 1835
1840 1845Ala Gly Pro Gly Ser Gly Gln Gln Gly
Pro Gly Ala Ser Gly Gln 1850 1855
1860Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser
1865 1870 1875Ser Ala Ala Ala Ala Ala
Gly Gln Tyr Gly Ser Gly Pro Gly Gln 1880 1885
1890Gln Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly
Ser 1895 1900 1905Gly Gln Tyr Gly Gln
Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro 1910 1915
1920Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser
Ala Ala 1925 1930 1935Ala Ala Ala Gly
Ser Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro 1940
1945 1950Tyr Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gly Ser Gly Pro 1955 1960 1965Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Gln 1970
1975 1980Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala
Ser Ala Ala Ala Ala 1985 1990 1995Ala
Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala 2000
2005 2010Ala Ala Ala Ala Gly Gln Tyr Gly Tyr
Gly Pro Gly Gln Gln Gly 2015 2020
2025Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly
2030 2035 2040Gln Tyr Gly Pro Gly Gln
Gln Gly Pro Gly Gln Ser Ala Ala Ala 2045 2050
2055Ala Ala Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala
Ser 2060 2065 2070Ala Ala Ala Ala Ala
Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro 2075 2080
2085Gly Gln Tyr Gly Pro Gly Ser Ser Gly Pro Gly Gln Gln
Gly Pro 2090 2095 2100Tyr Gly Pro Gly
Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly 2105
2110 2115Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln
Ser Ala Ala Ala 2120 2125 2130Ala Ala
Gly Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr 2135
2140 2145Gly Pro Gly Ala Ser Gly Pro Gly Gln Gln
Gly Pro Tyr Gly Pro 2150 2155 2160Gly
Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro 2165
2170 2175Gly Gln Gln Gly Pro Ser Ala Ser Ala
Ala Ala Ala Ala Gly Gln 2180 2185
2190Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr Gly Pro Gly Gln
2195 2200 2205Ser Gly Pro Gly Ser Gly
Gln Gln Gly Gln Gly Pro Tyr Gly Pro 2210 2215
2220Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro Gly
Gln 2225 2230 2235Gln Gly Pro Tyr Gly
Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly 2240 2245
2250Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln
Asn Gly 2255 2260 2265Pro Gly Ser Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln 2270
2275 2280Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln
Gly Pro Gly Gln 2285 2290 2295Gln Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly 2300
2305 2310Gln Tyr Gly Ser Gly Pro Gly Gln Gln Gly
Pro Tyr Gly Pro Gly 2315 2320 2325Gln
Ser Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr 2330
2335 2340Ala Ser Ala Ala Ala Ala Ala Gly Pro
Gly Ser Gly Gln Gln Gly 2345 2350
2355Ser Ser Val Asp Lys Leu Ala Ala Ala Leu Glu His His His His
2360 2365 2370His His
237516608PRTArtificial SequencePRT313 16Met His His His His His His Ser
Ser Gly Ser Ser Gly Pro Gly Gly1 5 10
15Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Gly 20 25 30Asn Gly Pro
Gly Ser Gly Gln Gln Gly Pro Gly Gly Ser Ala Ala Ala 35
40 45Ala Ala Gly Gly Tyr Gly Pro Gly Gly Gln Gly
Pro Gly Gln Gln Gly 50 55 60Pro Gly
Ser Ser Ala Ala Ala Ala Ala Gly Pro Gly Gly Tyr Gly Pro65
70 75 80Gly Gly Gln Gly Pro Ser Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly 85 90
95Ser Gly Gln Gln Gly Pro Gly Ala Ser Ala Ala Ala Ala
Ala Gly Gly 100 105 110Tyr Gly
Pro Gly Gly Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser 115
120 125Ala Ala Ala Ala Ala Gly Gly Tyr Gly Ser
Gly Pro Gly Gln Gln Gly 130 135 140Pro
Tyr Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gly Tyr145
150 155 160Gly Gln Gly Pro Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly 165
170 175Pro Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro Ser
Ala Ser Ala Ala 180 185 190Ala
Ala Ala Gly Ser Gly Gln Gln Gly Pro Gly Gly Tyr Gly Pro Tyr 195
200 205Ala Ser Ala Ala Ala Ala Ala Gly Gly
Tyr Gly Ser Gly Pro Gly Gln 210 215
220Gln Gly Pro Tyr Gly Pro Gly Gly Ser Ala Ala Ala Ala Ala Gly Ser225
230 235 240Gly Gln Gln Gly
Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala 245
250 255Ala Ala Gly Pro Gly Gly Gln Gly Pro Tyr
Gly Pro Gly Ser Ser Ala 260 265
270Ala Ala Ala Ala Gly Gly Tyr Gly Tyr Gly Pro Gly Gly Gln Gly Pro
275 280 285Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Gly Asn Gly Pro 290 295
300Gly Ser Gly Gly Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gly Ser
Ala305 310 315 320Ala Ala
Ala Ala Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala
325 330 335Ser Ala Ala Ala Ala Ala Gly
Gly Tyr Gly Pro Gly Gly Gln Gly Pro 340 345
350Gly Gly Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly
Pro Gly 355 360 365Gly Gln Gly Pro
Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly 370
375 380Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Gly Ser Ala385 390 395
400Ala Ala Ala Ala Gly Gly Tyr Gln Gln Gly Pro Gly Gly Gln Gly Pro
405 410 415Tyr Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly Gly Gln 420
425 430Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
Ala Gly Pro Gly 435 440 445Gly Tyr
Gly Pro Gly Gly Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala 450
455 460Ala Gly Gly Tyr Gly Ser Gly Pro Gly Gly Tyr
Gly Pro Tyr Gly Pro465 470 475
480Gly Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly
485 490 495Gln Gly Pro Tyr
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gly 500
505 510Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Gly Ser Ala Ala 515 520 525Ala
Ala Ala Gly Pro Gly Ser Gly Gly Tyr Gly Pro Gly Ala Ser Ala 530
535 540Ala Ala Ala Ala Gly Gly Asn Gly Pro Gly
Ser Gly Gly Tyr Gly Pro545 550 555
560Gly Gln Gln Gly Pro Gly Gly Ser Ala Ala Ala Ala Ala Gly Gly
Tyr 565 570 575Gln Gln Gly
Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala 580
585 590Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Gln Gly Pro Gly Ala Ser 595 600
60517590PRTArtificial SequenceMet-PRT399 17Met Gly Pro Gly Gly Gln Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5 10
15Ala Ala Ala Gly Gly Asn Gly Pro Gly Ser Gly Gln Gln
Gly Pro Gly 20 25 30Gly Ser
Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro Gly Gln Gln Gly 35
40 45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly
Pro Gly Gly Tyr Gly Pro 50 55 60Gly
Gly Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly65
70 75 80Ser Gly Gln Gln Gly Pro
Gly Ala Ser Gly Gly Tyr Gly Pro Gly Gly 85
90 95Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala
Ala Ala Ala Ala 100 105 110Gly
Gly Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Ser Ala 115
120 125Ala Ala Ala Ala Gly Pro Gly Ser Gly
Gly Tyr Gly Gln Gly Pro Tyr 130 135
140Gly Pro Gly Ala Ser Gly Pro Gly Gly Tyr Gly Pro Gly Gly Gln Gly145
150 155 160Pro Ser Ala Ser
Ala Ala Ala Ala Ala Gly Ser Gly Gln Gln Gly Pro 165
170 175Gly Gly Tyr Gly Pro Tyr Ala Ser Ala Ala
Ala Ala Ala Gly Gly Tyr 180 185
190Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gly Ser Gly
195 200 205Ser Gly Gln Gln Gly Pro Gly
Gln Gln Gly Pro Tyr Ala Ser Ala Ala 210 215
220Ala Ala Ala Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ser
Ser225 230 235 240Ala Ala
Ala Ala Ala Gly Gly Tyr Gly Tyr Gly Pro Gly Gly Gln Gly
245 250 255Pro Tyr Gly Pro Gly Ala Ser
Gly Gly Asn Gly Pro Gly Ser Gly Gly 260 265
270Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gly Ser Ala Ala Ala
Ala Ala 275 280 285Gly Pro Gly Gly
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala 290
295 300Ala Ala Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro
Gly Gly Tyr Gly305 310 315
320Pro Gly Ser Ser Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ser
325 330 335Ser Ala Ala Ala Ala
Ala Gly Gly Tyr Gly Pro Gly Gln Gln Gly Pro 340
345 350Tyr Gly Pro Gly Gly Ser Ala Ala Ala Ala Ala Gly
Gly Tyr Gln Gln 355 360 365Gly Pro
Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly 370
375 380Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly385 390 395
400Pro Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro Ser Ala Ser Ala Ala
405 410 415Ala Ala Ala Gly
Gly Tyr Gly Ser Gly Pro Gly Gly Tyr Gly Pro Tyr 420
425 430Gly Pro Gly Gly Ser Gly Pro Gly Ser Gly Gln
Gln Gly Gln Gly Pro 435 440 445Tyr
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly Pro 450
455 460Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gly
Ser Ala Ala Ala Ala Ala465 470 475
480Gly Pro Gly Ser Gly Gly Tyr Gly Pro Gly Ala Ser Gly Gly Asn
Gly 485 490 495Pro Gly Ser
Gly Gly Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gly Ser 500
505 510Ala Ala Ala Ala Ala Gly Gly Tyr Gln Gln
Gly Pro Gly Gly Gln Gly 515 520
525Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly 530
535 540Ser Gly Pro Gly Gln Gln Gly Pro
Tyr Gly Pro Gly Gly Ser Gly Ser545 550
555 560Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala
Ser Ala Ala Ala 565 570
575Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser 580
585 59018601PRTArtificial SequencePRT399
18Met His His His His His His Ser Ser Gly Ser Ser Gly Pro Gly Gly1
5 10 15Gln Gly Pro Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Gly 20 25
30Asn Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gly Ser
Gly Gly Tyr 35 40 45Gly Pro Gly
Gly Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala 50
55 60Ala Ala Ala Ala Gly Pro Gly Gly Tyr Gly Pro Gly
Gly Gln Gly Pro65 70 75
80Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln Gly
85 90 95Pro Gly Ala Ser Gly Gly
Tyr Gly Pro Gly Gly Gln Gly Pro Gly Gln 100
105 110Gln Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly
Gly Tyr Gly Ser 115 120 125Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly 130
135 140Pro Gly Ser Gly Gly Tyr Gly Gln Gly Pro Tyr
Gly Pro Gly Ala Ser145 150 155
160Gly Pro Gly Gly Tyr Gly Pro Gly Gly Gln Gly Pro Ser Ala Ser Ala
165 170 175Ala Ala Ala Ala
Gly Ser Gly Gln Gln Gly Pro Gly Gly Tyr Gly Pro 180
185 190Tyr Ala Ser Ala Ala Ala Ala Ala Gly Gly Tyr
Gly Ser Gly Pro Gly 195 200 205Gln
Gln Gly Pro Tyr Gly Pro Gly Gly Ser Gly Ser Gly Gln Gln Gly 210
215 220Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala
Ala Ala Ala Ala Gly Pro225 230 235
240Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala
Ala 245 250 255Gly Gly Tyr
Gly Tyr Gly Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly 260
265 270Ala Ser Gly Gly Asn Gly Pro Gly Ser Gly
Gly Tyr Gly Pro Gly Gln 275 280
285Gln Gly Pro Gly Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly Gly Gln 290
295 300Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Gly Gly Tyr305 310
315 320Gly Pro Gly Gly Gln Gly Pro Gly Gly Tyr Gly Pro
Gly Ser Ser Gly 325 330
335Pro Gly Gly Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala
340 345 350Ala Gly Gly Tyr Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gly 355 360
365Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gln Gln Gly Pro Gly
Gly Gln 370 375 380Gly Pro Tyr Gly Pro
Gly Ala Ser Gly Pro Gly Gly Gln Gly Pro Tyr385 390
395 400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Pro Gly Gly Tyr Gly 405 410
415Pro Gly Gly Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Gly
420 425 430Tyr Gly Ser Gly Pro
Gly Gly Tyr Gly Pro Tyr Gly Pro Gly Gly Ser 435
440 445Gly Pro Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr
Gly Pro Gly Ala 450 455 460Ser Ala Ala
Ala Ala Ala Gly Gly Tyr Gly Pro Gly Gln Gln Gly Pro465
470 475 480Tyr Gly Pro Gly Gly Ser Ala
Ala Ala Ala Ala Gly Pro Gly Ser Gly 485
490 495Gly Tyr Gly Pro Gly Ala Ser Gly Gly Asn Gly Pro
Gly Ser Gly Gly 500 505 510Tyr
Gly Pro Gly Gln Gln Gly Pro Gly Gly Ser Ala Ala Ala Ala Ala 515
520 525Gly Gly Tyr Gln Gln Gly Pro Gly Gly
Gln Gly Pro Tyr Gly Pro Gly 530 535
540Ala Ser Ala Ala Ala Ala Ala Gly Gly Tyr Gly Ser Gly Pro Gly Gln545
550 555 560Gln Gly Pro Tyr
Gly Pro Gly Gly Ser Gly Ser Gly Gln Gln Gly Pro 565
570 575Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala
Ala Ala Ala Gly Pro Gly 580 585
590Ser Gly Gln Gln Gly Pro Gly Ala Ser 595
60019612PRTArtificial SequenceMet-PRT720 19Met Gly Pro Gly Gln Gln Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5 10
15Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser Gly Gln Gln
Gly Pro Gly 20 25 30Gln Ser
Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly 35
40 45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly
Pro Gly Gln Tyr Val Leu 50 55 60Ile
Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Ser Ala Ser Ala Ala65
70 75 80Ala Ala Ala Gly Pro Gly
Ser Gly Gln Gln Gly Pro Gly Ala Ser Gly 85
90 95Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln
Gly Pro Gly Ser 100 105 110Ser
Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser Val Leu Ile Gly Pro 115
120 125Gly Gln Gln Val Leu Ile Gly Pro Tyr
Gly Ser Ala Ala Ala Ala Ala 130 135
140Gly Pro Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala145
150 155 160Ser Gly Pro Gly
Gln Tyr Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser 165
170 175Ala Ala Ala Ala Ala Gly Ser Gly Gln Gln
Val Leu Ile Gly Pro Gly 180 185
190Gln Tyr Val Leu Ile Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly
195 200 205Gln Tyr Gly Ser Gly Pro Gly
Gln Gln Gly Pro Tyr Gly Pro Gly Gln 210 215
220Ser Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala
Ser225 230 235 240Ala Ala
Ala Ala Ala Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr
245 250 255Val Leu Ile Gly Pro Gly Ser
Ser Ala Ala Ala Ala Ala Gly Gln Tyr 260 265
270Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala
Ser Gly 275 280 285Gln Asn Gly Pro
Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro 290
295 300Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln
Gln Val Leu Ile305 310 315
320Gly Pro Tyr Val Leu Ile Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
325 330 335Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Gln Tyr Gly Pro Gly 340
345 350Ser Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro
Gly Ser Ser Ala 355 360 365Ala Ala
Ala Ala Gly Ser Tyr Gly Pro Gly Gln Gln Val Leu Ile Gly 370
375 380Pro Tyr Val Leu Ile Gly Pro Gly Pro Ser Ala
Ala Ala Ala Ala Gly385 390 395
400Gln Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala
405 410 415Ser Gly Pro Gly
Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala 420
425 430Ala Ala Ala Gly Pro Gly Gln Tyr Val Leu Ile
Gly Pro Gly Gln Gln 435 440 445Val
Leu Ile Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr 450
455 460Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr
Gly Pro Gly Gln Ser Gly465 470 475
480Pro Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly Pro Gly Ala
Ser 485 490 495Ala Ala Ala
Ala Ala Gly Ser Tyr Gly Pro Gly Gln Gln Val Leu Ile 500
505 510Gly Pro Tyr Val Leu Ile Gly Pro Gly Pro
Ser Ala Ala Ala Ala Ala 515 520
525Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly 530
535 540Pro Gly Ser Gly Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Gln Ser545 550
555 560Ala Ala Ala Ala Ala Gly Gln Tyr Gln Gln Val Leu
Ile Gly Pro Gly 565 570
575Gln Gln Gly Pro Tyr Val Leu Ile Gly Pro Gly Ala Ser Ala Ala Ala
580 585 590Ala Ala Gly Pro Gly Ser
Gly Gln Gln Val Leu Ile Gly Pro Gly Ala 595 600
605Ser Val Leu Ile 61020592PRTArtificial
SequenceMet-PRT665 20Met Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala1 5 10 15Ala
Ala Ala Ala Ala Gly Ser Asn Gly Pro Gly Ser Gly Gln Gln Gly 20
25 30Pro Gly Gln Ser Gly Gln Tyr Gly
Pro Gly Gln Gln Gly Pro Gly Gln 35 40
45Gln Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
50 55 60Gln Tyr Val Leu Ile Gly Pro Gly
Gln Gln Gly Pro Ser Ala Ser Ala65 70 75
80Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln
Gly Pro Gly 85 90 95Ala
Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Gln Gly
100 105 110Pro Gly Ser Ser Ala Ala Ala
Ala Ala Ala Ala Gly Ser Tyr Gly Ser 115 120
125Val Leu Ile Gly Pro Gly Gln Gln Gly Pro Tyr Gly Ser Ala Ala
Ala 130 135 140Ala Ala Ala Ala Gly Pro
Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr145 150
155 160Gly Pro Gly Ala Ser Gly Pro Gly Gln Tyr Gly
Pro Gly Gln Gln Gly 165 170
175Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Gly Gln Gln
180 185 190Val Leu Ile Gly Pro Gly
Gln Tyr Gly Pro Tyr Ala Ser Ala Ala Ala 195 200
205Ala Ala Ala Ala Gly Ser Tyr Gly Ser Gly Pro Gly Gln Gln
Gly Pro 210 215 220Tyr Gly Pro Gly Gln
Ser Gly Ser Gly Gln Gln Gly Pro Gly Gln Gln225 230
235 240Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala
Ala Ala Gly Pro Gly Gln 245 250
255Gln Val Leu Ile Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala
260 265 270Ala Ala Ala Gly Ser
Tyr Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr 275
280 285Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser
Gly Gln Tyr Gly 290 295 300Pro Gly Gln
Gln Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala Ala Ala305
310 315 320Gly Pro Gly Gln Gln Val Leu
Ile Gly Pro Tyr Gly Pro Gly Ala Ser 325
330 335Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro
Gly Gln Gln Gly 340 345 350Pro
Gly Gln Tyr Gly Pro Gly Ser Ser Gly Pro Gly Gln Gln Gly Pro 355
360 365Tyr Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala Ala Ala Gly Ser Tyr 370 375
380Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr Gly Pro Gly Pro Ser385
390 395 400Ala Ala Ala Ala
Ala Ala Ala Gly Ser Tyr Gln Gln Gly Pro Gly Gln 405
410 415Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly
Pro Gly Gln Gln Gly Pro 420 425
430Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
435 440 445Gln Tyr Val Leu Ile Gly Pro
Gly Gln Gln Gly Pro Ser Ala Ser Ala 450 455
460Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser Gly Pro Gly Gln
Tyr465 470 475 480Gly Pro
Tyr Gly Pro Gly Gln Ser Gly Pro Gly Ser Gly Gln Gln Gly
485 490 495Gln Gly Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala Ala Ala Ala Ala 500 505
510Gly Ser Tyr Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr
Gly Pro 515 520 525Gly Pro Ser Ala
Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln 530
535 540Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly
Ser Gly Gln Tyr545 550 555
560Gly Pro Gly Gln Gln Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala Ala
565 570 575Ala Gly Pro Gly Ser
Gly Gln Gln Gly Pro Gly Ala Ser Val Leu Ile 580
585 59021619PRTArtificial SequenceMet-PRT666 21Met Gly
Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5
10 15Ala Ala Ala Ala Ala Gly Ser Asn
Gly Pro Gly Ser Gly Gln Gln Gly 20 25
30Pro Gly Gln Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly
Gln 35 40 45Gln Gly Pro Gly Ser
Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly 50 55
60Gln Tyr Val Leu Ile Gly Pro Gly Gln Gln Val Leu Ile Gly
Pro Ser65 70 75 80Ala
Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln
85 90 95Gly Pro Gly Ala Ser Gly Gln
Tyr Gly Pro Gly Gln Gln Gly Pro Gly 100 105
110Gln Gln Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala
Gly Ser 115 120 125Tyr Gly Ser Val
Leu Ile Gly Pro Gly Gln Gln Val Leu Ile Gly Pro 130
135 140Tyr Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro
Gly Ser Gly Gln145 150 155
160Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln Tyr
165 170 175Gly Pro Gly Gln Gln
Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala 180
185 190Ala Gly Ser Gly Gln Gln Val Leu Ile Gly Pro Gly
Gln Tyr Val Leu 195 200 205Ile Gly
Pro Tyr Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr 210
215 220Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly
Pro Gly Gln Ser Gly225 230 235
240Ser Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala
245 250 255Ala Ala Ala Ala
Ala Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr 260
265 270Val Leu Ile Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala Ala Ala Gly 275 280 285Ser
Tyr Gly Tyr Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala 290
295 300Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln
Tyr Gly Pro Gly Gln Gln305 310 315
320Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
Gln 325 330 335Gln Val Leu
Ile Gly Pro Tyr Val Leu Ile Gly Pro Gly Ala Ser Ala 340
345 350Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly
Pro Gly Gln Gln Gly Pro 355 360
365Gly Gln Tyr Gly Pro Gly Ser Ser Gly Pro Gly Gln Gln Gly Pro Tyr 370
375 380Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala Ala Ala Gly Ser Tyr Gly385 390
395 400Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr Val Leu
Ile Gly Pro Gly 405 410
415Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gln Gln Gly Pro
420 425 430Gly Gln Gln Gly Pro Tyr
Gly Pro Gly Ala Ser Gly Pro Gly Gln Gln 435 440
445Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala
Ala Gly 450 455 460Pro Gly Gln Tyr Val
Leu Ile Gly Pro Gly Gln Gln Val Leu Ile Gly465 470
475 480Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala
Ala Gly Ser Tyr Gly Ser 485 490
495Gly Pro Gly Gln Tyr Gly Pro Tyr Gly Pro Gly Gln Ser Gly Pro Gly
500 505 510Ser Gly Gln Gln Gly
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala 515
520 525Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Gln
Gln Val Leu Ile 530 535 540Gly Pro Tyr
Val Leu Ile Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala545
550 555 560Ala Ala Gly Pro Gly Ser Gly
Gln Tyr Gly Pro Gly Ala Ser Gly Gln 565
570 575Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln
Gln Gly Pro Gly 580 585 590Pro
Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Gln 595
600 605Val Leu Ile Gly Pro Gly Ala Ser Val
Leu Ile 610 61522623PRTArtificial SequencePRT720 22Met
His His His His His His Ser Ser Gly Ser Ser Gly Pro Gly Gln1
5 10 15Gln Gly Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala Ala Ala Gly Gln 20 25
30Asn Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Ser Gly
Gln Tyr 35 40 45Gly Pro Gly Gln
Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala 50 55
60Ala Ala Ala Ala Gly Pro Gly Gln Tyr Val Leu Ile Gly
Pro Gly Gln65 70 75
80Gln Val Leu Ile Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro
85 90 95Gly Ser Gly Gln Gln Gly
Pro Gly Ala Ser Gly Gln Tyr Gly Pro Gly 100
105 110Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser
Ala Ala Ala Ala 115 120 125Ala Gly
Ser Tyr Gly Ser Val Leu Ile Gly Pro Gly Gln Gln Val Leu 130
135 140Ile Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala
Gly Pro Gly Ser Gly145 150 155
160Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln
165 170 175Tyr Gly Pro Gly
Gln Gln Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala 180
185 190Gly Ser Gly Gln Gln Val Leu Ile Gly Pro Gly
Gln Tyr Val Leu Ile 195 200 205Gly
Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly 210
215 220Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly
Gln Ser Gly Ser Gly Gln225 230 235
240Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala
Ala 245 250 255Gly Pro Gly
Gln Gln Val Leu Ile Gly Pro Tyr Val Leu Ile Gly Pro 260
265 270Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln
Tyr Gly Tyr Gly Pro Gly 275 280
285Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly 290
295 300Ser Gly Gln Tyr Gly Pro Gly Gln
Gln Gly Pro Gly Gln Ser Ala Ala305 310
315 320Ala Ala Ala Gly Pro Gly Gln Gln Val Leu Ile Gly
Pro Tyr Val Leu 325 330
335Ile Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro
340 345 350Gly Gln Gln Gly Pro Gly
Gln Tyr Gly Pro Gly Ser Ser Gly Pro Gly 355 360
365Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala
Ala Gly 370 375 380Ser Tyr Gly Pro Gly
Gln Gln Val Leu Ile Gly Pro Tyr Val Leu Ile385 390
395 400Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala
Gly Gln Tyr Gln Gln Gly 405 410
415Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln
420 425 430Gln Gly Pro Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro 435
440 445Gly Gln Tyr Val Leu Ile Gly Pro Gly Gln Gln Val
Leu Ile Gly Pro 450 455 460Ser Ala Ser
Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly465
470 475 480Gln Tyr Gly Pro Tyr Gly Pro
Gly Gln Ser Gly Pro Gly Ser Gly Gln 485
490 495Gln Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala 500 505 510Gly
Ser Tyr Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr Val Leu 515
520 525Ile Gly Pro Gly Pro Ser Ala Ala Ala
Ala Ala Gly Pro Gly Ser Gly 530 535
540Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln545
550 555 560Tyr Gly Pro Gly
Gln Gln Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala 565
570 575Gly Gln Tyr Gln Gln Val Leu Ile Gly Pro
Gly Gln Gln Gly Pro Tyr 580 585
590Val Leu Ile Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly
595 600 605Ser Gly Gln Gln Val Leu Ile
Gly Pro Gly Ala Ser Val Leu Ile 610 615
62023603PRTArtificial SequencePRT665 23Met His His His His His His Ser
Ser Gly Ser Ser Gly Pro Gly Gln1 5 10
15Gln Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Ala Ala 20 25 30Gly Ser Asn
Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln Ser Gly 35
40 45Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln
Gln Gly Pro Gly Ser 50 55 60Ser Ala
Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Val Leu Ile65
70 75 80Gly Pro Gly Gln Gln Gly Pro
Ser Ala Ser Ala Ala Ala Ala Ala Ala 85 90
95Ala Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser
Gly Gln Tyr 100 105 110Gly Pro
Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser Ser Ala 115
120 125Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly
Ser Val Leu Ile Gly Pro 130 135 140Gly
Gln Gln Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly145
150 155 160Pro Gly Ser Gly Gln Tyr
Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser 165
170 175Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro
Ser Ala Ser Ala 180 185 190Ala
Ala Ala Ala Ala Ala Gly Ser Gly Gln Gln Val Leu Ile Gly Pro 195
200 205Gly Gln Tyr Gly Pro Tyr Ala Ser Ala
Ala Ala Ala Ala Ala Ala Gly 210 215
220Ser Tyr Gly Ser Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Gln225
230 235 240Ser Gly Ser Gly
Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Ala Ser 245
250 255Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
Gln Gln Val Leu Ile Gly 260 265
270Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser
275 280 285Tyr Gly Tyr Gly Pro Gly Gln
Gln Gly Pro Tyr Gly Pro Gly Ala Ser 290 295
300Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln
Gly305 310 315 320Pro Gly
Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln
325 330 335Val Leu Ile Gly Pro Tyr Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala 340 345
350Ala Ala Gly Ser Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln
Tyr Gly 355 360 365Pro Gly Ser Ser
Gly Pro Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser 370
375 380Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly
Pro Gly Gln Gln385 390 395
400Val Leu Ile Gly Pro Tyr Gly Pro Gly Pro Ser Ala Ala Ala Ala Ala
405 410 415Ala Ala Gly Ser Tyr
Gln Gln Gly Pro Gly Gln Gln Gly Pro Tyr Gly 420
425 430Pro Gly Ala Ser Gly Pro Gly Gln Gln Gly Pro Tyr
Gly Pro Gly Ala 435 440 445Ser Ala
Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Val Leu Ile 450
455 460Gly Pro Gly Gln Gln Gly Pro Ser Ala Ser Ala
Ala Ala Ala Ala Ala465 470 475
480Ala Gly Ser Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr Gly Pro
485 490 495Gly Gln Ser Gly
Pro Gly Ser Gly Gln Gln Gly Gln Gly Pro Tyr Gly 500
505 510Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala Ala
Gly Ser Tyr Gly Pro 515 520 525Gly
Gln Gln Val Leu Ile Gly Pro Tyr Gly Pro Gly Pro Ser Ala Ala 530
535 540Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly
Gln Tyr Gly Pro Gly Ala545 550 555
560Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln
Gln 565 570 575Gly Pro Gly
Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser 580
585 590Gly Gln Gln Gly Pro Gly Ala Ser Val Leu
Ile 595 60024630PRTArtificial SequencePRT666 24Met
His His His His His His Ser Ser Gly Ser Ser Gly Pro Gly Gln1
5 10 15Gln Gly Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala Ala Ala Ala Ala 20 25
30Gly Ser Asn Gly Pro Gly Ser Gly Gln Gln Gly Pro Gly Gln
Ser Gly 35 40 45Gln Tyr Gly Pro
Gly Gln Gln Gly Pro Gly Gln Gln Gly Pro Gly Ser 50 55
60Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr
Val Leu Ile65 70 75
80Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Ser Ala Ser Ala Ala Ala
85 90 95Ala Ala Ala Ala Gly Pro
Gly Ser Gly Gln Gln Gly Pro Gly Ala Ser 100
105 110Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln
Gln Gly Pro Gly 115 120 125Ser Ser
Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser Val Leu 130
135 140Ile Gly Pro Gly Gln Gln Val Leu Ile Gly Pro
Tyr Gly Ser Ala Ala145 150 155
160Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Tyr Gly Gln Gly Pro
165 170 175Tyr Gly Pro Gly
Ala Ser Gly Pro Gly Gln Tyr Gly Pro Gly Gln Gln 180
185 190Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala
Ala Gly Ser Gly Gln 195 200 205Gln
Val Leu Ile Gly Pro Gly Gln Tyr Val Leu Ile Gly Pro Tyr Ala 210
215 220Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser
Tyr Gly Ser Gly Pro Gly225 230 235
240Gln Gln Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Gln Gln
Gly 245 250 255Pro Gly Gln
Gln Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Ala Ala 260
265 270Gly Pro Gly Gln Gln Val Leu Ile Gly Pro
Tyr Val Leu Ile Gly Pro 275 280
285Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Tyr Gly 290
295 300Pro Gly Gln Gln Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Gln Asn Gly305 310
315 320Pro Gly Ser Gly Gln Tyr Gly Pro Gly Gln Gln Gly
Pro Gly Pro Ser 325 330
335Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Gln Val Leu Ile Gly
340 345 350Pro Tyr Val Leu Ile Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala 355 360
365Ala Gly Ser Tyr Gly Pro Gly Gln Gln Gly Pro Gly Gln Tyr
Gly Pro 370 375 380Gly Ser Ser Gly Pro
Gly Gln Gln Gly Pro Tyr Gly Pro Gly Ser Ser385 390
395 400Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr
Gly Pro Gly Gln Gln Val 405 410
415Leu Ile Gly Pro Tyr Val Leu Ile Gly Pro Gly Pro Ser Ala Ala Ala
420 425 430Ala Ala Ala Ala Gly
Ser Tyr Gln Gln Gly Pro Gly Gln Gln Gly Pro 435
440 445Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln Gln Gly
Pro Tyr Gly Pro 450 455 460Gly Ala Ser
Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Val465
470 475 480Leu Ile Gly Pro Gly Gln Gln
Val Leu Ile Gly Pro Ser Ala Ser Ala 485
490 495Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser Gly
Pro Gly Gln Tyr 500 505 510Gly
Pro Tyr Gly Pro Gly Gln Ser Gly Pro Gly Ser Gly Gln Gln Gly 515
520 525Gln Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Ala Ala 530 535
540Gly Ser Tyr Gly Pro Gly Gln Gln Val Leu Ile Gly Pro Tyr Val Leu545
550 555 560Ile Gly Pro Gly
Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly 565
570 575Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly
Gln Asn Gly Pro Gly Ser 580 585
590Gly Gln Tyr Gly Pro Gly Gln Gln Gly Pro Gly Pro Ser Ala Ala Ala
595 600 605Ala Ala Ala Ala Gly Pro Gly
Ser Gly Gln Gln Val Leu Ile Gly Pro 610 615
620Gly Ala Ser Val Leu Ile625 63025593PRTArtificial
SequenceMet-PRT888 25Met Gly Ser Ser Gly Pro Gly Val Leu Gly Pro Tyr Gly
Pro Gly Ala1 5 10 15Ser
Ala Ala Ala Ala Ala Gly Gln Asn Gly Pro Gly Ser Gly Val Leu 20
25 30Gly Pro Gly Gln Ser Gly Gln Tyr
Gly Pro Gly Val Leu Gly Pro Gly 35 40
45Val Leu Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln
50 55 60Tyr Gly Pro Gly Val Leu Gly Pro
Ser Ala Ser Ala Ala Ala Ala Ala65 70 75
80Gly Pro Gly Ser Gly Val Leu Gly Pro Gly Ala Ser Gly
Gln Tyr Gly 85 90 95Pro
Gly Val Leu Gly Pro Gly Val Leu Gly Pro Gly Ser Ser Ala Ala
100 105 110Ala Ala Ala Gly Gln Tyr Gly
Ser Gly Pro Gly Val Leu Gly Pro Tyr 115 120
125Gly Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln Tyr Gly
Gln 130 135 140Gly Pro Tyr Gly Pro Gly
Ala Ser Gly Pro Gly Gln Tyr Gly Pro Gly145 150
155 160Val Leu Gly Pro Ser Ala Ser Ala Ala Ala Ala
Ala Gly Ser Gly Val 165 170
175Leu Gly Pro Gly Gln Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala
180 185 190Gly Gln Tyr Gly Ser Gly
Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly 195 200
205Gln Ser Gly Ser Gly Val Leu Gly Pro Gly Val Leu Gly Pro
Tyr Ala 210 215 220Ser Ala Ala Ala Ala
Ala Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro225 230
235 240Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln
Tyr Gly Tyr Gly Pro Gly 245 250
255Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly
260 265 270Ser Gly Gln Tyr Gly
Pro Gly Val Leu Gly Pro Gly Gln Ser Ala Ala 275
280 285Ala Ala Ala Gly Pro Gly Val Leu Gly Pro Tyr Gly
Pro Gly Ala Ser 290 295 300Ala Ala Ala
Ala Ala Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro Gly305
310 315 320Gln Tyr Gly Pro Gly Ser Ser
Gly Pro Gly Val Leu Gly Pro Tyr Gly 325
330 335Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gly Pro Gly Val 340 345 350Leu
Gly Pro Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln 355
360 365Tyr Val Leu Gly Pro Gly Val Leu Gly
Pro Tyr Gly Pro Gly Ala Ser 370 375
380Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala385
390 395 400Ala Ala Gly Pro
Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro Ser Ala 405
410 415Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly
Ser Gly Pro Gly Gln Tyr 420 425
430Gly Pro Tyr Gly Pro Gly Gln Ser Gly Pro Gly Ser Gly Val Leu Gly
435 440 445Gln Gly Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala Ala Ala Gly Gln 450 455
460Tyr Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Gln Ser Ala
Ala465 470 475 480Ala Ala
Ala Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly
485 490 495Gln Asn Gly Pro Gly Ser Gly
Gln Tyr Gly Pro Gly Val Leu Gly Pro 500 505
510Gly Gln Ser Ala Ala Ala Ala Ala Gly Gln Tyr Val Leu Gly
Pro Gly 515 520 525Val Leu Gly Pro
Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly 530
535 540Gln Tyr Gly Ser Gly Pro Gly Val Leu Gly Pro Tyr
Gly Pro Gly Gln545 550 555
560Ser Gly Ser Gly Val Leu Gly Pro Gly Val Leu Gly Pro Tyr Ala Ser
565 570 575Ala Ala Ala Ala Ala
Gly Pro Gly Ser Gly Val Leu Gly Pro Gly Ala 580
585 590Ser26590PRTArtificial SequenceMet-PRT965 26Met
Gly Pro Gly Thr Ser Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala1
5 10 15Ala Ala Ala Gly Ala Asn Gly
Pro Gly Ser Gly Thr Ser Gly Pro Gly 20 25
30Ala Ser Gly Ala Tyr Gly Pro Gly Thr Ser Gly Pro Gly Thr
Ser Gly 35 40 45Pro Gly Ser Ser
Ala Ala Ala Ala Ala Gly Pro Gly Ala Tyr Gly Pro 50 55
60Gly Thr Ser Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala
Gly Pro Gly65 70 75
80Ser Gly Thr Ser Gly Pro Gly Ala Ser Gly Ala Tyr Gly Pro Gly Thr
85 90 95Ser Gly Pro Gly Thr Ser
Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 100
105 110Gly Ala Tyr Gly Ser Gly Pro Gly Thr Ser Gly Pro
Tyr Gly Ser Ala 115 120 125Ala Ala
Ala Ala Gly Pro Gly Ser Gly Ala Tyr Gly Ala Gly Pro Tyr 130
135 140Gly Pro Gly Ala Ser Gly Pro Gly Ala Tyr Gly
Pro Gly Thr Ser Gly145 150 155
160Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Ser Gly Thr Ser Gly Pro
165 170 175Gly Ala Tyr Gly
Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Ala Tyr 180
185 190Gly Ser Gly Pro Gly Thr Ser Gly Pro Tyr Gly
Pro Gly Ala Ser Gly 195 200 205Ser
Gly Thr Ser Gly Pro Gly Thr Ser Gly Pro Tyr Ala Ser Ala Ala 210
215 220Ala Ala Ala Gly Pro Gly Thr Ser Gly Pro
Tyr Gly Pro Gly Ser Ser225 230 235
240Ala Ala Ala Ala Ala Gly Ala Tyr Gly Tyr Gly Pro Gly Thr Ser
Gly 245 250 255Pro Tyr Gly
Pro Gly Ala Ser Gly Ala Asn Gly Pro Gly Ser Gly Ala 260
265 270Tyr Gly Pro Gly Thr Ser Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala 275 280
285Gly Pro Gly Thr Ser Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala 290
295 300Ala Ala Gly Ala Tyr Gly Pro Gly
Thr Ser Gly Pro Gly Ala Tyr Gly305 310
315 320Pro Gly Ser Ser Gly Pro Gly Thr Ser Gly Pro Tyr
Gly Pro Gly Ser 325 330
335Ser Ala Ala Ala Ala Ala Gly Ala Tyr Gly Pro Gly Thr Ser Gly Pro
340 345 350Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Gly Ala Tyr Thr Ser 355 360
365Gly Pro Gly Thr Ser Gly Pro Tyr Gly Pro Gly Ala Ser Gly
Pro Gly 370 375 380Thr Ser Gly Pro Tyr
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly385 390
395 400Pro Gly Ala Tyr Gly Pro Gly Thr Ser Gly
Pro Ser Ala Ser Ala Ala 405 410
415Ala Ala Ala Gly Ala Tyr Gly Ser Gly Pro Gly Ala Tyr Gly Pro Tyr
420 425 430Gly Pro Gly Ala Ser
Gly Pro Gly Ser Gly Thr Ser Gly Ala Gly Pro 435
440 445Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly
Ala Tyr Gly Pro 450 455 460Gly Thr Ser
Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala465
470 475 480Gly Pro Gly Ser Gly Ala Tyr
Gly Pro Gly Ala Ser Gly Ala Asn Gly 485
490 495Pro Gly Ser Gly Ala Tyr Gly Pro Gly Thr Ser Gly
Pro Gly Ala Ser 500 505 510Ala
Ala Ala Ala Ala Gly Ala Tyr Thr Ser Gly Pro Gly Thr Ser Gly 515
520 525Pro Tyr Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Gly Ala Tyr Gly 530 535
540Ser Gly Pro Gly Thr Ser Gly Pro Tyr Gly Pro Gly Ala Ser Gly Ser545
550 555 560Gly Thr Ser Gly
Pro Gly Thr Ser Gly Pro Tyr Ala Ser Ala Ala Ala 565
570 575Ala Ala Gly Pro Gly Ser Gly Thr Ser Gly
Pro Gly Ala Ser 580 585
59027593PRTArtificial SequenceMet-PRT889 27Met Gly Ser Ser Gly Pro Gly
Val Leu Gly Pro Tyr Gly Pro Gly Ala1 5 10
15Ser Ala Ala Ala Ala Ala Gly Ile Asn Gly Pro Gly Ser
Gly Val Leu 20 25 30Gly Pro
Gly Ile Ser Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Gly 35
40 45Val Leu Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala Gly Pro Gly Ile 50 55 60Tyr
Gly Pro Gly Val Leu Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala65
70 75 80Gly Pro Gly Ser Gly Val
Leu Gly Pro Gly Ala Ser Gly Ile Tyr Gly 85
90 95Pro Gly Val Leu Gly Pro Gly Val Leu Gly Pro Gly
Ser Ser Ala Ala 100 105 110Ala
Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly Val Leu Gly Pro Tyr 115
120 125Gly Ser Ala Ala Ala Ala Ala Gly Pro
Gly Ser Gly Ile Tyr Gly Ile 130 135
140Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Ile Tyr Gly Pro Gly145
150 155 160Val Leu Gly Pro
Ser Ala Ser Ala Ala Ala Ala Ala Gly Ser Gly Val 165
170 175Leu Gly Pro Gly Ile Tyr Gly Pro Tyr Ala
Ser Ala Ala Ala Ala Ala 180 185
190Gly Ile Tyr Gly Ser Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly
195 200 205Ile Ser Gly Ser Gly Val Leu
Gly Pro Gly Val Leu Gly Pro Tyr Ala 210 215
220Ser Ala Ala Ala Ala Ala Gly Pro Gly Val Leu Gly Pro Tyr Gly
Pro225 230 235 240Gly Ser
Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Tyr Gly Pro Gly
245 250 255Val Leu Gly Pro Tyr Gly Pro
Gly Ala Ser Gly Ile Asn Gly Pro Gly 260 265
270Ser Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Gly Ile Ser
Ala Ala 275 280 285Ala Ala Ala Gly
Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser 290
295 300Ala Ala Ala Ala Ala Gly Ile Tyr Gly Pro Gly Val
Leu Gly Pro Gly305 310 315
320Ile Tyr Gly Pro Gly Ser Ser Gly Pro Gly Val Leu Gly Pro Tyr Gly
325 330 335Pro Gly Ser Ser Ala
Ala Ala Ala Ala Gly Ile Tyr Gly Pro Gly Val 340
345 350Leu Gly Pro Tyr Gly Pro Gly Ile Ser Ala Ala Ala
Ala Ala Gly Ile 355 360 365Tyr Val
Leu Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser 370
375 380Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly
Ala Ser Ala Ala Ala385 390 395
400Ala Ala Gly Pro Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Ser Ala
405 410 415Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly Ile Tyr 420
425 430Gly Pro Tyr Gly Pro Gly Ile Ser Gly Pro Gly
Ser Gly Val Leu Gly 435 440 445Ile
Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile 450
455 460Tyr Gly Pro Gly Val Leu Gly Pro Tyr Gly
Pro Gly Ile Ser Ala Ala465 470 475
480Ala Ala Ala Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Ala Ser
Gly 485 490 495Ile Asn Gly
Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro 500
505 510Gly Ile Ser Ala Ala Ala Ala Ala Gly Ile
Tyr Val Leu Gly Pro Gly 515 520
525Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly 530
535 540Ile Tyr Gly Ser Gly Pro Gly Val
Leu Gly Pro Tyr Gly Pro Gly Ile545 550
555 560Ser Gly Ser Gly Val Leu Gly Pro Gly Val Leu Gly
Pro Tyr Ala Ser 565 570
575Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Val Leu Gly Pro Gly Ala
580 585 590Ser28590PRTArtificial
SequenceMet-PRT916 28Met Gly Pro Gly Val Ile Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala1 5 10 15Ala
Ala Ala Gly Leu Asn Gly Pro Gly Ser Gly Val Ile Gly Pro Gly 20
25 30Leu Ser Gly Leu Tyr Gly Pro Gly
Val Ile Gly Pro Gly Val Ile Gly 35 40
45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro Gly Leu Tyr Gly Pro
50 55 60Gly Val Ile Gly Pro Ser Ala Ser
Ala Ala Ala Ala Ala Gly Pro Gly65 70 75
80Ser Gly Val Ile Gly Pro Gly Ala Ser Gly Leu Tyr Gly
Pro Gly Val 85 90 95Ile
Gly Pro Gly Val Ile Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
100 105 110Gly Leu Tyr Gly Ser Gly Pro
Gly Val Ile Gly Pro Tyr Gly Ser Ala 115 120
125Ala Ala Ala Ala Gly Pro Gly Ser Gly Leu Tyr Gly Leu Gly Pro
Tyr 130 135 140Gly Pro Gly Ala Ser Gly
Pro Gly Leu Tyr Gly Pro Gly Val Ile Gly145 150
155 160Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Ser
Gly Val Ile Gly Pro 165 170
175Gly Leu Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Leu Tyr
180 185 190Gly Ser Gly Pro Gly Val
Ile Gly Pro Tyr Gly Pro Gly Leu Ser Gly 195 200
205Ser Gly Val Ile Gly Pro Gly Val Ile Gly Pro Tyr Ala Ser
Ala Ala 210 215 220Ala Ala Ala Gly Pro
Gly Val Ile Gly Pro Tyr Gly Pro Gly Ser Ser225 230
235 240Ala Ala Ala Ala Ala Gly Leu Tyr Gly Tyr
Gly Pro Gly Val Ile Gly 245 250
255Pro Tyr Gly Pro Gly Ala Ser Gly Leu Asn Gly Pro Gly Ser Gly Leu
260 265 270Tyr Gly Pro Gly Val
Ile Gly Pro Gly Leu Ser Ala Ala Ala Ala Ala 275
280 285Gly Pro Gly Val Ile Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala Ala 290 295 300Ala Ala Gly
Leu Tyr Gly Pro Gly Val Ile Gly Pro Gly Leu Tyr Gly305
310 315 320Pro Gly Ser Ser Gly Pro Gly
Val Ile Gly Pro Tyr Gly Pro Gly Ser 325
330 335Ser Ala Ala Ala Ala Ala Gly Leu Tyr Gly Pro Gly
Val Ile Gly Pro 340 345 350Tyr
Gly Pro Gly Leu Ser Ala Ala Ala Ala Ala Gly Leu Tyr Val Ile 355
360 365Gly Pro Gly Val Ile Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Pro Gly 370 375
380Val Ile Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly385
390 395 400Pro Gly Leu Tyr
Gly Pro Gly Val Ile Gly Pro Ser Ala Ser Ala Ala 405
410 415Ala Ala Ala Gly Leu Tyr Gly Ser Gly Pro
Gly Leu Tyr Gly Pro Tyr 420 425
430Gly Pro Gly Leu Ser Gly Pro Gly Ser Gly Val Ile Gly Leu Gly Pro
435 440 445Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Leu Tyr Gly Pro 450 455
460Gly Val Ile Gly Pro Tyr Gly Pro Gly Leu Ser Ala Ala Ala Ala
Ala465 470 475 480Gly Pro
Gly Ser Gly Leu Tyr Gly Pro Gly Ala Ser Gly Leu Asn Gly
485 490 495Pro Gly Ser Gly Leu Tyr Gly
Pro Gly Val Ile Gly Pro Gly Leu Ser 500 505
510Ala Ala Ala Ala Ala Gly Leu Tyr Val Ile Gly Pro Gly Val
Ile Gly 515 520 525Pro Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Leu Tyr Gly 530
535 540Ser Gly Pro Gly Val Ile Gly Pro Tyr Gly Pro Gly
Leu Ser Gly Ser545 550 555
560Gly Val Ile Gly Pro Gly Val Ile Gly Pro Tyr Ala Ser Ala Ala Ala
565 570 575Ala Ala Gly Pro Gly
Ser Gly Val Ile Gly Pro Gly Ala Ser 580 585
59029590PRTArtificial SequenceMet-PRT918 29Met Gly Pro Gly
Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5
10 15Ala Ala Ala Gly Ile Asn Gly Pro Gly Ser
Gly Val Phe Gly Pro Gly 20 25
30Ile Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Val Phe Gly
35 40 45Pro Gly Ser Ser Ala Ala Ala Ala
Ala Gly Pro Gly Ile Tyr Gly Pro 50 55
60Gly Val Phe Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly65
70 75 80Ser Gly Val Phe Gly
Pro Gly Ala Ser Gly Ile Tyr Gly Pro Gly Val 85
90 95Phe Gly Pro Gly Val Phe Gly Pro Gly Ser Ser
Ala Ala Ala Ala Ala 100 105
110Gly Ile Tyr Gly Ser Gly Pro Gly Val Phe Gly Pro Tyr Gly Ser Ala
115 120 125Ala Ala Ala Ala Gly Pro Gly
Ser Gly Ile Tyr Gly Ile Gly Pro Tyr 130 135
140Gly Pro Gly Ala Ser Gly Pro Gly Ile Tyr Gly Pro Gly Val Phe
Gly145 150 155 160Pro Ser
Ala Ser Ala Ala Ala Ala Ala Gly Ser Gly Val Phe Gly Pro
165 170 175Gly Ile Tyr Gly Pro Tyr Ala
Ser Ala Ala Ala Ala Ala Gly Ile Tyr 180 185
190Gly Ser Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ile
Ser Gly 195 200 205Ser Gly Val Phe
Gly Pro Gly Val Phe Gly Pro Tyr Ala Ser Ala Ala 210
215 220Ala Ala Ala Gly Pro Gly Val Phe Gly Pro Tyr Gly
Pro Gly Ser Ser225 230 235
240Ala Ala Ala Ala Ala Gly Ile Tyr Gly Tyr Gly Pro Gly Val Phe Gly
245 250 255Pro Tyr Gly Pro Gly
Ala Ser Gly Ile Asn Gly Pro Gly Ser Gly Ile 260
265 270Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Ser Ala
Ala Ala Ala Ala 275 280 285Gly Pro
Gly Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala 290
295 300Ala Ala Gly Ile Tyr Gly Pro Gly Val Phe Gly
Pro Gly Ile Tyr Gly305 310 315
320Pro Gly Ser Ser Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ser
325 330 335Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro 340
345 350Tyr Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala
Gly Ile Tyr Val Phe 355 360 365Gly
Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly 370
375 380Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Gly385 390 395
400Pro Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Ser Ala Ser Ala
Ala 405 410 415Ala Ala Ala
Gly Ile Tyr Gly Ser Gly Pro Gly Ile Tyr Gly Pro Tyr 420
425 430Gly Pro Gly Ile Ser Gly Pro Gly Ser Gly
Val Phe Gly Ile Gly Pro 435 440
445Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Pro 450
455 460Gly Val Phe Gly Pro Tyr Gly Pro
Gly Ile Ser Ala Ala Ala Ala Ala465 470
475 480Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Ala Ser
Gly Ile Asn Gly 485 490
495Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Ser
500 505 510Ala Ala Ala Ala Ala Gly
Ile Tyr Val Phe Gly Pro Gly Val Phe Gly 515 520
525Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile
Tyr Gly 530 535 540Ser Gly Pro Gly Val
Phe Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser545 550
555 560Gly Val Phe Gly Pro Gly Val Phe Gly Pro
Tyr Ala Ser Ala Ala Ala 565 570
575Ala Ala Gly Pro Gly Ser Gly Val Phe Gly Pro Gly Ala Ser
580 585 59030565PRTArtificial
SequenceMet-PRT699 30Met Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala1 5 10 15Ala
Ala Ala Ala Ala Gly Ser Asn Gly Pro Gly Ser Gly Val Leu Gly 20
25 30Pro Gly Gln Ser Gly Gln Tyr Gly
Pro Gly Val Leu Gly Pro Gly Val 35 40
45Leu Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
50 55 60Gln Tyr Gly Pro Gly Val Leu Gly
Pro Ser Ala Ser Ala Ala Ala Ala65 70 75
80Ala Ala Ala Gly Pro Gly Ser Gly Val Leu Gly Pro Gly
Ala Ser Gly 85 90 95Gln
Tyr Gly Pro Gly Val Leu Gly Pro Gly Val Leu Gly Pro Gly Ser
100 105 110Ser Ala Ala Ala Ala Ala Ala
Ala Gly Ser Tyr Gly Ser Gly Pro Gly 115 120
125Val Leu Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly
Pro 130 135 140Gly Ser Gly Gln Tyr Gly
Gln Gly Pro Tyr Gly Pro Gly Ala Ser Gly145 150
155 160Pro Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro
Ser Ala Ser Ala Ala 165 170
175Ala Ala Ala Ala Ala Gly Ser Gly Val Leu Gly Pro Gly Gln Tyr Gly
180 185 190Pro Tyr Ala Ser Ala Ala
Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser 195 200
205Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Gln Ser Gly
Ser Gly 210 215 220Val Leu Gly Pro Gly
Val Leu Gly Pro Tyr Ala Ser Ala Ala Ala Ala225 230
235 240Ala Ala Ala Gly Pro Gly Val Leu Gly Pro
Tyr Gly Pro Gly Ser Ser 245 250
255Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Tyr Gly Pro Gly Val
260 265 270Leu Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser 275
280 285Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro Gly Pro
Ser Ala Ala Ala 290 295 300Ala Ala Ala
Ala Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala305
310 315 320Ser Ala Ala Ala Ala Ala Ala
Ala Gly Ser Tyr Gly Pro Gly Val Leu 325
330 335Gly Pro Gly Gln Tyr Gly Pro Gly Ser Ser Gly Pro
Gly Val Leu Gly 340 345 350Pro
Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser 355
360 365Tyr Gly Pro Gly Val Leu Gly Pro Tyr
Gly Pro Gly Pro Ser Ala Ala 370 375
380Ala Ala Ala Ala Ala Gly Ser Tyr Val Leu Gly Pro Gly Val Leu Gly385
390 395 400Pro Tyr Gly Pro
Gly Ala Ser Gly Pro Gly Val Leu Gly Pro Tyr Gly 405
410 415Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala
Ala Gly Pro Gly Gln Tyr 420 425
430Gly Pro Gly Val Leu Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala
435 440 445Ala Gly Ser Tyr Gly Ser Gly
Pro Gly Gln Tyr Gly Pro Tyr Gly Pro 450 455
460Gly Gln Ser Gly Pro Gly Ser Gly Val Leu Gly Gln Gly Pro Tyr
Gly465 470 475 480Pro Gly
Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro
485 490 495Gly Val Leu Gly Pro Tyr Gly
Pro Gly Pro Ser Ala Ala Ala Ala Ala 500 505
510Ala Ala Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Ala Ser
Gly Gln 515 520 525Asn Gly Pro Gly
Ser Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro Gly 530
535 540Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
Ser Gly Val Leu545 550 555
560Gly Pro Gly Ala Ser 56531565PRTArtificial
SequenceMet-PRT698 31Met Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala1 5 10 15Ala
Ala Ala Ala Ala Gly Ser Asn Gly Pro Gly Ser Gly Val Leu Gly 20
25 30Pro Gly Ile Ser Gly Ile Tyr Gly
Pro Gly Val Leu Gly Pro Gly Val 35 40
45Leu Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
50 55 60Ile Tyr Gly Pro Gly Val Leu Gly
Pro Ser Ala Ser Ala Ala Ala Ala65 70 75
80Ala Ala Ala Gly Pro Gly Ser Gly Val Leu Gly Pro Gly
Ala Ser Gly 85 90 95Ile
Tyr Gly Pro Gly Val Leu Gly Pro Gly Val Leu Gly Pro Gly Ser
100 105 110Ser Ala Ala Ala Ala Ala Ala
Ala Gly Ser Tyr Gly Ser Gly Pro Gly 115 120
125Val Leu Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly
Pro 130 135 140Gly Ser Gly Ile Tyr Gly
Ile Gly Pro Tyr Gly Pro Gly Ala Ser Gly145 150
155 160Pro Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro
Ser Ala Ser Ala Ala 165 170
175Ala Ala Ala Ala Ala Gly Ser Gly Val Leu Gly Pro Gly Ile Tyr Gly
180 185 190Pro Tyr Ala Ser Ala Ala
Ala Ala Ala Ala Ala Gly Ser Tyr Gly Ser 195 200
205Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ile Ser Gly
Ser Gly 210 215 220Val Leu Gly Pro Gly
Val Leu Gly Pro Tyr Ala Ser Ala Ala Ala Ala225 230
235 240Ala Ala Ala Gly Pro Gly Val Leu Gly Pro
Tyr Gly Pro Gly Ser Ser 245 250
255Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Tyr Gly Pro Gly Val
260 265 270Leu Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Ile Asn Gly Pro Gly Ser 275
280 285Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Gly Pro
Ser Ala Ala Ala 290 295 300Ala Ala Ala
Ala Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala305
310 315 320Ser Ala Ala Ala Ala Ala Ala
Ala Gly Ser Tyr Gly Pro Gly Val Leu 325
330 335Gly Pro Gly Ile Tyr Gly Pro Gly Ser Ser Gly Pro
Gly Val Leu Gly 340 345 350Pro
Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser 355
360 365Tyr Gly Pro Gly Val Leu Gly Pro Tyr
Gly Pro Gly Pro Ser Ala Ala 370 375
380Ala Ala Ala Ala Ala Gly Ser Tyr Val Leu Gly Pro Gly Val Leu Gly385
390 395 400Pro Tyr Gly Pro
Gly Ala Ser Gly Pro Gly Val Leu Gly Pro Tyr Gly 405
410 415Pro Gly Ala Ser Ala Ala Ala Ala Ala Ala
Ala Gly Pro Gly Ile Tyr 420 425
430Gly Pro Gly Val Leu Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala
435 440 445Ala Gly Ser Tyr Gly Ser Gly
Pro Gly Ile Tyr Gly Pro Tyr Gly Pro 450 455
460Gly Ile Ser Gly Pro Gly Ser Gly Val Leu Gly Ile Gly Pro Tyr
Gly465 470 475 480Pro Gly
Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro
485 490 495Gly Val Leu Gly Pro Tyr Gly
Pro Gly Pro Ser Ala Ala Ala Ala Ala 500 505
510Ala Ala Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Ala Ser
Gly Ile 515 520 525Asn Gly Pro Gly
Ser Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Gly 530
535 540Pro Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly
Ser Gly Val Leu545 550 555
560Gly Pro Gly Ala Ser 565321179PRTArtificial
SequenceMet-PRT966 32Met Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala1 5 10 15Ala
Ala Ala Gly Ile Asn Gly Pro Gly Ser Gly Val Phe Gly Pro Gly 20
25 30Ile Ser Gly Ile Tyr Gly Pro Gly
Val Phe Gly Pro Gly Val Phe Gly 35 40
45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly Pro
50 55 60Gly Val Phe Gly Pro Ser Ala Ser
Ala Ala Ala Ala Ala Gly Pro Gly65 70 75
80Ser Gly Val Phe Gly Pro Gly Ala Ser Gly Ile Tyr Gly
Pro Gly Val 85 90 95Phe
Gly Pro Gly Val Phe Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
100 105 110Gly Ile Tyr Gly Ser Gly Pro
Gly Val Phe Gly Pro Tyr Gly Ser Ala 115 120
125Ala Ala Ala Ala Gly Pro Gly Ser Gly Ile Tyr Gly Ile Gly Pro
Tyr 130 135 140Gly Pro Gly Ala Ser Gly
Pro Gly Ile Tyr Gly Pro Gly Val Phe Gly145 150
155 160Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Ser
Gly Val Phe Gly Pro 165 170
175Gly Ile Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr
180 185 190Gly Ser Gly Pro Gly Val
Phe Gly Pro Tyr Gly Pro Gly Ile Ser Gly 195 200
205Ser Gly Val Phe Gly Pro Gly Val Phe Gly Pro Tyr Ala Ser
Ala Ala 210 215 220Ala Ala Ala Gly Pro
Gly Val Phe Gly Pro Tyr Gly Pro Gly Ser Ser225 230
235 240Ala Ala Ala Ala Ala Gly Ile Tyr Gly Tyr
Gly Pro Gly Val Phe Gly 245 250
255Pro Tyr Gly Pro Gly Ala Ser Gly Ile Asn Gly Pro Gly Ser Gly Ile
260 265 270Tyr Gly Pro Gly Val
Phe Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala 275
280 285Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala Ala 290 295 300Ala Ala Gly
Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Tyr Gly305
310 315 320Pro Gly Ser Ser Gly Pro Gly
Val Phe Gly Pro Tyr Gly Pro Gly Ser 325
330 335Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Pro Gly
Val Phe Gly Pro 340 345 350Tyr
Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly Ile Tyr Val Phe 355
360 365Gly Pro Gly Val Phe Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Pro Gly 370 375
380Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly385
390 395 400Pro Gly Ile Tyr
Gly Pro Gly Val Phe Gly Pro Ser Ala Ser Ala Ala 405
410 415Ala Ala Ala Gly Ile Tyr Gly Ser Gly Pro
Gly Ile Tyr Gly Pro Tyr 420 425
430Gly Pro Gly Ile Ser Gly Pro Gly Ser Gly Val Phe Gly Ile Gly Pro
435 440 445Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Ile Tyr Gly Pro 450 455
460Gly Val Phe Gly Pro Tyr Gly Pro Gly Ile Ser Ala Ala Ala Ala
Ala465 470 475 480Gly Pro
Gly Ser Gly Ile Tyr Gly Pro Gly Ala Ser Gly Ile Asn Gly
485 490 495Pro Gly Ser Gly Ile Tyr Gly
Pro Gly Val Phe Gly Pro Gly Ile Ser 500 505
510Ala Ala Ala Ala Ala Gly Ile Tyr Val Phe Gly Pro Gly Val
Phe Gly 515 520 525Pro Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly 530
535 540Ser Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly
Ile Ser Gly Ser545 550 555
560Gly Val Phe Gly Pro Gly Val Phe Gly Pro Tyr Ala Ser Ala Ala Ala
565 570 575Ala Ala Gly Pro Gly
Ser Gly Val Phe Gly Pro Gly Ala Ser Gly Pro 580
585 590Gly Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala 595 600 605Gly Ile
Asn Gly Pro Gly Ser Gly Val Phe Gly Pro Gly Ile Ser Gly 610
615 620Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Val
Phe Gly Pro Gly Ser625 630 635
640Ser Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly Pro Gly Val Phe
645 650 655Gly Pro Ser Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Val 660
665 670Phe Gly Pro Gly Ala Ser Gly Ile Tyr Gly Pro
Gly Val Phe Gly Pro 675 680 685Gly
Val Phe Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Ile Tyr 690
695 700Gly Ser Gly Pro Gly Val Phe Gly Pro Tyr
Gly Ser Ala Ala Ala Ala705 710 715
720Ala Gly Pro Gly Ser Gly Ile Tyr Gly Ile Gly Pro Tyr Gly Pro
Gly 725 730 735Ala Ser Gly
Pro Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Ser Ala 740
745 750Ser Ala Ala Ala Ala Ala Gly Ser Gly Val
Phe Gly Pro Gly Ile Tyr 755 760
765Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Ser Gly 770
775 780Pro Gly Val Phe Gly Pro Tyr Gly
Pro Gly Ile Ser Gly Ser Gly Val785 790
795 800Phe Gly Pro Gly Val Phe Gly Pro Tyr Ala Ser Ala
Ala Ala Ala Ala 805 810
815Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala
820 825 830Ala Ala Gly Ile Tyr Gly
Tyr Gly Pro Gly Val Phe Gly Pro Tyr Gly 835 840
845Pro Gly Ala Ser Gly Ile Asn Gly Pro Gly Ser Gly Ile Tyr
Gly Pro 850 855 860Gly Val Phe Gly Pro
Gly Ile Ser Ala Ala Ala Ala Ala Gly Pro Gly865 870
875 880Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Gly 885 890
895Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Tyr Gly Pro Gly Ser
900 905 910Ser Gly Pro Gly Val
Phe Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala 915
920 925Ala Ala Ala Gly Ile Tyr Gly Pro Gly Val Phe Gly
Pro Tyr Gly Pro 930 935 940Gly Ile Ser
Ala Ala Ala Ala Ala Gly Ile Tyr Val Phe Gly Pro Gly945
950 955 960Val Phe Gly Pro Tyr Gly Pro
Gly Ala Ser Gly Pro Gly Val Phe Gly 965
970 975Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Pro Gly Ile 980 985 990Tyr
Gly Pro Gly Val Phe Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala 995
1000 1005Gly Ile Tyr Gly Ser Gly Pro Gly
Ile Tyr Gly Pro Tyr Gly Pro 1010 1015
1020Gly Ile Ser Gly Pro Gly Ser Gly Val Phe Gly Ile Gly Pro Tyr
1025 1030 1035Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Ile Tyr Gly Pro 1040 1045
1050Gly Val Phe Gly Pro Tyr Gly Pro Gly Ile Ser Ala Ala Ala
Ala 1055 1060 1065Ala Gly Pro Gly Ser
Gly Ile Tyr Gly Pro Gly Ala Ser Gly Ile 1070 1075
1080Asn Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val Phe
Gly Pro 1085 1090 1095Gly Ile Ser Ala
Ala Ala Ala Ala Gly Ile Tyr Val Phe Gly Pro 1100
1105 1110Gly Val Phe Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala 1115 1120 1125Ala Gly
Ile Tyr Gly Ser Gly Pro Gly Val Phe Gly Pro Tyr Gly 1130
1135 1140Pro Gly Ile Ser Gly Ser Gly Val Phe Gly
Pro Gly Val Phe Gly 1145 1150 1155Pro
Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Val 1160
1165 1170Phe Gly Pro Gly Ala Ser
117533601PRTArtificial SequencePRT888 33Met His His His His His His Ser
Ser Gly Ser Ser Gly Pro Gly Val1 5 10
15Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Gln 20 25 30Asn Gly Pro
Gly Ser Gly Val Leu Gly Pro Gly Gln Ser Gly Gln Tyr 35
40 45Gly Pro Gly Val Leu Gly Pro Gly Val Leu Gly
Pro Gly Ser Ser Ala 50 55 60Ala Ala
Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro65
70 75 80Ser Ala Ser Ala Ala Ala Ala
Ala Gly Pro Gly Ser Gly Val Leu Gly 85 90
95Pro Gly Ala Ser Gly Gln Tyr Gly Pro Gly Val Leu Gly
Pro Gly Val 100 105 110Leu Gly
Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser 115
120 125Gly Pro Gly Val Leu Gly Pro Tyr Gly Ser
Ala Ala Ala Ala Ala Gly 130 135 140Pro
Gly Ser Gly Gln Tyr Gly Gln Gly Pro Tyr Gly Pro Gly Ala Ser145
150 155 160Gly Pro Gly Gln Tyr Gly
Pro Gly Val Leu Gly Pro Ser Ala Ser Ala 165
170 175Ala Ala Ala Ala Gly Ser Gly Val Leu Gly Pro Gly
Gln Tyr Gly Pro 180 185 190Tyr
Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Ser Gly Pro Gly 195
200 205Val Leu Gly Pro Tyr Gly Pro Gly Gln
Ser Gly Ser Gly Val Leu Gly 210 215
220Pro Gly Val Leu Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro225
230 235 240Gly Val Leu Gly
Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 245
250 255Gly Gln Tyr Gly Tyr Gly Pro Gly Val Leu
Gly Pro Tyr Gly Pro Gly 260 265
270Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro Gly Val
275 280 285Leu Gly Pro Gly Gln Ser Ala
Ala Ala Ala Ala Gly Pro Gly Val Leu 290 295
300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Gln
Tyr305 310 315 320Gly Pro
Gly Val Leu Gly Pro Gly Gln Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Val Leu Gly Pro Tyr
Gly Pro Gly Ser Ser Ala Ala Ala Ala 340 345
350Ala Gly Gln Tyr Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro
Gly Gln 355 360 365Ser Ala Ala Ala
Ala Ala Gly Gln Tyr Val Leu Gly Pro Gly Val Leu 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Val
Leu Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly
405 410 415Pro Gly Val Leu Gly
Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Gln 420
425 430Tyr Gly Ser Gly Pro Gly Gln Tyr Gly Pro Tyr Gly
Pro Gly Gln Ser 435 440 445Gly Pro
Gly Ser Gly Val Leu Gly Gln Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Gln Tyr Gly Pro
Gly Val Leu Gly Pro465 470 475
480Tyr Gly Pro Gly Gln Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly
485 490 495Gln Tyr Gly Pro
Gly Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln 500
505 510Tyr Gly Pro Gly Val Leu Gly Pro Gly Gln Ser
Ala Ala Ala Ala Ala 515 520 525Gly
Gln Tyr Val Leu Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly 530
535 540Ala Ser Ala Ala Ala Ala Ala Gly Gln Tyr
Gly Ser Gly Pro Gly Val545 550 555
560Leu Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Val Leu Gly
Pro 565 570 575Gly Val Leu
Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly 580
585 590Ser Gly Val Leu Gly Pro Gly Ala Ser
595 60034601PRTArtificial SequencePRT965 34Met His His
His His His His Ser Ser Gly Ser Ser Gly Pro Gly Thr1 5
10 15Ser Gly Pro Tyr Gly Pro Gly Ala Ser
Ala Ala Ala Ala Ala Gly Ala 20 25
30Asn Gly Pro Gly Ser Gly Thr Ser Gly Pro Gly Ala Ser Gly Ala Tyr
35 40 45Gly Pro Gly Thr Ser Gly Pro
Gly Thr Ser Gly Pro Gly Ser Ser Ala 50 55
60Ala Ala Ala Ala Gly Pro Gly Ala Tyr Gly Pro Gly Thr Ser Gly Pro65
70 75 80Ser Ala Ser Ala
Ala Ala Ala Ala Gly Pro Gly Ser Gly Thr Ser Gly 85
90 95Pro Gly Ala Ser Gly Ala Tyr Gly Pro Gly
Thr Ser Gly Pro Gly Thr 100 105
110Ser Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Ala Tyr Gly Ser
115 120 125Gly Pro Gly Thr Ser Gly Pro
Tyr Gly Ser Ala Ala Ala Ala Ala Gly 130 135
140Pro Gly Ser Gly Ala Tyr Gly Ala Gly Pro Tyr Gly Pro Gly Ala
Ser145 150 155 160Gly Pro
Gly Ala Tyr Gly Pro Gly Thr Ser Gly Pro Ser Ala Ser Ala
165 170 175Ala Ala Ala Ala Gly Ser Gly
Thr Ser Gly Pro Gly Ala Tyr Gly Pro 180 185
190Tyr Ala Ser Ala Ala Ala Ala Ala Gly Ala Tyr Gly Ser Gly
Pro Gly 195 200 205Thr Ser Gly Pro
Tyr Gly Pro Gly Ala Ser Gly Ser Gly Thr Ser Gly 210
215 220Pro Gly Thr Ser Gly Pro Tyr Ala Ser Ala Ala Ala
Ala Ala Gly Pro225 230 235
240Gly Thr Ser Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
245 250 255Gly Ala Tyr Gly Tyr
Gly Pro Gly Thr Ser Gly Pro Tyr Gly Pro Gly 260
265 270Ala Ser Gly Ala Asn Gly Pro Gly Ser Gly Ala Tyr
Gly Pro Gly Thr 275 280 285Ser Gly
Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Thr Ser 290
295 300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala
Ala Ala Gly Ala Tyr305 310 315
320Gly Pro Gly Thr Ser Gly Pro Gly Ala Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Thr Ser
Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala 340
345 350Ala Gly Ala Tyr Gly Pro Gly Thr Ser Gly Pro
Tyr Gly Pro Gly Ala 355 360 365Ser
Ala Ala Ala Ala Ala Gly Ala Tyr Thr Ser Gly Pro Gly Thr Ser 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro
Gly Thr Ser Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ala Tyr
Gly 405 410 415Pro Gly Thr
Ser Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Ala 420
425 430Tyr Gly Ser Gly Pro Gly Ala Tyr Gly Pro
Tyr Gly Pro Gly Ala Ser 435 440
445Gly Pro Gly Ser Gly Thr Ser Gly Ala Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Ala
Tyr Gly Pro Gly Thr Ser Gly Pro465 470
475 480Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly
Pro Gly Ser Gly 485 490
495Ala Tyr Gly Pro Gly Ala Ser Gly Ala Asn Gly Pro Gly Ser Gly Ala
500 505 510Tyr Gly Pro Gly Thr Ser
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala 515 520
525Gly Ala Tyr Thr Ser Gly Pro Gly Thr Ser Gly Pro Tyr Gly
Pro Gly 530 535 540Ala Ser Ala Ala Ala
Ala Ala Gly Ala Tyr Gly Ser Gly Pro Gly Thr545 550
555 560Ser Gly Pro Tyr Gly Pro Gly Ala Ser Gly
Ser Gly Thr Ser Gly Pro 565 570
575Gly Thr Ser Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly
580 585 590Ser Gly Thr Ser Gly
Pro Gly Ala Ser 595 60035601PRTArtificial
SequencePRT889 35Met His His His His His His Ser Ser Gly Ser Ser Gly Pro
Gly Val1 5 10 15Leu Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile 20
25 30Asn Gly Pro Gly Ser Gly Val Leu Gly
Pro Gly Ile Ser Gly Ile Tyr 35 40
45Gly Pro Gly Val Leu Gly Pro Gly Val Leu Gly Pro Gly Ser Ser Ala 50
55 60Ala Ala Ala Ala Gly Pro Gly Ile Tyr
Gly Pro Gly Val Leu Gly Pro65 70 75
80Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Val
Leu Gly 85 90 95Pro Gly
Ala Ser Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Gly Val 100
105 110Leu Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Ser 115 120
125Gly Pro Gly Val Leu Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly
130 135 140Pro Gly Ser Gly Ile Tyr Gly
Ile Gly Pro Tyr Gly Pro Gly Ala Ser145 150
155 160Gly Pro Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro
Ser Ala Ser Ala 165 170
175Ala Ala Ala Ala Gly Ser Gly Val Leu Gly Pro Gly Ile Tyr Gly Pro
180 185 190Tyr Ala Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly 195 200
205Val Leu Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser Gly Val
Leu Gly 210 215 220Pro Gly Val Leu Gly
Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro225 230
235 240Gly Val Leu Gly Pro Tyr Gly Pro Gly Ser
Ser Ala Ala Ala Ala Ala 245 250
255Gly Ile Tyr Gly Tyr Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly
260 265 270Ala Ser Gly Ile Asn
Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val 275
280 285Leu Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly
Pro Gly Val Leu 290 295 300Gly Pro Tyr
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr305
310 315 320Gly Pro Gly Val Leu Gly Pro
Gly Ile Tyr Gly Pro Gly Ser Ser Gly 325
330 335Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ser Ser
Ala Ala Ala Ala 340 345 350Ala
Gly Ile Tyr Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ile 355
360 365Ser Ala Ala Ala Ala Ala Gly Ile Tyr
Val Leu Gly Pro Gly Val Leu 370 375
380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Val Leu Gly Pro Tyr385
390 395 400Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly 405
410 415Pro Gly Val Leu Gly Pro Ser Ala Ser Ala
Ala Ala Ala Ala Gly Ile 420 425
430Tyr Gly Ser Gly Pro Gly Ile Tyr Gly Pro Tyr Gly Pro Gly Ile Ser
435 440 445Gly Pro Gly Ser Gly Val Leu
Gly Ile Gly Pro Tyr Gly Pro Gly Ala 450 455
460Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Pro Gly Val Leu Gly
Pro465 470 475 480Tyr Gly
Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly
485 490 495Ile Tyr Gly Pro Gly Ala Ser
Gly Ile Asn Gly Pro Gly Ser Gly Ile 500 505
510Tyr Gly Pro Gly Val Leu Gly Pro Gly Ile Ser Ala Ala Ala
Ala Ala 515 520 525Gly Ile Tyr Val
Leu Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly 530
535 540Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Ser
Gly Pro Gly Val545 550 555
560Leu Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser Gly Val Leu Gly Pro
565 570 575Gly Val Leu Gly Pro
Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly 580
585 590Ser Gly Val Leu Gly Pro Gly Ala Ser 595
60036601PRTArtificial SequencePRT916 36Met His His His His
His His Ser Ser Gly Ser Ser Gly Pro Gly Val1 5
10 15Ile Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Gly Leu 20 25
30Asn Gly Pro Gly Ser Gly Val Ile Gly Pro Gly Leu Ser Gly Leu Tyr
35 40 45Gly Pro Gly Val Ile Gly Pro Gly
Val Ile Gly Pro Gly Ser Ser Ala 50 55
60Ala Ala Ala Ala Gly Pro Gly Leu Tyr Gly Pro Gly Val Ile Gly Pro65
70 75 80Ser Ala Ser Ala Ala
Ala Ala Ala Gly Pro Gly Ser Gly Val Ile Gly 85
90 95Pro Gly Ala Ser Gly Leu Tyr Gly Pro Gly Val
Ile Gly Pro Gly Val 100 105
110Ile Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Leu Tyr Gly Ser
115 120 125Gly Pro Gly Val Ile Gly Pro
Tyr Gly Ser Ala Ala Ala Ala Ala Gly 130 135
140Pro Gly Ser Gly Leu Tyr Gly Leu Gly Pro Tyr Gly Pro Gly Ala
Ser145 150 155 160Gly Pro
Gly Leu Tyr Gly Pro Gly Val Ile Gly Pro Ser Ala Ser Ala
165 170 175Ala Ala Ala Ala Gly Ser Gly
Val Ile Gly Pro Gly Leu Tyr Gly Pro 180 185
190Tyr Ala Ser Ala Ala Ala Ala Ala Gly Leu Tyr Gly Ser Gly
Pro Gly 195 200 205Val Ile Gly Pro
Tyr Gly Pro Gly Leu Ser Gly Ser Gly Val Ile Gly 210
215 220Pro Gly Val Ile Gly Pro Tyr Ala Ser Ala Ala Ala
Ala Ala Gly Pro225 230 235
240Gly Val Ile Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
245 250 255Gly Leu Tyr Gly Tyr
Gly Pro Gly Val Ile Gly Pro Tyr Gly Pro Gly 260
265 270Ala Ser Gly Leu Asn Gly Pro Gly Ser Gly Leu Tyr
Gly Pro Gly Val 275 280 285Ile Gly
Pro Gly Leu Ser Ala Ala Ala Ala Ala Gly Pro Gly Val Ile 290
295 300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala
Ala Ala Gly Leu Tyr305 310 315
320Gly Pro Gly Val Ile Gly Pro Gly Leu Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Val Ile
Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala 340
345 350Ala Gly Leu Tyr Gly Pro Gly Val Ile Gly Pro
Tyr Gly Pro Gly Leu 355 360 365Ser
Ala Ala Ala Ala Ala Gly Leu Tyr Val Ile Gly Pro Gly Val Ile 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro
Gly Val Ile Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Leu Tyr
Gly 405 410 415Pro Gly Val
Ile Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Leu 420
425 430Tyr Gly Ser Gly Pro Gly Leu Tyr Gly Pro
Tyr Gly Pro Gly Leu Ser 435 440
445Gly Pro Gly Ser Gly Val Ile Gly Leu Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Leu
Tyr Gly Pro Gly Val Ile Gly Pro465 470
475 480Tyr Gly Pro Gly Leu Ser Ala Ala Ala Ala Ala Gly
Pro Gly Ser Gly 485 490
495Leu Tyr Gly Pro Gly Ala Ser Gly Leu Asn Gly Pro Gly Ser Gly Leu
500 505 510Tyr Gly Pro Gly Val Ile
Gly Pro Gly Leu Ser Ala Ala Ala Ala Ala 515 520
525Gly Leu Tyr Val Ile Gly Pro Gly Val Ile Gly Pro Tyr Gly
Pro Gly 530 535 540Ala Ser Ala Ala Ala
Ala Ala Gly Leu Tyr Gly Ser Gly Pro Gly Val545 550
555 560Ile Gly Pro Tyr Gly Pro Gly Leu Ser Gly
Ser Gly Val Ile Gly Pro 565 570
575Gly Val Ile Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly
580 585 590Ser Gly Val Ile Gly
Pro Gly Ala Ser 595 60037601PRTArtificial
SequencePRT918 37Met His His His His His His Ser Ser Gly Ser Ser Gly Pro
Gly Val1 5 10 15Phe Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile 20
25 30Asn Gly Pro Gly Ser Gly Val Phe Gly
Pro Gly Ile Ser Gly Ile Tyr 35 40
45Gly Pro Gly Val Phe Gly Pro Gly Val Phe Gly Pro Gly Ser Ser Ala 50
55 60Ala Ala Ala Ala Gly Pro Gly Ile Tyr
Gly Pro Gly Val Phe Gly Pro65 70 75
80Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Val
Phe Gly 85 90 95Pro Gly
Ala Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Val 100
105 110Phe Gly Pro Gly Ser Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Ser 115 120
125Gly Pro Gly Val Phe Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly
130 135 140Pro Gly Ser Gly Ile Tyr Gly
Ile Gly Pro Tyr Gly Pro Gly Ala Ser145 150
155 160Gly Pro Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro
Ser Ala Ser Ala 165 170
175Ala Ala Ala Ala Gly Ser Gly Val Phe Gly Pro Gly Ile Tyr Gly Pro
180 185 190Tyr Ala Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly 195 200
205Val Phe Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser Gly Val
Phe Gly 210 215 220Pro Gly Val Phe Gly
Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro225 230
235 240Gly Val Phe Gly Pro Tyr Gly Pro Gly Ser
Ser Ala Ala Ala Ala Ala 245 250
255Gly Ile Tyr Gly Tyr Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly
260 265 270Ala Ser Gly Ile Asn
Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val 275
280 285Phe Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly
Pro Gly Val Phe 290 295 300Gly Pro Tyr
Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr305
310 315 320Gly Pro Gly Val Phe Gly Pro
Gly Ile Tyr Gly Pro Gly Ser Ser Gly 325
330 335Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ser Ser
Ala Ala Ala Ala 340 345 350Ala
Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ile 355
360 365Ser Ala Ala Ala Ala Ala Gly Ile Tyr
Val Phe Gly Pro Gly Val Phe 370 375
380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Val Phe Gly Pro Tyr385
390 395 400Gly Pro Gly Ala
Ser Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly 405
410 415Pro Gly Val Phe Gly Pro Ser Ala Ser Ala
Ala Ala Ala Ala Gly Ile 420 425
430Tyr Gly Ser Gly Pro Gly Ile Tyr Gly Pro Tyr Gly Pro Gly Ile Ser
435 440 445Gly Pro Gly Ser Gly Val Phe
Gly Ile Gly Pro Tyr Gly Pro Gly Ala 450 455
460Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Pro Gly Val Phe Gly
Pro465 470 475 480Tyr Gly
Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly
485 490 495Ile Tyr Gly Pro Gly Ala Ser
Gly Ile Asn Gly Pro Gly Ser Gly Ile 500 505
510Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Ser Ala Ala Ala
Ala Ala 515 520 525Gly Ile Tyr Val
Phe Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly 530
535 540Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Ser
Gly Pro Gly Val545 550 555
560Phe Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser Gly Val Phe Gly Pro
565 570 575Gly Val Phe Gly Pro
Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly 580
585 590Ser Gly Val Phe Gly Pro Gly Ala Ser 595
60038576PRTArtificial SequencePRT699 38Met His His His His
His His Ser Ser Gly Ser Ser Gly Pro Gly Val1 5
10 15Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Ala Ala 20 25
30Gly Ser Asn Gly Pro Gly Ser Gly Val Leu Gly Pro Gly Gln Ser Gly
35 40 45Gln Tyr Gly Pro Gly Val Leu Gly
Pro Gly Val Leu Gly Pro Gly Ser 50 55
60Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln Tyr Gly Pro Gly65
70 75 80Val Leu Gly Pro Ser
Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro 85
90 95Gly Ser Gly Val Leu Gly Pro Gly Ala Ser Gly
Gln Tyr Gly Pro Gly 100 105
110Val Leu Gly Pro Gly Val Leu Gly Pro Gly Ser Ser Ala Ala Ala Ala
115 120 125Ala Ala Ala Gly Ser Tyr Gly
Ser Gly Pro Gly Val Leu Gly Pro Tyr 130 135
140Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Gln
Tyr145 150 155 160Gly Gln
Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Gln Tyr Gly
165 170 175Pro Gly Val Leu Gly Pro Ser
Ala Ser Ala Ala Ala Ala Ala Ala Ala 180 185
190Gly Ser Gly Val Leu Gly Pro Gly Gln Tyr Gly Pro Tyr Ala
Ser Ala 195 200 205Ala Ala Ala Ala
Ala Ala Gly Ser Tyr Gly Ser Gly Pro Gly Val Leu 210
215 220Gly Pro Tyr Gly Pro Gly Gln Ser Gly Ser Gly Val
Leu Gly Pro Gly225 230 235
240Val Leu Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro
245 250 255Gly Val Leu Gly Pro
Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 260
265 270Ala Ala Gly Ser Tyr Gly Tyr Gly Pro Gly Val Leu
Gly Pro Tyr Gly 275 280 285Pro Gly
Ala Ser Gly Gln Asn Gly Pro Gly Ser Gly Gln Tyr Gly Pro 290
295 300Gly Val Leu Gly Pro Gly Pro Ser Ala Ala Ala
Ala Ala Ala Ala Gly305 310 315
320Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
325 330 335Ala Ala Ala Gly
Ser Tyr Gly Pro Gly Val Leu Gly Pro Gly Gln Tyr 340
345 350Gly Pro Gly Ser Ser Gly Pro Gly Val Leu Gly
Pro Tyr Gly Pro Gly 355 360 365Ser
Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Val 370
375 380Leu Gly Pro Tyr Gly Pro Gly Pro Ser Ala
Ala Ala Ala Ala Ala Ala385 390 395
400Gly Ser Tyr Val Leu Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro
Gly 405 410 415Ala Ser Gly
Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala 420
425 430Ala Ala Ala Ala Ala Ala Gly Pro Gly Gln
Tyr Gly Pro Gly Val Leu 435 440
445Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly 450
455 460Ser Gly Pro Gly Gln Tyr Gly Pro
Tyr Gly Pro Gly Gln Ser Gly Pro465 470
475 480Gly Ser Gly Val Leu Gly Gln Gly Pro Tyr Gly Pro
Gly Ala Ser Ala 485 490
495Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Val Leu Gly Pro
500 505 510Tyr Gly Pro Gly Pro Ser
Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly 515 520
525Ser Gly Gln Tyr Gly Pro Gly Ala Ser Gly Gln Asn Gly Pro
Gly Ser 530 535 540Gly Gln Tyr Gly Pro
Gly Val Leu Gly Pro Gly Pro Ser Ala Ala Ala545 550
555 560Ala Ala Ala Ala Gly Pro Gly Ser Gly Val
Leu Gly Pro Gly Ala Ser 565 570
57539576PRTArtificial SequencePRT698 39Met His His His His His His
Ser Ser Gly Ser Ser Gly Pro Gly Val1 5 10
15Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
Ala Ala Ala 20 25 30Gly Ser
Asn Gly Pro Gly Ser Gly Val Leu Gly Pro Gly Ile Ser Gly 35
40 45Ile Tyr Gly Pro Gly Val Leu Gly Pro Gly
Val Leu Gly Pro Gly Ser 50 55 60Ser
Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly Pro Gly65
70 75 80Val Leu Gly Pro Ser Ala
Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro 85
90 95Gly Ser Gly Val Leu Gly Pro Gly Ala Ser Gly Ile
Tyr Gly Pro Gly 100 105 110Val
Leu Gly Pro Gly Val Leu Gly Pro Gly Ser Ser Ala Ala Ala Ala 115
120 125Ala Ala Ala Gly Ser Tyr Gly Ser Gly
Pro Gly Val Leu Gly Pro Tyr 130 135
140Gly Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Ile Tyr145
150 155 160Gly Ile Gly Pro
Tyr Gly Pro Gly Ala Ser Gly Pro Gly Ile Tyr Gly 165
170 175Pro Gly Val Leu Gly Pro Ser Ala Ser Ala
Ala Ala Ala Ala Ala Ala 180 185
190Gly Ser Gly Val Leu Gly Pro Gly Ile Tyr Gly Pro Tyr Ala Ser Ala
195 200 205Ala Ala Ala Ala Ala Ala Gly
Ser Tyr Gly Ser Gly Pro Gly Val Leu 210 215
220Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser Gly Val Leu Gly Pro
Gly225 230 235 240Val Leu
Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Pro
245 250 255Gly Val Leu Gly Pro Tyr Gly
Pro Gly Ser Ser Ala Ala Ala Ala Ala 260 265
270Ala Ala Gly Ser Tyr Gly Tyr Gly Pro Gly Val Leu Gly Pro
Tyr Gly 275 280 285Pro Gly Ala Ser
Gly Ile Asn Gly Pro Gly Ser Gly Ile Tyr Gly Pro 290
295 300Gly Val Leu Gly Pro Gly Pro Ser Ala Ala Ala Ala
Ala Ala Ala Gly305 310 315
320Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala
325 330 335Ala Ala Ala Gly Ser
Tyr Gly Pro Gly Val Leu Gly Pro Gly Ile Tyr 340
345 350Gly Pro Gly Ser Ser Gly Pro Gly Val Leu Gly Pro
Tyr Gly Pro Gly 355 360 365Ser Ser
Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly Pro Gly Val 370
375 380Leu Gly Pro Tyr Gly Pro Gly Pro Ser Ala Ala
Ala Ala Ala Ala Ala385 390 395
400Gly Ser Tyr Val Leu Gly Pro Gly Val Leu Gly Pro Tyr Gly Pro Gly
405 410 415Ala Ser Gly Pro
Gly Val Leu Gly Pro Tyr Gly Pro Gly Ala Ser Ala 420
425 430Ala Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr
Gly Pro Gly Val Leu 435 440 445Gly
Pro Ser Ala Ser Ala Ala Ala Ala Ala Ala Ala Gly Ser Tyr Gly 450
455 460Ser Gly Pro Gly Ile Tyr Gly Pro Tyr Gly
Pro Gly Ile Ser Gly Pro465 470 475
480Gly Ser Gly Val Leu Gly Ile Gly Pro Tyr Gly Pro Gly Ala Ser
Ala 485 490 495Ala Ala Ala
Ala Ala Ala Gly Ser Tyr Gly Pro Gly Val Leu Gly Pro 500
505 510Tyr Gly Pro Gly Pro Ser Ala Ala Ala Ala
Ala Ala Ala Gly Pro Gly 515 520
525Ser Gly Ile Tyr Gly Pro Gly Ala Ser Gly Ile Asn Gly Pro Gly Ser 530
535 540Gly Ile Tyr Gly Pro Gly Val Leu
Gly Pro Gly Pro Ser Ala Ala Ala545 550
555 560Ala Ala Ala Ala Gly Pro Gly Ser Gly Val Leu Gly
Pro Gly Ala Ser 565 570
575401190PRTArtificial SequencePRT966 40Met His His His His His His Ser
Ser Gly Ser Ser Gly Pro Gly Val1 5 10
15Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Ile 20 25 30Asn Gly Pro
Gly Ser Gly Val Phe Gly Pro Gly Ile Ser Gly Ile Tyr 35
40 45Gly Pro Gly Val Phe Gly Pro Gly Val Phe Gly
Pro Gly Ser Ser Ala 50 55 60Ala Ala
Ala Ala Gly Pro Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro65
70 75 80Ser Ala Ser Ala Ala Ala Ala
Ala Gly Pro Gly Ser Gly Val Phe Gly 85 90
95Pro Gly Ala Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly
Pro Gly Val 100 105 110Phe Gly
Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Ser 115
120 125Gly Pro Gly Val Phe Gly Pro Tyr Gly Ser
Ala Ala Ala Ala Ala Gly 130 135 140Pro
Gly Ser Gly Ile Tyr Gly Ile Gly Pro Tyr Gly Pro Gly Ala Ser145
150 155 160Gly Pro Gly Ile Tyr Gly
Pro Gly Val Phe Gly Pro Ser Ala Ser Ala 165
170 175Ala Ala Ala Ala Gly Ser Gly Val Phe Gly Pro Gly
Ile Tyr Gly Pro 180 185 190Tyr
Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly 195
200 205Val Phe Gly Pro Tyr Gly Pro Gly Ile
Ser Gly Ser Gly Val Phe Gly 210 215
220Pro Gly Val Phe Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro225
230 235 240Gly Val Phe Gly
Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala 245
250 255Gly Ile Tyr Gly Tyr Gly Pro Gly Val Phe
Gly Pro Tyr Gly Pro Gly 260 265
270Ala Ser Gly Ile Asn Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val
275 280 285Phe Gly Pro Gly Ile Ser Ala
Ala Ala Ala Ala Gly Pro Gly Val Phe 290 295
300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile
Tyr305 310 315 320Gly Pro
Gly Val Phe Gly Pro Gly Ile Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Val Phe Gly Pro Tyr
Gly Pro Gly Ser Ser Ala Ala Ala Ala 340 345
350Ala Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro
Gly Ile 355 360 365Ser Ala Ala Ala
Ala Ala Gly Ile Tyr Val Phe Gly Pro Gly Val Phe 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Val
Phe Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly
405 410 415Pro Gly Val Phe Gly
Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Ile 420
425 430Tyr Gly Ser Gly Pro Gly Ile Tyr Gly Pro Tyr Gly
Pro Gly Ile Ser 435 440 445Gly Pro
Gly Ser Gly Val Phe Gly Ile Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly Pro
Gly Val Phe Gly Pro465 470 475
480Tyr Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly
485 490 495Ile Tyr Gly Pro
Gly Ala Ser Gly Ile Asn Gly Pro Gly Ser Gly Ile 500
505 510Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Ser
Ala Ala Ala Ala Ala 515 520 525Gly
Ile Tyr Val Phe Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly 530
535 540Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr
Gly Ser Gly Pro Gly Val545 550 555
560Phe Gly Pro Tyr Gly Pro Gly Ile Ser Gly Ser Gly Val Phe Gly
Pro 565 570 575Gly Val Phe
Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly 580
585 590Ser Gly Val Phe Gly Pro Gly Ala Ser Gly
Pro Gly Val Phe Gly Pro 595 600
605Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile Asn Gly Pro 610
615 620Gly Ser Gly Val Phe Gly Pro Gly
Ile Ser Gly Ile Tyr Gly Pro Gly625 630
635 640Val Phe Gly Pro Gly Val Phe Gly Pro Gly Ser Ser
Ala Ala Ala Ala 645 650
655Ala Gly Pro Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Ser Ala Ser
660 665 670Ala Ala Ala Ala Ala Gly
Pro Gly Ser Gly Val Phe Gly Pro Gly Ala 675 680
685Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Val Phe
Gly Pro 690 695 700Gly Ser Ser Ala Ala
Ala Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly705 710
715 720Val Phe Gly Pro Tyr Gly Ser Ala Ala Ala
Ala Ala Gly Pro Gly Ser 725 730
735Gly Ile Tyr Gly Ile Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly
740 745 750Ile Tyr Gly Pro Gly
Val Phe Gly Pro Ser Ala Ser Ala Ala Ala Ala 755
760 765Ala Gly Ser Gly Val Phe Gly Pro Gly Ile Tyr Gly
Pro Tyr Ala Ser 770 775 780Ala Ala Ala
Ala Ala Gly Ile Tyr Gly Ser Gly Pro Gly Val Phe Gly785
790 795 800Pro Tyr Gly Pro Gly Ile Ser
Gly Ser Gly Val Phe Gly Pro Gly Val 805
810 815Phe Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly
Pro Gly Val Phe 820 825 830Gly
Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Ile Tyr 835
840 845Gly Tyr Gly Pro Gly Val Phe Gly Pro
Tyr Gly Pro Gly Ala Ser Gly 850 855
860Ile Asn Gly Pro Gly Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro865
870 875 880Gly Ile Ser Ala
Ala Ala Ala Ala Gly Pro Gly Val Phe Gly Pro Tyr 885
890 895Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala
Gly Ile Tyr Gly Pro Gly 900 905
910Val Phe Gly Pro Gly Ile Tyr Gly Pro Gly Ser Ser Gly Pro Gly Val
915 920 925Phe Gly Pro Tyr Gly Pro Gly
Ser Ser Ala Ala Ala Ala Ala Gly Ile 930 935
940Tyr Gly Pro Gly Val Phe Gly Pro Tyr Gly Pro Gly Ile Ser Ala
Ala945 950 955 960Ala Ala
Ala Gly Ile Tyr Val Phe Gly Pro Gly Val Phe Gly Pro Tyr
965 970 975Gly Pro Gly Ala Ser Gly Pro
Gly Val Phe Gly Pro Tyr Gly Pro Gly 980 985
990Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ile Tyr Gly Pro
Gly Val 995 1000 1005Phe Gly Pro
Ser Ala Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly 1010
1015 1020Ser Gly Pro Gly Ile Tyr Gly Pro Tyr Gly Pro
Gly Ile Ser Gly 1025 1030 1035Pro Gly
Ser Gly Val Phe Gly Ile Gly Pro Tyr Gly Pro Gly Ala 1040
1045 1050Ser Ala Ala Ala Ala Ala Gly Ile Tyr Gly
Pro Gly Val Phe Gly 1055 1060 1065Pro
Tyr Gly Pro Gly Ile Ser Ala Ala Ala Ala Ala Gly Pro Gly 1070
1075 1080Ser Gly Ile Tyr Gly Pro Gly Ala Ser
Gly Ile Asn Gly Pro Gly 1085 1090
1095Ser Gly Ile Tyr Gly Pro Gly Val Phe Gly Pro Gly Ile Ser Ala
1100 1105 1110Ala Ala Ala Ala Gly Ile
Tyr Val Phe Gly Pro Gly Val Phe Gly 1115 1120
1125Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Ile
Tyr 1130 1135 1140Gly Ser Gly Pro Gly
Val Phe Gly Pro Tyr Gly Pro Gly Ile Ser 1145 1150
1155Gly Ser Gly Val Phe Gly Pro Gly Val Phe Gly Pro Tyr
Ala Ser 1160 1165 1170Ala Ala Ala Ala
Ala Gly Pro Gly Ser Gly Val Phe Gly Pro Gly 1175
1180 1185Ala Ser 119041590PRTArtificial
SequenceMet-PRT917 41Met Gly Pro Gly Leu Ile Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala1 5 10 15Ala
Ala Ala Gly Val Asn Gly Pro Gly Ser Gly Leu Ile Gly Pro Gly 20
25 30Val Ser Gly Val Tyr Gly Pro Gly
Leu Ile Gly Pro Gly Leu Ile Gly 35 40
45Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Pro Gly Val Tyr Gly Pro
50 55 60Gly Leu Ile Gly Pro Ser Ala Ser
Ala Ala Ala Ala Ala Gly Pro Gly65 70 75
80Ser Gly Leu Ile Gly Pro Gly Ala Ser Gly Val Tyr Gly
Pro Gly Leu 85 90 95Ile
Gly Pro Gly Leu Ile Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
100 105 110Gly Val Tyr Gly Ser Gly Pro
Gly Leu Ile Gly Pro Tyr Gly Ser Ala 115 120
125Ala Ala Ala Ala Gly Pro Gly Ser Gly Val Tyr Gly Val Gly Pro
Tyr 130 135 140Gly Pro Gly Ala Ser Gly
Pro Gly Val Tyr Gly Pro Gly Leu Ile Gly145 150
155 160Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Ser
Gly Leu Ile Gly Pro 165 170
175Gly Val Tyr Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Val Tyr
180 185 190Gly Ser Gly Pro Gly Leu
Ile Gly Pro Tyr Gly Pro Gly Val Ser Gly 195 200
205Ser Gly Leu Ile Gly Pro Gly Leu Ile Gly Pro Tyr Ala Ser
Ala Ala 210 215 220Ala Ala Ala Gly Pro
Gly Leu Ile Gly Pro Tyr Gly Pro Gly Ser Ser225 230
235 240Ala Ala Ala Ala Ala Gly Val Tyr Gly Tyr
Gly Pro Gly Leu Ile Gly 245 250
255Pro Tyr Gly Pro Gly Ala Ser Gly Val Asn Gly Pro Gly Ser Gly Val
260 265 270Tyr Gly Pro Gly Leu
Ile Gly Pro Gly Val Ser Ala Ala Ala Ala Ala 275
280 285Gly Pro Gly Leu Ile Gly Pro Tyr Gly Pro Gly Ala
Ser Ala Ala Ala 290 295 300Ala Ala Gly
Val Tyr Gly Pro Gly Leu Ile Gly Pro Gly Val Tyr Gly305
310 315 320Pro Gly Ser Ser Gly Pro Gly
Leu Ile Gly Pro Tyr Gly Pro Gly Ser 325
330 335Ser Ala Ala Ala Ala Ala Gly Val Tyr Gly Pro Gly
Leu Ile Gly Pro 340 345 350Tyr
Gly Pro Gly Val Ser Ala Ala Ala Ala Ala Gly Val Tyr Leu Ile 355
360 365Gly Pro Gly Leu Ile Gly Pro Tyr Gly
Pro Gly Ala Ser Gly Pro Gly 370 375
380Leu Ile Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly385
390 395 400Pro Gly Val Tyr
Gly Pro Gly Leu Ile Gly Pro Ser Ala Ser Ala Ala 405
410 415Ala Ala Ala Gly Val Tyr Gly Ser Gly Pro
Gly Val Tyr Gly Pro Tyr 420 425
430Gly Pro Gly Val Ser Gly Pro Gly Ser Gly Leu Ile Gly Val Gly Pro
435 440 445Tyr Gly Pro Gly Ala Ser Ala
Ala Ala Ala Ala Gly Val Tyr Gly Pro 450 455
460Gly Leu Ile Gly Pro Tyr Gly Pro Gly Val Ser Ala Ala Ala Ala
Ala465 470 475 480Gly Pro
Gly Ser Gly Val Tyr Gly Pro Gly Ala Ser Gly Val Asn Gly
485 490 495Pro Gly Ser Gly Val Tyr Gly
Pro Gly Leu Ile Gly Pro Gly Val Ser 500 505
510Ala Ala Ala Ala Ala Gly Val Tyr Leu Ile Gly Pro Gly Leu
Ile Gly 515 520 525Pro Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Val Tyr Gly 530
535 540Ser Gly Pro Gly Leu Ile Gly Pro Tyr Gly Pro Gly
Val Ser Gly Ser545 550 555
560Gly Leu Ile Gly Pro Gly Leu Ile Gly Pro Tyr Ala Ser Ala Ala Ala
565 570 575Ala Ala Gly Pro Gly
Ser Gly Leu Ile Gly Pro Gly Ala Ser 580 585
59042587PRTArtificial SequenceMet-PRT1028 42Met Gly Pro Gly
Ile Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala1 5
10 15Ala Ala Ala Gly Thr Gly Pro Gly Ser Gly
Ile Phe Gly Pro Gly Thr 20 25
30Ser Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Gly Ile Phe Gly Pro
35 40 45Gly Ser Ser Ala Ala Ala Ala Ala
Gly Pro Gly Thr Tyr Gly Pro Gly 50 55
60Ile Phe Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser65
70 75 80Gly Ile Phe Gly Pro
Gly Ala Ser Gly Thr Tyr Gly Pro Gly Ile Phe 85
90 95Gly Pro Gly Ile Phe Gly Pro Gly Ser Ser Ala
Ala Ala Ala Ala Gly 100 105
110Thr Tyr Gly Ser Gly Pro Gly Ile Phe Gly Pro Tyr Gly Ser Ala Ala
115 120 125Ala Ala Ala Gly Pro Gly Ser
Gly Thr Tyr Gly Thr Gly Pro Tyr Gly 130 135
140Pro Gly Ala Ser Gly Pro Gly Thr Tyr Gly Pro Gly Ile Phe Gly
Pro145 150 155 160Ser Ala
Ser Ala Ala Ala Ala Ala Gly Ser Gly Ile Phe Gly Pro Gly
165 170 175Thr Tyr Gly Pro Tyr Ala Ser
Ala Ala Ala Ala Ala Gly Thr Tyr Gly 180 185
190Ser Gly Pro Gly Ile Phe Gly Pro Tyr Gly Pro Gly Thr Ser
Gly Ser 195 200 205Gly Ile Phe Gly
Pro Gly Ile Phe Gly Pro Tyr Ala Ser Ala Ala Ala 210
215 220Ala Ala Gly Pro Gly Ile Phe Gly Pro Tyr Gly Pro
Gly Ser Ser Ala225 230 235
240Ala Ala Ala Ala Gly Thr Tyr Gly Tyr Gly Pro Gly Ile Phe Gly Pro
245 250 255Tyr Gly Pro Gly Ala
Ser Gly Thr Gly Pro Gly Ser Gly Thr Tyr Gly 260
265 270Pro Gly Ile Phe Gly Pro Gly Thr Ser Ala Ala Ala
Ala Ala Gly Pro 275 280 285Gly Ile
Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala 290
295 300Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Gly
Thr Tyr Gly Pro Gly305 310 315
320Ser Ser Gly Pro Gly Ile Phe Gly Pro Tyr Gly Pro Gly Ser Ser Ala
325 330 335Ala Ala Ala Ala
Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Tyr Gly 340
345 350Pro Gly Thr Ser Ala Ala Ala Ala Ala Gly Thr
Tyr Ile Phe Gly Pro 355 360 365Gly
Ile Phe Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro Gly Ile Phe 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala
Ala Ala Ala Gly Pro Gly385 390 395
400Thr Tyr Gly Pro Gly Ile Phe Gly Pro Ser Ala Ser Ala Ala Ala
Ala 405 410 415Ala Gly Thr
Tyr Gly Ser Gly Pro Gly Thr Tyr Gly Pro Tyr Gly Pro 420
425 430Gly Thr Ser Gly Pro Gly Ser Gly Ile Phe
Gly Thr Gly Pro Tyr Gly 435 440
445Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Thr Tyr Gly Pro Gly Ile 450
455 460Phe Gly Pro Tyr Gly Pro Gly Thr
Ser Ala Ala Ala Ala Ala Gly Pro465 470
475 480Gly Ser Gly Thr Tyr Gly Pro Gly Ala Ser Gly Thr
Gly Pro Gly Ser 485 490
495Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Gly Thr Ser Ala Ala Ala
500 505 510Ala Ala Gly Thr Tyr Ile
Phe Gly Pro Gly Ile Phe Gly Pro Tyr Gly 515 520
525Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Thr Tyr Gly Ser
Gly Pro 530 535 540Gly Ile Phe Gly Pro
Tyr Gly Pro Gly Thr Ser Gly Ser Gly Ile Phe545 550
555 560Gly Pro Gly Ile Phe Gly Pro Tyr Ala Ser
Ala Ala Ala Ala Ala Gly 565 570
575Pro Gly Ser Gly Ile Phe Gly Pro Gly Ala Ser 580
58543601PRTArtificial SequencePRT917 43Met His His His His His
His Ser Ser Gly Ser Ser Gly Pro Gly Leu1 5
10 15Ile Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala
Ala Ala Gly Val 20 25 30Asn
Gly Pro Gly Ser Gly Leu Ile Gly Pro Gly Val Ser Gly Val Tyr 35
40 45Gly Pro Gly Leu Ile Gly Pro Gly Leu
Ile Gly Pro Gly Ser Ser Ala 50 55
60Ala Ala Ala Ala Gly Pro Gly Val Tyr Gly Pro Gly Leu Ile Gly Pro65
70 75 80Ser Ala Ser Ala Ala
Ala Ala Ala Gly Pro Gly Ser Gly Leu Ile Gly 85
90 95Pro Gly Ala Ser Gly Val Tyr Gly Pro Gly Leu
Ile Gly Pro Gly Leu 100 105
110Ile Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala Gly Val Tyr Gly Ser
115 120 125Gly Pro Gly Leu Ile Gly Pro
Tyr Gly Ser Ala Ala Ala Ala Ala Gly 130 135
140Pro Gly Ser Gly Val Tyr Gly Val Gly Pro Tyr Gly Pro Gly Ala
Ser145 150 155 160Gly Pro
Gly Val Tyr Gly Pro Gly Leu Ile Gly Pro Ser Ala Ser Ala
165 170 175Ala Ala Ala Ala Gly Ser Gly
Leu Ile Gly Pro Gly Val Tyr Gly Pro 180 185
190Tyr Ala Ser Ala Ala Ala Ala Ala Gly Val Tyr Gly Ser Gly
Pro Gly 195 200 205Leu Ile Gly Pro
Tyr Gly Pro Gly Val Ser Gly Ser Gly Leu Ile Gly 210
215 220Pro Gly Leu Ile Gly Pro Tyr Ala Ser Ala Ala Ala
Ala Ala Gly Pro225 230 235
240Gly Leu Ile Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala Ala
245 250 255Gly Val Tyr Gly Tyr
Gly Pro Gly Leu Ile Gly Pro Tyr Gly Pro Gly 260
265 270Ala Ser Gly Val Asn Gly Pro Gly Ser Gly Val Tyr
Gly Pro Gly Leu 275 280 285Ile Gly
Pro Gly Val Ser Ala Ala Ala Ala Ala Gly Pro Gly Leu Ile 290
295 300Gly Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala
Ala Ala Gly Val Tyr305 310 315
320Gly Pro Gly Leu Ile Gly Pro Gly Val Tyr Gly Pro Gly Ser Ser Gly
325 330 335Pro Gly Leu Ile
Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala Ala Ala 340
345 350Ala Gly Val Tyr Gly Pro Gly Leu Ile Gly Pro
Tyr Gly Pro Gly Val 355 360 365Ser
Ala Ala Ala Ala Ala Gly Val Tyr Leu Ile Gly Pro Gly Leu Ile 370
375 380Gly Pro Tyr Gly Pro Gly Ala Ser Gly Pro
Gly Leu Ile Gly Pro Tyr385 390 395
400Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Val Tyr
Gly 405 410 415Pro Gly Leu
Ile Gly Pro Ser Ala Ser Ala Ala Ala Ala Ala Gly Val 420
425 430Tyr Gly Ser Gly Pro Gly Val Tyr Gly Pro
Tyr Gly Pro Gly Val Ser 435 440
445Gly Pro Gly Ser Gly Leu Ile Gly Val Gly Pro Tyr Gly Pro Gly Ala 450
455 460Ser Ala Ala Ala Ala Ala Gly Val
Tyr Gly Pro Gly Leu Ile Gly Pro465 470
475 480Tyr Gly Pro Gly Val Ser Ala Ala Ala Ala Ala Gly
Pro Gly Ser Gly 485 490
495Val Tyr Gly Pro Gly Ala Ser Gly Val Asn Gly Pro Gly Ser Gly Val
500 505 510Tyr Gly Pro Gly Leu Ile
Gly Pro Gly Val Ser Ala Ala Ala Ala Ala 515 520
525Gly Val Tyr Leu Ile Gly Pro Gly Leu Ile Gly Pro Tyr Gly
Pro Gly 530 535 540Ala Ser Ala Ala Ala
Ala Ala Gly Val Tyr Gly Ser Gly Pro Gly Leu545 550
555 560Ile Gly Pro Tyr Gly Pro Gly Val Ser Gly
Ser Gly Leu Ile Gly Pro 565 570
575Gly Leu Ile Gly Pro Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly
580 585 590Ser Gly Leu Ile Gly
Pro Gly Ala Ser 595 60044598PRTArtificial
SequencePRT1028 44Met His His His His His His Ser Ser Gly Ser Ser Gly Pro
Gly Ile1 5 10 15Phe Gly
Pro Tyr Gly Pro Gly Ala Ser Ala Ala Ala Ala Ala Gly Thr 20
25 30Gly Pro Gly Ser Gly Ile Phe Gly Pro
Gly Thr Ser Gly Thr Tyr Gly 35 40
45Pro Gly Ile Phe Gly Pro Gly Ile Phe Gly Pro Gly Ser Ser Ala Ala 50
55 60Ala Ala Ala Gly Pro Gly Thr Tyr Gly
Pro Gly Ile Phe Gly Pro Ser65 70 75
80Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Ile Phe
Gly Pro 85 90 95Gly Ala
Ser Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Gly Ile Phe 100
105 110Gly Pro Gly Ser Ser Ala Ala Ala Ala
Ala Gly Thr Tyr Gly Ser Gly 115 120
125Pro Gly Ile Phe Gly Pro Tyr Gly Ser Ala Ala Ala Ala Ala Gly Pro
130 135 140Gly Ser Gly Thr Tyr Gly Thr
Gly Pro Tyr Gly Pro Gly Ala Ser Gly145 150
155 160Pro Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Ser
Ala Ser Ala Ala 165 170
175Ala Ala Ala Gly Ser Gly Ile Phe Gly Pro Gly Thr Tyr Gly Pro Tyr
180 185 190Ala Ser Ala Ala Ala Ala
Ala Gly Thr Tyr Gly Ser Gly Pro Gly Ile 195 200
205Phe Gly Pro Tyr Gly Pro Gly Thr Ser Gly Ser Gly Ile Phe
Gly Pro 210 215 220Gly Ile Phe Gly Pro
Tyr Ala Ser Ala Ala Ala Ala Ala Gly Pro Gly225 230
235 240Ile Phe Gly Pro Tyr Gly Pro Gly Ser Ser
Ala Ala Ala Ala Ala Gly 245 250
255Thr Tyr Gly Tyr Gly Pro Gly Ile Phe Gly Pro Tyr Gly Pro Gly Ala
260 265 270Ser Gly Thr Gly Pro
Gly Ser Gly Thr Tyr Gly Pro Gly Ile Phe Gly 275
280 285Pro Gly Thr Ser Ala Ala Ala Ala Ala Gly Pro Gly
Ile Phe Gly Pro 290 295 300Tyr Gly Pro
Gly Ala Ser Ala Ala Ala Ala Ala Gly Thr Tyr Gly Pro305
310 315 320Gly Ile Phe Gly Pro Gly Thr
Tyr Gly Pro Gly Ser Ser Gly Pro Gly 325
330 335Ile Phe Gly Pro Tyr Gly Pro Gly Ser Ser Ala Ala
Ala Ala Ala Gly 340 345 350Thr
Tyr Gly Pro Gly Ile Phe Gly Pro Tyr Gly Pro Gly Thr Ser Ala 355
360 365Ala Ala Ala Ala Gly Thr Tyr Ile Phe
Gly Pro Gly Ile Phe Gly Pro 370 375
380Tyr Gly Pro Gly Ala Ser Gly Pro Gly Ile Phe Gly Pro Tyr Gly Pro385
390 395 400Gly Ala Ser Ala
Ala Ala Ala Ala Gly Pro Gly Thr Tyr Gly Pro Gly 405
410 415Ile Phe Gly Pro Ser Ala Ser Ala Ala Ala
Ala Ala Gly Thr Tyr Gly 420 425
430Ser Gly Pro Gly Thr Tyr Gly Pro Tyr Gly Pro Gly Thr Ser Gly Pro
435 440 445Gly Ser Gly Ile Phe Gly Thr
Gly Pro Tyr Gly Pro Gly Ala Ser Ala 450 455
460Ala Ala Ala Ala Gly Thr Tyr Gly Pro Gly Ile Phe Gly Pro Tyr
Gly465 470 475 480Pro Gly
Thr Ser Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Thr Tyr
485 490 495Gly Pro Gly Ala Ser Gly Thr
Gly Pro Gly Ser Gly Thr Tyr Gly Pro 500 505
510Gly Ile Phe Gly Pro Gly Thr Ser Ala Ala Ala Ala Ala Gly
Thr Tyr 515 520 525Ile Phe Gly Pro
Gly Ile Phe Gly Pro Tyr Gly Pro Gly Ala Ser Ala 530
535 540Ala Ala Ala Ala Gly Thr Tyr Gly Ser Gly Pro Gly
Ile Phe Gly Pro545 550 555
560Tyr Gly Pro Gly Thr Ser Gly Ser Gly Ile Phe Gly Pro Gly Ile Phe
565 570 575Gly Pro Tyr Ala Ser
Ala Ala Ala Ala Ala Gly Pro Gly Ser Gly Ile 580
585 590Phe Gly Pro Gly Ala Ser 595
User Contributions:
Comment about this patent or add new information about this topic: