Patent application title: Compositions and Methods for Treating Celiac Sprue Disease
Inventors:
IPC8 Class: AA61K3848FI
USPC Class:
1 1
Class name:
Publication date: 2021-03-25
Patent application number: 20210085761
Abstract:
The invention provides compositions and methods for treating celiac
sprue.Claims:
1. A polypeptide comprising an amino acid sequence at least 75% identical
to an amino acid sequence according to SEQ ID NO:35, wherein (a) the
polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4; (b) residue
278 is Ser, residue 78 is Glu, and residue 82 is Asp; and (c) the
polypeptide comprises an amino acid change from SEQ ID NO: 67 at one or
more residues selected from the group consisting of 73, 102, 103, 104,
130, 165, 168, 169, 172, and 179.
2. A polypeptide comprising an amino acid sequence at least 75% identical to an amino acid sequence according to SEQ ID NO:1, wherein (a) the polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4; (b) residue 467 is Ser, residue 267 is Glu, and residue 271 is Asp; and (c) the polypeptide comprises an amino acid change from SEQ ID NO: 33 at one or more residues selected from the group consisting of 119, 262, 291, 292, 293, 319, 354, 357, 358, 361, and 368.
3. The polypeptide of claim 1 or 2, comprising an amino acid sequence at least 85% identical to an amino acid sequence according to SEQ ID NO:1 or SEQ ID NO: 35.
4. The polypeptide of claim 1 or 2, comprising an amino acid sequence at least 95% identical to an amino acid sequence according to SEQ ID NO:1 or SEQ ID NO: 35.
5. The polypeptide of claim 1, comprising an amino acid sequence according to SEQ ID NO:35.
6. The polypeptide of claim 2, comprising an amino acid sequence according to SEQ ID NO:1.
7. The polypeptide of any one of claims 1-6, wherein the polypeptide, comprises an amino acid sequence according to any one of SEQ ID NO:2-66.
8. The polypeptide of any one of claims 1-8, wherein the polypeptide comprises an amino acid sequence according to SEQ ID NO:32 or SEQ ID NO: 66.
9. A polypeptide comprising an amino acid sequence according to SEQ ID NO:1, wherein the polypeptide comprises at least one amino acid change from SEQ ID NO: 33.
10. A polypeptide comprising an amino acid sequence according to SEQ ID NO:35, wherein the polypeptide comprises at least one amino acid change from SEQ ID NO: 67.
11. The polypeptide of claim 9 or 10, comprising an amino acid sequence according to any one of SEQ ID NO:2-66.
12. The polypeptide of any one of claims 9-11, wherein the polypeptide comprises an amino acid sequence according to SEQ ID NO:32 or SEQ ID NO:66.
13. A nucleic acid encoding the polypeptide of any one of claims 1-12.
14. A nucleic acid expression vector comprising the isolated nucleic acid of claim 13.
15. A recombinant host cell comprising the nucleic acid expression vector of claim 14.
16. A pharmaceutical composition, comprising the polypeptide of any one of claims 1-12, the nucleic acid of claim 13, the nucleic acid expression vector of claim 14, and/or the recombinant host cell of claim 15, and a pharmaceutically acceptable carrier.
17. A method for treating celiac sprue, comprising administering to an individual with celiac sprue an amount effective to treat the celiac sprue of a polypeptide according to any one of claims 1-12, or a polypeptide comprising an amino acid selected from the group consisting of SEQ ID NO:33 or SEQ ID NO:67.
18. A method for treating celiac sprue, comprising administering to an individual with celiac sprue an amount effective of the pharmaceutical composition of claim 16 to treat the celiac sprue.
19. The method of any one of claims 17-18, wherein the polypeptide or the pharmaceutical composition is administered orally.
20. The method of any one of claims 17-19, wherein the polypeptide comprises an amino acid sequence according to SEQ ID NO:32 or SEQ ID NO:66.
Description:
CROSS REFERENCE
[0001] This application claims priority to U.S. Provisional Application Ser. No. 61/521,899 filed on Aug. 10, 2011, incorporated herein by reference in its entirety.
BACKGROUND
[0003] Celiac sprue is a highly prevalent disease in which dietary proteins found in wheat, barley, and rye products known as `glutens` evoke an immune response in the small intestine of genetically predisposed individuals. The resulting inflammation can lead to the degradation of the villi of the small intestine, impeding the absorption of nutrients. Symptoms can appear in early childhood or later in life, and range widely in severity, from diarrhea, fatigue and weight loss to abdominal distension, anemia, and neurological symptoms. There are currently no effective therapies for this lifelong disease except the total elimination of glutens from the diet. Although celiac sprue remains largely underdiagnosed, its' prevalence in the US and Europe is estimated at 0.5-1.0% of the population. The identification of suitable naturally-occurring enzymes as oral therapeutics for Celiac disease is difficult due to the stringent physical and chemical requirements to specifically and efficiently degrade gluten-derived peptides in the harsh and highly acidic environment of the human digestive tract.
SUMMARY OF THE INVENTION
[0004] In a first aspect, the present invention provides polypeptides comprising an amino acid sequence at least 75% identical to an amino acid sequence according to SEQ ID NO:35, wherein
[0005] (a) the polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4;
[0006] (b) residue 278 is Ser, residue 78 is Glu, and residue 82 is Asp; and
[0007] (c) the polypeptide comprises an amino acid change from SEQ ID NO: 67 at one or more residues selected from the group consisting of 73, 102, 103, 104, 130, 165, 168, 169, 172, and 179.
[0008] In a second aspect, the present invention provides polypeptide comprising an amino acid sequence at least 75% identical to an amino acid sequence according to SEQ ID NO:1, wherein
[0009] (a) the polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4;
[0010] (b) residue 467 is Ser, residue 267 is Glu, and residue 271 is Asp; and
[0011] (c) the polypeptide comprises an amino acid change from SEQ ID NO: 33 at one or more residues selected from the group consisting of 119, 262, 291, 292, 293, 319, 354, 357, 358, 361, and 368.
[0012] In various embodiments of the first and second aspect, the polypeptide comprises an amino acid sequence at least 85%, 95%, or 100% identical to an amino acid sequence according to SEQ ID NO:1 or SEQ ID NO: 35. In another embodiment, the polypeptide, comprises an amino acid sequence according to any one of SEQ ID NO:2-66.
[0013] In another aspect, the present invention provides polypeptides comprising an amino acid sequence according to SEQ ID NO:1, wherein the polypeptide comprises at least one amino acid change from SEQ ID NO: 33. In another aspect, the present invention provides a polypeptide comprising an amino acid sequence according to SEQ ID NO:35, wherein the polypeptide comprises at least one amino acid change from SEQ ID NO: 67. In various embodiments, the polypeptide, comprises an amino acid sequence according to any one of SEQ ID NO:2-66.
[0014] In a further aspect, the present invention provides nucleic acids encoding the polypeptide of any aspect or embodiment of the invention. In another aspect, the invention provides nucleic acid expression vectors comprising the isolated nucleic acids of the invention. In a further embodiment, the invention provides recombinant host cells comprising the nucleic acid expression vectors of the invention. In another aspect, the invention provides pharmaceutical compositions, comprising the polypeptides, the nucleic acids, the nucleic acid expression vectors and/or the recombinant host cells of the invention, and a pharmaceutically acceptable carrier.
[0015] In another aspect, the invention provides methods for treating celiac sprue, comprising administering to an individual with celiac sprue a polypeptide or pharmaceutical composition according to any embodiment of the invention, or a polypeptide comprising an amino acid selected from the group consisting of SEQ ID NO:33 or SEQ ID NO:67.
BRIEF DESCRIPTION OF THE FIGURES
[0016] FIG. 1. Schematic depicting the role of enzyme therapeutics in the treatment of Celiac disease. Gluten is comprised of many glycoproteins including .alpha.-gliadin. Partial proteolysis of .alpha.-gliadin (SEQ ID NO: 71) results in protease-resistant peptides enriched in a PQ dipeptide motif that can lead to inflammation and disease. The addition of an oral enzyme therapeutic that is functional in the stomach and capable of specifically degrading the immunogenic peptides could potentially act as a therapeutic for this disease.
[0017] FIG. 2. Computational models of the peptide binding sites for KumaWT and KumaMax. A) KumaWT in complex with a PR dipeptide motif. B) KumaMax in complex with the designed PQ dipeptide motif. Computationally designed residues in the active site are labeled and highlighted in sticks. The modeled peptides were based on a bound form of Kumamolisin-AS (PDB ID: 1T1E) and final structures were generated using the Rosetta Molecular Modeling Suite. Images were generated using PyMol v1.5.
[0018] FIG. 3. Protein stability after incubation with pepsin or trypsin. Stability was measured by quantifying the relative remaining fraction of intact protein as observed on an SDS-PAGE gel after 30 minutes of incubation in the presence or absence of pepsin or trypsin at the pH indicated. Each protein was measured in triplicate and the error bars represent the standard deviation. Quantification was performed in ImageJ.TM..
[0019] FIG. 4. An immunogenic .alpha.9-gliadin peptide is degraded by KumaMax. A) Reaction chromatograms measuring the abundance of the M+H ion of the parent .alpha.9-gliadin peptide after 50 minutes of incubation with no enzyme, SC PEP, or KumaMax.TM.. B) The fraction of .alpha.9-gliadin peptide remaining in the presence of KumaMax.TM. as a function of incubation time at pH 4. The data was fit using a standard exponential decay function. The R.sup.2 value was greater than 0.9.
DETAILED DESCRIPTION
[0020] All references cited are herein incorporated by reference in their entirety. Within this application, unless otherwise stated, the techniques utilized may be found in any of several well-known references such as: Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. Goeddel, 1991. Academic Press, San Diego, Calif.), "Guide to Protein Purification" in Methods in Enzymology (M. P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, San Diego, Calif.), Culture of Animal Cells: A Manual of Basic Technique, 2.sup.nd Ed. (R. I. Freshney. 1987. Liss, Inc. New York, N.Y.), Gene Transfer and Expression Protocols, pp. 109-128, ed. E. J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, Tex.).
[0021] As used herein, the singular forms "a", "an" and "the" include plural referents unless the context clearly dictates otherwise. "And" as used herein is interchangeably used with "or" unless expressly stated otherwise.
[0022] As used herein, amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V).
[0023] All embodiments of any aspect of the invention can be used in combination, unless the context clearly dictates otherwise.
[0024] In a first aspect, the present invention provides polypeptides comprising an amino acid sequence at least 75% identical to an amino acid sequence according to SEQ ID NO:35, wherein
[0025] (a) the polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4;
[0026] (b) residue 278 is Ser, residue 78 is Glu, and residue 82 is Asp; and
[0027] (c) the polypeptide comprises an amino acid change from SEQ ID NO: 67 at one or more residues selected from the group consisting of 73, 102, 103, 104, 130, 165, 168, 169, 172, and 179.
[0028] As disclosed in the examples that follow, polypeptides according to this aspect of the invention can be used, for example, in treating celiac sprue. The polypeptides are modified versions of either the processed version of Kumamolisin-As (SEQ ID NO:67) or the preprocessed version of Kumamolisin-As (SEQ ID NO:33), which is known as a member of the sedolisin family of serine-carboxyl peptidases, and utilizes the key catalytic triad Ser.sup.278-Glu.sup.78-Asp.sup.82 in its processed form to hydrolyze its substrate (Ser.sup.467-Glu.sup.267-Asp.sup.271 in the pre-processed form) Its maximal activity is at pH .about.4.0. While the native substrate for Kumamolisin-As is unknown, it has been previously shown to degrade collagen under acidic conditions (4). In addition, this enzyme has been shown to be thermostable, with an ideal temperature at 60.degree. C., but still showing significant activity at 37.degree. C.
[0029] The inventors of the present invention have unexpectedly discovered that Kumamolisin-As is capable of degrading proline (P)- and glutamine (Q)-rich components of gluten known as `gliadins` believed responsible for the bulk of the immune response in most celiac sprue patients. The polypeptides of the invention show improved protease activity at pH 4 against the oligopeptide PQPQLP (a substrate representative of gliadin) compared to wild type Kumamolisin-As.
[0030] The polypeptides of this aspect of the invention degrade a PQPQLP (SEQ ID NO:34) peptide at pH 4. Such degradation occurs under the conditions disclosed in the examples that follow.
[0031] The polypeptides of this aspect comprise one or more amino acid changes from SEQ ID NO: 67 (wild type processed Kumamolisin-As) at one or more residues selected from the group consisting of residues 73, 102, 103, 104, 130, 165, 168, 169, 172, and 179 (numbering based on the wild type processed Kumamolisin-As amino acid sequence). In non-limiting embodiments, the one or more changes relative to the wild type processed Kumamolisin-As amino acid sequence (SEQ ID NO:67) are selected from the group consisting of:
TABLE-US-00001 WT Residue# AA change S73 K, G N102 D T103 S D104 A, T, N G130 S S165 N T168 A D169 N, G Q172 D D179 S, H
[0032] In various further non-limiting embodiments, the one or more changes relative to the wild type processed Kumamolisin-As amino acid sequence include at least N102D. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N102D and D169N or D169G. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N102D, D169G, and D179H. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least S73K, D104T, N102D, G130S, D169G, and D179H.
[0033] As used herein, "at least 75% identical" means that the polypeptide differs in its full length amino acid sequence by less 25% or less (including any amino acid substitutions, deletions, additions, or insertions) from the polypeptide defined by SEQ ID NO:35.
[0034] In various preferred embodiment, the polypeptide s comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to SEQ ID NO:35. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:35.
[0035] In various further embodiments, the polypeptides comprise or consist of an amino acid sequence at least 75% identical to any one of SEQ ID NOS:36-66. The polypeptides represented by these SEQ ID NOS are specific examples of polypeptides with improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As. In various preferred embodiment, the polypeptide s comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to any one of SEQ ID NOS:36-66. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:36-66.
[0036] In a preferred embodiment of this first aspect, the polypeptides comprising an amino acid sequence at least 75% identical to an amino acid sequence according to SEQ ID NO:1 (based on variants of the preprocessed version of Kumamolisin-As), wherein
[0037] (a) the polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4;
[0038] (b) residue 467 is Ser, residue 267 is Glu, and residue 271 is Asp; and
[0039] (c) the polypeptide comprises an amino acid change from SEQ ID NO: 33 at one or more residues selected from the group consisting of 119, 262, 291, 292, 293, 319, 354, 357, 358, 361, and 368.
[0040] The polypeptides of this embodiment comprise one or more amino acid changes from SEQ ID NO: 33 (wild type pre-processed Kumamolisin-As) at one or more residues selected from the group consisting of residues 119, 262, 291, 292, 293, 319, 354, 357, 358, 361, and 368 (numbering based on the wild type pre-processed Kumamolisin-As amino acid sequence). In non-limiting embodiments, the one or more changes relative to the wild type Kumamolisin-As amino acid sequence are selected from the group consisting of:
TABLE-US-00002 WT Residue# AA change V119 D S262 K, G N291 D T292 S D293 A, T, N G319 S S354 N T357 A D358 N, G Q361 D D368 S, H
[0041] In various further non-limiting embodiments, the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N291D. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N291D and 358N or 358G. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N291D, 358G, and 368H. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least V119D, S262K, D293T, N291D, G319S, D358G, and D368H.
[0042] As used herein, "at least 75% identical" means that the polypeptide differs in its full length amino acid sequence by less 25% or less (including any amino acid substitutions, deletions, additions, or insertions) from the polypeptide defined by SEQ ID NO:1.
[0043] In various preferred embodiment, the polypeptide s comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to SEQ ID NO:1. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:1.
[0044] In various further embodiments, the polypeptides comprise or consist of an amino acid sequence at least 75% identical to any one of SEQ ID NOS:2-32. The polypeptides represented by these SEQ ID NOS are specific examples of polypeptides with improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As. In various preferred embodiment, the polypeptide s comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to any one of SEQ ID NOS:2-32. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:2-32.
[0045] In a second aspect, the present invention provides polypeptides comprising or consisting of an amino acid sequence according to SEQ ID NO:35, wherein the polypeptide comprises at least one amino acid change from SEQ ID NO: 67. As disclosed in the examples that follow, polypeptides according to this aspect of the invention can be used, for example, in treating celiac sprue. The polypeptides are modified versions of processed Kumamolisin-As (SEQ ID NO:67), that show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As. In one embodiment, the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:36. Polypeptides according to SEQ ID NO:36 have a N102D mutation relative to wild-type processed Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 10-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0046] In another embodiment of this second aspect, the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:37. Polypeptides according to SEQ ID NO:37 have a N102D mutation and a D169N or D169G mutation relative to wild-type processed Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 20-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0047] In another embodiment of this second aspect, the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:38. Polypeptides according to SEQ ID NO:38 have a N102D mutation, a D169G, and a D179H mutation relative to wild-type processed Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 50-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0048] In a further embodiment of this second aspect, the polypeptides comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:39-66. Polypeptides according to these embodiments have all been demonstrated to show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In a preferred embodiment, the polypeptide comprises or consists of an amino acid sequence according to SEQ ID NO:66.
[0049] In a preferred embodiment of this second aspect, the present invention provides polypeptides comprising or consisting of an amino acid sequence according to SEQ ID NO:1, wherein the polypeptide comprises at least one amino acid change from SEQ ID NO: 33. As disclosed in the examples that follow, polypeptides according to this aspect of the invention can be used, for example, in treating celiac sprue. The polypeptides are modified versions of preprocessed Kumamolisin-As (SEQ ID NO:33), that show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As. In one embodiment, the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:2. Polypeptides according to SEQ ID NO:2 have a N291D mutation relative to preprocessed wild-type Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 10-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0050] In another embodiment of this second aspect, the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:3. Polypeptides according to SEQ ID NO:3 have a N291D mutation and a D358N or D358G mutation relative to preprocessed wild-type Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 20-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0051] In another embodiment of this second aspect, the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:4. Polypeptides according to SEQ ID NO:4 have a N291D mutation, a D358G, and a D368H mutation relative to preprocessed wild-type Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 50-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0052] In a further embodiment of this second aspect, the polypeptides comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:5-32. Polypeptides according to these embodiments have all been demonstrated to show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In a preferred embodiment, the polypeptide comprises or consists of an amino acid sequence according to SEQ ID NO:32; this polypeptide is shown in the examples that follow to possess the most potent protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) of any of the polypeptides tested.
[0053] As used throughout the present application, the term "polypeptide" is used in its broadest sense to refer to a sequence of subunit amino acids, whether naturally occurring or of synthetic origin. The polypeptides of the invention may comprise L-amino acids, D-amino acids (which are resistant to L-amino acid-specific proteases in vivo), or a combination of D- and L-amino acids. The polypeptides described herein may be chemically synthesized or recombinantly expressed. The polypeptides may be linked to other compounds to promote an increased half-life in vivo, such as by PEGylation, HESylation, PASylation, or glycosylation. Such linkage can be covalent or non-covalent as is understood by those of skill in the art. The polypeptides may be linked to any other suitable linkers, including but not limited to any linkers that can be used for purification or detection (such as FLAG or His tags).
[0054] In a third aspect, the present invention provides isolated nucleic acids encoding the polypeptide of any aspect or embodiment of the invention. The isolated nucleic acid sequence may comprise RNA or DNA. As used herein, "isolated nucleic acids" are those that have been removed from their normal surrounding nucleic acid sequences in the genome or in cDNA sequences. Such isolated nucleic acid sequences may comprise additional sequences useful for promoting expression and/or purification of the encoded protein, including but not limited to polyA sequences, modified Kozak sequences, and sequences encoding epitope tags, export signals, and secretory signals, nuclear localization signals, and plasma membrane localization signals. It will be apparent to those of skill in the art, based on the teachings herein, what nucleic acid sequences will encode the polypeptides of the invention.
[0055] In a fourth aspect, the present invention provides nucleic acid expression vectors comprising the isolated nucleic acid of any embodiment of the invention operatively linked to a suitable control sequence. "Recombinant expression vector" includes vectors that operatively link a nucleic acid coding region or gene to any control sequences capable of effecting expression of the gene product. "Control sequences" operably linked to the nucleic acid sequences of the invention are nucleic acid sequences capable of effecting the expression of the nucleic acid molecules. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered "operably linked" to the coding sequence. Other such control sequences include, but are not limited to, polyadenylation signals, termination signals, and ribosome binding sites. Such expression vectors can be of any type known in the art, including but not limited plasmid and viral-based expression vectors. The control sequence used to drive expression of the disclosed nucleic acid sequences in a mammalian system may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive). The construction of expression vectors for use in transfecting prokaryotic cells is also well known in the art, and thus can be accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989; Gene Transfer and Expression Protocols, pp. 109-128, ed. E. J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, Tex.). The expression vector must be replicable in the host organisms either as an episome or by integration into host chromosomal DNA. In a preferred embodiment, the expression vector comprises a plasmid. However, the invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors.
[0056] In a fifth aspect, the present invention provides recombinant host cells comprising the nucleic acid expression vectors of the invention. The host cells can be either prokaryotic or eukaryotic. The cells can be transiently or stably transfected or transduced. Such transfection and transduction of expression vectors into prokaryotic and eukaryotic cells can be accomplished via any technique known in the art, including but not limited to standard bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection. (See, for example, Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press; Culture of Animal Cells: A Manual of Basic Technique, 2.sup.nd Ed. (R. I. Freshney. 1987. Liss, Inc. New York, N.Y.). A method of producing a polypeptide according to the invention is an additional part of the invention. The method comprises the steps of (a) culturing a host according to this aspect of the invention under conditions conducive to the expression of the polypeptide, and (b) optionally, recovering the expressed polypeptide. The expressed polypeptide can be recovered from the cell free extract, cell pellet, or recovered from the culture medium. Methods to purify recombinantly expressed polypeptides are well known to the man skilled in the art.
[0057] In a sixth aspect, the present invention provides pharmaceutical compositions, comprising the polypeptide, nucleic acid, nucleic acid expression vector, and/or the recombinant host cell of any aspect or embodiment of the invention, and a pharmaceutically acceptable carrier. The pharmaceutical compositions of the invention can be used, for example, in the methods of the invention described below. The pharmaceutical composition may comprise in addition to the polypeptides, nucleic acids, etc. of the invention (a) a lyoprotectant; (h) a surfactant; (c) a bulking agent; (d) a tonicity adjusting agent (e) a stabilizer; (f) a preservative and/or (g) a buffer.
[0058] In some embodiments, the buffer in the pharmaceutical composition is a Tris buffer, a histidine buffer, a phosphate buffer, a citrate buffer or an acetate buffer. The pharmaceutical composition may also include a lyoprotectant, e.g. sucrose, sorbitol or trehalose. In certain embodiments, the pharmaceutical composition includes a preservative e.g. benzalkonium chloride, benzethonium, chlorohexidine, phenol, m-cresol, benzyl alcohol, methylparaben, propylparaben, chlorobutanol, o-cresol, p-cresol, chlorocresol, phenylmercuric nitrate, thimerosal, benzoic acid, and various mixtures thereof. In other embodiments, the pharmaceutical composition includes a bulking agent, like glycine. In yet other embodiments, the pharmaceutical composition includes a surfactant e.g., poly sorbate-20, polysorbate-40, polysorbate-60, polysorbate-65, polysorbate-80 polysorbate-85, poloxamer-188, sorbitan monolaurate, sorbitan monopalmitate, sorbitan monostearate, sorbitan monooleate, sorbitan trilaurate, sorbitan tristearate, sorbitan trioleaste, or a combination thereof. The pharmaceutical composition may also include a tonicity adjusting agent, e.g., a compound that renders the formulation substantially isotonic or isoosmotic with human blood. Exemplary tonicity adjusting agents include sucrose, sorbitol, glycine, methionine, mannitol, dextrose, inositol, sodium chloride, arginine and arginine hydrochloride. In other embodiments, the pharmaceutical composition additionally includes a stabilizer, e.g., a molecule which, when combined with a protein of interest substantially prevents or reduces chemical and/or physical instability of the protein of interest in lyophilized or liquid form. Exemplary stabilizers include sucrose, sorbitol, glycine, inositol, sodium chloride, methionine, arginine, and arginine hydrochloride.
[0059] The polypeptides, nucleic acids, etc. of the invention may be the sole active agent in the pharmaceutical composition, or the composition may further comprise one or more other active agents suitable for an intended use.
[0060] The pharmaceutical compositions described herein generally comprise a combination of a compound described herein and a pharmaceutically acceptable carrier, diluent, or excipient. Such compositions are substantially free of non-pharmaceutically acceptable components, i.e., contain amounts of non-pharmaceutically acceptable components lower than permitted by US regulatory requirements at the time of filing this application. In some embodiments of this aspect, if the compound is dissolved or suspended in water, the composition further optionally comprises an additional pharmaceutically acceptable carrier, diluent, or excipient. In other embodiments, the pharmaceutical compositions described herein are solid pharmaceutical compositions (e.g., tablet, capsules, etc.).
[0061] These compositions can be prepared in a manner well known in the pharmaceutical art, and can be administered by any suitable route. In a preferred embodiment, the pharmaceutical compositions and formulations are designed for oral administration. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable.
[0062] The pharmaceutical compositions can be in any suitable form, including but not limited to tablets, pills, powders, lozenges, sachets, cachets, elixirs, suspensions, emulsions, solutions, syrups, aerosols (as a solid or in a liquid medium), ointments containing, for example, up to 10% by weight of the active compound, soft and hard gelatin capsules, sterile injectable solutions, and sterile packaged powders.
[0063] In a seventh aspect, the present invention provides methods for treating celiac sprue, comprising administering to an individual with celiac sprue an amount effective to treat the celiac sprue of one or more polypeptides selected from the group consisting of the polypeptides of the first or second aspects of the invention, SEQ ID NO:33, and SEQ ID NO:67.
[0064] The inventors of the present invention have unexpectedly discovered that Kumamolisin-As is capable of degrading proline (P)- and glutamine (Q)-rich components of gluten known as `gliadins` believed responsible for the bulk of the immune response in most celiac sprue patients. The polypeptides of the invention show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As.
[0065] In one embodiment, the one or more polypeptides comprise an amino acid sequence at least 75% identical to an amino acid sequence according to SEQ ID NO:35, wherein
[0066] (a) the polypeptide degrades a PQPQLP (SEQ ID NO:34) peptide at pH 4;
[0067] (b) residue 278 is Ser, residue 78 is Glu, and residue 82 is Asp.
[0068] In further embodiments, the one or more polypeptides comprise one or more amino acid changes from SEQ ID NO: 67 (wild type processed Kumamolisin-As) at one or more residues selected from the group consisting of residues 73, 102, 103, 104, 130, 165, 168, 169, 172, and 179 (numbering based on the wild type processed Kumamolisin-As amino acid sequence). In non-limiting embodiments, the one or more changes relative to the wild type processed Kumamolisin-As amino acid sequence (SEQ ID NO:67) are selected from the group consisting of:
TABLE-US-00003 WT Residue# AA change S73 K, G N102 D T103 S D104 A, T, N G130 S S165 N T168 A D169 N, G Q172 D D179 S, H
[0069] In various further non-limiting embodiments, the one or more changes relative to the wild type processed Kumamolisin-As amino acid sequence include at least N102D. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N102D and D169N or D169G. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N102D, D169G, and D179H. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least S73K, D104T, N102D, G130S, D169G, and D179H.
[0070] In various preferred embodiment, the one or more polypeptide s comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to SEQ ID NO:35. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:35.
[0071] In various further embodiments, the one or more polypeptides comprise or consist of an amino acid sequence at least 75% identical to any one of SEQ ID NOS:36-66. The polypeptides represented by these SEQ ID NOS are specific examples of polypeptides with improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As. In various preferred embodiment, the polypeptide s comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to any one of SEQ ID NOS:36-66. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:36-66.
[0072] In a preferred embodiment, the polypeptides for use in the methods of this aspect of the invention comprise an amino acid according to SEQ ID NO:33 or a polypeptide comprising one or more amino acid changes from SEQ ID NO: 33 (wild type preprocessed Kumamolisin-As) at one or more residues selected from the group consisting of residues 119, 262, 291, 292, 293, 319, 354, 357, 358, 361, and 368 (numbering based on the wild type Kumamolisin-As amino acid sequence). In non-limiting embodiments, the one or more changes relative to the wild type Kumamolisin-As amino acid sequence are selected from the group consisting of:
TABLE-US-00004 WT Residue# AA change V119 D S262 K, G N291 D T292 S D293 A, T, N G319 S S354 N T357 A D358 N, G Q361 D D368 S, H
[0073] In various further non-limiting preferred embodiments, the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N291D. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N291D and 358N or 358G. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least N291D, 358G, and 368H. In another embodiment the one or more changes relative to the wild type Kumamolisin-As amino acid sequence include at least V119D, S262K, D293T, N291D, G319S, D358G, and D368H.
[0074] In various preferred embodiment, the polypeptide s for use in the methods of this aspect of the invention comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to SEQ ID NO:1. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to SEQ ID NO:1.
[0075] In various further embodiments, the polypeptides for use in the methods of this aspect of the invention comprise or consist of an amino acid sequence at least 75% identical to any one of SEQ ID NOS:2-32. The polypeptides represented by these SEQ ID NOS are specific examples of polypeptides with improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) (a substrate representative of gliadin) compared to wild type Kumamolisin-As. In various preferred embodiment, the polypeptides for use in the methods of this aspect of the invention comprise or consist of an amino acid sequence at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to an amino acid sequence according to any one of SEQ ID NOS:2-32. In a further embodiment the polypeptides comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:2-32.
[0076] In an eighth aspect, the present invention provides methods for treating celiac sprue, comprising administering to an individual with celiac sprue a polypeptide comprising an amount effective of amino acid sequence according to any one of SEQ ID NOS: 1-67 to treat the celiac sprue.
[0077] In one embodiment, the polypeptides administered comprise or consist of an amino acid sequence according to SEQ ID NO:2. Polypeptides according to SEQ ID NO:2 have a N291D mutation relative to wild-type Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 10-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0078] In another embodiment of this second aspect, the polypeptides administered comprise or consist of an amino acid sequence according to SEQ ID NO:3. Polypeptides according to SEQ ID NO:3 have a N291D mutation and a D358N or D358G mutation relative to wild-type Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 20-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0079] In another embodiment of this second aspect, the polypeptides administered comprise or consist of an amino acid sequence according to SEQ ID NO:4. Polypeptides according to SEQ ID NO:4 have a N291D mutation, a D358G, and a D368H mutation relative to wild-type Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 50-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As.
[0080] In a further embodiment of this second aspect, the polypeptides administered comprise or consist of an amino acid sequence according to any one of SEQ ID NOS:5-32. Polypeptides according to these embodiments have all been demonstrated to show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In a preferred embodiment, the polypeptide administered comprises or consists of an amino acid sequence according to SEQ ID NO:32; this polypeptide is shown in the examples that follow to possess the most potent protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) of any of the polypeptides tested.
[0081] In another embodiment, the one or more polypeptides comprise an amino acid sequence according to SEQ ID NO:36. Polypeptides according to SEQ ID NO:36 have a N102D mutation relative to wild-type processed Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 10-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In another embodiment, the one or more polypeptides comprise an amino acid sequence according to SEQ ID NO:37. Polypeptides according to SEQ ID NO:37 have a N102D mutation and a D169N or D169G mutation relative to wild-type processed Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 20-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In another embodiment, the one or more polypeptides comprise an amino acid sequence according to SEQ ID NO:38. Polypeptides according to SEQ ID NO:38 have a N102D mutation, a D169G, and a D179H mutation relative to wild-type processed Kumamolisin-As. As shown in the examples that follow, polypeptides containing this mutation have at least 50-fold improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In further embodiments, the one or more polypeptides comprise an amino acid sequence according to any one of SEQ ID NOS:39-66. Polypeptides according to these embodiments have all been demonstrated to show improved protease activity at pH 4 against the oligopeptide PQPQLP (SEQ ID NO: 34) compared to wild type Kumamolisin-As. In a preferred embodiment, the polypeptide comprises or consists of an amino acid sequence according to SEQ ID NO:66.
[0082] Celiac sprue (also known as celiac disease or gluten intolerance) is a highly prevalent disease in which dietary proteins found in wheat, barley, and rye products known as `glutens` evoke an immune response in the small intestine of genetically predisposed individuals. The resulting inflammation can lead to the degradation of the villi of the small intestine, impeding the absorption of nutrients. Symptoms can appear in early childhood or later in life, and range widely in severity, from diarrhea, fatigue, weight loss, abdominal pain, bloating, excessive gas, indigestion, constipation, abdominal distension, nausea/vomiting, anemia, bruising easily, depression, anxiety, growth delay in children, hair loss, dermatitis, missed menstrual periods, mouth ulcers, muscle cramps, joint pain, nosebleeds, seizures, tingling or numbness in hands or feet, delayed puberty, defects in tooth enamel, and neurological symptoms such as ataxia or paresthesia. There are currently no effective therapies for this lifelong disease except the total elimination of glutens from the diet. Although celiac sprue remains largely underdiagnosed, its' prevalence in the US and Europe is estimated at 0.5-1.0% of the population.
[0083] As used herein, "treating celiac sprue" means accomplishing one or more of the following: (a) reducing the severity of celiac sprue; (b) limiting or preventing development of symptoms characteristic of celiac sprue; (c) inhibiting worsening of symptoms characteristic of celiac sprue; (d) limiting or preventing recurrence of celiac sprue in patients that have previously had the disorder; (e) limiting or preventing recurrence of symptoms in patients that were previously symptomatic for celiac sprue; and (f) limiting development of celiac sprue in a subject at risk of developing celiac sprue, or not yet showing the clinical effects of celiac sprue.
[0084] The individual to be treated according to the methods of the invention may be any individual suffering from celiac sprue, including human subjects. The individual may be one already suffering from symptoms or one who is asymptomatic.
[0085] As used herein, an "amount effective" refers to an amount of the polypeptide that is effective for treating celiac sprue. The polypeptides are typically formulated as a pharmaceutical composition, such as those disclosed above, and can be administered via any suitable route, including orally, parentally, by inhalation spray, or topically in dosage unit formulations containing conventional pharmaceutically acceptable carriers, adjuvants, and vehicles. In a preferred embodiment, the pharmaceutical compositions and formulations are orally administered, such as by tablets, pills, lozenges, elixirs, suspensions, emulsions, solutions, or syrups.
[0086] Dosage regimens can be adjusted to provide the optimum desired response (e.g., a therapeutic or prophylactic response). A suitable dosage range may, for instance, be 0.1 ug/kg-100 mg/kg body weight; alternatively, it may be 0.5 ug/kg to 50 mg/kg; 1 ug/kg to 25 mg/kg, or 5 ug/kg to 10 mg/kg body weight. The polypeptides can be delivered in a single bolus, or may be administered more than once (e.g., 2, 3, 4, 5, or more times) as determined by an attending physician.
Examples
[0087] Celiac disease is an autoimmune disorder that afflicts approximately 1% of the population.sup.1,2. This disease is characterized by an inflammatory reaction to gluten, the major protein in wheat flour, and to related proteins in barley and rye.sup.2. Gluten is composed of a heterogeneous mixture of the glycoproteins gliadin and glutenin.sup.3. Upon ingestion, .alpha.-gliadin is partially degraded by gastric and intestinal proteases to oligopeptides, which are resistant to further proteolysis due to their unusually high proline and glutamine content.sup.3 (FIG. 1). Immunogenic oligopeptides that result from incomplete proteolysis are enriched in the PQ motif.sup.4,5 (FIG. 1), which stimulate inflammation and injury in the intestine of people with Celiac disease. Currently the only treatment for this disease is complete elimination of gluten from the diet, which is difficult to attain due to the ubiquity of this protein in modern food products.sup.6.
[0088] Oral enzyme therapy (OET) in which orally administered proteases are employed to hydrolyze immunogenic peptides before they are capable of triggering inflammation is currently being explored as a treatment for gluten intolerance. For this purpose, several different proteases have been considered due to their specificity for cleavage after either proline or glutamine residues.sup.4,7-9. However, these enzymes often demonstrate characteristics that hinder their use in OET for gluten degradation. Most of these peptidases exhibit optimal catalytic activity at neutral pH; however, the pH of the human stomach ranges from 2 to 4. These enzymes are therefore most active when they reach the pH-neutral small intestine, which is too late for effective prevention of Celiac disease as this is the site where gluten-derived pathology develops.sup.2,10. Additionally, several of these enzymes demonstrate instability in the low pH of the human stomach, are susceptible to proteolysis by digestive proteases, or require extensive refolding procedures during their purification.sup.7,11, which are all characteristics that hamper efforts for clinical use.
[0089] The ideal protease for the application of OET in the treatment of gluten intolerance would combine the following traits: optimal activity at low pH, easy purification, stability under the conditions of the human stomach, and high specificity for amino acid motifs found in gluten-derived immunogenic oligopeptides. Here we report the engineering of an endopeptidase that demonstrates these traits. We identified a protease that is highly active in acidic conditions, Kumamolisin-As (KumaWT) from the acidophilic bacterium Alicyclobacillus sendaiensis, and used computational modeling tools to engineer it toward the desired oligopeptide specificity. The computationally designed enzyme, designated KumaMax.TM., exhibited over 100-fold increased proteolytic activity and an 800-fold switch in substrate specificity for the targeted PQ motif compared to wild-type KumaWT. In addition, KumaMax.TM. demonstrates resistance to common gastric proteases and is produced at high yields in E. coli without the need for refolding. Thus, this protease and others reported herein represent promising therapeutic candidates for Celiac disease.
Results
Selection and Computational Design of an .alpha.-Gliadin Endopeptidase
[0090] In order to engineer a novel protease that can degrade gluten peptides under gastric conditions, we first focused on identifying an appropriate protease as a starting point for our engineering efforts. Ideally, the template protease would combine stability and activity at low pH with demonstrated specificity for a dipeptide amino acid motif. We identified the enzyme Kumamolisin-As (KumaWT) as a template, since this protease naturally has an optimal activity over the pH range of 2-4.sup.12, which matches the approximate pH ranges in the human stomach before and after a meal is ingested (pH 2 and 4, respectively).sup.13. KumaWT also demonstrates high stability and activity at the physiologically relevant temperature 37.degree. C. In addition, the purification of this enzyme is straightforward and yields significant quantities using standard recombinant protein production methods in E. coli.sup.14, an important property both for screening mutant libraries and for its ultimate generation in large batches for use in OET. Finally, KumaWT naturally recognizes a specific dipeptide motif as opposed to single amino acid specificity.sup.14. This is an important property for an oral protease therapeutic meant to be taken during digestion, since dipeptide specificity should result in reduced competitive inhibition by other food-derived peptides in the stomach.
[0091] An effective OET for Celiac disease would likely demonstrate specificity for Proline-Glutamine (PQ), due to the frequent occurrence of this dipeptide in immunogenic gluten-derived oligopeptides (FIG. 1). KumaWT has a strong specificity for proline at the P2 position of its peptide substrate, matching one of the amino acid residues of interest for the degradation of immunogenic .alpha.-gliadin peptides. In the P1 site, KumaWT has been established to prefer the positively charged amino acids arginine or lysine.sup.14. Despite this preference, KumaWT is also capable of recognizing glutamine at the P1 position, albeit at a significantly decreased level compared to its recognition of arginine or lysine.sup.14. This slight innate proclivity to recognize glutamine at the P1 position suggests that KumaWT may be amenable to re-engineering to prefer glutamine at this position. At the P1' site, KumaWT demonstrates broad specificity, which is desirable since the residue in the position after the PQ motif varies among the different immunogenic peptides, as depicted in FIG. 1.
[0092] Given these characteristics of KumaWT, our primary goal was to computationally redesign the S1 binding pocket of KumaWT such that it would prefer a PQ dipeptide motif over the native PR or PK substrates. Using the Rosetta Molecular Modeling Suite, we modeled the PR dipeptide in the S1 binding pocket of KumaWT using this enzyme's solved crystal structure (PDB ID: 1T1E). This revealed that two negatively-charged amino acids, D358 and D368, likely facilitate binding of the positively charged amino acids in the P1 position (FIG. 2A). The native specificity for proline at P2 appears to be derived in large part from a hydrophobic interaction of this amino acid residue with the aromatic ring of W318 in the S2 pocket of the enzyme. As specificity of the P1 position for proline is desired in our enzyme variant, we maintained this native tryptophan during the design of the S1 pocket.
[0093] To redesign the KumaWT substrate specificity of the S1 pocket to prefer glutamine at the P1 position, we generated theoretical mutations in the KumaWT binding pocket using the Foldit interface to the Rosetta Molecular Modeling Suite. A tetrapeptide that represents a common immunogenic motif found throughout .alpha.-gliadin, PQLP (SEQ ID NO: 68), was modeled into the P2 to P2' active site positions. This structure already contained a polypeptide bound in the active site, so the residues of this polypeptide were mutated using Rosetta to the PQLP (SEQ ID NO: 68) tetrapeptide motif. A total of 75 residues within an 8 .ANG. sphere of the tetrapeptide were randomized to any of the 20 naturally occurring amino acids in order to find mutations that would favor binding of glutamine in the S1 pocket. These mutations were accepted if the overall energy of the new enzyme-PQLP substrate complex was either reduced relative to the native substrate, or was not increased by more than 5 Rosetta energy units. To accommodate the smaller, neutral amino acid glutamine, we focused our computational efforts on 1) removing the negative charge of the S1 pocket during the design process, 2) filling in open space that resulted from the replacement of the large amino acid arginine with glutamine, and 3) providing hydrogen bonds to the amide functional group of the glutamine. This computational modeling yielded 107 novel designs containing from 1 to 7 simultaneous mutations. These designed proteins were then constructed and their catalytic activity against a PQLP (SEQ ID NO: 68) peptide was assessed.
[0094] In order to test the activity of each of these designed proteases against the PQLP (SEQ ID NO: 68) motif, the desired mutations were incorporated into the native nucleotide sequence using site directed mutagenesis, and mutant enzyme variants were produced in E. coli BL21(DE3) cells. These enzyme variants were then screened for enzymatic activity in clarified whole cell lysates at pH 4 using the fluorescently quenched .alpha.-gliadin hexapeptide analogue QXL520-PQPQLP-K(5-FAM)-NH2 (FQ) (SEQ ID NO: 69) as a substrate. Of the 107 enzyme variants tested in this assay, 13% resulted in a loss of enzymatic function, 32% did not demonstrate a significant difference in activity relative to KumaWT, and 55% resulted in an increase in observed activity against this substrate. Twenty-eight of the most promising enzyme variants that exhibited a significant increase in activity in cell lysates were then purified in order to obtain an accurate comparison of enzymatic activity to that of KumaWT. After purification and correction for protein concentration, the activities of these enzymes ranged from 2-fold to 120-fold more active than KumaWT (Table 1). The most active variant, which was named KumaMax.TM., was selected for further characterization.
TABLE-US-00005 TABLE 1 Fold change in hydrolytic activity on PQ motif of all purified and sequenced mutants, relative to wild type Kumamolysin-As. These are the fold-change results (calculated as described in Supplementary Table 1) for all mutants that were purified, sequenced, and tested against wild-type Kumamolysin in the pure protein assay. The assay took place at pH 4, with enzyme final concentration of 0.0125 mg/mL and substrate concentration of 5 .mu.M. Fold Change in Activity of PQ Hydrolysis Rela- Mutations to Wild Type Kumamolysin-As tive to Wild Type (Preprocessed) Kumamolysin-As Wild Type (WT) 1.0 T357A 2.0 G319S, D368S 2.0 D358G 3.0 D293A 3.0 D358N 4.0 G319S, S354N, D358G, D368H 5.0 D358G, D368H 6.0 G319S, D358G, D368H 7.0 N291D, Q361D 7.5 S354N, D358G, D368H 9.0 N291D 10.0 N291D, D293A, Q361D, D358N 14.8 N291D, D293A 15.0 N291D, D293A, D358G, Q361D 15.0 N291D, D358N 18.9 N291D, Q361D, D358G 20.0 N291D, G319S, D358G, Q361D, D368H 23.1 N291D, D293A, D358N 24.0 S262G, T292S, N291D, G319S, D358G, D368H 29.0 N291D, D293A, G319S, D358G, Q361D, D368H 40.9 T292S, N291D, G319S, D358G, D368H 49.0 N291D, G319S, S354N, D358G, Q361D, D368H 50.0 N291D, G319S, S354N, D358G, D368H 54.6 N291D, D293A, G319S, S354N, D358G, Q361D, 58.0 D368H D293T, N291D, G319S, D358G, D368H 58.0 S262K, D293N, N291D, G319S, D358G, D368H 62.0 N291D, G319S, D358G, D368H 93.0 V119D, S262K, D293T, N291D, G319S, D358G, 120.0 D368H
[0095] KumaMax.TM. contains seven mutations from the wild-type amino acid sequence: V119D, S262K, N291D, D293T, G319S, D358G, D368H (FIG. 2B). Of these, the mutations G319S, D358G, and D368H appear to synergistically introduce a new hydrogen bond with the desired glutamine residue at position P1. As modeled, the G319S mutation appears to introduce a hydroxyl group that is located 2.5 .ANG. from the carbonyl oxygen of the glutamine amide, potentially contributing a new hydrogen bond that interacts with glutamine in the P1 pocket. The D368H mutation is predicted to stabilize the serine hydroxyl, and its position in the active site is in turn sterically allowed by the D358G mutation. In addition to providing a novel desired interaction with glutamine as modeled, these three mutations also remove the two acidic residues predicted to stabilize the positively charged arginine residue in the native KumaWT substrate (FIG. 2). V119D, which was unexpectedly incorporated during site directed mutagenesis, is located in the propeptide domain and therefore does not affect catalytic activity of the mature enzyme. The other three mutations do not make direct contacts with residues in the P2-P2' pockets, and therefore likely introduce interactions with other components of the hexapeptide, the fluorophore, or the quencher. It is clear that these mutations are important for the overall catalytic enhancement observed, as the G319S/D358G/D368H triple mutant alone demonstrated only a 7-fold increase in catalytic activity over KumaWT; roughly 17-fold lower than that determined for KumaMax.TM..
Kinetic Characterization and Substrate Specificity
[0096] The catalytic efficiencies for KumaMax.TM. and KumaWT against the FQ immunogenic gluten substrate analogue, as calculated by fitting a velocity versus substrate curve over 6-100 .mu.M substrate, were found to be 568 M.sup.-1s.sup.-1 and 4.9 M.sup.-1s.sup.-1, respectively (Table 2) These values are consistent with the observation that KumaMax.TM. demonstrated a 120-fold increase in enzymatic activity towards the FQ substrate in the initial activity screen mentioned above. Unfortunately, no significant saturation of velocity at these substrate concentrations was observed, and therefore the individual kinetic constants k.sub.cat and K.sub.M could not be determined. This is not surprising since previous analyses of the kinetic constants of KumaWT report a Km of 40 .mu.M. Therefore, no significant saturation would be expected at substrate concentrations less than 100 .mu.M.
TABLE-US-00006 TABLE 2 Kinetic Constants of peptide substrates for KumaMax .TM. and KumaWT. The catalytic efficiency (k.sub.cat/K.sub.M M.sup.-1s.sup.-1) for both KumaWT and KumaMa .TM. for the fluorescently (Fl) quenched (Qu) PQPQLP (SEQ ID NO: 34) peptide was fit to a linear curve as no saturation was observed up to 100 .mu.M substrate. The fluorescence signal was quantified as described in Materials and Methods with a standard curve that accounted for substrate quenching of product fluorescence. The catalytic efficiency for the pNA-linked peptides was determined in a similar manner, and is described in the Materials and Methods. All fits had at least six independently measured rates with an R.sup.2 greater than 0.9. Catalytic Efficiency M.sup.-1s.sup.-1 Qu-PQPQLP-Fl Suc-APQ-pNA Suc-APR-pNA Suc-APE-pNA Suc-AQP-pNA KumaWT 4.9 .+-. 0.2 n.d. 131.8 .+-. 3.8 4.0 .+-. 0.1 n.d. KumaMax .TM. 568.5 .+-. 14.6 6.7 .+-. 0.4 n.d. 1.4 .+-. 0.2 n.d. n.d. not detected.
[0097] While the increased activity of KumaMax.TM. compared to KumaWT against the fluorescently quenched PQPQLP (SEQ ID NO: 34)hexapeptide substrate suggests that KumaMax has increased preference for a PQ dipeptide motif, it does not report directly on substrate specificity. To confirm that the specificity of KumaMax.TM. had indeed been altered to prefer the PQ dipeptide over the native PR dipeptide of KumaWT, four peptides in the form of Succinyl-Alanine-P2-P1-P1' were provided as substrates to both enzymes in order to assess P2 and P1 specificity. These peptides contained the reporter p-nitroaniline (pNA) at the P1' position, which allows for a spectrometric readout of peptide cleavage. The four peptides harbored the following amino acids at the P2 and P1 positions, respectively: proline-glutamine (PQ), proline-arginine (PR), glutamine-proline (QP), and proline-glutamate (PE). Catalytic efficiencies were calculated for each substrate and are reported in Table 2. As in the determination of catalytic activities against the FQ substrate, no saturation of activity on these peptides by KumaWT or KumaMax.TM. was observed. This suggests that the pNA group may partially disrupt binding in the P1' pocket, since substrate concentrations up to 1 mM were tested, well beyond saturation levels previously reported for alternative KumaWT substrates.
[0098] In this specificity assay, KumaMax.TM. demonstrated its highest level of activity on the PQ substrate, the dipeptide that it had been designed to prefer. While KumaMax.TM. was not explicitly designed to demonstrate a decrease in specificity for the PR motif or for other motifs, its increased specificity for PQ could decrease its activity for non-targeted motifs. Indeed, KumaMax.TM. exhibited no significant catalytic activity against the QP or PR substrates in this assay (Table 2). Consistent with previous reports'', KumaWT exhibited its highest level of activity on the PR motif. KumaWT demonstrated significantly lower levels of activity on the three other peptide substrates. While catalytic activity of KumaWT on the PQ dipeptide motif has previously been reported.sup.14, no significant activity on the PQ dipeptide substrate was observed in this assay, which may be due to disruptive effects of pNA on the binding of this peptide to the enzyme active site. Both enzymes demonstrated activity towards the isosteric substrate PE, which is predicted to have neutral charge at pH 4; however, KumaMax.TM. demonstrated a roughly 5-fold decrease in activity on the PE peptide substrate compared to its activity on the PQ substrate, which illustrates its exquisite selectivity for the PQ dipeptide motif.
[0099] As discussed previously, there are several enzymes currently being explored as OET for Celiac disease. Two of these enzymes are engineered forms of the prolyl endopeptidase SC PEP and the glutamine-specific endoprotease EP-B2.sup.15. To compare the catalytic efficiencies of these proteases to that of KumaMax.TM., the native SC PEP and EP-B2 enzymes were expressed in E. coli BL21(DE3) cells, purified, and their catalytic activities assessed. SC-PEP demonstrated a catalytic efficiency of 1.6 M.sup.-1s.sup.-1 on the FQ gluten substrate analogue at pH 4, which represents a roughly 350-fold lower level of activity on this substrate than KumaMax.TM.. At pH 4, SC PEP did not exhibit any significant activity on any of the four pNA linked peptide substrates, including QP. Although previous studies using similar pNA-linked peptides have demonstrated activity of SC PEP on these substrates, those assays were performed at a pH of 4.5 or higher.sup.15. Like other groups, we found that SC PEP demonstrated significant levels of activity on the QP substrate at pH 7, with a catalytic efficiency of 2390 M.sup.-1s.sup.-1, thereby confirming that this recombinant SC PEP was fully functional (data not shown). This is consistent with previous literature reporting that SC PEP has low to negligible levels of catalytic activity in the pH range of the stomach, and is thus only expected to be effective once .alpha.-gliadin peptides have reached the small intestine.sup.15,16.
[0100] For EP-B2, only very low levels of activity were detected on the FQ substrate at pH 4, and no activity on any of the four pNA peptide substrates was observed (data not shown). This is inconsistent with previous reports of EP-B2 activity using comparable substrates.sup.11. EP-B2 is a difficult enzyme to purify, as it forms inclusion bodies in E. coli and requires refolding to obtain active enzyme. We were unable to obtain soluble protein using previously reported methods for the refolding of EP-B2.sup.11,17,18, so we used an on-column refolding process which resulted in soluble protein produced. Although this soluble EP-B2 demonstrated the expected self-processing activity at pH 4.sup.11 (data not shown), the lack of activity of this enzyme suggests that it may not have refolded properly using our methods. This could be due to alternative N and C-terminal tags arising from the use of different protein expression vectors and warrants further investigation.
Protease Stability
[0101] In addition to demonstrating catalytic activity at low pH, any protein therapeutic intended for use in the human digestive tract must exhibit resistance to degradation by digestive proteases. Two of the most abundant proteases in the stomach and small intestine are pepsin and trypsin, respectively. Pepsin demonstrates optimal proteolytic activity at the low pH range of the stomach, while trypsin is primarily active at the more neutral pH of the small intestine. To assess the resistance of KumaMax.TM. to degradation by these proteases, 0.01 or 0.1 mg/mL of KumaMax.TM. were incubated with each protease, in their respective optimal pH ranges, at 0.1 mg/mL, which is a physiologically relevant concentration of both pepsin and trypsin. SC PEP and EP-B2 were included as controls, as EP-B2 has been established to be resistant to pepsin but susceptible to trypsin, and SC PEP demonstrates susceptibility to both proteases.sup.11,15. Each protein was incubated in the presence or absence of the respective protease for 30 minutes, after which the proteins were heat inactivated and the remaining non-proteolyzed fraction determined using an SDS-PAGE gel (FIG. 3).
[0102] In this assay, KumaMax.TM. demonstrated high stability against both pepsin and trypsin, with roughly 90% intact protein remaining after the half hour incubation with either protease (FIG. 3). Consistent with previous reports, SC PEP exhibited susceptibility to both pepsin and trypsin, with less than 20% of the enzyme remaining after incubation with these proteases. As expected, trypsin efficiently proteolyzed EP-B2 with less than 10% remaining after incubation, but no significant degradation of EP-B2 was observed in the presence of pepsin. To confirm that observed protein degradation was due to protease activity and not to enzymatic self-processing, each enzyme was incubated at either pH 4 or 7 and apparent proteolysis was analyzed in the absence of other proteases over the course of an hour (data not shown). KumaMax.TM. and EP-B2, but not SC PEP, demonstrated self-processing from the pro-peptide to the active enzyme form in fewer than 10 minutes at pH 4. All three proteins remained >90% stable over the course of the hour. None of these proteins showed significant levels of self-processing or proteolysis during incubation for one hour at pH 7.
Degradation of an Immunogenic .alpha.9-Gliadin Peptide
[0103] The significant level of catalytic activity exhibited by KumaMax.TM. on immunogenic peptide analogues (Table 2) demonstrates promise for the use of KumaMax.TM. as a therapeutic in OET for gluten intolerance. However, these assays do not directly assess the ability of this enzyme to degrade relevant immunogenic peptides derived from gluten. Therefore, we examined the direct proteolytic activity of KumaMax.TM. towards an immunodominant peptide present in .alpha.9-gliadin, QLQPFPQPQLPY (SEQ ID NO: 70).
[0104] KumaMax was incubated with 500 .mu.M of the .alpha.9-gliadin peptide at 37.degree. C. in pH 4 at roughly a 1:100 enzyme to peptide molar ratio, which represents a physiologically relevant concentration of this peptide in the human stomach. SC PEP was included in this experiment for the sake of comparison, since this enzyme demonstrates significantly less activity against the FQ substrate than KumaMax.TM.. Samples from the incubation were quenched every 10 minutes in 80% acetonitrile to halt the proteolysis reaction. The remaining fraction of intact immunogenic peptide was determined using high-performance liquid-chromatography mass spectroscopy, in which the M+H parent ion of the .alpha.9-gliadin peptide was monitored. KumaMax.TM. demonstrated a high level of activity against the immunogenic peptide in this assay, as over 95% of the immunogenic peptide had been proteolyzed after a 50 minute incubation with KumaMax.TM., while no significant degradation of the peptide was observed in the presence of SC PEP or in the absence of protease (FIG. 4A). The half-life of the peptide in the presence of KumaMax.TM. was determined by plotting the fraction of peptide remaining against the incubation time, and was calculated to be 8.5.+-.0.7 minutes (FIG. 4B).
Discussion
[0105] Enzyme therapy is an attractive method for the treatment of Celiac disease since this form of treatment would not require intravenous injection. However, it is a challenge to identify an appropriate protease for use in OET that demonstrates all the properties necessary to be an effective therapeutic for Celiac disease. Specifically, an ideal protease for use in OET would maintain activity in a pH range from 2-4 at 37.degree. C. and would resist degradation by common digestive proteases. In addition, the protein therapeutic would ideally demonstrate stringent specificity for a common motif found in immunogenic gluten-derived peptides. Finally, the protein should be easily produced using recombinant methods. While it is unlikely that a single natural enzyme will encompass all of these properties, we demonstrate that a protein containing several of these important characteristics can be engineered to demonstrate the lacking qualities through computational analysis, mutagenesis, and screening.
[0106] The engineered protease, KumaMax.TM., demonstrated a high level of activity on, and specificity towards, the desired PQ dipeptide motif (Table 2). The specificity for the PQ motif, as opposed to the native PR motif, potentially derives from the addition of new hydrogen bonds in the 51 pocket of KumaMax that, as modeled, make direct contacts with the glutamine in this dipeptide motif (FIG. 2B). This specificity switch not only directs activity against a motif found commonly throughout gluten, but it also greatly decreases activity against non-targeted substrates (Table 2). The inability for an oral protease to recognize non-targeted substrates is an important characteristic as it reduces competitive inhibition by the large number of other peptides produced in the stomach during digestion of a meal. KumaMax.TM. or KumaWT can potentially act as platforms for engineering greater specificity, as KumaWT has demonstrated some level of selectivity beyond the P2 and P1 sites Using this method, a panel of customized proteases specific for unique immunogenic epitopes could be generated.
Methods
Protein Expression and Purification
[0107] The genes encoding each protein of interest, harbored in the pET29b plasmid, were transformed into Escherichia coli BL21 (DE3) cells. Individual colonies were picked, inoculated into Terrific Broth.TM. with 50 .mu.g/.mu.L Kanamycin (TB+Kan), and incubated overnight at 37.degree. C. 500 uL of the overnight culture was added to 500 mL autoinduction media (5 g tryptone, 2.5 g yeast extract, 465 mL ddH.sub.2O), and shaken at 37.degree. C. for roughly 4 hours, then the autoinduction components were added (500 uL MgSO.sub.4, 500 uL 1000.times. trace metals, 25 mL 20.times.NPS, 10 mL 20.times.5052, 500 uL 50 mg/mL Kan). The cultures were then shaken at 18.degree. C. for 30 hours before being spun down. Pellets were resuspended in 10 mL 1.times.PBS, then lysed via sonication with 5 mL lysis buffer (50 mM HEPES, 500 mM NaCl, 1 mM bME, 2 mg/mL lysozyme, 0.2 mg/mL DNase, ddH.sub.2O) and spun down. The proteins were then purified over 1 mL TALON cobalt affinity columns. KumaMax, KumaWT, and SC Pep were washed three times with 20 mL wash buffer (10 mM imidazole, 50 mM HEPES, 500 mM NaCl, 1 mM bME, ddH.sub.2O), and then eluted in 15 mL of elution buffer (200 mM imidazole, 50 mM HEPES, 500 mM NaCl, 1 mM bME). EP-B2 had to be refolded on the column, so after lysis the pellets were resuspended in 10 mL of EP-B2 buffer, which differs from the wash buffer only in that it is diluted in guanidine hydrochloride instead of ddH.sub.2O to allow for denaturation of the EP-B2 inclusion bodies. This resuspension was pelleted, and the supernatant (containing denatured EP-B2) was filtered with a 0.8 .mu.m filter onto the column. EP-B2 was washed once with 20 mL of the EP-B2 buffer, before being washed twice with 20 mL of the wash buffer to refold the protein on the column. Protein was eluted with 15 ml of the elution buffer. All proteins were concentrated from 15 mL down to .about.500 uL, then dialyzed once in 1 L dialysis buffer (20% glycerol, 50 mM HEPES, 500 mM NaCl, 1 mM bME). Protein concentration was calculated spectrophotometrically with extinction coefficients of 53,985 M.sup.-1 cm.sup.-1 for KumaWT and all KumaWT variants, 152,290 M.sup.-1 cm.sup.-1 for SC Pep, and 58,245 M.sup.-1cm.sup.-1 for EP-B2.
Screening Method
[0108] Kunkel mutagenesis was used to generate mutations to KumaWT. Individual colonies picked from plates were grown up in 96-deep well plates. After lysing the cells with Triton buffer (1% 100.times. Triton, 1 mg/mL lysozyme, 0.5 mg/mL DNase, 1.times.PBS), the supernatant was adjusted to pH 4 with a 100 mM sodium acetate buffer. To crudely screen for activity against the FQ substrate, 10 uL of supernatant was added to 90 uL of 5 .mu.M substrate in a 96-well black plate, and the fluorescence was measured at 30-second intervals for 1 hour.
Purified Enzyme Assay
[0109] The variants of Kumamolisin-As that displayed the most activity on the FQ substrate in the activity screen were sequenced, then purified in small scale. 500 uL of TB+Kan overnight cultures were added to 50 mL TB+Kan and grown at 37.degree. C. until reaching an optical density of 0.5-0.8. IPTG was added to 0.5 mM, and the cultures were expressed at 22.degree. C. for 16-24 hours. The cells were spun down, resuspended in 500 uL of wash buffer (1.times.PBS, 5 mM imidazole, ddH.sub.2O), transferred to a 2 mL Eppendorf tube, and lysed in 1 mL lysis buffer (1.times.PBS, 5 mM imidazole, 2.times. Bug Buster.TM., 2 mg/mL lysozyme, 0.2 mg/mL DNase, ddH.sub.2O). After centrifugation, the supernatant was decanted into a fresh tube. Columns with 200 uL of TALON cobalt resin were placed in Eppendorf tubes, and the supernatant was poured over the columns and rocked for 20 minutes before spinning down and discarding the flow-through. The proteins were washed three times with 500 uL wash buffer, discarding the flow-through between washes. Enzymes were eluted in 200 uL elution buffer (1.times.PBS, 200 mM imidazole, dd H.sub.2O), and concentrations were calculated spectrophotometrically using an extinction coefficient of 53,985 M.sup.-1cm.sup.-1.
[0110] For the assay, the Kumamolisin-As mutants were incubated for 15 minutes in pH 4 100 mM sodium acetate buffer. Enzyme was added to 5 .mu.M substrate so that the final protein concentration was 0.0125 mg/mL. The fluorescence was measured at 30-second intervals for 1 hour.
Kinetic Characterization
[0111] Enzyme variant proclivity for gluten degradation was measured by hydrolysis of the fluorescently quenched .alpha.-gliadin hexapeptide analogue QXL520-PQPQLP-K(5-FAM)-NH2 (FQ) (SEQ ID NO: 69) as a substrate. Each enzyme was incubated at room temperature for 15 minutes in 100 mM pH 4 sodium acetate buffer. After 15 minutes, 50 uL of fluorescent substrate was added ranging in final concentration between 100, 50, 25, 12.5, 6.25, and 0 .mu.M peptide, and maintaining concentrations of 0.05 .mu.M KumaMax.TM., 0.5 .mu.M KumaWT, 0.5 .mu.M SC Pep, and 0.5 .mu.M EP-B2 across all variations in substrate concentration. The plate was read immediately on the spectrophotometer for an hour, using 455 nm wavelength for excitation and reading 485 nm wavelength for emission.
[0112] The enzymes were also tested for specificity to different dipeptide motifs using a variety of chromogenic substrates that release p-nitroaniline (pNA) upon hydrolysis: [Suc-APQ-pNA], [Suc-AQP-pNA], [Suc-APE-pNA], and [Suc-APR-pNA]. Again, each enzyme was incubated at room temperature for 15 minutes in 100 mM pH 4 sodium acetate buffer. After 15 minutes, 20 uL of substrate was added to the enzyme incubation so that the final concentrations of substrate ranged between 1000, 500, 250, 125, 62.5, 31.25, 15.625, and 0 .mu.M, and all enzymes being tested ended in a concentration of 0.5 .mu.M. The plate was read immediately on the spectrophotometer for an hour, monitoring absorption by the reactions at 385 nm.
[0113] The standard curve for the fluorescent peptide involved mixing substrate and product together at varying concentrations in pH 4 buffer. Substrate concentrations were 100, 50, 25, 12.5, 6.25, and 0 .mu.M, and product concentrations were 20, 5, 1.25, 0.3125, 0.078125, 0 .mu.M.
[0114] The standard curve for the absorbent peptide involved product concentrations of 100, 50, 25, 12.5, 6.25, 3.125, 1.5625, 0.78125, 0.390625, 0.1953125, 0.09765625, and 0 .mu.M diluted in pH 4 buffer.
Protease Stability
[0115] Enzyme stability was determined in the presence the digestive proteases, pepsin and trypsin. KumaWT, KumaMax.TM., SC Pep, and EP-B2 were incubated in buffer matching the native pH environment of each digestive protease. pH 3.5 100 mM sodium acetate was used to pre-incubate the enzymes for pepsin digestion assays, and pH 7.5 dialysis buffer (see "Protein Expression and Purification") for the trypsin digestion assays. Each experimental enzyme was incubated at 37.degree. C. for 15 minutes in each buffer, at a concentration of 0.2 mg/mL.
[0116] After pre-incubation in the appropriate buffer, 0.1 mg/mL digestive protease was added. The reactions were done in triplicate, and were incubated at 37.degree. C. for 30 minutes. Adding SDS and boiling for 5 minutes ensured digestive protease inactivation. An SDS-PAGE gel allowed quantification of enzyme degradation, using ImageJ.
[0117] The rate of protein self-proteolysis was determined at pH 4 and 7.5 in the absence of pepsin or trypsin. Each enzyme, at a concentration of 0.2 mg/mL, was incubated in pH 4 100 mM sodium acetate and pH 7.5 dialysis buffer. At 20, 40, and 60 minutes, timepoints were taken. SDS was added, and the aliquots were boiled for 5 minutes to ensure denaturation of the enzymes and inhibition of further self-proteolysis. Again, an SDS-PAGE gel in conjunction with ImageJ allowed quantification of enzyme self-proteolysis.
LCMS Gliadin Degradation Assay
[0118] Enzyme activity on full-length .alpha.9-gliadin was measured using high-performance liquid-chromatography mass spectrometry. For each enzyme, 7 .mu.L of pH 4 1M sodium acetate buffer was added to 28 .mu.L of 5 .mu.M enzyme, and incubated alongside separate tubes of 3 .mu.L gliadin at 37.degree. C. for 15 minutes. Next 27 .mu.L of each enzyme mixture, and 27 .mu.L of dialysis buffer as a control, were added to each tube of gliadin. These were incubated once more at 37.degree. C., and 5 .mu.L samples were taken at 10, 20, 30, 40, and 50 minutes. Each timepoint sample was quenched in 95 .mu.L of 80% acetonitrile with 1% formic acid and approximately 33 .mu.M leupeptin. The samples were analyzed on the HPLC to compare gliadin degradation by the different proteases over time.
REFERENCES
[0119] 1. Armstrong, M. J., Hegade, V. S. & Robins, G. Advances in coeliac disease. Curr Opin Gastroenterol 28, 104-12 (2012).
[0120] 2. Sollid, L. M. Coeliac disease: dissecting a complex inflammatory disorder. Nat Rev Immunol 2, 647-55 (2002).
[0121] 3. Wieser, H. Chemistry of gluten proteins. Food Microbiol 24, 115-9 (2007).
[0122] 4. Shan, L. et al. Structural basis for gluten intolerance in celiac sprue. Science 297, 2275-9 (2002).
[0123] 5. Shan, L. Identification and analysis of multivalent proteolytically resistant peptides from gluten: implicatoins for celiac sprue. Journal of Proteome Research (2005).
[0124] 6. Chand, N. & Mihas, A. A. Celiac disease: current concepts in diagnosis and treatment. J Clin Gastroenterol 40, 3-14 (2006).
[0125] 7. Shan, L., Marti, T., Sollid, L. M., Gray, G. M. & Khosla, C. Comparative biochemical analysis of three bacterial prolyl endopeptidases: implications for coeliac sprue. Biochem J 383, 311-8 (2004).
[0126] 8. Siegel, M. et al. Rational design of combination enzyme therapy for celiac sprue. Chem Biol 13, 649-58 (2006).
[0127] 9. Stepniak, D. et al. Highly efficient gluten degradation with a newly identified prolyl endoprotease: implications for celiac disease. Am J Physiol Gastrointest Liver Physiol 291, G621-9 (2006).
[0128] 10. Ehren, J. et al. A food-grade enzyme preparation with modest gluten detoxification properties. PLoS One 4, e6313 (2009).
[0129] 11. Bethune, M. T., Strop, P., Tang, Y., Sollid, L. M. & Khosla, C. Heterologous expression, purification, refolding, and structural-functional characterization of EP-B2, a self-activating barley cysteine endoprotease. Chem Biol 13, 637-47 (2006).
[0130] 12. Okubo, A. et al. Processing, catalytic activity and crystal structures of kumamolisin-As with an engineered active site. FEBS J 273, 2563-76 (2006).
[0131] 13. Gardner, J. D., Ciociola, A. A. & Robinson, M. Measurement of meal-stimulated gastric acid secretion by in vivo gastric autotitration. J Appl Physiol 92, 427-34 (2002).
[0132] 14. Wlodawer, A. et al. Crystallographic and biochemical investigations of kumamolisin-As, a serine-carboxyl peptidase with collagenase activity. J Biol Chem 279, 21500-10 (2004).
[0133] 15. Ehren, J., Govindarajan, S., Moron, B., Minshull, J. & Khosla, C. Protein engineering of improved prolyl endopeptidases for celiac sprue therapy. Protein Eng Des Sel 21, 699-707 (2008).
[0134] 16. Bethune, M. T. & Khosla, C. Oral enzyme therapy for celiac sprue. Methods Enzymol 502, 241-71 (2012).
[0135] 17. Gass, J., Vora, H., Bethune, M. T., Gray, G. M. & Khosla, C. Effect of barley endoprotease EP-B2 on gluten digestion in the intact rat. J Pharmacol Exp Ther 318, 1178-86 (2006).
[0136] 18. Vora, H., McIntire, J., Kumar, P., Deshpande, M. & Khosla, C. A scaleable manufacturing process for pro-EP-B2, a cysteine protease from barley indicated for celiac sprue. Biotechnol Bioeng 98, 177-85 (2007).
Sequence CWU
1
1
731573PRTArtificial SequenceSyntheticMISC_FEATURE(119)..(119)X can be V or
DMISC_FEATURE(262)..(262)X can be S, K or GMISC_FEATURE(291)..(291)X can
be N or DMISC_FEATURE(292)..(292)X can be T or SMISC_FEATURE(293)..(293)X
can be D, A, T, or NMISC_FEATURE(319)..(319)X can be G or
SMISC_FEATURE(354)..(354)X can be S or NMISC_FEATURE(357)..(357)X can be
T or AMISC_FEATURE(358)..(358)X can be D, N or GMISC_FEATURE(361)..(361)X
can be Q or DMISC_FEATURE(368)..(368)X can be D, S, or H 1Met Ser Asp Met
Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1 5
10 15Val Leu Gln Gly His Ala Arg Ala Gln Ala
Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg
35 40 45Arg Gln Arg Ala Gly Glu Leu Ala
Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser
Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala 85
90 95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn
Val Ala Ala Gly Thr 100 105
110Ala Val Leu Ser Gly Pro Xaa Asp Ala Ile Asn Arg Ala Phe Gly Val
115 120 125Glu Leu Arg His Phe Asp His
Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130 135
140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala
Val145 150 155 160Leu Gly
Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu Gly Gly Phe
Glu Ala Arg Ser Gln Ala Ala Ala Pro 180 185
190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe
Pro Glu 195 200 205Gly Leu Asp Gly
Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala
Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln
245 250 255Pro Thr Gly Asp Pro
Xaa Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe
Ala Val Tyr Phe 275 280 285Ala Pro
Xaa Xaa Xaa Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser
Ile Ser Trp Xaa Gly305 310 315
320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala
Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly 340
345 350Asp Xaa Gly Ser Xaa Xaa Gly Glu Xaa Asp Gly
Leu Tyr His Val Xaa 355 360 365Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu
Thr Val Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro
Leu 405 410 415Pro Ala Trp
Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu
Ala Gly Asn Ala Asp Pro 435 440
445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu
Phe Ala Ala Leu Val Ala Arg Ile465 470
475 480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn
Pro Thr Leu Tyr 485 490
495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp
500 505 510Ile Ala Asn Arg Ala Gln
Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515 520
525Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln
Ala Leu 530 535 540Leu Pro Ser Ala Ser
Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545 550
555 560Phe Gln Ser Gly Ala Leu Glu His His His
His His His 565 5702573PRTArtificial
SequenceSyntheticMISC_FEATURE(119)..(119)X can be V or
DMISC_FEATURE(262)..(262)X can be S, K or GMISC_FEATURE(291)..(291)All
mutants with more than 10-fold activity have this
substitutionMISC_FEATURE(292)..(292)X can be T or
SMISC_FEATURE(293)..(293)X can be D, A, T or NMISC_FEATURE(319)..(319)X
can be G or SMISC_FEATURE(354)..(354)X can be S or
NMISC_FEATURE(357)..(357)X can be T or AMISC_FEATURE(358)..(358)X can be
D, N or GMISC_FEATURE(361)..(361)X can be Q or DMISC_FEATURE(368)..(368)X
can be D, S or H 2Met Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala
Arg Ala1 5 10 15Val Leu
Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val Asp Lys 20
25 30Gly Pro Val Ala Gly Asp Glu Arg Met
Ala Val Thr Val Val Leu Arg 35 40
45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50
55 60Ile Ala Pro His Ala Arg Glu His Leu
Lys Arg Glu Ala Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg
Phe Ala 85 90 95Asp Ala
His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Xaa Asp Ala
Ile Asn Arg Ala Phe Gly Val 115 120
125Glu Leu Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu
130 135 140Gly Glu Val Thr Val Pro Ala
Ser Ile Ala Pro Met Ile Glu Ala Val145 150
155 160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His
Phe Arg Met Gln 165 170
175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro
180 185 190Thr Ala Tyr Thr Pro Leu
Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195 200
205Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu
Gly Gly 210 215 220Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225 230
235 240Pro Ala Pro Gln Val Val Ser Val Ser Val
Asp Gly Ala Ser Asn Gln 245 250
255Pro Thr Gly Asp Pro Xaa Gly Pro Asp Gly Glu Val Glu Leu Asp Ile
260 265 270Glu Val Ala Gly Ala
Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe 275
280 285Ala Pro Asp Xaa Xaa Ala Gly Phe Leu Asp Ala Ile
Thr Thr Ala Ile 290 295 300His Asp Pro
Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Xaa Gly305
310 315 320Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met Asn Arg Ala 325
330 335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala Ala Ala Gly 340 345 350Asp
Xaa Gly Ser Xaa Xaa Gly Glu Xaa Asp Gly Leu Tyr His Val Xaa 355
360 365Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly Thr Arg Leu 370 375
380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385
390 395 400Pro Asp Gly Gly
Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu 405
410 415Pro Ala Trp Gln Glu His Ala Asn Val Pro
Pro Ser Ala Asn Pro Gly 420 425
430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro
435 440 445Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr Val Ile Gly 450 455
460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg
Ile465 470 475 480Asn Gln
Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala Asp Val Phe
His Asp Ile Thr Glu Gly Asn Asn Asp 500 505
510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp
Asp Pro 515 520 525Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr
Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 5703573PRTArtificial
SequenceSyntheticMISC_FEATURE(119)..(119)X can be V or
DMISC_FEATURE(262)..(262)X can be S, K, or GMISC_FEATURE(291)..(291)All
mutants with more than 20-fold activity increase have this
substitution together with 358 substitutionMISC_FEATURE(292)..(292)X is T
or SMISC_FEATURE(293)..(293)X is D, A, T or NMISC_FEATURE(319)..(319)X is
G or SMISC_FEATURE(354)..(354)X is S or NMISC_FEATURE(357)..(357)X is T
or AMISC_FEATURE(358)..(358)X is N or G (most have G at this
position)MISC_FEATURE(361)..(361)X is Q or DMISC_FEATURE(368)..(368)X is
D, S, or H 3Met Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg
Ala1 5 10 15Val Leu Gln
Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val Asp Lys 20
25 30Gly Pro Val Ala Gly Asp Glu Arg Met Ala
Val Thr Val Val Leu Arg 35 40
45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50
55 60Ile Ala Pro His Ala Arg Glu His Leu
Lys Arg Glu Ala Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg
Phe Ala 85 90 95Asp Ala
His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Xaa Asp Ala
Ile Asn Arg Ala Phe Gly Val 115 120
125Glu Leu Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu
130 135 140Gly Glu Val Thr Val Pro Ala
Ser Ile Ala Pro Met Ile Glu Ala Val145 150
155 160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His
Phe Arg Met Gln 165 170
175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro
180 185 190Thr Ala Tyr Thr Pro Leu
Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195 200
205Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu
Gly Gly 210 215 220Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225 230
235 240Pro Ala Pro Gln Val Val Ser Val Ser Val
Asp Gly Ala Ser Asn Gln 245 250
255Pro Thr Gly Asp Pro Xaa Gly Pro Asp Gly Glu Val Glu Leu Asp Ile
260 265 270Glu Val Ala Gly Ala
Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe 275
280 285Ala Pro Asp Xaa Xaa Ala Gly Phe Leu Asp Ala Ile
Thr Thr Ala Ile 290 295 300His Asp Pro
Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Xaa Gly305
310 315 320Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met Asn Arg Ala 325
330 335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala Ala Ala Gly 340 345 350Asp
Xaa Gly Ser Xaa Xaa Gly Glu Xaa Asp Gly Leu Tyr His Val Xaa 355
360 365Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly Thr Arg Leu 370 375
380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385
390 395 400Pro Asp Gly Gly
Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu 405
410 415Pro Ala Trp Gln Glu His Ala Asn Val Pro
Pro Ser Ala Asn Pro Gly 420 425
430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro
435 440 445Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr Val Ile Gly 450 455
460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg
Ile465 470 475 480Asn Gln
Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala Asp Val Phe
His Asp Ile Thr Glu Gly Asn Asn Asp 500 505
510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp
Asp Pro 515 520 525Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr
Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 5704573PRTArtificial
SequenceSyntheticMISC_FEATURE(119)..(119)X is V or
DMISC_FEATURE(262)..(262)X is S, K or GMISC_FEATURE(291)..(291)All
mutants with more than 50-fold activity increase have this
substitution together with 319, 358, and 368
substitutionsMISC_FEATURE(292)..(292)X is T or SMISC_FEATURE(293)..(293)X
is D, A, T, or NMISC_FEATURE(354)..(354)X is S or
NMISC_FEATURE(357)..(357)X is T or AMISC_FEATURE(361)..(361)X is Q or D
4Met Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala
Arg Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val
Val Leu Arg 35 40 45Arg Gln Arg
Ala Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50
55 60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu
Ala Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Xaa Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Xaa Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Xaa Xaa Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Xaa Gly Ser Xaa Gly
Gly Glu Xaa Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
5705573PRTArtificial SequenceSynthetic 5Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Asn Gly Glu Gln Asp Gly Leu Tyr His
Val Asp 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 5706573PRTArtificial SequenceSynthetic 6Met Ser
Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1 5
10 15Val Leu Gln Gly His Ala Arg Ala
Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu
Arg 35 40 45Arg Gln Arg Ala Gly
Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe
Ala Ala65 70 75 80Ser
His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala Leu
Asp Arg Ala Asn Val Ala Ala Gly Thr 100 105
110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe
Gly Val 115 120 125Glu Leu Arg His
Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met
Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu Gly
Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr
Gln Phe Pro Glu 195 200 205Gly Leu
Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe
Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln
245 250 255Pro Thr Gly Asp
Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys
Phe Ala Val Tyr Phe 275 280 285Ala
Pro Asp Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser Val Val
Ser Ile Ser Trp Gly Gly305 310 315
320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg
Ala 325 330 335Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly 340
345 350Asp Ser Gly Ser Thr Asp Gly Glu Gln Asp
Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg
Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg Gly
Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435 440
445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val
Ile Gly 450 455 460Gly Thr Ser Ala Val
Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465 470
475 480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr
Leu Asn Pro Thr Leu Tyr 485 490
495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp
500 505 510Ile Ala Asn Arg Ala
Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu
Leu Gln Ala Leu 530 535 540Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly Ala Leu Glu
His His His His His His 565
5707573PRTArtificial SequenceSynthetic 7Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val Asp 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 5708573PRTArtificial SequenceSynthetic 8Met Ser
Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1 5
10 15Val Leu Gln Gly His Ala Arg Ala
Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu
Arg 35 40 45Arg Gln Arg Ala Gly
Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe
Ala Ala65 70 75 80Ser
His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala Leu
Asp Arg Ala Asn Val Ala Ala Gly Thr 100 105
110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe
Gly Val 115 120 125Glu Leu Arg His
Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met
Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu Gly
Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr
Gln Phe Pro Glu 195 200 205Gly Leu
Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe
Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln
245 250 255Pro Thr Gly Asp
Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys
Phe Ala Val Tyr Phe 275 280 285Ala
Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser Val Val
Ser Ile Ser Trp Gly Gly305 310 315
320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg
Ala 325 330 335Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly 340
345 350Asp Ser Gly Ser Ala Asp Gly Glu Gln Asp
Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg
Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg Gly
Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435 440
445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val
Ile Gly 450 455 460Gly Thr Ser Ala Val
Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465 470
475 480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr
Leu Asn Pro Thr Leu Tyr 485 490
495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp
500 505 510Ile Ala Asn Arg Ala
Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu
Leu Gln Ala Leu 530 535 540Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly Ala Leu Glu
His His His His His His 565
5709573PRTArtificial SequenceSynthetic 9Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asn Thr Ala Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Asp Gly Glu Gln Asp Gly Leu Tyr His
Val Asp 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57010573PRTArtificial SequenceSynthetic 10Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Asp
Gly Glu Gln Asp Gly Leu Tyr His Val Ser 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57011573PRTArtificial SequenceSynthetic 11Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57012573PRTArtificial SequenceSynthetic 12Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Gly Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Asp
Gly Glu Asp Asp Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57013573PRTArtificial SequenceSynthetic 13Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Asn Gly Glu Gln Asp Gly Leu Tyr His
Val Asp 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57014573PRTArtificial SequenceSynthetic 14Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Ala Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Gly Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Asp
Gly Glu Gln Asp Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57015573PRTArtificial SequenceSynthetic 15Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Ala Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Asn Gly Glu Gln Asp Gly Leu Tyr His
Val Asp 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57016573PRTArtificial SequenceSynthetic 16Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Gly
Gly Glu Gln Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57017573PRTArtificial SequenceSynthetic 17Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Asn Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57018573PRTArtificial SequenceSynthetic 18Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Gly Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Gly
Gly Glu Asp Asp Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57019573PRTArtificial SequenceSynthetic 19Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57020573PRTArtificial SequenceSynthetic 20Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Ala Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Gly Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Asn
Gly Glu Asp Asp Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57021573PRTArtificial SequenceSynthetic 21Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Asn Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57022573PRTArtificial SequenceSynthetic 22Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Ala Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Gly Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Gly
Gly Glu Asp Asp Gly Leu Tyr His Val Asp 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57023573PRTArtificial SequenceSynthetic 23Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57024573PRTArtificial SequenceSynthetic 24Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Asn Gly Ser Thr Gly
Gly Glu Gln Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57025573PRTArtificial SequenceSynthetic 25Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Ala Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57026573PRTArtificial SequenceSynthetic 26Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Asn Gly Ser Thr Gly
Gly Glu Asp Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57027573PRTArtificial SequenceSynthetic 27Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Ala Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Asn Gly Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57028573PRTArtificial SequenceSynthetic 28Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Ser Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Ser Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Gly
Gly Glu Gln Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57029573PRTArtificial SequenceSynthetic 29Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Thr Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57030573PRTArtificial SequenceSynthetic 30Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Val Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Gly Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Ser Asp Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Gly
Gly Glu Gln Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57031573PRTArtificial SequenceSynthetic 31Met Ser Asp Met Glu Lys Pro Trp
Lys Glu Gly Glu Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val
Asp Lys 20 25 30Gly Pro Val
Ala Gly Asp Glu Arg Met Ala Val Thr Val Val Leu Arg 35
40 45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val
Glu Arg Gln Ala Ala 50 55 60Ile Ala
Pro His Ala Arg Glu His Leu Lys Arg Glu Ala Phe Ala Ala65
70 75 80Ser His Gly Ala Ser Leu Asp
Asp Phe Ala Glu Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala
Ala Gly Thr 100 105 110Ala Val
Leu Ser Gly Pro Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115
120 125Glu Leu Arg His Phe Asp His Pro Asp Gly
Ser Tyr Arg Ser Tyr Leu 130 135 140Gly
Glu Val Thr Val Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145
150 155 160Leu Gly Leu Asp Thr Arg
Pro Val Ala Arg Pro His Phe Arg Met Gln 165
170 175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln
Ala Ala Ala Pro 180 185 190Thr
Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu Leu Gly Gly 210 215
220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Lys Gly Pro Asp Gly
Glu Val Glu Leu Asp Ile 260 265
270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe
275 280 285Ala Pro Asp Thr Asn Ala Gly
Phe Leu Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Ser
Gly305 310 315 320Pro Glu
Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala
325 330 335Phe Leu Asp Ala Ala Ala Leu
Gly Val Thr Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr His
Val His 355 360 365Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr Arg Leu 370
375 380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val
Trp Asn Asp Gly385 390 395
400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu
His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly 420
425 430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn Ala Asp Pro 435 440 445Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala
Leu Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala
Asp Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly
Pro Gly Trp Asp Pro 515 520 525Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly
Ser Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 57032573PRTArtificial SequenceSynthetic 32Met
Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu Glu Ala Arg Ala1
5 10 15Val Leu Gln Gly His Ala Arg
Ala Gln Ala Pro Gln Ala Val Asp Lys 20 25
30Gly Pro Val Ala Gly Asp Glu Arg Met Ala Val Thr Val Val
Leu Arg 35 40 45Arg Gln Arg Ala
Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala Ala 50 55
60Ile Ala Pro His Ala Arg Glu His Leu Lys Arg Glu Ala
Phe Ala Ala65 70 75
80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu Leu Arg Arg Phe Ala
85 90 95Asp Ala His Gly Leu Ala
Leu Asp Arg Ala Asn Val Ala Ala Gly Thr 100
105 110Ala Val Leu Ser Gly Pro Asp Asp Ala Ile Asn Arg
Ala Phe Gly Val 115 120 125Glu Leu
Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser Tyr Leu 130
135 140Gly Glu Val Thr Val Pro Ala Ser Ile Ala Pro
Met Ile Glu Ala Val145 150 155
160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg Pro His Phe Arg Met Gln
165 170 175Arg Arg Ala Glu
Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro 180
185 190Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala
Tyr Gln Phe Pro Glu 195 200 205Gly
Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu Leu Gly Gly 210
215 220Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser Leu Gly Val225 230 235
240Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala Ser Asn
Gln 245 250 255Pro Thr Gly
Asp Pro Lys Gly Pro Asp Gly Glu Val Glu Leu Asp Ile 260
265 270Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala Val Tyr Phe 275 280
285Ala Pro Asp Thr Thr Ala Gly Phe Leu Asp Ala Ile Thr Thr Ala Ile 290
295 300His Asp Pro Thr Leu Lys Pro Ser
Val Val Ser Ile Ser Trp Ser Gly305 310
315 320Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met Asn Arg Ala 325 330
335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala Ala Ala Gly
340 345 350Asp Ser Gly Ser Thr Gly
Gly Glu Gln Asp Gly Leu Tyr His Val His 355 360
365Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly Thr
Arg Leu 370 375 380Val Ala Ser Gly Gly
Arg Ile Ala Gln Glu Thr Val Trp Asn Asp Gly385 390
395 400Pro Asp Gly Gly Ala Thr Gly Gly Gly Val
Ser Arg Ile Phe Pro Leu 405 410
415Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala Asn Pro Gly
420 425 430Ala Ser Ser Gly Arg
Gly Val Pro Asp Leu Ala Gly Asn Ala Asp Pro 435
440 445Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala
Thr Val Ile Gly 450 455 460Gly Thr Ser
Ala Val Ala Pro Leu Phe Ala Ala Leu Val Ala Arg Ile465
470 475 480Asn Gln Lys Leu Gly Lys Ala
Val Gly Tyr Leu Asn Pro Thr Leu Tyr 485
490 495Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly Asn Asn Asp 500 505 510Ile
Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly Trp Asp Pro 515
520 525Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu Gln Ala Leu 530 535
540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu Asn Leu Tyr545
550 555 560Phe Gln Ser Gly
Ala Leu Glu His His His His His His 565
57033573PRTHomo sapiens 33Met Ser Asp Met Glu Lys Pro Trp Lys Glu Gly Glu
Glu Ala Arg Ala1 5 10
15Val Leu Gln Gly His Ala Arg Ala Gln Ala Pro Gln Ala Val Asp Lys
20 25 30Gly Pro Val Ala Gly Asp Glu
Arg Met Ala Val Thr Val Val Leu Arg 35 40
45Arg Gln Arg Ala Gly Glu Leu Ala Ala His Val Glu Arg Gln Ala
Ala 50 55 60Ile Ala Pro His Ala Arg
Glu His Leu Lys Arg Glu Ala Phe Ala Ala65 70
75 80Ser His Gly Ala Ser Leu Asp Asp Phe Ala Glu
Leu Arg Arg Phe Ala 85 90
95Asp Ala His Gly Leu Ala Leu Asp Arg Ala Asn Val Ala Ala Gly Thr
100 105 110Ala Val Leu Ser Gly Pro
Val Asp Ala Ile Asn Arg Ala Phe Gly Val 115 120
125Glu Leu Arg His Phe Asp His Pro Asp Gly Ser Tyr Arg Ser
Tyr Leu 130 135 140Gly Glu Val Thr Val
Pro Ala Ser Ile Ala Pro Met Ile Glu Ala Val145 150
155 160Leu Gly Leu Asp Thr Arg Pro Val Ala Arg
Pro His Phe Arg Met Gln 165 170
175Arg Arg Ala Glu Gly Gly Phe Glu Ala Arg Ser Gln Ala Ala Ala Pro
180 185 190Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln Phe Pro Glu 195
200 205Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile
Glu Leu Gly Gly 210 215 220Gly Tyr Asp
Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser Leu Gly Val225
230 235 240Pro Ala Pro Gln Val Val Ser
Val Ser Val Asp Gly Ala Ser Asn Gln 245
250 255Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val
Glu Leu Asp Ile 260 265 270Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala Val Tyr Phe 275
280 285Ala Pro Asn Thr Asp Ala Gly Phe Leu
Asp Ala Ile Thr Thr Ala Ile 290 295
300His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser Trp Gly Gly305
310 315 320Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met Asn Arg Ala 325
330 335Phe Leu Asp Ala Ala Ala Leu Gly Val Thr
Val Leu Ala Ala Ala Gly 340 345
350Asp Ser Gly Ser Thr Asp Gly Glu Gln Asp Gly Leu Tyr His Val Asp
355 360 365Phe Pro Ala Ala Ser Pro Tyr
Val Leu Ala Cys Gly Gly Thr Arg Leu 370 375
380Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp Asn Asp
Gly385 390 395 400Pro Asp
Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile Phe Pro Leu
405 410 415Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala Asn Pro Gly 420 425
430Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn Ala
Asp Pro 435 440 445Ala Thr Gly Tyr
Glu Val Val Ile Asp Gly Glu Ala Thr Val Ile Gly 450
455 460Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu
Val Ala Arg Ile465 470 475
480Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro Thr Leu Tyr
485 490 495Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly Asn Asn Asp 500
505 510Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro
Gly Trp Asp Pro 515 520 525Cys Thr
Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu Gln Ala Leu 530
535 540Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu Asn Leu Tyr545 550 555
560Phe Gln Ser Gly Ala Leu Glu His His His His His His
565 570346PRTArtificial SequenceSynthetic 34Pro Gln Pro
Gln Leu Pro1 535384PRTArtificial
SequenceSyntheticMISC_FEATURE(73)..(73)X can be S, K or
GMISC_FEATURE(102)..(102)X can be N or DMISC_FEATURE(103)..(103)X can be
T or SMISC_FEATURE(104)..(104)X can be T or SMISC_FEATURE(130)..(130)X
can be G or SMISC_FEATURE(165)..(165)X can be S or
NMISC_FEATURE(168)..(168)X can be T or AMISC_FEATURE(169)..(169)X can be
D, N or GMISC_FEATURE(172)..(172)X can be Q or DMISC_FEATURE(179)..(179)X
can be D, S, or H 35Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Xaa Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Xaa Xaa Xaa Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Xaa Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Xaa Gly Ser Xaa Xaa Gly Glu
Xaa Asp Gly Leu Tyr 165 170
175His Val Xaa Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38036384PRTArtificial SequenceSyntheticMISC_FEATURE(73)..(73)X can be S,
K or GMISC_FEATURE(102)..(102)All mutants with more than 10-fold activity
have this substitutionMISC_FEATURE(103)..(103)X can be T or
SMISC_FEATURE(104)..(104)X can be D, A, T or NMISC_FEATURE(130)..(130)X
can be G or SMISC_FEATURE(165)..(165)X can be S or
NMISC_FEATURE(168)..(168)X can be T or AMISC_FEATURE(169)..(169)X can be
D, N, or GMISC_FEATURE(172)..(172)X can be Q or
DMISC_FEATURE(179)..(179)X can be D, S, or H 36Ala Ala Pro Thr Ala Tyr
Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile
Ala Ile Ile Glu 20 25 30Leu
Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val
Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Xaa Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val
Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asp Xaa Xaa Ala Gly Phe
Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Xaa Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Xaa Gly Ser Xaa Xaa Gly Glu Xaa Asp Gly Leu Tyr
165 170 175His Val Xaa Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38037384PRTArtificial
SequenceSyntheticMISC_FEATURE(73)..(73)X is S, K, or
GMISC_FEATURE(102)..(102)All mutants with more than 20-fold activity
increase have this substitution together with 358
substitutionMISC_FEATURE(103)..(103)X is T or SMISC_FEATURE(104)..(104)X
is D, A, T or NMISC_FEATURE(130)..(130)X is G or
SMISC_FEATURE(165)..(165)X is S or NMISC_FEATURE(168)..(168)X is T or
AMISC_FEATURE(169)..(169)X is N or G (most have G at this
position)MISC_FEATURE(172)..(172)X is Q or DMISC_FEATURE(179)..(179)X is
D, S, or H 37Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr
Gln1 5 10 15Phe Pro Glu
Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu
Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Xaa
Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys
Phe Ala 85 90 95Val Tyr
Phe Ala Pro Asp Xaa Xaa Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys
Pro Ser Val Val Ser Ile Ser 115 120
125Trp Xaa Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met
130 135 140Asn Arg Ala Phe Leu Asp Ala
Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Xaa Gly Ser Xaa Xaa Gly Glu Xaa
Asp Gly Leu Tyr 165 170
175His Val Xaa Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38038384PRTArtificial SequenceSyntheticMISC_FEATURE(73)..(73)X is S, K,
or GMISC_FEATURE(102)..(102)All mutants with more than 50-fold activity
increase have this substitution together with 319, 358, and 368
substitutionsMISC_FEATURE(103)..(103)X is T or SMISC_FEATURE(104)..(104)X
is D, A, T, or NMISC_FEATURE(165)..(165)X is S or
NMISC_FEATURE(168)..(168)X is T or AMISC_FEATURE(172)..(172)X is Q or D
38Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Xaa Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asp
Xaa Xaa Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Ser
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Xaa Gly Ser Xaa Gly Gly Glu Xaa Asp Gly Leu Tyr
165 170 175His Val His Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38039384PRTArtificial
SequenceSynthetic 39Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Ser Gly Ser Thr Asn Gly Glu
Gln Asp Gly Leu Tyr 165 170
175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38040384PRTArtificial SequenceSynthetic 40Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Ser Gly
Ser Thr Asp Gly Glu Gln Asp Gly Leu Tyr 165
170 175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38041384PRTArtificial SequenceSynthetic 41Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Gly Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val Asp Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38042384PRTArtificial SequenceSynthetic
42Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asn
Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Gly
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Ser Gly Ser Ala Asp Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val Asp Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38043384PRTArtificial
SequenceSynthetic 43Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asn Thr Ala Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Ser Gly Ser Thr Asp Gly Glu
Gln Asp Gly Leu Tyr 165 170
175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38044384PRTArtificial SequenceSynthetic 44Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Ser Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Ser Gly
Ser Thr Asp Gly Glu Gln Asp Gly Leu Tyr 165
170 175His Val Ser Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38045384PRTArtificial SequenceSynthetic 45Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asn Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Gly Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38046384PRTArtificial SequenceSynthetic
46Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asp
Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Gly
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Ser Gly Ser Thr Asp Gly Glu Asp Asp Gly Leu Tyr
165 170 175His Val Asp Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38047384PRTArtificial
SequenceSynthetic 47Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Ser Gly Ser Thr Asn Gly Glu
Gln Asp Gly Leu Tyr 165 170
175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38048384PRTArtificial SequenceSynthetic 48Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asp Thr Ala Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Ser Gly
Ser Thr Asp Gly Glu Gln Asp Gly Leu Tyr 165
170 175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38049384PRTArtificial SequenceSynthetic 49Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asp Thr Ala Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Gly Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Ser Gly Ser Thr Asn Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val Asp Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38050384PRTArtificial SequenceSynthetic
50Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asn
Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Ser
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38051384PRTArtificial
SequenceSynthetic 51Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Asn Gly Ser Thr Gly Gly Glu
Gln Asp Gly Leu Tyr 165 170
175His Val His Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38052384PRTArtificial SequenceSynthetic 52Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Ser Gly
Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr 165
170 175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38053384PRTArtificial SequenceSynthetic 53Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asp Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Ser Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38054384PRTArtificial SequenceSynthetic
54Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asp
Thr Ala Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Gly
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Ser Gly Ser Thr Asn Gly Glu Asp Asp Gly Leu Tyr
165 170 175His Val Asp Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38055384PRTArtificial
SequenceSynthetic 55Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Ser Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Asn Gly Ser Thr Gly Gly Glu
Gln Asp Gly Leu Tyr 165 170
175His Val His Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38056384PRTArtificial SequenceSynthetic 56Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asp Thr Ala Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Ser Gly
Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr 165
170 175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38057384PRTArtificial SequenceSynthetic 57Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asp Thr Asp Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Ser Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Ser Gly Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr
165 170 175His Val His Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38058384PRTArtificial SequenceSynthetic
58Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asp
Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Ser
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Asn Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38059384PRTArtificial
SequenceSynthetic 59Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asp Thr Ala Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Ser Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Ser Gly Ser Thr Gly Gly Glu
Asp Asp Gly Leu Tyr 165 170
175His Val His Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38060384PRTArtificial SequenceSynthetic 60Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asp Thr Asp Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Ser Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Asn Gly
Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr 165
170 175His Val His Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38061384PRTArtificial SequenceSynthetic 61Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asp Thr Ala Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Ser Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Asn Gly Ser Thr Gly Gly Glu Asp Asp Gly Leu Tyr
165 170 175His Val His Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38062384PRTArtificial SequenceSynthetic
62Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asp
Ser Asp Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Ser
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38063384PRTArtificial
SequenceSynthetic 63Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln
Ala Tyr Gln1 5 10 15Phe
Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala
Ser Leu Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala
50 55 60Ser Asn Gln Pro Thr Gly Asp Pro
Ser Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala
Lys Phe Ala 85 90 95Val
Tyr Phe Ala Pro Asp Thr Thr Ala Gly Phe Leu Asp Ala Ile Thr
100 105 110Thr Ala Ile His Asp Pro Thr
Leu Lys Pro Ser Val Val Ser Ile Ser 115 120
125Trp Ser Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala
Met 130 135 140Asn Arg Ala Phe Leu Asp
Ala Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Ser Gly Ser Thr Gly Gly Glu
Gln Asp Gly Leu Tyr 165 170
175His Val His Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
38064384PRTArtificial SequenceSynthetic 64Ala Ala Pro Thr Ala Tyr Thr Pro
Leu Asp Val Ala Gln Ala Tyr Gln1 5 10
15Phe Pro Glu Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile
Ile Glu 20 25 30Leu Gly Gly
Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser 35
40 45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val
Ser Val Asp Gly Ala 50 55 60Ser Asn
Gln Pro Thr Gly Asp Pro Gly Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu Val Ala Gly
Ala Leu Ala Pro Gly Ala Lys Phe Ala 85 90
95Val Tyr Phe Ala Pro Asp Ser Asp Ala Gly Phe Leu Asp
Ala Ile Thr 100 105 110Thr Ala
Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser 115
120 125Trp Ser Gly Pro Glu Asp Ser Trp Thr Ser
Ala Ala Ile Ala Ala Met 130 135 140Asn
Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu Ala145
150 155 160Ala Ala Gly Asp Ser Gly
Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr 165
170 175His Val His Phe Pro Ala Ala Ser Pro Tyr Val Leu
Ala Cys Gly Gly 180 185 190Thr
Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195
200 205Asn Asp Gly Pro Asp Gly Gly Ala Thr
Gly Gly Gly Val Ser Arg Ile 210 215
220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225
230 235 240Asn Pro Gly Ala
Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn 245
250 255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val
Ile Asp Gly Glu Ala Thr 260 265
270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val
275 280 285Ala Arg Ile Asn Gln Lys Leu
Gly Lys Ala Val Gly Tyr Leu Asn Pro 290 295
300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu
Gly305 310 315 320Asn Asn
Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys Thr Gly Leu
Gly Ser Pro Ile Gly Val Arg Leu Leu 340 345
350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser
Thr Glu 355 360 365Asn Leu Tyr Phe
Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38065384PRTArtificial SequenceSynthetic 65Ala Ala Pro
Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1 5
10 15Phe Pro Glu Gly Leu Asp Gly Gln Gly
Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr Phe Ala Ser
35 40 45Leu Gly Val Pro Ala Pro Gln
Val Val Ser Val Ser Val Asp Gly Ala 50 55
60Ser Asn Gln Pro Thr Gly Asp Pro Lys Gly Pro Asp Gly Glu Val Glu65
70 75 80Leu Asp Ile Glu
Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala 85
90 95Val Tyr Phe Ala Pro Asp Thr Asn Ala Gly
Phe Leu Asp Ala Ile Thr 100 105
110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val Val Ser Ile Ser
115 120 125Trp Ser Gly Pro Glu Asp Ser
Trp Thr Ser Ala Ala Ile Ala Ala Met 130 135
140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly Val Thr Val Leu
Ala145 150 155 160Ala Ala
Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe Pro Ala Ala
Ser Pro Tyr Val Leu Ala Cys Gly Gly 180 185
190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala Gln Glu Thr
Val Trp 195 200 205Asn Asp Gly Pro
Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala Asn Val
Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly Asn
245 250 255Ala Asp Pro Ala Thr
Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro Leu Phe
Ala Ala Leu Val 275 280 285Ala Arg
Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp Val Phe His
Asp Ile Thr Glu Gly305 310 315
320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln Ala Gly Pro Gly
325 330 335Trp Asp Pro Cys
Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu 340
345 350Gln Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln
Pro Gly Ser Thr Glu 355 360 365Asn
Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His His His 370
375 38066384PRTArtificial SequenceSynthetic
66Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr Gln1
5 10 15Phe Pro Glu Gly Leu Asp
Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20 25
30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu Ala Gln Tyr
Phe Ala Ser 35 40 45Leu Gly Val
Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Lys Gly Pro Asp
Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys Phe Ala
85 90 95Val Tyr Phe Ala Pro Asp
Thr Thr Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys Pro Ser Val
Val Ser Ile Ser 115 120 125Trp Ser
Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met 130
135 140Asn Arg Ala Phe Leu Asp Ala Ala Ala Leu Gly
Val Thr Val Leu Ala145 150 155
160Ala Ala Gly Asp Ser Gly Ser Thr Gly Gly Glu Gln Asp Gly Leu Tyr
165 170 175His Val His Phe
Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly 180
185 190Thr Arg Leu Val Ala Ser Gly Gly Arg Ile Ala
Gln Glu Thr Val Trp 195 200 205Asn
Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser Arg Ile 210
215 220Phe Pro Leu Pro Ala Trp Gln Glu His Ala
Asn Val Pro Pro Ser Ala225 230 235
240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val Pro Asp Leu Ala Gly
Asn 245 250 255Ala Asp Pro
Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr 260
265 270Val Ile Gly Gly Thr Ser Ala Val Ala Pro
Leu Phe Ala Ala Leu Val 275 280
285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly Tyr Leu Asn Pro 290
295 300Thr Leu Tyr Gln Leu Pro Ala Asp
Val Phe His Asp Ile Thr Glu Gly305 310
315 320Asn Asn Asp Ile Ala Asn Arg Ala Gln Ile Tyr Gln
Ala Gly Pro Gly 325 330
335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly Val Arg Leu Leu
340 345 350Gln Ala Leu Leu Pro Ser
Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355 360
365Asn Leu Tyr Phe Gln Ser Gly Ala Leu Glu His His His His
His His 370 375 38067384PRTHomo
sapiens 67Ala Ala Pro Thr Ala Tyr Thr Pro Leu Asp Val Ala Gln Ala Tyr
Gln1 5 10 15Phe Pro Glu
Gly Leu Asp Gly Gln Gly Gln Cys Ile Ala Ile Ile Glu 20
25 30Leu Gly Gly Gly Tyr Asp Glu Ala Ser Leu
Ala Gln Tyr Phe Ala Ser 35 40
45Leu Gly Val Pro Ala Pro Gln Val Val Ser Val Ser Val Asp Gly Ala 50
55 60Ser Asn Gln Pro Thr Gly Asp Pro Ser
Gly Pro Asp Gly Glu Val Glu65 70 75
80Leu Asp Ile Glu Val Ala Gly Ala Leu Ala Pro Gly Ala Lys
Phe Ala 85 90 95Val Tyr
Phe Ala Pro Asn Thr Asp Ala Gly Phe Leu Asp Ala Ile Thr 100
105 110Thr Ala Ile His Asp Pro Thr Leu Lys
Pro Ser Val Val Ser Ile Ser 115 120
125Trp Gly Gly Pro Glu Asp Ser Trp Thr Ser Ala Ala Ile Ala Ala Met
130 135 140Asn Arg Ala Phe Leu Asp Ala
Ala Ala Leu Gly Val Thr Val Leu Ala145 150
155 160Ala Ala Gly Asp Ser Gly Ser Thr Asp Gly Glu Gln
Asp Gly Leu Tyr 165 170
175His Val Asp Phe Pro Ala Ala Ser Pro Tyr Val Leu Ala Cys Gly Gly
180 185 190Thr Arg Leu Val Ala Ser
Gly Gly Arg Ile Ala Gln Glu Thr Val Trp 195 200
205Asn Asp Gly Pro Asp Gly Gly Ala Thr Gly Gly Gly Val Ser
Arg Ile 210 215 220Phe Pro Leu Pro Ala
Trp Gln Glu His Ala Asn Val Pro Pro Ser Ala225 230
235 240Asn Pro Gly Ala Ser Ser Gly Arg Gly Val
Pro Asp Leu Ala Gly Asn 245 250
255Ala Asp Pro Ala Thr Gly Tyr Glu Val Val Ile Asp Gly Glu Ala Thr
260 265 270Val Ile Gly Gly Thr
Ser Ala Val Ala Pro Leu Phe Ala Ala Leu Val 275
280 285Ala Arg Ile Asn Gln Lys Leu Gly Lys Ala Val Gly
Tyr Leu Asn Pro 290 295 300Thr Leu Tyr
Gln Leu Pro Ala Asp Val Phe His Asp Ile Thr Glu Gly305
310 315 320Asn Asn Asp Ile Ala Asn Arg
Ala Gln Ile Tyr Gln Ala Gly Pro Gly 325
330 335Trp Asp Pro Cys Thr Gly Leu Gly Ser Pro Ile Gly
Val Arg Leu Leu 340 345 350Gln
Ala Leu Leu Pro Ser Ala Ser Gln Pro Gln Pro Gly Ser Thr Glu 355
360 365Asn Leu Tyr Phe Gln Ser Gly Ala Leu
Glu His His His His His His 370 375
380684PRTArtificial SequenceSynthetic 68Pro Gln Leu Pro1696PRTArtificial
SequenceSyntheticMISC_FEATURE(1)..(1)N-terminal
QXL520MISC_FEATURE(6)..(6)C-terminal K(5-FAM) 69Pro Gln Pro Gln Leu Pro1
57012PRTArtificial SequenceSynthetic 70Gln Leu Gln Pro Phe
Pro Gln Pro Gln Leu Pro Tyr1 5
1071290PRTHomo sapiens 71Met Val Arg Val Pro Val Pro Gln Leu Gln Pro Gln
Asn Pro Ser Gln1 5 10
15Gln Gln Pro Gln Glu Gln Val Pro Leu Val Gln Gln Gln Gln Phe Pro
20 25 30Gly Gln Gln Gln Pro Phe Pro
Pro Gln Gln Pro Tyr Pro Gln Pro Gln 35 40
45Pro Phe Pro Ser Gln Gln Phe Tyr Leu Gln Leu Gln Pro Phe Pro
Gln 50 55 60Pro Gln Leu Pro Tyr Pro
Gln Pro Gln Leu Pro Tyr Pro Gln Pro Gln65 70
75 80Leu Pro Tyr Pro Gln Pro Gln Pro Phe Arg Pro
Gln Gln Pro Tyr Pro 85 90
95Gln Ser Gln Pro Gln Tyr Ser Gln Pro Gln Gln Pro Ile Ser Gln Gln
100 105 110Gln Gln Gln Gln Gln Gln
Gln Gln Gln Gln Lys Gln Gln Gln Gln Gln 115 120
125Gln Gln Gln Ile Leu Gln Gln Ile Leu Gln Gln Gln Leu Ile
Pro Cys 130 135 140Arg Asp Val Val Leu
Gln Gln His Ser Ile Ala Tyr Gly Ser Ser Gln145 150
155 160Val Leu Gln Gln Ser Thr Tyr Gly Leu Val
Gln Gln Leu Cys Cys Gln 165 170
175Gln Leu Trp Gln Ile Pro Glu Gln Ser Arg Cys Gln Ala Ile His Asn
180 185 190Val Val His Ala Ile
Ile Leu His Gln Gln Gln Gln Gln Gln Gln Gln 195
200 205Gln Gln Gln Gln Pro Leu Ser Gln Val Ser Phe Gln
Gln Pro Gln Gln 210 215 220Gln Tyr Pro
Ser Gly Gln Gly Ser Phe Gln Pro Ser Gln Gln Asn Pro225
230 235 240Gln Ala Gln Gly Ser Val Gln
Pro Gln Gln Leu Pro Gln Phe Glu Glu 245
250 255Ile Arg Asn Leu Ala Leu Glu Thr Leu Pro Ala Met
Cys Asn Val Tyr 260 265 270Ile
Pro Pro Tyr Cys Thr Ile Ala Pro Val Gly Ile Phe Gly Thr Asn 275
280 285Tyr Arg 2907233PRTArtificial
SequenceSynthetic 72Leu Gln Leu Gln Pro Phe Pro Gln Pro Gln Leu Pro Tyr
Pro Gln Pro1 5 10 15Gln
Leu Pro Tyr Pro Gln Pro Gln Leu Pro Tyr Pro Gln Pro Gln Pro 20
25 30Phe7326PRTArtificial
SequenceSynthetic 73Phe Leu Gln Pro Gln Gln Pro Phe Pro Pro Gln Gln Pro
Gln Gln Pro1 5 10 15Tyr
Pro Gln Gln Pro Gln Gln Pro Phe Pro 20 25
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20200307792 | MODULAR PERSONAL TRANSPORTATION SYSTEM |
20200307791 | SPREADING APPARATUS, CONTROL METHOD THEREFOR, AND PLANT PROTECTION UNMANNED AERIAL VEHICLE |
20200307790 | LIGHT TWIN ENGINE AIRCRAFT |
20200307789 | AIRCRAFT HAVING EMBEDDED ENGINES |
20200307788 | SYSTEMS AND METHODS FOR AUTOMATIC WATER SURFACE AND SKY DETECTION |