Patent application title: SUPPRESSION OF CANCER
Inventors:
Frederic Madec (Oxfordshire, GB)
Philip Lecane (Oxfordshire, GB)
Philip Mark (Oxfordshire, GB)
Keith Foster (Oxfordshire, GB)
Assignees:
SYNTAXIN LIMITED
IPC8 Class: AA61K3848FI
USPC Class:
424 9463
Class name: Enzyme or coenzyme containing hydrolases (3. ) (e.g., urease, lipase, asparaginase, muramidase, etc.) acting on peptide bonds (3.4) (e.g., urokinease, etc.)
Publication date: 2011-10-27
Patent application number: 20110262423
Abstract:
The present invention relates to polypeptides for use in suppressing
cancer and cancer disorders. The treatment employs use of a non-cytotoxic
protease, which is targeted to the cancer cell, and, when so delivered,
the protease is internalised and inhibits secretion from the cancer cell.Claims:
1. A polypeptide, for use in suppressing a cancer cell that expresses a
SNARE protein responsible for secretion from said cancer cell, wherein
the polypeptide comprises: (i) a non-cytotoxic protease, which protease
is capable of cleaving said SNARE protein expressed in said cancer cell;
(ii) a Targeting Moiety (TM) that is capable of binding to a Binding Site
on a cancer cell, which Binding Site is capable of undergoing endocytosis
to be incorporated into an endosome within the cancer cell; and (iii) a
translocation domain that is capable of translocating the protease from
within an endosome, across the endosomal membrane and into the cytosol of
the cancer cell; wherein the polypeptide lacks the natural binding
function of a clostridial neurotoxin HCC domain which enables the
clostridial neurotoxin to bind to nerve terminals at the neuromuscular
junction; with the proviso that the cancer cell is not a neuroendocrine
tumour cell; and with the proviso that the polypeptide is not a
clostridial neurotoxin (holotoxin).
2. The polypeptide for according to claim 1, wherein the cancer is selected from the group consisting of: lung cancer, renal cancer, brain cancer, breast cancer, pancreatic cancer, colorectal cancer, adrenal cancer, oesophageal cancer, lymphoma, B-cell lymphoma, mantle cell lymphoma, leukaemia, multiple myelona, acute leukaemia, bladder cancer, bone cancer, bowel cancer, cervical cancer, chronic lymphocytic leukaemia, Hodgkin's lymphoma, liver cancer, skin cancer, oropharyngeal cancer, myeloma, prostate cancer, prostate soft tissue sarcoma, gastric cancer, testicular cancer, uterine cancer, and Kaposi sarcoma.
3. The polypeptide according to claim 1, wherein the cancer cell is selected from the group consisting of: a lung cancer cell, a renal cancer cell, a brain cancer cell, a breast cancer cell, a pancreatic cancer cell, a colorectal cancer cell, an adrenal cancer cell, an oesophageal cancer cell, a lymphoma cancer cell, a B-cell lymphoma cancer cell, a mantle cell lymphoma cancer cell, a leukaemia cancer cell, a multiple myeloma cell, an acute leukaemia cancer cell, a bladder cancer cell, a bone cancer cell, a bowel cancer cell, a cervical cancer cell, a chronic lymphocytic leukaemia, cell, a Hodgkin's lymphoma cell, a liver cancer cell, a melanoma skin cancer cell, an oropharyngeal cancer cell, a myeloma cell, a prostate cancer cell, a soft tissue sarcoma cell, a gastric cancer cell, a testicular cancer cell, a uterine cancer cell and a Kaposi sarcoma cancer cell.
4. The polypeptide of claim 1, wherein the TM binds to a receptor selected from the group consisting of: an ErbE receptor; an EGF receptor; a growth hormone-releasing hormone (GHRH) receptor; an insulin-like growth factor receptor (IGFR); a gastrin-releasing peptide (GRP) receptor; a bombesin peptide (BB) receptor; a growth hormone (GH) receptor; a vascular endothelial growth factor (VEGF) receptor; an acetylcholine (ACH) receptor; a somatostatin (SST) receptor; a cortistatin (CST) receptor; a chemokine (C-X-C motif) receptor such as CXCR4; a neuropilin receptor; a gonadotropin-releasing hormone (GnRH) receptor; a VIP-glucagon-GRF-secretin superfamily receptor; a PAC receptor; a VPAC receptor; a FLT receptor; a CHRN receptor; an EPH receptor; an EPHA receptor; an EFN receptor; a DLK1 receptor, a DLL3 receptor, a FGF receptor; a JAG receptor; a LIF receptor; a NMB receptor; a NOTCH receptor; a PDGF receptor; a c-kit receptor; a TGF receptor; an endothelin receptor; a chemokine receptor; and an angiopoietin receptor.
5. The polypeptide of claim 1, wherein the TM is selected from the group consisting of: an Erb B peptide, an EGF peptide, an adrenomedullin (ADM) peptide, an AM peptide, an AMF peptide, an amphiregulin peptide, an APRIL peptide, an artemin peptide, a betaceiluhn peptide, a bombesin peptide, a CTR peptide, an endothelin peptide, an erythropoietin peptide, a FGF peptide, a FSH peptide, a gastrin peptide, a gastrin releasing peptide (GRP), a GDNF peptide, a ghrelin peptide, a GHRH peptide, a G-CSF peptide, a growth hormone peptide, a HB-EGF peptide, a hepatocyte growth factor (HGF) peptide, an IL peptide, a keratinocyte growth factor peptide, a leptin peptide, a LIF peptide, an alpha-melanotropin peptide, a MGSA/GRO peptide, a NRG peptide, an oxytocin peptide, an osteopontin (OPN) peptide, a neuregulin-1 peptide, a VIP or PACAP peptide, a PDGF peptide, a prolactin peptide, a SCF peptide, a somatostatin (SST) peptide, a cortistatin (CST) peptide, a TNF peptide, a TGF peptide, a VEGF peptide, a vasopressin peptide, an angiopoietin peptide, a B-CLL peptide, a BCGF-12KD peptide, a BAFF peptide, a galanin peptide, a GDNF peptide, a GnRH peptide, an IGF-II peptide, a LH peptide, a neurotrophin peptide, a substance P peptide, a TGF-alpha peptide; an IGF peptide, an angiopoietin peptide, a CXC peptide, a CCL peptide, or an antibody or antibody fragment that binds to a receptor as defined in claim 4.
6. The polypeptide of claim 1, wherein the non-cytotoxic protease comprises a clostridial neurotoxin L-chain or an IgA protease.
7. The polypeptide of claim 1, wherein the translocation domain comprises a clostridial neurotoxin translocation domain.
8. A polypeptide comprising: (i) a non-cytotoxic: protease, which protease is capable of cleaving a SNARE protein expressed in a cancer cell, wherein said SNARE protein is responsible for secretion from the cancer cell; (ii) a Targeting Moiety (TM) that is capable of binding to a Binding Site on the cancer cell, which Binding Site is capable of undergoing endocytosis to be incorporated into an endosome within the cancer cell; and (iii) a translocation domain that is capable of translocating the protease from within an endosome, across the endosomal membrane and into the cytosol of the cancer cell; wherein the polypeptide lacks the natural binding function of a clostridial neurotoxin HCC domain which enables the clostridial neurotoxin to bind to nerve terminals at the neuromuscular junction; with the proviso that the cancer cell is not a neuroendocrine tumour cell; and with the proviso that the polypeptide is not a clostridial neurotoxin (holotoxin).
9. The polypeptide according to claim 8, wherein the TM binds to a receptor selected from the group consisting of: an ErbB receptor; an EGF receptor; a growth hormone-releasing hormone (GHRH) receptor; an insulin-like growth factor (IGF) receptor; a gastrin-releasing peptide (GRP) receptor; a bombesin peptide (BB) receptor a growth hormone (GH) receptor; a vascular endothelial growth factor (VFGF) receptor; an acetylcholine (ACH) receptor; a somatostatin (SST) receptor; a cortistatin (CST) receptor; a chemokine (C-X-C motif) receptor; CXCR4; a neuropilin receptor; a gonadotropin-releasing hormone (GnRH) receptor; a VIP-glucagon-GRF-secretin superfamily receptor; a PAC receptor; a VPAC receptor; a FLT receptor; a CHRN receptor; an EPH receptor; an EPHA receptor; an EFN receptor; a DLK1 receptor, a DLL3 receptor, a FGF receptor; a JAG receptor; a LIF receptor; a NMB receptor; a NOTCH receptor; a PDGF receptor; a c-kit receptor; a TGF receptor; an endothelin receptor; a chemokine receptor; and an angiopoietin receptor.
10. The polypeptide according to claim 8, wherein the TM is selected from the group consisting of: an ErbB peptide, an EGF peptide, an adrenomedullin (ADM) peptide, an AM peptide, an autocrine motility factor (AMF) peptide, an amphiregulin peptide, an APRIL peptide, an artemin peptide, autocrine motility factor (AMF) peptide, a betacellulin peptide, a bombesin peptide, a CTR peptide, an endothelin peptide, an erythropoietin peptide, a FGF peptide, a FSH peptide, a gastrin peptide, a gastrin releasing peptide (GRP), a GDNF peptide, a ghrelin peptide, a GHRH peptide, a G-CSF peptide, a growth hormone peptide, a HB-EGF peptide, a hepatocyte growth factor (HGF) peptide, an IL peptide, a keratinocyte growth factor peptide, a leptin peptide, a LIF peptide, an aipha-melanotropin peptide, a MGSA/GRO peptide, a NRG peptide, an oxytocin peptide, an osteopontin (OPN) peptide, a neureguiin-1 peptide, a VIP peptide, a PACAP peptide, a PDGF peptide, a prolactin peptide, a SCF peptide, a somatostatin (SST) peptide, a cortistatin (CST) peptide, a TNF peptide, a TGF peptide, a VEGF peptide, a vasopressin peptide, an angiopoietin peptide, a B-CLL peptide, a BCGF-12KD peptide, a BAFF peptide, a galanin peptide, a GDNF peptide, a GnRH peptide, au IGF-II peptide, a LH peptide, a NT peptide, a substance-P peptide, a TGF-alpha peptide; an IGF peptide, an angiopoietin peptide, a CXC peptide, a CCL peptide, and an antibody or antibody fragment that binds to a receptor as defined in claim 9.
11. The polypeptide according to claim 8, wherein the non-cytotoxic protease comprises a clostridial neurotoxin protease or an IgA protease.
12. The polypeptide according to claim 8, wherein the translocation domain comprises a clostridial neurotoxin translocation domain.
13. The polypeptide according to claim 8, wherein said polypeptide comprises au amino acid sequence having at least 90-92%, or at least 95-97%, or at least 98-99% sequence identity to any one of SEQ ID NOs: 7, 10, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, or 44.
14. A nucleic acid encoding a polypeptide according to claim 8.
15. A nucleic acid encoding a polypeptide according to claim 8, wherein said nucleic acid comprises a nucleic acid sequence haying at least 90-94%, or at least 95-97%, or at least 98-99% sequence identity to any one of SEQ ID NOs: 6, or 9.
16. A method of suppressing a cancer in a patient, comprising administering to the patient an effective amount of a. polypeptide according to claim 8.
17. The method according to claim 16, for suppressing a cancer selected from the group consisting of: lung cancer, renal cancer, brain cancer, breast cancer, pancreatic cancer, colorectal cancer, colon cancer, rectal cancer, adrenal cancer, oesophageal cancer, lymphoma, B-cell lymphoma, mantle cell lymphoma, leukaemia, multiple myeloma, acute leukaemia, bladder cancer, bone cancer, bowel cancer, cervical cancer, chronic lymphocytic leukaemia, Hodgkin's lymphoma, liver cancer, skin cancer, oropharyngeal cancer, myeloma, prostate cancer, prostate soft tissue sarcoma, gastric cancer, testicular cancer, uterine cancer and Kaposi sarcoma.
18. The method according to claim 16, for suppressing secretion from a cancer cell selected from the group consisting of: a lung cancer cell, a renal cancer cell, a brain cancer cell, a breast cancer cell, a pancreatic cancer cell, a colorectal cancer cell, an adrenal cancer cell, an oesophageal cancer cell, a lymphoma cancer cell, a B-cell lymphoma cancer cell, a mantle cell lymphoma cancer cell, a leukaemia cancer cell, a multiple myeloma cancer cell, an acute leukaemia cancer cell, a bladder cancer cell, a bone cancer cell, a bowel cancer cell, a cervical cancer cell, a chronic lymphocytic leukaemia cell, a Hodgkin's lymphoma cell, a liver cancer cell, a skin cancer cell, an oropharyngeal cancer cell, a myeloma cell, a prostate cancer cell, a soft tissue sarcoma cell, a gastric cancer cell, a testicular cancer cell, a uterine cancer cell and a Kaposi sarcoma cell.
19. A method of suppressing a cancer in a patient, comprising administering to the patient an effective amount of a nucleic acid according to claim 14.
20. The polypeptide according to claim 9, wherein the TM is selected from the group consisting of: an ErbB peptide, an EGF peptide, an adrenomedullin (ADM) peptide, an AM peptide, an autocrine motility factor (AMF) peptide, an amphiregulin peptide, an APRIL peptide, an artemin peptide, an autocrine motility factor (AMF) peptide, a betacellulin peptide, a bombesin peptide, a CTR peptide, an endothelin peptide, an erythropoietin peptide, a FGF peptide, a FSH peptide, a gastrin peptide, a gastrin releasing peptide (GRP), a GDNF peptide, a ghrelin peptide, a GHRH peptide, a G-CSF peptide, a growth hormone peptide, a HB-EGF peptide, a hepatocyte growth factor (HGF) peptide, an IL peptide, a keratinocyte growth factor peptide, a leptin peptide, a LIF peptide, an alpha-melanotropin peptide, a MGSA/GRO peptide, a NRG peptide, an oxytocin peptide, an osteopontin (OPN) peptide, a neuregulin-1 peptide, a VIP peptide, a PACAP peptide, a PDGF peptide, a prolactin peptide, a SCF peptide, a somatostatin (SST) peptide, a cortistatin (CST) peptide, a TNF peptide, a TGF peptide, a VEGF peptide, a vasopressin peptide, an angiopoietin peptide, a B-CLL peptide, a BCGF-12KD peptide, a BAFF peptide, a galanin peptide, a GDNF peptide, a GnRH peptide, an IGF-II peptide, a LH peptide, a NT peptide, a substance-P peptide, a TGF-alpha peptide; an IGF peptide, an angiopoietin peptide, a CXC peptide, a CCL peptide, and an antibody or antibody fragment that binds to a receptor as defined in claim 9.
Description:
[0001] The present invention relates to the suppression of cancer.
[0002] Cancers are made up of cells that divide, invade and survive in an aberrant manner in the body. Cancers develop when alterations to the DNA within cells affect the control of cell division and other processes relevant to cell survival. As cancerous tumours grow, they invade the body tissues surrounding them. This is harmful to the body as it damages surrounding normal tissues and has a significant effect on physiological responses. In addition, cancers may spread to other parts of the body and develop into secondary tumours. Assessment of cancer can be undertaken using a variety of tests including CT scan, MRI imaging, PET scans, combined PET-CT scan and ultrasound tests. Generally, the earlier a cancer is detected and confirmed, the better the overall prognosis for the patient.
[0003] A wide range of treatments are available for patients diagnosed with cancer. These include a variety of surgical interventions, radiotherapy, chemotherapy and hormone therapies. These treatments may be used in isolation but most commonly they are used in combination with each other. The use of a combination of therapies is dependent on the type of cancer diagnosed with certain cancers (such as breast cancer) commonly requiring multiple therapeutic interventions. There are a variety of new therapeutic approaches under development utilising biological products including antibodies, immunotherapies and vaccination techniques to treat cancer. Additional treatment approaches including the use of novel therapeutics designed to block angiogenesis are under development. Novel approaches including the use of gene therapy are also being investigated as potential therapeutics. However, despite the significant scientific efforts and huge financial commitment to developing new cancer therapies, the prognosis for many patients remains extremely poor.
[0004] Breast cancer is the most common cancer in the UK and every year more than 44,000 women are diagnosed with this type of cancer. Worldwide, more than a million women are diagnosed with breast cancer every year. In the UK, approximately 80% of patients will survive for 5 years or longer.
[0005] Lung cancer is the 2nd commonest cancer in the UK and over 37,000 people were diagnosed with lung cancer in the UK in 2003. For patients diagnosed with lung cancer, only approximately 20% will survive for at least 1 year after diagnosis and only approximately 6% will live for 5 years or longer after diagnosis.
[0006] Colorectal cancer is the 3rd commonest cancer type overall and it is the 2nd commonest affecting women (after breast cancer). Just over 35,000 people were diagnosed with colorectal cancer in the UK in 2003. Of those diagnosed with bowel and rectal cancer in England and Wales, 46% will survive for at least 5 years after their diagnosis.
[0007] Pancreatic cancer is the fourth most common cause of death from cancer in the Western world and over 7000 cases are diagnosed in the UK each year. Despite therapeutic advances only 3-4% of those diagnosed with pancreatic cancer survive for 5 years or longer and successful therapeutic intervention is limited to surgery that is applicable to only 15% of patients.
[0008] Renal cancer is the 9th commonest cancer type overall with over 7000 new cases diagnosed and greater than 3500 deaths in the UK each year. Around 208,500 new cases of renal cancer are diagnosed in the world each year, accounting for just under 2% of all cancers. It is commoner in men than in women, with a male to female ratio of 1.5:1. Thirty percent of renal cancer patients show signs of advanced renal cell carcinoma at initial diagnosis, with metastases detected in 15-25% of patients at this time.
[0009] Cancer represents a worldwide problem that is predicted to grow significantly. In the year 2000, malignant tumours were responsible for 12 per cent of the nearly 56 million deaths reported worldwide from all causes. In some countries, more than a quarter of deaths could be attributed to cancer. In 2000, 5.3 million men and 4.7 million women developed a malignant tumour and altogether 6.2 million died from the disease. It is predicted that cancer rates could further increase by 50% to 15 million new cases in the year 2020, (WHO World Cancer Report (IARC, Ed B.W. Stewart and P. Kleihues, 2003).
[0010] For a large number of cancers, autocrine signalling (defined as a mode of hormone action in which a hormone binds to receptors on and affects the function of the cell type that produced it) and/or paracrine signalling (defined as a mode of hormone action in which hormone released from endocrine or endocrine-like cells binds to receptors on nearby cells and affects their function), are believed to play a significant role in development of the disease state. For a given cancer, multiple autocrine signalling loops may be implicated in development and maintenance of the cancer state. In addition to providing a stimulus for cell division, autocrine signalling can enhance cell survival acting to protect cells from mechanisms of apoptosis and necrosis and increase the metastatic potential of the cancer cell.
[0011] There is therefore a need in the art for new therapies/therapeutics capable of specifically addressing cancer. This need is addressed by the present invention, which solves one or more of the above-mentioned problems.
[0012] In more detail, a first aspect of the present invention provides a polypeptide for use in suppressing cancer, wherein the polypeptide comprises: (i) a non-cytotoxic protease, which protease is capable of cleaving a SNARE protein in a cancer cell;
[0013] (ii) a Targeting Moiety (TM) that is capable of binding to a Binding Site on a cancer cell, which Binding Site is capable of undergoing endocytosis to be incorporated into an endosome within the cancer cell, and wherein said cancer cell expresses said SNARE protein; and
[0014] (iii) a translocation domain that is capable of translocating the protease from within an endosome, across the endosomal membrane and into the cytosol of the cancer cell;
[0015] and with the proviso that the cancer cell is not a neuroendocrine tumour cell;
[0016] and with the proviso that the polypeptide is not a clostridial neurotoxin (holotoxin) molecule.
[0017] The polypeptide of the present invention is not a naturally-occurring clostridial neurotoxin molecule (also known as clostridial holotoxin). Clostridial holotoxin is one of the most lethal neurotoxins known to man, and, as such, has significant limitations as a therapeutic molecule. Also, in the context of cancer, clostridial holotoxin is associated with undesirable off-site targeting (ie. targeting of non-cancer cells), for example to the neuromuscular junction.
[0018] In use, a polypeptide of the invention binds to a cancer cell. Thereafter, the translocation component effects transport of the protease component into the cytosol of the cancer cell. Finally, once inside, the protease inhibits the exocytic fusion process of the cancer cell by cleaving SNARE protein present in the cytosol of the cancer cell. Thus, by inactivating the exocytic fusion apparatus of the cancer cell, the polypeptide of the invention inhibits secretion (eg. of TNF-alpha, acetylcholine, fibroblastic growth factor, gastrin releasing peptide, interleukin-6, VEGF, and/or autocrine mobility factor) therefrom. Accordingly, where said secretion peptides are, for example, involved in autocrine or paracrine signalling, the polypeptide of the invention reduces the signalling drive that the autocrine/paracrine loop would otherwise provide to the cancer cell. Accordingly, the polypeptides of the present invention are capable of suppressing or treating cancer.
[0019] The first aspect also embraces a corresponding method for suppressing cancer, said method comprising administering a therapeutically effective amount of a polypeptide of the present invention to a patient. The present invention thereby provides an effective means for treating cancer such as renal, breast, lung, pancreatic, and colorectal (eg. colon).
[0020] The polypeptides of the present invention provide a distinct advantage over other therapeutics in that they have the potential to inhibit the secretion of multiple autocrine and paracrine peptides from a targeted cancer cell using a single therapeutic molecule. This reduces significantly the ability of a cancer to adapt to a specific treatment regime by switching to and using alternative autocrine signalling pathways. Thus, the present invention provides a more efficacious therapy by blocking multiple pathways simultaneously.
[0021] The principal target cell of the present invention is a cancer cell, for example a cancer cell that secretes one or more hormones (or other bioactive molecules), which, once secreted, bind to a receptor on and affect the function of said cancer cell. Examples of cancer cells targeted by the present invention include: breast, lung, pancreatic, colorectal (eg. colon), adrenal, oesophageal, lymphoma (eg. B-cell; Mantell cell), leukaemia (eg. multiple myeloma), acute leukaemia, bladder, bone, brain tumours, bowel, cervical, chronic lymphocytic leukaemia, Hodgkin's lymphoma, renal, liver, skin (eg. basal, squamous, melanoma), oropharyngeal, myeloma, prostate (eg. soft tissue sarcoma), gastric, testicular, and uterine.
[0022] In one embodiment, the target cell of the present invention is a cancer cell in which the level of SNARE protein expression is undesirable.
[0023] For example, undesirable SNARE expression may occur when the expression of a SNARE protein is up-regulated when compared with the same (or very similar) cell type when in a non-cancerous state. In one embodiment, the mRNA encoding a SNARE protein is up-regulated in a target cancer cell. In another (or the same) embodiment, the cellular protein concentration of a SNARE protein is (also) increased in the target cancer cell. When measuring SNARE mRNA or protein levels, a value of `one` may be assigned to the SNARE mRNA or protein concentration in a non-cancerous cell of a normal individual, which cell is otherwise the same (or very similar) to the cell type of the cancer cell in question--this value might be described as a `basal expression level`, and is typically an average of expression level values obtained from several cells (of the same or very similar type) and/or from different normal individuals. For example, a lung cell (e.g. a neuroectodermal lung cell, or a lung epithelium cell) in a normal individual may be used for comparison with a small cell lung cancer cell. Thus, up-regulated in the context of the present invention means that the SNARE mRNA or protein level is `greater than 1` when compared with the same cell type in a non-cancerous state in a normal individual. In one embodiment, the SNARE protein or mRNA level is up-regulated by greater than 2- or 3-fold, or by greater than 4- or 5-fold, or by greater than 6- or 7-fold. In another embodiment, the SNARE protein or mRNA level is up-regulated by greater than 8- or 9-fold, or by greater than 10- or 12-fold, or by greater than 15- or 20-fold. In another embodiment, the SNARE protein or mRNA level is up-regulated by greater than 40- or 50-fold, or by greater than 60- or 70-fold, or by greater than 90- or 100-fold.
[0024] Alternatively, undesirable SNARE expression may occur when the level of SNARE mRNA and/or protein expression in a cancer cell is substantially normal (ie. substantially the same) when compared with that of a corresponding identical or very similar non-cancerous cell type. In this scenario, however, for example when a patient is suffering from a [prostate, bladder or cervical carcinoma], even a low or normal level of SNARE protein expression may support a secretion from the cancer cells of this type that is paracrine or autocrine in function, and so in the context of the cancer, the SNARE protein is supporting an undesirable secretion. In this scenario, the polypeptides of the present invention are capable of suppressing secretion from said cancer cells, and thereby addressing this need in the art.
[0025] In this regard, the present inventors have identified undesirable SNARE expression in a range of cancer cell types. These data are presented as Box Plots in FIGS. 3-47. In summary, the present inventors have confirmed that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-1a, syntaxin-2), and VAMP (eg. VAMP-1) is observed in small cell lung cancer cells (eg. when compared with normal lung cells); that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-2), VAMP (eg. VAMP-1, VAMP-2, VAMP-3) is observed in lung carcinoid cells; that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-2), VAMP (eg. VAMP-1, VAMP-2) is observed in lung carcinoid cells (eg. when compared with normal lung cells); that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-2, syntaxin-3), and VAMP (eg. VAMP-1, VAMP-3) is observed in bladder carcinoma cells; that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-1a), and VAMP (eg. VAMP-2) is observed in breast cancer cells; that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-1a, syntaxin-1b, syntaxin-2, syntaxin-3), and VAMP (eg. VAMP-1, VAMP-2, and VAMP-3) is observed in pancreatic adenocarcinoma cells (eg. when compared with normal pancreatic cells); that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-1a, syntaxin-1b, syntaxin-3), and VAMP (eg. VAMP-2, VAMP-3) is observed in prostate carcinoma cells (eg. when compared with benign prostate cells); that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-2), and VAMP (eg. VAMP-3) is observed in cervix cancer cells (eg. when compared with normal cervix cells); that undesirable SNARE expression such as syntaxin (eg. syntaxin 3) and VAMP (eg. VAMP-1, VAMP-2, VAMP-3) is observed in B-cell leukaemia cells (eg. when compared with normal B-cell lymphocytes); that undesirable SNARE expression such as syntaxin (eg. syntaxin-1a, syntaxin-3) and VAMP (eg. VAMP-1) is observed (eg. up-regulated) in ovarian cancer cells (eg. when compared with normal ovary cells); that undesirable SNARE expression such as SNAP-25 and VAMP (eg. VAMP-1, VAMP-2, VAMP-3) is observed in gastrointestinal (GI) carcinoid cells (eg. when compared with normal GI cells); and that undesirable SNARE expression such as SNAP-25, syntaxin (eg. syntaxin-1a, syntaxin-2) and VAMP (eg. VAMP-3) is observed (eg. up-regulated) in head and neck cancer cells (eg. when compared with normal head and neck cells). In addition, the present inventors have identified that undesirable SNARE expression such as SNAP-25 and VAMP (eg. VAMP-2) is observed in colon cancer cells; that undesirable SNARE expression such as SNAP-25 is observed in adrenal cancer cells; that undesirable SNARE expression such as syntaxin (eg. syntaxin-2) is observed in skin (eg. melanoma) cancer cells; that undesirable SNARE expression such as syntaxin (eg. syntaxin-3) is observed in leukemia (eg. multiple melanoma) cancer cells; that undesirable SNARE expression such as syntaxin (eg. syntaxin-1a) and VAMP (eg. VAMP-1) is observed in lung adenocarcinoma cells; that undesirable SNARE expression such as VAMP (eg. VAMP-1) is observed in liver cancer cells; that undesirable SNARE expression such as VAMP (eg. VAMP-2) is observed in oesophagus cancer cells; and that undesirable SNARE expression such as VAMP (eg. VAMP-3) is observed in lymphoma cancer cells (eg. B-cell lymphoma and Mantell cell).
[0026] The bioactive' component of the polypeptides of the present invention is provided by a non-cytotoxic protease. This distinct group of proteases act by proteolytically-cleaving intracellular transport proteins known as SNARE proteins (e.g. SNAP-25, VAMP, or Syntaxin)--see Gerald K (2002) "Cell and Molecular Biology" (4th edition) John Wiley & Sons, Inc. The acronym SNARE derives from the term Soluble NSF Attachment Receptor, where NSF means N-ethylmaleimide-Sensitive Factor. SNARE proteins are integral to intracellular vesicle formation, and thus to secretion of molecules via vesicle transport from a cell. Accordingly, once delivered to a desired target cell, the non-cytotoxic protease is capable of inhibiting cellular secretion from the target cell.
[0027] Non-cytotoxic proteases are a discrete class of molecules that do not kill cells; instead, they act by inhibiting cellular processes other than protein synthesis.
[0028] Non-cytotoxic proteases are produced as part of a larger toxin molecule by a variety of plants, and by a variety of microorganisms such as Clostridium sp. and Neisseria sp.
[0029] Clostridial neurotoxins represent a major group of non-cytotoxic toxin molecules, and comprise two polypeptide chains joined together by a disulphide bond. The two chains are termed the heavy chain (H-chain), which has a molecular mass of approximately 100 kDa, and the light chain (L-chain), which has a molecular mass of approximately 50 kDa. It is the L-chain, which possesses a protease function and exhibits a high substrate specificity for vesicle and/or plasma membrane associated (SNARE) proteins involved in the exocytic process (eg. synaptobrevin, syntaxin or SNAP-25). These substrates are important components of the neurosecretory machinery.
[0030] Neisseria sp., most importantly from the species N. gonorrhoeae, and Streptococcus sp., most importantly from the species S. pneumoniae, produce functionally similar non-cytotoxic toxin molecules. An example of such a non-cytotoxic protease is IgA protease (see WO99/58571, which is hereby incorporated in its entirety by reference thereto). Thus, the non-cytotoxic protease of the present invention is preferably a clostridial neurotoxin protease or an IgA protease.
[0031] Turning now to the Targeting Moiety (TM) component of the present invention, it is this component that binds the polypeptide of the present invention to a cancer cell. The TM is preferably a peptide.
[0032] The TM binds to a Binding Site on the cancer cell, thereby providing selectivity of the polypeptide to this species of target cell over other cells. By way of example, in gastric cancer, the TM preferably binds to gastric cancer cells in preference to other cells of the stomach, such as those that are not cancerous and/or to other non-cancerous (or cancerous) cells in the body. In this regard, preferred TM embodiments of the present invention include antibodies (eg. monoclonal antibodies, antibody fragments such as Fab, F(ab)'2, Fv, ScFv, etc., and antibody domains peptides), as well as binding scaffolds, which bind to the receptors identified below. Accordingly, the polypeptides of present invention may include commercially available antibodies or binding scaffolds, which have been designed to achieve specific binding to the target cell or receptor in question. Alternatively, preferred TMs include peptide ligands, such as cytokines, growth factors, neuropeptides, and lectins.
[0033] A TM of the present invention binds to a receptor on a cancer cell. By way of example, a TM of the present invention binds to a receptor on a cancer cell selected from the group comprising: a growth hormone-releasing hormone (GHRH, aka GHRF/GRF) receptor; an insulin-like growth factor receptor (e.g. an IGF-1 receptor); a corticotropin releasing factor receptor (e.g. CRHR-2); a gastrin-releasing peptide (GRP) receptor; a bombesin peptide (BB) receptor such as BRS-1, BRS-2, or BRS-3; a growth hormone (GH) receptor; an interleukin receptor (e.g. IL8RA or IL13RA1); a vascular endothelial growth factor (VEGF) receptor; an acetylcholine (ACH) receptor; a somatostatin (SST) receptor, such as SST1, SST2, SST3, SST4 or SST5; a cortistatin (CST) receptor; a chemokine (C-X-C motif) receptor such as CXCR4; a neuropilin receptor; a gonadotropin-releasing hormone (GnRH) receptor; a VIP-glucagon-GRF-secretin superfamily receptor, such as a PAC (eg. PACO receptor or a vasoactive intestinal peptide VPAC receptor (e.g. VPAC-1 or VPAC-2) receptor; or an ErbB family member receptor such as an EGF receptor. All of these receptors are over-expressed in cancer cells.
[0034] In one embodiment of the present invention, the TM binds to a receptor on a cancer cell selected from the group comprising: FLT such as FLT1, FLT4, FLT3, BRS such as BRS3; CHRN such as CHRNA1, CHRND or CHRNE; EPHA such as EPHA7, EPHA4, EPHA5, EPHA3, EPHA1, EPHA2; EFN such as EFNB3, EFNA1, EFNB2, EFNA3, EFNB1; ErbB such as ERBB2, ERBB4; DLK1, DLL3, FGF such as FGFR1, FGFR3, FGFR2; GRPR; GNRHR; GRPR; GnRHR; JAG such as JAG1 (CD339) or JAG2, IFGR such as IGF1R; leukaemia inhibitory factor receptor (LIFR); NMBR; NOTCHR such as NOTCH3 or NOTCH4; VIPR such as VIPR1; VEGFR such as VEGFR2; SSTR such as SSTR1; NMBR; or PDGFR such as PDGFRA or PDGFRB. All of these receptors are over-expressed in cancer cells.
[0035] In one embodiment, the TM is selected from: an adrenomedullin (ADM) peptide, an AM peptide, an autocrine motility factor (AMF) peptide, an amphiregulin peptide, an APRIL (a proliferation-inducing ligand) peptide, an artemin peptide, a betacellulin peptide, a bombesin peptide, a calcitonin receptor (CTR) binding peptide, an ErbB peptide such as an EGF peptide, an endothelin peptide, an erythropoietin peptide (EPO), a fibroblast growth factor (FGF) peptide such as a FGF-5 peptide, an FGF-18 peptide, a bFGF, a FGF8 or a FGF17 peptide, a follicle-stimulating hormone (FSH) peptide, a gastrin peptide, a gastrin releasing peptide (GRP), a glial cell line-defined neurotrophic factor (GDNF) peptide, a ghrelin (GHRL) peptide, a growth hormone-releasing hormone (GHRH) peptide, a granulocyte colony-stimulating factor (G-CSF) peptide, a growth hormone (GH) peptide, a heparin-binding EGF-like growth factor (HB-EGF) peptide, a hepatocyte growth factor/scatter factor (HGF/SF) peptide, an interleukin (IL) such as an IL-1alpha peptide, an IL-6 peptide, an IL-8 peptide or an IL-10 peptide, an IGF-1 peptide, a stromal cell-derived factor-1 (SDF-1) peptide, a keratinocyte growth factor (KGF) peptide, a proepithelin/granulin peptide, a leptin (LEP) peptide, a LIF peptide, an alpha-melanotropin peptide, a MGSA/GRO-alpha, beta or gamma peptide, a NRG-1alpha peptide, an oxytocin peptide, an osteopontin (OPN) peptide, a neuregulin-1 peptide, a PACAP peptide, a PDGF peptide such as a PDGF-alpha peptide, a PDGF-beta peptide, a PDGF-C peptide or a PDGF-D peptide, a prolactin peptide, a SCF peptide, a somatostatin peptide, a tumour necrosis factor (TNF) peptide such as a TNF-alpha peptide, a TGF-beta peptide, a TGF-betal peptide, a VEGF peptide such as a VEGF-C peptide or a VEGF-D peptide, a vasopressin peptide, a VIP peptide, an angiopoietin peptide such as an angiopoietin-1 peptide or angiopoietin-2 peptide, a B-CLL peptide, a BCGF-12KD peptide, a BAFF peptide, a galanin peptide, a GDNF peptide, a GnRH peptide, an IGF-II peptide, a LH peptide, a neurotrophin peptide, a substance P peptide, or a TGF-alpha peptide; as well as truncations and peptide analogues thereof.
[0036] In one embodiment, a TM of the present invention binds to a leptin receptor. Suitable examples of such TMs include: leptin peptides such as a full-length leptin peptide (eg. leptin167), and truncations or peptide analogues thereof such as leptin22-167, leptin70-95, and leptin116-122.
[0037] In another embodiment, a TM of the present invention binds to a ghrelin receptor. Examples of suitable TMs in this regard include: ghrelin peptides such as full-length ghrelin (eg. ghrelin117) and truncations or peptide analogues thereof such as ghrelin24-117, and ghrelin52-117; [Trp3, Arg5]-ghrelin (1-5), des-Gln-Ghrelin, cortistatin-8, His-D-Trp-Ala-Trp-D-Phe-Lys-NH2, growth hormone releasing peptide (e.g. GHRP-6), or hexarelin.
[0038] In one embodiment, a TM of the present invention binds to a somatostatin (SST) receptor. By way of example, suitable TMs include: SST peptides and cortistatin (CST)-peptides, as well as peptide analogues thereof such as D-Phe-Phe-Phe-D-Trp-Lys-Thr-Phe-Thr-NH2 (BIM 23052), D-Phe-Phe-Tyr-D-Trp-Lys-Val-Phe-D-Nal-NH2 (BIM 23056) or c[Cys-Phe-Phe-D-Trp-Lys-Thr-Phe-Cys]-NH2 (BIM-23268). Further examples include the SST peptides SST-14 and SST-28; as well as peptide and peptide analogues such as: octreotide, lanreotide, BIM23027, vapreotide, seglitide, and SOM230. These TMs are preferred TMs for binding to SST receptors, in particular to sst1, sst2, sst3, sst4 and sst5 receptors.
[0039] In another embodiment, a TM of the present invention binds to a gonadotropin-releasing hormone (GnRH) receptor. GnRH is also known as Luteinizing-Hormone Releasing Hormone (LHRH). Examples of suitable
[0040] GnRH receptor TMs include: GnRHI peptides, GnRHII peptides and GnRHIII peptides, for example the full-length 92 amino acid GnRH precursor polypeptide and truncations thereof such as the decapeptide: pyroGlu-His-Trp-Ser-Tyr-Gly-Leu-Arg-Pro-Gly CONH2.
[0041] In one embodiment, a TM of the present invention binds to insulin-like growth factor (IGF) receptor. Suitable examples include, for example IGF-1 peptides and IGF-2 peptides.
[0042] In one embodiment, a TM of the present invention binds to a VIP-glucagon-GRF-secretin superfamily receptor, such as to a PAC (eg. PACO or to a VPAC (e.g. VPAC-1 or VPAC-2) receptor. Suitable examples of such TMs include pituitary adenylate cyclase-activating peptides (PACAP), vasoactive intestinal peptides (VIP), as well as truncations and peptide analogues thereof.
[0043] In one embodiment the TM is a VIP peptide including VIP-1 and VIP-2 peptides, for example VIP(1-28), or a truncation or peptide analogue thereof. These TMs demonstrate a selective binding to VPAC-1. Alternatively, a TM demonstrating a selective binding to VPAC2 may be employed, such as, for example mROM (see Yu et al., Peptides 27 (6) p 1359-66 (2006), which is hereby incorporated by reference thereto). In another embodiment, the TM may be a PACAP peptide, for example PACAP(1-38) or PACAP(1-27), or a truncation of peptide analogue thereof. These TMs are preferred TMs for binding to PAC (eg. PAC-1) receptors.
[0044] In one embodiment, a TM of the present invention binds to an interleukin receptor. Suitable TM examples include: IL-1 peptides (e.g. IL-1α, IL-β, IL-18 peptides) and truncations or peptide analogues thereof, IL-2 peptides (e.g. IL-2, IL-3, IL-12, IL-23 peptides) and truncations or peptide analogues thereof, IL-6 and IL-8 peptides and truncations or peptide analogues thereof, and IL-17 peptides (e.g. II-17A, IL-17C peptides) and truncations or peptide analogues thereof.
[0045] In another embodiment, a TM of the present invention binds to an NGF receptor. Examples of suitable TMs include full-length NGF, and truncations or peptide analogues thereof.
[0046] In one embodiment, a TM of the present invention binds to a vasoactive epidermal growth factor (VEGF) receptor. Examples of suitable TMs include: VEGF peptide (e.g. VEGF-A, VEGF-B, VEGF-C, VEGF-D or VEGF-E and associated splice variants) and truncations or peptide analogues thereof, and placental growth factor (PIGF) and truncations or peptide analogues thereof.
[0047] In another embodiment, a TM of the present invention binds to an ErbB receptor. By way of example, the TM is selected from EGF peptides, transforming growth factor-a (TGF-α) peptides, chimeras of EGF and TGF-α, amphiregulin peptides, betacellulin peptides, epigen peptides, epiregulin peptides, heparin-binding EGF (HB-EGF) peptides, neuregulin (NRG) peptides such as NRG1α, NRG1β, NRG2α, NRG2β, NRG3, NRG4 and neuroregulin splice variants, tomoregulin 1 and 2 peptides, neuroglycan-C peptides, lin-3 peptides, vein peptides, gurken peptides, spitz peptides, or keren peptides; as well as truncations and peptide analogues thereof. There are 4 classes of ErbB receptor (termed ErbB1, erbB2, erbB3 and erbB4), which are also referred to as HER receptors. A number of variants of these receptors exist, which arise from alternate splicing and/or cleavage of the full-length receptor (eg EGFR v1 translation starts at aa543; EGFR v11 deletion of aa521-603; EGFR vlIl deletion of aa 6-273; EGFRvIII/Δ12-13 deletion of aa 6-273 and 409-520; EGFR vIV deletion of aa 959-1030; EGFR vV truncation at residue 958; EGFR TDM/2-7 tandem duplication of 6-273; EGFR TDM/18-25 tandem duplication of 664-1030; EGFR-TDM/18-26 tandem duplication of 664-1014). In addition, there are four ErbB4 receptor isoforms called erbB4 JM-a, erbB4 JM-b, erbB4 CYT-1 and erbB4 CYT-1.
[0048] Preferred TMs bind to ErbB receptors (eg. ErbB1, ErbB2, ErbB3, ErbB4) and splice variants thereof, in particular the ErbB1 receptor. ErbB TMs may also include proteins which contain EGF motifs with a splice site between the fourth and fifth cysteines within the six cysteine EGF-module, where this module is placed in close proximity to the transmembrane region of the potential ligand. For example, interphotoreceptor matrix proteoglycan-2 (IMP-2), meprin (MEP)1α, MEP1β, mucin (MUC)3, MUC4, MUC12. and MUC17, as well as proteins with a T-knot scaffold such as potato carboxypeptidase inhibitor, and antibodies to ErbB receptors such as cetuximab, ABX-EGF, trastuzumab, or EMD72000. Further examples include chimeras generated by swapping domains (loop sequences and/or connecting amino acids) of different ErbB ligands, such as a mammalian erbB receptor ligand in which the B-loop sequence has replaced by those present in the insect (Drosophila) ErbB ligands, an ErbB ligand in which the C-loop sequence of EGF has been replaced by that of TGFα(44-50), EGF ligands in which one or more domain has been replaced by corresponding sequences in TGFα to create EGF/TGFα chimeras (e.g. E3T, T3E, E4T, T4E, T3E4T, T6E and E3T4E, and EGF chimeras in which the N-terminal TGFα sequence (WSHFND) or the neuregulin sequence (SHLVK) has been used to replace the N-terminal EGF sequence C-terminal of the first cysteine residue (NSDSE), T1 E, and Biregulin. Yet further chimeras include EGF in which a domain has been replaced by an EGF-like domain of another protein, such as a blood coagulation, neural development or cell adhesion protein (e.g. Notch 3, Delta 1, EMR1, F4/80, EMR3 and EMR4 receptors).
[0049] In another embodiment, a TM of the present invention binds to a growth-hormone-releasing hormone (GHRH) receptor. GHRH is also known as growth-hormone-releasing factor (GRF or GHRF) or somatocrinin. Suitable TM examples of the present invention include the full-length GHRH (1-44) peptide, and truncations or peptide analogues thereof such as GHRH(1-29), GHRH(1-37), hGHRH(1-40)-OH,
[0050] [MeTyr1,A1a15,22,N1e27]-hGHRH(1-29)-NH2
[0051] [MeTyr1,A1a8,9,15,22,28,Nle 27]-hGHRH(1-29)-NH2
[0052] cyclo(25-29)[MeTyr1,A1a15,DAsp25,N1e27,Orn29+++]-hGHRH(1-29)-NH2.
[0053] (D-Tyr1)-GHRH (1-29)-NH2
[0054] (D-Ala2)-GHRH (1-29)-NH2
[0055] (D-Asp3)-GHRH (1-29)-NH2
[0056] (D-Ala4)-GHRH (1-29)-NH2
[0057] (D-Thr7)-GHRH (1-29)-NH2
[0058] (D-Asn8)-GHRH (1-29)-NH2
[0059] (D-Ser9)-GHRH (1-29)-NH2
[0060] (D-Tyr10)-GHRH (1-29)-NH2
[0061] (Phe4)-GHRH (1-29)-NH2
[0062] (pCl-Phe6)-GHRH (1-29)-NH2
[0063] (N-Ac-Tyr1)-GHRH (1-29)-NH2
[0064] (N-Ac-Tyr1, D-Ala2)-GHRH (1-29)-NH2
[0065] (N-Ac-D-Tyr1, D-Ala2)-GHRH (1-29)-NH2
[0066] (N-Ac-D-Tyr1, D-Ala 2, D-Asp3)-GHRH (1-29)-NH2
[0067] (D-Ala2, NLeu27)-GHRH (1-29)-NH2
[0068] (Nisi, D-Ala2, NLeu27)-GHRH (1-29)-NH2
[0069] (N-Ac-Hist, D-Ala2, N-Leu27)-GHRH (1-29)-NH2
[0070] (Nisi, D-Ala 2, D-Ala 4, Nleu27)-GHRH (1-29)-NH2
[0071] (D-Ala2, D-Asp3, D-Asn8, NLeu27)-GHRH (1-29)-NH2
[0072] (D-Asp3, D-Asn8, NLeu27)-GHRH (1-29)-NH2
[0073] [Nisi, NLeu27]-hGHRH(1-29)-NH2
[0074] [NLeu27]-hGHRH(1-29)-NH2
TABLE-US-00001 H-Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- Asp-Ile-Met-Ser-Arg-Gln-Gln-Gly-Glu-Ser-Asn-Gln- Glu-Arg-Gly-Ala-Arg-Ala-Arg-Leu-NH2 H-Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- Asp-Ile-Met-Ser-Arg-NH2 H-Tyr-D-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg- Lys-Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu- Gln-Asp-Ile-Met-Ser-Arg-NH2 H-Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- Ile-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- Asp-Ile-Met-Asn-Arg-Gln-Gln-Gly-Glu-Arg-Asn-Gln- Glu-Gln-Gly-Ala-Lys-Val-Arg-Leu-NH2 H-Tyr-Ala-Asp-Ala-Ile-Phe-Thr-Asn-Ser-Tyr-Arg-Lys- Val-Leu-Gly-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- Asp-Ile-Met-Asn-Arg-Gln-Gln-Gly-Glu-Arg-Asn-Gln- Glu-Gln-Gly-Ala-Lys-Val-Arg-Leu-NH2 His-Val-Asp-Ala-Ile-Phe-Thr-Gln-Ser-Tyr-Arg-Lys- Val-Leu-Ala-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- Asp-Ile-Leu-Asn-Arg His-Val-Asp-Ala-Ile-Phe-Thr-Gln-Ser-Tyr-Arg-Lys- Val-Leu-Ala-Gln-Leu-Ser-Ala-Arg-Lys-Leu-Leu-Gln- Asp-Ile-Leu-Asn-Arg-Gln-Gln-Gly-Glu-Arg-Asn-Gln- Glu-Gln-Gly-Ala.
[0075] In a further embodiment, the TM binds to a bombesin receptor (eg. BRS-1, BRS-2, or BRS-3). TMs for use in the present invention that bind to a bombesin receptor include: bombesin--a 14 amino acid peptide originally isolated from the skin of a frog (pGlu-Gln-Arg-Leu-Gly-Asn-Gin-Trp-Ala-Val-Gly-His-Leu-Met-NH2); and the two known homologues in mammals, namely neuromedin
[0076] B, and gastrin releasing peptide (GRP) such as porcine GRP-Ala-Pro-ValSer-Val-Gly-Gly-Gly-Thr-Val-Leu-Ala-Lys-Met-Tyr-Pro-Arg-Gl- y-Asn-His-Tro-Ala-Val-Gly-His-Leu-Met-NH2, and human GRP-Val-Pro-Leu-Pro-Ala-Gly-Gly-Gly-Thr-Val-Leu-Thr-Lys-Met-Tyr-Pro-Arg-G- ly-Asn-His-Trp-Ala-Val-Gly-His-Leu-Met-NH2, Additional TMs include corresponding bombesin, neuromedin B and GRP truncations as well as peptide analogues thereof.
[0077] In one embodiment, the cancer cell is a lung cancer cell (e.g. a small cell lung cancer cell, a non-small cell lung cancer cell, or a carcinoid lung cancer cell). The present inventors have confirmed that undesirable SNARE expression is observed in lung cancer cells. In this embodiment, the TM ligand binds to a receptor on the lung cancer cell, such as to a receptor selected from: an erythropoietin (EPO) receptor, a VEGF receptor such as a VEGF-3 receptor, an ErbB receptor such as an EGF receptor, an ErbB2 or 3 receptor, an GRP receptor, an ET(A) receptor, a CCK receptor such as a CCK-A or CCK-B receptor, a FLT receptor such a FLT 3 or FLT4 receptor, a CHRND receptor, an EPHA receptor such as an EPHA7, an EPHA4 or an EPHA5 receptor, an EFNA receptor such as an EFNA3 or EFNB3 receptor, a DLK1 receptor, an FGF receptor such as a FGF1 receptor, or a JAG receptor such as a JAG1 (CD339) or a JAG2 receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: erythropoietin (EPO), VEGF such as VEGF-B, VEGF-C, acetylcholine, ephrin peptides such as ephrin-A1, A2, A3, A4 or A5, pro-neuregulin peptides such as pro-neuregulin-2, neuregulin peptides such as neuregulin or NTAK, EGF, TGF such as TGF-beta, GRP, bombesin like peptide, endothelin, PGF, TNF such as TNF-alpha, IL such as IL-6, IL-8, oxytocin, vasopressin, NRG such as NRG-1alpha, bradykinin, HGF, GHRH, FGF such as bFGF, aFGF, FGF-1, FGF-2, NOTCH peptide ligands, FLT3 cytokine, or gastrin; as well as truncations and peptide analogues thereof.
[0078] In another embodiment the cancer cell is a bladder cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in bladder cancer cells. In this embodiment, the TM ligand binds to a receptor on the bladder cancer cell, such as to a receptor selected from: an IGF receptor such as an IGF1 receptor, a G-CSF receptor, a VEGF receptor such as a VEGF-2 receptor, an ErbB receptor such as an EGF receptor, a HGF receptor, or a FGF receptor such as a high/low affinity bFGF receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: G-CSF, VEGF, HB-EGF, IGF such as IGF-1 or IGF-2, amphiregulin, betacellulin, hepatocyte growth factor (HGF), ErbB such as EGF, TGF such as TGF-apha, IL such as IL-6, granulin peptide such as granulin-4, or FGF such as bFGF; as well as truncations and peptide analogues thereof.
[0079] In one embodiment the cancer cell is a breast cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in breast cancer cells. In this embodiment, the TM ligand binds to a receptor on the breast cancer cell, such as to a receptor selected from: an IGF receptor such as an IGF1 receptor, a VIP receptor such as VIPR-1, a GRP receptor, a NMB receptor, a CXC receptor such as CXCR4, a TNF receptor such as TNF receptor 1 or 2, a VEGF receptor such as VEGFR-2 or VEGFR-3, a neuropilin receptor such as NRP-1 or NRP-2, an integrin receptor such as integrin receptor alpha9betal, an OB receptor, an ET receptor such as ET-RA or ET-RB, an erythropoietin receptor (EpoR), a TGF-beta receptor, an AMF receptor, or a prolactin receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: TNF such as TNF-alpha, VEGF such as VEGF- A, VEGF-B, VEGF-C, IL such as IL-6, amphiregulin, leptin, endothelin such as ET-1, ET-2, ET-3, FGF such as FGF-2, GHRH, granulin peptide such as granulin-4, erythropoietin, neuromedin peptides such as GRP neuromedin C, neuromedin B, autocrine motility factor (AMF), prolactin, growth hormone such as hGH, IGF such as IGF-1 or IGF-2, VIP peptides, or PACAP peptides such as PACAP-27 or PACAP-38; as well as truncations and peptide analogues thereof.
[0080] In one embodiment the cancer cell is a pancreatic cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in pancreatic adenocarcinoma cells. In this embodiment, the TM ligand binds to a receptor on the pancreatic cancer cell, such as to a receptor selected from: a VIP receptor such as a VIP1 receptor, a VEGF receptor such as a VEGF-1 or VEGF2 receptor, a SST receptor such as a SST1 receptor, a CHRN receptor such as CHRNG, CHRNE, CHRNA1 or CHRND, an IGF receptor such as IGF1 R, a BRS receptor such as BRS3, a GnRH receptor, a GRP receptor, a NMB receptor, a type-1 growth factor receptor, an ErbB receptor such as EGFR, a CCK receptor such as CCKB or CCKC, a PDGF receptor such as PDGF-alpha, an ADM receptor, a keratinoctye growth factor (KGF) receptor such as FGFR2b, a type I FGF receptor, a GnRH receptor, a GFRalpha3/RET receptor, or a GDNF receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: ErbB such as EGF, TGF such as TGF-alpha, gastrin, PDGF, adrenomedullin, VEGF, KGF, FGF such as FGF-5 or FGF (a/b), VIP, SST peptides such as SST-14 or SST-28, acetylcholine, neuromedin peptides such as GRP neuromedin C, neuromedin B, PACAP peptided such as PACAP-27 or PACAP-38, IL such as IL-1a, GnRH, bombesin, artemin, GDNF, or IGF such as IGF-1 or IGF-2; as well as truncations and peptide analogues thereof.
[0081] In one embodiment the cancer cell is a prostate cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in prostate carcinoma cells. In this embodiment, the TM ligand binds to a receptor on the prostate cancer cell, such as to a receptor selected from: a CHRN such as a CHRNG or CHRNE receptor, an IGF receptor such as an IGF1 receptor, a VIP receptor such as a VIP1 receptor, a CT receptor, an ErbB receptor such as an EGFR (MR2), a [p75(NTR)] or TrkA receptor, a CC receptor such as a CCR2, a FGF receptor such as FGFR-1 to -4, a KGF receptor, a FSH receptor, a GHS receptor such as GHS-R 1 a, a CXC receptor such as CXCR2, a VPAC or PAC receptor such as VPAC1 or PAC1, a CRL receptor, a c-Met/HGF receptor, or a GH receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include:
[0082] calcitonin (CT), TGF such as TGF-alpha, adrenomedullin, acetylcholine, prolactin, IL such as IL-6 or IL-8, NGF, PDF, MCP such as MCP-1, somatostatin, FGF such as FGF1 (acidic FGF), FGF2 (basic FGF), FGF6 or FGF8, EPO, VEGF, KGF, FSH, ghrelin, FGF17, TNF such as TNF-alpha, VIP, PACAP peptides such as PACAP-27 or PACAP-38, AM, HGF, GH, GM-CSF, or IGF such as IGF-1 or IGF-2; as well as truncations and peptide analogues thereof.
[0083] In one embodiment the cancer cell is a cervix or uterine cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in cervix cancer cells. In this embodiment, the TM ligand binds to a receptor on the cervix or uterine cancer cell, such as to a receptor selected from: EPHA receptor such as EPHA5, EPHA7, EPHA4, EPHA3 receptor, a PDGF receptor such as PDGFRA, an EFNA receptor such as EFNA1, an SST receptor such as SSTR1, ab ErbB receptor such as an EGFR, a FLT receptor such as a FLT-1 receptor, a FLK receptor such as a FLK-1 receptor, a VEGF receptor such as VEGFR-3, an ET receptor such as ET(A)R, an IGF receptor such as IGF-1R, or an LIF receptor such as LIFR. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: ErbB such as EGF, TGF such as TGF-alpha, G-CSF, PDGF peptides such as PDGFA or PDGFC, SST peptides such as SST-14 or SST-28, IGF such as IGF-1 or IGF-2, ephrin peptides such as ephrin-A1, A2, A3, A5 or A5, VEGF such as VEGF-C or VEGF-D, granulin peptide such as granulin-4, endothelin such as ET-1, or IL such as IL-6 or IL-1; as well as truncations and peptide analogues thereof.
[0084] In one embodiment the cancer cell is a leukaemia cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in leukaemia cells. In this embodiment, the TM ligand binds to a receptor on the leukaemia cell, such as to a receptor selected from: an FLT receptor such as an FLT1 or 3 receptor, an FGF receptor such as FGFR1, 2 or 3, an EFN receptor such as an EFNB1 or EFNB3 receptor, a JAG receptor such as a JAG1 (cd339) receptor, a PDGF receptor such as PDGFRA or B, EPHAl , EPHA2, a NOTCH receptor such as a NOTCH3 or 4 receptor, an EPHA receptor such as an EPHA7 receptor, a LIF receptor, an EFN receptor such as an EFNA1 or 3 receptor, a DLL receptor such as a DLL3 receptor, an ErbB receptor such as an ErbB2 or 3 receptor, a KDR receptor, a GHS receptor such as GHS-R type 1 a, an IL receptor such as IL-1 receptor type II, or an IGF receptor such as an IGF1R. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: IGF such as IGF-1 or IGF-2, VEGF peptides such as VEGFB, ghrelin, APRIL, pro-neuregulin peptides such as pro-neuregulin-2, neuregulin peptides, NTAK, FGF such as bFGF, aFGF or FGF-1, NOTCH peptides such as NOTCH1, 2 or 3, PGF peptides, PDGF peptides such as PDGFA, PDGFB or PDGFD, ephrin peptides such as ephrin-A (1-5), LIF, GP30 ligand, erythropoietin, JAG peptides such as JAG-1 or JAG-2, Delta-1, IL such as IL1-alpha, or GM-CSF; as well as truncations and peptide analogues thereof.
[0085] In one embodiment the cancer cell is an ovary cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in ovarian cancer cells. In this embodiment, the TM ligand binds to a receptor on the ovary cancer cell, such as to a receptor selected from: an ErbB receptor such as an ErbB2, 3 or 4 receptor, an FGF receptor such as FGFR3, a JAG receptor such as a JAG2 receptor, an EPH receptor such as an EPHA1 or 2 receptor, an IGF receptor such as an IGF1R, a GRP receptor, an EFN receptor such as an EFNB3, Al or B2 receptor, or a GNRH receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: NRG peptides such as NRG-2 or NRG-3, or heparin-binding EGF-like growth factor peptides; neuregulin peptides, GP30 peptide, NOTCH peptides, granulin peptides such as granulin-4, ephrin-A peptides suchas ephrin A1-A5, IGF peptides such as IGF-1 or IGF-2, NTAK, GRP, or GnRH; as well as truncations and peptide analogues thereof.
[0086] In one embodiment the cancer cell is a bone cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in bone cancer cells. In this embodiment, the TM ligand binds to a receptor on the bone cancer cell, such as to a receptor selected from: an IGF receptor such as an IGF-IR, a GHRH receptor, or a FGF receptor such as a bFGF receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: PDGF such as PDGF-alpha or PDGF-beta, TGF such as TGFbetal, IGF such as IGF-1, GHRH, FGF such as bFGF, G-CSF, or IL such as IL-6; as well as truncations and peptide analogues thereof.
[0087] In one embodiment the cancer cell is a brain cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in brain cancer cells. In this embodiment, the TM ligand binds to a receptor on the brain cancer cell, such as to a receptor selected from: an EPO receptor, receptor 78 kDa glycoprotein (gp78), an ErbB receptor such as EGFR, as PDGF receptor such as PDGFR-alpha, a CRL receptor such as CRLR/RAMP2 or CRLR/RAMP3, a neuropilin receptor such as NRP-1 or NRP-2, a CXC receptor such as CXCR4, a VEGFR such as VEGFR-1, an IGFR such as IGF1R, a GDNF receptor such as GDNFR-alpha 1, or a MET receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: erythropoietin (EPO) peptides, IGF peptides, autocrine motility factor (AMF) peptides, GDNF peptides, HB-EGF peptides, TGF peptides such as TGF-alpha, PDGF peptides such as PDGFA, PDGF-B, PDGF-C or PDGF-D, neuregulin peptides such as neuregulin-1, adrenomedullin, IL peptides such as IL-6, scatter factor/hepatocyte growth factor (SF/HGF) peptides, granulin peptides such as granulin-4, VEGF peptides such as VEGF-A, or TGF peptides such as TGF beta 1 or TGF beta 2; as well as truncations and peptide analogues thereof.
[0088] In one embodiment the cancer cell is a colorectal cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in colorectal cancer cells In this embodiment, the TM ligand binds to a receptor on the colorectal cancer cell, such as to a receptor selected from: an IGF receptor such as IGF-R1, an ErbB receptor such as EGFR, a GRP receptor, an IL receptor such as IL6-R, a CCK receptor such as CCK-B receptor, or a prolactin receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: FGF such as FGF18, IGF such as IGF-1, TGF such as TGF-alpha, GRP, IL such as IL6, ErbB such as EGF, gastrin, or prolactin; as well as truncations and peptide analogues thereof.
[0089] In one embodiment the cancer cell is a liver cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in liver cancer cells In this embodiment, the TM ligand binds to a receptor on the liver cancer cell, such as to a receptor selected from: a TrkA (NGF) receptor, an IGF receptor such as IGF-IR, a HGF receptor such as a HGF-Met receptor, a c-met receptor, a gp78 receptor, or an ErbB receptor such as an EGFR. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: NGF, IL such as IL-6 or IL-8, IGF such as IGF-1 or IGF-2, HGF, SF/HGF, hepatopoietin (HPO), AMF, TGF such as TGF-beta or TGF-alpha, LIF or PDGF; as well as truncations and peptide analogues thereof.
[0090] In one embodiment the cancer cell is a skin cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in skin cancer cells In this embodiment, the TM ligand binds to a receptor on the skin (eg. basal, melanoma, squamous) cancer cell, such as to a receptor selected from: a FGF receptor such as FGFR-1, a c-kit receptor, a VEGF receptor such as VEGFR-2, a c-Met receptor, or a melanocortin receptor such as a melanocortin-1 receptor. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: IL such as IL8 or IL-10, FGF such as bFGF, SCF, VEGF such as VEGF-A, ErbB such as EGF, EPO, osteopontin (OPN), TGF such as TGF-beta, MGSA/GRO such as MGSA/GRO alpha, beta or gamma, HGF/SF, alpha-melanotropin, amphiregulin, or AMF; as well as truncations and peptide analogues thereof.
[0091] In one embodiment the cancer cell is a Kaposi sarcoma cancer cell. The present inventors have confirmed that undesirable SNARE expression is observed in Kaposi sarcoma cancer cells In this embodiment, the TM ligand binds to a receptor on the Kaposi sarcoma cell, such as to a receptor selected from: a PDGF receptor such as PDGFRA, a c-kit receptor, a TGF-beta receptor such as TGFR-1, an endothelin receptor such as ETA-R, a chemokine receptor such as CXCR3 or CCRL2, a VEGF receptor such as VEGFR-2, an IGF receptor such as IGF-IR, or an angiopoietin receptor such as TIE2. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: PDGFa, IGF-1, Ang2, IL such as IL6, VEGF such as VEGF-A, endothelin-1, TGF such as TGF-beta, CXCL11, CCL8/14; as well as truncations and peptide analogues thereof.
[0092] In one embodiment the cancer cell is a renal cancer cell. In this embodiment, the TM ligand binds to a receptor on the renal cancer cell, such as to a receptor selected from: an IGF receptor such as IGF-IR, a VEGF receptor, a CXC receptor, or an ErbB receptor such as an EGFR. Preferred TMs in this embodiment include the corresponding natural ligand to said receptors, as well as truncations and peptide analogues thereof. Examples include: IL such as IL-6 or IL-8, IGF such as IGF-1 or IGF-2, TGF such as TGF-beta or TGF-alpha, VEGF such as VEGF-A, chemokines such as CXCL12; as well as truncations and peptide analogues thereof.
[0093] The above embodiments describe particular cancers in which the present inventors have demonstrated undesirable SNARE expression (eg. up-regulated. SNARE expression). The present invention is not, however, limited to said specific cancer types, and embraces all cancer types in which SNARE expression occurs.
[0094] The above-described undesirable SNARE expression (eg. up-regulation) in cancer cells may optionally be linked to an up-regulation of one or more specific receptor types present on the cancer cells in question. In this regard, the present inventors have identified particular receptor types (detailed above), which are up-regulated in the same cells in which undesirable SNARE expression is observed (eg. up-regulated).
[0095] Polypeptide Preparation
[0096] The polypeptides of the present invention comprise 3 principal components: a bioactive' (ie. a non-cytotoxic protease); a TM; and a translocation domain. The general technology associated with the preparation of such fusion proteins is often referred to as re-targeted toxin technology. By way of exemplification, we refer to: WO94/21300; WO96/33273; WO98/07864; WO00/10598; WO01/21213; WO06/059093; WO00/62814; WO00/04926; WO93/15766; WO00/61192; and WO99/58571. All of these publications are herein incorporated by reference thereto.
[0097] In more detail, the TM component of the present invention may be fused to either the protease component or the translocation component of the present invention. Said fusion is preferably by way of a covalent bond, for example either a direct covalent bond or via a spacer/linker molecule. The protease component and the translocation component are preferably linked together via a covalent bond, for example either a direct covalent bond or via a spacer/linker molecule. Suitable spacer/linked molecules are well known in the art, and typically comprise an amino acid-based sequence of between 5 and 40, preferably between 10 and 30 amino acid residues in length.
[0098] In use, the polypeptides have a di-chain conformation, wherein the protease component and the translocation component are linked together, preferably via a disulphide bond.
[0099] The polypeptides of the present invention may be prepared by conventional chemical conjugation techniques, which are well known to a skilled person. By way of example, reference is made to Hermanson, G. T. (1996), Bioconjugate techniques, Academic Press, and to Wong, S. S. (1991), Chemistry of protein conjugation and cross-linking, CRC Press, Nagy et al., PNAS 95 p1794-99 (1998). Further detailed methodologies for attaching synthetic TMs to a polypeptide of the present invention are provided in, for example, EP0257742. The above-mentioned conjugation publications are herein incorporated by reference thereto.
[0100] Alternatively, the polypeptides may be prepared by recombinant preparation of a single polypeptide fusion protein (see, for example, WO98/07864). This technique is based on the in vivo bacterial mechanism by which native clostridial neurotoxin (i.e. holotoxin) is prepared, and results in a fusion protein having the following `simplified` structural arrangement:
NH2-[protease component]-[translocation component]-[TM]-COON
[0101] According to WO98/07864, the TM is placed towards the C-terminal end of the fusion protein. The fusion protein is then activated by treatment with a protease, which cleaves at a site between the protease component and the translocation component. A di-chain protein is thus produced, comprising the protease component as a single polypeptide chain covalently attached (via a disulphide bridge) to another single polypeptide chain containing the translocation component plus TM.
[0102] Alternatively, according to WO06/059093, the TM component of the fusion protein is located towards the middle of the linear fusion protein sequence, between the protease cleavage site and the translocation component. This ensures that the TM is attached to the translocation domain (ie. as occurs with native clostridial holotoxin), though in this case the two components are reversed in order vis-a-vis native holotoxin. Subsequent cleavage at the protease cleavage site exposes the N-terminal portion of the TM, and provides the di-chain polypeptide fusion protein.
[0103] The above-mentioned protease cleavage sequence(s) may be introduced (and/or any inherent cleavage sequence removed) at the DNA level by conventional means, such as by site-directed mutagenesis. Screening to confirm the presence of cleavage sequences may be performed manually or with the assistance of computer software (e.g. the MapDraw program by DNASTAR, Inc.). Whilst any protease cleavage site may be employed (ie. clostridial, or non-clostridial), the following are preferred:
TABLE-US-00002 Enterokinase (DDDDK↓) Factor Xa (IEGR↓/IDGR↓) TEV(Tobacco Etch virus) (ENLYFQ↓G) Thrombin (LVPR↓GS) PreScission (LEVLFQ↓GP).
[0104] Additional protease cleavage sites include recognition sequences that are cleaved by a non-cytotoxic protease, for example by a clostridial neurotoxin. These include the SNARE (eg. SNAP-25, syntaxin, VAMP) protein recognition sequences that are cleaved by non-cytotoxic proteases such as clostridial neurotoxins. Particular examples are provided in US2007/0166332, which is hereby incorporated in its entirety by reference thereto.
[0105] Also embraced by the term protease cleavage site is an intein, which is a self-cleaving sequence. The self-splicing reaction is controllable, for example by varying the concentration of reducing agent present. The above-mentioned `activation` cleavage sites may also be employed as a `destructive` cleavage site (discussed below) should one be incorporated into a polypeptide of the present invention.
[0106] In a preferred embodiment, the fusion protein of the present invention may comprise one or more N-terminal and/or C-terminal located purification tags. Whilst any purification tag may be employed, the following are preferred:
[0107] His-tag (e.g. 6 x histidine), preferably as a C-terminal and/or N-terminal tag
[0108] MBP-tag (maltose binding protein), preferably as an N-terminal tag
[0109] GST-tag (glutathione-S-transferase), preferably as an N-terminal tag
[0110] His-MBP-tag, preferably as an N-terminal tag
[0111] GST-MBP-tag, preferably as an N-terminal tag
[0112] Thioredoxin-tag, preferably as an N-terminal tag
[0113] CBD-tag (Chitin Binding Domain), preferably as an N-terminal tag.
[0114] One or more peptide spacer/linker molecules may be included in the fusion protein. For example, a peptide spacer may be employed between a purification tag and the rest of the fusion protein molecule.
[0115] Thus, a third aspect of the present invention provides a nucleic acid (e.g. DNA) sequence encoding a polypeptide as described above (i.e. the second aspect of the present invention).
[0116] Said nucleic acid may be included in the form of a vector, such as a plasmid, which may optionally include one or more of an origin of replication, a nucleic acid integration site, a promoter, a terminator, and a ribosome binding site.
[0117] The present invention also includes a method for expressing the above-described nucleic acid sequence (i.e. the third aspect of the present invention) in a host cell, in particular in E. coli.
[0118] The present invention also includes a method for activating a polypeptide of the present invention, said method comprising contacting the polypeptide with a protease that cleaves the polypeptide at a recognition site (cleavage site) located between the non-cytotoxic protease component and the translocation component, thereby converting the polypeptide into a di-chain polypeptide wherein the non-cytotoxic protease and translocation components are joined together by a disulphide bond. In a preferred embodiment, the recognition site is not native to a naturally-occurring clostridial neurotoxin and/or to a naturally-occurring IgA protease.
[0119] The polypeptides of the present invention may be further modified to reduce or prevent unwanted side-effects associated with dispersal into non-targeted areas. According to this embodiment, the polypeptide comprises a destructive cleavage site. The destructive cleavage site is distinct from the `activation` site (i.e. di-chain formation), and is cleavable by a second protease and not by the non-cytotoxic protease. Moreover, when so cleaved at the destructive cleavage site by the second protease, the polypeptide has reduced potency (e.g. reduced binding ability to the intended target cell, reduced translocation activity and/or reduced non-cytotoxic protease activity). For completeness, any of the `destructive` cleavage sites of the present invention may be separately employed as an `activation` site in a polypeptide of the present invention.
[0120] Thus, according to this embodiment, the present invention provides a polypeptide that can be controllably inactivated and/or destroyed at an off-site location.
[0121] In a preferred embodiment, the destructive cleavage site is recognised and cleaved by a second protease (i.e. a destructive protease) selected from a circulating protease (e.g. an extracellular protease, such as a serum protease or a protease of the blood clotting cascade), a tissue-associated protease (e.g. a matrix metalloprotease (MMP), such as an MMP of muscle), and an intracellular protease (preferably a protease that is absent from the target cell).
[0122] Thus, in use, should a polypeptide of the present invention become dispersed away from its intended target cell and/or be taken up by a non-target cell, the polypeptide will become inactivated by cleavage of the destructive cleavage site (by the second protease).
[0123] In one embodiment, the destructive cleavage site is recognised and cleaved by a second protease that is present within an off-site cell-type. In this embodiment, the off-site cell and the target cell are preferably different cell types. Alternatively (or in addition), the destructive cleavage site is recognised and cleaved by a second protease that is present at an off-site location (e.g. distal to the target cell). Accordingly, when destructive cleavage occurs extracellularly, the target cell and the off-site cell may be either the same or different cell-types. In this regard, the target cell and the off-site cell may each possess a receptor to which the same polypeptide of the invention binds.
[0124] The destructive cleavage site of the present invention provides for inactivation/destruction of the polypeptide when the polypeptide is in or at an off-site location. In this regard, cleavage at the destructive cleavage site minimises the potency of the polypeptide (when compared with an identical polypeptide lacking the same destructive cleavage site, or possessing the same destructive site but in an uncleaved form). By way of example, reduced potency includes: reduced binding (to a mammalian cell receptor) and/or reduced translocation (across the endosomal membrane of a mammalian cell in the direction of the cytosol), and/or reduced SNARE protein cleavage.
[0125] When selecting destructive cleavage site(s) in the context of the present invention, it is preferred that the destructive cleavage site(s) are not substrates for any proteases that may be separately used for post-translational modification of the polypeptide of the present invention as part of its manufacturing process. In this regard, the non-cytotoxic proteases of the present invention typically employ a protease activation event (via a separate `activation` protease cleavage site, which is structurally distinct from the destructive cleavage site of the present invention). The purpose of the activation cleavage site is to cleave a peptide bond between the non-cytotoxic protease and the translocation or the binding components of the polypeptide of the present invention, thereby providing an `activated` di-chain polypeptide wherein said two components are linked together via a di-sulfide bond.
[0126] Thus, to help ensure that the destructive cleavage site(s) of the polypeptides of the present invention do not adversely affect the `activation` cleavage site and subsequent di-sulfide bond formation, the former are preferably introduced into polypeptide of the present invention at a position of at least 20, at least 30, at least 40, at least 50, and more preferably at least 60, at least 70, at least 80 (contiguous) amino acid residues away from the `activation` cleavage site.
[0127] The destructive cleavage site(s) and the activation cleavage site are preferably exogenous (i.e. engineered/artificial) with regard to the native components of the polypeptide. In other words, said cleavage sites are preferably not inherent to the corresponding native components of the polypeptide. By way of example, a protease or translocation component based on BoNT/A L-chain or H-chain (respectively) may be engineered according to the present invention to include a cleavage site. Said cleavage site would not, however, be present in the corresponding BoNT native L-chain or H-chain. Similarly, when the Targeting Moiety component of the polypeptide is engineered to include a protease cleavage site, said cleavage site would not be present in the corresponding native sequence of the corresponding Targeting Moiety.
[0128] In a preferred embodiment of the present invention, the destructive cleavage site(s) and the `activation` cleavage site are not cleaved by the same protease. In one embodiment, the two cleavage sites differ from one another in that at least one, more preferably at least two, particularly preferably at least three, and most preferably at least four of the tolerated amino acids within the respective recognition sequences is/are different.
[0129] By way of example, in the case of a polypeptide chimera containing a Factor Xa `activation` site between clostridial L-chain and HN components, it is preferred to employ a destructive cleavage site that is a site other than a Factor Xa site, which may be inserted elsewhere in the L-chain and/or HN and/or TM component(s). In this scenario, the polypeptide may be modified to accommodate an alternative `activation` site between the L-chain and HN components (for example, an enterokinase cleavage site), in which case a separate Factor Xa cleavage site may be incorporated elsewhere into the polypeptide as the destructive cleavage site. Alternatively, the existing Factor Xa `activation` site between the L-chain and HN components may be retained, and an alternative cleavage site such as a thrombin cleavage site incorporated as the destructive cleavage site.
[0130] When identifying suitable sites within the primary sequence of any of the components of the present invention for inclusion of cleavage site(s), it is preferable to select a primary sequence that closely matches with the proposed cleavage site that is to be inserted. By doing so, minimal structural changes are introduced into the polypeptide. By way of example, cleavage sites typically comprise at least 3 contiguous amino acid residues. Thus, in a preferred embodiment, a cleavage site is selected that already possesses (in the correct position(s)) at least one, preferably at least two of the amino acid residues that are required in order to introduce the new cleavage site. By way of example, in one embodiment, the Caspase 3 cleavage site (DMQD) may be introduced. In this regard, a preferred insertion position is identified that already includes a primary sequence selected from, for example, Dxxx, xMxx, xxQx, xxxD, DMxx, DxQx, DxxD, xMQx, xMxD, xxQD, DMQx, xMQD, DxQD, and DMxD.
[0131] Similarly, it is preferred to introduce the cleavage sites into surface exposed regions. Within surface exposed regions, existing loop regions are preferred.
[0132] In a preferred embodiment of the present invention, the destructive cleavage site(s) are introduced at one or more of the following position(s), which are based on the primary amino acid sequence of BoNT/A. Whilst the insertion positions are identified (for convenience) by reference to BoNT/A, the primary amino acid sequences of alternative protease domains and/or translocation domains may be readily aligned with said BoNT/A positions.
[0133] For the protease component, one or more of the following positions is preferred: 27-31, 56-63, 73-75, 78-81, 99-105, 120-124, 137-144, 161-165, 169-173, 187-194, 202-214, 237-241, 243-250, 300-304, 323-335, 375-382, 391-400, and 413-423. The above numbering preferably starts from the N-terminus of the protease component of the present invention.
[0134] In a preferred embodiment, the destructive cleavage site(s) are located at a position greater than 8 amino acid residues, preferably greater than 10 amino acid residues, more preferably greater than 25 amino acid residues, particularly preferably greater than 50 amino acid residues from the N-terminus of the protease component. Similarly, in a preferred embodiment, the destructive cleavage site(s) are located at a position greater than 20 amino acid residues, preferably greater than 30 amino acid residues, more preferably greater than 40 amino acid residues, particularly preferably greater than 50 amino acid residues from the C-terminus of the protease component.
[0135] For the translocation component, one or more of the following positions is preferred: 474-479, 483-495, 507-543, 557-567, 576-580, 618-631, 643-650, 669-677, 751-767, 823-834, 845-859. The above numbering preferably acknowledges a starting position of 449 for the N-terminus of the translocation domain component of the present invention, and an ending position of 871 for the C-terminus of the translocation domain component.
[0136] In a preferred embodiment, the destructive cleavage site(s) are located at a position greater than 10 amino acid residues, preferably greater than 25 amino acid residues, more preferably greater than 40 amino acid residues, particularly preferably greater than 50 amino acid residues from the N-terminus of the translocation component. Similarly, in a preferred embodiment, the destructive cleavage site(s) are located at a position greater than 10 amino acid residues, preferably greater than 25 amino acid residues, more preferably greater than 40 amino acid residues, particularly preferably greater than 50 amino acid residues from the C-terminus of the translocation component.
[0137] In a preferred embodiment, the destructive cleavage site(s) are located at a position greater than 10 amino acid residues, preferably greater than 25 amino acid residues, more preferably greater than 40 amino acid residues, particularly preferably greater than 50 amino acid residues from the N-terminus of the TM component. Similarly, in a preferred embodiment, the destructive cleavage site(s) are located at a position greater than 10 amino acid residues, preferably greater than 25 amino acid residues, more preferably greater than 40 amino acid residues, particularly preferably greater than 50 amino acid residues from the C-terminus of the TM component.
[0138] The polypeptide of the present invention may include one or more (e.g. two, three, four, five or more) destructive protease cleavage sites. Where more than one destructive cleavage site is included, each cleavage site may be the same or different. In this regard, use of more than one destructive cleavage site provides improved off-site inactivation. Similarly, use of two or more different destructive cleavage sites provides additional design flexibility.
[0139] The destructive cleavage site(s) may be engineered into any of the following component(s) of the polypeptide: the non-cytotoxic protease component; the translocation component; the Targeting Moiety; or the spacer peptide (if present). In this regard, the destructive cleavage site(s) are chosen to ensure minimal adverse effect on the potency of the polypeptide (for example by having minimal effect on the targeting/binding regions and/or translocation domain, and/or on the non-cytotoxic protease domain) whilst ensuring that the polypeptide is labile away from its target site/target cell.
[0140] Preferred destructive cleavage sites (plus the corresponding second proteases) are listed in the Table immediately below. The listed cleavage sites are purely illustrative and are not intended to be limiting to the present invention.
TABLE-US-00003 Destructive cleavage site Tolerated recognition sequence variance Second recognition P4-P3-P2-P1--P1'-P2'-P3' protease sequence P4 P3 P2 P1 P1' P2' P3' Thrombin LVPRGS A, F, G, I, A, F, G, P R Not D Not -- L, T, V I, L, T, or E D or or M V, W E or A Thrombin GRG G R G Factor Xa IEGR A, F, G, I, D or E G R -- -- -- L, T, V or M ADAM17 PLAQAVRSSS Human SKGRSLIGRV airway trypsin-like protease (HAT) ACE -- -- -- -- Not P Not N/A (peptidyl- D or dipeptidase E A) Elastase MEAVTY M, R E A, V, V, T, H Y -- (leukocyte) H T Furin RXR/KR R X R R or K Granzyme IEPD I E P D -- -- -- Caspase 1 F, W, Y, -- H, D Not -- -- L A, T P, E.D. Q.K or R Caspase 2 DVAD D V A D Not -- -- P, E.D. Q.K or R Caspase 3 DMQD D M Q D Not -- -- P, E.D. Q.K or R Caspase 4 LEVD L E V D Not -- -- P, E.D. Q.K or R Caspase 5 L or W E H D -- -- -- Caspase 6 V E H D Not -- -- or I P, E.D. Q.K or R Caspase 7 DEVD D E V D Not -- -- P, E.D. Q.K or R Caspase 8 I or L E T D Not -- -- P, E.D. Q.K or R Caspase 9 LEHD L E H D -- -- -- Caspase IEHD I E H D -- -- -- 10
[0141] Matrix metalloproteases (MMPs) are a preferred group of destructive proteases in the context of the present invention. Within this group, ADAM17 (EC 3.4.24.86, also known as TACE), is preferred and cleaves a variety of membrane-anchored, cell-surface proteins to "shed" the extracellular domains. Additional, preferred MMPs include adamalysins, serralysins, and astacins.
[0142] Another group of preferred destructive proteases is a mammalian blood protease, such as Thrombin, Coagulation Factor Vila, Coagulation Factor IXa, Coagulation Factor Xa, Coagulation Factor XIa, Coagulation Factor XIIa, Kallikrein, Protein C, and MBP-associated serine protease.
[0143] In one embodiment of the present invention, said destructive cleavage site comprises a recognition sequence having at least 3 or 4, preferably 5 or 6, more preferably 6 or 7, and particularly preferably at least 8 contiguous amino acid residues. In this regard, the longer (in terms of contiguous amino acid residues) the recognition sequence, the less likely non-specific cleavage of the destructive site will occur via an unintended second protease.
[0144] It is preferred that the destructive cleavage site of the present invention is introduced into the protease component and/or the Targeting Moiety and/or into the translocation component and/or into the spacer peptide. Of these four components, the protease component is preferred. Accordingly, the polypeptide may be rapidly inactivated by direct destruction of the non-cytotoxic protease and/or binding and/or translocation components.
[0145] Polypeptide Delivery
[0146] In use, the present invention employs a pharmaceutical composition, comprising a polypeptide, together with at least one component selected from a pharmaceutically acceptable carrier, excipient, adjuvant, propellant and/or salt.
[0147] The polypeptides of the present invention may be formulated for oral, parenteral, continuous infusion, implant, inhalation or topical application. Compositions suitable for injection may be in the form of solutions, suspensions or emulsions, or dry powders which are dissolved or suspended in a suitable vehicle prior to use.
[0148] Local delivery means may include an aerosol, or other spray (eg. a nebuliser). In this regard, an aerosol formulation of a polypeptide enables delivery to the lungs and/or other nasal and/or bronchial or airway passages.
[0149] The preferred route of administration is selected from: systemic, laparoscopic and/or localised injection (e.g. injection directly into the tumour).
[0150] In the case of formulations for injection, it is optional to include a pharmaceutically active substance to assist retention at or reduce removal of the polypeptide from the site of administration. One example of such a pharmaceutically active substance is a vasoconstrictor such as adrenaline. Such a formulation confers the advantage of increasing the residence time of polypeptide following administration and thus increasing and/or enhancing its effect.
[0151] The dosage ranges for administration of the polypeptides of the present invention are those to produce the desired therapeutic effect. It will be appreciated that the dosage range required depends on the precise nature of the polypeptide or composition, the route of administration, the nature of the formulation, the age of the patient, the nature, extent or severity of the patient's condition, contraindications, if any, and the judgement of the attending physician. Variations in these dosage levels can be adjusted using standard empirical routines for optimisation.
[0152] Suitable daily dosages (per kg weight of patient) are in the range 0.0001-1 mg/kg, preferably 0.0001-0.5 mg/kg, more preferably 0.002-0.5 mg/kg, and particularly preferably 0.004-0.5 mg/kg. The unit dosage can vary from less that 1 microgram to 30 mg, but typically will be in the region of 0.01 to 1 mg per dose, which may be administered daily or preferably less frequently, such as weekly or six monthly.
[0153] A particularly preferred dosing regimen is based on 2.5 ng of polypeptide as the 1× dose. In this regard, preferred dosages are in the range 1×-100× (i.e. 2.5-250 ng).
[0154] Fluid dosage forms are typically prepared utilising the polypeptide and a pyrogen-free sterile vehicle. The polypeptide, depending on the vehicle and concentration used, can be either dissolved or suspended in the vehicle. In preparing solutions the polypeptide can be dissolved in the vehicle, the solution being made isotonic if necessary by addition of sodium chloride and sterilised by filtration through a sterile filter using aseptic techniques before filling into suitable sterile vials or ampoules and sealing. Alternatively, if solution stability is adequate, the solution in its sealed containers may be sterilised by autoclaving. Advantageously additives such as buffering, solubilising, stabilising, preservative or bactericidal, suspending or emulsifying agents and or local anaesthetic agents may be dissolved in the vehicle.
[0155] Dry powders, which are dissolved or suspended in a suitable vehicle prior to use, may be prepared by filling pre-sterilised ingredients into a sterile container using aseptic technique in a sterile area. Alternatively the ingredients may be dissolved into suitable containers using aseptic technique in a sterile area. The product is then freeze dried and the containers are sealed aseptically.
[0156] Parenteral suspensions, suitable for intramuscular, subcutaneous or intradermal injection, are prepared in substantially the same manner, except that the sterile components are suspended in the sterile vehicle, instead of being dissolved and sterilisation cannot be accomplished by filtration. The components may be isolated in a sterile state or alternatively it may be sterilised after isolation, e.g. by gamma irradiation.
[0157] Advantageously, a suspending agent for example polyvinylpyrrolidone is included in the composition/s to facilitate uniform distribution of the components.
[0158] Administration in accordance with the present invention may take advantage of a variety of delivery technologies including microparticle encapsulation, viral delivery systems or high-pressure aerosol impingement.
[0159] Definitions Section
[0160] Targeting Moiety (TM) means any chemical structure that functionally interacts with a Binding Site to cause a physical association between the polypeptide of the invention and the surface of a target cell. In the context of the present invention, the target cell is a cancerous cell, for example one in which autocrine stimulation is driving proliferation, survival, metastatic potential or angiogenesis. The term TM embraces any molecule (ie. a naturally occurring molecule, or a chemically/physically modified variant thereof) that is capable of binding to a Binding Site on the target cell, which Binding Site is capable of internalisation (eg. endosome formation)--also referred to as receptor-mediated endocytosis. The TM may possess an endosomal membrane translocation function, in which case separate TM and Translocation Domain components need not be present in an agent of the present invention. Throughout the preceding description, specific TMs have been described. Reference to said TMs is merely exemplary, and the present invention embraces all variants and derivatives thereof, which retain the basic binding (i.e. targeting) ability of the exemplified TMs.
[0161] In one embodiment, the cancer cell target of the present invention may be a cancer cell that does not secrete growth hormone (ie. no significant or detectable growth hormone is secreted therefrom).
[0162] As mentioned previously, preferred TMs include antibodies (eg. antibody fragments) and binding scaffolds; especially commercially avilable antibodies/fragments and scaffolds designed for the purpose of binding (eg. specifically) to cancer cells.
[0163] Protein scaffolds represent a new generation of universal binding frameworks to complement the expanding repertoire of therapeutic monoclonal antibodies and derivatives such as scFvs, Fab molecules, dAbs (single-domain antibodies), diabodies and minibodies, each of which may be employed as a TM of the present invention. Scaffold systems create or modify known protein recognition domains either through creation of novel scaffolds or modification of known protein binding domains. Such scaffolds include but are not limited to:
[0164] (i) protein A based scaffolds--affibodies (Nord, K. et al 1997 "Binding proteins selected from combinatorial libraries of an alpha-helical bacterial receptor domain". Nat Biotechnol 15, 772-777);
[0165] (ii) lipocalin based scaffolds--anticalins (Skerra 2008 "Alternative binding proteins: anticalins--harnessing the structural plasticity of the lipocalin ligand pocket to engineer novel binding activities". FEBS J. 275:2677-83);
[0166] (iii) fibronectin based scaffolds--adnectin (Dineen et al 2008 "The Adnectin CT-322 is a novel VEGF receptor 2 inhibitor that decreases tumor burden in an orthotopic mouse model of pancreatic cancer". BMC Cancer 8:352);
[0167] (iv) avimers (Silverman et al 2005 "Multivalent avimer proteins evolved by exon shuffling of a family of human receptor domains". Nat Biotechnol 23:1556-61);
[0168] (v) ankyrin based scaffolds--darpins (Zahnd et al 2006 "Selection and characterization of Her2 binding-designed ankyrin repeat proteins". J Biol Chem. 281:35167-75); and
[0169] (vi) centyrin scaffolds--based on a protein fold that has significant structural homology to Ig domains with loops that are analogous to CDRs. Ig domains are a common module in human proteins and have been widely applied as alternative scaffold proteins. Each of the above `scaffold` publications is hereby incorporated (in its entirety) by reference thereto.
[0170] Binding scaffolds can be used to target particular cell types via interaction with specific cell surface proteins, receptors or other cell surface epitopes such as sugar groups. Such modified scaffolds can be engineered onto recombinant non-cytotoxic protease based polypeptides of the present invention to target specific cancer cell types of interest.
[0171] The TM of the present invention binds (preferably specifically binds) to the target cell in question. The term "specifically binds" preferably means that a given TM binds to the target cell with a binding affinity (Ka) of 106 M-1 or greater, preferably 107 M-1 or greater, more preferably 108 M-1 or greater, and most preferably, 109 M-1 orgreater.
[0172] Reference to TM in the present specification embraces fragments and variants thereof, which retain the ability to bind to the target cell in question. By way of example, a variant may have at least 80%, preferably at least 90%, more preferably at least 95%, and most preferably at least 97 or at least 99% amino acid sequence homology with the reference TM. Thus, a variant may include one or more analogues of an amino acid (e.g. an unnatural amino acid), or a substituted linkage. Also, by way of example, the term fragment, when used in relation to a TM, means a peptide having at least ten, preferably at least twenty, more preferably at least thirty, and most preferably at least forty amino acid residues of the reference TM. The term fragment also relates to the above-mentioned variants. Thus, by way of example, a fragment of the present invention may comprise a peptide sequence having at least 10, 20, 30 or 40 amino acids, wherein the peptide sequence has at least 80% sequence homology over a corresponding peptide sequence (of contiguous) amino acids of the reference peptide.
[0173] By way of example, ErbB peptide TMs may be modified to generate mutein ErbB ligands with improved properties such as increased stability. By way of example, ErbB TM muteins include ErbB peptides having amino acid modifications such as a valine residue at position 46 or 47 (EGFVal46 or 47), which confers stability to cellular proteases. ErbB TMs may also have amino acids deleted or additional amino acids inserted. This includes but is not limited to EGF having a deletion of the two C-terminal amino acids and a neutral amino acid substitution at position 51 (particularly EGF51Gln51; see US20020098178A1), and EGF with amino acids deleted (e.g. rEGF2-48; rEGF3-48 and rEGF4-48). Fragments of ErbB TMs may include fragments of TGFα which contain predicted (3-turn regions (e.g. a peptide of the sequence Ac-C-H-S-G-Y-V-G-A-R-C-O-OMe), fragments of EGF such as [Ala20]EGF(14-31), and the peptide YHWYGYTPQNVI or GE11. All of the above patent specifications are incorporated herein by reference thereto.
[0174] By way of further example, somatostatin (SST) and cortistatin (CST) have high structural homology, and bind to all known SST receptors. Full-length SST has the amino acid sequence:
TABLE-US-00004 MLSCRLQCALAALSIVLALGCVTGAPSDPRLRQFLQKSLAAAAGKQELAK YFLAELLSEPNQTENDALEPEDLSQAAEQDEMRLELQRSANSNPAMAPRE RKAGCKNFFWKTFTSC
[0175] Full-length CST has the amino acid sequence:
TABLE-US-00005 MYRHKNSWRLGLKYPPSSKEETQVPKTLISGLPGRKSSSRVGEKLQSAHK MPLSPGLLLLLLSGATATAALPLEGGPTGRDSEHMQEAAGIRKSSLLTFL AWWFEWTSQASAGPLIGEEAREVARRQEGAPPQQSARRDRMPCRNFFWKT FSSCK
[0176] Reference to these TMs includes the following fragments (and corresponding variants) thereof:
TABLE-US-00006 NFFWKTF; (R or K)NFFWKTF; C(R or K)NFFWKTF; (P or G)C(R or K)NFFWKTF; NFFWKTF(S or T); NFFWKTF(S or T)S; NFFWKTF(S or T)SC; (R or K)NFFWKTF(S or T); (R or K)NFFWKTF(S or T)S; (R or K)NFFWKTF(S or T)SC; C(R or K)NFFWKTF(S or T); C(R or K)NFFWKTF(S or T)S; C(R or K)NFFWKTF(S or T)SC; (P or G)C(R or K)NFFWKTF(S or T); (P or G)C(R or K)NFFWKTF(S or T)S; or (P or G)C(R or K)NFFWKTF(S or T)C.
[0177] With regard to the above sequences, where a (P or G) alternative is given, a P is preferred in the case of a CST TM, whereas a G is preferred in the case of an SST TM. Where an (R or K) alternative is given, an R is preferred in the case of a CST TM, whereas a K is preferred in the case of an SST TM. Where an (S or T) alternative is given, an S is preferred in the case of a CST TM, whereas a T is preferred in the case of an SST TM.
[0178] Preferred fragments comprise at least 7 or at least 10 amino acid residues, preferably at least 14 or at least 17 amino acid residues, and more preferably at least 28 or 29 amino acid residues. By way of example, preferred sequences include:
TABLE-US-00007 SANSNPAMAPRERKAGCKNFFWKTFTSC (SST-28); AGCKNFFWKTFTSC (SST-14); QEGAPPQQSARRDRMPCRNFFWKTFSSCK (CST-29); QERPPLQQPPHRDKKPCKNFFWKTFSSCK (CST-29); QERPPPQQPPHLDKKPCKNFFWKTFSSCK (CST-29) DRMPCRNFFWKTFSSCK (CST-17); PCRNFFWKTFSSCK (CST-14); and PCKNFFWKTFSSCK (CST-14)
[0179] The TM may comprise a longer amino acid sequence, for example, at least 30 or 35 amino acid residues, or at least 40 or 45 amino acid residues, so long as the TM is able to bind to a normal GH-secreting cell, preferably to an SST or to a CST receptor on a normal GH-secreting cell. In this regard, the TM is preferably a fragment of full-length SST or CST, though including at least the core sequence "NFFWKTF" or one of the above-defined primary amino acid sequences.
[0180] It is routine to confirm that a TM binds to the selected target cell. For example, a simple radioactive displacement experiment may be employed in which tissue or cells representative of a growth hormone-secreting cell are exposed to labelled (eg. tritiated) TM in the presence of an excess of unlabelled TM. In such an experiment, the relative proportions of non-specific and specific binding may be assessed, thereby allowing confirmation that the TM binds to the target cell. Optionally, the assay may include one or more binding antagonists, and the assay may further comprise observing a loss of TM binding. Examples of this type of experiment can be found in Hulme, E. C. (1990), Receptor-binding studies, a brief outline, pp. 303-311, In Receptor biochemistry, A Practical Approach, Ed. E.G. Hulme, Oxford University Press.
[0181] In the context of the present invention, reference to a peptide TM (e.g. GHRH peptide, or leptin peptide) embraces peptide analogues thereof, so long as the analogue binds to the same receptor as the corresponding `reference` TM. Said analogues may include synthetic residues such as:
[0182] β-Nal=β-naphthylalanine
[0183] β-Pal=β-pyridylalanine
[0184] hArg(Bu)=N-guaniclino-(butyl)LLhorroarginine
[0185] hArg(Et)2=N,N'-guanidine-(dimethyl)-homoarginine
[0186] hArg(CH2CF3)2=N,N'-guanidino-bis-(2,2,2,-trifluoroet- hyl)-homoarginine
[0187] hArg(CH3, hexyl)=N,N'-guanidino-(methyl, hexyl)-homoarginine
[0188] Lys(Me)=Ne-methyllysine
[0189] Lys(iPr)=Ne-isopropyllysine
[0190] AmPhe=aminomethylphenyialanine
[0191] AChxAla=aminocyclohexylaianine
[0192] Abu=α-aminobutyric acid
[0193] Tpo=4-thiaproline
[0194] MeLeu=N-methylleucine
[0195] Orn=omithine
[0196] Nie=norleucine
[0197] Nva=norvaline
[0198] Trp(Br)=5-bromo-tryptophan
[0199] Trp(F)=5-fluoro-tryptophan
[0200] Trp(N02)=5-nitro-tryptophan
[0201] Gabe=γ-aminobutyric acid
[0202] Bmp=J-mercaptopropionyl
[0203] Ac=acetyl
[0204] Pen=penciliamine
[0205] By way of example, the above peptide analogue aspect is described in more detail with reference to specific peptide TMs, such as SST peptides, GHRH peptides, bombesin peptides, GnRH peptides, and ghrelin peptides, though the same principle applies to all TMs of the present invention,.
[0206] Somatostatin analogues, which can be used to practice the present invention include, but are not limited to, those described in the following publications, which are hereby incorporated by reference: Van Binst, a et al. Peptide Research 5: 8 (1992); Horvath, A. et al, Abstract, "Conformations of Somatostatin Analogs Having Antitumor Activity", 22nd European peptide Symposium, Sep. 13.-19, 1992, Interlaken, Switzerland; U.S. Pat. No. 5,506,339; EP0363589; U.S. Pat. No. 4,904,842; U.S. Pat. No. 4,871,717; U.S. Pat. No. 4,725,577; U.S. Pat. No. 4,684,620; U.S. Pat. No. 4,650,787; U.S. Pat. No. 4,585,755: U.S. Pat. No. 4,725,577; U.S. Pat. No. 4,522,813; U.S. Pat. No. 4,369,179; U.S. Pat. No. 4,360,516; U.S. Pat. No. 4,328,214; U.S. Pat. No. 4,316,890; U.S. Pat. No. 4,310,518; U.S. Pat. No. 4,291,022; U.S. Pat. No. 4,238,481; U.S. Pat. No. 4,235,886; U.S. Pat. No. 4,211 ,693; U.S. Pat. No. 4,190,648; U.S. Pat. No. 4,146,612; U.S. Pat. No. 4,133,782; U.S. Pat. No. 5,506,339; U.S. Pat. No. 4,261,885; U.S. Pat. No. 4,282,143; U.S. Pat. No. 4,190,575; U.S. Pat. No. 5,552,520; EP0389180; EP0505680; U.S. Pat. No. 4,603,120; EP0030920; U.S. Pat. No. 4,853.371; WO90/12811; WO97/01579; WO91 /18016; WO98/08529 and WO98/08528; WO/0075186 and WO00/06185; WO99/56769; and FR 2,522,655. Each of these publications is incorporated in its entirety by reference thereto.
[0207] Methods for synthesizing analogues are well documented, as illustrated, for example, by the patents cited above. For example, synthesis of H-D-Phe-Phe-Phe-D-Trp-Lys-Thr-Phe-Thr-NH2, can be achieved by following the protocol set forth in Example 1 of EP0395417A1. Similarly, synthesis analogues with a substituted N-terrninus can be achieved, for example, by following the protocol set forth in WO88/02756, EP0329295, and U.S. Pat. No. 5,240,561.
[0208] The use of linear SST analogues are also included within the scope of this invention, for example: H-D-Phe-p-chloro-Phe-Tyr-D-Trp-Lys-Thr-Phe-Thr-NH2; H-D-Phe-p-NO2-Phe-Tyr-D-Trp-Lys-Val-Phe-Thr-NH2; H-D-*Nal-p-chloro-Phe-Tyr-D-Trp-Lys-Val-Phe-Thr-NH2; H-I -Phe-Phe-Phe-D-Trp-Lys-Thr-Phe-Thr-NH2; H-D-Phe-Phe-Tyr-D-Trp-Lys-Val-Phe-Thr-NH2: H-D-Phe-p-chloro-Phe-Tyr-D-Trp-Lys-Val-Phe-Thr-NH2; and H-D-Phe-Ala-Tyr-D-Trp-Lys-Val-Ala-D-beta-Nal-NH2.
[0209] One or more chemical moieties, eg. a sugar derivative, mono or poly-hydroxy (C2-12) alkyl, mono or poly-hydroxy (C2-12) acyl groups, or a piperazine derivative, can be attached to a SST analogue, e g. to the N-terminus amino acid--see WO88/02756, EP0329295, and U.S. Pat. No.5,240,561.
[0210] GHRH peptide analogues date back to the 1990s, and include the `standard antogonist` [Ac-Tyr`, D-Arg2jhGH-RH (1-29) Nha. U.S. Pat. No. 4, 659,693 discloses GH-RH antagonistic analogs which contain certain N,N'-dialkyl-omega-guanidino alpha-amino acyl residues in position 2 of the GH-RH (1-29) sequence. Additional examples are provided in WO91/16923, U.S. Pat. No. 5,550,212, U.S. Pat. No. 5,942,489, U.S. Pat. No. 6,057,422 U.S. Pat. No. 5,942,489, U.S. Pat. No. 6,057,422, WO96/032126, WO96/022782, WO96/016707, WO94/011397, WO94/011396, each of which is herein incorporated by reference thereto.
[0211] Examples of bombesin analogues suitable for use in the present invention include TMs comprising: D-Phe-Gin-Trp-Ala-Val-Gly-His-Leu-Met-NH2 (code named BIM-26218), D-Phe-Gln-Trp-Ala-Val-Gly-His-Leu-Leu-NH2 (code named BIM-26187); D-Cpa-Gin-Trp-Ala-Val-Gly-His-Leu-φ [CH2NH]-Phe-NH2 (code named BlM-26159), and D-Phe-Gln-Trp-Ala-Val-Gly-His-Leu-φ [CH2NH]-Cpa-NH2 (code named BIM-26189); D-Phe-Gin-Trp-Ala-Val-N-methyl-D-Ala-His-Leu-methylester, and D-Fg-Phe-Gin-Trp-Ala-Val-D-Ala-His-Leu-methylester,
[0212] Bombesin analogues include peptides derived from the naturally-occurring, structurally-related peptides, namely, bombesin, neuromedin B, neuromedin C, litorin, and GRP. The relevant amino add sequences of these naturally occurring TM peptides are listed below:
TABLE-US-00008 Bombesin (last 10 aa's): Gly-Asn-Gln-Trp-Ala-Val-Gly-His-Leu-Met-NH2 Neuromedin B: Gly-Asn-Leu-Trp-Ala-Thr-Gly-His-Phe-Met-NH2 Neuromedin C: Gly-Asn-His-Trp-Ala-Val-Gly-His-Leu-Met-NH2 Litorin: pGlu-Gln-Trp-Ala-Val-Gly-His-Phe-Met-NH2 Human GRP (last 10 aa's): Gly-Asn-His-Trp-Ala-Val-Gly-His-Leu-Met-NH2
[0213] Analogs suitable for use in the present invention are described in U.S. Ser. No. 502,438, filed Mar.30, 1990, U.B. Ser. No. 397,169, filed Aug. 21, 1989, U.S. Ser. No. 376,555, filed Jul. 7, 1989, U.S. Ser. No. 394,727, filed Aug. 16, 1989, U.S. Ser. No. 317,941, filed Mar. 2, 1989, U.S. Ser. No. 282,328, filed Dec. 9, 1988, U.S. Ser. No. 257,998, filed Oct. 14, 1988, U.S. Ser. No. 248,771, .filed Sep. 23, 1988, U.S. Ser. No. 207,759, filed Jun. 16, 1988, U.S. Ser. No. 204,171, filed Jun. 8, 1988, U.S. Ser. No. 173,311, filed Mar. 25, 1988, U.S. Ser. No. 100,571, filed Sep. 24, 1987; and U.S. Ser. No. 520,225, filed May 9, 1990, U.S. Ser. No. 440,039, filed Nov. 21, 1989, All these applications are hereby incorporated by reference. Bombesin analogs are also described in Zachary et al., Proc. Nat. Aca. Sci. 82:7616 (1985); Heimbrook et al., "Synthetic Peptides: Approaches to Biological Problems", UCLA Symposium on IViol, and Cell. Biol. New Series, Vol. 86, ed. Tarn and Kaiser; Heinz-Erian et al., Am. J. Physiol. G439 (1986); Martinez et al., J. Med. Chem. 28:1874 (1985); Gargosky et al., Biochem, J. 247;427 (1987); Dubreuil et al. , Drug Design and Delivery, Vol 2:49, Harwood Academic Publishers, GB (1987): Heikkila et al., J. Biol. Chem. 262:16456 (1987); Caranikas et al. , J. Med. Chem. 25:1313 (1982); Saeed et al., Peptides 10;597 (1989); Rosell et al., Trends in Pharmacological Sciences 3:211 (1982); Lundberg et al., Proc. Nat, Ara. Sci. 80:1120, (1983); Engberg et al., Nature 293;222 (1984); Mizrahi et al., Euro, J. Pharma. 82:101 (1982); Leander et al., Nature 294:467 (1981); Woll et al., Biochem. Biophys. Res. Comm. 155:359 (1988); Rider et al., Biochem, 17:1766 (1978); Cuttitta et al., Cancer Surveys 4:707 (1985);
[0214] Aumeias et al., Int. J. Peptide Res. 30;596 (1987); all of which are also hereby incorporated by reference.
[0215] The analogs can be prepared by conventional techniques, such as those described in WO92/20363 and EP0737691. Additional bombesin analogues are described in, for example, WO89/02897, WO91/17181, WO90/03980 and WO91/02746, all of which are herein incorporated by reference thereto.
[0216] Examples of ghrelin analogues suitable for use as a TM of the present invention comprise: Tyr-DTrp-DLys-Trp-DPhe-NH2, Tyr-DTrp-Lys-Trp-DPhe-NH2, His-DTrp-DLys-Trp-DPhe-NH2, His-DTrp-DLys-Phe-DTrp-NH2, His-DTrp-DArg-Trp-DPhe-NH2, His-DTrp-DLys-Trp-DPhe-Lys-NH2, DesaminoTyr-DTrp-Ala-Trp-DPhe-N H2, DesaminoTyr-DTrp-DLys-Trp-DPhe-N H2, DeaminoTyr-DTrp-Ser-Trp-DPhe-Lys-NH2, DesarninoTyr-DTrp-Ser-Trp-DPhe-NH2, His-DTrp-DTrp-Phe-Met-N H2, Tyr-DTrp-DTrp-Phe-Phe-NH2, Glyψ[CH2NH]-DβNal-Ala-Trp-DPhe-Lys-NH2, Glyω[CH2NH]-DbetaNal-DLyS-TrP-DPhe-Lys-NH2, DAla-DbetaNal-DLys-DTrp-Phe-Lys-NH2, His-DbetaNal-DLys-Trp-DPhe-Lys-NH2, Ala-His-DTrp-DLys-Trp-DPhe-Lys-NH2, Alaφ[CH2NH]-DbetaNal-Ala-Trp-DPhe-Lys-NH2, DbetaNal-Ala-Trp-DPhe-Ala-NH2, DAla-DcyclohexylAla-Ala-Phe-DPhe-Nle-NH2, DcyclohexylAia-Ala-Phe-DTrp-Lys-NH2, DAla-DbetaAla-Thr-DThr-Lys-N H2, DcyclohexylAla-Ala-Trp-DPhe-NH2, DAla-DbetaNal-Ala-Ala-DAla-Lys-NH2, DbetaNal-Ala-Trp-DPhe-Leu-NH2, His-13Trp-Phe-Trp-DPhe-Lys-NH2, DAla-DbetaNal-DAla-DTrp-Phe-Lys-NH2, pAla-Trp-DAla-ITrp-Phe-N H2, His-Trp-DAla-DTrp-Phe-LysN H2, DLys-DβNal-Ala-Trp-DPhe-Lys-N H2, DAla-DbetaNal-DLys-DTrp-Phe-Lys-NH2, Tyr-DAla-Phe-Alb-NH2, Tyr-DAla-Sar.NMePhe-NH2, αγAbu-DTrp-DTrp-Ser-NH2, αγAbu-DTrp-DTrp-Lys-NH2, αγAbu-DTrp-DTrp-Orn-NH2, αAbu-DTrp-DTrp-Orn-NH2, DThr-D{acute over (α)}Nal-DTrp-DPro-Arg-NH2, DAla-Ala-DAla-DTrp-Phe-Lys-NH2, Alaω[CH2NH]His-DTrp-Ala-Trp-DPhe-Lys-NH2, Lys-DHis-DTrp-Phe-NH2, γAbu-DTrp-DTrp-Orn-NH2, inip-Trp-Trp-Phe-NH2, Ac-DTrp-Phe-DTrp-Leu-NH2, Ac-DTrp-Phe-DTrp-Lys-N H2, Ac-DTrp-DTrp-Lys-N H2, DLys-Tyr-DTrp-DTrp-Phe-Lys-NH2, Ac-DbetaNal-Leu-Pro-N H2, pAla-Trp-DTrp-DTrp-Orn-NH2, DVal-DαNal-DTrp-Phe-Arg-NH2, DLeu-DαNal-DTrp-Phe-Arg-NH2, NH2, CyclohexylAla-DαNal-DTrp-Phe-Arg-NH2, DTp-DαNal-DTrp-Phe-Arg-NH2, DAla-DβNal-DPro-Phe-Arg-NH2, Ac-DαNal-DTrp-Phe-Arg-NH2, DαNal-DTrp-Phe-Arg-NH2, His-DTrp-DTrp-Lys-N H2, Ac-DpNel-DTrp-N H2, oAlb-DTrp-DcyclohexylAla-NH2, gAlb-DTrp-DAla-cyclohexylAla-N H2, DAla-DcyclohexylAla-Ala-Ale-Phe-DPhe-Nle-NH2, DPhe-Ala-Phe-DPal-NH2, DPhe-Ala-Phe-DPhe-Lys-NH2, DLys-Tyr-DTrp-DTrp-Phe-NH2, Ac-DLys-Tyr-DTrp-DTrp-Phe-NH2, Arg-DTrp-Leu-Tyr-Trp-Pro(cycllc Arg-Pro), Ac-D3Nal-PicLys-iLys-DPhe-NH2, DPal-Phe-DTrp-Phe-Met-NH2, DPhe-Trp-DPhe-Phe-Met-NH2, DPal-Trp-DPhe-Phe-Met-NH2, pAla-Pal-DTrp-DTrp-Om-NH2, αγAbu-Trp-DTrp-DTrp-Orn-NH2, βAla-Trp-DTrp-DTrp-Lys-NH2, γAbu-Trp-DTrp-DTrp-Orn-NH2, Ava-Trp-DTrp-DTrp-Orh-NH2, DLys-Tyr-DTrp-Ala-Trp-DPhe-NH2, His-DTrp-DArg-Trp-DPhe-NH2, <Glu-His-Trp-DSer-DArg-NH2, DPhe-DPhe-DTrp-Met-DLys-NH2, 0-(2-methylallyl)benzophonahe oxime, (R)-2-amino-3-(IH-indol-3-yl)-1-(4-phenylpiperidin-1-Apropan-1-one, N-((R)-1-((R)-1-((S)-3-(IH-indol-3-yl)-1-oxo-1-(4-pheny lpiperidin-1-yl)propan-2-ylemino)-6-amino-1-oxohexan-2-ylamlno)-3-hydroxy- -1-oxopropen-2-yObenzarnide, (S)-N-((S)-3-(1H-indol-3-yl)-1-oxo-1-(4-phehylpiperidin-1-yl)propan-2-yl)- -6-acetarnido-2-((S)-2-amino-3-(benzyloxy)propanamido)hexanamide, (S)-N-((R)-3-(1H-indol-3-yl)-1-oxo-1-(4-phenylpiperldin-1-yl)propan-2-yl)- -2-((S)-2-acetamido-3-(benzyloxy)propahamino-6-amihohexanamide, (R)-N-(3-(1-(4-(2-rnethoxyphenyl)piperidin-1-yl)-1-oxopropan-2-yl)-4-emin- obutanamide, (R)-N-(3-(1H-indol-3-yl)-1-(4-(2-methoxyphehyl)piperidin-1-yl)-1-oxopropa- h-2-yl)-2-amino-2-methylpropanamide, methyl 3-(p-tolylcarbamoyl)-2-naphthoate, ethyl 3-(4-(2-methoxyphehyl)piperidine-1-carbonyl)-2-haphthoate, 3-(2-methoxyphenylcarbamoyl)-2-naphthoate, (S)-2,4-diamino-N-((R)-3-(naphthalen-2-ylmethoxy)-1-oxo-1-(4-phenylpiperi- din-1-yl)propan-2-yl)butanamide, naphthalene-2,3-diylbis((4-(2-methoxyphenyl)piperazin-1-yl)methanone), (R)-2-amino-N-(3-(benzyloxy)-1-oxo-1-(4-phenylpiperazin-1-yl)propan-2-yl)- -2-rnethylpropanamide, or (R)-2-amino-3-(benzyloxy)-1-(4-phenylpiperazin-1-yl)propan-1-one.
[0217] Examples of GnRH analogues suitable for use as a TM in the present invention include those known from, for example, EP171477, WO96/033729, WO92/022322, WO92/013883, and WO91/05563, each of which is herein incorporated by reference thereto. Specific examples comprise: (NAcDQal1,DPtf2,DPAl3,cjsPzACAla5,DPicLys6,DAla.- sup.10)LHRH; (NAcDNal1,DpClPhe2,DPal3,cjsPzACAla5,DNicLys6,IL- ys8,DAla10)LHRH; (NAcDNal1,DpClPhe2,DPal3,Thr4,PicLys5,DPicLys.su- p.6,ILys8,DAla10)LHRH; (NAcDNal1,DpClPhe2,DPal3,PicLys5,DPicLys6,Thr.su- p.7,ILys8,DAla10)LHRH; (NapDThr1, DpClPhe2, DPal3,PicLys5,DPicLys6,ILys8,DAla10)LHRH; (NAcDNal1, DpClPhe2,DPal3,NicLys5,DNicLys6,Thr7,ILys8,- DAla10)LHRH; (NAcDNal1, DpClPhe2,DPal3,Thr4NicLys5,DNicLys6,Thr7,IL- ys8,DAla10)LHRH; (NAcDNal1,DpClPhe2,DPal3,PicLys5,D(PicSar)Lys6,I- Lys8,DAla10)LHRH; (NAcDNal1,DpClPhe2,DPal3,D(PicSar)Lys6,ILys8,DAl- a10)LHRH; (NAcDNal1,DpClPhe2,DPal3,PicLys5,D(6ANic)Lys6,IL- ys8,DAla10)LHRH; (NAcDNal1,DpClPhe2,DPal3,PicLys5,D(6ANic)Orn6,IL- ys8,DAla10)LHRH; (NAcDQal1,DCpa2,DPal3,cisPzACAla5,DPicLys6,NLeu.- sup.7,ILys8,DAla10)LHRH; (NAcDNal1,DCpa2,DPal3)DPicLys5,DAPhe(PicSar)P,ILys.su- p.8,DAlal° LHRH; (NAcDQal1 ,DCpa2,DPal3,PicLys5,DPa16,ILys8,DAla1).deg- ree. LHRH; (NAcDNal1, DCpa2,DPal3,PicLys5,DOrn(ACyp)6,ILys8,DAla1- 0)LHRH; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(cyclo-pentyl)-Phe-Ar- g-Pro-D-Ala-NH2; N-acetyl-D-0-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(cycloperityl)-Phe-Lys(cyclopen- tyl)-Pro-D-Ala-NH2; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Arg-Phe-(isopropyl)D-Lys-Pro-D-- Ala-NH2; N-acetyl -D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(benzyl)-Phe-Arg-Pro-D-Ala-NH2; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(Cl-benzyl)-Phe-Arg-Pro-D-Al- a-NH2; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(heptyl)-Phe-Arg-Pro-D-Ala-N- H2; N-aoetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Arg-Phe-Lys-(t-butylmet- hyl)-Pro-D-Ala-NH2; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Arg-Phe-Lys-(4-methyl-benzyl)-P- ro-D-Ala-NH2; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Arg-Phe-Lys-(benzyl)-Pro-D-Ala-- NH2; N-acetyl-D-beta-Nal-D-p-Cl-Phe-D-Trp-Ser-Tyr-D-p-NH2-Phe-Ph- e-(isopropylps-Pro-D-Ala-NH2; N-acetyl-D-beta-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(heptyl)-Phe-Lys-(heptyl)-Pr- o-D-Ala-NH2; N-acetyl-D-3-Nal-D-Phe-D-Phe-Ser-Tyr-D-Lys(1-butylpentyl)-Phe-Lys(1-butyl- pentyl)-Arg-Pro-D-Ala-NH2,
[0218] The polypeptides of the present invention lack a functional HC domain of a clostridial neurotoxin. Accordingly, said polypeptides are not able to bind rat synaptosomal membranes (via a clostridial HC component) in binding assays as described in Shone et al. (1985) Eur. J. Biochem. 151, 75-82. In a preferred embodiment, the polypeptides preferably lack the last 50 C-terminal amino acids of a clostridial neurotoxin holotoxin. In another embodiment, the polypeptides preferably lack the last 100, preferably the last 150, more preferably the last 200, particularly preferably the last 250, and most preferably the last 300 C-terminal amino acid residues of a clostridial neurotoxin holotoxin. Alternatively, the Hc binding activity may be negated/reduced by mutagenesis--by way of example, referring to BoNT/A for convenience, modification of one or two amino acid residue mutations (W1266 to L and Y1267 to F) in the ganglioside binding pocket causes the HC region to lose its receptor binding function. Analogous mutations may be made to non-serotype A clostridial peptide components, e.g. a construct based on botulinum B with mutations (W1262 to L and Y1263 to F) or botulinum E (W1224 to L and Y1225 to F). Other mutations to the active site achieve the same ablation of HC receptor binding activity, e.g. Y1267S in botulinum type A toxin and the corresponding highly conserved residue in the other clostridial neurotoxins. Details of this and other mutations are described in Rummel et al (2004) (Molecular Microbiol. 51:631-634), which is hereby incorporated by reference thereto.
[0219] In another embodiment, the polypeptides of the present invention lack a functional HC domain of a clostridial neurotoxin and also lack any functionally equivalent TM. Accordingly, said polypeptides lack the natural binding function of a clostridial neurotoxin and are not able to bind rat synaptosomal membranes (via a clostridial HC component, or via any functionally equivalent TM) in binding assays as described in Shone et al. (1985) Eur. J. Biochem. 151, 75-82.
[0220] The HC peptide of a native clostridial neurotoxin comprises approximately 400-440 amino acid residues, and consists of two functionally distinct domains of approximately 25kDa each, namely the N-terminal region (commonly referred to as the HCN peptide or domain) and the C-terminal region (commonly referred to as the HCC peptide or domain). This fact is confirmed by the following publications, each of which is herein incorporated in its entirety by reference thereto: Umland T C (1997) Nat. Struct. Biol. 4: 788-792; Herreros J (2000) Biochem. J. 347: 199-204; Halpern J (1993) J. Biol. Chem. 268: 15, pp. 11188-11192; Rummel A (2007) PNAS 104: 359-364; Lacey D B (1998) Nat. Struct. Biol. 5: 898-902; Knapp (1998) Am. Cryst. Assoc. Abstract Papers 25: 90; Swaminathan and Eswaramoorthy (2000) Nat. Struct. Biol. 7: 1751-1759; and Rummel A (2004) Mol. Microbiol. 51(3), 631-643. Moreover, it has been well documented that the C-terminal region (HCC), which constitutes the C-terminal 160-200 amino acid residues, is responsible for binding of a clostridial neurotoxin to its natural cell receptors, namely to nerve terminals at the neuromuscular junction--this fact is also confirmed by the above publications. Thus, reference throughout this specification to a clostridial heavy-chain lacking a functional heavy chain HC peptide (or domain) such that the heavy-chain is incapable of binding to cell surface receptors to which a native clostridial neurotoxin binds means that the clostridial heavy-chain simply lacks a functional HCC peptide. In other words, the HCC peptide region is either partially or wholly deleted, or otherwise modified (e.g. through conventional chemical or proteolytic treatment) to inactivate its native binding ability for nerve terminals at the neuromuscular junction.
[0221] Thus, in one embodiment, a clostridial HN peptide of the present invention lacks part of a C-terminal peptide portion (HCC) of a clostridial neurotoxin and thus lacks the HC binding function of native clostridial neurotoxin. By way of example, in one embodiment, the C-terminally extended clostridial HN peptide lacks the C-terminal 40 amino acid residues, or the C-terminal 60 amino acid residues, or the C-terminal 80 amino acid residues, or the C-terminal 100 amino acid residues, or the C-terminal 120 amino acid residues, or the C-terminal 140 amino acid residues, or the C-terminal 150 amino acid residues, or the C-terminal 160 amino acid residues of a clostridial neurotoxin heavy-chain. In another embodiment, the clostridial HN peptide of the present invention lacks the entire C-terminal peptide portion (HCC) of a clostridial neurotoxin and thus lacks the HC binding function of native clostridial neurotoxin. By way of example, in one embodiment, the clostridial HN peptide lacks the C-terminal 165 amino acid residues, or the C-terminal 170 amino acid residues, or the C-terminal 175 amino acid residues, or the C-terminal 180 amino acid residues, or the C-terminal 185 amino acid residues, or the C-terminal 190 amino acid residues, or the C-terminal 195 amino acid residues of a clostridial neurotoxin heavy-chain. By way of further example, the clostridial HN peptide of the present invention lacks a clostridial HCC reference sequence selected from the group consisting of: [0222] Botulinum type A neurotoxin--amino acid residues (Y1111-L1296) [0223] Botulinum type B neurotoxin--amino acid residues (Y1098-E1291) [0224] Botulinum type C neurotoxin--amino acid residues (Y1112-E1291) [0225] Botulinum type D neurotoxin--amino acid residues (Y1099-E1276) [0226] Botulinum type E neurotoxin--amino acid residues (Y1086-K1252) [0227] Botulinum type F neurotoxin--amino acid residues (Y1106-E1274) [0228] Botulinum type G neurotoxin--amino acid residues (Y1106-E1297) [0229] Tetanus neurotoxin--amino acid residues (Y1128-D1315).
[0230] The above-identified reference sequences should be considered a guide as slight variations may occur according to sub-serotypes.
[0231] The protease of the present invention embraces all non-cytotoxic proteases that are capable of cleaving one or more proteins of the exocytic fusion apparatus in eukaryotic cells.
[0232] The protease of the present invention is preferably a bacterial protease (or fragment thereof). More preferably the bacterial protease is selected from the genera Clostridium or Neisseria/Streptococcus (e.g. a clostridial L-chain, or a neisserial IgA protease preferably from N. gonorrhoeae or S. pneumoniae).
[0233] The present invention also embraces variant non-cytotoxic proteases (ie. variants of naturally-occurring protease molecules), so long as the variant proteases still demonstrate the requisite protease activity. By way of example, a variant may have at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95 or at least 98% amino acid sequence homology with a reference protease sequence. Thus, the term variant includes non-cytotic proteases having enhanced (or decreased) endopeptidase activity--particular mention here is made to the increased Kcat/Km of BoNT/A mutants Q161A, E54A, and K165L see Ahmed, S. A. (2008) Protein J. DOI 10.1007/s10930-007-9118-8, which is incorporated by reference thereto. The term fragment, when used in relation to a protease, typically means a peptide having at least 150, preferably at least 200, more preferably at least 250, and most preferably at least 300 amino acid residues of the reference protease. As with the TM `fragment` component (discussed above), protease `fragments` of the present invention embrace fragments of variant proteases based on a reference sequence.
[0234] The protease of the present invention preferably demonstrates a serine or metalloprotease activity (e.g. endopeptidase activity). The protease is preferably specific for a SNARE protein (e.g. SNAP-25, synaptobrevin/VAMP, or syntaxin).
[0235] Particular mention is made to the protease domains of neurotoxins, for example the protease domains of bacterial neurotoxins. Thus, the present invention embraces the use of neurotoxin domains, which occur in nature, as well as recombinantly prepared versions of said naturally-occurring neurotoxins.
[0236] Exemplary neurotoxins are produced by clostridia, and the term clostridial neurotoxin embraces neurotoxins produced by C. tetani (TeNT), and by C. botulinum (BoNT) serotypes A-G, as well as the closely related BoNT-like neurotoxins produced by C. baratii and C. butyricum. The above-mentioned abbreviations are used throughout the present specification. For example, the nomenclature BoNT/A denotes the source of neurotoxin as BoNT (serotype A). Corresponding nomenclature applies to other BoNT serotypes.
[0237] BoNTs are the most potent toxins known, with median lethal dose (LD50) values for mice ranging from 0.5 to 5 ng/kg depending on the serotype. BoNTs are adsorbed in the gastrointestinal tract, and, after entering the general circulation, bind to the presynaptic membrane of cholinergic nerve terminals and prevent the release of their neurotransmitter acetylcholine. BoNT/B, BoNT/D, BoNT/F and BoNT/G cleave synaptobrevin/vesicle-associated membrane protein (VAMP); BoNT/C, BoNT/A and BoNT/E cleave the synaptosomal-associated protein of 25 kDa (SNAP-25); and BoNT/C cleaves syntaxin.
[0238] BoNTs share a common structure, being di-chain proteins of ˜150 kDa, consisting of a heavy chain (H-chain) of ˜100 kDa covalently joined by a single disulphide bond to a light chain (L-chain) of ˜50 kDa. The H-chain consists of two domains, each of ˜50 kDa. The C-terminal domain (HC) is required for the high-affinity neuronal binding, whereas the N-terminal domain (HN) is proposed to be involved in membrane translocation. The L-chain is a zinc-dependent metalloprotease responsible for the cleavage of the substrate SNARE protein.
[0239] The term L-chain fragment means a component of the L-chain of a neurotoxin, which fragment demonstrates a metalloprotease activity and is capable of proteolytically cleaving a vesicle and/or plasma membrane associated protein involved in cellular exocytosis.
[0240] Examples of suitable protease (reference) sequences include: [0241] Botulinum type A neurotoxin--amino acid residues (1-448) [0242] Botulinum type B neurotoxin--amino acid residues (1-440) [0243] Botulinum type C neurotoxin--amino acid residues (1-441) [0244] Botulinum type D neurotoxin--amino acid residues (1-445) [0245] Botulinum type E neurotoxin--amino acid residues (1-422) [0246] Botulinum type F neurotoxin--amino acid residues (1-439) [0247] Botulinum type G neurotoxin--amino acid residues (1-441) [0248] Tetanus neurotoxin--amino acid residues (1-457) [0249] IgA protease--amino acid residues (1-959)* [0250] Pohlner, J. et al. (1987). Nature 325, pp. 458-462, which is hereby incorporated by reference thereto.
[0251] The above-identified reference sequence should be considered a guide as slight variations may occur according to sub-serotypes. By way of example, US 2007/0166332 (hereby incorporated by reference thereto) cites slightly different clostridial sequences: [0252] Botulinum type A neurotoxin--amino acid residues (M1-K448) [0253] Botulinum type B neurotoxin--amino acid residues (M1-K441) [0254] Botulinum type C neurotoxin--amino acid residues (M1-K449) [0255] Botulinum type D neurotoxin--amino acid residues (M1-R445) [0256] Botulinum type E neurotoxin--amino acid residues (M1-R422) [0257] Botulinum type F neurotoxin--amino acid residues (M1-K439) [0258] Botulinum type G neurotoxin--amino acid residues (M1-K446) [0259] Tetanus neurotoxin--amino acid residues (M1-A457)
[0260] A variety of clostridial toxin fragments comprising the light chain can be useful in aspects of the present invention with the proviso that these light chain fragments can specifically target the core components of the neurotransmitter release apparatus and thus participate in executing the overall cellular mechanism whereby a clostridial toxin proteolytically cleaves a substrate. The light chains of clostridial toxins are approximately 420-460 amino acids in length and comprise an enzymatic domain. Research has shown that the entire length of a clostridial toxin light chain is not necessary for the enzymatic activity of the enzymatic domain. As a non-limiting example, the first eight amino acids of the BoNT/A light chain are not required for enzymatic activity.
[0261] As another non-limiting example, the first eight amino acids of the TeNT light chain are not required for enzymatic activity. Likewise, the carboxyl-terminus of the light chain is not necessary for activity. As a non-limiting example, the last 32 amino acids of the BoNT/A light chain (residues 417-448) are not required for enzymatic activity. As another non-limiting example, the last 31 amino acids of the TeNT light chain (residues 427-457) are not required for enzymatic activity. Thus, aspects of this embodiment can include clostridial toxin light chains comprising an enzymatic domain having a length of, for example, at least 350 amino acids, at least 375 amino acids, at least 400 amino acids, at least 425 amino acids and at least 450 amino acids. Other aspects of this embodiment can include clostridial toxin light chains comprising an enzymatic domain having a length of, for example, at most 350 amino acids, at most 375 amino acids, at most 400 amino acids, at most 425 amino acids and at most 450 amino acids.
[0262] The polypeptides of the present invention, especially the protease component thereof, may be PEGylated--this may help to increase stability, for example duration of action of the protease component. PEGylation is particularly preferred when the protease comprises a BoNT/A, B or C1 protease. PEGylation preferably includes the addition of PEG to the N-terminus of the protease component. By way of example, the N-terminus of a protease may be extended with one or more amino acid (e.g. cysteine) residues, which may be the same or different. One or more of said amino acid residues may have its own PEG molecule attached (e.g. covalently attached) thereto. An example of this technology is described in WO2007/104567, which is incorporated in its entirety by reference thereto.
[0263] A Translocation Domain is a molecule that enables translocation of a protease into a target cell such that a functional expression of protease activity occurs within the cytosol of the target cell. Whether any molecule (e.g. a protein or peptide) possesses the requisite translocation function of the present invention may be confirmed by any one of a number of conventional assays.
[0264] For example, Shone C. (1987) describes an in vitro assay employing liposomes, which are challenged with a test molecule. Presence of the requisite translocation function is confirmed by release from the liposomes of K.sup.+ and/or labelled NAD, which may be readily monitored [see Shone C. (1987) Eur. J. Biochem; vol. 167(1): pp. 175-180].
[0265] A further example is provided by Blaustein R. (1987), which describes a simple in vitro assay employing planar phospholipid bilayer membranes. The membranes are challenged with a test molecule and the requisite translocation function is confirmed by an increase in conductance across said membranes [see Blaustein (1987) FEBS Letts; vol. 226, no. 1: pp. 115-120].
[0266] Additional methodology to enable assessment of membrane fusion and thus identification of Translocation Domains suitable for use in the present invention are provided by Methods in Enzymology Vol 220 and 221, Membrane Fusion Techniques, Parts A and B, Academic Press 1993.
[0267] The present invention also embraces variant translocation domains, so long as the variant domains still demonstrate the requisite translocation activity. By way of example, a variant may have at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95% or at least 98% amino acid sequence homology with a reference translocation domain. The term fragment, when used in relation to a translocation domain, means a peptide having at least 20, preferably at least 40, more preferably at least 80, and most preferably at least 100 amino acid residues of the reference translocation domain. In the case of a clostridial translocation domain, the fragment preferably has at least 100, preferably at least 150, more preferably at least 200, and most preferably at least 250 amino acid residues of the reference translocation domain (eg. HN domain). As with the TM `fragment` component (discussed above), translocation `fragments` of the present invention embrace fragments of variant translocation domains based on the reference sequences.
[0268] The Translocation Domain is preferably capable of formation of ion-permeable pores in lipid membranes under conditions of low pH. Preferably it has been found to use only those portions of the protein molecule capable of pore-formation within the endosomal membrane.
[0269] The Translocation Domain may be obtained from a microbial protein source, in particular from a bacterial or viral protein source. Hence, in one embodiment, the Translocation Domain is a translocating domain of an enzyme, such as a bacterial toxin or viral protein.
[0270] It is well documented that certain domains of bacterial toxin molecules are capable of forming such pores. It is also known that certain translocation domains of virally expressed membrane fusion proteins are capable of forming such pores. Such domains may be employed in the present invention.
[0271] The Translocation Domain may be of a clostridial origin, such as the HN domain (or a functional component thereof). HN means a portion or fragment of the H-chain of a clostridial neurotoxin approximately equivalent to the amino-terminal half of the H-chain, or the domain corresponding to that fragment in the intact H-chain. The H-chain lacks the natural binding function of the HC component of the H-chain. In this regard, the HC function may be removed by deletion of the HC amino acid sequence (either at the DNA synthesis level, or at the post-synthesis level by nuclease or protease treatment). Alternatively, the HC function may be inactivated by chemical or biological treatment. Thus, the H-chain is incapable of binding to the Binding Site on a target cell to which native clostridial neurotoxin (i.e. holotoxin) binds.
[0272] Examples of suitable (reference) Translocation Domains include: [0273] Botulinum type A neurotoxin--amino acid residues (449-871) [0274] Botulinum type B neurotoxin--amino acid residues (441-858) [0275] Botulinum type C neurotoxin--amino acid residues (442-866) [0276] Botulinum type D neurotoxin--amino acid residues (446-862) [0277] Botulinum type E neurotoxin--amino acid residues (423-845) [0278] Botulinum type F neurotoxin--amino acid residues (440-864) [0279] Botulinum type G neurotoxin--amino acid residues (442-863) [0280] Tetanus neurotoxin--amino acid residues (458-879)
[0281] The above-identified reference sequence should be considered a guide as slight variations may occur according to sub-serotypes. By way of example,
[0282] US 2007/0166332 (hereby incorporated by reference thereto) cites slightly different clostridial sequences: [0283] Botulinum type A neurotoxin--amino acid residues (A449-K871) [0284] Botulinum type B neurotoxin--amino acid residues (A442-S858) [0285] Botulinum type C neurotoxin--amino acid residues (T450-N866) [0286] Botulinum type D neurotoxin--amino acid residues (D446-N862) [0287] Botulinum type E neurotoxin--amino acid residues (K423-K845) [0288] Botulinum type F neurotoxin--amino acid residues (A440-K864) [0289] Botulinum type G neurotoxin--amino acid residues (S447-S863) [0290] Tetanus neurotoxin--amino acid residues (5458-V879)
[0291] In the context of the present invention, a variety of Clostridial toxin HN regions comprising a translocation domain can be useful in aspects of the present invention with the proviso that these active fragments can facilitate the release of a non-cytotoxic protease (e.g. a clostridial L-chain) from intracellular vesicles into the cytoplasm of the target cell and thus participate in executing the overall cellular mechanism whereby a clostridial toxin proteolytically cleaves a substrate. The HN regions from the heavy chains of Clostridial toxins are approximately 410-430 amino acids in length and comprise a translocation domain. Research has shown that the entire length of a HN region from a Clostridial toxin heavy chain is not necessary for the translocating activity of the translocation domain. Thus, aspects of this embodiment can include clostridial toxin HN regions comprising a translocation domain having a length of, for example, at least 350 amino acids, at least 375 amino acids, at least 400 amino acids and at least 425 amino acids. Other aspects of this embodiment can include clostridial toxin HN regions comprising translocation domain having a length of, for example, at most 350 amino acids, at most 375 amino acids, at most 400 amino acids and at most 425 amino acids.
[0292] For further details on the genetic basis of toxin production in Clostridium botulinum and C. tetani, we refer to Henderson et al (1997) in The Clostridia: Molecular Biology and Pathogenesis, Academic press.
[0293] The term HN embraces naturally-occurring neurotoxin HN portions, and modified HN portions having amino acid sequences that do not occur in nature and/or synthetic amino acid residues, so long as the modified HN portions still demonstrate the above-mentioned translocation function.
[0294] Alternatively, the Translocation Domain may be of a non-clostridial origin. Examples of non-clostridial (reference) Translocation Domain origins include, but not be restricted to, the translocation domain of diphtheria toxin [O=Keefe et al., Proc. Natl. Acad. Sci. USA (1992) 89, 6202-6206; Silverman et al., J. Biol. Chem. (1993) 269, 22524-22532; and London, E. (1992) Biochem. Biophys. Acta., 1112, pp.25-51], the translocation domain of Pseudomonas exotoxin type A [Prior et al. Biochemistry (1992) 31, 3555-3559], the translocation domains of anthrax toxin [Blanke et al. Proc. Natl. Acad. Sci. USA (1996) 93, 8437-8442], a variety of fusogenic or hydrophobic peptides of translocating function [Plank et al. J. Biol. Chem. (1994) 269, 12918-12924; and Wagner et al (1992) PNAS, 89, pp.7934-7938], and amphiphilic peptides [Murata et al (1992) Biochem., 31, pp. 1986-1992]. The Translocation Domain may mirror the Translocation Domain present in a naturally-occurring protein, or may include amino acid variations so long as the variations do not destroy the translocating ability of the Translocation Domain.
[0295] Particular examples of viral (reference) Translocation Domains suitable for use in the present invention include certain translocating domains of virally expressed membrane fusion proteins. For example, Wagner et al. (1992) and Murata et al. (1992) describe the translocation (i.e. membrane fusion and vesiculation) function of a number of fusogenic and amphiphilic peptides derived from the N-terminal region of influenza virus haemagglutinin. Other virally expressed membrane fusion proteins known to have the desired translocating activity are a translocating domain of a fusogenic peptide of Semliki Forest Virus (SFV), a translocating domain of vesicular stomatitis virus (VSV) glycoprotein G, a translocating domain of SER virus F protein and a translocating domain of Foamy virus envelope glycoprotein. Virally encoded Aspike proteins have particular application in the context of the present invention, for example, the El protein of SFV and the G protein of the G protein of VSV.
[0296] Use of the (reference) Translocation Domains listed in Table (below) includes use of sequence variants thereof. A variant may comprise one or more conservative nucleic acid substitutions and/or nucleic acid deletions or insertions, with the proviso that the variant possesses the requisite translocating function. A variant may also comprise one or more amino acid substitutions and/or amino acid deletions or insertions, so long as the variant possesses the requisite translocating function.
TABLE-US-00009 Translocation Amino acid Domain source residues References Diphtheria toxin 194-380 Silverman et al., 1994, J. Biol. Chem. 269, 22524-22532 London E., 1992, Biochem. Biophys. Acta., 1113, 25-51 Domain II of 405-613 Prior et al., 1992, Biochemistry pseudomonas 31, 3555-3559 exotoxin Kihara & Pastan, 1994, Bioconj Chem. 5, 532-538 Influenza virus GLFGAIAGFIENGWE Plank et al., 1994, J. Biol. Chem. haemagglutinin GMIDGWYG, and 269, 12918-12924 Variants thereof Wagner et al., 1992, PNAS, 89, 7934-7938 Murata et al., 1992, Biochemistry 31, 1986-1992 Semliki Forest virus Translocation domain Kielian et al., 1996, J Cell Biol. fusogenic protein 134(4), 863-872 Vesicular Stomatitis 118-139 Yao et al., 2003, Virology 310(2), virus glycoprotein G 319-332 SER virus F protein Translocation domain Seth et al., 2003, J Virol 77(11) 6520-6527 Foamy virus Translocation domain Picard-Maureau et al., 2003, J envelope Virol. 77(8), 4722-4730 glycoprotein
[0297] The polypeptides of the present invention may further comprise a translocation facilitating domain. Said domain facilitates delivery of the non-cytotoxic protease into the cytosol of the target cell and are described, for example, in WO 08/008803 and WO 08/008805, each of which is herein incorporated by reference thereto.
[0298] By way of example, suitable translocation facilitating domains include an enveloped virus fusogenic peptide domain, for example, suitable fusogenic peptide domains include influenzavirus fusogenic peptide domain (e.g. influenza A virus fusogenic peptide domain of 23 amino acids), alphavirus fusogenic peptide domain (e.g. Semliki Forest virus fusogenic peptide domain of 26 amino acids), vesiculovirus fusogenic peptide domain (e.g. vesicular stomatitis virus fusogenic peptide domain of 21 amino acids), respirovirus fusogenic peptide domain (e.g. Sendai virus fusogenic peptide domain of 25 amino acids), morbiliivirus fusogenic peptide domain (e.g. Canine distemper virus fusogenic peptide domain of 25 amino acids), avulavirus fusogenic peptide domain (e.g. Newcastle disease virus fusogenic peptide domain of 25 amino acids), henipavirus fusogenic peptide domain (e.g. Hendra virus fusogenic peptide domain of 25 amino acids), metapneumovirus fusogenic peptide domain (e.g. Human metapneumovirus fusogenic peptide domain of 25 amino acids) or spumavirus fusogenic peptide domain such as simian foamy virus fusogenic peptide domain; or fragments or variants thereof.
[0299] By way of further example, a translocation facilitating domain may comprise a Clostridial toxin HCN domain or a fragment or variant thereof. In more detail, a Clostridial toxin HCN translocation facilitating domain may have a length of at least 200 amino acids, at least 225 amino acids, at least 250 amino acids, at least 275 amino acids. In this regard, a Clostridial toxin HCN translocation facilitating domain preferably has a length of at most 200 amino acids, at most 225 amino acids, at most 250 amino acids, or at most 275 amino acids. Specific (reference) examples include: [0300] Botulinum type A neurotoxin--amino acid residues (872-1110) [0301] Botulinum type B neurotoxin--amino acid residues (859-1097) [0302] Botulinum type C neurotoxin--amino acid residues (867-1111) [0303] Botulinum type D neurotoxin--amino acid residues (863-1098) [0304] Botulinum type E neurotoxin--amino acid residues (846-1085) [0305] Botulinum type F neurotoxin--amino acid residues (865-1105) [0306] Botulinum type G neurotoxin--amino acid residues (864-1105) [0307] Tetanus neurotoxin--amino acid residues (880-1127)
[0308] The above sequence positions may vary a little according to serotype/sub-type, and further examples of suitable (reference) Clostridial toxin HCN domains include: [0309] Botulinum type A neurotoxin--amino acid residues (874-1110) [0310] Botulinum type B neurotoxin--amino acid residues (861-1097) [0311] Botulinum type C neurotoxin--amino acid residues (869-1111) [0312] Botulinum type D neurotoxin--amino acid residues (865-1098) [0313] Botulinum type E neurotoxin--amino acid residues (848-1085) [0314] Botulinum type F neurotoxin--amino acid residues (867-1105) [0315] Botulinum type G neurotoxin--amino acid residues (866-1105) [0316] Tetanus neurotoxin--amino acid residues (882-1127)
[0317] Any of the above-described facilitating domains may be combined with any of the previously described translocation domain peptides that are suitable for use in the present invention. Thus, by way of example, a non-clostridial facilitating domain may be combined with non-clostridial translocation domain peptide or with clostridial translocation domain peptide. Alternatively, a Clostridial toxin HCN translocation facilitating domain may be combined with a non-clostridal translocation domain peptide. Alternatively, a Clostridial toxin HCN facilitating domain may be combined or with a clostridial translocation domain peptide, examples of which include: [0318] Botulinum type A neurotoxin--amino acid residues (449-1110) [0319] Botulinum type B neurotoxin--amino acid residues (442-1097) [0320] Botulinum type C neurotoxin--amino acid residues (450-1111) [0321] Botulinum type D neurotoxin--amino acid residues (446-1098) [0322] Botulinum type E neurotoxin--amino acid residues (423-1085) [0323] Botulinum type F neurotoxin--amino acid residues (440-1105) [0324] Botulinum type G neurotoxin--amino acid residues (447-1105) [0325] Tetanus neurotoxin--amino acid residues (458-1127)
[0326] Sequence Homology:
[0327] Any of a variety of sequence alignment methods can be used to determine percent identity, including, without limitation, global methods, local methods and hybrid methods, such as, e.g., segment approach methods. Protocols to determine percent identity are routine procedures within the scope of one skilled in the. Global methods align sequences from the beginning to the end of the molecule and determine the best alignment by adding up scores of individual residue pairs and by imposing gap penalties. Non-limiting methods include, e.g., CLUSTAL W, see, e.g., Julie D. Thompson et al., CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice, 22(22) Nucleic Acids Research 4673-4680 (1994); and iterative refinement, see, e.g., Osamu Gotoh, Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments, 264(4) J. Mol. Bid, 823-838 (1996), Local methods align sequences by identifying one or more conserved motifs shared by all of the input sequences, Non-limiting methods include, e.g., Match-box, see, e.g., Eric Depiereux and Ernest Feytmans, Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences, 8(5) CABIOS 501-509 (1992); Gibbs sampling, see, e.g., C. E. Lawrence et al., Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment, 262(5131) Science 208-214 (1993); Align-M, see, e.g., Ivo Van WaIle et al., Align-M--A New Algorithm for Multiple Alignment of Highly Divergent Sequences, 20(9) Bioinformatics:1428-1435 (2004).
[0328] Thus, percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-19, 1992. Briefly, two amino acid sequences are aligned to optimize the alignment scores using a gap opening penalty of 10, a gap extension penalty of 1, and the "blosum 62" scoring matrix of Henikoff and Henikoff (ibid.) as shown below (amino acids are indicated by the standard one-letter codes).
TABLE-US-00010 Alignment scores for determining sequence identity A R N D C Q E G H I L K M F P S T W Y V A 4 R -1 5 N -2 0 6 D -2 -2 1 6 C 0 -3 -3 -3 9 Q -1 1 0 0 -3 5 E -1 0 0 2 -4 2 5 G 0 -2 0 -1 -3 -2 -2 6 H -2 0 1 -1 -3 0 0 -2 8 I -1 -3 -3 -3 -1 -3 -3 -4 -3 4 L -1 -2 -3 -4 -1 -2 -3 -4 -3 2 4 K -1 2 0 -1 -3 1 1 -2 -1 -3 -2 5 M -1 -1 -2 -3 -1 0 -2 -3 -2 1 2 -1 5 F -2 -3 -3 -3 -2 -3 -3 -3 -1 0 0 -3 0 6 P -1 -2 -2 -1 -3 -1 -1 -2 -2 -3 -3 -1 -2 -4 7 S 1 -1 1 0 -1 0 0 0 -1 -2 -2 0 -1 -2 -1 4 T 0 -1 0 -1 -1 -1 -1 -2 -2 -1 -1 -1 -1 -2 -1 1 5 W -3 -3 -4 -4 -2 -2 -3 -2 -2 -3 -2 -3 -1 1 -4 -3 -2 11 Y -2 -2 -2 -3 -2 -1 -2 -3 2 -1 -1 -2 -1 3 -3 -2 -2 2 7 V 0 -3 -3 -3 -1 -2 -2 -3 -3 3 1 -2 1 -1 -2 -2 0 -3 -1 4
[0329] The percent identity is then calculated as:
Total number of identical matches [ length of the longer sequence plus the number of gaps introduced into the longer sequence in order to align the two sequences ] × 100 ##EQU00001##
[0330] Substantially homologous polypeptides are characterized as having one or more amino acid substitutions, deletions or additions. These changes are preferably of a minor nature, that is conservative amino acid substitutions (see below) and other substitutions that do not significantly affect the folding or activity of the polypeptide; small deletions, typically of one to about 30 amino acids; and small amino- or carboxyl-terminal extensions, such as an amino-terminal methionine residue, a small linker peptide of up to about 20-25 residues, or an affinity tag.
[0331] Conservative Amino Acid Substitutions
[0332] Basic: arginine [0333] lysine [0334] histidine
[0335] Acidic: glutamic acid [0336] aspartic acid
[0337] Polar: glutamine [0338] asparagine
[0339] Hydrophobic: leucine [0340] isoleucine [0341] valine
[0342] Aromatic: phenylalanine [0343] tryptophan [0344] tyrosine
[0345] Small: glycine [0346] alanine [0347] serine [0348] threonine [0349] methionine
[0350] In addition to the 20 standard amino acids, non-standard amino acids (such as 4-hydroxyproline, 6-N-methyl lysine, 2-aminoisobutyric acid, isovaline and α-methyl serine) may be substituted for amino acid residues of the polypeptides of the present invention. A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, and unnatural amino acids may be substituted for clostridial polypeptide amino acid residues. The polypeptides of the present invention can also comprise non-naturally occurring amino acid residues.
[0351] Non-naturally occurring amino acids include, without limitation, trans-3-methylproline, 2,4-methano-proline, cis-4-hydroxyproline, trans-4-hydroxy-proline, N-methylglycine, allo-threonine, methyl-threonine, hydroxy-ethylcysteine, hydroxyethylhomo-cysteine, nitro-glutamine, homoglutamine, pipecolic acid, tert-leucine, norvaline, 2-azaphenylalanine, 3-azaphenyl-alanine, 4-azaphenyl-alanine, and 4-fluorophenylalanine. Several methods are known in the art for incorporating non-naturally occurring amino acid residues into proteins. For example, an in vitro system can be employed wherein nonsense mutations are suppressed using chemically aminoacylated suppressor tRNAs. Methods for synthesizing amino acids and aminoacylating tRNA are known in the art. Transcription and translation of plasmids containing nonsense mutations is carried out in a cell free system comprising an E. coli S30 extract and commercially available enzymes and other reagents. Proteins are purified by chromatography. See, for example, Robertson et al., J. Am. Chem. Soc. 113:2722, 1991; Ellman et al., Methods Enzymol. 202:301, 1991; Chung et al., Science 259:806-9, 1993; and Chung et al., Proc. Natl. Acad. Sci. USA 90:10145-9, 1993). In a second method, translation is carried out in Xenopus oocytes by microinjection of mutated mRNA and chemically aminoacylated suppressor tRNAs (Turcatti et al., J. Biol. Chem. 271:19991-8, 1996). Within a third method, E. coli cells are cultured in the absence of a natural amino acid that is to be replaced (e.g., phenylalanine) and in the presence of the desired non-naturally occurring amino acid(s) (e.g., 2-azaphenylalanine, 3-azaphenylalanine, 4-azaphenylalanine, or 4-fluorophenylalanine). The non-naturally occurring amino acid is incorporated into the polypeptide in place of its natural counterpart. See, Koide et al., Biochem. 33:7470-6, 1994. Naturally occurring amino acid residues can be converted to non-naturally occurring species by in vitro chemical modification. Chemical modification can be combined with site-directed mutagenesis to further expand the range of substitutions (Wynn and Richards, Protein Sci. 2:395-403, 1993).
[0352] A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, non-naturally occurring amino acids, and unnatural amino acids may be substituted for amino acid residues of polypeptides of the present invention.
[0353] Essential amino acids in the polypeptides of the present invention can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 244: 1081-5, 1989). Sites of biological interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., Science 255:306-12, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett. 309:59-64, 1992. The identities of essential amino acids can also be inferred from analysis of homologies with related components (e.g. the translocation or protease components) of the polypeptides of the present invention.
[0354] Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241:53-7, 1988) or Bowie and Sauer (Proc. Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al., Biochem. 30:10832-7, 1991; Ladner et al., U.S. Pat. No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).
[0355] Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241:53-7, 1988) or Bowie and Sauer (Proc. Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al., Biochem. 30:10832-7, 1991; Ladner et al., U.S. Pat. No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).
[0356] There now follows a brief description of the Figures, which illustrate aspects and/or embodiments of the present invention.
[0357] FIG. 1--Purification of LHN/D-CT-CST28 Fusion Protein
[0358] Using the methodology outlined in Example 3, a LHN/D-CT-CST28 fusion protein was purified from E. coli BL21 (DE3) cells. Briefly, the soluble products obtained following cell disruption were applied to a nickel-charged affinity capture column. Bound proteins were eluted with 200 mM imidazole, treated with enterokinase to activate the fusion protein and then re-applied to a second nickel-charged affinity capture column. Samples from the purification procedure were assessed by SDS-PAGE. Lane 1: First nickel chelating Sepharose column eluant, Lane 2: Second nickel chelating Sepharose column eluant under non-reducing conditions, Lane 3: Second nickel chelating Sepharose column eluant under reducing conditions, lane 4: Molecular mass markers (kDa).
[0359] FIG. 2--Purification of LHN/A-CT-SST14 Fusion Protein
[0360] Using the methodology outlined in Example 3, an LHN/A-CT-SST14 fusion protein was purified from E. coli BL21 (DE3) cells. Briefly, the soluble products obtained following cell disruption were applied to a nickel-charged affinity capture column. Bound proteins were eluted with 200 mM imidazole, treated with Factor Xa to activate the fusion protein and then re-applied to a second nickel-charged affinity capture column. Samples from the purification procedure were assessed by SDS-PAGE. Lane 1: First nickel chelating Sepharose column eluant, Lane 2: Molecular mass markers (kDa), Lanes 3-4: Second nickel chelating Sepharose column eluant under non-reducing conditions, Lanes 5-6: Second nickel chelating Sepharose column eluant under reducing conditions.
[0361] FIGS. 3-47--Box Plots Showing Up-Regulation of SNARE Protein mRNA in Cancer Cells
[0362] Statistical analysis of the differences in SNARE expression between normal and primary cancer tissues was completed through use of Oncomine algorithms (Compendia Bioscience, Ann Arbor, Mich., USA) and gene microarray analysis tool. SNARE mRNA expression was compared to the median expression of all other genes in the respective study, for which a Normalised expression value was generated. Only studies with analysis results with P<0.05 are illustrated. In more detail, FIGS. 3-10 illustrate SNAP-25 expression profiles in different cancer versus non-cancer cells. FIGS. 11-16 illustrate syntaxin-1 expression profiles in different cancer versus non-cancer cells. FIGS. 17-20 illustrate syntaxin-2 expression profiles in different cancer versus non-cancer cells. FIGS. 21-24 illustrate syntaxin-3 expression profiles in different cancer versus non-cancer cells. FIGS. 25-32 illustrate VAMP-1 expression profiles in different cancer versus non-cancer cells. FIGS. 33-38 illustrate VAMP-2 expression profiles in different cancer versus non-cancer cells. FIGS. 39-45 illustrate VAMP-3 expression profiles in different cancer versus non-cancer cells. In all cases, the data illustrate up-regulation of SNARE proteins in cancer cells. FIG. 46 illustrates a statistically significant difference in syntaxin-1 mRNA expression level between patients with recurrent invasive ductal breast cancer at 5 years post diagnosis and those patients without recurrence. FIG. 47 illustrates a statistically significant difference in syntaxin-1 mRNA expression level in breast cancer patients in whom a metastatic event has occurred at 5 years post diagnosis versus those without metastases. Data is presented in box-plot form (see FIG. 3 for explanation) and/or as a histogram, where each bar represents the normalised gene expression level in each patient tissue sample.
[0363] FIG. 48--Profiling Cell Surface Receptor and SNARE Protein Expression in Renal Cell Carcinoma Lines by Western Blot Analysis.
[0364] Protein extracts from eight human renal cell carcinoma (RCC) lines grown in vitro were prepared in standard Laemmli sample buffer (lanes 3-10). Protein extracts prepared from primary cultures of rat spinal cord neurons and dorsal root ganglia were included as controls. Proteins were separated on 10% SDS-polyacrylamide gels and electrophoretically transferred to a nitrocellulose membrane. After blocking of the membrane, primary antisera were used to probe for each specific protein and detection was enabled by a peroxidise-conjugated anti-species IgG. The receptor proteins detected were ErbB receptor (epidermal growth factor receptor, EGF) and growth hormone releasing hormone receptor (GHRHR). The SNARE proteins tested were SNAP-25, syntaxin-2 and syntaxin-3. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was used as a protein loading control.
[0365] FIG. 49--Detection of SNAP-25 Protein Cleavage in a Renal Cell Carcinoma Line by Western Blot Analysis After 24 Hr Treatment with an EGF-LHA Fusion.
[0366] FIG. 49 shows the effect of 24 hour treatment of 786-0 human RCC cells with a number of EGF-liganded fusion protein molecules. In more detail, EGF-LHA, at doses between 1 and 100 nM, generated cleaved SNAP-25 species whereas control molecules, specifically EGF-LHB (non-SNAP-25 targeting), a catalytically inactive form of EGF-LHA, EGF-LH-DE-A (referred to as `EGF-0` from hereon in) and the `free` EGF ligand, did not. Indicated concentrations are in nM.
[0367] FIG. 50--Inhibition of In Vitro Proliferation of a Renal Cell Carcinoma Line by an EGF-LHA Fusion.
[0368] 786-0 cells seeded into a cell culture vessel were counted in a pre-define region at 24 and 48 hours after treatment with 25 nM of EGF-liganded fusion proteins.
[0369] FIG. 51--Inhibition of FGF-2 Secretion From a Renal Cell Carcinoma Line by an EGF-LHA Fusion.
[0370] Culture media from 786-0 cells treated for 24 hours with EGF-liganded fusion proteins were analysed for Fibroblast Growth Factor-2 by standard methods. In more detail, EGF-LHA, at doses between 1 and 50 nM, demonstrated a dose-dependent decrease in FGF-2 levels present in the culture medium whereas control molecules, specifically the catalytically inactive form of EGF-LHA, EGF-0, and `free` EGF ligand, did not. Indicated concentrations are in nM.
[0371] FIG. 52--Inhibition of In Vitro Proliferation of the A498 Renal Cell Carcinoma Line by an EGF-LHA Fusion.
[0372] FIG. 52 shows the effect of an EGF-liganded fusion protein (EGFv3-LHA, 300 nM) on cellular proliferation of A498 cell line after 48 hours. Cells were stained using the tetrazolium salt WST as per standard protocols. The unliganded molecule LHA was included as a control.
[0373] FIG. 53--Inhibition of In Vitro Proliferation of the ACHN Renal Cell Carcinoma Line by an EGF-LHA Fusion.
[0374] FIG. 53 shows the effect of an EGF-liganded fusion protein (EGFv3-LHA, at 300 nM) on cellular proliferation of ACHN cell line after 48 hours. Cells were stained using the tetrazolium salt WST as per standard protocols. The unliganded molecule LHA was included as a control.
[0375] FIG. 54--Inhibition of In Vitro Secretion of Gastrin Releasing Peptide (GRP) from the Small Cell Lung Cancer Cell Line DMS-53 by an EGF-LHA Fusion.
[0376] FIG. 54 shows the effect of an EGF-liganded fusion protein (EGFv3-LHA, at 150 nM) on GRP secretion from the DMS-53 cell line after treatment for 24 hr. Medium removed from the cells were analysed 48 hours after removal of fusion proteins. The catalytically inactive form of EGF-LHA, EGF-0, was included as a control, as was cycloheximide, a general protein synthesis inhibitor.
NOMENCLATURE
[0377] SST somatostatin
[0378] TGF(A) transforming growth factor (alpha)
[0379] GHRL ghrelin
[0380] LEP leptin
[0381] ET(A) endothelin-1
[0382] FLT vascular endothelial growth factor receptor
[0383] CHRN(D) acetylcholine receptor (subunit delta)
[0384] EPHA ephrin type-A receptor
[0385] EFNA ephrin-A
[0386] DLK1 delta-like protein 1
[0387] JAG jagged protein
[0388] NRG neuregulin
[0389] G-CSF granulocyte colony-stimulating factor
[0390] AMF autocrine motility factor
[0391] NMB neuromedin-B
[0392] CCK cholecystokinin
[0393] PDGF platelet-derived growth factor
[0394] ADM adrenomedullin
[0395] GDNF glial cell line-derived neurotrophic factor
[0396] TrkA high affinity nerve growth factor
[0397] FSH follicle-stimulating hormone
[0398] CXCR C-X-C chemokine receptor
[0399] CRLR calcitonin-receptor-like receptor
[0400] PDF prostate differentiation factor
[0401] MCP monocyte chemotactic protein
[0402] KGF keratinocyte growth factor
[0403] FLK1 vascular endothelial growth factor receptor 2
[0404] PDGFR platelet-derived growth factor receptor
[0405] NOTCH neurogenic locus notch homolog protein
[0406] DLL delta-like protein
[0407] GHS growth hormone secretagogue
[0408] c-MET hepatocyte growth factor
[0409] c-kit mast/stem cell growth factor
[0410] MGSA/GRO melanoma growth stimulatory activity/growth related gene
[0411] BCGF B-cell growth factor
[0412] GnRH gonadotropin-releasing hormone receptor
[0413] Ang-2 angiopoietin-2
[0414] FGF fibroblast growth factor
[0415] ErbB epidermal growth factor receptor family member
[0416] VIPR vasoactive intestinal polypeptide receptor
[0417] BRS bombesin receptor subtype
[0418] GRP gastrin releasing peptide
[0419] LIF leukaemia inhibitory factor
[0420] GHRH growth hormone-releasing hormone
[0421] IGF insulin-like growth factor
[0422] CRHR-2 corticotropin releasing factor receptor-2
[0423] BB bombesin
[0424] GH growth hormone
[0425] IL interleukin
[0426] VEGF vascular endothelial growth factor
[0427] ACH acetylcholine
[0428] CST cortistatin
[0429] VPAC vasoactive intestinal peptide receptor
[0430] GRPR gastrin releasing peptide receptor
[0431] CTR calcitonin binding receptor
[0432] EPO erythropoietin
[0433] HB-EGF heparin-binding EGF-like growth factor
[0434] HGF/SF hepatocyte growth factor/scatter factor
[0435] SDF-1 stromal cell-derived factor 1
[0436] CXCL12 chemokine (C-X-C motif) ligand 12 (SDF-1)
[0437] TNF tumour necrosis factor
[0438] PGF placental growth factor
[0439] Gran4 granulin-4
[0440] TIE2 angiopoietin receptor-2
[0441] LH luteinising hormone
[0442] CCL CC chemokine ligand
[0443] NT neurotrophin
[0444] NTAK neuregulin-2
[0445] BAFF B-cell activating factor
[0446] GM-CSF granulocyte-macrophage colony stimulating factor
[0447] NGF nerve growth factor
[0448] PACAP pituitary adenylate cyclase-activating peptide
[0449] OB leptin
[0450] NRP neuropilin receptor
[0451] Summary of Examples
[0452] Example 1 Preparation of a LHA backbone construct
[0453] Example 2 Construction of LHD-CT-CST28
[0454] Example 3 Expression and purification of a LHD-CT-CST28 fusion protein
[0455] Example 4 Construction of LHA-CP-EGF
[0456] Example 5 Expression and purification of a LHA-CP-EGF fusion protein
[0457] Example 6 Chemical conjugation of LHN/A to GnRH TM
[0458] Example 7 Method for treating colorectal cancer
[0459] Example 8 Method for treating breast cancer
[0460] Example 9 Method for treating prostate cancer
[0461] Example 10 Method for treating lung carcinoid tumours
[0462] Example 11 Method for treating bladder cancer
[0463] Example 12 Method for treating small cell lung cancer
[0464] Example 13 Method for treating prostate cancer
[0465] Example 14 Method for treating cervical cancer
[0466] Example 15 Method for treating leukaemia
[0467] Example 16 Method for treating small cell lung cancer
[0468] Example 17 Method for treating pancreatic cancer
[0469] Example 18 Method for treating metastatic bone cancer
[0470] Example 19 Method for treating metastatic small cell lung cancer in the brain
[0471] Example 20 Method for treating bowel cancer
[0472] Example 21 Method for treating chronic lymphocytic leukaemia
[0473] Example 22 Method for treating liver cancer
[0474] Example 23 Method for treating Hodgkin's lymphoma
[0475] Example 24 Method for treating renal cancer
[0476] Example 25 Method for treating skin cancer
[0477] Example 26 Method for treating oropharyngeal cancer
[0478] Example 27 Method for treating myeloma cancer
[0479] Example 28 Method for treating soft tissue sarcoma cancer
[0480] Example 29 Method for treating gastric cancer
[0481] Example 30 Method for treating testicular cancer
[0482] Example 31 Method for treating uterine cancer
[0483] Example 32 Method for treating Karposi sarcoma
[0484] Example 33 Method for treating primary brain cancer
[0485] Example 34 Method for treating rectal cancer
[0486] Example 35 Assessment of proliferation changes, inhibition of cellular secretion and concomitant SNARE cleavage after treatment of in vitro cultured renal cancer cell lines with a polypeptide of the present invention.
[0487] Summary of SEQ ID NOs
[0488] Where an initial Met amino acid residue or a corresponding initial codon is indicated in any of the following SEQ ID NOs, said residue/codon is optional. [0489] 1. DNA sequence of LHN/A [0490] 2. DNA sequence of LHN/B [0491] 3. DNA sequence of LHN/C [0492] 4. DNA sequence of LHN/D [0493] 5. DNA sequence of the CT-CST28 linker [0494] 6. DNA sequence of the LHD-CT-CST28 fusion [0495] 7. Protein sequence of the LHD-CT-CST28 fusion [0496] 8. DNA sequence of the CP-EGF linker [0497] 9. DNA sequence of the LHA-CP-EGF fusion [0498] 10. Protein sequence of the LHA-CP-EGF fusion [0499] 11. Protein sequence of LHN/A [0500] 12. Protein sequence of LHN/B [0501] 13. Protein sequence of LHN/C [0502] 14. Protein sequence of LHN/D [0503] 15. Synthesised GnRH peptide [0504] 16. Protein sequence of the LHB-CT-SST28 fusion [0505] 17. Protein sequence of the LHA-CP-SST28 fusion [0506] 18. Protein sequence of the LHD-CT-EGF fusion [0507] 19. Protein sequence of the LHD-CT-VIP fusion [0508] 20. Protein sequence of the LHC-CT-IGF1 fusion [0509] 21. Protein sequence of the LHD-CT-IGF1 fusion [0510] 22. Protein sequence of the LHC-CT-VIP fusion [0511] 23. Protein sequence of the LHC-CT-GnRH fusion [0512] 24. Protein sequence of the LHD-CT-GnRH fusion [0513] 25. Protein sequence of the LHD-CT-GRP fusion [0514] 26. Protein sequence of the LHB-CT-GRP fusion [0515] 27. Protein sequence of the LHC-CT-LIF fusion [0516] 28. Protein sequence of the LHB-CP-LIF fusion [0517] 29. Protein sequence of the LHC-CT-FGF1 fusion [0518] 30. Protein sequence of the LHA-CP-FGF1 fusion [0519] 31. Protein sequence of the LHA-CT-FGF9 fusion [0520] 32. Protein sequence of the LHC-CP-FGF9 fusion [0521] 33. Protein sequence of the IgA-HNtet-CT-SST14 fusion [0522] 34. Protein sequence of the IgA-HNtet-CP-SST14 fusion [0523] 35. Protein sequence of the LHA-CT-SST14 fusion [0524] 36. Protein sequence of the LHA-CT-EGFv3 fusion [0525] 37. Protein sequence of the LHE-CT-IL6 fusion [0526] 38. Protein sequence of the LHB-CT-IL8 fusion [0527] 39. Protein sequence of the LHF-CP-GRAN4 fusion [0528] 40. Protein sequence of the LHD-CP-TGFa fusion [0529] 41. Protein sequence of the LHD-CP-TGFb fusion [0530] 42. Protein sequence of the LHB-CT-TNFa fusion [0531] 43. Protein sequence of the LHD-CT-SDF1 fusion [0532] 44. Protein sequence of the LHC-CT-VEGF fusion
EXAMPLES
Example 1
Preparation of a LHN/A Backbone Construct
[0533] The following procedure creates a clone for use as an expression backbone for multidomain protein expression. This example is based on preparation of a serotype A based clone (SEQ ID1), though the procedures and methods are equally applicable to all LHN serotypes such as serotype B (SEQ ID2), serotype C (SEQ ID3) and serotype D (SEQ ID4) and other protease or translocation domains by using the appropriate published sequence for synthesis.
Preparation of Cloning and Expression Vectors
[0534] pCR 4 (Invitrogen) is the chosen standard cloning vector chosen due to the lack of restriction sequences within the vector and adjacent sequencing primer sites for easy construct confirmation. The expression vector is based on the pET (Novagen) expression vector which has been modified to contain the multiple cloning site NdeI-BamHI-SaII-PstI-XbaI-HindIII for construct insertion, a fragment of the expression vector has been removed to create a non-mobilisable plasmid, a variety of different fusion tags have been inserted to increase purification options and an existing XbaI site in the vector backbone has been removed to simplify sub-cloning.
Preparation of LC/A
[0535] The DNA sequence is designed by back translation of the LC/A amino acid sequence (obtained from freely available database sources such as GenBank (accession number P10845) using one of a variety of reverse translation software tools (for example Backtranslation tool v2.0 (Entelechon)). BamHI/SaII recognition sequences are incorporated at the 5' and 3' ends respectively of the sequence maintaining the correct reading frame. The DNA sequence is screened (using software such as SeqBuilder, DNASTAR Inc.) for restriction enzyme cleavage sequences incorporated during the back translation. Any cleavage sequences that are found to be common to those required by the cloning system are removed by the Backtranslation tool from the proposed coding sequence ensuring common E. coli codon usage is maintained. E. coli codon usage is assessed by reference to software programs such as Graphical Codon Usage Analyser (GeneArt), and the % GC content and codon usage ratio assessed by reference to published codon usage tables (for example GenBank Release 143, Sep. 13, 2004). This optimised DNA sequence containing the LC/A open reading frame (ORF) is then commercially synthesized (for example by Entelechon, GeneArt or Sigma-Genosys) and is provided in the pCR 4 vector.
Preparation of HN/A Insert
[0536] The DNA sequence is designed by back translation of the HN/A amino acid sequence (obtained from freely available database sources such as GenBank (accession number P10845) using one of a variety of reverse translation software tools (for example Back translation tool v2.0 (Entelechon)). A PstI restriction sequence added to the N-terminus and XbaI-stop codon-HindIII to the C-terminus ensuring the correct reading frame in maintained. The DNA sequence is screened (using software such as SeqBuilder, DNASTAR Inc.) for restriction enzyme cleavage sequences incorporated during the back translation. Any sequences that are found to be common to those required by the cloning system are removed by the Backtranslation tool from the proposed coding sequence ensuring common E. coli codon usage is maintained. E. coli codon usage is assessed by reference to software programs such as Graphical Codon Usage Analyser (GeneArt), and the % GC content and codon usage ratio assessed by reference to published codon usage tables (for example GenBank Release 143, Sep. 13, 2004). This optimised DNA sequence is then commercially synthesized (for example by Entelechon, GeneArt or Sigma-Genosys) and is provided in the pCR 4 vector.
Preparation of the Interdomain (LC-HN Linker)
[0537] The LC-HN linker can be designed from first principle, using the existing sequence information for the linker as the template. For example, the serotype A linker (in this case defined as the inter-domain polypeptide region that exists between the cysteines of the disulphide bridge between LC and HN) has the sequence VRGIIPFKTKSLDEGYNKALNDL. This sequence information is freely available from available database sources such as GenBank (accession number P10845). For generation of a specific protease cleavage site, the native recognition sequence for Factor Xa can be used in the modified sequence VDGIITSKTKSLIEGRNKALNLQ or an enterokinase recognition sequence is inserted into the activation loop to generate the sequence VDGIITSKTKSDDDDKNKALNLQ. Using one of a variety of reverse translation software tools (for example Backtranslation tool v2.0 (Entelechon), the DNA sequence encoding the linker region is determined. BamHI/SaII and PstI/XbaI/stop codon/HindIII restriction enzyme sequences are incorporated at either end, in the correct reading frames. The DNA sequence is screened (using software such as Seqbuilder, DNASTAR Inc.) for restriction enzyme cleavage sequences incorporated during the back translation. Any sequences that are found to be common to those required by the cloning system are removed by the Backtranslation tool from the proposed coding sequence ensuring common E. coli codon usage is maintained. E. coli codon usage is assessed by reference to software programs such as Graphical Codon Usage Analyser (GeneArt), and the % GC content and codon usage ratio assessed by reference to published codon usage tables (for example GenBank Release 143, Sep. 13, 2004). This optimised DNA sequence is then commercially synthesized (for example by Entelechon, GeneArt or Sigma-Genosys) and is provided in the pCR 4 vector.
Assembly and Confirmation of the Backbone Clone
[0538] Due to the small size, the activation linker must be transferred using a two step process. The pCR-4 linker vector is cleaved with BamHI+SaII combination restriction enzymes and the cleaved linker vector then serves as the recipient for BamHI+SaII restriction enzyme cleaved LC DNA. Once the LC encoding DNA is inserted upstream of the linker DNA, the entire LC-linker DNA fragment can then be isolated and transferred to the pET expression vector MCS. The LC-linker is cut out from the pCR 4 cloning vector using BamHI/PstI restriction enzymes digests. The pET expression vector is digested with the same enzymes but is also treated with antarctic phosphatase as an extra precaution to prevent re-circularisation. The LC-linker and the pET vector backbone are gel purified and the purified insert and vector backbone are ligated together using T4 DNA ligase. The product is transformed with TOP10 cells which are then screened for LC-linker using BamHI/PstI restriction digestion. The process is then repeated for the HN insertion into the PstI/HindIII restriction sites of the pET-LC-linker construct.
[0539] Screening with restriction enzymes is sufficient to ensure the final backbone is correct as all components are already sequenced confirmed during synthesis. However, during the sub-cloning of some components into the backbone, where similar size fragments are being removed and inserted, sequencing of a small region to confirm correct insertion is required.
Example 2
Construction of LHN/D-CT-CST28
[0540] The following procedure creates a clone for use as an expression construct for multidomain fusion expression where the targeting moiety (TM) is presented C-terminally to the translocation domain. This example is based on preparation of the LHN/D-CT-CST28 fusion (SEQ ID6), though the procedures and methods are equally applicable to create other protease, translocation and TM fusions, where the TM of C-terminal to the translocation domain. In this example, a flanking 15 amino acid glycine-serine spacer is engineered into the interdomain sequence to ensure accessibility of the ligand to its receptor, but other spacers are applicable.
Preparation of Spacer-CST28 Insert
[0541] For presentation of a CST28 sequence at the C-terminus of the HN domain, a DNA sequence is designed to flank the spacer and targeting moiety (TM) regions allowing incorporation into the backbone clone (SEQ ID4). The DNA sequence can be arranged as BamHI-Sa/I-PstI-XbaI-spacer-CST28-stop codon-HindIII (SEQ ID5). The DNA sequence can be designed using one of a variety of reverse translation software tools (for example EditSeq best E. coli reverse translation (DNASTAR Inc.), or Backtranslation tool v2.0 (Entelechon)). Once the TM DNA is designed, the additional DNA required to encode the preferred spacer is created in silico. It is preferred to ensure the correct reading frame is maintained for the spacer, TM and restriction sequences and that the XbaI sequence is not preceded by the bases TC, which would result in DAM methylation. The DNA sequence is screened for restriction sequences incorporated and any additional sequences are removed manually from the remaining sequence ensuring common E. coli codon usage is maintained. E. coli codon usage is assessed by reference to software programs such as Graphical Codon Usage Analyser (GeneArt), and the % GC content and codon usage ratio assessed by reference to published codon usage tables (for example GenBank Release 143, Sep. 13, 2004). This optimised DNA sequence is then commercially synthesized (for example by Entelechon, GeneArt or Sigma-Genosys) and is provided in the pCR 4 vector.
Assembly and Confirmation of the Backbone Clone
[0542] In order to create a LHN/D-CT-CST28 construct (SEQ ID6) using the backbone construct (SEQ ID4) and the newly synthesised pCR 4-spacer-TM vector encoding the CST28 TM (SEQ ID5), a one or two step method can be used; typically the two step method is used when the TM DNA is less than 100 base pairs. Using the one step method the TM can be inserted directly into the backbone construct buy cutting the pCR 4-spacer-TM vector with XbaI and HindIII restriction enzymes and inserting the TM encoding DNA fragment into a similarly cut pET backbone construct. Using the two-step method the LHN domain is excised from the backbone clone using restriction enzymes BamHI and XbaI and ligated into similarly digested pCR 4-spacer-TM vector. This creates an LHN-spacer-TM ORF in pCR 4 that can be excised from the vector using restriction enzymes BamHI and HindIII for subsequent ligation into the similarly cleaved pET expression construct. The final construct contains the LC-linker-HN-spacer-CST28 DNA (SEQ ID6) which will result in a fusion protein containing the sequence illustrated in SEQ ID7.
Example 3
Expression and Purification of a LHN/D-CT-CST28 Fusion Protein
[0543] This example is based on preparation of an LHN/D protein that incorporates a CST28 TM polypeptide at the carboxyl terminus of the HN domain (SEQ ID7), where the pET expression vector ORF also encodes a histidine purification tag. These procedures and methods are equally applicable to fusion protein sequences shown in SEQ ID 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 29, 31, 33, 35, 36, 37, 38, 42, 43 or 44. Where appropriate, the activation enzyme should be selected to be compatible with the protease activation site within each sequence.
Expression of LHN/D-CT-CST28
[0544] Expression of the LHN/D-CT-CST28 protein is achieved using the following protocol. Inoculate 100 ml of modified TB containing 0.2% glucosamine and 30 μg/ml kanamycin in a 250 ml flask with a single colony from the LHN/D-CT-CST28 expression strain. Grow the culture at 37° C., 225 rpm for 16 hours. Inoculate 1 L of modified TB containing 0.2% glucosamine and 30 μg/ml kanamycin in a 2 L flask with 10 ml of overnight culture. Grow cultures at 37° C. until an approximate OD600 nm of 0.5 is reached at which point reduce the temperature to 16° C. After 1 hour induce the cultures with 1 mM IPTG and grow at 16° C. for a further 16 hours.
Purification of LHN/D-CT-CST28 Protein
[0545] Defrost falcon tube containing 35 ml 50 mM HEPES pH 7.2 200 mM NaCl and approximately 10 g of E. coli BL21 (DE3) cell paste. Homogenise the cell paste (20 psi) ensuring the sample remains cool. Spin the lysed cells at 18000 rpm, 4° C. for 30 minutes. Load the supernatant onto a 0.1 M NiSO4 charged Chelating column (20-30 ml column is sufficient) equilibrated with 50 mM HEPES pH 7.2 200 mM NaCl. Using a step gradient of 10, 40 and 100 mM imidazole, wash away the non-specific bound protein and elute the fusion protein with 200 mM imidazole. The eluted fusion protein is dialysed against 5 L of 50 mM HEPES pH 7.2 200 mM NaCl at 4° C. overnight and the OD280 nm measured to establish the protein concentration. Add 3.2 μl enterokinase (New England Biolabs) per mg fusion protein and incubate static overnight at 25° C. Load onto a 0.1 M NiSO4 charged Chelating column (20-30 ml column is sufficient) equilibrated with 50 mM HEPES pH 7.2 200 mM NaCl. Wash column to baseline with 50 mM HEPES pH 7.2 200 mM NaCl. Using a step gradient of 10, 40 and 100 mM imidazole, wash away the non-specific bound protein and elute the fusion protein with 200 mM imidazole. Dialyse the eluted fusion protein against 5 L of 50 mM HEPES pH 7.2 150 mM NaCl at 4° C. overnight and concentrate the fusion to about 2 mg/ml, aliquot sample and freeze at -20° C. Test purified protein using OD280, BCA and purity analysis. FIGS. 1 and 2 demonstrate purification of fusion proteins following this method as analysed by SDS-PAGE.
Example 4
Construction of LHN/A-CP-EGF
[0546] The following procedure creates a clone for use as an expression construct for multidomain fusion expression where the targeting moiety (TM) is presented centrally between the protease and translocation domain. This example is based on preparation of the LHN/A-CP-EGF fusion (SEQ ID9), though the procedures and methods are equally applicable to create other protease, translocation and TM fusions, where the TM is N-terminal to the translocation domain. In this example, a flanking helical spacer is engineered into the interdomain sequence to ensure accessibility of the ligand to its receptor, but other spacers are applicable.
Preparation of Spacer-Human EGF Insert
[0547] The LC-HN inter-domain polypeptide linker region exists between the cysteines of the disulphide bridge between LC and HN. For insertion of a protease cleavage site, spacer and a targeting moiety (TM) region into the activation loop, one of a variety of reverse translation software tools (for example Backtranslation tool v2.0 (Entelechon) are used to determine the DNA sequence encoding the linker region. For central presentation of a TM sequence at the N-terminus of the HN domain, a DNA sequence is designed for the spacer and targeting moiety (TM) regions allowing incorporation into the backbone clone (SEQ ID1). The DNA sequence can be arranged as BamHI-SaII -spacer-protease activation site-EGF-spacer-PstI-XbaI-stop codon-HindIII (SEQ ID8). Once the TM DNA is designed, the additional DNA required to encode the preferred spacer is created in silico. It is preferred to ensure the correct reading frame is maintained for the spacer, TM and restriction sequences and that the XbaI sequence is not preceded by the bases TC, which would result in DAM methylation. The DNA sequence is screened for restriction sequence incorporated and any additional sites are removed manually from the remaining sequence ensuring common E. coli codon usage is maintained. E. coli codon usage is assessed by reference to software programs such as Graphical Codon Usage Analyser (GeneArt), and the % GC content and codon usage ratio assessed by reference to published codon usage tables (for example GenBank Release 143, Sep. 13, 2004). This optimised DNA sequence is then commercially synthesized (for example by Entelechon, GeneArt or Sigma-Genosys) and is provided in the pCR 4 vector.
Assembly and Confirmation of the Backbone Clone
[0548] In order to create a LC-spacer-activation site-EGF-spacer-HN construct (SEQ ID9) using the backbone construct (SEQ ID1) and the newly synthesised pCR 4-spacer-activation site-TM-spacer vector encoding the EGF TM (SEQ ID8), a one or two step method can be used; typically the two step method is used when the TM DNA is less than 100 base pairs. Using the one step method the TM linker region can be inserted directly into the backbone construct buy cutting the pCR 4-spacer-activation site-TM-spacer vector with SaII and PstI restriction enzymes and inserting the TM encoding DNA fragment into a similarly cut pET backbone construct. Using the two-step method the LC domain is excised from the backbone clone using restriction enzymes BamHI and SaII and ligated into similarly digested pCR 4-spacer-activation site-TM-spacer vector. This creates a LC-spacer-activation site-TM-spacer ORF in pCR 4 that can be excised from the vector using restriction enzymes BamHI and PstI for subsequent ligation into similarly pET expression construct. The final construct contains the LC-spacer-activation site-EGF-spacer-HN DNA (SEQ ID9) which will result in a fusion protein containing the sequence illustrated in SEQ ID10.
Example 5
Expression and Purification of a LHN/A-CP-EGF Fusion Protein
[0549] This example is based on preparation of an LHN/A protein that incorporates a EGF TM polypeptide into the interdomain linker region (SEQ ID10), where the pET expression vector ORF also encodes a histidine purification tag. These procedures and methods are equally applicable to the other fusion protein shown in SEQ ID 17, 28, 30, 32, 34, 39, 40 or 41. Where appropriate, the activation enzyme should be selected to be compatible with the protease activation site within each sequence.
Expression of LHN/A-CP-EGF
[0550] Expression of the LHN/A-CP-EGF protein is achieved using the following protocol. Inoculate 100 ml of modified TB containing 0.2% glucosamine and 30 μg/ml kanamycin in a 250 ml flask with a single colony from the LHA-CP-EGF expression strain. Grow the culture at 37° C., 225 rpm for 16 hours. Inoculate 1 L of modified TB containing 0.2% glucosamine and 30 μg/ml kanamycin in a 2 L flask with 10 ml of overnight culture. Grow cultures at 37° C. until an approximate OD600 nm of 0.5 is reached at which point reduce the temperature to 16° C. After 1 hour induce the cultures with 1 mM IPTG and grow at 16° C. for a further 16 hours.
Purification of LHN/A-CP-EGF Protein
[0551] Defrost falcon tube containing 35 ml 50 mM HEPES pH 7.2 200 mM NaCl and approximately 10 g of E. coli BL21 (DE3) cell paste. Homogenise the cell paste (20 psi) ensuring the sample remains cool. Spin the lysed cells at 18000 rpm, 4° C. for 30 minutes. Load the supernatant onto a 0.1 M NiSO4 charged Chelating column (20-30 ml column is sufficient) equilibrated with 50 mM HEPES pH 7.2 200 mM NaCl. Using a step gradient of 10, 40 and 100 mM imidazole, wash away the non-specific bound protein and elute the fusion protein with 200 mM imidazole. The eluted fusion protein is dialysed against 5 L of 50 mM HEPES pH 7.2 200 mM NaCl at 4° C. overnight and the OD280 nm measured to establish the protein concentration. Add 3.2 μl enterokinase (New England Biolabs) per mg fusion protein and incubate static overnight at 25° C. Load onto a 0.1 M NiSO4 charged Chelating column (20-30 ml column is sufficient) equilibrated with 50 mM HEPES pH 7.2 200 mM NaCl. Wash column to baseline with 50 mM HEPES pH 7.2 200 mM NaCl. Using a step gradient of 10, 40 and 100 mM imidazole, wash away the non-specific bound protein and elute the fusion protein with 200 mM imidazole. Dialyse the eluted fusion protein against 5 L of 50 mM HEPES pH 7.2 150 mM NaCl at 4° C. overnight and concentrate the fusion to about 2 mg/ml, aliquot sample and freeze at -20° C. Test purified protein using OD280, BCA and purity analysis.
Example 6
Chemical Conjugation of LHN/A to GnRH TM
[0552] The following procedure creates a chemically conjugated molecule containing the LHN/A amino acid sequence (SEQ ID11), prepared from SEQ ID1 using the production method outlined in example 3, and a GnRH peptide which has been chemically synthesised (SEQ ID15). However, the procedures and methods are equally applicable for the conjugation of other peptides to other protease/translocation domain proteins such as those containing the amino acid sequences SEQ ID12, 13 and 14.
[0553] The LHN/A protein was buffer exchanged from 50 mM Hepes 150 mM salt into PBSE (100 mM 14.2 g NA2HPO4, 100 mM 5.85 g NaCl, 1 mM EDTANa2 pH 7.5 with 1M HCl) using the Bio-rad PD10 column. This was done by washing one column volume of PBSE through the PD10 column, the protein was then added to the column until no more drops exit the end of the PD10 column. 8 mls of PBSE was then added and 0.5 ml fractions are collected. The collected fractions are the measured using the A280 reading and fractions containing protein are pooled. A concentration of 1.55 mg/ml of LHN/A was obtained from the buffer exchange step and this was used to set up the following reactions:
TABLE-US-00011 LHN/A 1.55 mg/ml 20 mM SPDP or Sulfo-LC-SPDP A 200 μl 0 B 200 μl 4 fold increase 0.62 μl C 200 μl 8 fold increase 1.24 μl
[0554] Sample were left to tumble at RT for 3 hours before being passed down another PD10 column to buffer exchange into PBSE and the protein containing fractions pooled. A final concentration of 25 mM DTT was then added to derivatised protein and then the samples left at room temperature for 10 minutes. A280 and A343 readings were then taken to work out the ratio of SPDP:LHN/A interaction and the reaction which resulted in a derivatisation ration of between 1 and 3 was used for the peptide conjugation. The SPDP reagent binds to the primary amines of the LHN/A via an N-hydroxysuccinimide (NHS) ester, leaving the sulphydryl-reactive portion to form a disulphide bond to the free SH group on the free cysteine on the synthesised peptide. In this case the peptide sequence is GnRH which has been synthesised with a cysteine contained within the peptide to allow conjugation whilst leaving the N-terminus and C-terminus of the GnRH to interact with its receptor (SEQ ID15). The SPDP-derivatised LHN/A was mixed with a 4-fold excess of the GnRH ligand and the reaction was then left at RT for 90 minutes whilst tumbling. The excess GnRH was then removed using either a PD10 column leaving LHN/A-GnRH conjugated molecule.
Example 7
Method for Treating Colorectal Cancer
[0555] A 62 year old man presents with a stage II colorectal cancer. To reduce and/or to prevent metastasis he receives a direct injection of a polypeptide of the present invention (e.g. botulinum type A neurotoxin protease and translocation domain and a VIP peptide). Within 4 weeks a significant shrinkage of the tumour is observed without appearance of metastasis elsewhere. The treatment is optionally performed in combination with chemotherapy, and is repeated 2 months later and 4 weeks later no tumour is observable anymore with the usual detection tools (colonoscopy, CT scan, PET scan, etc.) and the level of carcinoembryonic antigen (CEA) returned to the normal.
Example 8
Method for Treating Breast Cancer
[0556] A 61 year old woman presents with a stage II breast cancer. To treat and/or prevent metastasis she receives an IV injection of a polypeptide of the present invention (e.g. a botulinum type C neurotoxin protease, a botulinum type C neurotoxin translocation domain and a GnRH peptide), optionally in combination with chemotherapy. Within 4 weeks a significant shrinkage of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 2 months later and 6 weeks later no tumour is observable anymore with the usual detection tools (MRI, ultrasound, breast-specific positron emission tomography, mammography, Scintigraphy, etc.).
Example 9
Method for Treating Prostate Cancer
[0557] A 77 year old man presents with a stage II prostate cancer. To treatment and/or to prevent metastasis he receives a intravenous injection of a polypeptide of the present invention (e.g. a botulinum type C neurotoxin protease, a botulinum type C neurotoxin translocation domain and an IGF-1 peptide), optionally in combination with hormone therapy. Within 4 weeks a significant shrinkage of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 2 months later and 8 weeks later no tumour is observable anymore with the usual detection tools (X-ray, ProstaScint scan, MRI, transrectal ultrasonography, CT scan, etc.) and the levels of PSA came back to normal.
Example 10
Method for Treating Lung Carcinoid Tumours
[0558] A 66 year old woman presents with lung carcinoid tumours. To treat and/or to prevent metastasis she receives an IV injection of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease, a botulinum type A neurotoxin translocation domain and a bFGF-1 peptide), optionally in combination with chemotherapy. Within 4 weeks a significant decrease in the size of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 1 month later and 4 weeks later no tumour is observable anymore with the usual detection tools (X-rays, CT scan, bronchoscopy, etc.) or using the usual blood tests recommended for this cancer.
Example 11
Method for Treating Bladder Cancer
[0559] A 56 year old man presents with a stage II bladder cancer. To treat and/or to prevent metastasis he receives a direct injection of a polypeptide of the present invention (e.g. an IgA protease, a tetanus neurotoxin translocation domain and an IGF-1 peptide), optionally in combination with chemotherapy. Within 2 weeks a significant shrinkage of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 2 months later and 4 weeks later no tumour is observable anymore with the usual detection tools (colonoscopy, CT scan, PET scan, etc.).
Example 12
Method for Treating Small Cell Lung Cancer
[0560] A 79 year old man is diagnosed with a stage I small cell lung cancer. To treat and/or to prevent metastasis he receives a injection of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease, a botulinum type A neurotoxin translocation domain and a neuregulin ERBB3 peptide), optionally in combination with chemotherapy. Within 3 weeks a significant decrease in size of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 2 months later and 5 weeks later no tumour is observable anymore with the usual detection tools (X-rays, CT scan, MRI, PET scanning, Radionuclide imaging, bronchoscopy, etc.) or using the usual blood tests recommended for this cancer.
Example 13
Method for Treating Prostate Cancer
[0561] A 72 year old man is diagnosed with a stage II prostate cancer. To treat and/or to prevent metastasis he receives a intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain, a GS20 linker, and an IGF-1 peptide), optionally in combination with androgen deprivative treatment. Within 10 days a significant shrinkage of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 2 months later and 5 weeks later no tumour is observable anymore with the usual detection tools (X-ray, ProstaScint scan, MRI, transrectal ultrasonography, CT scan, etc.) and the levels of PSA came back to normal.
Example 14
Method for Treating Cervical Cancer
[0562] A 60 year old woman diagnosed with cervical cancer at a limited stage is treated with surgery. To improve the effects of the treatment and to prevent metastasis she receives an intravenous injection of a polypeptide of the present invention (e.g. botulinum type D neurotoxin protease, a botulinum type D neurotoxin translocation domain, a GS20 linker, and a somatostatin-14 peptide). Within 6 weeks no re-appearance of the tumour is observed. The treatment is repeated 3 months later and 8 weeks later no tumour is observable anymore with the usual detection tools (X-rays, CT scan, MRI, PET scanning, Radionuclide imaging, bronchoscopy, etc.) or using the usual blood tests recommended for this cancer.
Example 15
Method for Treating Leukaemia
[0563] A 42 year old man is diagnosed with a stage II Hairy cell leukaemia cancer. To treat and/or to prevent metastasis he receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type D neurotoxin translocation domain, a GS20 linker, and a FGF-1 peptide), optionally in combination with chemotherapy. Within 10 days significant reduction in tumour burden is observed without appearance of metastasis. The treatment is repeated 1 month later and 3 weeks later no tumour is observable anymore with the usual detection tools (MRI, complete blood count, etc.).
Example 16
Method for Treating Small Cell Lung Cancer
[0564] A 56 year old man is diagnosed with a small cell lung cancer at an extensive stage. To treat and/or to prevent metastasis elsewhere he receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain and an EGF peptide), optionally in combination with chemotherapy and radiation therapy. Within 3 weeks a significant shrinkage of the tumour and a diminution in size of the metastasis is observed without appearance of new metastasis elsewhere. The treatment is repeated twice after 2 months and 5 months. The patients died 11 months later, 6 months later than expected with this type of treatment and this stage of the disease.
Example 17
Method for Treating Pancreatic Cancer
[0565] A 48 year old woman is diagnosed with pancreatic cancer at an advanced stage. She receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain and an EGF peptide), optionally in combination with the chemotherapeutic agent, Gemcitabine. Within 3 weeks a significant reduction in primary tumour growth and a diminution in size of the metastases are observed without appearance of new metastasis elsewhere. The treatment is repeated twice after 2 months and 5 months. The patient died 12 months later, 6 months later than expected with this type of treatment and this stage of the disease.
Example 18
Method for Treating Metastatic Bone Cancer
[0566] A 71 year old man is diagnosed with a stage IV prostate cancer. To treat metastatic growth in his bone he receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type D neurotoxin translocation domain and a TGF-beta peptide), optionally in combination with external beam radiation plus hormone therapy. Within 4 weeks a significant shrinkage of the tumour at the metastatic sites is observed without appearance of metastasis elsewhere. The treatment is repeated 2 and 4 months later. After 6 months no detectable increase in tumour burden is observable anymore with the usual detection tools (X-ray, ProstaScint scan, MRI, CT scan, etc.).
Example 19
Method for Treating Metastatic Small Cell Lung Cancer in the Brain
[0567] A 65 year old man diagnosed with a small cell lung cancer at an advanced stage also presents with multiple brain metastases. To treat, a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease, a botulinum type A neurotoxin translocation domain, and a SDF-1 peptide) is delivered into his brain by convection enhanced delivery, optionally in combination with chemotherapy and whole brain radiation. Within 4 weeks significant shrinkage of the metastatic tumour sites is observed without appearance of metastasis elsewhere. The treatment is repeated at 2 months and 8 weeks later no metastatic tumour is observable anymore with the usual detection tools (X-rays, CT scan, MRI, PET scanning, Radionuclide imaging, etc.) or using the usual blood tests recommended for this cancer.
Example 20
Method for Treating Bowel Cancer
[0568] A 76 year old man is diagnosed with a stage II small bowel cancer. To treatment and/or to prevent metastasis he receives a direct injection of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease and translocation domain and an EGF peptide), optionally in combination with chemotherapy. Within 4 weeks a significant shrinkage of the tumour is observed without appearance of metastasis elsewhere and surgery is then realized to remove the tumour.
Example 21
Method for Treating Chronic Lymphocytic Leukaemia
[0569] A 54 year old man is diagnosed with relapsed chronic lymphocytic leukaemia. To treat, he receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type B neurotoxin protease, a botulinum type B neurotoxin translocation domain and an LIF-1 peptide), optionally in combination with chemotherapy. Within 2 weeks a significant reduction in tumour cell count in the patient's blood is observed which remains stable for an extended period of time as determined by microscopic examination and flow cytometric analysis of the patient's blood
Example 22
Method for Treating Liver Cancer
[0570] A 55 year old woman is diagnosed with a stage IV hepatic cancer. To treat, she receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type D neurotoxin translocation domain and a TGF-alpha peptide), optionally in combination with chemotherapy. Within 2 weeks a significant shrinkage of the tumour is observed on PET-CT imaging studies along with a reduction of alpha-fetoprotein in the blood, which was elevated prior to treatment.
Example 23
Method for Treating Hodgkin's Lymphoma
[0571] A 24 year old man is diagnosed with stage IVA Hodgkin's lymphoma. To treat, he receives repeated intravenous injections of a polypeptide of the present invention (e.g. a botulinum type B neurotoxin protease, a botulinum type B neurotoxin translocation domain and an VEGF peptide), optionally in combination with chemotherapy. Within 2 weeks a significant reduction in tumour volume is observed and tumour blood flow using the usual detection tools (MRI, CT scans) and leads to complete remission within 2 months.
Example 24
Method for Treating Renal Cancer
[0572] A 54 year old man is diagnosed with advanced renal cell carcinoma. To treat, he receives an intravenous administration of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease, a botulinum type C neurotoxin translocation domain and a VEGF peptide), optionally in combination with the anti-angiogenic therapeutic, Sunitinib, a small molecule tyrosine kinase inhibitor (TKI). Within 14 days a significant shrinkage of the tumour over and above that expected with TKI alone is observed using the usual detection tools (CT, MRI scans, blood tests).
Example 25
Method for Treating Skin Cancer
[0573] A 31 year old woman is diagnosed with a facial melanoma. To treat, she receives direct administration of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain, a GS20 linker, and a VEGF peptide), optionally in combination with chemotherapy and immunotherapy. Within 21 days a significant shrinkage of the tumour over and above that expected with chemo- and immunotherapy alone is observed, allowing surgical resection with clear but much reduced margins.
Example 26
Method for Treating Oropharyngeal Cancer
[0574] A 63 year old man is diagnosed with a stage III head and neck cancer. To treat, he receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type D neurotoxin translocation domain and a TGF-alpha peptide), optionally in combination with radiotherapy and chemotherapy. Within 4 weeks a significant improvement in the patient's ability to swallow food is apparent which is maintained longer than would be expected with the combination therapy alone.
Example 27
Method for Treating Myeloma Cancer
[0575] A 73 year old woman diagnosed with multiple myeloma associated osteolytic bone lesions and hypercalcemia is treated with the standard chemotherapy. To improve the effects of the treatments she receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain and an IL-6 peptide). Within 4 weeks a significant reduction in the size of bone lesions and severity of hypercalcemia is observed without appearance of new lesions elsewhere using the usual detection tools (MRI, X-ray, blood tests).
Example 28
Method for Treating Soft Tissue Sarcoma Cancer
[0576] A 51 year old woman diagnosed with a fibrosarcoma of the leg is treated with radiation therapy in an attempt to reduce tumour size prior to surgical resection. To improve the effects of the radiation treatment she receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain, a GS20 linker, and a bFGF peptide). Within 14 days a significant shrinkage of the tumour over and above that expected with radiation therapy alone is observed, allowing surgical resection with clear margins.
Example 29
Method for Treating Gastric Cancer
[0577] A 84 year old man diagnosed with advanced gastric cancer and unable to undergo surgical resection is treated with radiation therapy to relieve tumour associated blockage. To improve the effects of the treatment he receives multiple direct injections of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease, a botulinum type A neurotoxin translocation domain, a GS20 linker, and a GRP peptide). Within 5 days a significant shrinkage of the tumour is observed with the usual detection tools (gastroscopic examination and CT scan). The treatment is repeated 1 month later and 4 months later tumour blockage has not recurred.
Example 30
Method for Treating Testicular Cancer
[0578] A 32 year old man is diagnosed with a stage I seminoma cancer. To treat and/or to prevent recurrence he receives a intravenous injection of a polypeptide of the present invention (e.g. a botulinum type D neurotoxin protease, a botulinum type C neurotoxin translocation domain, a GS20 linker, and a VEGF peptide), optionally in combination with chemotherapy. Within 14 days a significant shrinkage of the tumour is observed. The treatment is repeated 1 month later and 6 weeks later no tumour is observable anymore with the usual detection tools (blood tests and CT scans).
Example 31
Method for Treating Uterine Cancer
[0579] A 76 year old woman diagnosed with a stage IIA endometrial cancer is treated with usual chemotherapy and radiotherapy. To improve the effects of the treatment and to prevent metastasis she receives a direct injection of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease and translocation domain and an EGF peptide). Within 6 weeks a significant shrinkage of the tumour is observed on direct visualization of the uterine cavity by hysteroscopy without appearance of metastasis elsewhere.
Example 32
Method for Treating Karposi Sarcoma
[0580] A 48 year old man is diagnosed with acquired immunity deficiency syndrome (AIDS) and presents with multiple Karposi sarcoma lesions. To treat, he receives an intravenous injection of a polypeptide of the present invention (e.g. a botulinum type A neurotoxin protease, a botulinum type A neurotoxin translocation domain and an IL-6 peptide), optionally in combination with interferon alpha. Within 3 weeks a significant reduction in lesion size is observed without appearance of new lesions elsewhere. The treatment is repeated twice after 1 month and 3 months which effectively stops the progression of the Kaposi sarcoma.
Example 33
Method for Treating Primary Brain Cancer
[0581] A 45 year old woman is diagnosed with glioblastoma. To treat and/or to prevent further metastasis she receives an intracranial application of a polypeptide of the present invention (e.g. a botulinum type C neurotoxin protease, a botulinum type C neurotoxin translocation domain and a VEGF peptide), optionally in combination with chemotherapy. Within 4 weeks a significant decrease in the size of the tumour is observed without appearance of metastasis elsewhere. The treatment is repeated 1 month later and 4 weeks later no tumour is observable anymore with the usual detection tools (X-rays, CT scan, etc.).
Example 34
Method for Treating Rectal Cancer
[0582] A 77 year old man diagnosed with a stage II rectal cancer is treated with a polypeptide of the present invention (e.g. botulinum type A neurotoxin protease and an anti-EGFR antibody F(ab)'2 fragment), optionally incombination with radiotherapy and surgery. He receives localised injections of polypeptide (e.g. 3 days in advance of a standard regiment of fractionated radiation) and within 2 weeks a significant shrinkage of the tumour is observed which enables complete surgical resection. 12 months later no tumour recurrence is observable with the usual detection tools (colonoscopy, CT scan, PET scan, etc.) and the level of carcinoembryonic antigen (CEA) returned to the normal.
Example 35
Assessment of Proliferation Changes, Inhibition of Cellular Secretion and Concomitant SNARE Cleavage after Treatment of In Vitro Cultured Renal Cancer Cell Lines with a Polypeptide of the Present Invention
[0583] Methods:
[0584] Proliferation Analysis:
[0585] An appropriate volume containing 1000 cells of a suitable renal cancer cell line, for example A498, ACHN or 786-0, are seeded into the wells of a 96-well cell culture plate in a suitable growth medium supplemented with 10% Foetal Bovine Serum and incubated at 37° C. in a humidified atmosphere with 5% CO2. Cells are allowed to adhere overnight after which treatment with an EGF-liganded LHA molecule, such as an LHA molecule with a C-terminal presented EGF ligand, is initiated. After 24 hours, the treatment media are removed, cell monolayers are washed to remove traces of LHA-EGF and fresh medium applied to cells. After a further 24 hours a conventional colorimetric proliferation assay (based on the determination of the cleavage of the tetrazolium salt WST-1 to formazan by cellular enzymes) is performed. Specifically, WST-1 is added to the culture medium for 4 hours after which the optical density at 440 nm is determined for each treatment. FIGS. 50-52 demonstrate inhibition of in vitro proliferation from a renal cell carcinoma line by an EGF-LHA fusion.
[0586] Cellular Secretion Analysis:
[0587] 10,000 cells of a suitable renal cancer cell line (for example 786-0, A498 or ACHN) are seeded in the wells of a 24 well plate in an appropriate culture medium containing 10% foetal bovine serum. Plates are incubated overnight in an incubator at 37° C. in a humidified atmosphere with 5% CO2 to allow cells to adhere. Cell cultures are then treated with an appropriate polypeptide of the present invention which targets a specific receptor on the cells of interest (for the cell lines detailed above an appropriate molecule would be an EGF-liganded LHA as these 3 cell lines all express EGF receptors, FIG. 48). After 24 hours, the treatment medium is removed, cell monolayers refed with fresh culture medium and 24-48 hours later samples of the culture medium are removed to fresh microfuge tubes which are subsequently spun at 1500 rpm for 5 minutes to remove floating cells. The resultant supernatants are removed to fresh tubes and aliquots then used for the quantification of specific analytes (for example VEGF, TNF-alpha) by enzyme-linked immunosorbent assay or ELISA (FIG. 53). An example of analysis of inhibition of a secretion (GRP) from a small cell lung carcinoma cell line is provided in FIG. 54.
[0588] Western Blot Analysis to Detect SNARE Cleavage Due to Cellular Uptake of Polypeptides of the Present Invention:
[0589] Cell cultures are treated with polypeptides as detailed above for the analysis of cellular secretions. After removal of culture medium for subsequent ELISA analysis, cell monolayers in each well are washed three times with phosphate-buffered saline and protein extracts prepared by cellular lysis in standard Laemmli sample buffer. Cellular protein are separated on 12% SDS-polyacrylamide gels and electrophoretically transferred to a nitrocellulose membrane. After blocking of the membrane, primary antisera are used to probe for the SNARE protein of interest and detection of full length and/or cleaved forms is enabled by a peroxidase-conjugated anti-species IgG. In the case of treatment of the renal cell line 786-0 with an LHA-EGF molecule, cleavage of the SNARE SNAP-25 is observed (see FIG. 49).
[0590] SEQ ID NOs
TABLE-US-00012 1. DNA sequence of LHN/A ggatccATGGAGTTCGTTAACAAACAGTTCAACTATAAAGACCCAGTTAA CGGTGTTGACATTGCTTACATCAAAATCCCGAACGCTGGCCAGATGCAGC CGGTAAAGGCATTCAAAATCCACAACAAAATCTGGGTTATCCCGGAACGT GATACCTTTACTAACCCGGAAGAAGGTGACCTGAACCCGCCACCGGAAGC GAAACAGGTGCCGGTATCTTACTATGACTCCACCTACCTGTCTACCGATA ACGAAAAGGACAACTACCTGAAAGGTGTTACTAAACTGTTCGAGCGTATT TACTCCACCGACCTGGGCCGTATGCTGCTGACTAGCATCGTTCGCGGTAT CCCGTTCTGGGGCGGTTCTACCATCGATACCGAACTGAAAGTAATCGACA CTAACTGCATCAACGTTATTCAGCCGGACGGTTCCTATCGTTCCGAAGAA CTGAACCTGGTGATCATCGGCCCGTCTGCTGATATCATCCAGTTCGAGTG TCTGAGCTTTGGTCACGAAGTTCTGAACCTCACCCGTAACGGCTACGGTT CCACTCAGTACATCCGTTTCTCTCCGGACTTCACCTTCGGTTTTGAAGAA TCCCTGGAAGTAGACACGAACCCACTGCTGGGCGCTGGTAAATTCGCAAC TGATCCTGCGGTTACCCTGGCTCACGAACTGATTCATGCAGGCCACCGCC TGTACGGTATCGCCATCAATCCGAACCGTGTCTTCAAAGTTAACACCAAC GCGTATTACGAGATGTCCGGTCTGGAAGTTAGCTTCGAAGAACTGCGTAC TTTTGGCGGTCACGACGCTAAATTCATCGACTCTCTGCAAGAAAACGAGT TCCGTCTGTACTACTATAACAAGTTCAAAGATATCGCATCCACCCTGAAC AAAGCGAAATCCATCGTGGGTACCACTGCTTCTCTCCAGTACATGAAGAA CGTTTTTAAAGAAAAATACCTGCTCAGCGAAGACACCTCCGGCAAATTCT CTGTAGACAAGTTGAAATTCGATAAACTTTACAAAATGCTGACTGAAATT TACACCGAAGACAACTTCGTTAAGTTCTTTAAAGTTCTGAACCGCAAAAC CTATCTGAACTTCGACAAGGCAGTATTCAAAATCAACATCGTGCCGAAAG TTAACTACACTATCTACGATGGTTTCAACCTGCGTAACACCAACCTGGCT GCTAATTTTAACGGCCAGAACACGGAAATCAACAACATGAACTTCACAAA ACTGAAAAACTTCACTGGTCTGTTCGAGTTTTACAAGCTGCTGTGCGTCG ACGGCATCATTACCTCCAAAACTAAATCTGACGATGACGATAAAAACAAA GCGCTGAACCTGCAGTGTATCAAGGTTAACAACTGGGATTTATTCTTCAG CCCGAGTGAAGACAACTTCACCAACGACCTGAACAAAGGTGAAGAAATCA CCTCAGATACTAACATCGAAGCAGCCGAAGAAAACATCTCGCTGGACCTG ATCCAGCAGTACTACCTGACCTTTAATTTCGACAACGAGCCGGAAAACAT TTCTATCGAAAACCTGAGCTCTGATATCATCGGCCAGCTGGAACTGATGC CGAACATCGAACGTTTCCCAAACGGTAAAAAGTACGAGCTGGACAAATAT ACCATGTTCCACTACCTGCGCGCGCAGGAATTTGAACACGGCAAATCCCG TATCGCACTGACTAACTCCGTTAACGAAGCTCTGCTCAACCCGTCCCGTG TATACACCTTCTTCTCTAGCGACTACGTGAAAAAGGTCAACAAAGCGACT GAAGCTGCAATGTTCTTGGGTTGGGTTGAACAGCTTGTTTATGATTTTAC CGACGAGACGTCCGAAGTATCTACTACCGACAAAATTGCGGATATCACTA TCATCATCCCGTACATCGGTCCGGCTCTGAACATTGGCAACATGCTGTAC AAAGACGACTTCGTTGGCGCACTGATCTTCTCCGGTGCGGTGATCCTGCT GGAGTTCATCCCGGAAATCGCCATCCCGGTACTGGGCACCTTTGCTCTGG TTTCTTACATTGCAAACAAGGTTCTGACTGTACAAACCATCGACAACGCG CTGAGCAAACGTAACGAAAAATGGGATGAAGTTTACAAATATATCGTGAC CAACTGGCTGGCTAAGGTTAATACTCAGATCGACCTCATCCGCAAAAAAA TGAAAGAAGCACTGGAAAACCAGGCGGAAGCTACCAAGGCAATCATTAAC TACCAGTACAACCAGTACACCGAGGAAGAAAAAAACAACATCAACTTCAA CATCGACGATCTGTCCTCTAAACTGAACGAATCCATCAACAAAGCTATGA TCAACATCAACAAGTTCCTGAACCAGTGCTCTGTAAGCTATCTGATGAAC TCCATGATCCCGTACGGTGTTAAACGTCTGGAGGACTTCGATGCGTCTCT GAAAGACGCCCTGCTGAAATACATTTACGACAACCGTGGCACTCTGATCG GTCAGGTTGATCGTCTGAAGGACAAAGTGAACAATACCTTATCGACCGAC ATCCCTTTTCAGCTCAGTAAATATGTCGATAACCAACGCCTTTTGTCCAC Tctagaataatgaaagctt 2. DNA sequence of LHN/B GGATCCATGCCGGTTACCATCAACAACTTCAACTACAACGACCCGATCGA CAACAACAACATCATTATGATGGAACCGCCGTTCGCACGTGGTACCGGAC GTTACTACAAGGCTTTTAAGATCACCGACCGTATCTGGATCATCCCGGAA CGTTACACCTTCGGTTACAAACCTGAGGACTTCAACAAGAGTAGCGGGAT TTTCAATCGTGACGTCTGCGAGTACTATGATCCAGATTATCTGAATACCA ACGATAAGAAGAACATATTCCTTCAGACTATGATTAAACTCTTCAACCGT ATCAAAAGCAAACCGCTCGGTGAAAAACTCCTCGAAATGATTATCAACGG TATCCCGTACCTCGGTGACCGTCGTGTCCCGCTTGAAGAGTTCAACACCA ACATCGCAAGCGTCACCGTCAACAAACTCATCAGCAACCCAGGTGAAGTC GAACGTAAAAAAGGTATCTTCGCAAACCTCATCATCTTCGGTCCGGGTCC GGTCCTCAACGAAAACGAAACCATCGACATCGGTATCCAGAACCACTTCG CAAGCCGTGAAGGTTTCGGTGGTATCATGCAGATGAAATTCTGCCCGGAA TACGTCAGTGTCTTCAACAACGTCCAGGAAAACAAAGGTGCAAGCATCTT CAACCGTCGTGGTTACTTCAGCGACCCGGCACTCATCCTCATGCATGAAC TCATCCACGTCCTCCACGGTCTCTACGGTATCAAAGTTGACGACCTCCCG ATCGTCCCGAACGAGAAGAAATTCTTCATGCAGAGCACCGACGCAATCCA GGCTGAGGAACTCTACACCTTCGGTGGCCAAGACCCAAGTATCATAACCC CGTCCACCGACAAAAGCATCTACGACAAAGTCCTCCAGAACTTCAGGGGT ATCGTGGACAGACTCAACAAAGTCCTCGTCTGCATCAGCGACCCGAACAT CAATATCAACATATACAAGAACAAGTTCAAAGACAAGTACAAATTCGTCG AGGACAGCGAAGGCAAATACAGCATCGACGTAGAAAGTTTCGACAAGCTC TACAAAAGCCTCATGTTCGGTTTCACCGAAACCAACATCGCCGAGAACTA CAAGATCAAGACAAGGGCAAGTTACTTCAGCGACAGCCTCCCGCCTGTCA AAATCAAGAACCTCTTAGACAACGAGATTTACACAATTGAAGAGGGCTTC AACATCAGTGACAAAGACATGGAGAAGGAATACAGAGGTCAGAACAAGGC TATCAACAAACAGGCATACGAGGAGATCAGCAAAGAACACCTCGCAGTCT ACAAGATCCAGATGTGCGTCGACGGCATCATTACCTCCAAAACTAAATCT GACGATGACGATAAAAACAAAGCGCTGAACCTGCAGTGCATCGACGTTGA CAACGAAGACCTGTTCTTCATCGCTGACAAAAACAGCTTCAGTGACGACC TGAGCAAAAACGAACGTATCGAATACAACACCCAGAGCAACTACATCGAA AACGACTTCCCGATCAACGAACTGATCCTGGACACCGACCTGATAAGTAA AATCGAACTGCCGAGCGAAAACACCGAAAGTCTGACCGACTTCAACGTTG ACGTTCCGGTTTACGAAAAACAGCCGGCTATCAAGAAAATCTTCACCGAC GAAAACACCATCTTCCAGTACCTGTACAGCCAGACCTTCCCGCTGGACAT CCGTGACATCAGTCTGACCAGCAGTTTCGACGACGCTCTGCTGTTCAGCA ACAAAGTTTACAGTTTCTTCAGCATGGACTACATCAAAACCGCTAACAAA GTTGTTGAAGCAGGGCTGTTCGCTGGTTGGGTTAAACAGATCGTTAACGA CTTCGTTATCGAAGCTAACAAAAGCAACACTATGGACAAAATCGCTGACA TCAGTCTGATCGTTCCGTACATCGGTCTGGCTCTGAACGTTGGTAACGAA ACCGCTAAAGGTAACTTTGAAAACGCTTTCGAGATCGCTGGTGCAAGCAT CCTGCTGGAGTTCATCCCGGAACTGCTGATCCCGGTTGTTGGTGCTTTCC TGCTGGAAAGTTACATCGACAACAAAAACAAGATCATCAAAACCATCGAC AACGCTCTGACCAAACGTAACGAAAAATGGAGTGATATGTACGGTCTGAT CGTTGCTCAGTGGCTGAGCACCGTCAACACCCAGTTCTACACCATCAAAG AAGGTATGTACAAAGCTCTGAACTACCAGGCTCAGGCTCTGGAAGAGATC ATCAAATACCGTTACAACATCTACAGTGAGAAGGAAAAGAGTAACATCAA CATCGACTTCAACGACATCAACAGCAAACTGAACGAAGGTATCAACCAGG CTATCGACAACATCAACAACTTCATCAACGGTTGCAGTGTTAGCTACCTG ATGAAGAAGATGATCCCGCTGGCTGTTGAAAAACTGCTGGACTTCGACAA CACCCTGAAAAAGAACCTGCTGAACTACATCGACGAAAACAAGCTGTACC TGATCGGTAGTGCTGAATACGAAAAAAGTAAAGTGAACAAATACCTGAAG ACCATCATGCCGTTCGACCTGAGTATCTACACCAACGACACCATCCTGAT CGAAATGTTCAACAAATACAACTCtctagaataatgaaagctt 3. DNA sequence of LHN/C ggatccATGCCGATCACCATCAACAACTTCAACTACAGCGATCCGGTGGA TAACAAAAACATCCTGTACCTGGATACCCATCTGAATACCCTGGCGAACG AACCGGAAAAAGCGTTTCGTATCACCGGCAACATTTGGGTTATTCCGGAT CGTTTTAGCCGTAACAGCAACCCGAATCTGAATAAACCGCCGCGTGTTAC CAGCCCGAAAAGCGGTTATTACGATCCGAACTATCTGAGCACCGATAGCG ATAAAGATACCTTCCTGAAAGAAATCATCAAACTGTTCAAACGCATCAAC AGCCGTGAAATTGGCGAAGAACTGATCTATCGCCTGAGCACCGATATTCC GTTTCCGGGCAACAACAACACCCCGATCAACACCTTTGATTTCGATGTGG ATTTCAACAGCGTTGATGTTAAAACCCGCCAGGGTAACAATTGGGTGAAA ACCGGCAGCATTAACCCGAGCGTGATTATTACCGGTCCGCGCGAAAACAT TATTGATCCGGAAACCAGCACCTTTAAACTGACCAACAACACCTTTGCGG CGCAGGAAGGTTTTGGCGCGCTGAGCATTATTAGCATTAGCCCGCGCTTT ATGCTGACCTATAGCAACGCGACCAACGATGTTGGTGAAGGCCGTTTCAG CAAAAGCGAATTTTGCATGGACCCGATCCTGATCCTGATGCATGAACTGA ACCATGCGATGCATAACCTGTATGGCATCGCGATTCCGAACGATCAGACC ATTAGCAGCGTGACCAGCAACATCTTTTACAGCCAGTACAACGTGAAACT GGAATATGCGGAAATCTATGCGTTTGGCGGTCCGACCATTGATCTGATTC CGAAAAGCGCGCGCAAATACTTCGAAGAAAAAGCGCTGGATTACTATCGC
AGCATTGCGAAACGTCTGAACAGCATTACCACCGCGAATCCGAGCAGCTT CAACAAATATATCGGCGAATATAAACAGAAACTGATCCGCAAATATCGCT TTGTGGTGGAAAGCAGCGGCGAAGTTACCGTTAACCGCAATAAATTCGTG GAACTGTACAACGAACTGACCCAGATCTTCACCGAATTTAACTATGCGAA AATCTATAACGTGCAGAACCGTAAAATCTACCTGAGCAACGTGTATACCC CGGTGACCGCGAATATTCTGGATGATAACGTGTACGATATCCAGAACGGC TTTAACATCCCGAAAAGCAACCTGAACGTTCTGTTTATGGGCCAGAACCT GAGCCGTAATCCGGCGCTGCGTAAAGTGAACCCGGAAAACATGCTGTACC TGTTCACCAAATTTTGCGTCGACGCGATTGATGGTCGTAGCCTGTACAAC AAAACCCTGCAGTGTCGTGAACTGCTGGTGAAAAACACCGATCTGCCGTT TATTGGCGATATCAGCGATGTGAAAACCGATATCTTCCTGCGCAAAGATA TCAACGAAGAAACCGAAGTGATCTACTACCCGGATAACGTGAGCGTTGAT CAGGTGATCCTGAGCAAAAACACCAGCGAACATGGTCAGCTGGATCTGCT GTATCCGAGCATTGATAGCGAAAGCGAAATTCTGCCGGGCGAAAACCAGG TGTTTTACGATAACCGTACCCAGAACGTGGATTACCTGAACAGCTATTAC TACCTGGAAAGCCAGAAACTGAGCGATAACGTGGAAGATTTTACCTTTAC CCGCAGCATTGAAGAAGCGCTGGATAACAGCGCGAAAGTTTACACCTATT TTCCGACCCTGGCGAACAAAGTTAATGCGGGTGTTCAGGGCGGTCTGTTT CTGATGTGGGCGAACGATGTGGTGGAAGATTTCACCACCAACATCCTGCG TAAAGATACCCTGGATAAAATCAGCGATGTTAGCGCGATTATTCCGTATA TTGGTCCGGCGCTGAACATTAGCAATAGCGTGCGTCGTGGCAATTTTACC GAAGCGTTTGCGGTTACCGGTGTGACCATTCTGCTGGAAGCGTTTCCGGA ATTTACCATTCCGGCGCTGGGTGCGTTTGTGATCTATAGCAAAGTGCAGG AACGCAACGAAATCATCAAAACCATCGATAACTGCCTGGAACAGCGTATT AAACGCTGGAAAGATAGCTATGAATGGATGATGGGCACCTGGCTGAGCCG TATTATCACCCAGTTCAACAACATCAGCTACCAGATGTACGATAGCCTGA ACTATCAGGCGGGTGCGATTAAAGCGAAAATCGATCTGGAATACAAAAAA TACAGCGGCAGCGATAAAGAAAACATCAAAAGCCAGGTTGAAAACCTGAA AAACAGCCTGGATGTGAAAATTAGCGAAGCGATGAATAACATCAACAAAT TCATCCGCGAATGCAGCGTGACCTACCTGTTCAAAAACATGCTGCCGAAA GTGATCGATGAACTGAACGAATTTGATCGCAACACCAAAGCGAAACTGAT CAACCTGATCGATAGCCACAACATTATTCTGGTGGGCGAAGTGGATAAAC TGAAAGCGAAAGTTAACAACAGCTTCCAGAACACCATCCCGTTTAACATC TTCAGCTATACCAACAACAGCCTGCTGAAAGATATCATCAACGAATACTT CAAtctagaataatgaaagctt 4. DNA sequence of LHN/D ggatccATGACGTGGCCAGTTAAGGATTTCAACTACTCAGATCCTGTAAA TGACAACGATATTCTGTACCTTCGCATTCCACAAAATAAACTGATCACCA CACCAGTCAAAGCATTCATGATTACTCAAAACATTTGGGTCATTCCAGAA CGCTTTTCTAGTGACACAAATCCGAGTTTATCTAAACCTCCGCGTCCGAC GTCCAAATATCAGAGCTATTACGATCCCTCATATCTCAGTACGGACGAAC AAAAAGATACTTTCCTTAAAGGTATCATTAAACTGTTTAAGCGTATTAAT GAGCGCGATATCGGGAAAAAGTTGATTAATTATCTTGTTGTGGGTTCCCC GTTCATGGGCGATAGCTCTACCCCCGAAGACACTTTTGATTTTACCCGTC ATACGACAAACATCGCGGTAGAGAAGTTTGAGAACGGATCGTGGAAAGTC ACAAACATCATTACACCTAGCGTCTTAATTTTTGGTCCGCTGCCAAACAT CTTAGATTATACAGCCAGCCTGACTTTGCAGGGGCAACAGTCGAATCCGA GTTTCGAAGGTTTTGGTACCCTGAGCATTCTGAAAGTTGCCCCGGAATTT CTGCTCACTTTTTCAGATGTCACCAGCAACCAGAGCTCAGCAGTATTAGG AAAGTCAATTTTTTGCATGGACCCGGTTATTGCACTGATGCACGAACTGA CGCACTCTCTGCATCAACTGTATGGGATCAACATCCCCAGTGACAAACGT ATTCGTCCCCAGGTGTCTGAAGGATTTTTCTCACAGGATGGGCCGAACGT CCAGTTCGAAGAGTTGTATACTTTCGGAGGCCTGGACGTAGAGATCATTC CCCAGATTGAGCGCAGTCAGCTGCGTGAGAAGGCATTGGGCCATTATAAG GATATTGCAAAACGCCTGAATAACATTAACAAAACGATTCCATCTTCGTG GATCTCGAATATTGATAAATATAAGAAAATTTTTAGCGAGAAATATAATT TTGATAAAGATAATACAGGTAACTTTGTGGTTAACATTGACAAATTCAAC TCCCTTTACAGTGATTTGACGAATGTAATGAGCGAAGTTGTGTATAGTTC CCAATACAACGTTAAGAATCGTACCCATTACTTCTCTCGTCACTACCTGC CGGTTTTCGCGAACATCCTTGACGATAATATTTACACTATTCGTGACGGC TTTAACTTGACCAACAAGGGCTTCAATATTGAAAATTCAGGCCAGAACAT TGAACGCAACCCGGCCTTGCAGAAACTGTCGAGTGAATCCGTGGTTGACC TGTTTACCAAAGTCTGCGTCGACAAAAGCGAAGAGAAGCTGTACGATGAC GATGACAAAGATCGTTGGGGATCGTCCCTGCAGTGTATTAAAGTGAAAAA CAATCGGCTGCCTTATGTAGCAGATAAAGATAGCATTAGTCAGGAGATTT TCGAAAATAAAATTATCACTGACGAAACCAATGTTCAGAATTATTCAGAT AAATTTTCACTGGACGAAAGCATCTTAGATGGCCAAGTTCCGATTAACCC GGAAATTGTTGATCCGTTACTGCCGAACGTGAATATGGAACCGTTAAACC TCCCTGGCGAAGAGATCGTATTTTATGATGACATTACGAAATATGTGGAC TACCTTAATTCTTATTACTATTTGGAAAGCCAGAAACTGTCCAATAACGT GGAAAACATTACTCTGACCACAAGCGTGGAAGAGGCTTTAGGCTACTCAA ATAAGATTTATACCTTCCTCCCGTCGCTGGCGGAAAAAGTAAATAAAGGT GTGCAGGCTGGTCTGTTCCTCAACTGGGCGAATGAAGTTGTCGAAGACTT TACCACGAATATTATGAAAAAGGATACCCTGGATAAAATCTCCGACGTCT CGGTTATTATCCCATATATTGGCCCTGCGTTAAATATCGGTAATAGTGCG CTGCGGGGGAATTTTAACCAGGCCTTTGCTACCGCGGGCGTCGCGTTCCT CCTGGAGGGCTTTCCTGAATTTACTATCCCGGCGCTCGGTGTTTTTACAT TTTACTCTTCCATCCAGGAGCGTGAGAAAATTATCAAAACCATCGAAAAC TGCCTGGAGCAGCGGGTGAAACGCTGGAAAGATTCTTATCAATGGATGGT GTCAAACTGGTTATCTCGCATCACGACCCAATTCAACCATATTAATTACC AGATGTATGATAGTCTGTCGTACCAAGCTGACGCCATTAAAGCCAAAATT GATCTGGAATATAAAAAGTACTCTGGTAGCGATAAGGAGAACATCAAAAG CCAGGTGGAGAACCTTAAGAATAGTCTGGATGTGAAAATCTCTGAAGCTA TGAATAACATTAACAAATTCATTCGTGAATGTTCGGTGACGTACCTGTTC AAGAATATGCTGCCAAAAGTTATTGATGAACTGAATAAATTTGATCTGCG TACCAAAACCGAACTTATCAACCTCATCGACTCCCACAACATTATCCTTG TGGGCGAAGTGGATCGTCTGAAGGCCAAAGTAAACGAGAGCTTTGAAAAT ACGATGCCGTTTAATATTTTTTCATATACCAATAACTCCTTGCTGAAAGA TATCATCAATGAATATTTCAATctagattaataagctt 5. DNA sequence of the CT-CST28 linker GGATCCGTCGACCTGCAGGGTCTAGAAGGCGGTGGCGGTAGCGGCGGTGG CGGTAGCGGCGGTGGCGGTAGCGGCGGTGGCGGTAGCGCACTAGTGCAGG AAAGACCTCCATTACAACAACCTCCACATCGCGATAAGAAACCATGTAAG AATTTCTTTTGGAAAACATTTAGCAGTTGCAAATGATAAAAGCTT 6. DNA sequence of the LHD-CT-CST28 fusion GGATCCATGACGTGGCCAGTTAAGGATTTCAACTACTCAGATCCTGTAAA TGACAACGATATTCTGTACCTTCGCATTCCACAAAATAAACTGATCACCA CACCAGTCAAAGCATTCATGATTACTCAAAACATTTGGGTCATTCCAGAA CGCTTTTCTAGTGACACAAATCCGAGTTTATCTAAACCTCCGCGTCCGAC GTCCAAATATCAGAGCTATTACGATCCCTCATATCTCAGTACGGACGAAC AAAAAGATACTTTCCTTAAAGGTATCATTAAACTGTTTAAGCGTATTAAT GAGCGCGATATCGGGAAAAAGTTGATTAATTATCTTGTTGTGGGTTCCCC GTTCATGGGCGATAGCTCTACCCCCGAAGACACTTTTGATTTTACCCGTC ATACGACAAACATCGCGGTAGAGAAGTTTGAGAACGGATCGTGGAAAGTC ACAAACATCATTACACCTAGCGTCTTAATTTTTGGTCCGCTGCCAAACAT CTTAGATTATACAGCCAGCCTGACTTTGCAGGGGCAACAGTCGAATCCGA GTTTCGAAGGTTTTGGTACCCTGAGCATTCTGAAAGTTGCCCCGGAATTT CTGCTCACTTTTTCAGATGTCACCAGCAACCAGAGCTCAGCAGTATTAGG AAAGTCAATTTTTTGCATGGACCCGGTTATTGCACTGATGCACGAACTGA CGCACTCTCTGCATCAACTGTATGGGATCAACATCCCCAGTGACAAACGT ATTCGTCCCCAGGTGTCTGAAGGATTTTTCTCACAGGATGGGCCGAACGT CCAGTTCGAAGAGTTGTATACTTTCGGAGGCCTGGACGTAGAGATCATTC CCCAGATTGAGCGCAGTCAGCTGCGTGAGAAGGCATTGGGCCATTATAAG GATATTGCAAAACGCCTGAATAACATTAACAAAACGATTCCATCTTCGTG GATCTCGAATATTGATAAATATAAGAAAATTTTTAGCGAGAAATATAATT TTGATAAAGATAATACAGGTAACTTTGTGGTTAACATTGACAAATTCAAC TCCCTTTACAGTGATTTGACGAATGTAATGAGCGAAGTTGTGTATAGTTC CCAATACAACGTTAAGAATCGTACCCATTACTTCTCTCGTCACTACCTGC CGGTTTTCGCGAACATCCTTGACGATAATATTTACACTATTCGTGACGGC TTTAACTTGACCAACAAGGGCTTCAATATTGAAAATTCAGGCCAGAACAT TGAACGCAACCCGGCCTTGCAGAAACTGTCGAGTGAATCCGTGGTTGACC TGTTTACCAAAGTCTGCGTCGACAAAAGCGAAGAGAAGCTGTACGATGAC GATGACAAAGATCGTTGGGGATCGTCCCTGCAGTGTATTAAAGTGAAAAA CAATCGGCTGCCTTATGTAGCAGATAAAGATAGCATTAGTCAGGAGATTT TCGAAAATAAAATTATCACTGACGAAACCAATGTTCAGAATTATTCAGAT AAATTTTCACTGGACGAAAGCATCTTAGATGGCCAAGTTCCGATTAACCC GGAAATTGTTGATCCGTTACTGCCGAACGTGAATATGGAACCGTTAAACC
TCCCTGGCGAAGAGATCGTATTTTATGATGACATTACGAAATATGTGGAC TACCTTAATTCTTATTACTATTTGGAAAGCCAGAAACTGTCCAATAACGT GGAAAACATTACTCTGACCACAAGCGTGGAAGAGGCTTTAGGCTACTCAA ATAAGATTTATACCTTCCTCCCGTCGCTGGCGGAAAAAGTAAATAAAGGT GTGCAGGCTGGTCTGTTCCTCAACTGGGCGAATGAAGTTGTCGAAGACTT TACCACGAATATTATGAAAAAGGATACCCTGGATAAAATCTCCGACGTCT CGGTTATTATCCCATATATTGGCCCTGCGTTAAATATCGGTAATAGTGCG CTGCGGGGGAATTTTAACCAGGCCTTTGCTACCGCGGGCGTCGCGTTCCT CCTGGAGGGCTTTCCTGAATTTACTATCCCGGCGCTCGGTGTTTTTACAT TTTACTCTTCCATCCAGGAGCGTGAGAAAATTATCAAAACCATCGAAAAC TGCCTGGAGCAGCGGGTGAAACGCTGGAAAGATTCTTATCAATGGATGGT GTCAAACTGGTTATCTCGCATCACGACCCAATTCAACCATATTAATTACC AGATGTATGATAGTCTGTCGTACCAAGCTGACGCCATTAAAGCCAAAATT GATCTGGAATATAAAAAGTACTCTGGTAGCGATAAGGAGAACATCAAAAG CCAGGTGGAGAACCTTAAGAATAGTCTGGATGTGAAAATCTCTGAAGCTA TGAATAACATTAACAAATTCATTCGTGAATGTTCGGTGACGTACCTGTTC AAGAATATGCTGCCAAAAGTTATTGATGAACTGAATAAATTTGATCTGCG TACCAAAACCGAACTTATCAACCTCATCGACTCCCACAACATTATCCTTG TGGGCGAAGTGGATCGTCTGAAGGCCAAAGTAAACGAGAGCTTTGAAAAT ACGATGCCGTTTAATATTTTTTCATATACCAATAACTCCTTGCTGAAAGA TATCATCAATGAATATTTCAATCTAGAAGGCGGTGGCGGTAGCGGCGGTG GCGGTAGCGGCGGTGGCGGTAGCGCACTAGTGCAGGAAAGACCTCCATTA CAACAACCTCCACATCGCGATAAGAAACCATGTAAGAATTTCTTTTGGAA AACATTTAGCAGTTGCAAAtaataagctt 7. Protein sequence of the LHD-CT-CST28 fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVQERPPLQQP PHRDKKPCKNFFWKTFSSCK 8. DNA sequence of the CP-EGF linker ggatccGTCGACaacaacaataacaacaacaataacaacaacgacgatga cgataaaAATTCAGATAGCGAATGTCCACTTAGTCACGACGGGTACTGTT TGCATGATGGTGTGTGTATGTATATAGAAGCACTAGACAAATACGCTTGC AATTGCGTAGTTGGCTATATAGGAGAGCGATGCCAATATAGAGATCTGAA GTGGTGGGAGTTAAGGGCAgaagcggcagccaaagaagcagccgctaagg cgctgcagagtctagaataataagctt 9. DNA sequence of the LHA-CP-EGF fusion ggatccATGGAGTTCGTTAACAAACAGTTCAACTATAAAGACCCAGTTAA CGGTGTTGACATTGCTTACATCAAAATCCCGAACGCTGGCCAGATGCAGC CGGTAAAGGCATTCAAAATCCACAACAAAATCTGGGTTATCCCGGAACGT GATACCTTTACTAACCCGGAAGAAGGTGACCTGAACCCGCCACCGGAAGC GAAACAGGTGCCGGTATCTTACTATGACTCCACCTACCTGTCTACCGATA ACGAAAAGGACAACTACCTGAAAGGTGTTACTAAACTGTTCGAGCGTATT TACTCCACCGACCTGGGCCGTATGCTGCTGACTAGCATCGTTCGCGGTAT CCCGTTCTGGGGCGGTTCTACCATCGATACCGAACTGAAAGTAATCGACA CTAACTGCATCAACGTTATTCAGCCGGACGGTTCCTATCGTTCCGAAGAA CTGAACCTGGTGATCATCGGCCCGTCTGCTGATATCATCCAGTTCGAGTG TCTGAGCTTTGGTCACGAAGTTCTGAACCTCACCCGTAACGGCTACGGTT CCACTCAGTACATCCGTTTCTCTCCGGACTTCACCTTCGGTTTTGAAGAA TCCCTGGAAGTAGACACGAACCCACTGCTGGGCGCTGGTAAATTCGCAAC TGATCCTGCGGTTACCCTGGCTCACGAACTGATTCATGCAGGCCACCGCC TGTACGGTATCGCCATCAATCCGAACCGTGTCTTCAAAGTTAACACCAAC GCGTATTACGAGATGTCCGGTCTGGAAGTTAGCTTCGAAGAACTGCGTAC TTTTGGCGGTCACGACGCTAAATTCATCGACTCTCTGCAAGAAAACGAGT TCCGTCTGTACTACTATAACAAGTTCAAAGATATCGCATCCACCCTGAAC AAAGCGAAATCCATCGTGGGTACCACTGCTTCTCTCCAGTACATGAAGAA CGTTTTTAAAGAAAAATACCTGCTCAGCGAAGACACCTCCGGCAAATTCT CTGTAGACAAGTTGAAATTCGATAAACTTTACAAAATGCTGACTGAAATT TACACCGAAGACAACTTCGTTAAGTTCTTTAAAGTTCTGAACCGCAAAAC CTATCTGAACTTCGACAAGGCAGTATTCAAAATCAACATCGTGCCGAAAG TTAACTACACTATCTACGATGGTTTCAACCTGCGTAACACCAACCTGGCT GCTAATTTTAACGGCCAGAACACGGAAATCAACAACATGAACTTCACAAA ACTGAAAAACTTCACTGGTCTGTTCGAGTTTTACAAGCTGCTGTGCGTCG ACaacaacaataacaacaacaataacaacaacgacgatgacgataaaAAT TCAGATAGCGAATGTCCACTTAGTCACGACGGGTACTGTTTGCATGATGG TGTGTGTATGTATATAGAAGCACTAGACAAATACGCTTGCAATTGCGTAG TTGGCTATATAGGAGAGCGATGCCAATATAGAGATCTGAAGTGGTGGGAG TTAAGGGCAgaagcggcagccaaagaagcagccgctaaggcgCTGCAGTG TATCAAGGTTAACAACTGGGATTTATTCTTCAGCCCGAGTGAAGACAACT TCACCAACGACCTGAACAAAGGTGAAGAAATCACCTCAGATACTAACATC GAAGCAGCCGAAGAAAACATCTCGCTGGACCTGATCCAGCAGTACTACCT GACCTTTAATTTCGACAACGAGCCGGAAAACATTTCTATCGAAAACCTGA GCTCTGATATCATCGGCCAGCTGGAACTGATGCCGAACATCGAACGTTTC CCAAACGGTAAAAAGTACGAGCTGGACAAATATACCATGTTCCACTACCT GCGCGCGCAGGAATTTGAACACGGCAAATCCCGTATCGCACTGACTAACT CCGTTAACGAAGCTCTGCTCAACCCGTCCCGTGTATACACCTTCTTCTCT AGCGACTACGTGAAAAAGGTCAACAAAGCGACTGAAGCTGCAATGTTCTT GGGTTGGGTTGAACAGCTTGTTTATGATTTTACCGACGAGACGTCCGAAG TATCTACTACCGACAAAATTGCGGATATCACTATCATCATCCCGTACATC GGTCCGGCTCTGAACATTGGCAACATGCTGTACAAAGACGACTTCGTTGG CGCACTGATCTTCTCCGGTGCGGTGATCCTGCTGGAGTTCATCCCGGAAA TCGCCATCCCGGTACTGGGCACCTTTGCTCTGGTTTCTTACATTGCAAAC AAGGTTCTGACTGTACAAACCATCGACAACGCGCTGAGCAAACGTAACGA AAAATGGGATGAAGTTTACAAATATATCGTGACCAACTGGCTGGCTAAGG TTAATACTCAGATCGACCTCATCCGCAAAAAAATGAAAGAAGCACTGGAA AACCAGGCGGAAGCTACCAAGGCAATCATTAACTACCAGTACAACCAGTA CACCGAGGAAGAAAAAAACAACATCAACTTCAACATCGACGATCTGTCCT CTAAACTGAACGAATCCATCAACAAAGCTATGATCAACATCAACAAGTTC CTGAACCAGTGCTCTGTAAGCTATCTGATGAACTCCATGATCCCGTACGG TGTTAAACGTCTGGAGGACTTCGATGCGTCTCTGAAAGACGCCCTGCTGA AATACATTTACGACAACCGTGGCACTCTGATCGGTCAGGTTGATCGTCTG AAGGACAAAGTGAACAATACCTTATCGACCGACATCCCTTTTCAGCTCAG TAAATATGTCGATAACCAACGCCTTTTGTCCACTctagaataatgaaagc tt 10. Protein sequence of the LHA-CP-EGF fusion EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECLSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDNNNNNNNNNNDDDDKNSDS ECPLSHDGYCLHDGVCMYIEALDKYACNCVVGYIGERCQYRDLKWWELRA EAAAKEAAAKALQCIKVNNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAA EENISLDLIQQYYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNG KKYELDKYTMFHYLRAQEFEHGKSRIALTNSVNEALLNPSRVYTFFSSDY VKKVNKATEAAMFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPA LNIGNMLYKDDFVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVL TVQTIDNALSKRNEKWDEVYKYIVTNWLAKVNTQIDLIRKKMKEALENQA EATKAIINYQYNQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQ
CSVSYLMNSMIPYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDK VNNTLSTDIPFQLSKYVDNQRLLST 11. Protein sequence of LHN/A EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECKSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDGIITSKTKSDDDDKNKALN LQCIKVNNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAAEENISLDLIQQ YYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMF HYLRAQEFEHGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAA MFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPALNIGNMLYKDD FVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSK RNEKWDEVYKYIVTNWLAKVNTQIDLIRKKMKEALENQAEATKAIINYQY NQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSMI PYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPF QLSKYVDNQRLLST 12. Protein sequence of LHN/B PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYT FGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKS KPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERK KGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVS VFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVP NEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKS LMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNIS DKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCVDEEKLYDDDDKDR WGSSLQCIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDFPIN ELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQ YLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVEAGL FAGWVKQIVNDFVIEANKSNTMDAIADISLIVPYIGLALNVGNETAKGNF ENAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKR NEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKYRYN IYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIP LAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFD LSIYTNDTILIEMFNKYNS 13. Protein sequence of LHN/C PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFN 14. Protein sequence of LHN/D TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFN 15. Synthesised GnRH peptide pGlu-His-Trp-Ser-Tyr-Gly-Cys-Arg-Pro-Gly-NH2 16. Protein sequence of the LHB-CT-SST28 fusion PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYT FGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKS KPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERK KGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVS VFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVP NEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKS LMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNIS DKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCVDGIITSKTKSDDD DKNKALNLQCIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDF PINELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENT IFQYLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVE AGLFAGWVKQIVNDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETAK GNFENAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNAL TKRNEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKY RYNIYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKK MIPLAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIM PFDLSIYTNDTILIEMFNKYNSLEGGGGSGGGGSGGGGSALDSANSNPAM APRERKAGCKNFFWKTFTSC 17. Protein sequence of the LHA-CP-SST28 fusion EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECLSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDGIITSKTKSDDDDKSANSN PAMAPRERKAGCKNFFWKTFTSCALAGGGGSGGGGSGGGGSALVLQCIKV NNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAAEENISLDLIQQYYLTFN FDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMFHYLRAQ EFEHGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAAMFLGWV EQLVYDFTDETSEVSTTDKIADITIIIPYIGPALNIGNMLYKDDFVGALI FSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSKRNEKWD EVYKYIVTNWLAKVNTQIDLIRKKMKEALENQAEATKAIINYQYNQYTEE EKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSMIPYGVKR LEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPFQLSKYV DNQRLLST 18. Protein sequence of the LHD-CT-EGF fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK
DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVNSDSECPLS HDGYCLHDGVCMYIEALDKYACNCVVGYIGERCQYRDLKWWELR 19. Protein sequence of the LHD-CT-VIP fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVHSDAVFTDN YTRLRKQMAVKKYLNSILN 20. Protein sequence of the LHC-CT-IGF1 fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVGPETLCGAELVDALQ FVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLK PAKSA 21. Protein sequence of the LHD-CT-IGF1 fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVGPETLCGAE LVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEM YCAPLKPAKSA 22. Protein sequence of the LHC-CT-VIP fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVHSDAVFTDNYTRLRK QMAVKKYLNSILN 23. Protein sequence of the LHC-CT-GnRH fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVMKPIQKLLAGLILLT WCVEGCSSQHWSYGLRPGGKRDAENLIDSFQEIVKEVGQLAETQRFECTT HQPRSPLRDLKGALESLIEEETGQKKI 24. Protein sequence of the LHD-CT-GnRH fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE
QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSGGGGSALVMKPI QKLLAGLILLTWCVEGCSSQHWSYGLRPGGKRDAENLIDSFQEIVKEVGQ LAETQRFECTTHQPRSPLRDLKGALESLIEEETGQKKI 25. Protein sequence of the LHD-CT-GRP fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVGNHWAVGHL M 26. Protein sequence of the LHB-CT-GRP fusion PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYT FGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKS KPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERK KGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVS VFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVP NEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKS LMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNIS DKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCVDEEKLYDDDDKDR WGSSLQCIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDFPIN ELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQ YLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVEAGL FAGWVKQIVNDFVIEANKSNTMDAIADISLIVPYIGLALNVGNETAKGNF ENAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKR NEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKYRYN IYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIP LAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFD LSIYTNDTILIEMFNKYNSLEGGGGSGGGGSGGGGSALVGNHWAVGHLM 27. Protein sequence of the LHC-CT-LIF fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVSPLPITPVNATCAIR HPCHNNLMNQIRSQLAQLNGSANALFILYYTAQGEPFPNNLDKLCGPNVT DFPPFHANGTEKAKLVELYRIVVYLGTSLGNITRDQKILNPSALSLHSKL NATADILRGLLSNVLCRLCSKYHVGHVDVTYGPDTSGKDVFQKKKLGCQL LGKYKQIIAVLAQAF 28. Protein sequence of the LHB-CP-LIF fusion PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYT FGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKS KPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERK KGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVS VFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVP NEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKS LMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNIS DKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCVDNNNNNNNNNNDD DDKSPLPITPVNATCAIRHPCHNNLMNQIRSQLAQLNGSANALFILYYTA QGEPFPNNLDKLCGPNVTDFPPFHANGTEKAKLVELYRIVVYLGTSLGNI TRDQKILNPSALSLHSKLNATADILRGLLSNVLCRLCSKYHVGHVDVTYG PDTSGKDVFQKKKLGCQLLGKYKQIIAVLAQAFAEAAAKEAAAKALQCID VDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDFPINELILDTDLI SKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQYLYSQTFPL DIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVEAGLFAGWVKQIV NDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETAKGNFENAFEIAGA SILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKRNEKWSDMYG LIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKYRYNIYSEKEKSN INIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIPLAVEKLLDF DNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFDLSIYTNDTI LIEMFNKYNS 29. Protein sequence of the LHC-CT-FGF1 fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALVMFNLPPGNYKKPKLL YCSNGGHFLRILPDGTVDGTRDRSDQHIQLQLSAESVGEVYIKSTETGQY LAMDTDGLLYGSQTPNEECLFLERLEENHYNTYISKKHAEKNWFVGLKKN GSCKRGPRTHYGQKAILFLPLPVSSD 30. Protein sequence of the LHA-CP-FGF1 fusion EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECLSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDGIITSKTKSDDDDKMFNLP PGNYKKPKLLYCSNGGHFLRILPDGTVDGTRDRSDQHIQLQLSAESVGEV YIKSTETGQYLAMDTDGLLYGSQTPNEECLFLERLEENHYNTYISKKHAE KNWFVGLKKNGSCKRGPRTHYGQKAILFLPLPVSSDGGGGSGGGGSGGGG SALVLQCIKVNNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAAEENISLD LIQQYYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDK YTMFHYLRAQEFEHGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKA
TEAAMFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPALNIGNML YKDDFVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDN ALSKRNEKWDEVYKYIVTNWLAKVNTQIDLIRKKMKEALENQAEATKAII NYQYNQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLM NSMIPYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLST DIPFQLSKYVDNQRLLST 31. Protein sequence of the LHA-CT-FGF9 fusion EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECLSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDGIITSKTKSDDDDKNKALN LQCIKVNNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAAEENISLDLIQQ YYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMF HYLRAQEFEHGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAA MFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPALNIGNMLYKDD FVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSK RNEKWDEVYKYIVTNWLAKVNTQIDLIRKKMKEALENQAEATKAIINYQY NQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSMI PYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPF QLSKYVDNQRLLSTLEGGGGSGGGGSGGGGSGGGGSALVDHLGQSEAGGL PRGPAVTDLDHLKGILRRRQLYCRTGFHLEIFPNGTIQGTRKDHSRFGIL EFISIAVGLVSIRGVDSGLYLGMNEKGELYGSEKLTQECVFREQFEENWY NTYSSNLYKHVDTGRRYYVALNKDGTPREGTRTKRHQKFTHFLPRPVDPD KVPELYKDILSQS 32. Protein sequence of the LHC-CP-FGF9 fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDNNNNNNNNNNDD DDKDHLGQSEAGGLPRGPAVTDLDHLKGILRRRQLYCRTGFHLEIFPNGT IQGTRKDHSRFGILEFISIAVGLVSIRGVDSGLYLGMNEKGELYGSEKLT QECVFREQFEENWYNTYSSNLYKHVDTGRRYYVALNKDGTPREGTRTKRH QKFTHFLPRPVDPDKVPELYKDILSQSAEAAAKEAAAKALQCRELLVKNT DLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVILSKNTSEHGQ LDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLESQKLSDNVED FTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMWANDVVEDFTT NILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAFAVTGVTILLE AFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRWKDSYEWMMGT WLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSGSDKENIKSQV ENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVIDELNEFDRNTK AKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSYTNNSLLKDII NEYFN 33. Protein sequence of the IgA-HNtet-CT-SST14 Fusion ESNQPEKNGTATKPENSGNTTSENGQTEPEKKLELRNVSDIELYSQTNGT YRQHVSLDGIPENTDTYFVKVKSSAFKDVYIPVASITEEKRNGQSVYKIT AKAEKLQQELENKYVDNFTFYLDKKAKEENTNFTSFSNLVKAINQNPSGT YHLAASLNANEVELGPDERSYIKDTFTGRLIGEKDGKNYAIYNLKKPLFE NLSGATVEKLSLKNVAISGKNDIGSLANEATNGTKIKQVHVDGCVDGIIT SKTKSDDDDKNKALNLQCIKIKNEDLTFIAEKNSFSEEPFQDEIVSYNTK NKPLNFNYSLDKIIVDYNLQSKITLPNDRTTPVTKGIPYAPEYKSNAAST IEIHNIDDNTIYQYLYAQKSPTTLQRITMTNSVDDALINSTKIYSYFPSV ISKVNQGAQGILFLQWVRDIIDDFTNESSQKTTIDKISDVSTIVPYIGPA LNIVKQGYEGNFIGALETTGVVLLLEYIPEITLPVIAALSIAESSTQKEK IIKTIDNFLEKRYEKWIEVYKLVKAKWLGTVNTQFQKRSYQMYRSLEYQV DAIKKIIDYEYKIYSGPDKEQIADEINNLKNKLEEKANKAMININIFMRE SSRSFLVNQMINEAKKQLLEFDTQSKNILMQYIKANSKFIGITELKKLES KINKVFSTPIPFSYSKNLDCWVDNEEDIDVLEGGGGSGGGGSGGGGSALV AGCKNFFWKTFTSC 34. Protein sequence of the IgA-HNtet-CP-SST14 fusion ESNQPEKNGTATKPENSGNTTSENGQTEPEKKLELRNVSDIELYSQTNGT YRQHVSLDGIPENTDTYFVKVKSSAFKDVYIPVASITEEKRNGQSVYKIT AKAEKLQQELENKYVDNFTFYLDKKAKEENTNFTSFSNLVKAINQNPSGT YHLAASLNANEVELGPDERSYIKDTFTGRLIGEKDGKNYAIYNLKKPLFE NLSGATVEKLSLKNVAISGKNDIGSLANEATNGTKIKQVHVDGCVDGIIT SKTKSDDDDKAGCKNFFWKTFTSCALAGGGGSGGGGSGGGGSALALQCIK IKNEDLTFIAEKNSFSEEPFQDEIVSYNTKNKPLNFNYSLDKIIVDYNLQ SKITLPNDRTTPVTKGIPYAPEYKSNAASTIEIHNIDDNTIYQYLYAQKS PTTLQRITMTNSVDDALINSTKIYSYFPSVISKVNQGAQGILFLQWVRDI IDDFTNESSQKTTIDKISDVSTIVPYIGPALNIVKQGYEGNFIGALETTG VVLLLEYIPEITLPVIAALSIAESSTQKEKIIKTIDNFLEKRYEKWIEVY KLVKAKWLGTVNTQFQKRSYQMYRSLEYQVDAIKKIIDYEYKIYSGPDKE QIADEINNLKNKLEEKANKAMININIFMRESSRSFLVNQMINEAKKQLLE FDTQSKNILMQYIKANSKFIGITELKKLESKINKVFSTPIPFSYSKNLDC WVDNEEDIDV 35. Protein sequence of the LHA-CT-SST14 fusion EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECLSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDGIITSKTKSDDDDKNKALN LQCIKVNNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAAEENISLDLIQQ YYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMF HYLRAQEFEHGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAA MFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPALNIGNMLYKDD FVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSK RNEKWDEVYKYIVTNWLAKVNTQIDLIRKKMKEALENQAEATKAIINYQY NQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSMI PYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPF QLSKYVDNQRLLSTLEGGGGSGGGGSGGGGSALVAGCKNFFWKTFTSC 36. Protein sequence of the LHA-CT-EGFv3 fusion EFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTF TNPEEGDLNPPPEAKQVPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYST DLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNL VIIGPSADIIQFECLSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLE VDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPNRVFKVNTNAYY EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAK SIVGTTASLQYMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTE DNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAANF NGQNTEINNMNFTKLKNFTGLFEFYKLLCVDGIITSKTKSDDDDKNKALN LQCIKVNNWDLFFSPSEDNFTNDLNKGEEITSDTNIEAAEENISLDLIQQ YYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMF HYLRAQEFEHGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAA MFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPALNIGNMLYKDD FVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSK RNEKWDEVYKYIVTNWLAKVNTQIDLIRKKMKEALENQAEATKAIINYQY NQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSMI PYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPF QLSKYVDNQRLLSTLEGGGGSGGGGSGGGGSGGGGSALVDNSDPKCPLSH EGYCLNDGVCMYIGTLDRYACNCVVGYVGERCQYRDLKLAELR 37. Protein sequence of the LHE-CT-IL6 fusion PKINSFNYNDPVNDRTILYIKPGGCQEFYKSFNIMKNIWIIPERNVIGTT PQDFHPPTSLKNGDSSYYDPNYLQSDEEKDRFLKIVTKIFNRINNNLSGG ILLEELSKANPYLGNDNTPDNQFHIGDASAVEIKFSNGSQHILLPNVIIM GAEPDLFETNSSNISLRNNYMPSNHGFGSIAIVTFSPEYSFRFNDNSINE
FIQDPALTLMHELIHSLHGLYGAKGITTTCIITQQQNPLITNRKGINIEE FLTFGGNDLNIITVAQYNDIYTNLLNDYRKIASKLSKVQVSNPQLNPYKD IFQEKYGLDKDASGIYSVNINKFDDILKKLYSFTEFDLATKFQVKCRETY IGQYKYFKLSNLLNDSIYNISEGYNINNLKVNFRGQNANLNPRIIKPITG RGLVKKIIRFCVDGIITSKTKSLIEGRNKALNLQCIEINNGELFFVASEN SYNDDNINTPKEIDDTVTSNNNYENDLDQVILNFNSESAPGLSDEKLNLT IQNDAYIPKYDSNGTSDIEQHDVNELNVFFYLDAQKVPEGENNVNLTSSI DTALLEQPKIYTFFSSEFINNVNKPVQAALFVSWIQQVLVDFTTEANQKS TVDKIADISIVVPYIGLALNIGNEAQKGNFKDALELLGAGILLEFEPELL IPTILVFTIKSFLGSSDNKNKVIKAINNALKERDEKWKEVYSFIVSNWMT KINTQFNKRKEQMYQALQNQVNAIKTIIESKYNSYTLEEKNELTNKYDIK QIENELNQKVSIAMNNIDRFLTESSISYLMKIINEVKINKLREYDENVKT YLLNYIIQHGSILGESQQELNSMVTDTLNNSIPFKLSSYTDDKILISYFN KFFKLEGGGGSGGGGSGGGGSGGGGSALVPPGEDSKDVAAPHRQPLTSSE RIDKQIRYILDGISALRKETCNKSNMCESSKEALAENNLNLPKMAEKDGC FQSGFNEETCLVKIITGLLEFEVYLEYLQNRFESSEEQARAVQMSTKVLI QFLQKKAKNLDAITTPDPTTNASLLTKLQAQNQWLQDMTTHLILRSFKEF LQSSLRALRQM 38. Protein sequence of the LHB-CT-IL8 fusion PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYT FGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKS KPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERK KGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVS VFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVP NEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKS LMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNIS DKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCVDGIITSKTKSLIE GRNKALNLQCIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDF PINELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENT IFQYLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVE AGLFAGWVKQIVNDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETAK GNFENAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNAL TKRNEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKY RYNIYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKK MIPLAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIM PFDLSIYTNDTILIEMFNKYNSLEGGGGSGGGGSGGGGSALVAKELRCQC IKTYSKPFHPKFIKELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQ RVVEKFLKRAENS 39. Protein sequence of the LHF-CP-GRAN4 fusion PVAINSFNYNDPVNDDTILYMQIPYEEKSKKYYKAFEIMRNVWIIPERNT IGTNPSDFDPPASLKNGSSAYYDPNYLTTDAEKDRYLKTTIKLFKRINSN PAGKVLLQEISYAKPYLGNDHTPIDEFSPVTRTTSVNIKLSTNVESSMLL NLLVLGAGPDIFESCCYPVRKLIDPDVVYDPSNYGFGSINIVTFSPEYEY TFNDISGGHNSSTESFIADPAISLAHELIHALHGLYGARGVTYEETIEVK QAPLMIAEKPIRLEEFLTFGGQDLNIITSAMKEKIYNNLLANYEKIATRL SEVNSAPPEYDINEYKDYFQWKYGLDKNADGSYTVNENKFNEIYKKLYSF TESDLANKFKVKARNTYFIKYEFLKVPNLLDDDIYTVSEGFNIGNLAVNN RGQSIKLNPKIIDSIPDKGLVEKIVKFAVENNNNNNNNNNLGCVDGIITS KTKSLIEGRDVKCDMEVSCPDGYTCCRLQSGAWGCCPFTQAVCCEDHIHC CPAGFTCDTQKGTCEALAGGGGSGGGGSGGGGSALVLQCIEVNNSELFFV ASESSYNENDINTPKEIDDTTNLNNNYENNLDEVILDYNSQTIPQISNIE NLNTLVQDNSYVPEYDSNGTSEIEEYDVVDFNVFFYLHAQKVPEGETNIS LTSSIDTALLEESKDIFFSSEFIDTINKPVNAALFIDWISKVIRDFTTEA TQKSTVDKIADISLIVPYVGLALNIIIEAEKGNFEEAFELLGVGILLEFV PELTIPVILVFTIKSYIDSYENKNKAIKAINNSLIEREAKWKEIYSWIVS NWLTRINTQFNKRKEQMYQALQNQVDAIKTAIEYKYNNYTSDEKNRLESE YNINNIEEELNKKVSLAMKNIERFMTESSISYLMKLINEAKVGKLKKYDN HVKSDLLNYILDHRSILGEQTNELSDLVTSTLNSSIPFELSSYTNDKILI IYFNRLYKT 40. Protein sequence of the LHD-CP-TGFa fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDGGGGSGGGGSAD DDDKVVSHFNDCPDSHTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHA DLLALAGGGGSGGGGSGGGGSALALQCIKVKNNRLPYVADKDSISQEIFE NKIITDETNVQNYSDKFSLDESILDGQVPINPEIVDPLLPNVNMEPLNLP GEEIVFYDDITKYVDYLNSYYYLESQKLSNNVENITLTTSVEEALGYSNK IYTFLPSLAEKVNKGVQAGLFLNWANEVVEDFTTNIMKKDTLDKISDVSV IIPYIGPALNIGNSALRGNFNQAFATAGVAFLLEGFPEFTIPALGVFTFY SSIQEREKIIKTIENCLEQRVKRWKDSYQWMVSNWLSRITTQFNHINYQM YDSLSYQADAIKAKIDLEYKKYSGSDKENIKSQVENLKNSLDVKISEAMN NINKFIRECSVTYLFKNMLPKVIDELNKFDLRTKTELINLIDSHNIILVG EVDRLKAKVNESFENTMPFNIFSYTNNSLLKDIINEYFN 41. Protein sequence of the LHD-CP-TGFb fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDGGGGSGGGGSAD DDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLG PCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKP KVEQLSNMIVRSCKCSALAGGGGSGGGGSGGGGSALALQCIKVKNNRLPY VADKDSISQEIFENKIITDETNVQNYSDKFSLDESILDGQVPINPEIVDP LLPNVNMEPLNLPGEEIVFYDDITKYVDYLNSYYYLESQKLSNNVENITL TTSVEEALGYSNKIYTFLPSLAEKVNKGVQAGLFLNWANEVVEDFTTNIM KKDTLDKISDVSVIIPYIGPALNIGNSALRGNFNQAFATAGVAFLLEGFP EFTIPALGVFTFYSSIQEREKIIKTIENCLEQRVKRWKDSYQWMVSNWLS RITTQFNHINYQMYDSLSYQADAIKAKIDLEYKKYSGSDKENIKSQVENL KNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVIDELNKFDLRTKTEL INLIDSHNIILVGEVDRLKAKVNESFENTMPFNIFSYTNNSLLKDIINEY FN 42. Protein sequence of the LHB-CT-TNFa fusion PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYT FGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKS KPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERK KGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVS VFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVP NEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKS LMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNIS DKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCVDEEKLYDDDDKDR WGSSLQCIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDFPIN ELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQ YLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVEAGL FAGWVKQIVNDFVIEANKSNTMDAIADISLIVPYIGLALNVGNETAKGNF ENAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKR NEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKYRYN IYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIP LAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFD LSIYTNDTILIEMFNKYNSLEGGGGSGGGGSGGGGSALVRSSSRTPSDKP VAHVVANPQAEGQLQWLNRRANALLANGVELRDNQLVVPSEGLYLIYSQV LFKGQGCPSTHVLLTHTISRIAVSYQTKVNLLSAIKSPCQRETPEGAEAK PWYEPIYLGGVFQLEKGDRLSAEINRPDYLDFAESGQVYFGIIAL 43. Protein sequence of the LHD-CT-SDF1 fusion TWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFS SDTNPSLSKPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERD IGKKLINYLVVGSPFMGDSSTPEDTFDFTRHTTNIAVEKFENGSWKVTNI ITPSVLIFGPLPNILDYTASLTLQGQQSNPSFEGFGTLSILKVAPEFLLT
FSDVTSNQSSAVLGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRP QVSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIA KRLNNINKTIPSSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLY SDLTNVMSEVVYSSQYNVKNRTHYFSRHYLPVFANILDDNIYTIRDGFNL TNKGFNIENSGQNIERNPALQKLSSESVVDLFTKVCVDKSEEKLYDDDDK DRWGSSLQCIKVKNNRLPYVADKDSISQEIFENKIITDETNVQNYSDKFS LDESILDGQVPINPEIVDPLLPNVNMEPLNLPGEEIVFYDDITKYVDYLN SYYYLESQKLSNNVENITLTTSVEEALGYSNKIYTFLPSLAEKVNKGVQA GLFLNWANEVVEDFTTNIMKKDTLDKISDVSVIIPYIGPALNIGNSALRG NFNQAFATAGVAFLLEGFPEFTIPALGVFTFYSSIQEREKIIKTIENCLE QRVKRWKDSYQWMVSNWLSRITTQFNHINYQMYDSLSYQADAIKAKIDLE YKKYSGSDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNM LPKVIDELNKFDLRTKTELINLIDSHNIILVGEVDRLKAKVNESFENTMP FNIFSYTNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALKPVSLSYRCP CRFFESHVARANVKHLKILNTPNCALQIVARLKNNNRQVCIDPKLKWIQE YLEKALNKRFKM 44. Protein sequence of the LHC-CT-VEGF fusion PITINNFNYSDPVDNKNILYLDTHLNTLANEPEKAFRITGNIWVIPDRFS RNSNPNLNKPPRVTSPKSGYYDPNYLSTDSDKDTFLKEIIKLFKRINSRE IGEELIYRLSTDIPFPGNNNTPINTFDFDVDFNSVDVKTRQGNNWVKTGS INPSVIITGPRENIIDPETSTFKLTNNTFAAQEGFGALSIISISPRFMLT YSNATNDVGEGRFSKSEFCMDPILILMHELNHAMHNLYGIAIPNDQTISS VTSNIFYSQYNVKLEYAEIYAFGGPTIDLIPKSARKYFEEKALDYYRSIA KRLNSITTANPSSFNKYIGEYKQKLIRKYRFVVESSGEVTVNRNKFVELY NELTQIFTEFNYAKIYNVQNRKIYLSNVYTPVTANILDDNVYDIQNGFNI PKSNLNVLFMGQNLSRNPALRKVNPENMLYLFTKFCVDAIDGRSLYNKTL QCRELLVKNTDLPFIGDISDVKTDIFLRKDINEETEVIYYPDNVSVDQVI LSKNTSEHGQLDLLYPSIDSESEILPGENQVFYDNRTQNVDYLNSYYYLE SQKLSDNVEDFTFTRSIEEALDNSAKVYTYFPTLANKVNAGVQGGLFLMW ANDVVEDFTTNILRKDTLDKISDVSAIIPYIGPALNISNSVRRGNFTEAF AVTGVTILLEAFPEFTIPALGAFVIYSKVQERNEIIKTIDNCLEQRIKRW KDSYEWMMGTWLSRIITQFNNISYQMYDSLNYQAGAIKAKIDLEYKKYSG SDKENIKSQVENLKNSLDVKISEAMNNINKFIRECSVTYLFKNMLPKVID ELNEFDRNTKAKLINLIDSHNIILVGEVDKLKAKVNNSFQNTIPFNIFSY TNNSLLKDIINEYFNLEGGGGSGGGGSGGGGSALAPMAEGGGQNHHEVVK FMDVYQRSYCHPIETLVDIFQEYPDEIEYIFKPSCVPLMRCGGCCNDEGL ECVPTEESNITMQIMRIKPHQGQHIGEMSFLQHNKCECRPKKDRARQEKK SVRGKGKGQKRKRKKSRYKSWSVYVGARCCLMPWSLPGPHPCGPCSERRK HLFVQDPQTCKCSCKNTDSRCKARQLELNERTCRCDKPRR
Sequence CWU
1
11212619DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide" 1ggatccatgg agttcgttaa caaacagttc
aactataaag acccagttaa cggtgttgac 60attgcttaca tcaaaatccc gaacgctggc
cagatgcagc cggtaaaggc attcaaaatc 120cacaacaaaa tctgggttat cccggaacgt
gataccttta ctaacccgga agaaggtgac 180ctgaacccgc caccggaagc gaaacaggtg
ccggtatctt actatgactc cacctacctg 240tctaccgata acgaaaagga caactacctg
aaaggtgtta ctaaactgtt cgagcgtatt 300tactccaccg acctgggccg tatgctgctg
actagcatcg ttcgcggtat cccgttctgg 360ggcggttcta ccatcgatac cgaactgaaa
gtaatcgaca ctaactgcat caacgttatt 420cagccggacg gttcctatcg ttccgaagaa
ctgaacctgg tgatcatcgg cccgtctgct 480gatatcatcc agttcgagtg tctgagcttt
ggtcacgaag ttctgaacct cacccgtaac 540ggctacggtt ccactcagta catccgtttc
tctccggact tcaccttcgg ttttgaagaa 600tccctggaag tagacacgaa cccactgctg
ggcgctggta aattcgcaac tgatcctgcg 660gttaccctgg ctcacgaact gattcatgca
ggccaccgcc tgtacggtat cgccatcaat 720ccgaaccgtg tcttcaaagt taacaccaac
gcgtattacg agatgtccgg tctggaagtt 780agcttcgaag aactgcgtac ttttggcggt
cacgacgcta aattcatcga ctctctgcaa 840gaaaacgagt tccgtctgta ctactataac
aagttcaaag atatcgcatc caccctgaac 900aaagcgaaat ccatcgtggg taccactgct
tctctccagt acatgaagaa cgtttttaaa 960gaaaaatacc tgctcagcga agacacctcc
ggcaaattct ctgtagacaa gttgaaattc 1020gataaacttt acaaaatgct gactgaaatt
tacaccgaag acaacttcgt taagttcttt 1080aaagttctga accgcaaaac ctatctgaac
ttcgacaagg cagtattcaa aatcaacatc 1140gtgccgaaag ttaactacac tatctacgat
ggtttcaacc tgcgtaacac caacctggct 1200gctaatttta acggccagaa cacggaaatc
aacaacatga acttcacaaa actgaaaaac 1260ttcactggtc tgttcgagtt ttacaagctg
ctgtgcgtcg acggcatcat tacctccaaa 1320actaaatctg acgatgacga taaaaacaaa
gcgctgaacc tgcagtgtat caaggttaac 1380aactgggatt tattcttcag cccgagtgaa
gacaacttca ccaacgacct gaacaaaggt 1440gaagaaatca cctcagatac taacatcgaa
gcagccgaag aaaacatctc gctggacctg 1500atccagcagt actacctgac ctttaatttc
gacaacgagc cggaaaacat ttctatcgaa 1560aacctgagct ctgatatcat cggccagctg
gaactgatgc cgaacatcga acgtttccca 1620aacggtaaaa agtacgagct ggacaaatat
accatgttcc actacctgcg cgcgcaggaa 1680tttgaacacg gcaaatcccg tatcgcactg
actaactccg ttaacgaagc tctgctcaac 1740ccgtcccgtg tatacacctt cttctctagc
gactacgtga aaaaggtcaa caaagcgact 1800gaagctgcaa tgttcttggg ttgggttgaa
cagcttgttt atgattttac cgacgagacg 1860tccgaagtat ctactaccga caaaattgcg
gatatcacta tcatcatccc gtacatcggt 1920ccggctctga acattggcaa catgctgtac
aaagacgact tcgttggcgc actgatcttc 1980tccggtgcgg tgatcctgct ggagttcatc
ccggaaatcg ccatcccggt actgggcacc 2040tttgctctgg tttcttacat tgcaaacaag
gttctgactg tacaaaccat cgacaacgcg 2100ctgagcaaac gtaacgaaaa atgggatgaa
gtttacaaat atatcgtgac caactggctg 2160gctaaggtta atactcagat cgacctcatc
cgcaaaaaaa tgaaagaagc actggaaaac 2220caggcggaag ctaccaaggc aatcattaac
taccagtaca accagtacac cgaggaagaa 2280aaaaacaaca tcaacttcaa catcgacgat
ctgtcctcta aactgaacga atccatcaac 2340aaagctatga tcaacatcaa caagttcctg
aaccagtgct ctgtaagcta tctgatgaac 2400tccatgatcc cgtacggtgt taaacgtctg
gaggacttcg atgcgtctct gaaagacgcc 2460ctgctgaaat acatttacga caaccgtggc
actctgatcg gtcaggttga tcgtctgaag 2520gacaaagtga acaatacctt atcgaccgac
atcccttttc agctcagtaa atatgtcgat 2580aaccaacgcc ttttgtccac tctagaataa
tgaaagctt 261922643DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 2ggatccatgc cggttaccat caacaacttc aactacaacg acccgatcga
caacaacaac 60atcattatga tggaaccgcc gttcgcacgt ggtaccggac gttactacaa
ggcttttaag 120atcaccgacc gtatctggat catcccggaa cgttacacct tcggttacaa
acctgaggac 180ttcaacaaga gtagcgggat tttcaatcgt gacgtctgcg agtactatga
tccagattat 240ctgaatacca acgataagaa gaacatattc cttcagacta tgattaaact
cttcaaccgt 300atcaaaagca aaccgctcgg tgaaaaactc ctcgaaatga ttatcaacgg
tatcccgtac 360ctcggtgacc gtcgtgtccc gcttgaagag ttcaacacca acatcgcaag
cgtcaccgtc 420aacaaactca tcagcaaccc aggtgaagtc gaacgtaaaa aaggtatctt
cgcaaacctc 480atcatcttcg gtccgggtcc ggtcctcaac gaaaacgaaa ccatcgacat
cggtatccag 540aaccacttcg caagccgtga aggtttcggt ggtatcatgc agatgaaatt
ctgcccggaa 600tacgtcagtg tcttcaacaa cgtccaggaa aacaaaggtg caagcatctt
caaccgtcgt 660ggttacttca gcgacccggc actcatcctc atgcatgaac tcatccacgt
cctccacggt 720ctctacggta tcaaagttga cgacctcccg atcgtcccga acgagaagaa
attcttcatg 780cagagcaccg acgcaatcca ggctgaggaa ctctacacct tcggtggcca
agacccaagt 840atcataaccc cgtccaccga caaaagcatc tacgacaaag tcctccagaa
cttcaggggt 900atcgtggaca gactcaacaa agtcctcgtc tgcatcagcg acccgaacat
caatatcaac 960atatacaaga acaagttcaa agacaagtac aaattcgtcg aggacagcga
aggcaaatac 1020agcatcgacg tagaaagttt cgacaagctc tacaaaagcc tcatgttcgg
tttcaccgaa 1080accaacatcg ccgagaacta caagatcaag acaagggcaa gttacttcag
cgacagcctc 1140ccgcctgtca aaatcaagaa cctcttagac aacgagattt acacaattga
agagggcttc 1200aacatcagtg acaaagacat ggagaaggaa tacagaggtc agaacaaggc
tatcaacaaa 1260caggcatacg aggagatcag caaagaacac ctcgcagtct acaagatcca
gatgtgcgtc 1320gacggcatca ttacctccaa aactaaatct gacgatgacg ataaaaacaa
agcgctgaac 1380ctgcagtgca tcgacgttga caacgaagac ctgttcttca tcgctgacaa
aaacagcttc 1440agtgacgacc tgagcaaaaa cgaacgtatc gaatacaaca cccagagcaa
ctacatcgaa 1500aacgacttcc cgatcaacga actgatcctg gacaccgacc tgataagtaa
aatcgaactg 1560ccgagcgaaa acaccgaaag tctgaccgac ttcaacgttg acgttccggt
ttacgaaaaa 1620cagccggcta tcaagaaaat cttcaccgac gaaaacacca tcttccagta
cctgtacagc 1680cagaccttcc cgctggacat ccgtgacatc agtctgacca gcagtttcga
cgacgctctg 1740ctgttcagca acaaagttta cagtttcttc agcatggact acatcaaaac
cgctaacaaa 1800gttgttgaag cagggctgtt cgctggttgg gttaaacaga tcgttaacga
cttcgttatc 1860gaagctaaca aaagcaacac tatggacaaa atcgctgaca tcagtctgat
cgttccgtac 1920atcggtctgg ctctgaacgt tggtaacgaa accgctaaag gtaactttga
aaacgctttc 1980gagatcgctg gtgcaagcat cctgctggag ttcatcccgg aactgctgat
cccggttgtt 2040ggtgctttcc tgctggaaag ttacatcgac aacaaaaaca agatcatcaa
aaccatcgac 2100aacgctctga ccaaacgtaa cgaaaaatgg agtgatatgt acggtctgat
cgttgctcag 2160tggctgagca ccgtcaacac ccagttctac accatcaaag aaggtatgta
caaagctctg 2220aactaccagg ctcaggctct ggaagagatc atcaaatacc gttacaacat
ctacagtgag 2280aaggaaaaga gtaacatcaa catcgacttc aacgacatca acagcaaact
gaacgaaggt 2340atcaaccagg ctatcgacaa catcaacaac ttcatcaacg gttgcagtgt
tagctacctg 2400atgaagaaga tgatcccgct ggctgttgaa aaactgctgg acttcgacaa
caccctgaaa 2460aagaacctgc tgaactacat cgacgaaaac aagctgtacc tgatcggtag
tgctgaatac 2520gaaaaaagta aagtgaacaa atacctgaag accatcatgc cgttcgacct
gagtatctac 2580accaacgaca ccatcctgat cgaaatgttc aacaaataca actctctaga
ataatgaaag 2640ctt
264332622DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 3ggatccatgc cgatcaccat
caacaacttc aactacagcg atccggtgga taacaaaaac 60atcctgtacc tggataccca
tctgaatacc ctggcgaacg aaccggaaaa agcgtttcgt 120atcaccggca acatttgggt
tattccggat cgttttagcc gtaacagcaa cccgaatctg 180aataaaccgc cgcgtgttac
cagcccgaaa agcggttatt acgatccgaa ctatctgagc 240accgatagcg ataaagatac
cttcctgaaa gaaatcatca aactgttcaa acgcatcaac 300agccgtgaaa ttggcgaaga
actgatctat cgcctgagca ccgatattcc gtttccgggc 360aacaacaaca ccccgatcaa
cacctttgat ttcgatgtgg atttcaacag cgttgatgtt 420aaaacccgcc agggtaacaa
ttgggtgaaa accggcagca ttaacccgag cgtgattatt 480accggtccgc gcgaaaacat
tattgatccg gaaaccagca cctttaaact gaccaacaac 540acctttgcgg cgcaggaagg
ttttggcgcg ctgagcatta ttagcattag cccgcgcttt 600atgctgacct atagcaacgc
gaccaacgat gttggtgaag gccgtttcag caaaagcgaa 660ttttgcatgg acccgatcct
gatcctgatg catgaactga accatgcgat gcataacctg 720tatggcatcg cgattccgaa
cgatcagacc attagcagcg tgaccagcaa catcttttac 780agccagtaca acgtgaaact
ggaatatgcg gaaatctatg cgtttggcgg tccgaccatt 840gatctgattc cgaaaagcgc
gcgcaaatac ttcgaagaaa aagcgctgga ttactatcgc 900agcattgcga aacgtctgaa
cagcattacc accgcgaatc cgagcagctt caacaaatat 960atcggcgaat ataaacagaa
actgatccgc aaatatcgct ttgtggtgga aagcagcggc 1020gaagttaccg ttaaccgcaa
taaattcgtg gaactgtaca acgaactgac ccagatcttc 1080accgaattta actatgcgaa
aatctataac gtgcagaacc gtaaaatcta cctgagcaac 1140gtgtataccc cggtgaccgc
gaatattctg gatgataacg tgtacgatat ccagaacggc 1200tttaacatcc cgaaaagcaa
cctgaacgtt ctgtttatgg gccagaacct gagccgtaat 1260ccggcgctgc gtaaagtgaa
cccggaaaac atgctgtacc tgttcaccaa attttgcgtc 1320gacgcgattg atggtcgtag
cctgtacaac aaaaccctgc agtgtcgtga actgctggtg 1380aaaaacaccg atctgccgtt
tattggcgat atcagcgatg tgaaaaccga tatcttcctg 1440cgcaaagata tcaacgaaga
aaccgaagtg atctactacc cggataacgt gagcgttgat 1500caggtgatcc tgagcaaaaa
caccagcgaa catggtcagc tggatctgct gtatccgagc 1560attgatagcg aaagcgaaat
tctgccgggc gaaaaccagg tgttttacga taaccgtacc 1620cagaacgtgg attacctgaa
cagctattac tacctggaaa gccagaaact gagcgataac 1680gtggaagatt ttacctttac
ccgcagcatt gaagaagcgc tggataacag cgcgaaagtt 1740tacacctatt ttccgaccct
ggcgaacaaa gttaatgcgg gtgttcaggg cggtctgttt 1800ctgatgtggg cgaacgatgt
ggtggaagat ttcaccacca acatcctgcg taaagatacc 1860ctggataaaa tcagcgatgt
tagcgcgatt attccgtata ttggtccggc gctgaacatt 1920agcaatagcg tgcgtcgtgg
caattttacc gaagcgtttg cggttaccgg tgtgaccatt 1980ctgctggaag cgtttccgga
atttaccatt ccggcgctgg gtgcgtttgt gatctatagc 2040aaagtgcagg aacgcaacga
aatcatcaaa accatcgata actgcctgga acagcgtatt 2100aaacgctgga aagatagcta
tgaatggatg atgggcacct ggctgagccg tattatcacc 2160cagttcaaca acatcagcta
ccagatgtac gatagcctga actatcaggc gggtgcgatt 2220aaagcgaaaa tcgatctgga
atacaaaaaa tacagcggca gcgataaaga aaacatcaaa 2280agccaggttg aaaacctgaa
aaacagcctg gatgtgaaaa ttagcgaagc gatgaataac 2340atcaacaaat tcatccgcga
atgcagcgtg acctacctgt tcaaaaacat gctgccgaaa 2400gtgatcgatg aactgaacga
atttgatcgc aacaccaaag cgaaactgat caacctgatc 2460gatagccaca acattattct
ggtgggcgaa gtggataaac tgaaagcgaa agttaacaac 2520agcttccaga acaccatccc
gtttaacatc ttcagctata ccaacaacag cctgctgaaa 2580gatatcatca acgaatactt
caatctagaa taatgaaagc tt 262242638DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 4ggatccatga cgtggccagt taaggatttc aactactcag atcctgtaaa
tgacaacgat 60attctgtacc ttcgcattcc acaaaataaa ctgatcacca caccagtcaa
agcattcatg 120attactcaaa acatttgggt cattccagaa cgcttttcta gtgacacaaa
tccgagttta 180tctaaacctc cgcgtccgac gtccaaatat cagagctatt acgatccctc
atatctcagt 240acggacgaac aaaaagatac tttccttaaa ggtatcatta aactgtttaa
gcgtattaat 300gagcgcgata tcgggaaaaa gttgattaat tatcttgttg tgggttcccc
gttcatgggc 360gatagctcta cccccgaaga cacttttgat tttacccgtc atacgacaaa
catcgcggta 420gagaagtttg agaacggatc gtggaaagtc acaaacatca ttacacctag
cgtcttaatt 480tttggtccgc tgccaaacat cttagattat acagccagcc tgactttgca
ggggcaacag 540tcgaatccga gtttcgaagg ttttggtacc ctgagcattc tgaaagttgc
cccggaattt 600ctgctcactt tttcagatgt caccagcaac cagagctcag cagtattagg
aaagtcaatt 660ttttgcatgg acccggttat tgcactgatg cacgaactga cgcactctct
gcatcaactg 720tatgggatca acatccccag tgacaaacgt attcgtcccc aggtgtctga
aggatttttc 780tcacaggatg ggccgaacgt ccagttcgaa gagttgtata ctttcggagg
cctggacgta 840gagatcattc cccagattga gcgcagtcag ctgcgtgaga aggcattggg
ccattataag 900gatattgcaa aacgcctgaa taacattaac aaaacgattc catcttcgtg
gatctcgaat 960attgataaat ataagaaaat ttttagcgag aaatataatt ttgataaaga
taatacaggt 1020aactttgtgg ttaacattga caaattcaac tccctttaca gtgatttgac
gaatgtaatg 1080agcgaagttg tgtatagttc ccaatacaac gttaagaatc gtacccatta
cttctctcgt 1140cactacctgc cggttttcgc gaacatcctt gacgataata tttacactat
tcgtgacggc 1200tttaacttga ccaacaaggg cttcaatatt gaaaattcag gccagaacat
tgaacgcaac 1260ccggccttgc agaaactgtc gagtgaatcc gtggttgacc tgtttaccaa
agtctgcgtc 1320gacaaaagcg aagagaagct gtacgatgac gatgacaaag atcgttgggg
atcgtccctg 1380cagtgtatta aagtgaaaaa caatcggctg ccttatgtag cagataaaga
tagcattagt 1440caggagattt tcgaaaataa aattatcact gacgaaacca atgttcagaa
ttattcagat 1500aaattttcac tggacgaaag catcttagat ggccaagttc cgattaaccc
ggaaattgtt 1560gatccgttac tgccgaacgt gaatatggaa ccgttaaacc tccctggcga
agagatcgta 1620ttttatgatg acattacgaa atatgtggac taccttaatt cttattacta
tttggaaagc 1680cagaaactgt ccaataacgt ggaaaacatt actctgacca caagcgtgga
agaggcttta 1740ggctactcaa ataagattta taccttcctc ccgtcgctgg cggaaaaagt
aaataaaggt 1800gtgcaggctg gtctgttcct caactgggcg aatgaagttg tcgaagactt
taccacgaat 1860attatgaaaa aggataccct ggataaaatc tccgacgtct cggttattat
cccatatatt 1920ggccctgcgt taaatatcgg taatagtgcg ctgcggggga attttaacca
ggcctttgct 1980accgcgggcg tcgcgttcct cctggagggc tttcctgaat ttactatccc
ggcgctcggt 2040gtttttacat tttactcttc catccaggag cgtgagaaaa ttatcaaaac
catcgaaaac 2100tgcctggagc agcgggtgaa acgctggaaa gattcttatc aatggatggt
gtcaaactgg 2160ttatctcgca tcacgaccca attcaaccat attaattacc agatgtatga
tagtctgtcg 2220taccaagctg acgccattaa agccaaaatt gatctggaat ataaaaagta
ctctggtagc 2280gataaggaga acatcaaaag ccaggtggag aaccttaaga atagtctgga
tgtgaaaatc 2340tctgaagcta tgaataacat taacaaattc attcgtgaat gttcggtgac
gtacctgttc 2400aagaatatgc tgccaaaagt tattgatgaa ctgaataaat ttgatctgcg
taccaaaacc 2460gaacttatca acctcatcga ctcccacaac attatccttg tgggcgaagt
ggatcgtctg 2520aaggccaaag taaacgagag ctttgaaaat acgatgccgt ttaatatttt
ttcatatacc 2580aataactcct tgctgaaaga tatcatcaat gaatatttca atctagatta
ataagctt 26385195DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 5ggatccgtcg acctgcaggg
tctagaaggc ggtggcggta gcggcggtgg cggtagcggc 60ggtggcggta gcggcggtgg
cggtagcgca ctagtgcagg aaagacctcc attacaacaa 120cctccacatc gcgataagaa
accatgtaag aatttctttt ggaaaacatt tagcagttgc 180aaatgataaa agctt
19562779DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 6ggatccatga cgtggccagt taaggatttc aactactcag atcctgtaaa
tgacaacgat 60attctgtacc ttcgcattcc acaaaataaa ctgatcacca caccagtcaa
agcattcatg 120attactcaaa acatttgggt cattccagaa cgcttttcta gtgacacaaa
tccgagttta 180tctaaacctc cgcgtccgac gtccaaatat cagagctatt acgatccctc
atatctcagt 240acggacgaac aaaaagatac tttccttaaa ggtatcatta aactgtttaa
gcgtattaat 300gagcgcgata tcgggaaaaa gttgattaat tatcttgttg tgggttcccc
gttcatgggc 360gatagctcta cccccgaaga cacttttgat tttacccgtc atacgacaaa
catcgcggta 420gagaagtttg agaacggatc gtggaaagtc acaaacatca ttacacctag
cgtcttaatt 480tttggtccgc tgccaaacat cttagattat acagccagcc tgactttgca
ggggcaacag 540tcgaatccga gtttcgaagg ttttggtacc ctgagcattc tgaaagttgc
cccggaattt 600ctgctcactt tttcagatgt caccagcaac cagagctcag cagtattagg
aaagtcaatt 660ttttgcatgg acccggttat tgcactgatg cacgaactga cgcactctct
gcatcaactg 720tatgggatca acatccccag tgacaaacgt attcgtcccc aggtgtctga
aggatttttc 780tcacaggatg ggccgaacgt ccagttcgaa gagttgtata ctttcggagg
cctggacgta 840gagatcattc cccagattga gcgcagtcag ctgcgtgaga aggcattggg
ccattataag 900gatattgcaa aacgcctgaa taacattaac aaaacgattc catcttcgtg
gatctcgaat 960attgataaat ataagaaaat ttttagcgag aaatataatt ttgataaaga
taatacaggt 1020aactttgtgg ttaacattga caaattcaac tccctttaca gtgatttgac
gaatgtaatg 1080agcgaagttg tgtatagttc ccaatacaac gttaagaatc gtacccatta
cttctctcgt 1140cactacctgc cggttttcgc gaacatcctt gacgataata tttacactat
tcgtgacggc 1200tttaacttga ccaacaaggg cttcaatatt gaaaattcag gccagaacat
tgaacgcaac 1260ccggccttgc agaaactgtc gagtgaatcc gtggttgacc tgtttaccaa
agtctgcgtc 1320gacaaaagcg aagagaagct gtacgatgac gatgacaaag atcgttgggg
atcgtccctg 1380cagtgtatta aagtgaaaaa caatcggctg ccttatgtag cagataaaga
tagcattagt 1440caggagattt tcgaaaataa aattatcact gacgaaacca atgttcagaa
ttattcagat 1500aaattttcac tggacgaaag catcttagat ggccaagttc cgattaaccc
ggaaattgtt 1560gatccgttac tgccgaacgt gaatatggaa ccgttaaacc tccctggcga
agagatcgta 1620ttttatgatg acattacgaa atatgtggac taccttaatt cttattacta
tttggaaagc 1680cagaaactgt ccaataacgt ggaaaacatt actctgacca caagcgtgga
agaggcttta 1740ggctactcaa ataagattta taccttcctc ccgtcgctgg cggaaaaagt
aaataaaggt 1800gtgcaggctg gtctgttcct caactgggcg aatgaagttg tcgaagactt
taccacgaat 1860attatgaaaa aggataccct ggataaaatc tccgacgtct cggttattat
cccatatatt 1920ggccctgcgt taaatatcgg taatagtgcg ctgcggggga attttaacca
ggcctttgct 1980accgcgggcg tcgcgttcct cctggagggc tttcctgaat ttactatccc
ggcgctcggt 2040gtttttacat tttactcttc catccaggag cgtgagaaaa ttatcaaaac
catcgaaaac 2100tgcctggagc agcgggtgaa acgctggaaa gattcttatc aatggatggt
gtcaaactgg 2160ttatctcgca tcacgaccca attcaaccat attaattacc agatgtatga
tagtctgtcg 2220taccaagctg acgccattaa agccaaaatt gatctggaat ataaaaagta
ctctggtagc 2280gataaggaga acatcaaaag ccaggtggag aaccttaaga atagtctgga
tgtgaaaatc 2340tctgaagcta tgaataacat taacaaattc attcgtgaat gttcggtgac
gtacctgttc 2400aagaatatgc tgccaaaagt tattgatgaa ctgaataaat ttgatctgcg
taccaaaacc 2460gaacttatca acctcatcga ctcccacaac attatccttg tgggcgaagt
ggatcgtctg 2520aaggccaaag taaacgagag ctttgaaaat acgatgccgt ttaatatttt
ttcatatacc 2580aataactcct tgctgaaaga tatcatcaat gaatatttca atctagaagg
cggtggcggt 2640agcggcggtg gcggtagcgg cggtggcggt agcgcactag tgcaggaaag
acctccatta 2700caacaacctc cacatcgcga taagaaacca tgtaagaatt tcttttggaa
aacatttagc 2760agttgcaaat aataagctt
27797920PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 7Thr Trp Pro Val Lys Asp
Phe Asn Tyr Ser Asp Pro Val Asn Asp Asn1 5
10 15Asp Ile Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu
Ile Thr Thr Pro 20 25 30Val
Lys Ala Phe Met Ile Thr Gln Asn Ile Trp Val Ile Pro Glu Arg 35
40 45Phe Ser Ser Asp Thr Asn Pro Ser Leu
Ser Lys Pro Pro Arg Pro Thr 50 55
60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp Glu65
70 75 80Gln Lys Asp Thr Phe
Leu Lys Gly Ile Ile Lys Leu Phe Lys Arg Ile 85
90 95Asn Glu Arg Asp Ile Gly Lys Lys Leu Ile Asn
Tyr Leu Val Val Gly 100 105
110Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp Phe
115 120 125Thr Arg His Thr Thr Asn Ile
Ala Val Glu Lys Phe Glu Asn Gly Ser 130 135
140Trp Lys Val Thr Asn Ile Ile Thr Pro Ser Val Leu Ile Phe Gly
Pro145 150 155 160Leu Pro
Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gln Gly Gln
165 170 175Gln Ser Asn Pro Ser Phe Glu
Gly Phe Gly Thr Leu Ser Ile Leu Lys 180 185
190Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser
Asn Gln 195 200 205Ser Ser Ala Val
Leu Gly Lys Ser Ile Phe Cys Met Asp Pro Val Ile 210
215 220Ala Leu Met His Glu Leu Thr His Ser Leu His Gln
Leu Tyr Gly Ile225 230 235
240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro Gln Val Ser Glu Gly Phe
245 250 255Phe Ser Gln Asp Gly
Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe 260
265 270Gly Gly Leu Asp Val Glu Ile Ile Pro Gln Ile Glu
Arg Ser Gln Leu 275 280 285Arg Glu
Lys Ala Leu Gly His Tyr Lys Asp Ile Ala Lys Arg Leu Asn 290
295 300Asn Ile Asn Lys Thr Ile Pro Ser Ser Trp Ile
Ser Asn Ile Asp Lys305 310 315
320Tyr Lys Lys Ile Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn Thr
325 330 335Gly Asn Phe Val
Val Asn Ile Asp Lys Phe Asn Ser Leu Tyr Ser Asp 340
345 350Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser
Ser Gln Tyr Asn Val 355 360 365Lys
Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe Ala 370
375 380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile
Arg Asp Gly Phe Asn Leu385 390 395
400Thr Asn Lys Gly Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu
Arg 405 410 415Asn Pro Ala
Leu Gln Lys Leu Ser Ser Glu Ser Val Val Asp Leu Phe 420
425 430Thr Lys Val Cys Val Asp Lys Ser Glu Glu
Lys Leu Tyr Asp Asp Asp 435 440
445Asp Lys Asp Arg Trp Gly Ser Ser Leu Gln Cys Ile Lys Val Lys Asn 450
455 460Asn Arg Leu Pro Tyr Val Ala Asp
Lys Asp Ser Ile Ser Gln Glu Ile465 470
475 480Phe Glu Asn Lys Ile Ile Thr Asp Glu Thr Asn Val
Gln Asn Tyr Ser 485 490
495Asp Lys Phe Ser Leu Asp Glu Ser Ile Leu Asp Gly Gln Val Pro Ile
500 505 510Asn Pro Glu Ile Val Asp
Pro Leu Leu Pro Asn Val Asn Met Glu Pro 515 520
525Leu Asn Leu Pro Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile
Thr Lys 530 535 540Tyr Val Asp Tyr Leu
Asn Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu545 550
555 560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr
Thr Ser Val Glu Glu Ala 565 570
575Leu Gly Tyr Ser Asn Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu
580 585 590Lys Val Asn Lys Gly
Val Gln Ala Gly Leu Phe Leu Asn Trp Ala Asn 595
600 605Glu Val Val Glu Asp Phe Thr Thr Asn Ile Met Lys
Lys Asp Thr Leu 610 615 620Asp Lys Ile
Ser Asp Val Ser Val Ile Ile Pro Tyr Ile Gly Pro Ala625
630 635 640Leu Asn Ile Gly Asn Ser Ala
Leu Arg Gly Asn Phe Asn Gln Ala Phe 645
650 655Ala Thr Ala Gly Val Ala Phe Leu Leu Glu Gly Phe
Pro Glu Phe Thr 660 665 670Ile
Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser Ser Ile Gln Glu Arg 675
680 685Glu Lys Ile Ile Lys Thr Ile Glu Asn
Cys Leu Glu Gln Arg Val Lys 690 695
700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val Ser Asn Trp Leu Ser Arg705
710 715 720Ile Thr Thr Gln
Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser Leu 725
730 735Ser Tyr Gln Ala Asp Ala Ile Lys Ala Lys
Ile Asp Leu Glu Tyr Lys 740 745
750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile Lys Ser Gln Val Glu Asn
755 760 765Leu Lys Asn Ser Leu Asp Val
Lys Ile Ser Glu Ala Met Asn Asn Ile 770 775
780Asn Lys Phe Ile Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn
Met785 790 795 800Leu Pro
Lys Val Ile Asp Glu Leu Asn Lys Phe Asp Leu Arg Thr Lys
805 810 815Thr Glu Leu Ile Asn Leu Ile
Asp Ser His Asn Ile Ile Leu Val Gly 820 825
830Glu Val Asp Arg Leu Lys Ala Lys Val Asn Glu Ser Phe Glu
Asn Thr 835 840 845Met Pro Phe Asn
Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp 850
855 860Ile Ile Asn Glu Tyr Phe Asn Leu Glu Gly Gly Gly
Gly Ser Gly Gly865 870 875
880Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Gln Glu Arg Pro Pro
885 890 895Leu Gln Gln Pro Pro
His Arg Asp Lys Lys Pro Cys Lys Asn Phe Phe 900
905 910Trp Lys Thr Phe Ser Ser Cys Lys 915
9208277DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide" 8ggatccgtcg acaacaacaa
taacaacaac aataacaaca acgacgatga cgataaaaat 60tcagatagcg aatgtccact
tagtcacgac gggtactgtt tgcatgatgg tgtgtgtatg 120tatatagaag cactagacaa
atacgcttgc aattgcgtag ttggctatat aggagagcga 180tgccaatata gagatctgaa
gtggtgggag ttaagggcag aagcggcagc caaagaagca 240gccgctaagg cgctgcagag
tctagaataa taagctt 27792802DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide" 9ggatccatgg agttcgttaa caaacagttc aactataaag acccagttaa
cggtgttgac 60attgcttaca tcaaaatccc gaacgctggc cagatgcagc cggtaaaggc
attcaaaatc 120cacaacaaaa tctgggttat cccggaacgt gataccttta ctaacccgga
agaaggtgac 180ctgaacccgc caccggaagc gaaacaggtg ccggtatctt actatgactc
cacctacctg 240tctaccgata acgaaaagga caactacctg aaaggtgtta ctaaactgtt
cgagcgtatt 300tactccaccg acctgggccg tatgctgctg actagcatcg ttcgcggtat
cccgttctgg 360ggcggttcta ccatcgatac cgaactgaaa gtaatcgaca ctaactgcat
caacgttatt 420cagccggacg gttcctatcg ttccgaagaa ctgaacctgg tgatcatcgg
cccgtctgct 480gatatcatcc agttcgagtg tctgagcttt ggtcacgaag ttctgaacct
cacccgtaac 540ggctacggtt ccactcagta catccgtttc tctccggact tcaccttcgg
ttttgaagaa 600tccctggaag tagacacgaa cccactgctg ggcgctggta aattcgcaac
tgatcctgcg 660gttaccctgg ctcacgaact gattcatgca ggccaccgcc tgtacggtat
cgccatcaat 720ccgaaccgtg tcttcaaagt taacaccaac gcgtattacg agatgtccgg
tctggaagtt 780agcttcgaag aactgcgtac ttttggcggt cacgacgcta aattcatcga
ctctctgcaa 840gaaaacgagt tccgtctgta ctactataac aagttcaaag atatcgcatc
caccctgaac 900aaagcgaaat ccatcgtggg taccactgct tctctccagt acatgaagaa
cgtttttaaa 960gaaaaatacc tgctcagcga agacacctcc ggcaaattct ctgtagacaa
gttgaaattc 1020gataaacttt acaaaatgct gactgaaatt tacaccgaag acaacttcgt
taagttcttt 1080aaagttctga accgcaaaac ctatctgaac ttcgacaagg cagtattcaa
aatcaacatc 1140gtgccgaaag ttaactacac tatctacgat ggtttcaacc tgcgtaacac
caacctggct 1200gctaatttta acggccagaa cacggaaatc aacaacatga acttcacaaa
actgaaaaac 1260ttcactggtc tgttcgagtt ttacaagctg ctgtgcgtcg acaacaacaa
taacaacaac 1320aataacaaca acgacgatga cgataaaaat tcagatagcg aatgtccact
tagtcacgac 1380gggtactgtt tgcatgatgg tgtgtgtatg tatatagaag cactagacaa
atacgcttgc 1440aattgcgtag ttggctatat aggagagcga tgccaatata gagatctgaa
gtggtgggag 1500ttaagggcag aagcggcagc caaagaagca gccgctaagg cgctgcagtg
tatcaaggtt 1560aacaactggg atttattctt cagcccgagt gaagacaact tcaccaacga
cctgaacaaa 1620ggtgaagaaa tcacctcaga tactaacatc gaagcagccg aagaaaacat
ctcgctggac 1680ctgatccagc agtactacct gacctttaat ttcgacaacg agccggaaaa
catttctatc 1740gaaaacctga gctctgatat catcggccag ctggaactga tgccgaacat
cgaacgtttc 1800ccaaacggta aaaagtacga gctggacaaa tataccatgt tccactacct
gcgcgcgcag 1860gaatttgaac acggcaaatc ccgtatcgca ctgactaact ccgttaacga
agctctgctc 1920aacccgtccc gtgtatacac cttcttctct agcgactacg tgaaaaaggt
caacaaagcg 1980actgaagctg caatgttctt gggttgggtt gaacagcttg tttatgattt
taccgacgag 2040acgtccgaag tatctactac cgacaaaatt gcggatatca ctatcatcat
cccgtacatc 2100ggtccggctc tgaacattgg caacatgctg tacaaagacg acttcgttgg
cgcactgatc 2160ttctccggtg cggtgatcct gctggagttc atcccggaaa tcgccatccc
ggtactgggc 2220acctttgctc tggtttctta cattgcaaac aaggttctga ctgtacaaac
catcgacaac 2280gcgctgagca aacgtaacga aaaatgggat gaagtttaca aatatatcgt
gaccaactgg 2340ctggctaagg ttaatactca gatcgacctc atccgcaaaa aaatgaaaga
agcactggaa 2400aaccaggcgg aagctaccaa ggcaatcatt aactaccagt acaaccagta
caccgaggaa 2460gaaaaaaaca acatcaactt caacatcgac gatctgtcct ctaaactgaa
cgaatccatc 2520aacaaagcta tgatcaacat caacaagttc ctgaaccagt gctctgtaag
ctatctgatg 2580aactccatga tcccgtacgg tgttaaacgt ctggaggact tcgatgcgtc
tctgaaagac 2640gccctgctga aatacattta cgacaaccgt ggcactctga tcggtcaggt
tgatcgtctg 2700aaggacaaag tgaacaatac cttatcgacc gacatccctt ttcagctcag
taaatatgtc 2760gataaccaac gccttttgtc cactctagaa taatgaaagc tt
280210925PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 10Glu Phe Val Asn Lys Gln
Phe Asn Tyr Lys Asp Pro Val Asn Gly Val1 5
10 15Asp Ile Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln
Met Gln Pro Val 20 25 30Lys
Ala Phe Lys Ile His Asn Lys Ile Trp Val Ile Pro Glu Arg Asp 35
40 45Thr Phe Thr Asn Pro Glu Glu Gly Asp
Leu Asn Pro Pro Pro Glu Ala 50 55
60Lys Gln Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp65
70 75 80Asn Glu Lys Asp Asn
Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu Arg 85
90 95Ile Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu
Thr Ser Ile Val Arg 100 105
110Gly Ile Pro Phe Trp Gly Gly Ser Thr Ile Asp Thr Glu Leu Lys Val
115 120 125Ile Asp Thr Asn Cys Ile Asn
Val Ile Gln Pro Asp Gly Ser Tyr Arg 130 135
140Ser Glu Glu Leu Asn Leu Val Ile Ile Gly Pro Ser Ala Asp Ile
Ile145 150 155 160Gln Phe
Glu Cys Leu Ser Phe Gly His Glu Val Leu Asn Leu Thr Arg
165 170 175Asn Gly Tyr Gly Ser Thr Gln
Tyr Ile Arg Phe Ser Pro Asp Phe Thr 180 185
190Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu
Leu Gly 195 200 205Ala Gly Lys Phe
Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu Leu 210
215 220Ile His Ala Gly His Arg Leu Tyr Gly Ile Ala Ile
Asn Pro Asn Arg225 230 235
240Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu
245 250 255Val Ser Phe Glu Glu
Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe 260
265 270Ile Asp Ser Leu Gln Glu Asn Glu Phe Arg Leu Tyr
Tyr Tyr Asn Lys 275 280 285Phe Lys
Asp Ile Ala Ser Thr Leu Asn Lys Ala Lys Ser Ile Val Gly 290
295 300Thr Thr Ala Ser Leu Gln Tyr Met Lys Asn Val
Phe Lys Glu Lys Tyr305 310 315
320Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys
325 330 335Phe Asp Lys Leu
Tyr Lys Met Leu Thr Glu Ile Tyr Thr Glu Asp Asn 340
345 350Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys
Thr Tyr Leu Asn Phe 355 360 365Asp
Lys Ala Val Phe Lys Ile Asn Ile Val Pro Lys Val Asn Tyr Thr 370
375 380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr
Asn Leu Ala Ala Asn Phe385 390 395
400Asn Gly Gln Asn Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu
Lys 405 410 415Asn Phe Thr
Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Asn 420
425 430Asn Asn Asn Asn Asn Asn Asn Asn Asn Asp
Asp Asp Asp Lys Asn Ser 435 440
445Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly 450
455 460Val Cys Met Tyr Ile Glu Ala Leu
Asp Lys Tyr Ala Cys Asn Cys Val465 470
475 480Val Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp
Leu Lys Trp Trp 485 490
495Glu Leu Arg Ala Glu Ala Ala Ala Lys Glu Ala Ala Ala Lys Ala Leu
500 505 510Gln Cys Ile Lys Val Asn
Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu 515 520
525Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu Ile Thr
Ser Asp 530 535 540Thr Asn Ile Glu Ala
Ala Glu Glu Asn Ile Ser Leu Asp Leu Ile Gln545 550
555 560Gln Tyr Tyr Leu Thr Phe Asn Phe Asp Asn
Glu Pro Glu Asn Ile Ser 565 570
575Ile Glu Asn Leu Ser Ser Asp Ile Ile Gly Gln Leu Glu Leu Met Pro
580 585 590Asn Ile Glu Arg Phe
Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr 595
600 605Thr Met Phe His Tyr Leu Arg Ala Gln Glu Phe Glu
His Gly Lys Ser 610 615 620Arg Ile Ala
Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser625
630 635 640Arg Val Tyr Thr Phe Phe Ser
Ser Asp Tyr Val Lys Lys Val Asn Lys 645
650 655Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu
Gln Leu Val Tyr 660 665 670Asp
Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys Ile Ala 675
680 685Asp Ile Thr Ile Ile Ile Pro Tyr Ile
Gly Pro Ala Leu Asn Ile Gly 690 695
700Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu Ile Phe Ser Gly705
710 715 720Ala Val Ile Leu
Leu Glu Phe Ile Pro Glu Ile Ala Ile Pro Val Leu 725
730 735Gly Thr Phe Ala Leu Val Ser Tyr Ile Ala
Asn Lys Val Leu Thr Val 740 745
750Gln Thr Ile Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu
755 760 765Val Tyr Lys Tyr Ile Val Thr
Asn Trp Leu Ala Lys Val Asn Thr Gln 770 775
780Ile Asp Leu Ile Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gln
Ala785 790 795 800Glu Ala
Thr Lys Ala Ile Ile Asn Tyr Gln Tyr Asn Gln Tyr Thr Glu
805 810 815Glu Glu Lys Asn Asn Ile Asn
Phe Asn Ile Asp Asp Leu Ser Ser Lys 820 825
830Leu Asn Glu Ser Ile Asn Lys Ala Met Ile Asn Ile Asn Lys
Phe Leu 835 840 845Asn Gln Cys Ser
Val Ser Tyr Leu Met Asn Ser Met Ile Pro Tyr Gly 850
855 860Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys
Asp Ala Leu Leu865 870 875
880Lys Tyr Ile Tyr Asp Asn Arg Gly Thr Leu Ile Gly Gln Val Asp Arg
885 890 895Leu Lys Asp Lys Val
Asn Asn Thr Leu Ser Thr Asp Ile Pro Phe Gln 900
905 910Leu Ser Lys Tyr Val Asp Asn Gln Arg Leu Leu Ser
Thr 915 920 92511864PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 11Glu Phe Val Asn Lys Gln Phe Asn Tyr Lys Asp Pro Val Asn
Gly Val1 5 10 15Asp Ile
Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln Met Gln Pro Val 20
25 30Lys Ala Phe Lys Ile His Asn Lys Ile
Trp Val Ile Pro Glu Arg Asp 35 40
45Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala 50
55 60Lys Gln Val Pro Val Ser Tyr Tyr Asp
Ser Thr Tyr Leu Ser Thr Asp65 70 75
80Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe
Glu Arg 85 90 95Ile Tyr
Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser Ile Val Arg 100
105 110Gly Ile Pro Phe Trp Gly Gly Ser Thr
Ile Asp Thr Glu Leu Lys Val 115 120
125Ile Asp Thr Asn Cys Ile Asn Val Ile Gln Pro Asp Gly Ser Tyr Arg
130 135 140Ser Glu Glu Leu Asn Leu Val
Ile Ile Gly Pro Ser Ala Asp Ile Ile145 150
155 160Gln Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu
Asn Leu Thr Arg 165 170
175Asn Gly Tyr Gly Ser Thr Gln Tyr Ile Arg Phe Ser Pro Asp Phe Thr
180 185 190Phe Gly Phe Glu Glu Ser
Leu Glu Val Asp Thr Asn Pro Leu Leu Gly 195 200
205Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His
Glu Leu 210 215 220Ile His Ala Gly His
Arg Leu Tyr Gly Ile Ala Ile Asn Pro Asn Arg225 230
235 240Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr
Glu Met Ser Gly Leu Glu 245 250
255Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe
260 265 270Ile Asp Ser Leu Gln
Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys 275
280 285Phe Lys Asp Ile Ala Ser Thr Leu Asn Lys Ala Lys
Ser Ile Val Gly 290 295 300Thr Thr Ala
Ser Leu Gln Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr305
310 315 320Leu Leu Ser Glu Asp Thr Ser
Gly Lys Phe Ser Val Asp Lys Leu Lys 325
330 335Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu Ile Tyr
Thr Glu Asp Asn 340 345 350Phe
Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe 355
360 365Asp Lys Ala Val Phe Lys Ile Asn Ile
Val Pro Lys Val Asn Tyr Thr 370 375
380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe385
390 395 400Asn Gly Gln Asn
Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu Lys 405
410 415Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys
Leu Leu Cys Val Asp Gly 420 425
430Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp Lys Asn Lys Ala
435 440 445Leu Asn Leu Gln Cys Ile Lys
Val Asn Asn Trp Asp Leu Phe Phe Ser 450 455
460Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu
Ile465 470 475 480Thr Ser
Asp Thr Asn Ile Glu Ala Ala Glu Glu Asn Ile Ser Leu Asp
485 490 495Leu Ile Gln Gln Tyr Tyr Leu
Thr Phe Asn Phe Asp Asn Glu Pro Glu 500 505
510Asn Ile Ser Ile Glu Asn Leu Ser Ser Asp Ile Ile Gly Gln
Leu Glu 515 520 525Leu Met Pro Asn
Ile Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu 530
535 540Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gln
Glu Phe Glu His545 550 555
560Gly Lys Ser Arg Ile Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu
565 570 575Asn Pro Ser Arg Val
Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys 580
585 590Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly
Trp Val Glu Gln 595 600 605Leu Val
Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp 610
615 620Lys Ile Ala Asp Ile Thr Ile Ile Ile Pro Tyr
Ile Gly Pro Ala Leu625 630 635
640Asn Ile Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu Ile
645 650 655Phe Ser Gly Ala
Val Ile Leu Leu Glu Phe Ile Pro Glu Ile Ala Ile 660
665 670Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr
Ile Ala Asn Lys Val 675 680 685Leu
Thr Val Gln Thr Ile Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys 690
695 700Trp Asp Glu Val Tyr Lys Tyr Ile Val Thr
Asn Trp Leu Ala Lys Val705 710 715
720Asn Thr Gln Ile Asp Leu Ile Arg Lys Lys Met Lys Glu Ala Leu
Glu 725 730 735Asn Gln Ala
Glu Ala Thr Lys Ala Ile Ile Asn Tyr Gln Tyr Asn Gln 740
745 750Tyr Thr Glu Glu Glu Lys Asn Asn Ile Asn
Phe Asn Ile Asp Asp Leu 755 760
765Ser Ser Lys Leu Asn Glu Ser Ile Asn Lys Ala Met Ile Asn Ile Asn 770
775 780Lys Phe Leu Asn Gln Cys Ser Val
Ser Tyr Leu Met Asn Ser Met Ile785 790
795 800Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala
Ser Leu Lys Asp 805 810
815Ala Leu Leu Lys Tyr Ile Tyr Asp Asn Arg Gly Thr Leu Ile Gly Gln
820 825 830Val Asp Arg Leu Lys Asp
Lys Val Asn Asn Thr Leu Ser Thr Asp Ile 835 840
845Pro Phe Gln Leu Ser Lys Tyr Val Asp Asn Gln Arg Leu Leu
Ser Thr 850 855 86012869PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 12Pro Val Thr Ile Asn Asn Phe Asn Tyr Asn Asp Pro Ile Asp
Asn Asn1 5 10 15Asn Ile
Ile Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg Tyr 20
25 30Tyr Lys Ala Phe Lys Ile Thr Asp Arg
Ile Trp Ile Ile Pro Glu Arg 35 40
45Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly Ile 50
55 60Phe Asn Arg Asp Val Cys Glu Tyr Tyr
Asp Pro Asp Tyr Leu Asn Thr65 70 75
80Asn Asp Lys Lys Asn Ile Phe Leu Gln Thr Met Ile Lys Leu
Phe Asn 85 90 95Arg Ile
Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met Ile Ile 100
105 110Asn Gly Ile Pro Tyr Leu Gly Asp Arg
Arg Val Pro Leu Glu Glu Phe 115 120
125Asn Thr Asn Ile Ala Ser Val Thr Val Asn Lys Leu Ile Ser Asn Pro
130 135 140Gly Glu Val Glu Arg Lys Lys
Gly Ile Phe Ala Asn Leu Ile Ile Phe145 150
155 160Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr Ile
Asp Ile Gly Ile 165 170
175Gln Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly Ile Met Gln Met
180 185 190Lys Phe Cys Pro Glu Tyr
Val Ser Val Phe Asn Asn Val Gln Glu Asn 195 200
205Lys Gly Ala Ser Ile Phe Asn Arg Arg Gly Tyr Phe Ser Asp
Pro Ala 210 215 220Leu Ile Leu Met His
Glu Leu Ile His Val Leu His Gly Leu Tyr Gly225 230
235 240Ile Lys Val Asp Asp Leu Pro Ile Val Pro
Asn Glu Lys Lys Phe Phe 245 250
255Met Gln Ser Thr Asp Ala Ile Gln Ala Glu Glu Leu Tyr Thr Phe Gly
260 265 270Gly Gln Asp Pro Ser
Ile Ile Thr Pro Ser Thr Asp Lys Ser Ile Tyr 275
280 285Asp Lys Val Leu Gln Asn Phe Arg Gly Ile Val Asp
Arg Leu Asn Lys 290 295 300Val Leu Val
Cys Ile Ser Asp Pro Asn Ile Asn Ile Asn Ile Tyr Lys305
310 315 320Asn Lys Phe Lys Asp Lys Tyr
Lys Phe Val Glu Asp Ser Glu Gly Lys 325
330 335Tyr Ser Ile Asp Val Glu Ser Phe Asp Lys Leu Tyr
Lys Ser Leu Met 340 345 350Phe
Gly Phe Thr Glu Thr Asn Ile Ala Glu Asn Tyr Lys Ile Lys Thr 355
360 365Arg Ala Ser Tyr Phe Ser Asp Ser Leu
Pro Pro Val Lys Ile Lys Asn 370 375
380Leu Leu Asp Asn Glu Ile Tyr Thr Ile Glu Glu Gly Phe Asn Ile Ser385
390 395 400Asp Lys Asp Met
Glu Lys Glu Tyr Arg Gly Gln Asn Lys Ala Ile Asn 405
410 415Lys Gln Ala Tyr Glu Glu Ile Ser Lys Glu
His Leu Ala Val Tyr Lys 420 425
430Ile Gln Met Cys Val Asp Glu Glu Lys Leu Tyr Asp Asp Asp Asp Lys
435 440 445Asp Arg Trp Gly Ser Ser Leu
Gln Cys Ile Asp Val Asp Asn Glu Asp 450 455
460Leu Phe Phe Ile Ala Asp Lys Asn Ser Phe Ser Asp Asp Leu Ser
Lys465 470 475 480Asn Glu
Arg Ile Glu Tyr Asn Thr Gln Ser Asn Tyr Ile Glu Asn Asp
485 490 495Phe Pro Ile Asn Glu Leu Ile
Leu Asp Thr Asp Leu Ile Ser Lys Ile 500 505
510Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr Asp Phe Asn
Val Asp 515 520 525Val Pro Val Tyr
Glu Lys Gln Pro Ala Ile Lys Lys Ile Phe Thr Asp 530
535 540Glu Asn Thr Ile Phe Gln Tyr Leu Tyr Ser Gln Thr
Phe Pro Leu Asp545 550 555
560Ile Arg Asp Ile Ser Leu Thr Ser Ser Phe Asp Asp Ala Leu Leu Phe
565 570 575Ser Asn Lys Val Tyr
Ser Phe Phe Ser Met Asp Tyr Ile Lys Thr Ala 580
585 590Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly Trp
Val Lys Gln Ile 595 600 605Val Asn
Asp Phe Val Ile Glu Ala Asn Lys Ser Asn Thr Met Asp Ala 610
615 620Ile Ala Asp Ile Ser Leu Ile Val Pro Tyr Ile
Gly Leu Ala Leu Asn625 630 635
640Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu Asn Ala Phe Glu Ile
645 650 655Ala Gly Ala Ser
Ile Leu Leu Glu Phe Ile Pro Glu Leu Leu Ile Pro 660
665 670Val Val Gly Ala Phe Leu Leu Glu Ser Tyr Ile
Asp Asn Lys Asn Lys 675 680 685Ile
Ile Lys Thr Ile Asp Asn Ala Leu Thr Lys Arg Asn Glu Lys Trp 690
695 700Ser Asp Met Tyr Gly Leu Ile Val Ala Gln
Trp Leu Ser Thr Val Asn705 710 715
720Thr Gln Phe Tyr Thr Ile Lys Glu Gly Met Tyr Lys Ala Leu Asn
Tyr 725 730 735Gln Ala Gln
Ala Leu Glu Glu Ile Ile Lys Tyr Arg Tyr Asn Ile Tyr 740
745 750Ser Glu Lys Glu Lys Ser Asn Ile Asn Ile
Asp Phe Asn Asp Ile Asn 755 760
765Ser Lys Leu Asn Glu Gly Ile Asn Gln Ala Ile Asp Asn Ile Asn Asn 770
775 780Phe Ile Asn Gly Cys Ser Val Ser
Tyr Leu Met Lys Lys Met Ile Pro785 790
795 800Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn Thr
Leu Lys Lys Asn 805 810
815Leu Leu Asn Tyr Ile Asp Glu Asn Lys Leu Tyr Leu Ile Gly Ser Ala
820 825 830Glu Tyr Glu Lys Ser Lys
Val Asn Lys Tyr Leu Lys Thr Ile Met Pro 835 840
845Phe Asp Leu Ser Ile Tyr Thr Asn Asp Thr Ile Leu Ile Glu
Met Phe 850 855 860Asn Lys Tyr Asn
Ser86513865PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 13Pro Ile Thr Ile Asn Asn Phe Asn
Tyr Ser Asp Pro Val Asp Asn Lys1 5 10
15Asn Ile Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn
Glu Pro 20 25 30Glu Lys Ala
Phe Arg Ile Thr Gly Asn Ile Trp Val Ile Pro Asp Arg 35
40 45Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys
Pro Pro Arg Val Thr 50 55 60Ser Pro
Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp Ser65
70 75 80Asp Lys Asp Thr Phe Leu Lys
Glu Ile Ile Lys Leu Phe Lys Arg Ile 85 90
95Asn Ser Arg Glu Ile Gly Glu Glu Leu Ile Tyr Arg Leu
Ser Thr Asp 100 105 110Ile Pro
Phe Pro Gly Asn Asn Asn Thr Pro Ile Asn Thr Phe Asp Phe 115
120 125Asp Val Asp Phe Asn Ser Val Asp Val Lys
Thr Arg Gln Gly Asn Asn 130 135 140Trp
Val Lys Thr Gly Ser Ile Asn Pro Ser Val Ile Ile Thr Gly Pro145
150 155 160Arg Glu Asn Ile Ile Asp
Pro Glu Thr Ser Thr Phe Lys Leu Thr Asn 165
170 175Asn Thr Phe Ala Ala Gln Glu Gly Phe Gly Ala Leu
Ser Ile Ile Ser 180 185 190Ile
Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp Val 195
200 205Gly Glu Gly Arg Phe Ser Lys Ser Glu
Phe Cys Met Asp Pro Ile Leu 210 215
220Ile Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly Ile225
230 235 240Ala Ile Pro Asn
Asp Gln Thr Ile Ser Ser Val Thr Ser Asn Ile Phe 245
250 255Tyr Ser Gln Tyr Asn Val Lys Leu Glu Tyr
Ala Glu Ile Tyr Ala Phe 260 265
270Gly Gly Pro Thr Ile Asp Leu Ile Pro Lys Ser Ala Arg Lys Tyr Phe
275 280 285Glu Glu Lys Ala Leu Asp Tyr
Tyr Arg Ser Ile Ala Lys Arg Leu Asn 290 295
300Ser Ile Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr Ile Gly
Glu305 310 315 320Tyr Lys
Gln Lys Leu Ile Arg Lys Tyr Arg Phe Val Val Glu Ser Ser
325 330 335Gly Glu Val Thr Val Asn Arg
Asn Lys Phe Val Glu Leu Tyr Asn Glu 340 345
350Leu Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala Lys Ile Tyr
Asn Val 355 360 365Gln Asn Arg Lys
Ile Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr Ala 370
375 380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile Gln Asn
Gly Phe Asn Ile385 390 395
400Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser Arg
405 410 415Asn Pro Ala Leu Arg
Lys Val Asn Pro Glu Asn Met Leu Tyr Leu Phe 420
425 430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly Arg Ser
Leu Tyr Asn Lys 435 440 445Thr Leu
Gln Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe 450
455 460Ile Gly Asp Ile Ser Asp Val Lys Thr Asp Ile
Phe Leu Arg Lys Asp465 470 475
480Ile Asn Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp Asn Val Ser Val
485 490 495Asp Gln Val Ile
Leu Ser Lys Asn Thr Ser Glu His Gly Gln Leu Asp 500
505 510Leu Leu Tyr Pro Ser Ile Asp Ser Glu Ser Glu
Ile Leu Pro Gly Glu 515 520 525Asn
Gln Val Phe Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr Leu Asn 530
535 540Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu
Ser Asp Asn Val Glu Asp545 550 555
560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala Leu Asp Asn Ser Ala
Lys 565 570 575Val Tyr Thr
Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val 580
585 590Gln Gly Gly Leu Phe Leu Met Trp Ala Asn
Asp Val Val Glu Asp Phe 595 600
605Thr Thr Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys Ile Ser Asp Val 610
615 620Ser Ala Ile Ile Pro Tyr Ile Gly
Pro Ala Leu Asn Ile Ser Asn Ser625 630
635 640Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val
Thr Gly Val Thr 645 650
655Ile Leu Leu Glu Ala Phe Pro Glu Phe Thr Ile Pro Ala Leu Gly Ala
660 665 670Phe Val Ile Tyr Ser Lys
Val Gln Glu Arg Asn Glu Ile Ile Lys Thr 675 680
685Ile Asp Asn Cys Leu Glu Gln Arg Ile Lys Arg Trp Lys Asp
Ser Tyr 690 695 700Glu Trp Met Met Gly
Thr Trp Leu Ser Arg Ile Ile Thr Gln Phe Asn705 710
715 720Asn Ile Ser Tyr Gln Met Tyr Asp Ser Leu
Asn Tyr Gln Ala Gly Ala 725 730
735Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp
740 745 750Lys Glu Asn Ile Lys
Ser Gln Val Glu Asn Leu Lys Asn Ser Leu Asp 755
760 765Val Lys Ile Ser Glu Ala Met Asn Asn Ile Asn Lys
Phe Ile Arg Glu 770 775 780Cys Ser Val
Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val Ile Asp785
790 795 800Glu Leu Asn Glu Phe Asp Arg
Asn Thr Lys Ala Lys Leu Ile Asn Leu 805
810 815Ile Asp Ser His Asn Ile Ile Leu Val Gly Glu Val
Asp Lys Leu Lys 820 825 830Ala
Lys Val Asn Asn Ser Phe Gln Asn Thr Ile Pro Phe Asn Ile Phe 835
840 845Ser Tyr Thr Asn Asn Ser Leu Leu Lys
Asp Ile Ile Asn Glu Tyr Phe 850 855
860Asn86514871PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 14Thr Trp Pro Val Lys Asp
Phe Asn Tyr Ser Asp Pro Val Asn Asp Asn1 5
10 15Asp Ile Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu
Ile Thr Thr Pro 20 25 30Val
Lys Ala Phe Met Ile Thr Gln Asn Ile Trp Val Ile Pro Glu Arg 35
40 45Phe Ser Ser Asp Thr Asn Pro Ser Leu
Ser Lys Pro Pro Arg Pro Thr 50 55
60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp Glu65
70 75 80Gln Lys Asp Thr Phe
Leu Lys Gly Ile Ile Lys Leu Phe Lys Arg Ile 85
90 95Asn Glu Arg Asp Ile Gly Lys Lys Leu Ile Asn
Tyr Leu Val Val Gly 100 105
110Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp Phe
115 120 125Thr Arg His Thr Thr Asn Ile
Ala Val Glu Lys Phe Glu Asn Gly Ser 130 135
140Trp Lys Val Thr Asn Ile Ile Thr Pro Ser Val Leu Ile Phe Gly
Pro145 150 155 160Leu Pro
Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gln Gly Gln
165 170 175Gln Ser Asn Pro Ser Phe Glu
Gly Phe Gly Thr Leu Ser Ile Leu Lys 180 185
190Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser
Asn Gln 195 200 205Ser Ser Ala Val
Leu Gly Lys Ser Ile Phe Cys Met Asp Pro Val Ile 210
215 220Ala Leu Met His Glu Leu Thr His Ser Leu His Gln
Leu Tyr Gly Ile225 230 235
240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro Gln Val Ser Glu Gly Phe
245 250 255Phe Ser Gln Asp Gly
Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe 260
265 270Gly Gly Leu Asp Val Glu Ile Ile Pro Gln Ile Glu
Arg Ser Gln Leu 275 280 285Arg Glu
Lys Ala Leu Gly His Tyr Lys Asp Ile Ala Lys Arg Leu Asn 290
295 300Asn Ile Asn Lys Thr Ile Pro Ser Ser Trp Ile
Ser Asn Ile Asp Lys305 310 315
320Tyr Lys Lys Ile Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn Thr
325 330 335Gly Asn Phe Val
Val Asn Ile Asp Lys Phe Asn Ser Leu Tyr Ser Asp 340
345 350Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser
Ser Gln Tyr Asn Val 355 360 365Lys
Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe Ala 370
375 380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile
Arg Asp Gly Phe Asn Leu385 390 395
400Thr Asn Lys Gly Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu
Arg 405 410 415Asn Pro Ala
Leu Gln Lys Leu Ser Ser Glu Ser Val Val Asp Leu Phe 420
425 430Thr Lys Val Cys Val Asp Lys Ser Glu Glu
Lys Leu Tyr Asp Asp Asp 435 440
445Asp Lys Asp Arg Trp Gly Ser Ser Leu Gln Cys Ile Lys Val Lys Asn 450
455 460Asn Arg Leu Pro Tyr Val Ala Asp
Lys Asp Ser Ile Ser Gln Glu Ile465 470
475 480Phe Glu Asn Lys Ile Ile Thr Asp Glu Thr Asn Val
Gln Asn Tyr Ser 485 490
495Asp Lys Phe Ser Leu Asp Glu Ser Ile Leu Asp Gly Gln Val Pro Ile
500 505 510Asn Pro Glu Ile Val Asp
Pro Leu Leu Pro Asn Val Asn Met Glu Pro 515 520
525Leu Asn Leu Pro Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile
Thr Lys 530 535 540Tyr Val Asp Tyr Leu
Asn Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu545 550
555 560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr
Thr Ser Val Glu Glu Ala 565 570
575Leu Gly Tyr Ser Asn Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu
580 585 590Lys Val Asn Lys Gly
Val Gln Ala Gly Leu Phe Leu Asn Trp Ala Asn 595
600 605Glu Val Val Glu Asp Phe Thr Thr Asn Ile Met Lys
Lys Asp Thr Leu 610 615 620Asp Lys Ile
Ser Asp Val Ser Val Ile Ile Pro Tyr Ile Gly Pro Ala625
630 635 640Leu Asn Ile Gly Asn Ser Ala
Leu Arg Gly Asn Phe Asn Gln Ala Phe 645
650 655Ala Thr Ala Gly Val Ala Phe Leu Leu Glu Gly Phe
Pro Glu Phe Thr 660 665 670Ile
Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser Ser Ile Gln Glu Arg 675
680 685Glu Lys Ile Ile Lys Thr Ile Glu Asn
Cys Leu Glu Gln Arg Val Lys 690 695
700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val Ser Asn Trp Leu Ser Arg705
710 715 720Ile Thr Thr Gln
Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser Leu 725
730 735Ser Tyr Gln Ala Asp Ala Ile Lys Ala Lys
Ile Asp Leu Glu Tyr Lys 740 745
750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile Lys Ser Gln Val Glu Asn
755 760 765Leu Lys Asn Ser Leu Asp Val
Lys Ile Ser Glu Ala Met Asn Asn Ile 770 775
780Asn Lys Phe Ile Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn
Met785 790 795 800Leu Pro
Lys Val Ile Asp Glu Leu Asn Lys Phe Asp Leu Arg Thr Lys
805 810 815Thr Glu Leu Ile Asn Leu Ile
Asp Ser His Asn Ile Ile Leu Val Gly 820 825
830Glu Val Asp Arg Leu Lys Ala Lys Val Asn Glu Ser Phe Glu
Asn Thr 835 840 845Met Pro Phe Asn
Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp 850
855 860Ile Ile Asn Glu Tyr Phe Asn865
8701510PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 15Glu His Trp Ser Tyr Gly Cys Arg Pro
Gly1 5 1016920PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 16Pro Val Thr Ile Asn Asn Phe Asn Tyr Asn Asp Pro Ile Asp
Asn Asn1 5 10 15Asn Ile
Ile Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg Tyr 20
25 30Tyr Lys Ala Phe Lys Ile Thr Asp Arg
Ile Trp Ile Ile Pro Glu Arg 35 40
45Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly Ile 50
55 60Phe Asn Arg Asp Val Cys Glu Tyr Tyr
Asp Pro Asp Tyr Leu Asn Thr65 70 75
80Asn Asp Lys Lys Asn Ile Phe Leu Gln Thr Met Ile Lys Leu
Phe Asn 85 90 95Arg Ile
Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met Ile Ile 100
105 110Asn Gly Ile Pro Tyr Leu Gly Asp Arg
Arg Val Pro Leu Glu Glu Phe 115 120
125Asn Thr Asn Ile Ala Ser Val Thr Val Asn Lys Leu Ile Ser Asn Pro
130 135 140Gly Glu Val Glu Arg Lys Lys
Gly Ile Phe Ala Asn Leu Ile Ile Phe145 150
155 160Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr Ile
Asp Ile Gly Ile 165 170
175Gln Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly Ile Met Gln Met
180 185 190Lys Phe Cys Pro Glu Tyr
Val Ser Val Phe Asn Asn Val Gln Glu Asn 195 200
205Lys Gly Ala Ser Ile Phe Asn Arg Arg Gly Tyr Phe Ser Asp
Pro Ala 210 215 220Leu Ile Leu Met His
Glu Leu Ile His Val Leu His Gly Leu Tyr Gly225 230
235 240Ile Lys Val Asp Asp Leu Pro Ile Val Pro
Asn Glu Lys Lys Phe Phe 245 250
255Met Gln Ser Thr Asp Ala Ile Gln Ala Glu Glu Leu Tyr Thr Phe Gly
260 265 270Gly Gln Asp Pro Ser
Ile Ile Thr Pro Ser Thr Asp Lys Ser Ile Tyr 275
280 285Asp Lys Val Leu Gln Asn Phe Arg Gly Ile Val Asp
Arg Leu Asn Lys 290 295 300Val Leu Val
Cys Ile Ser Asp Pro Asn Ile Asn Ile Asn Ile Tyr Lys305
310 315 320Asn Lys Phe Lys Asp Lys Tyr
Lys Phe Val Glu Asp Ser Glu Gly Lys 325
330 335Tyr Ser Ile Asp Val Glu Ser Phe Asp Lys Leu Tyr
Lys Ser Leu Met 340 345 350Phe
Gly Phe Thr Glu Thr Asn Ile Ala Glu Asn Tyr Lys Ile Lys Thr 355
360 365Arg Ala Ser Tyr Phe Ser Asp Ser Leu
Pro Pro Val Lys Ile Lys Asn 370 375
380Leu Leu Asp Asn Glu Ile Tyr Thr Ile Glu Glu Gly Phe Asn Ile Ser385
390 395 400Asp Lys Asp Met
Glu Lys Glu Tyr Arg Gly Gln Asn Lys Ala Ile Asn 405
410 415Lys Gln Ala Tyr Glu Glu Ile Ser Lys Glu
His Leu Ala Val Tyr Lys 420 425
430Ile Gln Met Cys Val Asp Gly Ile Ile Thr Ser Lys Thr Lys Ser Asp
435 440 445Asp Asp Asp Lys Asn Lys Ala
Leu Asn Leu Gln Cys Ile Asp Val Asp 450 455
460Asn Glu Asp Leu Phe Phe Ile Ala Asp Lys Asn Ser Phe Ser Asp
Asp465 470 475 480Leu Ser
Lys Asn Glu Arg Ile Glu Tyr Asn Thr Gln Ser Asn Tyr Ile
485 490 495Glu Asn Asp Phe Pro Ile Asn
Glu Leu Ile Leu Asp Thr Asp Leu Ile 500 505
510Ser Lys Ile Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr
Asp Phe 515 520 525Asn Val Asp Val
Pro Val Tyr Glu Lys Gln Pro Ala Ile Lys Lys Ile 530
535 540Phe Thr Asp Glu Asn Thr Ile Phe Gln Tyr Leu Tyr
Ser Gln Thr Phe545 550 555
560Pro Leu Asp Ile Arg Asp Ile Ser Leu Thr Ser Ser Phe Asp Asp Ala
565 570 575Leu Leu Phe Ser Asn
Lys Val Tyr Ser Phe Phe Ser Met Asp Tyr Ile 580
585 590Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe
Ala Gly Trp Val 595 600 605Lys Gln
Ile Val Asn Asp Phe Val Ile Glu Ala Asn Lys Ser Asn Thr 610
615 620Met Asp Lys Ile Ala Asp Ile Ser Leu Ile Val
Pro Tyr Ile Gly Leu625 630 635
640Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu Asn Ala
645 650 655Phe Glu Ile Ala
Gly Ala Ser Ile Leu Leu Glu Phe Ile Pro Glu Leu 660
665 670Leu Ile Pro Val Val Gly Ala Phe Leu Leu Glu
Ser Tyr Ile Asp Asn 675 680 685Lys
Asn Lys Ile Ile Lys Thr Ile Asp Asn Ala Leu Thr Lys Arg Asn 690
695 700Glu Lys Trp Ser Asp Met Tyr Gly Leu Ile
Val Ala Gln Trp Leu Ser705 710 715
720Thr Val Asn Thr Gln Phe Tyr Thr Ile Lys Glu Gly Met Tyr Lys
Ala 725 730 735Leu Asn Tyr
Gln Ala Gln Ala Leu Glu Glu Ile Ile Lys Tyr Arg Tyr 740
745 750Asn Ile Tyr Ser Glu Lys Glu Lys Ser Asn
Ile Asn Ile Asp Phe Asn 755 760
765Asp Ile Asn Ser Lys Leu Asn Glu Gly Ile Asn Gln Ala Ile Asp Asn 770
775 780Ile Asn Asn Phe Ile Asn Gly Cys
Ser Val Ser Tyr Leu Met Lys Lys785 790
795 800Met Ile Pro Leu Ala Val Glu Lys Leu Leu Asp Phe
Asp Asn Thr Leu 805 810
815Lys Lys Asn Leu Leu Asn Tyr Ile Asp Glu Asn Lys Leu Tyr Leu Ile
820 825 830Gly Ser Ala Glu Tyr Glu
Lys Ser Lys Val Asn Lys Tyr Leu Lys Thr 835 840
845Ile Met Pro Phe Asp Leu Ser Ile Tyr Thr Asn Asp Thr Ile
Leu Ile 850 855 860Glu Met Phe Asn Lys
Tyr Asn Ser Leu Glu Gly Gly Gly Gly Ser Gly865 870
875 880Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala
Leu Asp Ser Ala Asn Ser 885 890
895Asn Pro Ala Met Ala Pro Arg Glu Arg Lys Ala Gly Cys Lys Asn Phe
900 905 910Phe Trp Lys Thr Phe
Thr Ser Cys 915 92017908PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 17Glu Phe Val Asn Lys Gln Phe Asn Tyr Lys Asp Pro Val Asn
Gly Val1 5 10 15Asp Ile
Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln Met Gln Pro Val 20
25 30Lys Ala Phe Lys Ile His Asn Lys Ile
Trp Val Ile Pro Glu Arg Asp 35 40
45Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala 50
55 60Lys Gln Val Pro Val Ser Tyr Tyr Asp
Ser Thr Tyr Leu Ser Thr Asp65 70 75
80Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe
Glu Arg 85 90 95Ile Tyr
Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser Ile Val Arg 100
105 110Gly Ile Pro Phe Trp Gly Gly Ser Thr
Ile Asp Thr Glu Leu Lys Val 115 120
125Ile Asp Thr Asn Cys Ile Asn Val Ile Gln Pro Asp Gly Ser Tyr Arg
130 135 140Ser Glu Glu Leu Asn Leu Val
Ile Ile Gly Pro Ser Ala Asp Ile Ile145 150
155 160Gln Phe Glu Cys Leu Ser Phe Gly His Glu Val Leu
Asn Leu Thr Arg 165 170
175Asn Gly Tyr Gly Ser Thr Gln Tyr Ile Arg Phe Ser Pro Asp Phe Thr
180 185 190Phe Gly Phe Glu Glu Ser
Leu Glu Val Asp Thr Asn Pro Leu Leu Gly 195 200
205Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His
Glu Leu 210 215 220Ile His Ala Gly His
Arg Leu Tyr Gly Ile Ala Ile Asn Pro Asn Arg225 230
235 240Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr
Glu Met Ser Gly Leu Glu 245 250
255Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe
260 265 270Ile Asp Ser Leu Gln
Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys 275
280 285Phe Lys Asp Ile Ala Ser Thr Leu Asn Lys Ala Lys
Ser Ile Val Gly 290 295 300Thr Thr Ala
Ser Leu Gln Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr305
310 315 320Leu Leu Ser Glu Asp Thr Ser
Gly Lys Phe Ser Val Asp Lys Leu Lys 325
330 335Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu Ile Tyr
Thr Glu Asp Asn 340 345 350Phe
Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe 355
360 365Asp Lys Ala Val Phe Lys Ile Asn Ile
Val Pro Lys Val Asn Tyr Thr 370 375
380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe385
390 395 400Asn Gly Gln Asn
Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu Lys 405
410 415Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys
Leu Leu Cys Val Asp Gly 420 425
430Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp Lys Ser Ala Asn
435 440 445Ser Asn Pro Ala Met Ala Pro
Arg Glu Arg Lys Ala Gly Cys Lys Asn 450 455
460Phe Phe Trp Lys Thr Phe Thr Ser Cys Ala Leu Ala Gly Gly Gly
Gly465 470 475 480Ser Gly
Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Leu Gln
485 490 495Cys Ile Lys Val Asn Asn Trp
Asp Leu Phe Phe Ser Pro Ser Glu Asp 500 505
510Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu Ile Thr Ser
Asp Thr 515 520 525Asn Ile Glu Ala
Ala Glu Glu Asn Ile Ser Leu Asp Leu Ile Gln Gln 530
535 540Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu
Asn Ile Ser Ile545 550 555
560Glu Asn Leu Ser Ser Asp Ile Ile Gly Gln Leu Glu Leu Met Pro Asn
565 570 575Ile Glu Arg Phe Pro
Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr 580
585 590Met Phe His Tyr Leu Arg Ala Gln Glu Phe Glu His
Gly Lys Ser Arg 595 600 605Ile Ala
Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg 610
615 620Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys
Lys Val Asn Lys Ala625 630 635
640Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gln Leu Val Tyr Asp
645 650 655Phe Thr Asp Glu
Thr Ser Glu Val Ser Thr Thr Asp Lys Ile Ala Asp 660
665 670Ile Thr Ile Ile Ile Pro Tyr Ile Gly Pro Ala
Leu Asn Ile Gly Asn 675 680 685Met
Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu Ile Phe Ser Gly Ala 690
695 700Val Ile Leu Leu Glu Phe Ile Pro Glu Ile
Ala Ile Pro Val Leu Gly705 710 715
720Thr Phe Ala Leu Val Ser Tyr Ile Ala Asn Lys Val Leu Thr Val
Gln 725 730 735Thr Ile Asp
Asn Ala Leu Ser Lys Arg Asn Glu Lys Trp Asp Glu Val 740
745 750Tyr Lys Tyr Ile Val Thr Asn Trp Leu Ala
Lys Val Asn Thr Gln Ile 755 760
765Asp Leu Ile Arg Lys Lys Met Lys Glu Ala Leu Glu Asn Gln Ala Glu 770
775 780Ala Thr Lys Ala Ile Ile Asn Tyr
Gln Tyr Asn Gln Tyr Thr Glu Glu785 790
795 800Glu Lys Asn Asn Ile Asn Phe Asn Ile Asp Asp Leu
Ser Ser Lys Leu 805 810
815Asn Glu Ser Ile Asn Lys Ala Met Ile Asn Ile Asn Lys Phe Leu Asn
820 825 830Gln Cys Ser Val Ser Tyr
Leu Met Asn Ser Met Ile Pro Tyr Gly Val 835 840
845Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu
Leu Lys 850 855 860Tyr Ile Tyr Asp Asn
Arg Gly Thr Leu Ile Gly Gln Val Asp Arg Leu865 870
875 880Lys Asp Lys Val Asn Asn Thr Leu Ser Thr
Asp Ile Pro Phe Gln Leu 885 890
895Ser Lys Tyr Val Asp Asn Gln Arg Leu Leu Ser Thr 900
90518944PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 18Thr Trp Pro Val Lys Asp
Phe Asn Tyr Ser Asp Pro Val Asn Asp Asn1 5
10 15Asp Ile Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu
Ile Thr Thr Pro 20 25 30Val
Lys Ala Phe Met Ile Thr Gln Asn Ile Trp Val Ile Pro Glu Arg 35
40 45Phe Ser Ser Asp Thr Asn Pro Ser Leu
Ser Lys Pro Pro Arg Pro Thr 50 55
60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp Glu65
70 75 80Gln Lys Asp Thr Phe
Leu Lys Gly Ile Ile Lys Leu Phe Lys Arg Ile 85
90 95Asn Glu Arg Asp Ile Gly Lys Lys Leu Ile Asn
Tyr Leu Val Val Gly 100 105
110Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp Phe
115 120 125Thr Arg His Thr Thr Asn Ile
Ala Val Glu Lys Phe Glu Asn Gly Ser 130 135
140Trp Lys Val Thr Asn Ile Ile Thr Pro Ser Val Leu Ile Phe Gly
Pro145 150 155 160Leu Pro
Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gln Gly Gln
165 170 175Gln Ser Asn Pro Ser Phe Glu
Gly Phe Gly Thr Leu Ser Ile Leu Lys 180 185
190Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser
Asn Gln 195 200 205Ser Ser Ala Val
Leu Gly Lys Ser Ile Phe Cys Met Asp Pro Val Ile 210
215 220Ala Leu Met His Glu Leu Thr His Ser Leu His Gln
Leu Tyr Gly Ile225 230 235
240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro Gln Val Ser Glu Gly Phe
245 250 255Phe Ser Gln Asp Gly
Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe 260
265 270Gly Gly Leu Asp Val Glu Ile Ile Pro Gln Ile Glu
Arg Ser Gln Leu 275 280 285Arg Glu
Lys Ala Leu Gly His Tyr Lys Asp Ile Ala Lys Arg Leu Asn 290
295 300Asn Ile Asn Lys Thr Ile Pro Ser Ser Trp Ile
Ser Asn Ile Asp Lys305 310 315
320Tyr Lys Lys Ile Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn Thr
325 330 335Gly Asn Phe Val
Val Asn Ile Asp Lys Phe Asn Ser Leu Tyr Ser Asp 340
345 350Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser
Ser Gln Tyr Asn Val 355 360 365Lys
Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe Ala 370
375 380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile
Arg Asp Gly Phe Asn Leu385 390 395
400Thr Asn Lys Gly Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu
Arg 405 410 415Asn Pro Ala
Leu Gln Lys Leu Ser Ser Glu Ser Val Val Asp Leu Phe 420
425 430Thr Lys Val Cys Val Asp Lys Ser Glu Glu
Lys Leu Tyr Asp Asp Asp 435 440
445Asp Lys Asp Arg Trp Gly Ser Ser Leu Gln Cys Ile Lys Val Lys Asn 450
455 460Asn Arg Leu Pro Tyr Val Ala Asp
Lys Asp Ser Ile Ser Gln Glu Ile465 470
475 480Phe Glu Asn Lys Ile Ile Thr Asp Glu Thr Asn Val
Gln Asn Tyr Ser 485 490
495Asp Lys Phe Ser Leu Asp Glu Ser Ile Leu Asp Gly Gln Val Pro Ile
500 505 510Asn Pro Glu Ile Val Asp
Pro Leu Leu Pro Asn Val Asn Met Glu Pro 515 520
525Leu Asn Leu Pro Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile
Thr Lys 530 535 540Tyr Val Asp Tyr Leu
Asn Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu545 550
555 560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr
Thr Ser Val Glu Glu Ala 565 570
575Leu Gly Tyr Ser Asn Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu
580 585 590Lys Val Asn Lys Gly
Val Gln Ala Gly Leu Phe Leu Asn Trp Ala Asn 595
600 605Glu Val Val Glu Asp Phe Thr Thr Asn Ile Met Lys
Lys Asp Thr Leu 610 615 620Asp Lys Ile
Ser Asp Val Ser Val Ile Ile Pro Tyr Ile Gly Pro Ala625
630 635 640Leu Asn Ile Gly Asn Ser Ala
Leu Arg Gly Asn Phe Asn Gln Ala Phe 645
650 655Ala Thr Ala Gly Val Ala Phe Leu Leu Glu Gly Phe
Pro Glu Phe Thr 660 665 670Ile
Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser Ser Ile Gln Glu Arg 675
680 685Glu Lys Ile Ile Lys Thr Ile Glu Asn
Cys Leu Glu Gln Arg Val Lys 690 695
700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val Ser Asn Trp Leu Ser Arg705
710 715 720Ile Thr Thr Gln
Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser Leu 725
730 735Ser Tyr Gln Ala Asp Ala Ile Lys Ala Lys
Ile Asp Leu Glu Tyr Lys 740 745
750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile Lys Ser Gln Val Glu Asn
755 760 765Leu Lys Asn Ser Leu Asp Val
Lys Ile Ser Glu Ala Met Asn Asn Ile 770 775
780Asn Lys Phe Ile Arg Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn
Met785 790 795 800Leu Pro
Lys Val Ile Asp Glu Leu Asn Lys Phe Asp Leu Arg Thr Lys
805 810 815Thr Glu Leu Ile Asn Leu Ile
Asp Ser His Asn Ile Ile Leu Val Gly 820 825
830Glu Val Asp Arg Leu Lys Ala Lys Val Asn Glu Ser Phe Glu
Asn Thr 835 840 845Met Pro Phe Asn
Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp 850
855 860Ile Ile Asn Glu Tyr Phe Asn Leu Glu Gly Gly Gly
Gly Ser Gly Gly865 870 875
880Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Asn Ser Asp Ser Glu
885 890 895Cys Pro Leu Ser His
Asp Gly Tyr Cys Leu His Asp Gly Val Cys Met 900
905 910Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys
Val Val Gly Tyr 915 920 925Ile Gly
Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg 930
935 94019919PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 19Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn
Asp Asn1 5 10 15Asp Ile
Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr Thr Pro 20
25 30Val Lys Ala Phe Met Ile Thr Gln Asn
Ile Trp Val Ile Pro Glu Arg 35 40
45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro Thr 50
55 60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro
Ser Tyr Leu Ser Thr Asp Glu65 70 75
80Gln Lys Asp Thr Phe Leu Lys Gly Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Glu
Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu Val Val Gly 100
105 110Ser Pro Phe Met Gly Asp Ser Ser Thr
Pro Glu Asp Thr Phe Asp Phe 115 120
125Thr Arg His Thr Thr Asn Ile Ala Val Glu Lys Phe Glu Asn Gly Ser
130 135 140Trp Lys Val Thr Asn Ile Ile
Thr Pro Ser Val Leu Ile Phe Gly Pro145 150
155 160Leu Pro Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr
Leu Gln Gly Gln 165 170
175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser Ile Leu Lys
180 185 190Val Ala Pro Glu Phe Leu
Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195 200
205Ser Ser Ala Val Leu Gly Lys Ser Ile Phe Cys Met Asp Pro
Val Ile 210 215 220Ala Leu Met His Glu
Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225 230
235 240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro
Gln Val Ser Glu Gly Phe 245 250
255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe
260 265 270Gly Gly Leu Asp Val
Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu 275
280 285Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ile Ala
Lys Arg Leu Asn 290 295 300Asn Ile Asn
Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp Lys305
310 315 320Tyr Lys Lys Ile Phe Ser Glu
Lys Tyr Asn Phe Asp Lys Asp Asn Thr 325
330 335Gly Asn Phe Val Val Asn Ile Asp Lys Phe Asn Ser
Leu Tyr Ser Asp 340 345 350Leu
Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr Asn Val 355
360 365Lys Asn Arg Thr His Tyr Phe Ser Arg
His Tyr Leu Pro Val Phe Ala 370 375
380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp Gly Phe Asn Leu385
390 395 400Thr Asn Lys Gly
Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg 405
410 415Asn Pro Ala Leu Gln Lys Leu Ser Ser Glu
Ser Val Val Asp Leu Phe 420 425
430Thr Lys Val Cys Val Asp Lys Ser Glu Glu Lys Leu Tyr Asp Asp Asp
435 440 445Asp Lys Asp Arg Trp Gly Ser
Ser Leu Gln Cys Ile Lys Val Lys Asn 450 455
460Asn Arg Leu Pro Tyr Val Ala Asp Lys Asp Ser Ile Ser Gln Glu
Ile465 470 475 480Phe Glu
Asn Lys Ile Ile Thr Asp Glu Thr Asn Val Gln Asn Tyr Ser
485 490 495Asp Lys Phe Ser Leu Asp Glu
Ser Ile Leu Asp Gly Gln Val Pro Ile 500 505
510Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val Asn Met
Glu Pro 515 520 525Leu Asn Leu Pro
Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile Thr Lys 530
535 540Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu
Ser Gln Lys Leu545 550 555
560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr Thr Ser Val Glu Glu Ala
565 570 575Leu Gly Tyr Ser Asn
Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu 580
585 590Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
Asn Trp Ala Asn 595 600 605Glu Val
Val Glu Asp Phe Thr Thr Asn Ile Met Lys Lys Asp Thr Leu 610
615 620Asp Lys Ile Ser Asp Val Ser Val Ile Ile Pro
Tyr Ile Gly Pro Ala625 630 635
640Leu Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe Asn Gln Ala Phe
645 650 655Ala Thr Ala Gly
Val Ala Phe Leu Leu Glu Gly Phe Pro Glu Phe Thr 660
665 670Ile Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser
Ser Ile Gln Glu Arg 675 680 685Glu
Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu Gln Arg Val Lys 690
695 700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val
Ser Asn Trp Leu Ser Arg705 710 715
720Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser
Leu 725 730 735Ser Tyr Gln
Ala Asp Ala Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys 740
745 750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile
Lys Ser Gln Val Glu Asn 755 760
765Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala Met Asn Asn Ile 770
775 780Asn Lys Phe Ile Arg Glu Cys Ser
Val Thr Tyr Leu Phe Lys Asn Met785 790
795 800Leu Pro Lys Val Ile Asp Glu Leu Asn Lys Phe Asp
Leu Arg Thr Lys 805 810
815Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile Ile Leu Val Gly
820 825 830Glu Val Asp Arg Leu Lys
Ala Lys Val Asn Glu Ser Phe Glu Asn Thr 835 840
845Met Pro Phe Asn Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu
Lys Asp 850 855 860Ile Ile Asn Glu Tyr
Phe Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly865 870
875 880Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu
Val His Ser Asp Ala Val 885 890
895Phe Thr Asp Asn Tyr Thr Arg Leu Arg Lys Gln Met Ala Val Lys Lys
900 905 910Tyr Leu Asn Ser Ile
Leu Asn 91520955PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 20Pro Ile Thr Ile Asn Asn
Phe Asn Tyr Ser Asp Pro Val Asp Asn Lys1 5
10 15Asn Ile Leu Tyr Leu Asp Thr His Leu Asn Thr Leu
Ala Asn Glu Pro 20 25 30Glu
Lys Ala Phe Arg Ile Thr Gly Asn Ile Trp Val Ile Pro Asp Arg 35
40 45Phe Ser Arg Asn Ser Asn Pro Asn Leu
Asn Lys Pro Pro Arg Val Thr 50 55
60Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp Ser65
70 75 80Asp Lys Asp Thr Phe
Leu Lys Glu Ile Ile Lys Leu Phe Lys Arg Ile 85
90 95Asn Ser Arg Glu Ile Gly Glu Glu Leu Ile Tyr
Arg Leu Ser Thr Asp 100 105
110Ile Pro Phe Pro Gly Asn Asn Asn Thr Pro Ile Asn Thr Phe Asp Phe
115 120 125Asp Val Asp Phe Asn Ser Val
Asp Val Lys Thr Arg Gln Gly Asn Asn 130 135
140Trp Val Lys Thr Gly Ser Ile Asn Pro Ser Val Ile Ile Thr Gly
Pro145 150 155 160Arg Glu
Asn Ile Ile Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr Asn
165 170 175Asn Thr Phe Ala Ala Gln Glu
Gly Phe Gly Ala Leu Ser Ile Ile Ser 180 185
190Ile Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn
Asp Val 195 200 205Gly Glu Gly Arg
Phe Ser Lys Ser Glu Phe Cys Met Asp Pro Ile Leu 210
215 220Ile Leu Met His Glu Leu Asn His Ala Met His Asn
Leu Tyr Gly Ile225 230 235
240Ala Ile Pro Asn Asp Gln Thr Ile Ser Ser Val Thr Ser Asn Ile Phe
245 250 255Tyr Ser Gln Tyr Asn
Val Lys Leu Glu Tyr Ala Glu Ile Tyr Ala Phe 260
265 270Gly Gly Pro Thr Ile Asp Leu Ile Pro Lys Ser Ala
Arg Lys Tyr Phe 275 280 285Glu Glu
Lys Ala Leu Asp Tyr Tyr Arg Ser Ile Ala Lys Arg Leu Asn 290
295 300Ser Ile Thr Thr Ala Asn Pro Ser Ser Phe Asn
Lys Tyr Ile Gly Glu305 310 315
320Tyr Lys Gln Lys Leu Ile Arg Lys Tyr Arg Phe Val Val Glu Ser Ser
325 330 335Gly Glu Val Thr
Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn Glu 340
345 350Leu Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala
Lys Ile Tyr Asn Val 355 360 365Gln
Asn Arg Lys Ile Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr Ala 370
375 380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile
Gln Asn Gly Phe Asn Ile385 390 395
400Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser
Arg 405 410 415Asn Pro Ala
Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu Phe 420
425 430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly
Arg Ser Leu Tyr Asn Lys 435 440
445Thr Leu Gln Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe 450
455 460Ile Gly Asp Ile Ser Asp Val Lys
Thr Asp Ile Phe Leu Arg Lys Asp465 470
475 480Ile Asn Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp
Asn Val Ser Val 485 490
495Asp Gln Val Ile Leu Ser Lys Asn Thr Ser Glu His Gly Gln Leu Asp
500 505 510Leu Leu Tyr Pro Ser Ile
Asp Ser Glu Ser Glu Ile Leu Pro Gly Glu 515 520
525Asn Gln Val Phe Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr
Leu Asn 530 535 540Ser Tyr Tyr Tyr Leu
Glu Ser Gln Lys Leu Ser Asp Asn Val Glu Asp545 550
555 560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala
Leu Asp Asn Ser Ala Lys 565 570
575Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val
580 585 590Gln Gly Gly Leu Phe
Leu Met Trp Ala Asn Asp Val Val Glu Asp Phe 595
600 605Thr Thr Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys
Ile Ser Asp Val 610 615 620Ser Ala Ile
Ile Pro Tyr Ile Gly Pro Ala Leu Asn Ile Ser Asn Ser625
630 635 640Val Arg Arg Gly Asn Phe Thr
Glu Ala Phe Ala Val Thr Gly Val Thr 645
650 655Ile Leu Leu Glu Ala Phe Pro Glu Phe Thr Ile Pro
Ala Leu Gly Ala 660 665 670Phe
Val Ile Tyr Ser Lys Val Gln Glu Arg Asn Glu Ile Ile Lys Thr 675
680 685Ile Asp Asn Cys Leu Glu Gln Arg Ile
Lys Arg Trp Lys Asp Ser Tyr 690 695
700Glu Trp Met Met Gly Thr Trp Leu Ser Arg Ile Ile Thr Gln Phe Asn705
710 715 720Asn Ile Ser Tyr
Gln Met Tyr Asp Ser Leu Asn Tyr Gln Ala Gly Ala 725
730 735Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys
Lys Tyr Ser Gly Ser Asp 740 745
750Lys Glu Asn Ile Lys Ser Gln Val Glu Asn Leu Lys Asn Ser Leu Asp
755 760 765Val Lys Ile Ser Glu Ala Met
Asn Asn Ile Asn Lys Phe Ile Arg Glu 770 775
780Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val Ile
Asp785 790 795 800Glu Leu
Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu Ile Asn Leu
805 810 815Ile Asp Ser His Asn Ile Ile
Leu Val Gly Glu Val Asp Lys Leu Lys 820 825
830Ala Lys Val Asn Asn Ser Phe Gln Asn Thr Ile Pro Phe Asn
Ile Phe 835 840 845Ser Tyr Thr Asn
Asn Ser Leu Leu Lys Asp Ile Ile Asn Glu Tyr Phe 850
855 860Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly
Ser Gly Gly Gly865 870 875
880Gly Ser Ala Leu Val Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
885 890 895Asp Ala Leu Gln Phe
Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys 900
905 910Pro Thr Gly Tyr Gly Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile 915 920 925Val Asp
Glu Cys Cys Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met 930
935 940Tyr Cys Ala Pro Leu Lys Pro Ala Lys Ser
Ala945 950 95521961PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 21Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn
Asp Asn1 5 10 15Asp Ile
Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr Thr Pro 20
25 30Val Lys Ala Phe Met Ile Thr Gln Asn
Ile Trp Val Ile Pro Glu Arg 35 40
45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro Thr 50
55 60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro
Ser Tyr Leu Ser Thr Asp Glu65 70 75
80Gln Lys Asp Thr Phe Leu Lys Gly Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Glu
Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu Val Val Gly 100
105 110Ser Pro Phe Met Gly Asp Ser Ser Thr
Pro Glu Asp Thr Phe Asp Phe 115 120
125Thr Arg His Thr Thr Asn Ile Ala Val Glu Lys Phe Glu Asn Gly Ser
130 135 140Trp Lys Val Thr Asn Ile Ile
Thr Pro Ser Val Leu Ile Phe Gly Pro145 150
155 160Leu Pro Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr
Leu Gln Gly Gln 165 170
175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser Ile Leu Lys
180 185 190Val Ala Pro Glu Phe Leu
Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195 200
205Ser Ser Ala Val Leu Gly Lys Ser Ile Phe Cys Met Asp Pro
Val Ile 210 215 220Ala Leu Met His Glu
Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225 230
235 240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro
Gln Val Ser Glu Gly Phe 245 250
255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe
260 265 270Gly Gly Leu Asp Val
Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu 275
280 285Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ile Ala
Lys Arg Leu Asn 290 295 300Asn Ile Asn
Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp Lys305
310 315 320Tyr Lys Lys Ile Phe Ser Glu
Lys Tyr Asn Phe Asp Lys Asp Asn Thr 325
330 335Gly Asn Phe Val Val Asn Ile Asp Lys Phe Asn Ser
Leu Tyr Ser Asp 340 345 350Leu
Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr Asn Val 355
360 365Lys Asn Arg Thr His Tyr Phe Ser Arg
His Tyr Leu Pro Val Phe Ala 370 375
380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp Gly Phe Asn Leu385
390 395 400Thr Asn Lys Gly
Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg 405
410 415Asn Pro Ala Leu Gln Lys Leu Ser Ser Glu
Ser Val Val Asp Leu Phe 420 425
430Thr Lys Val Cys Val Asp Lys Ser Glu Glu Lys Leu Tyr Asp Asp Asp
435 440 445Asp Lys Asp Arg Trp Gly Ser
Ser Leu Gln Cys Ile Lys Val Lys Asn 450 455
460Asn Arg Leu Pro Tyr Val Ala Asp Lys Asp Ser Ile Ser Gln Glu
Ile465 470 475 480Phe Glu
Asn Lys Ile Ile Thr Asp Glu Thr Asn Val Gln Asn Tyr Ser
485 490 495Asp Lys Phe Ser Leu Asp Glu
Ser Ile Leu Asp Gly Gln Val Pro Ile 500 505
510Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val Asn Met
Glu Pro 515 520 525Leu Asn Leu Pro
Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile Thr Lys 530
535 540Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu
Ser Gln Lys Leu545 550 555
560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr Thr Ser Val Glu Glu Ala
565 570 575Leu Gly Tyr Ser Asn
Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu 580
585 590Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
Asn Trp Ala Asn 595 600 605Glu Val
Val Glu Asp Phe Thr Thr Asn Ile Met Lys Lys Asp Thr Leu 610
615 620Asp Lys Ile Ser Asp Val Ser Val Ile Ile Pro
Tyr Ile Gly Pro Ala625 630 635
640Leu Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe Asn Gln Ala Phe
645 650 655Ala Thr Ala Gly
Val Ala Phe Leu Leu Glu Gly Phe Pro Glu Phe Thr 660
665 670Ile Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser
Ser Ile Gln Glu Arg 675 680 685Glu
Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu Gln Arg Val Lys 690
695 700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val
Ser Asn Trp Leu Ser Arg705 710 715
720Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser
Leu 725 730 735Ser Tyr Gln
Ala Asp Ala Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys 740
745 750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile
Lys Ser Gln Val Glu Asn 755 760
765Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala Met Asn Asn Ile 770
775 780Asn Lys Phe Ile Arg Glu Cys Ser
Val Thr Tyr Leu Phe Lys Asn Met785 790
795 800Leu Pro Lys Val Ile Asp Glu Leu Asn Lys Phe Asp
Leu Arg Thr Lys 805 810
815Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile Ile Leu Val Gly
820 825 830Glu Val Asp Arg Leu Lys
Ala Lys Val Asn Glu Ser Phe Glu Asn Thr 835 840
845Met Pro Phe Asn Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu
Lys Asp 850 855 860Ile Ile Asn Glu Tyr
Phe Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly865 870
875 880Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu
Val Gly Pro Glu Thr Leu 885 890
895Cys Gly Ala Glu Leu Val Asp Ala Leu Gln Phe Val Cys Gly Asp Arg
900 905 910Gly Phe Tyr Phe Asn
Lys Pro Thr Gly Tyr Gly Ser Ser Ser Arg Arg 915
920 925Ala Pro Gln Thr Gly Ile Val Asp Glu Cys Cys Phe
Arg Ser Cys Asp 930 935 940Leu Arg Arg
Leu Glu Met Tyr Cys Ala Pro Leu Lys Pro Ala Lys Ser945
950 955 960Ala22913PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 22Pro Ile Thr Ile Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp
Asn Lys1 5 10 15Asn Ile
Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu Pro 20
25 30Glu Lys Ala Phe Arg Ile Thr Gly Asn
Ile Trp Val Ile Pro Asp Arg 35 40
45Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val Thr 50
55 60Ser Pro Lys Ser Gly Tyr Tyr Asp Pro
Asn Tyr Leu Ser Thr Asp Ser65 70 75
80Asp Lys Asp Thr Phe Leu Lys Glu Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Ser
Arg Glu Ile Gly Glu Glu Leu Ile Tyr Arg Leu Ser Thr Asp 100
105 110Ile Pro Phe Pro Gly Asn Asn Asn Thr
Pro Ile Asn Thr Phe Asp Phe 115 120
125Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gln Gly Asn Asn
130 135 140Trp Val Lys Thr Gly Ser Ile
Asn Pro Ser Val Ile Ile Thr Gly Pro145 150
155 160Arg Glu Asn Ile Ile Asp Pro Glu Thr Ser Thr Phe
Lys Leu Thr Asn 165 170
175Asn Thr Phe Ala Ala Gln Glu Gly Phe Gly Ala Leu Ser Ile Ile Ser
180 185 190Ile Ser Pro Arg Phe Met
Leu Thr Tyr Ser Asn Ala Thr Asn Asp Val 195 200
205Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro
Ile Leu 210 215 220Ile Leu Met His Glu
Leu Asn His Ala Met His Asn Leu Tyr Gly Ile225 230
235 240Ala Ile Pro Asn Asp Gln Thr Ile Ser Ser
Val Thr Ser Asn Ile Phe 245 250
255Tyr Ser Gln Tyr Asn Val Lys Leu Glu Tyr Ala Glu Ile Tyr Ala Phe
260 265 270Gly Gly Pro Thr Ile
Asp Leu Ile Pro Lys Ser Ala Arg Lys Tyr Phe 275
280 285Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser Ile Ala
Lys Arg Leu Asn 290 295 300Ser Ile Thr
Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr Ile Gly Glu305
310 315 320Tyr Lys Gln Lys Leu Ile Arg
Lys Tyr Arg Phe Val Val Glu Ser Ser 325
330 335Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu
Leu Tyr Asn Glu 340 345 350Leu
Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala Lys Ile Tyr Asn Val 355
360 365Gln Asn Arg Lys Ile Tyr Leu Ser Asn
Val Tyr Thr Pro Val Thr Ala 370 375
380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile Gln Asn Gly Phe Asn Ile385
390 395 400Pro Lys Ser Asn
Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser Arg 405
410 415Asn Pro Ala Leu Arg Lys Val Asn Pro Glu
Asn Met Leu Tyr Leu Phe 420 425
430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly Arg Ser Leu Tyr Asn Lys
435 440 445Thr Leu Gln Cys Arg Glu Leu
Leu Val Lys Asn Thr Asp Leu Pro Phe 450 455
460Ile Gly Asp Ile Ser Asp Val Lys Thr Asp Ile Phe Leu Arg Lys
Asp465 470 475 480Ile Asn
Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp Asn Val Ser Val
485 490 495Asp Gln Val Ile Leu Ser Lys
Asn Thr Ser Glu His Gly Gln Leu Asp 500 505
510Leu Leu Tyr Pro Ser Ile Asp Ser Glu Ser Glu Ile Leu Pro
Gly Glu 515 520 525Asn Gln Val Phe
Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr Leu Asn 530
535 540Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu Ser Asp
Asn Val Glu Asp545 550 555
560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala Leu Asp Asn Ser Ala Lys
565 570 575Val Tyr Thr Tyr Phe
Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val 580
585 590Gln Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val
Val Glu Asp Phe 595 600 605Thr Thr
Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys Ile Ser Asp Val 610
615 620Ser Ala Ile Ile Pro Tyr Ile Gly Pro Ala Leu
Asn Ile Ser Asn Ser625 630 635
640Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val Thr
645 650 655Ile Leu Leu Glu
Ala Phe Pro Glu Phe Thr Ile Pro Ala Leu Gly Ala 660
665 670Phe Val Ile Tyr Ser Lys Val Gln Glu Arg Asn
Glu Ile Ile Lys Thr 675 680 685Ile
Asp Asn Cys Leu Glu Gln Arg Ile Lys Arg Trp Lys Asp Ser Tyr 690
695 700Glu Trp Met Met Gly Thr Trp Leu Ser Arg
Ile Ile Thr Gln Phe Asn705 710 715
720Asn Ile Ser Tyr Gln Met Tyr Asp Ser Leu Asn Tyr Gln Ala Gly
Ala 725 730 735Ile Lys Ala
Lys Ile Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp 740
745 750Lys Glu Asn Ile Lys Ser Gln Val Glu Asn
Leu Lys Asn Ser Leu Asp 755 760
765Val Lys Ile Ser Glu Ala Met Asn Asn Ile Asn Lys Phe Ile Arg Glu 770
775 780Cys Ser Val Thr Tyr Leu Phe Lys
Asn Met Leu Pro Lys Val Ile Asp785 790
795 800Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys
Leu Ile Asn Leu 805 810
815Ile Asp Ser His Asn Ile Ile Leu Val Gly Glu Val Asp Lys Leu Lys
820 825 830Ala Lys Val Asn Asn Ser
Phe Gln Asn Thr Ile Pro Phe Asn Ile Phe 835 840
845Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp Ile Ile Asn Glu
Tyr Phe 850 855 860Asn Leu Glu Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly865 870
875 880Gly Ser Ala Leu Val His Ser Asp Ala Val
Phe Thr Asp Asn Tyr Thr 885 890
895Arg Leu Arg Lys Gln Met Ala Val Lys Lys Tyr Leu Asn Ser Ile Leu
900 905 910Asn
23977PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 23Pro Ile Thr Ile Asn Asn Phe Asn
Tyr Ser Asp Pro Val Asp Asn Lys1 5 10
15Asn Ile Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn
Glu Pro 20 25 30Glu Lys Ala
Phe Arg Ile Thr Gly Asn Ile Trp Val Ile Pro Asp Arg 35
40 45Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys
Pro Pro Arg Val Thr 50 55 60Ser Pro
Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp Ser65
70 75 80Asp Lys Asp Thr Phe Leu Lys
Glu Ile Ile Lys Leu Phe Lys Arg Ile 85 90
95Asn Ser Arg Glu Ile Gly Glu Glu Leu Ile Tyr Arg Leu
Ser Thr Asp 100 105 110Ile Pro
Phe Pro Gly Asn Asn Asn Thr Pro Ile Asn Thr Phe Asp Phe 115
120 125Asp Val Asp Phe Asn Ser Val Asp Val Lys
Thr Arg Gln Gly Asn Asn 130 135 140Trp
Val Lys Thr Gly Ser Ile Asn Pro Ser Val Ile Ile Thr Gly Pro145
150 155 160Arg Glu Asn Ile Ile Asp
Pro Glu Thr Ser Thr Phe Lys Leu Thr Asn 165
170 175Asn Thr Phe Ala Ala Gln Glu Gly Phe Gly Ala Leu
Ser Ile Ile Ser 180 185 190Ile
Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp Val 195
200 205Gly Glu Gly Arg Phe Ser Lys Ser Glu
Phe Cys Met Asp Pro Ile Leu 210 215
220Ile Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly Ile225
230 235 240Ala Ile Pro Asn
Asp Gln Thr Ile Ser Ser Val Thr Ser Asn Ile Phe 245
250 255Tyr Ser Gln Tyr Asn Val Lys Leu Glu Tyr
Ala Glu Ile Tyr Ala Phe 260 265
270Gly Gly Pro Thr Ile Asp Leu Ile Pro Lys Ser Ala Arg Lys Tyr Phe
275 280 285Glu Glu Lys Ala Leu Asp Tyr
Tyr Arg Ser Ile Ala Lys Arg Leu Asn 290 295
300Ser Ile Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr Ile Gly
Glu305 310 315 320Tyr Lys
Gln Lys Leu Ile Arg Lys Tyr Arg Phe Val Val Glu Ser Ser
325 330 335Gly Glu Val Thr Val Asn Arg
Asn Lys Phe Val Glu Leu Tyr Asn Glu 340 345
350Leu Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala Lys Ile Tyr
Asn Val 355 360 365Gln Asn Arg Lys
Ile Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr Ala 370
375 380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile Gln Asn
Gly Phe Asn Ile385 390 395
400Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser Arg
405 410 415Asn Pro Ala Leu Arg
Lys Val Asn Pro Glu Asn Met Leu Tyr Leu Phe 420
425 430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly Arg Ser
Leu Tyr Asn Lys 435 440 445Thr Leu
Gln Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe 450
455 460Ile Gly Asp Ile Ser Asp Val Lys Thr Asp Ile
Phe Leu Arg Lys Asp465 470 475
480Ile Asn Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp Asn Val Ser Val
485 490 495Asp Gln Val Ile
Leu Ser Lys Asn Thr Ser Glu His Gly Gln Leu Asp 500
505 510Leu Leu Tyr Pro Ser Ile Asp Ser Glu Ser Glu
Ile Leu Pro Gly Glu 515 520 525Asn
Gln Val Phe Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr Leu Asn 530
535 540Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu
Ser Asp Asn Val Glu Asp545 550 555
560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala Leu Asp Asn Ser Ala
Lys 565 570 575Val Tyr Thr
Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val 580
585 590Gln Gly Gly Leu Phe Leu Met Trp Ala Asn
Asp Val Val Glu Asp Phe 595 600
605Thr Thr Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys Ile Ser Asp Val 610
615 620Ser Ala Ile Ile Pro Tyr Ile Gly
Pro Ala Leu Asn Ile Ser Asn Ser625 630
635 640Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val
Thr Gly Val Thr 645 650
655Ile Leu Leu Glu Ala Phe Pro Glu Phe Thr Ile Pro Ala Leu Gly Ala
660 665 670Phe Val Ile Tyr Ser Lys
Val Gln Glu Arg Asn Glu Ile Ile Lys Thr 675 680
685Ile Asp Asn Cys Leu Glu Gln Arg Ile Lys Arg Trp Lys Asp
Ser Tyr 690 695 700Glu Trp Met Met Gly
Thr Trp Leu Ser Arg Ile Ile Thr Gln Phe Asn705 710
715 720Asn Ile Ser Tyr Gln Met Tyr Asp Ser Leu
Asn Tyr Gln Ala Gly Ala 725 730
735Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp
740 745 750Lys Glu Asn Ile Lys
Ser Gln Val Glu Asn Leu Lys Asn Ser Leu Asp 755
760 765Val Lys Ile Ser Glu Ala Met Asn Asn Ile Asn Lys
Phe Ile Arg Glu 770 775 780Cys Ser Val
Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val Ile Asp785
790 795 800Glu Leu Asn Glu Phe Asp Arg
Asn Thr Lys Ala Lys Leu Ile Asn Leu 805
810 815Ile Asp Ser His Asn Ile Ile Leu Val Gly Glu Val
Asp Lys Leu Lys 820 825 830Ala
Lys Val Asn Asn Ser Phe Gln Asn Thr Ile Pro Phe Asn Ile Phe 835
840 845Ser Tyr Thr Asn Asn Ser Leu Leu Lys
Asp Ile Ile Asn Glu Tyr Phe 850 855
860Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly865
870 875 880Gly Ser Ala Leu
Val Met Lys Pro Ile Gln Lys Leu Leu Ala Gly Leu 885
890 895Ile Leu Leu Thr Trp Cys Val Glu Gly Cys
Ser Ser Gln His Trp Ser 900 905
910Tyr Gly Leu Arg Pro Gly Gly Lys Arg Asp Ala Glu Asn Leu Ile Asp
915 920 925Ser Phe Gln Glu Ile Val Lys
Glu Val Gly Gln Leu Ala Glu Thr Gln 930 935
940Arg Phe Glu Cys Thr Thr His Gln Pro Arg Ser Pro Leu Arg Asp
Leu945 950 955 960Lys Gly
Ala Leu Glu Ser Leu Ile Glu Glu Glu Thr Gly Gln Lys Lys
965 970 975Ile24988PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 24Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn
Asp Asn1 5 10 15Asp Ile
Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr Thr Pro 20
25 30Val Lys Ala Phe Met Ile Thr Gln Asn
Ile Trp Val Ile Pro Glu Arg 35 40
45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro Thr 50
55 60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro
Ser Tyr Leu Ser Thr Asp Glu65 70 75
80Gln Lys Asp Thr Phe Leu Lys Gly Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Glu
Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu Val Val Gly 100
105 110Ser Pro Phe Met Gly Asp Ser Ser Thr
Pro Glu Asp Thr Phe Asp Phe 115 120
125Thr Arg His Thr Thr Asn Ile Ala Val Glu Lys Phe Glu Asn Gly Ser
130 135 140Trp Lys Val Thr Asn Ile Ile
Thr Pro Ser Val Leu Ile Phe Gly Pro145 150
155 160Leu Pro Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr
Leu Gln Gly Gln 165 170
175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser Ile Leu Lys
180 185 190Val Ala Pro Glu Phe Leu
Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195 200
205Ser Ser Ala Val Leu Gly Lys Ser Ile Phe Cys Met Asp Pro
Val Ile 210 215 220Ala Leu Met His Glu
Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225 230
235 240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro
Gln Val Ser Glu Gly Phe 245 250
255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe
260 265 270Gly Gly Leu Asp Val
Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu 275
280 285Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ile Ala
Lys Arg Leu Asn 290 295 300Asn Ile Asn
Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp Lys305
310 315 320Tyr Lys Lys Ile Phe Ser Glu
Lys Tyr Asn Phe Asp Lys Asp Asn Thr 325
330 335Gly Asn Phe Val Val Asn Ile Asp Lys Phe Asn Ser
Leu Tyr Ser Asp 340 345 350Leu
Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr Asn Val 355
360 365Lys Asn Arg Thr His Tyr Phe Ser Arg
His Tyr Leu Pro Val Phe Ala 370 375
380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp Gly Phe Asn Leu385
390 395 400Thr Asn Lys Gly
Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg 405
410 415Asn Pro Ala Leu Gln Lys Leu Ser Ser Glu
Ser Val Val Asp Leu Phe 420 425
430Thr Lys Val Cys Val Asp Lys Ser Glu Glu Lys Leu Tyr Asp Asp Asp
435 440 445Asp Lys Asp Arg Trp Gly Ser
Ser Leu Gln Cys Ile Lys Val Lys Asn 450 455
460Asn Arg Leu Pro Tyr Val Ala Asp Lys Asp Ser Ile Ser Gln Glu
Ile465 470 475 480Phe Glu
Asn Lys Ile Ile Thr Asp Glu Thr Asn Val Gln Asn Tyr Ser
485 490 495Asp Lys Phe Ser Leu Asp Glu
Ser Ile Leu Asp Gly Gln Val Pro Ile 500 505
510Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val Asn Met
Glu Pro 515 520 525Leu Asn Leu Pro
Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile Thr Lys 530
535 540Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu
Ser Gln Lys Leu545 550 555
560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr Thr Ser Val Glu Glu Ala
565 570 575Leu Gly Tyr Ser Asn
Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu 580
585 590Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
Asn Trp Ala Asn 595 600 605Glu Val
Val Glu Asp Phe Thr Thr Asn Ile Met Lys Lys Asp Thr Leu 610
615 620Asp Lys Ile Ser Asp Val Ser Val Ile Ile Pro
Tyr Ile Gly Pro Ala625 630 635
640Leu Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe Asn Gln Ala Phe
645 650 655Ala Thr Ala Gly
Val Ala Phe Leu Leu Glu Gly Phe Pro Glu Phe Thr 660
665 670Ile Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser
Ser Ile Gln Glu Arg 675 680 685Glu
Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu Gln Arg Val Lys 690
695 700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val
Ser Asn Trp Leu Ser Arg705 710 715
720Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser
Leu 725 730 735Ser Tyr Gln
Ala Asp Ala Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys 740
745 750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile
Lys Ser Gln Val Glu Asn 755 760
765Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala Met Asn Asn Ile 770
775 780Asn Lys Phe Ile Arg Glu Cys Ser
Val Thr Tyr Leu Phe Lys Asn Met785 790
795 800Leu Pro Lys Val Ile Asp Glu Leu Asn Lys Phe Asp
Leu Arg Thr Lys 805 810
815Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile Ile Leu Val Gly
820 825 830Glu Val Asp Arg Leu Lys
Ala Lys Val Asn Glu Ser Phe Glu Asn Thr 835 840
845Met Pro Phe Asn Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu
Lys Asp 850 855 860Ile Ile Asn Glu Tyr
Phe Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly865 870
875 880Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
Gly Gly Ser Ala Leu Val 885 890
895Met Lys Pro Ile Gln Lys Leu Leu Ala Gly Leu Ile Leu Leu Thr Trp
900 905 910Cys Val Glu Gly Cys
Ser Ser Gln His Trp Ser Tyr Gly Leu Arg Pro 915
920 925Gly Gly Lys Arg Asp Ala Glu Asn Leu Ile Asp Ser
Phe Gln Glu Ile 930 935 940Val Lys Glu
Val Gly Gln Leu Ala Glu Thr Gln Arg Phe Glu Cys Thr945
950 955 960Thr His Gln Pro Arg Ser Pro
Leu Arg Asp Leu Lys Gly Ala Leu Glu 965
970 975Ser Leu Ile Glu Glu Glu Thr Gly Gln Lys Lys Ile
980 98525901PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 25Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn
Asp Asn1 5 10 15Asp Ile
Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr Thr Pro 20
25 30Val Lys Ala Phe Met Ile Thr Gln Asn
Ile Trp Val Ile Pro Glu Arg 35 40
45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro Thr 50
55 60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro
Ser Tyr Leu Ser Thr Asp Glu65 70 75
80Gln Lys Asp Thr Phe Leu Lys Gly Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Glu
Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu Val Val Gly 100
105 110Ser Pro Phe Met Gly Asp Ser Ser Thr
Pro Glu Asp Thr Phe Asp Phe 115 120
125Thr Arg His Thr Thr Asn Ile Ala Val Glu Lys Phe Glu Asn Gly Ser
130 135 140Trp Lys Val Thr Asn Ile Ile
Thr Pro Ser Val Leu Ile Phe Gly Pro145 150
155 160Leu Pro Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr
Leu Gln Gly Gln 165 170
175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser Ile Leu Lys
180 185 190Val Ala Pro Glu Phe Leu
Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195 200
205Ser Ser Ala Val Leu Gly Lys Ser Ile Phe Cys Met Asp Pro
Val Ile 210 215 220Ala Leu Met His Glu
Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225 230
235 240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro
Gln Val Ser Glu Gly Phe 245 250
255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe
260 265 270Gly Gly Leu Asp Val
Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu 275
280 285Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ile Ala
Lys Arg Leu Asn 290 295 300Asn Ile Asn
Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp Lys305
310 315 320Tyr Lys Lys Ile Phe Ser Glu
Lys Tyr Asn Phe Asp Lys Asp Asn Thr 325
330 335Gly Asn Phe Val Val Asn Ile Asp Lys Phe Asn Ser
Leu Tyr Ser Asp 340 345 350Leu
Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr Asn Val 355
360 365Lys Asn Arg Thr His Tyr Phe Ser Arg
His Tyr Leu Pro Val Phe Ala 370 375
380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp Gly Phe Asn Leu385
390 395 400Thr Asn Lys Gly
Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg 405
410 415Asn Pro Ala Leu Gln Lys Leu Ser Ser Glu
Ser Val Val Asp Leu Phe 420 425
430Thr Lys Val Cys Val Asp Lys Ser Glu Glu Lys Leu Tyr Asp Asp Asp
435 440 445Asp Lys Asp Arg Trp Gly Ser
Ser Leu Gln Cys Ile Lys Val Lys Asn 450 455
460Asn Arg Leu Pro Tyr Val Ala Asp Lys Asp Ser Ile Ser Gln Glu
Ile465 470 475 480Phe Glu
Asn Lys Ile Ile Thr Asp Glu Thr Asn Val Gln Asn Tyr Ser
485 490 495Asp Lys Phe Ser Leu Asp Glu
Ser Ile Leu Asp Gly Gln Val Pro Ile 500 505
510Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val Asn Met
Glu Pro 515 520 525Leu Asn Leu Pro
Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile Thr Lys 530
535 540Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu
Ser Gln Lys Leu545 550 555
560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr Thr Ser Val Glu Glu Ala
565 570 575Leu Gly Tyr Ser Asn
Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu 580
585 590Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
Asn Trp Ala Asn 595 600 605Glu Val
Val Glu Asp Phe Thr Thr Asn Ile Met Lys Lys Asp Thr Leu 610
615 620Asp Lys Ile Ser Asp Val Ser Val Ile Ile Pro
Tyr Ile Gly Pro Ala625 630 635
640Leu Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe Asn Gln Ala Phe
645 650 655Ala Thr Ala Gly
Val Ala Phe Leu Leu Glu Gly Phe Pro Glu Phe Thr 660
665 670Ile Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser
Ser Ile Gln Glu Arg 675 680 685Glu
Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu Gln Arg Val Lys 690
695 700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val
Ser Asn Trp Leu Ser Arg705 710 715
720Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser
Leu 725 730 735Ser Tyr Gln
Ala Asp Ala Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys 740
745 750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile
Lys Ser Gln Val Glu Asn 755 760
765Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala Met Asn Asn Ile 770
775 780Asn Lys Phe Ile Arg Glu Cys Ser
Val Thr Tyr Leu Phe Lys Asn Met785 790
795 800Leu Pro Lys Val Ile Asp Glu Leu Asn Lys Phe Asp
Leu Arg Thr Lys 805 810
815Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile Ile Leu Val Gly
820 825 830Glu Val Asp Arg Leu Lys
Ala Lys Val Asn Glu Ser Phe Glu Asn Thr 835 840
845Met Pro Phe Asn Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu
Lys Asp 850 855 860Ile Ile Asn Glu Tyr
Phe Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly865 870
875 880Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu
Val Gly Asn His Trp Ala 885 890
895Val Gly His Leu Met 90026899PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 26Pro Val Thr Ile Asn Asn Phe Asn Tyr Asn Asp Pro Ile Asp
Asn Asn1 5 10 15Asn Ile
Ile Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg Tyr 20
25 30Tyr Lys Ala Phe Lys Ile Thr Asp Arg
Ile Trp Ile Ile Pro Glu Arg 35 40
45Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly Ile 50
55 60Phe Asn Arg Asp Val Cys Glu Tyr Tyr
Asp Pro Asp Tyr Leu Asn Thr65 70 75
80Asn Asp Lys Lys Asn Ile Phe Leu Gln Thr Met Ile Lys Leu
Phe Asn 85 90 95Arg Ile
Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met Ile Ile 100
105 110Asn Gly Ile Pro Tyr Leu Gly Asp Arg
Arg Val Pro Leu Glu Glu Phe 115 120
125Asn Thr Asn Ile Ala Ser Val Thr Val Asn Lys Leu Ile Ser Asn Pro
130 135 140Gly Glu Val Glu Arg Lys Lys
Gly Ile Phe Ala Asn Leu Ile Ile Phe145 150
155 160Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr Ile
Asp Ile Gly Ile 165 170
175Gln Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly Ile Met Gln Met
180 185 190Lys Phe Cys Pro Glu Tyr
Val Ser Val Phe Asn Asn Val Gln Glu Asn 195 200
205Lys Gly Ala Ser Ile Phe Asn Arg Arg Gly Tyr Phe Ser Asp
Pro Ala 210 215 220Leu Ile Leu Met His
Glu Leu Ile His Val Leu His Gly Leu Tyr Gly225 230
235 240Ile Lys Val Asp Asp Leu Pro Ile Val Pro
Asn Glu Lys Lys Phe Phe 245 250
255Met Gln Ser Thr Asp Ala Ile Gln Ala Glu Glu Leu Tyr Thr Phe Gly
260 265 270Gly Gln Asp Pro Ser
Ile Ile Thr Pro Ser Thr Asp Lys Ser Ile Tyr 275
280 285Asp Lys Val Leu Gln Asn Phe Arg Gly Ile Val Asp
Arg Leu Asn Lys 290 295 300Val Leu Val
Cys Ile Ser Asp Pro Asn Ile Asn Ile Asn Ile Tyr Lys305
310 315 320Asn Lys Phe Lys Asp Lys Tyr
Lys Phe Val Glu Asp Ser Glu Gly Lys 325
330 335Tyr Ser Ile Asp Val Glu Ser Phe Asp Lys Leu Tyr
Lys Ser Leu Met 340 345 350Phe
Gly Phe Thr Glu Thr Asn Ile Ala Glu Asn Tyr Lys Ile Lys Thr 355
360 365Arg Ala Ser Tyr Phe Ser Asp Ser Leu
Pro Pro Val Lys Ile Lys Asn 370 375
380Leu Leu Asp Asn Glu Ile Tyr Thr Ile Glu Glu Gly Phe Asn Ile Ser385
390 395 400Asp Lys Asp Met
Glu Lys Glu Tyr Arg Gly Gln Asn Lys Ala Ile Asn 405
410 415Lys Gln Ala Tyr Glu Glu Ile Ser Lys Glu
His Leu Ala Val Tyr Lys 420 425
430Ile Gln Met Cys Val Asp Glu Glu Lys Leu Tyr Asp Asp Asp Asp Lys
435 440 445Asp Arg Trp Gly Ser Ser Leu
Gln Cys Ile Asp Val Asp Asn Glu Asp 450 455
460Leu Phe Phe Ile Ala Asp Lys Asn Ser Phe Ser Asp Asp Leu Ser
Lys465 470 475 480Asn Glu
Arg Ile Glu Tyr Asn Thr Gln Ser Asn Tyr Ile Glu Asn Asp
485 490 495Phe Pro Ile Asn Glu Leu Ile
Leu Asp Thr Asp Leu Ile Ser Lys Ile 500 505
510Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr Asp Phe Asn
Val Asp 515 520 525Val Pro Val Tyr
Glu Lys Gln Pro Ala Ile Lys Lys Ile Phe Thr Asp 530
535 540Glu Asn Thr Ile Phe Gln Tyr Leu Tyr Ser Gln Thr
Phe Pro Leu Asp545 550 555
560Ile Arg Asp Ile Ser Leu Thr Ser Ser Phe Asp Asp Ala Leu Leu Phe
565 570 575Ser Asn Lys Val Tyr
Ser Phe Phe Ser Met Asp Tyr Ile Lys Thr Ala 580
585 590Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly Trp
Val Lys Gln Ile 595 600 605Val Asn
Asp Phe Val Ile Glu Ala Asn Lys Ser Asn Thr Met Asp Ala 610
615 620Ile Ala Asp Ile Ser Leu Ile Val Pro Tyr Ile
Gly Leu Ala Leu Asn625 630 635
640Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu Asn Ala Phe Glu Ile
645 650 655Ala Gly Ala Ser
Ile Leu Leu Glu Phe Ile Pro Glu Leu Leu Ile Pro 660
665 670Val Val Gly Ala Phe Leu Leu Glu Ser Tyr Ile
Asp Asn Lys Asn Lys 675 680 685Ile
Ile Lys Thr Ile Asp Asn Ala Leu Thr Lys Arg Asn Glu Lys Trp 690
695 700Ser Asp Met Tyr Gly Leu Ile Val Ala Gln
Trp Leu Ser Thr Val Asn705 710 715
720Thr Gln Phe Tyr Thr Ile Lys Glu Gly Met Tyr Lys Ala Leu Asn
Tyr 725 730 735Gln Ala Gln
Ala Leu Glu Glu Ile Ile Lys Tyr Arg Tyr Asn Ile Tyr 740
745 750Ser Glu Lys Glu Lys Ser Asn Ile Asn Ile
Asp Phe Asn Asp Ile Asn 755 760
765Ser Lys Leu Asn Glu Gly Ile Asn Gln Ala Ile Asp Asn Ile Asn Asn 770
775 780Phe Ile Asn Gly Cys Ser Val Ser
Tyr Leu Met Lys Lys Met Ile Pro785 790
795 800Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn Thr
Leu Lys Lys Asn 805 810
815Leu Leu Asn Tyr Ile Asp Glu Asn Lys Leu Tyr Leu Ile Gly Ser Ala
820 825 830Glu Tyr Glu Lys Ser Lys
Val Asn Lys Tyr Leu Lys Thr Ile Met Pro 835 840
845Phe Asp Leu Ser Ile Tyr Thr Asn Asp Thr Ile Leu Ile Glu
Met Phe 850 855 860Asn Lys Tyr Asn Ser
Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly865 870
875 880Ser Gly Gly Gly Gly Ser Ala Leu Val Gly
Asn His Trp Ala Val Gly 885 890
895His Leu Met271065PRTArtificial Sequencesource/note="Description
of Artificial Sequence Synthetic polypeptide" 27Pro Ile Thr Ile Asn
Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn Lys1 5
10 15Asn Ile Leu Tyr Leu Asp Thr His Leu Asn Thr
Leu Ala Asn Glu Pro 20 25
30Glu Lys Ala Phe Arg Ile Thr Gly Asn Ile Trp Val Ile Pro Asp Arg
35 40 45Phe Ser Arg Asn Ser Asn Pro Asn
Leu Asn Lys Pro Pro Arg Val Thr 50 55
60Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp Ser65
70 75 80Asp Lys Asp Thr Phe
Leu Lys Glu Ile Ile Lys Leu Phe Lys Arg Ile 85
90 95Asn Ser Arg Glu Ile Gly Glu Glu Leu Ile Tyr
Arg Leu Ser Thr Asp 100 105
110Ile Pro Phe Pro Gly Asn Asn Asn Thr Pro Ile Asn Thr Phe Asp Phe
115 120 125Asp Val Asp Phe Asn Ser Val
Asp Val Lys Thr Arg Gln Gly Asn Asn 130 135
140Trp Val Lys Thr Gly Ser Ile Asn Pro Ser Val Ile Ile Thr Gly
Pro145 150 155 160Arg Glu
Asn Ile Ile Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr Asn
165 170 175Asn Thr Phe Ala Ala Gln Glu
Gly Phe Gly Ala Leu Ser Ile Ile Ser 180 185
190Ile Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn
Asp Val 195 200 205Gly Glu Gly Arg
Phe Ser Lys Ser Glu Phe Cys Met Asp Pro Ile Leu 210
215 220Ile Leu Met His Glu Leu Asn His Ala Met His Asn
Leu Tyr Gly Ile225 230 235
240Ala Ile Pro Asn Asp Gln Thr Ile Ser Ser Val Thr Ser Asn Ile Phe
245 250 255Tyr Ser Gln Tyr Asn
Val Lys Leu Glu Tyr Ala Glu Ile Tyr Ala Phe 260
265 270Gly Gly Pro Thr Ile Asp Leu Ile Pro Lys Ser Ala
Arg Lys Tyr Phe 275 280 285Glu Glu
Lys Ala Leu Asp Tyr Tyr Arg Ser Ile Ala Lys Arg Leu Asn 290
295 300Ser Ile Thr Thr Ala Asn Pro Ser Ser Phe Asn
Lys Tyr Ile Gly Glu305 310 315
320Tyr Lys Gln Lys Leu Ile Arg Lys Tyr Arg Phe Val Val Glu Ser Ser
325 330 335Gly Glu Val Thr
Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn Glu 340
345 350Leu Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala
Lys Ile Tyr Asn Val 355 360 365Gln
Asn Arg Lys Ile Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr Ala 370
375 380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile
Gln Asn Gly Phe Asn Ile385 390 395
400Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser
Arg 405 410 415Asn Pro Ala
Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu Phe 420
425 430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly
Arg Ser Leu Tyr Asn Lys 435 440
445Thr Leu Gln Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe 450
455 460Ile Gly Asp Ile Ser Asp Val Lys
Thr Asp Ile Phe Leu Arg Lys Asp465 470
475 480Ile Asn Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp
Asn Val Ser Val 485 490
495Asp Gln Val Ile Leu Ser Lys Asn Thr Ser Glu His Gly Gln Leu Asp
500 505 510Leu Leu Tyr Pro Ser Ile
Asp Ser Glu Ser Glu Ile Leu Pro Gly Glu 515 520
525Asn Gln Val Phe Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr
Leu Asn 530 535 540Ser Tyr Tyr Tyr Leu
Glu Ser Gln Lys Leu Ser Asp Asn Val Glu Asp545 550
555 560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala
Leu Asp Asn Ser Ala Lys 565 570
575Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val
580 585 590Gln Gly Gly Leu Phe
Leu Met Trp Ala Asn Asp Val Val Glu Asp Phe 595
600 605Thr Thr Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys
Ile Ser Asp Val 610 615 620Ser Ala Ile
Ile Pro Tyr Ile Gly Pro Ala Leu Asn Ile Ser Asn Ser625
630 635 640Val Arg Arg Gly Asn Phe Thr
Glu Ala Phe Ala Val Thr Gly Val Thr 645
650 655Ile Leu Leu Glu Ala Phe Pro Glu Phe Thr Ile Pro
Ala Leu Gly Ala 660 665 670Phe
Val Ile Tyr Ser Lys Val Gln Glu Arg Asn Glu Ile Ile Lys Thr 675
680 685Ile Asp Asn Cys Leu Glu Gln Arg Ile
Lys Arg Trp Lys Asp Ser Tyr 690 695
700Glu Trp Met Met Gly Thr Trp Leu Ser Arg Ile Ile Thr Gln Phe Asn705
710 715 720Asn Ile Ser Tyr
Gln Met Tyr Asp Ser Leu Asn Tyr Gln Ala Gly Ala 725
730 735Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys
Lys Tyr Ser Gly Ser Asp 740 745
750Lys Glu Asn Ile Lys Ser Gln Val Glu Asn Leu Lys Asn Ser Leu Asp
755 760 765Val Lys Ile Ser Glu Ala Met
Asn Asn Ile Asn Lys Phe Ile Arg Glu 770 775
780Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val Ile
Asp785 790 795 800Glu Leu
Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu Ile Asn Leu
805 810 815Ile Asp Ser His Asn Ile Ile
Leu Val Gly Glu Val Asp Lys Leu Lys 820 825
830Ala Lys Val Asn Asn Ser Phe Gln Asn Thr Ile Pro Phe Asn
Ile Phe 835 840 845Ser Tyr Thr Asn
Asn Ser Leu Leu Lys Asp Ile Ile Asn Glu Tyr Phe 850
855 860Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly
Ser Gly Gly Gly865 870 875
880Gly Ser Ala Leu Val Ser Pro Leu Pro Ile Thr Pro Val Asn Ala Thr
885 890 895Cys Ala Ile Arg His
Pro Cys His Asn Asn Leu Met Asn Gln Ile Arg 900
905 910Ser Gln Leu Ala Gln Leu Asn Gly Ser Ala Asn Ala
Leu Phe Ile Leu 915 920 925Tyr Tyr
Thr Ala Gln Gly Glu Pro Phe Pro Asn Asn Leu Asp Lys Leu 930
935 940Cys Gly Pro Asn Val Thr Asp Phe Pro Pro Phe
His Ala Asn Gly Thr945 950 955
960Glu Lys Ala Lys Leu Val Glu Leu Tyr Arg Ile Val Val Tyr Leu Gly
965 970 975Thr Ser Leu Gly
Asn Ile Thr Arg Asp Gln Lys Ile Leu Asn Pro Ser 980
985 990Ala Leu Ser Leu His Ser Lys Leu Asn Ala Thr
Ala Asp Ile Leu Arg 995 1000
1005Gly Leu Leu Ser Asn Val Leu Cys Arg Leu Cys Ser Lys Tyr His
1010 1015 1020Val Gly His Val Asp Val
Thr Tyr Gly Pro Asp Thr Ser Gly Lys 1025 1030
1035Asp Val Phe Gln Lys Lys Lys Leu Gly Cys Gln Leu Leu Gly
Lys 1040 1045 1050Tyr Lys Gln Ile Ile
Ala Val Leu Ala Gln Ala Phe 1055 1060
1065281060PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 28Pro Val Thr Ile Asn Asn Phe Asn
Tyr Asn Asp Pro Ile Asp Asn Asn1 5 10
15Asn Ile Ile Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly
Arg Tyr 20 25 30Tyr Lys Ala
Phe Lys Ile Thr Asp Arg Ile Trp Ile Ile Pro Glu Arg 35
40 45Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn
Lys Ser Ser Gly Ile 50 55 60Phe Asn
Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn Thr65
70 75 80Asn Asp Lys Lys Asn Ile Phe
Leu Gln Thr Met Ile Lys Leu Phe Asn 85 90
95Arg Ile Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu
Met Ile Ile 100 105 110Asn Gly
Ile Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu Phe 115
120 125Asn Thr Asn Ile Ala Ser Val Thr Val Asn
Lys Leu Ile Ser Asn Pro 130 135 140Gly
Glu Val Glu Arg Lys Lys Gly Ile Phe Ala Asn Leu Ile Ile Phe145
150 155 160Gly Pro Gly Pro Val Leu
Asn Glu Asn Glu Thr Ile Asp Ile Gly Ile 165
170 175Gln Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly
Ile Met Gln Met 180 185 190Lys
Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gln Glu Asn 195
200 205Lys Gly Ala Ser Ile Phe Asn Arg Arg
Gly Tyr Phe Ser Asp Pro Ala 210 215
220Leu Ile Leu Met His Glu Leu Ile His Val Leu His Gly Leu Tyr Gly225
230 235 240Ile Lys Val Asp
Asp Leu Pro Ile Val Pro Asn Glu Lys Lys Phe Phe 245
250 255Met Gln Ser Thr Asp Ala Ile Gln Ala Glu
Glu Leu Tyr Thr Phe Gly 260 265
270Gly Gln Asp Pro Ser Ile Ile Thr Pro Ser Thr Asp Lys Ser Ile Tyr
275 280 285Asp Lys Val Leu Gln Asn Phe
Arg Gly Ile Val Asp Arg Leu Asn Lys 290 295
300Val Leu Val Cys Ile Ser Asp Pro Asn Ile Asn Ile Asn Ile Tyr
Lys305 310 315 320Asn Lys
Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly Lys
325 330 335Tyr Ser Ile Asp Val Glu Ser
Phe Asp Lys Leu Tyr Lys Ser Leu Met 340 345
350Phe Gly Phe Thr Glu Thr Asn Ile Ala Glu Asn Tyr Lys Ile
Lys Thr 355 360 365Arg Ala Ser Tyr
Phe Ser Asp Ser Leu Pro Pro Val Lys Ile Lys Asn 370
375 380Leu Leu Asp Asn Glu Ile Tyr Thr Ile Glu Glu Gly
Phe Asn Ile Ser385 390 395
400Asp Lys Asp Met Glu Lys Glu Tyr Arg Gly Gln Asn Lys Ala Ile Asn
405 410 415Lys Gln Ala Tyr Glu
Glu Ile Ser Lys Glu His Leu Ala Val Tyr Lys 420
425 430Ile Gln Met Cys Val Asp Asn Asn Asn Asn Asn Asn
Asn Asn Asn Asn 435 440 445Asp Asp
Asp Asp Lys Ser Pro Leu Pro Ile Thr Pro Val Asn Ala Thr 450
455 460Cys Ala Ile Arg His Pro Cys His Asn Asn Leu
Met Asn Gln Ile Arg465 470 475
480Ser Gln Leu Ala Gln Leu Asn Gly Ser Ala Asn Ala Leu Phe Ile Leu
485 490 495Tyr Tyr Thr Ala
Gln Gly Glu Pro Phe Pro Asn Asn Leu Asp Lys Leu 500
505 510Cys Gly Pro Asn Val Thr Asp Phe Pro Pro Phe
His Ala Asn Gly Thr 515 520 525Glu
Lys Ala Lys Leu Val Glu Leu Tyr Arg Ile Val Val Tyr Leu Gly 530
535 540Thr Ser Leu Gly Asn Ile Thr Arg Asp Gln
Lys Ile Leu Asn Pro Ser545 550 555
560Ala Leu Ser Leu His Ser Lys Leu Asn Ala Thr Ala Asp Ile Leu
Arg 565 570 575Gly Leu Leu
Ser Asn Val Leu Cys Arg Leu Cys Ser Lys Tyr His Val 580
585 590Gly His Val Asp Val Thr Tyr Gly Pro Asp
Thr Ser Gly Lys Asp Val 595 600
605Phe Gln Lys Lys Lys Leu Gly Cys Gln Leu Leu Gly Lys Tyr Lys Gln 610
615 620Ile Ile Ala Val Leu Ala Gln Ala
Phe Ala Glu Ala Ala Ala Lys Glu625 630
635 640Ala Ala Ala Lys Ala Leu Gln Cys Ile Asp Val Asp
Asn Glu Asp Leu 645 650
655Phe Phe Ile Ala Asp Lys Asn Ser Phe Ser Asp Asp Leu Ser Lys Asn
660 665 670Glu Arg Ile Glu Tyr Asn
Thr Gln Ser Asn Tyr Ile Glu Asn Asp Phe 675 680
685Pro Ile Asn Glu Leu Ile Leu Asp Thr Asp Leu Ile Ser Lys
Ile Glu 690 695 700Leu Pro Ser Glu Asn
Thr Glu Ser Leu Thr Asp Phe Asn Val Asp Val705 710
715 720Pro Val Tyr Glu Lys Gln Pro Ala Ile Lys
Lys Ile Phe Thr Asp Glu 725 730
735Asn Thr Ile Phe Gln Tyr Leu Tyr Ser Gln Thr Phe Pro Leu Asp Ile
740 745 750Arg Asp Ile Ser Leu
Thr Ser Ser Phe Asp Asp Ala Leu Leu Phe Ser 755
760 765Asn Lys Val Tyr Ser Phe Phe Ser Met Asp Tyr Ile
Lys Thr Ala Asn 770 775 780Lys Val Val
Glu Ala Gly Leu Phe Ala Gly Trp Val Lys Gln Ile Val785
790 795 800Asn Asp Phe Val Ile Glu Ala
Asn Lys Ser Asn Thr Met Asp Lys Ile 805
810 815Ala Asp Ile Ser Leu Ile Val Pro Tyr Ile Gly Leu
Ala Leu Asn Val 820 825 830Gly
Asn Glu Thr Ala Lys Gly Asn Phe Glu Asn Ala Phe Glu Ile Ala 835
840 845Gly Ala Ser Ile Leu Leu Glu Phe Ile
Pro Glu Leu Leu Ile Pro Val 850 855
860Val Gly Ala Phe Leu Leu Glu Ser Tyr Ile Asp Asn Lys Asn Lys Ile865
870 875 880Ile Lys Thr Ile
Asp Asn Ala Leu Thr Lys Arg Asn Glu Lys Trp Ser 885
890 895Asp Met Tyr Gly Leu Ile Val Ala Gln Trp
Leu Ser Thr Val Asn Thr 900 905
910Gln Phe Tyr Thr Ile Lys Glu Gly Met Tyr Lys Ala Leu Asn Tyr Gln
915 920 925Ala Gln Ala Leu Glu Glu Ile
Ile Lys Tyr Arg Tyr Asn Ile Tyr Ser 930 935
940Glu Lys Glu Lys Ser Asn Ile Asn Ile Asp Phe Asn Asp Ile Asn
Ser945 950 955 960Lys Leu
Asn Glu Gly Ile Asn Gln Ala Ile Asp Asn Ile Asn Asn Phe
965 970 975Ile Asn Gly Cys Ser Val Ser
Tyr Leu Met Lys Lys Met Ile Pro Leu 980 985
990Ala Val Glu Lys Leu Leu Asp Phe Asp Asn Thr Leu Lys Lys
Asn Leu 995 1000 1005Leu Asn Tyr
Ile Asp Glu Asn Lys Leu Tyr Leu Ile Gly Ser Ala 1010
1015 1020Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu
Lys Thr Ile Met 1025 1030 1035Pro Phe
Asp Leu Ser Ile Tyr Thr Asn Asp Thr Ile Leu Ile Glu 1040
1045 1050Met Phe Asn Lys Tyr Asn Ser 1055
1060291026PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 29Pro Ile Thr Ile Asn Asn
Phe Asn Tyr Ser Asp Pro Val Asp Asn Lys1 5
10 15Asn Ile Leu Tyr Leu Asp Thr His Leu Asn Thr Leu
Ala Asn Glu Pro 20 25 30Glu
Lys Ala Phe Arg Ile Thr Gly Asn Ile Trp Val Ile Pro Asp Arg 35
40 45Phe Ser Arg Asn Ser Asn Pro Asn Leu
Asn Lys Pro Pro Arg Val Thr 50 55
60Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp Ser65
70 75 80Asp Lys Asp Thr Phe
Leu Lys Glu Ile Ile Lys Leu Phe Lys Arg Ile 85
90 95Asn Ser Arg Glu Ile Gly Glu Glu Leu Ile Tyr
Arg Leu Ser Thr Asp 100 105
110Ile Pro Phe Pro Gly Asn Asn Asn Thr Pro Ile Asn Thr Phe Asp Phe
115 120 125Asp Val Asp Phe Asn Ser Val
Asp Val Lys Thr Arg Gln Gly Asn Asn 130 135
140Trp Val Lys Thr Gly Ser Ile Asn Pro Ser Val Ile Ile Thr Gly
Pro145 150 155 160Arg Glu
Asn Ile Ile Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr Asn
165 170 175Asn Thr Phe Ala Ala Gln Glu
Gly Phe Gly Ala Leu Ser Ile Ile Ser 180 185
190Ile Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn
Asp Val 195 200 205Gly Glu Gly Arg
Phe Ser Lys Ser Glu Phe Cys Met Asp Pro Ile Leu 210
215 220Ile Leu Met His Glu Leu Asn His Ala Met His Asn
Leu Tyr Gly Ile225 230 235
240Ala Ile Pro Asn Asp Gln Thr Ile Ser Ser Val Thr Ser Asn Ile Phe
245 250 255Tyr Ser Gln Tyr Asn
Val Lys Leu Glu Tyr Ala Glu Ile Tyr Ala Phe 260
265 270Gly Gly Pro Thr Ile Asp Leu Ile Pro Lys Ser Ala
Arg Lys Tyr Phe 275 280 285Glu Glu
Lys Ala Leu Asp Tyr Tyr Arg Ser Ile Ala Lys Arg Leu Asn 290
295 300Ser Ile Thr Thr Ala Asn Pro Ser Ser Phe Asn
Lys Tyr Ile Gly Glu305 310 315
320Tyr Lys Gln Lys Leu Ile Arg Lys Tyr Arg Phe Val Val Glu Ser Ser
325 330 335Gly Glu Val Thr
Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn Glu 340
345 350Leu Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala
Lys Ile Tyr Asn Val 355 360 365Gln
Asn Arg Lys Ile Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr Ala 370
375 380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile
Gln Asn Gly Phe Asn Ile385 390 395
400Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser
Arg 405 410 415Asn Pro Ala
Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu Phe 420
425 430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly
Arg Ser Leu Tyr Asn Lys 435 440
445Thr Leu Gln Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe 450
455 460Ile Gly Asp Ile Ser Asp Val Lys
Thr Asp Ile Phe Leu Arg Lys Asp465 470
475 480Ile Asn Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp
Asn Val Ser Val 485 490
495Asp Gln Val Ile Leu Ser Lys Asn Thr Ser Glu His Gly Gln Leu Asp
500 505 510Leu Leu Tyr Pro Ser Ile
Asp Ser Glu Ser Glu Ile Leu Pro Gly Glu 515 520
525Asn Gln Val Phe Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr
Leu Asn 530 535 540Ser Tyr Tyr Tyr Leu
Glu Ser Gln Lys Leu Ser Asp Asn Val Glu Asp545 550
555 560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala
Leu Asp Asn Ser Ala Lys 565 570
575Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val
580 585 590Gln Gly Gly Leu Phe
Leu Met Trp Ala Asn Asp Val Val Glu Asp Phe 595
600 605Thr Thr Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys
Ile Ser Asp Val 610 615 620Ser Ala Ile
Ile Pro Tyr Ile Gly Pro Ala Leu Asn Ile Ser Asn Ser625
630 635 640Val Arg Arg Gly Asn Phe Thr
Glu Ala Phe Ala Val Thr Gly Val Thr 645
650 655Ile Leu Leu Glu Ala Phe Pro Glu Phe Thr Ile Pro
Ala Leu Gly Ala 660 665 670Phe
Val Ile Tyr Ser Lys Val Gln Glu Arg Asn Glu Ile Ile Lys Thr 675
680 685Ile Asp Asn Cys Leu Glu Gln Arg Ile
Lys Arg Trp Lys Asp Ser Tyr 690 695
700Glu Trp Met Met Gly Thr Trp Leu Ser Arg Ile Ile Thr Gln Phe Asn705
710 715 720Asn Ile Ser Tyr
Gln Met Tyr Asp Ser Leu Asn Tyr Gln Ala Gly Ala 725
730 735Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys
Lys Tyr Ser Gly Ser Asp 740 745
750Lys Glu Asn Ile Lys Ser Gln Val Glu Asn Leu Lys Asn Ser Leu Asp
755 760 765Val Lys Ile Ser Glu Ala Met
Asn Asn Ile Asn Lys Phe Ile Arg Glu 770 775
780Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val Ile
Asp785 790 795 800Glu Leu
Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu Ile Asn Leu
805 810 815Ile Asp Ser His Asn Ile Ile
Leu Val Gly Glu Val Asp Lys Leu Lys 820 825
830Ala Lys Val Asn Asn Ser Phe Gln Asn Thr Ile Pro Phe Asn
Ile Phe 835 840 845Ser Tyr Thr Asn
Asn Ser Leu Leu Lys Asp Ile Ile Asn Glu Tyr Phe 850
855 860Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly
Ser Gly Gly Gly865 870 875
880Gly Ser Ala Leu Val Met Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys
885 890 895Pro Lys Leu Leu Tyr
Cys Ser Asn Gly Gly His Phe Leu Arg Ile Leu 900
905 910Pro Asp Gly Thr Val Asp Gly Thr Arg Asp Arg Ser
Asp Gln His Ile 915 920 925Gln Leu
Gln Leu Ser Ala Glu Ser Val Gly Glu Val Tyr Ile Lys Ser 930
935 940Thr Glu Thr Gly Gln Tyr Leu Ala Met Asp Thr
Asp Gly Leu Leu Tyr945 950 955
960Gly Ser Gln Thr Pro Asn Glu Glu Cys Leu Phe Leu Glu Arg Leu Glu
965 970 975Glu Asn His Tyr
Asn Thr Tyr Ile Ser Lys Lys His Ala Glu Lys Asn 980
985 990Trp Phe Val Gly Leu Lys Lys Asn Gly Ser Cys
Lys Arg Gly Pro Arg 995 1000
1005Thr His Tyr Gly Gln Lys Ala Ile Leu Phe Leu Pro Leu Pro Val
1010 1015 1020Ser Ser Asp
1025301018PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 30Glu Phe Val Asn Lys Gln Phe Asn
Tyr Lys Asp Pro Val Asn Gly Val1 5 10
15Asp Ile Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln Met Gln
Pro Val 20 25 30Lys Ala Phe
Lys Ile His Asn Lys Ile Trp Val Ile Pro Glu Arg Asp 35
40 45Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn
Pro Pro Pro Glu Ala 50 55 60Lys Gln
Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp65
70 75 80Asn Glu Lys Asp Asn Tyr Leu
Lys Gly Val Thr Lys Leu Phe Glu Arg 85 90
95Ile Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser
Ile Val Arg 100 105 110Gly Ile
Pro Phe Trp Gly Gly Ser Thr Ile Asp Thr Glu Leu Lys Val 115
120 125Ile Asp Thr Asn Cys Ile Asn Val Ile Gln
Pro Asp Gly Ser Tyr Arg 130 135 140Ser
Glu Glu Leu Asn Leu Val Ile Ile Gly Pro Ser Ala Asp Ile Ile145
150 155 160Gln Phe Glu Cys Leu Ser
Phe Gly His Glu Val Leu Asn Leu Thr Arg 165
170 175Asn Gly Tyr Gly Ser Thr Gln Tyr Ile Arg Phe Ser
Pro Asp Phe Thr 180 185 190Phe
Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly 195
200 205Ala Gly Lys Phe Ala Thr Asp Pro Ala
Val Thr Leu Ala His Glu Leu 210 215
220Ile His Ala Gly His Arg Leu Tyr Gly Ile Ala Ile Asn Pro Asn Arg225
230 235 240Val Phe Lys Val
Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu 245
250 255Val Ser Phe Glu Glu Leu Arg Thr Phe Gly
Gly His Asp Ala Lys Phe 260 265
270Ile Asp Ser Leu Gln Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys
275 280 285Phe Lys Asp Ile Ala Ser Thr
Leu Asn Lys Ala Lys Ser Ile Val Gly 290 295
300Thr Thr Ala Ser Leu Gln Tyr Met Lys Asn Val Phe Lys Glu Lys
Tyr305 310 315 320Leu Leu
Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys
325 330 335Phe Asp Lys Leu Tyr Lys Met
Leu Thr Glu Ile Tyr Thr Glu Asp Asn 340 345
350Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu
Asn Phe 355 360 365Asp Lys Ala Val
Phe Lys Ile Asn Ile Val Pro Lys Val Asn Tyr Thr 370
375 380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu
Ala Ala Asn Phe385 390 395
400Asn Gly Gln Asn Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu Lys
405 410 415Asn Phe Thr Gly Leu
Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly 420
425 430Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp
Lys Met Phe Asn 435 440 445Leu Pro
Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser Asn 450
455 460Gly Gly His Phe Leu Arg Ile Leu Pro Asp Gly
Thr Val Asp Gly Thr465 470 475
480Arg Asp Arg Ser Asp Gln His Ile Gln Leu Gln Leu Ser Ala Glu Ser
485 490 495Val Gly Glu Val
Tyr Ile Lys Ser Thr Glu Thr Gly Gln Tyr Leu Ala 500
505 510Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gln
Thr Pro Asn Glu Glu 515 520 525Cys
Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr Ile 530
535 540Ser Lys Lys His Ala Glu Lys Asn Trp Phe
Val Gly Leu Lys Lys Asn545 550 555
560Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gln Lys Ala
Ile 565 570 575Leu Phe Leu
Pro Leu Pro Val Ser Ser Asp Gly Gly Gly Gly Ser Gly 580
585 590Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala
Leu Val Leu Gln Cys Ile 595 600
605Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn Phe 610
615 620Thr Asn Asp Leu Asn Lys Gly Glu
Glu Ile Thr Ser Asp Thr Asn Ile625 630
635 640Glu Ala Ala Glu Glu Asn Ile Ser Leu Asp Leu Ile
Gln Gln Tyr Tyr 645 650
655Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn Ile Ser Ile Glu Asn
660 665 670Leu Ser Ser Asp Ile Ile
Gly Gln Leu Glu Leu Met Pro Asn Ile Glu 675 680
685Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr
Met Phe 690 695 700His Tyr Leu Arg Ala
Gln Glu Phe Glu His Gly Lys Ser Arg Ile Ala705 710
715 720Leu Thr Asn Ser Val Asn Glu Ala Leu Leu
Asn Pro Ser Arg Val Tyr 725 730
735Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr Glu
740 745 750Ala Ala Met Phe Leu
Gly Trp Val Glu Gln Leu Val Tyr Asp Phe Thr 755
760 765Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys Ile
Ala Asp Ile Thr 770 775 780Ile Ile Ile
Pro Tyr Ile Gly Pro Ala Leu Asn Ile Gly Asn Met Leu785
790 795 800Tyr Lys Asp Asp Phe Val Gly
Ala Leu Ile Phe Ser Gly Ala Val Ile 805
810 815Leu Leu Glu Phe Ile Pro Glu Ile Ala Ile Pro Val
Leu Gly Thr Phe 820 825 830Ala
Leu Val Ser Tyr Ile Ala Asn Lys Val Leu Thr Val Gln Thr Ile 835
840 845Asp Asn Ala Leu Ser Lys Arg Asn Glu
Lys Trp Asp Glu Val Tyr Lys 850 855
860Tyr Ile Val Thr Asn Trp Leu Ala Lys Val Asn Thr Gln Ile Asp Leu865
870 875 880Ile Arg Lys Lys
Met Lys Glu Ala Leu Glu Asn Gln Ala Glu Ala Thr 885
890 895Lys Ala Ile Ile Asn Tyr Gln Tyr Asn Gln
Tyr Thr Glu Glu Glu Lys 900 905
910Asn Asn Ile Asn Phe Asn Ile Asp Asp Leu Ser Ser Lys Leu Asn Glu
915 920 925Ser Ile Asn Lys Ala Met Ile
Asn Ile Asn Lys Phe Leu Asn Gln Cys 930 935
940Ser Val Ser Tyr Leu Met Asn Ser Met Ile Pro Tyr Gly Val Lys
Arg945 950 955 960Leu Glu
Asp Phe Asp Ala Ser Leu Lys Asp Ala Leu Leu Lys Tyr Ile
965 970 975Tyr Asp Asn Arg Gly Thr Leu
Ile Gly Gln Val Asp Arg Leu Lys Asp 980 985
990Lys Val Asn Asn Thr Leu Ser Thr Asp Ile Pro Phe Gln Leu
Ser Lys 995 1000 1005Tyr Val Asp
Asn Gln Arg Leu Leu Ser Thr 1010
1015311063PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 31Glu Phe Val Asn Lys Gln Phe Asn
Tyr Lys Asp Pro Val Asn Gly Val1 5 10
15Asp Ile Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln Met Gln
Pro Val 20 25 30Lys Ala Phe
Lys Ile His Asn Lys Ile Trp Val Ile Pro Glu Arg Asp 35
40 45Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn
Pro Pro Pro Glu Ala 50 55 60Lys Gln
Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp65
70 75 80Asn Glu Lys Asp Asn Tyr Leu
Lys Gly Val Thr Lys Leu Phe Glu Arg 85 90
95Ile Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser
Ile Val Arg 100 105 110Gly Ile
Pro Phe Trp Gly Gly Ser Thr Ile Asp Thr Glu Leu Lys Val 115
120 125Ile Asp Thr Asn Cys Ile Asn Val Ile Gln
Pro Asp Gly Ser Tyr Arg 130 135 140Ser
Glu Glu Leu Asn Leu Val Ile Ile Gly Pro Ser Ala Asp Ile Ile145
150 155 160Gln Phe Glu Cys Leu Ser
Phe Gly His Glu Val Leu Asn Leu Thr Arg 165
170 175Asn Gly Tyr Gly Ser Thr Gln Tyr Ile Arg Phe Ser
Pro Asp Phe Thr 180 185 190Phe
Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly 195
200 205Ala Gly Lys Phe Ala Thr Asp Pro Ala
Val Thr Leu Ala His Glu Leu 210 215
220Ile His Ala Gly His Arg Leu Tyr Gly Ile Ala Ile Asn Pro Asn Arg225
230 235 240Val Phe Lys Val
Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu 245
250 255Val Ser Phe Glu Glu Leu Arg Thr Phe Gly
Gly His Asp Ala Lys Phe 260 265
270Ile Asp Ser Leu Gln Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys
275 280 285Phe Lys Asp Ile Ala Ser Thr
Leu Asn Lys Ala Lys Ser Ile Val Gly 290 295
300Thr Thr Ala Ser Leu Gln Tyr Met Lys Asn Val Phe Lys Glu Lys
Tyr305 310 315 320Leu Leu
Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys
325 330 335Phe Asp Lys Leu Tyr Lys Met
Leu Thr Glu Ile Tyr Thr Glu Asp Asn 340 345
350Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu
Asn Phe 355 360 365Asp Lys Ala Val
Phe Lys Ile Asn Ile Val Pro Lys Val Asn Tyr Thr 370
375 380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu
Ala Ala Asn Phe385 390 395
400Asn Gly Gln Asn Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu Lys
405 410 415Asn Phe Thr Gly Leu
Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly 420
425 430Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp
Lys Asn Lys Ala 435 440 445Leu Asn
Leu Gln Cys Ile Lys Val Asn Asn Trp Asp Leu Phe Phe Ser 450
455 460Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn
Lys Gly Glu Glu Ile465 470 475
480Thr Ser Asp Thr Asn Ile Glu Ala Ala Glu Glu Asn Ile Ser Leu Asp
485 490 495Leu Ile Gln Gln
Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu 500
505 510Asn Ile Ser Ile Glu Asn Leu Ser Ser Asp Ile
Ile Gly Gln Leu Glu 515 520 525Leu
Met Pro Asn Ile Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu 530
535 540Asp Lys Tyr Thr Met Phe His Tyr Leu Arg
Ala Gln Glu Phe Glu His545 550 555
560Gly Lys Ser Arg Ile Ala Leu Thr Asn Ser Val Asn Glu Ala Leu
Leu 565 570 575Asn Pro Ser
Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys 580
585 590Val Asn Lys Ala Thr Glu Ala Ala Met Phe
Leu Gly Trp Val Glu Gln 595 600
605Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp 610
615 620Lys Ile Ala Asp Ile Thr Ile Ile
Ile Pro Tyr Ile Gly Pro Ala Leu625 630
635 640Asn Ile Gly Asn Met Leu Tyr Lys Asp Asp Phe Val
Gly Ala Leu Ile 645 650
655Phe Ser Gly Ala Val Ile Leu Leu Glu Phe Ile Pro Glu Ile Ala Ile
660 665 670Pro Val Leu Gly Thr Phe
Ala Leu Val Ser Tyr Ile Ala Asn Lys Val 675 680
685Leu Thr Val Gln Thr Ile Asp Asn Ala Leu Ser Lys Arg Asn
Glu Lys 690 695 700Trp Asp Glu Val Tyr
Lys Tyr Ile Val Thr Asn Trp Leu Ala Lys Val705 710
715 720Asn Thr Gln Ile Asp Leu Ile Arg Lys Lys
Met Lys Glu Ala Leu Glu 725 730
735Asn Gln Ala Glu Ala Thr Lys Ala Ile Ile Asn Tyr Gln Tyr Asn Gln
740 745 750Tyr Thr Glu Glu Glu
Lys Asn Asn Ile Asn Phe Asn Ile Asp Asp Leu 755
760 765Ser Ser Lys Leu Asn Glu Ser Ile Asn Lys Ala Met
Ile Asn Ile Asn 770 775 780Lys Phe Leu
Asn Gln Cys Ser Val Ser Tyr Leu Met Asn Ser Met Ile785
790 795 800Pro Tyr Gly Val Lys Arg Leu
Glu Asp Phe Asp Ala Ser Leu Lys Asp 805
810 815Ala Leu Leu Lys Tyr Ile Tyr Asp Asn Arg Gly Thr
Leu Ile Gly Gln 820 825 830Val
Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp Ile 835
840 845Pro Phe Gln Leu Ser Lys Tyr Val Asp
Asn Gln Arg Leu Leu Ser Thr 850 855
860Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly865
870 875 880Ser Gly Gly Gly
Gly Ser Ala Leu Val Asp His Leu Gly Gln Ser Glu 885
890 895Ala Gly Gly Leu Pro Arg Gly Pro Ala Val
Thr Asp Leu Asp His Leu 900 905
910Lys Gly Ile Leu Arg Arg Arg Gln Leu Tyr Cys Arg Thr Gly Phe His
915 920 925Leu Glu Ile Phe Pro Asn Gly
Thr Ile Gln Gly Thr Arg Lys Asp His 930 935
940Ser Arg Phe Gly Ile Leu Glu Phe Ile Ser Ile Ala Val Gly Leu
Val945 950 955 960Ser Ile
Arg Gly Val Asp Ser Gly Leu Tyr Leu Gly Met Asn Glu Lys
965 970 975Gly Glu Leu Tyr Gly Ser Glu
Lys Leu Thr Gln Glu Cys Val Phe Arg 980 985
990Glu Gln Phe Glu Glu Asn Trp Tyr Asn Thr Tyr Ser Ser Asn
Leu Tyr 995 1000 1005Lys His Val
Asp Thr Gly Arg Arg Tyr Tyr Val Ala Leu Asn Lys 1010
1015 1020Asp Gly Thr Pro Arg Glu Gly Thr Arg Thr Lys
Arg His Gln Lys 1025 1030 1035Phe Thr
His Phe Leu Pro Arg Pro Val Asp Pro Asp Lys Val Pro 1040
1045 1050Glu Leu Tyr Lys Asp Ile Leu Ser Gln Ser
1055 1060321055PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 32Pro Ile Thr Ile Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp
Asn Lys1 5 10 15Asn Ile
Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu Pro 20
25 30Glu Lys Ala Phe Arg Ile Thr Gly Asn
Ile Trp Val Ile Pro Asp Arg 35 40
45Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val Thr 50
55 60Ser Pro Lys Ser Gly Tyr Tyr Asp Pro
Asn Tyr Leu Ser Thr Asp Ser65 70 75
80Asp Lys Asp Thr Phe Leu Lys Glu Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Ser
Arg Glu Ile Gly Glu Glu Leu Ile Tyr Arg Leu Ser Thr Asp 100
105 110Ile Pro Phe Pro Gly Asn Asn Asn Thr
Pro Ile Asn Thr Phe Asp Phe 115 120
125Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gln Gly Asn Asn
130 135 140Trp Val Lys Thr Gly Ser Ile
Asn Pro Ser Val Ile Ile Thr Gly Pro145 150
155 160Arg Glu Asn Ile Ile Asp Pro Glu Thr Ser Thr Phe
Lys Leu Thr Asn 165 170
175Asn Thr Phe Ala Ala Gln Glu Gly Phe Gly Ala Leu Ser Ile Ile Ser
180 185 190Ile Ser Pro Arg Phe Met
Leu Thr Tyr Ser Asn Ala Thr Asn Asp Val 195 200
205Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro
Ile Leu 210 215 220Ile Leu Met His Glu
Leu Asn His Ala Met His Asn Leu Tyr Gly Ile225 230
235 240Ala Ile Pro Asn Asp Gln Thr Ile Ser Ser
Val Thr Ser Asn Ile Phe 245 250
255Tyr Ser Gln Tyr Asn Val Lys Leu Glu Tyr Ala Glu Ile Tyr Ala Phe
260 265 270Gly Gly Pro Thr Ile
Asp Leu Ile Pro Lys Ser Ala Arg Lys Tyr Phe 275
280 285Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser Ile Ala
Lys Arg Leu Asn 290 295 300Ser Ile Thr
Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr Ile Gly Glu305
310 315 320Tyr Lys Gln Lys Leu Ile Arg
Lys Tyr Arg Phe Val Val Glu Ser Ser 325
330 335Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu
Leu Tyr Asn Glu 340 345 350Leu
Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala Lys Ile Tyr Asn Val 355
360 365Gln Asn Arg Lys Ile Tyr Leu Ser Asn
Val Tyr Thr Pro Val Thr Ala 370 375
380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile Gln Asn Gly Phe Asn Ile385
390 395 400Pro Lys Ser Asn
Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser Arg 405
410 415Asn Pro Ala Leu Arg Lys Val Asn Pro Glu
Asn Met Leu Tyr Leu Phe 420 425
430Thr Lys Phe Cys Val Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn
435 440 445Asp Asp Asp Asp Lys Asp His
Leu Gly Gln Ser Glu Ala Gly Gly Leu 450 455
460Pro Arg Gly Pro Ala Val Thr Asp Leu Asp His Leu Lys Gly Ile
Leu465 470 475 480Arg Arg
Arg Gln Leu Tyr Cys Arg Thr Gly Phe His Leu Glu Ile Phe
485 490 495Pro Asn Gly Thr Ile Gln Gly
Thr Arg Lys Asp His Ser Arg Phe Gly 500 505
510Ile Leu Glu Phe Ile Ser Ile Ala Val Gly Leu Val Ser Ile
Arg Gly 515 520 525Val Asp Ser Gly
Leu Tyr Leu Gly Met Asn Glu Lys Gly Glu Leu Tyr 530
535 540Gly Ser Glu Lys Leu Thr Gln Glu Cys Val Phe Arg
Glu Gln Phe Glu545 550 555
560Glu Asn Trp Tyr Asn Thr Tyr Ser Ser Asn Leu Tyr Lys His Val Asp
565 570 575Thr Gly Arg Arg Tyr
Tyr Val Ala Leu Asn Lys Asp Gly Thr Pro Arg 580
585 590Glu Gly Thr Arg Thr Lys Arg His Gln Lys Phe Thr
His Phe Leu Pro 595 600 605Arg Pro
Val Asp Pro Asp Lys Val Pro Glu Leu Tyr Lys Asp Ile Leu 610
615 620Ser Gln Ser Ala Glu Ala Ala Ala Lys Glu Ala
Ala Ala Lys Ala Leu625 630 635
640Gln Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro Phe Ile Gly
645 650 655Asp Ile Ser Asp
Val Lys Thr Asp Ile Phe Leu Arg Lys Asp Ile Asn 660
665 670Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp Asn
Val Ser Val Asp Gln 675 680 685Val
Ile Leu Ser Lys Asn Thr Ser Glu His Gly Gln Leu Asp Leu Leu 690
695 700Tyr Pro Ser Ile Asp Ser Glu Ser Glu Ile
Leu Pro Gly Glu Asn Gln705 710 715
720Val Phe Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr Leu Asn Ser
Tyr 725 730 735Tyr Tyr Leu
Glu Ser Gln Lys Leu Ser Asp Asn Val Glu Asp Phe Thr 740
745 750Phe Thr Arg Ser Ile Glu Glu Ala Leu Asp
Asn Ser Ala Lys Val Tyr 755 760
765Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val Gln Gly 770
775 780Gly Leu Phe Leu Met Trp Ala Asn
Asp Val Val Glu Asp Phe Thr Thr785 790
795 800Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys Ile Ser
Asp Val Ser Ala 805 810
815Ile Ile Pro Tyr Ile Gly Pro Ala Leu Asn Ile Ser Asn Ser Val Arg
820 825 830Arg Gly Asn Phe Thr Glu
Ala Phe Ala Val Thr Gly Val Thr Ile Leu 835 840
845Leu Glu Ala Phe Pro Glu Phe Thr Ile Pro Ala Leu Gly Ala
Phe Val 850 855 860Ile Tyr Ser Lys Val
Gln Glu Arg Asn Glu Ile Ile Lys Thr Ile Asp865 870
875 880Asn Cys Leu Glu Gln Arg Ile Lys Arg Trp
Lys Asp Ser Tyr Glu Trp 885 890
895Met Met Gly Thr Trp Leu Ser Arg Ile Ile Thr Gln Phe Asn Asn Ile
900 905 910Ser Tyr Gln Met Tyr
Asp Ser Leu Asn Tyr Gln Ala Gly Ala Ile Lys 915
920 925Ala Lys Ile Asp Leu Glu Tyr Lys Lys Tyr Ser Gly
Ser Asp Lys Glu 930 935 940Asn Ile Lys
Ser Gln Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys945
950 955 960Ile Ser Glu Ala Met Asn Asn
Ile Asn Lys Phe Ile Arg Glu Cys Ser 965
970 975Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val
Ile Asp Glu Leu 980 985 990Asn
Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu Ile Asn Leu Ile Asp 995
1000 1005Ser His Asn Ile Ile Leu Val Gly
Glu Val Asp Lys Leu Lys Ala 1010 1015
1020Lys Val Asn Asn Ser Phe Gln Asn Thr Ile Pro Phe Asn Ile Phe
1025 1030 1035Ser Tyr Thr Asn Asn Ser
Leu Leu Lys Asp Ile Ile Asn Glu Tyr 1040 1045
1050Phe Asn 105533714PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 33Glu Ser Asn Gln Pro Glu Lys Asn Gly Thr Ala Thr Lys Pro
Glu Asn1 5 10 15Ser Gly
Asn Thr Thr Ser Glu Asn Gly Gln Thr Glu Pro Glu Lys Lys 20
25 30Leu Glu Leu Arg Asn Val Ser Asp Ile
Glu Leu Tyr Ser Gln Thr Asn 35 40
45Gly Thr Tyr Arg Gln His Val Ser Leu Asp Gly Ile Pro Glu Asn Thr 50
55 60Asp Thr Tyr Phe Val Lys Val Lys Ser
Ser Ala Phe Lys Asp Val Tyr65 70 75
80Ile Pro Val Ala Ser Ile Thr Glu Glu Lys Arg Asn Gly Gln
Ser Val 85 90 95Tyr Lys
Ile Thr Ala Lys Ala Glu Lys Leu Gln Gln Glu Leu Glu Asn 100
105 110Lys Tyr Val Asp Asn Phe Thr Phe Tyr
Leu Asp Lys Lys Ala Lys Glu 115 120
125Glu Asn Thr Asn Phe Thr Ser Phe Ser Asn Leu Val Lys Ala Ile Asn
130 135 140Gln Asn Pro Ser Gly Thr Tyr
His Leu Ala Ala Ser Leu Asn Ala Asn145 150
155 160Glu Val Glu Leu Gly Pro Asp Glu Arg Ser Tyr Ile
Lys Asp Thr Phe 165 170
175Thr Gly Arg Leu Ile Gly Glu Lys Asp Gly Lys Asn Tyr Ala Ile Tyr
180 185 190Asn Leu Lys Lys Pro Leu
Phe Glu Asn Leu Ser Gly Ala Thr Val Glu 195 200
205Lys Leu Ser Leu Lys Asn Val Ala Ile Ser Gly Lys Asn Asp
Ile Gly 210 215 220Ser Leu Ala Asn Glu
Ala Thr Asn Gly Thr Lys Ile Lys Gln Val His225 230
235 240Val Asp Gly Cys Val Asp Gly Ile Ile Thr
Ser Lys Thr Lys Ser Asp 245 250
255Asp Asp Asp Lys Asn Lys Ala Leu Asn Leu Gln Cys Ile Lys Ile Lys
260 265 270Asn Glu Asp Leu Thr
Phe Ile Ala Glu Lys Asn Ser Phe Ser Glu Glu 275
280 285Pro Phe Gln Asp Glu Ile Val Ser Tyr Asn Thr Lys
Asn Lys Pro Leu 290 295 300Asn Phe Asn
Tyr Ser Leu Asp Lys Ile Ile Val Asp Tyr Asn Leu Gln305
310 315 320Ser Lys Ile Thr Leu Pro Asn
Asp Arg Thr Thr Pro Val Thr Lys Gly 325
330 335Ile Pro Tyr Ala Pro Glu Tyr Lys Ser Asn Ala Ala
Ser Thr Ile Glu 340 345 350Ile
His Asn Ile Asp Asp Asn Thr Ile Tyr Gln Tyr Leu Tyr Ala Gln 355
360 365Lys Ser Pro Thr Thr Leu Gln Arg Ile
Thr Met Thr Asn Ser Val Asp 370 375
380Asp Ala Leu Ile Asn Ser Thr Lys Ile Tyr Ser Tyr Phe Pro Ser Val385
390 395 400Ile Ser Lys Val
Asn Gln Gly Ala Gln Gly Ile Leu Phe Leu Gln Trp 405
410 415Val Arg Asp Ile Ile Asp Asp Phe Thr Asn
Glu Ser Ser Gln Lys Thr 420 425
430Thr Ile Asp Lys Ile Ser Asp Val Ser Thr Ile Val Pro Tyr Ile Gly
435 440 445Pro Ala Leu Asn Ile Val Lys
Gln Gly Tyr Glu Gly Asn Phe Ile Gly 450 455
460Ala Leu Glu Thr Thr Gly Val Val Leu Leu Leu Glu Tyr Ile Pro
Glu465 470 475 480Ile Thr
Leu Pro Val Ile Ala Ala Leu Ser Ile Ala Glu Ser Ser Thr
485 490 495Gln Lys Glu Lys Ile Ile Lys
Thr Ile Asp Asn Phe Leu Glu Lys Arg 500 505
510Tyr Glu Lys Trp Ile Glu Val Tyr Lys Leu Val Lys Ala Lys
Trp Leu 515 520 525Gly Thr Val Asn
Thr Gln Phe Gln Lys Arg Ser Tyr Gln Met Tyr Arg 530
535 540Ser Leu Glu Tyr Gln Val Asp Ala Ile Lys Lys Ile
Ile Asp Tyr Glu545 550 555
560Tyr Lys Ile Tyr Ser Gly Pro Asp Lys Glu Gln Ile Ala Asp Glu Ile
565 570 575Asn Asn Leu Lys Asn
Lys Leu Glu Glu Lys Ala Asn Lys Ala Met Ile 580
585 590Asn Ile Asn Ile Phe Met Arg Glu Ser Ser Arg Ser
Phe Leu Val Asn 595 600 605Gln Met
Ile Asn Glu Ala Lys Lys Gln Leu Leu Glu Phe Asp Thr Gln 610
615 620Ser Lys Asn Ile Leu Met Gln Tyr Ile Lys Ala
Asn Ser Lys Phe Ile625 630 635
640Gly Ile Thr Glu Leu Lys Lys Leu Glu Ser Lys Ile Asn Lys Val Phe
645 650 655Ser Thr Pro Ile
Pro Phe Ser Tyr Ser Lys Asn Leu Asp Cys Trp Val 660
665 670Asp Asn Glu Glu Asp Ile Asp Val Leu Glu Gly
Gly Gly Gly Ser Gly 675 680 685Gly
Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Ala Gly Cys Lys 690
695 700Asn Phe Phe Trp Lys Thr Phe Thr Ser
Cys705 71034710PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 34Glu Ser Asn Gln Pro Glu Lys Asn Gly Thr Ala Thr Lys Pro
Glu Asn1 5 10 15Ser Gly
Asn Thr Thr Ser Glu Asn Gly Gln Thr Glu Pro Glu Lys Lys 20
25 30Leu Glu Leu Arg Asn Val Ser Asp Ile
Glu Leu Tyr Ser Gln Thr Asn 35 40
45Gly Thr Tyr Arg Gln His Val Ser Leu Asp Gly Ile Pro Glu Asn Thr 50
55 60Asp Thr Tyr Phe Val Lys Val Lys Ser
Ser Ala Phe Lys Asp Val Tyr65 70 75
80Ile Pro Val Ala Ser Ile Thr Glu Glu Lys Arg Asn Gly Gln
Ser Val 85 90 95Tyr Lys
Ile Thr Ala Lys Ala Glu Lys Leu Gln Gln Glu Leu Glu Asn 100
105 110Lys Tyr Val Asp Asn Phe Thr Phe Tyr
Leu Asp Lys Lys Ala Lys Glu 115 120
125Glu Asn Thr Asn Phe Thr Ser Phe Ser Asn Leu Val Lys Ala Ile Asn
130 135 140Gln Asn Pro Ser Gly Thr Tyr
His Leu Ala Ala Ser Leu Asn Ala Asn145 150
155 160Glu Val Glu Leu Gly Pro Asp Glu Arg Ser Tyr Ile
Lys Asp Thr Phe 165 170
175Thr Gly Arg Leu Ile Gly Glu Lys Asp Gly Lys Asn Tyr Ala Ile Tyr
180 185 190Asn Leu Lys Lys Pro Leu
Phe Glu Asn Leu Ser Gly Ala Thr Val Glu 195 200
205Lys Leu Ser Leu Lys Asn Val Ala Ile Ser Gly Lys Asn Asp
Ile Gly 210 215 220Ser Leu Ala Asn Glu
Ala Thr Asn Gly Thr Lys Ile Lys Gln Val His225 230
235 240Val Asp Gly Cys Val Asp Gly Ile Ile Thr
Ser Lys Thr Lys Ser Asp 245 250
255Asp Asp Asp Lys Ala Gly Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr
260 265 270Ser Cys Ala Leu Ala
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 275
280 285Gly Gly Gly Ser Ala Leu Ala Leu Gln Cys Ile Lys
Ile Lys Asn Glu 290 295 300Asp Leu Thr
Phe Ile Ala Glu Lys Asn Ser Phe Ser Glu Glu Pro Phe305
310 315 320Gln Asp Glu Ile Val Ser Tyr
Asn Thr Lys Asn Lys Pro Leu Asn Phe 325
330 335Asn Tyr Ser Leu Asp Lys Ile Ile Val Asp Tyr Asn
Leu Gln Ser Lys 340 345 350Ile
Thr Leu Pro Asn Asp Arg Thr Thr Pro Val Thr Lys Gly Ile Pro 355
360 365Tyr Ala Pro Glu Tyr Lys Ser Asn Ala
Ala Ser Thr Ile Glu Ile His 370 375
380Asn Ile Asp Asp Asn Thr Ile Tyr Gln Tyr Leu Tyr Ala Gln Lys Ser385
390 395 400Pro Thr Thr Leu
Gln Arg Ile Thr Met Thr Asn Ser Val Asp Asp Ala 405
410 415Leu Ile Asn Ser Thr Lys Ile Tyr Ser Tyr
Phe Pro Ser Val Ile Ser 420 425
430Lys Val Asn Gln Gly Ala Gln Gly Ile Leu Phe Leu Gln Trp Val Arg
435 440 445Asp Ile Ile Asp Asp Phe Thr
Asn Glu Ser Ser Gln Lys Thr Thr Ile 450 455
460Asp Lys Ile Ser Asp Val Ser Thr Ile Val Pro Tyr Ile Gly Pro
Ala465 470 475 480Leu Asn
Ile Val Lys Gln Gly Tyr Glu Gly Asn Phe Ile Gly Ala Leu
485 490 495Glu Thr Thr Gly Val Val Leu
Leu Leu Glu Tyr Ile Pro Glu Ile Thr 500 505
510Leu Pro Val Ile Ala Ala Leu Ser Ile Ala Glu Ser Ser Thr
Gln Lys 515 520 525Glu Lys Ile Ile
Lys Thr Ile Asp Asn Phe Leu Glu Lys Arg Tyr Glu 530
535 540Lys Trp Ile Glu Val Tyr Lys Leu Val Lys Ala Lys
Trp Leu Gly Thr545 550 555
560Val Asn Thr Gln Phe Gln Lys Arg Ser Tyr Gln Met Tyr Arg Ser Leu
565 570 575Glu Tyr Gln Val Asp
Ala Ile Lys Lys Ile Ile Asp Tyr Glu Tyr Lys 580
585 590Ile Tyr Ser Gly Pro Asp Lys Glu Gln Ile Ala Asp
Glu Ile Asn Asn 595 600 605Leu Lys
Asn Lys Leu Glu Glu Lys Ala Asn Lys Ala Met Ile Asn Ile 610
615 620Asn Ile Phe Met Arg Glu Ser Ser Arg Ser Phe
Leu Val Asn Gln Met625 630 635
640Ile Asn Glu Ala Lys Lys Gln Leu Leu Glu Phe Asp Thr Gln Ser Lys
645 650 655Asn Ile Leu Met
Gln Tyr Ile Lys Ala Asn Ser Lys Phe Ile Gly Ile 660
665 670Thr Glu Leu Lys Lys Leu Glu Ser Lys Ile Asn
Lys Val Phe Ser Thr 675 680 685Pro
Ile Pro Phe Ser Tyr Ser Lys Asn Leu Asp Cys Trp Val Asp Asn 690
695 700Glu Glu Asp Ile Asp Val705
71035898PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 35Glu Phe Val Asn Lys Gln Phe Asn
Tyr Lys Asp Pro Val Asn Gly Val1 5 10
15Asp Ile Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln Met Gln
Pro Val 20 25 30Lys Ala Phe
Lys Ile His Asn Lys Ile Trp Val Ile Pro Glu Arg Asp 35
40 45Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn
Pro Pro Pro Glu Ala 50 55 60Lys Gln
Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr Asp65
70 75 80Asn Glu Lys Asp Asn Tyr Leu
Lys Gly Val Thr Lys Leu Phe Glu Arg 85 90
95Ile Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser
Ile Val Arg 100 105 110Gly Ile
Pro Phe Trp Gly Gly Ser Thr Ile Asp Thr Glu Leu Lys Val 115
120 125Ile Asp Thr Asn Cys Ile Asn Val Ile Gln
Pro Asp Gly Ser Tyr Arg 130 135 140Ser
Glu Glu Leu Asn Leu Val Ile Ile Gly Pro Ser Ala Asp Ile Ile145
150 155 160Gln Phe Glu Cys Leu Ser
Phe Gly His Glu Val Leu Asn Leu Thr Arg 165
170 175Asn Gly Tyr Gly Ser Thr Gln Tyr Ile Arg Phe Ser
Pro Asp Phe Thr 180 185 190Phe
Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu Gly 195
200 205Ala Gly Lys Phe Ala Thr Asp Pro Ala
Val Thr Leu Ala His Glu Leu 210 215
220Ile His Ala Gly His Arg Leu Tyr Gly Ile Ala Ile Asn Pro Asn Arg225
230 235 240Val Phe Lys Val
Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu Glu 245
250 255Val Ser Phe Glu Glu Leu Arg Thr Phe Gly
Gly His Asp Ala Lys Phe 260 265
270Ile Asp Ser Leu Gln Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys
275 280 285Phe Lys Asp Ile Ala Ser Thr
Leu Asn Lys Ala Lys Ser Ile Val Gly 290 295
300Thr Thr Ala Ser Leu Gln Tyr Met Lys Asn Val Phe Lys Glu Lys
Tyr305 310 315 320Leu Leu
Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu Lys
325 330 335Phe Asp Lys Leu Tyr Lys Met
Leu Thr Glu Ile Tyr Thr Glu Asp Asn 340 345
350Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu
Asn Phe 355 360 365Asp Lys Ala Val
Phe Lys Ile Asn Ile Val Pro Lys Val Asn Tyr Thr 370
375 380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu
Ala Ala Asn Phe385 390 395
400Asn Gly Gln Asn Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu Lys
405 410 415Asn Phe Thr Gly Leu
Phe Glu Phe Tyr Lys Leu Leu Cys Val Asp Gly 420
425 430Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp
Lys Asn Lys Ala 435 440 445Leu Asn
Leu Gln Cys Ile Lys Val Asn Asn Trp Asp Leu Phe Phe Ser 450
455 460Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn
Lys Gly Glu Glu Ile465 470 475
480Thr Ser Asp Thr Asn Ile Glu Ala Ala Glu Glu Asn Ile Ser Leu Asp
485 490 495Leu Ile Gln Gln
Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu 500
505 510Asn Ile Ser Ile Glu Asn Leu Ser Ser Asp Ile
Ile Gly Gln Leu Glu 515 520 525Leu
Met Pro Asn Ile Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu 530
535 540Asp Lys Tyr Thr Met Phe His Tyr Leu Arg
Ala Gln Glu Phe Glu His545 550 555
560Gly Lys Ser Arg Ile Ala Leu Thr Asn Ser Val Asn Glu Ala Leu
Leu 565 570 575Asn Pro Ser
Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys 580
585 590Val Asn Lys Ala Thr Glu Ala Ala Met Phe
Leu Gly Trp Val Glu Gln 595 600
605Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp 610
615 620Lys Ile Ala Asp Ile Thr Ile Ile
Ile Pro Tyr Ile Gly Pro Ala Leu625 630
635 640Asn Ile Gly Asn Met Leu Tyr Lys Asp Asp Phe Val
Gly Ala Leu Ile 645 650
655Phe Ser Gly Ala Val Ile Leu Leu Glu Phe Ile Pro Glu Ile Ala Ile
660 665 670Pro Val Leu Gly Thr Phe
Ala Leu Val Ser Tyr Ile Ala Asn Lys Val 675 680
685Leu Thr Val Gln Thr Ile Asp Asn Ala Leu Ser Lys Arg Asn
Glu Lys 690 695 700Trp Asp Glu Val Tyr
Lys Tyr Ile Val Thr Asn Trp Leu Ala Lys Val705 710
715 720Asn Thr Gln Ile Asp Leu Ile Arg Lys Lys
Met Lys Glu Ala Leu Glu 725 730
735Asn Gln Ala Glu Ala Thr Lys Ala Ile Ile Asn Tyr Gln Tyr Asn Gln
740 745 750Tyr Thr Glu Glu Glu
Lys Asn Asn Ile Asn Phe Asn Ile Asp Asp Leu 755
760 765Ser Ser Lys Leu Asn Glu Ser Ile Asn Lys Ala Met
Ile Asn Ile Asn 770 775 780Lys Phe Leu
Asn Gln Cys Ser Val Ser Tyr Leu Met Asn Ser Met Ile785
790 795 800Pro Tyr Gly Val Lys Arg Leu
Glu Asp Phe Asp Ala Ser Leu Lys Asp 805
810 815Ala Leu Leu Lys Tyr Ile Tyr Asp Asn Arg Gly Thr
Leu Ile Gly Gln 820 825 830Val
Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp Ile 835
840 845Pro Phe Gln Leu Ser Lys Tyr Val Asp
Asn Gln Arg Leu Leu Ser Thr 850 855
860Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly865
870 875 880Ser Ala Leu Val
Ala Gly Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr 885
890 895Ser Cys36943PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 36Glu Phe Val Asn Lys Gln Phe Asn Tyr Lys Asp Pro Val Asn
Gly Val1 5 10 15Asp Ile
Ala Tyr Ile Lys Ile Pro Asn Ala Gly Gln Met Gln Pro Val 20
25 30Lys Ala Phe Lys Ile His Asn Lys Ile
Trp Val Ile Pro Glu Arg Asp 35 40
45Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu Ala 50
55 60Lys Gln Val Pro Val Ser Tyr Tyr Asp
Ser Thr Tyr Leu Ser Thr Asp65 70 75
80Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe
Glu Arg 85 90 95Ile Tyr
Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser Ile Val Arg 100
105 110Gly Ile Pro Phe Trp Gly Gly Ser Thr
Ile Asp Thr Glu Leu Lys Val 115 120
125Ile Asp Thr Asn Cys Ile Asn Val Ile Gln Pro Asp Gly Ser Tyr Arg
130 135 140Ser Glu Glu Leu Asn Leu Val
Ile Ile Gly Pro Ser Ala Asp Ile Ile145 150
155 160Gln Phe Glu Cys Leu Ser Phe Gly His Glu Val Leu
Asn Leu Thr Arg 165 170
175Asn Gly Tyr Gly Ser Thr Gln Tyr Ile Arg Phe Ser Pro Asp Phe Thr
180 185 190Phe Gly Phe Glu Glu Ser
Leu Glu Val Asp Thr Asn Pro Leu Leu Gly 195 200
205Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His
Glu Leu 210 215 220Ile His Ala Gly His
Arg Leu Tyr Gly Ile Ala Ile Asn Pro Asn Arg225 230
235 240Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr
Glu Met Ser Gly Leu Glu 245 250
255Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys Phe
260 265 270Ile Asp Ser Leu Gln
Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn Lys 275
280 285Phe Lys Asp Ile Ala Ser Thr Leu Asn Lys Ala Lys
Ser Ile Val Gly 290 295 300Thr Thr Ala
Ser Leu Gln Tyr Met Lys Asn Val Phe Lys Glu Lys Tyr305
310 315 320Leu Leu Ser Glu Asp Thr Ser
Gly Lys Phe Ser Val Asp Lys Leu Lys 325
330 335Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu Ile Tyr
Thr Glu Asp Asn 340 345 350Phe
Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn Phe 355
360 365Asp Lys Ala Val Phe Lys Ile Asn Ile
Val Pro Lys Val Asn Tyr Thr 370 375
380Ile Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn Phe385
390 395 400Asn Gly Gln Asn
Thr Glu Ile Asn Asn Met Asn Phe Thr Lys Leu Lys 405
410 415Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys
Leu Leu Cys Val Asp Gly 420 425
430Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp Lys Asn Lys Ala
435 440 445Leu Asn Leu Gln Cys Ile Lys
Val Asn Asn Trp Asp Leu Phe Phe Ser 450 455
460Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu
Ile465 470 475 480Thr Ser
Asp Thr Asn Ile Glu Ala Ala Glu Glu Asn Ile Ser Leu Asp
485 490 495Leu Ile Gln Gln Tyr Tyr Leu
Thr Phe Asn Phe Asp Asn Glu Pro Glu 500 505
510Asn Ile Ser Ile Glu Asn Leu Ser Ser Asp Ile Ile Gly Gln
Leu Glu 515 520 525Leu Met Pro Asn
Ile Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu 530
535 540Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gln
Glu Phe Glu His545 550 555
560Gly Lys Ser Arg Ile Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu
565 570 575Asn Pro Ser Arg Val
Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys 580
585 590Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly
Trp Val Glu Gln 595 600 605Leu Val
Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp 610
615 620Lys Ile Ala Asp Ile Thr Ile Ile Ile Pro Tyr
Ile Gly Pro Ala Leu625 630 635
640Asn Ile Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu Ile
645 650 655Phe Ser Gly Ala
Val Ile Leu Leu Glu Phe Ile Pro Glu Ile Ala Ile 660
665 670Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr
Ile Ala Asn Lys Val 675 680 685Leu
Thr Val Gln Thr Ile Asp Asn Ala Leu Ser Lys Arg Asn Glu Lys 690
695 700Trp Asp Glu Val Tyr Lys Tyr Ile Val Thr
Asn Trp Leu Ala Lys Val705 710 715
720Asn Thr Gln Ile Asp Leu Ile Arg Lys Lys Met Lys Glu Ala Leu
Glu 725 730 735Asn Gln Ala
Glu Ala Thr Lys Ala Ile Ile Asn Tyr Gln Tyr Asn Gln 740
745 750Tyr Thr Glu Glu Glu Lys Asn Asn Ile Asn
Phe Asn Ile Asp Asp Leu 755 760
765Ser Ser Lys Leu Asn Glu Ser Ile Asn Lys Ala Met Ile Asn Ile Asn 770
775 780Lys Phe Leu Asn Gln Cys Ser Val
Ser Tyr Leu Met Asn Ser Met Ile785 790
795 800Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala
Ser Leu Lys Asp 805 810
815Ala Leu Leu Lys Tyr Ile Tyr Asp Asn Arg Gly Thr Leu Ile Gly Gln
820 825 830Val Asp Arg Leu Lys Asp
Lys Val Asn Asn Thr Leu Ser Thr Asp Ile 835 840
845Pro Phe Gln Leu Ser Lys Tyr Val Asp Asn Gln Arg Leu Leu
Ser Thr 850 855 860Leu Glu Gly Gly Gly
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly865 870
875 880Ser Gly Gly Gly Gly Ser Ala Leu Val Asp
Asn Ser Asp Pro Lys Cys 885 890
895Pro Leu Ser His Glu Gly Tyr Cys Leu Asn Asp Gly Val Cys Met Tyr
900 905 910Ile Gly Thr Leu Asp
Arg Tyr Ala Cys Asn Cys Val Val Gly Tyr Val 915
920 925Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Leu Ala
Glu Leu Arg 930 935
940371061PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 37Pro Lys Ile Asn Ser Phe Asn Tyr
Asn Asp Pro Val Asn Asp Arg Thr1 5 10
15Ile Leu Tyr Ile Lys Pro Gly Gly Cys Gln Glu Phe Tyr Lys
Ser Phe 20 25 30Asn Ile Met
Lys Asn Ile Trp Ile Ile Pro Glu Arg Asn Val Ile Gly 35
40 45Thr Thr Pro Gln Asp Phe His Pro Pro Thr Ser
Leu Lys Asn Gly Asp 50 55 60Ser Ser
Tyr Tyr Asp Pro Asn Tyr Leu Gln Ser Asp Glu Glu Lys Asp65
70 75 80Arg Phe Leu Lys Ile Val Thr
Lys Ile Phe Asn Arg Ile Asn Asn Asn 85 90
95Leu Ser Gly Gly Ile Leu Leu Glu Glu Leu Ser Lys Ala
Asn Pro Tyr 100 105 110Leu Gly
Asn Asp Asn Thr Pro Asp Asn Gln Phe His Ile Gly Asp Ala 115
120 125Ser Ala Val Glu Ile Lys Phe Ser Asn Gly
Ser Gln His Ile Leu Leu 130 135 140Pro
Asn Val Ile Ile Met Gly Ala Glu Pro Asp Leu Phe Glu Thr Asn145
150 155 160Ser Ser Asn Ile Ser Leu
Arg Asn Asn Tyr Met Pro Ser Asn His Gly 165
170 175Phe Gly Ser Ile Ala Ile Val Thr Phe Ser Pro Glu
Tyr Ser Phe Arg 180 185 190Phe
Asn Asp Asn Ser Ile Asn Glu Phe Ile Gln Asp Pro Ala Leu Thr 195
200 205Leu Met His Glu Leu Ile His Ser Leu
His Gly Leu Tyr Gly Ala Lys 210 215
220Gly Ile Thr Thr Thr Cys Ile Ile Thr Gln Gln Gln Asn Pro Leu Ile225
230 235 240Thr Asn Arg Lys
Gly Ile Asn Ile Glu Glu Phe Leu Thr Phe Gly Gly 245
250 255Asn Asp Leu Asn Ile Ile Thr Val Ala Gln
Tyr Asn Asp Ile Tyr Thr 260 265
270Asn Leu Leu Asn Asp Tyr Arg Lys Ile Ala Ser Lys Leu Ser Lys Val
275 280 285Gln Val Ser Asn Pro Gln Leu
Asn Pro Tyr Lys Asp Ile Phe Gln Glu 290 295
300Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly Ile Tyr Ser Val Asn
Ile305 310 315 320Asn Lys
Phe Asp Asp Ile Leu Lys Lys Leu Tyr Ser Phe Thr Glu Phe
325 330 335Asp Leu Ala Thr Lys Phe Gln
Val Lys Cys Arg Glu Thr Tyr Ile Gly 340 345
350Gln Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser
Ile Tyr 355 360 365Asn Ile Ser Glu
Gly Tyr Asn Ile Asn Asn Leu Lys Val Asn Phe Arg 370
375 380Gly Gln Asn Ala Asn Leu Asn Pro Arg Ile Ile Lys
Pro Ile Thr Gly385 390 395
400Arg Gly Leu Val Lys Lys Ile Ile Arg Phe Cys Val Asp Gly Ile Ile
405 410 415Thr Ser Lys Thr Lys
Ser Leu Ile Glu Gly Arg Asn Lys Ala Leu Asn 420
425 430Leu Gln Cys Ile Glu Ile Asn Asn Gly Glu Leu Phe
Phe Val Ala Ser 435 440 445Glu Asn
Ser Tyr Asn Asp Asp Asn Ile Asn Thr Pro Lys Glu Ile Asp 450
455 460Asp Thr Val Thr Ser Asn Asn Asn Tyr Glu Asn
Asp Leu Asp Gln Val465 470 475
480Ile Leu Asn Phe Asn Ser Glu Ser Ala Pro Gly Leu Ser Asp Glu Lys
485 490 495Leu Asn Leu Thr
Ile Gln Asn Asp Ala Tyr Ile Pro Lys Tyr Asp Ser 500
505 510Asn Gly Thr Ser Asp Ile Glu Gln His Asp Val
Asn Glu Leu Asn Val 515 520 525Phe
Phe Tyr Leu Asp Ala Gln Lys Val Pro Glu Gly Glu Asn Asn Val 530
535 540Asn Leu Thr Ser Ser Ile Asp Thr Ala Leu
Leu Glu Gln Pro Lys Ile545 550 555
560Tyr Thr Phe Phe Ser Ser Glu Phe Ile Asn Asn Val Asn Lys Pro
Val 565 570 575Gln Ala Ala
Leu Phe Val Ser Trp Ile Gln Gln Val Leu Val Asp Phe 580
585 590Thr Thr Glu Ala Asn Gln Lys Ser Thr Val
Asp Lys Ile Ala Asp Ile 595 600
605Ser Ile Val Val Pro Tyr Ile Gly Leu Ala Leu Asn Ile Gly Asn Glu 610
615 620Ala Gln Lys Gly Asn Phe Lys Asp
Ala Leu Glu Leu Leu Gly Ala Gly625 630
635 640Ile Leu Leu Glu Phe Glu Pro Glu Leu Leu Ile Pro
Thr Ile Leu Val 645 650
655Phe Thr Ile Lys Ser Phe Leu Gly Ser Ser Asp Asn Lys Asn Lys Val
660 665 670Ile Lys Ala Ile Asn Asn
Ala Leu Lys Glu Arg Asp Glu Lys Trp Lys 675 680
685Glu Val Tyr Ser Phe Ile Val Ser Asn Trp Met Thr Lys Ile
Asn Thr 690 695 700Gln Phe Asn Lys Arg
Lys Glu Gln Met Tyr Gln Ala Leu Gln Asn Gln705 710
715 720Val Asn Ala Ile Lys Thr Ile Ile Glu Ser
Lys Tyr Asn Ser Tyr Thr 725 730
735Leu Glu Glu Lys Asn Glu Leu Thr Asn Lys Tyr Asp Ile Lys Gln Ile
740 745 750Glu Asn Glu Leu Asn
Gln Lys Val Ser Ile Ala Met Asn Asn Ile Asp 755
760 765Arg Phe Leu Thr Glu Ser Ser Ile Ser Tyr Leu Met
Lys Ile Ile Asn 770 775 780Glu Val Lys
Ile Asn Lys Leu Arg Glu Tyr Asp Glu Asn Val Lys Thr785
790 795 800Tyr Leu Leu Asn Tyr Ile Ile
Gln His Gly Ser Ile Leu Gly Glu Ser 805
810 815Gln Gln Glu Leu Asn Ser Met Val Thr Asp Thr Leu
Asn Asn Ser Ile 820 825 830Pro
Phe Lys Leu Ser Ser Tyr Thr Asp Asp Lys Ile Leu Ile Ser Tyr 835
840 845Phe Asn Lys Phe Phe Lys Leu Glu Gly
Gly Gly Gly Ser Gly Gly Gly 850 855
860Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Pro865
870 875 880Pro Gly Glu Asp
Ser Lys Asp Val Ala Ala Pro His Arg Gln Pro Leu 885
890 895Thr Ser Ser Glu Arg Ile Asp Lys Gln Ile
Arg Tyr Ile Leu Asp Gly 900 905
910Ile Ser Ala Leu Arg Lys Glu Thr Cys Asn Lys Ser Asn Met Cys Glu
915 920 925Ser Ser Lys Glu Ala Leu Ala
Glu Asn Asn Leu Asn Leu Pro Lys Met 930 935
940Ala Glu Lys Asp Gly Cys Phe Gln Ser Gly Phe Asn Glu Glu Thr
Cys945 950 955 960Leu Val
Lys Ile Ile Thr Gly Leu Leu Glu Phe Glu Val Tyr Leu Glu
965 970 975Tyr Leu Gln Asn Arg Phe Glu
Ser Ser Glu Glu Gln Ala Arg Ala Val 980 985
990Gln Met Ser Thr Lys Val Leu Ile Gln Phe Leu Gln Lys Lys
Ala Lys 995 1000 1005Asn Leu Asp
Ala Ile Thr Thr Pro Asp Pro Thr Thr Asn Ala Ser 1010
1015 1020Leu Leu Thr Lys Leu Gln Ala Gln Asn Gln Trp
Leu Gln Asp Met 1025 1030 1035Thr Thr
His Leu Ile Leu Arg Ser Phe Lys Glu Phe Leu Gln Ser 1040
1045 1050Ser Leu Arg Ala Leu Arg Gln Met 1055
106038963PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 38Pro Val Thr Ile Asn Asn
Phe Asn Tyr Asn Asp Pro Ile Asp Asn Asn1 5
10 15Asn Ile Ile Met Met Glu Pro Pro Phe Ala Arg Gly
Thr Gly Arg Tyr 20 25 30Tyr
Lys Ala Phe Lys Ile Thr Asp Arg Ile Trp Ile Ile Pro Glu Arg 35
40 45Tyr Thr Phe Gly Tyr Lys Pro Glu Asp
Phe Asn Lys Ser Ser Gly Ile 50 55
60Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn Thr65
70 75 80Asn Asp Lys Lys Asn
Ile Phe Leu Gln Thr Met Ile Lys Leu Phe Asn 85
90 95Arg Ile Lys Ser Lys Pro Leu Gly Glu Lys Leu
Leu Glu Met Ile Ile 100 105
110Asn Gly Ile Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu Phe
115 120 125Asn Thr Asn Ile Ala Ser Val
Thr Val Asn Lys Leu Ile Ser Asn Pro 130 135
140Gly Glu Val Glu Arg Lys Lys Gly Ile Phe Ala Asn Leu Ile Ile
Phe145 150 155 160Gly Pro
Gly Pro Val Leu Asn Glu Asn Glu Thr Ile Asp Ile Gly Ile
165 170 175Gln Asn His Phe Ala Ser Arg
Glu Gly Phe Gly Gly Ile Met Gln Met 180 185
190Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gln
Glu Asn 195 200 205Lys Gly Ala Ser
Ile Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro Ala 210
215 220Leu Ile Leu Met His Glu Leu Ile His Val Leu His
Gly Leu Tyr Gly225 230 235
240Ile Lys Val Asp Asp Leu Pro Ile Val Pro Asn Glu Lys Lys Phe Phe
245 250 255Met Gln Ser Thr Asp
Ala Ile Gln Ala Glu Glu Leu Tyr Thr Phe Gly 260
265 270Gly Gln Asp Pro Ser Ile Ile Thr Pro Ser Thr Asp
Lys Ser Ile Tyr 275 280 285Asp Lys
Val Leu Gln Asn Phe Arg Gly Ile Val Asp Arg Leu Asn Lys 290
295 300Val Leu Val Cys Ile Ser Asp Pro Asn Ile Asn
Ile Asn Ile Tyr Lys305 310 315
320Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly Lys
325 330 335Tyr Ser Ile Asp
Val Glu Ser Phe Asp Lys Leu Tyr Lys Ser Leu Met 340
345 350Phe Gly Phe Thr Glu Thr Asn Ile Ala Glu Asn
Tyr Lys Ile Lys Thr 355 360 365Arg
Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys Ile Lys Asn 370
375 380Leu Leu Asp Asn Glu Ile Tyr Thr Ile Glu
Glu Gly Phe Asn Ile Ser385 390 395
400Asp Lys Asp Met Glu Lys Glu Tyr Arg Gly Gln Asn Lys Ala Ile
Asn 405 410 415Lys Gln Ala
Tyr Glu Glu Ile Ser Lys Glu His Leu Ala Val Tyr Lys 420
425 430Ile Gln Met Cys Val Asp Gly Ile Ile Thr
Ser Lys Thr Lys Ser Leu 435 440
445Ile Glu Gly Arg Asn Lys Ala Leu Asn Leu Gln Cys Ile Asp Val Asp 450
455 460Asn Glu Asp Leu Phe Phe Ile Ala
Asp Lys Asn Ser Phe Ser Asp Asp465 470
475 480Leu Ser Lys Asn Glu Arg Ile Glu Tyr Asn Thr Gln
Ser Asn Tyr Ile 485 490
495Glu Asn Asp Phe Pro Ile Asn Glu Leu Ile Leu Asp Thr Asp Leu Ile
500 505 510Ser Lys Ile Glu Leu Pro
Ser Glu Asn Thr Glu Ser Leu Thr Asp Phe 515 520
525Asn Val Asp Val Pro Val Tyr Glu Lys Gln Pro Ala Ile Lys
Lys Ile 530 535 540Phe Thr Asp Glu Asn
Thr Ile Phe Gln Tyr Leu Tyr Ser Gln Thr Phe545 550
555 560Pro Leu Asp Ile Arg Asp Ile Ser Leu Thr
Ser Ser Phe Asp Asp Ala 565 570
575Leu Leu Phe Ser Asn Lys Val Tyr Ser Phe Phe Ser Met Asp Tyr Ile
580 585 590Lys Thr Ala Asn Lys
Val Val Glu Ala Gly Leu Phe Ala Gly Trp Val 595
600 605Lys Gln Ile Val Asn Asp Phe Val Ile Glu Ala Asn
Lys Ser Asn Thr 610 615 620Met Asp Lys
Ile Ala Asp Ile Ser Leu Ile Val Pro Tyr Ile Gly Leu625
630 635 640Ala Leu Asn Val Gly Asn Glu
Thr Ala Lys Gly Asn Phe Glu Asn Ala 645
650 655Phe Glu Ile Ala Gly Ala Ser Ile Leu Leu Glu Phe
Ile Pro Glu Leu 660 665 670Leu
Ile Pro Val Val Gly Ala Phe Leu Leu Glu Ser Tyr Ile Asp Asn 675
680 685Lys Asn Lys Ile Ile Lys Thr Ile Asp
Asn Ala Leu Thr Lys Arg Asn 690 695
700Glu Lys Trp Ser Asp Met Tyr Gly Leu Ile Val Ala Gln Trp Leu Ser705
710 715 720Thr Val Asn Thr
Gln Phe Tyr Thr Ile Lys Glu Gly Met Tyr Lys Ala 725
730 735Leu Asn Tyr Gln Ala Gln Ala Leu Glu Glu
Ile Ile Lys Tyr Arg Tyr 740 745
750Asn Ile Tyr Ser Glu Lys Glu Lys Ser Asn Ile Asn Ile Asp Phe Asn
755 760 765Asp Ile Asn Ser Lys Leu Asn
Glu Gly Ile Asn Gln Ala Ile Asp Asn 770 775
780Ile Asn Asn Phe Ile Asn Gly Cys Ser Val Ser Tyr Leu Met Lys
Lys785 790 795 800Met Ile
Pro Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn Thr Leu
805 810 815Lys Lys Asn Leu Leu Asn Tyr
Ile Asp Glu Asn Lys Leu Tyr Leu Ile 820 825
830Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu
Lys Thr 835 840 845Ile Met Pro Phe
Asp Leu Ser Ile Tyr Thr Asn Asp Thr Ile Leu Ile 850
855 860Glu Met Phe Asn Lys Tyr Asn Ser Leu Glu Gly Gly
Gly Gly Ser Gly865 870 875
880Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu Val Ala Lys Glu Leu
885 890 895Arg Cys Gln Cys Ile
Lys Thr Tyr Ser Lys Pro Phe His Pro Lys Phe 900
905 910Ile Lys Glu Leu Arg Val Ile Glu Ser Gly Pro His
Cys Ala Asn Thr 915 920 925Glu Ile
Ile Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro 930
935 940Lys Glu Asn Trp Val Gln Arg Val Val Glu Lys
Phe Leu Lys Arg Ala945 950 955
960Glu Asn Ser39959PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 39Pro Val Ala Ile Asn Ser
Phe Asn Tyr Asn Asp Pro Val Asn Asp Asp1 5
10 15Thr Ile Leu Tyr Met Gln Ile Pro Tyr Glu Glu Lys
Ser Lys Lys Tyr 20 25 30Tyr
Lys Ala Phe Glu Ile Met Arg Asn Val Trp Ile Ile Pro Glu Arg 35
40 45Asn Thr Ile Gly Thr Asn Pro Ser Asp
Phe Asp Pro Pro Ala Ser Leu 50 55
60Lys Asn Gly Ser Ser Ala Tyr Tyr Asp Pro Asn Tyr Leu Thr Thr Asp65
70 75 80Ala Glu Lys Asp Arg
Tyr Leu Lys Thr Thr Ile Lys Leu Phe Lys Arg 85
90 95Ile Asn Ser Asn Pro Ala Gly Lys Val Leu Leu
Gln Glu Ile Ser Tyr 100 105
110Ala Lys Pro Tyr Leu Gly Asn Asp His Thr Pro Ile Asp Glu Phe Ser
115 120 125Pro Val Thr Arg Thr Thr Ser
Val Asn Ile Lys Leu Ser Thr Asn Val 130 135
140Glu Ser Ser Met Leu Leu Asn Leu Leu Val Leu Gly Ala Gly Pro
Asp145 150 155 160Ile Phe
Glu Ser Cys Cys Tyr Pro Val Arg Lys Leu Ile Asp Pro Asp
165 170 175Val Val Tyr Asp Pro Ser Asn
Tyr Gly Phe Gly Ser Ile Asn Ile Val 180 185
190Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp Ile Ser
Gly Gly 195 200 205His Asn Ser Ser
Thr Glu Ser Phe Ile Ala Asp Pro Ala Ile Ser Leu 210
215 220Ala His Glu Leu Ile His Ala Leu His Gly Leu Tyr
Gly Ala Arg Gly225 230 235
240Val Thr Tyr Glu Glu Thr Ile Glu Val Lys Gln Ala Pro Leu Met Ile
245 250 255Ala Glu Lys Pro Ile
Arg Leu Glu Glu Phe Leu Thr Phe Gly Gly Gln 260
265 270Asp Leu Asn Ile Ile Thr Ser Ala Met Lys Glu Lys
Ile Tyr Asn Asn 275 280 285Leu Leu
Ala Asn Tyr Glu Lys Ile Ala Thr Arg Leu Ser Glu Val Asn 290
295 300Ser Ala Pro Pro Glu Tyr Asp Ile Asn Glu Tyr
Lys Asp Tyr Phe Gln305 310 315
320Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Gly Ser Tyr Thr Val Asn
325 330 335Glu Asn Lys Phe
Asn Glu Ile Tyr Lys Lys Leu Tyr Ser Phe Thr Glu 340
345 350Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Ala
Arg Asn Thr Tyr Phe 355 360 365Ile
Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp Ile 370
375 380Tyr Thr Val Ser Glu Gly Phe Asn Ile Gly
Asn Leu Ala Val Asn Asn385 390 395
400Arg Gly Gln Ser Ile Lys Leu Asn Pro Lys Ile Ile Asp Ser Ile
Pro 405 410 415Asp Lys Gly
Leu Val Glu Lys Ile Val Lys Phe Ala Val Glu Asn Asn 420
425 430Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly
Cys Val Asp Gly Ile Ile 435 440
445Thr Ser Lys Thr Lys Ser Leu Ile Glu Gly Arg Asp Val Lys Cys Asp 450
455 460Met Glu Val Ser Cys Pro Asp Gly
Tyr Thr Cys Cys Arg Leu Gln Ser465 470
475 480Gly Ala Trp Gly Cys Cys Pro Phe Thr Gln Ala Val
Cys Cys Glu Asp 485 490
495His Ile His Cys Cys Pro Ala Gly Phe Thr Cys Asp Thr Gln Lys Gly
500 505 510Thr Cys Glu Ala Leu Ala
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 515 520
525Gly Gly Gly Gly Ser Ala Leu Val Leu Gln Cys Ile Glu Val
Asn Asn 530 535 540Ser Glu Leu Phe Phe
Val Ala Ser Glu Ser Ser Tyr Asn Glu Asn Asp545 550
555 560Ile Asn Thr Pro Lys Glu Ile Asp Asp Thr
Thr Asn Leu Asn Asn Asn 565 570
575Tyr Glu Asn Asn Leu Asp Glu Val Ile Leu Asp Tyr Asn Ser Gln Thr
580 585 590Ile Pro Gln Ile Ser
Asn Ile Glu Asn Leu Asn Thr Leu Val Gln Asp 595
600 605Asn Ser Tyr Val Pro Glu Tyr Asp Ser Asn Gly Thr
Ser Glu Ile Glu 610 615 620Glu Tyr Asp
Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala Gln625
630 635 640Lys Val Pro Glu Gly Glu Thr
Asn Ile Ser Leu Thr Ser Ser Ile Asp 645
650 655Thr Ala Leu Leu Glu Glu Ser Lys Asp Ile Phe Phe
Ser Ser Glu Phe 660 665 670Ile
Asp Thr Ile Asn Lys Pro Val Asn Ala Ala Leu Phe Ile Asp Trp 675
680 685Ile Ser Lys Val Ile Arg Asp Phe Thr
Thr Glu Ala Thr Gln Lys Ser 690 695
700Thr Val Asp Lys Ile Ala Asp Ile Ser Leu Ile Val Pro Tyr Val Gly705
710 715 720Leu Ala Leu Asn
Ile Ile Ile Glu Ala Glu Lys Gly Asn Phe Glu Glu 725
730 735Ala Phe Glu Leu Leu Gly Val Gly Ile Leu
Leu Glu Phe Val Pro Glu 740 745
750Leu Thr Ile Pro Val Ile Leu Val Phe Thr Ile Lys Ser Tyr Ile Asp
755 760 765Ser Tyr Glu Asn Lys Asn Lys
Ala Ile Lys Ala Ile Asn Asn Ser Leu 770 775
780Ile Glu Arg Glu Ala Lys Trp Lys Glu Ile Tyr Ser Trp Ile Val
Ser785 790 795 800Asn Trp
Leu Thr Arg Ile Asn Thr Gln Phe Asn Lys Arg Lys Glu Gln
805 810 815Met Tyr Gln Ala Leu Gln Asn
Gln Val Asp Ala Ile Lys Thr Ala Ile 820 825
830Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg
Leu Glu 835 840 845Ser Glu Tyr Asn
Ile Asn Asn Ile Glu Glu Glu Leu Asn Lys Lys Val 850
855 860Ser Leu Ala Met Lys Asn Ile Glu Arg Phe Met Thr
Glu Ser Ser Ile865 870 875
880Ser Tyr Leu Met Lys Leu Ile Asn Glu Ala Lys Val Gly Lys Leu Lys
885 890 895Lys Tyr Asp Asn His
Val Lys Ser Asp Leu Leu Asn Tyr Ile Leu Asp 900
905 910His Arg Ser Ile Leu Gly Glu Gln Thr Asn Glu Leu
Ser Asp Leu Val 915 920 925Thr Ser
Thr Leu Asn Ser Ser Ile Pro Phe Glu Leu Ser Ser Tyr Thr 930
935 940Asn Asp Lys Ile Leu Ile Ile Tyr Phe Asn Arg
Leu Tyr Lys Thr945 950
95540939PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 40Thr Trp Pro Val Lys Asp Phe Asn
Tyr Ser Asp Pro Val Asn Asp Asn1 5 10
15Asp Ile Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr
Thr Pro 20 25 30Val Lys Ala
Phe Met Ile Thr Gln Asn Ile Trp Val Ile Pro Glu Arg 35
40 45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys
Pro Pro Arg Pro Thr 50 55 60Ser Lys
Tyr Gln Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp Glu65
70 75 80Gln Lys Asp Thr Phe Leu Lys
Gly Ile Ile Lys Leu Phe Lys Arg Ile 85 90
95Asn Glu Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu
Val Val Gly 100 105 110Ser Pro
Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp Phe 115
120 125Thr Arg His Thr Thr Asn Ile Ala Val Glu
Lys Phe Glu Asn Gly Ser 130 135 140Trp
Lys Val Thr Asn Ile Ile Thr Pro Ser Val Leu Ile Phe Gly Pro145
150 155 160Leu Pro Asn Ile Leu Asp
Tyr Thr Ala Ser Leu Thr Leu Gln Gly Gln 165
170 175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu
Ser Ile Leu Lys 180 185 190Val
Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195
200 205Ser Ser Ala Val Leu Gly Lys Ser Ile
Phe Cys Met Asp Pro Val Ile 210 215
220Ala Leu Met His Glu Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225
230 235 240Asn Ile Pro Ser
Asp Lys Arg Ile Arg Pro Gln Val Ser Glu Gly Phe 245
250 255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe
Glu Glu Leu Tyr Thr Phe 260 265
270Gly Gly Leu Asp Val Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu
275 280 285Arg Glu Lys Ala Leu Gly His
Tyr Lys Asp Ile Ala Lys Arg Leu Asn 290 295
300Asn Ile Asn Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp
Lys305 310 315 320Tyr Lys
Lys Ile Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn Thr
325 330 335Gly Asn Phe Val Val Asn Ile
Asp Lys Phe Asn Ser Leu Tyr Ser Asp 340 345
350Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr
Asn Val 355 360 365Lys Asn Arg Thr
His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe Ala 370
375 380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp
Gly Phe Asn Leu385 390 395
400Thr Asn Lys Gly Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg
405 410 415Asn Pro Ala Leu Gln
Lys Leu Ser Ser Glu Ser Val Val Asp Leu Phe 420
425 430Thr Lys Val Cys Val Asp Gly Gly Gly Gly Ser Gly
Gly Gly Gly Ser 435 440 445Ala Asp
Asp Asp Asp Lys Val Val Ser His Phe Asn Asp Cys Pro Asp 450
455 460Ser His Thr Gln Phe Cys Phe His Gly Thr Cys
Arg Phe Leu Val Gln465 470 475
480Glu Asp Lys Pro Ala Cys Val Cys His Ser Gly Tyr Val Gly Ala Arg
485 490 495Cys Glu His Ala
Asp Leu Leu Ala Leu Ala Gly Gly Gly Gly Ser Gly 500
505 510Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu
Ala Leu Gln Cys Ile 515 520 525Lys
Val Lys Asn Asn Arg Leu Pro Tyr Val Ala Asp Lys Asp Ser Ile 530
535 540Ser Gln Glu Ile Phe Glu Asn Lys Ile Ile
Thr Asp Glu Thr Asn Val545 550 555
560Gln Asn Tyr Ser Asp Lys Phe Ser Leu Asp Glu Ser Ile Leu Asp
Gly 565 570 575Gln Val Pro
Ile Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val 580
585 590Asn Met Glu Pro Leu Asn Leu Pro Gly Glu
Glu Ile Val Phe Tyr Asp 595 600
605Asp Ile Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu 610
615 620Ser Gln Lys Leu Ser Asn Asn Val
Glu Asn Ile Thr Leu Thr Thr Ser625 630
635 640Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys Ile Tyr
Thr Phe Leu Pro 645 650
655Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
660 665 670Asn Trp Ala Asn Glu Val
Val Glu Asp Phe Thr Thr Asn Ile Met Lys 675 680
685Lys Asp Thr Leu Asp Lys Ile Ser Asp Val Ser Val Ile Ile
Pro Tyr 690 695 700Ile Gly Pro Ala Leu
Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe705 710
715 720Asn Gln Ala Phe Ala Thr Ala Gly Val Ala
Phe Leu Leu Glu Gly Phe 725 730
735Pro Glu Phe Thr Ile Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser Ser
740 745 750Ile Gln Glu Arg Glu
Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu 755
760 765Gln Arg Val Lys Arg Trp Lys Asp Ser Tyr Gln Trp
Met Val Ser Asn 770 775 780Trp Leu Ser
Arg Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln Met785
790 795 800Tyr Asp Ser Leu Ser Tyr Gln
Ala Asp Ala Ile Lys Ala Lys Ile Asp 805
810 815Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu
Asn Ile Lys Ser 820 825 830Gln
Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala 835
840 845Met Asn Asn Ile Asn Lys Phe Ile Arg
Glu Cys Ser Val Thr Tyr Leu 850 855
860Phe Lys Asn Met Leu Pro Lys Val Ile Asp Glu Leu Asn Lys Phe Asp865
870 875 880Leu Arg Thr Lys
Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile 885
890 895Ile Leu Val Gly Glu Val Asp Arg Leu Lys
Ala Lys Val Asn Glu Ser 900 905
910Phe Glu Asn Thr Met Pro Phe Asn Ile Phe Ser Tyr Thr Asn Asn Ser
915 920 925Leu Leu Lys Asp Ile Ile Asn
Glu Tyr Phe Asn 930 935411002PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 41Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn
Asp Asn1 5 10 15Asp Ile
Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr Thr Pro 20
25 30Val Lys Ala Phe Met Ile Thr Gln Asn
Ile Trp Val Ile Pro Glu Arg 35 40
45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro Thr 50
55 60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro
Ser Tyr Leu Ser Thr Asp Glu65 70 75
80Gln Lys Asp Thr Phe Leu Lys Gly Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Glu
Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu Val Val Gly 100
105 110Ser Pro Phe Met Gly Asp Ser Ser Thr
Pro Glu Asp Thr Phe Asp Phe 115 120
125Thr Arg His Thr Thr Asn Ile Ala Val Glu Lys Phe Glu Asn Gly Ser
130 135 140Trp Lys Val Thr Asn Ile Ile
Thr Pro Ser Val Leu Ile Phe Gly Pro145 150
155 160Leu Pro Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr
Leu Gln Gly Gln 165 170
175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser Ile Leu Lys
180 185 190Val Ala Pro Glu Phe Leu
Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195 200
205Ser Ser Ala Val Leu Gly Lys Ser Ile Phe Cys Met Asp Pro
Val Ile 210 215 220Ala Leu Met His Glu
Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225 230
235 240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro
Gln Val Ser Glu Gly Phe 245 250
255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe
260 265 270Gly Gly Leu Asp Val
Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu 275
280 285Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ile Ala
Lys Arg Leu Asn 290 295 300Asn Ile Asn
Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp Lys305
310 315 320Tyr Lys Lys Ile Phe Ser Glu
Lys Tyr Asn Phe Asp Lys Asp Asn Thr 325
330 335Gly Asn Phe Val Val Asn Ile Asp Lys Phe Asn Ser
Leu Tyr Ser Asp 340 345 350Leu
Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr Asn Val 355
360 365Lys Asn Arg Thr His Tyr Phe Ser Arg
His Tyr Leu Pro Val Phe Ala 370 375
380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp Gly Phe Asn Leu385
390 395 400Thr Asn Lys Gly
Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg 405
410 415Asn Pro Ala Leu Gln Lys Leu Ser Ser Glu
Ser Val Val Asp Leu Phe 420 425
430Thr Lys Val Cys Val Asp Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
435 440 445Ala Asp Asp Asp Asp Lys Ala
Leu Asp Thr Asn Tyr Cys Phe Ser Ser 450 455
460Thr Glu Lys Asn Cys Cys Val Arg Gln Leu Tyr Ile Asp Phe Arg
Lys465 470 475 480Asp Leu
Gly Trp Lys Trp Ile His Glu Pro Lys Gly Tyr His Ala Asn
485 490 495Phe Cys Leu Gly Pro Cys Pro
Tyr Ile Trp Ser Leu Asp Thr Gln Tyr 500 505
510Ser Lys Val Leu Ala Leu Tyr Asn Gln His Asn Pro Gly Ala
Ser Ala 515 520 525Ala Pro Cys Cys
Val Pro Gln Ala Leu Glu Pro Leu Pro Ile Val Tyr 530
535 540Tyr Val Gly Arg Lys Pro Lys Val Glu Gln Leu Ser
Asn Met Ile Val545 550 555
560Arg Ser Cys Lys Cys Ser Ala Leu Ala Gly Gly Gly Gly Ser Gly Gly
565 570 575Gly Gly Ser Gly Gly
Gly Gly Ser Ala Leu Ala Leu Gln Cys Ile Lys 580
585 590Val Lys Asn Asn Arg Leu Pro Tyr Val Ala Asp Lys
Asp Ser Ile Ser 595 600 605Gln Glu
Ile Phe Glu Asn Lys Ile Ile Thr Asp Glu Thr Asn Val Gln 610
615 620Asn Tyr Ser Asp Lys Phe Ser Leu Asp Glu Ser
Ile Leu Asp Gly Gln625 630 635
640Val Pro Ile Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val Asn
645 650 655Met Glu Pro Leu
Asn Leu Pro Gly Glu Glu Ile Val Phe Tyr Asp Asp 660
665 670Ile Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tyr
Tyr Tyr Leu Glu Ser 675 680 685Gln
Lys Leu Ser Asn Asn Val Glu Asn Ile Thr Leu Thr Thr Ser Val 690
695 700Glu Glu Ala Leu Gly Tyr Ser Asn Lys Ile
Tyr Thr Phe Leu Pro Ser705 710 715
720Leu Ala Glu Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
Asn 725 730 735Trp Ala Asn
Glu Val Val Glu Asp Phe Thr Thr Asn Ile Met Lys Lys 740
745 750Asp Thr Leu Asp Lys Ile Ser Asp Val Ser
Val Ile Ile Pro Tyr Ile 755 760
765Gly Pro Ala Leu Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe Asn 770
775 780Gln Ala Phe Ala Thr Ala Gly Val
Ala Phe Leu Leu Glu Gly Phe Pro785 790
795 800Glu Phe Thr Ile Pro Ala Leu Gly Val Phe Thr Phe
Tyr Ser Ser Ile 805 810
815Gln Glu Arg Glu Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu Gln
820 825 830Arg Val Lys Arg Trp Lys
Asp Ser Tyr Gln Trp Met Val Ser Asn Trp 835 840
845Leu Ser Arg Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln
Met Tyr 850 855 860Asp Ser Leu Ser Tyr
Gln Ala Asp Ala Ile Lys Ala Lys Ile Asp Leu865 870
875 880Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys
Glu Asn Ile Lys Ser Gln 885 890
895Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala Met
900 905 910Asn Asn Ile Asn Lys
Phe Ile Arg Glu Cys Ser Val Thr Tyr Leu Phe 915
920 925Lys Asn Met Leu Pro Lys Val Ile Asp Glu Leu Asn
Lys Phe Asp Leu 930 935 940Arg Thr Lys
Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile Ile945
950 955 960Leu Val Gly Glu Val Asp Arg
Leu Lys Ala Lys Val Asn Glu Ser Phe 965
970 975Glu Asn Thr Met Pro Phe Asn Ile Phe Ser Tyr Thr
Asn Asn Ser Leu 980 985 990Leu
Lys Asp Ile Ile Asn Glu Tyr Phe Asn 995
1000421045PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 42Pro Val Thr Ile Asn Asn Phe Asn
Tyr Asn Asp Pro Ile Asp Asn Asn1 5 10
15Asn Ile Ile Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly
Arg Tyr 20 25 30Tyr Lys Ala
Phe Lys Ile Thr Asp Arg Ile Trp Ile Ile Pro Glu Arg 35
40 45Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn
Lys Ser Ser Gly Ile 50 55 60Phe Asn
Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn Thr65
70 75 80Asn Asp Lys Lys Asn Ile Phe
Leu Gln Thr Met Ile Lys Leu Phe Asn 85 90
95Arg Ile Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu
Met Ile Ile 100 105 110Asn Gly
Ile Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu Phe 115
120 125Asn Thr Asn Ile Ala Ser Val Thr Val Asn
Lys Leu Ile Ser Asn Pro 130 135 140Gly
Glu Val Glu Arg Lys Lys Gly Ile Phe Ala Asn Leu Ile Ile Phe145
150 155 160Gly Pro Gly Pro Val Leu
Asn Glu Asn Glu Thr Ile Asp Ile Gly Ile 165
170 175Gln Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly
Ile Met Gln Met 180 185 190Lys
Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gln Glu Asn 195
200 205Lys Gly Ala Ser Ile Phe Asn Arg Arg
Gly Tyr Phe Ser Asp Pro Ala 210 215
220Leu Ile Leu Met His Glu Leu Ile His Val Leu His Gly Leu Tyr Gly225
230 235 240Ile Lys Val Asp
Asp Leu Pro Ile Val Pro Asn Glu Lys Lys Phe Phe 245
250 255Met Gln Ser Thr Asp Ala Ile Gln Ala Glu
Glu Leu Tyr Thr Phe Gly 260 265
270Gly Gln Asp Pro Ser Ile Ile Thr Pro Ser Thr Asp Lys Ser Ile Tyr
275 280 285Asp Lys Val Leu Gln Asn Phe
Arg Gly Ile Val Asp Arg Leu Asn Lys 290 295
300Val Leu Val Cys Ile Ser Asp Pro Asn Ile Asn Ile Asn Ile Tyr
Lys305 310 315 320Asn Lys
Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly Lys
325 330 335Tyr Ser Ile Asp Val Glu Ser
Phe Asp Lys Leu Tyr Lys Ser Leu Met 340 345
350Phe Gly Phe Thr Glu Thr Asn Ile Ala Glu Asn Tyr Lys Ile
Lys Thr 355 360 365Arg Ala Ser Tyr
Phe Ser Asp Ser Leu Pro Pro Val Lys Ile Lys Asn 370
375 380Leu Leu Asp Asn Glu Ile Tyr Thr Ile Glu Glu Gly
Phe Asn Ile Ser385 390 395
400Asp Lys Asp Met Glu Lys Glu Tyr Arg Gly Gln Asn Lys Ala Ile Asn
405 410 415Lys Gln Ala Tyr Glu
Glu Ile Ser Lys Glu His Leu Ala Val Tyr Lys 420
425 430Ile Gln Met Cys Val Asp Glu Glu Lys Leu Tyr Asp
Asp Asp Asp Lys 435 440 445Asp Arg
Trp Gly Ser Ser Leu Gln Cys Ile Asp Val Asp Asn Glu Asp 450
455 460Leu Phe Phe Ile Ala Asp Lys Asn Ser Phe Ser
Asp Asp Leu Ser Lys465 470 475
480Asn Glu Arg Ile Glu Tyr Asn Thr Gln Ser Asn Tyr Ile Glu Asn Asp
485 490 495Phe Pro Ile Asn
Glu Leu Ile Leu Asp Thr Asp Leu Ile Ser Lys Ile 500
505 510Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr
Asp Phe Asn Val Asp 515 520 525Val
Pro Val Tyr Glu Lys Gln Pro Ala Ile Lys Lys Ile Phe Thr Asp 530
535 540Glu Asn Thr Ile Phe Gln Tyr Leu Tyr Ser
Gln Thr Phe Pro Leu Asp545 550 555
560Ile Arg Asp Ile Ser Leu Thr Ser Ser Phe Asp Asp Ala Leu Leu
Phe 565 570 575Ser Asn Lys
Val Tyr Ser Phe Phe Ser Met Asp Tyr Ile Lys Thr Ala 580
585 590Asn Lys Val Val Glu Ala Gly Leu Phe Ala
Gly Trp Val Lys Gln Ile 595 600
605Val Asn Asp Phe Val Ile Glu Ala Asn Lys Ser Asn Thr Met Asp Ala 610
615 620Ile Ala Asp Ile Ser Leu Ile Val
Pro Tyr Ile Gly Leu Ala Leu Asn625 630
635 640Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu Asn
Ala Phe Glu Ile 645 650
655Ala Gly Ala Ser Ile Leu Leu Glu Phe Ile Pro Glu Leu Leu Ile Pro
660 665 670Val Val Gly Ala Phe Leu
Leu Glu Ser Tyr Ile Asp Asn Lys Asn Lys 675 680
685Ile Ile Lys Thr Ile Asp Asn Ala Leu Thr Lys Arg Asn Glu
Lys Trp 690 695 700Ser Asp Met Tyr Gly
Leu Ile Val Ala Gln Trp Leu Ser Thr Val Asn705 710
715 720Thr Gln Phe Tyr Thr Ile Lys Glu Gly Met
Tyr Lys Ala Leu Asn Tyr 725 730
735Gln Ala Gln Ala Leu Glu Glu Ile Ile Lys Tyr Arg Tyr Asn Ile Tyr
740 745 750Ser Glu Lys Glu Lys
Ser Asn Ile Asn Ile Asp Phe Asn Asp Ile Asn 755
760 765Ser Lys Leu Asn Glu Gly Ile Asn Gln Ala Ile Asp
Asn Ile Asn Asn 770 775 780Phe Ile Asn
Gly Cys Ser Val Ser Tyr Leu Met Lys Lys Met Ile Pro785
790 795 800Leu Ala Val Glu Lys Leu Leu
Asp Phe Asp Asn Thr Leu Lys Lys Asn 805
810 815Leu Leu Asn Tyr Ile Asp Glu Asn Lys Leu Tyr Leu
Ile Gly Ser Ala 820 825 830Glu
Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu Lys Thr Ile Met Pro 835
840 845Phe Asp Leu Ser Ile Tyr Thr Asn Asp
Thr Ile Leu Ile Glu Met Phe 850 855
860Asn Lys Tyr Asn Ser Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly865
870 875 880Ser Gly Gly Gly
Gly Ser Ala Leu Val Arg Ser Ser Ser Arg Thr Pro 885
890 895Ser Asp Lys Pro Val Ala His Val Val Ala
Asn Pro Gln Ala Glu Gly 900 905
910Gln Leu Gln Trp Leu Asn Arg Arg Ala Asn Ala Leu Leu Ala Asn Gly
915 920 925Val Glu Leu Arg Asp Asn Gln
Leu Val Val Pro Ser Glu Gly Leu Tyr 930 935
940Leu Ile Tyr Ser Gln Val Leu Phe Lys Gly Gln Gly Cys Pro Ser
Thr945 950 955 960His Val
Leu Leu Thr His Thr Ile Ser Arg Ile Ala Val Ser Tyr Gln
965 970 975Thr Lys Val Asn Leu Leu Ser
Ala Ile Lys Ser Pro Cys Gln Arg Glu 980 985
990Thr Pro Glu Gly Ala Glu Ala Lys Pro Trp Tyr Glu Pro Ile
Tyr Leu 995 1000 1005Gly Gly Val
Phe Gln Leu Glu Lys Gly Asp Arg Leu Ser Ala Glu 1010
1015 1020Ile Asn Arg Pro Asp Tyr Leu Asp Phe Ala Glu
Ser Gly Gln Val 1025 1030 1035Tyr Phe
Gly Ile Ile Ala Leu 1040 104543962PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 43Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn
Asp Asn1 5 10 15Asp Ile
Leu Tyr Leu Arg Ile Pro Gln Asn Lys Leu Ile Thr Thr Pro 20
25 30Val Lys Ala Phe Met Ile Thr Gln Asn
Ile Trp Val Ile Pro Glu Arg 35 40
45Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro Thr 50
55 60Ser Lys Tyr Gln Ser Tyr Tyr Asp Pro
Ser Tyr Leu Ser Thr Asp Glu65 70 75
80Gln Lys Asp Thr Phe Leu Lys Gly Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Glu
Arg Asp Ile Gly Lys Lys Leu Ile Asn Tyr Leu Val Val Gly 100
105 110Ser Pro Phe Met Gly Asp Ser Ser Thr
Pro Glu Asp Thr Phe Asp Phe 115 120
125Thr Arg His Thr Thr Asn Ile Ala Val Glu Lys Phe Glu Asn Gly Ser
130 135 140Trp Lys Val Thr Asn Ile Ile
Thr Pro Ser Val Leu Ile Phe Gly Pro145 150
155 160Leu Pro Asn Ile Leu Asp Tyr Thr Ala Ser Leu Thr
Leu Gln Gly Gln 165 170
175Gln Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser Ile Leu Lys
180 185 190Val Ala Pro Glu Phe Leu
Leu Thr Phe Ser Asp Val Thr Ser Asn Gln 195 200
205Ser Ser Ala Val Leu Gly Lys Ser Ile Phe Cys Met Asp Pro
Val Ile 210 215 220Ala Leu Met His Glu
Leu Thr His Ser Leu His Gln Leu Tyr Gly Ile225 230
235 240Asn Ile Pro Ser Asp Lys Arg Ile Arg Pro
Gln Val Ser Glu Gly Phe 245 250
255Phe Ser Gln Asp Gly Pro Asn Val Gln Phe Glu Glu Leu Tyr Thr Phe
260 265 270Gly Gly Leu Asp Val
Glu Ile Ile Pro Gln Ile Glu Arg Ser Gln Leu 275
280 285Arg Glu Lys Ala Leu Gly His Tyr Lys Asp Ile Ala
Lys Arg Leu Asn 290 295 300Asn Ile Asn
Lys Thr Ile Pro Ser Ser Trp Ile Ser Asn Ile Asp Lys305
310 315 320Tyr Lys Lys Ile Phe Ser Glu
Lys Tyr Asn Phe Asp Lys Asp Asn Thr 325
330 335Gly Asn Phe Val Val Asn Ile Asp Lys Phe Asn Ser
Leu Tyr Ser Asp 340 345 350Leu
Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gln Tyr Asn Val 355
360 365Lys Asn Arg Thr His Tyr Phe Ser Arg
His Tyr Leu Pro Val Phe Ala 370 375
380Asn Ile Leu Asp Asp Asn Ile Tyr Thr Ile Arg Asp Gly Phe Asn Leu385
390 395 400Thr Asn Lys Gly
Phe Asn Ile Glu Asn Ser Gly Gln Asn Ile Glu Arg 405
410 415Asn Pro Ala Leu Gln Lys Leu Ser Ser Glu
Ser Val Val Asp Leu Phe 420 425
430Thr Lys Val Cys Val Asp Lys Ser Glu Glu Lys Leu Tyr Asp Asp Asp
435 440 445Asp Lys Asp Arg Trp Gly Ser
Ser Leu Gln Cys Ile Lys Val Lys Asn 450 455
460Asn Arg Leu Pro Tyr Val Ala Asp Lys Asp Ser Ile Ser Gln Glu
Ile465 470 475 480Phe Glu
Asn Lys Ile Ile Thr Asp Glu Thr Asn Val Gln Asn Tyr Ser
485 490 495Asp Lys Phe Ser Leu Asp Glu
Ser Ile Leu Asp Gly Gln Val Pro Ile 500 505
510Asn Pro Glu Ile Val Asp Pro Leu Leu Pro Asn Val Asn Met
Glu Pro 515 520 525Leu Asn Leu Pro
Gly Glu Glu Ile Val Phe Tyr Asp Asp Ile Thr Lys 530
535 540Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr Tyr Leu Glu
Ser Gln Lys Leu545 550 555
560Ser Asn Asn Val Glu Asn Ile Thr Leu Thr Thr Ser Val Glu Glu Ala
565 570 575Leu Gly Tyr Ser Asn
Lys Ile Tyr Thr Phe Leu Pro Ser Leu Ala Glu 580
585 590Lys Val Asn Lys Gly Val Gln Ala Gly Leu Phe Leu
Asn Trp Ala Asn 595 600 605Glu Val
Val Glu Asp Phe Thr Thr Asn Ile Met Lys Lys Asp Thr Leu 610
615 620Asp Lys Ile Ser Asp Val Ser Val Ile Ile Pro
Tyr Ile Gly Pro Ala625 630 635
640Leu Asn Ile Gly Asn Ser Ala Leu Arg Gly Asn Phe Asn Gln Ala Phe
645 650 655Ala Thr Ala Gly
Val Ala Phe Leu Leu Glu Gly Phe Pro Glu Phe Thr 660
665 670Ile Pro Ala Leu Gly Val Phe Thr Phe Tyr Ser
Ser Ile Gln Glu Arg 675 680 685Glu
Lys Ile Ile Lys Thr Ile Glu Asn Cys Leu Glu Gln Arg Val Lys 690
695 700Arg Trp Lys Asp Ser Tyr Gln Trp Met Val
Ser Asn Trp Leu Ser Arg705 710 715
720Ile Thr Thr Gln Phe Asn His Ile Asn Tyr Gln Met Tyr Asp Ser
Leu 725 730 735Ser Tyr Gln
Ala Asp Ala Ile Lys Ala Lys Ile Asp Leu Glu Tyr Lys 740
745 750Lys Tyr Ser Gly Ser Asp Lys Glu Asn Ile
Lys Ser Gln Val Glu Asn 755 760
765Leu Lys Asn Ser Leu Asp Val Lys Ile Ser Glu Ala Met Asn Asn Ile 770
775 780Asn Lys Phe Ile Arg Glu Cys Ser
Val Thr Tyr Leu Phe Lys Asn Met785 790
795 800Leu Pro Lys Val Ile Asp Glu Leu Asn Lys Phe Asp
Leu Arg Thr Lys 805 810
815Thr Glu Leu Ile Asn Leu Ile Asp Ser His Asn Ile Ile Leu Val Gly
820 825 830Glu Val Asp Arg Leu Lys
Ala Lys Val Asn Glu Ser Phe Glu Asn Thr 835 840
845Met Pro Phe Asn Ile Phe Ser Tyr Thr Asn Asn Ser Leu Leu
Lys Asp 850 855 860Ile Ile Asn Glu Tyr
Phe Asn Leu Glu Gly Gly Gly Gly Ser Gly Gly865 870
875 880Gly Gly Ser Gly Gly Gly Gly Ser Ala Leu
Lys Pro Val Ser Leu Ser 885 890
895Tyr Arg Cys Pro Cys Arg Phe Phe Glu Ser His Val Ala Arg Ala Asn
900 905 910Val Lys His Leu Lys
Ile Leu Asn Thr Pro Asn Cys Ala Leu Gln Ile 915
920 925Val Ala Arg Leu Lys Asn Asn Asn Arg Gln Val Cys
Ile Asp Pro Lys 930 935 940Leu Lys Trp
Ile Gln Glu Tyr Leu Glu Lys Ala Leu Asn Lys Arg Phe945
950 955 960Lys Met441090PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 44Pro Ile Thr Ile Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp
Asn Lys1 5 10 15Asn Ile
Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu Pro 20
25 30Glu Lys Ala Phe Arg Ile Thr Gly Asn
Ile Trp Val Ile Pro Asp Arg 35 40
45Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val Thr 50
55 60Ser Pro Lys Ser Gly Tyr Tyr Asp Pro
Asn Tyr Leu Ser Thr Asp Ser65 70 75
80Asp Lys Asp Thr Phe Leu Lys Glu Ile Ile Lys Leu Phe Lys
Arg Ile 85 90 95Asn Ser
Arg Glu Ile Gly Glu Glu Leu Ile Tyr Arg Leu Ser Thr Asp 100
105 110Ile Pro Phe Pro Gly Asn Asn Asn Thr
Pro Ile Asn Thr Phe Asp Phe 115 120
125Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gln Gly Asn Asn
130 135 140Trp Val Lys Thr Gly Ser Ile
Asn Pro Ser Val Ile Ile Thr Gly Pro145 150
155 160Arg Glu Asn Ile Ile Asp Pro Glu Thr Ser Thr Phe
Lys Leu Thr Asn 165 170
175Asn Thr Phe Ala Ala Gln Glu Gly Phe Gly Ala Leu Ser Ile Ile Ser
180 185 190Ile Ser Pro Arg Phe Met
Leu Thr Tyr Ser Asn Ala Thr Asn Asp Val 195 200
205Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro
Ile Leu 210 215 220Ile Leu Met His Glu
Leu Asn His Ala Met His Asn Leu Tyr Gly Ile225 230
235 240Ala Ile Pro Asn Asp Gln Thr Ile Ser Ser
Val Thr Ser Asn Ile Phe 245 250
255Tyr Ser Gln Tyr Asn Val Lys Leu Glu Tyr Ala Glu Ile Tyr Ala Phe
260 265 270Gly Gly Pro Thr Ile
Asp Leu Ile Pro Lys Ser Ala Arg Lys Tyr Phe 275
280 285Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser Ile Ala
Lys Arg Leu Asn 290 295 300Ser Ile Thr
Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr Ile Gly Glu305
310 315 320Tyr Lys Gln Lys Leu Ile Arg
Lys Tyr Arg Phe Val Val Glu Ser Ser 325
330 335Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu
Leu Tyr Asn Glu 340 345 350Leu
Thr Gln Ile Phe Thr Glu Phe Asn Tyr Ala Lys Ile Tyr Asn Val 355
360 365Gln Asn Arg Lys Ile Tyr Leu Ser Asn
Val Tyr Thr Pro Val Thr Ala 370 375
380Asn Ile Leu Asp Asp Asn Val Tyr Asp Ile Gln Asn Gly Phe Asn Ile385
390 395 400Pro Lys Ser Asn
Leu Asn Val Leu Phe Met Gly Gln Asn Leu Ser Arg 405
410 415Asn Pro Ala Leu Arg Lys Val Asn Pro Glu
Asn Met Leu Tyr Leu Phe 420 425
430Thr Lys Phe Cys Val Asp Ala Ile Asp Gly Arg Ser Leu Tyr Asn Lys
435 440 445Thr Leu Gln Cys Arg Glu Leu
Leu Val Lys Asn Thr Asp Leu Pro Phe 450 455
460Ile Gly Asp Ile Ser Asp Val Lys Thr Asp Ile Phe Leu Arg Lys
Asp465 470 475 480Ile Asn
Glu Glu Thr Glu Val Ile Tyr Tyr Pro Asp Asn Val Ser Val
485 490 495Asp Gln Val Ile Leu Ser Lys
Asn Thr Ser Glu His Gly Gln Leu Asp 500 505
510Leu Leu Tyr Pro Ser Ile Asp Ser Glu Ser Glu Ile Leu Pro
Gly Glu 515 520 525Asn Gln Val Phe
Tyr Asp Asn Arg Thr Gln Asn Val Asp Tyr Leu Asn 530
535 540Ser Tyr Tyr Tyr Leu Glu Ser Gln Lys Leu Ser Asp
Asn Val Glu Asp545 550 555
560Phe Thr Phe Thr Arg Ser Ile Glu Glu Ala Leu Asp Asn Ser Ala Lys
565 570 575Val Tyr Thr Tyr Phe
Pro Thr Leu Ala Asn Lys Val Asn Ala Gly Val 580
585 590Gln Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val
Val Glu Asp Phe 595 600 605Thr Thr
Asn Ile Leu Arg Lys Asp Thr Leu Asp Lys Ile Ser Asp Val 610
615 620Ser Ala Ile Ile Pro Tyr Ile Gly Pro Ala Leu
Asn Ile Ser Asn Ser625 630 635
640Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val Thr
645 650 655Ile Leu Leu Glu
Ala Phe Pro Glu Phe Thr Ile Pro Ala Leu Gly Ala 660
665 670Phe Val Ile Tyr Ser Lys Val Gln Glu Arg Asn
Glu Ile Ile Lys Thr 675 680 685Ile
Asp Asn Cys Leu Glu Gln Arg Ile Lys Arg Trp Lys Asp Ser Tyr 690
695 700Glu Trp Met Met Gly Thr Trp Leu Ser Arg
Ile Ile Thr Gln Phe Asn705 710 715
720Asn Ile Ser Tyr Gln Met Tyr Asp Ser Leu Asn Tyr Gln Ala Gly
Ala 725 730 735Ile Lys Ala
Lys Ile Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp 740
745 750Lys Glu Asn Ile Lys Ser Gln Val Glu Asn
Leu Lys Asn Ser Leu Asp 755 760
765Val Lys Ile Ser Glu Ala Met Asn Asn Ile Asn Lys Phe Ile Arg Glu 770
775 780Cys Ser Val Thr Tyr Leu Phe Lys
Asn Met Leu Pro Lys Val Ile Asp785 790
795 800Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys
Leu Ile Asn Leu 805 810
815Ile Asp Ser His Asn Ile Ile Leu Val Gly Glu Val Asp Lys Leu Lys
820 825 830Ala Lys Val Asn Asn Ser
Phe Gln Asn Thr Ile Pro Phe Asn Ile Phe 835 840
845Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp Ile Ile Asn Glu
Tyr Phe 850 855 860Asn Leu Glu Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly865 870
875 880Gly Ser Ala Leu Ala Pro Met Ala Glu Gly
Gly Gly Gln Asn His His 885 890
895Glu Val Val Lys Phe Met Asp Val Tyr Gln Arg Ser Tyr Cys His Pro
900 905 910Ile Glu Thr Leu Val
Asp Ile Phe Gln Glu Tyr Pro Asp Glu Ile Glu 915
920 925Tyr Ile Phe Lys Pro Ser Cys Val Pro Leu Met Arg
Cys Gly Gly Cys 930 935 940Cys Asn Asp
Glu Gly Leu Glu Cys Val Pro Thr Glu Glu Ser Asn Ile945
950 955 960Thr Met Gln Ile Met Arg Ile
Lys Pro His Gln Gly Gln His Ile Gly 965
970 975Glu Met Ser Phe Leu Gln His Asn Lys Cys Glu Cys
Arg Pro Lys Lys 980 985 990Asp
Arg Ala Arg Gln Glu Lys Lys Ser Val Arg Gly Lys Gly Lys Gly 995
1000 1005Gln Lys Arg Lys Arg Lys Lys Ser
Arg Tyr Lys Ser Trp Ser Val 1010 1015
1020Tyr Val Gly Ala Arg Cys Cys Leu Met Pro Trp Ser Leu Pro Gly
1025 1030 1035Pro His Pro Cys Gly Pro
Cys Ser Glu Arg Arg Lys His Leu Phe 1040 1045
1050Val Gln Asp Pro Gln Thr Cys Lys Cys Ser Cys Lys Asn Thr
Asp 1055 1060 1065Ser Arg Cys Lys Ala
Arg Gln Leu Glu Leu Asn Glu Arg Thr Cys 1070 1075
1080Arg Cys Asp Lys Pro Arg Arg 1085
10904510PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 45Glu His Trp Ser Tyr Gly Leu Arg Pro
Gly1 5
10466PRTUnknownsource/note="Description of Unknown Transforming
growth factor alpha peptide" 46Trp Ser His Phe Asn Asp1
5475PRTUnknownsource/note="Description of Unknown Neuregulin
peptide" 47Ser His Leu Val Lys1
5485PRTUnknownsource/note="Description of Unknown Epidermal growth
factor peptide" 48Asn Ser Asp Ser Glu1 54944PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 49Tyr Ala Asp Ala Ile Phe Thr Asn Ser Tyr Arg Lys Val Leu
Gly Gln1 5 10 15Leu Ser
Ala Arg Lys Leu Leu Gln Asp Ile Met Ser Arg Gln Gln Gly 20
25 30Glu Ser Asn Gln Glu Arg Gly Ala Arg
Ala Arg Leu 35 405029PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 50Tyr Ala Asp Ala Ile Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly
Gln1 5 10 15Leu Ser Ala
Arg Lys Leu Leu Gln Asp Ile Met Ser Arg 20
255144PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 51Tyr Ala Asp Ala Ile Phe Thr Asn
Ser Tyr Arg Lys Ile Leu Gly Gln1 5 10
15Leu Ser Ala Arg Lys Leu Leu Gln Asp Ile Met Asn Arg Gln
Gln Gly 20 25 30Glu Arg Asn
Gln Glu Gln Gly Ala Lys Val Arg Leu 35
405244PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 52Tyr Ala Asp Ala Ile Phe Thr Asn
Ser Tyr Arg Lys Val Leu Gly Gln1 5 10
15Leu Ser Ala Arg Lys Leu Leu Gln Asp Ile Met Asn Arg Gln
Gln Gly 20 25 30Glu Arg Asn
Gln Glu Gln Gly Ala Lys Val Arg Leu 35
405329PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 53His Val Asp Ala Ile Phe Thr Gln Ser
Tyr Arg Lys Val Leu Ala Gln1 5 10
15Leu Ser Ala Arg Lys Leu Leu Gln Asp Ile Leu Asn Arg
20 255440PRTArtificial Sequencesource/note="Description
of Artificial Sequence Synthetic polypeptide" 54His Val Asp Ala Ile
Phe Thr Gln Ser Tyr Arg Lys Val Leu Ala Gln1 5
10 15Leu Ser Ala Arg Lys Leu Leu Gln Asp Ile Leu
Asn Arg Gln Gln Gly 20 25
30Glu Arg Asn Gln Glu Gln Gly Ala 35
405514PRTUnknownsource/note="Description of Unknown Frog bombesin
peptide" 55Glu Gln Arg Leu Gly Asn Gln Trp Ala Val Gly His Leu Met1
5 105627PRTUnknownsource/note="Description of
Unknown Porcine gastrin releasing peptide" 56Ala Pro Val Ser Val Gly
Gly Gly Thr Val Leu Ala Lys Met Tyr Pro1 5
10 15Arg Gly Asn His Trp Ala Val Gly His Leu Met
20 255727PRTHomo sapiens 57Val Pro Leu Pro Ala Gly
Gly Gly Thr Val Leu Thr Lys Met Tyr Pro1 5
10 15Arg Gly Asn His Trp Ala Val Gly His Leu Met
20 25585PRTUnknownsource/note="Description of
Unknown Enterokinase peptide" 58Asp Asp Asp Asp Lys1
5594PRTUnknownsource/note="Description of Unknown Factor Xa peptide"
59Ile Glu Gly Arg1604PRTUnknownsource/note="Description of Unknown Factor
Xa peptide" 60Ile Asp Gly Arg1617PRTTobacco etch virus 61Glu Asn Leu
Tyr Phe Gln Gly1 5626PRTUnknownsource/note="Description of
Unknown Thrombin peptide" 62Leu Val Pro Arg Gly Ser1
5638PRTArtificial Sequencesource/note="Description of Artificial Sequence
Synthetic peptide" 63Leu Glu Val Leu Phe Gln Gly Pro1
5646PRTArtificial Sequencesource/note="Description of Artificial Sequence
Synthetic 6xHis tag" 64His His His His His His1
5654PRTUnknownsource/note="Description of Unknown Caspase 3 peptide"
65Asp Met Gln Asp1666PRTUnknownsource/note="Description of Unknown
Thrombin peptide" 66Leu Val Pro Arg Gly Ser1
56710PRTUnknownsource/note="Description of Unknown ADAM17 peptide" 67Pro
Leu Ala Gln Ala Val Arg Ser Ser Ser1 5
106810PRTHomo sapiens 68Ser Lys Gly Arg Ser Leu Ile Gly Arg Val1
5 10696PRTUnknownsource/note="Description of
Unknown Elastase peptide" 69Met Glu Ala Val Thr Tyr1
5704PRTUnknownsource/note="Description of Unknown Granzyme peptide"
70Ile Glu Pro Asp1714PRTUnknownsource/note="Description of Unknown
Caspase 2 peptide" 71Asp Val Ala
Asp1724PRTUnknownsource/note="Description of Unknown Caspase 4
peptide" 72Leu Glu Val Asp1734PRTUnknownsource/note="Description of
Unknown Caspase 7 peptide" 73Asp Glu Val
Asp1744PRTUnknownsource/note="Description of Unknown Caspase 9
peptide" 74Leu Glu His Asp1754PRTUnknownsource/note="Description of
Unknown Caspase 10 peptide" 75Ile Glu His Asp17610PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 76Cys His Ser Gly Tyr Val Gly Ala Arg Cys1 5
107712PRTUnknownsource/note="Description of Unknown Epidermal
growth factor peptide" 77Tyr His Trp Tyr Gly Tyr Thr Pro Gln Asn Val
Ile1 5 1078116PRTHomo sapiens 78Met Leu
Ser Cys Arg Leu Gln Cys Ala Leu Ala Ala Leu Ser Ile Val1 5
10 15Leu Ala Leu Gly Cys Val Thr Gly
Ala Pro Ser Asp Pro Arg Leu Arg 20 25
30Gln Phe Leu Gln Lys Ser Leu Ala Ala Ala Ala Gly Lys Gln Glu
Leu 35 40 45Ala Lys Tyr Phe Leu
Ala Glu Leu Leu Ser Glu Pro Asn Gln Thr Glu 50 55
60Asn Asp Ala Leu Glu Pro Glu Asp Leu Ser Gln Ala Ala Glu
Gln Asp65 70 75 80Glu
Met Arg Leu Glu Leu Gln Arg Ser Ala Asn Ser Asn Pro Ala Met
85 90 95Ala Pro Arg Glu Arg Lys Ala
Gly Cys Lys Asn Phe Phe Trp Lys Thr 100 105
110Phe Thr Ser Cys 11579155PRTHomo sapiens 79Met Tyr
Arg His Lys Asn Ser Trp Arg Leu Gly Leu Lys Tyr Pro Pro1 5
10 15Ser Ser Lys Glu Glu Thr Gln Val
Pro Lys Thr Leu Ile Ser Gly Leu 20 25
30Pro Gly Arg Lys Ser Ser Ser Arg Val Gly Glu Lys Leu Gln Ser
Ala 35 40 45His Lys Met Pro Leu
Ser Pro Gly Leu Leu Leu Leu Leu Leu Ser Gly 50 55
60Ala Thr Ala Thr Ala Ala Leu Pro Leu Glu Gly Gly Pro Thr
Gly Arg65 70 75 80Asp
Ser Glu His Met Gln Glu Ala Ala Gly Ile Arg Lys Ser Ser Leu
85 90 95Leu Thr Phe Leu Ala Trp Trp
Phe Glu Trp Thr Ser Gln Ala Ser Ala 100 105
110Gly Pro Leu Ile Gly Glu Glu Ala Arg Glu Val Ala Arg Arg
Gln Glu 115 120 125Gly Ala Pro Pro
Gln Gln Ser Ala Arg Arg Asp Arg Met Pro Cys Arg 130
135 140Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys Lys145
150 155807PRTHomo sapiens 80Asn Phe Phe Trp
Lys Thr Phe1 5818PRTHomo
sapiensVARIANT(1)..(1)/replace="Lys" 81Arg Asn Phe Phe Trp Lys Thr Phe1
5829PRTHomo sapiensVARIANT(2)..(2)/replace="Lys" 82Cys Arg
Asn Phe Phe Trp Lys Thr Phe1 58310PRTHomo
sapiensVARIANT(1)..(1)/replace="Gly" 83Pro Cys Arg Asn Phe Phe Trp Lys
Thr Phe1 5 10848PRTHomo
sapiensVARIANT(8)..(8)/replace="Thr" 84Asn Phe Phe Trp Lys Thr Phe Ser1
5859PRTHomo sapiensVARIANT(8)..(8)/replace="Thr" 85Asn Phe
Phe Trp Lys Thr Phe Ser Ser1 58610PRTHomo
sapiensVARIANT(8)..(8)/replace="Thr" 86Asn Phe Phe Trp Lys Thr Phe Ser
Ser Cys1 5 10879PRTHomo
sapiensVARIANT(1)..(1)/replace="Lys" 87Arg Asn Phe Phe Trp Lys Thr Phe
Ser1 58810PRTHomo sapiensVARIANT(1)..(1)/replace="Lys"
88Arg Asn Phe Phe Trp Lys Thr Phe Ser Ser1 5
108911PRTHomo sapiensVARIANT(1)..(1)/replace="Lys" 89Arg Asn Phe Phe
Trp Lys Thr Phe Ser Ser Cys1 5
109010PRTHomo sapiensVARIANT(2)..(2)/replace="Lys" 90Cys Arg Asn Phe Phe
Trp Lys Thr Phe Ser1 5 109111PRTHomo
sapiensVARIANT(2)..(2)/replace="Lys" 91Cys Arg Asn Phe Phe Trp Lys Thr
Phe Ser Ser1 5 109212PRTHomo
sapiensVARIANT(2)..(2)/replace="Lys" 92Cys Arg Asn Phe Phe Trp Lys Thr
Phe Ser Ser Cys1 5 109311PRTHomo
sapiensVARIANT(1)..(1)/replace="Gly" 93Pro Cys Arg Asn Phe Phe Trp Lys
Thr Phe Ser1 5 109412PRTHomo
sapiensVARIANT(1)..(1)/replace="Gly" 94Pro Cys Arg Asn Phe Phe Trp Lys
Thr Phe Ser Ser1 5 109512PRTHomo
sapiensVARIANT(1)..(1)/replace="Gly" 95Pro Cys Arg Asn Phe Phe Trp Lys
Thr Phe Ser Cys1 5 109628PRTHomo sapiens
96Ser Ala Asn Ser Asn Pro Ala Met Ala Pro Arg Glu Arg Lys Ala Gly1
5 10 15Cys Lys Asn Phe Phe Trp
Lys Thr Phe Thr Ser Cys 20 259714PRTHomo
sapiens 97Ala Gly Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys1
5 109829PRTHomo sapiens 98Gln Glu Gly Ala Pro
Pro Gln Gln Ser Ala Arg Arg Asp Arg Met Pro1 5
10 15Cys Arg Asn Phe Phe Trp Lys Thr Phe Ser Ser
Cys Lys 20 259929PRTHomo sapiens 99Gln Glu
Arg Pro Pro Leu Gln Gln Pro Pro His Arg Asp Lys Lys Pro1 5
10 15Cys Lys Asn Phe Phe Trp Lys Thr
Phe Ser Ser Cys Lys 20 2510029PRTHomo sapiens
100Gln Glu Arg Pro Pro Pro Gln Gln Pro Pro His Leu Asp Lys Lys Pro1
5 10 15Cys Lys Asn Phe Phe Trp
Lys Thr Phe Ser Ser Cys Lys 20 2510117PRTHomo
sapiens 101Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Ser Ser
Cys1 5 10
15Lys10214PRTHomo sapiens 102Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Ser
Ser Cys Lys1 5 1010314PRTHomo sapiens
103Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys Lys1
5 1010410PRTUnknownsource/note="Description of Unknown
Bombesin peptide" 104Gly Asn Gln Trp Ala Val Gly His Leu Met1
5 1010510PRTUnknownsource/note="Description of
Unknown Neuromedin B peptide" 105Gly Asn Leu Trp Ala Thr Gly His Phe
Met1 5
1010610PRTUnknownsource/note="Description of Unknown Neuromedin C
peptide" 106Gly Asn His Trp Ala Val Gly His Leu Met1 5
101079PRTUnknownsource/note="Description of Unknown Litorin
peptide" 107Glu Gln Trp Ala Val Gly His Phe Met1
510810PRTHomo sapiens 108Gly Asn His Trp Ala Val Gly His Leu Met1
5 1010923PRTInfluenza virus 109Gly Leu Phe Gly
Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu Gly1 5
10 15Met Ile Asp Gly Trp Tyr Gly
2011023PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 110Val Arg Gly Ile Ile Pro Phe Lys Thr
Lys Ser Leu Asp Glu Gly Tyr1 5 10
15Asn Lys Ala Leu Asn Asp Leu 2011123PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 111Val Asp Gly Ile Ile Thr Ser Lys Thr Lys Ser Leu Ile Glu Gly
Arg1 5 10 15Asn Lys Ala
Leu Asn Leu Gln 2011223PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 112Val Asp Gly Ile Ile Thr Ser Lys Thr Lys Ser Asp Asp Asp Asp
Lys1 5 10 15Asn Lys Ala
Leu Asn Leu Gln 20
User Contributions:
Comment about this patent or add new information about this topic: