Patent application title: CD20-BINDING IMMUNOTOXINS FOR INDUCING CELLULAR INTERNALIZATION AND METHODS USING SAME
Inventors:
IPC8 Class: AC07K1425FI
USPC Class:
1 1
Class name:
Publication date: 2020-01-02
Patent application number: 20200002387
Abstract:
The present invention provides CD20-binding proteins that bind to and
rapidly internalize CD20 antigens from a cell surface location to the
interior of a cell. CD20-binding proteins of the invention comprise a
CD20 binding region and a Shiga toxin effector region. Certain of the
disclosed CD20-binding proteins kill cells that express CD20 on their
surface. Further, the presently disclosed CD20-binding proteins can
comprise additional exogenous materials and are capable of targeted
delivery of these additional exogenous materials into the interior of
CD20 expressing cells. Such additional materials may include peptides,
antigens, enzymes, and polynucleotides. These CD20-binding proteins have
uses in methods of internalizing themselves, targeted killing of CD20
expressing cells, delivering exogenous materials into CD20 expressing
cells, and treating a variety of diseases involving CD20 expressing
cells, such as cancers and immune disorders.Claims:
1. A CD20-binding protein comprising: (a) a CD20 binding region
comprising an immunoglobulin-type binding region that specifically binds
an extracellular part of a CD20 polypeptide, (b) a Shiga toxin A subunit
effector polypeptide; and (c) an additional exogenous material.
2. The CD20-binding protein of claim 1, wherein the CD20-binding protein is capable of rapidly internalizing into a CD20-expressing cell within less than six hours when the CD20-binding protein is contacted with the CD20-expressing cell.
3. The CD20-binding protein of claim 1, wherein the CD20-binding region comprises a complementary determining region 3 fragment, a constrained FR3-CDR3-FR4 polypeptide, a single-domain antibody fragment, a single-chain variable fragment, an antibody variable fragment, an antigen-binding fragment, an Fd fragment, a fibronectin-derived 10.sup.th fibronectin type III domain, a tenascin type III domain, ankyrin repeat motif domain, a low-density-lipoprotein-receptor-derived A-domain, a lipocalin, a Kunitz domain, a Protein-A-derived Z domain, a gamma-B crystallin-derived domain, a ubiquitin-derived domain, a Sac7d-derived polypeptide, a Fyn-derived SH2 domain, or any genetically manipulated counterparts of any of the foregoing that retain CD20 binding function.
4. The CD20-binding protein of claim 1, wherein the Shiga toxin A subunit effector polypeptide comprises an amino acid sequence having at least 85% sequence identity to: (i) amino acids 75 to 251 of SEQ ID NO: 1, SEQ ID NO: 25, or SEQ ID NO: 26; (ii) amino acids 1 to 241 of SEQ ID NO: 1, SEQ ID NO: 25 or SEQ ID NO: 26; (iii) amino acids 1 to 251 of SEQ ID NO: 1, SEQ ID NO: 25 or SEQ ID NO: 26; or (iv) amino acids 1 to 261 of SEQ ID NO: 1, SEQ ID NO: 25 or SEQ ID NO: 26.
5. The CD20-binding protein of claim 1, wherein the CD20-binding region comprises: (a) a heavy chain variable (VH) domain comprising an HCDR1 of SEQ ID NO: 6, an HCDR2 of SEQ ID NO: 7, and an HCDR3 of SEQ ID NO: 8, and a light chain variable (VL) domain comprising an LCDR1 of SEQ ID NO:9, an LCDR2 of SEQ ID NO: 10, and an LDCR3 of SEQ ID NO: 11, (b) a VH domain comprising an HCDR1 of SEQ ID NO: 21, an HCDR2 of SEQ ID NO: 22, and an HCDR3 of SEQ ID NO: 27, and a VL domain comprising an LCDR1 of SEQ ID NO: 24, an LCDR2 of SEQ ID NO: 10, and an LDCR3 of SEQ ID NO: 11, or (c) a VH domain comprising an HCDR1 of SEQ ID NO: 21, an HCDR2 of SEQ ID NO: 22, and an HCDR3 of SEQ ID NO: 23, and a VL domain comprising an LCDR1 of SEQ ID NO: 28, an LCDR2 of SEQ ID NO: 10, and an LDCR3 of SEQ ID NO: 29.
6. A pharmaceutical composition comprising the CD20-binding protein of claim 1 and at least one pharmaceutically acceptable excipient or carrier.
7. The pharmaceutical composition of claim 6, which comprises a solvate, salt, ester or amide of the CD20-binding protein.
8. The pharmaceutical composition of claim 6, wherein the excipient is: acetate, alcohol, alpha-tocopherol, aluminum monostearate, ascorbic acid, ascorbyl palmitate, benzyl alcohol, butylated hydroxyanisole, butylated hydroxytoluene, chlorobutanol, citrate, cysteine hydrochloride, dextrose, ethanol, ethylenediaminetetraacetic acid, ethyloleate, gelatin, glycerine, glycerol, lactic acid, lecithin, mannitol, methyl parabens, monostearate salt, organic ester, paraben, phenol phosphate, phosphoric acid, polyalcohol, polyethylene glycol, polyol, propylene glycol, propylgallate, Ringer's solution, saline, sodium bisulfate, sodium bisulfite, sodium chloride, sodium metabisulfite, sodium sulfite, sorbic acid, sorbitol, sugar, tartaric acid, vegetable oil or water.
9. The pharmaceutical composition of claim 6, which further comprises an acceptable solvent, vehicle, sterile aqueous solution, buffer, powder, sterile powder, surfactant, antioxidant, chelating agent, antimicrobial agent, preservative, isotonic agent, dispersion medium, coating, adjuvant, wetting agent, emulsifying agent, dispersing agent, adsorption delaying agent, stabilizer, or additive.
10. A method for delivering an exogenous material into a CD20-expressing cell, the method comprising contacting a CD20-expressing cell having an interior with a CD20-binding protein comprising: (a) a CD20 binding region comprising an immunoglobulin-type binding region and capable of specifically binding an extracellular part of a CD20 protein, (b) a Shiga toxin A subunit effector polypeptide; and (c) an additional exogenous material, and wherein the contacting step results in the CD20-binding protein delivering the additional exogenous material into the interior of the CD20-expressing cell.
11. The method of claim 10, wherein the CD20-binding region comprises: a complementary determining region 3 fragment, a constrained FR3-CDR3-FR4 polypeptide, a single-domain antibody fragment, a single-chain variable fragment, an antibody variable fragment, an antigen binding fragment, an Fd fragment, a fibronectin-derived 10.sup.th fibronectin type III domain, a tenascin type III domain, an ankyrin repeat motif domain, a low-density lipoprotein receptor-derived A domain, a lipocalin, a Kunitz domain, a Protein-A-derived Z domain, a gamma-B crystallin-derived domain, a ubiquitin-derived domain, a Sac7d-derived polypeptide, a Fyn-derived SH2 domain, or any genetically manipulated counterparts of any of the foregoing that retain CD20-binding function.
12. The method of claim 10, wherein the contacting step results in the CD20-binding protein inducing cellular internalization of the CD20-binding protein in less than about an hour.
13. The method of claim 10, wherein the Shiga toxin A subunit effector polypeptide comprises an amino acid sequence having at least 85% sequence identity to: (i) amino acids 75 to 251 of SEQ ID NO: 1, SEQ ID NO: 25, or SEQ ID NO: 26; (ii) amino acids 1 to 241 of SEQ ID NO: 1, SEQ ID NO: 25 or SEQ ID NO: 26; (iii) amino acids 1 to 251 of SEQ ID NO: 1, SEQ ID NO: 25 or SEQ ID NO: 26, or (iv) amino acids 1 to 261 of SEQ ID NO: 1, SEQ ID NO: 25 or SEQ ID NO: 26.
14. The method of claim 10, wherein the CD20-binding region comprises: (a) a heavy chain variable (VH) domain comprising an HCDR1 of SEQ ID NO: 6, an HCDR2 of SEQ ID NO: 7, and an HCDR3 of SEQ ID NO: 8, and a light chain variable (VL) domain comprising an LCDR1 of SEQ ID NO:9, an LCDR2 of SEQ ID NO: 10, and an LDCR3 of SEQ ID NO: 11, (b) a VH domain comprising an HCDR1 of SEQ ID NO: 21, an HCDR2 of SEQ ID NO: 22, and an HCDR3 of SEQ ID NO: 27, and a VL domain comprising an LCDR1 of SEQ ID NO: 24, an LCDR2 of SEQ ID NO: 10, and an LDCR3 of SEQ ID NO: 11, or (c) a VH domain comprising an HCDR1 of SEQ ID NO: 21, an HCDR2 of SEQ ID NO: 22, and an HCDR3 of SEQ ID NO: 23, and a VL domain comprising an LCDR1 of SEQ ID NO: 28, an LCDR2 of SEQ ID NO: 10, and an LDCR3 of SEQ ID NO: 29.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a Divisional of U.S. patent application Ser. No. 14/774,609, filed Sep. 10, 2015, which is a 35 U.S.C. .sctn. 371 National Phase Entry Application of International Patent Application No. PCT/US2014/023198, filed Mar. 11, 2014, which claims the benefit under 35 U.S.C. .sctn. 119(e) of the U.S. Provisional Application No. 61/777,130, filed Mar. 12, 2013, the contents of each of which are incorporated herein by reference in their entirety.
SEQUENCE LISTING
[0002] The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Mar. 24, 2014 is named 13-03PCT-Sequence Listing--later furnished.txt and is 167 bytes in size
FIELD OF THE INVENTION
[0003] The present invention relates to CD20-binding proteins with the ability of binding to and forcing the rapid internalization of CD20 antigens from a cell surface location to the cell interior. These CD20-binding proteins have uses as therapeutic molecules for treatment of a variety of diseases, including cancer and immune disorders.
BACKGROUND OF THE INVENTION
[0004] An immunotoxin is a chimeric molecule which combines a cell surface binding region, such as from an immunoglobulin domain, and a toxin region typically derived from a naturally occurring protein toxin, such as those found in bacteria or plants. The potency of an immunotoxin greatly depends on its efficiency in transiting from the cell surface to the cytosol, a process that begins with cell internalization (see Pirie C et al., J Biol Chem 286: 4165-72 (2011)).
[0005] CD20 is a member of a family of polypeptides known as the membrane-spanning 4A (MS4A) family that includes at least 26 proteins in humans and mice (Ishibashi K et al., Gene 264: 87-93 (2001)). As with all MS4A members, the CD20 sequence predicts three hydrophobic regions forming a transmembrane molecule that spans the membrane four times, a structural characteristic believed central to its function. Also predicted is a single extracellular loop between the proposed third and fourth transmembrane domains and intracellular amino- and carboxy-terminal regions (Tedder T et al., Proc Natl Acad Sci 85: 208-12 (1988)). It is within this extracellular loop of approximately 40 amino acids that the majority of anti-CD20 monoclonal antibodies (mAbs), such as rituximab, are believed to bind with alanine-170 and proline-172 being the most critical residues. A crystal structure of an antibody binding a peptide fragment of CD20 using amino acids 163-187 of CD20 has confirmed amino acids 170 (alanine) through amino acids 173 (serine) as antigen-antibody interaction points for rituximab and CD20 (Du J et al., J Biol Chem 282: 15073-80 (2007)).
[0006] CD20 is believed to be present on the cell surface as a homo-multimer, likely a tetramer, and electron microscopy has shown that 90% of complexed CD20 is present in the membrane in lipid rafts and microvilli (Li H et al., J Biol Chem 279: 19893-901 (2004)). Lipid rafts are micro-domains found in the plasma membrane which have high polypeptide, sphingolipid, and cholesterol concentrations (Brown D, London E, Annu Rev Cell Dev Biol 14: 111-36 (1998)). Microvilli, or microvillar channels, are cell extensions from the plasma membrane surface (Reaven E et al., J Lipid Res 30: 1551-60 (1989)). Some antibodies to CD20 are known to bind only when the molecule is present in lipid rafts, such as FMC7 (Polyak M et al., Leukemia 17:1384-89 (2003)) and others, such as rituximab, are known to increase association of CD20 to rafts (Li H et al, supra). It is hypothesized that raft association is important to the proposed function of CD20 as an amplifier of calcium signals that are transduced through the B-cell antigen receptor (BCR), another protein commonly located within lipid rafts and found associated with CD20 multimers (Polyak M et al., J Biol Chem 283: 18545-52 (2008)).
[0007] Antibody-based therapies targeting a CD20 antigen are numerous (see Boross P, Leusen J, Am J Cancer Res 2: 676-90 (2012), for review). One of the attractive characteristics of CD20 as a target for therapies based on a mechanism in which the therapeutic remains on the cell surface in order to function is the lack of CD20 cellular internalization after being bound by antibody-based therapeutics (Anderson K et al., Blood 63: 1424-33 (1984); Press O et al., Blood 69: 584-91 (1987)). Although this has proven to be both cell-type and antibody-type specific, in general, CD20 appears to internalize at a much lower rate than do other cell surface antigens (Beers S et al., Sem Hematol 47: 107-14 (2010)).
[0008] There is a question in the art as to the utility of CD20 antigens as a target for therapies that require the therapeutic to internalize into a target cell after binding in order to be effective because of the general finding that CD20 does not readily internalize (Anderson K et al., Blood 63: 1424-33 (1984); Press O et al., Blood 69: 584-91 (1987); Beers S et al., Sem Hematol 47: 107-14 (2010)). Thus, there is an unsolved problem in targeting CD20 antigens with immunoglobulin-type therapeutics that require cell internalization for efficacy--how to force the CD20 bound therapeutic into the target cell's interior after binding. For example, therapies based on the delivery of an immunotoxin that targets a CD20 antigen are predicted to be ineffective based on insufficient CD20 internalization efficiency. Thus, there is a need in the art to develop effective compositions, therapeutics, and therapeutic methods that target cell-surface antigens which do not natively internalize at an efficient rate or upon binding by an immunoglobulin-type domain, like CD20.
[0009] In particular, there remains a need in the art to identify and develop CD20-targeted compositions that trigger rapid and efficient cellular internalization of the complex of the composition bound to CD20. For example, cytotoxic CD20-binding proteins comprising toxin-derived regions that induce cellular internalization of native CD20 molecules are desirable for the development of effective cancer and immuno-modulatory therapeutic molecules that target cells of B-cell lineages.
SUMMARY OF THE INVENTION
[0010] The present invention provides various CD20-binding proteins for inducing rapid cellular internalization of CD20, which comprise 1) a CD20 binding region, such as an immunoglobulin domain, and 2) a Shiga toxin effector region, such as a truncation of SLT-1A. Upon binding a CD20 antigen on the surface of a cell, the CD20-binding proteins of the invention are capable of inducing rapid cellular internalization of the complex comprising of the CD20-binding protein and a CD20 antigen into the interior of a eukaryotic cell. The linking of CD20 binding regions with Shiga-toxin-Subunit-A-derived polypeptides enables the engineering of cytotoxic Shiga-toxin based molecules that are capable of inducing rapid cellular internalization of natively expressed CD20, as well as capable of delivering additional exogenous materials into the interior of CD20 expressing cells. The CD20-binding proteins of the invention have uses, e.g., for targeted killing of CD20 positive cell types, delivering exogenous materials, as diagnostic agents, and as therapeutics for the treatment of a variety of conditions in patients such as cancers, tumors, and immune disorders related to B-cell lineages.
[0011] A CD20-binding protein of the invention comprises (a) a CD20 binding region comprising an immunoglobulin-type binding region and capable of specifically binding an extracellular part of CD20 and (b) a Shiga toxin effector region comprising a polypeptide derived from the amino acid sequence of the A Subunit of at least one member of the Shiga toxin family; whereby upon administration of the CD20-binding protein to a cell expressing CD20 on a cellular surface, the CD20-binding protein is capable of inducing rapid cellular internalization of a protein complex comprising the CD20-binding protein bound to CD20.
[0012] For certain embodiments of the CD20-binding proteins of the present invention, the CD20 binding region comprises an immunoglobulin-type binding region comprising a polypeptide selected from the group consisting of: a complementary determining region 3 fragment, constrained FR3-CDR3-FR4 polypeptide, single-domain antibody fragment, single-chain variable fragment, antibody variable fragment, antigen-binding fragment, Fd fragment, fibronectin-derived 10.sup.th fibronectin type III domain, tenascin type III domain, ankyrin repeat motif domain, low-density-lipoprotein-receptor-derived A-domain, lipocalin, Kunitz domain, Protein-A-derived Z domain, gamma-B crystalline-derived domain, ubiquitin-derived domain, Sac7d-derived polypeptide, Fyn-derived SH2 domain, engineered antibody mimic, and any genetically manipulated counterparts of any of the foregoing that retain CD20 binding functionality.
[0013] For certain embodiments, the CD20-binding proteins are capable of inducing rapid cellular internalization of a CD20 natively present on the surface of a cell. In certain further embodiments, the CD20-binding proteins are capable of inducing, in less than about one hour, cellular internalization of a CD20 natively present on the surface of a cell. In certain further embodiments, the CD20-binding proteins are capable of inducing, in less than about one hour, cellular internalization of a CD20 natively present on the surface of a member of a B-cell lineage.
[0014] For certain embodiments, upon administration of the CD20-binding protein to a cell which expresses CD20 on a cellular surface, the CD20-binding protein is capable of causing the death of the cell. In certain other embodiments, the CD20-binding proteins comprise Shiga toxin effector regions that lack catalytic activity and are not capable of causing the death of a cell.
[0015] For certain embodiments, upon administration of the CD20-binding protein to a first population of cells whose members express CD20, and a second population of cells whose members do not express CD20, the cytotoxic effect of the CD20-binding protein to members of the first population of cells relative to members of the second population of cells is at least 3-fold greater.
[0016] For certain embodiments, the CD20-binding proteins comprise the Shiga toxin effector region comprising or consisting essentially of amino acids 75 to 251 of SEQ ID NO: 1, SEQ ID NO:25, or SEQ ID NO:26. Further embodiments are CD20-binding proteins in which the Shiga toxin effector region comprises or consists essentially of amino acids 1 to 241 of SEQ ID NO: 1, SEQ ID NO:25, or SEQ ID NO:26; amino acids 1 to 251 of SEQ ID NO:1, SEQ ID NO:25, or SEQ ID NO:26; and/or amino acids 1 to 261 of SEQ ID NO: 1, SEQ ID NO:25, or SEQ ID NO:26.
[0017] For certain embodiments, the CD20-binding protein comprises or consists essentially of amino acids of SEQ ID NO:4, SEQ ID NO:12, SEQ ID NO:14, or SEQ ID NO: 16.
[0018] In certain embodiments, the CD20-binding proteins comprise the CD20 binding region comprising: (a) a heavy chain variable domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8, respectively, and a light chain variable domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:9, SEQ ID NO: 10, and SEQ ID NO: 11, respectively; (b) a heavy chain variable domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23, respectively, and a light chain variable domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:24, SEQ ID NO: 10, and SEQ ID NO: 11, respectively; or (c) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:27, respectively, and a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:28, SEQ ID NO: 10, and SEQ ID NO:29, respectively. Further embodiments are CD20-binding proteins comprising the immunoglobulin-type binding region comprising or consisting essentially of amino acids 2-245 of SEQ ID NO:4. Further embodiments are CD20-binding proteins comprising the immunoglobulin-type binding region comprising or consisting essentially of amino acids 2-245 of SEQ ID NO:4 and the Shiga toxin effector region comprising or consisting essentially of amino acids 75-251 of SEQ ID NO: 1. Further embodiments are CD20-binding proteins comprising or consisting essentially of SEQ ID NO:4 or SEQ ID NO: 16.
[0019] In certain embodiments, the CD20-binding proteins comprise Shiga toxin effector regions which comprise a mutation relative to a naturally occurring A Subunit of a member of the Shiga toxin family which changes the enzymatic activity of the Shiga toxin effector region, the mutation selected from at least one amino acid residue deletion or substitution.
[0020] Certain embodiments of the CD20-binding proteins can also be utilized for the delivery of additional exogenous material into a cell that expresses CD20 on a cellular surface. These embodiments comprise a CD20 binding region comprising (a) an immunoglobulin-type polypeptide capable of specifically binding an extracellular part of a CD20 molecule, (b) a Shiga toxin effector region comprising a polypeptide derived from the amino acid sequence of at least one member of the Shiga toxin family, and (c) an additional exogenous material; whereby upon administration of the CD20-binding protein to a cell expressing CD20 on a cellular surface, the CD20-binding protein is capable of inducing rapid cellular internalization of a protein complex comprising the CD20-binding protein bound to CD20 and capable of delivering the additional exogenous material into the interior of the cell. In certain further embodiments, the CD20-binding proteins comprise the CD20 binding region comprising: (a) a heavy chain variable domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8, respectively, and a light chain variable domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:9, SEQ ID NO: 10, and SEQ ID NO: 11, respectively; or (b) a heavy chain variable domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23, respectively, and a light chain variable domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:24, SEQ ID NO: 10, and SEQ ID NO: 11, respectively; or (c) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:27, respectively, and a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:28, SEQ ID NO: 10, and SEQ ID NO:29, respectively.
[0021] In certain embodiments, the additional exogenous material is selected from the group consisting of peptides, polypeptides, proteins, and polynucleotides. In certain embodiments, the additional exogenous material comprises a protein or polypeptide comprising an enzyme. In certain other embodiments, the additional exogenous material is a nucleic acid, such as, e.g. a ribonucleic acid that functions as a small inhibiting RNA (siRNA) or microRNA (miRNA).
[0022] In certain embodiments, the additional exogenous material is a peptide and the peptide is an antigen. In certain embodiments, the additional exogenous material is an antigen derived from a bacterial protein. In certain other embodiments, the antigen is derived from a protein mutated in cancer. Further embodiments are ones in which the antigen is derived from a protein aberrantly expressed in cancer. Still further embodiments are ones in which the antigen is derived from a T-cell complementary determining region.
[0023] For certain embodiments, the antigen is included within the CD20-binding protein as part of a polypeptide fusion in which the peptide antigen is located between the binding region and the toxin effector region of a single-chain protein. In certain embodiments, the additional exogenous material is an antigen derived from a viral protein. In certain embodiments, the antigen comprises or consists essentially of SEQ ID NO:3, the influenza Matrix 58-66 antigen. In certain further embodiments, the CD20-binding protein comprises or consists essentially of SEQ ID NO: 16.
[0024] The invention also includes pharmaceutical compositions comprising a CD20-binding protein of the present invention and at least one pharmaceutically acceptable excipient or carrier; and the use of such a cytotoxic protein or a composition comprising it in methods of the invention as further described herein.
[0025] The present invention also provides polynucleotides that encode the CD20-binding proteins of the invention, expression vectors that comprise the polynucleotides of the invention, as well as host cells comprising the expression vectors of the invention.
[0026] Additionally, the present invention provides a method of rapidly inducing cellular internalization of a CD20-binding protein of the present invention into a CD20 expressing cell(s), the method comprising the step of contacting the cell(s) with a CD20-binding protein of the present invention or a pharmaceutical composition thereof. Similarly, the present invention provides a method of internalizing a cell surface localized CD20 bound by a CD20-binding protein in a patient, the method comprising the step of administering to the patient a CD20-binding protein or pharmaceutical composition of the present invention.
[0027] Additionally, the present invention provides a method of killing a CD20 expressing cell(s) comprising contacting the cell(s), either in vitro or in vivo, with a CD20-binding protein or pharmaceutical composition of the present invention.
[0028] Additionally, the present invention provides a method for delivering exogenous material to the inside of a cell(s) comprising contacting the cell(s), either in vitro or in vivo, with a CD20-binding protein or pharmaceutical composition of the present invention.
[0029] The present invention further provides a method for delivering exogenous material to the inside of a cell(s) in a patient, wherein the cell expresses CD20 on its surface, the method comprising the step of administering to the patient a CD20-binding protein of the present invention.
[0030] Additionally, the present invention provides methods of killing cells comprising the step of contacting the cell with a CD20-binding protein of the invention or a pharmaceutical composition of the invention. In certain embodiments of the cell killing method, the step of contacting the cell(s) occurs in vitro. In certain other embodiments, the step of contacting the cell(s) occurs or in vivo.
[0031] Also, the present invention provides a method of treating a disease, disorder, or condition in patients comprising the step of administering to a patient in need thereof a therapeutically effective amount of a CD20-binding protein of the invention or a pharmaceutical composition of the invention. In certain embodiments of the treating method, the disease, disorder, or condition to be treated using this method of the invention involves a cell(s) or cell type(s) which express CD20 on a cellular surface, such as, e.g., a cancer cell, a tumor cell, or an immune cell. A further embodiment is a method of treating a disease involving a cancer or tumor cell associated with the disease selected from the group consisting of: bone cancer, leukemia, lymphoma, melanoma, or myeloma. In certain embodiments of this method, the disorder is an immune disorder associated with a disease selected from the group consisting of: amyloidosis, ankylosing spondylitis, asthma, Crohn's disease, diabetes, graft rejection, graft-versus-host disease, Hashimoto's thyroiditis, hemolytic uremic syndrome, HIV-related diseases, lupus erythematosus, multiple sclerosis, polyarteritis, psoriasis, psoriatic arthritis, rheumatoid arthritis, scleroderma, septic shock, Sjorgren's syndrome, ulcerative colitis, and vasculitis.
[0032] Among certain embodiments of the present invention is the use of a CD20-binding protein of the invention in the manufacture of a medicament for the treatment or prevention of a cancer or immune disorder. Among certain embodiments of the present invention is a cytotoxic protein or a pharmaceutical composition comprising said protein for use in the treatment or prevention of a cancer, tumor, or immune disorder.
[0033] These and other features, aspects and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying figures. The aforementioned elements of the invention may be combined or removed freely in order to make other embodiments, without any statement to object such combination or removal hereinafter.
BRIEF DESCRIPTION OF THE FIGURES
[0034] FIG. 1 shows the general architecture of exemplary CD20-binding proteins of the present invention.
[0035] FIG. 2 graphically shows the change in total body luminescence with the administration of .alpha.CD20scFv::SLT-1A version 1 and .alpha.CD20scFv::SLT-1A version 2 in a disseminated Raji-luc xenograft model.
[0036] FIG. 3 graphically shows the increased survival of Raji-luc xenograft model mice with the administration of .alpha.CD20scFv::SLT-1A version 1 and .alpha.CD20scFv::SLT-1A version 2.
[0037] FIG. 4 graphically shows the change in tumor volume with the administration of .alpha.CD20scFv::SLT-1A version 1 and .alpha.CD20scFv::SLT-1A version 2 in a Raji subcutaneous xenograft model.
[0038] FIG. 5 shows B-cell depletion in a non-human primate study with the administration of .alpha.CD20scFv::SLT-1A version 1. Specifically, the subsets of CD20+ B-cells that expressed CD21 were analyzed.
[0039] FIG. 6 shows B-cell depletion in a non-human primate study with the administration of .alpha.CD20scFv::SLT-1A version 1. Specifically, the subsets of CD20+ B-cells that did not express CD21 were analyzed.
DETAILED DESCRIPTION OF THE INVENTION
[0040] The present invention is described more fully hereinafter using illustrative, non-limiting embodiments, and references to the accompanying figures. This invention may, however, be embodied in many different forms and should not be construed as to be limited to the embodiments set forth below. Rather, these embodiments are provided so that this disclosure is thorough and conveys the scope of the invention to those skilled in the art.
[0041] In order that the present invention may be more readily understood, certain terms are defined below. Additional definitions may be found within the detailed description of the invention.
[0042] As used in the specification and the appended claims, the terms "a," "an" and "the" include both singular and the plural referents unless the context clearly dictates otherwise.
[0043] As used in the specification and the appended claims, the term "and/or" when referring to two species, A and B, means at least one of A and B. As used in the specification and the appended claims, the term "and/or" when referring to greater than two species, such as A, B, and C, means at least one of A, B, or C, or at least one of any combination of A, B, or C (with each species in singular or multiple possibility).
[0044] Throughout this specification, the word "comprise" or variations such as "comprises" or "comprising" will be understood to imply the inclusion of a stated integer (or components) or group of integers (or components), but not the exclusion of any other integer (or components) or group of integers (or components).
[0045] Throughout this specification, the term "including" is used to mean "including but not limited to." "Including" and "including but not limited to" are used interchangeably.
[0046] The term "amino acid residue" or "amino acid" includes reference to an amino acid that is incorporated into a protein, polypeptide, or peptide. The term "polypeptide" includes any polymer of amino acids or amino acid residues. The term "polypeptide sequence" refers to a series of amino acids or amino acid residues from which a polypeptide is physically composed. A "protein" is a macromolecule comprising one or more polypeptides chains. A "peptide" is a small polypeptide of sizes less than a total of 15-20 amino acid residues.
[0047] The terms "amino acid," "amino acid residue," or polypeptide sequence include naturally occurring amino acids and, unless otherwise limited, also include known analogs of natural amino acids that can function in a similar manner as naturally occurring amino acids. The amino acids referred to herein are described by shorthand designations as follows in Table A:
TABLE-US-00001 TABLE A Amino Acid Nomenclature Name 3-letter 1-letter Alanine Ala A Arginine Arg R Asparagine Asn N Aspartic Acid or Aspartate Asp D Cysteine Cys C Glutamic Acid or Glutamate Glu E Glutamine Gln Q Glycine Gly G Histidine His H Isoleucine Ile I Leucine Leu L Lysine Lys K Methionine Met M Phenylalanine Phe F Proline Pro P Serine Ser S Threonine Thr T Tryptophan Trp W Tyrosine Tyr Y Valine Val V
[0048] The phrase "conservative substitution" with regard to a polypeptide, refers to a change in the amino acid composition of the polypeptide that does not substantially alter the function and structure of the overall polypeptide (see Creighton, Proteins: Structures and Molecular Properties (W. H. Freeman and Company, New York (2nd ed., 1992)).
[0049] As used herein, the term "expressed," "expressing" or "expresses" refers to translation of a polynucleotide or nucleic acid into a polypeptide or protein. The expressed polypeptide or protein may remain intracellular, become a component of the cell surface membrane or be secreted into an extracellular space.
[0050] As used herein, the symbol "a" is shorthand for an immunoglobulin-type binding region capable of binding to the biomolecule following the symbol. The symbol "a" is used to refer to the functional characteristic of an immunoglobulin-type binding region based on its capability of binding to the biomolecule following the symbol.
[0051] The term "selective cytotoxicity" with regard to the cytotoxic activity of a CD20-binding protein refers to the relative levels of cytotoxicity between a targeted cell population and a non-targeted bystander cell population, which can be expressed as a ratio of the half-maximal cytotoxic concentration (CD.sub.50) for a targeted cell type over the CD.sub.50 for an untargeted cell type to show preferentiality of cell killing of the targeted cell type.
[0052] For purposes of the present invention, the term "effector" means providing a biological activity, such as cytotoxicity, biological signaling, enzymatic catalysis, subcellular routing, and/or intermolecular binding resulting in the recruit of a factors and/or allosteric effects.
[0053] For purposes of the present invention, the phrase "derived from" means the polypeptide region comprises amino acid sequences originally found in a protein and may now comprise additions, deletions, truncations, or other alterations form the original sequence such that overall function and structure is substantially conserved.
INTRODUCTION
[0054] The present invention solves problems for engineering therapeutics targeting CD20 that require cell internalization for function because Shiga-toxin-Subunit-A derived effector regions induce the cellular internalization of CD20. The present invention provides CD20-binding proteins that bind to extracellular CD20 antigens and rapidly internalize CD20 from a cell membrane location to the interior of a cell. Certain of the disclosed CD20-binding proteins kill cells which express CD20 on their surface. Certain of the disclosed CD20-binding proteins are capable of precisely delivering additional exogenous material in the form of molecular cargos to the interior of cells which express CD20 on their surface. The present invention expands the universe of immunotoxin-drugable targets to include CD20 and enables the precise delivery of payloads to the interiors of CD20 expressing cells.
I. The General Structure of the CD20-Binding Proteins of the Invention
[0055] The present invention provides various CD20-binding proteins for the selective killing of specific cell types, each CD20-binding protein comprising 1) a CD20 binding region comprising immunoglobulin-type binding regions for cell targeting and 2) a Shiga toxin effector region for cell killing. The linking of CD20 targeting immunoglobulin-type binding regions with Shiga-toxin-Subunit-A-derived regions enables the engineering of cell-type specific targeting of the potent Shiga toxin cytotoxicity. This system is modular, in that various Shiga toxin effector regions and additional exogenous materials may be linked to the same CD20 binding region to provide diverse applications involving CD20 expressing cells. CD20-binding proteins of the invention comprise Shiga toxin effector regions derived from the A Subunits of members of the Shiga toxin family linked to immunoglobulin-type CD20 binding regions which can bind specifically to at least one extracellular part of a CD20 molecule spanning the outer cell membrane of a eukaryotic cell. This general structure is modular in that various CD20 binding regions can be linked to Shiga-toxin-Subunit-A derived effector regions at various positions or with different linkers between them to produce variations of the same general structure (see e.g. FIG. 1).
A. CD20 Binding Regions Comprising an Immunoglobulin-Type Binding Region
[0056] For purposes of the present invention, the term "CD20 binding region" refers to a polypeptide region capable of specifically binding an extracellular part of a CD20 molecule. While the name CD20 might refer to multiple proteins with related structures and polypeptide sequences from various species, for the purposes of the present invention the term "CD20" refers to the B-lymphocyte antigen CD20 proteins present in mammals whose exact sequence might vary slightly based on the isoform and from individual to individual. For example, in humans CD20 refers to the protein represented by the predominant polypeptide sequence UniProt P11836 and NCBI accession NP 690605.1; however, different isoforms and variants may exist. The polypeptide sequence of various CD20 proteins has been described in various species, such as bats, cats, cattle, dogs, mice, marmosets, and rats, and can be predicted by bioinformatics in numerous other species based on genetic homology (e.g. CD20 has been predicted in various primates, including baboons, macaques, gibbons, chimpanzees, and gorillas) (see Zuccolo J et al., PLoS One 5: e9369 (2010) and NCBI protein database (National Center for Biotechnology Information, U.S.). A skilled worker will be able to identify a CD20 protein in mammals, even if it differs from the referenced sequences slightly.
[0057] An extracellular part of a CD20 molecule refers to a portion of its structure exposed to the extracellular environment when the CD20 molecule is natively present in a cell membrane. In this context, exposed to the extracellular environment means that part of the CD20 molecule is accessible by, e.g., an antibody or at least a binding moiety smaller than an antibody such as a single-domain antibody domain, a nanobody, a heavy-chain antibody domain derived from camelids or cartilaginous fishes, a single-chain variable fragment, or any number of engineered alternative scaffolds to immunoglobulins (see below). The exposure of a part of CD20 may be empirically determined by the skilled worker using methods known in the art. Note that some portion of CD20, which was predicted not to be accessible to an antibody in the extracellular space based on its location within CD20, was empirically shown to be accessible by a monoclonal antibody (Teeling J et al., J. Immunol. 177: 362-71 (2006)).
[0058] CD20 binding regions are commonly derived from antibody or antibody-like structures; however, alternative scaffolds from other sources are contemplated within the scope of the term. In certain embodiments, the CD20 binding region is derived from an immunoglobulin-derived binding region, such as an antibody paratope. In certain other embodiments, the CD20 binding region comprises an immunoglobulin-type binding region that is an engineered polypeptide not derived from any immunoglobulin domain. There are numerous immunoglobulin-derived binding regions contemplated as components in the present invention.
[0059] CD20-binding proteins of the invention comprise an immunoglobulin-type binding region comprising one or more polypeptides capable of selectively and specifically binding an extracellular part of CD20. The term "immunoglobulin-type binding region" as used herein refers to a polypeptide region capable of binding one or more target biomolecules, such as an antigen or epitope. Immunoglobulin-type binding regions are functionally defined by their ability to bind to target molecules, and all the immunoglobulin-type binding regions of the present invention are capable of binding CD20. Immunoglobulin-type binding regions are commonly derived from antibody or antibody-like structures; however, alternative scaffolds from other sources are contemplated within the scope of the term.
[0060] Immunoglobulin (Ig) proteins have a structural domain known as an Ig domain. Ig domains range in length from about 70-110 amino acid residues and possess a characteristic Ig-fold, in which typically 7 to 9 antiparallel beta strands arrange into two beta sheets which form a sandwich-like structure. The Ig fold is stabilized by hydrophobic amino acid interactions on inner surfaces of the sandwich and highly conserved disulfide bonds between cysteine residues in the strands. Ig domains may be variable (IgV or V-set), constant (IgC or C-set) or intermediate (IgI or I-set). Some Ig domains may be associated with a complementarity determining region (CDR), also referred to as antigen binding region (ABR), which is important for the specificity of antibodies binding to their epitopes. Ig-like domains are also found in non-immunoglobulin proteins and are classified on that basis as members of the Ig superfamily of proteins. The HUGO Gene Nomenclature Committee (HGNC) provides a list of members of the Ig-like domain containing family.
[0061] An immunoglobulin-type binding region may be a polypeptide sequence of antibody or antigen-binding fragment thereof wherein the amino acid sequence has been varied from that of a native antibody or an Ig-like domain of a non-immunoglobulin protein, for example by molecular engineering or selection by library screening. Because of the relevance of recombinant DNA techniques and in vitro library screening in the generation of immunoglobulin-type binding regions, antibodies can be redesigned to obtain desired characteristics, such as smaller size, cell entry, or other therapeutic improvements. The possible variations are many and may range from the changing of just one amino acid to the complete redesign of, for example, a variable region. Typically, changes in the variable region will be made in order to improve the antigen-binding characteristics, improve variable region stability, or reduce the potential for immunogenic responses.
[0062] There are numerous immunoglobulin-type binding regions that bind an extracellular part of CD20 contemplated in the present invention. In certain embodiments, the immunoglobulin-type binding region is derived from an immunoglobulin binding region, such as an antibody paratope capable of binding an extracellular part of CD20. In certain other embodiments, the immunoglobulin-type binding region comprises an engineered polypeptide not derived from any immunoglobulin domain but that functions like an immunoglobulin binding region by providing high-affinity binding to an extracellular part of CD20. This engineered polypeptide may optionally include polypeptide scaffolds comprising or consisting essentially of complementary determining regions from immunoglobulins as described herein.
[0063] There are numerous immunoglobulin-derived binding regions and non-immunoglobulin engineered polypeptides in the prior art that are useful for targeting the CD20-binding proteins of the invention to CD20 expressing cells. In certain embodiments, the immunoglobulin-type binding region of the present CD20-binding proteins is selected from the group which includes single-domain antibody domains (sdAb), nanobodies, heavy-chain antibody domains derived from camelids (V.sub.HH fragments), bivalent nanobodies, heavy-chain antibody domains derived from cartilaginous fishes, immunoglobulin new antigen receptors (IgNARs), VNAR fragments, single-chain variable (scFv) fragments, multimerizing scFv fragments (diabodies, triabodies, tetrabodies), bispecific tandem scFv fragments, disulfide stabilized antibody variable (Fv) fragments, disulfide stabilized antigen-binding (Fab) fragments consisting of the V.sub.L, V.sub.H, C.sub.L and C.sub.H 1 domains, divalent F(ab')2 fragments, Fd fragments consisting of the heavy chain and C.sub.H1 domains, single chain Fv-C.sub.H3 minibodies, bispecific minibodies, dimeric C.sub.H2 domain fragments (C.sub.H2D), Fc antigen binding domains (Fcabs), isolated complementary determining region 3 (CDR3) fragments, constrained framework region 3, CDR3, framework region 4 (FR3-CDR3-FR4) polypeptides, small modular immunopharmaceutical (SMIP) domains, and any genetically manipulated counterparts of the foregoing that retain its paratope and binding function (see, Weiner L, Cell 148: 1081-4 (2012); Ahmad Z et al., Clin Dev Immunol 2012: 980250 (2012),for reviews).
[0064] In accordance with certain other embodiments, the immunoglobulin-type binding region of the CD20-binding proteins of the invention may include engineered, alternative scaffolds to immunoglobulin domains that exhibit similar functional characteristics, such as high-affinity and specific binding to CD20, and enable the engineering of improved characteristics, such as greater stability or reduced immunogenicity. For certain embodiments of the CD20-binding proteins of the invention, the immunoglobulin-type binding region is selected from the group which includes engineered, fibronectin-derived, 10.sup.th fibronectin type III (10Fn3) domains (monobodies, AdNectins.TM., or AdNexins.TM.); engineered, tenascin-derived, tenascin type III domains (Centryns.TM.); engineered, ankyrin repeat motif containing polypeptides (DARPins.TM.); engineered, low-density-lipoprotein-receptor-derived, A domains (LDLR-A) (Avimers.TM.); lipocalins (anticalins); engineered, protease inhibitor-derived, Kunitz domains; engineered, Protein-A-derived, Z domains (Affibodies.TM.); engineered, gamma-B crystalline-derived scaffolds or engineered, ubiquitin-derived scaffolds (Affilins); Sac7d-derived polypeptides (Nanoffitins.RTM. or affitins); engineered, Fyn-derived, SH2 domains (Fynomers.RTM.); and engineered antibody mimics and any genetically manipulated counterparts of the foregoing that retains its binding functionality (Worn A, Pluckthun A, J Mol Biol 305: 989-10 (2001); Xu L et al., Chem Biol 9: 933-42 (2002); Wikman M et al., Protein Eng Des Sel 17: 455-62 (2004); Binz H et al., Nat Biotechnol 23: 1257-68 (2005); Holliger P, Hudson P, Nat Biotechnol 23: 1126-36 (2005); Gill D, Damle N, Curr Opin Biotech 17: 653-8 (2006); Koide A, Koide S, Methods Mol Biol 352: 95-109 (2007)).
[0065] Non-limiting examples of protein constructs encompassed within the term "binding region" as used herein include: (i) an Fab fragment, a monovalent fragment consisting of the VL, VH, CL and CH 1 domains; (ii) an F(ab')2 fragment, a bivalent fragment comprising two Fab fragments linked by a disulfide bridge at the hinge region; (iii) an Fd fragment consisting of the VH and CH 1 domains; (iv) an Fv fragment consisting of the VL and VH domains of a single arm of an antibody, (v) a dAb fragment, which consists of a VH domain; and (vi) an isolated CDR. Furthermore, although the two domains of the Fv fragment, VL and VH, are coded for by separate genes, they may be recombinantly joined by a synthetic linker, creating a single protein chain in which the VL and VH domains pair to form monovalent molecules (known as single chain Fv (scFv)). The most commonly used linker is a 15-residue (Gly4Ser)3 peptide, but other linkers are also known in the art. Single chain antibodies are also intended to be encompassed within the term "binding region" as used herein.
[0066] It is also anticipated that alternative scaffolds that provide binding function are within the scope of the term "binding region" as used herein. Some examples of the alternative scaffolds include diabodies, a CDR3 peptide, a constrained FR3-CDR3-FR4 peptide, a nanobody (U.S. patent application publication 2008/0107601), a bivalent nanobody, small modular immunopharmaceuticals (SMIPs), a shark variable IgNAR domain (WO 03/014161), a minibody and any fragment or chemically or genetically manipulated counterparts that retain target molecule binding function.
[0067] An "antibody-derived sequence" as used herein, means an amino acid sequence of an antibody or antigen-binding fragment thereof wherein the amino acid sequence has been varied from that of a native antibody. Because of the relevance of recombinant DNA techniques in the generation of antibodies, antibodies can be redesigned to obtain desired characteristics. The possible variations are many and range from the changing of just one or a few amino acids to the complete redesign of, for example, a variable region. Typically, changes in the variable region will be made in order to improve the antigen-binding characteristics, improve variable region stability, or reduce the risk of immunogenicity.
[0068] As used herein, the term "heavy chain variable (VH) domain" or "light chain variable (VL) domain" respectively refer to any native antibody VH or VL domain (e.g., a human VH or VL domain) as well as any derivative thereof retaining at least qualitative antigen binding ability of the corresponding native antibody (e.g., a humanized VH or VL domain derived from a native murine VH or VL domain). A VH or VL domain consists of a "framework" region interrupted by the three CDRs. The framework regions serve to align the CDRs for specific binding to an epitope of an antigen. From amino-terminus to carboxyl-terminus, both VH and VL domains comprise the following framework (FR) and CDR regions: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4. The assignment of amino acids to each domain is in accordance with the definitions of Kabat, Sequences of Proteins of Immunological Interest (5th ed., National Institutes of Health, Bethesda, Md., 1991), or Chothia and Lesk, J. Mol. Biol. 196: 901-17 (1987); Chothia et al., Nature 342:878-83, (1989). CDRs 1, 2, and 3 of a VH domain are also referred to herein, respectively, as HCDR1, HCDR2, and HCDR3; CDRs 1, 2, and 3 of a VL domain are also referred to herein, respectively, as LCDR1, LCDR2, and LCDR3.
[0069] In some embodiments of the CD20-binding proteins of the present invention, the binding region comprises an antibody or an antibody-derived sequence that comprises a specific set of complementarity determining regions, or CDRs. CDRs are defined sequence regions within the variable domains of antibodies that are necessary for specific binding of the antibody to its antigenic determinants. In one embodiment of the invention, the set of CDRs comprise three CDRs derived from the heavy chain of the antibody and three CDRs derived from light chain of the antibody. In some embodiments, the three heavy chain CDRs comprise: (a) a heavy chain variable domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8, respectively, and a light chain variable domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:9, SEQ ID NO: 10, and SEQ ID NO: 11, respectively; (b) a heavy chain variable domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:23, respectively, and a light chain variable domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:24, SEQ ID NO: 10, and SEQ ID NO: 11, respectively; or (c) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in SEQ ID NO:21, SEQ ID NO:22, and SEQ ID NO:27, respectively, and a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in SEQ ID NO:28, SEQ ID NO:10, and SEQ ID NO:29, respectively. Additionally, in certain embodiments of the invention, the binding region comprises or consists essentially of amino acids 2 to 245 of SEQ ID NO:4.
[0070] This system is modular, in that various, diverse immunoglobulin-type binding regions can be used with the same Shiga toxin effector region to target different extracellular epitopes of CD20. It will be appreciated by the skilled worker that any CD20 binding region of an immunoglobulin type capable of binding an extracellular part of CD20 may be used to design or select an immunoglobulin-type binding region to be linked to the Shiga toxin effector region to produce a CD20-binding protein of the invention.
B. Shiga Toxin Effector Regions Derived from a Subunits of Members of the Shiga Toxin Family
[0071] For purposes of the present invention, the phrase "Shiga toxin effector region" refers to a polypeptide region derived from a Shiga toxin A Subunit of a member of the Shiga toxin family that is capable of inactivating ribosomes and effectuating cytotoxicity and/or cytostatic effects. A member of the Shiga toxin family refers to any member of a family of naturally occurring protein toxins which are structurally and functionally related, notably toxins isolated from S. dysenteriae and E. coli (Johannes, Nat Rev Microbiol 8: 105-16 (2010)). For example, the Shiga toxin family encompasses true Shiga toxin (Stx) isolated from S. dysenteriae serotype 1, Shiga-like toxin 1 variants (SLT1 or Stx1 or SLT-1 or Slt-I) isolated from serotypes of enterohemorrhagic E. coli, and Shiga-like toxin 2 variants (SLT2 or Stx2 or SLT-2) isolated from serotypes of enterohemorrhagic E. coli. SLT1 differs by only one residue from Stx, and both have been referred to as Verocytotoxins or Verotoxins (VTs) (O'Brien, Curr Top Microbiol Immunol 180: 65-94 (1992)). Although SLT1 and SLT2 variants are only about 53-60% similar to each other at the amino acid sequence level, they share mechanisms of enzymatic activity and cytotoxicity common to the members of the Shiga toxin family (Johannes, Nat Rev Microbiol 8: 105-16 (2010)). Over 39 different Shiga toxins have been described, such as the defined subtypes Stx1a, Stx1c, Stx1d, and Stx2a-g (Scheutz F et al., J Clin Microbiol 50: 2951-63 (2012)). Members of the Shiga toxin family are not naturally restricted to any bacterial species because Shiga-toxin-encoding genes can spread among bacterial species via horizontal gene transfer (Strauch E et al., Infect Immun 69: 7588-95 (2001); Zhaxybayeva O, Doolittle W, Curr Biol. 21: R242-6 (2011)). As an example of interspecies transfer, a Shiga toxin was discovered in a strain of A. haemolyticus isolated from a patient (Grotiuz G et al., J Clin Microbiol 44: 3838-41 (2006)). Once a Shiga toxin encoding polynucleotide enters a new subspecies or species, the Shiga toxin amino acid sequence is presumed to be capable of developing slight sequence variations due to genetic drift and/or selective pressure while still maintaining a mechanism of cytotoxicity common to members of the Shiga toxin family (see Scheutz, J Clin Microbiol 50: 2951-63 (2012)).
[0072] Shiga toxin effector regions of the invention comprise or consist essentially of a polypeptide derived from a Shiga toxin A Subunit dissociated from any form of its native Shiga toxin B Subunit. In addition, the CD20-binding proteins of the present invention do not comprise any polypeptide comprising or consisting essentially of a functional binding domain of a Shiga toxin B subunit. Rather, the Shiga toxin A Subunit derived regions are functionally associated with heterologous CD20 binding regions to effectuate cell targeting to CD20 expressing cells.
[0073] In certain embodiments, a Shiga toxin effector region of the invention may comprise or consist essentially of a full length Shiga toxin A Subunit (e.g. SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), or SLT-2A (SEQ ID NO:26)), noting that naturally occurring Shiga toxin A Subunits may comprise precursor forms containing signal sequences of about 22 amino acids at their amino-terminals which are removed to produce mature Shiga toxin A Subunits. One specific example of a "toxin effector region" is one that is derived from the A chain of Shiga-like toxin 1 (SLT-1) (SEQ ID NO: 1). The A chain of SLT-1 is composed of 293 amino acids with the enzymatic (toxic) domain spanning residues 1 to 239. In other embodiments, the Shiga toxin effector region of the invention comprises or consists essentially of a truncated Shiga toxin A Subunit which is shorter than a full-length Shiga toxin A Subunit.
[0074] Shiga-like toxin 1 A Subunit truncations are catalytically active, capable of enzymatically inactivating ribosomes in vitro, and cytotoxic when expressed within a cell (LaPointe, J Biol Chem 280: 23310-18 (2005)). The smallest Shiga toxin A Subunit fragment exhibiting full enzymatic activity is a polypeptide composed of residues 1-239 of Slt1A (LaPointe, J Biol Chem 280: 23310-18 (2005)). Although the smallest fragment of the Shiga toxin A Subunit reported to retain substantial catalytic activity was residues 75-247 of StxA (Al-Jaufy, Infect Immun 62: 956-60 (1994)), a StxA truncation expressed de novo within a eukaryotic cell requires only up to residue 240 to reach the cytosol and exert catalytic inactivation of ribosomes (LaPointe, J Biol Chem 280: 23310-18 (2005)).
[0075] Shiga toxin effector regions may commonly be smaller than the full length A subunit. It is preferred that the Shiga toxin effector region maintain the polypeptide region from amino acid position 77 to 239 (SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), or SLT-2A (SEQ ID NO:26)) or the equivalent in other A Subunits of members of the Shiga toxin family. For example, in certain embodiments of the invention, a Shiga toxin effector region derived from SLT-1A may comprise or consist essentially of amino acids 75 to 251 of SEQ ID NO:1, 1 to 241 of SEQ ID NO: 1, 1 to 251 of SEQ ID NO: 1, or amino acids 1 to 261 of SEQ ID NO: 1. Similarly, among certain other embodiments, the Shiga toxin effector regions derived from StxA may comprise or consist essentially of amino acids 75 to 251 of SEQ ID NO:25, 1 to 241 of SEQ ID NO:25, 1 to 251 of SEQ ID NO:25, or amino acids 1 to 261 of SEQ ID NO:25. Additionally, among certain other embodiments, the Shiga toxin effector regions derived from SLT-2 may comprise or consist essentially of amino acids 75 to 251 of SEQ ID NO:26, 1 to 241 of SEQ ID NO:26, 1 to 251 of SEQ ID NO:26, or amino acids 1 to 261 of SEQ ID NO:26.
[0076] The invention further provides variants of the CD20-binding proteins of the invention, wherein the Shiga toxin effector region differs from a naturally occurring Shiga toxin A Subunit by up to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40 or more amino acid residues (but by no more than that which retains at least 85%, 90%, 95%, 99% or more amino acid sequence identity). Thus, a polypeptide region derived from an A Subunit of a member of the Shiga toxin family may comprise additions, deletions, truncations, or other alterations from the original sequence so long as at least 85%, 90%, 95%, 99% or more amino acid sequence identity is maintained to a naturally occurring Shiga toxin A Subunit.
[0077] Accordingly, in certain embodiments, the Shiga toxin effector region comprises or consists essentially of amino acid sequences having at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, 99.5% or 99.7% overall sequence identity to a naturally occurring Shiga toxin A Subunit, such as SLT-1A (SEQ ID NO: 1), Stx (SEQ ID NO:25), and/or SLT-2A (SEQ ID NO:26).
[0078] Optionally, either a full length or a truncated version of the Shiga toxin A Subunit may comprise one or more mutations (e.g. substitutions, deletions, insertions or inversions). In certain embodiments that are potently cytotoxic, the Shiga toxin effector region has sufficient sequence identity to retain cytotoxicity after entry into a cell, either by well-known methods of host cell transformation, transfection, infection or induction, or by internalization mediated by the cell targeting, immunoglobulin-type binding region linked with the Shiga toxin effector region. The most critical residues for enzymatic activity and/or cytotoxicity in the Shiga toxin A Subunits have been mapped to the following residue-positions: aspargine-75, tyrosine-77, glutamate-167, arginine-170, and arginine-176 among others (Di, Toxicon 57: 535-39 (2011)). In any one of the embodiments of the present invention, the Shiga toxin effector region may preferably but not necessarily maintain one or more conserved amino acids at positions, such as those found at positions 77, 167, 170, and 203 in StxA, SLT-1A, or the equivalent conserved position in other members of the Shiga toxin family which are typically required for cytotoxic activity. The capacity of a CD20-binding protein of the invention to cause cell death, e.g. its cytotoxicity, may be measured using any one or more of a number of assays well known in the art.
[0079] In certain embodiments of the invention, one or more amino acid residues may be mutated or deleted in order to reduce or eliminate cytotoxic activity of the Shiga toxin effector region. The cytotoxicity of the A Subunits of members of the Shiga toxin family may be reduced or eliminated by mutation or truncation. The positions labeled tyrosine-77, glutamate-167, arginine-170, tyrosine-114, and tryptophan-203 have been shown to be important for the catalytic activity of Stx, Stx1, and Stx2 (Hovde C et al., Proc Natl Acad Sci USA 85: 2568-72 (1988); Deresiewicz R et al., Biochemistry 31: 3272-80 (1992); Deresiewicz R et al., Mol Gen Genet 241: 467-73 (1993); Ohmura M et al., Microb Pathog 15: 169-76 (1993); Cao C et al., Microbiol Immunol 38: 441-7 (1994); Suhan M, Hovde C, Infect Immun 66: 5252-9 (1998)). Mutating both glutamate-167 and arginine-170 eliminated the enzymatic activity of Slt-I A1 in a cell-free ribosome inactivation assay (LaPointe, J Biol Chem 280: 23310-18 (2005)). In another approach using de novo expression of Slt-I A1 in the endoplasmic reticulum, mutating both glutamate-167 and arginine-170 eliminated Slt-I A1 fragment cytotoxicity at that expression level (LaPointe, J Biol Chem 280: 23310-18 (2005)). A truncation analysis demonstrated that a fragment of StxA from residues 75 to 268 still retains significant enzymatic activity in vitro (Haddad, J Bacteriol 175: 4970-8 (1993)). A truncated fragment of Slt-I A1 containing residues 1-239 displayed significant enzymatic activity in vitro and cytotoxicity by de novo expression in the cytosol (LaPointe, J Biol Chem 280: 23310-18 (2005)). Expression of a Slt-I A1 fragment truncated to residues 1-239 in the endoplasmic reticulum was not cytotoxic because it could not retrotranslocate into the cytosol (LaPointe, J Biol Chem 280: 23310-18 (2005)).
[0080] For the purposes of the present invention, the specific order or orientation is not fixed for the Shiga toxin effector region and the CD20 binding region in relation to each other or the entire CD20-binding protein's N-terminal(s) and C-terminal(s) (see e.g. FIG. 1). In the above CD20-binding proteins, the CD20 binding regions and Shiga toxin effector regions may be directly linked to each other and/or suitably linked to each other via one or more intervening polypeptide sequences, such as with one or more linkers well known in the art.
II. Examples of Specific Structural Variations of the CD20-Binding Proteins of the Invention
[0081] Among certain embodiments of the present invention, the CD20-binding proteins comprise the Shiga toxin effector region comprising or consisting essentially of amino acids 75 to 251 of SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), or SLT-2A (SEQ ID NO:26). Further embodiments are CD20-binding proteins in which the Shiga toxin effector region comprises or consists essentially of amino acids 1 to 241 of SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), and/or SLT-2A (SEQ ID NO:26). Further embodiments are CD20-binding proteins in which the Shiga toxin effector region comprises or consists essentially of amino acids 1 to 251 of SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), and/or SLT-2A (SEQ ID NO:26). Further embodiments are CD20-binding proteins in which the Shiga toxin effector region comprises or consists essentially of amino acids 1 to 261 of SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), and/or SLT-2A (SEQ ID NO:26).
[0082] For certain embodiments, the CD20-binding proteins of the present invention is one which comprises or consists essentially of the amino acid sequence of SEQ ID NO:4, SEQ ID NO:12, SEQ ID NO:14, or SEQ ID NO:16.
[0083] As used herein, the term "heavy chain variable (VH) domain" or "light chain variable (VL) domain" respectively refer to any antibody VH or VL domain (e.g. a human VH or VL domain) as well as any derivative thereof retaining at least qualitative antigen binding ability of the corresponding native antibody (e.g. a humanized VH or VL domain derived from a native murine VH or VL domain). A VH or VL domain consists of a "framework" region interrupted by the three CDRs. The framework regions serve to align the CDRs for specific binding to an epitope of an antigen. From amino-terminus to carboxyl-terminus, both VH and VL domains comprise the following framework (FR) and CDR regions: FR1, CDR1, FR2, CDR2, FR3, CDR3, and FR4. The assignment of amino acids to each domain is in accordance with the definitions of Kunik V et al., PLoS Comput Biol 8: e1002388 (2012) and Kunik V et al., Nucleic Acids Res 40: W521-4 (2012) or alternatively in accordance with the definitions of Kabat, Sequences of Proteins of Immunological Interest, (5th ed., National Institutes of Health, Bethesda, Md., 1991); or Chothia and Lesk, J. Mol. Biol. 196: 901-17 (1987); Chothia et al., Nature 342: 878-83 (1989).
[0084] In certain embodiments of the invention, the CDRs comprise three CDRs derived from a heavy chain of the antibody and three CDRs derived from a light chain of the antibody. In certain embodiments, the three heavy chain CDRs comprise SEQ ID NO: 7 (HCDR1), SEQ ID NO:8 (HCDR2), and SEQ ID NO:9 (HCR3), while the three light chain CDRs comprise SEQ ID NO:9 (LCDR1), SEQ ID NO: 10 (LCDR2) and SEQ ID NO: 11 (LCDR3). Additionally, in certain embodiments of the invention, the immunoglobulin-type binding region comprises or consists essentially of amino acids 2 to 245 of SEQ ID NO:4.
[0085] It is within the scope of the invention to use fragments, variants, and/or derivatives of the polypeptides of the CD20-binding proteins of the invention which contain a functional CD20 binding site to any extracellular part of CD20, and even more preferably capable of binding CD20 with high affinity (e.g. as shown by K.sub.D). For example, the invention provides immunoglobulin-derived polypeptide sequences that can bind to CD20. Any polypeptide may be substituted for this region which binds an extracellular part of CD20 with a dissociation constant (K.sub.D) of 10.sup.-5 to 10.sup.-12 moles/liter, preferably less than 200 nM.
[0086] Thus it is within the scope of the invention to alter the immunoglobulin-type binding site of a disclosed exemplary CD20-binding protein so long as at least one polypeptide sequence is chosen from the group consisting of the CDR1 sequences, CDR2 sequences, and CDR3 sequences that are described. In particular, but without limitation, the polypeptide sequences of the invention may consist essentially of 4 framework regions (FR1 to FR4) and three complementary determining regions (CDR1 to CDR3 respectively); or any suitable fragment of such amino acid sequence that exhibits target biomolecule binding functionality based on the presence of one or more CDRs.
[0087] In certain embodiments, the immunoglobulin-type binding region comprises (i) a heavy chain variable (VH) domain comprising CDR amino acid sequences as shown in SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8, and (ii) a light chain variable (VL) domain comprising CDR amino acid sequences as shown in SEQ ID NO:9, SEQ ID NO: 10, and SEQ ID NO: 11. In other embodiments, the immunoglobulin-type binding region comprises or consists essentially of amino acids 2 to 245 of SEQ ID NO:4.
[0088] Among certain embodiments of the present invention, the immunoglobulin-type binding region is derived from a nanobody or single domain immunoglobulin-derived region V.sub.HH which exhibits high affinity binding specifically to CD20. Generally, nanobodies are constructed from fragments of naturally occurring single, monomeric variable domain antibodies (sdAbs) of the sort found in camelids and cartilaginous fishes (Chondrichthyes). Nanobodies are engineered from these naturally occurring antibodies by truncating the single, monomeric variable domain to create a smaller and more stable molecule. Due to their small size, nanobodies are able to bind to antigens that are not accessible to whole antibodies.
III. The General Function of the CD20-Binding Proteins of the Invention
[0089] The present invention provides various CD20-binding proteins for the selective killing of specific cell types, the CD20 proteins comprising 1) immunoglobulin-type CD20 binding regions for cell targeting and 2) cytotoxic Shiga toxin effector regions for inducing cellular internalization and, optionally, cell killing as well. The linking of CD20 targeting immunoglobulin-type binding regions with Shiga-toxin-Subunit-A-derived regions enables the targeting of the potent Shiga toxin cytotoxicity specifically to CD20 expressing cells. In their preferred embodiments, the CD20-binding proteins of the invention are capable of binding CD20 natively present on a cell surface and entering the cell. Once internalized within a targeted cell type, certain embodiments of the CD20-binding proteins of the invention are capable of routing a cytotoxic Shiga toxin effector polypeptide fragment into the cytosol of the target cell. Once in the cytosol of a targeted cell type, certain embodiments of the CD20-binding proteins of the invention are capable of enzymatically inactivating ribosomes and eventually killing the cell. Alternatively, non-toxic variants may be used to deliver additional exogenous materials and/or label the interiors of CD20 expressing cells for diagnostic purposes.
[0090] Various types of cells which express CD20 may be targeted by the CD20-binding proteins of the invention for killing and/or receiving exogenous materials. Among the CD20 expressing cell types anticipated to internalize the CD20-binding proteins of the invention are those within the B-cell lineage. "B-cell lineage" is a term used to describe those cells that are cytologically or otherwise identified as B-cells themselves, e.g., through cell surface markers, or were once or presently derived from cells that are cytologically or otherwise identified as B-cells. The term "B-cell lineage" includes neoplastic cells derived from the B-cell lineage or precursors to the B-cell lineage. Among the CD20 expressing cell types that may be targeted are dysplastic or neoplastic cells of cell lineages which do not normally express CD20, e.g. melanoma cells. In particular, the CD20 expressing cells to be targeted with the CD20-binding proteins of the invention include neoplastic cells of B-cell lineages or non-B-cell lineages, such as neoplastic cells from a hematopoietic lineage that are not usually categorized as B-cells but which express CD20.
A. CD20-Binding Protein Capable of Inducing Rapid Internalization of CD20
[0091] The Shiga toxin effector regions of the present invention provide an internalization function, moving the CD20-binding proteins from the external surface of the target cell into the cytosol of the target cell. However, this internalization function is also an acceleration function in that the cellular internalization of CD20 is promoted or induced. As used in the specification and the claims herein, the phrase "rapid internalization" refers to a CD20-binding protein of the invention decreasing the time for CD20 cellular internalization upon binding as compared to a prior art reference molecule, such as the monoclonal antibody rituximab.
[0092] For the purposes of the present invention, cellular internalization is considered rapid if the time for internalization to occur due to the binding of the CD20-binding proteins is reduced as compared to the time for internalization of the target molecule with the binding of a well-characterized antibody recognizing a CD20 antigen, such as the 1H4 CD20 monoclonal antibody (Haisma H et al., Blood 92: 184-90 (1999)). For example, internalization timing for the CD20 antigen, although variable for cell type and antibody type, does not typically begin to reach maximal levels until approximately six hours after binding. Thus the term "rapid" as defined within the present specification is less than this six hour standard internalization window. In certain embodiments, rapid can be as quickly as less than about one hour, but can also encompass a range of from about 1 hour to about 2 hours, to about 3 hours, to about 4 hours, to about 5 hours; a range of about 2 hours to about 3 hours, to about 4 hours, to about 5 hours; a range of about 3 hours to about 4 hours, to about 5 hours; and a range of about 4 hours to about 5 hours.
B. Cell Kill via Targeted Shiga Toxin Cytotoxicity
[0093] Because members of the Shiga toxin family are adapted to killing eukaryotic cells, CD20-binding proteins designed using Shiga toxin effector regions can show potent cell-kill activity. The A Subunits of members of the Shiga toxin family comprise enzymatic domains capable of killing a eukaryotic cell once in the cell's cytosol. Certain embodiments of the CD20-binding proteins of the invention take advantage of this cytotoxic mechanism.
[0094] In certain embodiments of the CD20-binding proteins of the invention, upon contacting a cell expressing CD20 such that at least a part of CD20 is accessible from the extracellular space, the CD20-binding protein is capable of causing death of the cell. CD20 positive "cell kill" may be accomplished using a CD20-binding protein of the invention under varied conditions of target cells, such as an ex vivo manipulated target cell, a target cell cultured in vitro, a target cell within a tissue sample cultured in vitro, or a target cell in vivo.
C. Selective Cytotoxicity between CD20 Expressing Cells and Non-CD20 Expressing Cells
[0095] By targeting the delivery of enzymatically active Shiga toxin regions using high-affinity immunoglobulin-type binding regions to CD20 expressing cells, this potent cell-kill activity can be restricted to preferentially killing CD20 positive cell types.
[0096] In certain embodiments, upon administration of the CD20-binding protein of the invention to a mixture of cell types, the CD20-binding protein is capable of selectively killing CD20 expressing cells displaying an extracellular CD20 target compared to cell types lacking extracellular CD20 targets. Because members of the Shiga toxin family are adapted for killing eukaryotic cells, CD20-binding proteins designed using Shiga toxin effector regions can show potent cytotoxic activity. By targeting the delivery of enzymatically active Shiga toxin regions to CD20 expressing cells using high-affinity immunoglobulin-type binding regions, this potent cell kill activity can be restricted to preferentially killing only CD20 expressing cells.
[0097] In certain embodiments, the CD20-binding protein of the invention is capable of selectively or preferentially causing the death of a specific cell type within a mixture of two or more different cell types. This enables targeting cytotoxic activity to specific cell types with a high preferentiality, such as with at least a 3-fold cytotoxic effect, over "bystander" cell types that do not express any significant amount of extracellular CD20 targets. This enables the targeted cell-killing of specific cell types expressing CD20 on cellular surfaces with a high preferentiality, such as with at least a 3-fold cytotoxic effect, over "bystander" cell types that do not express significant amounts of CD20 or are not exposing significant amounts of CD20 on a cellular surface.
[0098] In certain further embodiments, upon administration of the CD20-binding protein to two different populations of cell types, the CD20-binding protein is capable of causing cell death as defined by the half-maximal cytotoxic concentration (CD.sub.50) on a cell population which expresses CD20 on a cellular surface at a dose at least three times lower than the CD.sub.50 dose of the same CD20-binding protein to a cell population which does not express CD20.
[0099] In certain embodiments, the cytotoxic activity toward populations of cell types expressing CD20 on a cellular surface is at least 3-fold higher than the cytotoxic activity toward populations of cell types not physically coupled with any extracellular CD20 target of the CD20 binding region of the embodiment. According to the present invention, selective cytotoxicity may be quantified in terms of the ratio (a/b) of (a) cytotoxicity towards a population of cells expressing an extracellular CD20 target of the CD20 binding region of the embodiment to (b) cytotoxicity towards a population of cells of a cell type not physically coupled with any extracellular CD20 target of the CD20 binding region of the embodiment. In certain embodiments, the cytotoxicity ratio is indicative of selective cytotoxicity which is at least 3-fold, 5-fold, 10-fold, 15-fold, 20-fold, 25-fold, 30-fold, 40-fold, 50-fold, 75-fold, 100-fold, 250-fold, 500-fold, 750-fold, or 1000-fold higher for populations of cells or cell types expressing CD20 compared to populations of cells or cell types which do not express CD20.
[0100] This preferential cell-killing function allows a targeted cell to be killed by certain CD20-binding proteins of the invention under varied conditions and in the presence of non-targeted bystander cells, such as ex vivo manipulated mixtures of cell types, in vitro cultured tissues with mixtures of cell types, or in vivo in the presence of multiple cell types (e.g. in situ or in its native location within a multicellular organism).
[0101] In addition, catalytically inactive forms of CD20-binding proteins optionally may be used for diagnostic functions. The conjugating of additional diagnostic agents known in the art to CD20-binding proteins of the invention enable the imaging of intracellular organelles (e.g. Golgi, endoplasmic reticulum, and cytosolic compartments) of individual immune cells of the B-cell lineage or cancer cells in a patient or biopsy sample. For example, this may be useful in the diagnosis of neoplastic cell types, assaying the progression of anticancer therapies over time, and/or evaluating the presence of residual cancer cells after surgical excision of a tumor mass.
D. Delivery of Additional Exogenous Material
[0102] Because the CD20-binding protein are capable of inducing cellular internalization of CD20 after binding to an extracellular part of CD20, certain embodiments of the CD20-binding proteins of the invention may be used to deliver additional exogenous materials into the interior of CD20 expressing cells. In one sense, the entire CD20-binding protein is an exogenous material which will enter the cell; thus, the "additional" exogenous materials are materials linked to but other than the core CD20-binding protein itself.
[0103] "Additional exogenous material" as used herein refers to one or more molecules, often not generally present within a native target cell, where the CD20-binding proteins of the present invention can be used to specifically transport such material to the interior of a cell. In general, additional exogenous material is selected from peptides, polypeptides, proteins, and polynucleotides. One example of an additional exogenous material that is a peptide is an influenza virus antigen, such as the influenza Matrix 58-66 peptide (SEQ ID NO:3). One exemplary embodiment of a CD20-binding protein that may deliver that antigen into a target cell that expresses CD20 is provided in SEQ ID NO:16.
[0104] Additional exogenous material may include an interior polypeptide sequence within the core CD20-binding protein structure, such as the influenza Matrix 58-66 peptide (SEQ ID NO:3). Similarly, additional exogenous material may include a terminally-located polypeptide sequence linked to a terminal of the CD20-binding structure. Certain embodiments of the CD20-binding proteins of the invention that may deliver that antigen, as an additional exogenous material, into a target cell that expresses CD20 on a cell surface is the CD20-binding protein that comprises or consists essentially of SEQ ID NO:4, SEQ ID NO: 12, SEQ ID NO: 14, or SEQ ID NO:16.
[0105] Additional examples of exogenous materials that may be linked to the CD20-binding proteins of the invention include antigens such as those derived from bacterial proteins, such as those characteristic of antigen-presenting cells infected by bacteria. Further examples of additional exogenous materials are proteins mutated in cancer or proteins that are aberrantly expressed in cancer. Further examples of additional exogenous materials include T-cell complementary determining regions capable of functioning as exogenous antigens.
[0106] Further examples of exogenous materials that may be linked to the CD20-binding proteins of the invention include proteins other than antigens, such as enzymes. Further types of exogenous material are polynucleotides. Among the polynucleotides that can be transported are those formulated to have regulatory function, such as small interfering RNA (siRNA) and microRNA (miRNA).
[0107] Additional examples of exogenous materials include antigens such as those derived from bacterial proteins, such as those characteristic of antigen-presenting cells that are infected with bacteria. Further examples of exogenous antigens are ones that are derived from a protein mutated in cancer or proteins that are aberrantly expressed in cancer. T-cell complementary determining regions (CDR) can also act as exogenous antigens for the purposes of the present invention. Additional examples of exogenous material include proteins other than antigens, such as enzymes. A further type of exogenous material is nucleic acids. Among the nucleic acids that can be transported are those formulated to have regulatory function, such as small interfering RNA (siRNA) and microRNA (miRNA).
Variations in the Polypeptide Sequence of the CD20-Binding Proteins of the Invention, which Maintain Overall Structure and Function
[0108] In certain of the above embodiments, the CD20-binding protein of the invention is a variant in which there are one or more conservative amino acid substitutions introduced into the polypeptide region(s). As used herein, the term "conservative substitution" denotes that one or more amino acids are replaced by another, biologically similar amino acid residue. Examples include substitution of amino acid residues with similar characteristics, e.g. small amino acids, acidic amino acids, polar amino acids, basic amino acids, hydrophobic amino acids and aromatic amino acids (see, for example, Table B below). An example of a conservative substitution with a residue normally not found in endogenous, mammalian peptides and proteins is the conservative substitution of an arginine or lysine residue with, for example, ornithine, canavanine, aminoethylcysteine, or another basic amino acid. For further information concerning phenotypically silent substitutions in peptides and proteins (see e.g. Bowie J et al., Science 247: 1306-10 (1990)). In the scheme below are conservative substitutions of amino acids grouped by physicochemical properties. I: neutral, hydrophilic, II: acids and amides, III: basic, IV: hydrophobic, V: aromatic, bulky amino acids.
TABLE-US-00002 TABLE B Examples of Conservative Amino Acid Substitutions I II III IV V A N H M F S D R L Y T E K I W P Q V G C
[0109] In certain embodiments, a CD20-binding protein of the invention may comprise functional fragments or variants of a polypeptide region of the invention that have, at most, 20, 15, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid substitutions compared to a polypeptide sequence recited herein, as long as it retains measurable biological activity alone or as a component of a CD20-binding protein. Variants of CD20-binding proteins are within the scope of the invention as a result of changing a polypeptide of the CD20-binding protein by altering one or more amino acids or deleting or inserting one or more amino acids, such as within the immunoglobulin-type binding region or the Shiga toxin effector region, in order to achieve desired properties, such as changed cytotoxicity, changed cytostatic effects, changed immunogenicity, and/or changed serum half-life. A polypeptide of a CD20-binding protein of the invention may further be with or without a signal sequence.
[0110] In certain embodiments, a CD20-binding protein of the invention shares at least 85%, 90%, 95%, 96%, 97%, 98%, 99% or more amino acid sequence identity to any one of the amino acid sequences of a CD20-binding protein recited herein, as long as it retains measurable biological activity, such as cytotoxicity, extracellular target biomolecule binding, enzymatic catalysis, or subcellular routing. The immunoglobulin-type binding region may differ from the amino acid sequences of a CD20-binding protein recited herein, as long as it retains binding functionality to its extracellular target biomolecule. Binding functionality will most likely be retained if the amino acid sequences of the ABRs are identical. For example, a CD20-binding protein that consists essentially of 85% amino acid identity to SEQ ID NO: 4 or SEQ ID NO: 16 in which for the purposes of determining the degree of amino acid identity, the amino acid residues that form the ABR are disregarded. Binding functionality can be determined by the skilled worker using standard techniques.
[0111] In certain embodiments, the Shiga toxin effector region may be altered to change the enzymatic activity and/or cytotoxicity of the Shiga toxin effector region. This change may or may not result in a change in the cytotoxicity of a CD20-binding protein of which the altered Shiga toxin effector region is a component. Possible alterations include mutations to the Shiga toxin effector region selected from the group consisting of: a truncation, deletion, inversion, insertion and substitution.
[0112] The cytotoxicity of the A Subunits of members of the Shiga toxin family may be reduced or eliminated by mutation or truncation. The positions labeled tyrosine-77, glutamate-167, arginine-170, tyrosine-114, and tryptophan-203 have been shown to be important for the catalytic activity of Stx, Stx1, and Stx2 (Hovde C et al., Proc Natl Acad Sci USA 85: 2568-72 (1988); Deresiewicz R et al., Biochemistry 31: 3272-80 (1992); Deresiewicz R et al., Mol Gen Genet 241: 467-73 (1993); Ohmura M et al., Microb Pathog 15: 169-76 (1993); Cao C et al., Microbiol Immunol 38: 441-7 (1994); Suhan M, Hovde C, Infect Immun 66: 5252-9 (1998)). Mutating both glutamate-167 and arginine-170 eliminated the enzymatic activity of Slt-I A1 in a cell-free ribosome inactivation assay (LaPointe, J Biol Chem 280: 23310-18 (2005)). In another approach using de novo expression of Slt-I A1 in the endoplasmic reticulum, mutating both glutamate-167 and arginine-170 eliminated Slt-I A1 fragment cytotoxicity at that expression level (LaPointe, J Biol Chem 280: 23310-18 (2005)). A truncation analysis demonstrated that a fragment of StxA from residues 75 to 268 still retains significant enzymatic activity in vitro (Haddad, J Bacteriol 175: 4970-8 (1993)). A truncated fragment of Slt-I A1 containing residues 1-239 displayed significant enzymatic activity in vitro and cytotoxicity by de novo expression in the cytosol (LaPointe, J Biol Chem 280: 23310-18 (2005)). Expression of a Slt-I A1 fragment truncated to residues 1-239 in the endoplasmic reticulum was not cytotoxic because it could not retrotranslocate to the cytosol (LaPointe, J Biol Chem 280: 23310-18 (2005)).
[0113] The most critical residues for enzymatic activity and/or cytotoxicity in the Shiga toxin A Subunits were mapped to the following residue-positions: aspargine-75, tyrosine-77, glutamate-167, arginine-170, and arginine-176 among others (Di, Toxicon 57: 535-39 (2011)). In particular, a double-mutant construct of Stx2A containing glutamate-El 67-to-lysine and arginine-176-to-lysine mutations was completely inactivated; whereas, many single mutations in Stx1 and Stx2 showed a 10-fold reduction in cytotoxicity. Further, truncation of Stx1A to 1-239 or 1-240 reduced its cytotoxicity, and similarly, truncation of Stx2A to a conserved hydrophobic residue reduced its cytotoxicity.
[0114] Shiga-like toxin 1 A Subunit truncations are catalytically active, capable of enzymatically inactivating ribosomes in vitro, and cytotoxic when expressed within a cell (LaPointe, J Biol Chem 280: 23310-18 (2005)). The smallest Shiga toxin A Subunit fragment exhibiting full enzymatic activity is a polypeptide composed of residues 1-239 of Slt1A (LaPointe, J Biol Chem 280: 23310-18 (2005)). Although the smallest fragment of the Shiga toxin A Subunit reported to retain substantial catalytic activity was residues 75-247 of StxA (Al-Jaufy, Infect Immun 62: 956-60 (1994)), a StxA truncation expressed de novo within a eukaryotic cell requires only up to residue 240 to reach the cytosol and exert catalytic inactivation of ribosomes (LaPointe, J Biol Chem 280: 23310-18 (2005)).
[0115] In certain embodiments derived from SLT-1A (SEQ ID NO: 1), StxA (SEQ ID NO:25), or SLT-2A (SEQ ID NO:26), these changes include substitution of the asparagine at position 75, tyrosine at position 77, tyrosine at position 114, aspartate at position 167, arginine at position 170, arginine at position 176, and/or substitution of the tryptophan at position 203. Examples of such substitutions will be known to the skilled worker based on the prior art, such as asparagine at position 75 to alanine, tyrosine at position 77 to serine, substitution of the tyrosine at position 114 to alanine, substitution of the aspartate at position 167 to glutamate, substitution of the arginine at position 170 to alanine, substitution of the arginine at position 176 to lysine, and/or substitution of the tryptophan at position 203 to alanine.
[0116] CD20-binding proteins of the invention may optionally be conjugated to one or more additional agents which may include therapeutic and/or diagnostic agents known in the art.
Production, Manufacture, and Purification of a CD20-Binding Protein of the Invention
[0117] The CD20-binding proteins of the invention may be produced using biochemical engineering techniques well known to those of skill in the art. For example, CD20-binding proteins of the invention may be manufactured by standard synthetic methods, by use of recombinant expression systems, or by any other suitable method. Thus, the CD20-binding proteins may be synthesized in a number of ways, including, e.g. methods comprising: (1) synthesizing a polypeptide or polypeptide component of a CD20-binding protein using standard solid-phase or liquid-phase methodology, either stepwise or by fragment assembly, and isolating and purifying the final peptide compound product; (2) expressing a polynucleotide that encodes a polypeptide or polypeptide component of a CD20-binding protein in a host cell and recovering the expression product from the host cell or host cell culture; or (3) cell-free in vitro expression of a polynucleotide encoding a polypeptide or polypeptide component of a CD20-binding protein, and recovering the expression product; or by any combination of the methods of (1), (2) or (3) to obtain fragments of the peptide component, subsequently joining (e.g. ligating) the fragments to obtain the peptide component, and recovering the peptide component.
[0118] It may be preferable to synthesize a polypeptide or polypeptide component of a CD20-binding protein of the invention by means of solid-phase or liquid-phase peptide synthesis. CD20-binding proteins of the invention may suitably be manufactured by standard synthetic methods. Thus, peptides may be synthesized by, e.g. methods comprising synthesizing the peptide by standard solid-phase or liquid-phase methodology, either stepwise or by fragment assembly, and isolating and purifying the final peptide product. In this context, reference may be made to WO 1998/11125 or, inter alia, Fields, G et al., Principles and Practice of Solid-Phase Peptide Synthesis (Synthetic Peptides, Gregory A. Grant, ed., Oxford University Press, U.K., 2nd ed., 2002) and the synthesis examples therein.
[0119] CD20-binding proteins of the invention may be prepared (produced and purified) using recombinant techniques well known in the art. In general, methods for preparing polypeptides by culturing host cells transformed or transfected with a vector comprising the encoding polynucleotide and recovering the polypeptide from cell culture are described in, e.g. Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, NY, U.S., 1989); Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, N.Y., U.S., 1995). Any suitable host cell may be used to produce a CD20-binding protein of the invention. Host cells may be cells stably or transiently transfected, transformed, transduced or infected with one or more expression vectors which drive expression of a polypeptide of the invention. In addition, a CD20-binding protein of the invention may be produced by modifying the polynucleotide encoding the CD20-binding protein that result in altering one or more amino acids or deleting or inserting one or more amino acids in order to achieve desired properties, such as changed cytotoxicity, changed cytostatic effects, changed immunogenicity, and/or changed serum half-life.
[0120] Accordingly, the present invention also provides methods for producing a CD20-binding protein of the invention according to above recited methods and using a polynucleotide encoding part or all of a polypeptide of the invention, an expression vector comprising at least one polynucleotide of the invention capable of encoding part or all of a polypeptide of the invention when introduced into a host cell, and/or a host cell comprising a polynucleotide or expression vector of the invention.
[0121] When a polypeptide or protein is expressed using recombinant techniques in a host cell or cell-free system, it is advantageous to separate (or purify) the desired polypeptide or protein away from other components, such as host cell factors, in order to obtain preparations that are of high purity or are substantially homogeneous. Purification can be accomplished by methods well known in the art, such as centrifugation techniques, extraction techniques, chromatographic and fractionation techniques (e.g. size separation by gel filtration, charge separation by ion-exchange column, hydrophobic interaction chromatography, reverse phase chromatography, chromatography on silica or cation-exchange resins such as DEAE and the like, chromatofocusing, and Protein A Sepharose chromatography to remove contaminants), and precipitation techniques (e.g. ethanol precipitation or ammonium sulfate precipitation. Any number of biochemical purification techniques may be used to increase the purity of a CD20-binding protein of the invention. In certain embodiments, the CD20-binding proteins of the invention may optionally be purified in homo-multimeric forms (i.e. a protein complex of two or more identical CD20-binding proteins).
[0122] In the Examples below are descriptions of non-limiting examples of methods for producing a CD20-binding protein of the invention, as well as specific but non-limiting aspects of CD20-binding protein production for the disclosed, exemplary, CD20-binding proteins.
Pharmaceutical Compositions Comprising a CD20-Binding Protein of the Invention
[0123] The present invention provides CD20-binding proteins for use, alone or in combination with one or more additional therapeutic agents, in a pharmaceutical composition, for treatment or prophylaxis of conditions, diseases, disorders, or symptoms described in further detail below (e.g. cancers, malignant tumors, non-malignant tumors, and immune disorders). The present invention further provides pharmaceutical compositions comprising a CD20-binding protein of the invention, or a pharmaceutically acceptable salt or solvate thereof, according to the invention, together with at least one pharmaceutically acceptable carrier, excipient, or vehicle. In certain embodiments, the pharmaceutical composition of the invention may comprise homo-multimeric and/or hetero-multimeric forms of the CD20-binding proteins of the invention. The pharmaceutical compositions will be useful in methods of treating, ameliorating, or preventing a disease, condition, disorder, or symptom described in further detail below. Each such disease, condition, disorder, or symptom is envisioned to be a separate embodiment with respect to uses of a pharmaceutical composition according to the invention. The invention further provides pharmaceutical compositions for use in at least one method of treatment according to the invention, as described in more detail below.
[0124] As used herein, the terms "patient" and "subject" are used interchangeably to refer to any organism, commonly vertebrates such as humans and animals, which presents symptoms, signs, and/or indications of at least one disease, disorder, or condition. These terms include mammals such as the non-limiting examples of primates, livestock animals (e.g. cattle, horses, pigs, sheep, goats, etc.), companion animals (e.g. cats, dogs, etc.) and laboratory animals (e.g. mice, rabbits, rats, etc.).
[0125] As used herein, "treat," "treating," or "treatment" and grammatical variants thereof refer to an approach for obtaining beneficial or desired clinical results. The terms may refer to slowing the onset or rate of development of a condition, disorder or disease, reducing or alleviating symptoms associated with it, generating a complete or partial regression of the condition, or some combination of any of the above. For the purposes of this invention, beneficial or desired clinical results include, but are not limited to, reduction or alleviation of symptoms, diminishment of extent of disease, stabilization (e.g. not worsening) of state of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, and remission (whether partial or total), whether detectable or undetectable. "Treat," "treating," or "treatment" can also mean prolonging survival relative to expected survival time if not receiving treatment. A subject (e.g. a human) in need of treatment may thus be a subject already afflicted with the disease or disorder in question. The terms "treat," "treating," or "treatment" includes inhibition or reduction of an increase in severity of a pathological state or symptoms relative to the absence of treatment, and is not necessarily meant to imply complete cessation of the relevant disease, disorder, or condition.
[0126] As used herein, the terms "prevent," "preventing," "prevention" and grammatical variants thereof refer to an approach for preventing the development of, or altering the pathology of, a condition, disease, or disorder. Accordingly, "prevention" may refer to prophylactic or preventive measures. For the purposes of this invention, beneficial or desired clinical results include, but are not limited to, prevention or slowing of symptoms, progression or development of a disease, whether detectable or undetectable. A subject (e.g. a human) in need of prevention may thus be a subject not yet afflicted with the disease or disorder in question. The term "prevention" includes slowing the onset of disease relative to the absence of treatment, and is not necessarily meant to imply permanent prevention of the relevant disease, disorder or condition. Thus "preventing" or "prevention" of a condition may in certain contexts refer to reducing the risk of developing the condition, or preventing or delaying the development of symptoms associated with the condition.
[0127] As used herein, an "effective amount" or "therapeutically effective amount" is an amount or dose of a composition (e.g. a therapeutic composition or agent) that produces at least one desired therapeutic effect in a subject, such as preventing or treating a target condition or beneficially alleviating a symptom associated with the condition. The most desirable therapeutically effective amount is an amount that will produce a desired efficacy of a particular treatment selected by one of skill in the art for a given subject in need thereof. This amount will vary depending upon a variety of factors understood by the skilled worker, including but not limited to the characteristics of the therapeutic compound (including activity, pharmacokinetics, pharmacodynamics, and bioavailability), the physiological condition of the subject (including age, sex, disease type, disease stage, general physical condition, responsiveness to a given dosage, and type of medication), the nature of the pharmaceutically acceptable carrier or carriers in the formulation, and the route of administration. One skilled in the clinical and pharmacological arts will be able to determine a therapeutically effective amount through routine experimentation, namely by monitoring a subject's response to administration of a compound and adjusting the dosage accordingly (see e.g. Remington: The Science and Practice of Pharmacy (Gennaro A, ed., Mack Publishing Co., Easton, Pa., U.S., 19th ed., 1995)).
Production or Manufacture of a Pharmaceutical Composition Comprising a CD20-Binding Protein of the Invention
[0128] Pharmaceutically acceptable salts or solvates of any of the CD20-binding proteins of the invention are likewise within the scope of the present invention.
[0129] The term "solvate" in the context of the present invention refers to a complex of defined stoichiometry formed between a solute (in casu, a polypeptide compound or pharmaceutically acceptable salt thereof according to the invention) and a solvent. The solvent in this connection may, for example, be water, ethanol or another pharmaceutically acceptable, typically small-molecular organic species, such as, but not limited to, acetic acid or lactic acid. When the solvent in question is water, such a solvate is normally referred to as a hydrate.
[0130] CD20-binding proteins of the present invention, or salts thereof, may be formulated as pharmaceutical compositions prepared for storage or administration, which typically comprise a therapeutically effective amount of a compound of the invention, or a salt thereof, in a pharmaceutically acceptable carrier. The term "pharmaceutically acceptable carrier" includes any of the standard pharmaceutical carriers. Pharmaceutically acceptable carriers for therapeutic use are well known in the pharmaceutical art, and are described, for example, in Remington's Pharmaceutical Sciences (Mack Publishing Co. (A. Gennaro, ed., 1985)). As used herein, "pharmaceutically acceptable carrier" includes any and all physiologically acceptable, i.e. compatible, solvents, dispersion media, coatings, antimicrobial agents, isotonic, and absorption delaying agents, and the like. Pharmaceutically acceptable carriers or diluents include those used in formulations suitable for oral, rectal, nasal or parenteral (including subcutaneous, intramuscular, intravenous, intradermal, and transdermal) administration. Exemplary pharmaceutically acceptable carriers include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. Examples of suitable aqueous and nonaqueous carriers that may be employed in the pharmaceutical compositions of the invention include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene glycol, and the like), and suitable mixtures thereof, vegetable oils, such as olive oil, and injectable organic esters, such as ethyloleate. Proper fluidity can be maintained, for example, by the use of coating materials, such as lecithin, by the maintenance of the required particle size in the case of dispersions, and by the use of surfactants. In certain embodiments, the carrier is suitable for intravenous, intramuscular, subcutaneous, parenteral, spinal or epidermal administration (e.g. by injection or infusion). Depending on selected route of administration, the CD20-binding protein or other pharmaceutical component may be coated in a material intended to protect the compound from the action of low pH and other natural inactivating conditions to which the active CD20-binding protein may encounter when administered to a patient by a particular route of administration.
[0131] The formulations of the pharmaceutical compositions of the invention may conveniently be presented in unit dosage form and may be prepared by any of the methods well known in the art of pharmacy. In such form, the composition is divided into unit doses containing appropriate quantities of the active component. The unit dosage form can be a packaged preparation, the package containing discrete quantities of the preparations, for example, packeted tablets, capsules, and powders in vials or ampoules. The unit dosage form can also be a capsule, cachet, or tablet itself, or it can be the appropriate number of any of these packaged forms. It may be provided in single dose injectable form, for example in the form of a pen. Compositions may be formulated for any suitable route and means of administration. Subcutaneous or transdermal modes of administration may be particularly suitable for therapeutic CD20-binding proteins described herein.
[0132] The pharmaceutical compositions of the invention may also contain adjuvants such as preservatives, wetting agents, emulsifying agents and dispersing agents. Prevention of the presence of microorganisms may be ensured both by sterilization procedures, and by the inclusion of various antibacterial and antifungal agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. Isotonic agents, such as sugars, sodium chloride, and the like into the compositions, may also be desirable. In addition, prolonged absorption of the injectable pharmaceutical form may be brought about by the inclusion of agents which delay absorption such as, aluminum monostearate and gelatin.
[0133] A pharmaceutical composition of the invention also optionally includes a pharmaceutically acceptable antioxidant. Exemplary pharmaceutically acceptable antioxidants are water soluble antioxidants such as ascorbic acid, cysteine hydrochloride, sodium bisulfate, sodium metabisulfite, sodium sulfite and the like; oil-soluble antioxidants, such as ascorbyl palmitate, butylated hydroxyanisole (BHA), butylated hydroxytoluene (BHT), lecithin, propylgallate, alpha-tocopherol, and the like; and metal chelating agents, such as citric acid, ethylenediamine tetraacetic acid (EDTA), sorbitol, tartaric acid, phosphoric acid, and the like.
[0134] In another aspect, the present invention provides pharmaceutical compositions comprising one or a combination of different CD20-binding proteins of the invention, or an ester, salt or amide of any of the foregoing, and at least one pharmaceutically acceptable carrier.
[0135] Therapeutic compositions are typically sterile and stable under the conditions of manufacture and storage. The composition may be formulated as a solution, microemulsion, liposome, or other ordered structure suitable to high drug concentration. The carrier may be a solvent or dispersion medium containing, for example, water, alcohol such as ethanol, polyol (e.g. glycerol, propylene glycol, and liquid polyethylene glycol), or any suitable mixtures. The proper fluidity may be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by use of surfactants according to formulation chemistry well known in the art. In certain embodiments, isotonic agents, e.g. sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride may be desirable in the composition. Prolonged absorption of injectable compositions may be brought about by including in the composition an agent that delays absorption for example, monostearate salts and gelatin.
[0136] Solutions or suspensions used for intradermal or subcutaneous application typically include one or more of: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates; and tonicity adjusting agents such as, e.g., sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide, or buffers with citrate, phosphate, acetate and the like. Such preparations may be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.
[0137] Sterile injectable solutions may be prepared by incorporating a CD20-binding protein of the invention in the required amount in an appropriate solvent with one or a combination of ingredients described above, as required, followed by sterilization microfiltration. Dispersions may be prepared by incorporating the active compound into a sterile vehicle that contains a dispersion medium and other ingredients, such as those described above. In the case of sterile powders for the preparation of sterile injectable solutions, the methods of preparation are vacuum drying and freeze-drying (lyophilization) that yield a powder of the active ingredient in addition to any additional desired ingredient from a sterile-filtered solution thereof.
[0138] When a therapeutically effective amount of a CD20-binding protein of the invention is designed to be administered by, e.g. intravenous, cutaneous or subcutaneous injection, the binding agent will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. Methods for preparing parenterally acceptable protein solutions, taking into consideration appropriate pH, isotonicity, stability, and the like, are within the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection will contain, in addition to binding agents, an isotonic vehicle such as sodium chloride injection, Ringer's injection, dextrose injection, dextrose and sodium chloride injection, lactated Ringer's injection, or other vehicle as known in the art. A pharmaceutical composition of the present invention may also contain stabilizers, preservatives, buffers, antioxidants, or other additives well known to those of skill in the art.
[0139] As described elsewhere herein, a compound may be prepared with carriers that will protect the compound against rapid release, such as a controlled release formulation, including implants, transdermal patches, and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Many methods for the preparation of such formulations are patented or generally known to those skilled in the art (see e.g. Sustained and Controlled Release Drug Delivery Systems (J. Robinson, ed., Marcel Dekker, Inc., NY, U.S., 1978)).
[0140] In certain embodiments, the pharmaceutical composition of the invention may be formulated to ensure a desired distribution in vivo. For example, the blood-brain barrier excludes many large and/or hydrophilic compounds. To target a therapeutic compound or composition of the invention to a particular in vivo location, they can be formulated, for example, in liposomes which may comprise one or more moieties that are selectively transported into specific cells or organs, thus enhancing targeted drug delivery. Exemplary targeting moieties include folate or biotin; mannosides; antibodies; surfactant protein A receptor; p120 catenin and the like.
Polynucleotides, Expression Vectors, and Host Cells
[0141] Beyond the CD20-binding proteins of the present invention, the polynucleotides which encode such CD20-binding proteins, or functional portions thereof, are within the scope of the present invention. The term "polynucleotide" is equivalent to the term "nucleic acids" both of which include polymers of deoxyribonucleic acids (DNAs), polymers of ribonucleic acids (RNAs), analogs of these DNAs or RNAs generated using nucleotide analogs, and derivatives, fragments and homologs thereof. The polynucleotide of the invention may be single-, double-, or triple-stranded. Disclosed polynucleotides are specifically disclosed to include all polynucleotides capable of encoding an exemplary CD20-binding protein, for example, taking into account the wobble known to be tolerated in the third position of RNA codons, yet encoding for the same amino acid as a different RNA codon (see Stothard P, Biotechniques 28: 1102-4 (2000)).
[0142] In one aspect, the invention provides polynucleotides which encode a CD20-binding protein of the invention, or a fragment or derivative thereof. The polynucleotides may include, e.g., nucleic acid sequence encoding a polypeptide at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or more, identical to a polypeptide comprising one of the amino acid sequences of the CD20-binding protein. The invention also includes polynucleotides comprising nucleotide sequences that hybridize under stringent conditions to a polynucleotide which encodes a CD20-binding protein of the invention, or a fragment or derivative thereof, or the antisense or complement of any such sequence.
[0143] Derivatives or analogs of the polynucleotides (or CD20-binding proteins) of the invention include, inter alia, polynucleotide (or polypeptide) molecules having regions that are substantially homologous to the polynucleotides or CD20-binding proteins of the invention, e.g. by at least about 45%, 50%, 70%, 80%, 95%, 98%, or even 99% identity (with a preferred identity of 80-99%) over a polynucleotide or polypeptide sequence of the same size or when compared to an aligned sequence in which the alignment is done by a computer homology program known in the art. An exemplary program is the GAP program (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer Group, University Research Park, Madison, Wis., U.S.) using the default settings, which uses the algorithm of Smith T, Waterman M, Adv. Appl. Math. 2: 482-9 (1981). Also included are polynucleotides capable of hybridizing to the complement of a sequence encoding the proteins of the invention under stringent conditions (see e.g. Ausubel F, et al., Current Protocols in Molecular Biology (John Wiley & Sons, New York, N.Y., U.S., 1993)), and below. Stringent conditions are known to those skilled in the art and may be found in Current Protocols in Molecular Biology (John Wiley & Sons, NY, U.S., Ch. Sec. 6.3.1-6.3.6 (1989).
[0144] Further, the present invention further provides expression vectors that comprise the polynucleotides within the scope of the invention. The polynucleotides capable of encoding the CD20-binding proteins of the invention may be inserted into known vectors, including bacterial plasmids, viral vectors and phage vectors, using material and methods well known in the art to produce expression vectors. Such expression vectors will include the polynucleotides necessary to support production of contemplated CD20-binding proteins within any host cell of choice or cell-free expression systems (e.g. pTxb1 and pIVEX2.3 described in the Examples below). The specific polynucleotides comprising expression vectors for use with specific types of host cells or cell-free expression systems are well known to one of ordinary skill in the art, can be determined using routine experimentation, or may be purchased.
[0145] The term "expression vector," as used herein, refers to a polynucleotide, linear or circular, comprising one or more expression units. The term "expression unit" denotes a polynucleotide segment encoding a polypeptide of interest and capable of providing expression of the nucleic acid segment in a host cell. An expression unit typically comprises a transcription promoter, an open reading frame encoding the polypeptide of interest, and a transcription terminator, all in operable configuration. An expression vector contains one or more expression units. Thus, in the context of the present invention, an expression vector encoding a CD20-binding protein comprising a single polypeptide chain (e.g. an scFv linked to a Shiga toxin effector region) includes at least an expression unit for the single polypeptide chain, whereas a CD20-binding protein comprising, e.g. two or more polypeptide chains (e.g. one chain comprising a V.sub.L domain and a second chain comprising a VH domain linked to a toxin effector region) includes at least two expression units, one for each of the two polypeptide chains of the CD20-binding protein. For expression of multi-chain CD20-binding proteins, an expression unit for each polypeptide chain may also be separately contained on different expression vectors (e.g. expression may be achieved with a single host cell into which expression vectors for each polypeptide chain has been introduced).
[0146] Expression vectors capable of directing transient or stable expression of polypeptides and proteins are well known in the art. The expression vectors generally include, but are not limited to, one or more of the following: a heterologous signal sequence or peptide, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence, each of which is well known in the art. Optional regulatory control sequences, integration sequences, and useful markers that can be employed are known in the art.
[0147] The term "host cell" refers to a cell which can support the replication or expression of the expression vector. Host cells may be prokaryotic cells, such as E. coli or eukaryotic cells (e.g. yeast, insect, amphibian, bird, or mammalian cells). Creation and isolation of host cell lines comprising a polynucleotide of the invention or capable of producing a CD20-binding protein of the invention can be accomplished using standard techniques known in the art.
[0148] CD20-binding proteins within the scope of the present invention may be variants or derivatives of the CD20-binding proteins described herein that are produced by modifying the polynucleotide encoding a CD20-binding protein by altering one or more amino acids or deleting or inserting one or more amino acids that may render it more suitable to achieve desired properties, such as more optimal expression by a host cell.
Methods for Using a CD20-Binding Protein or a Pharmaceutical Composition of the Invention
[0149] Generally, it is an object of the invention to provide pharmacologically active agents, as well as compositions comprising the same, that can be used in the prevention and/or treatment of diseases, disorders, and conditions, such as cancers, tumors, immune disorders, or further pathological conditions mentioned herein. Accordingly, the present invention provides methods of using the CD20-binding proteins and pharmaceutical compositions of the invention for the killing of CD20 expressing cells, delivering of additional exogenous materials into CD20 expressing cells, labeling of the interior of CD20 expressing cells, and for treating diseases, disorders, and conditions as described herein.
[0150] In particular, it is an object of the invention to provide such pharmacologically active agents, compositions, and/or methods that have certain advantages compared to the agents, compositions, and/or methods that are currently known in the art. Accordingly, the present invention provides methods of using CD20-binding proteins with specified polypeptide sequences and pharmaceutical compositions thereof. For example, any of the polypeptide sequences in SEQ ID NOs:1, 3, 4, 6-12, 14, 16, and/or 18-29, can be specifically utilized as a component of the CD20-binding protein used in the following methods.
[0151] The present invention provides methods of killing a CD20 expressing cell comprising the step of contacting the cell, either in vitro or in vivo, with a CD20-binding protein or pharmaceutical composition of the present invention. In certain embodiments, a CD20-binding protein or pharmaceutical composition of the present invention can be used to kill CD20 expressing cells in a mixture of different cell types including non-CD20 expressing cells, such as mixtures comprising cancer cells, infected cells, and/or hematological cells.
[0152] In certain embodiments, a CD20-binding protein or pharmaceutical composition of the present invention can be used to kill cancer cells in a mixture of different cell types, such as within an organism. In certain embodiments, a CD20-binding protein or pharmaceutical composition of the present invention, alone or in combination with other compounds or pharmaceutical compositions can show potent cell-kill activity when administered to a population of cells, in vitro or in vivo in a subject such as in a patient in need of treatment. By targeting the delivery of enzymatically active Shiga toxin regions using high-affinity immunoglobulin-type binding regions to CD20, this potent cell-kill activity can be restricted to specifically and selectively kill certain cell types within an organism, such as cancer cells, neoplastic cells, malignant cells, non-malignant tumor cells, or infected cells.
[0153] The term "cancer cell" or "cancerous cell" refers to various neoplastic cells which grow and divide in an abnormally accelerated fashion and will be clear to the skilled person. The term cancer cell includes both malignant and non-malignant cells. Generally, cancers and/or tumors can be defined as diseases, disorders, or conditions that are amenable to treatment and/or prevention. The cancers and tumors (either malignant or non-malignant) which are comprised by cancer cells and/or tumor cells will be clear to the skilled person.
[0154] The present invention provides a method of killing a CD20 expressing cell in a patient, the method comprising the step of administering to the patient at least one CD20-binding protein of the present invention or a pharmaceutical composition thereof.
[0155] Certain embodiments of the CD20-binding protein or pharmaceutical compositions thereof can be used to kill a CD20 expressing immune cell (whether healthy or malignant) in a patient.
[0156] It is within the scope of the present invention to utilize the CD20-binding protein of the invention or pharmaceutical composition thereof for the purposes of ex vivo depletion of B-cells from isolated cell populations removed from a patient.
[0157] Additionally, the present invention provides a method of treating a disease, disorder, or condition in a patient comprising the step of administering to a patient in need thereof a therapeutically effective amount of at least one of the CD20-binding proteins of the present invention or a pharmaceutical composition thereof. Contemplated diseases, disorders, and conditions that can be treated using this method include cancers, malignant tumors, non-malignant tumors, and, immune disorders. Administration of a "therapeutically effective dosage" of a compound of the invention can result in a decrease in severity of disease symptoms, an increase in frequency and duration of disease symptom-free periods, or a prevention of impairment or disability due to the disease affliction.
[0158] The therapeutically effective amount of a compound of the present invention will depend on the route of administration, the type of mammal being treated, and the physical characteristics of the specific patient under consideration. These factors and their relationship to determining this amount are well known to skilled practitioners in the medical arts. This amount and the method of administration can be tailored to achieve optimal efficacy, and may depend on such factors as weight, diet, concurrent medication and other factors, well known to those skilled in the medical arts. The dosage sizes and dosing regimen most appropriate for human use may be guided by the results obtained by the present invention, and may be confirmed in properly designed clinical trials. An effective dosage and treatment protocol may be determined by conventional means, starting with a low dose in laboratory animals and then increasing the dosage while monitoring the effects, and systematically varying the dosage regimen as well. Numerous factors may be taken into consideration by a clinician when determining an optimal dosage for a given subject. Such considerations are known to the skilled person.
[0159] An acceptable route of administration may refer to any administration pathway known in the art, including but not limited to aerosol, enteral, nasal, ophthalmic, oral, parenteral, rectal, vaginal, or transdermal (e.g. topical administration of a cream, gel or ointment, or by means of a transdermal patch). "Parenteral administration" is typically associated with injection at or in communication with the intended site of action, including intratumoral injection, infraorbital, infusion, intraarterial, intracapsular, intracardiac, intradermal, intramuscular, intraperitoneal, intrapulmonary, intraspinal, intrasternal, intrathecal, intrauterine, intravenous, subarachnoid, subcapsular, subcutaneous, transmucosal, or transtracheal administration.
[0160] For administration of a pharmaceutical composition of the invention, the dosage range will generally be from about 0.0001 to 100 milligrams per kilogram (mg/kg), and more usually 0.01 to 5 mg/kg, of the host body weight. Exemplary dosages may be 0.25 mg/kg body weight, 1 mg/kg body weight, 3 mg/kg body weight, 5 mg/kg body weight or 10 mg/kg body weight or within the range of 1-10 mg/kg. An exemplary treatment regime is a once or twice daily administration, or a once or twice weekly administration, once every two weeks, once every three weeks, once every four weeks, once a month, once every two or three months or once every three to 6 months. Dosages may be selected and readjusted by the skilled health care professional as required to maximize therapeutic benefit for a particular patient.
[0161] Pharmaceutical compositions of the invention will typically be administered to the same patient on multiple occasions. Intervals between single dosages can be, for example, 2-5 days, weekly, monthly, every two or three months, every six months, or yearly. Intervals between administrations can also be irregular, based on regulating blood levels or other markers in the subject or patient. Dosage regimens for a compound of the invention include intravenous administration of 1 mg/kg body weight or 3 mg/kg body weight with the compound administered every two to four weeks for six dosages, then every three months at 3 mg/kg body weight or 1 mg/kg body weight.
[0162] A pharmaceutical composition of the present invention may be administered via one or more routes of administration, using one or more of a variety of methods known in the art. As will be appreciated by the skilled worker, the route and/or mode of administration will vary depending upon the desired results. Routes of administration for CD20-binding proteins or pharmaceutical compositions of the invention include, e.g. intravenous, intramuscular, intradermal, intraperitoneal, subcutaneous, spinal, or other parenteral routes of administration, for example by injection or infusion at or in communication with the intended site of action (e.g. intratumoral injection). In other embodiments, a CD20-binding protein or pharmaceutical composition of the invention may be administered by a non-parenteral route, such as a topical, epidermal or mucosal route of administration, for example, intranasally, orally, vaginally, rectally, sublingually, or topically.
[0163] Therapeutic CD20-binding proteins or pharmaceutical compositions of the invention may be administered with one or more of a variety of medical devices known in the art. For example, in one embodiment, a pharmaceutical composition of the invention may be administered with a needleless hypodermic injection device. Examples of well-known implants and modules useful in the present invention are in the art, including e.g., implantable micro-infusion pumps for controlled rate delivery; devices for administering through the skin; infusion pumps for delivery at a precise infusion rate; variable flow implantable infusion devices for continuous drug delivery; and osmotic drug delivery systems. These and other such implants, delivery systems, and modules are known to those skilled in the art.
[0164] A CD20-binding protein or pharmaceutical composition of the present invention may be administered alone or in combination with one or more other therapeutic or diagnostic agents. A combination therapy may include a CD20-binding protein of the invention or pharmaceutical composition thereof combined with at least one other therapeutic agent selected based on the particular patient, disease or condition to be treated. Examples of other such agents include, inter alia, a cytotoxic, anti-cancer or chemotherapeutic agent, an anti-inflammatory or anti-proliferative agent, an antimicrobial or antiviral agent, growth factors, cytokines, an analgesic, a therapeutically active small molecule or polypeptide, a single chain antibody, a classical antibody or fragment thereof, or a nucleic acid molecule which modulates one or more signaling pathways, and similar modulating therapeutics which might complement or otherwise be beneficial in a therapeutic or prophylactic treatment regimen.
[0165] Treatment of a patient with certain embodiments of the CD20-binding proteins or pharmaceutical compositions of the present invention will lead to cell death of targeted cells and/or the inhibition of growth of targeted cells. As such, CD20-binding proteins of the invention, and pharmaceutical compositions comprising them, will be useful in methods for treating a variety of pathological disorders in which killing or depleting target cells might be beneficial, such as, inter alia, cancer, immune disorders, and infected cells. The present invention provides methods for suppressing cell proliferation, and treating cell disorders, including neoplasia and overactive B-cells.
[0166] In certain embodiments, CD20-binding proteins and pharmaceutical compositions of the invention can be used to treat or prevent cancers, tumors (malignant and non-malignant), and immune disorders.
[0167] In certain embodiments, the present invention provides methods for treating malignancies or neoplasms and other blood cell-associated cancers in a mammalian subject, such as a human, the method comprising the step of administering to a subject in need thereof a therapeutically effective amount of a CD20-binding protein or pharmaceutical composition of the invention.
[0168] The CD20-binding proteins and pharmaceutical compositions of the invention have varied applications, including, e.g., uses as anti-neoplastic agents, uses in modulating immune responses, uses in purging transplantation tissues of unwanted cell types, and uses as diagnostic agents. The CD20-binding proteins and pharmaceutical compositions of the present invention are commonly anti-neoplastic agents--meaning they are capable of treating and/or preventing the development, maturation, or spread of neoplastic or malignant cells by inhibiting the growth and/or causing the death of cancer or tumor cells.
[0169] In certain embodiments, a CD20-binding protein or pharmaceutical composition of the present invention is used to treat a B-cell-mediated disease or disorder, such as for example leukemia, lymphoma, myeloma, amyloidosis, ankylosing spondylitis, asthma, Crohn's disease, diabetes, graft rejection, graft-versus-host disease, Hashimoto's thyroiditis, hemolytic uremic syndrome, HIV-related diseases, lupus erythematosus, multiple sclerosis, polyarteritis, psoriasis, psoriatic arthritis, rheumatoid arthritis, scleroderma, septic shock, Sjorgren's syndrome, ulcerative colitis, and vasculitis.
[0170] The CD20-binding proteins and pharmaceutical compositions of the present invention can be utilized in a method of treating cancer comprising administering to a patient, in need thereof, a therapeutically effective amount of the CD20-binding protein or a pharmaceutical composition of the present invention. Some cancers shown to have expression of CD20 include, but are not limited to, B-cell lymphomas (including both non-Hodgkin's and Hodgkin's), hairy cell leukemia, B-cell chronic lymphocytic leukemia, some T-cell lymphomas, and melanoma cancer stem cells. In certain embodiments of the methods of the present invention, the cancer being treated is selected from the group consisting of bone cancer, leukemia, lymphoma, melanoma, and myeloma.
[0171] The CD20-binding proteins and pharmaceutical compositions of the present invention can be utilized in a method of treating an immune disorder comprising administering to a patient, in need thereof, a therapeutically effective amount of the CD20-binding protein or a pharmaceutical composition of the present invention. In certain embodiments of the methods of the present invention, the immune disorder is related to an inflammation associated with a disease selected from the group consisting of: amyloidosis, ankylosing spondylitis, asthma, Crohn's disease, diabetes, graft rejection, graft-versus-host disease, Hashimoto's thyroiditis, hemolytic uremic syndrome, HIV-related diseases, lupus erythematosus, multiple sclerosis, polyarteritis, psoriasis, psoriatic arthritis, rheumatoid arthritis, scleroderma, septic shock, Sjorgren's syndrome, ulcerative colitis, and vasculitis.
[0172] Among certain embodiments of the present invention is using the CD20-binding protein as a component of a medicament for the treatment or prevention of a cancer, tumor, or immune disorder. For example, immune disorders presenting on the skin of a patient may be treated with such a medicament in efforts to reduce inflammation.
[0173] Beyond the CD20-binding proteins of the present invention, the polynucleotides which encode such molecules, when applicable, are within the scope of the present invention. The term "polynucleotides" is equivalent to the term "nucleic acids" both of which include polymers of deoxyribonucleic acids and ribonucleic acids. Such polynucleotides are specifically disclosed to include all polynucleotides capable of encoding a specified CD20-binding protein, for example, taking into account the wobble known to be tolerated in the third position of amino acid codons, yet encoding for an equivalent amino acid. Further, the present invention comprises expression vectors that comprise the polynucleotides within the scope of the invention. Such expression vectors will include the polynucleotides necessary to support production of the CD20-binding proteins of the invention within any host cell of choice. The specific polynucleotides comprising expression vectors for use with specific types of host cells are well known to one of ordinary skill in the art, can be determined using routine experimentation, or may be purchased.
[0174] The present invention also provides methods of rapidly internalizing the CD20-binding protein into the interior of a cell, by contacting the cell with a CD20-binding protein of the invention either in vivo or in vitro, such as within a patient. The present invention further provides methods of killing a CD20 expressing cell, where that cell expresses a CD20 antigen on its surface, by contacting the cell with a CD20-binding protein of the invention either in vivo or in vitro, such as within a patient.
[0175] If the CD20-binding proteins of the present invention comprise or are conjugated to exogenous material, as described above, those CD20-binding proteins can be utilized in a method of delivering that exogenous material into a target cell that expresses a CD20 antigen on its cell surface. The present invention also provides methods of delivering exogenous materials into the interior of a CD20 expressing cell, by contacting the cell with a CD20-binding protein of the invention either in vivo or in vitro, such as within a patient.
[0176] Additionally, the CD20-binding proteins of the invention can be utilized in a method for treating cancer, wherein the tumor or cancer cell expresses on its surface a CD20 antigen, which method comprises administering the protein of the present invention to a patient in need of such treatment. Some cancers shown to have expression of CD20 include, but are not limited to, B-cell lymphomas (including both non-Hodgkin's and Hodgkin's), hairy cell leukemia, B-cell chronic lymphocytic leukemia, some T-cell lymphomas, and melanoma cancer stem cells.
[0177] For purposes of the present invention, the term "lymphoma" includes B-cell lymphomas (such as non-Hodgkin's and Hodgkin's types), hairy cell leukemia, B-cell chronic lymphocytic leukemia, T-cell lymphomas, and melanoma cancer stem cell type lymphomas.
[0178] Certain embodiments of the invention are below, numbered 1-40 and referring to Table C for biological sequences: (1) A CD20-binding protein for the internalization of the CD20 antigen in a cell, wherein the protein comprises a binding region specific for CD20 and a toxin effector region derived from Shiga-like toxin 1 (SLT-1), wherein the protein induces rapid internalization of CD20 present on the surface of the cell. (2) The CD20-binding protein of embodiment 1, wherein the protein induces internalization of CD20 in a B-cell lineage cell in less than about one hour. (3) The CD20-binding protein of claim 1, wherein the toxin effector region comprises amino acids 75 to 251 of NO: 1 (see Table C). (4) The CD20-binding protein of embodiment 1, wherein the toxin effector region comprises amino acids 1 to 251 of NO:1. (5) The CD20-binding protein of embodiment 1, wherein the toxin effector region comprises amino acids 1 to 261 of NO: 1. (6) The CD20-binding protein of embodiment 1, wherein the protein is cytotoxic.
[0179] (7) The CD20-binding protein of embodiment 1, wherein the CD20 binding region is selected from the group consisting of an Fab fragment, an F(ab')2 fragment, an Fd fragment, an Fv fragment a dAb fragment, a scFv, a diabody, a CDR3 peptide, a constrained FR3-CDR3-FR4 peptide, a nanobody, a bivalent nanobody, small modular immunopharmaceuticals (SMIPs), a shark variable IgNAR domain, a minibody, and any fragment or chemically or genetically manipulated counterparts that retain CD20 binding function. (8) The CD20-binding protein of embodiment 1, wherein the binding region is a scFv.
[0180] (9) The CD20-binding protein of embodiment 8, wherein the binding region comprises (A) (i) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, HCDR3 amino acid sequences as shown in NO:6, NO:7, and NO:8, respectively, and (ii) a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in NO:9, NO: 10, and NO: 11, respectively; or (B) (i) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in NO:21, NO:22, and NO:23, respectively, and (ii) a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in NO:24, NO: 10, and NO: 11, respectively.
[0181] (10) The CD20-binding protein of embodiment 8, wherein the CD20 binding region comprises amino acids 2 to 245 of NO:4. (11) The CD20-binding protein of embodiment 1, wherein the CD20 binding region comprises amino acids 2 to 245 of NO: 4 and the toxin effector region comprises amino acids 75 to 251 of NO: 1. (12) The CD20-binding protein of embodiment 1, which comprises NO:4. (13) A CD20-binding protein for killing a cell which expresses CD20 on its surface wherein the binding region comprises a heavy chain variable (VH) domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in NO:6, NO:7, and NO:8, respectively, and a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in NO:9, NO: 10, and NO: 11, respectively, whereby upon administration, the protein is capable of killing a cell which expresses CD20 on its surface.
[0182] (14) The CD20 binding-protein of embodiment 13, wherein the CD20 binding region comprises amino acids 2 to 245 of NO:4. (15) The CD20 binding-protein of embodiment 13, wherein the CD20 binding region comprises amino acids 2 to 245 of NO: 4 and the toxin effector region comprises amino acids 75 to 251 of NO:1. (16) The CD20 binding-protein of embodiment 13 which comprises NO:4.
[0183] (17) A CD20 binding-protein for the delivery of exogenous material into the cell that expresses CD20 on its surface, wherein the protein comprises a binding region specific for CD20, a toxin effector region wherein said toxin effector region is derived from Shiga-like toxin 1 (SLT-1), and the exogenous material, whereby upon administration, the protein is capable of delivering the exogenous material into a cell which expresses CD20 on its surface.
[0184] (18) The CD20-binding protein of embodiment 17, wherein the binding region comprises (A) (i) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, HCDR3 amino acid sequences as shown in NO:6, NO:7, and NO:8, respectively, and (ii) a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in NO:9, NO: 10, and NO: 11, respectively; or (B) (i) a heavy chain variable (VH) domain comprising HCDR1, HCDR2, and HCDR3 amino acid sequences as shown in NO:21, NO:22, and NO:23, respectively, and (ii) a light chain variable (VL) domain comprising LCDR1, LCDR2, and LCDR3 amino acid sequences as shown in NO:23, NO: 10, and NO: 11, respectively.
[0185] (19) The CD20-binding protein of embodiment 18, wherein the exogenous material is selected from the group consisting of a peptide, a protein, and a nucleic acid. (20) The CD20 binding-protein of embodiment 19, wherein the exogenous material is a peptide and the peptide is an antigen. (21) The CD20 binding-protein of embodiment 20, wherein the antigen is encoded between the binding region and the toxin effector region of the protein. (22) The CD20 binding-protein of embodiment 19 wherein the antigen is derived from a viral protein. (23) The CD20 binding-protein of embodiment 21 wherein the antigen is NO:2. (24) The CD20 binding-protein of embodiment 21 comprising NO:5. (25) The CD20 binding-protein of embodiment 20, wherein the antigen is derived from a bacterial protein. (26) The CD20 binding-protein of embodiment 20, wherein the antigen is derived from a protein mutated in cancer. (27) The CD20 binding-protein of embodiment 20, wherein the antigen is derived from a protein aberrantly expressed in cancer. (28) The CD20 binding-protein of embodiment 20, wherein the antigen is derived from a T-cell CDR region.
[0186] (29) The CD20 binding-protein of embodiment 19, wherein the exogenous material is a protein. (30) The CD20 binding-protein of embodiment 29, wherein the protein is an enzyme. (31) The CD20 binding-protein of embodiment 19 wherein the exogenous material is a nucleic acid. (31) The CD20 binding-protein of embodiment 30 wherein the nucleic acid is a siRNA. (32) A polynucleotide that encodes the CD20 binding-protein of embodiment 1. (33) An expression vector that comprises the polynucleotide of embodiment 32. (34) A host cell comprising the expression vector of embodiment 33.
[0187] (35) A method of rapidly internalizing the CD20 antigen into the cell of a patient, the method comprising the step of administering to the patient a protein of any one of embodiments 1-12. (36) A method of killing a cell in a patient expressing the CD20 antigen on its surface, the method comprising the step of administering to a patient a protein of any of embodiments 1-16. (37) A method of delivering exogenous material into a cell of a patient that expresses CD20 on its surface, the method comprising the step of administering to the patient a protein of any one of embodiments 17-31. (39) A method of treating cancer in a patient, wherein the cancer expresses on the tumor or cancer cell surface a CD20 antigen, the method comprising the step of administering to the patient a protein of any one of embodiments 1-31. (40) The method of embodiment 39 wherein the cancer is lymphoma.
TABLE-US-00003 TABLE C Sequences referred to in embodiments 1-40 Text Number Description Sequence NO: 1 SLT-1 A KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGS subunit GDNLFAVDVRGIDPEEGFNNLRLIVERNNLYVTGFVNRTNNVFYR polypeptide FASHVTFPGTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTS YLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRTTLD DLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGS INAILGSVALILNCHHHARVARMASDEFPSMCPADGRVRGITHNK ILWDSSTLGAILMRRTISS NO: 2 SLT-1 A aargarttyacnytngayttywsnacngcnaaracntaygtngaywsnytnaaygtnathm subunit gnwsngcnathggnacnccnytncamcnathwsnwsnggnggnacnwsnytnytnatgath polynucleotide gaywsnggnwsnggngayaayytnttygcngtngaygtnmgnggnathgayccngargarg (consensus) gnmgnttyaayaayytnmgnytnathgtngarmgnaayaayytntaygtnacnggnttygt naaymgnacnaayaaygtnttytaymgnttygcngayttywsncaygtnacnttyccnggn acnacngcngtnacnytnwsnggngaywsnwsntayacnacnytncarmgngtngcnggna thwsnmgnacnggnatgcamthaaymgncaywsnytnacnacnwsntayytngayytnatg wsncaywsnggnacnwsnytnacncarwsngtngcnmgngcnatgytnmgnttygtnacng tnacngcngargcnytnmgnttymgncamthcarmgnggnttymgnacnacnytngaygay ytnwsnggnmgnwntaygtnatgacngcngargaygtngayytnacnytnaaytggggnmg nytnwsnwsngtnytnccngaytaycayggncargaywsngtnmgngtnggnmgnathwsn ttyggnwsnathaaygcnathytnggnwsngtngnytnathytnaaytgycaycaycaygc nwsnmgngtngcnmgnatggcnwsngaygarttyccnwsnatgtgyccngcngayggnmgn gtnmgnggnathacncayaayaarathytntgggaywsnwsnacnytnggngcnathytna tgmgnmgnacnathwsnwsn NO: 3 Influenza GILGFVFTL Matrix 58-66 NO: 4 MT-3724 QVQLQQPGAELVKPGASVKMSCKTSGYTFTSYNVHWVKQTPGQGL polypeptide EWIGAIYPGNGDTSFNQKFKGKATLTADKSSSTVYMQLSSLTSED SAVYYCARSNYYGSSYVWFFDVWGAGTTVTVSSGSTSGSGKPGSG EGSQIVLSQSPTILSASPGEKVTMTCRASSSVSYMDWYQQKPGSS PKPWIYATSNLASGVPARFSGSGSGTSYSLTISRVEAEDAATYYC QQWISNPPTFGAGTKLELKEFPKPSTPPGSSGGAPKEFTLDFSTA KTYVDSLNVIRSAIGTPLQIISSGGTSLLMIDSGSGDNLFAVDVR GIDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTF PGTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSH SGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRTTLDDLSGRSY VMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGS VALILNCHHHASRVAR NO: 5 MT-3724 cargtncarytncarcarccnggngcngarytngtnaarccnggngcnwsngtnaaratgw polynucleotide sntgyaamcnwsnggntayacnttyacnwsntayaaygtncaytgggtnaarcaracnccn consensus ggncarggnytngartggathggngcnathtayccnggnaayggngayacnwsnttyaayc araarttyaarggnaargcnacnytnacngcngayaarwsnwsnwsnacngtntayatgca rytnwsnwsnytnacnwsngargaywsngcngtntaytaytgygcnmgnwsnaaytaytay ggnwsnwsntaygtntggttyttygaygtntggggngcnggnacnacngtnacngtnwsnw snggnwsnacnwsnggnwsnggnaarccnggnwsnggngarggnwsncarathgtnytnws ncarwsnccnacnathytnwsngcnwsnccnggngaraargtnacnatgacntgymgngcn wsnwsnwsngtnwsntayatggaytggtaycarcaraarccnggnwsnwsnccnaarccnt ggathtaygcnacnwsnaayytngcnwsnggngtnccngcnmgnttywsnggnwsnggnws nggnacnwsntaywsnytnacnathwsnmgngtngargcngargaygcngcnacntaytay tgycarcartggathwsnaayccnccnacnttyggngcnggnacnaarytngarytnaarg arttyccnaarccnwsnacnccnccnggnwsnwsnggnggngcnccnaargarttyacnyt ngayttywsnacngcnaaracntaygtngaywsnytnaaygtnathmgnwsngcnathggn acnccnytncamcnathwsnwsnggnggnacnwsnytnytnatgathgaywsnggnwsngg ngayaayytnttygcngtngaygtnmgnggnathgayccngargarggnmgnttyaayaay ytnmgnytnathgtngarmgnaayaayytntaygtnacnggnttygtnaaymgnacnaaya aygtnttytaymgnttygcngayttywsncaygtnacnttyccnggnacnacngcngtnac nytnwsnggngaywsnwsntayacnacnytncarmgngtngcnggnathwsnmgnacnggn atgcamthaaymgncaywsnytnacnacnwsntayytngayytnatgwsncaywsnggnac nwsnytnacncarwsngtngcnmgngcnatgytnmgnttygtnacngtnacngcngargcn ytnmgnttymgncarathcarmgnggnttymgnacnacnytngaygayytnwsnggnmgnw sntaygtnatgacngcngargaygtngayytnacnytnaaytggggnmgnytnwsnwsngt nytnccngaytaycayggncargaywsngtnmgngtnggnmgnathwsnttyggnwsnath aaygcnathytnggnwsngtngcnytnathytnaaytgycaycaycaygcnwsnmgngtng cnmgn NO: 6 Heavy chain GYTFTSYNVH CDR1 NO: 7 Heavy chain AIYPGNGDTSFNQKFKG CDR2 NO: 8 Heavy chain SNYYGSSYVWFFDY CDR3 NO: 9 Light chain RASSSVSYMD CDR1 NO: 10 Light chain ATSNLAS CDR2 NO: 11 Light chain QQWISNPPT CDR3 NO: 12 B9E9-SLTA QVQLVQSGAELVKPGASVKMSCKASGYTFTSYNMEWVKQTPGQGL polypeptide EWIGAIYPGNGDTSYNQKFKGKATLTADKSSSTAYMQLSSLTSED SAVYYCARAQLRPNYWYFDVWGAGTTVTVSSGGGGSGGGGSGGGG SGGGGSGGGGSDIVLSQSPAILSASPGEKVTMTCRASSSVSYMHW YQQKPGSSPKPWIYATSNLASGVPARFSGSGSGTSYSLTISRVEA EDAATYYCQQWISNPPTFGAGTKLELKGGGGSGGKEFTLDFSTAK TYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRG IDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTFP GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHS GTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRTTLDDLSGRSYV MTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSV ALILNCHHHASRVAR NO: 13 B9E9-SLTA cargtncarytngtncarwsnggngcngarytngtnaarccnggngcnwsngtnaaratgw polynucleotide sntgyaargcnwsnggntayacnttyacnwsntayaayatgcaytgggtnaarcaracncc consensus nggncarggnytngartggathggngcnathtayccnggnaayggngayacnwsntayaay caraarttyaarggnaargcnacnytnacngcngayaarwsnwsnwsnacngcntayatgc arytnwsnwsnytnacnwsngargaywsngcngtntaytaytgygcnmgngcncarytnmg nccnaaytaytggtayttygaygtntggggngcnggnacnacngtnacngtnwsnwsnggn ggnggnggnwsnggnggnggnggnwsnggnggnggnggnwsnggnggnggnggnwsnggng gnggnggnwsngayathgtnytnwsncarwsncmgcnathytnwsngcnwsnccnggngam argtnacnatgacntgymgngcnwsnwsnwsngtnwsntayatgcaytggtaycarcaraa rccnggnwsnwsnccnaarccntggathtaygcnacnwsnaayytngcnwsnggngtnccn gcnmgnttywsnggnwsnggnwsnggnacnwsntaywsnytnacnathwsnmgngtngarg cngargaygcngcnacntaytaytgycarcartggathwsnaayccnccnacnttyggngc nggnacnaarytngarytnaarggnggnggnggnwsnggnggnaargarttyacnytngay ttywsnacngcnaaracntaygtngaywsnytnaaygtnathmgnwsngcnathggnacnc cnytncaracnathwsnwsnggnggnacwsnytnytnatgathgaywsnggnwsnggngay aayytnttygcngtngaygtnmgnggnathgayccngargarggnmgnttyaayaayytnm gnytnathgtngarmgnaayaayytntaygtnacnggnttygtnaaymgnacnaayaaygt nttytaymgnttygcngayttywsncaygtnacnttyccnggnacnacngcngtnacnytn wsnggngaywsnwsntayacnacnytncarmgngtngcnggnathwsnmgnacnggnatgc arathaaymgncaywsnytnacnacnwsntayytngayytnatgwsncaywsnggnacnws nytnacncarwsngtngcnmgngcnatgytnmgnttygtnacngtnacngcngargcnytn mgnttymgncarathcarmgnggnttymgnacnacnytngaygayytnwsnggnmgnwsnt aygtnatgacngcngargaygtngayytnacnytnaaytggggnmgnytnwsnwsngtnyt nccngaytaycayggncargaywsngtnmgngtnggnmgnathwsnttyggnwsnathaay gcnathytnggnwsngtngcnytnathytnaaytgycaycaycaygcnwsnmgngtngcnm gn NO: 14 C2B8-SLTA QVQLQQPGAELVKPGASVKMSCKASGYTFTSYNMHWVKQTPGRGL polypeptide EWIGAIYPGNGDTSYNQKFKGKATLTADKSSSTAYMQLSSLTSED SAVYYCARSTYYGGDWYFNVWGAGTTVTVSAGSTSGSGKPGSGEG STKGQIVLSQSPAILSASPGEKVTMTCRASSSVSYIHWFQQKPGS SPKPWIYATSNLASGVPVRFSGSGSGTSYSLTISRVEAEDAATYY CQQWTSNPPTFGGGTKLEIKEFPKPSTPPGSSGGAPKEFTLDFST AKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDV RGIDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVT FPGTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMS HSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRTTLDDLSGRS YVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILG SVALILNCHHHASRVAR NO: 15 C2B8-SLTA cargtncarytncarcarccnggngcngarytngtnaarccnggngcnwsngtnaaratgw polynucleotide sntgyaargcnwsnggntayacnttyacnwsntayaayatgcaytgggtnaarcaracncc consensus nggnmgnggnytngartggathggngcnathtayccnggnaayggngayacnwsntayaay caraarttyaarggnaargcnacnytnacngcngayaarwsnwsnwsnacngcntayatgc arytnwsnwsnytnacnwsngargaywsngcngtntaytaytgygcnmgnwsnacntayta yggnggngaytggtayttyaaygtntggggngcnggnacnacngtnacngtnwsngcnggn wsnacnwsnggnwsnggnaarccnggnwsnggngarggnwsnacnaarggncarathgtny tnwsncarwsnccngcnathytnwsngcnwsnccnggngaraargtnacnatgacntgymg ngcnwsnwsnwsngtnwsntayathcaytggttycarcaraarccnggnwsnwsnccnaar ccntggathtaygcnacnwsnaayytngcnwsnggngtnccngtnmgnttywsnggnwsng gnwsnggnacnwsntaywsnytnacnathwsnmgngtngargcngargaygcngcnacnta ytaytgycarcartggacnwsnaayccnccnacnttyggnggnggnacnaarytngamtha argarttyccnaarccnwsnacnccnccnggnwsnwsnggnggngcnccnaargarttyac nytngayttywsnacngcnaaracntaygtngaywsnytnaaygtnathmgnwsngcnath ggnacnccnytncaracnathwsnwsnggnggnacnwsnytnytnatgathgaywsnggnw snggngayaayytnttygcngtngaygtnmgnggnathgayccngargarggnmgnttyaa yaayytnmgnytnathgtngarmgnaayaayytntaygtnacnggnttygtnaaymgnacn aayaaygtnttytaymgnttygcngayttywsncaygtnacnttyccnggnacnacngcng tnacnytnwsnggngaywsnwsntayacnacnytncarmgngtngcnggnathwsnmgnac nggnatgcarathaaymgncaywsnytnacnacnwsntayytngayytnatgwsncaywsn ggnacnwsnytnacncarwsngtngcnmgngcnatgytnmgnttygtnacngtnacngcng argcnytnmgnttymgncarathcarmgnggnttymgnacnacnytngaygayytnwsngg nmgnwsntaygtnatgacngcngargaygtngayytnacnytnaaytggggnmgnytnwsn wsngtnytnccngaytaycayggncargaywsngtnmgngtnggnmgnathwsnttyggnw snathaaygcnathytnggnwsngtngcnytnathytnaaytgycaycaycaygcnwsnmg ngtngcnmgn NO: 16 MT-3727 QVQLQQPGAELVKPGASVKMSCKTSGYTFTSYNVHWVKQTPGQGL polypeptide EWIGAIYPGNGDTSFNQKFKGKATLTADKSSSTVYMQLSSLTSED SAVYYCARSNYYGSSYVWFFDVWGAGTTVTVSSGSTSGSGKPGSG EGSQIVLSQSPTILSASPGEKVTMTCRASSSVSYMDWYQQKPGSS PKPWIYATSNLASGVPARFSGSGSGTSYSLTISRVEAEDAATYYC QQWISNPPTFGAGTKLELKEFPKPSTPPGSSGGAPGILGFVFTLK EFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSG DNLFAVDVRGIDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYR FADFSHVTFPGTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLT TSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRTT LDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISF GSINAILGSVALILNCHHHASRVAR NO: 17 MT-3727 cargtncarytncarcarccnggngcngarytngtnaarccnggngcnwsngtnaaratgw polynucleotide sntgyaaracnwsnggntayacnttyacnwsntayaaygtncaytgggtnaarcaracncc consensus nggncarggnytngartggathggngcnathtayccnggnaayggngayacnwsnttyaay caraarttyaarggnaargcnacnytnacngcngayaarwsnwsnwsnacngtntayatgc alytnwsnwsnytnacnwsngargaywsngcngtntaytaytgygcnmgnwsnaaytayta yggnwsnwsntaygtntggttyttygaygtntggggngcnggnacnacngtnacngtnwsn wsnggnwsnacnwsnggnwsnggnaarccnggnwsnggngarggnwsncarathgtnytnw sncarwsnccnacnathytnwsngcnwsnccnggngaraargtnacnatgacntgymgngc nwsnwsnwsngtnwsntayatggaytggtaycarcaraarccnggnwsnwsnccnaarccn tggathtaygcnacnwsnaayytngcnwsnggngtnccngcnmgnttywsnggnwsnggnw snggnacnwsntaywsnytnacnathwsnmgngtngargcngargaygcngcnacntayta ytgycarcartggathwsnaayccnccnacnttyggngcnggnacnaarytngarytnaar garttyccnaarccnwsnacnccnccnggnwsnwsnggnggngcnccnggnathytnggnt tygtnttyacnytnaargarttyacnytngayttywsnacngcnaaracntaygtngayws nytnaaygtnathmgnwsngcnathggnacnccnytncamcnathwsnwsnggnggnacnw snytnytnatgathgaywsnggnwsnggngayaayytnttygcngtngaygtnmgnggnat hgayccngargarggnmgnttyaayaayytnmgnytnathgtngarmgnaayaayytntay gtnacnggnttygtnaaymgnacnaayaaygtnttytaymgnttygcngayttywsncayg tnacnttyccnggnacnacngcngtnacnytnwsnggngaywsnwsntayacnacnytnca rmgngtngcnggnathwsnmgnacnggnatgcarathaaymgncaywsnytnacnacnwsn tayytngayytnatgwsncaywsnggnacnwsnytnacncarwsngtngcnmgngcnatgy tnmgnttygtnacngtnacngcngargcnytnmgnttymgncamthcarmgnggnttymgn acnacnytngaygayytnwsnggnmgnwsntaygtnatgacngcngargaygtngayytna cnytnaaytggggnmgnytnwsnwsngtnytnccngaytaycayggncargaywsngtnmg ngtnggnmgnathwsnttyggnwsnathaaygcnathytnggnwsngtngcnytnathytn aaytgycaycaycaygcnwsnmgngtngcnmgn NO: 18 218 Linker GSTSGSGKPGSGEGS NO: 19 Strep leader MWSHPQFEK sequence NO: 20 Murine IgG3 EFPKPSTPPGSSGGAP (mhinge) NO: 21 Heavy chain GYTFTSYNMH CDR1 NO: 22 Heavy chain AIYPGNGDTSYNQKFKG CDR2 NO: 23 Heavy chain AQLRPNYWYFDV CDR3 NO: 24 Light chain RASSSVSYMH CDR1
[0188] The present invention is further illustrated by the following non-limiting examples of CD20-binding proteins comprising Shiga toxin effector regions derived from A Subunits of members of the Shiga toxin family and CD20 binding regions comprising immunoglobulin-type polypeptides capable of binding extracellular parts of CD20.
EXAMPLES
[0189] The following examples demonstrate certain embodiments of the present invention. However, it is to be understood that these examples are for illustration purposes only and do not intend, nor should any be construed, to be wholly definitive as to conditions and scope of this invention. The examples were carried out using standard techniques, which are well known and routine to those of skill in the art, except where otherwise described in detail.
[0190] The following examples demonstrate the ability of exemplary CD20-binding proteins to selectively kill cells which express CD20 on their cell surfaces. The exemplary CD20-binding proteins bound to extracellular antigens on CD20 expressed by targeted cell types and entered the targeted cells. The internalized CD20-binding proteins routed their Shiga toxin effector region to the cytosol to inactivate ribosomes and subsequently caused the apoptotic death of the targeted cells. Thus, the exemplary CD20-binding proteins were capable of internalizing within CD20 expressing cell types by virtue of their Shiga toxin effector regions inducing rapid cellular internalization after the CD20-binding proteins formed a complex with cell surface CD20.
[0191] These exemplary CD20-binding proteins include .alpha.CD20scFv::SLT-1A version 1 (SEQ ID NO:4), .alpha.CD20scFv::SLT-1A version 2 (SEQ ID NO: 16), B9E9-SLT-1A (SEQ ID NO:12), and C2B8-SLT-1A (SEQ ID NO:14).
Example 1--Construction, Production, and Purification of Exemplary CD20-Binding Proteins
[0192] First, a CD20 binding region and a Shiga toxin effector region were designed or selected. In the examples below, the Shiga toxin effector region was derived from the A subunit of Shiga-like Toxin 1 (SLT-1A). A polynucleotide was obtained containing a fragment of SLT-1A cloned into the pECHE9A plasmid and encoding amino acids 1-251 of SLT-1A (Cheung M et al., Mol Cancer 9: 28 (2010)).
[0193] The CD20 binding region was designed as a recombinant scFv derived from the 1H4 CD20 monoclonal antibody (Haisma et al. (1999), Blood 92: 184-90). The two immunoglobulin variable regions (VL and VH) were separated by a linker (SEQ ID NO:18).
[0194] Second, the binding region and Shiga toxin effector region were combined to form a single-chain, recombinant polypeptide. In this example, a polynucleotide encoding the recombinant scFv derived from 1H4 CD20 monoclonal antibody was cloned in frame with a "murine hinge" polynucleotide derived from polynucleotides encoding a murine IgG3 molecule (SEQ ID NO:20) and in frame with a polynucleotide encoding SLT-1A (residues 1-251 of SEQ ID NO:1). The full-length sequence begins with Strep-Tag.RTM. (SEQ ID NO: 19) encoding polynucleotide sequence cloned in frame to facilitate detection and purification. The polynucleotide sequence of this example was codon optimized for efficient expression in E. coli using services from DNA 2.0, Inc. (Menlo Park, Calif., U.S.) to produce the expression vector which encoded .alpha.CD20scFv::SLT-1A version 1.
[0195] A different CD20-binding protein comprising an influenza antigen was constructed and produced in a similar manner. DNA 2.0, Inc. (Menlo Park, Calif., U.S.) synthesized the multiple polynucleotides, including the antigen sequence (SEQ ID NO:3) and the required polynucleotide components were joined in frame using vector pJ201 to create the open reading frame coding for the following single-chain polypeptide (from amino-terminus to carboxy-terminus) Strep-Tag.RTM. (SEQ ID NO:19), the 1H4-derived recombinant scFv (described above), the murine IgG3 molecule (SEQ ID NO:20), the linker (SEQ ID NO:3), and the SLT-1A-derived sequence (residues 1-251 of SEQ ID NO: 1). This recombinant polynucleotide was cloned into pTXB1 for polypeptide production purposes. Again, codon optimization for efficient expression in E. coli was performed by DNA 2.0, Inc. (Menlo Park, Calif., U.S.) to produce the expression vector which encoded .alpha.CD20scFv::SLT-1A version 2.
[0196] Third, both versions 1 and 2 of the .alpha.CD20scFv::SLT-1A recombinant CD20-binding proteins were produced by using standard techniques for both bacterial and cell-free, protein translation systems. Then the CD20-binding proteins were purified and isolated using techniques well known in the art.
Example 2--Determining the Dissociation Constant (K.sub.D) of Exemplary CD20-Binding Proteins
[0197] The cell binding characteristics of both versions 1 and 2 of the .alpha.CD20scFv::SLT-1A CD20-binding proteins were determined by a fluorescence-based flow cytometry assay. Each sample contained 0.5.times.10.sup.6 of either CD20 expressing cells (Raji (CD20+)) or non-expressing cells (BC1 (CD20-)) and was incubated with 100 .mu.L of various dilutions of the CD20-binding proteins in phosphate buffered saline Hyclone 1.times.PBS (Fisher Scientific, Waltham, Mass.) with 1% bovine serum albumin (BSA) (Calbiochem, San Diego, Calif., U.S.), hereinafter referred to as "1.times.PBS+1% BSA" for 1 hour at 4 degrees Celsius (.degree. C.). The highest concentration of CD20-binding protein was selected to lead to saturation of the reaction. The cells were washed twice with 1.times.PBS+1% BSA. The cells were incubated for 1 hour at 4.degree. C. with 100 .mu.L of 1.times.PBS+1% BSA containing 0.3 .mu.g of anti-Strep Tag.RTM. mAb-FITC (# A01736-100, Genscript, Piscataway, N.J., U.S.). The cells were washed twice with 1.times.PBS+1% BSA, suspended in 200 .mu.L of 1.times.PBS, and subjected to flow cytometry. The baseline corrected mean fluorescence intensity (MFI) data for all the samples was obtained by subtracting the MFI of the FITC alone sample (negative control) from each experimental sample. Graphs were plotted of MFI versus "concentration of protein" using Prism software (GraphPad Software, San Diego, Calif., U.S.). Using the Prism software function of one-site binding [Y=B.sub.max*X/(K.sub.D+X)] under the heading binding-saturation, the B.sub.max and K.sub.D were calculated using baseline corrected data. B.sub.max is the maximum specific binding reported in MFI. K.sub.D is the equilibrium binding constant, reported in nanomolar (nM).
[0198] Over multiple experiments, the K.sub.D of .alpha.CD20scFv::SLT-1A version 1 for Raji (CD20+) cells was determined to be about 80-100 nM. In one experiment, the B.sub.max for the .alpha.CD20scFv::SLT-1A version 1 CD20-binding protein binding to CD20+ cells was measured to be about 140,000 MFI with a K.sub.D of about 83 nM (Table 1), whereas there was no meaningful binding to CD20-cells observed in this assay. In one experiment, the B.sub.max for .alpha.CD20scFv::SLT-1A version 2 binding to CD20+ cells was measured to be about 110,000 MFI with a K.sub.D of about 101 nM (Table 1), whereas there was no meaningful binding to CD20-cells observed in this assay.
TABLE-US-00004 TABLE 1 Binding Characteristics: Representative values for B.sub.max and K.sub.D for exemplary CD20-binding proteins target Target Positive Target Negative bio- Cells Cells mole- B.sub.max K.sub.D B.sub.max K.sub.D CD20-Binding Protein cule (MFI) (nM) (MFI) (nM) .alpha.CD20scFv::SLT-1A CD20 139,000 82.5 15,800 1,050 version 1 .alpha.CD20scFv::SLT-1A CD20 112,000 101.0 8,300 280 version 2
Example 3--Determining the Half Maximal Inhibitory Concentration (IC.sub.50) of Exemplary CD20-Binding Proteins
[0199] The ribosome inactivation capabilities of both versions 1 and 2 of the .alpha.CD20scFv::SLT-1A CD20-binding proteins were determined using a cell-free, in vitro protein translation assay using the TNT.RTM. Quick Coupled Transcription/Translation kit (L1170 Promega Madison, Wis., U.S.). The kit includes Luciferase T7 Control DNA (L4821 Promega Madison, Wis., U.S.) and TNT.RTM. Quick Master Mix. The ribosome activity reaction was prepared according to the manufacturer's instructions.
[0200] A series of 10-fold dilutions of the .alpha.CD20scFv::SLT-1A version to be tested was prepared in appropriate buffer and a series of identical TNT reaction mixture components were created for each dilution. Each sample in the dilution series of the .alpha.CD20scFv::SLT-1A proteins was combined with each of the TNT reaction mixtures along with the Luciferase T7 Control DNA. The test samples were incubated for 1.5 hours at 30.degree. C. After the incubation, Luciferase Assay Reagent (E1483 Promega, Madison, Wis., U.S.) was added to all test samples and the amount of luciferase protein translation was measured by luminescence according to the manufacturer instructions. The level of translational inhibition was determined by non-linear regression analysis of log-transformed concentrations of total protein versus relative luminescence units. Using statistical software (GraphPad Prism, San Diego, Calif., U.S.), the half maximal inhibitory concentration (IC.sub.50) value was calculated for each sample using the Prism software function of log(inhibitor) vs. response (three parameters) [Y=Bottom+((Top-Bottom)/(1+10{circumflex over ( )}(X-Log IC50)))] under the heading dose-response-inhibition. The IC.sub.50 for experimental proteins and SLT-1A-only control protein were calculated. The percent of SLT-1A-only control protein was calculated by [(IC50 of SLT-1A control protein/IC50 of experimental protein).times.100].
[0201] The inhibitory effect of both versions of .alpha.CD20scFv::SLT-1A on cell-free protein synthesis was strong. Multiple experiments determined that the IC.sub.50 of both versions of .alpha.CD20scFv::SLT-1A was around 50 picomolar (pM). In one experiment, the IC.sub.50 of .alpha.CD20scFv::SLT-1A version 1 on protein synthesis was about 38 pM or within 19% of the SLT-1A-only positive control (Table 2). Similarly, the IC.sub.50 of .alpha.CD20scFv::SLT-1A version 2 on protein synthesis in this cell-free assay was about 58 pM or within 18% of the SLT-1A-only positive control (Table 2).
TABLE-US-00005 TABLE 2 Ribosome Inactivation: Representative half-maximal inhibitory concentrations (IC.sub.50) for exemplary CD20-binding proteins IC.sub.50 of Percentage of CD20-Binding IC.sub.50 SLT-1A-only IC.sub.50 of Protein (pM) positive control (pM) SLT-1A control .alpha.CD20scFv::SLT-1A 38.3 31.2 81% version 1 .alpha.CD20scFv::SLT-1A 58.3 47.8 82% version 2
Example 4--Determining Cellular Internalization by Immunofluorescence Assay
[0202] Immunofluorescence studies were carried out in order to analyze the binding and internalization profiles of .alpha.CD20scFv::SLT-1A version 1 in CD20+ positive cell lines (Daudi, Raji, and Ramos) as compared to CD20-cell lines (BC-1, Jurkat (J45.01), and U266). For example, 50 nM of the respective CD 20-binding proteins were incubated with 0.8.times.10.sup.6 Raji cells for 1 hour at 37.degree. C. to allow for binding and internalization of the CD20-binding protein. The cells were then washed with 1.times.PBS, fixed and permeabilized with BD cytofix/cytoperm (BD Biosciences, San Jose, Calif., U.S.), and then washed twice with 1.times.BD Perm/Wash.TM. Buffer (BD Biosciences, San Jose, Calif., U.S.). The cells were incubated with Alexa Fluor.RTM.-555 labeled mouse anti-SLT-1A antibody (BEI Resources, Manassas, Va., U.S.) in 1.times.BD Perm/Wash.TM. Buffer for 45 minutes at room temperature. Cells were then washed and fixed with BD cytofix (BD Biosciences, San Jose, Calif., U.S.) for 10 minutes at 4.degree. C. The cells were then washed with 1.times.PBS and resuspended in 1.times.PBS, and then the cells were allowed to adhere onto poly-L-lysine coated glass slides (VWR, Radnor, Pa., U.S.). Slides were coverslipped with 4',6-diamidino-2-phenylindole (DAPI)-containing Vectashield (Fisher Scientific, Waltham, Mass., U.S.) and viewed by Zeiss Fluorescence Microscope (Zeiss, Thornwood, N.Y., U.S.).
[0203] Immunofluorescence studies showed that .alpha.CD20scFv::SLT-1A version 1 and B9E9-SLT-1A bound to cell surfaces and entered into cells expressing CD20 within one hour at 37.degree. C.
Example 5--CD20+ Cell Kill Assay: Determining the Cytotoxic Selectivity and Half-Maximal Cytotoxic Concentrations (CD.sub.50) of CD20-Binding Proteins
[0204] The cytotoxicity profiles of both versions of .alpha.CD20scFv::SLT-1A were determined by a CD20+ cell kill assay. This assay determines the capacity of a CD20-binding protein to kill cells expressing CD20 on a cellular surface as compared to cells that do not express the target biomolecule. Cells were plated (2.times.10.sup.3 per well) in 20 .mu.L media in 384 well plates. The .alpha.CD20scFv::SLT-1A protein to be tested was diluted either 5-fold or 10-fold in a 1.times.PBS and 5 .mu.L of the dilutions or buffer control were added to the cells. Control wells containing only media were used for baseline correction. The cell samples were incubated for 3 days at 37.degree. C. and in an atmosphere of 5% carbon dioxide (CO.sub.2) with the .alpha.CD20scFv::SLT-1A to be tested or only PBS buffer. The total cell survival or percent viability was determined using a luminescent readout using the CellTiter-Glo.RTM. Luminescent Cell Viability Assay (G7573 Promega Madison, Wis., U.S.) according to the manufacturer's instructions. The "percent viability" of experimental wells was calculated using the following equation: (Test RLU-Average Media RLU)/(Average Cells RLU-Average Media RLU)*100. Log polypeptide concentration versus Percent Viability was plotted using Prism software (GraphPad Prism, San Diego, Calif., U.S.) and log (inhibitor) vs. normalized response (variable slope) analysis was used to determine the half-maximal cytotoxic concentration (CD.sub.50) value for the exemplary CD20-binding proteins. In addition, cell samples from lymphoma patients were analyzed using this cell kill assay to determine the cytotoxicity profile of .alpha.CD20scFv::SLT-1A version 1.
[0205] Over multiple experiments, both versions of .alpha.CD20scFv::SLT-1A demonstrated CD20-specific cell kill with 10 to 1000-fold specificity compared to cell kill of CD20 negative cell lines (Table 3). The CD20-specific cell kill profile of both versions of .alpha.CD20scFv::SLT-1A also contrasted to the ability of the component SLT-1A (251) to kill cells which lacked CD20-specificity (Table 3). The CD.sub.50 values of both versions of .alpha.CD20scFv::SLT-1A protein was measured to be about 3-70 nM for CD20+ cells, depending on the cell line, as compared to over 600-2,000 for CD20-cell lines (Table 3). The CD.sub.50 of the .alpha.CD20scFv::SLT-1A version 1 CD20-binding protein was over 100 to 400 fold greater (less cytotoxic) for cells which did not express CD20 on a cellular surface as compared to cells expressing CD20 on a cellular surface. The CD.sub.50 of .alpha.CD20scFv::SLT-1A version toward human lymphoma cells from patient samples was about 7-40 nM (Table 3).
TABLE-US-00006 TABLE 3 Selective Cytotoxicity: Representative half-maximal cytotoxic concentrations (CD.sub.50) for exemplary CD20-binding proteins CD.sub.50 (nM) SLT-1A only CD20 .alpha.CD20scFv::SLT- .alpha.CD20scFv::SLT- negative status 1A version 1 1A version 2 control Cell Line Daudi positive 5.6 67.0 650 Raji positive 2.8 4.5 1,100 ST486 positive 3.7 7.0 940 Ramos positive 27.0 33.0 470 BC-1 negative 2,000 2,100.0 160 Jurkat negative 1,400 600.0 120 U226 negative 2,500 not 960 determined Patient Samples follicular positive 7.1 39.0 690,000 lymphoma, rituximab refractory Burkitt's positive 9.0 12.0 960 lymphoma transformed by Epstein- Barr Virus
Example 6--Comparative CD20+ Cell Kill: Determining the Relative Cytotoxicities of CD20-Binding Proteins to CD20+ Cells
[0206] Three potentially cytotoxic CD20-binding proteins were tested using the CD20+ cell kill assay in Raji cells (CD20+) as described above in Example 5. A set of representative results is reported in Table 4. Over multiple experiments, .alpha.CD20scFv::SLT-1A version 1 exhibited a 50 to 100-fold greater cell kill function as compared to the CD20-binding protein B9E9 (SEQ ID NO: 12) (Table 4).
TABLE-US-00007 TABLE 4 Representative Half-Maximal Cytotoxic Concentrations (CD.sub.50) for Exemplary CD20-Binding Proteins to CD20+ Raji Cells CD20-Binding Protein CD.sub.50 (nM) SLT-1A only negative control 429 .alpha.CD20scFv::SLT-1A 2 version 1 .alpha.CD20scFv-B9E9::SLT-1A 103
Example 7--Determining the Targeted Cytotoxicity for CD20-Binding Proteins Using In Vivo Xenograft Studies
[0207] Two xenograft model systems based on an immuno-compromised mouse strains were used to study the ability of exemplary CD20-binding proteins to kill CD20+ tumor cells in vivo and in a tumor environment over time and for various dosages. These xenograft model systems rely on well-characterized mouse strains that lack graft versus host responses, among other immune system deficiencies. First, an intravenous tumor model was studied using SCID (severe combined immune deficiency) mice to create disseminated tumors throughout the mice in order to test the in vivo effects of exemplary CD20-binding proteins on human tumor cells. Second, a subcutaneous tumor model was studied using BALBc/nude mice to create subcutaneous tumors on the mice, again in order to test the in vivo effects of exemplary CD20-binding proteins on human tumor cells.
[0208] For the first xenograft system, thirty-two C.B.-17 SCID mice (in four groups of eight animals) were challenged with 1.times.10' Raji-luc human lymphoma derived cells (Molecular Imaging, Ann Arbor, Mich., U.S.) in 200 pL PBS. On days 5-9 and 12-16 following tumor challenge, the following groups received the following through intravenous administration: Group 1: PBS; Group 2: .alpha.CD20scFv::SLT-1A version 2 at a dose of 2 mg/kg; Group 3: .alpha.CD20scFv::SLT-1A version 1 at a dose of 2 mg/kg; and Group 4: .alpha.CD20scFv::SLT-1A version 1 at a dose of 4 mg/kg (days 5-9 only). Bioluminescence, in 1.times.10.sup.6 photons/second units (p/s), was measured on days 5, 10, 15, and 20 using a Caliper IVIS 50 optical imaging system (Perkin Elmer, Waltham, Mass., U.S.). FIG. 2 shows how both versions of .alpha.CD20scFv::SLT-1A, and .alpha.CD20scFv::SLT-1A version 1 at both dosage levels, resulted in statistically significant less total bioluminescence compared to the PBS control. The decrease in total bioluminescence was reflective of statistically significant reductions in disseminated tumor burdens after treatment with a CD20-binding protein of the invention. FIG. 3 indicates a statistically significant increase in survival with administration of either version of .alpha.CD20scFv::SLT-1A. The mean survival age was increased by five days with all treatments compared to the PBS negative control.
[0209] For the second xenograft model, twenty-eight BALBc/nude (in four groups of six or seven animals) were challenged subcutaneously with 2.5.times.10.sup.6 Raji human lymphoma cells (Washington Biotechnology, Simpsonville, Md., U.S.). Tumor volume was determined using standard methods known in the art utilizing calipers. Day 0 was set at the point when the mean tumor volume for each mouse reached approximately 160 mm.sup.3 (one mouse from each group had a tumor greater than 260 mm.sup.3 so it was excluded). On days 0-4 and 7-11 the groups received intravenous administration of the following by group: Group 1: PBS; Group 2: .alpha.CD20scFv::SLT-1A version 2 at a dose of 2 mg/kg; Group 3: .alpha.CD20scFv::SLT-1A version 1 at a dose of 2 mg/kg; Group 4: .alpha.CD20scFv::SLT-1A version 1 at a dose of 4 mg/kg. Tumor volume was measured and graphed as a function of day of study. FIG. 4 demonstrates how treatment with .alpha.CD20scFv::SLT-1A version 1 (at both dosage levels) resulted in significantly reduced tumor volume compared to the PBS control through to Day 24. This is also reflected in the tumor free mouse number through Day 54, reported in Table 5.
TABLE-US-00008 TABLE 5 Elimination of Tumors by Exemplary CD20-Binding Proteins in a Subcutaneous-Tumor Mouse Model Group Tumor Free Mice/Total Mice PBS 0/7 .alpha.CD20scFv::SLT-1A 6/7 version 2, 2 mg/kg .alpha.CD20scFv::SLT-1A 5/6 version 1, 2 mg/kg .alpha.CD20scFv::SLT-1A 6/7 version 1, 4 mg/kg
Example 8--Determining In Vivo Effects of a CD20-Binding Protein in Non-Human Primates
[0210] The exemplary CD20-binding protein .alpha.CD20scFv::SLT-1A version 1 was administered to non-human primates in order to test for in vivo effects. In vivo depletion of peripheral blood B lymphocytes in cynomolgus primates was observed after parenteral administration of different doses of .alpha.CD20scFv::SLT-1A version 1.
[0211] In one experiment, ten cynomolgus primates were intravenously injected with PBS or .alpha.CD20scFv::SLT-1A version 1 at different doses (50, 150, and 450 micrograms drug/kilogram body weight (mcg/kg)) on alternative days for 2 weeks. Then, peripheral blood samples collected prior to dosing on days 3 and 8 were analyzed for the percentage of B-lymphocytes which expressed CD20 (FIGS. 5 and 6). In cynomolgus monkeys, two distinct B-cell subsets have been described by flow-cytometry: (1) CD21 negative, CD40 positive cells which express high levels of CD20, and (2) CD21 positive and CD40 positive cells which express lower levels of CD20 (Vugmeyster Y et al., Cytometry 52: 101-9 (2003)). Dose-dependent B-cell depletion as compared to baseline levels from blood samples collected prior to treatment was observed on day 3 (4, 14 and 45% decrease in animals dosed at 50, 150 and 450 mcg/kg) and day 8 (32, 52 and 75% decrease in animals dosed at 50, 150 and 450 mcg/kg) (Table 6). This experiment showed that .alpha.CD20scFv::SLT-1A version 1 was capable of killing CD20 positive, primate B-cells in vivo.
TABLE-US-00009 TABLE 6 CD20-Binding Protein Dose Dependent B- Cell Depletion in Non-Human Primates Percent Decrease in Percent Decrease in CD40+, CD20+ Cells (%) CD21+, CD40+, CD20+ Cells (%) 50 150 450 50 150 450 Day mcg/kg mcg/kg mcg/kg mcg/kg mcg/kg mcg/kg 3 38 57 69 4 14 45 8 65 81 86 32 52 75
Example 9--a CD20-Binding Protein Derived from the a Subunit of Shiga-Like Toxin-1 and the Antibody of Atumumab
[0212] In this example, the Shiga toxin effector region is derived from the A subunit of Shiga-like Toxin 1 (SLT-1A). An immunoglobulin-type binding region .alpha.CD20 is derived from the monoclonal antibody of atumumab (Gupta I, Jewell R, Ann N Y Acad Sci 1263: 43-56 (2012)) which comprises an immunoglobulin-type binding region capable of binding human CD20.
Construction, Production, and Purification of the CD20-Binding Protein SLT-1A::.alpha.CD20
[0213] The immunoglobulin-type binding region .alpha.CD20 and Shiga toxin effector region are linked together to form a protein. For example, a fusion protein is produced by expressing a polynucleotide encoding the CD20-binding protein SLT-1A::.alpha.CD20. Expression of the SLT-1A::.alpha.CD20 CD20-binding protein is accomplished using either bacterial and/or cell-free, protein translation systems as described in the previous examples.
Determining the In Vitro Characteristics of the CD20-Binding Protein SLT-1A::.alpha.CD20
[0214] The binding characteristics of the CD20-binding protein of this example for CD20+ cells and CD20-cells is determined by a fluorescence-based, flow-cytometry assay as described above in the previous examples. The B.sub.max for SLT-1A::.alpha.CD20 binding to CD20+ cells is measured to be approximately 50,000-200,000 MFI with a K.sub.D within the range of 0.01-100 nM, whereas there is no significant binding to CD20-cells in this assay.
[0215] The ribosome inactivation capabilities of the SLT-1A::.alpha.CD20 CD20-binding protein is determined in a cell-free, in vitro protein translation as described above in the previous examples. The inhibitory effect of the CD20-binding protein of this example on cell-free protein synthesis is significant. The IC.sub.50 of SLT-1A::.alpha.CD20 on protein synthesis in this cell-free assay is approximately 0.1-100 pM.
Determining the Cytotoxicity of the CD20-Binding Protein SLT-1A::.alpha.CD20 Using a Cell-Kill Assay
[0216] The cytotoxicity characteristics of SLT-1A::.alpha.CD20 are determined by the general cell-kill assay as described above in the previous examples using CD20+ cells. In addition, the selective cytotoxicity characteristics of SLT-1A::.alpha.CD20 are determined by the same general cell-kill assay using CD20-cells as a comparison to the CD20+ cells. The CD.sub.50 of the CD20-binding protein of this example is approximately 0.01-100 nM for CD20+ cells depending on the cell line. The CD.sub.50 of the CD20-binding protein is approximately 10-10,000 fold greater (less cytotoxic) for cells not expressing CD20 on a cellular surface as compared to cells which do express CD20 on a cellular surface.
Determining the In Vivo Effects of the CD20-Binding Protein SLT-1A:: .alpha.CD20 Using Animal Models
[0217] Animal models are used to determine the in vivo effects of the CD20-binding protein SLT-1A::.alpha.CD20 on neoplastic cells. Various mice strains are used to test the effect of the CD20-binding protein after intravenous administration on xenograft tumors in mice resulting from the injection into those mice of human neoplastic cells which express CD20 on their cell surfaces. Non-human primates may be used to test the effect of SLT-1A::.alpha.CD20 on peripheral blood B-cells as described above in Example 8.
Example 10--CD20-Binding Proteins Based on Various CD20 Binding Domains
[0218] In this example, the Shiga toxin effector region is derived from the A subunit of Shiga-like Toxin 1 (SLT-1A), Shiga toxin (StxA), and/or Shiga-like Toxin 2 (SLT-2A). An immunoglobulin-type binding region is derived from the immunoglobulin domain from the molecule chosen from Table 7 and which binds an extracellular part of CD20. The exemplary cytotoxic proteins of this example are created and tested as described in the previous examples using CD20+ cells expressing CD20 to a cellular surface.
TABLE-US-00010 TABLE 7 Exemplary CD20 Binding Domains Source of CD20 Binding Domain ibritumomab obinutuzumab ocaratuzumab ocrelizumab obinutuzumab ofatumumab rituximab tositumomab ublituximab CD20 binding scFv(s) in Geng S et al., Cell Mol Immunol 3: 439-43 (2006) CD20 binding scFv(s) in Olafesn T et al., Protein Eng Des Sel 23: 243-9 (2010)
[0219] While certain embodiments of the invention have been described by way of illustration, it will be apparent that the invention may be put into practice with many modifications, variations and adaptations, and with the use of numerous equivalents or alternative solutions that are within the scope of persons skilled in the art, without departing from the spirit of the invention or exceeding the scope of the claims.
[0220] All publications, patents, and patent applications are herein incorporated by reference in their entirety to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety. The U.S. provisional patent application 61/777,130 is incorporated by reference in its entirety. The complete disclosures of all electronically available biological sequence information from GenBank (National Center for Biotechnology Information, U.S.) for amino acid and nucleotide sequences cited herein are each incorporated herein by reference in their entirety.
TABLE-US-00011 Sequence Listing ID Number Text Description Sequence SEQ ID NO: 1 Shiga-like Toxin 1 A KEFTLDFSTAKTYVDSLNVIRS Subunit (SLT-1A) AIGTPLQTISSGGTSLLMIDSG SGDNLFAVDVRGIDPEEGRFNN LRLIVERNNLYVTGFVNRTNNV FYRFADFSHVTFPGTTAVTLSG DSSYTTLQRVAGISRTGMQINR HSLTTSYLDLMSHSGTSLTQSV ARAMLRFVTVTAEALRFRQIQR GFRTTLDDLSGRSYVMTAEDVD LTLNWGRLSSVLPDYHGQDSVR VGRISFGSINAILGSVALILNC HHHASRVARMASDEFPSMCPAD GRVRGITHNKILWDSSTLGAIL MRRTISS SEQ ID NO: 2 polynucleotide aargarttyacnytngayttywsnacngcn encoding SLT-1 A aaracntaygtngaywsnytnaaygtnath Subunit (consensus) mgnwsngcnathggnacnccnytncaracn athwsnwsnggnggnacnwsnytnytnatg athgaywsnggnwsnggngayaayytntty gcngtngaygtnmgnggnathgayccngar garggnmgnttyaayaayytnmgnytnath gtngarmgnaayaayytntaygtnacnggn ttygtnaaymgnacnaayaaygtnttytay mgnttygcngayttywsncaygtnacntty ccnggnacnacngcngtnacnytnwsnggn gaywsnwsntayacnacnytncarmgngtn gcnggnathwsnmgnacnggnatgcarath aaymgncaywsnytnacnacnwsntayytn gayytnatgwsncaywsnggnacnwsnytn acncarwsngtngcnmgngcnatgytnmgn ttygtnacngtnacngcngargcnytnmgn ttymgncarathcarmgnggnttymgnacn acnytngaygayytnwsnggnmgnwsntay gtnatgacngcngargaygtngayytnacn ytnaaytggggnmgnytnwsnwsngtnytn ccngaytaycayggncargaywsngtnmgn gtnggnmgnathwsnttyggnwsnathaay gcnathytnggnwsngtngcnytnathytn aaytgycaycaycaygcnwsnmgngtngcn mgnatggcnwsngaygarttyccnwsnatg tgyccngcngayggnmgngtnmgnggnath acncayaayaarathytntgggaywsnwsn acnytnggngcnathytnatgmgnmgnacn athwsnwsntrr SEQ ID NO: 3 linker extension for GILGFVFTL anti-CD20-scFv::SLT- 1A version 2 (influenza Matrix 58- 66) polypeptide SEQ ID NO: 4 anti-CD20-scFv::SLT- MQVQLQQPGAELVKPGASNKMS 1A version 1 (MT- CKTSGYTFTSYNVHWVKQTPGQ 3724) polypeptide GLEWIGAIYPGNGDTSFNQKFK GKATLTADKSSSTVYMQLSSLT SEDSAVYYCARSNYYGSSYVWF FDVWGAGTTVTVSSGSTSGSGK PGSGEGSQIVLSQSPTILSASP GEKVTMTCRASSSVSYMDWYQQ KPGSSPKPWIYATSNLASGVPA RFSGSGSGTSYSLTISRVEAED AATYYCQQWISNPPTFGAGTKL ELKEFPKPSTPPGSSGGAPKEF TLDFSTAKTYVDSLNVIRSAIG TPLQTISSGGTSLLMIDSGSGD NLFAVDVRGIDPEEGRFNNLRL IVERNNLYVTGFVNRTNNVFYR FADFSHVTFPGTTAVTLSGDSS YTTLQRVAGISRTGMQINRHSL TTSYLDLMSHSGTSLTSVARAM LRFVTVTAEALRFRQIQRGFRT TLDDLSGRSYVMTAEDVDLTLN WGRLSSVLPDYHGQDSVRVGRI SFGSINAILGSVALILNCHHHA SRVAR SEQ ID NO: 5 polynucleotide atgcargtncarytncarcarccnggngcn encoding anti-CD20- garytngtnaarccnggngcnwsngtnaar scFv::SLT-1A version atgwsntgyaaracnwsnggntayacntty 1 (MT-3724) acnwsntayaaygtncaytgggtnaarcar acnccnggncarggnytngartggathggn gcnathtayccnggnaayggngayacnwsn ttyaaycaraarttyaarggnaargcnacn ytnacngcngayaarwsnwsnwsnacngtn tayatgcarytnwsnwsnytnacnwsngar gaywsngcngtntaytaytgygcnmgnwsn aaytaytayggnwsnwsntaygtntggtty ttygaygtntggggngcnggnacnacngtn acngtnwsnwsnggnwsnacnwsnggnwsn ggnaarccnggnwsnggngarggnwsncar athgtnytnwsncarwsnccnacnathytn wsngcnwsnccnggngaraargtnacnatg acntgymgngcnwsnwsnwsngtnwsntay atggaytggtaycarcaraarccnggnwsn wsnccnaarccntggathtaygcnacnwsn aayytngcnwsnggngtnccngcnmgntty wsnggnwsnggnwsnggnacnwsntaywsn ytnacnathwsnmgngtngargcngargay gcngcnacntaytaytgycarcartggath wsnaayccnccnacnttyggngcnggnacn aarytngarytnaargarttyccnaarccn wsnacnccnccnggnwsnwsnggnggngcn ccnaargarttyacnytngayttywsnacn gcnaaracntaygtngaywsnytnaaygtn athmgnwsngcnathggnacnccnytncar acnathwsnwsnggnggnacnwsnytnytn atgathgaywsnggnwsnggngayaayytn ttygcngtngaygtnmgnggnathgayccn gargarggnmgnttyaayaayytnmgnytn athgtngarmgnaayaayytntaygtnacn ggnttygtnaaymgnacnaayaaygtntty taymgnttygcngayttywsncaygtnacn ttyccnggnacnacngcngtnacnytnwsn ggngaywsnwsntayacnacnytncarmgn gtngcnggnathwsnmgnacnggnatgcar athaaymgncaywsnytnacnacnwsntay ytngayytnatgwsncaywsnggnacnwsn ytnacncarwsngtngcnmgngcnatgytn mgnttygtnacngtnacngcngargcnytn mgnttymgncarathcarmgnggnttymgn acnacnytngaygayytnwsnggnmgnwsn taygtnatgacngcngargaygtngayytn acnytnaaytggggnmgnytnwsnwsngtn ytnccngaytaycayggncargaywsngtn mgngtnggnmgnathwsnttyggnwsnath aaygcnathytnggnwsngtngcnytnath ytnaaytgycaycaycaygcnwsnmgngtn gcnmgn SEQ ID NO: 6 heavy chain CDR1 GYTFTSYNVH SEQ ID NO: 7 heavy chain CDR2 AIYPGNGDTSFNQKFKG SEQ ID NO: 8 heavy chain CDR3 SNYYGSSYVWFFDV SEQ ID NO: 9 light chain CDR1 RASSSVSYMD SEQ ID NO: 10 light chain CDR2 ATSNLAS SEQ ID NO: 11 light chain CDR3 QQWISNPPT SEQ ID NO: 12 anti-CD20-scFv- MQVQLVQSGAELVKPGASVKMS B9E9::SLT-1A CKASGYTFTSYNMHWVKQTPGQ polypeptide GLEWIGAIYPGNGDTSYNQKFK GKATLTADKSSSTAYMQLSSLT SEDSAVYYCARAQLRPNYWYFD VWGAGTTVTVSSGGGGSGGGGS GGGGSGGGGSGGGGSDIVLSQS PAILSASPGEKVTMTCRASSSV SYMHWYQQKPGSSPKPWIYATS NLASGVPARFSGSGSGTSYSLT ISRVEAEDAATYYCQQWISNPP TFGAGTKLELKGGGGSGGKEFT LDFSTAKTYVDSLNVIRSAIGT PLQTISSGGTSLLMIDSGSGDN LFAVDVRGIDPEEGRFNNLRLI VERNNLYVTGFVNRTNNVFYRF ADFSHVTFPGTTAVTLSGDSSY TTLQRVAGISRTGMQINRHSLT TSYLDLMSHSGTSLTQSVARAM LRFVTVTAEALRFRQIQRGFRT TLDDLSGRSYVMTAEDVDLTLN WGRLSSVLPDYHGQDSVRVGRI SFGSINAILGSVALILNCHHHA SRVAR SEQ ID NO: 13 polynucleotide atgcargtncarytngtncarwsnggngcn encoding anti-CD20- garytngtnaarccnggngcnwsngtnaar scFv-B9E9::SLT-1A atgwsntgyaargcnwsnggntayacntty (consensus) acnwsntayaayatgcaytgggtnaarcar acnccnggncarggnytngartggathggn gcnathtayccnggnaayggngayacnwsn tayaaycaraarttyaarggnaargcnacn ytnacngcngayaarwsnwsnwsnacngcn tayatgcarytnwsnwsnytnacnwsngar gaywsngcngtntaytaytgygcnmgngcn carytnmgnccnaaytaytggtayttygay gtntggggngcnggnacnacngtnacngtn wsnwsnggnggnggnggnwsnggnggnggn ggnwsnggnggnggnggnwsnggnggnggn ggnwsnggnggnggnggnwsngayathgtn ytnwsncarwsnccngcnathytnwsngcn wsnccnggngaraargtnacnatgacntgy mgngcnwsnwsnwsngtnwsntayatgcay tggtaycarcaraarccnggnwsnwsnccn aarccntggathtaygcnacnwsnaayytn gcnwsnggngtnccngcnmgnttywsnggn wsnggnwsnggnacnwsntaywsnytnacn athwsnmgngtngargcngargaygcngcn acntaytaytgycarcartggathwsnaay ccnccnacnttyggngcnggnacnaarytn garytnaarggnggnggnggnwsnggnggn aargarttyacnytngayttywsnacngcn aaracntaygtngaywsnytnaaygtnath mgnwsngcnathggnacnccnytncaracn athwsnwsnggnggnacnwsnytnytnatg athgaywsnggnwsnggngayaayytntty gcngtngaygtnmgnggnathgayccngar garggnmgnttyaayaayytnmgnytnath gtngarmgnaayaayytntaygtnacnggn ttygtnaaymgnacnaayaaygtnttytay mgnttygcngayttywsncaygtnacntty ccnggnacnacngcngtnacnytnwsnggn gaywsnwsntayacnacnytncarmgngtn gcnggnathwsnmgnacnggnatgcarath aaymgncaywsnytnacnacnwsntayytn gayytnatgwsncaywsnggnacnwsnytn acncarwsngtngcnmgngcnatgytnmgn ttygtnacngtnacngcngargcnytnmgn ttymgncarathcarmgnggnttymgnacn acnytngaygayytnwsnggnmgnwsntay gtnatgacngcngargaygtngayytnacn ytnaaytggggnmgnytnwsnwsngtnytn ccngaytaycayggncargaywsngtnmgn gtnggnmgnathwsnttyggnwsnathaay gcnathytnggnwsngtngcnytnathytn aaytgycaycaycaygcnwsnmgngtngcn mgn SEQ ID NO: 14 anti-CD20-scFv- MQVQLQQPGAELVKPGASVKMS C2B8::SLT-1A CKASGYTFTSYNMHWVKQTPGR polypeptide GLEWIGAIYPGNGDTSYNQKFK GKATLTADKSSSTAYMQLSSLT SEDSAVYYCARSTYYGGDWYFN VWGAGTTVTVSAGSTSGSGKPG SGEGSTKGQIVLSQSPAILSAS PGEKVTMTCRASSSVSYIHWFQ QKPGSSPKPWIYATSNLASGVP VRFSGSGSGTSYSLTISRVEAE DAATYYCQQWTSNPPTFGGGTK LEIKEFPKPSTPPGSSGGAPKE FTLDFSTAKTYVDSLNVIRSAI GTPLQTISSGGTSLLMIDSGSG DNLFAVDVRGIDPEEGRFNNLR LIVERNNLYVTGFVNRTNNVFY RFADFSHVTFPGTTAVTLSGDS SYTTLQRVAGISRTGMQINRHS LTTSYLDLMSHSGTSLTQSVAR AMLRFVTVTAEALRFRQIQRGF RTTLDDLSGRSYVMTAEDVDLT LNWGRLSSVLPDYHGQDSVRVG RISFGSINAILGSVALILNCHH HASRVAR SEQ ID NO: 15 anti-CD20-scFv- atgcargtncarytncarcarccnggngcn
C2B8::SLT-1A garytngtnaarccnggngcnwsngtnaar polynucleotide atgwsntgyaargcnwsnggntayacntty (consensus) acnwsntayaayatgcaytgggtnaarcar acnccnggnmgnggnytngartggathggn gcnathtayccnggnaayggngayacnwsn tayaaycaraarttyaarggnaargcnacn ytnacngcngayaarwsnwsnwsnacngcn tayatgcarytnwsnwsnytnacnwsngar gaywsngcngtntaytaytgygcnmgnwsn acntaytayggnggngaytggtayttyaay gtntggggngcnggnacnacngtnacngtn wsngcnggnwsnacnwsnggnwsnggnaar ccnggnwsnggngarggnwsnacnaarggn carathgtnytnwsncarwsnccngcnath ytnwsngcnwsnccnggngaraargtnacn atgacntgymgngcnwsnwsnwsngtnwsn tayathcaytggttycarcaraarccnggn wsnwsnccnaarccntggathtaygcnacn wsnaayytngcnwsnggngtncengtnmgn ttywsnggnwsnggnwsnggnacnwsntay wsnytnacnathwsnmgngtngargcngar gaygcngcnacntaytaytgycarcartgg acnwsnaayccnccnacnttyggnggnggn acnaarytngarathaargarttyccnaar ccnwsnacnccnccnggnwsnwsnggnggn gcnccnaargarttyacnytngayttywsn acngcnaaracntaygtngaywsnytnaay gtnathmgnwsngcnathggnacnccnytn caracnathwsnwsnggnggnacnwsnytn ytnatgathgaywsnggnwsnggngayaay ytnttygcngtngaygtnmgnggnathgay ccngargarggnmgnttyaayaayytnmgn ytnathgtngarmgnaayaayytntaygtn acnggnttygtnaaymgnacnaayaaygtn ttytaymgnttygcngayttywsncaygtn acnttyccnggnacnacngcngtnacnytn wsnggngaywsnwsntayacnacnytncar mgngtngcnggnathwsnmgnacnggnatg carathaaymgncaywsnytnacnacnwsn tayytngayytnatgwsncaywsnggnacn wsnytnacncarwsngtngcnmgngcnatg ytnmgnttygtnacngtnacngcngargcn ytnmgnttymgncarathcarmgnggntty mgnacnacnytngaygayytnwsnggnmgn wsntaygtnatgacngcngargaygtngay ytnacnytnaaytggggnmgnytnwsnwsn gtnytnccngaytaycayggncargaywsn gtnmgngtnggnmgnathwsnttyggnwsn athaaygcnathytnggnwsngtngcnytn athytnaaytgycaycaycaygcnwsnmgn gtngcnmgn SEQ ID NO: 16 anti-CD20-scFv::SLT- MQVQLQQPGAELVKPGASVKMS 1A version 2 (MT- CKTSGYTFTSYNVHWVKQTPGQ 3727) polypeptide GLEWIGAIYPGNGDTSFNQKFK GKATLTADKSSSTVYMQLSSLT SEDSAVYYCARSNYYGSSYVWF FDVWGAGTTVTVSSGSTSGSGK PGSGEGSQIVLSQSPTILSASP GEKVTMTCRASSSVSYMDWYQQ KPGSSPKPWIYATSNLASGVPA RFSGSGSGTSYSLTISRVEAED AATYYCQQWISNPPTFGAGTKL ELKEFPKPSTPPGSSGGAPGIL GFVFTLKEFTLDFSTAKTYVDS LNVIRSAIGTPLQTISSGGTSL LMIDSGSGDNLFAVDVRGIDPE EGRFNNLRLIVERNNLYVTGFV NRTNNVFYRFADFSHVTFPGTT AVTLSGDSSYTTLQRVAGISRT GMQINRHSLTTSYLDLMSHSGT SLTQSVARAMLRFVTVTAEALR FRQIQRGFRTTLDDLSGRSYVM TAEDVDLTLNWGRLSSVLPDYH GQDSVRVGRISFGSINAILGSV ALILNCHHHASRVAR SEQ ID NO: 17 Polynucleotide atgcargtncarytncarcarccnggngcn encoding anti-CD20- garytngtnaarccnggngcnwsngtnaar scFv::SLT-1A version atgwsntgyaaracnwsnggntayacntty 2 (MT-3727) acnwsntayaaygtncaytgggtnaarcar (consensus) acnccnggncarggnytngartggathggn gcnathtayccnggnaayggngayacnwsn ttyaaycaraarttyaarggnaargcnacn ytnacngcngayaarwsnwsnwsnacngtn tayatgcarytnwsnwsnytnacnwsngar gaywsngcngtntaytaytgygcnmgnwsn aaytaytayggnwsnwsntaygtntggtty ttygaygtntggggngcnggnacnacngtn acngtnwsnwsnggnwsnacnwsnggnwsn ggnaarccnggnwsnggngarggnwsncar athgtnytnwsncarwsnccnacnathytn wsngcnwsnccnggngaraargtnacnatg acntgymgngcnwsnwsnwsngtnwsntay atggaytggtaycarcaraarccnggnwsn wsnccnaarccntggathtaygcnacnwsn aayytngcnwsnggngtnccngcnmgntty wsnggnwsnggnwsnggnacnwsntaywsn ytnacnathwsnmgngtngargcngargay gcngcnacntaytaytgycarcartggath wsnaayccnccnacnttyggngcnggnacn aarytngarytnaargarttyccnaarccn wsnacnccnccnggnwsnwsnggnggngcn ccnggnathytnggnttygtnttyacnytn aargarttyacnytngayttywsnacngcn aaracntaygtngaywsnytnaaygtnath mgnwsngcnathggnacnccnytncaracn athwsnwsnggnggnacnwsnytnytnatg athgaywsnggnwsnggngayaayytntty gcngtngaygtnmgnggnathgayccngar garggnmgnttyaayaayytnmgnytnath gtngarmgnaayaayytntaygtnacnggn ttygtnaaymgnacnaayaaygtnttytay mgnttygcngayttywsncaygtnacntty ccnggnacnacngcngtnacnytnwsnggn gaywsnwsntayacnacnytncarmgngtn gcnggnathwsnmgnacnggnatgcarath aaymgncaywsnytnacnacnwsntayytn gayytnatgwsncaywsnggnacnwsnytn acncarwsngtngcnmgngcnatgytnmgn ttygtnacngtnacngcngargcnytnmgn ttymgncarathcarmgnggnttymgnacn acnytngaygayytnwsnggnmgnwsntay gtnatgacngcngargaygtngayytnacn ytnaaytggggnmgnytnwsnwsngtnytn ccngaytaycayggncargaywsngtnmgn gtnggnmgnathwsnttyggnwsnathaay gcnathytnggnwsngtngcnytnathytn aaytgycaycaycaygcnwsnmgngtngcn mgn SEQ ID NO: 18 linker (218) GSTSGSGKPGSGEGS SEQ ID NO: 19 Strep-tag .RTM. WSHPQFEK SEQ ID NO: 20 murine hinge (murine EFPKPSTPPGSSGGAP IgG3) SEQ ID NO: 21 heavy chain CDR1 GYTFTSYNMH SEQ ID NO: 22 heavy chain CDR2 AIYPGNGDTSYNQKFKG SEQ ID NO: 23 heavy chain CDR3 AQLRPNYWYFDV SEQ ID NO: 24 light chain CDR1 RASSSVSYMH SEQ ID NO: 25 Shiga toxin Subunit A KEFTLDFSTAKTYVDSLNVIRS (StxA) AIGTPLQTISSGGTSLLMIDSG TGDNLFAVDVRGIDPEEGRFNN LRLIVERNNLYVTGFVNRTNNV FYRFADFSHVTFPGTTAVTLSG DSSYTTLQRVAGISRTGMQINR HSLTTSYLDLMSHSGTSLTQSV ARAMLRFVTVTAEALRFRQIQR GFRTTLDDLSGRSYVMTAEDVD LTLNWGRLSSVLPDYHGQDSVR VGRISFGSINAILGSVALILNC HHHASRVARMASDEFPSMCPAD GRVRGITHNKILWDSSTLGAIL MRRTISS SEQ ID NO: 26 Shiga-like toxin 2 DEFTVDFSSQKSYVDSLNSIRS Subunit A (SLT-2A) AISTPLGNISQGGVSVSVINHV LGGNYISLNVRGLDPYSERFNH LRLIMERNNLYVAGFINTETNI FYRFSDFSHISVPDVITVSMTT DSSYSSLQRIADLERTGMQIGR HSLVGSYLDLMEFRGRSMTRAS SRAMLRFVTVIAEALRFRQIQR GFRPALSEASPLYTMTAQDVDL TLNWGRISNVLPEYRGEEGVRI GRISFNSLSAILGSVAVILNCH STGSYSVRSVSQKQKTECQIVG DRAAIKVNNVLWEANTIAALLN RKPQDLTEPNQ SEQ ID NO: 27 heavy chain CDR3 STYYGGDWYFNV SEQ ID NO: 28 light chain CDR1 RASSSVSYIH SEQ ID NO: 29 light chain CDR3 QWTSNPPT
Sequence CWU
1
1
291293PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 1Lys Glu Phe Thr Leu Asp Phe Ser Thr
Ala Lys Thr Tyr Val Asp Ser1 5 10
15Leu Asn Val Ile Arg Ser Ala Ile Gly Thr Pro Leu Gln Thr Ile
Ser 20 25 30Ser Gly Gly Thr
Ser Leu Leu Met Ile Asp Ser Gly Ser Gly Asp Asn 35
40 45Leu Phe Ala Val Asp Val Arg Gly Ile Asp Pro Glu
Glu Gly Arg Phe 50 55 60Asn Asn Leu
Arg Leu Ile Val Glu Arg Asn Asn Leu Tyr Val Thr Gly65 70
75 80Phe Val Asn Arg Thr Asn Asn Val
Phe Tyr Arg Phe Ala Asp Phe Ser 85 90
95His Val Thr Phe Pro Gly Thr Thr Ala Val Thr Leu Ser Gly
Asp Ser 100 105 110Ser Tyr Thr
Thr Leu Gln Arg Val Ala Gly Ile Ser Arg Thr Gly Met 115
120 125Gln Ile Asn Arg His Ser Leu Thr Thr Ser Tyr
Leu Asp Leu Met Ser 130 135 140His Ser
Gly Thr Ser Leu Thr Gln Ser Val Ala Arg Ala Met Leu Arg145
150 155 160Phe Val Thr Val Thr Ala Glu
Ala Leu Arg Phe Arg Gln Ile Gln Arg 165
170 175Gly Phe Arg Thr Thr Leu Asp Asp Leu Ser Gly Arg
Ser Tyr Val Met 180 185 190Thr
Ala Glu Asp Val Asp Leu Thr Leu Asn Trp Gly Arg Leu Ser Ser 195
200 205Val Leu Pro Asp Tyr His Gly Gln Asp
Ser Val Arg Val Gly Arg Ile 210 215
220Ser Phe Gly Ser Ile Asn Ala Ile Leu Gly Ser Val Ala Leu Ile Leu225
230 235 240Asn Cys His His
His Ala Ser Arg Val Ala Arg Met Ala Ser Asp Glu 245
250 255Phe Pro Ser Met Cys Pro Ala Asp Gly Arg
Val Arg Gly Ile Thr His 260 265
270Asn Lys Ile Leu Trp Asp Ser Ser Thr Leu Gly Ala Ile Leu Met Arg
275 280 285Arg Thr Ile Ser Ser
2902882DNAArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polynucleotide"modified_base(12)..(12)a, c, t, g,
unknown or othermodified_base(15)..(15)a, c, t, g, unknown or
othermodified_base(24)..(24)a, c, t, g, unknown or
othermodified_base(27)..(27)a, c, t, g, unknown or
othermodified_base(30)..(30)a, c, t, g, unknown or
othermodified_base(36)..(36)a, c, t, g, unknown or
othermodified_base(42)..(42)a, c, t, g, unknown or
othermodified_base(48)..(48)a, c, t, g, unknown or
othermodified_base(51)..(51)a, c, t, g, unknown or
othermodified_base(57)..(57)a, c, t, g, unknown or
othermodified_base(63)..(63)a, c, t, g, unknown or
othermodified_base(66)..(66)a, c, t, g, unknown or
othermodified_base(69)..(69)a, c, t, g, unknown or
othermodified_base(75)..(75)a, c, t, g, unknown or
othermodified_base(78)..(78)a, c, t, g, unknown or
othermodified_base(81)..(81)a, c, t, g, unknown or
othermodified_base(84)..(84)a, c, t, g, unknown or
othermodified_base(90)..(90)a, c, t, g, unknown or
othermodified_base(96)..(96)a, c, t, g, unknown or
othermodified_base(99)..(99)a, c, t, g, unknown or
othermodified_base(102)..(102)a, c, t, g, unknown or
othermodified_base(105)..(105)a, c, t, g, unknown or
othermodified_base(108)..(108)a, c, t, g, unknown or
othermodified_base(111)..(111)a, c, t, g, unknown or
othermodified_base(114)..(114)a, c, t, g, unknown or
othermodified_base(117)..(117)a, c, t, g, unknown or
othermodified_base(129)..(129)a, c, t, g, unknown or
othermodified_base(132)..(132)a, c, t, g, unknown or
othermodified_base(135)..(135)a, c, t, g, unknown or
othermodified_base(138)..(138)a, c, t, g, unknown or
othermodified_base(147)..(147)a, c, t, g, unknown or
othermodified_base(153)..(153)a, c, t, g, unknown or
othermodified_base(156)..(156)a, c, t, g, unknown or
othermodified_base(162)..(162)a, c, t, g, unknown or
othermodified_base(165)..(165)a, c, t, g, unknown or
othermodified_base(168)..(168)a, c, t, g, unknown or
othermodified_base(177)..(177)a, c, t, g, unknown or
othermodified_base(186)..(186)a, c, t, g, unknown or
othermodified_base(189)..(189)a, c, t, g, unknown or
othermodified_base(201)..(201)a, c, t, g, unknown or
othermodified_base(204)..(204)a, c, t, g, unknown or
othermodified_base(207)..(207)a, c, t, g, unknown or
othermodified_base(213)..(213)a, c, t, g, unknown or
othermodified_base(219)..(219)a, c, t, g, unknown or
othermodified_base(228)..(228)a, c, t, g, unknown or
othermodified_base(234)..(234)a, c, t, g, unknown or
othermodified_base(237)..(237)a, c, t, g, unknown or
othermodified_base(240)..(240)a, c, t, g, unknown or
othermodified_base(246)..(246)a, c, t, g, unknown or
othermodified_base(252)..(252)a, c, t, g, unknown or
othermodified_base(255)..(255)a, c, t, g, unknown or
othermodified_base(264)..(264)a, c, t, g, unknown or
othermodified_base(273)..(273)a, c, t, g, unknown or
othermodified_base(279)..(279)a, c, t, g, unknown or
othermodified_base(288)..(288)a, c, t, g, unknown or
othermodified_base(294)..(294)a, c, t, g, unknown or
othermodified_base(297)..(297)a, c, t, g, unknown or
othermodified_base(303)..(303)a, c, t, g, unknown or
othermodified_base(306)..(306)a, c, t, g, unknown or
othermodified_base(309)..(309)a, c, t, g, unknown or
othermodified_base(312)..(312)a, c, t, g, unknown or
othermodified_base(315)..(315)a, c, t, g, unknown or
othermodified_base(318)..(318)a, c, t, g, unknown or
othermodified_base(321)..(321)a, c, t, g, unknown or
othermodified_base(324)..(324)a, c, t, g, unknown or
othermodified_base(327)..(327)a, c, t, g, unknown or
othermodified_base(330)..(330)a, c, t, g, unknown or
othermodified_base(336)..(336)a, c, t, g, unknown or
othermodified_base(339)..(339)a, c, t, g, unknown or
othermodified_base(345)..(345)a, c, t, g, unknown or
othermodified_base(348)..(348)a, c, t, g, unknown or
othermodified_base(351)..(351)a, c, t, g, unknown or
othermodified_base(357)..(357)a, c, t, g, unknown or
othermodified_base(360)..(360)a, c, t, g, unknown or
othermodified_base(363)..(363)a, c, t, g, unknown or
othermodified_base(366)..(366)a, c, t, g, unknown or
othermodified_base(372)..(372)a, c, t, g, unknown or
othermodified_base(375)..(375)a, c, t, g, unknown or
othermodified_base(378)..(378)a, c, t, g, unknown or
othermodified_base(381)..(381)a, c, t, g, unknown or
othermodified_base(396)..(396)a, c, t, g, unknown or
othermodified_base(402)..(402)a, c, t, g, unknown or
othermodified_base(405)..(405)a, c, t, g, unknown or
othermodified_base(408)..(408)a, c, t, g, unknown or
othermodified_base(411)..(411)a, c, t, g, unknown or
othermodified_base(414)..(414)a, c, t, g, unknown or
othermodified_base(420)..(420)a, c, t, g, unknown or
othermodified_base(426)..(426)a, c, t, g, unknown or
othermodified_base(432)..(432)a, c, t, g, unknown or
othermodified_base(438)..(438)a, c, t, g, unknown or
othermodified_base(441)..(441)a, c, t, g, unknown or
othermodified_base(444)..(444)a, c, t, g, unknown or
othermodified_base(447)..(447)a, c, t, g, unknown or
othermodified_base(450)..(450)a, c, t, g, unknown or
othermodified_base(453)..(453)a, c, t, g, unknown or
othermodified_base(459)..(459)a, c, t, g, unknown or
othermodified_base(462)..(462)a, c, t, g, unknown or
othermodified_base(465)..(465)a, c, t, g, unknown or
othermodified_base(468)..(468)a, c, t, g, unknown or
othermodified_base(471)..(471)a, c, t, g, unknown or
othermodified_base(477)..(477)a, c, t, g, unknown or
othermodified_base(480)..(480)a, c, t, g, unknown or
othermodified_base(486)..(486)a, c, t, g, unknown or
othermodified_base(489)..(489)a, c, t, g, unknown or
othermodified_base(492)..(492)a, c, t, g, unknown or
othermodified_base(495)..(495)a, c, t, g, unknown or
othermodified_base(498)..(498)a, c, t, g, unknown or
othermodified_base(504)..(504)a, c, t, g, unknown or
othermodified_base(507)..(507)a, c, t, g, unknown or
othermodified_base(510)..(510)a, c, t, g, unknown or
othermodified_base(516)..(516)a, c, t, g, unknown or
othermodified_base(528)..(528)a, c, t, g, unknown or
othermodified_base(531)..(531)a, c, t, g, unknown or
othermodified_base(537)..(537)a, c, t, g, unknown or
othermodified_base(540)..(540)a, c, t, g, unknown or
othermodified_base(543)..(543)a, c, t, g, unknown or
othermodified_base(546)..(546)a, c, t, g, unknown or
othermodified_base(555)..(555)a, c, t, g, unknown or
othermodified_base(558)..(558)a, c, t, g, unknown or
othermodified_base(561)..(561)a, c, t, g, unknown or
othermodified_base(564)..(564)a, c, t, g, unknown or
othermodified_base(567)..(567)a, c, t, g, unknown or
othermodified_base(573)..(573)a, c, t, g, unknown or
othermodified_base(579)..(579)a, c, t, g, unknown or
othermodified_base(582)..(582)a, c, t, g, unknown or
othermodified_base(591)..(591)a, c, t, g, unknown or
othermodified_base(597)..(597)a, c, t, g, unknown or
othermodified_base(600)..(600)a, c, t, g, unknown or
othermodified_base(603)..(603)a, c, t, g, unknown or
othermodified_base(612)..(612)a, c, t, g, unknown or
othermodified_base(615)..(615)a, c, t, g, unknown or
othermodified_base(618)..(618)a, c, t, g, unknown or
othermodified_base(621)..(621)a, c, t, g, unknown or
othermodified_base(624)..(624)a, c, t, g, unknown or
othermodified_base(627)..(627)a, c, t, g, unknown or
othermodified_base(630)..(630)a, c, t, g, unknown or
othermodified_base(633)..(633)a, c, t, g, unknown or
othermodified_base(645)..(645)a, c, t, g, unknown or
othermodified_base(654)..(654)a, c, t, g, unknown or
othermodified_base(657)..(657)a, c, t, g, unknown or
othermodified_base(660)..(660)a, c, t, g, unknown or
othermodified_base(663)..(663)a, c, t, g, unknown or
othermodified_base(666)..(666)a, c, t, g, unknown or
othermodified_base(669)..(669)a, c, t, g, unknown or
othermodified_base(675)..(675)a, c, t, g, unknown or
othermodified_base(681)..(681)a, c, t, g, unknown or
othermodified_base(684)..(684)a, c, t, g, unknown or
othermodified_base(693)..(693)a, c, t, g, unknown or
othermodified_base(699)..(699)a, c, t, g, unknown or
othermodified_base(702)..(702)a, c, t, g, unknown or
othermodified_base(705)..(705)a, c, t, g, unknown or
othermodified_base(708)..(708)a, c, t, g, unknown or
othermodified_base(711)..(711)a, c, t, g, unknown or
othermodified_base(714)..(714)a, c, t, g, unknown or
othermodified_base(720)..(720)a, c, t, g, unknown or
othermodified_base(738)..(738)a, c, t, g, unknown or
othermodified_base(741)..(741)a, c, t, g, unknown or
othermodified_base(744)..(744)a, c, t, g, unknown or
othermodified_base(747)..(747)a, c, t, g, unknown or
othermodified_base(750)..(750)a, c, t, g, unknown or
othermodified_base(753)..(753)a, c, t, g, unknown or
othermodified_base(759)..(759)a, c, t, g, unknown or
othermodified_base(762)..(762)a, c, t, g, unknown or
othermodified_base(774)..(774)a, c, t, g, unknown or
othermodified_base(777)..(777)a, c, t, g, unknown or
othermodified_base(786)..(786)a, c, t, g, unknown or
othermodified_base(789)..(789)a, c, t, g, unknown or
othermodified_base(795)..(795)a, c, t, g, unknown or
othermodified_base(798)..(798)a, c, t, g, unknown or
othermodified_base(801)..(801)a, c, t, g, unknown or
othermodified_base(804)..(804)a, c, t, g, unknown or
othermodified_base(807)..(807)a, c, t, g, unknown or
othermodified_base(813)..(813)a, c, t, g, unknown or
othermodified_base(828)..(828)a, c, t, g, unknown or
othermodified_base(837)..(837)a, c, t, g, unknown or
othermodified_base(840)..(840)a, c, t, g, unknown or
othermodified_base(843)..(843)a, c, t, g, unknown or
othermodified_base(846)..(846)a, c, t, g, unknown or
othermodified_base(849)..(849)a, c, t, g, unknown or
othermodified_base(852)..(852)a, c, t, g, unknown or
othermodified_base(858)..(858)a, c, t, g, unknown or
othermodified_base(864)..(864)a, c, t, g, unknown or
othermodified_base(867)..(867)a, c, t, g, unknown or
othermodified_base(870)..(870)a, c, t, g, unknown or
othermodified_base(876)..(876)a, c, t, g, unknown or
othermodified_base(879)..(879)a, c, t, g, unknown or other 2aargarttya
cnytngaytt ywsnacngcn aaracntayg tngaywsnyt naaygtnath 60mgnwsngcna
thggnacncc nytncaracn athwsnwsng gnggnacnws nytnytnatg 120athgaywsng
gnwsnggnga yaayytntty gcngtngayg tnmgnggnat hgayccngar 180garggnmgnt
tyaayaayyt nmgnytnath gtngarmgna ayaayytnta ygtnacnggn 240ttygtnaaym
gnacnaayaa ygtnttytay mgnttygcng ayttywsnca ygtnacntty 300ccnggnacna
cngcngtnac nytnwsnggn gaywsnwsnt ayacnacnyt ncarmgngtn 360gcnggnathw
snmgnacngg natgcarath aaymgncayw snytnacnac nwsntayytn 420gayytnatgw
sncaywsngg nacnwsnytn acncarwsng tngcnmgngc natgytnmgn 480ttygtnacng
tnacngcnga rgcnytnmgn ttymgncara thcarmgngg nttymgnacn 540acnytngayg
ayytnwsngg nmgnwsntay gtnatgacng cngargaygt ngayytnacn 600ytnaaytggg
gnmgnytnws nwsngtnytn ccngaytayc ayggncarga ywsngtnmgn 660gtnggnmgna
thwsnttygg nwsnathaay gcnathytng gnwsngtngc nytnathytn 720aaytgycayc
aycaygcnws nmgngtngcn mgnatggcnw sngaygartt yccnwsnatg 780tgyccngcng
ayggnmgngt nmgnggnath acncayaaya arathytntg ggaywsnwsn 840acnytnggng
cnathytnat gmgnmgnacn athwsnwsnt rr
88239PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 3Gly Ile Leu Gly Phe Val Phe Thr Leu1
54512PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 4Met Gln Val Gln Leu Gln
Gln Pro Gly Ala Glu Leu Val Lys Pro Gly1 5
10 15Ala Ser Val Lys Met Ser Cys Lys Thr Ser Gly Tyr
Thr Phe Thr Ser 20 25 30Tyr
Asn Val His Trp Val Lys Gln Thr Pro Gly Gln Gly Leu Glu Trp 35
40 45Ile Gly Ala Ile Tyr Pro Gly Asn Gly
Asp Thr Ser Phe Asn Gln Lys 50 55
60Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr Val65
70 75 80Tyr Met Gln Leu Ser
Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr 85
90 95Cys Ala Arg Ser Asn Tyr Tyr Gly Ser Ser Tyr
Val Trp Phe Phe Asp 100 105
110Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ser Gly Ser Thr Ser
115 120 125Gly Ser Gly Lys Pro Gly Ser
Gly Glu Gly Ser Gln Ile Val Leu Ser 130 135
140Gln Ser Pro Thr Ile Leu Ser Ala Ser Pro Gly Glu Lys Val Thr
Met145 150 155 160Thr Cys
Arg Ala Ser Ser Ser Val Ser Tyr Met Asp Trp Tyr Gln Gln
165 170 175Lys Pro Gly Ser Ser Pro Lys
Pro Trp Ile Tyr Ala Thr Ser Asn Leu 180 185
190Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly
Thr Ser 195 200 205Tyr Ser Leu Thr
Ile Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr 210
215 220Tyr Cys Gln Gln Trp Ile Ser Asn Pro Pro Thr Phe
Gly Ala Gly Thr225 230 235
240Lys Leu Glu Leu Lys Glu Phe Pro Lys Pro Ser Thr Pro Pro Gly Ser
245 250 255Ser Gly Gly Ala Pro
Lys Glu Phe Thr Leu Asp Phe Ser Thr Ala Lys 260
265 270Thr Tyr Val Asp Ser Leu Asn Val Ile Arg Ser Ala
Ile Gly Thr Pro 275 280 285Leu Gln
Thr Ile Ser Ser Gly Gly Thr Ser Leu Leu Met Ile Asp Ser 290
295 300Gly Ser Gly Asp Asn Leu Phe Ala Val Asp Val
Arg Gly Ile Asp Pro305 310 315
320Glu Glu Gly Arg Phe Asn Asn Leu Arg Leu Ile Val Glu Arg Asn Asn
325 330 335Leu Tyr Val Thr
Gly Phe Val Asn Arg Thr Asn Asn Val Phe Tyr Arg 340
345 350Phe Ala Asp Phe Ser His Val Thr Phe Pro Gly
Thr Thr Ala Val Thr 355 360 365Leu
Ser Gly Asp Ser Ser Tyr Thr Thr Leu Gln Arg Val Ala Gly Ile 370
375 380Ser Arg Thr Gly Met Gln Ile Asn Arg His
Ser Leu Thr Thr Ser Tyr385 390 395
400Leu Asp Leu Met Ser His Ser Gly Thr Ser Leu Thr Gln Ser Val
Ala 405 410 415Arg Ala Met
Leu Arg Phe Val Thr Val Thr Ala Glu Ala Leu Arg Phe 420
425 430Arg Gln Ile Gln Arg Gly Phe Arg Thr Thr
Leu Asp Asp Leu Ser Gly 435 440
445Arg Ser Tyr Val Met Thr Ala Glu Asp Val Asp Leu Thr Leu Asn Trp 450
455 460Gly Arg Leu Ser Ser Val Leu Pro
Asp Tyr His Gly Gln Asp Ser Val465 470
475 480Arg Val Gly Arg Ile Ser Phe Gly Ser Ile Asn Ala
Ile Leu Gly Ser 485 490
495Val Ala Leu Ile Leu Asn Cys His His His Ala Ser Arg Val Ala Arg
500 505 51051536DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide"modified_base(9)..(9)a, c, t, g, unknown or
othermodified_base(15)..(15)a, c, t, g, unknown or
othermodified_base(24)..(24)a, c, t, g, unknown or
othermodified_base(27)..(27)a, c, t, g, unknown or
othermodified_base(30)..(30)a, c, t, g, unknown or
othermodified_base(36)..(36)a, c, t, g, unknown or
othermodified_base(39)..(39)a, c, t, g, unknown or
othermodified_base(45)..(45)a, c, t, g, unknown or
othermodified_base(48)..(48)a, c, t, g, unknown or
othermodified_base(51)..(51)a, c, t, g, unknown or
othermodified_base(54)..(54)a, c, t, g, unknown or
othermodified_base(57)..(57)a, c, t, g, unknown or
othermodified_base(66)..(66)a, c, t, g, unknown or
othermodified_base(75)..(75)a, c, t, g, unknown or
othermodified_base(78)..(78)a, c, t, g, unknown or
othermodified_base(81)..(81)a, c, t, g, unknown or
othermodified_base(87)..(87)a, c, t, g, unknown or
othermodified_base(93)..(93)a, c, t, g, unknown or
othermodified_base(96)..(96)a, c, t, g, unknown or
othermodified_base(105)..(105)a, c, t, g, unknown or
othermodified_base(114)..(114)a, c, t, g, unknown or
othermodified_base(123)..(123)a, c, t, g, unknown or
othermodified_base(126)..(126)a, c, t, g, unknown or
othermodified_base(129)..(129)a, c, t, g, unknown or
othermodified_base(135)..(135)a, c, t, g, unknown or
othermodified_base(138)..(138)a, c, t, g, unknown or
othermodified_base(150)..(150)a, c, t, g, unknown or
othermodified_base(153)..(153)a, c, t, g, unknown or
othermodified_base(162)..(162)a, c, t, g, unknown or
othermodified_base(165)..(165)a, c, t, g, unknown or
othermodified_base(171)..(171)a, c, t, g, unknown or
othermodified_base(177)..(177)a, c, t, g, unknown or
othermodified_base(180)..(180)a, c, t, g, unknown or
othermodified_base(201)..(201)a, c, t, g, unknown or
othermodified_base(207)..(207)a, c, t, g, unknown or
othermodified_base(210)..(210)a, c, t, g, unknown or
othermodified_base(213)..(213)a, c, t, g, unknown or
othermodified_base(216)..(216)a, c, t, g, unknown or
othermodified_base(219)..(219)a, c, t, g, unknown or
othermodified_base(228)..(228)a, c, t, g, unknown or
othermodified_base(231)..(231)a, c, t, g, unknown or
othermodified_base(234)..(234)a, c, t, g, unknown or
othermodified_base(237)..(237)a, c, t, g, unknown or
othermodified_base(240)..(240)a, c, t, g, unknown or
othermodified_base(252)..(252)a, c, t, g, unknown or
othermodified_base(255)..(255)a, c, t, g, unknown or
othermodified_base(258)..(258)a, c, t, g, unknown or
othermodified_base(261)..(261)a, c, t, g, unknown or
othermodified_base(264)..(264)a, c, t, g, unknown or
othermodified_base(267)..(267)a, c, t, g, unknown or
othermodified_base(276)..(276)a, c, t, g, unknown or
othermodified_base(279)..(279)a, c, t, g, unknown or
othermodified_base(282)..(282)a, c, t, g, unknown or
othermodified_base(294)..(294)a, c, t, g, unknown or
othermodified_base(297)..(297)a, c, t, g, unknown or
othermodified_base(300)..(300)a, c, t, g, unknown or
othermodified_base(312)..(312)a, c, t, g, unknown or
othermodified_base(315)..(315)a, c, t, g, unknown or
othermodified_base(318)..(318)a, c, t, g, unknown or
othermodified_base(324)..(324)a, c, t, g, unknown or
othermodified_base(339)..(339)a, c, t, g, unknown or
othermodified_base(345)..(345)a, c, t, g, unknown or
othermodified_base(348)..(348)a, c, t, g, unknown or
othermodified_base(351)..(351)a, c, t, g, unknown or
othermodified_base(354)..(354)a, c, t, g, unknown or
othermodified_base(357)..(357)a, c, t, g, unknown or
othermodified_base(360)..(360)a, c, t, g, unknown or
othermodified_base(363)..(363)a, c, t, g, unknown or
othermodified_base(366)..(366)a, c, t, g, unknown or
othermodified_base(369)..(369)a, c, t, g, unknown or
othermodified_base(372)..(372)a, c, t, g, unknown or
othermodified_base(375)..(375)a, c, t, g, unknown or
othermodified_base(378)..(378)a, c, t, g, unknown or
othermodified_base(381)..(381)a, c, t, g, unknown or
othermodified_base(384)..(384)a, c, t, g, unknown or
othermodified_base(387)..(387)a, c, t, g, unknown or
othermodified_base(390)..(390)a, c, t, g, unknown or
othermodified_base(393)..(393)a, c, t, g, unknown or
othermodified_base(399)..(399)a, c, t, g, unknown or
othermodified_base(402)..(402)a, c, t, g, unknown or
othermodified_base(405)..(405)a, c, t, g, unknown or
othermodified_base(408)..(408)a, c, t, g, unknown or
othermodified_base(414)..(414)a, c, t, g, unknown or
othermodified_base(417)..(417)a, c, t, g, unknown or
othermodified_base(426)..(426)a, c, t, g, unknown or
othermodified_base(429)..(429)a, c, t, g, unknown or
othermodified_base(432)..(432)a, c, t, g, unknown or
othermodified_base(438)..(438)a, c, t, g, unknown or
othermodified_base(441)..(441)a, c, t, g, unknown or
othermodified_base(444)..(444)a, c, t, g, unknown or
othermodified_base(450)..(450)a, c, t, g, unknown or
othermodified_base(453)..(453)a, c, t, g, unknown or
othermodified_base(456)..(456)a, c, t, g, unknown or
othermodified_base(459)..(459)a, c, t, g, unknown or
othermodified_base(462)..(462)a, c, t, g, unknown or
othermodified_base(465)..(465)a, c, t, g, unknown or
othermodified_base(474)..(474)a, c, t, g, unknown or
othermodified_base(477)..(477)a, c, t, g, unknown or
othermodified_base(483)..(483)a, c, t, g, unknown or
othermodified_base(489)..(489)a, c, t, g, unknown or
othermodified_base(492)..(492)a, c, t, g, unknown or
othermodified_base(495)..(495)a, c, t, g, unknown or
othermodified_base(498)..(498)a, c, t, g, unknown or
othermodified_base(501)..(501)a, c, t, g, unknown or
othermodified_base(504)..(504)a, c, t, g, unknown or
othermodified_base(507)..(507)a, c, t, g, unknown or
othermodified_base(534)..(534)a, c, t, g, unknown or
othermodified_base(537)..(537)a, c, t, g, unknown or
othermodified_base(540)..(540)a, c, t, g, unknown or
othermodified_base(543)..(543)a, c, t, g, unknown or
othermodified_base(546)..(546)a, c, t, g, unknown or
othermodified_base(552)..(552)a, c, t, g, unknown or
othermodified_base(564)..(564)a, c, t, g, unknown or
othermodified_base(567)..(567)a, c, t, g, unknown or
othermodified_base(570)..(570)a, c, t, g, unknown or
othermodified_base(576)..(576)a, c, t, g, unknown or
othermodified_base(579)..(579)a, c, t, g, unknown or
othermodified_base(582)..(582)a, c, t, g, unknown or
othermodified_base(585)..(585)a, c, t, g, unknown or
othermodified_base(588)..(588)a, c, t, g, unknown or
othermodified_base(591)..(591)a, c, t, g, unknown or
othermodified_base(594)..(594)a, c, t, g, unknown or
othermodified_base(597)..(597)a, c, t, g, unknown or
othermodified_base(603)..(603)a, c, t, g, unknown or
othermodified_base(606)..(606)a, c, t, g, unknown or
othermodified_base(609)..(609)a, c, t, g, unknown or
othermodified_base(612)..(612)a, c, t, g, unknown or
othermodified_base(615)..(615)a, c, t, g, unknown or
othermodified_base(618)..(618)a, c, t, g, unknown or
othermodified_base(621)..(621)a, c, t, g, unknown or
othermodified_base(624)..(624)a, c, t, g, unknown or
othermodified_base(630)..(630)a, c, t, g, unknown or
othermodified_base(633)..(633)a, c, t, g, unknown or
othermodified_base(636)..(636)a, c, t, g, unknown or
othermodified_base(642)..(642)a, c, t, g, unknown or
othermodified_base(645)..(645)a, c, t, g, unknown or
othermodified_base(648)..(648)a, c, t, g, unknown or
othermodified_base(654)..(654)a, c, t, g, unknown or
othermodified_base(663)..(663)a, c, t, g, unknown or
othermodified_base(666)..(666)a, c, t, g, unknown or
othermodified_base(669)..(669)a, c, t, g, unknown or
othermodified_base(693)..(693)a, c, t, g, unknown or
othermodified_base(699)..(699)a, c, t, g, unknown or
othermodified_base(702)..(702)a, c, t, g, unknown or
othermodified_base(705)..(705)a, c, t, g, unknown or
othermodified_base(711)..(711)a, c, t, g, unknown or
othermodified_base(714)..(714)a, c, t, g, unknown or
othermodified_base(717)..(717)a, c, t, g, unknown or
othermodified_base(720)..(720)a, c, t, g, unknown or
othermodified_base(726)..(726)a, c, t, g, unknown or
othermodified_base(732)..(732)a, c, t, g, unknown or
othermodified_base(744)..(744)a, c, t, g, unknown or
othermodified_base(750)..(750)a, c, t, g, unknown or
othermodified_base(753)..(753)a, c, t, g, unknown or
othermodified_base(756)..(756)a, c, t, g, unknown or
othermodified_base(759)..(759)a, c, t, g, unknown or
othermodified_base(762)..(762)a, c, t, g, unknown or
othermodified_base(765)..(765)a, c, t, g, unknown or
othermodified_base(768)..(768)a, c, t, g, unknown or
othermodified_base(771)..(771)a, c, t, g, unknown or
othermodified_base(774)..(774)a, c, t, g, unknown or
othermodified_base(777)..(777)a, c, t, g, unknown or
othermodified_base(780)..(780)a, c, t, g, unknown or
othermodified_base(783)..(783)a, c, t, g, unknown or
othermodified_base(795)..(795)a, c, t, g, unknown or
othermodified_base(798)..(798)a, c, t, g, unknown or
othermodified_base(807)..(807)a, c, t, g, unknown or
othermodified_base(810)..(810)a, c, t, g, unknown or
othermodified_base(813)..(813)a, c, t, g, unknown or
othermodified_base(819)..(819)a, c, t, g, unknown or
othermodified_base(825)..(825)a, c, t, g, unknown or
othermodified_base(831)..(831)a, c, t, g, unknown or
othermodified_base(834)..(834)a, c, t, g, unknown or
othermodified_base(840)..(840)a, c, t, g, unknown or
othermodified_base(846)..(846)a, c, t, g, unknown or
othermodified_base(849)..(849)a, c, t, g, unknown or
othermodified_base(852)..(852)a, c, t, g, unknown or
othermodified_base(858)..(858)a, c, t, g, unknown or
othermodified_base(861)..(861)a, c, t, g, unknown or
othermodified_base(864)..(864)a, c, t, g, unknown or
othermodified_base(867)..(867)a, c, t, g, unknown or
othermodified_base(873)..(873)a, c, t, g, unknown or
othermodified_base(879)..(879)a, c, t, g, unknown or
othermodified_base(882)..(882)a, c, t, g, unknown or
othermodified_base(885)..(885)a, c, t, g, unknown or
othermodified_base(888)..(888)a, c, t, g, unknown or
othermodified_base(891)..(891)a, c, t, g, unknown or
othermodified_base(894)..(894)a, c, t, g, unknown or
othermodified_base(897)..(897)a, c, t, g, unknown or
othermodified_base(900)..(900)a, c, t, g, unknown or
othermodified_base(912)..(912)a, c, t, g, unknown or
othermodified_base(915)..(915)a, c, t, g, unknown or
othermodified_base(918)..(918)a, c, t, g, unknown or
othermodified_base(921)..(921)a, c, t, g, unknown or
othermodified_base(930)..(930)a, c, t, g, unknown or
othermodified_base(936)..(936)a, c, t, g, unknown or
othermodified_base(939)..(939)a, c, t, g, unknown or
othermodified_base(945)..(945)a, c, t, g, unknown or
othermodified_base(948)..(948)a, c, t, g, unknown or
othermodified_base(951)..(951)a, c, t, g, unknown or
othermodified_base(960)..(960)a, c, t, g, unknown or
othermodified_base(969)..(969)a, c, t, g, unknown or
othermodified_base(972)..(972)a, c, t, g, unknown or
othermodified_base(984)..(984)a, c, t, g, unknown or
othermodified_base(987)..(987)a, c, t, g, unknown or
othermodified_base(990)..(990)a, c, t, g, unknown or
othermodified_base(996)..(996)a, c, t, g, unknown or
othermodified_base(1002)..(1002)a, c, t, g, unknown or
othermodified_base(1011)..(1011)a, c, t, g, unknown or
othermodified_base(1017)..(1017)a, c, t, g, unknown or
othermodified_base(1020)..(1020)a, c, t, g, unknown or
othermodified_base(1023)..(1023)a, c, t, g, unknown or
othermodified_base(1029)..(1029)a, c, t, g, unknown or
othermodified_base(1035)..(1035)a, c, t, g, unknown or
othermodified_base(1038)..(1038)a, c, t, g, unknown or
othermodified_base(1047)..(1047)a, c, t, g, unknown or
othermodified_base(1056)..(1056)a, c, t, g, unknown or
othermodified_base(1062)..(1062)a, c, t, g, unknown or
othermodified_base(1071)..(1071)a, c, t, g, unknown or
othermodified_base(1077)..(1077)a, c, t, g, unknown or
othermodified_base(1080)..(1080)a, c, t, g, unknown or
othermodified_base(1086)..(1086)a, c, t, g, unknown or
othermodified_base(1089)..(1089)a, c, t, g, unknown or
othermodified_base(1092)..(1092)a, c, t, g, unknown or
othermodified_base(1095)..(1095)a, c, t, g, unknown or
othermodified_base(1098)..(1098)a, c, t, g, unknown or
othermodified_base(1101)..(1101)a, c, t, g, unknown or
othermodified_base(1104)..(1104)a, c, t, g, unknown or
othermodified_base(1107)..(1107)a, c, t, g, unknown or
othermodified_base(1110)..(1110)a, c, t, g, unknown or
othermodified_base(1113)..(1113)a, c, t, g, unknown or
othermodified_base(1119)..(1119)a, c, t, g, unknown or
othermodified_base(1122)..(1122)a, c, t, g, unknown or
othermodified_base(1128)..(1128)a, c, t, g, unknown or
othermodified_base(1131)..(1131)a, c, t, g, unknown or
othermodified_base(1134)..(1134)a, c, t, g, unknown or
othermodified_base(1140)..(1140)a, c, t, g, unknown or
othermodified_base(1143)..(1143)a, c, t, g, unknown or
othermodified_base(1146)..(1146)a, c, t, g, unknown or
othermodified_base(1149)..(1149)a, c, t, g, unknown or
othermodified_base(1155)..(1155)a, c, t, g, unknown or
othermodified_base(1158)..(1158)a, c, t, g, unknown or
othermodified_base(1161)..(1161)a, c, t, g, unknown or
othermodified_base(1164)..(1164)a, c, t, g, unknown or
othermodified_base(1179)..(1179)a, c, t, g, unknown or
othermodified_base(1185)..(1185)a, c, t, g, unknown or
othermodified_base(1188)..(1188)a, c, t, g, unknown or
othermodified_base(1191)..(1191)a, c, t, g, unknown or
othermodified_base(1194)..(1194)a, c, t, g, unknown or
othermodified_base(1197)..(1197)a, c, t, g, unknown or
othermodified_base(1203)..(1203)a, c, t, g, unknown or
othermodified_base(1209)..(1209)a, c, t, g, unknown or
othermodified_base(1215)..(1215)a, c, t, g, unknown or
othermodified_base(1221)..(1221)a, c, t, g, unknown or
othermodified_base(1224)..(1224)a, c, t, g, unknown or
othermodified_base(1227)..(1227)a, c, t, g, unknown or
othermodified_base(1230)..(1230)a, c, t, g, unknown or
othermodified_base(1233)..(1233)a, c, t, g, unknown or
othermodified_base(1236)..(1236)a, c, t, g, unknown or
othermodified_base(1242)..(1242)a, c, t, g, unknown or
othermodified_base(1245)..(1245)a, c, t, g, unknown or
othermodified_base(1248)..(1248)a, c, t, g, unknown or
othermodified_base(1251)..(1251)a, c, t, g, unknown or
othermodified_base(1254)..(1254)a, c, t, g, unknown or
othermodified_base(1260)..(1260)a, c, t, g, unknown or
othermodified_base(1263)..(1263)a, c, t, g, unknown or
othermodified_base(1269)..(1269)a, c, t, g, unknown or
othermodified_base(1272)..(1272)a, c, t, g, unknown or
othermodified_base(1275)..(1275)a, c, t, g, unknown or
othermodified_base(1278)..(1278)a, c, t, g, unknown or
othermodified_base(1281)..(1281)a, c, t, g, unknown or
othermodified_base(1287)..(1287)a, c, t, g, unknown or
othermodified_base(1290)..(1290)a, c, t, g, unknown or
othermodified_base(1293)..(1293)a, c, t, g, unknown or
othermodified_base(1299)..(1299)a, c, t, g, unknown or
othermodified_base(1311)..(1311)a, c, t, g, unknown or
othermodified_base(1314)..(1314)a, c, t, g, unknown or
othermodified_base(1320)..(1320)a, c, t, g, unknown or
othermodified_base(1323)..(1323)a, c, t, g, unknown or
othermodified_base(1326)..(1326)a, c, t, g, unknown or
othermodified_base(1329)..(1329)a, c, t, g, unknown or
othermodified_base(1338)..(1338)a, c, t, g, unknown or
othermodified_base(1341)..(1341)a, c, t, g, unknown or
othermodified_base(1344)..(1344)a, c, t, g, unknown or
othermodified_base(1347)..(1347)a, c, t, g, unknown or
othermodified_base(1350)..(1350)a, c, t, g, unknown or
othermodified_base(1356)..(1356)a, c, t, g, unknown or
othermodified_base(1362)..(1362)a, c, t, g, unknown or
othermodified_base(1365)..(1365)a, c, t, g, unknown or
othermodified_base(1374)..(1374)a, c, t, g, unknown or
othermodified_base(1380)..(1380)a, c, t, g, unknown or
othermodified_base(1383)..(1383)a, c, t, g, unknown or
othermodified_base(1386)..(1386)a, c, t, g, unknown or
othermodified_base(1395)..(1395)a, c, t, g, unknown or
othermodified_base(1398)..(1398)a, c, t, g, unknown or
othermodified_base(1401)..(1401)a, c, t, g, unknown or
othermodified_base(1404)..(1404)a, c, t, g, unknown or
othermodified_base(1407)..(1407)a, c, t, g, unknown or
othermodified_base(1410)..(1410)a, c, t, g, unknown or
othermodified_base(1413)..(1413)a, c, t, g, unknown or
othermodified_base(1416)..(1416)a, c, t, g, unknown or
othermodified_base(1428)..(1428)a, c, t, g, unknown or
othermodified_base(1437)..(1437)a, c, t, g, unknown or
othermodified_base(1440)..(1440)a, c, t, g, unknown or
othermodified_base(1443)..(1443)a, c, t, g, unknown or
othermodified_base(1446)..(1446)a, c, t, g, unknown or
othermodified_base(1449)..(1449)a, c, t, g, unknown or
othermodified_base(1452)..(1452)a, c, t, g, unknown or
othermodified_base(1458)..(1458)a, c, t, g, unknown or
othermodified_base(1464)..(1464)a, c, t, g, unknown or
othermodified_base(1467)..(1467)a, c, t, g, unknown or
othermodified_base(1476)..(1476)a, c, t, g, unknown or
othermodified_base(1482)..(1482)a, c, t, g, unknown or
othermodified_base(1485)..(1485)a, c, t, g, unknown or
othermodified_base(1488)..(1488)a, c, t, g, unknown or
othermodified_base(1491)..(1491)a, c, t, g, unknown or
othermodified_base(1494)..(1494)a, c, t, g, unknown or
othermodified_base(1497)..(1497)a, c, t, g, unknown or
othermodified_base(1503)..(1503)a, c, t, g, unknown or
othermodified_base(1521)..(1521)a, c, t, g, unknown or
othermodified_base(1524)..(1524)a, c, t, g, unknown or
othermodified_base(1527)..(1527)a, c, t, g, unknown or
othermodified_base(1530)..(1530)a, c, t, g, unknown or
othermodified_base(1533)..(1533)a, c, t, g, unknown or
othermodified_base(1536)..(1536)a, c, t, g, unknown or other 5atgcargtnc
arytncarca rccnggngcn garytngtna arccnggngc nwsngtnaar 60atgwsntgya
aracnwsngg ntayacntty acnwsntaya aygtncaytg ggtnaarcar 120acnccnggnc
arggnytnga rtggathggn gcnathtayc cnggnaaygg ngayacnwsn 180ttyaaycara
arttyaargg naargcnacn ytnacngcng ayaarwsnws nwsnacngtn 240tayatgcary
tnwsnwsnyt nacnwsngar gaywsngcng tntaytaytg ygcnmgnwsn 300aaytaytayg
gnwsnwsnta ygtntggtty ttygaygtnt ggggngcngg nacnacngtn 360acngtnwsnw
snggnwsnac nwsnggnwsn ggnaarccng gnwsnggnga rggnwsncar 420athgtnytnw
sncarwsncc nacnathytn wsngcnwsnc cnggngaraa rgtnacnatg 480acntgymgng
cnwsnwsnws ngtnwsntay atggaytggt aycarcaraa rccnggnwsn 540wsnccnaarc
cntggathta ygcnacnwsn aayytngcnw snggngtncc ngcnmgntty 600wsnggnwsng
gnwsnggnac nwsntaywsn ytnacnathw snmgngtnga rgcngargay 660gcngcnacnt
aytaytgyca rcartggath wsnaayccnc cnacnttygg ngcnggnacn 720aarytngary
tnaargartt yccnaarccn wsnacnccnc cnggnwsnws nggnggngcn 780ccnaargart
tyacnytnga yttywsnacn gcnaaracnt aygtngayws nytnaaygtn 840athmgnwsng
cnathggnac nccnytncar acnathwsnw snggnggnac nwsnytnytn 900atgathgayw
snggnwsngg ngayaayytn ttygcngtng aygtnmgngg nathgayccn 960gargarggnm
gnttyaayaa yytnmgnytn athgtngarm gnaayaayyt ntaygtnacn 1020ggnttygtna
aymgnacnaa yaaygtntty taymgnttyg cngayttyws ncaygtnacn 1080ttyccnggna
cnacngcngt nacnytnwsn ggngaywsnw sntayacnac nytncarmgn 1140gtngcnggna
thwsnmgnac nggnatgcar athaaymgnc aywsnytnac nacnwsntay 1200ytngayytna
tgwsncayws nggnacnwsn ytnacncarw sngtngcnmg ngcnatgytn 1260mgnttygtna
cngtnacngc ngargcnytn mgnttymgnc arathcarmg nggnttymgn 1320acnacnytng
aygayytnws nggnmgnwsn taygtnatga cngcngarga ygtngayytn 1380acnytnaayt
ggggnmgnyt nwsnwsngtn ytnccngayt aycayggnca rgaywsngtn 1440mgngtnggnm
gnathwsntt yggnwsnath aaygcnathy tnggnwsngt ngcnytnath 1500ytnaaytgyc
aycaycaygc nwsnmgngtn gcnmgn
1536610PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 6Gly Tyr Thr Phe Thr Ser Tyr Asn Val
His1 5 10717PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 7Ala Ile Tyr Pro Gly Asn Gly Asp Thr Ser Phe Asn Gln Lys Phe
Lys1 5 10
15Gly814PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 8Ser Asn Tyr Tyr Gly Ser Ser Tyr Val Trp
Phe Phe Asp Val1 5 10910PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 9Arg Ala Ser Ser Ser Val Ser Tyr Met Asp1 5
10107PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic peptide" 10Ala Thr Ser Asn Leu Ala
Ser1 5119PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic peptide" 11Gln Gln Trp Ile Ser Asn Pro
Pro Thr1 512511PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polypeptide" 12Met Gln Val Gln Leu Val Gln Ser Gly Ala Glu Leu Val Lys
Pro Gly1 5 10 15Ala Ser
Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser 20
25 30Tyr Asn Met His Trp Val Lys Gln Thr
Pro Gly Gln Gly Leu Glu Trp 35 40
45Ile Gly Ala Ile Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn Gln Lys 50
55 60Phe Lys Gly Lys Ala Thr Leu Thr Ala
Asp Lys Ser Ser Ser Thr Ala65 70 75
80Tyr Met Gln Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val
Tyr Tyr 85 90 95Cys Ala
Arg Ala Gln Leu Arg Pro Asn Tyr Trp Tyr Phe Asp Val Trp 100
105 110Gly Ala Gly Thr Thr Val Thr Val Ser
Ser Gly Gly Gly Gly Ser Gly 115 120
125Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
130 135 140Gly Gly Ser Asp Ile Val Leu
Ser Gln Ser Pro Ala Ile Leu Ser Ala145 150
155 160Ser Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala
Ser Ser Ser Val 165 170
175Ser Tyr Met His Trp Tyr Gln Gln Lys Pro Gly Ser Ser Pro Lys Pro
180 185 190Trp Ile Tyr Ala Thr Ser
Asn Leu Ala Ser Gly Val Pro Ala Arg Phe 195 200
205Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr Ile Ser
Arg Val 210 215 220Glu Ala Glu Asp Ala
Ala Thr Tyr Tyr Cys Gln Gln Trp Ile Ser Asn225 230
235 240Pro Pro Thr Phe Gly Ala Gly Thr Lys Leu
Glu Leu Lys Gly Gly Gly 245 250
255Gly Ser Gly Gly Lys Glu Phe Thr Leu Asp Phe Ser Thr Ala Lys Thr
260 265 270Tyr Val Asp Ser Leu
Asn Val Ile Arg Ser Ala Ile Gly Thr Pro Leu 275
280 285Gln Thr Ile Ser Ser Gly Gly Thr Ser Leu Leu Met
Ile Asp Ser Gly 290 295 300Ser Gly Asp
Asn Leu Phe Ala Val Asp Val Arg Gly Ile Asp Pro Glu305
310 315 320Glu Gly Arg Phe Asn Asn Leu
Arg Leu Ile Val Glu Arg Asn Asn Leu 325
330 335Tyr Val Thr Gly Phe Val Asn Arg Thr Asn Asn Val
Phe Tyr Arg Phe 340 345 350Ala
Asp Phe Ser His Val Thr Phe Pro Gly Thr Thr Ala Val Thr Leu 355
360 365Ser Gly Asp Ser Ser Tyr Thr Thr Leu
Gln Arg Val Ala Gly Ile Ser 370 375
380Arg Thr Gly Met Gln Ile Asn Arg His Ser Leu Thr Thr Ser Tyr Leu385
390 395 400Asp Leu Met Ser
His Ser Gly Thr Ser Leu Thr Gln Ser Val Ala Arg 405
410 415Ala Met Leu Arg Phe Val Thr Val Thr Ala
Glu Ala Leu Arg Phe Arg 420 425
430Gln Ile Gln Arg Gly Phe Arg Thr Thr Leu Asp Asp Leu Ser Gly Arg
435 440 445Ser Tyr Val Met Thr Ala Glu
Asp Val Asp Leu Thr Leu Asn Trp Gly 450 455
460Arg Leu Ser Ser Val Leu Pro Asp Tyr His Gly Gln Asp Ser Val
Arg465 470 475 480Val Gly
Arg Ile Ser Phe Gly Ser Ile Asn Ala Ile Leu Gly Ser Val
485 490 495Ala Leu Ile Leu Asn Cys His
His His Ala Ser Arg Val Ala Arg 500 505
510131533DNAArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polynucleotide"modified_base(9)..(9)a,
c, t, g, unknown or othermodified_base(15)..(15)a, c, t, g, unknown or
othermodified_base(18)..(18)a, c, t, g, unknown or
othermodified_base(24)..(24)a, c, t, g, unknown or
othermodified_base(27)..(27)a, c, t, g, unknown or
othermodified_base(30)..(30)a, c, t, g, unknown or
othermodified_base(36)..(36)a, c, t, g, unknown or
othermodified_base(39)..(39)a, c, t, g, unknown or
othermodified_base(45)..(45)a, c, t, g, unknown or
othermodified_base(48)..(48)a, c, t, g, unknown or
othermodified_base(51)..(51)a, c, t, g, unknown or
othermodified_base(54)..(54)a, c, t, g, unknown or
othermodified_base(57)..(57)a, c, t, g, unknown or
othermodified_base(66)..(66)a, c, t, g, unknown or
othermodified_base(75)..(75)a, c, t, g, unknown or
othermodified_base(78)..(78)a, c, t, g, unknown or
othermodified_base(81)..(81)a, c, t, g, unknown or
othermodified_base(87)..(87)a, c, t, g, unknown or
othermodified_base(93)..(93)a, c, t, g, unknown or
othermodified_base(96)..(96)a, c, t, g, unknown or
othermodified_base(114)..(114)a, c, t, g, unknown or
othermodified_base(123)..(123)a, c, t, g, unknown or
othermodified_base(126)..(126)a, c, t, g, unknown or
othermodified_base(129)..(129)a, c, t, g, unknown or
othermodified_base(135)..(135)a, c, t, g, unknown or
othermodified_base(138)..(138)a, c, t, g, unknown or
othermodified_base(150)..(150)a, c, t, g, unknown or
othermodified_base(153)..(153)a, c, t, g, unknown or
othermodified_base(162)..(162)a, c, t, g, unknown or
othermodified_base(165)..(165)a, c, t, g, unknown or
othermodified_base(171)..(171)a, c, t, g, unknown or
othermodified_base(177)..(177)a, c, t, g, unknown or
othermodified_base(180)..(180)a, c, t, g, unknown or
othermodified_base(201)..(201)a, c, t, g, unknown or
othermodified_base(207)..(207)a, c, t, g, unknown or
othermodified_base(210)..(210)a, c, t, g, unknown or
othermodified_base(213)..(213)a, c, t, g, unknown or
othermodified_base(216)..(216)a, c, t, g, unknown or
othermodified_base(219)..(219)a, c, t, g, unknown or
othermodified_base(228)..(228)a, c, t, g, unknown or
othermodified_base(231)..(231)a, c, t, g, unknown or
othermodified_base(234)..(234)a, c, t, g, unknown or
othermodified_base(237)..(237)a, c, t, g, unknown or
othermodified_base(240)..(240)a, c, t, g, unknown or
othermodified_base(252)..(252)a, c, t, g, unknown or
othermodified_base(255)..(255)a, c, t, g, unknown or
othermodified_base(258)..(258)a, c, t, g, unknown or
othermodified_base(261)..(261)a, c, t, g, unknown or
othermodified_base(264)..(264)a, c, t, g, unknown or
othermodified_base(267)..(267)a, c, t, g, unknown or
othermodified_base(276)..(276)a, c, t, g, unknown or
othermodified_base(279)..(279)a, c, t, g, unknown or
othermodified_base(282)..(282)a, c, t, g, unknown or
othermodified_base(294)..(294)a, c, t, g, unknown or
othermodified_base(297)..(297)a, c, t, g, unknown or
othermodified_base(300)..(300)a, c, t, g, unknown or
othermodified_base(306)..(306)a, c, t, g, unknown or
othermodified_base(309)..(309)a, c, t, g, unknown or
othermodified_base(312)..(312)a, c, t, g, unknown or
othermodified_base(333)..(333)a, c, t, g, unknown or
othermodified_base(339)..(339)a, c, t, g, unknown or
othermodified_base(342)..(342)a, c, t, g, unknown or
othermodified_base(345)..(345)a, c, t, g, unknown or
othermodified_base(348)..(348)a, c, t, g, unknown or
othermodified_base(351)..(351)a, c, t, g, unknown or
othermodified_base(354)..(354)a, c, t, g, unknown or
othermodified_base(357)..(357)a, c, t, g, unknown or
othermodified_base(360)..(360)a, c, t, g, unknown or
othermodified_base(363)..(363)a, c, t, g, unknown or
othermodified_base(366)..(366)a, c, t, g, unknown or
othermodified_base(369)..(369)a, c, t, g, unknown or
othermodified_base(372)..(372)a, c, t, g, unknown or
othermodified_base(375)..(375)a, c, t, g, unknown or
othermodified_base(378)..(378)a, c, t, g, unknown or
othermodified_base(381)..(381)a, c, t, g, unknown or
othermodified_base(384)..(384)a, c, t, g, unknown or
othermodified_base(387)..(387)a, c, t, g, unknown or
othermodified_base(390)..(390)a, c, t, g, unknown or
othermodified_base(393)..(393)a, c, t, g, unknown or
othermodified_base(396)..(396)a, c, t, g, unknown or
othermodified_base(399)..(399)a, c, t, g, unknown or
othermodified_base(402)..(402)a, c, t, g, unknown or
othermodified_base(405)..(405)a, c, t, g, unknown or
othermodified_base(408)..(408)a, c, t, g, unknown or
othermodified_base(411)..(411)a, c, t, g, unknown or
othermodified_base(414)..(414)a, c, t, g, unknown or
othermodified_base(417)..(417)a, c, t, g, unknown or
othermodified_base(420)..(420)a, c, t, g, unknown or
othermodified_base(423)..(423)a, c, t, g, unknown or
othermodified_base(426)..(426)a, c, t, g, unknown or
othermodified_base(429)..(429)a, c, t, g, unknown or
othermodified_base(432)..(432)a, c, t, g, unknown or
othermodified_base(435)..(435)a, c, t, g, unknown or
othermodified_base(438)..(438)a, c, t, g, unknown or
othermodified_base(441)..(441)a, c, t, g, unknown or
othermodified_base(450)..(450)a, c, t, g, unknown or
othermodified_base(453)..(453)a, c, t, g, unknown or
othermodified_base(456)..(456)a, c, t, g, unknown or
othermodified_base(462)..(462)a, c, t, g, unknown or
othermodified_base(465)..(465)a, c, t, g, unknown or
othermodified_base(468)..(468)a, c, t, g, unknown or
othermodified_base(474)..(474)a, c, t, g, unknown or
othermodified_base(477)..(477)a, c, t, g, unknown or
othermodified_base(480)..(480)a, c, t, g, unknown or
othermodified_base(483)..(483)a, c, t, g, unknown or
othermodified_base(486)..(486)a, c, t, g, unknown or
othermodified_base(489)..(489)a, c, t, g, unknown or
othermodified_base(498)..(498)a, c, t, g, unknown or
othermodified_base(501)..(501)a, c, t, g, unknown or
othermodified_base(507)..(507)a, c, t, g, unknown or
othermodified_base(513)..(513)a, c, t, g, unknown or
othermodified_base(516)..(516)a, c, t, g, unknown or
othermodified_base(519)..(519)a, c, t, g, unknown or
othermodified_base(522)..(522)a, c, t, g, unknown or
othermodified_base(525)..(525)a, c, t, g, unknown or
othermodified_base(528)..(528)a, c, t, g, unknown or
othermodified_base(531)..(531)a, c, t, g, unknown or
othermodified_base(558)..(558)a, c, t, g, unknown or
othermodified_base(561)..(561)a, c, t, g, unknown or
othermodified_base(564)..(564)a, c, t, g, unknown or
othermodified_base(567)..(567)a, c, t, g, unknown or
othermodified_base(570)..(570)a, c, t, g, unknown or
othermodified_base(576)..(576)a, c, t, g, unknown or
othermodified_base(588)..(588)a, c, t, g, unknown or
othermodified_base(591)..(591)a, c, t, g, unknown or
othermodified_base(594)..(594)a, c, t, g, unknown or
othermodified_base(600)..(600)a, c, t, g, unknown or
othermodified_base(603)..(603)a, c, t, g, unknown or
othermodified_base(606)..(606)a, c, t, g, unknown or
othermodified_base(609)..(609)a, c, t, g, unknown or
othermodified_base(612)..(612)a, c, t, g, unknown or
othermodified_base(615)..(615)a, c, t, g, unknown or
othermodified_base(618)..(618)a, c, t, g, unknown or
othermodified_base(621)..(621)a, c, t, g, unknown or
othermodified_base(627)..(627)a, c, t, g, unknown or
othermodified_base(630)..(630)a, c, t, g, unknown or
othermodified_base(633)..(633)a, c, t, g, unknown or
othermodified_base(636)..(636)a, c, t, g, unknown or
othermodified_base(639)..(639)a, c, t, g, unknown or
othermodified_base(642)..(642)a, c, t, g, unknown or
othermodified_base(645)..(645)a, c, t, g, unknown or
othermodified_base(648)..(648)a, c, t, g, unknown or
othermodified_base(654)..(654)a, c, t, g, unknown or
othermodified_base(657)..(657)a, c, t, g, unknown or
othermodified_base(660)..(660)a, c, t, g, unknown or
othermodified_base(666)..(666)a, c, t, g, unknown or
othermodified_base(669)..(669)a, c, t, g, unknown or
othermodified_base(672)..(672)a, c, t, g, unknown or
othermodified_base(678)..(678)a, c, t, g, unknown or
othermodified_base(687)..(687)a, c, t, g, unknown or
othermodified_base(690)..(690)a, c, t, g, unknown or
othermodified_base(693)..(693)a, c, t, g, unknown or
othermodified_base(717)..(717)a, c, t, g, unknown or
othermodified_base(723)..(723)a, c, t, g, unknown or
othermodified_base(726)..(726)a, c, t, g, unknown or
othermodified_base(729)..(729)a, c, t, g, unknown or
othermodified_base(735)..(735)a, c, t, g, unknown or
othermodified_base(738)..(738)a, c, t, g, unknown or
othermodified_base(741)..(741)a, c, t, g, unknown or
othermodified_base(744)..(744)a, c, t, g, unknown or
othermodified_base(750)..(750)a, c, t, g, unknown or
othermodified_base(756)..(756)a, c, t, g, unknown or
othermodified_base(762)..(762)a, c, t, g, unknown or
othermodified_base(765)..(765)a, c, t, g, unknown or
othermodified_base(768)..(768)a, c, t, g, unknown or
othermodified_base(771)..(771)a, c, t, g, unknown or
othermodified_base(774)..(774)a, c, t, g, unknown or
othermodified_base(777)..(777)a, c, t, g, unknown or
othermodified_base(780)..(780)a, c, t, g, unknown or
othermodified_base(792)..(792)a, c, t, g, unknown or
othermodified_base(795)..(795)a, c, t, g, unknown or
othermodified_base(804)..(804)a, c, t, g, unknown or
othermodified_base(807)..(807)a, c, t, g, unknown or
othermodified_base(810)..(810)a, c, t, g, unknown or
othermodified_base(816)..(816)a, c, t, g, unknown or
othermodified_base(822)..(822)a, c, t, g, unknown or
othermodified_base(828)..(828)a, c, t, g, unknown or
othermodified_base(831)..(831)a, c, t, g, unknown or
othermodified_base(837)..(837)a, c, t, g, unknown or
othermodified_base(843)..(843)a, c, t, g, unknown or
othermodified_base(846)..(846)a, c, t, g, unknown or
othermodified_base(849)..(849)a, c, t, g, unknown or
othermodified_base(855)..(855)a, c, t, g, unknown or
othermodified_base(858)..(858)a, c, t, g, unknown or
othermodified_base(861)..(861)a, c, t, g, unknown or
othermodified_base(864)..(864)a, c, t, g, unknown or
othermodified_base(870)..(870)a, c, t, g, unknown or
othermodified_base(876)..(876)a, c, t, g, unknown or
othermodified_base(879)..(879)a, c, t, g, unknown or
othermodified_base(882)..(882)a, c, t, g, unknown or
othermodified_base(885)..(885)a, c, t, g, unknown or
othermodified_base(888)..(888)a, c, t, g, unknown or
othermodified_base(891)..(891)a, c, t, g, unknown or
othermodified_base(894)..(894)a, c, t, g, unknown or
othermodified_base(897)..(897)a, c, t, g, unknown or
othermodified_base(909)..(909)a, c, t, g, unknown or
othermodified_base(912)..(912)a, c, t, g, unknown or
othermodified_base(915)..(915)a, c, t, g, unknown or
othermodified_base(918)..(918)a, c, t, g, unknown or
othermodified_base(927)..(927)a, c, t, g, unknown or
othermodified_base(933)..(933)a, c, t, g, unknown or
othermodified_base(936)..(936)a, c, t, g, unknown or
othermodified_base(942)..(942)a, c, t, g, unknown or
othermodified_base(945)..(945)a, c, t, g, unknown or
othermodified_base(948)..(948)a, c, t, g, unknown or
othermodified_base(957)..(957)a, c, t, g, unknown or
othermodified_base(966)..(966)a, c, t, g, unknown or
othermodified_base(969)..(969)a, c, t, g, unknown or
othermodified_base(981)..(981)a, c, t, g, unknown or
othermodified_base(984)..(984)a, c, t, g, unknown or
othermodified_base(987)..(987)a, c, t, g, unknown or
othermodified_base(993)..(993)a, c, t, g, unknown or
othermodified_base(999)..(999)a, c, t, g, unknown or
othermodified_base(1008)..(1008)a, c, t, g, unknown or
othermodified_base(1014)..(1014)a, c, t, g, unknown or
othermodified_base(1017)..(1017)a, c, t, g, unknown or
othermodified_base(1020)..(1020)a, c, t, g, unknown or
othermodified_base(1026)..(1026)a, c, t, g, unknown or
othermodified_base(1032)..(1032)a, c, t, g, unknown or
othermodified_base(1035)..(1035)a, c, t, g, unknown or
othermodified_base(1044)..(1044)a, c, t, g, unknown or
othermodified_base(1053)..(1053)a, c, t, g, unknown or
othermodified_base(1059)..(1059)a, c, t, g, unknown or
othermodified_base(1068)..(1068)a, c, t, g, unknown or
othermodified_base(1074)..(1074)a, c, t, g, unknown or
othermodified_base(1077)..(1077)a, c, t, g, unknown or
othermodified_base(1083)..(1083)a, c, t, g, unknown or
othermodified_base(1086)..(1086)a, c, t, g, unknown or
othermodified_base(1089)..(1089)a, c, t, g, unknown or
othermodified_base(1092)..(1092)a, c, t, g, unknown or
othermodified_base(1095)..(1095)a, c, t, g, unknown or
othermodified_base(1098)..(1098)a, c, t, g, unknown or
othermodified_base(1101)..(1101)a, c, t, g, unknown or
othermodified_base(1104)..(1104)a, c, t, g, unknown or
othermodified_base(1107)..(1107)a, c, t, g, unknown or
othermodified_base(1110)..(1110)a, c, t, g, unknown or
othermodified_base(1116)..(1116)a, c, t, g, unknown or
othermodified_base(1119)..(1119)a, c, t, g, unknown or
othermodified_base(1125)..(1125)a, c, t, g, unknown or
othermodified_base(1128)..(1128)a, c, t, g, unknown or
othermodified_base(1131)..(1131)a, c, t, g, unknown or
othermodified_base(1137)..(1137)a, c, t, g, unknown or
othermodified_base(1140)..(1140)a, c, t, g, unknown or
othermodified_base(1143)..(1143)a, c, t, g, unknown or
othermodified_base(1146)..(1146)a, c, t, g, unknown or
othermodified_base(1152)..(1152)a, c, t, g, unknown or
othermodified_base(1155)..(1155)a, c, t, g, unknown or
othermodified_base(1158)..(1158)a, c, t, g, unknown or
othermodified_base(1161)..(1161)a, c, t, g, unknown or
othermodified_base(1176)..(1176)a, c, t, g, unknown or
othermodified_base(1182)..(1182)a, c, t, g, unknown or
othermodified_base(1185)..(1185)a, c, t, g, unknown or
othermodified_base(1188)..(1188)a, c, t, g, unknown or
othermodified_base(1191)..(1191)a, c, t, g, unknown or
othermodified_base(1194)..(1194)a, c, t, g, unknown or
othermodified_base(1200)..(1200)a, c, t, g, unknown or
othermodified_base(1206)..(1206)a, c, t, g, unknown or
othermodified_base(1212)..(1212)a, c, t, g, unknown or
othermodified_base(1218)..(1218)a, c, t, g, unknown or
othermodified_base(1221)..(1221)a, c, t, g, unknown or
othermodified_base(1224)..(1224)a, c, t, g, unknown or
othermodified_base(1227)..(1227)a, c, t, g, unknown or
othermodified_base(1230)..(1230)a, c, t, g, unknown or
othermodified_base(1233)..(1233)a, c, t, g, unknown or
othermodified_base(1239)..(1239)a, c, t, g, unknown or
othermodified_base(1242)..(1242)a, c, t, g, unknown or
othermodified_base(1245)..(1245)a, c, t, g, unknown or
othermodified_base(1248)..(1248)a, c, t, g, unknown or
othermodified_base(1251)..(1251)a, c, t, g, unknown or
othermodified_base(1257)..(1257)a, c, t, g, unknown or
othermodified_base(1260)..(1260)a, c, t, g, unknown or
othermodified_base(1266)..(1266)a, c, t, g, unknown or
othermodified_base(1269)..(1269)a, c, t, g, unknown or
othermodified_base(1272)..(1272)a, c, t, g, unknown or
othermodified_base(1275)..(1275)a, c, t, g, unknown or
othermodified_base(1278)..(1278)a, c, t, g, unknown or
othermodified_base(1284)..(1284)a, c, t, g, unknown or
othermodified_base(1287)..(1287)a, c, t, g, unknown or
othermodified_base(1290)..(1290)a, c, t, g, unknown or
othermodified_base(1296)..(1296)a, c, t, g, unknown or
othermodified_base(1308)..(1308)a, c, t, g, unknown or
othermodified_base(1311)..(1311)a, c, t, g, unknown or
othermodified_base(1317)..(1317)a, c, t, g, unknown or
othermodified_base(1320)..(1320)a, c, t, g, unknown or
othermodified_base(1323)..(1323)a, c, t, g, unknown or
othermodified_base(1326)..(1326)a, c, t, g, unknown or
othermodified_base(1335)..(1335)a, c, t, g, unknown or
othermodified_base(1338)..(1338)a, c, t, g, unknown or
othermodified_base(1341)..(1341)a, c, t, g, unknown or
othermodified_base(1344)..(1344)a, c, t, g, unknown or
othermodified_base(1347)..(1347)a, c, t, g, unknown or
othermodified_base(1353)..(1353)a, c, t, g, unknown or
othermodified_base(1359)..(1359)a, c, t, g, unknown or
othermodified_base(1362)..(1362)a, c, t, g, unknown or
othermodified_base(1371)..(1371)a, c, t, g, unknown or
othermodified_base(1377)..(1377)a, c, t, g, unknown or
othermodified_base(1380)..(1380)a, c, t, g, unknown or
othermodified_base(1383)..(1383)a, c, t, g, unknown or
othermodified_base(1392)..(1392)a, c, t, g, unknown or
othermodified_base(1395)..(1395)a, c, t, g, unknown or
othermodified_base(1398)..(1398)a, c, t, g, unknown or
othermodified_base(1401)..(1401)a, c, t, g, unknown or
othermodified_base(1404)..(1404)a, c, t, g, unknown or
othermodified_base(1407)..(1407)a, c, t, g, unknown or
othermodified_base(1410)..(1410)a, c, t, g, unknown or
othermodified_base(1413)..(1413)a, c, t, g, unknown or
othermodified_base(1425)..(1425)a, c, t, g, unknown or
othermodified_base(1434)..(1434)a, c, t, g, unknown or
othermodified_base(1437)..(1437)a, c, t, g, unknown or
othermodified_base(1440)..(1440)a, c, t, g, unknown or
othermodified_base(1443)..(1443)a, c, t, g, unknown or
othermodified_base(1446)..(1446)a, c, t, g, unknown or
othermodified_base(1449)..(1449)a, c, t, g, unknown or
othermodified_base(1455)..(1455)a, c, t, g, unknown or
othermodified_base(1461)..(1461)a, c, t, g, unknown or
othermodified_base(1464)..(1464)a, c, t, g, unknown or
othermodified_base(1473)..(1473)a, c, t, g, unknown or
othermodified_base(1479)..(1479)a, c, t, g, unknown or
othermodified_base(1482)..(1482)a, c, t, g, unknown or
othermodified_base(1485)..(1485)a, c, t, g, unknown or
othermodified_base(1488)..(1488)a, c, t, g, unknown or
othermodified_base(1491)..(1491)a, c, t, g, unknown or
othermodified_base(1494)..(1494)a, c, t, g, unknown or
othermodified_base(1500)..(1500)a, c, t, g, unknown or
othermodified_base(1518)..(1518)a, c, t, g, unknown or
othermodified_base(1521)..(1521)a, c, t, g, unknown or
othermodified_base(1524)..(1524)a, c, t, g, unknown or
othermodified_base(1527)..(1527)a, c, t, g, unknown or
othermodified_base(1530)..(1530)a, c, t, g, unknown or
othermodified_base(1533)..(1533)a, c, t, g, unknown or other 13atgcargtnc
arytngtnca rwsnggngcn garytngtna arccnggngc nwsngtnaar 60atgwsntgya
argcnwsngg ntayacntty acnwsntaya ayatgcaytg ggtnaarcar 120acnccnggnc
arggnytnga rtggathggn gcnathtayc cnggnaaygg ngayacnwsn 180tayaaycara
arttyaargg naargcnacn ytnacngcng ayaarwsnws nwsnacngcn 240tayatgcary
tnwsnwsnyt nacnwsngar gaywsngcng tntaytaytg ygcnmgngcn 300carytnmgnc
cnaaytaytg gtayttygay gtntggggng cnggnacnac ngtnacngtn 360wsnwsnggng
gnggnggnws nggnggnggn ggnwsnggng gnggnggnws nggnggnggn 420ggnwsnggng
gnggnggnws ngayathgtn ytnwsncarw snccngcnat hytnwsngcn 480wsnccnggng
araargtnac natgacntgy mgngcnwsnw snwsngtnws ntayatgcay 540tggtaycarc
araarccngg nwsnwsnccn aarccntgga thtaygcnac nwsnaayytn 600gcnwsnggng
tnccngcnmg nttywsnggn wsnggnwsng gnacnwsnta ywsnytnacn 660athwsnmgng
tngargcnga rgaygcngcn acntaytayt gycarcartg gathwsnaay 720ccnccnacnt
tyggngcngg nacnaarytn garytnaarg gnggnggngg nwsnggnggn 780aargarttya
cnytngaytt ywsnacngcn aaracntayg tngaywsnyt naaygtnath 840mgnwsngcna
thggnacncc nytncaracn athwsnwsng gnggnacnws nytnytnatg 900athgaywsng
gnwsnggnga yaayytntty gcngtngayg tnmgnggnat hgayccngar 960garggnmgnt
tyaayaayyt nmgnytnath gtngarmgna ayaayytnta ygtnacnggn 1020ttygtnaaym
gnacnaayaa ygtnttytay mgnttygcng ayttywsnca ygtnacntty 1080ccnggnacna
cngcngtnac nytnwsnggn gaywsnwsnt ayacnacnyt ncarmgngtn 1140gcnggnathw
snmgnacngg natgcarath aaymgncayw snytnacnac nwsntayytn 1200gayytnatgw
sncaywsngg nacnwsnytn acncarwsng tngcnmgngc natgytnmgn 1260ttygtnacng
tnacngcnga rgcnytnmgn ttymgncara thcarmgngg nttymgnacn 1320acnytngayg
ayytnwsngg nmgnwsntay gtnatgacng cngargaygt ngayytnacn 1380ytnaaytggg
gnmgnytnws nwsngtnytn ccngaytayc ayggncarga ywsngtnmgn 1440gtnggnmgna
thwsnttygg nwsnathaay gcnathytng gnwsngtngc nytnathytn 1500aaytgycayc
aycaygcnws nmgngtngcn mgn
153314513PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 14Met Gln Val Gln Leu Gln Gln Pro
Gly Ala Glu Leu Val Lys Pro Gly1 5 10
15Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe
Thr Ser 20 25 30Tyr Asn Met
His Trp Val Lys Gln Thr Pro Gly Arg Gly Leu Glu Trp 35
40 45Ile Gly Ala Ile Tyr Pro Gly Asn Gly Asp Thr
Ser Tyr Asn Gln Lys 50 55 60Phe Lys
Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr Ala65
70 75 80Tyr Met Gln Leu Ser Ser Leu
Thr Ser Glu Asp Ser Ala Val Tyr Tyr 85 90
95Cys Ala Arg Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe
Asn Val Trp 100 105 110Gly Ala
Gly Thr Thr Val Thr Val Ser Ala Gly Ser Thr Ser Gly Ser 115
120 125Gly Lys Pro Gly Ser Gly Glu Gly Ser Thr
Lys Gly Gln Ile Val Leu 130 135 140Ser
Gln Ser Pro Ala Ile Leu Ser Ala Ser Pro Gly Glu Lys Val Thr145
150 155 160Met Thr Cys Arg Ala Ser
Ser Ser Val Ser Tyr Ile His Trp Phe Gln 165
170 175Gln Lys Pro Gly Ser Ser Pro Lys Pro Trp Ile Tyr
Ala Thr Ser Asn 180 185 190Leu
Ala Ser Gly Val Pro Val Arg Phe Ser Gly Ser Gly Ser Gly Thr 195
200 205Ser Tyr Ser Leu Thr Ile Ser Arg Val
Glu Ala Glu Asp Ala Ala Thr 210 215
220Tyr Tyr Cys Gln Gln Trp Thr Ser Asn Pro Pro Thr Phe Gly Gly Gly225
230 235 240Thr Lys Leu Glu
Ile Lys Glu Phe Pro Lys Pro Ser Thr Pro Pro Gly 245
250 255Ser Ser Gly Gly Ala Pro Lys Glu Phe Thr
Leu Asp Phe Ser Thr Ala 260 265
270Lys Thr Tyr Val Asp Ser Leu Asn Val Ile Arg Ser Ala Ile Gly Thr
275 280 285Pro Leu Gln Thr Ile Ser Ser
Gly Gly Thr Ser Leu Leu Met Ile Asp 290 295
300Ser Gly Ser Gly Asp Asn Leu Phe Ala Val Asp Val Arg Gly Ile
Asp305 310 315 320Pro Glu
Glu Gly Arg Phe Asn Asn Leu Arg Leu Ile Val Glu Arg Asn
325 330 335Asn Leu Tyr Val Thr Gly Phe
Val Asn Arg Thr Asn Asn Val Phe Tyr 340 345
350Arg Phe Ala Asp Phe Ser His Val Thr Phe Pro Gly Thr Thr
Ala Val 355 360 365Thr Leu Ser Gly
Asp Ser Ser Tyr Thr Thr Leu Gln Arg Val Ala Gly 370
375 380Ile Ser Arg Thr Gly Met Gln Ile Asn Arg His Ser
Leu Thr Thr Ser385 390 395
400Tyr Leu Asp Leu Met Ser His Ser Gly Thr Ser Leu Thr Gln Ser Val
405 410 415Ala Arg Ala Met Leu
Arg Phe Val Thr Val Thr Ala Glu Ala Leu Arg 420
425 430Phe Arg Gln Ile Gln Arg Gly Phe Arg Thr Thr Leu
Asp Asp Leu Ser 435 440 445Gly Arg
Ser Tyr Val Met Thr Ala Glu Asp Val Asp Leu Thr Leu Asn 450
455 460Trp Gly Arg Leu Ser Ser Val Leu Pro Asp Tyr
His Gly Gln Asp Ser465 470 475
480Val Arg Val Gly Arg Ile Ser Phe Gly Ser Ile Asn Ala Ile Leu Gly
485 490 495Ser Val Ala Leu
Ile Leu Asn Cys His His His Ala Ser Arg Val Ala 500
505 510Arg151539DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide"modified_base(9)..(9)a, c, t, g, unknown or
othermodified_base(15)..(15)a, c, t, g, unknown or
othermodified_base(24)..(24)a, c, t, g, unknown or
othermodified_base(27)..(27)a, c, t, g, unknown or
othermodified_base(30)..(30)a, c, t, g, unknown or
othermodified_base(36)..(36)a, c, t, g, unknown or
othermodified_base(39)..(39)a, c, t, g, unknown or
othermodified_base(45)..(45)a, c, t, g, unknown or
othermodified_base(48)..(48)a, c, t, g, unknown or
othermodified_base(51)..(51)a, c, t, g, unknown or
othermodified_base(54)..(54)a, c, t, g, unknown or
othermodified_base(57)..(57)a, c, t, g, unknown or
othermodified_base(66)..(66)a, c, t, g, unknown or
othermodified_base(75)..(75)a, c, t, g, unknown or
othermodified_base(78)..(78)a, c, t, g, unknown or
othermodified_base(81)..(81)a, c, t, g, unknown or
othermodified_base(87)..(87)a, c, t, g, unknown or
othermodified_base(93)..(93)a, c, t, g, unknown or
othermodified_base(96)..(96)a, c, t, g, unknown or
othermodified_base(114)..(114)a, c, t, g, unknown or
othermodified_base(123)..(123)a, c, t, g, unknown or
othermodified_base(126)..(126)a, c, t, g, unknown or
othermodified_base(129)..(129)a, c, t, g, unknown or
othermodified_base(132)..(132)a, c, t, g, unknown or
othermodified_base(135)..(135)a, c, t, g, unknown or
othermodified_base(138)..(138)a, c, t, g, unknown or
othermodified_base(150)..(150)a, c, t, g, unknown or
othermodified_base(153)..(153)a, c, t, g, unknown or
othermodified_base(162)..(162)a, c, t, g, unknown or
othermodified_base(165)..(165)a, c, t, g, unknown or
othermodified_base(171)..(171)a, c, t, g, unknown or
othermodified_base(177)..(177)a, c, t, g, unknown or
othermodified_base(180)..(180)a, c, t, g, unknown or
othermodified_base(201)..(201)a, c, t, g, unknown or
othermodified_base(207)..(207)a, c, t, g, unknown or
othermodified_base(210)..(210)a, c, t, g, unknown or
othermodified_base(213)..(213)a, c, t, g, unknown or
othermodified_base(216)..(216)a, c, t, g, unknown or
othermodified_base(219)..(219)a, c, t, g, unknown or
othermodified_base(228)..(228)a, c, t, g, unknown or
othermodified_base(231)..(231)a, c, t, g, unknown or
othermodified_base(234)..(234)a, c, t, g, unknown or
othermodified_base(237)..(237)a, c, t, g, unknown or
othermodified_base(240)..(240)a, c, t, g, unknown or
othermodified_base(252)..(252)a, c, t, g, unknown or
othermodified_base(255)..(255)a, c, t, g, unknown or
othermodified_base(258)..(258)a, c, t, g, unknown or
othermodified_base(261)..(261)a, c, t, g, unknown or
othermodified_base(264)..(264)a, c, t, g, unknown or
othermodified_base(267)..(267)a, c, t, g, unknown or
othermodified_base(276)..(276)a, c, t, g, unknown or
othermodified_base(279)..(279)a, c, t, g, unknown or
othermodified_base(282)..(282)a, c, t, g, unknown or
othermodified_base(294)..(294)a, c, t, g, unknown or
othermodified_base(297)..(297)a, c, t, g, unknown or
othermodified_base(300)..(300)a, c, t, g, unknown or
othermodified_base(303)..(303)a, c, t, g, unknown or
othermodified_base(312)..(312)a, c, t, g, unknown or
othermodified_base(315)..(315)a, c, t, g, unknown or
othermodified_base(333)..(333)a, c, t, g, unknown or
othermodified_base(339)..(339)a, c, t, g, unknown or
othermodified_base(342)..(342)a, c, t, g, unknown or
othermodified_base(345)..(345)a, c, t, g, unknown or
othermodified_base(348)..(348)a, c, t, g, unknown or
othermodified_base(351)..(351)a, c, t, g, unknown or
othermodified_base(354)..(354)a, c, t, g, unknown or
othermodified_base(357)..(357)a, c, t, g, unknown or
othermodified_base(360)..(360)a, c, t, g, unknown or
othermodified_base(363)..(363)a, c, t, g, unknown or
othermodified_base(366)..(366)a, c, t, g, unknown or
othermodified_base(369)..(369)a, c, t, g, unknown or
othermodified_base(372)..(372)a, c, t, g, unknown or
othermodified_base(375)..(375)a, c, t, g, unknown or
othermodified_base(378)..(378)a, c, t, g, unknown or
othermodified_base(381)..(381)a, c, t, g, unknown or
othermodified_base(384)..(384)a, c, t, g, unknown or
othermodified_base(387)..(387)a, c, t, g, unknown or
othermodified_base(393)..(393)a, c, t, g, unknown or
othermodified_base(396)..(396)a, c, t, g, unknown or
othermodified_base(399)..(399)a, c, t, g, unknown or
othermodified_base(402)..(402)a, c, t, g, unknown or
othermodified_base(408)..(408)a, c, t, g, unknown or
othermodified_base(411)..(411)a, c, t, g, unknown or
othermodified_base(414)..(414)a, c, t, g, unknown or
othermodified_base(420)..(420)a, c, t, g, unknown or
othermodified_base(429)..(429)a, c, t, g, unknown or
othermodified_base(432)..(432)a, c, t, g, unknown or
othermodified_base(435)..(435)a, c, t, g, unknown or
othermodified_base(441)..(441)a, c, t, g, unknown or
othermodified_base(444)..(444)a, c, t, g, unknown or
othermodified_base(447)..(447)a, c, t, g, unknown or
othermodified_base(453)..(453)a, c, t, g, unknown or
othermodified_base(456)..(456)a, c, t, g, unknown or
othermodified_base(459)..(459)a, c, t, g, unknown or
othermodified_base(462)..(462)a, c, t, g, unknown or
othermodified_base(465)..(465)a, c, t, g, unknown or
othermodified_base(468)..(468)a, c, t, g, unknown or
othermodified_base(477)..(477)a, c, t, g, unknown or
othermodified_base(480)..(480)a, c, t, g, unknown or
othermodified_base(486)..(486)a, c, t, g, unknown or
othermodified_base(492)..(492)a, c, t, g, unknown or
othermodified_base(495)..(495)a, c, t, g, unknown or
othermodified_base(498)..(498)a, c, t, g, unknown or
othermodified_base(501)..(501)a, c, t, g, unknown or
othermodified_base(504)..(504)a, c, t, g, unknown or
othermodified_base(507)..(507)a, c, t, g, unknown or
othermodified_base(510)..(510)a, c, t, g, unknown or
othermodified_base(537)..(537)a, c, t, g, unknown or
othermodified_base(540)..(540)a, c, t, g, unknown or
othermodified_base(543)..(543)a, c, t, g, unknown or
othermodified_base(546)..(546)a, c, t, g, unknown or
othermodified_base(549)..(549)a, c, t, g, unknown or
othermodified_base(555)..(555)a, c, t, g, unknown or
othermodified_base(567)..(567)a, c, t, g, unknown or
othermodified_base(570)..(570)a, c, t, g, unknown or
othermodified_base(573)..(573)a, c, t, g, unknown or
othermodified_base(579)..(579)a, c, t, g, unknown or
othermodified_base(582)..(582)a, c, t, g, unknown or
othermodified_base(585)..(585)a, c, t, g, unknown or
othermodified_base(588)..(588)a, c, t, g, unknown or
othermodified_base(591)..(591)a, c, t, g, unknown or
othermodified_base(594)..(594)a, c, t, g, unknown or
othermodified_base(597)..(597)a, c, t, g, unknown or
othermodified_base(600)..(600)a, c, t, g, unknown or
othermodified_base(606)..(606)a, c, t, g, unknown or
othermodified_base(609)..(609)a, c, t, g, unknown or
othermodified_base(612)..(612)a, c, t, g, unknown or
othermodified_base(615)..(615)a, c, t, g, unknown or
othermodified_base(618)..(618)a, c, t, g, unknown or
othermodified_base(621)..(621)a, c, t, g, unknown or
othermodified_base(624)..(624)a, c, t, g, unknown or
othermodified_base(627)..(627)a, c, t, g, unknown or
othermodified_base(633)..(633)a, c, t, g, unknown or
othermodified_base(636)..(636)a, c, t, g, unknown or
othermodified_base(639)..(639)a, c, t, g, unknown or
othermodified_base(645)..(645)a, c, t, g, unknown or
othermodified_base(648)..(648)a, c, t, g, unknown or
othermodified_base(651)..(651)a, c, t, g, unknown or
othermodified_base(657)..(657)a, c, t, g, unknown or
othermodified_base(666)..(666)a, c, t, g, unknown or
othermodified_base(669)..(669)a, c, t, g, unknown or
othermodified_base(672)..(672)a, c, t, g, unknown or
othermodified_base(693)..(693)a, c, t, g, unknown or
othermodified_base(696)..(696)a, c, t, g, unknown or
othermodified_base(702)..(702)a, c, t, g, unknown or
othermodified_base(705)..(705)a, c, t, g, unknown or
othermodified_base(708)..(708)a, c, t, g, unknown or
othermodified_base(714)..(714)a, c, t, g, unknown or
othermodified_base(717)..(717)a, c, t, g, unknown or
othermodified_base(720)..(720)a, c, t, g, unknown or
othermodified_base(723)..(723)a, c, t, g, unknown or
othermodified_base(729)..(729)a, c, t, g, unknown or
othermodified_base(747)..(747)a, c, t, g, unknown or
othermodified_base(753)..(753)a, c, t, g, unknown or
othermodified_base(756)..(756)a, c, t, g, unknown or
othermodified_base(759)..(759)a, c, t, g, unknown or
othermodified_base(762)..(762)a, c, t, g, unknown or
othermodified_base(765)..(765)a, c, t, g, unknown or
othermodified_base(768)..(768)a, c, t, g, unknown or
othermodified_base(771)..(771)a, c, t, g, unknown or
othermodified_base(774)..(774)a, c, t, g, unknown or
othermodified_base(777)..(777)a, c, t, g, unknown or
othermodified_base(780)..(780)a, c, t, g, unknown or
othermodified_base(783)..(783)a, c, t, g, unknown or
othermodified_base(786)..(786)a, c, t, g, unknown or
othermodified_base(798)..(798)a, c, t, g, unknown or
othermodified_base(801)..(801)a, c, t, g, unknown or
othermodified_base(810)..(810)a, c, t, g, unknown or
othermodified_base(813)..(813)a, c, t, g, unknown or
othermodified_base(816)..(816)a, c, t, g, unknown or
othermodified_base(822)..(822)a, c, t, g, unknown or
othermodified_base(828)..(828)a, c, t, g, unknown or
othermodified_base(834)..(834)a, c, t, g, unknown or
othermodified_base(837)..(837)a, c, t, g, unknown or
othermodified_base(843)..(843)a, c, t, g, unknown or
othermodified_base(849)..(849)a, c, t, g, unknown or
othermodified_base(852)..(852)a, c, t, g, unknown or
othermodified_base(855)..(855)a, c, t, g, unknown or
othermodified_base(861)..(861)a, c, t, g, unknown or
othermodified_base(864)..(864)a, c, t, g, unknown or
othermodified_base(867)..(867)a, c, t, g, unknown or
othermodified_base(870)..(870)a, c, t, g, unknown or
othermodified_base(876)..(876)a, c, t, g, unknown or
othermodified_base(882)..(882)a, c, t, g, unknown or
othermodified_base(885)..(885)a, c, t, g, unknown or
othermodified_base(888)..(888)a, c, t, g, unknown or
othermodified_base(891)..(891)a, c, t, g, unknown or
othermodified_base(894)..(894)a, c, t, g, unknown or
othermodified_base(897)..(897)a, c, t, g, unknown or
othermodified_base(900)..(900)a, c, t, g, unknown or
othermodified_base(903)..(903)a, c, t, g, unknown or
othermodified_base(915)..(915)a, c, t, g, unknown or
othermodified_base(918)..(918)a, c, t, g, unknown or
othermodified_base(921)..(921)a, c, t, g, unknown or
othermodified_base(924)..(924)a, c, t, g, unknown or
othermodified_base(933)..(933)a, c, t, g, unknown or
othermodified_base(939)..(939)a, c, t, g, unknown or
othermodified_base(942)..(942)a, c, t, g, unknown or
othermodified_base(948)..(948)a, c, t, g, unknown or
othermodified_base(951)..(951)a, c, t, g, unknown or
othermodified_base(954)..(954)a, c, t, g, unknown or
othermodified_base(963)..(963)a, c, t, g, unknown or
othermodified_base(972)..(972)a, c, t, g, unknown or
othermodified_base(975)..(975)a, c, t, g, unknown or
othermodified_base(987)..(987)a, c, t, g, unknown or
othermodified_base(990)..(990)a, c, t, g, unknown or
othermodified_base(993)..(993)a, c, t, g, unknown or
othermodified_base(999)..(999)a, c, t, g, unknown or
othermodified_base(1005)..(1005)a, c, t, g, unknown or
othermodified_base(1014)..(1014)a, c, t, g, unknown or
othermodified_base(1020)..(1020)a, c, t, g, unknown or
othermodified_base(1023)..(1023)a, c, t, g, unknown or
othermodified_base(1026)..(1026)a, c, t, g, unknown or
othermodified_base(1032)..(1032)a, c, t, g, unknown or
othermodified_base(1038)..(1038)a, c, t, g, unknown or
othermodified_base(1041)..(1041)a, c, t, g, unknown or
othermodified_base(1050)..(1050)a, c, t, g, unknown or
othermodified_base(1059)..(1059)a, c, t, g, unknown or
othermodified_base(1065)..(1065)a, c, t, g, unknown or
othermodified_base(1074)..(1074)a, c, t, g, unknown or
othermodified_base(1080)..(1080)a, c, t, g, unknown or
othermodified_base(1083)..(1083)a, c, t, g, unknown or
othermodified_base(1089)..(1089)a, c, t, g, unknown or
othermodified_base(1092)..(1092)a, c, t, g, unknown or
othermodified_base(1095)..(1095)a, c, t, g, unknown or
othermodified_base(1098)..(1098)a, c, t, g, unknown or
othermodified_base(1101)..(1101)a, c, t, g, unknown or
othermodified_base(1104)..(1104)a, c, t, g, unknown or
othermodified_base(1107)..(1107)a, c, t, g, unknown or
othermodified_base(1110)..(1110)a, c, t, g, unknown or
othermodified_base(1113)..(1113)a, c, t, g, unknown or
othermodified_base(1116)..(1116)a, c, t, g, unknown or
othermodified_base(1122)..(1122)a, c, t, g, unknown or
othermodified_base(1125)..(1125)a, c, t, g, unknown or
othermodified_base(1131)..(1131)a, c, t, g, unknown or
othermodified_base(1134)..(1134)a, c, t, g, unknown or
othermodified_base(1137)..(1137)a, c, t, g, unknown or
othermodified_base(1143)..(1143)a, c, t, g, unknown or
othermodified_base(1146)..(1146)a, c, t, g, unknown or
othermodified_base(1149)..(1149)a, c, t, g, unknown or
othermodified_base(1152)..(1152)a, c, t, g, unknown or
othermodified_base(1158)..(1158)a, c, t, g, unknown or
othermodified_base(1161)..(1161)a, c, t, g, unknown or
othermodified_base(1164)..(1164)a, c, t, g, unknown or
othermodified_base(1167)..(1167)a, c, t, g, unknown or
othermodified_base(1182)..(1182)a, c, t, g, unknown or
othermodified_base(1188)..(1188)a, c, t, g, unknown or
othermodified_base(1191)..(1191)a, c, t, g, unknown or
othermodified_base(1194)..(1194)a, c, t, g, unknown or
othermodified_base(1197)..(1197)a, c, t, g, unknown or
othermodified_base(1200)..(1200)a, c, t, g, unknown or
othermodified_base(1206)..(1206)a, c, t, g, unknown or
othermodified_base(1212)..(1212)a, c, t, g, unknown or
othermodified_base(1218)..(1218)a, c, t, g, unknown or
othermodified_base(1224)..(1224)a, c, t, g, unknown or
othermodified_base(1227)..(1227)a, c, t, g, unknown or
othermodified_base(1230)..(1230)a, c, t, g, unknown or
othermodified_base(1233)..(1233)a, c, t, g, unknown or
othermodified_base(1236)..(1236)a, c, t, g, unknown or
othermodified_base(1239)..(1239)a, c, t, g, unknown or
othermodified_base(1245)..(1245)a, c, t, g, unknown or
othermodified_base(1248)..(1248)a, c, t, g, unknown or
othermodified_base(1251)..(1251)a, c, t, g, unknown or
othermodified_base(1254)..(1254)a, c, t, g, unknown or
othermodified_base(1257)..(1257)a, c, t, g, unknown or
othermodified_base(1263)..(1263)a, c, t, g, unknown or
othermodified_base(1266)..(1266)a, c, t, g, unknown or
othermodified_base(1272)..(1272)a, c, t, g, unknown or
othermodified_base(1275)..(1275)a, c, t, g, unknown or
othermodified_base(1278)..(1278)a, c, t, g, unknown or
othermodified_base(1281)..(1281)a, c, t, g, unknown or
othermodified_base(1284)..(1284)a, c, t, g, unknown or
othermodified_base(1290)..(1290)a, c, t, g, unknown or
othermodified_base(1293)..(1293)a, c, t, g, unknown or
othermodified_base(1296)..(1296)a, c, t, g, unknown or
othermodified_base(1302)..(1302)a, c, t, g, unknown or
othermodified_base(1314)..(1314)a, c, t, g, unknown or
othermodified_base(1317)..(1317)a, c, t, g, unknown or
othermodified_base(1323)..(1323)a, c, t, g, unknown or
othermodified_base(1326)..(1326)a, c, t, g, unknown or
othermodified_base(1329)..(1329)a, c, t, g, unknown or
othermodified_base(1332)..(1332)a, c, t, g, unknown or
othermodified_base(1341)..(1341)a, c, t, g, unknown or
othermodified_base(1344)..(1344)a, c, t, g, unknown or
othermodified_base(1347)..(1347)a, c, t, g, unknown or
othermodified_base(1350)..(1350)a, c, t, g, unknown or
othermodified_base(1353)..(1353)a, c, t, g, unknown or
othermodified_base(1359)..(1359)a, c, t, g, unknown or
othermodified_base(1365)..(1365)a, c, t, g, unknown or
othermodified_base(1368)..(1368)a, c, t, g, unknown or
othermodified_base(1377)..(1377)a, c, t, g, unknown or
othermodified_base(1383)..(1383)a, c, t, g, unknown or
othermodified_base(1386)..(1386)a, c, t, g, unknown or
othermodified_base(1389)..(1389)a, c, t, g, unknown or
othermodified_base(1398)..(1398)a, c, t, g, unknown or
othermodified_base(1401)..(1401)a, c, t, g, unknown or
othermodified_base(1404)..(1404)a, c, t, g, unknown or
othermodified_base(1407)..(1407)a, c, t, g, unknown or
othermodified_base(1410)..(1410)a, c, t, g, unknown or
othermodified_base(1413)..(1413)a, c, t, g, unknown or
othermodified_base(1416)..(1416)a, c, t, g, unknown or
othermodified_base(1419)..(1419)a, c, t, g, unknown or
othermodified_base(1431)..(1431)a, c, t, g, unknown or
othermodified_base(1440)..(1440)a, c, t, g, unknown or
othermodified_base(1443)..(1443)a, c, t, g, unknown or
othermodified_base(1446)..(1446)a, c, t, g, unknown or
othermodified_base(1449)..(1449)a, c, t, g, unknown or
othermodified_base(1452)..(1452)a, c, t, g, unknown or
othermodified_base(1455)..(1455)a, c, t, g, unknown or
othermodified_base(1461)..(1461)a, c, t, g, unknown or
othermodified_base(1467)..(1467)a, c, t, g, unknown or
othermodified_base(1470)..(1470)a, c, t, g, unknown or
othermodified_base(1479)..(1479)a, c, t, g, unknown or
othermodified_base(1485)..(1485)a, c, t, g, unknown or
othermodified_base(1488)..(1488)a, c, t, g, unknown or
othermodified_base(1491)..(1491)a, c, t, g, unknown or
othermodified_base(1494)..(1494)a, c, t, g, unknown or
othermodified_base(1497)..(1497)a, c, t, g, unknown or
othermodified_base(1500)..(1500)a, c, t, g, unknown or
othermodified_base(1506)..(1506)a, c, t, g, unknown or
othermodified_base(1524)..(1524)a, c, t, g, unknown or
othermodified_base(1527)..(1527)a, c, t, g, unknown or
othermodified_base(1530)..(1530)a, c, t, g, unknown or
othermodified_base(1533)..(1533)a, c, t, g, unknown or
othermodified_base(1536)..(1536)a, c, t, g, unknown or
othermodified_base(1539)..(1539)a, c, t, g, unknown or other 15atgcargtnc
arytncarca rccnggngcn garytngtna arccnggngc nwsngtnaar 60atgwsntgya
argcnwsngg ntayacntty acnwsntaya ayatgcaytg ggtnaarcar 120acnccnggnm
gnggnytnga rtggathggn gcnathtayc cnggnaaygg ngayacnwsn 180tayaaycara
arttyaargg naargcnacn ytnacngcng ayaarwsnws nwsnacngcn 240tayatgcary
tnwsnwsnyt nacnwsngar gaywsngcng tntaytaytg ygcnmgnwsn 300acntaytayg
gnggngaytg gtayttyaay gtntggggng cnggnacnac ngtnacngtn 360wsngcnggnw
snacnwsngg nwsnggnaar ccnggnwsng gngarggnws nacnaarggn 420carathgtny
tnwsncarws nccngcnath ytnwsngcnw snccnggnga raargtnacn 480atgacntgym
gngcnwsnws nwsngtnwsn tayathcayt ggttycarca raarccnggn 540wsnwsnccna
arccntggat htaygcnacn wsnaayytng cnwsnggngt nccngtnmgn 600ttywsnggnw
snggnwsngg nacnwsntay wsnytnacna thwsnmgngt ngargcngar 660gaygcngcna
cntaytaytg ycarcartgg acnwsnaayc cnccnacntt yggnggnggn 720acnaarytng
arathaarga rttyccnaar ccnwsnacnc cnccnggnws nwsnggnggn 780gcnccnaarg
arttyacnyt ngayttywsn acngcnaara cntaygtnga ywsnytnaay 840gtnathmgnw
sngcnathgg nacnccnytn caracnathw snwsnggngg nacnwsnytn 900ytnatgathg
aywsnggnws nggngayaay ytnttygcng tngaygtnmg nggnathgay 960ccngargarg
gnmgnttyaa yaayytnmgn ytnathgtng armgnaayaa yytntaygtn 1020acnggnttyg
tnaaymgnac naayaaygtn ttytaymgnt tygcngaytt ywsncaygtn 1080acnttyccng
gnacnacngc ngtnacnytn wsnggngayw snwsntayac nacnytncar 1140mgngtngcng
gnathwsnmg nacnggnatg carathaaym gncaywsnyt nacnacnwsn 1200tayytngayy
tnatgwsnca ywsnggnacn wsnytnacnc arwsngtngc nmgngcnatg 1260ytnmgnttyg
tnacngtnac ngcngargcn ytnmgnttym gncarathca rmgnggntty 1320mgnacnacny
tngaygayyt nwsnggnmgn wsntaygtna tgacngcnga rgaygtngay 1380ytnacnytna
aytggggnmg nytnwsnwsn gtnytnccng aytaycaygg ncargaywsn 1440gtnmgngtng
gnmgnathws nttyggnwsn athaaygcna thytnggnws ngtngcnytn 1500athytnaayt
gycaycayca ygcnwsnmgn gtngcnmgn
153916521PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic polypeptide" 16Met Gln Val Gln Leu Gln Gln Pro
Gly Ala Glu Leu Val Lys Pro Gly1 5 10
15Ala Ser Val Lys Met Ser Cys Lys Thr Ser Gly Tyr Thr Phe
Thr Ser 20 25 30Tyr Asn Val
His Trp Val Lys Gln Thr Pro Gly Gln Gly Leu Glu Trp 35
40 45Ile Gly Ala Ile Tyr Pro Gly Asn Gly Asp Thr
Ser Phe Asn Gln Lys 50 55 60Phe Lys
Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr Val65
70 75 80Tyr Met Gln Leu Ser Ser Leu
Thr Ser Glu Asp Ser Ala Val Tyr Tyr 85 90
95Cys Ala Arg Ser Asn Tyr Tyr Gly Ser Ser Tyr Val Trp
Phe Phe Asp 100 105 110Val Trp
Gly Ala Gly Thr Thr Val Thr Val Ser Ser Gly Ser Thr Ser 115
120 125Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly
Ser Gln Ile Val Leu Ser 130 135 140Gln
Ser Pro Thr Ile Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met145
150 155 160Thr Cys Arg Ala Ser Ser
Ser Val Ser Tyr Met Asp Trp Tyr Gln Gln 165
170 175Lys Pro Gly Ser Ser Pro Lys Pro Trp Ile Tyr Ala
Thr Ser Asn Leu 180 185 190Ala
Ser Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser 195
200 205Tyr Ser Leu Thr Ile Ser Arg Val Glu
Ala Glu Asp Ala Ala Thr Tyr 210 215
220Tyr Cys Gln Gln Trp Ile Ser Asn Pro Pro Thr Phe Gly Ala Gly Thr225
230 235 240Lys Leu Glu Leu
Lys Glu Phe Pro Lys Pro Ser Thr Pro Pro Gly Ser 245
250 255Ser Gly Gly Ala Pro Gly Ile Leu Gly Phe
Val Phe Thr Leu Lys Glu 260 265
270Phe Thr Leu Asp Phe Ser Thr Ala Lys Thr Tyr Val Asp Ser Leu Asn
275 280 285Val Ile Arg Ser Ala Ile Gly
Thr Pro Leu Gln Thr Ile Ser Ser Gly 290 295
300Gly Thr Ser Leu Leu Met Ile Asp Ser Gly Ser Gly Asp Asn Leu
Phe305 310 315 320Ala Val
Asp Val Arg Gly Ile Asp Pro Glu Glu Gly Arg Phe Asn Asn
325 330 335Leu Arg Leu Ile Val Glu Arg
Asn Asn Leu Tyr Val Thr Gly Phe Val 340 345
350Asn Arg Thr Asn Asn Val Phe Tyr Arg Phe Ala Asp Phe Ser
His Val 355 360 365Thr Phe Pro Gly
Thr Thr Ala Val Thr Leu Ser Gly Asp Ser Ser Tyr 370
375 380Thr Thr Leu Gln Arg Val Ala Gly Ile Ser Arg Thr
Gly Met Gln Ile385 390 395
400Asn Arg His Ser Leu Thr Thr Ser Tyr Leu Asp Leu Met Ser His Ser
405 410 415Gly Thr Ser Leu Thr
Gln Ser Val Ala Arg Ala Met Leu Arg Phe Val 420
425 430Thr Val Thr Ala Glu Ala Leu Arg Phe Arg Gln Ile
Gln Arg Gly Phe 435 440 445Arg Thr
Thr Leu Asp Asp Leu Ser Gly Arg Ser Tyr Val Met Thr Ala 450
455 460Glu Asp Val Asp Leu Thr Leu Asn Trp Gly Arg
Leu Ser Ser Val Leu465 470 475
480Pro Asp Tyr His Gly Gln Asp Ser Val Arg Val Gly Arg Ile Ser Phe
485 490 495Gly Ser Ile Asn
Ala Ile Leu Gly Ser Val Ala Leu Ile Leu Asn Cys 500
505 510His His His Ala Ser Arg Val Ala Arg
515 520171563DNAArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
polynucleotide"modified_base(9)..(9)a, c, t, g, unknown or
othermodified_base(15)..(15)a, c, t, g, unknown or
othermodified_base(24)..(24)a, c, t, g, unknown or
othermodified_base(27)..(27)a, c, t, g, unknown or
othermodified_base(30)..(30)a, c, t, g, unknown or
othermodified_base(36)..(36)a, c, t, g, unknown or
othermodified_base(39)..(39)a, c, t, g, unknown or
othermodified_base(45)..(45)a, c, t, g, unknown or
othermodified_base(48)..(48)a, c, t, g, unknown or
othermodified_base(51)..(51)a, c, t, g, unknown or
othermodified_base(54)..(54)a, c, t, g, unknown or
othermodified_base(57)..(57)a, c, t, g, unknown or
othermodified_base(66)..(66)a, c, t, g, unknown or
othermodified_base(75)..(75)a, c, t, g, unknown or
othermodified_base(78)..(78)a, c, t, g, unknown or
othermodified_base(81)..(81)a, c, t, g, unknown or
othermodified_base(87)..(87)a, c, t, g, unknown or
othermodified_base(93)..(93)a, c, t, g, unknown or
othermodified_base(96)..(96)a, c, t, g, unknown or
othermodified_base(105)..(105)a, c, t, g, unknown or
othermodified_base(114)..(114)a, c, t, g, unknown or
othermodified_base(123)..(123)a, c, t, g, unknown or
othermodified_base(126)..(126)a, c, t, g, unknown or
othermodified_base(129)..(129)a, c, t, g, unknown or
othermodified_base(135)..(135)a, c, t, g, unknown or
othermodified_base(138)..(138)a, c, t, g, unknown or
othermodified_base(150)..(150)a, c, t, g, unknown or
othermodified_base(153)..(153)a, c, t, g, unknown or
othermodified_base(162)..(162)a, c, t, g, unknown or
othermodified_base(165)..(165)a, c, t, g, unknown or
othermodified_base(171)..(171)a, c, t, g, unknown or
othermodified_base(177)..(177)a, c, t, g, unknown or
othermodified_base(180)..(180)a, c, t, g, unknown or
othermodified_base(201)..(201)a, c, t, g, unknown or
othermodified_base(207)..(207)a, c, t, g, unknown or
othermodified_base(210)..(210)a, c, t, g, unknown or
othermodified_base(213)..(213)a, c, t, g, unknown or
othermodified_base(216)..(216)a, c, t, g, unknown or
othermodified_base(219)..(219)a, c, t, g, unknown or
othermodified_base(228)..(228)a, c, t, g, unknown or
othermodified_base(231)..(231)a, c, t, g, unknown or
othermodified_base(234)..(234)a, c, t, g, unknown or
othermodified_base(237)..(237)a, c, t, g, unknown or
othermodified_base(240)..(240)a, c, t, g, unknown or
othermodified_base(252)..(252)a, c, t, g, unknown or
othermodified_base(255)..(255)a, c, t, g, unknown or
othermodified_base(258)..(258)a, c, t, g, unknown or
othermodified_base(261)..(261)a, c, t, g, unknown or
othermodified_base(264)..(264)a, c, t, g, unknown or
othermodified_base(267)..(267)a, c, t, g, unknown or
othermodified_base(276)..(276)a, c, t, g, unknown or
othermodified_base(279)..(279)a, c, t, g, unknown or
othermodified_base(282)..(282)a, c, t, g, unknown or
othermodified_base(294)..(294)a, c, t, g, unknown or
othermodified_base(297)..(297)a, c, t, g, unknown or
othermodified_base(300)..(300)a, c, t, g, unknown or
othermodified_base(312)..(312)a, c, t, g, unknown or
othermodified_base(315)..(315)a, c, t, g, unknown or
othermodified_base(318)..(318)a, c, t, g, unknown or
othermodified_base(324)..(324)a, c, t, g, unknown or
othermodified_base(339)..(339)a, c, t, g, unknown or
othermodified_base(345)..(345)a, c, t, g, unknown or
othermodified_base(348)..(348)a, c, t, g, unknown or
othermodified_base(351)..(351)a, c, t, g, unknown or
othermodified_base(354)..(354)a, c, t, g, unknown or
othermodified_base(357)..(357)a, c, t, g, unknown or
othermodified_base(360)..(360)a, c, t, g, unknown or
othermodified_base(363)..(363)a, c, t, g, unknown or
othermodified_base(366)..(366)a, c, t, g, unknown or
othermodified_base(369)..(369)a, c, t, g, unknown or
othermodified_base(372)..(372)a, c, t, g, unknown or
othermodified_base(375)..(375)a, c, t, g, unknown or
othermodified_base(378)..(378)a, c, t, g, unknown or
othermodified_base(381)..(381)a, c, t, g, unknown or
othermodified_base(384)..(384)a, c, t, g, unknown or
othermodified_base(387)..(387)a, c, t, g, unknown or
othermodified_base(390)..(390)a, c, t, g, unknown or
othermodified_base(393)..(393)a, c, t, g, unknown or
othermodified_base(399)..(399)a, c, t, g, unknown or
othermodified_base(402)..(402)a, c, t, g, unknown or
othermodified_base(405)..(405)a, c, t, g, unknown or
othermodified_base(408)..(408)a, c, t, g, unknown or
othermodified_base(414)..(414)a, c, t, g, unknown or
othermodified_base(417)..(417)a, c, t, g, unknown or
othermodified_base(426)..(426)a, c, t, g, unknown or
othermodified_base(429)..(429)a, c, t, g, unknown or
othermodified_base(432)..(432)a, c, t, g, unknown or
othermodified_base(438)..(438)a, c, t, g, unknown or
othermodified_base(441)..(441)a, c, t, g, unknown or
othermodified_base(444)..(444)a, c, t, g, unknown or
othermodified_base(450)..(450)a, c, t, g, unknown or
othermodified_base(453)..(453)a, c, t, g, unknown or
othermodified_base(456)..(456)a, c, t, g, unknown or
othermodified_base(459)..(459)a, c, t, g, unknown or
othermodified_base(462)..(462)a, c, t, g, unknown or
othermodified_base(465)..(465)a, c, t, g, unknown or
othermodified_base(474)..(474)a, c, t, g, unknown or
othermodified_base(477)..(477)a, c, t, g, unknown or
othermodified_base(483)..(483)a, c, t, g, unknown or
othermodified_base(489)..(489)a, c, t, g, unknown or
othermodified_base(492)..(492)a, c, t, g, unknown or
othermodified_base(495)..(495)a, c, t, g, unknown or
othermodified_base(498)..(498)a, c, t, g, unknown or
othermodified_base(501)..(501)a, c, t, g, unknown or
othermodified_base(504)..(504)a, c, t, g, unknown or
othermodified_base(507)..(507)a, c, t, g, unknown or
othermodified_base(534)..(534)a, c, t, g, unknown or
othermodified_base(537)..(537)a, c, t, g, unknown or
othermodified_base(540)..(540)a, c, t, g, unknown or
othermodified_base(543)..(543)a, c, t, g, unknown or
othermodified_base(546)..(546)a, c, t, g, unknown or
othermodified_base(552)..(552)a, c, t, g, unknown or
othermodified_base(564)..(564)a, c, t, g, unknown or
othermodified_base(567)..(567)a, c, t, g, unknown or
othermodified_base(570)..(570)a, c, t, g, unknown or
othermodified_base(576)..(576)a, c, t, g, unknown or
othermodified_base(579)..(579)a, c, t, g, unknown or
othermodified_base(582)..(582)a, c, t, g, unknown or
othermodified_base(585)..(585)a, c, t, g, unknown or
othermodified_base(588)..(588)a, c, t, g, unknown or
othermodified_base(591)..(591)a, c, t, g, unknown or
othermodified_base(594)..(594)a, c, t, g, unknown or
othermodified_base(597)..(597)a, c, t, g, unknown or
othermodified_base(603)..(603)a, c, t, g, unknown or
othermodified_base(606)..(606)a, c, t, g, unknown or
othermodified_base(609)..(609)a, c, t, g, unknown or
othermodified_base(612)..(612)a, c, t, g, unknown or
othermodified_base(615)..(615)a, c, t, g, unknown or
othermodified_base(618)..(618)a, c, t, g, unknown or
othermodified_base(621)..(621)a, c, t, g, unknown or
othermodified_base(624)..(624)a, c, t, g, unknown or
othermodified_base(630)..(630)a, c, t, g, unknown or
othermodified_base(633)..(633)a, c, t, g, unknown or
othermodified_base(636)..(636)a, c, t, g, unknown or
othermodified_base(642)..(642)a, c, t, g, unknown or
othermodified_base(645)..(645)a, c, t, g, unknown or
othermodified_base(648)..(648)a, c, t, g, unknown or
othermodified_base(654)..(654)a, c, t, g, unknown or
othermodified_base(663)..(663)a, c, t, g, unknown or
othermodified_base(666)..(666)a, c, t, g, unknown or
othermodified_base(669)..(669)a, c, t, g, unknown or
othermodified_base(693)..(693)a, c, t, g, unknown or
othermodified_base(699)..(699)a, c, t, g, unknown or
othermodified_base(702)..(702)a, c, t, g, unknown or
othermodified_base(705)..(705)a, c, t, g, unknown or
othermodified_base(711)..(711)a, c, t, g, unknown or
othermodified_base(714)..(714)a, c, t, g, unknown or
othermodified_base(717)..(717)a, c, t, g, unknown or
othermodified_base(720)..(720)a, c, t, g, unknown or
othermodified_base(726)..(726)a, c, t, g, unknown or
othermodified_base(732)..(732)a, c, t, g, unknown or
othermodified_base(744)..(744)a, c, t, g, unknown or
othermodified_base(750)..(750)a, c, t, g, unknown or
othermodified_base(753)..(753)a, c, t, g, unknown or
othermodified_base(756)..(756)a, c, t, g, unknown or
othermodified_base(759)..(759)a, c, t, g, unknown or
othermodified_base(762)..(762)a, c, t, g, unknown or
othermodified_base(765)..(765)a, c, t, g, unknown or
othermodified_base(768)..(768)a, c, t, g, unknown or
othermodified_base(771)..(771)a, c, t, g, unknown or
othermodified_base(774)..(774)a, c, t, g, unknown or
othermodified_base(777)..(777)a, c, t, g, unknown or
othermodified_base(780)..(780)a, c, t, g, unknown or
othermodified_base(783)..(783)a, c, t, g, unknown or
othermodified_base(786)..(786)a, c, t, g, unknown or
othermodified_base(792)..(792)a, c, t, g, unknown or
othermodified_base(795)..(795)a, c, t, g, unknown or
othermodified_base(801)..(801)a, c, t, g, unknown or
othermodified_base(807)..(807)a, c, t, g, unknown or
othermodified_base(810)..(810)a, c, t, g, unknown or
othermodified_base(822)..(822)a, c, t, g, unknown or
othermodified_base(825)..(825)a, c, t, g, unknown or
othermodified_base(834)..(834)a, c, t, g, unknown or
othermodified_base(837)..(837)a, c, t, g, unknown or
othermodified_base(840)..(840)a, c, t, g, unknown or
othermodified_base(846)..(846)a, c, t, g, unknown or
othermodified_base(852)..(852)a, c, t, g, unknown or
othermodified_base(858)..(858)a, c, t, g, unknown or
othermodified_base(861)..(861)a, c, t, g, unknown or
othermodified_base(867)..(867)a, c, t, g, unknown or
othermodified_base(873)..(873)a, c, t, g, unknown or
othermodified_base(876)..(876)a, c, t, g, unknown or
othermodified_base(879)..(879)a, c, t, g, unknown or
othermodified_base(885)..(885)a, c, t, g, unknown or
othermodified_base(888)..(888)a, c, t, g, unknown or
othermodified_base(891)..(891)a, c, t, g, unknown or
othermodified_base(894)..(894)a, c, t, g, unknown or
othermodified_base(900)..(900)a, c, t, g, unknown or
othermodified_base(906)..(906)a, c, t, g, unknown or
othermodified_base(909)..(909)a, c, t, g, unknown or
othermodified_base(912)..(912)a, c, t, g, unknown or
othermodified_base(915)..(915)a, c, t, g, unknown or
othermodified_base(918)..(918)a, c, t, g, unknown or
othermodified_base(921)..(921)a, c, t, g, unknown or
othermodified_base(924)..(924)a, c, t, g, unknown or
othermodified_base(927)..(927)a, c, t, g, unknown or
othermodified_base(939)..(939)a, c, t, g, unknown or
othermodified_base(942)..(942)a, c, t, g, unknown or
othermodified_base(945)..(945)a, c, t, g, unknown or
othermodified_base(948)..(948)a, c, t, g, unknown or
othermodified_base(957)..(957)a, c, t, g, unknown or
othermodified_base(963)..(963)a, c, t, g, unknown or
othermodified_base(966)..(966)a, c, t, g, unknown or
othermodified_base(972)..(972)a, c, t, g, unknown or
othermodified_base(975)..(975)a, c, t, g, unknown or
othermodified_base(978)..(978)a, c, t, g, unknown or
othermodified_base(987)..(987)a, c, t, g, unknown or
othermodified_base(996)..(996)a, c, t, g, unknown or
othermodified_base(999)..(999)a, c, t, g, unknown or
othermodified_base(1011)..(1011)a, c, t, g, unknown or
othermodified_base(1014)..(1014)a, c, t, g, unknown or
othermodified_base(1017)..(1017)a, c, t, g, unknown or
othermodified_base(1023)..(1023)a, c, t, g, unknown or
othermodified_base(1029)..(1029)a, c, t, g, unknown or
othermodified_base(1038)..(1038)a, c, t, g, unknown or
othermodified_base(1044)..(1044)a, c, t, g, unknown or
othermodified_base(1047)..(1047)a, c, t, g, unknown or
othermodified_base(1050)..(1050)a, c, t, g, unknown or
othermodified_base(1056)..(1056)a, c, t, g, unknown or
othermodified_base(1062)..(1062)a, c, t, g, unknown or
othermodified_base(1065)..(1065)a, c, t, g, unknown or
othermodified_base(1074)..(1074)a, c, t, g, unknown or
othermodified_base(1083)..(1083)a, c, t, g, unknown or
othermodified_base(1089)..(1089)a, c, t, g, unknown or
othermodified_base(1098)..(1098)a, c, t, g, unknown or
othermodified_base(1104)..(1104)a, c, t, g, unknown or
othermodified_base(1107)..(1107)a, c, t, g, unknown or
othermodified_base(1113)..(1113)a, c, t, g, unknown or
othermodified_base(1116)..(1116)a, c, t, g, unknown or
othermodified_base(1119)..(1119)a, c, t, g, unknown or
othermodified_base(1122)..(1122)a, c, t, g, unknown or
othermodified_base(1125)..(1125)a, c, t, g, unknown or
othermodified_base(1128)..(1128)a, c, t, g, unknown or
othermodified_base(1131)..(1131)a, c, t, g, unknown or
othermodified_base(1134)..(1134)a, c, t, g, unknown or
othermodified_base(1137)..(1137)a, c, t, g, unknown or
othermodified_base(1140)..(1140)a, c, t, g, unknown or
othermodified_base(1146)..(1146)a, c, t, g, unknown or
othermodified_base(1149)..(1149)a, c, t, g, unknown or
othermodified_base(1155)..(1155)a, c, t, g, unknown or
othermodified_base(1158)..(1158)a, c, t, g, unknown or
othermodified_base(1161)..(1161)a, c, t, g, unknown or
othermodified_base(1167)..(1167)a, c, t, g, unknown or
othermodified_base(1170)..(1170)a, c, t, g, unknown or
othermodified_base(1173)..(1173)a, c, t, g, unknown or
othermodified_base(1176)..(1176)a, c, t, g, unknown or
othermodified_base(1182)..(1182)a, c, t, g, unknown or
othermodified_base(1185)..(1185)a, c, t, g, unknown or
othermodified_base(1188)..(1188)a, c, t, g, unknown or
othermodified_base(1191)..(1191)a, c, t, g, unknown or
othermodified_base(1206)..(1206)a, c, t, g, unknown or
othermodified_base(1212)..(1212)a, c, t, g, unknown or
othermodified_base(1215)..(1215)a, c, t, g, unknown or
othermodified_base(1218)..(1218)a, c, t, g, unknown or
othermodified_base(1221)..(1221)a, c, t, g, unknown or
othermodified_base(1224)..(1224)a, c, t, g, unknown or
othermodified_base(1230)..(1230)a, c, t, g, unknown or
othermodified_base(1236)..(1236)a, c, t, g, unknown or
othermodified_base(1242)..(1242)a, c, t, g, unknown or
othermodified_base(1248)..(1248)a, c, t, g, unknown or
othermodified_base(1251)..(1251)a, c, t, g, unknown or
othermodified_base(1254)..(1254)a, c, t, g, unknown or
othermodified_base(1257)..(1257)a, c, t, g, unknown or
othermodified_base(1260)..(1260)a, c, t, g, unknown or
othermodified_base(1263)..(1263)a, c, t, g, unknown or
othermodified_base(1269)..(1269)a, c, t, g, unknown or
othermodified_base(1272)..(1272)a, c, t, g, unknown or
othermodified_base(1275)..(1275)a, c, t, g, unknown or
othermodified_base(1278)..(1278)a, c, t, g, unknown or
othermodified_base(1281)..(1281)a, c, t, g, unknown or
othermodified_base(1287)..(1287)a, c, t, g, unknown or
othermodified_base(1290)..(1290)a, c, t, g, unknown or
othermodified_base(1296)..(1296)a, c, t, g, unknown or
othermodified_base(1299)..(1299)a, c, t, g, unknown or
othermodified_base(1302)..(1302)a, c, t, g, unknown or
othermodified_base(1305)..(1305)a, c, t, g, unknown or
othermodified_base(1308)..(1308)a, c, t, g, unknown or
othermodified_base(1314)..(1314)a, c, t, g, unknown or
othermodified_base(1317)..(1317)a, c, t, g, unknown or
othermodified_base(1320)..(1320)a, c, t, g, unknown or
othermodified_base(1326)..(1326)a, c, t, g, unknown or
othermodified_base(1338)..(1338)a, c, t, g, unknown or
othermodified_base(1341)..(1341)a, c, t, g, unknown or
othermodified_base(1347)..(1347)a, c, t, g, unknown or
othermodified_base(1350)..(1350)a, c, t, g, unknown or
othermodified_base(1353)..(1353)a, c, t, g, unknown or
othermodified_base(1356)..(1356)a, c, t, g, unknown or
othermodified_base(1365)..(1365)a, c, t, g, unknown or
othermodified_base(1368)..(1368)a, c, t, g, unknown or
othermodified_base(1371)..(1371)a, c, t, g, unknown or
othermodified_base(1374)..(1374)a, c, t, g, unknown or
othermodified_base(1377)..(1377)a, c, t, g, unknown or
othermodified_base(1383)..(1383)a, c, t, g, unknown or
othermodified_base(1389)..(1389)a, c, t, g, unknown or
othermodified_base(1392)..(1392)a, c, t, g, unknown or
othermodified_base(1401)..(1401)a, c, t, g, unknown or
othermodified_base(1407)..(1407)a, c, t, g, unknown or
othermodified_base(1410)..(1410)a, c, t, g, unknown or
othermodified_base(1413)..(1413)a, c, t, g, unknown or
othermodified_base(1422)..(1422)a, c, t, g, unknown or
othermodified_base(1425)..(1425)a, c, t, g, unknown or
othermodified_base(1428)..(1428)a, c, t, g, unknown or
othermodified_base(1431)..(1431)a, c, t, g, unknown or
othermodified_base(1434)..(1434)a, c, t, g, unknown or
othermodified_base(1437)..(1437)a, c, t, g, unknown or
othermodified_base(1440)..(1440)a, c, t, g, unknown or
othermodified_base(1443)..(1443)a, c, t, g, unknown or
othermodified_base(1455)..(1455)a, c, t, g, unknown or
othermodified_base(1464)..(1464)a, c, t, g, unknown or
othermodified_base(1467)..(1467)a, c, t, g, unknown or
othermodified_base(1470)..(1470)a, c, t, g, unknown or
othermodified_base(1473)..(1473)a, c, t, g, unknown or
othermodified_base(1476)..(1476)a, c, t, g, unknown or
othermodified_base(1479)..(1479)a, c, t, g, unknown or
othermodified_base(1485)..(1485)a, c, t, g, unknown or
othermodified_base(1491)..(1491)a, c, t, g, unknown or
othermodified_base(1494)..(1494)a, c, t, g, unknown or
othermodified_base(1503)..(1503)a, c, t, g, unknown or
othermodified_base(1509)..(1509)a, c, t, g, unknown or
othermodified_base(1512)..(1512)a, c, t, g, unknown or
othermodified_base(1515)..(1515)a, c, t, g, unknown or
othermodified_base(1518)..(1518)a, c, t, g, unknown or
othermodified_base(1521)..(1521)a, c, t, g, unknown or
othermodified_base(1524)..(1524)a, c, t, g, unknown or
othermodified_base(1530)..(1530)a, c, t, g, unknown or
othermodified_base(1548)..(1548)a, c, t, g, unknown or
othermodified_base(1551)..(1551)a, c, t, g, unknown or
othermodified_base(1554)..(1554)a, c, t, g, unknown or
othermodified_base(1557)..(1557)a, c, t, g, unknown or
othermodified_base(1560)..(1560)a, c, t, g, unknown or
othermodified_base(1563)..(1563)a, c, t, g, unknown or other 17atgcargtnc
arytncarca rccnggngcn garytngtna arccnggngc nwsngtnaar 60atgwsntgya
aracnwsngg ntayacntty acnwsntaya aygtncaytg ggtnaarcar 120acnccnggnc
arggnytnga rtggathggn gcnathtayc cnggnaaygg ngayacnwsn 180ttyaaycara
arttyaargg naargcnacn ytnacngcng ayaarwsnws nwsnacngtn 240tayatgcary
tnwsnwsnyt nacnwsngar gaywsngcng tntaytaytg ygcnmgnwsn 300aaytaytayg
gnwsnwsnta ygtntggtty ttygaygtnt ggggngcngg nacnacngtn 360acngtnwsnw
snggnwsnac nwsnggnwsn ggnaarccng gnwsnggnga rggnwsncar 420athgtnytnw
sncarwsncc nacnathytn wsngcnwsnc cnggngaraa rgtnacnatg 480acntgymgng
cnwsnwsnws ngtnwsntay atggaytggt aycarcaraa rccnggnwsn 540wsnccnaarc
cntggathta ygcnacnwsn aayytngcnw snggngtncc ngcnmgntty 600wsnggnwsng
gnwsnggnac nwsntaywsn ytnacnathw snmgngtnga rgcngargay 660gcngcnacnt
aytaytgyca rcartggath wsnaayccnc cnacnttygg ngcnggnacn 720aarytngary
tnaargartt yccnaarccn wsnacnccnc cnggnwsnws nggnggngcn 780ccnggnathy
tnggnttygt nttyacnytn aargarttya cnytngaytt ywsnacngcn 840aaracntayg
tngaywsnyt naaygtnath mgnwsngcna thggnacncc nytncaracn 900athwsnwsng
gnggnacnws nytnytnatg athgaywsng gnwsnggnga yaayytntty 960gcngtngayg
tnmgnggnat hgayccngar garggnmgnt tyaayaayyt nmgnytnath 1020gtngarmgna
ayaayytnta ygtnacnggn ttygtnaaym gnacnaayaa ygtnttytay 1080mgnttygcng
ayttywsnca ygtnacntty ccnggnacna cngcngtnac nytnwsnggn 1140gaywsnwsnt
ayacnacnyt ncarmgngtn gcnggnathw snmgnacngg natgcarath 1200aaymgncayw
snytnacnac nwsntayytn gayytnatgw sncaywsngg nacnwsnytn 1260acncarwsng
tngcnmgngc natgytnmgn ttygtnacng tnacngcnga rgcnytnmgn 1320ttymgncara
thcarmgngg nttymgnacn acnytngayg ayytnwsngg nmgnwsntay 1380gtnatgacng
cngargaygt ngayytnacn ytnaaytggg gnmgnytnws nwsngtnytn 1440ccngaytayc
ayggncarga ywsngtnmgn gtnggnmgna thwsnttygg nwsnathaay 1500gcnathytng
gnwsngtngc nytnathytn aaytgycayc aycaygcnws nmgngtngcn 1560mgn
15631815PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 18Gly Ser Thr Ser Gly Ser Gly Lys Pro
Gly Ser Gly Glu Gly Ser1 5 10
15198PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 19Trp Ser His Pro Gln Phe Glu Lys1
52016PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 20Glu Phe Pro Lys Pro Ser Thr Pro Pro
Gly Ser Ser Gly Gly Ala Pro1 5 10
152110PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic peptide" 21Gly Tyr Thr Phe Thr Ser Tyr
Asn Met His1 5 102217PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 22Ala Ile Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn Gln Lys Phe
Lys1 5 10
15Gly2312PRTArtificial Sequencesource/note="Description of Artificial
Sequence Synthetic peptide" 23Ala Gln Leu Arg Pro Asn Tyr Trp Tyr
Phe Asp Val1 5 102410PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 24Arg Ala Ser Ser Ser Val Ser Tyr Met His1 5
1025293PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 25Lys Glu Phe Thr Leu Asp
Phe Ser Thr Ala Lys Thr Tyr Val Asp Ser1 5
10 15Leu Asn Val Ile Arg Ser Ala Ile Gly Thr Pro Leu
Gln Thr Ile Ser 20 25 30Ser
Gly Gly Thr Ser Leu Leu Met Ile Asp Ser Gly Thr Gly Asp Asn 35
40 45Leu Phe Ala Val Asp Val Arg Gly Ile
Asp Pro Glu Glu Gly Arg Phe 50 55
60Asn Asn Leu Arg Leu Ile Val Glu Arg Asn Asn Leu Tyr Val Thr Gly65
70 75 80Phe Val Asn Arg Thr
Asn Asn Val Phe Tyr Arg Phe Ala Asp Phe Ser 85
90 95His Val Thr Phe Pro Gly Thr Thr Ala Val Thr
Leu Ser Gly Asp Ser 100 105
110Ser Tyr Thr Thr Leu Gln Arg Val Ala Gly Ile Ser Arg Thr Gly Met
115 120 125Gln Ile Asn Arg His Ser Leu
Thr Thr Ser Tyr Leu Asp Leu Met Ser 130 135
140His Ser Gly Thr Ser Leu Thr Gln Ser Val Ala Arg Ala Met Leu
Arg145 150 155 160Phe Val
Thr Val Thr Ala Glu Ala Leu Arg Phe Arg Gln Ile Gln Arg
165 170 175Gly Phe Arg Thr Thr Leu Asp
Asp Leu Ser Gly Arg Ser Tyr Val Met 180 185
190Thr Ala Glu Asp Val Asp Leu Thr Leu Asn Trp Gly Arg Leu
Ser Ser 195 200 205Val Leu Pro Asp
Tyr His Gly Gln Asp Ser Val Arg Val Gly Arg Ile 210
215 220Ser Phe Gly Ser Ile Asn Ala Ile Leu Gly Ser Val
Ala Leu Ile Leu225 230 235
240Asn Cys His His His Ala Ser Arg Val Ala Arg Met Ala Ser Asp Glu
245 250 255Phe Pro Ser Met Cys
Pro Ala Asp Gly Arg Val Arg Gly Ile Thr His 260
265 270Asn Lys Ile Leu Trp Asp Ser Ser Thr Leu Gly Ala
Ile Leu Met Arg 275 280 285Arg Thr
Ile Ser Ser 29026297PRTArtificial Sequencesource/note="Description of
Artificial Sequence Synthetic polypeptide" 26Asp Glu Phe Thr Val Asp
Phe Ser Ser Gln Lys Ser Tyr Val Asp Ser1 5
10 15Leu Asn Ser Ile Arg Ser Ala Ile Ser Thr Pro Leu
Gly Asn Ile Ser 20 25 30Gln
Gly Gly Val Ser Val Ser Val Ile Asn His Val Leu Gly Gly Asn 35
40 45Tyr Ile Ser Leu Asn Val Arg Gly Leu
Asp Pro Tyr Ser Glu Arg Phe 50 55
60Asn His Leu Arg Leu Ile Met Glu Arg Asn Asn Leu Tyr Val Ala Gly65
70 75 80Phe Ile Asn Thr Glu
Thr Asn Ile Phe Tyr Arg Phe Ser Asp Phe Ser 85
90 95His Ile Ser Val Pro Asp Val Ile Thr Val Ser
Met Thr Thr Asp Ser 100 105
110Ser Tyr Ser Ser Leu Gln Arg Ile Ala Asp Leu Glu Arg Thr Gly Met
115 120 125Gln Ile Gly Arg His Ser Leu
Val Gly Ser Tyr Leu Asp Leu Met Glu 130 135
140Phe Arg Gly Arg Ser Met Thr Arg Ala Ser Ser Arg Ala Met Leu
Arg145 150 155 160Phe Val
Thr Val Ile Ala Glu Ala Leu Arg Phe Arg Gln Ile Gln Arg
165 170 175Gly Phe Arg Pro Ala Leu Ser
Glu Ala Ser Pro Leu Tyr Thr Met Thr 180 185
190Ala Gln Asp Val Asp Leu Thr Leu Asn Trp Gly Arg Ile Ser
Asn Val 195 200 205Leu Pro Glu Tyr
Arg Gly Glu Glu Gly Val Arg Ile Gly Arg Ile Ser 210
215 220Phe Asn Ser Leu Ser Ala Ile Leu Gly Ser Val Ala
Val Ile Leu Asn225 230 235
240Cys His Ser Thr Gly Ser Tyr Ser Val Arg Ser Val Ser Gln Lys Gln
245 250 255Lys Thr Glu Cys Gln
Ile Val Gly Asp Arg Ala Ala Ile Lys Val Asn 260
265 270Asn Val Leu Trp Glu Ala Asn Thr Ile Ala Ala Leu
Leu Asn Arg Lys 275 280 285Pro Gln
Asp Leu Thr Glu Pro Asn Gln 290 2952712PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 27Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe Asn Val1
5 102810PRTArtificial Sequencesource/note="Description
of Artificial Sequence Synthetic peptide" 28Arg Ala Ser Ser Ser Val
Ser Tyr Ile His1 5 10298PRTArtificial
Sequencesource/note="Description of Artificial Sequence Synthetic
peptide" 29Gln Trp Thr Ser Asn Pro Pro Thr1 5
User Contributions:
Comment about this patent or add new information about this topic: