Patent application title: Modified Latency Associated Protein Construct
Inventors:
IPC8 Class: AC07K14495FI
USPC Class:
1 1
Class name:
Publication date: 2020-02-27
Patent application number: 20200062814
Abstract:
The present invention provides a fusion protein comprising a latency
associated peptide (LAP), a pharmaceutically active agent and an amino
acid sequence comprising a dimerisation domain, wherein the LAP and the
pharmaceutically active agent are connected by an amino acid sequence
comprising a proleolytic cleavage site. Also provided are nucleic acids
enclosing such fusion proteins, process for their preparation,
pharmaceutical compositions, kits and uses thereof in medicine.Claims:
1-30. (canceled)
31. A protein dimer comprising a pair of fusion proteins, each fusion protein comprising a latency associated peptide (LAP) which is the precursor domain of TGF.beta.-1, -2, -3, -4, or -5, a pharmaceutically active agent and an amino acid sequence comprising a dimerisation domain comprising an antibody fragment polypeptide, wherein the LAP and the pharmaceutically active agent are connected by an amino acid sequence comprising a proteolytic cleavage site, and the dimerisation domain is linked to the N-terminus of the LAP, and wherein the fusion proteins are associated at the dimerisation domain in each fusion protein and form a closed shell around the pharmaceutically active agent, wherein the pharmaceutically active agent is an antibody or antibody fragment.
32. The protein dimer as claimed in claim 31, wherein the antibody fragment polypeptide is an Fc region polypeptide, an immunoglobulin hinge polypeptide, a CH3 domain polypeptide, a CH4 domain polypeptide, a CH1 domain polypeptide, or a CL domain polypeptide.
33. The protein dimer as claimed in claim 31, wherein the antibody fragment polypeptide is an Fc region polypeptide.
34. The protein dimer as claimed in claim 33, wherein the Fc region polypeptide is derived from an IgG or IgA antibody.
35. The protein dimer as claimed in claim 31, wherein the dimerisation domain is linked to the latency associated peptide by a linker sequence.
36. The protein dimer as claimed in claim 31, wherein the proteolytic cleavage site is a matrix metalloproteinase or an aggrecanase cleavage site.
37. The protein dimer as claimed in claim 31, wherein the antibody fragment is an scFv, Fab, Fab', F(ab').sub.2, Fv, dsFv diabody, or Fd fragment.
38. The protein dimer as claimed in claim 31, wherein the antibody or antibody fragment thereof is an anti-TNF antibody, anti-interleukin antibody, anti-interferon antibody, anti-cytokine antibody, or fragment thereof.
39. The protein dimer as claimed in claim 31, wherein the antibody or antibody fragment thereof is an antibody against the HER2/neu receptor.
40. The protein dimer as claimed in claim 31, wherein the antibody fragment is a trastuzumab scFv.
41. A pharmaceutical composition comprising a protein dimer of claim 31.
42. A nucleic acid construct encoding a fusion protein as defined in claim 31 comprising a nucleic acid sequence encoding a pharmaceutically active agent, a nucleic acid sequence encoding a LAP, and a nucleic acid sequence encoding a dimerisation domain composed of an antibody fragment polypeptide, wherein the pharmaceutically active agent is an antibody or an antibody fragment.
43. The nucleic acid construct as claimed in claim 42, wherein the dimerisation domain polypeptide is an Fc region polypeptide derived from an IgG or IgA antibody.
44. The nucleic acid construct as claimed in claim 42, wherein the nucleic acid construct further comprises a nucleic acid sequence encoding a proteolytic cleavage site.
45. The nucleic acid construct as claimed in claim 42, which is in the form of a vector.
46. A fusion protein encoded by the nucleic acid construct of claim 42.
47. A cell comprising a nucleic acid construct of claim 42.
48. A pharmaceutical composition comprising a nucleic acid construct as claimed in claim 42.
49. A method for the treatment of an inflammatory condition or cancer comprising the administration to a subject of a composition comprising a protein dimer of claim 31.
50. A method for the treatment of an inflammatory condition or cancer comprising the administration to a subject of a composition comprising a nucleic acid construct of claim 42.
51. A kit comprising a protein dimer as claimed in claim 31 as an administration vehicle.
52. A process for preparing the protein dimer as claimed in claim 31, comprising producing the fusion proteins recombinantly by expression in a host cell, purifying the expressed fusion proteins, and associating the dimerisation domain of the fusion proteins at the N-terminus of the LAP to form a closed shell around the pharmaceutically active agent.
53. A process for preparing a nucleic acid construct of claim 42, comprising ligating together nucleic acid sequences encoding a latency associated peptide, a dimerisation domain, a proteolytic cleavage sequence, and a pharmaceutically active agent, optionally including a linker sequence on either side of the proteolytic cleavage site.
Description:
[0001] The present invention relates to the use of proteins, protein
derivatives and DNA constructs that confer latency to pharmaceutically
active agents. The present invention also relates to improved methods of
providing latency to pharmaceutically active agents.
[0002] Most cytokines and growth factors are expressed under tight control mechanisms. Their gene expression is regulated by environmental stimuli such as infection, cell-cell interactions, change in extracellular matrix composition and interactions with adhesion molecules or via stimulation with other cytokines.
[0003] In addition to the control at the transcriptional and post-transcriptional level, some cytokines are not released into the medium unless a second signal activates the cell. A third level of regulation for cytokine activity is found in molecules which are secreted in a latent form and become "activated" by releasing the cytokine moiety where processes of inflammation, wound healing and tissue repair takes place (Khalil N, Microbes and Infection, 1, 1255-1263 (1999). In this latter respect, transforming growth factor beta (TGF.beta.) has received greatest attention.
[0004] TGF.beta. is synthesized as a dimeric latent cytokine composed of an amino terminal latency associated protein (LAP) and the active TGF.beta. cytokine at its COOH terminal end (Roberts and Sporn, Peptide Growth Factors and their Receptors: Sporn, M B and Roberts, AB, Springer-Verlag, 419-472 (1996); Roth-Eicchorn et al., Hepatology, 28 1588-1596 (1998)). The precursor peptide contains a signal peptide (residues 1-29) necessary for protein secretion and guiding the molecule through the Golgi apparatus to become processed by proteolytic cleavage and glycosylation. The LAP domain is separated from TGF.beta. by proteolytic cleavage at arginines (277-278). Mature TGF.beta. begins at alanine 279. The LAP, in addition to protect TGF.beta., contains important residues necessary for the interaction with other molecules. Mutations in the LAP domain have recently been associated with the autosomal dominant Camurati-Engelmann disease (Janssens et al., Nature Genetics, 26, 273-275 (2000). Cysteines 224 and 226 are important in the intermolecular disulphide bond between two LAPs. Their mutation to serine renders the molecule "active" (Sanderson et al., Proc. Natl. Acad. Sci. USA, 92, 2572-2576 (1995); Brunner et al., Mol. Endocrinol. 6, 1691-1700 (1992); Brunner et al., J. Biol. Chem, 264, 13660-13664 (1989)). The RGD motif (245-247) facilitates the interaction with integrins (Munger et al., Mol, Biol. of the Cell, 9, 2627-2638 (1998; Derynck R, TIBS, 19, 548-553 (1994)). Nucleic acid encoding TGF.beta. is described in U.S. Pat. No. 5,801,231.
[0005] In most cell types studied, including those of mesenchymal, epithelial and endothelial origin, TGF.beta. is secreted in a latent form consisting of TGF.beta. and its latency associated peptide (LAP) propeptide dimers, covalently linked to latent TGF.beta.-binding proteins (LTBPs). LTBPs are also needed for the secretion and folding of TGF.beta. (Miyazano et al., EMBO J. 10, 1091-1101 (1991); Miyazano et al., J. Biol. Chem. 267, 5668-5675 (1992); Eklov et al., Cancer Res. 53, 3193-3197 (1993)). Cysteine 33 is important for the disulphide bridge with the third 8 cysteine-rich repeat of latent TGF.beta. binding protein (LTBP) (Saharinen et al., The EMBO Journal, 15, 245-253 (1996). Modification of LTBP by enzymes such as thrombospondin (Schultz et al., The Journal of Biological Chemistry, 269, 26783-26788 (1994); Crawford et al., Cell, 93, 1159-1170 (1998)), transglutaminase (Nunes et al., J. Cell, Biol. 136, 1151-1163 (1997); Kojima et al., The Journal of Cell Biology, 121, 439-448 (1993)) and MMP9, MMP2 (Yu and Stamenkovic, Genes and Dev, 14, 163-176 (2000)) could release the active portion of TGF.beta. from the latent complex.
[0006] Cytokines are natural products serving as soluble local mediators of cell-cell interactions. They have a variety of pleiotropic actions, some of which can be harnessed for therapeutic purposes. Targeting of cytokines to specific cell types using scFv (Lode et al., Pharmacol. Ther, 80, 277-292 (1998)) and vWF (Gordon et al., Human Gene Therapy, 8, 1385-1394 (1997)) have focused entirely on the active cytokine moiety of the cytokine complex.
[0007] Pharmacologically active proteins or other medicines based on such agents, which have to be administered at very high concentrations systemically in order to achieve biologically effective concentrations in the tissue being targeted, tend to give rise to undesirable systemic effects, for example toxicity, which limit their use and efficacy.
[0008] The principles underlying the construction of such a system for providing latency to pharmaceutically active agents using the LAP of TGF-.beta. was described in WO 02/055098 and WO 2009/077755. In the naturally occurring LAP-TGF-.beta. complex, the latency associated peptide forms a protective shell around TGF preventing it from being degraded. The closed nature of the shell is guaranteed because of internal interactions between TGF with LAP. These interactions are not necessarily expected when the `payload` is another cytokine, growth factor or peptide or pharmaceutically active compound, and may result in a permeable shell which allows entry of mobile target molecules into the shell. For example, if the target molecule is a soluble receptor, the receptor may be able to enter the shell and interact with the pharmaceutically active agent even when the pharmaceutically active agent is bound to the LAP. This could lead to off-site activity which may have undesirable side effects. Furthermore, while improving protein production utilizing suspension cell cultures according to WO 2009/077755, the inventors found that, along with dimers, the LAP fusion proteins also tended to form active monomers.
[0009] The present inventors have now developed an improved means for providing pharmaceutically active agents in latent form based on this system.
[0010] According to the first aspect of the invention there is provided a fusion protein comprising a latency associated peptide (LAP), a pharmaceutically active agent and an amino acid sequence comprising a dimerisation domain wherein the LAP and pharmaceutically active agent are connected by an amino acid sequence comprising a proteolytic cleavage site.
[0011] The fusion protein comprising a LAP, a proteolytic cleavage site, a pharmaceutically active agent and a dimerisation domain may provide for site specific activation of the latent pharmaceutically active agent. The term "site specific activation" as used herein means, in general terms and not limited to the removal or reduction of latency, conferred on a pharmaceutically active agent, by site-specific cleavage at the proteolytic cleavage site.
[0012] Site-specific cleavage at the proteolytic cleavage site is expected to take place concomitantly with the restored activation of the pharmaceutically active agent.
[0013] The term "latent pharmaceutically active agent" as used herein may include, but is not limited to, pharmaceutically active agents which are latent due to their association with LAP and a proteolytic cleavage site. Specifically, the pharmaceutically active agent may be latent by virtue of its fusion to a LAP associated proteolytic cleavage site to form a latent fusion protein. The pharmaceutically active agent may be of natural or synthetic origin.
[0014] The fusion protein may be constructed as shown in FIG. 4(c) in which the dimerisation domain is fused to the LAP. An additional proteolytic cleavage site and/or linker sequence may be inserted also between the dimerisation domain and the LAP. A secretory signal peptide (i.e. the precursor peptide) may be fused to the dimerisation domain also at the N-terminal of the dimerisation domain.
[0015] The term "protein" in this text means, in general terms, a plurality of amino acid residues joined together by peptide bonds. It is used interchangeably and means the same as peptide, oligopeptide, oligomer or polypeptide, and includes glycoproteins and derivatives thereof. The term "protein" is also intended to include fragments, analogues and derivatives of a protein wherein the fragment, analogue or derivative retains essentially the same biological activity or function as a reference protein.
[0016] The fragment, analogue or derivative of the protein as defined in this text, may be at least 6, preferably 10 or 20, or up to 50 or 100 amino acids long.
[0017] The fragment, derivative or analogue of the protein may be (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably, a conserved amino acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code, or (ii) one in which one or more of the amino acid residues includes a substituent group, or (iii) one in which the mature polypeptide is fused with another compound, such as a compound to increase the half life of the polypeptide (for example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to the mature polypeptide, such as a leader or secretory sequence which is employed for purification of the polypeptide. Such fragments, derivatives and analogues are deemed to be within the scope of those skilled in the art from the teachings herein.
[0018] Particularly preferred are variants, analogues, derivatives and fragments having the amino acid sequence of the protein in which several e.g. 5 to 10, or 1 to 5, or 1 to 3, 2, 1 or no amino acid residues are substituted, deleted or added in any combination. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of the protein of the present invention. Also especially preferred in this regard are conservative substitutions.
[0019] An example of a variant of the present invention is a fusion protein as defined above, apart from the substitution of one or more amino acids with one or more other amino acids. The skilled person is aware that various amino acids have similar properties. One or more such amino acids of a substance can often be substituted by one or more other such amino acids without eliminating a desired activity of that substance.
[0020] Thus the amino acids glycine, alanine, valine, leucine and isoleucine can often be substituted for one another (amino acids having aliphatic side chains). Of these possible substitutions it is preferred that glycine and alanine are used to substitute for one another (since they have relatively short side chains) and that valine, leucine and isoleucine are used to substitute for one another (since they have larger aliphatic side chains which are hydrophobic). Other amino acids which can often be substituted for one another include: phenylalanine, tyrosine and tryptophan (amino acids having aromatic side chains); lysine, arginine and histidine (amino acids having basic side chains); aspartate and glutamate (amino acids having acidic side chains); asparagine and glutamine (amino acids having amide side chains); and cysteine and methionine (amino acids having sulphur containing side chains).
[0021] Substitutions of this nature are often referred to as "conservative" or "semi-conservative" amino acid substitutions.
[0022] Amino acid deletions or insertions may also be made relative to the amino acid sequence for the fusion protein referred to above. Thus, for example, amino acids which do not have a substantial effect on the activity of the polypeptide, or at least which do not eliminate such activity, may be deleted. Such deletions can be advantageous since the overall length and the molecular weight of a polypeptide can be reduced whilst still retaining activity. This can enable the amount of polypeptide required for a particular purpose to be reduced--for example, dosage levels can be reduced.
[0023] Amino acid insertions relative to the sequence of the fusion protein above can also be made. This may be done to alter the properties of a substance of the present invention (e.g. to assist in identification, purification or expression, as explained above in relation to fusion proteins).
[0024] Amino acid changes relative to the sequence for the fusion protein of the invention can be made using any suitable technique e.g. by using site-directed mutagenesis. It should be appreciated that amino acid substitutions or insertions within the scope of the present invention can be made using naturally occurring or non-naturally occurring amino acids. Whether or not natural or synthetic amino acids are used, it is preferred that only L-amino acids are present.
[0025] A protein according to the invention may have additional N-terminal and/or C-terminal amino acid sequences. Such sequences can be provided for various reasons, for example, glycosylation.
[0026] The term "fusion protein" in this text means, in general terms, one or more proteins joined together by chemical means, including hydrogen bonds or salt bridges, or by peptide bonds through protein synthesis or both.
[0027] The latency associated peptide (LAP) of the present invention may include, but is not limited to, the coding sequence for the precursor domain of TGF.beta. or a sequence which is substantially identical thereto.
[0028] "Identity" as known in the art is the relationship between two or more polypeptide sequences or two or more polynucleotide sequences, as determined by comparing the sequences. In the art, identity also means the degree of sequence relatedness (homology) between polypeptide or polynucleotide sequences, as the case may be, as determined by the match between strings of such sequences. While there exist a number of methods to measure identity between two polypeptide or two polynucleotide sequences, methods commonly employed to determine identity are codified in computer programs. Preferred computer programs to determine identity between two sequences include, but are not limited to, GCG program package (Devereux, et al., Nucleic acids Research, 12, 387 (1984), BLASTP, BLASTN, and FASTA (Atschul et al., J. Molec. Biol. 215, 403 (1990).
[0029] The LAP of the present invention may comprise the precursor domain of TGF.beta., for example, the precursor peptide of TGF.beta.-1, 2 or 3 (from human) (Derynck et al., Nature, 316, 701-705 (1985); De Martin et al., EMBO J. 6 3673-3677 (1987); Hanks et al., Proc. Natl. Acad. Sci. 85, 79-82 (1988); Derynck et al., EMBO J. 7, 3737-3743 (1988); Ten Dyke et al., Proc. Natl. Acad. Sci. USA, 85, 4715-4719 (1988)) TGF.beta.-4 (from chicken) (Jakowlew et al., Mol. Endocrinol. 2, 1186-1195 (1988)) or TGF.beta.-5 (from xenopus) (Kondaiah et al., J. Biol. Chem. 265, 1089-1093 (1990)). The term "precursor domain" is defined as a sequence encoding a secretory signal peptide (i.e. a precursor peptide) which does not include the sequence encoding the mature protein. The amino acid sequences of the precursor domain of TGF.beta.1, 2, 3, 4 and 5 (Roberts and Sporn, Peptide Growth Factors and their Receptors: Sporn, M B and Roberts, AB, Springer-Verlag, Chapter 8, 422 (1996)) are shown in FIG. 1.
[0030] Preferably, the amino acid sequence of the LAP has at least 50% identity, using the default parameters of the BLAST computer program (Atschul et al., J. Mol. Biol. 215, 403-410 (1990) provided by HGMP (Human Genome Mapping Project), at the amino acid level, to the precursor domain of TGF.beta.1, 2, 3, 4 or 5 (Roberts and Sporn, Peptide Growth Factors and their Receptors: Sporn, M B and Roberts, AB, Springer-Verlag, Chapter 8, 422 (1996)) as shown in FIG. 1. More preferably, the LAP may have at least 60%, 70%, 80%, 90% and still more preferably 95% (still more preferably at least 99%) identity, at the nucleic acid or amino acid level, to the precursor domain of TGF.beta.1, 2, 3, 4 or 5 as shown in FIG. 1 which comprises residues 1 to 278.
[0031] The LAP may comprise the LAP of TGF.beta.1, 2, 3, 4, or 5 (Roberts and Sporn, Peptide Growth Factors and their Receptors: Sporn, M B and Roberts, AB, Springer-Verlag, Chapter 8, 422 (1996)) as shown in FIG. 1.
[0032] The LAP may contain at least two, for example at least 4, 6, 8, 10 or 20 cysteine residues for the formation of disulphide bonds.
[0033] The LAP may also comprise a sequence which has at least 50%, 60%, 70%, 80%, 90%, 95% or 99% identity with a LAP sequence of FIG. 1, using the default parameters of the BLAST computer program provided by HGMP, thereto.
[0034] The "dimerisation domain" refers to a peptide having affinity for a second peptide, such that the two peptides associate under physiological conditions to form a dimer. The second peptide may be the same or a different peptide. The dimerisation domain may also refer to polypeptides. The peptides or polypeptides may interact with each other through covalent and/or non-covalent association(s).
[0035] The dimerisation domain may be linked to the latency associated peptide by a linker. The linker size can be varied to vary the size of the shell in order to accommodate the pharmaceutically active agent. The linker peptide may comprise the amino acid sequence GGGGS (SEQ ID NO:135) or a multimer thereof (for example a dimer, a trimer, or a tetramer), a suitable linker may be (GGGGS).sub.3 (SEQ ID NO:136), or a sequence of nucleotides which has at least 50%, 60%, 70%, 80%, 90%, 95% or 99% identity, using the default parameters of the BLAST computer program provided by HGMP, thereto.
[0036] Examples of dimerisation domains include antibody fragment polypeptides such as an immunoglobulin Fc polypeptide, an immunoglobulin hinge polypeptide, a CH3 domain polypeptide, a CH4 domain polypeptide, a CH1 domain or CL domain polypeptide; a leucine zipper domain (e.g., a jun/fos leucine zipper domain, see, e.g., Kostelney et al., J. Immunol., 148:1547-1553, 1992; or a yeast GCN4 leucine zipper domain); an isoleucine zipper domain; a dimerising region of a dimerising cell-surface receptor (e.g., interleukin-8 receptor (IL-8R); or an integrin heterodimer such as LFA-1 or GPIllb/111a); a dimerising region of a secreted, dimerising ligand (e.g., nerve growth factor (NGF), neurotrophin-3 (NT-3), interleukin-8 (IL-8), vascular endothelial growth factor (VEGF), or brain-derived neurotrophic factor (BDNF); see, e.g., Arakawa et al., J. Biol. Chem. 269:27833-27839, 1994, and Radziejewski et al., Biochem. 32:1350, 1993); or a polypeptide comprising at least one cysteine residue (e.g., one, two, or three to about ten cysteine residues) such that disulfide bond(s) can form between the polypeptide and a second polypeptide comprising at least one cysteine residue. Suitably the dimerisation domain may be an Fc polypeptide.
[0037] An immunogobulin hinge polypeptide typically comprises a region which is rich in proline and cysteine amino acid residues. A common sequence motif present in the hinge polypeptide region may be may be CPXCP (SEQ ID NO:137) where X can be another residue that does not interfere with dimerisation, for example proline (P), arginine (R) or serine (S). The hinge region polypeptide may be from around 10 to 75 amino acid residues. The hinge region polypeptide may contain a plurality of cysteine-cysteine disulphide bonds, for example of from 2 to 15. The hinge region polypeptide may comprise the sequence CPXCP where X can be another residue that does not interfere with dimerisation, for example proline (P), arginine (R) or serine (S). A number of repeats of the sequence CPXCP may be present also, for example 2, 3, 4, or 5 repeats, or greater.
[0038] Wypych et al., J. Biol. Chem. 28316194-16205, 2008 defined the following hinge region peptide sequences for IgG antibodies as shown in Table 1 below:
TABLE-US-00001 TABLE 1 IgG subtype Core hinge sequences IgG1 EPKSCDKTHTCPPCP (SEQ ID NO: 138) IgG2 ERKCCVECPPCP (SEQ ID NO: 139) IgG3 ELKTPLGDTTHTCPRCP (SEQ ID NO: 140) (EPKSCDTPPPCPRCP).sub.3 (SEQ ID NO: 141) IgG4 ESKYGPPCPSCP (SEQ ID NO: 142)
[0039] The terms "antibody" and "immunoglobulin" are used herein interchangeably. An antibody molecule is made up of two identical heavy (H) and two identical light (L) chains held together by disulphide bonds. Each heavy chain comprises an Fc polypeptide. The two Fc polypeptides from the two heavy chains dimerise to form the Fc region of the antibody molecule. The term "Fc region" refers to the constant region of an antibody excluding the first constant region immunoglobulin domain of the heavy chain (CH1) that interacts with the constant portion of the light chain (CL) forming a CH1-CL domain pair. Thus, Fc region comprises the last two constant region immunoglobulin domains (CH2 and CH3) of IgA, IgD, and IgG, and the last three constant region immunoglobulin domains of IgE and IgM (CH2, CH3 and CH4), Any polypeptide of the various immunoglobulin constant domains may therefore be used in accordance with the present invention as a dimerisation domain.
[0040] Several antibody effector functions are mediated through the binding of the Fc region to Fc receptors (FcR) found on the surface of many cells for example lymphocytes, macrophages, natural killer cells, etc. FcRs are defined by their specificity for antibody isotypes. For example, Fc receptors for IgG antibodies are referred to as Fc.gamma.R.
[0041] IgG is also bound by the neonatal Fc receptor (FcRn). In humans, IgG exhibits a long serum half-life. Studies indicate that this is due to the protective effect of FcRn which binds to the Fc region of IgG and prevents degradation by allowing intracellular recycling.
[0042] The Fc polypeptide may be selected to alter, e.g. increase or decrease the half-life of the fusion protein. As used herein, the term "half-life" refers to a biological half-life of a particular polypeptide or protein in vivo. Half-life may be represented by the time required for half the quantity administered to a subject to be cleared from the circulation and/or other tissues in the animal. In an embodiment of the invention the Fc polypeptide is an IgG Fc polypeptide.
[0043] IgG antibodies can be further subdivided into IgG1, IgG2, IgG3 and IgG4. In an embodiment of the invention the Fc polypeptide may be IgG1, IgG2, IgG3 and IgG4 polypeptide, for example an IgG1 polypeptide.
[0044] The Fc polypeptide may be selected to target the fusion protein to specific tissues, for example the mucosa. The IgA antibody plays an important role in mucosal immunity for e.g. in the respiratory tract and the gastrointestinal mucosal lining. In its secretory form, IgA is found in mucous secretions such as tears, saliva, colostrum, gastrointestinal and genitourinary fluids. IgA deficiency is associated with a number of autoimmune diseases such as rheumatoid arthritis, systemic lupus erythematosus and immune thrombocytopenic purpura. IgA deficiency is also associated with allergic diseases such as asthma. In another embodiment of the invention the Fc polypeptide may be an IgA polypeptide.
[0045] The Fc polypeptides may be derived from the same antibody isotype to form a homodimeric Fc region or from different antibody isotypes to form a heterodimeric Fc region. The Fc polypeptide may be a naturally occurring polypeptide or may be an engineered polypeptide.
[0046] The LAP may provide a protective "shell" around the pharmaceutically active agent thereby shielding it and hindering, or preventing, its interaction with other molecules in the cell surface or molecules important for its activity.
[0047] The dimerisation domain enables the fusion of the amino terminals of two latency associated proteins thereby effectively closing the LAP shell. The dimerisation domains are therefore complementary and permit dimerisation to occur. The closure of the shell does not depend on the interaction of LAP with the pharmaceutically active agent. The closure also prevents monomer formation of the LAP fusion proteins.
[0048] Closure of the shell also prevents any interaction between the pharmaceutically active agent and its target molecule unless the pharmaceutically active agent is released from the LAP fusion protein by proteolytic activity. This ensures site-specific delivery of the pharmaceutically active agent and may reduce off-site activity.
[0049] In one alternative embodiment of the invention, therefore, there is provided a protein construct comprising two fusion proteins as defined in the first aspect of the invention which are present as a dimer. The dimer is composed of monomers of the first aspect of the invention which may be the same or different with respect to the latency associated peptide (LAP), the pharmaceutically active agent, and the amino acid sequence comprising a proteolytic cleavage site.
[0050] The dimer may therefore be composed of a first monomer and a second monomer, in which the first monomer comprises a latency associated peptide (LAP), a pharmaceutically active agent and an amino acid sequence comprising a dimerisation domain wherein the LAP and pharmaceutically active agent are connected by an amino acid sequence comprising a proteolytic cleavage site, and a second monomer comprising a latency associated peptide (LAP), a pharmaceutically active agent and an amino acid sequence comprising a dimerisation domain wherein the LAP and pharmaceutically active agent are connected by an amino acid sequence comprising a proteolytic cleavage site.
[0051] The present invention therefore provides a composition comprising two fusion proteins according to the first aspect, wherein the fusion proteins are associated at the dimerisation domain in each fusion protein.
[0052] The proteolytic cleavage site may comprise any protease specific cleavage site. The proteolytic cleavage site may include, but is not limited to, a matrix metalloproteinase (MMP) cleavage site, a serine protease cleavage site, an aggrecanase cleavage site, a site cleavable by a parasitic protease derived from a pathogenic organism (Zhang et al., J. Mol. Biol. 289, 1239-1251 (1999); Voth et al., Molecular and Biochemical Parasitology, 93, 31-41 (1998); Yoshioka et al., Folia Pharmacologica Japonica, 110, 347-355 (1997); Tort et al, Advances in Parasitology, 43, 161-266 (1999); McKerrow, International Journal for Parasitology, 29, 833-837 (1999); Young et al., International Journal for Parasitology, 29, 861-867 (1999); Coombs and Mottram, Parasitology, 114, 61-80 (1997)) or a site cleavable by the proteins of the complement cascade (Carroll, Annu. Rev. Immunol. 16, 545-568 (1998); Williams et al., Ann. Allergy, 60, 293-300 (1988)).
[0053] Suitably, the proteolytic cleavage site of the fusion proteins of the present invention is cleaved when the fusion protein is located at or introduced to a site of disease diagnosed as an inflammatory condition, e.g. arthritis, or cancer which can be characterized by inflammation and/or tissue remodelling.
[0054] A MMP cleavage site may comprise a number of amino acid residues recognisable by MMP. Preferably, the MMP cleavage site comprises the minimum number of amino acid residues required for recognition and cleavage by MMP. Moreover, the amino acids of the MMP site may be linked by one or more peptide bonds which are cleavable, proteolytically, by MMP. MMPs which may cleave the MMP site include, but are not limited to, MMP1, MMP2, MMP3, MMP7, MMP8, MMP9, MMP10 and MMP13 (Yu and Stamenkovic, Genes and Dev. 14, 163-176 (2000); Nagase and Fields, Biopolymers, 40, 399-416 (1996); Massova et al., J. Mol. Model. 3, 17-30 (1997); reviewed in Vu and Werb; Genes and Dev. 14, 2123-2133 (2000)).
[0055] The MMP cleavage site e.g. any one or more of MMP1, MMP2, MMP3, MMP7, MMP8, MMP9 or MMP10 may be as shown in FIG. 2 or a sequence of amino acids which has at least 50%, 60%, 70%, 80%, 90%, 95% or 99% identity with the sequences shown in FIG. 2, using the default parameters of the BLAST computer program provided by HGMP, thereto. In an embodiment the MMP proteolytic cleavage site has the amino acid sequence PLGLWA.
[0056] The proteolytic cleavage site may also comprise any aggrecanase specific cleavage site which is cleavable by an aggrecanase. An aggrecanase cleavage site may comprise a number of amino acid residues recognisable by an aggrecanase. Moreover, the amino acids of the aggrecanase site may be linked by one or more peptide bonds which are cleavable, proteolytically, by aggrecanase.
[0057] Aggrecanases which may cleave the aggrecanase site include, but are not limited to ADAMTS-4 (aggrecanase-1), ADAMTS-5 (aggrecanase-2) and ADAMTS-11 (Tortorella, M. D., et al Osteoarthritis Cartilage, 2001. 9(6): p. 539-552); Abbaszade, I., et al J Biol Chem, 1999. 274(33): p. 23443-23450).
[0058] The sequences of the protein cleavage sites of ADAMTS-4 (aggrecanase-1) are shown in FIG. 3. Suitable ADAMTS-4 sites include:
TABLE-US-00002 (SEQ ID NO: 80) HNEFRQRETYMVF (SEQ ID NO: 97) DVQEFRGVTAVIR
[0059] The consensus ADAMTS-4 cleavage motif can be represented according to Hills et al (J. Biol. Chem. 282 11101-11109 (2007)) as:
TABLE-US-00003 (SEQ ID NO: 129) E-[AFVLMY]-X-.sub.(0,1)-[RK]-X.sub.(2,3)-[ST]-[VYIFWMLA]
[0060] The aggrecanase proteolytic cleavage site of the present invention may be cleaved by ADAMTS-4 (aggrecanase-1), ADAMTS-5 (aggrecanase-2) or ADAMTS-11.
[0061] The amino acid sequence of the aggrecanase cleavage site may include a sequence which has at least 50%, 60%, 70%, 80%, 90%, 95% or 99% identity, using the default parameters of the BLAST computer program provided by HGMP, thereto. Preferably, the nucleic acid sequence encoding the aggrecanase cleavage site comprises the minimum number of residues required for recognition and cleavage by an aggrecanase.
[0062] The present invention may further provide a "linker" peptide. Preferably the linker peptide is linked to the amino acid sequence of the proteolytic cleavage site. The linker peptide may be provided at the C terminal or N terminal end of the amino acid sequence encoding the proteolytic cleavage site. Preferably, the linker peptide is continuous with the amino acid sequence of the proteolytic cleavage site. The linker peptide may comprise the amino acid sequence GGGGS (SEQ ID NO:135) or a multimer thereof (for example a dimer, a trimer, or a tetramer), a suitable linker may be (GGGGS).sub.3 (SEQ ID NO:136), or a sequence of nucleotides which has at least 50%, 60%, 70%, 80%, 90%, 95% or 99% identity, using the default parameters of the BLAST computer program provided by HGMP, thereto.
[0063] The term "linker peptide" is intended to define any sequence of amino acid residues which preferably provide a hydrophilic region when contained in an expressed protein. Such a hydrophilic region may facilitate cleavage by an enzyme at the proteolytic cleavage site.
[0064] The constructs of the invention may also comprise an additional linker sequence and/or proteolytic cleavage site between the dimerisation domain and the LAP.
[0065] The term "latency" as used herein, may relate to a shielding effect which may hinder interaction between the fusion protein and other molecules in the cell surface. Alternatively the term latency may be used to describe a reduction in the activity (up to and including ablation of activity) of a molecule/agent associated with the fusion protein. The term latency may also relate to a stabilising effect of the fusion protein. The effect may be in full or partial, where a partial effect is sufficient to achieve the latency of the active agent.
[0066] The term "associating with" in the context of the present invention is intended to include all means of association including, but not limited to, chemical cross-linking or peptide bond linkage.
[0067] The pharmaceutically active agent may be a pharmaceutically active protein which can include, but is not limited to, an antibody, a growth factor (e.g. TGF.beta., epidermal growth factor (EGF), platelet derived growth factor (PDGF), nerve growth factor (NGF), colony stimulating factor (CSF), hepatocyte growth factor, insulin-like growth factor, placenta growth factor); a differentiation factor; a cytokine e.g. an interleukin, (e.g. IL1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, IL-19, IL-20, IL-21, IL-22, IL-23, IL-24, IL-25, IL-26, IL-27, IL-28, IL-29, IL-30, IL-31, IL-32 or IL-33 or an interferon (e.g. IFN-.alpha., IFN-.beta. and IFN-.gamma.), tumour necrosis factor (TNF), IFN-.gamma. inducing factor (IGIF), a bone morphogenetic protein (BMP, e.g. BMP-1, BMP-2, BMP-3, BMP-4, BMP-4, BMP-5, BMP-6, BMP-7, BMP-8, BMP-9, BMP10, BMP-11, BMP-12, BMP-13); an interleukin receptor antagonist (e.g. IL-1 ra, IL-1R11), a tumor necrosis factor inhibitor (TNF-R or anti-TNF); a chemokine (e.g. MIPs (Macrophage Inflammatory Proteins) e.g. MIPla and MIP113; MCPs (Monocyte Chemotactic Proteins) e.g. MCP1, 2 or 3; RANTES (regulated upon activation normal T-cell expressed and secreted)); a trophic factor; a cytokine inhibitor; a cytokine receptor; a free-radical scavenging enzyme e.g. superoxide dismutase or catalase; a pro-drug converting enzyme (e.g. angiotensin converting enzyme, deaminases, dehydrogenases, reductases, kinases, urate oxidase and phosphatases); a peptide mimetic; a protease inhibitor; a tissue inhibitor of metalloproteinases (TIMPs e.g. TIMP1, TIMP2, TIMP3 or TIMP4) or a serpin (inhibitors of serine proteases). Preferably, the pharmaceutically active agent will be derived from the species to be treated e.g. human origin for the treatment of humans.
[0068] Preferably, the pharmaceutically active agent may be a cytokine, e.g. IFN.beta., IL-4, or IL-1ra, or a cytokine inhibitor, such as an antibody or antibody fragment, e.g. trastuzumab, and as defined herein below.
[0069] The interleukins and cytokines may be anti-inflammatory or pro-inflammatory. Anti-inflammatory cytokines and certain interleukins, such as IL-4 and/or IL-10, are suitable for the treatment of arthritis, whereas pro-inflammatory cytokines and other interleukins, such as IL-1 and IL-2, are suitable for the treatment of cancer.
[0070] As used herein "peptide mimetics" includes, but is not limited to, agents having a desired peptide backbone conformation embedded into a non-peptide skeleton which holds the peptide in a particular conformation. Peptide mimetics, which do not have some of the drawbacks of peptides, are of interest in those cases where peptides are not suitable in medicine.
[0071] Peptide mimetics may comprise a peptide backbone which is of the L- or D-conformation. Examples of peptides mimetics include melanocortin, adrenocorticotrophin hormone (ACTH) and other peptide mimetic agents which play a role in the central nervous system, endocrine system in signal transduction and in infection and immunity.
[0072] The pharmaceutically active agent may comprise a chemical compound such as a chemotherapeutic agent or other synthetic drug. Alternatively, the pharmaceutically active agent may comprise an siRNA or a peptide nucleic acid (PNA) sequence e.g. a poly-lysine sequence which binds to nucleic acids and permeabilises lipid bilayers (Wyman et al., Biological Chemistry, 379, 1045-1052 (1998)) or a KALA peptide which facilitates transfer through lipid bilayers (Wyman et al., Biochemistry, 36, 3008-3017 (1997)) or a protein transduction domain (PTD) that enables polypeptides to enter cells via the plasma membrane (Pi et al Molecular Therapy 2, 339-347 (2000)).
[0073] The pharmaceutically active agent may be suitable for interacting with soluble target molecules. Examples of soluble target molecules include cytokines, growth factors, signaling proteins and other ligands and receptors.
[0074] The pharmaceutically active agent may be a cytokine inhibitor. The term "cytokine inhibitor" refers to a molecule that can block, reduce, inhibit or neutralise a function, an activity and/or the expression of a cytokine. The cytokine inhibitor may be a protein (for example soluble cytokine receptor protein); an antibody or antibody fragment; nucleic acid (for example siRNA or anti-sense nucleic acid) or organic or inorganic molecules.
[0075] Examples of suitable antibodies include but are not limited to anti-TNF (e.g. anti-TNF .alpha., anti-TNF .beta.), anti-interleukins (e.g. anti-IL-1, anti-IL-2, anti-IL-3, anti-IL-4, anti-IL-5, anti-II-6, anti-IL-7, anti-IL-8, anti-IL-9, anti-IL-10, anti-IL-11, anti-IL-12, anti-IL-13, anti-IL-14, anti-IL-15, anti-IL-16, anti-IL-17, anti-IL-18, anti-IL-19, anti-IL-20, anti-IL-21, anti-IL-22, anti-IL-23, anti-IL-24, anti-IL-25, anti-IL-26, anti-IL-27, anti-IL-28, anti-IL-29, anti-IL-30, anti-IL-31, anti-IL-32, anti-IL-33, anti-IL-34, anti-IL-35 and IL-36), anti-interferons (e.g. anti-INF-.alpha., anti-INF-.beta., anti-INF-.gamma. and anti-INF-.omega.) and fragments thereof.
[0076] Examples of such molecules also include trastuzumab (also known as Herclon.TM./Herceptin.TM.), a monoclonal antibody to the HER2/neu receptor.
[0077] The pharmaceutically active agent may be an antibody or an antibody fragment. An "antibody fragment" as referred to herein means any portion of a full length antibody. Examples of antibody fragments include, but are not limited to, Fab, Fab', F(ab')2, scFv, Fv, dsFv diabody and Fd fragments.
[0078] The term "single chain variable fragment" or "scFv" refers to an Fv fragment in which the heavy chain domain and the light chain domain are linked. One or more scFv fragments may be linked to other antibody fragments (such as the constant domain of a heavy chain or a light chain) to form antibody constructs having one or more antigen recognition sites.
[0079] The antibody or antibody fragment may be suitable for use in the treatment of inflammatory conditions such as arthritis, gout, atherosclerosis, allograft rejection, Crohn's disease, inflammatory bowel disease, irritable bowel syndrome and colitis.
[0080] In an alternative embodiment, the invention further provides the fusion protein of the present invention optionally in association with latent TGF.beta. binding protein (LTBP). Typically, the fusion protein is covalently linked to LTBP to form a complex. Preferably, the association is mediated by disulphide bond(s) between Cys No. 33 of LAP and the third 8 Cys residue of LTBP. The LTBP associated with the fusion protein may include, but is not limited to, LTBP 1, 2, 3 or 4 (Kanzaki et al., Cell, 61, 1051-1061 (1990); Tsuji et al., Proc. Natl. Acad. Sci. USA, 87, 8835-8839 (1990); Moren et al., J. Biol. Chem. 269, 32469-32478 (1994); Yin et al., J. Biol. Chem. 270, 10147-10160 (1995); Gibson et al., Mol. Cell. Biol. 15, 6932-6942 (1995); Saharinen et al., J. Biol. Chem. 273, 18459-18469 (1998)), or fragments of LTBP such as that containing the third 8 Cys repeat, or homologues having a sequence of amino acids or nucleotides which has at least 50%, 60%, 70%, 80%, 90%, 95% or 99% identity, using the default parameters of the BLAST computer program provided by HGMP, to that of LTBP.
[0081] Cleavage of LTBP may release the fusion protein from the LTBP complex. Enzymes which may cleave LTBP in this manner include, but are not limited to, thrombospondin (Schultz et al., The Journal of Biological Chemistry, 269, 26783-26788 (1994); Crawford et al., Cell, 93, 1159-1170 (1998)), transglutaminase (Nunes et al., J. Cell, Biol. 136, 1151-1163 (1997); Kojima et al., The Journal of Cell Biology, 121, 439-448 (1993)) MMP9 and MMP2 (Yu and Stamenkovic, Genes and Dev, 14, 163-176 (2000)).
[0082] The invention further provides nucleic acid encoding the fusion protein of the first aspect of the invention as defined above. A second aspect of the invention provides a nucleic acid construct comprising a first nucleic acid sequence encoding a pharmaceutically active agent, a second nucleic acid sequence encoding a LAP and a third nucleic acid sequence encoding a dimerisation domain polypeptide.
[0083] The term "nucleic acid construct" generally refers to any length of nucleic acid which may be DNA, cDNA or RNA such as mRNA obtained by cloning or produced by chemical synthesis. The DNA may be single or double stranded. Single stranded DNA may be the coding sense strand, or it may be the non-coding or anti-sense strand. For therapeutic use, the nucleic acid construct is preferably in a form capable of being expressed in the subject to be treated.
[0084] The pharmaceutically active agent may be suitable for interacting with soluble target molecules. Examples of soluble target molecules include cytokines, growth factors, signaling proteins and other ligands and receptors.
[0085] In an embodiment of the invention, the first nucleic acid sequence encodes a cytokine inhibitor. The term "cytokine inhibitor" refers to a molecule that can block, reduce, inhibit or neutralise a function, an activity and/or the expression of a cytokine. The cytokine inhibitor may be a protein (for example soluble cytokine receptor protein); an antibody or antibody fragment; nucleic acid (for example siRNA or anti-sense nucleic acid).
[0086] Where the first nucleic acid construct encodes an antibody or an antibody fragment, the antibody fragment may be, for example, an Fab, Fab', F(ab')2, scFv, Fv, dsFv diabody or Fd fragment. Examples of such molecules include trastuzumab (also known as Herclon.TM./Herceptin.TM.), a monoclonal antibody to the HER2/neu receptor.
[0087] In some embodiments, the first nucleic acid sequence encodes the protein IFN.beta., IL-4 or IL-1ra. In one embodiment of the invention, the first nucleic acid sequence encodes IFN.beta., IL-4 or IL-1ra from a mouse or a human.
[0088] The nucleic acid construct of the second aspect of the invention may be in the form of a vector, for example, an expression vector, and may include, among others, chromosomal, episomal and virus-derived vectors, for example, vectors derived from bacterial plasmids, from bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal elements, from viruses such as baculo-viruses, papova-viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. Generally, any vector suitable to maintain, propagate or express nucleic acid to express a polypeptide in a host, may be used for expression in this regard. The vector may comprise a plurality of the nucleic acid constructs defined above, for example 2 or more.
[0089] The invention further provides a protein encoded by the nucleic acid construct of the second aspect of the invention optionally in association with latent TGF.beta. binding protein (LTBP) described herein. Typically, the protein encoded by the nucleic acid construct is covalently linked to LTBP to form a complex. Preferably, the association is mediated by disulphide bond(s) between Cys No. 33 of LAP and the third 8 Cys residue of LTBP.
[0090] The nucleic acid construct of the second aspect of the invention preferably includes a promoter or other regulatory sequence which controls expression of the nucleic acid. Promoters and other regulatory sequences which control expression of a nucleic acid have been identified and are known in the art. The person skilled in the art will note that it may not be necessary to utilise the whole promoter or other regulatory sequence. Only the minimum essential regulatory element may be required and, in fact, such elements can be used to construct chimeric sequences or other promoters. The essential requirement is, of course, to retain the tissue and/or temporal specificity. The promoter may be any suitable known promoter, for example, the human cytomegalovirus (CMV) promoter, the CMV immediate early promoter, the HSV thymidine kinase, the early and late SV40 promoters or the promoters of retroviral LTRs, such as those of the Rous Sarcoma virus (RSV) and metallothionine promoters such as the mouse metallothionine-I promoter. The promoter may comprise the minimum comprised for promoter activity (such as a TATA element without enhancer elements) for example, the minimum sequence of the CMV promoter. Preferably, the promoter is contiguous to the first and/or second nucleic acid sequence.
[0091] As stated herein, the nucleic acid construct of the second aspect of the invention may be in the form of a vector. Vectors frequently include one or more expression markers which enable selection of cells transfected (or transformed) with them, and preferably, to enable a selection of cells containing vectors incorporating heterologous DNA. A suitable start and stop signal will generally be present.
[0092] One embodiment of the invention relates to a cell comprising the nucleic acid construct of the second aspect of the invention. The cell may be termed a "host" cell, which is useful for the manipulation of the nucleic acid, including cloning. Alternatively, the cell may be a cell in which to obtain expression of the nucleic acid. Representative examples of appropriate host cells for expression of the nucleic acid construct of the invention include virus packaging cells which allow encapsulation of the nucleic acid into a viral vector; bacterial cells, such as Streptococci, Staphylococci, E. coli, Streptomyces and Bacillus subtilis; single cells, such as yeast cells, for example, Saccharomyces cerevisiae, and Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells, animal cells such as CHO, COS, C127, 3T3, PHK.293, and Bowes Melanoma cells and other suitable human cells; and plant cells e.g. Arabidopsis thaliana.
[0093] Introduction of an expression vector into the host cell can be affected by calcium phosphate transfection, DEAE-dextran mediated transfection, microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrape loading, ballistic introduction, infection or other methods. Such methods are described in many standard laboratory manuals, such as Sambrook et al, Molecular Cloning, a Laboratory Manual, Second Edition, Coldspring Harbor Laboratory Press, Coldspring Harbor, N.Y. (1989).
[0094] Mature proteins can be expressed in host cells, including mammalian cells such as CHO cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free translation systems can be employed to produce such proteins using RNAs derived from the nucleic acid construct of the second aspect of the present invention. Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook et al, Molecular Cloning, a Laboratory Manual, Second Edition, Coldspring Harbor Laboratory Press, Coldspring Harbor, N.Y. (1989).
[0095] Proteins can be recovered and purified from recombinant cell cultures by well-known methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography, high performance liquid chromatography, lectin and/or heparin chromatography. For therapy, the nucleic acid construct e.g. in the form of a recombinant vector, may be purified by techniques known in the art, such as by means of column chromatography as described in Sambrook et al, Molecular Cloning, a Laboratory Manual, Second Edition, Coldspring Harbor Laboratory Press, Coldspring Harbor, N.Y. (1989).
[0096] According to a third aspect of the invention, there is provided a composition in accordance with the first aspect of the invention for use in the treatment of inflammatory conditions or cancer. This aspect of the invention therefore extends to and includes a method for the treatment of inflammatory conditions or cancer comprising the administration to a subject of a composition comprising a fusion protein comprising a latency associated protein, a dimerisation domain and a pharmaceutically active agent.
[0097] The present invention provides a composition as described above for use in the treatment of inflammatory conditions or cancer. Inflammatory conditions include, without limitation, atherosclerosis, acute and chronic lung inflammation (e.g., chronic bronchitis, asthma, lung infection including bacterial and viral infections such as SARS and influenza, cystic fibrosis, etc.), inflammation of virus-infected tissues (e.g., viral lung infections, viral myocarditis, viral meningitis, etc.), ulcerative colitis, endotoxic shock, arthritis (e.g., rheumatoid arthritis, juvenile arthritis, osteoarthritis, psoriatic arthritis, reactive arthritis, viral or post-viral arthritis, ankylosing spondylarthritis, etc.), psoriasis, Crohn's disease, inflammatory bowel disease, insulin dependent diabetes mellitus, injury independent type II diabetes, ischemia induced inflammation, otitis media (middle ear infection), gout, multiple sclerosis, cachexia, and Ataxia Telangiectasia. Arthritis defines a group of disease conditions (or arthropathies) where damage is caused to the joints of the body and includes osteoarthritis (also known as degenerative joint disease) which can occur following trauma to the joint, following an infection of the joint or as a result of aging. Other forms of arthritis include rheumatoid arthritis and psoriatic arthritis, which are autoimmune diseases, and septic arthritis is caused by infection in the joints. Cancer defines a group of diseases characterized by an abnormal proliferation of cells in the body, which can be defined as tumors, for example glioma. Types of gliomas include ependymomas, astrocytomas, oligodendrogliomas and mixed gliomas. A Grade 4 astrocytoma is also known as a glioblastoma.
[0098] In a fourth aspect, the invention provides a nucleic acid sequence in accordance with the second aspect of the invention for use in the treatment of inflammatory conditions or cancer. This aspect therefore extends to and includes a method for the treatment of inflammatory conditions or cancer comprising the administration to a subject a nucleic acid construct of the second aspect of the invention. Where the nucleic acid construct is used in the therapeutic method of the invention, the construct may be used as part of an expression construct, e.g. in the form of an expression vector such as a plasmid or virus. In such a method, the construct may be administered intravenously, intradermally, intramuscularly, orally or by other routes.
[0099] The nucleic acid construct of the second aspect of the invention, and proteins derived therefrom, may be employed alone or in conjunction with other compounds, such as therapeutic compounds, e.g. anti-inflammatory drugs, cytotoxic agents, cytostatic agents or antibiotics. The nucleic acid constructs and proteins useful in the present invention are preferably provided in an isolated form, and preferably are purified to homogeneity.
[0100] As used herein, the term "treatment" includes any regime that can benefit a human or a non-human animal. The treatment of "non-human animals" extends to the treatment of domestic animals, including horses and companion animals (e.g. cats and dogs) and farm/agricultural animals including members of the ovine, caprine, porcine, bovine and equine families. The treatment may be in respect of any existing condition or disorder, or may be prophylactic (preventive treatment). The treatment may be of an inherited or an acquired disease. The treatment may be of an acute or chronic condition. Preferably, the treatment is of a condition/disorder associated with inflammation. The first nucleic acid sequence of the nucleic acid construct of the third aspect of the invention may encode a protein for use in the treatment of the disorder, including, but not limited to osteoarthritis, scleroderma, renal disease, rheumatoid arthritis, inflammatory bowel disease, multiple sclerosis, atherosclerosis, cancer, or any inflammatory disease.
[0101] The nucleic acid construct of the second aspect of the invention may be used therapeutically in a method of the invention by way of gene therapy. Alternatively, protein encoded by the nucleic acid construct may be directly administered as described herein.
[0102] Administration of the nucleic acid construct of the second aspect may be directed to the target site by physical methods. Examples of these include topical administration of the "naked" nucleic acid in the form of a vector in an appropriate vehicle, for example, in solution in a pharmaceutically acceptable excipient, such as phosphate buffered saline, or administration of a vector by physical method such as particle bombardment according to methods known in the art.
[0103] Other physical methods for administering the nucleic acid construct or proteins of the third aspect of the invention directly to the recipient include ultrasound, electrical stimulation, electroporation and microseeding. Further methods of administration include oral administration or administration through inhalation.
[0104] Particularly preferred is the microseeding mode of delivery which is a system for delivering genetic material into cells in situ in a patient. This method is described in U.S. Pat. No. 5,697,901.
[0105] The nucleic acid construct according to the second aspect of the invention may also be administered by means of delivery vectors. These include viral delivery vectors, such as adenovirus, retrovirus or lentivirus delivery vectors known in the art. Other non-viral delivery vectors include lipid delivery vectors, including liposome delivery vectors known in the art.
[0106] Administration may also take place via transformed host cells. Such cells include cells harvested from the subject, into which the nucleic acid construct is transferred by gene transfer methods known in the art. Followed by the growth of the transformed cells in culture and grafting to the subject.
[0107] As used herein the term "gene therapy" refers to the introduction of genes by recombinant genetic engineering of body cells (somatic gene therapy) for the benefit of the patient. Furthermore, gene therapy can be divided into ex vivo and in vivo techniques. Ex vivo gene therapy relates to the removal of body cells from a patient, treatment of the removed cells with a vector i.e., a recombinant vector, and subsequent return of the treated cells to the patient. In vivo gene therapy relates to the direct administration of the recombinant gene vector by, for example, intravenous or intravascular means. Preferably the method of gene therapy of the present invention is carried out ex vivo.
[0108] Preferably in gene therapy, the expression vector of the present invention is administered such that it is expressed in the subject to be treated. Thus for human gene therapy, the promoter is preferably a human promoter from a human gene, or from a gene which is typically expressed in humans, such as the promoter from human CMV.
[0109] For gene therapy, the present invention may provide a method for manipulating the somatic cells of human and non-human mammals.
[0110] The present invention also provides a gene therapy method which may involve the manipulation of the germ line cells of a non-human mammal.
[0111] The present invention therefore provides a method for providing a human with a therapeutic protein comprising introducing mammalian cells into a human, the human cells having been treated in vitro to insert therein a nucleic acid construct according to the second aspect of the invention.
[0112] Each of the individual steps of the ex vivo somatic gene therapy method are also covered by the present invention. For example, the step of manipulating the cells removed from a patient with the nucleic acid construct of the third aspect of the invention in an appropriate vector. As used herein, the term "manipulated cells" covers cells transfected with a recombinant vector. Also contemplated is the use of the transfected cells in the manufacture of a medicament for the treatment of inflammatory conditions, such as arthritis or cancer, as defined herein above.
[0113] The present invention may also find application in veterinary medicine for treatment/prophylaxis of domestic animals including horses and companion animals (e.g. cats and dogs) and farm animals which may include mammals of the ovine, porcine, caprine, bovine and equine families.
[0114] The present invention also relates to compositions comprising the nucleic acid construct or proteins of the first or second aspects of the invention. Therefore, the fusion protein or nucleic acid constructs of the present invention may be employed in combination with the pharmaceutically acceptable carrier or carriers. Such carriers may include, but are not limited to, saline, buffered saline, dextrose, liposomes, water, glycerol, ethanol and combinations thereof.
[0115] The pharmaceutical compositions of the invention may comprise two fusion proteins according to the first aspect of the invention, wherein the fusion proteins are associated at the dimerisation domain in each fusion protein, or a nucleic acid sequence encoding two fusion proteins according to the first aspect of the invention.
[0116] The pharmaceutical compositions may be administered in any effective, convenient manner effective for treating a patient's disease including, for instance, administration by oral, topical, intravenous, intramuscular, intranasal, or intradermal routes among others. In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable composition, for example as a sterile aqueous dispersion, preferably isotonic.
[0117] For administration to mammals, and particularly humans, it is expected that the daily dosage of the active agent will be from 0.01 mg/kg up to 10 mg/kg body weight, typically around 1 mg/kg. The physician in any event will determine the actual dosage which will be most suitable for an individual which will be dependent on factors including the age, weight, sex and response of the individual. The above dosages are exemplary of the average case. There can, of course, be instances where higher or lower dosages are merited, and such are within the scope of this invention
[0118] References to uses of the fusions proteins, nucleic acid constructs, vectors, or host cells of the present invention in the treatment of diseases, such as inflammatory diseases or cancer, includes embodiments relating to the use of the fusion protein, nucleic acid construct, vector, or host cell in the manufacture of a medicament for the treatment of said diseases.
[0119] A fifth aspect of the invention provides a fusion protein comprising a LAP, a pharmaceutically active agent and an amino acid sequence comprising a dimerisation domain, wherein the LAP and the pharmaceutically active agent are connected by an amino acid sequence comprising a proteolytic cleavage site for use in the treatment of inflammatory conditions or cancer. The pharmaceutically active agent may be as described above. In some embodiments of this aspect of the invention, the pharmaceutically active agent may be an siRNA or PNA molecule.
[0120] The invention further provides a nucleic acid construct encoding the fusion protein of the fifth aspect of the invention. The nucleic acid construct preferably comprises a nucleic acid sequence encoding a LAP adjacent a nucleic acid sequence encoding a proteolytic cleavage site. Preferably, the nucleic acid sequence encoding a LAP is suitably operably linked to a nucleic acid sequence encoding a proteolytic cleavage site.
[0121] The invention further provides the fusion protein of the fifth aspect of the invention optionally in association with latent TGF.beta. binding protein (LTBP) described herein.
[0122] The fusion protein of the fifth aspect of the invention may be associated with the pharmaceutically active agent by means of a peptide bond linkage. Alternatively, the fusion protein may be associated with the pharmaceutically active agent by means of a chemical linkage e.g. by cross-linking the fusion protein to a chemical compound such as a chemotherapeutic agent, synthetic drug or PNA.
[0123] Preferably, the pharmaceutically active agent is linked to the C-terminal end of the amino acid sequence of the proteolytic cleavage site in the fusion protein of the seventh aspect of the invention. More preferably, the pharmaceutically active agent is continuous with the C-terminal residue of the amino acid sequence of the proteolytic cleavage site.
[0124] The fusion protein, and associated pharmaceutically active agent of the fifth aspect of the invention may be employed alone or in conjunction with other compounds, such as therapeutic compounds, e.g. anti-inflammatory drugs, cytotoxic agents, cytostatic agents or antibiotics. Such administration may be simultaneous, separate or sequential. The components may be prepared in the form of a kit which may comprise instructions as appropriate.
[0125] Preferably, the fusion protein and associated pharmaceutically active agent of the fifth aspect of the invention are directly administered to a patient as described herein.
[0126] The present invention also relates to compositions comprising the fusion protein and associated pharmaceutically active agent of the fifth aspect of the invention. Therefore, the fusion protein and associated pharmaceutically active agent may be employed in combination with the pharmaceutically acceptable carrier or carriers. Such carriers may include, but are not limited to, saline, buffered saline, dextrose, liposomes, water, glycerol, polyethylene glycol, ethanol and combinations thereof.
[0127] The pharmaceutical compositions may be administered in any effective, convenient manner effective for treating a disease of a patient including, for instance, administration by oral, topical, intravenous, intramuscular, intranasal, or intradermal routes among others. In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable composition, for example as a sterile aqueous dispersion, preferably isotonic.
[0128] A sixth aspect of the invention provides a kit of parts comprising a fusion protein of the first aspect of the invention, a nucleic acid construct of the second aspect of the invention, or a fusion protein and associated pharmaceutically active agent according to the fifth aspect of the invention, and an administration vehicle including, but not limited to, tablets for oral administration, inhalers for lung administration and injectable solutions for intravenous administration.
[0129] A seventh aspect of the invention provides a process for preparing the fusion protein, of the first aspect of the invention comprising production of the fusion protein recombinant by expression in a host cell, purification of the expressed fusion protein and association of the pharmaceutically active agent to the purified fusion protein by means of peptide bond linkage, hydrogen or salt bond or chemical cross linking. In some embodiments of this aspect of the invention where the pharmaceutically active agent is a peptide, the fusion protein could be prepared using hydrogen or salt bonds where the peptide is capable or multimerisation, for example dimerisation or trimerisation.
[0130] An eighth aspect of the invention provides a process for preparing a nucleic acid construct of the second aspect of the invention comprising ligating together nucleic acid sequences encoding a latency associated peptide, a proteolytic cleavage sequence, and a pharmaceutically active agent, optionally including a linker sequence on either side of the proteolytic cleavage site.
[0131] One embodiment of the present invention provides a method of providing latency to a pharmaceutically active agent which is a cytokine, preferably interferon or an interleukin or a cytokine inhibitor such as a scFV or soluble cytokine receptor, the method comprising constructing a fusion protein having a latency associated peptide, preferably from TGF.beta., associated with a proteolytic cleavage site, preferably an ADAM-TS4 cleavage site, and the pharmaceutically active agent. For example, the pharmaceutically active agent may be followed by the proteolytic cleavage site and the LAP as follows: Ig-LAP-cleavage site-active agent.
[0132] The present invention therefore enables the formation of a dimer which solves a problem of protein production in suspension cultures. The invention also enables (due to the closure of the "shell" of the LAP construct) the production of latent antibody fragments (e.g. inhibitors of cytokine action) as therapeutic agents. Previously, the fusion of a single peptide therapeutic agent with LAP left the "shell" partially open and thus potentially exposed without the need for release by protease action at the site of disease i.e. MMP/aggrecanase cleavage. For example, where the therapeutic peptide was a cytokine inhibitor, such as a scFv, it could interact if exposed with the cytokine before release at the site of disease.
[0133] The present invention can use any type of immunoglobulin (Ig) from any species (as their structure is extremely similar), but for certain clinical applications particular isotypes may be advantageous for example for in mucosal tissues IgA may be used, for binding to Fc receptors IgG2 may be used. Additionally, mutated forms of Fc that prevent them from binding to Fc receptors or activating cell mediated immunity or complement activation could be used if it was desirable for a particular clinical application.
[0134] All preferred features of the second and subsequent aspects of the invention are as for the first aspect mutatis mutandis.
[0135] The present invention will now be described by way of example only with reference to the accompanying figures wherein:
[0136] FIG. 1 shows amino acid sequences of the precursor domain of TGF.beta.1, 2 and 3 (human, Hu) (SEQ ID NOs:1-3), TGF.beta.4 (chicken, Ck) (SEQ ID NO:4), TGF.beta. (frog, Fg) (SEQ ID NO:5). Arrows indicate the position of the proteolytic processing resulting in cleavage of the signal peptide of TGF.beta.1 and of the mature TGF.beta.s. N-linked glycosylation sites are underlined, as is the integrin cellular recognition sequence (Roberts and Sporn, Peptide Growth Factors and their Receptors: Sporn, M B and Roberts, AB, Springer-Verlag, Chapter 8, 422 (1996)).
[0137] FIG. 2 shows the sequences of protein cleavage sites of matrix metalloproteinases (MMPs) (Nagase and Fields, Biopolymers, 40, 399-416 (1996)) (SEQ ID NOs:6-78).
[0138] FIG. 3 shows a multiple sequence alignment of the ADAMTS-4 epitope sequences with corresponding average percentage of phagemid cleavage and the derived ADAMTS-4 cleavage motif. Predominant amino acids found at a frequency of greater than 40% in a particular position are illustrated with a black background, in contrast to related amino acids which are shown with a grey background (reproduced from Hills et al J. Biol. Chem. 282 11101-11109 (2007)) (SEQ ID NOs:79-129).
[0139] FIG. 4A shows the theoretical structure of LAP. FIG. 4B shows the theoretical structure of Ig-LAP. FIG. 4C shows a schematic representation of Ig-LAP.
[0140] FIG. 5 shows the DNA sequence (SEQ ID NO:130) and predicted amino acid sequence (SEQ ID NO:131) of mouse IgG1 (Fc)-LAP-MMP-IFN. Initiator ATG is at position 10 and stop codon at position 2025.
[0141] FIG. 6 shows the expression of Ig-LAP-IFN in CHO cells. Western blotting of LAP-IFN (slot 1) or Ig-LAP-IFN (slot 2). 20 .mu.l supernatant from suspension CHO cells was run on 4-12% SDS-PAGE in non-denaturing conditions and blotted onto PDVF membrane. Then probed with goat anti-LAP antibodies. The bands were detected using HRP-conjugated anti-goat antibody by chemiluminescence (ECL, Amersham) and exposed to autoradiography.
[0142] FIG. 7 shows the cleavage of Ig-LAP-IFN by MMP1. 20 .mu.l of CHO cell supernatant was incubated at 37.degree. C. overnight without (slot 1) or with MMP1 (slot 2). The products were then run on a 4-12% SDS-PAGE gradient gel in non-denaturing conditions and blotted to a PDVF membrane. Anti-LAP antibodies were used to detect the uncleaved (slot 1) and cleaved product (slot 2). The molecular weight of the cleaved Ig-LAP corresponds to the expected size of about 160 kDa.
[0143] FIG. 8 shows the DNA sequence of human Ig-LAP with anti-herNeu2 (Herceptin.TM.) antibody (SEQ ID NOs:132-134). The cleavage site in this construct is an aggrecanase-specific site. The secretory signal peptide is derived from IL-2 (nucleotides 1-54), the human CH2 and CH3 domains are derived from IgG1 (nucleotides 55-757) and is followed by a spacer with unique restriction sites HindIII and Nco1 (nucleotides 758-771) and the human LAP sequence (nucleotides 772-1515) is followed by the aggrecanase cleavage site which is flanked by GGGGS (SEQ ID NO:135) linkers (nucleotides 1516-1584) and finally the Herceptin.TM. scFv ending in a poly His tail (nucleotides 1585-2370).
[0144] FIG. 9 shows the detection of Herceptin.TM. by anti-His antibody labelled with a fluorophore. Ig-LAP Herceptin.TM. was produced in CHO cells in suspension and the fusion protein was purified from cell supernatants by affinity chromatography using a Protein A column. A fraction from the purified material was applied to breast cancer cells expressing Her/neu2 (SKBR3) or non-expressing (MDA-MD-231) before or after cleavage by aggrecanase. Herceptin.TM. scFv binds to the P185 Her Neu2 on breast cancer cells only after aggrecanase release of Herceptin.TM. from the hulg-LAP fusion.
[0145] The invention is now described with reference to the following non-limiting examples:
Example 1: Cloning the IgG1 Fc Between the Signal Peptide of IL-2 and Mouse LAP
[0146] An example of the preparation of a construct of the invention is as follows. PCR of mouse IgG1 was used for cloning the IgG1 Fc into an EcoR1 site after the signal peptide of muLAP-MMP-IFN. The following oligonucleotides were designed for linking in frame the coding regions.
TABLE-US-00004 Sense oligo: (SEQ ID NO: 143) 5' ATG AAT TCC GGT TGT AAG CCTTGCATA Anti-sense oligo: (SEQ ID NO: 144) 5' GT GA ATT CCT CCA TGG AAG CTT TTT ACC AGG AGA GTG GGA GAG
[0147] The EcoR1 sites in the oligos are underlined and the Nco1 and HindIII sites are in bold. These latter sites were introduced to allow for direct cloning of additional MMP or aggrecanase cleavage sites. Because the IgG1 fragment could be inserted on the opposite orientation needed for in-frame translation, the resulting clones were analysed by restriction analysis and these containing the IgG1 fragment in the right orientation sent for DNA sequencing. FIG. 5 depicts the DNA sequence of a representative positive clone.
[0148] Ig-LAP-MMP-IFN is effectively secreted from CHO cells grown in suspension:
[0149] Supernatant from transiently transfected CHO cells was analysed after non-reducing SDS-PAGE by western blotting using a goat anti-LAP antibody (R&D systems) (FIG. 6). The molecular weight of the Ig-LAP-IFN was bigger than that of LAP-IFN (i.e. above 200 kDa as expected from a glycosylated dimerised protein).
Id-LAP-Fusion is Cleaved by Recombinant MMP1:
[0150] In order to establish whether the Ig-LAP-IFN fusion is still cleavable by MMP the inventors digested the protein in the CHO supernatant with recombinant MMP-1 overnight at 37.degree. C. The reaction was stopped with 25 mM EDTA and then analysed by western blotting after non-denaturing SDS-PAGE (FIG. 7).
After Cleavage with MMP, Ig-LAP-IFN Releases IFN Biological Activity:
[0151] Mouse L929 cells were plated at 104 cells/well in 96 well plates. The cells were incubated overnight with supernatants of Ig-LAP-IFN transfected CHO cell cultures that was treated or untreated with MMP-1 at double dilutions starting at 1:10. Then the medium was removed and the cells were infected with encephalomyocarditis virus in a volume of 50 .mu.l for 16 hours as described (Adams et al. 2003). Cells were washed in PBS and 100 ml of Cell titer-Glo (Promega) cell lysis buffer. After 20 minutes and room temperature, 50 .mu.l were transferred to an opaque plate and the luminescence (endogenous ATP levels) read in a Luminometer. 50% cell viability was assessed as half of luciferase activity compared to uninfected L929 cells (see Table 2 which shows MMP cleavage releases IFN activity from Ig-LAP-IFN).
TABLE-US-00005 TABLE 2 Luciferase IFN biological Treatment activity activity (U/ml) Control No infection 10973 N/A Control (+EMC) 64 Ig-LAP-IFN (no MMP1) + EMC 64 0 IgLAP-IFN (+MMP) + EMC 5068 30
Example 2: Human IQ-Human LAP with Anti-herNeu2 (Herceptin.TM.) Antibody
[0152] Human Ig-human LAP with anti-herNeu2 (Herceptin.TM.--Markiv et al. BMC Biotechnology 2011, 11:117) antibody construct was created with an aggrecanase-specific cleavage site in CHO cells in suspension (FIG. 8). The secretory signal peptide was derived from IL-2 (nucleotides 1-54), the human CH2 and CH3 domains were derived from IgG1 (nucleotides 55-757). In the construct the human CH2 and CH3 domains are followed by a spacer with unique restriction sites HindIII and Nco1 (nucleotides 758-771). The human LAP sequence (nucleotides 772-1515) is followed by the aggrecanase cleavage site which is flanked by GGGGS linkers (nucleotides 1516-1584). The Herceptin.TM. scFv ends in a poly His tail (nucleotides 1585-2370). The fusion protein was purified from cell supernatants by affinity chromatography using a Protein A column. A fraction from the purified material was applied to breast cancer cells expressing Her/neu2 (SKBR3) or non-expressing (MDA-MD-231) before or after cleavage by aggrecanase. The bound Herceptin.TM. was detected by anti-His antibody labelled with a fluorophore (FIG. 9).
Sequence CWU
1
1
1441390PRTHomo sapiens 1Met Pro Pro Ser Gly Leu Arg Leu Leu Pro Leu Leu
Leu Pro Leu Leu1 5 10
15Trp Leu Leu Val Leu Thr Pro Gly Pro Pro Ala Ala Gly Leu Ser Thr
20 25 30Cys Lys Thr Ile Asp Met Glu
Leu Val Lys Arg Lys Arg Ile Glu Ala 35 40
45Ile Arg Gly Gln Ile Leu Ser Lys Leu Arg Leu Ala Ser Pro Pro
Ser 50 55 60Gln Gly Glu Val Pro Pro
Gly Pro Leu Pro Glu Ala Val Leu Ala Leu65 70
75 80Tyr Asn Ser Thr Arg Asp Arg Val Ala Gly Glu
Ser Ala Glu Pro Glu 85 90
95Pro Glu Pro Glu Ala Asp Tyr Tyr Ala Lys Glu Val Ile Arg Val Leu
100 105 110Met Val Glu Thr His Asn
Glu Ile Tyr Asp Lys Phe Lys Gln Ser Thr 115 120
125His Ser Ile Tyr Met Phe Phe Asn Thr Ser Glu Leu Arg Glu
Ala Val 130 135 140Pro Glu Pro Val Leu
Leu Ser Arg Ala Glu Leu Arg Leu Leu Arg Leu145 150
155 160Lys Leu Lys Val Glu Gln His Val Glu Leu
Tyr Gln Lys Tyr Ser Asn 165 170
175Asn Ser Trp Arg Tyr Leu Ser Asn Arg Leu Leu Ala Pro Ser Asp Ser
180 185 190Pro Glu Trp Leu Ser
Phe Asp Val Thr Gly Val Val Arg Gln Trp Leu 195
200 205Ser Arg Gly Gly Glu Ile Glu Gly Phe Arg Leu Ser
Ala His Cys Ser 210 215 220Gly Asp Ser
Arg Asp Asn Thr Leu Gln Val Asp Ile Asn Gly Phe Thr225
230 235 240Thr Gly Arg Arg Gly Asp Leu
Ala Thr Ile His Gly Met Trp Arg Pro 245
250 255Phe Leu Leu Leu Met Ala Thr Pro Leu Glu Arg Ala
Gln Lys Leu Gln 260 265 270Ser
Ser Arg His Arg Arg Ala Leu Asp Thr Asn Tyr Cys Phe Ser Ser 275
280 285Thr Glu Lys Asn Cys Cys Val Arg Gln
Leu Tyr Ile Asp Phe Arg Lys 290 295
300Asp Leu Gly Trp Lys Trp Ile His Glu Pro Lys Gly Tyr His Ala Asn305
310 315 320Phe Cys Leu Gly
Pro Cys Pro Tyr Ile Trp Ser Leu Asp Thr Gln Tyr 325
330 335Ser Lys Val Leu Ala Leu Tyr Asn Gln His
Asn Pro Gly Ala Ser Ala 340 345
350Ala Pro Cys Cys Val Pro Gln Ala Leu Glu Pro Leu Pro Ile Val Tyr
355 360 365Tyr Val Gly Arg Lys Pro Lys
Val Glu Gln Leu Ser Asn Met Ile Val 370 375
380Arg Ser Cys Lys Cys Ser385 3902414PRTHomo sapiens
2Met His Tyr Cys Val Leu Ser Ala Phe Leu Ile Leu His Leu Val Thr1
5 10 15Val Ala Leu Ser Leu Ser
Thr Cys Ser Thr Leu Asp Met Asp Gln Phe 20 25
30Met Arg Lys Arg Ile Glu Ala Ile Arg Gly Gln Ile Leu
Ser Lys Leu 35 40 45Lys Leu Thr
Ser Pro Pro Glu Asp Tyr Pro Glu Pro Glu Glu Val Pro 50
55 60Pro Glu Val Ile Ser Ile Tyr Asn Ser Thr Arg Asp
Leu Leu Gln Glu65 70 75
80Lys Ala Ser Arg Arg Ala Ala Ala Cys Glu Arg Glu Arg Ser Asp Glu
85 90 95Glu Tyr Tyr Ala Lys Glu
Val Tyr Lys Ile Asp Met Pro Pro Phe Phe 100
105 110Pro Ser Glu Asn Ala Ile Pro Pro Thr Phe Tyr Arg
Pro Tyr Phe Arg 115 120 125Ile Val
Arg Phe Asp Val Ser Ala Met Glu Lys Asn Ala Ser Asn Leu 130
135 140Val Lys Ala Glu Phe Arg Val Phe Arg Leu Gln
Asn Pro Lys Ala Arg145 150 155
160Val Pro Glu Gln Arg Ile Glu Leu Tyr Gln Ile Leu Lys Ser Lys Asp
165 170 175Leu Ile Ser Pro
Thr Gln Arg Tyr Ile Asp Ser Lys Val Val Lys Thr 180
185 190Arg Ala Glu Gly Glu Trp Leu Ser Phe Asp Val
Thr Asp Ala Val His 195 200 205Glu
Trp Leu His His Lys Asp Arg Trp Leu Gly Phe Lys Ile Ser Leu 210
215 220His Cys Pro Cys Cys Thr Phe Val Pro Ser
Asn Asn Tyr Ile Ile Pro225 230 235
240Asn Lys Ser Glu Glu Leu Glu Ala Arg Phe Ala Gly Ile Asp Gly
Thr 245 250 255Ser Thr Tyr
Thr Ser Gly Asp Gln Lys Thr Ile Lys Ser Thr Arg Lys 260
265 270Lys Asn Ser Gly Lys Thr Pro His Leu Leu
Leu Met Leu Leu Pro Ser 275 280
285Tyr Arg Leu Glu Ser Gln Gln Thr Asn Arg Arg Lys Lys Arg Ala Leu 290
295 300Asp Ala Ala Tyr Cys Phe Arg Asn
Val Gln Asp Asn Cys Cys Leu Arg305 310
315 320Pro Leu Tyr Ile Asp Phe Lys Arg Asp Leu Gly Trp
Lys Trp Ile His 325 330
335Glu Pro Lys Gly Tyr Asn Ala Asn Phe Cys Ala Gly Ala Cys Pro Tyr
340 345 350Leu Trp Ser Ser Asp Thr
Gln His Ser Arg Val Leu Ser Leu Tyr Asn 355 360
365Thr Ile Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys Val Ser
Gln Asp 370 375 380Leu Glu Pro Leu Thr
Ile Leu Tyr Tyr Ile Gly Lys Ile Pro Lys Ile385 390
395 400Glu Gln Leu Ser Asn Met Ile Val Lys Ser
Cys Lys Cys Ser 405 4103412PRTHomo sapiens
3Met Lys Met His Leu Gln Arg Ala Leu Val Val Leu Ala Leu Leu His1
5 10 15Phe Ala Thr Val Ser Leu
Ser Leu Ser Thr Cys Thr Thr Leu Asp Phe 20 25
30Gly His Ile Lys Lys Lys Arg Val Glu Ala Ile Arg Gly
Gln Ile Leu 35 40 45Ser Lys Leu
Arg Leu Thr Ser Pro Pro Glu Pro Thr Val Met Thr His 50
55 60Val Pro Tyr Gln Val Leu Ala Leu Tyr Asn Ser Thr
Arg Glu Leu Leu65 70 75
80Glu Glu His Gly Glu Arg Lys Glu Glu Gly Cys Thr Gln Glu Asn Thr
85 90 95Glu Ser Glu Tyr Tyr Ala
Lys Glu Ile His Lys Phe Asp Met Ile Gln 100
105 110Gly Leu Ala Glu His Asn Glu Leu Ala Val Cys Pro
Lys Gly Ile Thr 115 120 125Ser Lys
Val Phe Arg Phe Asn Val Ser Ser Val Glu Lys Asn Arg Thr 130
135 140Asn Leu Phe Arg Ala Glu Phe Arg Val Leu Arg
Val Pro Asn Pro Ser145 150 155
160Ser Lys Arg Asn Glu Gln Arg Ile Glu Leu Phe Gln Ile Leu Arg Pro
165 170 175Asp Glu His Ile
Ala Lys Gln Arg Tyr Ile Gly Gly Lys Asn Leu Pro 180
185 190Thr Arg Gly Thr Ala Glu Trp Leu Ser Phe Asp
Val Thr Asp Thr Val 195 200 205Arg
Glu Trp Leu Leu Arg Arg Glu Ser Asn Leu Gly Leu Glu Ile Ser 210
215 220Ile His Cys Pro Cys His Thr Phe Gln Pro
Asn Gly Asp Ile Leu Glu225 230 235
240Asn Ile His Glu Val Met Glu Ile Lys Phe Lys Gly Val Asp Asn
Glu 245 250 255Asp Asp His
Gly Arg Gly Asp Leu Gly Arg Leu Lys Lys Gln Lys Asp 260
265 270Asn Asn Asn Pro His Leu Ile Leu Met Met
Ile Pro Pro His Arg Leu 275 280
285Asp Asn Pro Gly Gln Gly Gly Gln Arg Lys Lys Arg Ala Leu Asp Thr 290
295 300Asn Tyr Cys Phe Arg Asn Leu Glu
Glu Asn Cys Cys Val Arg Pro Leu305 310
315 320Tyr Ile Asp Phe Arg Gln Asp Leu Gly Trp Lys Trp
Val His Glu Pro 325 330
335Lys Gly Tyr Tyr Ala Asn Phe Cys Ser Gly Pro Cys Pro Tyr Leu Arg
340 345 350Ser Ala Asp Thr Thr His
Ser Thr Val Leu Gly Leu Tyr Asn Thr Leu 355 360
365Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys Val Pro Gln Asp
Leu Glu 370 375 380Pro Leu Thr Ile Leu
Tyr Tyr Val Gly Arg Ile Pro Lys Val Glu Gln385 390
395 400Leu Ser Asn Met Val Val Lys Ser Cys Lys
Cys Ser 405 4104304PRTGallus gallus 4Met
Asp Pro Met Ser Ile Gly Pro Lys Ser Cys Gly Gly Ser Pro Trp1
5 10 15Arg Pro Pro Gly Thr Ala Pro
Trp Ser Ile Gly Ser Arg Arg Ala Thr 20 25
30Ala Ser Ser Ser Cys Ser Thr Ser Ser Arg Val Arg Ala Glu
Val Gly 35 40 45Gly Arg Ala Leu
Leu His Arg Ala Glu Leu Arg His Leu Arg Gln Lys 50 55
60Ala Ala Ala Asp Ser Ala Gly Thr Glu Gln Arg Leu Glu
Leu Tyr Gln65 70 75
80Gly Tyr Gly Asn Ala Ser Trp Arg Tyr Leu His Gly Arg Ser Val Arg
85 90 95Ala Thr Ala Asp Asp Glu
Trp Leu Ser Phe Asp Val Thr Asp Ala Val 100
105 110His Gln Trp Leu Ser Gly Ser Glu Leu Ile Gly Val
Phe Lys Leu Ser 115 120 125Val His
Cys Pro Cys Glu Met Gly Pro Gly His Ala Asp Glu Met Arg 130
135 140Ile Ser Ile Glu Gly Phe Glu Gln Gln Arg Gly
Asp Met Gln Ser Ile145 150 155
160Ala Lys Lys His Arg Arg Val Pro Tyr Val Leu Ala Met Ala Leu Pro
165 170 175Ala Glu Arg Ala
Asn Glu Leu His Ser Ala Arg Arg Arg Arg Asp Leu 180
185 190Asp Thr Asp Tyr Cys Phe Gly Pro Gly Thr Asp
Glu Lys Asn Cys Cys 195 200 205Val
Arg Pro Leu Tyr Ile Asp Phe Arg Lys Asp Leu Gln Trp Lys Trp 210
215 220Ile His Glu Pro Lys Gly Tyr Met Ala Asn
Phe Cys Met Gly Pro Cys225 230 235
240Pro Tyr Ile Trp Ser Ala Asp Thr Gln Tyr Ile Lys Val Leu Ala
Leu 245 250 255Tyr Asn Gln
His Asn Pro Gly Ala Ser Ala Ala Pro Cys Cys Val Pro 260
265 270Gln Ile Leu Asp Pro Leu Pro Ile Ile Tyr
Tyr Val Gly Arg Asn Val 275 280
285Arg Val Glu Gln Leu Ser Asn Met Val Val Arg Ala Cys Lys Cys Ser 290
295 3005383PRTXenopus laevis 5Met Glu Val
Leu Trp Met Leu Leu Val Leu Leu Val Leu His Leu Ser1 5
10 15Ser Leu Ala Asn Ser Leu Ser Thr Cys
Lys Ala Val Asp Met Glu Glu 20 25
30Val Arg Lys Arg Arg Ile Glu Ala Ile Arg Gly Gln Ile Leu Ser Lys
35 40 45Leu Lys Leu Asp Lys Ile Pro
Asp Val Asp Ser Glu Lys Met Thr Val 50 55
60Pro Ser Glu Ala Ile Phe Leu Tyr Asn Ser Thr Leu Glu Val Ile Arg65
70 75 80Glu Lys Ala Thr
Arg Glu Glu Glu Glu His Val Gly His Asp Gln Asn 85
90 95Ile Gln Asp Tyr Tyr Ala Lys Gln Val Tyr
Arg Phe Glu Ser Ile Thr 100 105
110Glu Leu Glu Asp His Glu Phe Lys Phe Lys Phe Asn Ala Ser His Val
115 120 125Arg Glu Asn Val Gly Met Asn
Ser Leu Leu His His Ala Glu Leu Arg 130 135
140Met Tyr Lys Lys Gln Thr Asp Lys Asn Met Asp Gln Arg Met Glu
Leu145 150 155 160Phe Trp
Lys Tyr Gln Glu Asn Gly Thr Thr His Ser Arg Tyr Leu Glu
165 170 175Ser Lys Tyr Ile Thr Pro Val
Thr Asp Gln Glu Trp Asn Ser Phe Asp 180 185
190Val Thr Lys Thr Val Asn Glu Trp Leu Lys Arg Ala Glu Glu
Asn Glu 195 200 205Gln Phe Gly Leu
Gln Pro Ala Gly Lys Gly Pro Thr Pro Gln Ala Lys 210
215 220Asp Ile Asp Ile Glu Gly Phe Pro Ala Leu Arg Gly
Asp Leu Ala Ser225 230 235
240Leu Ser Ser Lys Glu Asn Thr Lys Pro Tyr Leu Met Ile Thr Ser Met
245 250 255Pro Ala Glu Arg Ile
Asp Thr Val Thr Ser Ser Arg Lys Lys Arg Gly 260
265 270Val Gly Gln Glu Tyr Cys Phe Gly Asn Asn Gly Pro
Asn Cys Cys Val 275 280 285Lys Pro
Leu Tyr Ile Asn Phe Arg Lys Asp Leu Gly Trp Lys Trp Ile 290
295 300His Glu Pro Lys Gly Tyr Glu Ala Asn Tyr Cys
Leu Gly Asn Cys Pro305 310 315
320Tyr Ile Trp Ser Met Asp Thr Gln Tyr Ser Lys Val Leu Ser Leu Tyr
325 330 335Asn Gln His Asn
Pro Gly Ala Ser Ile Ser Pro Cys Cys Val Pro Asp 340
345 350Val Leu Glu Pro Leu Pro Ile Ile Tyr Tyr Val
Gly Arg Ile Ala Lys 355 360 365Val
Glu Gln Leu Ser Asn Met Val Val Arg Ser Cys Asn Cys Ser 370
375 38068PRTHomo sapiens 6Ala Pro Gln Gly Ile Ala
Gly Gln1 578PRTHomo sapiens 7Gly Pro Gln Gly Leu Leu Gly
Ala1 588PRTHomo sapiens 8Gly Pro Gln Gly Leu Ala Gly Gln1
598PRTHomo sapiens 9Gly Pro Leu Gly Ile Ala Gly Ile1
5108PRTHomo sapiens 10Gly Pro Glu Gly Leu Arg Val Gly1
5118PRTRattus sp. 11Ala Ala Tyr His Leu Val Ser Gln1
5128PRTRattus sp. 12Met Asp Ala Phe Leu Glu Ser Ser1
5138PRTRattus sp. 13Glu Pro Gln Ala Leu Ala Met Ser1
5148PRTRattus sp. 14Gln Ala Leu Ala Met Ser Ala Ile1
5158PRTGallus gallus 15Pro Ser Tyr Phe Leu Asn Ala Gly1
5168PRTHomo sapiens 16Tyr Glu Ala Gly Leu Gly Val Val1
5178PRTHomo sapiens 17Ala Gly Leu Gly Val Val Glu Arg1
5188PRTHomo sapiens 18Ala Gly Leu Gly Ile Ser Ser Thr1
5198PRTHomo sapiens 19Gly Ala Met Phe Leu Glu Ala Ile1
5208PRTHomo sapiens 20Ile Pro Glu Asn Phe Phe Gly Val1
5218PRTHomo sapiens 21Thr Glu Gly Glu Ala Arg Gly Ser1
5228PRTHomo sapiens 22Arg Ala Ile His Ile Gln Ala Glu1
5238PRTHomo sapiens 23Leu Arg Ala Tyr Leu Leu Pro Ala1
5248PRTCavia porcellusMOD_RES(3)..(3)Xaa is Hyp 24Gly Ala Xaa Gly Leu Glx
Gly His1 5258PRTRattus sp. 25Gly Pro Gln Gly Val Arg Gly
Glu1 5268PRTRattus sp. 26Gly Pro Ala Gly Val Gln Gly Pro1
5278PRTRattus sp.MOD_RES(6)..(6)Xaa is Hyp 27Gly Pro Ser Gly
Leu Xaa Gly Pro1 5288PRTRattus sp. 28Gly Pro Ala Gly Glu
Arg Gly Ser1 5298PRTRattus sp. 29Gly Ala Lys Gly Leu Thr
Gly Ser1 5308PRTRattus sp. 30Gly Pro Ala Gly Gln Asp Gly
Pro1 5318PRTRattus sp. 31Gly Pro Ala Gly Phe Ala Gly Pro1
5328PRTRattus sp. 32Gly Pro Ile Gly Asn Val Gly Ala1
5338PRTRattus sp.MOD_RES(3)..(3)Hyl 33Gly Pro Lys Gly Ser Arg Gly
Ala1 5348PRTBos taurus 34Gly Pro Gln Gly Ile Ala Gly Gln1
5358PRTBos taurus 35Gly Pro Gln Gly Leu Leu Gly Ala1
5368PRTHomo sapiens 36Ile Pro Glu Asn Phe Phe Gly Val1
5378PRTHomo sapiens 37Pro Pro Gly Ala Tyr His Gly Ala1
5388PRTHomo sapiens 38Arg Ala Ile His Ile Gln Ala Glu1
5398PRTHomo sapiens 39Gly Pro His Leu Leu Val Glu Ala1
5408PRTHomo sapiens 40Leu Arg Ala Tyr Leu Leu Pro Ala1
5418PRTHomo sapiens 41Gly Pro Glu Gly Leu Arg Val Gly1
5428PRTHomo sapiens 42Arg Val Gly Phe Tyr Glu Ser Asp1
5438PRTHomo sapiens 43Leu Leu Ser Ala Leu Val Glu Thr1
5448PRTHomo sapiens 44Glu Ala Ile Pro Met Ser Ile Pro1
5458PRTHomo sapiens 45Ile Ala Gly Arg Ser Leu Asn Pro1
5468PRTGallus gallus 46Leu Asn Ala Gly Phe Thr Ala Ser1
5478PRTHomo sapiens 47Ile Pro Glu Asn Phe Phe Gly Val1
5488PRTHomo sapiens 48Lys Pro Gln Gln Phe Phe Gly Leu1
5498PRTHomo sapiens 49Asp Val Ala Gln Phe Val Leu Thr1
5508PRTHomo sapiens 50Asp Thr Leu Glu Val Met Arg Lys1
5518PRTHomo sapiens 51Asp Val Gly His Phe Arg Thr Phe1
5528PRTHomo sapiens 52Asp Ser Gly Gly Phe Met Leu Thr1
5538PRTHomo sapiens 53Arg Val Ala Glu Met Arg Gly Glu1
5548PRTHomo sapiens 54Asp Leu Gly Arg Phe Gln Thr Phe1
5558PRTHomo sapiens 55Pro Phe Ser Pro Leu Val Ala Thr1
5568PRTHomo sapiens 56Leu Arg Ala Tyr Leu Leu Pro Ala1
5578PRTHomo sapiens 57Ala Pro Gly Asn Ala Ser Glu Ser1
5588PRTHomo sapiens 58Phe Ser Ser Glu Ser Lys Arg Glu1
5598PRTBos taurus 59Ala Gly Gly Ala Gln Met Gly Val1
5608PRTBos taurus 60Gln Met Gly Val Met Gln Gly Pro1
5618PRTBos taurus 61Met Ala Ala Ser Leu Lys Arg Pro1
5624PRTBos taurus 62Ala Lys Arg Glu1634PRTBos taurus 63Leu Arg Lys
Pro1648PRTBos taurus 64Gln Ala Gln Ala Ile Leu Gln Gln1
5658PRTHomo sapiens 65Arg Ala Ile His Ile Gln Ala Glu1
5668PRTBos taurus 66Leu Val Glu Ala Leu Tyr Leu Val1
5678PRTBos taurus 67Glu Ala Leu Tyr Leu Val Cys Gly1
5688PRTHomo sapiens 68Ile Pro Glu Asn Phe Phe Gly Val1
5698PRTHomo sapiens 69Gly Pro His Leu Leu Val Glu Ala1
5708PRTHomo sapiens 70Pro Pro Glu Glu Leu Lys Phe Gln1
5718PRTHomo sapiens 71Gly Pro Pro Gly Val Val Gly Pro1
5728PRTHomo sapiens 72Gly Pro Pro Gly Leu Arg Gly Glu1
5738PRTHomo sapiens 73Gly Pro Gly Gly Val Val Gly Glu1
5748PRTHomo sapiens 74Ile Pro Glu Asn Phe Phe Gly Val1
5758PRTHomo sapiens 75Pro Pro Gly Ala Tyr His Gly Ala1
5768PRTHomo sapiens 76Arg Ala Ile His Ile Gln Ala Glu1
5778PRTHomo sapiens 77Arg Ala Ile His Ile Gln Ala Glu1
5788PRTHomo sapiens 78Gly Pro His Leu Leu Val Glu Ala1
57913PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 79Met Met Phe Lys Gly Gln Arg Val Glu Arg Val
Leu Thr1 5 108013PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 80His Asn Glu Phe Arg Gln Arg Glu Thr Tyr Met Val Phe1
5 108113PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 81Asn Trp
Gln Glu Phe Gln Ala Lys Arg Ser Val Ala Tyr1 5
108213PRTArtificial sequenceSynthetic sequence Protein cleavage site
of ADAMTS-4 (aggrecanase-1) 82Leu Glu Leu Lys Ser Asn Ser Val Ile
Met Arg Trp Pro1 5 108313PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 83Asp Tyr Met Glu Val Arg Arg Gln Met Ser Met Gln Met1
5 108413PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 84Ala Leu
Glu Met Arg Ala Ala Asp Val Glu Tyr His Phe1 5
108513PRTArtificial sequenceSynthetic sequence Protein cleavage site
of ADAMTS-4 (aggrecanase-1) 85Val Glu His Leu Met Glu Val Gln Arg
Lys Thr Thr Trp1 5 108613PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 86Gly Val Glu Val Lys Arg Gln Leu Ser Tyr His Tyr Met1
5 108713PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 87Gln Glu
Leu Val Gly Ala Asn Ile Glu Thr Tyr Met Leu1 5
108813PRTArtificial sequenceSynthetic sequence Protein cleavage site
of ADAMTS-4 (aggrecanase-1) 88Gln Gln Met Glu Val Ser Arg Tyr Val
Gln Tyr Lys Trp1 5 108913PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 89Leu Gln Ser Phe Arg Gln Ala Pro Val Asp Ile Trp Trp1
5 109013PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 90Gln Glu
Leu Arg Gly Lys Ile Ser Ile Gln Pro Phe Lys1 5
109113PRTArtificial sequenceSynthetic sequence Protein cleavage site
of ADAMTS-4 (aggrecanase-1) 91Gln Gln Glu Tyr Met Ser Gly Gln Tyr
Asp Ile Ile Phe1 5 109213PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 92Ser Met Glu Phe Ala Ala Thr Val Thr Ser Thr Phe Glu1
5 109313PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 93Glu Gln
Gln Leu Lys Gly Arg Gln Thr His Ile Ile Ile1 5
109413PRTArtificial sequenceSynthetic sequence Protein cleavage site
of ADAMTS-4 (aggrecanase-1) 94Met Glu Leu Lys Gly Gln Thr Asp Met
Phe Tyr Ile Ile1 5 109513PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 95Gly Ala Tyr Ala Val Gly Arg Trp Ser Tyr Val Asp Ala1
5 109613PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 96Gly Gln
Phe Ala Thr Ser Pro Lys Ile Thr Ile His Lys1 5
109713PRTArtificial sequenceSynthetic sequence Protein cleavage site
of ADAMTS-4 (aggrecanase-1) 97Asp Val Gln Glu Phe Arg Gly Val Thr
Ala Val Ile Arg1 5 109813PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 98His Glu Ala Arg Thr Val Ser Thr Thr Tyr Leu Met Leu1
5 109913PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 99Tyr Met
Glu Met Arg Gly Ser Thr Thr Val Phe Phe Asn1 5
1010013PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 100Gln Glu Leu Ile Gly Ser Tyr Ser
Val Met Pro Thr Asn1 5
1010113PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 101His Tyr Tyr Met Glu Ala Thr Arg Asp Ile
Glu Met Val1 5 1010213PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 102Asn Glu Ala His Ser Ser Gly Ile Thr Ile Met Leu Arg1
5 1010313PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 103Asp
His Pro Met Glu Phe Arg Ser Lys Ile Thr Met Lys1 5
1010413PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 104Thr Phe Ala Glu Met Lys Gly Thr
Val Ser Tyr Ala Leu1 5
1010513PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 105Gly Val His Met Glu Ser Met Arg Arg Tyr
Thr Val Ile1 5 1010613PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 106Phe Gln Glu Tyr Thr Gly Thr Tyr Asp Ile Met Asp Pro1
5 1010713PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 107Phe
Gln Ala Val Glu Ala Ser Lys Thr Leu His Phe Trp1 5
1010813PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 108Tyr Leu Glu Thr Ser Arg Thr Tyr
Thr Thr Val Trp Pro1 5
1010913PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 109Thr Asp Tyr Leu Glu Val Arg Ser Gln Pro
Ile Ile Tyr1 5 1011013PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 110Thr Phe Glu Gln Glu Val Arg Ala Pro Asn Ile Ser Trp1
5 1011112PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 111Pro
Gln Glu Val Gln Gly Ile Ala Val Glu Trp Val1 5
1011213PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 112Ala Glu Ala Lys Ala Ser Thr Leu
His Val Tyr Leu Met1 5
1011313PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 113Asp Tyr Met Glu Val Val Gly Asn Lys Ile
Ser Tyr Ile1 5 1011413PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 114Val Ile Met Glu Ala Val Gly Arg Lys Thr Ile Leu Gln1
5 1011513PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 115Phe
Gln Ala Glu Ala Ala Arg Ala Val Thr Tyr Ser Ser1 5
1011613PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 116Glu Asp Tyr Val Tyr Val Lys Asp
Val Gly Thr Thr Asn1 5
1011713PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 117Gln Glu Tyr Lys Ala His His Ser Tyr Lys
Leu Met Ser1 5 1011813PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 118Tyr Asn Glu Tyr Arg Ala Thr Pro Thr Phe Ala Val Val1
5 1011913PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 119Glu
Tyr Phe His Ala Asn Thr Thr Arg Ile Val Gln Ser1 5
1012013PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 120Ala Leu Glu Ala Ser Arg Phe Ile
Ser Trp Asp Ile Asn1 5
1012113PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 121Trp Glu Ala Val Ala Ala Pro Ile Met His
Thr Trp Val1 5 1012212PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 122Phe Gln Glu Leu Lys Ala Ala Glu Thr Phe Trp Met1
5 1012313PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 123Asn
Thr Leu Tyr Ala Val Ala Pro Pro Val Ile Tyr Val1 5
1012413PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 124Phe Gln Pro Tyr Glu Val Gln Arg
Ile Thr Thr Val Met1 5
1012513PRTArtificial sequenceSynthetic sequence Protein cleavage site of
ADAMTS-4 (aggrecanase-1) 125Lys Pro Met Glu Ser Gly Arg Arg Thr Thr
Val Tyr Tyr1 5 1012613PRTArtificial
sequenceSynthetic sequence Protein cleavage site of ADAMTS-4
(aggrecanase-1) 126Met Glu Phe Lys Gly Ala Leu Gln Tyr Arg Leu Gln Pro1
5 1012713PRTArtificial sequenceSynthetic
sequence Protein cleavage site of ADAMTS-4 (aggrecanase-1) 127Pro
Gln Glu Val Lys Gln Ala Arg Lys Trp Ile Ile Glu1 5
1012813PRTArtificial sequenceSynthetic sequence Protein cleavage
site of ADAMTS-4 (aggrecanase-1) 128Tyr Arg Gln Gln Glu Val Lys Arg
His Ile Gln Ile Val1 5
101299PRTArtificial sequenceSynthetic sequence Consensus ADAMTS-4
cleavage motifVARIANT(2)..(2)Xaa is Ala, Phe, Val, Leu, Met, or
TyrVARIANT(3)..(3)May be present or absent. If present, Xaa
indicates no obvious consensusVARIANT(4)..(4)Xaa is Arg or
LysVARIANT(5)..(7)Any 1 Xaa may be present or absent represents a
range of 2-3 amino acids. If present, Xaa indicates no obvious
consensusVARIANT(8)..(8)Xaa is Ser or ThrVARIANT(9)..(9)Xaa is Val, Tyr,
Ile, Phe, Trp, Met, Leu, or Ala 129Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa1 51302037DNAArtificial sequenceSynthetic sequence Mouse
lG1(Fc)-LAP-MMP-IFNCDS(1)..(2028) 130gga tcc acc atg tac agg atg caa ctc
ctg tgc att gca cta agt ctt 48Gly Ser Thr Met Tyr Arg Met Gln Leu
Leu Cys Ile Ala Leu Ser Leu1 5 10
15gca ctt gtc acg aat tcc ggt tgt aag cct tgc ata tgt aca gtc
cca 96Ala Leu Val Thr Asn Ser Gly Cys Lys Pro Cys Ile Cys Thr Val
Pro 20 25 30gaa gta tca tct
gtc ttc atc ttc ccc cca aag ccc aag gat gtg ctc 144Glu Val Ser Ser
Val Phe Ile Phe Pro Pro Lys Pro Lys Asp Val Leu 35
40 45acc att act ctg act cct aag gtc acg tgt gtt gtg
gta gac atc agc 192Thr Ile Thr Leu Thr Pro Lys Val Thr Cys Val Val
Val Asp Ile Ser 50 55 60aag gat gat
ccc gag gtc cag ttc agc tgg ttt gta gat gat gtg gag 240Lys Asp Asp
Pro Glu Val Gln Phe Ser Trp Phe Val Asp Asp Val Glu65 70
75 80gtg cac aca gct cag acg caa ccc
cgg gag gag cag ttc aac agc act 288Val His Thr Ala Gln Thr Gln Pro
Arg Glu Glu Gln Phe Asn Ser Thr 85 90
95ttc cgc tca gtc agt gaa ctt ccc atc atg cac cag gac tgg
ctc aat 336Phe Arg Ser Val Ser Glu Leu Pro Ile Met His Gln Asp Trp
Leu Asn 100 105 110ggc aag gag
ttc aaa tgc agg gtc aac agt gca gct ttc cct gcc ccc 384Gly Lys Glu
Phe Lys Cys Arg Val Asn Ser Ala Ala Phe Pro Ala Pro 115
120 125atc gag aaa acc atc tcc aaa acc aaa ggc aga
ccg aag gct cca cag 432Ile Glu Lys Thr Ile Ser Lys Thr Lys Gly Arg
Pro Lys Ala Pro Gln 130 135 140gtg tac
acc att cca cct ccc aag gag cag atg gcc aag gat aaa gtc 480Val Tyr
Thr Ile Pro Pro Pro Lys Glu Gln Met Ala Lys Asp Lys Val145
150 155 160agt ctg acc tgc atg ata aca
gac ttc ttc cct gaa gac att act gtg 528Ser Leu Thr Cys Met Ile Thr
Asp Phe Phe Pro Glu Asp Ile Thr Val 165
170 175gag tgg cag tgg aat ggg cag cca gcg gag aac tac
aag aac act cag 576Glu Trp Gln Trp Asn Gly Gln Pro Ala Glu Asn Tyr
Lys Asn Thr Gln 180 185 190ccc
atc atg gac aca gat ggc tct tac ttc gtc tac agc aag ctc aat 624Pro
Ile Met Asp Thr Asp Gly Ser Tyr Phe Val Tyr Ser Lys Leu Asn 195
200 205gtg cag aag agc aac tgg gag gca gga
aat act ttc acc tgc tct gtg 672Val Gln Lys Ser Asn Trp Glu Ala Gly
Asn Thr Phe Thr Cys Ser Val 210 215
220tta cat gag ggc ctg cac aac cac cat act gag aag agc ctc tcc cac
720Leu His Glu Gly Leu His Asn His His Thr Glu Lys Ser Leu Ser His225
230 235 240tct cct ggt aaa
aag ctt cca tgg agg aat tcc tcc acc agc aag acc 768Ser Pro Gly Lys
Lys Leu Pro Trp Arg Asn Ser Ser Thr Ser Lys Thr 245
250 255atc gac atg gag ctg gtg aaa cgg aag cgc
atc gaa gcc atc cgt ggc 816Ile Asp Met Glu Leu Val Lys Arg Lys Arg
Ile Glu Ala Ile Arg Gly 260 265
270cag atc ctg tcc aaa cta agg ctc gcc agt ccc cca agc cag ggg gag
864Gln Ile Leu Ser Lys Leu Arg Leu Ala Ser Pro Pro Ser Gln Gly Glu
275 280 285gta ccg ccc ggc ccg ctg ccc
gag gcg gtg ctc gct ttg tac aac agc 912Val Pro Pro Gly Pro Leu Pro
Glu Ala Val Leu Ala Leu Tyr Asn Ser 290 295
300acc cgc gac cgg gtg gca ggc gag agc gcc gac cca gag ccg gag ccc
960Thr Arg Asp Arg Val Ala Gly Glu Ser Ala Asp Pro Glu Pro Glu Pro305
310 315 320gaa gcg gac tac
tat gct aaa gag gtc acc cgc gtg cta atg gtg gac 1008Glu Ala Asp Tyr
Tyr Ala Lys Glu Val Thr Arg Val Leu Met Val Asp 325
330 335cgc aac aac gcc atc tat gag aaa acc aaa
gac atc tca cac agt ata 1056Arg Asn Asn Ala Ile Tyr Glu Lys Thr Lys
Asp Ile Ser His Ser Ile 340 345
350tat atg ttc ttc aat acg tca gac att cgg gaa gca gtg ccc gaa ccc
1104Tyr Met Phe Phe Asn Thr Ser Asp Ile Arg Glu Ala Val Pro Glu Pro
355 360 365cca ttg ctg tcc cgt gca gag
ctg cgc ttg cag aga tta aaa tca agt 1152Pro Leu Leu Ser Arg Ala Glu
Leu Arg Leu Gln Arg Leu Lys Ser Ser 370 375
380gtg gag caa cat gtg gaa ctc tac cag aaa tat agc aac aat tcc tgg
1200Val Glu Gln His Val Glu Leu Tyr Gln Lys Tyr Ser Asn Asn Ser Trp385
390 395 400cgt tac ctt ggt
aac cgg ctg ctg acc ccc act gat acg cct gag tgg 1248Arg Tyr Leu Gly
Asn Arg Leu Leu Thr Pro Thr Asp Thr Pro Glu Trp 405
410 415ctg tct ttt gac gtc act gga gtt gta cgg
cag tgg ctg aac caa gga 1296Leu Ser Phe Asp Val Thr Gly Val Val Arg
Gln Trp Leu Asn Gln Gly 420 425
430gac gga ata cag ggc ttt cga ttc agc gct cac tgc tct tgt gac agc
1344Asp Gly Ile Gln Gly Phe Arg Phe Ser Ala His Cys Ser Cys Asp Ser
435 440 445aaa gat aac aaa ctc cac gtg
gaa atc aac ggg atc agc ccc aaa cgt 1392Lys Asp Asn Lys Leu His Val
Glu Ile Asn Gly Ile Ser Pro Lys Arg 450 455
460cgg ggc gac ctg ggc acc atc cat gac atg aac cgg ccc ttc ctg ctc
1440Arg Gly Asp Leu Gly Thr Ile His Asp Met Asn Arg Pro Phe Leu Leu465
470 475 480ctc atg gcc acc
ccc ctg gaa agg gcc cag cac ctg cac agc ctg cag 1488Leu Met Ala Thr
Pro Leu Glu Arg Ala Gln His Leu His Ser Leu Gln 485
490 495ggg gga ggc ggt tcc ccg ctc ggg ctt tgg
gcg gga ggg ggc tca gcg 1536Gly Gly Gly Gly Ser Pro Leu Gly Leu Trp
Ala Gly Gly Gly Ser Ala 500 505
510gcc gca atc aac tat aag cag ctc cag ctc caa gaa agg acg aac att
1584Ala Ala Ile Asn Tyr Lys Gln Leu Gln Leu Gln Glu Arg Thr Asn Ile
515 520 525cgg aaa tgt cag gag ctc ctg
gag cag ctg aat gga aag atc aac ctc 1632Arg Lys Cys Gln Glu Leu Leu
Glu Gln Leu Asn Gly Lys Ile Asn Leu 530 535
540acc tac agg gcg gac ttc aag atc cct atg gag atg acg gag aag atg
1680Thr Tyr Arg Ala Asp Phe Lys Ile Pro Met Glu Met Thr Glu Lys Met545
550 555 560cag aag agt tac
act gcc ttt gcc atc caa gag atg ctc cag aat gtc 1728Gln Lys Ser Tyr
Thr Ala Phe Ala Ile Gln Glu Met Leu Gln Asn Val 565
570 575ttt ctt gtc ttc aga aac aat ttc tcc agc
act ggg tgg aat gag act 1776Phe Leu Val Phe Arg Asn Asn Phe Ser Ser
Thr Gly Trp Asn Glu Thr 580 585
590att gtt gta cgt ctc ctg gat gaa ctc cac cag cag aca gtg ttt ctg
1824Ile Val Val Arg Leu Leu Asp Glu Leu His Gln Gln Thr Val Phe Leu
595 600 605aag aca gta cta gag gaa aag
caa gag gaa aga ttg acg tgg gag atg 1872Lys Thr Val Leu Glu Glu Lys
Gln Glu Glu Arg Leu Thr Trp Glu Met 610 615
620tcc tca act gct ctc cac ttg aag agc tat tac tgg agg gtg caa agg
1920Ser Ser Thr Ala Leu His Leu Lys Ser Tyr Tyr Trp Arg Val Gln Arg625
630 635 640tac ctt aaa ctc
atg aag tac aac agc tac gcc tgg atg gtg gtc cga 1968Tyr Leu Lys Leu
Met Lys Tyr Asn Ser Tyr Ala Trp Met Val Val Arg 645
650 655gca gag atc ttc agg aac ttt ctc atc att
cga aga ctt acc aga aac 2016Ala Glu Ile Phe Arg Asn Phe Leu Ile Ile
Arg Arg Leu Thr Arg Asn 660 665
670ttc caa aac tga atctagacc
2037Phe Gln Asn 675131675PRTArtificial sequenceSynthetic Construct
131Gly Ser Thr Met Tyr Arg Met Gln Leu Leu Cys Ile Ala Leu Ser Leu1
5 10 15Ala Leu Val Thr Asn Ser
Gly Cys Lys Pro Cys Ile Cys Thr Val Pro 20 25
30Glu Val Ser Ser Val Phe Ile Phe Pro Pro Lys Pro Lys
Asp Val Leu 35 40 45Thr Ile Thr
Leu Thr Pro Lys Val Thr Cys Val Val Val Asp Ile Ser 50
55 60Lys Asp Asp Pro Glu Val Gln Phe Ser Trp Phe Val
Asp Asp Val Glu65 70 75
80Val His Thr Ala Gln Thr Gln Pro Arg Glu Glu Gln Phe Asn Ser Thr
85 90 95Phe Arg Ser Val Ser Glu
Leu Pro Ile Met His Gln Asp Trp Leu Asn 100
105 110Gly Lys Glu Phe Lys Cys Arg Val Asn Ser Ala Ala
Phe Pro Ala Pro 115 120 125Ile Glu
Lys Thr Ile Ser Lys Thr Lys Gly Arg Pro Lys Ala Pro Gln 130
135 140Val Tyr Thr Ile Pro Pro Pro Lys Glu Gln Met
Ala Lys Asp Lys Val145 150 155
160Ser Leu Thr Cys Met Ile Thr Asp Phe Phe Pro Glu Asp Ile Thr Val
165 170 175Glu Trp Gln Trp
Asn Gly Gln Pro Ala Glu Asn Tyr Lys Asn Thr Gln 180
185 190Pro Ile Met Asp Thr Asp Gly Ser Tyr Phe Val
Tyr Ser Lys Leu Asn 195 200 205Val
Gln Lys Ser Asn Trp Glu Ala Gly Asn Thr Phe Thr Cys Ser Val 210
215 220Leu His Glu Gly Leu His Asn His His Thr
Glu Lys Ser Leu Ser His225 230 235
240Ser Pro Gly Lys Lys Leu Pro Trp Arg Asn Ser Ser Thr Ser Lys
Thr 245 250 255Ile Asp Met
Glu Leu Val Lys Arg Lys Arg Ile Glu Ala Ile Arg Gly 260
265 270Gln Ile Leu Ser Lys Leu Arg Leu Ala Ser
Pro Pro Ser Gln Gly Glu 275 280
285Val Pro Pro Gly Pro Leu Pro Glu Ala Val Leu Ala Leu Tyr Asn Ser 290
295 300Thr Arg Asp Arg Val Ala Gly Glu
Ser Ala Asp Pro Glu Pro Glu Pro305 310
315 320Glu Ala Asp Tyr Tyr Ala Lys Glu Val Thr Arg Val
Leu Met Val Asp 325 330
335Arg Asn Asn Ala Ile Tyr Glu Lys Thr Lys Asp Ile Ser His Ser Ile
340 345 350Tyr Met Phe Phe Asn Thr
Ser Asp Ile Arg Glu Ala Val Pro Glu Pro 355 360
365Pro Leu Leu Ser Arg Ala Glu Leu Arg Leu Gln Arg Leu Lys
Ser Ser 370 375 380Val Glu Gln His Val
Glu Leu Tyr Gln Lys Tyr Ser Asn Asn Ser Trp385 390
395 400Arg Tyr Leu Gly Asn Arg Leu Leu Thr Pro
Thr Asp Thr Pro Glu Trp 405 410
415Leu Ser Phe Asp Val Thr Gly Val Val Arg Gln Trp Leu Asn Gln Gly
420 425 430Asp Gly Ile Gln Gly
Phe Arg Phe Ser Ala His Cys Ser Cys Asp Ser 435
440 445Lys Asp Asn Lys Leu His Val Glu Ile Asn Gly Ile
Ser Pro Lys Arg 450 455 460Arg Gly Asp
Leu Gly Thr Ile His Asp Met Asn Arg Pro Phe Leu Leu465
470 475 480Leu Met Ala Thr Pro Leu Glu
Arg Ala Gln His Leu His Ser Leu Gln 485
490 495Gly Gly Gly Gly Ser Pro Leu Gly Leu Trp Ala Gly
Gly Gly Ser Ala 500 505 510Ala
Ala Ile Asn Tyr Lys Gln Leu Gln Leu Gln Glu Arg Thr Asn Ile 515
520 525Arg Lys Cys Gln Glu Leu Leu Glu Gln
Leu Asn Gly Lys Ile Asn Leu 530 535
540Thr Tyr Arg Ala Asp Phe Lys Ile Pro Met Glu Met Thr Glu Lys Met545
550 555 560Gln Lys Ser Tyr
Thr Ala Phe Ala Ile Gln Glu Met Leu Gln Asn Val 565
570 575Phe Leu Val Phe Arg Asn Asn Phe Ser Ser
Thr Gly Trp Asn Glu Thr 580 585
590Ile Val Val Arg Leu Leu Asp Glu Leu His Gln Gln Thr Val Phe Leu
595 600 605Lys Thr Val Leu Glu Glu Lys
Gln Glu Glu Arg Leu Thr Trp Glu Met 610 615
620Ser Ser Thr Ala Leu His Leu Lys Ser Tyr Tyr Trp Arg Val Gln
Arg625 630 635 640Tyr Leu
Lys Leu Met Lys Tyr Asn Ser Tyr Ala Trp Met Val Val Arg
645 650 655Ala Glu Ile Phe Arg Asn Phe
Leu Ile Ile Arg Arg Leu Thr Arg Asn 660 665
670Phe Gln Asn 6751322370DNAArtificial
sequenceSynthetic sequence Human lg Lap with anti- herNeu2
(Herceptin) antibodyCDS(1)..(2370) 132atg tac agg atg caa ctc ctg tct tgc
att gca cta agt ctt gca ctt 48Met Tyr Arg Met Gln Leu Leu Ser Cys
Ile Ala Leu Ser Leu Ala Leu1 5 10
15gtc acg aat tcc gag ccc aaa tct tgt gac aaa act cac aca tgc
cca 96Val Thr Asn Ser Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys
Pro 20 25 30ccg tgc cca gca
cct gaa gct gcg ggg gga ccg tca gtc ttc ctc ttc 144Pro Cys Pro Ala
Pro Glu Ala Ala Gly Gly Pro Ser Val Phe Leu Phe 35
40 45ccc cca aaa ccc aag gac acc ctc atg atc tcc cgg
acc cct gag gtc 192Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
Thr Pro Glu Val 50 55 60aca tgc gtg
gtg gtg gac gtg agc cac gaa gac cct gag gtc aag ttc 240Thr Cys Val
Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe65 70
75 80aac tgg tac gtg gac ggc gtg gag
gtg cat aat gcc aag aca aag ccg 288Asn Trp Tyr Val Asp Gly Val Glu
Val His Asn Ala Lys Thr Lys Pro 85 90
95cgg gag gag cag tac aac agc acg tac cgt gtg gtc agc gtc
ctc acc 336Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val
Leu Thr 100 105 110gtc ctg cac
cag gac tgg ctg aat ggc aag gag tac aag tgc aag gtc 384Val Leu His
Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val 115
120 125tcc aac aaa gcc ctc cca gcc ccc atc gag aaa
acc atc tcc aaa gcc 432Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys
Thr Ile Ser Lys Ala 130 135 140aaa ggg
cag ccc cga gaa cca cag gtg tac acc ctg ccc cca tcc cgg 480Lys Gly
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg145
150 155 160gag gag atg acc aag aac cag
gtc agc ctg acc tgc ctg gtc aaa ggc 528Glu Glu Met Thr Lys Asn Gln
Val Ser Leu Thr Cys Leu Val Lys Gly 165
170 175ttc tat ccc agc gac atc gcc gtg gag tgg gag agc
aat ggg cag ccg 576Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
Asn Gly Gln Pro 180 185 190gag
aac aac tac aag acc acg cct ccc gtg ctg gac tcc gac ggc tcc 624Glu
Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser 195
200 205ttc ttc ctc tat agc aag ctc acc gtg
gac aag agc agg tgg cag cag 672Phe Phe Leu Tyr Ser Lys Leu Thr Val
Asp Lys Ser Arg Trp Gln Gln 210 215
220ggg aac gtc ttc tca tgc tcc gtg atg cat gag gct ctg cac aac cac
720Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His225
230 235 240tac acg cag aag
agc ctc tcc ctg tct ccg ggt aaa aag ctt cca tgg 768Tyr Thr Gln Lys
Ser Leu Ser Leu Ser Pro Gly Lys Lys Leu Pro Trp 245
250 255agg aat tcc cta tcc acc agt aag act atc
gac atg gag ctg gtg aag 816Arg Asn Ser Leu Ser Thr Ser Lys Thr Ile
Asp Met Glu Leu Val Lys 260 265
270cgg aag cgc atc gag gcc atc cgc ggc cag atc ctg tcc aag ctg cgg
864Arg Lys Arg Ile Glu Ala Ile Arg Gly Gln Ile Leu Ser Lys Leu Arg
275 280 285ctc gcc agc ccc ccg agc cag
ggg gag gtg ccg ccc ggc ccg ctg ccc 912Leu Ala Ser Pro Pro Ser Gln
Gly Glu Val Pro Pro Gly Pro Leu Pro 290 295
300gag gcc gtg ctc gcc ctg tac aac agc acc cgc gac cgg gtg gcc ggg
960Glu Ala Val Leu Ala Leu Tyr Asn Ser Thr Arg Asp Arg Val Ala Gly305
310 315 320gag agt gca gaa
ccg gag ccc gag cct gag gcc gac tac tac gcc aag 1008Glu Ser Ala Glu
Pro Glu Pro Glu Pro Glu Ala Asp Tyr Tyr Ala Lys 325
330 335gag gtc acc cgc gtg cta atg gtg gaa acc
cac aac gaa atc tat gac 1056Glu Val Thr Arg Val Leu Met Val Glu Thr
His Asn Glu Ile Tyr Asp 340 345
350aag ttc aag cag agt aca cac agc ata tat atg ttc ttc aac aca tca
1104Lys Phe Lys Gln Ser Thr His Ser Ile Tyr Met Phe Phe Asn Thr Ser
355 360 365gag ctc cga gaa gcg gta cct
gaa ccc gtg ttg ctc tcc cgg gca gag 1152Glu Leu Arg Glu Ala Val Pro
Glu Pro Val Leu Leu Ser Arg Ala Glu 370 375
380ctg cgt ctg ctg agg ctc aag tta aaa gtg gag cag cac gtg gag ctg
1200Leu Arg Leu Leu Arg Leu Lys Leu Lys Val Glu Gln His Val Glu Leu385
390 395 400tac cag aaa tac
agc aac aat tcc tgg cga tac ctc agc aac cgg ctg 1248Tyr Gln Lys Tyr
Ser Asn Asn Ser Trp Arg Tyr Leu Ser Asn Arg Leu 405
410 415ctg gca ccc agc gac tcg cca gag tgg tta
tct ttt gat gtc acc gga 1296Leu Ala Pro Ser Asp Ser Pro Glu Trp Leu
Ser Phe Asp Val Thr Gly 420 425
430gtt gtg cgg cag tgg ttg agc cgt gga ggg gaa att gag ggc ttt cgc
1344Val Val Arg Gln Trp Leu Ser Arg Gly Gly Glu Ile Glu Gly Phe Arg
435 440 445ctt agc gcc cac tgc tcc tgt
gac agc agg gat aac aca ctg caa gtg 1392Leu Ser Ala His Cys Ser Cys
Asp Ser Arg Asp Asn Thr Leu Gln Val 450 455
460gac atc aac ggg ttc act acc ggc cgc cga ggt gac ctg gcc acc att
1440Asp Ile Asn Gly Phe Thr Thr Gly Arg Arg Gly Asp Leu Ala Thr Ile465
470 475 480cat ggc atg aac
cgg cct ttc ctg ctt ctc atg gcc acc ccg ctg gag 1488His Gly Met Asn
Arg Pro Phe Leu Leu Leu Met Ala Thr Pro Leu Glu 485
490 495agg gcc cag cat ctg caa agc ctg cag gga
ggc ggg ggt tca gac gtc 1536Arg Ala Gln His Leu Gln Ser Leu Gln Gly
Gly Gly Gly Ser Asp Val 500 505
510caa gag ttc cgc ggc gtc aca gct gtg atc cgt gga ggc ggg ggt tca
1584Gln Glu Phe Arg Gly Val Thr Ala Val Ile Arg Gly Gly Gly Gly Ser
515 520 525gcg gcc cag atg gcc gag gtt
cag ctg gtg gag tct ggc ggt ggc ctg 1632Ala Ala Gln Met Ala Glu Val
Gln Leu Val Glu Ser Gly Gly Gly Leu 530 535
540gtg cag cca ggg ggc tca ctc cgt ttg tcc tgt gca gct tct ggc ttc
1680Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe545
550 555 560aac att aaa gac
acc tat ata cac tgg gtg cgt cag gcc ccg ggt aag 1728Asn Ile Lys Asp
Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys 565
570 575ggc ctg gaa tgg gtt gca agg att tat cct
acg aat ggt tat gct aga 1776Gly Leu Glu Trp Val Ala Arg Ile Tyr Pro
Thr Asn Gly Tyr Ala Arg 580 585
590tat gcc gat agc gtc aag ggc cgt ttc act ata agc gca gac aca tcc
1824Tyr Ala Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser
595 600 605aaa aac aca gcc tac ctg cag
atg aac agc ctg cgt gct gag gac act 1872Lys Asn Thr Ala Tyr Leu Gln
Met Asn Ser Leu Arg Ala Glu Asp Thr 610 615
620gcc gtc tat tat tgt tct aga tgg gga ggg gac ggc ttc tat gct atg
1920Ala Val Tyr Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met625
630 635 640gac tac tgg ggt
caa gga acc ctg gtc acc gtc tcc tcg ggc gga ggc 1968Asp Tyr Trp Gly
Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly 645
650 655gga tcc ggc gga ggc gga tcg gga ggc gga
gga tcc gat atc cag atg 2016Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly
Gly Ser Asp Ile Gln Met 660 665
670acc cag tcc ccg agc tcc ctg tcc gcc tct gtg ggc gat agg gtc acc
2064Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr
675 680 685atc acc tgc cgt gcc agt cag
gat gtg aat act gct gta gcc tgg tat 2112Ile Thr Cys Arg Ala Ser Gln
Asp Val Asn Thr Ala Val Ala Trp Tyr 690 695
700caa cag aaa cca gga aaa gct ccg aaa cta ctg att tac tcg gca tcc
2160Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile Tyr Ser Ala Ser705
710 715 720ttc ctc tac tct
gga gtc ctt tct cgc ttc tct ggg tcc aga tct ggg 2208Phe Leu Tyr Ser
Gly Val Leu Ser Arg Phe Ser Gly Ser Arg Ser Gly 725
730 735acg gat ttc act ctg acc atc agc agt ctg
cag ccg gag gac ttc gca 2256Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu
Gln Pro Glu Asp Phe Ala 740 745
750act tat tac tgt cag caa cat tat act act cct ccc acg ttc gga cag
2304Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr Pro Pro Thr Phe Gly Gln
755 760 765ggt acc aag gtg gag atc aaa
cgc gcg gat gcg gcc gca ctc gag cac 2352Gly Thr Lys Val Glu Ile Lys
Arg Ala Asp Ala Ala Ala Leu Glu His 770 775
780cac cac cac cac cac tga
2370His His His His His785133789PRTArtificial sequenceSynthetic
Construct 133Met Tyr Arg Met Gln Leu Leu Ser Cys Ile Ala Leu Ser Leu Ala
Leu1 5 10 15Val Thr Asn
Ser Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 20
25 30Pro Cys Pro Ala Pro Glu Ala Ala Gly Gly
Pro Ser Val Phe Leu Phe 35 40
45Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val 50
55 60Thr Cys Val Val Val Asp Val Ser His
Glu Asp Pro Glu Val Lys Phe65 70 75
80Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr
Lys Pro 85 90 95Arg Glu
Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr 100
105 110Val Leu His Gln Asp Trp Leu Asn Gly
Lys Glu Tyr Lys Cys Lys Val 115 120
125Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala
130 135 140Lys Gly Gln Pro Arg Glu Pro
Gln Val Tyr Thr Leu Pro Pro Ser Arg145 150
155 160Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr Cys
Leu Val Lys Gly 165 170
175Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro
180 185 190Glu Asn Asn Tyr Lys Thr
Thr Pro Pro Val Leu Asp Ser Asp Gly Ser 195 200
205Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp
Gln Gln 210 215 220Gly Asn Val Phe Ser
Cys Ser Val Met His Glu Ala Leu His Asn His225 230
235 240Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
Gly Lys Lys Leu Pro Trp 245 250
255Arg Asn Ser Leu Ser Thr Ser Lys Thr Ile Asp Met Glu Leu Val Lys
260 265 270Arg Lys Arg Ile Glu
Ala Ile Arg Gly Gln Ile Leu Ser Lys Leu Arg 275
280 285Leu Ala Ser Pro Pro Ser Gln Gly Glu Val Pro Pro
Gly Pro Leu Pro 290 295 300Glu Ala Val
Leu Ala Leu Tyr Asn Ser Thr Arg Asp Arg Val Ala Gly305
310 315 320Glu Ser Ala Glu Pro Glu Pro
Glu Pro Glu Ala Asp Tyr Tyr Ala Lys 325
330 335Glu Val Thr Arg Val Leu Met Val Glu Thr His Asn
Glu Ile Tyr Asp 340 345 350Lys
Phe Lys Gln Ser Thr His Ser Ile Tyr Met Phe Phe Asn Thr Ser 355
360 365Glu Leu Arg Glu Ala Val Pro Glu Pro
Val Leu Leu Ser Arg Ala Glu 370 375
380Leu Arg Leu Leu Arg Leu Lys Leu Lys Val Glu Gln His Val Glu Leu385
390 395 400Tyr Gln Lys Tyr
Ser Asn Asn Ser Trp Arg Tyr Leu Ser Asn Arg Leu 405
410 415Leu Ala Pro Ser Asp Ser Pro Glu Trp Leu
Ser Phe Asp Val Thr Gly 420 425
430Val Val Arg Gln Trp Leu Ser Arg Gly Gly Glu Ile Glu Gly Phe Arg
435 440 445Leu Ser Ala His Cys Ser Cys
Asp Ser Arg Asp Asn Thr Leu Gln Val 450 455
460Asp Ile Asn Gly Phe Thr Thr Gly Arg Arg Gly Asp Leu Ala Thr
Ile465 470 475 480His Gly
Met Asn Arg Pro Phe Leu Leu Leu Met Ala Thr Pro Leu Glu
485 490 495Arg Ala Gln His Leu Gln Ser
Leu Gln Gly Gly Gly Gly Ser Asp Val 500 505
510Gln Glu Phe Arg Gly Val Thr Ala Val Ile Arg Gly Gly Gly
Gly Ser 515 520 525Ala Ala Gln Met
Ala Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu 530
535 540Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala
Ala Ser Gly Phe545 550 555
560Asn Ile Lys Asp Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys
565 570 575Gly Leu Glu Trp Val
Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Ala Arg 580
585 590Tyr Ala Asp Ser Val Lys Gly Arg Phe Thr Ile Ser
Ala Asp Thr Ser 595 600 605Lys Asn
Thr Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr 610
615 620Ala Val Tyr Tyr Cys Ser Arg Trp Gly Gly Asp
Gly Phe Tyr Ala Met625 630 635
640Asp Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Gly Gly Gly
645 650 655Gly Ser Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Asp Ile Gln Met 660
665 670Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val
Gly Asp Arg Val Thr 675 680 685Ile
Thr Cys Arg Ala Ser Gln Asp Val Asn Thr Ala Val Ala Trp Tyr 690
695 700Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu
Leu Ile Tyr Ser Ala Ser705 710 715
720Phe Leu Tyr Ser Gly Val Leu Ser Arg Phe Ser Gly Ser Arg Ser
Gly 725 730 735Thr Asp Phe
Thr Leu Thr Ile Ser Ser Leu Gln Pro Glu Asp Phe Ala 740
745 750Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr
Pro Pro Thr Phe Gly Gln 755 760
765Gly Thr Lys Val Glu Ile Lys Arg Ala Asp Ala Ala Ala Leu Glu His 770
775 780His His His His
His7851342370DNAArtificial sequenceSynthetic construct 134tcagtggtgg
tggtggtggt gctcgagtgc ggccgcatcc gcgcgtttga tctccacctt 60ggtaccctgt
ccgaacgtgg gaggagtagt ataatgttgc tgacagtaat aagttgcgaa 120gtcctccggc
tgcagactgc tgatggtcag agtgaaatcc gtcccagatc tggacccaga 180gaagcgagaa
aggactccag agtagaggaa ggatgccgag taaatcagta gtttcggagc 240ttttcctggt
ttctgttgat accaggctac agcagtattc acatcctgac tggcacggca 300ggtgatggtg
accctatcgc ccacagaggc ggacagggag ctcggggact gggtcatctg 360gatatcggat
cctccgcctc ccgatccgcc tccgccggat ccgcctccgc ccgaggagac 420ggtgaccagg
gttccttgac cccagtagtc catagcatag aagccgtccc ctccccatct 480agaacaataa
tagacggcag tgtcctcagc acgcaggctg ttcatctgca ggtaggctgt 540gtttttggat
gtgtctgcgc ttatagtgaa acggcccttg acgctatcgg catatctagc 600ataaccattc
gtaggataaa tccttgcaac ccattccagg cccttacccg gggcctgacg 660cacccagtgt
atataggtgt ctttaatgtt gaagccagaa gctgcacagg acaaacggag 720tgagccccct
ggctgcacca ggccaccgcc agactccacc agctgaacct cggccatctg 780ggccgctgaa
cccccgcctc cacggatcac agctgtgacg ccgcggaact cttggacgtc 840tgaacccccg
cctccctgca ggctttgcag atgctgggcc ctctccagcg gggtggccat 900gagaagcagg
aaaggccggt tcatgccatg aatggtggcc aggtcacctc ggcggccggt 960agtgaacccg
ttgatgtcca cttgcagtgt gttatccctg ctgtcacagg agcagtgggc 1020gctaaggcga
aagccctcaa tttcccctcc acggctcaac cactgccgca caactccggt 1080gacatcaaaa
gataaccact ctggcgagtc gctgggtgcc agcagccggt tgctgaggta 1140tcgccaggaa
ttgttgctgt atttctggta cagctccacg tgctgctcca cttttaactt 1200gagcctcagc
agacgcagct ctgcccggga gagcaacacg ggttcaggta ccgcttctcg 1260gagctctgat
gtgttgaaga acatatatat gctgtgtgta ctctgcttga acttgtcata 1320gatttcgttg
tgggtttcca ccattagcac gcgggtgacc tccttggcgt agtagtcggc 1380ctcaggctcg
ggctccggtt ctgcactctc cccggccacc cggtcgcggg tgctgttgta 1440cagggcgagc
acggcctcgg gcagcgggcc gggcggcacc tccccctggc tcggggggct 1500ggcgagccgc
agcttggaca ggatctggcc gcggatggcc tcgatgcgct tccgcttcac 1560cagctccatg
tcgatagtct tactggtgga tagggaattc ctccatggaa gctttttacc 1620cggagacagg
gagaggctct tctgcgtgta gtggttgtgc agagcctcat gcatcacgga 1680gcatgagaag
acgttcccct gctgccacct gctcttgtcc acggtgagct tgctatagag 1740gaagaaggag
ccgtcggagt ccagcacggg aggcgtggtc ttgtagttgt tctccggctg 1800cccattgctc
tcccactcca cggcgatgtc gctgggatag aagcctttga ccaggcaggt 1860caggctgacc
tggttcttgg tcatctcctc ccgggatggg ggcagggtgt acacctgtgg 1920ttctcggggc
tgccctttgg ctttggagat ggttttctcg atgggggctg ggagggcttt 1980gttggagacc
ttgcacttgt actccttgcc attcagccag tcctggtgca ggacggtgag 2040gacgctgacc
acacggtacg tgctgttgta ctgctcctcc cgcggctttg tcttggcatt 2100atgcacctcc
acgccgtcca cgtaccagtt gaacttgacc tcagggtctt cgtggctcac 2160gtccaccacc
acgcatgtga cctcaggggt ccgggagatc atgagggtgt ccttgggttt 2220tggggggaag
aggaagactg acggtccccc cgcagcttca ggtgctgggc acggtgggca 2280tgtgtgagtt
ttgtcacaag atttgggctc ggaattcgtg acaagtgcaa gacttagtgc 2340aatgcaagac
aggagttgca tcctgtacat
23701355PRTArtificial sequenceSynthetic sequence Linker 135Gly Gly Gly
Gly Ser1 513615PRTArtificial sequenceSynthetic sequence
Linker 136Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser1
5 10 151375PRTArtificial
sequenceCommon sequence motif present in hinge polypeptide
regionVARIANT(3)..(3)Xaa can be another residue that does not
interfere with dimerisation, for example proline (P), arginine (R)
or serine (S) 137Cys Pro Xaa Cys Pro1 513815PRTHomo sapiens
138Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro1
5 10 1513912PRTHomo sapiens 139Glu
Arg Lys Cys Cys Val Glu Cys Pro Pro Cys Pro1 5
1014017PRTHomo sapiens 140Glu Leu Lys Thr Pro Leu Gly Asp Thr Thr
His Thr Cys Pro Arg Cys1 5 10
15Pro14115PRTHomo sapiens 141Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro
Cys Pro Arg Cys Pro1 5 10
1514212PRTHomo sapiens 142Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys
Pro1 5 1014327DNAArtificial
sequenceSynthetic primer 143atgaattccg gttgtaagcc ttgcata
2714443DNAArtificial sequenceSynthetic primer
144gtgaattcct ccatggaagc tttttaccag gagagtggga gag
43
User Contributions:
Comment about this patent or add new information about this topic: