Patent application title: NOVEL TRAF FAMILY PROTEINS
Inventors:
Juan M. Zapata (San Diego, CA, US)
John C. Reed (Rancho Santa Fe, CA, US)
Assignees:
Burnham Institute for Medical Research
IPC8 Class: AC07K1400FI
USPC Class:
530350
Class name: Chemistry: natural resins or derivatives; peptides or proteins; lignins or reaction products thereof proteins, i.e., more than 100 amino acid residues
Publication date: 2010-12-23
Patent application number: 20100324267
Claims:
1. A TRAF-Protein-Binding-Domain (TPBD) polypeptide comprising SEQ ID
NO:19, provided said polypeptide is no longer than 213 amino acids.
2. The TPBD polypeptide of claim 1, further comprising SEQ ID NO:20.
3. The TPBD polypeptide of claim 1, further comprising SEQ ID NO:21.
4. The TPBD of claim 1, wherein the amino acid sequence of said protein comprises substantially the same sequence as any of SEQ ID NOS:8, 10, 12, 23, 24 or 25.
5. The TPBD of claim 1, comprising the same amino acid sequence as set forth in any of SEQ ID NOS:8, 10, 12, 23, 24 or 25.
6. A TPBD according to claim 4, wherein said polypeptide is encoded by a nucleotide sequence comprising substantially the same nucleotide sequence as set forth in SEQ ID NOs:7, 9 or 11.
7. A TPBD according to claim 1, wherein said polypeptide is encoded by a nucleotide sequence comprising the same sequence as set forth in SEQ ID NOs:7, 9 or 11.
8-11. (canceled)
12. Isolated nucleic acid encoding a TRAF protein binding domain (TPBD), or functional fragments thereof, selected from:(a) DNA encoding the amino acid sequence set forth in SEQ ID NOS:8, 10 or 12, 23, 24 or 25, or(b) DNA that hybridizes to the DNA of (a) under moderately stringent conditions, wherein said DNA encodes biologically active TPBD, or(c) DNA degenerate with (b), wherein said DNA encodes biologically active TPBD.
13. A nucleic acid according to claim 12, wherein said nucleic acid hybridizes under high stringency conditions to the TPBD coding portion of any of SEQ ID NOs:7, 9 and 11.
14. A nucleic acid according to claim 12, wherein the nucleotide sequence of said nucleic acid is substantially the same as set forth in any of SEQ ID NO:7, 9 and 11.
15-50. (canceled)
51. A chimeric protein comprising the TPBD of claim 1.
52. A chimeric TPBD-containing protein, comprising the sequence SEQ ID NO:19, provided said chimeric protein is not naturally occurring.
53. The chimeric TPBD-containing protein of claim 52, comprising:(a) the sequence SEQ ID NO:19; and(b) a sequence from a heterologous protein.
54. The chimeric TPBD-containing protein of claim 53, further comprising SEQ ID NO:20.
55. The chimeric TPBD-containing protein of claim 53, further comprising SEQ ID NO:21.
56. The chimeric protein of claim 53 further comprising a RING finger domain.
57. An isolated TPBD-containing TRAF protein, or fragment thereof, comprising the sequence SEQ ID NO:19, provided said TRAF domain containing protein does not consist of the sequence SEQ ID NOs:2, 4 or 6.
58. The TPBD-containing TRAF protein of claim 57, further comprising the sequence SEQ ID NO:20.
59. The TPBD-containing TRAF protein of claim 57, further comprising the sequence SEQ ID NO:21.
60-73. (canceled)
Description:
[0001]This application is a continuation of U.S. application Ser. No.
11/788,741, filed Apr. 20, 2007, which is a divisional of U.S.
application Ser. No. 09/706,325, filed Nov. 3, 2000, now U.S. Patent No.
7,208,581, which claims the benefit of U.S. Provisional Application No.
60/287,568, filed Nov. 5, 1999, which was converted from U.S. Ser. No.
09/434,784, filed Nov. 5, 1999, each of which the entire contents are
incorporated herein by reference.
BACKGROUND OF THE INVENTION
Field of the Invention
[0002]This invention relates generally to the fields of molecular biology and molecular medicine and more specifically to proteins involved in the regulation of immunological response and cell death.
[0003]This invention was made with government support under grant number CA69381 awarded by the National Institute of Health. The United States Government has certain rights in this invention.
[0004]Tumor Necrosis Factor (TNF) family cytokines play an important role in a wide variety of immunological, allergic, and inflammatory responses. Several members of the TNF family have been identified, including TNFα, Lymphotoxin-α, Lymphotoxin-β, LIGHT, CD27 Ligand (CD27L), CD30L, CD40L, Fas-L, Trail, and others. These molecules are generally produced as Type-II integral membrane proteins on the surface of cells, undergoing subsequent release into the excellular milieu as a result of proteolytic cleavage. Many of the TNF-family cytokines however remain anchored in the plasma membrane, relying on interactions with receptor-bearing cells through cell-cell contact.
[0005]The receptors for TNF-family cytokines are equally diverse. All members of the family have a conserved arrangement of cysteines in their extracellular domains, which is one of the criteria for membership in this family. The intracellular cytosolic domain of TNF-family receptors are diverse in their amino acid sequences, but can be broadly classified into two types: [0006]a) those that contain a protein-interaction module known as a Death-Domain (TNFR1, Fas,DR3, DR4, DR5) and those that do not (TNFR2, CD27, CD30, CD40, LTβR, 4B1 and others). Death Domains are responsible for interactions of a subgroup of the TNF-Receptor (TNFR) family with adapter proteins which bind in turn to caspase-family intracellular proteases involved in inducing apoptosis (programmed cell death). However, the Death Domains can also mediate binding to other types of adaptor molecules which bind kinases or other types of signaling molecules rather than proteases.
[0007]Those TNFR family members that do not contain a Death Domain in their cytosolic tail rely on a family of intracellular adapter proteins for transducing signals. This family of adaptor proteins is known as the "TNF Receptor Associated Factors" (TRAFs). TRAF-family proteins contain a protein interaction domain known as the TRAF-domain that mediates their binding to the cytosolic domains of TNF-family receptors. The TRAF domain also allows for interactions among TRAF-family members, creating opportunities for homo- and hetero-oligomerization that can have important functional consequences. In humans and mice, six members of the TRAF family have been described, including TRAF1, TRAF2, TRAF3, TRAF4, TRAF5, and TRAF6. The crystal structure of the TRAF domain of TRAF2 has been solved, revealing a trimeric assembly with 3-fold symmetry and demonstrating the presence of a surface pocket on each monomer that accounts for binding to discrete peptidyl motifs found within the cytosolic domains of many members of the TNF-family receptors.
[0008]Some TRAF family proteins physically associate with protein kinases and control the activation of these kinases. TRAF2, TRAF5, and TRAF6 for example have been reported to be capable of inducing the activation of kinases involved in activation of: (a) several members of the stress-kinase family, such as the Jun N-terminal kinase (JNK), and (b) phosphorylation of IκB, an inhibitor of the transcription factor NF-κB. In contrast, TRAF1, TRAF3, and TRAF4 do not activate these kinase, and in some contexts, may interfere with kinase activation by other TRAFs. All kinase-activating members of the TRAF family contain additional protein domains (besides the TRAF-domain) which are important for their function as kinase-activators, including a RING domain and sometimes zinc-finger domains.
[0009]Gene knock-out studies in mice have demonstrated critical roles for several of the TRAF-family proteins in signal transduction pathways stimulated by TNF-family receptors. Moreover, mutational analyses of the TRAF-binding sites within the cytosolic domains of TNF-family receptors have also provided evidence that interactions of TRAFs with these receptors are critical for many of the biochemical signal-transduction and cellular biological responses induced via TNF-family receptors.
[0010]In addition to their involvement in signaling by TNF-family receptors, at least one of the known TRAFs can also participate in signaling mediated by the Interleukin-1 Receptor/Toll family of receptors. TRAF6 interacts with a Death Domain-containing adapter protein (MyD88) and a Death Domain-containing protein kinase (IRAK) which are critical for NF-κB induction by IL-1 receptors. Thus, TRAFs may in some instances participate in signal transduction by other cytokine receptors beside those of the TNFR family. In addition, some TRAFs bind to viral proteins, suggesting that such protein interactions may play a role the mechanisms employed by viruses to either evade immune surveillance mechanisms or in virus-mediated malignant transformation.
[0011]A need exists, therefore, to identify novel TRAF family proteins or TRAF protein binding domains. The present invention satisfies this need and provides additional advantages as well.
SUMMARY OF THE INVENTION
[0012]In accordance with the present invention, there are provided novel TRAF-Protein-Binding-Domain polypeptides (TPBDs). The invention also provides nucleic acid molecules encoding TPBDs, vectors containing these nucleic acid molecules and host cells containing the vectors. The invention also provides antibodies that can specifically bind to invention TPBDs. Such TPBDs and/or anti-TPBD antibodies are useful for discovery of drugs that suppress autoimmunity, inflammation, allergy, allograph rejection, sepsis, and other diseases.
[0013]The present invention also provides a screening assay useful for identifying agents that can effectively alter the association of an invention TPBD with itself or with other proteins. By altering the self-association of TPBD or by altering their interactions with other proteins, an effective agent may increase or decrease the activation of kinases, or modulate cellular pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses, B cell immunoglobulin class switching, and the like.
[0014]The invention also provides methods of altering the activity of TPBD in a cell, wherein such increased or decreased activity of TPBD can modulate the level of kinase activity or cellular pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses, B cell immunoglobulin class switching, and the like. For example, the activity of TPBD in a cell can be increased by introducing into the cell and expressing a nucleic acid sequence encoding this polypeptide or proteins comprising such TPBD. In addition, the activity of TPBD or TPBD-comprising proteins in a cell can be decreased by introducing into the cell and expressing an antisense nucleotide sequence that is complementary to a portion of a nucleic acid molecule encoding the TPBD or TPBD-comprising proteins.
[0015]The invention also provides methods for using an agent that can specifically bind TPBD or a nucleotide sequence that can bind to a nucleic acid molecule encoding TPBD to diagnose a pathology that is characterized by an altered level of apoptosis, cell proliferation, cell adhesion, cell stress responses and B cell immunoglobulin class switching due to an increased or decreased level of TPBD in a cell.
BRIEF DESCRIPTION OF THE FIGURES
[0016]FIG. 1 shows a schematic representation of the structure of newly identified TRAF proteins referred to as HAUSP (SEQ ID NO:8)(also known as USP7), SPOP (SEQ ID NO:10) and TRAF 7 (SEQ ID NO:12); and additionally, TRAF 1 and TRAF 2, showing in shaded boxes the relative positions of the indicated protein domains: TRAF domain, protease domain, ring finger, Z-b box, zinc finger, poly-acidic domain, coiled coil, and nuclear localization signal.
[0017]FIG. 2 shows amino acid sequences of the TRAF domains of TRAF 7 (SEQ ID NO:25)(also known as KIAA), HAUSP (SEQ ID NO:23)(also known as USP7), and SPOP (SEQ ID NO:24) in alignment with TRAF domains of six other known human TRAF proteins (hT1td through hT6td, SEQ ID NOS:26-31). Dark boxes indicate identical residues between family members. Light boxes indicate structurally related residues.
[0018]FIG. 3 shows the predicted amino acid sequence of TRAF7 (SEQ ID NO:32, corresponding to amino acids 16-979 of SEQ ID NO:6). The different protein domains and regions of the protein are indicated: ring finger domain (amino acids 15-55); ZF-B Box domain (amino acids 90-132); coiled coil (amino acids 132-177); coiled coil (amino acids 195-231); two leucine zipper domains (amino acids 197-218 and 222-245); TRAF domain (amino acids 277-403); coiled coil (amino acids 427-446); and two poly-acidic regions (amino acids)(amino acids 868-964).
[0019]FIG. 4 shows Northern blot analysis of the mRNA levels of TRAF7, USP7 and SPOP in human tissues. Northern blot analyses were performed using the human 12 lane multiple tissue northern blot (MTN; Clontech; Palo Alto Calif.), as recommended by the manufacturer. The TRAF domains of TRAF7, USP7 and SPOP, as well as actin as a control, were labeled with 32P-cytidine using a nick translation assay kit.
[0020]FIG. 5 shows an analysis of the binding of the invention TRAF-protein-binding-domains (TPBDs) to different members of the TNF receptor family. In vitro binding assay results of the cytosolic domains of selected members of the TNF-R family are presented. GST-Fas(ct), GST-TNF-R2(ct), GST-CD40(ct), GST-LTβR(ct), GST-NGFR(ct) and GST-DR4(ct) immobilized on glutathione-Sepharose were incubated with in vitro translated (35)S-TRAF2,-SPOP, -HAUSP, and -TRAF7 TRAF domains. Alternatively, TRAF domains of TRAF7 or TRAF2 were expressed in 293 cells. Bound TRAF proteins were detected by SDS-PAGE and fluorography.
[0021]FIG. 6 shows an analysis of the binding of human TRAF proteins 1-6 and I-TRAF (Rothe et al., Proc. Natl. Acad. Sci. USA 93:8241-8246 (1996)) to TRAF-7, HAUSP and SPOP. GST-TRAF7 (282-435), GST HAUSP (1-213) and SPOP (1-180) proteins immobilized on glutathione-Sepharose were incubated with in vitro translated 35S-TRAF proteins and I-TRAF, as indicated. Bound TRAF proteins were detected by SDS-PAGE and fluorography.
[0022]FIG. 7 shows that TPBDs of HAUSP, SPOP and TRAF7 can specifically inhibit the NF-κB activity induced by TRAF containing proteins. In upper panel relative NF-κB activities effected by different TRAF domains are shown. Cells were transfected with control plasmid, pcDNA3-myc-hTRAF6 alone or with Tug of either pcDNA3-myc-HAUSP(1-213) or pcDNA3-myc-SPOP(1-180), together with 0.5 μg pUC13-4xNFκB-luc plasmid and 1 μg pCMV-β-galactosidase plasmid, as indicated. Relative NF-κB activity was assessed by luciferase assays, with normalization for β-galactosidase activity. The results are presented as fold of activation relative to the control.
[0023]FIG. 8 shows the effect of TRAF7 and USP7 in regulating NFκB activity induced by TRAF2 and TRAF6 in a mammalian cell line.
[0024]FIG. 9 shows the effect of the TRAF domains of TRAF7, USP7 and SPOP in regulating the NFκB activity induced by TNFα or CD40 overexpression.
[0025]FIG. 10 shows regulation of NIK induced NFκB activation by TRAF7 and USP7.
[0026]FIG. 11 shows the subcellular localization of different TRAF7 deletion mutants.
[0027]FIG. 12 shows the subcellular localization of different domains of USP7.
DETAILED DESCRIPTION OF THE INVENTION
[0028]In accordance with the present invention, there are provided novel TRAF-protein-binding-domains (TPBDs) of newly identified TRAF proteins, and fragments thereof. As used herein, an invention TPBD refers to a peptide region that binds to one or more TRAF proteins. Invention TPBDs share sequence homology with the C-TRAF domain of TRAF proteins, and have been found herein to have binding properties similar to those of other known TRAF proteins. This domain corresponds to the portion of the TRAF domain commonly referred to as the C-TRAF or TRAF-C domain (Arch et al., supra.; and Wajant et al., Cytokine Growth Factor Rev. 10:15-26 (1999), each of which is incorporated herein by reference). Previously characterized human TRAF domain proteins have also been found to have N-TRAF domains immediately amino-terminal of the C-TRAF domain.
[0029]TRAF proteins were initially identified as a family of proteins that are associated with members of the TNF-receptors family. TRAF proteins are known to influence a variety of cellular processes such as class switching in B cells, apoptosis, cell proliferation, and stress response (Arch et al., Genes Dev. 12:2821-2830 (1998)). These effects on cellular process are thought to occur by TRAF-mediated modulation of NF-κB and/or cJun N-terminal kinase (JNK) activity. TRAF proteins thus represent an important class of intracellular proteins that mediate the signal transduction of cell surface receptors such as TNFR-family members that ultimately influence a variety of cellular processes.
[0030]In addition to binding TNF-family receptors, TRAF domain proteins also interact with other TRAF domain proteins. For example, TRAF 1, TRAF 2 and TRAF 3 all have been shown to homo-oligomerize. TRAF 2 has further been demonstrated capable of hetero-oligomerization with TRAF 1 and TRAF 5. As shown herein, TRAF domains of the present invention have all been demonstrated as capable of interacting with other TRAF-domain proteins.
[0031]The functions of the TRAF domain containing proteins, generally, supports the role of invention TPBDs and invention TRAF domain proteins in cellular pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses and B cell immunoglobulin class switching.
[0032]For example, invention TPBDs have been found to associate with other proteins, including proteins comprising TRAF domains. Exemplary TRAF proteins to which invention TPBDs bind are human TRAF1, TRAF2, TRAF3, TRAF4, TRAF5 or TRAF6. As used herein, the term "bind" or "binding" refers to the association of an invention TPBD with another protein relatively specifically and, therefore, can form a bound complex. In particular, the binding of a TPBD to a TRAF protein is sufficiently specific such that the bound complex can form in vivo in a cell or in vitro under suitable condition.
[0033]In one embodiment, it has been found that the invention HAUSP TPBD (SEQ ID NOS:8 or 23) binds TRAF1, TRAF2, TRAF3, TRAF4, TRAF5 and TRAF6, and can inhibit the NF-κB activation activity of TRAF2, TRAF5 and TRAF6. In another embodiment, it has been found that the invention SPOP TPBD domain (SEQ ID NOS:10 or 24) binds TRAF1 and TRAF6, and can inhibit the NF-κB activation activity of TRAF6. In yet another embodiment, it has been found that the invention TRAF-7 TPBD (SEQ ID NOS:12 or 25) binds TRAF1, TRAF2, TRAF3, TRAF4, TRAF5 and TRAF6, and can inhibit the NF-κB activation activity of TRAF2 and TRAF6, and can increase the NF-κB activation activity of TRAF5. Furthermore, it has been found that the TRAF domain of TRAF7 and USP7 can inhibit the NF-κB activation induced by TNFα and CD40 overexpression, whereas the TRAF domain of SPOP showed no inhibitory activity.
[0034]In an additional embodiment, it was found that TRAF7, the TRAF domain of TRAF7, and the TRAF domain of USP7 (HAUSP) inhibited NIK-induced NFκB activation. Therefore, the invention provides TPBDs that can be used to inhibit NIK-induced NFκB activation.
[0035]In another embodiment, invention TPBDs also bind to TNF receptor family proteins. For example, the SPOP TPBD of SEQ ID NO:10 binds TNF-R2, and the TRAF-7 TPBD (SEQ ID NOS:12 or 25) binds to TNF-R2, CD40, lymphotoxin-βR, NGFRp75 and DR4, and the TRAF domain of TRAF7 also binds with itself. Further, TRAF proteins generally, are known to bind to a wide variety of cell-surface receptors, and in particular, with cytokine receptors of the TNF receptor family. Exemplary receptors to which TRAF proteins are known to bind are: TNFR1, TNFR2, CD27, CD30, CD40, 4-1BB, Ox40, LT-βR, Fas, DR3, DR4, DR5, HVEM, LMP-1 and IL-1R. In addition, TRAF proteins have been observed to bind to all members of the TNF receptor family that do not comprise a death domain. Thus, it is contemplated herein that invention TPBDs also bind one or more receptors selected from TNFR1, TNFR2, CD27, CD30, CD40, 4-1BB, Ox40, LT-βR, Fas, DR3, DR4, DR5, HVEM, LMP-1, IL-1R, and members of the TNF receptor family that do not comprise a death domain.
[0036]In another embodiment of the invention, invention TPBDs have been found to bind to TRAF-associated proteins. For example, the TRAF-7 TPBD (SEQ ID NOS:12 or 25) binds I-TRAF. Further, TRAF proteins generally are known to interact with numerous TRAF-associated proteins, such as TRADD, FADD, I-TRAF, TRIP, A20, c-IAP1, c-IAP2, Casper, RIP, RIP2, NIK, Peg3, GCK, NIK, ASK1 and IRAK. Thus, it is contemplated herein that TPBDs also bind one or more TRAF-associated proteins selected from TRADD, FADD, I-TRAF, TRIP, A20, c-IAP1, c-IAP2, Casper, RIP, RIP2, NIK, Peg3, GCK, NIK, ASK1 and IRAK.
[0037]Structurally, an invention TPBD is characterized as having the sequence: E(X)17-21LXW(X)3VXP(X)15-16L(X)24-28K(X).su- b.15-16W (SEQ ID NO:19), where X is any amino acid. Alternatively, an invention TRAF domain is characterized as having the sequence: LXWX(X')XVXP (SEQ ID NO:20) where X is any amino acid and X' is selected from L and I. Preferably, an invention TRAF domain has the sequence: E(X)10-13S(X)6-7LXW(X)3VXP(x)10-11S(X)4L(X)24-28- K(X)9-10F(X)5WG(X)3F(X)16D(X)5-7V (SEQ ID NO:21), where X is any amino acid. More preferably, an invention TRAF domain has the following sequence: E(X)4(XA)(X)5-8SX(XB)(X)4-5LXWX(XA)XVXP(X)1- 0-11S(XA)(X)3L(X)16-18(XA)(X)4-6(XC)(X)2K(X)9-10F(X)5WG(XD)(X)2F(X)5(XA)X(XC(X- )7(XC) DX(XA)(X)2-4(XC)V (SEQ ID NO:22), where X is any amino acid, XA is selected from V, L and I; XB is selected from P and G; XC is selected from D, E, N and Q; and XD is selected from Y and F. Most preferably, an invention TRAF domain comprises the sequence SEQ ID NOS:8, 10, 12, 23, 24 or 25.
[0038]TRAF domain proteins, generally, are well known in the art to modulate the activity of NF-κB and JNK. In accordance with another embodiment of the invention, invention TPBDs, HAUSP (preferably SEQ ID NOS:8 or 23), SPOP (SEQ ID NOS:10 or 24), and TRAF-7 (SEQ ID NOS:12 or 25), have been found to modulate the activity NF-κB or cJun N-terminal kinase (JNK).
[0039]It has also been found that invention TPBDs modulate a variety of cellular pathways. TRAF domain proteins, generally, are well known in the art as modulating the cellular pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses and B cell immunoglobulin class switching, and NF-kB and JNK are further known to modulate these pathways. Thus, those of skill in the art will recognize that it is within the scope of the invention that TPBDs modulate one or more cellular pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses and B cell immunoglobulin class switching.
[0040]Presently preferred TRAF domains of the invention include amino acid sequences that comprise substantially the same protein sequence set forth in SEQ ID NOS:8, 10, 12, 23, 24 and 25, as well as biologically active, modified forms thereof.
[0041]In another embodiment, invention TPBDs include proteins comprising TPBD fragments having the sequence SEQ ID NO:19 or SEQ ID NO:20, which retain at least one native biological TPBD activity, such as immunogenicity, the ability to bind to a TNF family receptor, the ability to bind a TRAF domain protein, the ability to bind a TRAF-associated protein, the ability to modulate NF-κB activity or JNK activity, the ability to modulate apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching, provided the TRAF protein is no longer than 213 amino acids.
[0042]Use of the terms "isolated" and/or "purified" in the present specification and claims as a modifier of DNA, RNA, polypeptides or proteins means that the DNA, RNA, polypeptides or proteins so designated have been produced in such form by the hand of man, and thus are separated from their native in vivo cellular environment, and are substantially free of any other species of nucleic acid or protein. As a result of this human intervention, the recombinant DNAs, RNAs, polypeptides and proteins of the invention are useful in ways described herein that the DNAs, RNAs, polypeptides or proteins as they naturally occur are not.
[0043]As used herein, "eukaryotic" refers to the variety of species from which an invention TPBD is derived, e.g., yeast, slime mold, plant, insect, nematode, mammal, and the like. A preferred TPBD herein, is mammalian TRAF.
[0044]As used herein, "mammalian" refers to the variety of species from which a preferred invention TPBD is derived, e.g., human, rat, mouse, rabbit, monkey, baboon, bovine, porcine, ovine, canine, feline, and the like. A more preferred TPBD herein, is human TRAF.
[0045]The term "biologically active" or "functional", when used herein as a modifier of invention TPBD(s), or polypeptide fragment thereof, refers to a polypeptide that exhibits functional characteristics similar to a TPBD. For example, one biological activity of a TPBD is the ability to bind, preferably in vivo, to a TRAF protein. Such TRAF binding activity can be assayed, for example, using the methods described in the Examples described herein.
[0046]Another biological activity of TRAF is the ability to act as an immunogen for the production of polyclonal and monoclonal antibodies that bind specifically to an invention TPBD. Thus, an invention TPBD will encode a polypeptide specifically recognized by an antibody that also specifically recognizes the TPBD having the amino acid sequence SEQ ID NO:19 or SEQ ID NO:20, preferably including an amino acid set forth in SEQ ID NOS:8, 10, 12, 23, 24 or 25. Such immunologic activity may be assayed by any method known to those of skill in the art. For example, a test-TPBD polypeptide can be used to produce antibodies, which are then assayed for their ability to bind to an invention TPBD comprising SEQ ID NO:19 or SEQ ID NO:20, preferably including a sequence set forth in SEQ ID NOS:8, 10, 12, 23, 24 or 25. If the antibody binds to the test-polypeptide and a protein including the sequence SEQ ID NO:19 or SEQ ID NO:20, and preferably also a protein including SEQ ID NOS:8, 10, 12, 23, 24 or 25 with substantially the same affinity, then the polypeptide possesses the requisite immunologic biological activity.
[0047]The TPBD containing protein set forth in SEQ ID NO:2 was initially identified as a herpes-associated ubiquitin-specific protease (HAUSP) associated with the nuclear domain of the PML protein and with Herpes simplex virus type 1 protein Vmw110 (Everett et al., EMBO J. 16:566-577 (1997)). The subject application represents the first identification of a portion of this protein as forming a TRAF domain.
[0048]The TPBD containing protein set forth in SEQ ID NO:4 was initially identified in serum from a scleroderma patient as having a speckled pattern in nuclei, and was termed a speckle-type POZ protein, or SPOP (Nagai et al., FEBS Lett. 418:23-26 (1997)). The POZ domain of SPOP has an undetermined function, but the protein was found to be widely expressed.
[0049]The TPBD containing protein set forth in SEQ ID NO:6 was identified as having a leucine-zipper region, a zinc finger region and a leucine zipper-like K-box region (Nagase et al., DNA Res. 5:355-364 (1998)). Initially named KIAA0898, this protein is also referred to herein as TRAF7.
[0050]In addition to the TRAF domain, TRAF7 was also found to have a number of previously unrecognized domains (see FIG. 3). The existence of a RING finger domain (amino acids 15-55) close to the N-terminus of TRAF7 was previously known. TRAF7 is KIAA0898, which was described in the database as a RING finger protein of unknown function. Besides this domain, several other new protein domains were identified in this molecule. The RING finger domain is followed by a ZF-BBox domain (amino acids 90-132) and a coiled coil (amino acids 132-177). These three protein domains form what is denominated the tripartite motif. The tripartite motif has been found in a very restricted list of proteins, including transcription factors, ribonucleoproteins, and oncoproteins and appears to be involved in protein-protein interaction (Borden, Biochem. Cell. Biol. 76:351-358 (1998)). After the tripartite motif, a second coiled coil (amino acids 195-231) is found. Two putative leucine zipper domains are also found in this region of the protein (amino acids 197-218 and 222-245). The TRAF domain (amino acids 277-403) is located after the leucine zippers. The TRAF domain is followed by another coiled coil (amino acids 427-446) and by two regions rich in acidic residues (amino acids 452-577 and 868-964). Therefore, in addition to TRAF domains, including the TRAF domain of TRAF7 (SEQ ID NOS:12 or 25), the invention also provides additional functional domains of TRAF7, including a ZF-BBox domain (amino acids 90-132 of SEQ ID NO:32); coiled coil domains (amino acids 132-177, 195-231 and 427-446 of SEQ ID NO:32); leucine zipper domains (amino acids 197-218 and 222-245 of SEQ ID NO:32); and poly-acidic domains (amino acids 452-577 and 868-964 of SEQ ID NO:32).
[0051]In accordance with one embodiment of the invention, it has been found that the invention TPBD (SEQ ID NOS:8 or 23) can bind all known human TRAF proteins (i.e., TRAF proteins 1 through 6), although the TRAF domain of HAUSP (SEQ ID NOS:8 or 23) binds TRAF 3 somewhat weakly, and binds TRAF 6 quite strongly. The invention TPBD of HAUSP (SEQ ID NOS:8 or 23) has also been found to inhibit NF-κB activation mediated by any of TRAF2, TRAF5 or TRAF6. The TPBD of TRAF7 (SEQ ID NOS:12 or 25) also was found to inhibit NFκB activation mediated by TRAF2 or TRAF6 (see Examples).
[0052]In another embodiment of the invention, the invention TPBD of SPOP (SEQ ID NOS:10 or 24) shows selectivity in TRAF protein interaction, binding human TRAF 1 and TRAF 6, but not human TRAF proteins 2, 3, 4 or 5. The invention TPBD of SPOP (SEQ ID NOS:10 or 24) has also been found to be specific in TNF-receptor interaction, binding TNF-receptor 2, but not FAS, CD40, lymphotoxin-β receptor, NGF receptor and DR4. The invention TPBD of SPOP (SEQ ID NOS:10 or 24) has also found to inhibit NF-κB activation mediated by TRAF6.
[0053]In a further embodiment of the invention, the invention TPBD of TRAF7 (SEQ ID NOS:12 or 25) binds all known human TRAF proteins, but binds TRAF 2 somewhat weakly, and binds TRAF 6 quite strongly. The invention TPBD of SPOP (SEQ ID NOS:10 or 24) also binds numerous members of the TNF-receptor family, including TNF-receptor 2, CD40 (albeit weakly), lymphotoxin-β receptor, NGF receptor and DR4, but not to FAS. The invention TPBD of SPOP (SEQ ID NOS:10 or 24) has also been demonstrated to inhibit NF-κB activation by either TRAF2 or TRAF6.
[0054]Those of skill in the art will recognize that numerous residues of the above-described sequences can be substituted with other, chemically, sterically and/or electronically similar residues without substantially altering the biological activity of the resulting receptor species. In addition, larger polypeptide sequences containing substantially the same sequence as amino acids set forth in SEQ ID NOS:8, 10, 12, 23, 24 and 25 therein (e.g., splice variants) are contemplated.
[0055]As employed herein, the term "substantially the same amino acid sequence" refers to amino acid sequences having at least about 70% identity with respect to the reference amino acid sequence, and retaining comparable functional and biological activity characteristic of the protein defined by the reference amino acid sequence. Preferably, proteins having "substantially the same amino acid sequence" will have at least about 80%, more preferably 90% amino acid identity with respect to the reference amino acid sequence; with greater than about 95% amino acid sequence identity being especially preferred. It is recognized, however, that polypeptides (or nucleic acids referred to hereinbefore) containing less than the described levels of sequence identity arising as splice variants or that are modified by conservative amino acid substitutions, or by substitution of degenerate codons are also encompassed within the scope of the present invention. Identity of any two amino acid sequences can be determined by those skilled in the art based, for example, on a BLAST 2.0 computer alignment, using default parameters.
[0056]The invention TPBDs can be isolated by a variety of methods well-known in the art, e.g., recombinant expression systems described herein, precipitation, gel filtration, ion-exchange, reverse-phase and affinity chromatography, and the like. Other well-known methods are described in Deutscher et al., Guide to Protein Purification: Methods in Enzymology Vol. 182, (Academic Press, (1990)), which is incorporated herein by reference. Alternatively, the isolated polypeptides of the present invention can be obtained using well-known recombinant methods as described, for example, in Sambrook et al., supra., 1989).
[0057]An example of the means for preparing the invention TPBD(s) is to express nucleic acids encoding the TPBD in a suitable host cell, such as a bacterial cell, a yeast cell, an amphibian cell (i.e., oocyte), or a mammalian cell, using methods well known in the art, and recovering the expressed polypeptide, again using well-known methods. Invention polypeptides can be isolated directly from cells that have been transformed with expression vectors as described below herein. The invention polypeptide, biologically functional fragments, and functional equivalents thereof can also be produced by chemical synthesis. For example, synthetic polypeptides can be produced using Applied Biosystems, Inc. Model 430A or 431A automatic peptide synthesizer (Foster City, Calif.) employing the chemistry provided by the manufacturer.
[0058]Also encompassed by the term TPBD are functional fragments or polypeptide analogs thereof. The term "functional fragment" refers to a peptide fragment that is a portion of a full length TPBD, provided that the portion has a biological activity, as defined above, that is characteristic of the corresponding full length protein. For example, a functional fragment of an invention TPBD can have an activity such as the ability, for example, to bind a TNF-family receptor, to bind another TRAF protein, to bind a TRAF-associated protein, or to modulate NF-κB activity or JNK activity, or to modulate the level of cell proliferation, apoptosis, cell adhesion, cell stress responses, class switching, and the like. In addition, the characteristic of a functional fragment of invention TPBDs to elicit an immune response is useful for obtaining an anti-TPBD antibodies. Thus, the invention also provides functional fragments of invention TPBDs, which can be identified using the binding and routine methods, such as bioassays described herein.
[0059]The term "polypeptide analog" includes any polypeptide having an amino acid residue sequence substantially the same as a sequence specifically shown herein in which one or more residues have been conservatively substituted with a functionally similar residue and which displays the ability to functionally mimic a TPBD as described herein. Examples of conservative substitutions include the substitution of one non-polar (hydrophobic) residue such as isoleucine, valine, leucine or methionine for another, the substitution of one polar (hydrophilic) residue for another such as between arginine and lysine, between glutamine and asparagine, between glycine and serine, the substitution of one basic residue such as lysine, arginine or histidine for another, or the substitution of one acidic residue, such as aspartic acid or glutamic acid for another.
[0060]The amino acid length of functional fragments or polypeptide analogs of the present invention can range from about 5 amino acids up to one residue less than a full-length protein sequence of an invention TPBD. In certain embodiments, the amino acid lengths include, for example, at least about 10 amino acids, at least about 20, at least about 30, at least about 40, at least about 50, at least about 75, at least about 100, at least about 150, at least about 200, at least 213, at least about 250, at least about 300, at least about 350 or more amino acids in length up to one residue less than a full-length TPBD-containing protein sequence, provided as sequence SEQ ID NO:2, 4 or 6.
[0061]Preferably, a fragment comprises a sequence selected from SEQ ID NO:19, 21 or 22. Such a fragment can also include at least about 10 residues at its amino-terminus, carboxy-terminus, or both; at least about 20 residues at its amino-terminus, carboxy-terminus, or both; at least about 30 residues at its amino-terminus, carboxy-terminus, or both; at least about 40 residues at its amino-terminus, carboxy-terminus, or both; at least about 50 residues at its amino-terminus, carboxy-terminus, or both; at least about 60 residues at its amino-terminus, carboxy-terminus, or both.
[0062]More preferably, a fragment comprises a sequence selected from SEQ ID NO:19, 21 or 22, further comprising one or more domains selected from N-TRAF, Ring Finger, Zinc finger, coiled-coil, POZ, and USP. Most preferably, a fragment has at least one fewer domains than the domains in proteins from SEQ ID NO:2, 4 or 6, wherein the domains are selected from N-TRAF, Ring Finger, Zinc finger, coiled-coil, POZ, and USP. Such domains are known in the art, as exemplified in Arch et al., supra.; and Wajant et al., supra. Identification of the domains in proteins from SEQ ID NO:2, 4 and 6 is carried out by reference to the publication reporting such proteins (i.e., Everett et al., supra., Nagai et al., supra., and Nagase et al., supra., for HAUSP (SEQ ID NO:2), SPOP (SEQ ID NO:4) and TRAF7 (SEQ ID NO:6), respectively).
[0063]As used herein the phrase "conservative substitution" also includes the use of a chemically derivatized residue in place of a non-derivatized residue, provided that such polypeptide displays the required binding activity. The phrase "chemical derivative" refers to a subject polypeptide having one or more residues chemically derivatized by reaction of a functional side group. Such derivatized molecules include, for example, those molecules in which free amino groups have been derivatized to form amine hydrochlorides, p-toluene sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl groups, chloroacetyl groups or formyl groups. Free carboxyl groups may be derivatized to form salts, methyl and ethyl esters or other types of esters or hydrazides. Free hydroxyl groups may be derivatized to form O-acyl or O-alkyl derivatives. The imidazole nitrogen of histidine may be derivatized to form N-im-benzylhistidine. Also included as chemical derivatives are those peptides which contain one or more naturally occurring amino acid derivatives of the twenty standard amino acids. For example: 4-hydroxyproline may be substituted for proline; 5-hydroxylysine may be substituted for lysine; 3-methylhistidine may be substituted for histidine; homoserine may be substituted for serine; and ornithine may be substituted for lysine. Polypeptides of the present invention also include any polypeptide having one or more additions and/or deletions of residues, relative to the sequence of a polypeptide whose sequence is shown herein, so long as the required activity is maintained.
[0064]In accordance with another embodiment, novel TPBD-containing TRAF proteins are provided. Invention TPBD-containing TRAF proteins refer to a protein comprising an invention TPBD including SEQ ID NO:19 or including SEQ ID NO:20, or a recombinantly produced invention TPBD-containing TRAF protein, including naturally occurring allelic variants thereof encoded by mRNA generated by alternative splicing of a primary transcript, provided the TPBD-containing TRAF protein are not the sequence SEQ ID NO:2, 4 or 6. Preferably, a TPBD-containing TRAF protein comprises an invention TPBD with a sequence substantially the same as SEQ ID NO:8, 10, 12, 23, 24, or 25. More preferably, a TPBD-containing TRAF protein comprises an invention TPBD with the sequence of SEQ ID NO:8, 10, 12, 23, 24, or 25.
[0065]A TPBD-containing TRAF protein comprising an invention TPBD domain is further characterized as binding one or more members of the TNF-family of receptors, or binding to one or more TRAF domain proteins, or binding to one or more TRAF associated proteins; or modulating NF-kB activity, or modulating cJun N-terminal kinase (JNK) activity; or modulating apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching; or any combination thereof.
[0066]In another embodiment of the invention, TPBD-containing chimeric proteins are provided comprising an invention TPBD, or fragments thereof, having the sequence of SEQ ID NO:19 or SEQ ID NO:20, and further comprising one or more sequences from a heterologous protein. Invention TPBD-containing chimeric proteins include, for example, polypeptides having the sequence SEQ ID NO:8, 10, 12, 23, 24 or 25. Sequences from heterologous proteins with which the TPBD or functional fragment thereof are fused will include, for example, glutathione-S-transferase, an antibody, or other proteins or functional fragments thereof which facilitate recovery of the chimera. Further proteins with which the TPBD or functional fragment thereof are fused will include, for example, luciferase, green fluorescent protein, an antibody, or other proteins or functional fragments thereof which facilitate identification of the chimera. Still further proteins with which the TPBD or functional fragment thereof are fused will include, for example, the LexA DNA binding domain, ricin, α-sarcin, an antibody, or other proteins which have therapeutic properties or other biological activity.
[0067]As such chimeric proteins include sequences from two different proteins, the resultant amino acid sequence of the chimeric protein will typically be a non-naturally occurring sequence. Thus, in accordance with this embodiment of the invention, there are provided chimeric proteins comprising an invention TPBD, or fragments thereof, having the sequence of SEQ ID NO:19 or SEQ ID NO:20, provided the sequence of the chimeric protein is not naturally occurring.
[0068]Further invention chimeric proteins contemplated herein are chimeric proteins wherein an invention TPBD is combined with one or more domains selected from a Ring-Finger domain, a Zinc-finger domain and an N-TRAF domain from a heterologous protein. For example, the TPBD of SEQ ID NO:8, 10, 12, 23, 24, or 25 can be fused with the Ring-Finger domain of TRAF proteins such as human TRAF 2, 3, 4, 5 or 6, and the like. Another example of such a chimera is a protein wherein the TPBD of SEQ ID NO:8, 10, 12, 23, 24, or 25 is fused with the N-TRAF domain from a TRAF protein such as human TRAF 1, 2, 3, 4, 5 or 6, and the like.
[0069]Another embodiment of the invention provides TPBD, or a functional fragment thereof, fused with a moiety to form a conjugate. As used herein, a "moiety" can be a physical, chemical or biological entity which contributes functionality to TPBD or a functional fragment thereof. Functionalities contributed by a moiety include therapeutic or other biological activity, or the ability to facilitate identification or recovery of TPBD. Therefore, a moiety will include molecules known in the art to be useful for detection of the conjugate by, for example, by fluorescence, magnetic imaging, detection of radioactive emission, and the like. A moiety may also be useful for recovery of the conjugate, for example a His tag or other known tags used for protein isolation/purification, or a physical substance such as a bead. A moiety can be a therapeutic compound, for example, a cytotoxic drug which can be useful to effect a biological change in cells to which the conjugate localizes.
[0070]In accordance with another embodiment of the invention there are provided oligomers comprising invention TPBDs and fragments thereof, invention TPBD-containing proteins, TPBD-containing chimeric proteins, or combinations thereof. It has been found that a TPBD such as the TPBD of TRAF 7 (SEQ ID NOS:12 or 25) can bind at least one other, equivalent TPBD in forming a homo-oligomer. Thus in one embodiment, the invention comprises homo-oligomers of invention TPBDs and fragments thereof, invention TPBD-containing proteins, TPBD-containing chimeric proteins, or combinations thereof.
[0071]In another embodiment of the invention, there are provided hetero-oligomers comprising invention TPBDs and fragments thereof, invention TPBD-containing proteins, TPBD-containing chimeric proteins, or combinations thereof. It was found that invention HAUSP-TPBD (SEQ ID NOS:8 or 23), SPOP-TPBD (SEQ ID NOS:10 or 24) and TRAF7-TPBD (SEQ ID NOS:12 or 25) all bind, for example, multiple TRAF proteins. Thus hetero-oligomers comprising invention TPBDs and fragments thereof, invention TPBD-containing proteins, TPBD-containing chimeric proteins, or combinations thereof, and further comprising TRAF proteins such as human TRAF1, TRAF2, TRAF3, TRAF4, TRAF5, TRAF6, or combinations thereof. For example, the HAUSP-TPBD (SEQ ID NOS:8 or 23) can form a hetero-oligomer with human TRAF1, TRAF2, TRAF3, TRAF4, TRAF5, TRAF6, or combinations thereof. In another example, the SPOP-TPBD (SEQ ID NOS:10 or 24) can form a hetero-oligomer with human TRAF1, TRAF6, or combinations thereof. In a further example, the TRAF7-TPBD (SEQ ID NOS:12 or 25) can form a hetero-oligomer with human TRAF1, TRAF2, TRAF3, TRAF4, TRAF5, TRAF6, or combinations thereof.
[0072]In accordance with another embodiment of the invention, there are provided isolated nucleic acids, which encode a novel TPBD, and fragments thereof, TPBD-containing TRAF proteins and TPBD-containing chimeric proteins. Nucleic acids that encode a TPBD are those that encode a protein with the ability to bind, preferably in vivo, to one or more members of the Tumor Necrosis Factor Receptor-family (TNFR-family), or to one or more members of the TRAF family, or to one or more TRAF-associated proteins, or any combination thereof, or have the ability to modulate NF-κB activity, JNK activity, apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching. An invention nucleic acid encodes a TPBD having the sequence: E(X)17-21L(X)2W(X)3VXP(X)15-16L(X)24-28K(X)15-16W (SEQ ID NO:19), where X is any amino acid. Alternatively, an invention nucleic acid encodes a TPBD having the sequence: LXWX(X')XVXP (SEQ ID NO:20) where X is any amino acid and X' is selected from L and I. Preferably, an invention nucleic acid encodes a TPBD having the sequence: E(X)10-13S(X)6-7LXW(X)3VXP(x)10-11S(X)4L(X)24-28K(X)9-10F(X)5WG(X)3F(X)16D(X)5-7V (SEQ ID NO:21), where X is any amino acid. More preferably, an invention nucleic acid encodes a TPBD having the sequence: E(X)4(XA)(X)5-8SX(XB)(X)4-5LXWX(XA)XVXP(X).- sub.10-11S (XA)(X)3L(X)16-18(XA)(X)4-6(XC)(X- )2K(X)9-10F(X)5WG(XA)(X)2F(X)5(XA)X(X.s- ub.C)(X)7(XC) DX(XA)(X)2-4(XC)V (SEQ ID NO:22), where X is any amino acid, XA is selected from V, L and I; XB is selected from P and G; XC is selected from D, E, N and Q; and XD is selected from Y and F. Most preferably, an invention nucleic acid encodes a TPBD comprising the sequence SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0073]The nucleic acid molecules described herein are useful for producing invention proteins, when such nucleic acids are incorporated into a variety of protein expression systems known to those of skill in the art. In addition, such nucleic acid molecules or fragments thereof can be labeled with a readily detectable substituent and used as hybridization probes for assaying for the presence and/or amount of an invention TPBD gene or mRNA transcript in a given sample. The nucleic acid molecules described herein, and fragments thereof, are also useful as primers and/or templates in a Polymerase Chain Reacion (PCR) for amplifying genes encoding invention proteins described herein.
[0074]The term "nucleic acid" (also referred to as polynucleotides) encompasses ribonucleic acid (RNA) or deoxyribonucleic acid (DNA), probes, oligonucleotides, and primers. DNA can be either complementary DNA (cDNA) or genomic DNA, e.g. a gene encoding a TPBD. One means of isolating a nucleic acid encoding a TPBD is to probe a mammalian genomic library with a natural or artificially designed DNA probe using methods well known in the art. DNA probes derived from the TPBD gene are particularly useful for this purpose. DNA and cDNA molecules that encode TPBDs can be used to obtain complementary genomic DNA, cDNA or RNA from eukaryotic (e.g., human, primate, mammal, plant, nematode, insect, yeast, and the like), or mammalian sources, or to isolate related cDNA or genomic clones by the screening of cDNA or genomic libraries, by methods described in more detail below. Examples of nucleic acids are RNA, cDNA, or isolated genomic DNA encoding a TPBD, provided the nucleic acids do not comprise the nucleotide sequence set forth in SEQ ID NOs:1, 3, or 5. Such nucleic acids may include, but are not limited to, nucleic acids comprising substantially the same nucleotide sequence as set forth in SEQ ID NOs:7, 9, and 11.
[0075]In one embodiment of the present invention, cDNAs encoding the invention TPBD disclosed herein comprise substantially the same nucleotide sequence as set forth in SEQ ID NOs:19 or 20, provided they do not comprise the sequence set forth in SEQ ID NO: 1, 3 or 5. Preferably, cDNAs encoding the invention TPBDs disclosed herein comprise substantially the same nucleotide sequence as set forth in any of SEQ ID NOs:7, 9, and 11, provided they do not comprise the sequence set forth in SEQ ID NO: 1, 3 or 5. Preferred cDNA molecules encoding the invention proteins comprise the same nucleotide sequence as set forth in SEQ ID No: 7, 9 and 11.
[0076]In another embodiment of the present invention, cDNAs encoding the invention TPBDs disclosed herein comprise substantially the same nucleotide sequence as set forth in SEQ ID NOs:19 or 20, provided they are no longer than 639 bases in length. Preferably, cDNAs encoding the invention TPBDs disclosed herein comprise substantially the same nucleotide sequence as set forth in any of SEQ ID NOs:7, 9, and 11, provided they are not longer than 639 bases in length. Preferred cDNA molecules encoding the invention proteins comprise the same nucleotide sequence as set forth in SEQ ID No: 7, 9 and 11.
[0077]cDNA molecules SEQ ID NOs:7, 9 and 11 encoding the invention TRAF domains respectively represent the same nucleotide sequence as nucleotides 1-639 set forth in SEQ ID No:1; nucleotides 1-540 set forth in SEQ ID NO:3; and nucleotides 847-1305 set forth in SEQ ID NO:5.
[0078]As employed herein, the term "substantially the same nucleotide sequence" refers to DNA having sufficient identity to the reference polynucleotide, such that it will hybridize to the reference nucleotide under moderately stringent hybridization conditions. In one embodiment, DNA having substantially the same nucleotide sequence as the reference nucleotide sequence encodes substantially the same amino acid sequence as that set forth in any of SEQ ID Nos:8, 10, 12, 23, 24 and 25, provided the DNA does not encode the sequence set forth in SEQ ID NO: 2, 4 or 6. In another embodiment, DNA having "substantially the same nucleotide sequence" as the reference nucleotide sequence has at least 60% identity with respect to the reference nucleotide sequence. DNA having at least 70%, more preferably at least 90%, yet more preferably at least 95%, identity to the reference nucleotide sequence is preferred. Identity of any two nucleic acid sequences can be determined by those skilled in the art based, for example, on a BLAST 2.0 computer alignment, using default parameters. BLAST 2.0 searching is available at http://www.ncbi.nlm.nih.gov/gorf/b12.html., as described by Tatiana et al., FEMS Microbiol Lett. 174:247-250 (1999).
[0079]This invention also encompasses nucleic acids which differ from the nucleic acids shown in SEQ ID NOs:7, 9, and 11, but which have the same phenotype. Phenotypically similar nucleic acids are also referred to as "functionally equivalent nucleic acids". As used herein, the phrase "functionally equivalent nucleic acids" encompasses nucleic acids characterized by slight and non-consequential sequence variations that will function in substantially the same manner to produce the same protein product(s) as the nucleic acids disclosed herein. In particular, functionally equivalent nucleic acids encode polypeptides that are the same as those encoded by the nucleic acids disclosed herein or that have conservative amino acid variations. For example, conservative variations include substitution of a non-polar residue with another non-polar residue, or substitution of a charged residue with a similarly charged residue. These variations include those recognized by skilled artisans as those that do not substantially alter the tertiary structure of the protein.
[0080]Further provided are nucleic acids encoding TPBDs that, by virtue of the degeneracy of the genetic code, do not necessarily hybridize to the invention nucleic acids under specified hybridization conditions. Preferred nucleic acids encoding the invention TPBDs are comprised of nucleotides that encode substantially the same amino acid sequence as set forth in SEQ ID NOs:8, 10, and 12, provided they do not encode the sequence set forth in SEQ ID NO: 2, 4 or 6.
[0081]Thus, an exemplary nucleic acid encoding an invention TPBD may be selected from: [0082](a) DNA encoding the amino acid sequence set forth in SEQ ID NOs:19 or 20 [0083](b) DNA that hybridizes to the DNA of (a) under moderately stringent conditions, wherein said DNA encodes biologically active TPBD, or [0084](c) DNA degenerate with (b), wherein said DNA encodes biologically active TPBD,wherein the nucleic acid sequence does not encode an amino acid sequence longer than 213 residues.
[0085]Another exemplary nucleic acid encoding an invention TPBD may be selected from: [0086](a) DNA encoding the amino acid sequence set forth in SEQ ID NOS:8, 10, 12, 23, 24 and 25, [0087](b) DNA that hybridizes to the DNA of (a) under moderately stringent conditions, wherein said DNA encodes biologically active TPBD, or [0088](c) DNA degenerate with (b), wherein said DNA encodes biologically active TPBD,wherein the nucleic acid sequence does not encode the amino acid sequence set forth in SEQ ID Nos:2, 4 or 6.
[0089]Hybridization refers to the binding of complementary strands of nucleic acid (i.e., sense:antisense strands or probe:target-DNA) to each other through hydrogen bonds, similar to the bonds that naturally occur in chromosomal DNA. Stringency levels used to hybridize a given probe with target-DNA can be readily varied by those of skill in the art.
[0090]The phrase "stringent hybridization" is used herein to refer to conditions under which polynucleic acid hybrids are stable. As known to those of skill in the art, the stability of hybrids is reflected in the melting temperature (Tm) of the hybrids. In general, the stability of a hybrid is a function of sodium ion concentration and temperature. Typically, the hybridization reaction is performed under conditions of lower stringency, followed by washes of varying, but higher, stringency. Reference to hybridization stringency relates to such washing conditions.
[0091]As used herein, the phrase "moderately stringent hybridization" refers to conditions that permit target-DNA to bind a complementary nucleic acid that has about 60% identity, preferably about 75% identity, more preferably about 85% identity to the target DNA; with greater than about 90% identity to target-DNA being especially preferred. Preferably, moderately stringent conditions are conditions equivalent to hybridization in 50% formamide, 5× Denhart's solution, 5×SSPE, 0.2% SDS at 42° C., followed by washing in 0.2×SSPE, 0.2% SDS, at 42° C.
[0092]The phrase "high stringency hybridization" refers to conditions that permit hybridization of only those nucleic acid sequences that form stable hybrids in 0.018M NaCl at 65° C. (i.e., if a hybrid is not stable in 0.018M NaCl at 65° C., it will not be stable under high stringency conditions, as contemplated herein). High stringency conditions can be provided, for example, by hybridization in 50% formamide, 5× Denhart's solution, 5×SSPE, 0.2% SDS at 42° C., followed by washing in 0.1×SSPE, and 0.1% SDS at 65° C.
[0093]The phrase "low stringency hybridization" refers to conditions equivalent to hybridization in 10% formamide, 5× Denhart's solution, 6×SSPE, 0.2% SDS at 42° C., followed by washing in 1×SSPE, 0.2% SDS, at 50° C. Denhart's solution and SSPE (see, e.g., Sambrook et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989) are well known to those of skill in the art as are other suitable hybridization buffers.
[0094]As used herein, the term "degenerate" refers to codons that differ in at least one nucleotide from a reference nucleic acid, e.g., SEQ ID NOs:7, 9, and 11, but encode the same amino acids as the reference nucleic acid. For example, codons specified by the triplets "UCU", "UCC", "UCA", and "UCG" are degenerate with respect to each other since all four of these codons encode the amino acid serine.
[0095]Preferred nucleic acids encoding the invention polypeptide(s) hybridize under moderately stringent, preferably high stringency, conditions to substantially the entire sequence, or substantial portions (i.e., typically at least 15-30 nucleotides) of the nucleic acid sequence set forth in SEQ ID NOs:7, 9, and 11, provided they do not comprise the sequence set forth in SEQ ID NO: 1, 3 or 5.
[0096]The invention nucleic acids can be produced by a variety of methods well-known in the art, e.g., the methods described herein, employing PCR amplification using oligonucleotide primers from various regions of SEQ ID NOs:7, 9, and 11, and the like.
[0097]In accordance with a further embodiment of the present invention, optionally labeled TRAF-encoding cDNAs, or fragments thereof, can be employed to probe library(ies) (e.g., cDNA, genomic, and the like) for additional nucleic acid sequences encoding novel eukaryotic TPBDs. Construction of suitable eukaryotic cDNA libraries is well-known in the art. Screening of such a cDNA library is initially carried out under low-stringency conditions, which comprise a temperature of less than about 42° C., a formamide concentration of less than about 50%, and a moderate to low salt concentration.
[0098]Presently preferred probe-based screening conditions comprise a temperature of about 37° C., a formamide concentration of about 20%, and a salt concentration of about 5× standard saline citrate (SSC; 20×SSC contains 3M sodium chloride, 0.3M sodium citrate, pH 7.0). Such conditions will allow the identification of sequences which have a substantial degree of similarity with the probe sequence, without requiring perfect homology. The phrase "substantial similarity" refers to sequences which share at least 50% homology. Preferably, hybridization conditions will be selected which allow the identification of sequences having at least 70% homology with the probe, while discriminating against sequences which have a lower degree of homology with the probe. As a result, nucleic acids having substantially the same nucleotide sequence as SEQ ID NOs:1, 3, and 5, are obtained.
[0099]As used herein, a nucleic acid "probe" is single-stranded DNA or RNA, or analogs thereof, that has a sequence of nucleotides that includes at least 14, at least 20, at least 50, at least 100, at least 200, at least 300, at least 400, or at least 500 contiguous bases that are the same as (or the complement of) any contiguous bases set forth in any of SEQ ID NOs:7, 9, and 11. Preferred regions from which to construct probes include 5' and/or 3' coding regions of SEQ ID NOs:7, 9, and 11. In addition, the entire cDNA encoding region of an invention TRAF, or the entire sequence corresponding to SEQ ID NOs:7, 9, and 11, may be used as a probe. Probes may be labeled by methods well-known in the art, as described hereinafter, and used in various diagnostic kits.
[0100]As used herein, the terms "label" and "indicating means" in their various grammatical forms refer to single atoms and molecules that are either directly or indirectly involved in the production of a detectable signal. Any label or indicating means can be linked to invention nucleic acid probes, expressed proteins, polypeptide fragments, or antibody molecules. These atoms or molecules can be used alone or in conjunction with additional reagents. Such labels are themselves well-known in clinical diagnostic chemistry.
[0101]The labeling means can be a fluorescent labeling agent that chemically binds to antibodies or antigens without denaturation to form a fluorochrome (dye) that is a useful immunofluorescent tracer. A description of immunofluorescent analytic techniques is found in DeLuca, "Immunofluorescence Analysis", in Antibody As a Tool, Marchalonis et al., eds., John Wiley & Sons, Ltd., pp. 189-231 (1982), which is incorporated herein by reference.
[0102]In one embodiment, the indicating group is an enzyme, such as horseradish peroxidase (HRP), glucose oxidase, and the like. In another embodiment, radioactive elements are employed labeling agents. The linking of a label to a substrate, i.e., labeling of nucleic acid probes, antibodies, polypeptides, and proteins, is well known in the art. For instance, an invention antibody can be labeled by metabolic incorporation of radiolabeled amino acids provided in the culture medium. See, for example, Galfre et al., Meth. Enzymol., 73:3-46 (1981). Conventional means of protein conjugation or coupling by activated functional groups are particularly applicable. See, for example, Aurameas et al., Scand. J. Immunol., Vol. 8, Suppl. 7:7-23 (1978), Rodwell et al., Biotech., 3:889-894 (1984), and U.S. Pat. No. 4,493,795.
[0103]In another embodiment of the invention, nucleic acids are provided encoding chimeric proteins comprising an invention TPBD, or fragment thereof, having the sequence of SEQ ID NO:19 or SEQ ID NO:20, and further comprising one or more sequences from a heterologous protein. Functional fragments of TPBD include, for example, polypeptides having the sequence SEQ ID NO:8, 10, 12, 23, 24 or 25. Nucleic acids encoding proteins with which the TPBD or functional fragment thereof are fused will also encode, for example, glutathione-S-transferase, an antibody, or other proteins or functional fragments thereof which facilitate recovery of the chimera. Nucleic acids of the invention can also encode proteins with which the TPBD or functional fragment thereof are fused, for example, luciferase, green fluorescent protein, an antibody, or other proteins or functional fragments thereof which facilitate identification of the chimera. Still further nucleic acids of the invention encode proteins with which the TPBD or functional fragment thereof are fused including, for example, the LexA DNA binding domain, ricin, α-sarcin, an antibody, or other proteins which have therapeutic properties or other biological activity.
[0104]The present invention also provides compositions containing an acceptable carrier and any of an isolated, purified TPBD-containing protein or functional polypeptide fragments thereof, alone or in combination with each other. These polypeptides or proteins can be recombinantly derived, chemically synthesized or purified from native sources. As used herein, the term "acceptable carrier" encompasses any of the standard pharmaceutical carriers, such as phosphate buffered saline solution, water and emulsions such as an oil/water or water/oil emulsion, and various types of wetting agents.
[0105]The TPBD compositions described herein can be used, for example, in methods for modulating the activity of members of the TNFR family. TNF family receptor binding is well known in the art as mediating the signal transduction activity of the receptor, and it is demonstrated herein that invention TPBDs can bind TNF receptors. Thus it is within the scope of the present invention that a protein comprising the sequence SEQ ID NO:19 or 20, or a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20, modulates the activity of one or more TNF family receptors.
[0106]In one embodiment, modulation of a member of the TNFR family will comprise the step of contacting a member of the TNFR family with a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0107]In another embodiment, modulation of a member of the TNFR family will comprise the step of contacting a member of the TNFR family with a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a nucleic acid encoding a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0108]In another embodiment, the TPBD compositions described herein can be used, for example, in methods for modulating the activity of TRAF domain containing proteins. Thus it is within the scope of the present invention that a protein comprising the sequence SEQ ID NO:19 or 20, or a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20, modulates the activity of one or more TRAF domain containing proteins.
[0109]In one embodiment, modulation of a TRAF domain containing protein will comprise the step of contacting a TRAF domain containing protein with a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0110]In another embodiment, modulation of a TRAF domain containing protein will comprise the step of contacting a TRAF domain containing protein with a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a nucleic acid encoding a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0111]In another embodiment, a TPBD comprising the sequence SEQ ID NO:19 or 20, or a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20, modulates the activity of one or more TRAF-associated proteins. While some TRAF-associated proteins are known to modulate NF-κB activity, others are known to modulate cJun N-terminal kinase (JNK) activity, and still others are known to modulate the activity of other proteins. For example, c-IAP1 and c-IAP2 modulate caspase activity and thus influence apoptosis. Thus it is within the scope of the invention that an invention TRAF domain protein can modulate the activity of TRAF-associated proteins any protein with which TRAF-associated proteins are known to interact.
[0112]In one embodiment, modulation of a TRAF-associated protein will comprise the step of contacting a TRAF-associated protein with a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0113]In another embodiment, modulation of a TRAF-associated protein will comprise the step of contacting a TRAF-associated protein with a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a nucleic acid encoding a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0114]TPBD compositions can also be used, for example, in methods for modulating the activity of NF-κB and cJun N-terminal kinase (JNK). Proteins homologous to invention TPBDs, for example, human TRAF domain proteins, are well known in the art as modulating the activity of NF-κB and JNK, and it is further shown herein that SEQ ID NOS:8, 10, 12, 23, 24 and 25 can modulate NF-κB activity. Thus, in accordance with another embodiment of the invention, a protein comprising the sequence SEQ ID NO:19 or 20, or a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20, modulates the activity of NF-κB or JNK.
[0115]In one embodiment, modulation of NF-κB or JNK activity will comprise the step of contacting a cell containing NF-κB or JNK activity with a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0116]In another embodiment, modulation of NF-κB or JNK activity will comprise the step of contacting a cell containing NF-κB or JNK activity with a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20. Preferably, the method comprises contacting a cell with a nucleic acid encoding a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0117]The functions of the invention TPBDs support the role of TRAFs in modulating cellular pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses and B cell immunoglobulin class switching. Thus, in accordance with another embodiment of the invention, a protein comprising the sequence SEQ ID NO:19 or 20, or a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20, modulates apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching.
[0118]In one embodiment, modulation of apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching will comprise the step of contacting a cell with a protein comprising the sequence SEQ ID NO:19 or 20, whereby apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching is modulated. Preferably, the method comprises contacting a cell with a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0119]In another embodiment, modulation of apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching will comprise the step of contacting a cell with a nucleic acid encoding a protein comprising the sequence SEQ ID NO:19 or 20, whereby apoptosis, cell proliferation, cell adhesion, cell stress responses or B cell immunoglobulin class switching is modulated. Preferably, the method comprises contacting a cell with a nucleic acid encoding a protein comprising the sequence of SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0120]Also provided are antisense-nucleic acids having a sequence capable of binding specifically with full-length or any portion of an mRNA that encodes TPBD polypeptides so as to prevent translation of the mRNA. The antisense-nucleic acid may have a sequence capable of binding specifically with any portion of the sequence of the cDNA encoding TPBD polypeptides. As used herein, the phrase "binding specifically" encompasses the ability of a nucleic acid sequence to recognize a complementary nucleic acid sequence and to form double-helical segments therewith via the formation of hydrogen bonds between the complementary base pairs. An example of an antisense-nucleic acid is an antisense-nucleic acid comprising chemical analogs of nucleotides.
[0121]Compositions comprising an amount of the antisense-nucleic acid, described above, effective to reduce expression of TPBD polypeptides by passing through a cell membrane and binding specifically with mRNA encoding TPBD polypeptides so as to prevent translation and an acceptable hydrophobic carrier capable of passing through a cell membrane are also provided herein. Suitable hydrophobic carriers are described, for example, in U.S. Pat. Nos. 5,334,761; 4,889,953; 4,897,355, and the like. The acceptable hydrophobic carrier capable of passing through cell membranes may also comprise a structure which binds to a receptor specific for a selected cell type and is thereby taken up by cells of the selected cell type. The structure may be part of a protein known to bind to a cell-type specific receptor.
[0122]Antisense-nucleic acid compositions are useful to inhibit translation of mRNA encoding invention polypeptides. Synthetic oligonucleotides, or other antisense chemical structures are designed to bind to mRNA encoding TPBD polypeptides and inhibit translation of mRNA and are useful as compositions to inhibit expression of TPBD associated genes in a tissue sample or in a subject.
[0123]In accordance with another embodiment of the invention, kits for detecting mutations, duplications, deletions, rearrangements and aneuploidies in TPBD genes comprising at least one invention probe or antisense nucleotide.
[0124]The present invention provides means to modulate levels of expression of TPBD polypeptides by employing synthetic antisense-nucleic acid compositions (hereinafter SANC) which inhibit translation of mRNA encoding these polypeptides. Synthetic oligonucleotides, or other antisense-nucleic acid chemical structures designed to recognize and selectively bind to mRNA, are constructed to be complementary to full-length or portions of a TPBD coding strand, including nucleotide sequences set forth in SEQ ID NOs:7, 9, and 11. The SANC is designed to be stable in the blood stream for administration to a subject by injection, or in laboratory cell culture conditions. The SANC is designed to be capable of passing through the cell membrane in order to enter the cytoplasm of the cell by virtue of physical and chemical properties of the SANC which render it capable of passing through cell membranes, for example, by designing small, hydrophobic SANC chemical structures, or by virtue of specific transport systems in the cell which recognize and transport the SANC into the cell. In addition, the SANC can be designed for administration only to certain selected cell populations by targeting the SANC to be recognized by specific cellular uptake mechanisms which bind and take up the SANC only within select cell populations. In a particular embodiment the SANC is an antisense oligonucleotide.
[0125]For example, the SANC may be designed to bind to a receptor found only in a certain cell type, as discussed supra. The SANC is also designed to recognize and selectively bind to target mRNA sequence, which may correspond to a sequence contained within the sequences shown in SEQ ID NOs:7, 9, and 11. The SANC is designed to inactivate target mRNA sequence by either binding thereto and inducing degradation of the mRNA by, for example, RNase I digestion, or inhibiting translation of mRNA target sequence by interfering with the binding of translation-regulating factors or ribosomes, or inclusion of other chemical structures, such as ribozyme sequences or reactive chemical groups which either degrade or chemically modify the target mRNA. SANCs have been shown to be capable of such properties when directed against mRNA targets (see Cohen et al., TIPS, 10:435 (1989) and Weintraub, Sci. American, January (1990), pp.40; both incorporated herein by reference).
[0126]In accordance with yet another embodiment of the present invention, there is provided a method for the recombinant production of invention TPBDs by expressing the above-described nucleic acid sequences in suitable host cells. Recombinant DNA expression systems that are suitable to produce TPBDs described herein are well-known in the art. For example, the above-described nucleotide sequences can be incorporated into vectors for further manipulation. As used herein, vector (or plasmid) refers to discrete elements that are used to introduce heterologous DNA into cells for either expression or replication thereof.
[0127]Suitable expression vectors are well-known in the art, and include vectors capable of expressing DNA operatively linked to a regulatory sequence, such as a promoter region that is capable of regulating expression of such DNA. Thus, an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector that, upon introduction into an appropriate host cell, results in expression of the inserted DNA. Appropriate expression vectors are well known to those of skill in the art and include those that are replicable in eukaryotic cells and/or prokaryotic cells and those that remain episomal or those which integrate into the host cell genome.
[0128]As used herein, a promoter region refers to a segment of DNA that controls transcription of DNA to which it is operatively linked. The promoter region includes specific sequences that are sufficient for RNA polymerase recognition, binding and transcription initiation. In addition, the promoter region includes sequences that modulate this recognition, binding and transcription initiation activity of RNA polymerase. These sequences may be cis acting or may be responsive to trans acting factors. Promoters, depending upon the nature of the regulation, may be constitutive or regulated. Exemplary promoters contemplated for use in the practice of the present invention include the SV40 early promoter, the cytomegalovirus (CMV) promoter, the mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine leukemia virus (MMLV) promoter, and the like.
[0129]As used herein, the term "operatively linked" refers to the functional relationship of DNA with regulatory and effector nucleotide sequences, such as promoters, enhancers, transcriptional and translational stop sites, and other signal sequences. For example, operative linkage of DNA to a promoter refers to the physical and functional relationship between the DNA and the promoter such that the transcription of such DNA is initiated from the promoter by an RNA polymerase that specifically recognizes, binds to and transcribes the DNA.
[0130]As used herein, expression refers to the process by which polynucleic acids are transcribed into mRNA and translated into peptides, polypeptides, or proteins. If the polynucleic acid is derived from genomic DNA, expression may, if an appropriate eukaryotic host cell or organism is selected, include splicing of the mRNA.
[0131]Prokaryotic transformation vectors are well-known in the art and include pBlueskript and phage Lambda ZAP vectors (Stratagene, La Jolla, Calif.), and the like. Other suitable vectors and promoters are disclosed in detail in U.S. Pat. No. 4,798,885, issued Jan. 17, 1989, the disclosure of which is incorporated herein by reference in its entirety.
[0132]Other suitable vectors for transformation of E. coli cells include the pET expression vectors (Novagen, see U.S. Pat. No. 4,952,496), e.g., pET11a, which contains the T7 promoter, T7 terminator, the inducible E. coli lac operator, and the lac repressor gene; and pET 12a-c, which contain the T7 promoter, T7 terminator, and the E. coli ompT secretion signal. Another suitable vector is the pIN-IIIompA2 (see Duffaud et al., Meth. in Enzymology, 153:492-507, 1987), which contains the 1 pp promoter, the lacUV5 promoter operator, the ompA secretion signal, and the lac repressor gene.
[0133]Exemplary, eukaryotic transformation vectors, include the cloned bovine papilloma virus genome, the cloned genomes of the murine retroviruses, and eukaryotic cassettes, such as the pSV-2 gpt system (described by Mulligan and Berg, Nature Vol. 277:108-114 (1979)) the Okayama-Berg cloning system (Mol. Cell Biol. 2:161-170 (1982)), and the expression cloning vector described by Genetics Institute (Wong et al., Science 228:810-815 (1985)), are available which provide substantial assurance of at least some expression of the protein of interest in the transformed eukaryotic cell line.
[0134]Particularly preferred base vectors which contain regulatory elements that can be linked to the invention TPBD-encoding DNAs for transfection of mammalian cells are cytomegalovirus (CMV) promoter-based vectors such as pcDNA1 (Invitrogen, San Diego, Calif.), MMTV promoter-based vectors such as pMAMNeo (Clontech, Palo Alto, Calif,.) and pMSG (Pharmacia, Piscataway, N.J.), and SV40 promoter-based vectors such as pSVβ (Clontech, Palo Alto, Calif.).
[0135]In accordance with another embodiment of the present invention, there are provided "recombinant cells" containing the nucleic acid molecules (i.e., DNA or mRNA) of the present invention. Methods of transforming suitable host cells, preferably bacterial cells, and more preferably E. coli cells, as well as methods applicable for culturing said cells containing a gene encoding a heterologous protein, are generally known in the art. See, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual (2 ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., USA (1989).
[0136]Exemplary methods of transformation include, e.g., transformation employing plasmids, viral, or bacterial phage vectors, transfection, electroporation, lipofection, and the like. The heterologous DNA can optionally include sequences which allow for its extrachromosomal maintenance, or said heterologous DNA can be caused to integrate into the genome of the host (as an alternative means to ensure stable maintenance in the host).
[0137]Host organisms contemplated for use in the practice of the present invention include those organisms in which recombinant production of heterologous proteins has been carried out. Examples of such host organisms include bacteria (e.g., E. coli), yeast (e.g., Saccharomyces cerevisiae, Candida tropicalis, Hansenula polymorpha and P. pastoris; see, e.g., U.S. Pat. Nos. 4,882,279, 4,837,148, 4,929,555 and 4,855,231), mammalian cells (e.g., HEK293, CHO and Ltk.sup.- cells), insect cells, and the like. Presently preferred host organisms are bacteria. The most preferred bacteria is E. coli.
[0138]In one embodiment, nucleic acids encoding the invention TPBDs can be delivered into mammalian cells, either in vivo or in vitro using suitable viral vectors well-known in the art. Suitable retroviral vectors, designed specifically for "gene therapy" methods, are described, for example, in WIPO publications WO 9205266 and WO 9214829, which provide a description of methods for efficiently introducing nucleic acids into human cells. In addition, where it is desirable to limit or reduce the in vivo expression of the invention TRAF, the introduction of the antisense strand of the invention nucleic acid is contemplated.
[0139]Viral based systems provide the advantage of being able to introduce relatively high levels of the heterologous nucleic acid into a variety of cells. Suitable viral vectors for introducing invention nucleic acid encoding an TPBD into mammalian cells (e.g., vascular tissue segments) are well known in the art. These viral vectors include, for example, Herpes simplex virus vectors (e.g., Geller et al., Science, 241:1667-1669 (1988)), Vaccinia virus vectors (e.g., Piccini et al., Meth. in Enzymology, 153:545-563 (1987); Cytomegalovirus vectors (Mocarski et al., in Viral Vectors, Y. Gluzman and S. H. Hughes, Eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1988, pp. 78-84), Moloney murine leukemia virus vectors (Danos et al., PNAS, USA, 85:6469 (1980)), adenovirus vectors (e.g., Logan et al., PNAS, USA, 81:3655-3659 (1984); Jones et al., Cell, 17:683-689 (1979); Berkner, Biotechniques, 6:616-626 (1988); Cotten et al., PNAS, USA, 89:6094-6098 (1992); Graham et al., Meth. Mol. Biol., 7:109-127 (1991)), adeno-associated virus vectors, retrovirus vectors (see, e.g., U.S. Pat. Nos. 4,405,712 and 4,650,764), and the like. Especially preferred viral vectors are the adenovirus and retroviral vectors.
[0140]For example, in one embodiment of the present invention, adenovirus-transferrin/polylysine-DNA (TfAdpl-DNA) vector complexes (Wagner et al., PNAS, USA, 89:6099-6103 (1992); Curiel et al., Hum. Gene Ther., 3:147-154 (1992); Gao et al., Hum. Gene Ther., 4:14-24 (1993)) are employed to transduce mammalian cells with heterologous TPBD nucleic acid. Any of the plasmid expression vectors described herein may be employed in a TfAdpl-DNA complex.
[0141]As used herein, "retroviral vector" refers to the well-known gene transfer plasmids that have an expression cassette encoding an heterologous gene residing between two retroviral LTRs. Retroviral vectors typically contain appropriate packaging signals that enable the retroviral vector, or RNA transcribed using the retroviral vector as a template, to be packaged into a viral virion in an appropriate packaging cell line (see, e.g., U.S. Pat. No. 4,650,764).
[0142]Suitable retroviral vectors for use herein are described, for example, in U.S. Pat. No. 5,252,479, and in WIPO publications WO 92/07573, WO 90/06997, WO 89/05345, WO 92/05266 and WO 92/14829, incorporated herein by reference, which provide a description of methods for efficiently introducing nucleic acids into human cells using such retroviral vectors. Other retroviral vectors include, for example, the mouse mammary tumor virus vectors (e.g., Shackleford et al., PNAS, USA, 85:9655-9659 (1988)), and the like.
[0143]In accordance with yet another embodiment of the present invention, there are provided anti-TPBD antibodies having specific reactivity with one or more TPBD polypeptides of the present invention. Active fragments of antibodies are encompassed within the definition of "antibody". Invention antibodies can be produced by methods known in the art using invention polypeptides, proteins or portions thereof as antigens. For example, polyclonal and monoclonal antibodies can be produced by methods well known in the art, as described, for example, in Harlow and Lane, Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory (1988)), which is incorporated herein by reference. Invention polypeptides can be used as immunogens in generating such antibodies. Alternatively, synthetic peptides can be prepared (using commercially available synthesizers) and used as immunogens. Amino acid sequences can be analyzed by methods well known in the art to determine whether they encode hydrophobic or hydrophilic domains of the corresponding polypeptide. Altered antibodies such as chimeric, humanized, CDR-grafted or bifunctional antibodies can also be produced by methods well known in the art. Such antibodies can also be produced by hybridoma, chemical synthesis or recombinant methods described, for example, in Sambrook et al., supra., and Harlow and Lane, supra. Both anti-peptide and anti-fusion protein antibodies can be used. (see, for example, Bahouth et al., Trends Pharmacol. Sci. 12:338 (1991); Ausubel et al., Current Protocols in Molecular Biology (John Wiley and Sons, NY (1989) which are incorporated herein by reference).
[0144]Antibody so produced can be used, inter alia, in diagnostic methods and systems to detect the level of TPBD present in a mammalian, preferably human, body sample, such as tissue or vascular fluid. Such antibodies can also be used for the immunoaffinity or affinity chromatography purification of the invention TRAF. In addition, methods are contemplated herein for detecting the presence of an invention TPBD either within a cell, or on the surface of a cell, comprising contacting the cell with an antibody that specifically binds to TPBD polypeptides, under conditions permitting binding of the antibody to the TPBD polypeptides, detecting the presence of the antibody bound to the TPBD polypeptide, and thereby detecting the presence of invention polypeptides on the surface of the cell. With respect to the detection of such polypeptides, the antibodies can be used for in vitro diagnostic or in vivo imaging methods.
[0145]Immunological procedures useful for in vitro detection of target TPBD polypeptides in a sample include immunoassays that employ a detectable antibody. Such immunoassays include, for example, ELISA, Pandex microfluorimetric assay, agglutination assays, flow cytometry, serum diagnostic assays and immunohistochemical staining procedures which are well known in the art. An antibody can be made detectable by various means well known in the art. For example, a detectable marker can be directly or indirectly attached to the antibody. Useful markers include, for example, radionucleotides, enzymes, fluorogens, chromogens and chemiluminescent labels.
[0146]Invention anti-TPBD antibodies are contemplated for use herein to modulate the activity of the TPBD polypeptide in living animals, in humans, or in biological tissues or fluids isolated therefrom. The term "modulate" refers to a compound's ability to increase (e.g., via an agonist), decrease (e.g., via an antagonist), or otherwise modify (e.g., increasing a first TPBD activity while decreasing a second TPBD activity) the biological activity of an invention TPBD protein, such as TNFR family-binding, TRAF protein binding activity, TRAF-associated protein binding activity, NF-κB modulating activity, JNK modulating activity, apoptosis modulating activity, cell proliferation modulating activity, cell adhesion, cell stress responses modulating activity, or B cell immunoglobulin class switching modulating activity, and the like. Accordingly, compositions comprising a carrier and an amount of an antibody having specificity for TPBD polypeptides effective to block naturally occurring ligands or other TPBD-associated proteins, and the like, from binding to invention TPBD polypeptides are contemplated herein. For example, a monoclonal antibody directed to an epitope of an invention TPBD polypeptide including an amino acid sequence set forth in SEQ ID NOs:8, 10, 12, 23, 24 and 25 can be useful for this purpose.
[0147]The present invention further provides transgenic non-human mammals that are capable of expressing exogenous nucleic acids encoding TPBDs. As employed herein, the phrase "exogenous nucleic acid" refers to nucleic acid sequence which is not native to the host, or which is present in the host in other than its native environment (e.g., as part of a genetically engineered DNA construct). In addition to naturally occurring levels of TPBD-containing proteins, invention TPBDs can either be overexpressed or underexpressed (such as in the well-known knock-out transgenics) in transgenic mammals.
[0148]Also provided are transgenic non-human mammals capable of expressing nucleic acids encoding TPBD polypeptides so mutated as to be incapable of normal activity, i.e., do not express native TPBD. The present invention also provides transgenic non-human mammals having a genome comprising antisense nucleic acids complementary to nucleic acids encoding TPBD polypeptides, placed so as to be transcribed into antisense mRNA complementary to mRNA encoding TPBD polypeptides, which hybridizes to the mRNA and, thereby, reduces the translation thereof. The nucleic acid may additionally comprise an inducible promoter and/or tissue specific regulatory elements, so that expression can be induced, or restricted to specific cell types. Examples of nucleic acids are DNA or cDNA having a coding sequence substantially the same as the coding sequence shown in SEQ ID NOs:1, 3, and 5. An example of a non-human transgenic mammal is a transgenic mouse. Examples of tissue specificity-determining elements are the metallothionein promoter and the L7 promoter.
[0149]Animal model systems which elucidate the physiological and behavioral roles of TPBD polypeptides are also provided, and are produced by creating transgenic animals in which the expression of the TPBD polypeptide is altered using a variety of techniques. Examples of such techniques include the insertion of normal or mutant versions of nucleic acids encoding an TRAF polypeptide by microinjection, retroviral infection or other means well known to those skilled in the art, into appropriate fertilized embryos to produce a transgenic animal. (See, for example, Hogan et al., Manipulating the Mouse Embryo: A Laboratory Manual (Cold Spring Harbor Laboratory, (1986)).
[0150]Also contemplated herein, is the use of homologous recombination of mutant or normal versions of TPBD genes with the native gene locus in transgenic animals, to alter the regulation of expression or the structure of TPBD polypeptides (see, Capecchi et al., Science 244:1288 (1989); Zimmer et al., Nature 338:150 (1989); which are incorporated herein by reference). Homologous recombination techniques are well known in the art. Homologous recombination replaces the native (endogenous) gene with a recombinant or mutated gene to produce an animal that cannot express native (endogenous) protein but can express, for example, a mutated protein which results in altered expression of TPBD polypeptides.
[0151]In contrast to homologous recombination, microinjection adds genes to the host genome, without removing host genes. Microinjection can produce a transgenic animal that is capable of expressing both endogenous and exogenous TPBDs. Inducible promoters can be linked to the coding region of nucleic acids to provide a means to regulate expression of the transgene. Tissue specific regulatory elements can be linked to the coding region to permit tissue-specific expression of the transgene. Transgenic animal model systems are useful for in vivo screening of compounds for identification of specific ligands, i.e., agonists and antagonists, which activate or inhibit protein responses.
[0152]Invention nucleic acids, oligonucleotides (including antisense), vectors containing same, transformed host cells, polypeptides and combinations thereof, as well as antibodies of the present invention, can be used to screen compounds in vitro to determine whether a compound functions as a potential agonist or antagonist to invention TPBDs. These in vitro screening assays provide information regarding the function and activity of invention TPBDs, which can lead to the identification and design of compounds that are capable of specific interaction with one or more types of polypeptides, peptides or proteins.
[0153]An invention TPBD, as used herein, contains the sequence E(X)17-21L(X)2W(X)3VXP(X)15-16L(X)24-28K(X)15-16- W (SEQ ID NO:19), where X is any amino acid. Alternatively, an invention TPBD is characterized as having the sequence: LXWX(X')XVXP (SEQ ID NO:20) where X is any amino acid and X' is selected from L and I. Preferably, an invention TPBD has the sequence: E(X)10-13S(X)6-7LXW(X)3VXP(x)10-11S(X)4L(X)24-28K(X)9-10F(X)5WG(X)3F(X)16D(X)5-7V (SEQ ID NO:21), where X is any amino acid. More preferably, an invention TPBD has the following sequence: E(X)4(XA)(X)5-8SX(XB)(X)4-5LXWX(XA)XVXP(X).- sub.10-11S(XA)(X)3L(X)16-18(XA)(X)4-6(XC)(X)- 9-10F(X)5WG(XA)(X)2F(X)5(XA)X(XC)(X)7(X- C) DX(XA)(X)2-4(XC)V (SEQ ID NO:22), where X is any amino acid, XA is selected from V, L and I; XB is selected from P and G; XC is selected from D, E, N and Q; and XD is selected from Y and F. Most preferably, an invention TPBD comprises the sequence SEQ ID NO:8, 10, 12, 23, 24 or 25.
[0154]By the known homology of invention TPBDs to known TRAF domains, it is within the scope of the invention that invention TPBD also have a role in celluar pathways that effect apoptosis, cell proliferation, cell adhesion, cell stress responses and B cell immunoglobulin class switching. Thus, invention TPBDs also provide drug discovery targets for a broad variety of pathologies including autoimmunity, inflammation, allergy, allograph-rejection and sepsis, and for a broad variety of cancer pathologies, such as, gliomas, carcinomas, sarcomas, melanomas, hamartomas and the like. In certain aspects of the invention, invention TRAF proteins, agonist or antagonists thereto, are used to treat autoimmunity, inflammation, allergy, allograph-rejection, sepsis, keratinocyte hyperplasia, neoplasia, keloid, benign prostatic hypertrophy, inflammatory hyperplasia, fibrosis, smooth muscle cell proliferation in arteries following balloon angioplasty (restenosis), and the like. Exemplary cancer pathologies contemplated herein for treatment include, gliomas, carcinomas, adenocarcinomas, sarcomas, melanomas, hamartomas, leukemias, lymphomas, and the like.
[0155]Also provided herein are methods of treating pathologies, said method comprising administering an effective amount of an invention therapeutic composition. Such compositions are typically administered in a physiologically compatible composition.
[0156]Methods of treating pathologies of abnormal cell proliferation will include methods of modulating the activity of one or more oncogenic proteins, wherein the oncogenic proteins specifically interact with a TPBD. Methods of modulating the activity of such oncogenic proteins will include contacting the oncogenic protein with a substantially pure TPBD or an active fragment (i.e., oncogenic protein-binding fragment) thereof. This contacting will modulate the activity of the oncogenic protein, thereby providing a method of treating a pathology caused by the oncogenic protein. Further methods of modulating the activity of oncogenic proteins will include contacting the oncogenic protein with an agent, wherein the agent modulates the interactions between the TPBD and the oncogenic protein.
[0157]Methods of treating immune-based pathologies such as autoimmunity, inflammation, allergy, allograph-rejection, and sepsis will include modulating the activity of one or more proteins that modulate immune response, wherein the protein that modulates immune response specifically interact with a TPBD. Methods of modulating the activity of such protein that modulates immune response will include contacting the protein that modulates immune response with a substantially pure TPBD or an active fragment (i.e., protein-binding fragment) thereof. This contacting will modulate the activity of the protein that modulates immune response, thereby providing a method of treating a pathology caused by the protein that modulates immune response. Further methods of modulating the activity of a protein that modulates immune response will include contacting the protein that modulates immune response with an agent, wherein the agent modulates the interactions between the TPBD and the protein that modulates immune response.
[0158]Also contemplated herein, are therapeutic methods using invention pharmaceutical compositions for the treatment of pathological disorders in which there is too little cell division, such as, for example, bone marrow aplasias, immunodeficiencies due to a decreased number of lymphocytes, and the like. Methods of treating a variety of inflammatory diseases with invention therapeutic compositions are also contemplated herein, such as treatment of sepsis, fibrosis (e.g., scarring), arthritis, graft versus host disease, and the like.
[0159]The present invention also provides therapeutic compositions useful for practicing the therapeutic methods described herein. Therapeutic compositions of the present invention, such as pharmaceutical compositions, contain a physiologically compatible carrier together with an invention TPBD (or functional fragment thereof), a TPBD modulating agent, such as a compound (agonist or antagonist) identified by the methods described herein, or an anti-TPBD antibody, as described herein, dissolved or dispersed therein as an active ingredient. In a preferred embodiment, the therapeutic composition is not immunogenic when administered to a mammal or human patient for therapeutic purposes.
[0160]As used herein, the terms "pharmaceutically acceptable", "physiologically compatible" and grammatical variations thereof, as they refer to compositions, carriers, diluents and reagents, are used interchangeably and represent that the materials are capable of administration to a mammal without the production of undesirable physiological effects such as nausea, dizziness, gastric upset, and the like.
[0161]The preparation of a pharmacological composition that contains active ingredients dissolved or dispersed therein is well known in the art. Typically such compositions are prepared as injectables either as liquid solutions or suspensions; however, solid forms suitable for solution, or suspension, in liquid prior to use can also be prepared. The preparation can also be emulsified.
[0162]The active ingredient can be mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient in amounts suitable for use in the therapeutic methods described herein. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like, as well as combinations of any two or more thereof. In addition, if desired, the composition can contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and the like, which enhance the effectiveness of the active ingredient.
[0163]The therapeutic composition of the present invention can include pharmaceutically acceptable salts of the components therein. Pharmaceutically acceptable nontoxic salts include the acid addition salts (formed with the free amino groups of the polypeptide) that are formed with inorganic acids such as, for example, hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, phosphoric acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, fumaric acid, anthranilic acid, cinnamic acid, naphthalene sulfonic acid, sulfanilic acid, and the like.
[0164]Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium hydroxide, ammonium hydroxide, potassium hydroxide, and the like; and organic bases such as mono-, di-, and tri-alkyl and -aryl amines (e.g., triethylamine, diisopropyl amine, methyl amine, dimethyl amine, and the like) and optionally substituted ethanolamines (e.g., ethanolamine, diethanolamine, and the like).
[0165]Physiologically tolerable carriers are well known in the art. Exemplary liquid carriers are sterile aqueous solutions that contain no materials in addition to the active ingredients and water, or contain a buffer such as sodium phosphate at physiological pH, physiological saline or both, such as phosphate-buffered saline. Still further, aqueous carriers can contain more than one buffer salt, as well as salts such as sodium and potassium chlorides, dextrose, polyethylene glycol and other solutes.
[0166]Liquid compositions can also contain liquid phases in addition to and to the exclusion of water. Exemplary additional liquid phases include glycerin, vegetable oils such as cottonseed oil, and water-oil emulsions.
[0167]As described herein, an "effective amount" is a predetermined amount calculated to achieve the desired therapeutic effect, e.g., to modulate activity of an invention TPBD. The required dosage will vary with the particular treatment and with the duration of desired treatment; however, it is anticipated that dosages between about 10 micrograms and about 1 milligram per kilogram of body weight per day will be used for therapeutic treatment. It may be particularly advantageous to administer such compounds in depot or long-lasting form as discussed hereinafter. A therapeutically effective amount is typically an amount of an TPBD-modulating agent or compound identified herein that, when administered in a physiologically acceptable composition, is sufficient to achieve a plasma concentration of from about 0.1 μg/ml to about 100 μg/ml, preferably from about 1.0 μg/ml to about 50 μg/ml, more preferably at least about 2 μg/ml and usually 5 to 10 μg/ml. Therapeutic invention anti-TPBD antibodies can be administered in proportionately appropriate amounts in accordance with known practices in this art.
[0168]In accordance with still another embodiment of the present invention, there are provided methods for identifying compounds which bind to TPBD polypeptides. The invention proteins may be employed in a competitive binding assay. Such an assay can accommodate the rapid screening of a large number of compounds to determine which compounds, if any, are capable of binding to TPBDs. Subsequently, more detailed assays can be carried out with those compounds found to bind, to further determine whether such compounds act as modulators, agonists or antagonists of invention TPBDs. Compounds that bind to and/or modulate invention TPBDs can be used to treat a variety of pathologies mediated by invention TPBDs.
[0169]In another embodiment of the invention, there is provided a bioassay for identifying compounds which modulate the activity of invention TPBD polypeptides. Invention TPBD polypeptides are known to influence the activities of, for example, NF-κB. Further, homologous TRAF polypeptides are known to influence the activities of, for example, NF-κB and cJun N-terminal kinase (JNK). Thus a reporter gene construct to assay for NF-κB or JNK activity can be used to test invention TPBD activity (see Examples). According to this method, invention TPBD polypeptides are contacted with an "unknown" or test substance, the activity of the invention TPBD polypeptide is monitored subsequent to the contact with the "unknown" or test substance, and those substances which effect a resultant modulation of, for example, NF-κB activity or JNK activity are identified as functional ligands for TPBD polypeptides.
[0170]Alternative bioassays for identifying compounds which modulate the activity of invention TPBD polypeptides can be used which routinely are used to test for protein:protein interactions. Such bioassays include yeast two-hybrid assays, glutathione-S-transferase fusion protein binding assays, co-immmunoprecipitation assays, and the like. Such assays are well known in the art and can be found in standard reference texts such as Sambrook et al., supra., and Current Protocols in Molecular Biology, supra.
[0171]In accordance with another embodiment of the present invention, transformed host cells that recombinantly express invention polypeptides can be contacted with a test compound, and the modulating effect(s) thereof can then be evaluated by comparing the TPBD-mediated response (e.g., via reporter gene expression) in the presence and absence of test compound, or by comparing the response of test cells or control cells (i.e., cells that do not express TPBD polypeptides), to the presence of the compound.
[0172]As used herein, a compound or a signal that "modulates the activity" of invention TPBD polypeptides refers to a compound or a signal that alters the activity of TRAF polypeptides so that the activity of the invention polypeptide is different in the presence of the compound or signal than in the absence of the compound or signal. In particular, such compounds or signals include agonists and antagonists. An agonist encompasses a compound or a signal that activates TPBD protein expression. Alternatively, an antagonist includes a compound or signal that interferes with TPBD expression. Typically, the effect of an antagonist is observed as a blocking of agonist-induced protein activation. Antagonists include competitive and non-competitive antagonists. A competitive antagonist (or competitive blocker) interacts with or near the site specific for agonist binding. A non-competitive antagonist or blocker inactivates the function of the polypeptide by interacting with a site other than the agonist interaction site.
[0173]As understood by those of skill in the art, assay methods for identifying compounds that modulate TPBD activity generally require comparison to a control. One type of a "control" is a cell or culture that is treated substantially the same as the test cell or test culture exposed to the compound, with the distinction that the "control" cell or culture is not exposed to the compound. For example, in methods that use voltage clamp electrophysiological procedures, the same cell can be tested in the presence or absence of compound, by merely changing the external solution bathing the cell. Another type of "control" cell or culture may be a cell or culture that is identical to the transfected cells, with the exception that the "control" cell or culture do not express native proteins. Accordingly, the response of the transfected cell to compound is compared to the response (or lack thereof) of the "control" cell or culture to the same compound under the same reaction conditions.
[0174]In yet another embodiment of the present invention, the activation of TPBD polypeptides can be modulated by contacting the polypeptides with an effective amount of at least one compound identified by the above-described bioassays.
[0175]In accordance with another embodiment of the present invention, there are provided methods for diagnosing cancer, said method comprising:
[0176]detecting, in said subject, a defective sequence or mutant of SEQ ID NOs:7, 9, and 11.
[0177]In accordance with another embodiment of the present invention, there are provided diagnostic systems, preferably in kit form, comprising at least one invention nucleic acid in a suitable packaging material. The diagnostic nucleic acids are derived from the TPBD-encoding nucleic acids described herein. In one embodiment, for example, the diagnostic nucleic acids are derived from any of SEQ ID NOs:7, 9, and 11. Invention diagnostic systems are useful for assaying for the presence or absence of nucleic acid encoding TPBD in either genomic DNA or in transcribed nucleic acid (such as mRNA or cDNA) encoding TRAF.
[0178]A suitable diagnostic system includes at least one invention nucleic acid, preferably two or more invention nucleic acids, as a separately packaged chemical reagent(s) in an amount sufficient for at least one assay. Instructions for use of the packaged reagent are also typically included. Those of skill in the art can readily incorporate invention nucleic probes and/or primers into kit form in combination with appropriate buffers and solutions for the practice of the invention methods as described herein.
[0179]As employed herein, the phrase "packaging material" refers to one or more physical structures used to house the contents of the kit, such as invention nucleic acid probes or primers, and the like. The packaging material is constructed by well known methods, preferably to provide a sterile, contaminant-free environment. The packaging material has a label which indicates that the invention nucleic acids can be used for detecting a particular sequence encoding TPBD including the nucleotide sequences set forth in SEQ ID NOs:7, 9, and 11 or mutations or deletions therein, thereby diagnosing the presence of, or a predisposition for, cancer. In addition, the packaging material contains instructions indicating how the materials within the kit are employed both to detect a particular sequence and diagnose the presence of, or a predisposition for, cancer.
[0180]The packaging materials employed herein in relation to diagnostic systems are those customarily utilized in nucleic acid-based diagnostic systems. As used herein, the term "package" refers to a solid matrix or material such as glass, plastic, paper, foil, and the like, capable of holding within fixed limits an isolated nucleic acid, oligonucleotide, or primer of the present invention. Thus, for example, a package can be a glass vial used to contain milligram quantities of a contemplated nucleic acid, oligonucleotide or primer, or it can be a microtiter plate well to which microgram quantities of a contemplated nucleic acid probe have been operatively affixed.
[0181]"Instructions for use" typically include a tangible expression describing the reagent concentration or at least one assay method parameter, such as the relative amounts of reagent and sample to be admixed, maintenance time periods for reagent/sample admixtures, temperature, buffer conditions, and the like.
[0182]All U.S. patents and all publications mentioned herein are incorporated in their entirety by reference thereto. The invention will now be described in greater detail by reference to the following non-limiting examples.
Examples
I. Isolation of the TPBDs of TRAF7, HAUSP and SPOP
[0183]Nucleic acids encoding the TRAF protein binding domain fragments (TPBDs) of TRAF7 (residues 282-435 of SEQ ID NO:6), HAUSP (residues 1-213 of SEQ ID NO:2) and SPOP (residues 1-180 of SEQ ID NO:4) were isolated by polymerase chain reaction (PCR) using 2 μg of cDNA from Jurkat cells (The method for RNA extraction and cDNA synthesis was provided by the manufacturer (Pharmingen). The PCR was performed for 1 cycle of 94° C. for 2 min followed by 35 cycles of 94° C. for 15 s, 60° C. for 20 s and 72° C. for 100 s, followed by a final cycle of 72° C. for 5 min, using the following primers HAUSP: 5' GCGAATTCCAGGCCGCG 3' (SEQ ID NO:13) and 5' TTCCTCGAGCCGACTTAGCCTGTGTGC 3' (SEQ ID NO:14); SPOP: 5' CTTCGAATTCGCGATGTCAAGGGTTCC 3' (SEQ ID NO:15) and 5' CCATGCTCGAGGTATTCTAGCCAGAAATG 3' (SEQ ID NO:16); TRAF 7: 5' CCAGAATTCACCAGTGAATTAGTGCC 3' (SEQ ID NO:17) and 5' CCACTCGAGTAATGTACCAATGCTAGTCC 3' (SEQ ID NO:18). The amplified fragments were purified, digested with EcoR1 and Xho 1 restriction enzymes and subcloned into pCDNA-3-myc tag and into pGEX4T (Pharmacia).
II. Expression of TPBCs TRAF7, USP7 and SPOP
[0184]To determine mRNA expression of genes encoding various TRAF domain proteins, Northern blot analysis of the mRNA levels of TRAF7, USP7 and SPOP was performed in human tissues. 32P-labeled cDNA fragments of TRAF7, USP7 or SPOP corresponding to the TRAF domains were sequentially hybridized, in the order of TRAF7, USP7 and SPOP, to filter-immobilized poly-A.sup.+ RNA from various tissues (1 μg/lane) (Clontech; Palo Alto Calif.). The tissues tested were brain, heart, skeletal muscle, colon, thymus, spleen, kidney, liver, small intestine, placenta, lung and peripheral blood leukocytes (PBL). Hybridized fragments were visualized by autoradiography. The same RNA blot was finally hybridized with a human β-actin cDNA probe to controll for RNA loading (bottom panel).
[0185]As shown in FIG. 4, the TRAF7 probe hybridized to mRNA of about 5 kb. TRAF7 mRNA was most highly expressed in brain, heart, skeletal muscle, kidney, liver and placenta. TRAF7 was expressed at a lower level in thymus, small intestine, lung, spleen, PBL and colon.
[0186]USP7, also known as HAUSP, was expressed as a doublet of about 7 and 5.5 kb. USP7 was most highly expressed in heart, skeletal muscle, and kidney, with lower expression in brain, thymus, liver, placenta and PBLs, and lowest expression, barely detectable, in colon, spleen, small intestine and lung.
[0187]SPOP was expressed as mRNA of about 2.6 kb. SPOP was most highly expressed in heart and skeletal muscle, with lower expression in kidney and liver, and even lower expression in brain, skeletal muscle, colon, thymus, small intestine, placenta, lung, and PBLs.
III. Binding to the TNF-Receptor Family
[0188]For GST-fusion protein production, pGEX-plasmids were transformed into competent XL-1 blue bacteria cells and grown in LB medium and induced at an A600=1.0 with 1 mM isopropyl-1-thio-β-D-galactopyranoside for 4 h at 25° C. Cells were then recovered and resuspended in PBS containing 1 mM dithiothreitol, 1 mM phenyl-methyl-sulphonyl-fluoride and 100 μg/ml lysozyme and lysed by sonication. The GST-TRAF2 (263-501), (GST-HAUSP (SEQ ID NO: 8), GST-SPOP (SEQ ID NO: 10) and GST-TRAF7 (SEQ ID NO: 12) proteins were purified from bacterial lysates by affinity chromatography using glutathione-Sepharose (Amersham Pharmacia Biotech). The resins were then washed with PBS containing 1mM dithiothreitol until the OD280 nm reached <0.01.
[0189]Plasmids pGEX-TRAF2(263-501), pGEX-CD40(ct), pGEX-Fas(ct), pGEX-LT13R(ct), pGEX-DR4(ct) and pGEX-NGFR(ct) have been previously described (Leo et al., 1999, JBC 274, 22414; Sato et al., 1995, FEBS lett.358 113; Crowe et al., J Exp Med 1995,181,1205; McFarlane et al., J Biol Chem 1997, 272,25417; Rabizadeh et al., Proc Natl Acad Sci USA 1994, 91,10703). Plasmid pGEX-TNF-R2(ct) was kindly provided by Dr. C. Ware (La Jolla Institute for Alergy and Immunology).
[0190]In vitro GST-protein binding assays were performed as previously described (Hanada, M., Aime-Sempe, C., Sato, T. and Reed, J C. 1995, JBC 270, 11962-11968; Sato, T., Irie, S., Kitada, S. and Reed, J C. 1995, Science 268, 411-415; Takayama, S., Sato, T., Krajewski, S., Kochel, C., Irie, S., Millan, J. and Reed, J C. 1995, Cell 80, 279-284, Leo et al., 1999, JBC 274, 22414). Briefly, (35S)-methionine labeled GST-fused TNF family receptors FAS(ct), TNF-R2(ct), CD40(ct), LTβR(ct), NGFRp75(ct) and DR4(ct) were produced by in vitro translation using the TnT coupled reticulocyte system following the manufacturer indications (Promega Inc.). Equal amounts of each labeled protein (2-6 μl lysate) were then diluted with 250 μl of binding buffer (142 mM KCl, 5 mM MgCl12, 10 mM Hepes pH 7.4, 0.2% Nonidet-P40, 0.5 mM dithiothreitol, 1 mM EGTA, 0.5 mM phenyl-methyl-sulphonyl-fluoride and a mixture of other protease inhibitors (Boehringer 1697498) and incubated with the GST-protein resins (0.25 μg protein) at 4° C. for 2 h. The resins were then extensively washed with binding buffer and the GST-protein binding complexes were eluted with buffer containing 50 mM Tris-HCl pH 8, 1 mM dithiothreitol and 100 mM glutathione and analyzed by SDS-PAGE and fluorography.
[0191]The results of the binding assay are shown in FIG. 5. In vitro protein binding assays were performed by in vitro translating TRAF2 or the TRAF domain of TRAF7 in the presence of 35S-L-methionine in reticulocyte lysates, as described above, or following in vivo overexpression of TRAF2 or the TRAF domain of TRAF7 by transfection of 293T cells. Equal volumes of the in vitro translation mixtures (10 μl) were incubated with GST-fusion proteins containing the cytosolic domains of different members of the TNF-R family (1 μg)(FAS, TNF-R2, CD40, LTβR, NGFRp75, and DR4) immobilized on glutathione-Sepharose. For TRAF domain protein expression in vivo, 293T lysates (50 μl) were incubated with GST-fusion proteins (1 μg) containing the cytosolic domains of the TNF-R family members DR4, HVEM, TNF-R2, LTβR, NGFRp75, CD40, and TRAF7 and then immobilized on glutathione-Sepharose. Control GST and other GST control proteins were included in all assays After washing, bound proteins were analyzed by SDS-PAGE followed by fluorography for binding assays using in vitro translated proteins. For transfected 293 cells, samples were immunoblotted using anti-TRFA2 or anti-Myc antibodies to detect TRAF2 or TRAF7 TD binding, respectively. Bound proteins were then detected by standard chemiluminescence assay (ECL; Amersham; Piscataway N.J.).
[0192]As shown in FIG. 5, three TNF family receptors interacted with the GST-fused TRAF domain of TRAF2: TNF-R2, CD40 and LTβR. However, FAS, NGFRp75 and DR4 did not appreciably interact with the TRAF domain of TRAF2. In contrast, the GST-SPOP fusion protein showed significant interaction with only one of these six receptors, TNF-R2; and GST-HAUSP did not significantly interact with any of the six receptors. In contrast, GST-TRAF7 binds TNF-R2, CD40 (albeit weakly), LTβR, NRFRp75 and DR4. This result shows that invention TPBDs SPOP (SEQ ID NO: 10) and HAUSP (SEQ ID NO: 8) can demonstrate higher selectivity in receptor binding than the TRAF domain of TRAF2 and the invention TPBD of TRAF7 (SEQ ID NO:12).
[0193]In cell lysates of 293 cells transfected with TRAF2 or the TRAF domain of TRAF7, binding was observed with several TNF-R family members. TRAF2 interacted with HVEM, TNF-R2, LTβR, CD40 and TRAF7, with low binding activity also observed with NGFRp75. The TRAF domain of TRAF7 interacted with DR4, LTβR, and NGFRp75, with strongest binding observed with TRAF7. A faint band of TRAF7 was also observed to interact with TNF-R2 under these conditions, but no binding of TRAF7 to CD40 or HVEM was detected. Therefore, TRAF7 has strong binding activity with itself.
IV. Binding to TRAF Family Proteins
[0194]The above described GST-HAUSP and GST-SPOP, and GST-TRAF7 were produced and purified using the above described procedure.
[0195]Plasmids containing cDNAs encompasing the complete open reading frames of hTRAF 1 (pSG5-TRAF 1), hTRAF 2 (pcDNA3-HA-TRAF2), hTRAF3 (pbluscriptKS-TRAF3b), hTRAF4 (pcDNA3-HA-TRAF4), hTRAF5 (pcDNA3-Flag-TRAF5) and hTRAF6 (pcDNA3-myc-TRAF6) have been previously described (Sato et al., 1995, FEBS lett.358, 113; Nakano et al., 1996, JBC 271, 14661; Mosialos et al., 1995, Cell 80, 389; Krajewska et al., 1998, Am. J. Pathol. 152, 1549; Leo et al., 1999, JBC 274, 22414; Song and Donner.1995, Biochem. J. 309, 825; Rothe et al., 1995, Cell 78, 281).
[0196]In vitro translated TRAF1, TRAF2, TRAF3, TRAF4, TRAF5, TRAF6, full-length I-TRAF (Rothe et al., Proc. Natl. Acad. Sci. USA 93:8241-8246 (1996)), HAUSP (1-213, SEQ ID NO:8), SPOP (1-180, SEQ ID NO:10) and TRAF7 (282-435, SEQ ID NO:12) were incubated with the GST-protein resins using the previously described method.
[0197]The resultant images showed that the GST-fused TPBD of TRAF7 (SEQ ID NO: 12) interacted with all previously identified TRAF proteins, TRAFS1-6, and with I-TRAF (see FIG. 6). Additionally, the TPBD of TRAF7 showed an ability to self-associate. The GST-fused TPBD of HAUSP (SEQ ID NO: 8) also was able to interact with all previously known TRAF proteins, although somewhat weakly with TRAF3, and weakly with I-TRAF. However, HAUSP did not demonstrate an ability to self-associate. In contrast, the GST-fused TPBD of SPOP (SEQ ID NO: 10) demonstrated a higher selectivity, having significant interactions with only TRAF1 and TRAF6, and no ability to self-associate. Interestingly, no heterologous interaction between invention TPBDs (i.e., HAUSP:SPOP, HAUSP:TRAF7 and SPOP:TRAF7) were observed.
V. Reporter Gene Assays
[0198]293T cells were obtained from ATCC (Rockville, Mass.) and cultured in Dulbecco modified Eagle's-high glucose medium (Life technology, Inc) supplemented with 10% FCS (Hyclone, Utah), 1 mM glutamine and antibiotics. The promoter-containing reporter gene plasmids pUC13-4xNFkB-luc (containing 4 tandem HIV-NFkB response elements and the minimal fos promoter) and pCMV-β-galactosidase has been previously described (Miyashita and Reed, 1995, Cell 80, 293 ; Lin and Stavnezer, 1996, MCB 16, 4591).
[0199]For NF-κB reporter gene assays, 293T cells were calcium phosphate-transfected with 12 μg DNA, including 3.5 μg of either pcDNA3-myc-hTRAF2, pcDNA3-myc-hTRAF5 or pcDNA3-myc-hTRAF6, in combination with either 3.5 or 7 μg of control plasmid, pcDNA3-myc-HAUSP (1-213, SEQ ID NO: 8), pcDNA3-myc-SPOP (1-180, SEQ ID NO: 10) or pcDNA3-myc-TRAF7 (282-435, SEQ ID NO: 12) at 60% confluency in 6-well plates in duplicate, except in the case of TRAF5 in which only 7 μg of pcDNA3-myc-HAUSP or pcDNA3-myc-TRAF7 were transfected. In the case of SPOP, only 3.5 μg of either pcDNA3-myc-hTRAF6 was used. Also transfected were a total of 0.5 μg pUC13-4xNFκB-luc plasmid and 1 μg pCMV-β-galactosidase plasmid. After 36 h, cells were lysed with 0.5 ml of Promega lysis buffer. The luciferase activity from 10 μl of each cell lysate was determined using the Luciferase assay system from Promega, following the manufacturer protocol, and read using a luminometer (EG&G Berthold). The luciferase activity was normalized relative to B-galactosidase activity (Miyashita and Reed, 1995, Cell 80, 293).
[0200]Immunobloting was performed as described (Krajewski et al., 1996, Anal. Biochem. 236, 221). Briefly, 5 μl of each cell lysate was analyzed by immunoblotting using anti-TRAF6 or anti-myc antibodies and standard chemiluminiscence methods (Amersham) for detection.
[0201]The results show that TPBDs of HAUSP (SEQ ID NO: 8) and SPOP (SEQ ID NO: 10) and TRAF 7 (SEQ ID NO: 12)can inhibit NF-κB activitation induced by TRAF6 (see FIG. 7). In particular, the TPBD of HAUSP is capable of canceling nearly all of the TRAF6-induced increase in NF-κB activitation. Further, the TPBD of HAUSP also strongly inhibits NF-κB activation induced by TRAF 2 and TRAF 5. The TPBD of TRAF 7 also demonstrates the ability to inhibit TRAF 2-mediated NF-κB activation, but actually increases TRAF 5 mediated NF-κB activation.
[0202]The activity of various domains of the TRAF proteins USP7 and TRAF7 were further characterized with respect to regulating NFκB activity. The effect of TRAF7 and USP7 in regulating the NFκB activity induced by TRAF2 and TRAF6 in a mammalian cell line was tested. Briefly, 293T cells were transfected in 6 well plates with approximately 10 μg of pcDNA3 control plasmid, 3.5 μg of either pcDNA3-myc-hTRAF6 or pcDNA3-hTRAF2, and 7 μg of any of several TRAF7 or USP7 plasmid constructs containing various domains (see FIG. 8, bottom). The transfections also included 0.5 μg pUC13-4xNFκB-luc plasmid and 1 μg pCMV-β-galactosidase plasmid. Relative NFκB activity was assessed by luciferase assays using 10 μof each of the cell lysates prepared 36 h after transfection, with normalization for β-galactosidase activity. At least three to eight independent experiments were performed. The results are presented as fold of activation, with one representative experiment shown (FIG. 8).
[0203]As shown in FIG. 8, full length USP7 and a USP7 construct lacking the TRAF domain had a slight inhibitory effect on NFκB activity induced by TRAF2 or TRAF6. The USP7 TRAF domain alone strongly inhibited the TRAF2 and TRAF6-induced increase in NFκB activation. In contrast to USP7, full length TRAF7, as well as the TRAF domain of TRAF7 alone, had inhibitory activity for TRAF2 and TRAF6-induced increase in NFκB activation. A TRAF7 mutant construct lacking the TRAF domain did not show any effect on inhibiting the NFκB activation mediated by TRAF2 or TRAF6.
[0204]The effect of the TRAF domains of TRAF7, USP7 and SPOP in regulating the NFkB activity induced by TNFα or CD40 overexpression was also tested. 293T cells were transfected in 6 well plates with approximately 10 μg of pcDNA3 control plasmid, 7 μg of either TRAF7 TD, USP7 TD or SPOP TD plasmids, and either no plasmid (TNFα in FIG. 9) or 3.5 μg of pcDNA3-myc-CD40 (CD40 in FIG. 9), together with 0.5 μg pUC13-4xNFkB-luc plasmid and 1 μg pCMV-β-galactosidase plasmid. TNFαCtreatment was performed by adding 100 ng/ml of TNFα to the cells 12 h before harvesting. Relative NFκB activity was assessed by luciferase assays using 10 μl of each of the cell lysates prepared 36 h after transfection, with normalization for β-galactosidase activity. At least three independent experiments were performed. The results of a representative experiment are presented as fold of activation.
[0205]As shown in FIG. 9, the TRAF domain of TRAF7 had some inhibitory activity for activation of NFκB induced by both TNFα and CD40 overexpression. The TRAF domain of USP7 had even stronger inhibitory activity than the TRAF domain of TRAF7. In contrast, the TRAF domain of SPOP showed essentially no inhibitory activity of NFκB activation by either TNFα or CD40 overexpression.
[0206]Regulation by TRAF7 and USP7 of NFκB activation induced by the TRAF-associated protein NIK was also examined. 293T cells were transfected in 6 well plates with approximately 10 μg of pcDNA3 control plasmid, 7 μg of either empty pcDNA3, or pcDNA3-myc-containing TRAF7, TRAF7 TD, USP7 TD or SPOP TD plasmids, and 3.5 μg of pcDNA3-Flag-NIK, together with 0.5 μg pUC13-4xNFκB-luc plasmid and 1 μg pCMV-β-galactosidase plasmid. Relative NFκB activity was assessed by luciferase assays using 10 μl of each of the cell lysates prepared 36 h after transfection, with normalization for β-galactosidase activity. At least three independent experiments were performed. The results of a representative experiment are presented as fold of activation (FIG. 10).
[0207]As shown in FIG. 10, NIK induced NFκB activation can be partially regulated by TRAF7 and USP7. Both TRAF7 and the TRAF domain of TRAF7 partially inhibited NIK-induced NFκB activation. USP7 TRAF domain also partially inhibited NIK induced NFκB activation, although the full length USP7 did not inhibit NIK-induced NFκB activation (FIG. 10)
VI. Subcellular Localization of Domains of TPBDs
[0208]To test the function of various domains in subcellular localization, TRAF7 and USP7 deletion mutants were constructed. For TRAF7 deletion mutants, Cos7 cells were transfected with Lipofectamin plus (Life Technologies; Rockville Md.) and a total of 3 μg of plasmid. The TRAF7 deletion mutants (shown in FIG. 11) were constructed as Myc fusion proteins. 24 hours after transfection, cells were plated onto poly-lysinated cover-glasses and allowed to settle for 24 h more. Cells were fixed with methanol-acetone (50% each). After blocking, cells were stained with anti-myc mAb (Santa Cruz Biotechnology; Santa Cruz Calif.) and a secondary FITC-labelled rabbit anti-mouse Ig (Dako; Carpinteria Calif.). Analysis was performed using a Bio-Rad confocal microscope (BioRad; Hercules Calif.).
[0209]FIG. 11 shows the subcellular localization of TRAF7 and different deletion mutants of the molecule. A scheme of the different TRAF7 mutants analyzed is also shown. TRAF7 localizes in cytosolic corpuscles, which can appear as individual dots (panel A) or as large aggregates (A, small panel). Deletion mutants of TRAF7 lacking the RING finger domain (B), containing only the tripartite domain (C), lacking the poly-acidic region (D), or lacking the TRAF domain (E) have a similar subcellular localization as the full length TRAF7. In contrast, the TRAF domain alone (F) or the TRAF domain containing also the leucine zippers (G) did not show this particular subcellular localization, having a diffuse cytosolic localization. The C-terminal region of the molecule containing the TRAF domain and the poly-acidic region (H) or the poly-acidic region alone (I) have a similar diffuse cytosolic localization, although in this case they appear to be also located in the nucleus. These results indicate that the region of TRAF7 which is determinant for its subcellular localization can be mapped to the Z-b box and the first coiled coil.
[0210]For characterizing the localization of USP7 deletion mutants, Cos7 cells were transfected with Lipofectamin plus (Life Technologies) and a total of 3 μg of plasmid. USP7 full length and deletion mutants were constructed as Myc fusion proteins. The USP7 deletion mutants tested were a mutant lacking the TRAF domain and a mutant containing USP7 TRAF domain. 24 hours after transfection, cells were plated onto poly-lysinated cover-glasses and allowed to settle for 24 h more. Cells were fixed with methanol-acetone (50% each). After blocking, cells were stained with anti-myc mAb (Santa Cruz Biothech.) and a secondary FITC-labelled rabbit anti-mouse Ig (Dako). Analysis was performed using a Bio-Rad confocal microscope. Two representative examples of each construct are shown.
[0211]FIG. 12 shows the subcellular localization of USP7 full length (A and B), a deletion mutant of USP7 lacking the TRAF domain (C and D) and USP7 TRAF domain (E and F). USP7 is mainly located in the nucleus, although some cytosolic staining can also be observed in some cells. The deletion mutant lacking the TRAF domain is completely excluded from the nucleus, even when it still contains the nuclear localization signal. Finally, the TRAF domain of USP7 appears to have a diffuse localization, with a larger concentration in the perinuclear region. These results indicate that the TRAF domain of USP7 is necessary, but not sufficient, to target this molecule to the nucleus.
[0212]Although the invention has been described with reference to the examples above, it should be understood that various modifications can be made without departing from the spirit of the invention. Accordingly, the invention is limited only by the following claims.
Sequence CWU
1
3214022DNAHomo sapiensCDS(200)..(3505) 1gtacgtgcgc gtctccctgc cgccgccgcc
gcccgccgcg ggccgccccg gggccgccgt 60cgccgacgac gcgcgggagg aggaggagga
ggccgccccg ccgccgccgc cgccgccgcc 120gccccggctc gccgccgccc gcccgccggg
ctcgcagccc cggcccccgg ccgcaggcga 180ggcccaggcc gcggccgac atg aac cac
cag cag cag cag cag cag cag aaa 232 Met Asn His
Gln Gln Gln Gln Gln Gln Gln Lys 1 5
10gcg ggc gag cag cag ttg agc gag ccc gag gac atg gag atg
gaa gcg 280Ala Gly Glu Gln Gln Leu Ser Glu Pro Glu Asp Met Glu Met
Glu Ala 15 20 25gga gat aca
gat gac cca cca aga att act cag aac cct gtg atc aat 328Gly Asp Thr
Asp Asp Pro Pro Arg Ile Thr Gln Asn Pro Val Ile Asn 30
35 40ggg aat gtg gcc ctg agt gat gga cac aac acc
gcg gag gag gac atg 376Gly Asn Val Ala Leu Ser Asp Gly His Asn Thr
Ala Glu Glu Asp Met 45 50 55gag gat
gac acc agt tgg cgc tcc gag gca acc ttt cag ttc act gtg 424Glu Asp
Asp Thr Ser Trp Arg Ser Glu Ala Thr Phe Gln Phe Thr Val60
65 70 75gag cgc ttc agc aga ctg agt
gag tcg gtc ctt agc cct ccg tgt ttt 472Glu Arg Phe Ser Arg Leu Ser
Glu Ser Val Leu Ser Pro Pro Cys Phe 80 85
90gtg cga aat ctg cca tgg aag att atg gtg atg cca cgc
ttt tat cca 520Val Arg Asn Leu Pro Trp Lys Ile Met Val Met Pro Arg
Phe Tyr Pro 95 100 105gac aga
cca cac caa aaa agc gta gga ttc ttt ctc cag tgc aat gct 568Asp Arg
Pro His Gln Lys Ser Val Gly Phe Phe Leu Gln Cys Asn Ala 110
115 120gaa tct gat tcc acg tca tgg tct tgc cat
gca caa gca gtg ctg aag 616Glu Ser Asp Ser Thr Ser Trp Ser Cys His
Ala Gln Ala Val Leu Lys 125 130 135ata
ata aat tac aga gat gat gaa aag tcg ttc agt cgt cgt att agt 664Ile
Ile Asn Tyr Arg Asp Asp Glu Lys Ser Phe Ser Arg Arg Ile Ser140
145 150 155cat ttg ttc ttc cat aaa
gaa aat gat tgg gga ttt tcc aat ttt atg 712His Leu Phe Phe His Lys
Glu Asn Asp Trp Gly Phe Ser Asn Phe Met 160
165 170gcc tgg agt gaa gtg acc gat cct gag aaa gga ttt
ata gat gat gac 760Ala Trp Ser Glu Val Thr Asp Pro Glu Lys Gly Phe
Ile Asp Asp Asp 175 180 185aaa
gtt acc ttt gaa gtc ttt gta cag gcg gat gct ccc cat gga gtt 808Lys
Val Thr Phe Glu Val Phe Val Gln Ala Asp Ala Pro His Gly Val 190
195 200gcg tgg gat tca aag aag cac aca ggc
tac gtc ggc tta aag aat cag 856Ala Trp Asp Ser Lys Lys His Thr Gly
Tyr Val Gly Leu Lys Asn Gln 205 210
215gga gcg act tgt tac atg aac agc ctg cta cag acg tta ttt ttc acg
904Gly Ala Thr Cys Tyr Met Asn Ser Leu Leu Gln Thr Leu Phe Phe Thr220
225 230 235aat cag cta cga
aag gct gtg tac atg atg cca acc gag ggg gat gat 952Asn Gln Leu Arg
Lys Ala Val Tyr Met Met Pro Thr Glu Gly Asp Asp 240
245 250tcg tct aaa agc gtc cct tta gca tta caa
aga gtg ttc tat gaa tta 1000Ser Ser Lys Ser Val Pro Leu Ala Leu Gln
Arg Val Phe Tyr Glu Leu 255 260
265cag cat agt gat aaa cct gta gga aca aaa aag tta aca aag tca ttt
1048Gln His Ser Asp Lys Pro Val Gly Thr Lys Lys Leu Thr Lys Ser Phe
270 275 280ggg tgg gaa act tta gat agc
ttc atg caa cat gat gtt cag gag ctt 1096Gly Trp Glu Thr Leu Asp Ser
Phe Met Gln His Asp Val Gln Glu Leu 285 290
295tgt cga gtg ttg ctc gat aat gtg gaa aat aag atg aaa ggc acc tgt
1144Cys Arg Val Leu Leu Asp Asn Val Glu Asn Lys Met Lys Gly Thr Cys300
305 310 315gta gag ggc acc
ata ccc aaa tta ttc cgc ggc aaa atg gtg tcc tat 1192Val Glu Gly Thr
Ile Pro Lys Leu Phe Arg Gly Lys Met Val Ser Tyr 320
325 330atc cag tgt aaa gaa gta gac tat cgg tct
gat aga aga gaa gat tat 1240Ile Gln Cys Lys Glu Val Asp Tyr Arg Ser
Asp Arg Arg Glu Asp Tyr 335 340
345tat gat atc cag cta agt atc aaa gga aag aaa aat ata ttt gaa tca
1288Tyr Asp Ile Gln Leu Ser Ile Lys Gly Lys Lys Asn Ile Phe Glu Ser
350 355 360ttt gtg gat tat gtg gca gta
gaa cag ctc gat ggg gac aat aaa tac 1336Phe Val Asp Tyr Val Ala Val
Glu Gln Leu Asp Gly Asp Asn Lys Tyr 365 370
375gac gct ggg gaa cat ggc tta cag gaa gca gag aaa ggt gtg aaa ttc
1384Asp Ala Gly Glu His Gly Leu Gln Glu Ala Glu Lys Gly Val Lys Phe380
385 390 395cta aca ttg cca
cca gtg tta cat cta caa ctg atg aga ttt atg tat 1432Leu Thr Leu Pro
Pro Val Leu His Leu Gln Leu Met Arg Phe Met Tyr 400
405 410gac cct cag acg gac caa aat atc aag atc
aat gat agg ttt gaa ttc 1480Asp Pro Gln Thr Asp Gln Asn Ile Lys Ile
Asn Asp Arg Phe Glu Phe 415 420
425cca gag cag tta cca ctt gat gaa ttt ttg caa aaa aca gat cct aag
1528Pro Glu Gln Leu Pro Leu Asp Glu Phe Leu Gln Lys Thr Asp Pro Lys
430 435 440gac cct gca aat tat att ctt
cat gca gtc ctg gtt cat agt gga gat 1576Asp Pro Ala Asn Tyr Ile Leu
His Ala Val Leu Val His Ser Gly Asp 445 450
455aat cat ggt gga cat tat gtg gtt tat cta aac ccc aaa ggg gat ggc
1624Asn His Gly Gly His Tyr Val Val Tyr Leu Asn Pro Lys Gly Asp Gly460
465 470 475aaa tgg tgt aaa
ttt gat gac gac gtg gtg tca agg tgt act aaa gag 1672Lys Trp Cys Lys
Phe Asp Asp Asp Val Val Ser Arg Cys Thr Lys Glu 480
485 490gaa gca att gag cac aat tat ggg ggt cac
gat gac gac ctg tct gtt 1720Glu Ala Ile Glu His Asn Tyr Gly Gly His
Asp Asp Asp Leu Ser Val 495 500
505cga cac tgc act aat gct tac atg tta gtc tac atc agg gaa tca aaa
1768Arg His Cys Thr Asn Ala Tyr Met Leu Val Tyr Ile Arg Glu Ser Lys
510 515 520ctg agt gaa gtt tta cag gcg
gtc acc gac cat gat att cct cag cag 1816Leu Ser Glu Val Leu Gln Ala
Val Thr Asp His Asp Ile Pro Gln Gln 525 530
535ttg gtg gag cga tta caa gaa gag aaa agg atc gag gct cag aag cgg
1864Leu Val Glu Arg Leu Gln Glu Glu Lys Arg Ile Glu Ala Gln Lys Arg540
545 550 555aag gag cgg cag
gaa gcc cat ctc tat atg caa gtg cag ata gtc gca 1912Lys Glu Arg Gln
Glu Ala His Leu Tyr Met Gln Val Gln Ile Val Ala 560
565 570gag gac cag ttt tgt ggc cac caa ggg aat
gac atg tac gat gaa gaa 1960Glu Asp Gln Phe Cys Gly His Gln Gly Asn
Asp Met Tyr Asp Glu Glu 575 580
585aaa gtg aaa tac act gtg ttc aaa gta ttg aag aac tcc tcg ctt gct
2008Lys Val Lys Tyr Thr Val Phe Lys Val Leu Lys Asn Ser Ser Leu Ala
590 595 600gag ttt gtt cag agc ctc tct
cag acc atg gga ttt cca caa gat caa 2056Glu Phe Val Gln Ser Leu Ser
Gln Thr Met Gly Phe Pro Gln Asp Gln 605 610
615att cga ttg tgg ccc atg caa gca agg agt aat gga aca aaa cga cca
2104Ile Arg Leu Trp Pro Met Gln Ala Arg Ser Asn Gly Thr Lys Arg Pro620
625 630 635gca atg tta gat
aat gaa gcc gac ggc aat aaa aca atg att gag ctc 2152Ala Met Leu Asp
Asn Glu Ala Asp Gly Asn Lys Thr Met Ile Glu Leu 640
645 650agt gat aat gaa aac cct tgg aca ata ttc
ctg gaa aca gtt gat ccc 2200Ser Asp Asn Glu Asn Pro Trp Thr Ile Phe
Leu Glu Thr Val Asp Pro 655 660
665gag ctg gct gct agt gga gcg acc tta ccc aag ttt gat aaa gat cat
2248Glu Leu Ala Ala Ser Gly Ala Thr Leu Pro Lys Phe Asp Lys Asp His
670 675 680gat gta atg tta ttt ttg aag
atg tat gat ccc aaa acg cgg agc ttg 2296Asp Val Met Leu Phe Leu Lys
Met Tyr Asp Pro Lys Thr Arg Ser Leu 685 690
695aat tac tgt ggg cat atc tac aca cca ata tcc tgt aaa ata cgt gac
2344Asn Tyr Cys Gly His Ile Tyr Thr Pro Ile Ser Cys Lys Ile Arg Asp700
705 710 715ttg ctc cca gtt
atg tgt gac aga gca gga ttt att caa gat act agc 2392Leu Leu Pro Val
Met Cys Asp Arg Ala Gly Phe Ile Gln Asp Thr Ser 720
725 730ctt atc ctc tat gag gaa gtt aaa ccg aat
tta aca gag aga att cag 2440Leu Ile Leu Tyr Glu Glu Val Lys Pro Asn
Leu Thr Glu Arg Ile Gln 735 740
745gac tat gac gtg tct ctt gat aaa gcc ctt gat gaa cta atg gat ggt
2488Asp Tyr Asp Val Ser Leu Asp Lys Ala Leu Asp Glu Leu Met Asp Gly
750 755 760gac atc ata gta ttt cag aag
gat gac cct gaa aat gat aac agt gaa 2536Asp Ile Ile Val Phe Gln Lys
Asp Asp Pro Glu Asn Asp Asn Ser Glu 765 770
775tta ccc acc gca aag gag tat ttc cga gat ctc tac cac cgc gtt gat
2584Leu Pro Thr Ala Lys Glu Tyr Phe Arg Asp Leu Tyr His Arg Val Asp780
785 790 795gtc att ttc tgt
gat aaa aca atc cct aat gat cct gga ttt gtg gtt 2632Val Ile Phe Cys
Asp Lys Thr Ile Pro Asn Asp Pro Gly Phe Val Val 800
805 810acg tta tca aat aga atg aat tat ttt cag
gtt gca aag aca gtt gca 2680Thr Leu Ser Asn Arg Met Asn Tyr Phe Gln
Val Ala Lys Thr Val Ala 815 820
825cag agg ctc aac aca gat cca atg ttg ctg cag ttt ttc aag tct caa
2728Gln Arg Leu Asn Thr Asp Pro Met Leu Leu Gln Phe Phe Lys Ser Gln
830 835 840ggt tat agg gat ggc cca ggt
aat cct ctt aga cat aat tat gaa ggt 2776Gly Tyr Arg Asp Gly Pro Gly
Asn Pro Leu Arg His Asn Tyr Glu Gly 845 850
855act tta aga gat ctt cta cag ttc ttc aag cct aga caa cct aag aaa
2824Thr Leu Arg Asp Leu Leu Gln Phe Phe Lys Pro Arg Gln Pro Lys Lys860
865 870 875ctt tac tat cag
cag ctt aag atg aaa atc aca gac ttt gag aac agg 2872Leu Tyr Tyr Gln
Gln Leu Lys Met Lys Ile Thr Asp Phe Glu Asn Arg 880
885 890cga agt ttt aaa tgt ata tgg tta aac agc
caa ttt agg gaa gag gaa 2920Arg Ser Phe Lys Cys Ile Trp Leu Asn Ser
Gln Phe Arg Glu Glu Glu 895 900
905ata aca cta tat cca gac aag cat ggg tgt gtc cgg gac ctg tta gaa
2968Ile Thr Leu Tyr Pro Asp Lys His Gly Cys Val Arg Asp Leu Leu Glu
910 915 920gaa tgt aaa aag gcc gtg gag
ctt ggg gag aaa gca tca ggg aaa ctt 3016Glu Cys Lys Lys Ala Val Glu
Leu Gly Glu Lys Ala Ser Gly Lys Leu 925 930
935agg ctg cta gaa att gta agc tac aaa atc att ggt gtt cat caa gaa
3064Arg Leu Leu Glu Ile Val Ser Tyr Lys Ile Ile Gly Val His Gln Glu940
945 950 955gat gaa cta tta
gaa tgt tta tct cct gca acg agc cgg acg ttt cga 3112Asp Glu Leu Leu
Glu Cys Leu Ser Pro Ala Thr Ser Arg Thr Phe Arg 960
965 970ata gag gaa atc cct ttg gac cag gtg gac
ata gac aaa gag aat gag 3160Ile Glu Glu Ile Pro Leu Asp Gln Val Asp
Ile Asp Lys Glu Asn Glu 975 980
985atg ctt gtc aca gtg gcg cat ttc cac aaa gag gtc ttc gga acg ttc
3208Met Leu Val Thr Val Ala His Phe His Lys Glu Val Phe Gly Thr Phe
990 995 1000gga atc ccg ttt ttg ctg agg
ata cac cag ggc gag cat ttt cga gaa 3256Gly Ile Pro Phe Leu Leu Arg
Ile His Gln Gly Glu His Phe Arg Glu 1005 1010
1015gtg atg aag cga atc cag agc ctg ctg gac atc cag gag aag gag ttt
3304Val Met Lys Arg Ile Gln Ser Leu Leu Asp Ile Gln Glu Lys Glu
Phe1020 1025 1030 1035gag
aag ttt aaa ttt gca att gta atg acg ggc cga cac cag tac ata 3352Glu
Lys Phe Lys Phe Ala Ile Val Met Thr Gly Arg His Gln Tyr Ile
1040 1045 1050aat gaa gac gag tat gaa gta
aat ttg aaa gac ttt gag cca cag ccc 3400Asn Glu Asp Glu Tyr Glu Val
Asn Leu Lys Asp Phe Glu Pro Gln Pro 1055 1060
1065ggt aat atg tct cat cct cgg cct tgg cta ggg ctc gac cac
ttc aac 3448Gly Asn Met Ser His Pro Arg Pro Trp Leu Gly Leu Asp His
Phe Asn 1070 1075 1080aaa gcc cca
aag agg agt cgc tac act tac ctt gaa aag gcc att aaa 3496Lys Ala Pro
Lys Arg Ser Arg Tyr Thr Tyr Leu Glu Lys Ala Ile Lys 1085
1090 1095atc cat aac tgatttccaa gctggtgtgt tcaaggcgag
gacggtgtgt 3545Ile His Asn1100gggtggcccc ttaacagcct
agaactttgg tgcacgtgcc ctctagccga agtcttcagc 3605aagaggattc gctgctggtg
ttaattttat tttattgagg ctgttcagtt tggcttctct 3665gtatctattg actgcccttt
ttgagcaaaa tgaagatgtt tttataaagc ttggatgcca 3725atgagagtta ttttatggta
accacagtgc aaggcaactg tcagcgcaat gggggagaag 3785aggttagtgg atcgggggtc
cctggctcaa ggtctctggg ctgtccctag tgggcacgag 3845tggctcggct gccttcctgg
ggtcccgtgc accagccctg cagctagcaa gtcttgtgtt 3905taggctcgtc tgacctattt
ccttcagtta tactttcaat gaccttttgt gcatctgtta 3965aggcaaaaca gagaaactca
caacctaata aatagcgctc ttcccttcaa aaaaaaa 402221102PRTHomo sapiens
2Met Asn His Gln Gln Gln Gln Gln Gln Gln Lys Ala Gly Glu Gln Gln1
5 10 15Leu Ser Glu Pro Glu Asp
Met Glu Met Glu Ala Gly Asp Thr Asp Asp 20 25
30Pro Pro Arg Ile Thr Gln Asn Pro Val Ile Asn Gly Asn
Val Ala Leu 35 40 45Ser Asp Gly
His Asn Thr Ala Glu Glu Asp Met Glu Asp Asp Thr Ser 50
55 60Trp Arg Ser Glu Ala Thr Phe Gln Phe Thr Val Glu
Arg Phe Ser Arg65 70 75
80Leu Ser Glu Ser Val Leu Ser Pro Pro Cys Phe Val Arg Asn Leu Pro
85 90 95Trp Lys Ile Met Val Met
Pro Arg Phe Tyr Pro Asp Arg Pro His Gln 100
105 110Lys Ser Val Gly Phe Phe Leu Gln Cys Asn Ala Glu
Ser Asp Ser Thr 115 120 125Ser Trp
Ser Cys His Ala Gln Ala Val Leu Lys Ile Ile Asn Tyr Arg 130
135 140Asp Asp Glu Lys Ser Phe Ser Arg Arg Ile Ser
His Leu Phe Phe His145 150 155
160Lys Glu Asn Asp Trp Gly Phe Ser Asn Phe Met Ala Trp Ser Glu Val
165 170 175Thr Asp Pro Glu
Lys Gly Phe Ile Asp Asp Asp Lys Val Thr Phe Glu 180
185 190Val Phe Val Gln Ala Asp Ala Pro His Gly Val
Ala Trp Asp Ser Lys 195 200 205Lys
His Thr Gly Tyr Val Gly Leu Lys Asn Gln Gly Ala Thr Cys Tyr 210
215 220Met Asn Ser Leu Leu Gln Thr Leu Phe Phe
Thr Asn Gln Leu Arg Lys225 230 235
240Ala Val Tyr Met Met Pro Thr Glu Gly Asp Asp Ser Ser Lys Ser
Val 245 250 255Pro Leu Ala
Leu Gln Arg Val Phe Tyr Glu Leu Gln His Ser Asp Lys 260
265 270Pro Val Gly Thr Lys Lys Leu Thr Lys Ser
Phe Gly Trp Glu Thr Leu 275 280
285Asp Ser Phe Met Gln His Asp Val Gln Glu Leu Cys Arg Val Leu Leu 290
295 300Asp Asn Val Glu Asn Lys Met Lys
Gly Thr Cys Val Glu Gly Thr Ile305 310
315 320Pro Lys Leu Phe Arg Gly Lys Met Val Ser Tyr Ile
Gln Cys Lys Glu 325 330
335Val Asp Tyr Arg Ser Asp Arg Arg Glu Asp Tyr Tyr Asp Ile Gln Leu
340 345 350Ser Ile Lys Gly Lys Lys
Asn Ile Phe Glu Ser Phe Val Asp Tyr Val 355 360
365Ala Val Glu Gln Leu Asp Gly Asp Asn Lys Tyr Asp Ala Gly
Glu His 370 375 380Gly Leu Gln Glu Ala
Glu Lys Gly Val Lys Phe Leu Thr Leu Pro Pro385 390
395 400Val Leu His Leu Gln Leu Met Arg Phe Met
Tyr Asp Pro Gln Thr Asp 405 410
415Gln Asn Ile Lys Ile Asn Asp Arg Phe Glu Phe Pro Glu Gln Leu Pro
420 425 430Leu Asp Glu Phe Leu
Gln Lys Thr Asp Pro Lys Asp Pro Ala Asn Tyr 435
440 445Ile Leu His Ala Val Leu Val His Ser Gly Asp Asn
His Gly Gly His 450 455 460Tyr Val Val
Tyr Leu Asn Pro Lys Gly Asp Gly Lys Trp Cys Lys Phe465
470 475 480Asp Asp Asp Val Val Ser Arg
Cys Thr Lys Glu Glu Ala Ile Glu His 485
490 495Asn Tyr Gly Gly His Asp Asp Asp Leu Ser Val Arg
His Cys Thr Asn 500 505 510Ala
Tyr Met Leu Val Tyr Ile Arg Glu Ser Lys Leu Ser Glu Val Leu 515
520 525Gln Ala Val Thr Asp His Asp Ile Pro
Gln Gln Leu Val Glu Arg Leu 530 535
540Gln Glu Glu Lys Arg Ile Glu Ala Gln Lys Arg Lys Glu Arg Gln Glu545
550 555 560Ala His Leu Tyr
Met Gln Val Gln Ile Val Ala Glu Asp Gln Phe Cys 565
570 575Gly His Gln Gly Asn Asp Met Tyr Asp Glu
Glu Lys Val Lys Tyr Thr 580 585
590Val Phe Lys Val Leu Lys Asn Ser Ser Leu Ala Glu Phe Val Gln Ser
595 600 605Leu Ser Gln Thr Met Gly Phe
Pro Gln Asp Gln Ile Arg Leu Trp Pro 610 615
620Met Gln Ala Arg Ser Asn Gly Thr Lys Arg Pro Ala Met Leu Asp
Asn625 630 635 640Glu Ala
Asp Gly Asn Lys Thr Met Ile Glu Leu Ser Asp Asn Glu Asn
645 650 655Pro Trp Thr Ile Phe Leu Glu
Thr Val Asp Pro Glu Leu Ala Ala Ser 660 665
670Gly Ala Thr Leu Pro Lys Phe Asp Lys Asp His Asp Val Met
Leu Phe 675 680 685Leu Lys Met Tyr
Asp Pro Lys Thr Arg Ser Leu Asn Tyr Cys Gly His 690
695 700Ile Tyr Thr Pro Ile Ser Cys Lys Ile Arg Asp Leu
Leu Pro Val Met705 710 715
720Cys Asp Arg Ala Gly Phe Ile Gln Asp Thr Ser Leu Ile Leu Tyr Glu
725 730 735Glu Val Lys Pro Asn
Leu Thr Glu Arg Ile Gln Asp Tyr Asp Val Ser 740
745 750Leu Asp Lys Ala Leu Asp Glu Leu Met Asp Gly Asp
Ile Ile Val Phe 755 760 765Gln Lys
Asp Asp Pro Glu Asn Asp Asn Ser Glu Leu Pro Thr Ala Lys 770
775 780Glu Tyr Phe Arg Asp Leu Tyr His Arg Val Asp
Val Ile Phe Cys Asp785 790 795
800Lys Thr Ile Pro Asn Asp Pro Gly Phe Val Val Thr Leu Ser Asn Arg
805 810 815Met Asn Tyr Phe
Gln Val Ala Lys Thr Val Ala Gln Arg Leu Asn Thr 820
825 830Asp Pro Met Leu Leu Gln Phe Phe Lys Ser Gln
Gly Tyr Arg Asp Gly 835 840 845Pro
Gly Asn Pro Leu Arg His Asn Tyr Glu Gly Thr Leu Arg Asp Leu 850
855 860Leu Gln Phe Phe Lys Pro Arg Gln Pro Lys
Lys Leu Tyr Tyr Gln Gln865 870 875
880Leu Lys Met Lys Ile Thr Asp Phe Glu Asn Arg Arg Ser Phe Lys
Cys 885 890 895Ile Trp Leu
Asn Ser Gln Phe Arg Glu Glu Glu Ile Thr Leu Tyr Pro 900
905 910Asp Lys His Gly Cys Val Arg Asp Leu Leu
Glu Glu Cys Lys Lys Ala 915 920
925Val Glu Leu Gly Glu Lys Ala Ser Gly Lys Leu Arg Leu Leu Glu Ile 930
935 940Val Ser Tyr Lys Ile Ile Gly Val
His Gln Glu Asp Glu Leu Leu Glu945 950
955 960Cys Leu Ser Pro Ala Thr Ser Arg Thr Phe Arg Ile
Glu Glu Ile Pro 965 970
975Leu Asp Gln Val Asp Ile Asp Lys Glu Asn Glu Met Leu Val Thr Val
980 985 990Ala His Phe His Lys Glu
Val Phe Gly Thr Phe Gly Ile Pro Phe Leu 995 1000
1005Leu Arg Ile His Gln Gly Glu His Phe Arg Glu Val Met Lys
Arg Ile 1010 1015 1020Gln Ser Leu Leu
Asp Ile Gln Glu Lys Glu Phe Glu Lys Phe Lys Phe1025 1030
1035 1040Ala Ile Val Met Thr Gly Arg His Gln
Tyr Ile Asn Glu Asp Glu Tyr 1045 1050
1055Glu Val Asn Leu Lys Asp Phe Glu Pro Gln Pro Gly Asn Met Ser
His 1060 1065 1070Pro Arg Pro
Trp Leu Gly Leu Asp His Phe Asn Lys Ala Pro Lys Arg 1075
1080 1085Ser Arg Tyr Thr Tyr Leu Glu Lys Ala Ile Lys
Ile His Asn 1090 1095 110031642DNAHomo
sapiensCDS(158)..(1279) 3gaatcggcgg tcccgcaggt cccggatgtt gcggacagta
tgaggcaagc gcagggggac 60ggggaccagc agctgtcgcc gccgctctca gggtgaagag
ggaacagaaa tctttgcccc 120ctgactttgg aaatctcgtt taaccttcaa actggcg atg
tca agg gtt cca agt 175 Met
Ser Arg Val Pro Ser 1
5cct cca cct ccg gca gaa atg tcg agt ggc ccc gta gct gag agt tgg
223Pro Pro Pro Pro Ala Glu Met Ser Ser Gly Pro Val Ala Glu Ser Trp
10 15 20tgc tac aca cag atc aag
gta gtg aaa ttc tcc tac atg tgg acc atc 271Cys Tyr Thr Gln Ile Lys
Val Val Lys Phe Ser Tyr Met Trp Thr Ile 25 30
35aat aac ttt agc ttt tgc cgg gag gaa atg ggt gaa gtc att
aaa agt 319Asn Asn Phe Ser Phe Cys Arg Glu Glu Met Gly Glu Val Ile
Lys Ser 40 45 50tct aca ttt tca tca
gga gca aat gat aaa ctg aaa tgg tgt ttg cga 367Ser Thr Phe Ser Ser
Gly Ala Asn Asp Lys Leu Lys Trp Cys Leu Arg 55 60
65 70gta aac ccc aaa ggg tta gat gaa gaa agc
aaa gat tac ctg tca ctt 415Val Asn Pro Lys Gly Leu Asp Glu Glu Ser
Lys Asp Tyr Leu Ser Leu 75 80
85tac ctg tta ctg gtc agc tgt cca aag agt gaa gtt cgg gca aaa ttc
463Tyr Leu Leu Leu Val Ser Cys Pro Lys Ser Glu Val Arg Ala Lys Phe
90 95 100aaa ttc tcc atc ctg aat
gcc aag gga gaa gaa acc aaa gct atg gag 511Lys Phe Ser Ile Leu Asn
Ala Lys Gly Glu Glu Thr Lys Ala Met Glu 105 110
115agt caa cgg gca tat agg ttt gtg caa ggc aaa gac tgg gga
ttc aag 559Ser Gln Arg Ala Tyr Arg Phe Val Gln Gly Lys Asp Trp Gly
Phe Lys 120 125 130aaa ttc atc cgt aga
gat ttt ctt ttg gat gag gcc aac ggg ctt ctc 607Lys Phe Ile Arg Arg
Asp Phe Leu Leu Asp Glu Ala Asn Gly Leu Leu135 140
145 150cct gat gac aag ctt acc ctc ttc tgc gag
gtg agt gtt gtg caa gat 655Pro Asp Asp Lys Leu Thr Leu Phe Cys Glu
Val Ser Val Val Gln Asp 155 160
165tct gtc aac att tct ggc cag aat acc atg aac atg gta aag gtt cct
703Ser Val Asn Ile Ser Gly Gln Asn Thr Met Asn Met Val Lys Val Pro
170 175 180gag tgc cgg ctg gca gat
gag tta gga gga ctg tgg gag aat tcc cgg 751Glu Cys Arg Leu Ala Asp
Glu Leu Gly Gly Leu Trp Glu Asn Ser Arg 185 190
195ttc aca gac tgc tgc ttg tgt gtt gcc ggc cag gaa ttc cag
gct cac 799Phe Thr Asp Cys Cys Leu Cys Val Ala Gly Gln Glu Phe Gln
Ala His 200 205 210aag gct atc tta gca
gct cgt tct ccg gtt ttt agt gcc atg ttt gaa 847Lys Ala Ile Leu Ala
Ala Arg Ser Pro Val Phe Ser Ala Met Phe Glu215 220
225 230cat gaa atg gag gag agc aaa aag aat cga
gtt gaa atc aat gat gtg 895His Glu Met Glu Glu Ser Lys Lys Asn Arg
Val Glu Ile Asn Asp Val 235 240
245gag cct gaa gtt ttt aag gaa atg atg tgc ttc att tac acg ggg aag
943Glu Pro Glu Val Phe Lys Glu Met Met Cys Phe Ile Tyr Thr Gly Lys
250 255 260gct cca aac ctc gac aaa
atg gct gat gat ttg ctg gca gct gct gac 991Ala Pro Asn Leu Asp Lys
Met Ala Asp Asp Leu Leu Ala Ala Ala Asp 265 270
275aag tat gcc ctg gag cgc tta aag gtc atg tgt gag gat gcc
ctc tgc 1039Lys Tyr Ala Leu Glu Arg Leu Lys Val Met Cys Glu Asp Ala
Leu Cys 280 285 290agt aac ctg tcc gtg
gag aac gct gca gaa att ctc atc ctg gcc gac 1087Ser Asn Leu Ser Val
Glu Asn Ala Ala Glu Ile Leu Ile Leu Ala Asp295 300
305 310ctc cac agt gca gat cag ttg aaa act cag
gca gtg gat ttc atc aac 1135Leu His Ser Ala Asp Gln Leu Lys Thr Gln
Ala Val Asp Phe Ile Asn 315 320
325tat cat gct tcg gat gtc ttg gag acc tct ggg tgg aag tca atg gtg
1183Tyr His Ala Ser Asp Val Leu Glu Thr Ser Gly Trp Lys Ser Met Val
330 335 340gtg tca cat ccc cac ttg
gtg gct gag gca tac cgc tct ctg gct tca 1231Val Ser His Pro His Leu
Val Ala Glu Ala Tyr Arg Ser Leu Ala Ser 345 350
355gca cag tgc cct ttt ctg gga ccc cca cgc aaa cgc ctg aag
caa tcc 1279Ala Gln Cys Pro Phe Leu Gly Pro Pro Arg Lys Arg Leu Lys
Gln Ser 360 365 370taagatcctg
cttgttgtaa gactccgttt aatttccaga agcagcagcc actgttgctg 1339ccactgacca
ccaggtagac agcgcaatct gtggagcttt tactctgttg tgaggggaag 1399agactgcatt
gtggccccag acttttaaaa cagcactaaa taacttgggg gaaacggggg 1459gagggaaaat
gaaatgaaaa ccctgttgct gcgtcactgt gttccctttg gcctgtctga 1519gtttgatact
gtggggattc agtttaggcg ctggcccgag gatatcccag cggtggtact 1579tcggagacac
ctgtctgcat ctgactgagc agaacaaatc gtcaggtgcc tggagcaaaa 1639agg
16424374PRTHomo
sapiens 4Met Ser Arg Val Pro Ser Pro Pro Pro Pro Ala Glu Met Ser Ser Gly1
5 10 15Pro Val Ala Glu
Ser Trp Cys Tyr Thr Gln Ile Lys Val Val Lys Phe 20
25 30Ser Tyr Met Trp Thr Ile Asn Asn Phe Ser Phe
Cys Arg Glu Glu Met 35 40 45Gly
Glu Val Ile Lys Ser Ser Thr Phe Ser Ser Gly Ala Asn Asp Lys 50
55 60Leu Lys Trp Cys Leu Arg Val Asn Pro Lys
Gly Leu Asp Glu Glu Ser65 70 75
80Lys Asp Tyr Leu Ser Leu Tyr Leu Leu Leu Val Ser Cys Pro Lys
Ser 85 90 95Glu Val Arg
Ala Lys Phe Lys Phe Ser Ile Leu Asn Ala Lys Gly Glu 100
105 110Glu Thr Lys Ala Met Glu Ser Gln Arg Ala
Tyr Arg Phe Val Gln Gly 115 120
125Lys Asp Trp Gly Phe Lys Lys Phe Ile Arg Arg Asp Phe Leu Leu Asp 130
135 140Glu Ala Asn Gly Leu Leu Pro Asp
Asp Lys Leu Thr Leu Phe Cys Glu145 150
155 160Val Ser Val Val Gln Asp Ser Val Asn Ile Ser Gly
Gln Asn Thr Met 165 170
175Asn Met Val Lys Val Pro Glu Cys Arg Leu Ala Asp Glu Leu Gly Gly
180 185 190Leu Trp Glu Asn Ser Arg
Phe Thr Asp Cys Cys Leu Cys Val Ala Gly 195 200
205Gln Glu Phe Gln Ala His Lys Ala Ile Leu Ala Ala Arg Ser
Pro Val 210 215 220Phe Ser Ala Met Phe
Glu His Glu Met Glu Glu Ser Lys Lys Asn Arg225 230
235 240Val Glu Ile Asn Asp Val Glu Pro Glu Val
Phe Lys Glu Met Met Cys 245 250
255Phe Ile Tyr Thr Gly Lys Ala Pro Asn Leu Asp Lys Met Ala Asp Asp
260 265 270Leu Leu Ala Ala Ala
Asp Lys Tyr Ala Leu Glu Arg Leu Lys Val Met 275
280 285Cys Glu Asp Ala Leu Cys Ser Asn Leu Ser Val Glu
Asn Ala Ala Glu 290 295 300Ile Leu Ile
Leu Ala Asp Leu His Ser Ala Asp Gln Leu Lys Thr Gln305
310 315 320Ala Val Asp Phe Ile Asn Tyr
His Ala Ser Asp Val Leu Glu Thr Ser 325
330 335Gly Trp Lys Ser Met Val Val Ser His Pro His Leu
Val Ala Glu Ala 340 345 350Tyr
Arg Ser Leu Ala Ser Ala Gln Cys Pro Phe Leu Gly Pro Pro Arg 355
360 365Lys Arg Leu Lys Gln Ser
37054111DNAHomo sapiensCDS(1)..(2937) 5gcg gag cct gcg ggt cgc cag cgg
ccc cgc cga gag ccg gag gca atg 48Ala Glu Pro Ala Gly Arg Gln Arg
Pro Arg Arg Glu Pro Glu Ala Met1 5 10
15gat gaa cag agc gtg gag agc att gct gag gtt ttc cga tgt
ttc att 96Asp Glu Gln Ser Val Glu Ser Ile Ala Glu Val Phe Arg Cys
Phe Ile 20 25 30tgt atg gag
aaa ttg cgg gat gca cgc ctg tgt cct cat tgc tcc aaa 144Cys Met Glu
Lys Leu Arg Asp Ala Arg Leu Cys Pro His Cys Ser Lys 35
40 45ctg tgt tgt ttc agc tgt att agg cgc tgg ctg
aca gag cag aga gct 192Leu Cys Cys Phe Ser Cys Ile Arg Arg Trp Leu
Thr Glu Gln Arg Ala 50 55 60caa tgt
cct cat tgc cgt gct cca ctc cag cta cga gaa cta gta aat 240Gln Cys
Pro His Cys Arg Ala Pro Leu Gln Leu Arg Glu Leu Val Asn65
70 75 80tgt cgt tgg gca gaa gaa gta
aca caa cag ctt gat act ctt caa ctc 288Cys Arg Trp Ala Glu Glu Val
Thr Gln Gln Leu Asp Thr Leu Gln Leu 85 90
95tgc agt ctc acc aaa cat gaa gaa aat gaa aag gac aaa
tgt gaa aat 336Cys Ser Leu Thr Lys His Glu Glu Asn Glu Lys Asp Lys
Cys Glu Asn 100 105 110cac cat
gaa aaa ctt agt gta ttt tgc tgg act tgt aag aag tgt atc 384His His
Glu Lys Leu Ser Val Phe Cys Trp Thr Cys Lys Lys Cys Ile 115
120 125tgc cat cag tgt gca ctt tgg gga gga atg
cat ggc gga cat acc ttt 432Cys His Gln Cys Ala Leu Trp Gly Gly Met
His Gly Gly His Thr Phe 130 135 140aaa
cct ttg gca gaa att tat gag caa cac gtc act aaa gtg aat gaa 480Lys
Pro Leu Ala Glu Ile Tyr Glu Gln His Val Thr Lys Val Asn Glu145
150 155 160gag gta gcc aaa ctt cgt
cgg cgt ctc atg gaa ctg atc agc tta gtt 528Glu Val Ala Lys Leu Arg
Arg Arg Leu Met Glu Leu Ile Ser Leu Val 165
170 175caa gaa gtg gaa agg aat gta gaa gct gta aga aat
gca aaa gat gag 576Gln Glu Val Glu Arg Asn Val Glu Ala Val Arg Asn
Ala Lys Asp Glu 180 185 190cgt
gtt cgg gaa att agg aat gca gtg gag atg atg att gca cgg tta 624Arg
Val Arg Glu Ile Arg Asn Ala Val Glu Met Met Ile Ala Arg Leu 195
200 205gac aca cag ctg aag aat aag ctt ata
aca ctg atg ggt cag aag aca 672Asp Thr Gln Leu Lys Asn Lys Leu Ile
Thr Leu Met Gly Gln Lys Thr 210 215
220tct cta acc caa gaa aca gag ctt ttg gaa tcc tta ctt cag gag gtg
720Ser Leu Thr Gln Glu Thr Glu Leu Leu Glu Ser Leu Leu Gln Glu Val225
230 235 240gag cac cag ttg
cgg tct tgt agt aag agt gag ttg ata tct aag agc 768Glu His Gln Leu
Arg Ser Cys Ser Lys Ser Glu Leu Ile Ser Lys Ser 245
250 255tca gag atc ctt atg atg ttt cag caa gtt
cat cgg aag ccc atg gca 816Ser Glu Ile Leu Met Met Phe Gln Gln Val
His Arg Lys Pro Met Ala 260 265
270tct ttt gtt acc act cct gtt cca cca gac ttt acc agt gaa tta gtg
864Ser Phe Val Thr Thr Pro Val Pro Pro Asp Phe Thr Ser Glu Leu Val
275 280 285cca tct tac gat tca gct act
ttt gtt tta gag aat ttc agc act ttg 912Pro Ser Tyr Asp Ser Ala Thr
Phe Val Leu Glu Asn Phe Ser Thr Leu 290 295
300cgt cag aga gca gat cct gtt tac agt cca cct ctt caa gtt tca gga
960Arg Gln Arg Ala Asp Pro Val Tyr Ser Pro Pro Leu Gln Val Ser Gly305
310 315 320ctt tgc tgg agg
tta aaa gtt tac cca gat gga aat gga gtt gtg cga 1008Leu Cys Trp Arg
Leu Lys Val Tyr Pro Asp Gly Asn Gly Val Val Arg 325
330 335ggt tac tac tta tct gtg ttt ctg gag ctc
tca gct ggc ttg cct gaa 1056Gly Tyr Tyr Leu Ser Val Phe Leu Glu Leu
Ser Ala Gly Leu Pro Glu 340 345
350act tct aaa tat gaa tat cgt gta gag atg gtt cac cag tcc tgt aat
1104Thr Ser Lys Tyr Glu Tyr Arg Val Glu Met Val His Gln Ser Cys Asn
355 360 365gat cct aca aaa aat atc att
cga gaa ttt gca tct gac ttt gaa gtt 1152Asp Pro Thr Lys Asn Ile Ile
Arg Glu Phe Ala Ser Asp Phe Glu Val 370 375
380gga gaa tgc tgg ggc tat aat aga ttt ttc cgt ttg gac tta ctc gca
1200Gly Glu Cys Trp Gly Tyr Asn Arg Phe Phe Arg Leu Asp Leu Leu Ala385
390 395 400aat gaa gga tac
ttg aat cca caa aat gat aca gtg att tta agg ttt 1248Asn Glu Gly Tyr
Leu Asn Pro Gln Asn Asp Thr Val Ile Leu Arg Phe 405
410 415cag gta cgt tca cca act ttc ttt caa aaa
tcc cgg gac cag cat tgg 1296Gln Val Arg Ser Pro Thr Phe Phe Gln Lys
Ser Arg Asp Gln His Trp 420 425
430tac att act cag ttg gaa gct gca cag act agt tat atc caa caa ata
1344Tyr Ile Thr Gln Leu Glu Ala Ala Gln Thr Ser Tyr Ile Gln Gln Ile
435 440 445aac aac ctt aaa gag aga ctt
act att gag ctg tct cga act cag aag 1392Asn Asn Leu Lys Glu Arg Leu
Thr Ile Glu Leu Ser Arg Thr Gln Lys 450 455
460tca aga gat ttg tca cca cca gat aac cat ctt agc ccc caa aat gat
1440Ser Arg Asp Leu Ser Pro Pro Asp Asn His Leu Ser Pro Gln Asn Asp465
470 475 480gat gct ctg gag
aca cga gct aag aag tct gca tgc tct gac atg ctt 1488Asp Ala Leu Glu
Thr Arg Ala Lys Lys Ser Ala Cys Ser Asp Met Leu 485
490 495ctc gaa ggt ggt cct act aca gct tct gta
aga gag gcc aaa gag gat 1536Leu Glu Gly Gly Pro Thr Thr Ala Ser Val
Arg Glu Ala Lys Glu Asp 500 505
510gaa gaa gat gag gag aag att cag aat gaa gat tat cat cac gag ctt
1584Glu Glu Asp Glu Glu Lys Ile Gln Asn Glu Asp Tyr His His Glu Leu
515 520 525tca gat gga gat ctg gat ctg
gat ctt gtt tat gag gat gaa gta aat 1632Ser Asp Gly Asp Leu Asp Leu
Asp Leu Val Tyr Glu Asp Glu Val Asn 530 535
540cag ctc gat ggc agc agt tcc tct gct agt tcc aca gca aca agt aat
1680Gln Leu Asp Gly Ser Ser Ser Ser Ala Ser Ser Thr Ala Thr Ser Asn545
550 555 560aca gaa gaa aat
gat att gat gaa gaa act atg tct gga gaa aat gat 1728Thr Glu Glu Asn
Asp Ile Asp Glu Glu Thr Met Ser Gly Glu Asn Asp 565
570 575gtg gaa tat aac aac atg gaa tta gaa gag
gga gaa ctc atg gaa gat 1776Val Glu Tyr Asn Asn Met Glu Leu Glu Glu
Gly Glu Leu Met Glu Asp 580 585
590gca gct gct gca gga ccc gca ggt agt agc cat ggt tat gtg ggt tcc
1824Ala Ala Ala Ala Gly Pro Ala Gly Ser Ser His Gly Tyr Val Gly Ser
595 600 605agt agt aga ata tca aga aga
aca cat tta tgc tcc gct gct acc agt 1872Ser Ser Arg Ile Ser Arg Arg
Thr His Leu Cys Ser Ala Ala Thr Ser 610 615
620agt tta cta gac att gat cca tta att tta ata cat ttg ttg gac ctt
1920Ser Leu Leu Asp Ile Asp Pro Leu Ile Leu Ile His Leu Leu Asp Leu625
630 635 640aag gac cgg agc
agt ata gaa aat ttg tgg ggc tta cag cct cgc cca 1968Lys Asp Arg Ser
Ser Ile Glu Asn Leu Trp Gly Leu Gln Pro Arg Pro 645
650 655cct gct tca ctt ctg cag ccc aca gca tca
tat tct cga aaa gat aaa 2016Pro Ala Ser Leu Leu Gln Pro Thr Ala Ser
Tyr Ser Arg Lys Asp Lys 660 665
670gac caa agg aag caa cag gca atg tgg cga gtg ccc tct gat tta aag
2064Asp Gln Arg Lys Gln Gln Ala Met Trp Arg Val Pro Ser Asp Leu Lys
675 680 685atg cta aaa aga ctc aaa act
caa atg gcc gaa gtt cga tgt atg aaa 2112Met Leu Lys Arg Leu Lys Thr
Gln Met Ala Glu Val Arg Cys Met Lys 690 695
700act gat gta aag aat aca ctt tca gaa ata aaa agc agc agt gct gct
2160Thr Asp Val Lys Asn Thr Leu Ser Glu Ile Lys Ser Ser Ser Ala Ala705
710 715 720tct gga gac atg
cag aca agc ctt ttt tct gct gac cag gca gct ctg 2208Ser Gly Asp Met
Gln Thr Ser Leu Phe Ser Ala Asp Gln Ala Ala Leu 725
730 735gct gca tgt gga act gaa aac tct ggc aga
ttg cag gat ttg gga atg 2256Ala Ala Cys Gly Thr Glu Asn Ser Gly Arg
Leu Gln Asp Leu Gly Met 740 745
750gaa ctc ctg gca aag tca tca gtt gcc aat tgt tac ata cga aac tcc
2304Glu Leu Leu Ala Lys Ser Ser Val Ala Asn Cys Tyr Ile Arg Asn Ser
755 760 765aca aat aag aag agt aat tcg
ccc aag cca gct cga tcc agt gta gca 2352Thr Asn Lys Lys Ser Asn Ser
Pro Lys Pro Ala Arg Ser Ser Val Ala 770 775
780ggt agt cta tca ctt cga aga gca gtg gac cct gga gaa aat agt cgt
2400Gly Ser Leu Ser Leu Arg Arg Ala Val Asp Pro Gly Glu Asn Ser Arg785
790 795 800tca aag gga gac
tgt cag act ctg tct gaa ggc tcc cca gga agc tct 2448Ser Lys Gly Asp
Cys Gln Thr Leu Ser Glu Gly Ser Pro Gly Ser Ser 805
810 815cag tct ggg agc agg cac agt tct ccc cga
gcc ttg ata cat ggc agt 2496Gln Ser Gly Ser Arg His Ser Ser Pro Arg
Ala Leu Ile His Gly Ser 820 825
830atc ggt gat att ctg cca aaa act gaa gac cgg cag tgt aaa gct ttg
2544Ile Gly Asp Ile Leu Pro Lys Thr Glu Asp Arg Gln Cys Lys Ala Leu
835 840 845gat tca gat gct gtt gtg gtt
gca gtt ttc agt ggc ttg cct gcg gtt 2592Asp Ser Asp Ala Val Val Val
Ala Val Phe Ser Gly Leu Pro Ala Val 850 855
860gag aaa agg agg aaa atg gtc acc ttg ggg gct aat gct aaa gga ggt
2640Glu Lys Arg Arg Lys Met Val Thr Leu Gly Ala Asn Ala Lys Gly Gly865
870 875 880cat ctg gaa gga
ctg cag atg act gat ttg gaa aat aat tct gaa act 2688His Leu Glu Gly
Leu Gln Met Thr Asp Leu Glu Asn Asn Ser Glu Thr 885
890 895gga gag tta cag cct gta cta cct gaa gga
gct tca gct gcc cct gaa 2736Gly Glu Leu Gln Pro Val Leu Pro Glu Gly
Ala Ser Ala Ala Pro Glu 900 905
910gaa gga atg agt agc gac agt gac att gaa tgt gac act gag aat gag
2784Glu Gly Met Ser Ser Asp Ser Asp Ile Glu Cys Asp Thr Glu Asn Glu
915 920 925gag cag gaa gag cat acc agt
gtg ggc ggg ttt cac gac tcc ttc atg 2832Glu Gln Glu Glu His Thr Ser
Val Gly Gly Phe His Asp Ser Phe Met 930 935
940gtc atg aca cag ccc ccg gat gaa gat aca cat tcc agt ttt cct gat
2880Val Met Thr Gln Pro Pro Asp Glu Asp Thr His Ser Ser Phe Pro Asp945
950 955 960ggt gaa caa ata
ggc cct gaa gat ctc agc ttc aat aca gat gaa aat 2928Gly Glu Gln Ile
Gly Pro Glu Asp Leu Ser Phe Asn Thr Asp Glu Asn 965
970 975agt gga agg taattgccaa atcaagagaa
ctgacttgca agctaccttg 2977Ser Gly Argaccctgaatt ttgctgtagt
tggtgctcaa atttgtcatc agtcagataa tcagatttgg 3037tcttatttct tcattatctc
gacctgaaat agtaatttgg aaactgttgg aaggtggcac 3097agtttagtct aagacagcag
tagtacatgg gaaaaacagt atgggaagag ttctttgtaa 3157tgtaaggaaa taacaatgta
gttctctatt aatttagcaa atttgtacat tcacaaaagg 3217cagtttgtct actacagcag
aaggctggtt aactgccaga aaatgtacct ccaggccctg 3277catgccgtca gtaacccgcc
cggcattggt gctctactgt ctttggctag agcttagttg 3337tgtttaaata atcatcttta
tatttggggt tttaattaca gttccattag tgcctgtaga 3397ttagtgaaca gaaaattgct
ttggaagaga ttctgccctg tagacactat gtgaataact 3457gaagtaacac tagactgaat
ctcctttttg gagtatgtat cttctctcac ttgttcaagt 3517acaggcacac tgttcaaccg
catggtatct ttctgttgtg tgacttctac aaatgtaatt 3577ttaaatgaaa ttaagttaac
atggattcat tacgttcctg gccctgtaga cacgtgtaag 3637attatttaaa attctttcat
ttttttctgc ctcttactat acgactgtag tgcaacaaat 3697attttaaagc ccccttttct
tctttatttt cattagttgt acattgattt cagtgtcaac 3757acatttaaag attcattcat
gttgcacagt ggcttacatg aacgtgaaac tgtgatataa 3817ggttttcttt catactcata
attagcccaa aacagttgcc aaactttgcc attgtgctcc 3877tgcatttgtg tttgagctgc
tatatatttg tggaaattac actgaaagtt gactaagaga 3937ctattgaaaa agcatgaata
attaaatata catgtgagag acatctcatc tgctgtattt 3997tacttagtga atattgttca
ctcttccgtg tctgatgtct tgctgaatgc tgtgactcat 4057agtttacttt tgttcaaaat
agtttgcact ttttgttaat aaaatcaact tgag 41116979PRTHomo sapiens
6Ala Glu Pro Ala Gly Arg Gln Arg Pro Arg Arg Glu Pro Glu Ala Met1
5 10 15Asp Glu Gln Ser Val Glu
Ser Ile Ala Glu Val Phe Arg Cys Phe Ile 20 25
30Cys Met Glu Lys Leu Arg Asp Ala Arg Leu Cys Pro His
Cys Ser Lys 35 40 45Leu Cys Cys
Phe Ser Cys Ile Arg Arg Trp Leu Thr Glu Gln Arg Ala 50
55 60Gln Cys Pro His Cys Arg Ala Pro Leu Gln Leu Arg
Glu Leu Val Asn65 70 75
80Cys Arg Trp Ala Glu Glu Val Thr Gln Gln Leu Asp Thr Leu Gln Leu
85 90 95Cys Ser Leu Thr Lys His
Glu Glu Asn Glu Lys Asp Lys Cys Glu Asn 100
105 110His His Glu Lys Leu Ser Val Phe Cys Trp Thr Cys
Lys Lys Cys Ile 115 120 125Cys His
Gln Cys Ala Leu Trp Gly Gly Met His Gly Gly His Thr Phe 130
135 140Lys Pro Leu Ala Glu Ile Tyr Glu Gln His Val
Thr Lys Val Asn Glu145 150 155
160Glu Val Ala Lys Leu Arg Arg Arg Leu Met Glu Leu Ile Ser Leu Val
165 170 175Gln Glu Val Glu
Arg Asn Val Glu Ala Val Arg Asn Ala Lys Asp Glu 180
185 190Arg Val Arg Glu Ile Arg Asn Ala Val Glu Met
Met Ile Ala Arg Leu 195 200 205Asp
Thr Gln Leu Lys Asn Lys Leu Ile Thr Leu Met Gly Gln Lys Thr 210
215 220Ser Leu Thr Gln Glu Thr Glu Leu Leu Glu
Ser Leu Leu Gln Glu Val225 230 235
240Glu His Gln Leu Arg Ser Cys Ser Lys Ser Glu Leu Ile Ser Lys
Ser 245 250 255Ser Glu Ile
Leu Met Met Phe Gln Gln Val His Arg Lys Pro Met Ala 260
265 270Ser Phe Val Thr Thr Pro Val Pro Pro Asp
Phe Thr Ser Glu Leu Val 275 280
285Pro Ser Tyr Asp Ser Ala Thr Phe Val Leu Glu Asn Phe Ser Thr Leu 290
295 300Arg Gln Arg Ala Asp Pro Val Tyr
Ser Pro Pro Leu Gln Val Ser Gly305 310
315 320Leu Cys Trp Arg Leu Lys Val Tyr Pro Asp Gly Asn
Gly Val Val Arg 325 330
335Gly Tyr Tyr Leu Ser Val Phe Leu Glu Leu Ser Ala Gly Leu Pro Glu
340 345 350Thr Ser Lys Tyr Glu Tyr
Arg Val Glu Met Val His Gln Ser Cys Asn 355 360
365Asp Pro Thr Lys Asn Ile Ile Arg Glu Phe Ala Ser Asp Phe
Glu Val 370 375 380Gly Glu Cys Trp Gly
Tyr Asn Arg Phe Phe Arg Leu Asp Leu Leu Ala385 390
395 400Asn Glu Gly Tyr Leu Asn Pro Gln Asn Asp
Thr Val Ile Leu Arg Phe 405 410
415Gln Val Arg Ser Pro Thr Phe Phe Gln Lys Ser Arg Asp Gln His Trp
420 425 430Tyr Ile Thr Gln Leu
Glu Ala Ala Gln Thr Ser Tyr Ile Gln Gln Ile 435
440 445Asn Asn Leu Lys Glu Arg Leu Thr Ile Glu Leu Ser
Arg Thr Gln Lys 450 455 460Ser Arg Asp
Leu Ser Pro Pro Asp Asn His Leu Ser Pro Gln Asn Asp465
470 475 480Asp Ala Leu Glu Thr Arg Ala
Lys Lys Ser Ala Cys Ser Asp Met Leu 485
490 495Leu Glu Gly Gly Pro Thr Thr Ala Ser Val Arg Glu
Ala Lys Glu Asp 500 505 510Glu
Glu Asp Glu Glu Lys Ile Gln Asn Glu Asp Tyr His His Glu Leu 515
520 525Ser Asp Gly Asp Leu Asp Leu Asp Leu
Val Tyr Glu Asp Glu Val Asn 530 535
540Gln Leu Asp Gly Ser Ser Ser Ser Ala Ser Ser Thr Ala Thr Ser Asn545
550 555 560Thr Glu Glu Asn
Asp Ile Asp Glu Glu Thr Met Ser Gly Glu Asn Asp 565
570 575Val Glu Tyr Asn Asn Met Glu Leu Glu Glu
Gly Glu Leu Met Glu Asp 580 585
590Ala Ala Ala Ala Gly Pro Ala Gly Ser Ser His Gly Tyr Val Gly Ser
595 600 605Ser Ser Arg Ile Ser Arg Arg
Thr His Leu Cys Ser Ala Ala Thr Ser 610 615
620Ser Leu Leu Asp Ile Asp Pro Leu Ile Leu Ile His Leu Leu Asp
Leu625 630 635 640Lys Asp
Arg Ser Ser Ile Glu Asn Leu Trp Gly Leu Gln Pro Arg Pro
645 650 655Pro Ala Ser Leu Leu Gln Pro
Thr Ala Ser Tyr Ser Arg Lys Asp Lys 660 665
670Asp Gln Arg Lys Gln Gln Ala Met Trp Arg Val Pro Ser Asp
Leu Lys 675 680 685Met Leu Lys Arg
Leu Lys Thr Gln Met Ala Glu Val Arg Cys Met Lys 690
695 700Thr Asp Val Lys Asn Thr Leu Ser Glu Ile Lys Ser
Ser Ser Ala Ala705 710 715
720Ser Gly Asp Met Gln Thr Ser Leu Phe Ser Ala Asp Gln Ala Ala Leu
725 730 735Ala Ala Cys Gly Thr
Glu Asn Ser Gly Arg Leu Gln Asp Leu Gly Met 740
745 750Glu Leu Leu Ala Lys Ser Ser Val Ala Asn Cys Tyr
Ile Arg Asn Ser 755 760 765Thr Asn
Lys Lys Ser Asn Ser Pro Lys Pro Ala Arg Ser Ser Val Ala 770
775 780Gly Ser Leu Ser Leu Arg Arg Ala Val Asp Pro
Gly Glu Asn Ser Arg785 790 795
800Ser Lys Gly Asp Cys Gln Thr Leu Ser Glu Gly Ser Pro Gly Ser Ser
805 810 815Gln Ser Gly Ser
Arg His Ser Ser Pro Arg Ala Leu Ile His Gly Ser 820
825 830Ile Gly Asp Ile Leu Pro Lys Thr Glu Asp Arg
Gln Cys Lys Ala Leu 835 840 845Asp
Ser Asp Ala Val Val Val Ala Val Phe Ser Gly Leu Pro Ala Val 850
855 860Glu Lys Arg Arg Lys Met Val Thr Leu Gly
Ala Asn Ala Lys Gly Gly865 870 875
880His Leu Glu Gly Leu Gln Met Thr Asp Leu Glu Asn Asn Ser Glu
Thr 885 890 895Gly Glu Leu
Gln Pro Val Leu Pro Glu Gly Ala Ser Ala Ala Pro Glu 900
905 910Glu Gly Met Ser Ser Asp Ser Asp Ile Glu
Cys Asp Thr Glu Asn Glu 915 920
925Glu Gln Glu Glu His Thr Ser Val Gly Gly Phe His Asp Ser Phe Met 930
935 940Val Met Thr Gln Pro Pro Asp Glu
Asp Thr His Ser Ser Phe Pro Asp945 950
955 960Gly Glu Gln Ile Gly Pro Glu Asp Leu Ser Phe Asn
Thr Asp Glu Asn 965 970
975Ser Gly Arg7639DNAHomo sapiensCDS(1)..(639) 7atg aac cac cag cag cag
cag cag cag cag aaa gcg ggc gag cag cag 48Met Asn His Gln Gln Gln
Gln Gln Gln Gln Lys Ala Gly Glu Gln Gln1 5
10 15ttg agc gag ccc gag gac atg gag atg gaa gcg gga
gat aca gat gac 96Leu Ser Glu Pro Glu Asp Met Glu Met Glu Ala Gly
Asp Thr Asp Asp 20 25 30cca
cca aga att act cag aac cct gtg atc aat ggg aat gtg gcc ctg 144Pro
Pro Arg Ile Thr Gln Asn Pro Val Ile Asn Gly Asn Val Ala Leu 35
40 45agt gat gga cac aac acc gcg gag gag
gac atg gag gat gac acc agt 192Ser Asp Gly His Asn Thr Ala Glu Glu
Asp Met Glu Asp Asp Thr Ser 50 55
60tgg cgc tcc gag gca acc ttt cag ttc act gtg gag cgc ttc agc aga
240Trp Arg Ser Glu Ala Thr Phe Gln Phe Thr Val Glu Arg Phe Ser Arg65
70 75 80ctg agt gag tcg gtc
ctt agc cct ccg tgt ttt gtg cga aat ctg cca 288Leu Ser Glu Ser Val
Leu Ser Pro Pro Cys Phe Val Arg Asn Leu Pro 85
90 95tgg aag att atg gtg atg cca cgc ttt tat cca
gac aga cca cac caa 336Trp Lys Ile Met Val Met Pro Arg Phe Tyr Pro
Asp Arg Pro His Gln 100 105
110aaa agc gta gga ttc ttt ctc cag tgc aat gct gaa tct gat tcc acg
384Lys Ser Val Gly Phe Phe Leu Gln Cys Asn Ala Glu Ser Asp Ser Thr
115 120 125tca tgg tct tgc cat gca caa
gca gtg ctg aag ata ata aat tac aga 432Ser Trp Ser Cys His Ala Gln
Ala Val Leu Lys Ile Ile Asn Tyr Arg 130 135
140gat gat gaa aag tcg ttc agt cgt cgt att agt cat ttg ttc ttc cat
480Asp Asp Glu Lys Ser Phe Ser Arg Arg Ile Ser His Leu Phe Phe His145
150 155 160aaa gaa aat gat
tgg gga ttt tcc aat ttt atg gcc tgg agt gaa gtg 528Lys Glu Asn Asp
Trp Gly Phe Ser Asn Phe Met Ala Trp Ser Glu Val 165
170 175acc gat cct gag aaa gga ttt ata gat gat
gac aaa gtt acc ttt gaa 576Thr Asp Pro Glu Lys Gly Phe Ile Asp Asp
Asp Lys Val Thr Phe Glu 180 185
190gtc ttt gta cag gcg gat gct ccc cat gga gtt gcg tgg gat tca aag
624Val Phe Val Gln Ala Asp Ala Pro His Gly Val Ala Trp Asp Ser Lys
195 200 205aag cac aca ggc tac
639Lys His Thr Gly Tyr
2108213PRTHomo sapiens 8Met Asn His Gln Gln Gln Gln Gln Gln Gln Lys Ala
Gly Glu Gln Gln1 5 10
15Leu Ser Glu Pro Glu Asp Met Glu Met Glu Ala Gly Asp Thr Asp Asp
20 25 30Pro Pro Arg Ile Thr Gln Asn
Pro Val Ile Asn Gly Asn Val Ala Leu 35 40
45Ser Asp Gly His Asn Thr Ala Glu Glu Asp Met Glu Asp Asp Thr
Ser 50 55 60Trp Arg Ser Glu Ala Thr
Phe Gln Phe Thr Val Glu Arg Phe Ser Arg65 70
75 80Leu Ser Glu Ser Val Leu Ser Pro Pro Cys Phe
Val Arg Asn Leu Pro 85 90
95Trp Lys Ile Met Val Met Pro Arg Phe Tyr Pro Asp Arg Pro His Gln
100 105 110Lys Ser Val Gly Phe Phe
Leu Gln Cys Asn Ala Glu Ser Asp Ser Thr 115 120
125Ser Trp Ser Cys His Ala Gln Ala Val Leu Lys Ile Ile Asn
Tyr Arg 130 135 140Asp Asp Glu Lys Ser
Phe Ser Arg Arg Ile Ser His Leu Phe Phe His145 150
155 160Lys Glu Asn Asp Trp Gly Phe Ser Asn Phe
Met Ala Trp Ser Glu Val 165 170
175Thr Asp Pro Glu Lys Gly Phe Ile Asp Asp Asp Lys Val Thr Phe Glu
180 185 190Val Phe Val Gln Ala
Asp Ala Pro His Gly Val Ala Trp Asp Ser Lys 195
200 205Lys His Thr Gly Tyr 2109540DNAHomo
sapiensCDS(1)..(540) 9atg tca agg gtt cca agt cct cca cct ccg gca gaa atg
tcg agt ggc 48Met Ser Arg Val Pro Ser Pro Pro Pro Pro Ala Glu Met
Ser Ser Gly1 5 10 15ccc
gta gct gag agt tgg tgc tac aca cag atc aag gta gtg aaa ttc 96Pro
Val Ala Glu Ser Trp Cys Tyr Thr Gln Ile Lys Val Val Lys Phe 20
25 30tcc tac atg tgg acc atc aat aac
ttt agc ttt tgc cgg gag gaa atg 144Ser Tyr Met Trp Thr Ile Asn Asn
Phe Ser Phe Cys Arg Glu Glu Met 35 40
45ggt gaa gtc att aaa agt tct aca ttt tca tca gga gca aat gat aaa
192Gly Glu Val Ile Lys Ser Ser Thr Phe Ser Ser Gly Ala Asn Asp Lys
50 55 60ctg aaa tgg tgt ttg cga gta aac
ccc aaa ggg tta gat gaa gaa agc 240Leu Lys Trp Cys Leu Arg Val Asn
Pro Lys Gly Leu Asp Glu Glu Ser65 70 75
80aaa gat tac ctg tca ctt tac ctg tta ctg gtc agc tgt
cca aag agt 288Lys Asp Tyr Leu Ser Leu Tyr Leu Leu Leu Val Ser Cys
Pro Lys Ser 85 90 95gaa
gtt cgg gca aaa ttc aaa ttc tcc atc ctg aat gcc aag gga gaa 336Glu
Val Arg Ala Lys Phe Lys Phe Ser Ile Leu Asn Ala Lys Gly Glu
100 105 110gaa acc aaa gct atg gag agt
caa cgg gca tat agg ttt gtg caa ggc 384Glu Thr Lys Ala Met Glu Ser
Gln Arg Ala Tyr Arg Phe Val Gln Gly 115 120
125aaa gac tgg gga ttc aag aaa ttc atc cgt aga gat ttt ctt ttg
gat 432Lys Asp Trp Gly Phe Lys Lys Phe Ile Arg Arg Asp Phe Leu Leu
Asp 130 135 140gag gcc aac ggg ctt ctc
cct gat gac aag ctt acc ctc ttc tgc gag 480Glu Ala Asn Gly Leu Leu
Pro Asp Asp Lys Leu Thr Leu Phe Cys Glu145 150
155 160gtg agt gtt gtg caa gat tct gtc aac att tct
ggc cag aat acc atg 528Val Ser Val Val Gln Asp Ser Val Asn Ile Ser
Gly Gln Asn Thr Met 165 170
175aac atg gta aag
540Asn Met Val Lys 18010180PRTHomo sapiens 10Met Ser Arg Val
Pro Ser Pro Pro Pro Pro Ala Glu Met Ser Ser Gly1 5
10 15Pro Val Ala Glu Ser Trp Cys Tyr Thr Gln
Ile Lys Val Val Lys Phe 20 25
30Ser Tyr Met Trp Thr Ile Asn Asn Phe Ser Phe Cys Arg Glu Glu Met
35 40 45Gly Glu Val Ile Lys Ser Ser Thr
Phe Ser Ser Gly Ala Asn Asp Lys 50 55
60Leu Lys Trp Cys Leu Arg Val Asn Pro Lys Gly Leu Asp Glu Glu Ser65
70 75 80Lys Asp Tyr Leu Ser
Leu Tyr Leu Leu Leu Val Ser Cys Pro Lys Ser 85
90 95Glu Val Arg Ala Lys Phe Lys Phe Ser Ile Leu
Asn Ala Lys Gly Glu 100 105
110Glu Thr Lys Ala Met Glu Ser Gln Arg Ala Tyr Arg Phe Val Gln Gly
115 120 125Lys Asp Trp Gly Phe Lys Lys
Phe Ile Arg Arg Asp Phe Leu Leu Asp 130 135
140Glu Ala Asn Gly Leu Leu Pro Asp Asp Lys Leu Thr Leu Phe Cys
Glu145 150 155 160Val Ser
Val Val Gln Asp Ser Val Asn Ile Ser Gly Gln Asn Thr Met
165 170 175Asn Met Val Lys
18011459DNAHomo sapiensCDS(1)..(459) 11ttt acc agt gaa tta gtg cca tct
tac gat tca gct act ttt gtt tta 48Phe Thr Ser Glu Leu Val Pro Ser
Tyr Asp Ser Ala Thr Phe Val Leu1 5 10
15gag aat ttc agc act ttg cgt cag aga gca gat cct gtt tac
agt cca 96Glu Asn Phe Ser Thr Leu Arg Gln Arg Ala Asp Pro Val Tyr
Ser Pro 20 25 30cct ctt caa
gtt tca gga ctt tgc tgg agg tta aaa gtt tac cca gat 144Pro Leu Gln
Val Ser Gly Leu Cys Trp Arg Leu Lys Val Tyr Pro Asp 35
40 45gga aat gga gtt gtg cga ggt tac tac tta tct
gtg ttt ctg gag ctc 192Gly Asn Gly Val Val Arg Gly Tyr Tyr Leu Ser
Val Phe Leu Glu Leu 50 55 60tca gct
ggc ttg cct gaa act tct aaa tat gaa tat cgt gta gag atg 240Ser Ala
Gly Leu Pro Glu Thr Ser Lys Tyr Glu Tyr Arg Val Glu Met65
70 75 80gtt cac cag tcc tgt aat gat
cct aca aaa aat atc att cga gaa ttt 288Val His Gln Ser Cys Asn Asp
Pro Thr Lys Asn Ile Ile Arg Glu Phe 85 90
95gca tct gac ttt gaa gtt gga gaa tgc tgg ggc tat aat
aga ttt ttc 336Ala Ser Asp Phe Glu Val Gly Glu Cys Trp Gly Tyr Asn
Arg Phe Phe 100 105 110cgt ttg
gac tta ctc gca aat gaa gga tac ttg aat cca caa aat gat 384Arg Leu
Asp Leu Leu Ala Asn Glu Gly Tyr Leu Asn Pro Gln Asn Asp 115
120 125aca gtg att tta agg ttt cag gta cgt tca
cca act ttc ttt caa aaa 432Thr Val Ile Leu Arg Phe Gln Val Arg Ser
Pro Thr Phe Phe Gln Lys 130 135 140tcc
cgg gac cag cat tgg tac att act 459Ser
Arg Asp Gln His Trp Tyr Ile Thr145 15012153PRTHomo
sapiens 12Phe Thr Ser Glu Leu Val Pro Ser Tyr Asp Ser Ala Thr Phe Val
Leu1 5 10 15Glu Asn Phe
Ser Thr Leu Arg Gln Arg Ala Asp Pro Val Tyr Ser Pro 20
25 30Pro Leu Gln Val Ser Gly Leu Cys Trp Arg
Leu Lys Val Tyr Pro Asp 35 40
45Gly Asn Gly Val Val Arg Gly Tyr Tyr Leu Ser Val Phe Leu Glu Leu 50
55 60Ser Ala Gly Leu Pro Glu Thr Ser Lys
Tyr Glu Tyr Arg Val Glu Met65 70 75
80Val His Gln Ser Cys Asn Asp Pro Thr Lys Asn Ile Ile Arg
Glu Phe 85 90 95Ala Ser
Asp Phe Glu Val Gly Glu Cys Trp Gly Tyr Asn Arg Phe Phe 100
105 110Arg Leu Asp Leu Leu Ala Asn Glu Gly
Tyr Leu Asn Pro Gln Asn Asp 115 120
125Thr Val Ile Leu Arg Phe Gln Val Arg Ser Pro Thr Phe Phe Gln Lys
130 135 140Ser Arg Asp Gln His Trp Tyr
Ile Thr145 1501317DNAArtificial SequenceDescription of
Artificial Sequence Primer 13gcgaattcca ggccgcg
171427DNAArtificial SequenceDescription of
Artificial Sequence Primer 14ttcctcgagc cgacttagcc tgtgtgc
271527DNAArtificial SequenceDescription of
Artificial Sequence Primer 15cttcgaattc gcgatgtcaa gggttcc
271629DNAArtificial SequenceDescription of
Artificial Sequence Primer 16ccatgctcga ggtattctag ccagaaatg
291726DNAArtificial SequenceDescription of
Artificial Sequence Primer 17ccagaattca ccagtgaatt agtgcc
261829DNAArtificial SequenceDescription of
Artificial Sequence Primer 18ccactcgagt aatgtaccaa tgctagtcc
291994PRTArtificial SequenceDescription of
Artificial Sequence Consensus Sequence 19Glu Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa1 5 10
15Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Trp Xaa Xaa Xaa Val
Xaa Pro Xaa 20 25 30Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 35
40 45Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa 50 55 60Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa65
70 75 80Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp 85
90209PRTArtificial SequenceSITE(2)Xaa =any amino acid 20Leu Xaa Trp Xaa
Xaa Xaa Val Xaa Pro1 521124PRTArtificial
SequenceSITE(2)..(14)Xaa=any amino acid 21Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa1 5 10
15Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Trp Xaa Xaa Xaa Val Xaa
Pro Xaa 20 25 30Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Leu 35
40 45Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa 50 55 60Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa65
70 75 80Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Phe Xaa Xaa Xaa Xaa Xaa Trp Gly Xaa 85 90
95Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa 100 105 110Xaa Xaa
Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val 115
12022124PRTArtificial SequenceSITE(2)..(5)Xaa =any amino acid 22Glu Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa1 5
10 15Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa
Trp Xaa Xaa Xaa Val Xaa Pro Xaa 20 25
30Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa
Leu 35 40 45Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 50 55
60Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Xaa
Xaa Xaa65 70 75 80Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Trp Gly Xaa
85 90 95Xaa Xaa Phe Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 100 105
110Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val
115 12023139PRTHomo sapiens 23Asp Met Glu Asp Asp Thr Ser
Trp Arg Ser Glu Ala Thr Phe Gln Phe1 5 10
15Thr Val Glu Arg Phe Ser Arg Leu Ser Glu Ser Val Leu
Ser Pro Pro 20 25 30Cys Phe
Val Arg Asn Leu Pro Trp Lys Ile Met Val Met Pro Arg Phe 35
40 45Tyr Pro Asp Arg Pro His Gln Lys Ser Val
Gly Phe Phe Leu Gln Cys 50 55 60Asn
Ala Glu Ser Asp Ser Thr Ser Trp Ser Cys His Ala Gln Ala Val65
70 75 80Leu Lys Ile Ile Asn Tyr
Arg Asp Asp Glu Lys Ser Phe Ser Arg Arg 85
90 95Ile Ser His Leu Phe Phe His Lys Glu Asn Asp Trp
Gly Phe Ser Asn 100 105 110Phe
Met Ala Trp Ser Glu Val Thr Asp Pro Glu Lys Gly Phe Ile Asp 115
120 125Asp Asp Lys Val Thr Phe Glu Val Phe
Val Gln 130 13524132PRTHomo sapiens 24Ser Tyr Met Trp
Thr Ile Asn Asn Phe Ser Phe Cys Arg Glu Glu Met1 5
10 15Gly Glu Val Ile Lys Ser Ser Thr Phe Ser
Ser Gly Ala Asn Asp Lys 20 25
30Leu Lys Trp Cys Leu Arg Val Asn Pro Lys Gly Leu Asp Glu Glu Ser
35 40 45Lys Asp Tyr Leu Ser Leu Tyr Leu
Leu Leu Val Ser Cys Pro Lys Ser 50 55
60Glu Val Arg Ala Lys Phe Lys Phe Ser Ile Leu Asn Ala Lys Gly Glu65
70 75 80Glu Thr Lys Ala Met
Glu Ser Gln Arg Ala Tyr Arg Phe Val Gln Gly 85
90 95Lys Asp Trp Gly Phe Lys Lys Phe Ile Arg Arg
Asp Phe Leu Leu Asp 100 105
110Glu Ala Asn Gly Leu Leu Pro Asp Asp Lys Leu Thr Leu Phe Cys Glu
115 120 125Val Ser Val Val
13025135PRTHomo sapiens 25Glu Leu Val Pro Ser Tyr Asp Ser Ala Thr Phe Val
Leu Glu Asn Phe1 5 10
15Ser Thr Leu Arg Gln Arg Ala Asp Pro Val Tyr Ser Pro Pro Leu Gln
20 25 30Val Ser Gly Leu Cys Trp Arg
Leu Lys Val Tyr Pro Asp Gly Asn Gly 35 40
45Val Val Arg Gly Tyr Tyr Leu Ser Val Phe Leu Glu Leu Ser Ala
Gly 50 55 60Leu Pro Glu Thr Ser Lys
Tyr Glu Tyr Arg Val Glu Met Val His Gln65 70
75 80Ser Cys Asn Asp Pro Thr Lys Asn Ile Ile Arg
Glu Phe Ala Ser Asp 85 90
95Phe Glu Val Gly Glu Cys Trp Gly Tyr Asn Arg Phe Phe Arg Leu Asp
100 105 110Leu Leu Ala Asn Glu Gly
Tyr Leu Asn Pro Gln Asn Asp Thr Val Ile 115 120
125Leu Arg Phe Gln Val Arg Ser 130
13526150PRTHomo sapiens 26Gly Thr Phe Leu Trp Lys Ile Thr Asn Val Thr Arg
Arg Cys His Glu1 5 10
15Ser Ala Cys Gly Arg Thr Val Ser Leu Phe Ser Pro Ala Phe Tyr Thr
20 25 30Ala Lys Tyr Gly Tyr Lys Leu
Cys Leu Arg Leu Tyr Leu Asn Gly Asp 35 40
45Gly Thr Gly Lys Arg Thr His Leu Ser Leu Phe Ile Val Ile Met
Arg 50 55 60Gly Glu Tyr Asp Ala Leu
Leu Pro Trp Pro Phe Arg Asn Lys Val Thr65 70
75 80Phe Met Leu Leu Asp Gln Asn Asn Arg Glu His
Ala Ile Asp Ala Phe 85 90
95Arg Pro Asp Leu Ser Ser Ala Ser Phe Gln Arg Pro Gln Ser Glu Thr
100 105 110Asn Val Ala Ser Gly Cys
Pro Leu Phe Phe Pro Leu Ser Lys Leu Gln 115 120
125Ser Pro Lys His Ala Tyr Val Lys Asp Asp Thr Met Phe Leu
Lys Cys 130 135 140Ile Val Glu Thr Ser
Thr145 15027150PRTHomo sapiens 27Gly Val Phe Ile Trp Lys
Ile Ser Asp Phe Ala Arg Lys Arg Gln Glu1 5
10 15Ala Val Ala Gly Arg Ile Pro Ala Ile Phe Ser Pro
Ala Phe Tyr Thr 20 25 30Ser
Arg Tyr Gly Tyr Lys Met Cys Leu Arg Ile Tyr Leu Asn Gly Asp 35
40 45Gly Thr Gly Arg Gly Thr His Leu Ser
Leu Phe Phe Val Val Met Lys 50 55
60Gly Pro Asn Asp Ala Leu Leu Arg Trp Pro Phe Asn Gln Lys Val Thr65
70 75 80Leu Met Leu Leu Asp
Gln Asn Asn Arg Glu His Val Ile Asp Ala Phe 85
90 95Arg Pro Asp Val Thr Ser Ser Ser Phe Gln Arg
Pro Val Asn Asp Met 100 105
110Asn Ile Ala Ser Gly Cys Pro Leu Phe Cys Pro Val Ser Lys Met Glu
115 120 125Ala Lys Asn Ser Tyr Val Arg
Asp Asp Ala Ile Phe Ile Lys Ala Ile 130 135
140Val Asp Leu Thr Gly Leu145 15028153PRTHomo
sapiens 28Gly Val Leu Ile Trp Lys Ile Arg Asp Tyr Lys Arg Arg Lys Gln
Glu1 5 10 15Ala Val Met
Gly Lys Thr Leu Ser Leu Tyr Ser Gln Pro Phe Tyr Thr 20
25 30Gly Tyr Phe Gly Tyr Lys Met Cys Ala Arg
Val Tyr Leu Asn Gly Asp 35 40
45Gly Met Gly Lys Gly Thr His Leu Ser Leu Phe Phe Val Ile Met Arg 50
55 60Gly Glu Tyr Asp Ala Leu Leu Pro Trp
Pro Phe Lys Gln Lys Val Thr65 70 75
80Leu Met Leu Met Asp Gln Gly Ser Ser Arg Arg His Leu Gly
Asp Ala 85 90 95Phe Lys
Pro Asp Pro Asn Ser Ser Ser Phe Lys Lys Pro Ile Gly Glu 100
105 110Met Asn Ile Ala Ser Gly Cys Pro Val
Phe Val Ala Gln Thr Val Leu 115 120
125Glu Asn Gly Thr Tyr Ile Lys Asp Asp Thr Ile Phe Ile Lys Val Ile
130 135 140Val Asp Thr Ser Asp Leu Pro
Asp Pro145 15029163PRTHomo sapiens 29Gly Val Leu Ile Trp
Lys Ile Gly Ser Tyr Gly Arg Arg Leu Gln Glu1 5
10 15Ala Lys Ala Lys Pro Asn Leu Glu Cys Phe Ser
Pro Ala Phe Tyr Thr 20 25
30His Lys Tyr Gly Tyr Lys Leu Gln Val Ser Ala Phe Leu Asn Gly Asn
35 40 45Gly Ser Gly Glu Gly Thr His Leu
Ser Leu Tyr Ile Arg Val Leu Pro 50 55
60Gly Ala Phe Asp Asn Leu Leu Glu Trp Pro Phe Ala Arg Arg Val Thr65
70 75 80Phe Ser Leu Leu Asp
Gln Ser Asp Pro Gly Leu Ala Lys Pro Gln His 85
90 95Val Thr Glu Thr Phe His Pro Asp Pro Asn Trp
Lys Asn Phe Gln Lys 100 105
110Pro Gly Thr Trp Arg Gly Ser Leu Asp Glu Ser Ser Leu Gly Phe Gly
115 120 125Tyr Pro Lys Phe Ile Ser His
Gln Asp Ile Arg Lys Arg Asn Tyr Val 130 135
140Arg Asp Asp Ala Val Phe Ile Arg Ala Ala Val Glu Leu Pro Arg
Lys145 150 155 160Ile Leu
Ser30154PRTHomo sapiens 30Gly Lys Leu Ile Trp Lys Val Thr Asp Tyr Lys Met
Lys Lys Arg Glu1 5 10
15Ala Val Asp Gly His Thr Val Ser Ile Phe Ser Gln Ser Phe Tyr Thr
20 25 30Ser Arg Cys Gly Tyr Arg Leu
Cys Ala Arg Ala Tyr Leu Asn Gly Asp 35 40
45Gly Ser Gly Arg Gly Ser His Leu Ser Leu Tyr Phe Val Val Met
Arg 50 55 60Gly Glu Phe Asp Ser Leu
Leu Gln Trp Pro Phe Arg Gln Arg Val Thr65 70
75 80Leu Met Leu Leu Asp Gln Ser Gly Lys Lys Asn
Ile Met Glu Thr Phe 85 90
95Lys Pro Asp Pro Asn Ser Ser Ser Phe Lys Arg Pro Asp Gly Glu Met
100 105 110Asn Ile Ala Ser Gly Cys
Pro Arg Phe Val Ala His Ser Val Leu Glu 115 120
125Asn Ala Lys Asn Ala Tyr Ile Lys Asp Asp Thr Leu Phe Leu
Lys Val 130 135 140Ala Val Asp Leu Thr
Asp Leu Glu Asp Leu145 15031157PRTHomo sapiens 31Gly Ile
Tyr Ile Trp Lys Ile Gly Asn Phe Gly Met His Leu Lys Cys1 5
10 15Gln Glu Glu Glu Lys Pro Val Val
Ile His Ser Pro Gly Phe Tyr Thr 20 25
30Gly Lys Pro Gly Tyr Lys Leu Cys Met Arg Leu His Leu Gln Leu
Pro 35 40 45Thr Ala Gln Arg Cys
Ala Asn Tyr Ile Ser Leu Phe Val His Thr Met 50 55
60Gln Gly Glu Tyr Asp Ser His Leu Pro Trp Pro Phe Gln Gly
Thr Ile65 70 75 80Arg
Leu Thr Ile Leu Asp Gln Ser Glu Ala Pro Val Arg Gln Asn His
85 90 95Glu Glu Ile Met Asp Ala Lys
Pro Glu Leu Leu Ala Phe Gln Arg Pro 100 105
110Thr Ile Pro Arg Asn Pro Lys Gly Phe Gly Tyr Val Thr Phe
Met His 115 120 125Leu Glu Ala Leu
Arg Gln Arg Thr Phe Ile Lys Asp Asp Thr Leu Leu 130
135 140Val Arg Cys Glu Val Ser Thr Arg Phe Asp Met Gly
Ser145 150 15532964PRTHomo sapiens 32Met
Asp Glu Gln Ser Val Glu Ser Ile Ala Glu Val Phe Arg Cys Phe1
5 10 15Ile Cys Met Glu Lys Leu Arg
Asp Ala Arg Leu Cys Pro His Cys Ser 20 25
30Lys Leu Cys Cys Phe Ser Cys Ile Arg Arg Trp Leu Thr Glu
Gln Arg 35 40 45Ala Gln Cys Pro
His Cys Arg Ala Pro Leu Gln Leu Arg Glu Leu Val 50 55
60Asn Cys Arg Trp Ala Glu Glu Val Thr Gln Gln Leu Asp
Thr Leu Gln65 70 75
80Leu Cys Ser Leu Thr Lys His Glu Glu Asn Glu Lys Asp Lys Cys Glu
85 90 95Asn His His Glu Lys Leu
Ser Val Phe Cys Trp Thr Cys Lys Lys Cys 100
105 110Ile Cys His Gln Cys Ala Leu Trp Gly Gly Met His
Gly Gly His Thr 115 120 125Phe Lys
Pro Leu Ala Glu Ile Tyr Glu Gln His Val Thr Lys Val Asn 130
135 140Glu Glu Val Ala Lys Leu Arg Arg Arg Leu Met
Glu Leu Ile Ser Leu145 150 155
160Val Gln Glu Val Glu Arg Asn Val Glu Ala Val Arg Asn Ala Lys Asp
165 170 175Glu Arg Val Arg
Glu Ile Arg Asn Ala Val Glu Met Met Ile Ala Arg 180
185 190Leu Asp Thr Gln Leu Lys Asn Lys Leu Ile Thr
Leu Met Gly Gln Lys 195 200 205Thr
Ser Leu Thr Gln Glu Thr Glu Leu Leu Glu Ser Leu Leu Gln Glu 210
215 220Val Glu His Gln Leu Arg Ser Cys Ser Lys
Ser Glu Leu Ile Ser Lys225 230 235
240Ser Ser Glu Ile Leu Met Met Phe Gln Gln Val His Arg Lys Pro
Met 245 250 255Ala Ser Phe
Val Thr Thr Pro Val Pro Pro Asp Phe Thr Ser Glu Leu 260
265 270Val Pro Ser Tyr Asp Ser Ala Thr Phe Val
Leu Glu Asn Phe Ser Thr 275 280
285Leu Arg Gln Arg Ala Asp Pro Val Tyr Ser Pro Pro Leu Gln Val Ser 290
295 300Gly Leu Cys Trp Arg Leu Lys Val
Tyr Pro Asp Gly Asn Gly Val Val305 310
315 320Arg Gly Tyr Tyr Leu Ser Val Phe Leu Glu Leu Ser
Ala Gly Leu Pro 325 330
335Glu Thr Ser Lys Tyr Glu Tyr Arg Val Glu Met Val His Gln Ser Cys
340 345 350Asn Asp Pro Thr Lys Asn
Ile Ile Arg Cys Phe Ala Ser Asp Phe Glu 355 360
365Val Gly Glu Cys Trp Gly Tyr Asn Arg Phe Phe Arg Leu Asp
Leu Leu 370 375 380Ala Asn Glu Gly Tyr
Leu Asn Pro Gln Asn Asp Thr Val Ile Leu Arg385 390
395 400Phe Gln Val Arg Ser Pro Thr Phe Phe Gln
Lys Ser Arg Asp Gln His 405 410
415Trp Tyr Thr Ile Gln Leu Glu Ala Ala Gln Thr Ser Tyr Ile Gln Gln
420 425 430Ile Asn Asn Leu Lys
Glu Arg Leu Thr Ile Glu Leu Ser Arg Thr Gln 435
440 445Lys Ser Arg Asp Leu Ser Pro Pro Asp Asn His Leu
Ser Pro Gln Asn 450 455 460Asp Asp Ala
Leu Glu Thr Arg Ala Lys Lys Ser Ala Cys Ser Asp Met465
470 475 480Leu Leu Glu Gly Gly Pro Thr
Thr Ala Ser Val Arg Glu Ala Lys Glu 485
490 495Asp Glu Glu Asp Glu Glu Lys Ile Gln Asn Glu Asp
Tyr His His Glu 500 505 510Leu
Ser Asp Gly Asp Leu Asp Leu Asp Leu Val Tyr Glu Asp Glu Val 515
520 525Asn Gln Leu Asp Gly Ser Ser Ser Ser
Ala Ser Ser Thr Ala Thr Ser 530 535
540Asn Thr Glu Glu Asn Asp Ile Asp Glu Glu Thr Met Ser Gly Glu Asn545
550 555 560Asp Val Glu Tyr
Asn Asn Met Glu Leu Glu Glu Gly Glu Leu Met Glu 565
570 575Asp Ala Ala Ala Ala Gly Pro Ala Gly Ser
Ser His Gly Tyr Val Gly 580 585
590Ser Ser Ser Arg Ile Ser Arg Arg Thr His Leu Cys Ser Ala Ala Thr
595 600 605Ser Ser Leu Leu Asp Ile Asp
Pro Leu Ile Leu Ile His Leu Leu Asp 610 615
620Leu Lys Asp Arg Ser Ser Ile Glu Asn Leu Trp Gly Leu Gln Pro
Arg625 630 635 640Pro Pro
Ala Ser Leu Leu Gln Pro Thr Ala Ser Tyr Ser Arg Lys Asp
645 650 655Lys Asp Gln Arg Lys Gln Gln
Ala Met Trp Arg Val Pro Ser Asp Leu 660 665
670Lys Met Leu Lys Arg Leu Lys Thr Gln Met Ala Glu Val Arg
Cys Met 675 680 685Lys Thr Asp Val
Lys Asn Thr Leu Ser Glu Ile Lys Ser Ser Ser Ala 690
695 700Ala Ser Gly Asp Met Gln Thr Ser Leu Phe Ser Ala
Asp Gln Ala Ala705 710 715
720Leu Ala Ala Cys Gly Thr Glu Asn Ser Gly Arg Leu Gln Asp Leu Gly
725 730 735Met Glu Leu Leu Ala
Lys Ser Ser Val Ala Asn Cys Tyr Ile Arg Asn 740
745 750Ser Thr Asn Lys Lys Ser Asn Ser Pro Lys Pro Ala
Arg Ser Ser Val 755 760 765Ala Gly
Ser Leu Ser Leu Arg Arg Ala Val Asp Pro Gly Glu Asn Ser 770
775 780Arg Ser Lys Gly Asp Cys Gln Thr Leu Ser Glu
Gly Ser Pro Gly Ser785 790 795
800Ser Gln Ser Gly Ser Arg His Ser Ser Pro Arg Ala Leu Ile His Gly
805 810 815Ser Ile Gly Asp
Ile Leu Pro Lys Thr Glu Asp Arg Gln Cys Lys Ala 820
825 830Leu Asp Ser Asp Ala Val Val Val Ala Val Phe
Ser Gly Leu Pro Ala 835 840 845Val
Glu Lys Arg Arg Lys Met Val Thr Leu Gly Ala Asn Ala Lys Gly 850
855 860Gly His Leu Phe Gly Leu Gln Met Thr Asp
Leu Glu Asn Asn Ser Glu865 870 875
880Thr Gly Phe Leu Gln Pro Val Leu Pro Glu Gly Ala Ser Ala Ala
Pro 885 890 895Glu Glu Gly
Met Ser Ser Asp Ser Asp Ile Glu Gly Asp Thr Glu Asn 900
905 910Glu Glu Gln Glu Glu His Thr Ser Val Gly
Gly Phe His Asp Ser Phe 915 920
925Met Val Met Thr Gln Pro Pro Asp Glu Asp Thr His Ser Ser Glu Pro 930
935 940Asp Gly Phe Gln Ile Gly Pro Glu
Asp Leu Ser Phe Asn Thr Asp Glu945 950
955 960Asn Ser Gly Arg
User Contributions:
Comment about this patent or add new information about this topic: