Patent application title: PURIFIED SR-P70 PROTEIN
Inventors:
Daniel Caput (Labege, FR)
Pascual Ferrara (Avignonet Lauragais, FR)
Ahmed Mourad Kaghad (Montgiscard, FR)
Assignees:
SANOFI-AVENTIS
IPC8 Class: AG01N3353FI
USPC Class:
436501
Class name: Chemistry: analytical and immunological testing biospecific ligand binding assay
Publication date: 2011-01-27
Patent application number: 20110020951
Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
Patent application title: PURIFIED SR-P70 PROTEIN
Inventors:
Daniel CAPUT
Pascual FERRARA
Ahmed Mourad KAGHAD
Agents:
ANDREA Q. RYAN;SANOFI-AVENTIS U.S. LLC
Assignees:
Origin: BRIDGEWATER, NJ US
IPC8 Class: AG01N3353FI
USPC Class:
Publication date: 01/27/2011
Patent application number: 20110020951
Abstract:
The invention relates to new nucleic acid sequences of the family of
tumor-suppressing genes related to the gene for the p53 protein, and to
corresponding protein sequences.Claims:
1. A monoclonal or polyclonal antibody or their fragments, a chimeric
antibody or an immunoconjugate capable of specifically recognizing a
polypeptide comprising amino acid sequence SEQ ID NO:6.
2. A method of purifying or detecting a polypeptide comprising amino acid sequence SEQ ID NO:6, the method comprising:a) contacting the antibody or the immunoconjugate of claim 1 to a biological sample; andb) purifying or detecting a polypeptide comprising amino acid sequence SEQ ID NO:6 bound by the antibody or the immunoconjugate of claim 1.
3. A method of in vitro diagnosis of pathologies correlated with an expression or an abnormal accumulation of SR-p70 proteins from a biological sample, the method comprising:a) contacting the antibody or the immunoconjugate of claim 1 to a biological sample; andb) identifying specific immunological complexes between an SR-p70 protein and the antibody or the immunoconjugate of claim 1wherein an abnormal accumulation of SR-p70 proteins identifies a pathology associated with an abnormal accumulaton of SR-p70.
4. A kit for the in vitro diagnosis of an expression or an abnormal accumulation of SR-p70 proteins in a biological sample or for measuring the level of expression of these proteins in the said sample, comprising:at least one antibody according to claim 1, optionally bound to a support,means of visualization of the formation of specific antigen-antibody complexes between an SR-p70 protein and the said antibody, or means of quantification of these complexes.
Description:
[0001]The invention relates to new nucleic acid sequences of the family of
tumour-suppressing genes related to the gene for the p53 protein, and to
the corresponding protein sequences.
[0002]The invention also relates to the prophylactic, therapeutic and diagnostic applications of these sequences, in particular in the field of pathologies linked to the phenomena of apoptosis or of cell transformation.
[0003]Tumour-suppressing genes perform a key role in protection against the phenomena of carcinogenesis, and any modification capable of bringing about the loss of one of these genes, its inactivation or its dysfunction may have oncogenic character, thereby creating favourable conditions for the development of a malignant tumour.
[0004]The authors of the present invention have identified transcription products of a new gene, as well as the corresponding proteins. This gene, SR-p70, is related to the p53 tumour-suppressing gene, the antitumour activity of which is linked to its transcription factor activity, and more specifically to the controls exerted on the activity of the Bax and Bcl-2 genes which are instrumental in the mechanisms of cell death.
[0005]Hence the present invention relates to purified SR-p70 proteins, or biologically active fragments of the latter.
[0006]The invention also relates to isolated nucleic acid sequences coding for the said proteins or their biologically active fragments, and to specific oligonucleotides obtained from these sequences.
[0007]It relates, in addition, to the cloning and/or expression vectors containing at least one of the nucleotide sequences defined above, and the host cells transfected by these cloning and/or expression vectors under conditions permitting the replication and/or expression of one of the said nucleotide sequences.
[0008]The methods of production of recombinant SR-p70 proteins or their biologically active fragments by the transfected host cells also form part of the invention.
[0009]The invention also comprises antibodies or antibody derivatives specific for the proteins defined above.
[0010]It relates, in addition, to methods of detection of cancers, either by measuring the accumulation of SR-p70 proteins in the tumours according to immunohistochemical techniques, or by demonstrating autoantibodies directed against these proteins in patients' serum.
[0011]The invention also relates to any inhibitor or activator of SR-p70 activity, for example of protein-protein interaction, involving SR-p70.
[0012]It also relates to antisense oligonucleotide sequences specific for the above nucleic acid sequences, capable of modulating in vivo the expression of the SR-p70 gene.
[0013]Lastly, the invention comprises a method of gene therapy, in which vectors such as, for example, inactivated viral vectors capable of transferring coding sequences for a protein according to the invention are injected into cells deficient for this protein, for purposes of regulating the phenomena of apoptosis or of reversion of transformation.
[0014]A subject of the present invention is a purified polypeptide comprising an amino acid sequence selected from: [0015]a) the sequence SEQ ID No. 2; [0016]b) the sequence SEQ ID No. 4; [0017]c) the sequence SEQ ID No. 6; [0018]d) the sequence SEQ ID No. 8; [0019]e) the sequence SEQ ID No. 10; [0020]f) the sequence SEQ ID No. 13; [0021]g) the sequence SEQ ID No. 15; [0022]h) the sequence SEQ ID No. 17; [0023]i) the sequence SEQ ID No. 19; [0024]j) any biologically active sequence derived from SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19.
[0025]In the description of the invention, the following definitions are used: [0026]SR-p70 protein: a polypeptide comprising an amino acid sequence selected from the sequences SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19, or any biologically active fragment or derivative of this polypeptide; [0027]derivative: any variant polypeptide of the polypeptide of sequence SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19, or any molecule resulting from a modification of a genetic and/or chemical nature of the sequence SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19, that is to say obtained by mutation, deletion, addition, substitution and/or chemical modification of a single amino acid or of a limited number of amino acids, as well as any isoform sequence, that is to say sequence identical to the sequence SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19, or to one of its fragments or modified sequences, containing one or more amino acids in the form of the D enantiomer, the said variant, modified or isoform sequences having retained at least one of the properties that make them biologically active; [0028]biologically active: capable of binding to DNA and/or of exerting transcription factor activity and/or of participating in the control of the cell cycle, of differentiation and of apoptosis and/or capable of being recognized by the antibodies specific for the polypeptide of sequence SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19, and/or capable of inducing antibodies which recognize this polypeptide.
[0029]The manufacture of derivatives may have different objectives, including especially that of increasing the affinity of the polypeptide for DNA or its transcription factor activity, and that of improving its levels of production, of increasing its resistance to proteases, of modifying its biological activities or of endowing it with new pharmaceutical and/or biological properties.
[0030]Among the polypeptides of the invention, the polypeptide of human origin comprising the sequence SEQ ID No. 6, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No.17 or SEQ ID No. 19 is preferred. The polypeptide of 636 amino acids corresponding to the sequence SEQ ID No. 6 is more than 97% identical to the polypeptide of sequence SEQ ID No. 2.
[0031]The polypeptide of sequence SEQ ID No. 2 and that of sequence SEQ ID No. 4 are two expression products of the same gene, and the same applies to the sequences SEQ ID No. 8 and SEQ ID No. 10 and to the sequences SEQ ID No. 6, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 or SEQ ID No. 19.
[0032]As will be explained in the examples, the polypeptide of sequence SEQ ID No. 4 corresponds to a premature termination of the peptide of sequence SEQ ID No. 2, linked to an alternative splicing of the longer transcript (messenger RNA), coding for the polypeptide of SEQ ID No. 2, of the corresponding gene. Similarly, in humans, the polypeptides corresponding to the sequences SEQ ID No. 6, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 and SEQ ID No. 19, diverge in their composition in respect of the N- and/or C-terminal portions, this being the outcome of alternative splicing of the same primary transcript. The N-terminal peptide sequence of the sequence SEQ ID No. 10 is deleted, this being linked to an alternative splicing of its coding transcript.
[0033]Advantageously, the invention relates to a polypeptide corresponding to the DNA binding domain of one of the above polypeptides.
[0034]This domain corresponds to the sequence lying between residue 110 and residue 310 for the sequences SEQ ID No. 2 or 6, and between residue 60 and residue 260 for the sequence SEQ ID No. 8.
[0035]A subject of the present invention is also nucleic acid sequences coding for a SR-p70 protein or biologically active fragments or derivatives of the latter.
[0036]More preferably, a subject of the invention is an isolated nucleic acid sequence selected from: [0037]a) the sequence SEQ ID No. 1; [0038]b) the sequence SEQ ID No. 3; [0039]c) the sequence SEQ ID No. 5; [0040]d) the sequence SEQ ID No. 7; [0041]e) the sequence SEQ ID No. 9; [0042]f) the sequence SEQ ID No. 11; [0043]g) the sequence SEQ ID No. 12; [0044]h) the sequence SEQ ID No. 14; [0045]i) the sequence SEQ ID No. 16; [0046]j) the sequence SEQ ID No. 18; [0047]k) the nucleic acid sequences capable of hybridizing specifically with the sequence SEQ ID No. 1, SEQ ID No. 3, SEQ ID No. 5, SEQ ID No. 7, SEQ ID No. 9, SEQ ID No. 11, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 16 or SEQ ID No. 18 or with the sequences complementary to them, or of hybridizing specifically with their proximal sequences; [0048]l) the sequences derived from the sequences a), b), c), d), e), f), g), h), i), j) or k) as a result of the degeneracy of the genetic code.
[0049]According to a preferred embodiment, a subject of the invention is nucleotide sequences SEQ ID No. 5, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 16 and SEQ ID No. 18, corresponding, respectively, to the cDNAs of the human proteins of the sequences SEQ ID No. 6, SEQ ID No. 13, SEQ ID No. 15, SEQ ID No. 17 and SEQ ID No. 19.
[0050]The different nucleotide sequences of the invention may be of artificial origin or otherwise. They can be DNA or RNA sequences obtained by the screening of libraries of sequences by means of probes prepared on the basis of the sequences SEQ ID No. 1, 3, 5, 7, 9, 11, 12, 14, 16 or 18. Such libraries may be prepared by traditional techniques of molecular biology which are known to a person skilled in the art.
[0051]The nucleotide sequences according to the invention may also be prepared by chemical synthesis, or alternatively by mixed methods including the chemical or enzymatic modification of sequences obtained by the screening of libraries.
[0052]These nucleotide sequences enable nucleotide probes to be produced which are capable of hybridizing strongly and specifically with a nucleic acid sequence, of a genomic DNA or of a messenger RNA, coding for a polypeptide according to the invention or a biologically active fragment of the latter. Such probes also form part of the invention. They may be used as an in vitro diagnostic tool for the detection, by hybridization experiments, of transcripts specific for the polypeptides of the invention in biological samples, or for the demonstration of aberrant syntheses or of genetic abnormalities such as loss of heterozygosity or genetic rearrangement resulting from a polymorphism, from mutations or from a different splicing.
[0053]The probes of the invention contain at least 10 nucleotides, and contain at most the whole of the sequence of the SR-p70 gene or of its cDNA contained, for example, in a cosmid.
[0054]Among the shortest probes, that is to say of approximately 10 to 20 nucleotides, the appropriate hybridization conditions correspond to the stringent conditions normally used by a person skilled in the art.
[0055]The temperature used is preferably between Tm-5° C. and Tm-30° C., and as a further preference between Tm-5° C. and Tm-10° C., Tm being the melting temperature, the temperature at which 50% of the paired DNA strands separate.
[0056]The hybridization is preferably conducted in solutions of high ionic strength, such as, in particular, 6×SSC solutions.
[0057]Advantageously, the hybridization conditions used are as follows: [0058]temperature: 42° C., [0059]hybridization buffer: 6×SSC, 5×Denhart's, 0.1% SDS, as described in Example III.
[0060]Advantageously, these probes are represented by the following oligonucleotides or the sequences complementary to them:
TABLE-US-00001 SEQ ID No. 20: GCG AGC TGC CCT CGG AG SEQ ID No. 21: GGT TCT GCA GGT GAC TCA G SEQ ID No. 22: GCC ATG CCT GTC TAC AAG SEQ ID No. 23: ACC AGC TGG TTG ACG GAG SEQ ID No. 24: GTC AAC CAG CTG GTG GGC CAG SEQ ID No. 25: GTG GAT CTC GGC CTC C SEQ ID No. 26: AGG CCG GCG TGG GGA AG SEQ ID No. 27: CTT GGC GAT CTG GCA GTA G SEQ ID No. 28: GCG GCC ACG ACC GTG AC SEQ ID No. 29: GGC AGC TTG GGT CTC TGG SEQ ID No. 30: CTG TAC GTC GGT GAC CCC SEQ ID No. 31: TCA GTG GAT CTC GGC CTC SEQ ID No. 32: AGG GGA CGC AGC GAA ACC SEQ ID No. 33: CCA TCA GCT CCA GGC TCT C SEQ ID No. 34: CCA GGA CAG GCG CAG ATG SEQ ID No. 35: GAT GAG GTG GCT GGC TGG A SEQ ID No. 36: TGG TCA GGT TCT GCA GGT G SEQ ID No. 37: CAC CTA CTC CAG GGA TGC SEQ ID No. 38: AGG AAA ATA GAA GCG TCA GTC SEQ ID No. 39: CAG GCC CAC TTG CCT GCC SEQ ID No. 40: CTG TCC CCA AGC TGA TGA G
[0061]Preferably, the probes of the invention are labelled prior to their use. To this end, several techniques are within the capacity of a person skilled in the art (fluorescent, radioactive, chemoluminescence, enzyme, and the like, labelling).
[0062]The in vitro diagnostic methods in which these nucleotide probes are employed are included in the subject of the present invention.
[0063]These methods relate, for example, to the detection of abnormal syntheses (e.g. accumulation of transcription products) or of genetic abnormalities, such as loss of heterozygosity and genetic rearrangement, and point mutations in the nucleotide sequences of nucleic acids coding for an SR-p70 protein, according to the definition given above.
[0064]The nucleotide sequences of the invention are also useful for the manufacture and use of oligonucleotide primers for sequencing reactions or specific amplification reactions according to the so-called PCR technique or any variant of the latter (ligase chain reaction (LCR), etc).
[0065]Preferred primer pairs consist of primers selected from the nucleotide sequences: SEQ ID No. 1: monkey sequence of 2,874 nucleotides, and SEQ ID No. 5: human SR-p70a cDNA, in particular upstream of the ATG translation initiation codon and downstream of the TGA translation stop codon.
[0066]Advantageously, these primers are represented by the following pairs:
TABLE-US-00002 pair No. 1: sense primer: GCG AGC TGC CCT CGG AG (SEQ ID No. 20) antisense primer: GGT TCT GCA GGT GAC TCA G (SEQ ID No. 21) pair No. 2: sense primer: GCC ATG CCT GTC TAC AAG (SEQ ID No. 22) antisense primer: ACC AGC TGG TTG ACG GAG (SEQ ID No. 23) pair No. 3: sense primer: GTC AAC CAG CTG GTG GGC CAG (SEQ ID No. 24) antisense primer: GTG GAT CTC GGC CTC C (SEQ ID No. 25) pair No. 4: sense primer: AGG CCG GCG TGG GGA AG (SEQ ID No. 26) antisense primer: CTT GGC GAT CTG GCA GTA G (SEQ ID No. 27) pair No. 5: sense primer: GCG GCC ACG ACC GTG A (SEQ ID No. 28) antisense primer: GGC AGC TTG GGT CTC TGG (SEQ ID No. 29) pair No. 6: sense primer: CTG TAC GTC GGT GAC CCC (SEQ ID No. 30) antisense primer: TCA GTG GAT CTC GGC CTC (SEQ ID No. 31) pair No. 7: sense primer: AGG GGA CGC AGC GAA ACC (SEQ ID No. 32) antisense primer: GGC AGC TTG GGT CTC TGG (SEQ ID No. 29) pair No. 8: sense primer: CCCCCCCCCCCCCCN (where N equals G, A or T) antisense primer: CCA TCA GCT CCA GGC TCT C (SEQ ID No. 33) pair No. 9: sense primer: CCCCCCCCCCCCCCN (where N equals G, A or T) antisense primer: CCA GGA CAG GCG CAG ATG (SEQ ID No. 34) pair No. 10: sense primer: CCCCCCCCCCCCCCCN (where N equals G, A or T) antisense primer: CTT GGC GAT CTG GCA GTA G (SEQ ID No. 27) pair No. 11: sense primer: CAC CTA CTC CAG GGA TGC (SEQ ID No. 37) antisense primer: AGG AAA ATA GAA GCG TCA GTC (SEQ ID No. 38) pair No. 12: sense primer: CAG GCC CAC TTG CCT GCC (SEQ ID No. 39) antisense primer: CTG TCC CCA AGC TGA TGA G (SEQ ID No. 40)
[0067]These primers correspond to the sequences extending, respectively: [0068]from nucleotide No. 124 to nucleotide No. 140 on SEQ ID No. 1 and from nucleotide No. 1 to nucleotide No. 17 on SEQ ID No. 5 for SEQ ID No. 20 [0069]from nucleotide No. 2280 to nucleotide No. 2262 on SEQ ID No. 1 and from nucleotide No. 2156 to nucleotide 2138 on SEQ ID No. 5 for SEQ ID No. 21 [0070]from nucleotide No. 684 to nucleotide No. 701 on SEQ ID No. 1 for SEQ ID No. 22 [0071]from nucleotide No. 1447 to nucleotide No. 1430 on SEQ ID No. 1 and from nucleotide 1324 to nucleotide 1307 on SEQ ID No. 5 for SEQ ID No. 23 [0072]from nucleotide 1434 to nucleotide 1454 on SEQ ID No. 1 and from nucleotide 1311 to nucleotide 1331 on SEQ ID No. 5 for SEQ ID No. 24 [0073]from nucleotide 2066 to nucleotide 2051 on SEQ ID No. 1 and from nucleotide 1940 to nucleotide 1925 on SEQ ID No. 5 for SEQ ID No. 25 [0074]from nucleotide 16 to nucleotide 32 on SEQ ID No. 5 for SEQ ID No. 26 [0075]from nucleotide 503 to nucleotide 485 on SEQ ID No. 5 for SEQ ID No. 27 [0076]from nucleotide 160 to nucleotide 176 on SEQ ID No. 11 for SEQ ID No. 28 [0077]from nucleotide 1993 to nucleotide 1976 on SEQ ID No. 5 for SEQ ID No. 29 [0078]from nucleotide 263 to nucleotide 280 on SEQ ID No. 11 for SEQ ID No. 30 [0079]from nucleotide 1943 to nucleotide 1926 on SEQ ID No. 5 for SEQ ID No. 31 [0080]from nucleotide 128 to nucleotide 145 on the nucleotide sequence depicted in FIG. 22 for SEQ ID No. 32 [0081]from nucleotide 1167 to nucleotide 1149 on SEQ ID No. 5 for SEQ ID No. 33 [0082]from nucleotide 928 to nucleotide 911 on SEQ ID No. 5 for SEQ ID No. 34 [0083]from nucleotide 677 to nucleotide 659 on SEQ ID No. 5 for SEQ ID No. 35 [0084]from nucleotide 1605 to nucleotide 1587 on SEQ ID No. 5 for SEQ ID No. 36 [0085]from nucleotide 1 to nucleotide 18 on the nucleotide sequence depicted in FIG. 13 for SEQ ID No. 37 [0086]from nucleotide 833 to nucleotide 813 on the nucleotide sequence depicted in FIG. 13 for SEQ ID No. 38 [0087]from nucleotide 25 to nucleotide 42 on the nucleotide sequence depicted in FIG. 13 for SEQ ID No. 39 [0088]from nucleotide 506 to nucleotide 488 on the nucleotide sequence depicted in FIG. 13 for SEQ ID No. 40
[0089]The nucleotide sequences according to the invention can have, moreover, uses in gene therapy, in particular for controlling the phenomena of apoptosis and of reversion of transformation.
[0090]The nucleotide sequences according to the invention may, moreover, be used for the production of recombinant SR-p70 proteins, according to the definition which has been given to this term.
[0091]These proteins may be produced from the nucleotide sequences defined above, according to techniques of production of recombinant products which are known to a person skilled in the art. In this case, the nucleotide sequence used is placed under the control of signals permitting its expression in a cell host.
[0092]An effective system for production of a recombinant protein necessitates having at one's disposal a vector, for example of plasmid or viral origin, and a compatible host cell.
[0093]The cell host may be selected from prokaryotic systems such as bacteria, or eukaryotic systems such as, for example, yeasts, insect cells, CHO cells (Chinese hamster ovary cells) or any other system advantageously available. A preferred cell host for the expression of proteins of the invention consists of the E. coli bacterium, in particular the strain MC 1061 (Clontec).
[0094]The vector must contain a promoter, translation initiation and termination signals and also the appropriate transcription regulation regions. It must be capable of being maintained stably in the cell and can, where appropriate, possess particular signals specifying the secretion of the translated protein.
[0095]These various control signals are selected in accordance with the cell host used. To this end, the nucleotide sequences according to the invention may be inserted into vectors which are autonomously replicating within the selected host, or vectors which are integrative for the chosen host. Such vectors will be prepared according to methods commonly used by a person skilled in the art, and the clones resulting therefrom may be introduced into a suitable host by standard methods such as, for example, electroporation.
[0096]The cloning and/or expression vectors containing at least one of the nucleotide sequences defined above also form part of the present invention.
[0097]A preferred cloning and expression vector is the plasmid pSE1, which contains the elements necessary for its use both as a cloning vector in E. coli (origin of replication in E. coli and ampicillin resistance gene originating from the plasmid pTZ 18R) and as an expression vector in animal cells (promoter, intron, polyadenylation site, origin of replication of the SV40 virus), as well as the elements enabling it to be copied as a single strand with the object of sequencing (origin of replication of phage f1).
[0098]The characteristics of this plasmid are described in Application EP 0,506,574.
[0099]Its construction and also the integration of the cDNAs originating from the nucleic acid sequences of the invention are, moreover, described in the examples below. According to a preferred embodiment, the proteins of the invention are in the form of fusion proteins, in particular in the form of a protein fused with glutathione S-transferase (GST). A designated expression vector in this case is represented by the plasmid vector pGEX-4T-3 (Pharmacia ref-27.4583).
[0100]The invention relates, in addition, to the host cells transfected by these aforementioned vectors. These cells may be obtained by introducing into host cells a nucleotide sequence inserted into a vector as defined above, followed by culturing of the said cells under conditions permitting the replication and/or expression of the transfected nucleotide sequence.
[0101]These cells are usable in a method of production of a recombinant polypeptide of sequence SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 16 or SEQ ID No. 18 or any biologically active fragment or derivative of the latter.
[0102]The method of production of a polypeptide of the invention in recombinant form is itself included in the present invention, and is characterized in that the transfected cells are cultured under conditions permitting the expression of a recombinant polypeptide of sequence SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 12, SEQ ID No. 14, SEQ ID No. 16 or SEQ ID No. 18 or of any biologically active fragment or derivative of the latter, and in that the said recombinant polypeptide is recovered.
[0103]The purification methods used are known to a person skilled in the art. The recombinant polypeptide may be purified from lysates and cell extracts or from the culture medium supernatant, by methods used individually or in combination, such as fractionation, chromatographic methods, immunoaffinity techniques using specific mono- or polyclonal antibodies, and the like. A preferred variant consists in producing a recombinant polypeptide fused to a "carrier" protein (chimeric protein). The advantage of this system is that it permits a stabilization and a decrease in proteolysis of the recombinant product, an increase in solubility during in vitro renaturation and/or a simplification of the purification when the fusion partner possesses an affinity for a specific ligand.
[0104]Advantageously, the polypeptides of the invention are fused with glutathione S-transferase at the N-terminal position (Pharmacia "GST" system). The fusion product is, in this case, detected and quantified by means of the enzyme activity of the GST. The colorimetric reagent used is a glutathione acceptor, a substrate for GST. The recombinant product is purified on a chromatographic support to which glutathione molecules have been coupled beforehand.
[0105]The mono- or polyclonal antibodies capable of specifically recognizing an SR-p70 protein according to the definition given above also form part of the invention. Polyclonal antibodies may be obtained from the serum of an animal immunized against protein, produced, for example, by genetic recombination according to the method described above, according to standard procedures.
[0106]The monoclonal antibodies may be obtained according to the traditional hybridoma culture method described by Kohler and Milstein, Nature, 1975, 256, 495-497.
[0107]Advantageous antibodies are antibodies directed against the central region lying between residue 110 and residue 310 for the sequences SEQ ID No. 2 or 6, or between residue 60 and residue 260 for the sequence SEQ ID No. 8.
[0108]The antibodies according to the invention are, for example, chimeric antibodies, humanized antibodies or Fab and F(ab')2 fragments. They may also take the form of immunoconjugates or labelled antibodies.
[0109]Moreover, besides their use for the purification of the recombinant polypeptides, the antibodies of the invention, especially the monoclonal antibodies, may also be used for detecting these polypeptides in a biological sample.
[0110]Thus they constitute a means of immunocytochemical or immunohistochemical analysis of the expression of SR-p70 proteins on sections of specific tissues, for example by immunofluorescence, gold labelling or enzyme immunoconj ugates.
[0111]They make it possible, in particular, to demonstrate an abnormal accumulation of SR-p70 proteins in certain tissues or biological samples, which makes them useful for detecting cancers or monitoring the progression or remission of pre-existing cancers.
[0112]More generally, the antibodies of the invention may be advantageously employed in any situation where the expression of an SR-p70 protein has to be observed.
[0113]Hence the invention also relates to a method of in vitro diagnosis of pathologies correlated with an expression or an abnormal accumulation of SR-p70 proteins, in particular the phenomena of carcinogenesis, from a biological sample, characterized in that at least one antibody of the invention is brought into contact with the said biological sample under conditions permitting the possible formation of specific immunological complexes between an SR-p70 protein and the said antibody or antibodies, and in that the specific immunological complexes possibly formed are detected.
[0114]The invention also relates to a kit for the in vitro diagnosis of an expression or an abnormal accumulation of SR-p70 proteins in a biological sample and/or for measuring the level of expression of this protein in the said sample, comprising: [0115]at least one antibody specific for an SR-p70 protein, optionally bound to a support, [0116]means of visualization of the formation of specific antigen-antibody complexes between an SR-p70 protein and the said antibody, and/or means of quantification of these complexes.
[0117]The invention also relates to a method of early diagnosis of tumour formation, by detecting autoantibodies directed against an SR-p70 protein in an individual's serum.
[0118]Such a method of early diagnosis is characterized in that a serum sample drawn from an individual is brought into contact with a polypeptide of the invention, optionally bound to a support, under conditions permitting the formation of specific immunological complexes between the said polypeptide and the autoantibodies possibly present in the serum sample, and in that the specific immunological complexes possibly formed are detected.
[0119]A subject of the invention is also a method of determination of an allelic variability, a mutation, a deletion, an insertion, a loss of heterozygosity or a genetic abnormality of the SR-p70 gene which may be involved in pathologies, characterized in that it utilizes at least one nucleotide sequence described above. Among the methods of determination of an allelic variability, a mutation, a deletion, an insertion, a loss of heterozygosity or a genetic abnormality of the SR-p70 gene, preference is given to the method which is characterized in that it comprises at least one step of PCR amplification of the target nucleic acid sequence of SR-p70 liable to exhibit a polymorphism, a mutation, a deletion or an insertion, using a pair of primers of nucleotide sequences defined above, a step during which the amplified products are treated using a suitable restriction enzyme and a step during which at least one of the products of the enzyme reaction is detected or assayed.
[0120]The invention also comprises pharmaceutical compositions comprising as active principle a polypeptide corresponding to the above definitions, preferably in soluble form, in combination with a pharmaceutically acceptable vehicle.
[0121]Such compositions afford a novel approach to treating the phenomena of carcinogenesis at the level of the control of multiplication and cell differentiation.
[0122]Preferably, these compositions can be administered systemically, preferably intravenously, intramuscularly, intradermally or orally.
[0123]Their optimal modes of administration, dosages and pharmaceutical dosage forms may be determined according to the criteria generally borne in mind in establishing a therapeutic treatment suitable for a patient, such as, for example, the patient's age or body weight, the severity of his or her general state, the tolerability of treatment and the observed side effects, and the like.
[0124]Lastly, the invention comprises a method of gene therapy, in which nucleotide sequences coding for an SR-p70 protein are transferred to target cells by means of inactivated viral vectors.
[0125]Other features and advantages of the invention are to be found in the remainder of the description, with the examples and the figures for which the legends are given below.
LEGEND TO THE FIGURES
[0126]FIG. 1: Nucleic acid comparison of monkey SR-p70a cDNA (corresponding to SEQ ID No. 1) with the nucleic acid sequence of monkey p53 cDNA.
[0127]FIG. 2: Protein comparison of monkey SR-p70a with monkey p53 protein (sw: p53-cerae).
[0128]FIG. 3: Comparison of the nucleic acid sequence of monkey SR-p70a and b cDNA (corresponding, respectively, to SEQ ID No. 1 and SEQ ID No. 3).
[0129]FIG. 4: Nucleic acid sequence and deduced protein sequence of monkey SR-p70a.
[0130]FIG. 5: Partial nucleic acid sequence and complete deduced protein sequence of monkey SR-p70b.
[0131]FIG. 6: Partial nucleic acid sequence and deduced complete protein sequence of human SR-p70a (corresponding to SEQ ID No. 5).
[0132]FIG. 7: Partial nucleic acid sequence and complete deduced protein sequence of mouse SR-p70c (corresponding to SEQ ID No. 7).
[0133]FIG. 8: Partial nucleic acid sequence and partially deduced protein sequence of mouse SR-p70a (corresponding to SEQ ID No. 9).
[0134]FIG. 9: Multialignment of the proteins deduced from monkey (a and b), human (a) and mouse (a and c) SR-p70 cDNAs.
[0135]FIG. 10a: Immunoblot of the SR-p70 protein.
[0136]FIG. 10b: Detection of the endogenous SR-p70 protein.
[0137]FIG. 11: Chromosomal localization of the human SR-p70 gene. The signal appears on chromosome 1, in the p36 region.
[0138]FIG. 12: Genomic structure of the SR-p70 gene and comparison with that of the p53 gene. The human protein sequences of SR-p70a (upper line of the alignment) and of p53 (lower line) are divided up into peptides on the basis of the respective exons from which they are encoded. The figures beside the arrows correspond to the numbering of the corresponding exons.
[0139]FIG. 13: Human genomic sequence of SR-p70 from the 3' end of intron 1 to the 5' end of exon 3. The introns are boxed. At positions 123 and 133, two variable nucleic acid positions are localized (G→A at 123 and C→T at 133). The restriction sites for the enzyme StyI are underlined (position 130 in the case where a T is present instead of a C at position 133, position 542 and position 610). The arrows indicate the positions of the nucleic acid primers used in Example XI.
[0140]FIG. 14: Nucleic acid comparison of the 5' region of the human cDNAs of SR-p70d and of SR-p70a.
[0141]FIG. 15: Multialignment of the nucleic acid sequences corresponding to human SR-p70a, b, d, e, and f.
[0142]FIG. 16: Multialignment of the proteins deduced from human SR-p70 (a, b, d, e and f) cDNAs.
[0143]FIG. 17: Partial nucleic acid sequence and partial deduced protein sequence of human SR-p70a. The two bases in bold characters correspond to two variable positions (see FIG. 6). This sequence possesses a more complete non-coding 5' region than the one presented in FIG. 6.
[0144]FIG. 18: Analysis of the SR-p70a transcripts after PCR amplification. [0145]lane M: 1 kb ladder (GIBCO-BRL) molecular weight markers [0146]lane 1: line HT29 [0147]lane 3: line SK-N-AS [0148]lane 5: line UMR-32 [0149]lane 7: line U-373 MG [0150]lane 9: line SW 480 [0151]lane 11: line CHP 212 [0152]lane 13: line SK-N-MC [0153]lanes 2, 4, 6, 8, 10, 12, 14: negative controls corresponding to lanes 1, 3, 5, 7, 9, 11 and 13, respectively (absence of inverse transcriptase in the RT-PCR reaction).
[0154]FIG. 19: A: Analysis by agarose gel electrophoresis of genomic fragments amplified by PCR (from the 3' end of intron 1 to the 5' end of exon 3). The numbering of the lanes corresponds to the numbering of the control population. Lane M: molecular weight markers (1 kb ladder). [0155]B: Analysis identical to that of part A, after digestion of the same samples with the restriction enzyme StyI.
[0156]FIG. 20: Diagrammatic representation with a partial restriction map of the plasmid pCDNA3 containing human SR-p70a.
EXAMPLE I
Cloning of SR-p70 cDNA from COS-3 Cells
1. Culturing of COS-3 Cells
[0157]COS-3 cells (African green monkey kidney cells transformed with the SV 40 virus T antigen) are cultured in DMEM medium (GIBCO-BRL reference 41 965-047) containing 2 mM L-glutamine and supplemented with 50 mg/l of gentamicin and 5% of foetal bovine serum (GIBCO-BRL reference 10231-074) to semi-confluence.
2. Preparation of the Messenger RNA
a) Extraction of the Messenger RNA
[0158]The cells are recovered in the following manner: [0159]the adherent cells are washed twice with PBS buffer (phosphate buffered saline, reference 04104040-GIBCO-BRL), then scraped off with a rubber scraper and centrifuged.
[0160]The cell pellet is suspended in the lysis buffer of the following composition: 4 M guanidine thiocyanate; 25 mM sodium citrate pH 7; 0.5% sarcosyl; 0.1 M β-mercaptoethanol. The suspension is sonicated using an Ultra-Turrax No. 231256 sonicator (Janke and Kundel) at maximum power for one minute. Sodium acetate pH 4 is added to a concentration of 0.2 M. The solution is extracted with one volume of a phenol/chloroform (5:1 v/v) mixture. The RNA contained in the aqueous phase is precipitated at -20° C. using one volume of isopropanol. The pellet is resuspended in the lysis buffer. The solution is extracted again with a phenol/chloroform mixture and the RNA is precipitated with isopropanol. After washing of the pellet with 70% and then 100% ethanol, the RNA is resuspended in water.
b) Purification of the poly(A).sup.+ Fraction of the RNA
[0161]Purification of poly(A).sup.+ fraction of the RNA is carried out using the DYNAL Dynabeads oligo(dT)25 kit (reference 610.05) according to the protocol recommended by the manufacturer. The principle is based on the use of superparamagnetic polystyrene beads to which an oligonucleotide poly(dT)25 is attached. The poly(A).sup.+ fraction of the RNA is hybridized with the oligo(dT)25 coupled to the beads, which are trapped on a magnetic support.
3. Production of the Complementary DNA Library
a) Preparation of the Complementary DNA
[0162]From 0.5 μg of the poly(A).sup.+ RNA from COS-3 cells obtained at the end of step 2, the [32P] dCTP-labelled single-stranded complementary DNA is prepared (the complementary DNA obtained possesses a specific activity of 3000 dpm/ng) with the synthetic primer of the following sequence (comprising a BamHI site):
5'<GATCCGGGCC CTTTTTTTTT TTT<3'
[0163]in a volume of 30 μl of buffer of composition: 50 mM Tris-HCl pH 8.3, 6 mM MgCl2, 10 mM DDT, 40 mM KCl, containing 0.5 mM each of the deoxynucleotide triphosphates, 30 μCi of [α-32P]dCTP and 30 U of RNasin (Promega). After one hour of incubation at 37° C., then 10 minutes at 50° C., then 10 minutes again at 37° C., with 200 units of the enzyme reverse transcriptase RNase H.sup.(GIBCO-BRL reference 8064A), 4 μl of EDTA are added.
b) Alkaline Hydrolysis of the RNA Template
[0164]6 μl of 2N NaOH solution are added and the mixture is then incubated for 5 minutes at 65° C.
c) Purification on a Sephacryl S-400 Column
[0165]In order to remove the synthetic primer, the complementary DNA is purified on a column of 1 ml of Sephacryl S-400 (Pharmacia) equilibrated in TE buffer.
[0166]The first two radioactive fractions are pooled and precipitated with 1/10 volume of 10 M ammonium acetate solution and 2.5 volumes of ethanol, this being done after extraction with one volume of chloroform.
d) Homopolymer Addition of dG
[0167]The complementary DNA is elongated at the 3' end with a dG tail with 20 units of the enzyme terminal transferase (Pharmacia 27073001). The mixture is incubated in 20 μl of buffer of composition: 30 mM Tris-HCl pH 7.6, 1 mM cobalt chloride, 140 mM cacodylic acid, 0.1 mM DTT, 1 mM dGTP, for 15 minutes at 37° C., and 2 μl of 0.5 M EDTA are then added.
e) Steps b) and c) are Repeated Again
[0168]f) Pairing of the cloning vector pSE1 (EP 506,574) and the complementary DNA in the presence of the adaptor.
[0169]The mixture is centrifuged, the pellet is dissolved in 33 μl of TE buffer, 5 μl (125 ng) of cloning vector pSE1, 1 μl (120 ng) of the adaptor of the following sequence (comprising an ApaI site):
5'AAAAAAAAAAAAAGGGCCCG3'
[0170]and 10 μl of 200 mM NaCl solution are added, and the reaction mixture is incubated for 5 minutes at 65° C. and then allowed to cool to room temperature.
g) Ligation
[0171]The cloning vector and the single-stranded cDNA are ligated in a volume of 100 μl with 32.5 units of the enzyme phage T4 DNA ligase (Pharmacia reference 270 87002) overnight at 15° C. in a buffer of composition: 50 mM Tris-HCl pH 7.5, 10 mM MgCl2, 1 mM ATP.
h) Synthesis of the Second Strand of the cDNA
[0172]The proteins are removed by phenol extraction followed by chloroform extraction, and 1/10 volume of 10 mM ammonium acetate solution and then 2.5 volumes of ethanol are then added. The mixture is centrifuged, the pellet is dissolved in a buffer of composition 33 mM Tris-acetate pH 7.9, 62.5 mM potassium acetate, 1 mM magnesium acetate and 1 mM dithiothreitol (DTT), and the second strand of complementary DNA is synthesized in a volume of 30 μl with 30 units of the enzyme phage T4 DNA polymerase (Pharmacia reference 270718) and a mixture of 1 mM the four deoxynucleotide triphosphates dATP, dCTP, dGTP and dTTP as well as two units of phage T4 gene 32 protein (Pharmacia reference 27-0213) for one hour at 37° C. The mixture is extracted with phenol and the traces of phenol are removed with a column of polyacrylamide P10 (Biogel P10-200-400 mesh-reference 15011050-Biorad).
i) Transformation by Electroporation
[0173]E. coli MC 1061 cells are transformed with the recombinant DNA obtained above by electroporation using a Biorad Gene Pulser apparatus (Biorad) used at 2.5 kV under the conditions specified by the manufacturer, and the bacteria are then grown for one hour in the medium known as LB medium (Sambrook op. cit.) of composition: bactotryptone 10 g/l; yeast extract 5 g/l; NaCl 10 g/l.
[0174]The number of independent clones is determined by plating out a 1/1000 dilution of the transformation after the first hour of incubation on a dish of LB medium with the addition of 1.5% of agar (w/v) and 100 μg/ml of ampicillin, hereinafter referred to as LB agar medium. The number of independent clones is 1 million.
j) Analysis of the cDNAs of the Library
[0175]In the context of the analysis of individual clones of the library by nucleic acid sequencing of the 5' region of the cDNAs, one clone, designated SR-p70a, was shown to exhibit a partial homology with the cDNA of the already known protein, the p53 protein (Genbank X 02469 and X 16384) (FIG. 1). The sequences were produced with the United States Biochemical kit (reference 70770) and/or the Applied Biosystems kit (references 401434 and/or 401628), which use the method of Sanger et al., Proc. Natl. Acad. Sci. USA; 1977, 14, 5463-5467. The plasmid DNA is prepared from the WIZARD minipreparation kit (Promega reference A7510). The primers used are 16- to 22-mer oligonucleotides, complementary either to the vector pSE1 in the region immediately at the 5' end of the cDNA, or to the sequence of the cDNA.
[0176]A second cDNA was isolated from the same library by screening, in a manner similar to the technique described in EXAMPLE III.3) below, with a fragment of SR-p70a the DNA labelled with 32P with the BRL "Random Primers DNA labelling systems" kit (reference 18187-013). The hybridization and washing buffers are treated by adding 50% of formamide. The last wash is carried out in 0.1×SSC/0.1% SDS at 60° C. This second sequence (SR-p79b cDNA) is identical to the first but an internal fragment has been deleted from it (FIG. 3).
[0177]The two SR-p70 cDNAs, of length 2874 nucleotides (SR-p70a) and 2780 nucleotides (SR-p70b), correspond to the products of a single gene, an alternative splicing bringing about a deletion of 94 bases between nucleotides 1637 and 1732 and a premature termination of the corresponding encoded protein. The proteins deduced from the two cDNAs possess 637 amino acids and 499 amino acids, respectively (FIGS. 4 and 5).
EXAMPLE II
Obtaining of the Sequence and Cloning of the cDNA of the SR-p70a Protein from HT-29 (Human Colon Adenocarcinoma) Cells
1) Culturing of HT-29 Cells
[0178]The cells are cultured in McCoy's 5 medium (GIBCO 26600-023) with the addition of 10% of foetal calf serum (GIBCO 10081-23) and 50 mg/l of gentamicin, to semi-confluence.
2) Preparation of the Complementary DNA
[0179]The messenger RNA is prepared as described in EXAMPLE I.2. The cDNA is prepared in a manner similar to that described in EXAMPLE I.3, with 5 μg of total messenger RNA, using a poly(T)12 primer. The reaction is not interrupted with EDTA.
3) Specific Amplification of the Human cDNA by the So-Called PCR Technique
[0180]The polymerization is carried out with 4 μl of cDNA in 50 μl final with the buffer of the following composition: 10 mM Tris-HCl pH 8.3, 2.5 mM MgCl2, 50 mM KCl in the presence of 10% DMSO, 0.5 mM dNTP, 4 μg/ml of each of the two nucleic acid primers and 2.5 units of TAQ DNA polymerase (Boehringer). The primer pairs were selected on the basis of the nucleic acid sequence of the COS-3 SR-p70 clone, in particular upstream of the translation initiation ATG and downstream of the translation stop TGA, and are of the following compositions:
TABLE-US-00003 sense primer: ACT GGT ACC GCG AGC TGC CCT CGG AG Kpn I restriction site antisense primer: GAC TCT AGA GGT TCT GCA GGT GAC TCA G Xba I restriction site
[0181]The reaction is carried out for 30 cycles of 94° C./1 minute, 54-60° C./1 minute 30 seconds and 72° C./1 minute 30 seconds, followed by a final cycle of 72° C./6 minutes.
4) Obtaining of the Sequence of the Human cDNA
[0182]In a first step, the PCR product is removed from the oligonucleotides on a column of Sephacryl S-400, and then desalted by exclusion chromatography on a column of polyacrylamide P10 (Biorad reference 1504144). The sequencing reactions are carried out using the Applied Biosystems kit (reference 401628) with oligonucleotides specific for the cDNA. The sequence obtained is very similar to that of monkey SR-p70a, and the deduced protein contains 636 amino acids (FIG. 6).
[0183]In a similar manner, other sequences originating from human lines or tissues were obtained for the coding portion of human SR-p70, in particular from the lung or pancreas. The proteins deduced from these sequences are identical to those obtained for the HT-29 line.
5) Cloning of the Human cDNA into Plasmid pCDNA3 (Invitrogen V 790-20)
[0184]The PCR product obtained in 3) and also the plasmid are digested with the two restriction enzymes Kpn I and Xba I and then purified after migration on a 1% agarose gel using the Geneclean kit (Bio 101 reference 3105). After ligation with 100 ng of insert and 10 ng of vector and transformation (technique described in EXAMPLE I.3.g and i), the recombinant clones are verified by sequencing using the Applied Biosystems kit mentioned above.
EXAMPLE III
Cloning of Mouse SR-p70 cDNA from AtT-20 (Pituitary Tumour) Cells
1) Cell Culturing of the Line AtT-20
[0185]The cells are cultured in Ham F10 medium (GIBCO 31550-023) with the addition of 15% of horse serum (GIBCO 26050-047), 2.5% of foetal calf serum (GIBCO 10081-073) and 50 mg/l of gentamicin, to semi-confluence.
2) Preparation of the Complementary DNA Library
[0186]The library is produced as described in EXAMPLE I. 2 and 3 from the cells cultured above.
3) Screening of the Library
a) Preparation of the Membranes
[0187]The clones of the library are plated out on LB agar medium (Petri dishes 150 mm in diameter) coated with Biodyne A membranes (PALL reference BNNG 132). After one night at 37° C., the clones are transferred by contact onto fresh membranes. The latter are treated by depositing them on 3 mm Whatman paper soaked with the following solutions: 0.5 N NaOH, 1.5 M NaCl for 5 minutes, then 0.5 M Tris-HCl pH 8, 1.5 M NaCl for 5 minutes. After treatment with proteinase K in the following buffer: 10 mM Tris-HCl pH 8, 10 mM EDTA, 50 mM NaCl, 0.1% SDS, 100 μg/ml proteinase K, for one hour at room temperature, the membranes are washed copiously in 2×SSC (sodium citrate, NaCl), dried and then incubated in an oven under vacuum at 80° C. for 20 minutes.
b) Preparation of the Probe
[0188]On the basis of monkey and human SR-p70 cDNA sequences, a first sequence was produced on a fragment amplified from line AtT-20 mRNA as described in EXAMPLE II.3 and 4, with the oligomers of the following compositions:
TABLE-US-00004 sense primer: GCC ATG CCT GTC TAC AAG antisense primer: ACC AGC TGG TTG ACG GAG.
[0189]On the basis of this sequence, an oligomeric probe specific for mouse was chosen and possesses the following composition:
GAG CAT GTG ACC GAC ATT G.
[0190]100 ng of the probe are labelled at the 3' end with 10 units of terminal transferase (Pharmacia) and 100 μCi of [α-32P]dCTP 3000 Ci/mmol (Amersham reference PB 10205) in 10 μl of the following buffer: 30 mM Tris-HCl pH 7.6, 140 mM cacodylic acid, 1 mM CoCl2, 0.1 mM DTT for 15 minutes at 37° C. The radiolabelled nucleotides not incorporated are removed on a column of polyacrylamide P10 (Biorad, reference 1504144). The probe obtained has a specific activity of approximately 5×108 dpm/μg.
c) Prehybridization and Hybridization
[0191]The membranes prepared in a) are prehybridized for 30 minutes at 42° C. in 6×SSC, 5×Denhart's, 0.1% SDS, and then hybridized for a few hours in the same buffer with the addition of the probe prepared in b) in the proportion of 106 dpm/ml.
d) Washing and Exposure of the Membranes
[0192]The membranes are washed twice at room temperature in 2×SSC/0.1% SDS buffer and then for one hour at 56° C. in 6×SSC/0.1% SDS. The hybridized clones are visualized with KODAK XOMAT films. A positive clone containing the mouse SR-p70 is selected and hereinafter designated as SR-p70c.
4) Sequencing of Mouse SR p70 and Analysis of the Sequence
[0193]The sequence is obtained using the Applied Biosystem kit (reference 401628). The protein sequence deduced from mouse SR-p70c cDNA (FIG. 7) exhibits a very strong homology with the human and monkey sequences, except in the N-terminal portion which diverges strongly (see FIG. 9). Using the so-called PCR technique in a similar manner to that described in EXAMPLE II.3 and 4, a second 5' sequence (originating from the same AtT-20 library) was obtained (FIG. 8). The deduced N-terminal protein sequence (sequence designated SR-p70a) is very similar to that deduced from human and monkey SR-p70 cDNAs (SR-p70a) (FIG. 9). The line AtT-20 hence affords at least two SR-p70 transcripts. The latter 2 diverge in the N-terminal portion through different splicings.
EXAMPLE IV
[0194]1) Production of Recombinant SR-p70 Protein in E. coli
a) Construction of the Expression Plasmid
[0195]This consists in placing the COOH-terminal portion of the monkey SR-p70a protein, from the valine at position 427 to the COOH-terminal histidine at position 637, in fusion with the glutathione S-transferase (GST) of the plasmid vector pGEX-4T-3 (Pharmacia reference 27-4583). For this purpose, the corresponding insert of SR-p70a (position 1434 to 2066) was amplified by PCR with 10 ng of plasmid containing monkey SR-p70a cDNA. The nucleic acid primers are of the following composition:
TABLE-US-00005 sense primer: TTT GGA TCC GTC AAC CAG CTG GTG GGC CAG BamHI restriction site antisense primer: AAA GTC GAC GTG GAT CTC GGC CTC C. Sal I site
[0196]The fragment obtained and also the vector are digested with the restriction enzymes BamHI and Sal I and cloning is carried out as described in EXAMPLE II.5. The selected clone is referred to as pG SR-p70.
b) Expression and Purification of the GST-pSR-p70 Fusion Protein
[0197]This step was carried out using the "bulk GST purification module" kit (Pharmacia Reference 27-4570-01).
[0198]In outline, the recombinant clone was cultured at 37° C. in one litre of 2×YTA medium+100 μg/ml ampicillin. At OD 0.8, expression is induced with 0.5 mM IPTG for 2 hours at 37° C. After centrifugation, the cell pellet is taken up in cold PBS and then sonicated by ultrasound. After the addition of 1% Triton X-100, the preparation is incubated for 30 minutes with agitation at room temperature. After centrifugation at 12,000 g for 10 minutes at 4° C., the supernatant is recovered. Purification is then carried out on a glutathione-Sepharose 4B affinity chromatography column. Binding and washing are carried out in PBS buffer and elution is carried out by competition with reduced glutathione. The final concentration is brought to 300 μg/ml of fusion protein.
2) Production of SR-p70a Protein in COS-3 Cells
[0199]COS-3 cells are transfected with pSE1 plasmid DNA into which monkey SR-p70a cDNA has been cloned (EXAMPLE I.1), or with the vector pSE1 plasmid DNA as control, by the DEAE-dextran technique: the COS-3 cells are inoculated at 5×105 cells per 6 cm dish in culture medium containing 5% of foetal bovine serum (EXAMPLE I.1). After culture, the cells are rinsed with PBS. 1 ml of the following mixture is added: medium containing 6.5 μg of DNA, 250 μg/ml of DEAE-dextran and 100 μM chloroquine. The cells are incubated at 37° C. in 5% CO2 for 4 to 5 hours. The medium is aspirated off, 2 ml of PBS containing 10% of DMSO are added and the cells are incubated for one minute, shaking the dishes gently. The medium is aspirated off again and the cells are rinsed twice with PBS. The cells are then incubated at 37° C. with medium containing 2% of foetal bovine serum for the period during which expression takes place, which is generally 3 days.
[0200]The SR-p70a protein is then analysed as described in EXAMPLE IV by immunoblotting.
EXAMPLE V
Preparation of Specific Antibodies
[0201]150 μg of proteins of the sample prepared according to EXAMPLE IV were used to immunize a rabbit (New Zealand male weighing 1.5 to 2 kg approximately). The immunizations were performed every 15 days according to the protocol described by Vaitukaitis, Methods in Enzymology, 1981, 73, 46. At the first injection, one volume of antigenic solution is emulsified with one volume of Freund's complete adjuvant (Sigma reference 4258). Five boosters were administered in Freund's incomplete adjuvant (Sigma reference 5506).
EXAMPLE VI
Detection of the SR-p70 Protein: Western Immunoblotting
1) Materials Used for Immunoblotting
a) Cell Lines Used for Immunoblotting
[0202]The following cell lines were cultured as described in the catalogue "Catalogue of cell lines and hybridomas, 7th edition, 1992" of the ATCC (American Type Culture Collection): COS-3, CV-1 (monkey kidney cell line), HT-29, U-373MG (human glioblastoma), MCF7 (human mammary adenocarcinoma), SKNAS (human neuroblastoma cultured under the same conditions as COS-3), SK-N-MC (human neuroblastoma), IMR-32 (human neuroblastoma), CHP212 (human neuroblastoma cultured under the same conditions as CV-1), Saos-2 (osteosarcoma), SK-OV-3 (ovarian adenocarcinoma) and SW 480 (human colon adenocarcinoma).
b) COS-3 Cells Transfected by SR-p70a cDNA
[0203]COS-3 cells were transfected as described in EXAMPLE IV.2. As a control, the cells were transfected with pSE1 plasmid DNA not containing recombinant SR-p70a cDNA.
2) Preparation of Protein Samples from a Eukaryotic Cell Culture or From Transfected Cells
[0204]After culture, the cells are washed with PBS and then taken up in RIPA buffer (PBS with 1% NP40, 0.5% sodium deoxycholate, 0.5% SDS) supplemented with 10 μg/ml RNAse A, 20 μg/ml DNAse 1, 2 μg/ml aprotinin, 0.5 μg/ml leupeptin, 0.7 μg/ml pepstatin and 170 μg/ml PMSF. The cells are sonicated by ultrasound at 4° C. and left for 30 minutes at 4° C. After microcentrifugation at 12,000 rpm, the supernatant is recovered. The protein concentration is measured by the Bradford method.
3) Western Blotting
[0205]5 or 50 μg of proteins (50 μg for the cell lines and 5 μg for transfected cells) are placed in 0.2 volume of the following 6×electrophoresis buffer: 0.35 mM Tris-HCl pH 6.8, 10.3% SDS, 36% glycerol, 0.6 mM DTT, 0.012% bromophenol blue. The samples are applied and run in a 10% SDS-PAGE gel (30:0.8 Bis) and then electrotransferred onto a nitrocellulose membrane.
4) Visualization with the Antibody
[0206]The membrane is incubated for 30 minutes in TBST blocking buffer (10 mM Tris-HCl pH 8, 150 mM NaCl, 0.2% Tween 20) with the addition of 5% of milk (GIBCO--SKIM MILK) at room temperature. The membrane is brought into contact successively with the anti-SR-p70 (aSR-p70) antibody in the same buffer for 16 hours at 4° C., washed 3 times for 10 minutes with TBST and then incubated for one hour at 37° C. with a second, anti-rabbit immunoglobulin antibody coupled to peroxidase (SIGMA A055). After three washes of 15 minutes, the visualization is performed using the ECL kit (Amersham RPN2106) by chemiluminescence.
[0207]In parallel, the same samples were subjected to visualization with an anti-p53 (αp53) antibody (Sigma BP5312) followed by a second, anti-mouse immunoglobulin antibody.
5) Figures and Results
[0208]FIG. 10: Immunoblot of the SR-p70 proteinFIG. 10a: Detection of the recombinant SR-p70 protein [0209]columns 1 and 3: COS-3 transfected by the vector pSE1. [0210]columns 2 and 4: COS-3 transfected by plasmid pSE1 containing SR-p70a cDNA. [0211]columns 1 and 2: visualization with the anti-SR-p70 (αSR-p70) antibody. [0212]columns 3 and 4: visualization with the anti-p53 (αp53) antibody.FIG. 10b: Detection of the endogenous SR-p70 protein [0213]columns 1: COS-3; 2: CV-1; 3: HT-29; 4: U-373 MG; 5: MCF7; 6: SKNAS; 7: SK-N-MC; 8: IMR-32; 9: CHP212; 10: Saos-2; 11: SK-OV-3 and 12: SW480.A: Visualization with the αSR-p70 antibodyB: Visualization with the αp53 antibody.
[0214]The αSR-p70 antibody specifically recognizes the recombinant proteins (FIG. 10a) and endogenous proteins (FIG. 10b) and does not cross with p53. The analysis of human or monkey cell lines shows the SR-p70 protein, like p53, is generally weakly detectable. In contrast, when an accumulation of p53 exists, SR-p70 becomes, for its part also, more readily detectable (FIG. 10b). A study by RT-PCR of the distribution of SR-p70 transcripts shows that the gene is expressed in all the cell types tested.
EXAMPLE VII
Cloning of the SR-p70 Gene and Chromosomal Localization
1) Cloning of SR-p70 Gene
[0215]The library used is a cosmid library prepared with purified human genomic DNA from placenta and marketed by Stratagene (reference 95 1202).
[0216]Screening of the gene is carried out as described in the EXAMPLE III.3, with an SR-p70 DNA fragment labelled with 32P with the BRL "Random Primers DNA Labelling Systems" kit (reference 18187-013). The hybridization and washing buffers are treated by adding 50% of formaldehyde. The last wash is carried out in 0.1×SSC/0.1% SDS at 60° C. In a similar manner, the SR-p70 gene was isolated from a library prepared with C57 black mouse genomic DNA.
[0217]An analysis and a partial sequencing of the clones demonstrate the presence of 14 exons with a structure close to that of the p53 gene, in particular in the central portion where the size and positioning of the exons are highly conserved (FIG. 12). This structure was partially defined in mouse and in man.
[0218]As an example, the human genomic sequences of the 3' region of intron 1, of exon 2, of intron 3 and of the 5' region of exon 3 are presented in FIG. 13.
2) Chromosomal Localization of the SR-p70 Gene in Man
[0219]This was carried out with human SR-70 gene DNA using the technique described by R. Slim et al., Hum. Genet., 1991, 88, 21-26. Fifty mitoses were analysed, more than 80% of which had double spots localized at 1p36 on both chromosomes and more especially at 1p36.2-1p36.3 (FIG. 11). The identification of chromosome 1 and its orientation are based on the heterochromatin of the secondary constriction. The pictures were produced on a Zeiss Axiophot microscope, taken with a LHESA cooled CCD camera and treated with Optilab.
EXAMPLE VIII
[0220]A) Demonstration of an mRNA Coding for a Deduced Human SR-p70 Protein Possessing both a Shorter N-Terminal End and a Divergence.
1) Culturing of IMR-32 (Human Neuroblastoma) Cells
[0221]The cells were cultured as described in the catalogue "Catalogue of cell lines and hybridomas, 7th edition, 1992" of the ATCC (American Type Culture Collection).
2) Preparation of the cDNA
[0222]The RNA is prepared as described in Example I.2.a. The cDNA is prepared in a manner similar to that described in Example I.3, with 5 μg total RNA in a final volume of 20 μl using a poly(T)12 primer and with cold nucleotides. The reaction is not interrupted with EDTA.
3) Specific Amplification of SR-p70 cDNA by the So-Called PCR Technique
[0223]The polymerization is carried out with 2 μl of cDNA in 50 μl final with the buffer of the following composition: 50 mM Tris-HCl pH 9.2, 16 mM (NH4)2SO4, 1.75 mM MgCl2, in the presence of 10% DMSO, 0.4 mM NTP, 100 ng of each of the two nucleic acid primers and 3.5 units of the mixture of TAQ and PWO polymerases (Boehringer Mannheim, ref. 1681 842).
[0224]The primer pair is of the following composition:
TABLE-US-00006 sense primer: AGGCCGGCGTGGGGAAG (position 16 to 32, FIG. 6) antisense primer: CTTGGCGATCTGGCAGTAG (position 503 to 485, FIG. 6).
[0225]The reaction is carried out for 30 cycles at 95° C./30 seconds, 58° C./1 minute and 68° C./2 minutes 30 seconds, followed by a final cycle of 68° C./10 minutes.
[0226]The PCR product is subjected to electrophoresis on a 1% agarose gel (TAE buffer). After ethidium bromide staining, two major bands are revealed: a band approximately 490 by in size (expected size (see FIG. 6)) and an additional band approximately 700 by in size. The latter is extracted from the gel using the "Geneclean" kit (Bio 101, ref 1001 400). After a desalting on a column of polyacrylamide P10 (Biorad, ref 15011050), the fragment is subjected to a further PCR amplification for 10 cycles as described above.
4) Determination of the Sequence of the Amplified Product
[0227]In a first step, the PCR product is removed from the oligonucleotides on a column of Sephacryl S-400 (Pharmacia 17-0609-01) and then desalted on a column of P10. The sequencing reaction is carried out using the Applied Biosystems kit (ref. 401 628) (373 DNA sequencer) with the antisense primer.
[0228]The sequence obtained is identical to the SR-p70 cDNA sequence (Example II.4) with an insertion of 198 by between positions 217 and 218 (FIG. 14). The deduced N-terminal protein sequence (sequence designated SR-p70d) is 49 amino acids shorter, with a divergence of the first 13 amino acids (sequence ID No. 13). There is hence coexistence of at least two different SR-p70 transcripts as already described for the mouse AtT-20 line.
B) Cloning of Human SR-P70 and Demonstration of an mRNA Coding for a Deduced Human SR-p70 Protein Possessing the Same N-Terminal End as SR-p70d and a Divergence in the C-Terminal Portion1) Specific Amplification of SR-p70 cDNA by the So-Called PCR Technique
[0229]The amplification was carried out as described in EXAMPLE VIII.A from purified RNA of IMR-32 cells with the primer pair of the following composition:
TABLE-US-00007 sense primer: GCG GCC ACG ACC GTG AC (position 160 to 176, sequence ID No. 11) antisense primer: GGC AGC TTG GGT CTC TGG (position 1933 to 1976, FIG. 6).
[0230]After removal of the excess primers on an S400 column and desalting on a P10 column, 1 μl of the sample is subjected again to a PCR with the primer pair of the following composition:
TABLE-US-00008 sense primer: TAT CTC GAG CTG TAC GTC GGT GAC CCC XhoI (position 263 to 280, sequence ID No. 11) antisense primer: ATA TCT AGA TCA GTG GAT CTC GGC CTC XbaI (position 1943 to 1926, FIG. 6).
2) Cloning of the Amplified Product into Plasmid pCDNA3
[0231]The PCR product obtained in 1) is desalted on a P10 column, digested with the restriction enzymes XhoI and XbaI and then cloned into plasmid pCDNA3 as described in EXAMPLE II.5. Two recombinant clones are sequenced using the Applied Biosystems kit with the oligonu-cleotides specific for SR-p70 cDNA.
[0232]The first sequence obtained corresponds to the complete sequence of the mRNA coding for SR-p70 described in EXAMPLE VIII.a. The deduced protein contains 587 amino acids (sequence ID No. 13 and FIG. 16).
[0233]The second sequence obtained is identical to the SR-p70d cDNA sequence described above, but with two deletions, of 149 by and of 94 by between positions 1049 and 1050 on the one hand, and between positions 1188 and 1189 on the other hand (sequence ID No. 14 and FIG. 15). The protein sequence deduced from this second sequence reveals a protein having an N-terminal portion 49 amino acids shorter, with a divergence in the first 13 amino acids as well as a divergence of protein sequence between amino acids 350 and 397 (sequence ID No. 15 and FIG. 16) (sequence designated SR-p70e). The deduced protein contains 506 amino acids.
C) Demonstration of an mRNA Coding for a Deduced Human SR-p70 Protein Possessing a Shorter N-Terminal End
1) Culturing of SK-N-SH (Human Neuroblastoma) Cells
[0234]The cells are cultivated as described in the "Catalogue of cell lines and hybridomas, 7th edition, 1992" of the ATCC (American Type Culture Collection).
[0235]2) Preparation of the cDNA and Amplification of SR p70 cDNA by the So-Called PCR Technique
[0236]These steps are carried out as described in EXAMPLE VIKA with the primer pair of the following composition:
TABLE-US-00009 sense primer: AGG GGA CGC AGC GAA ACC (position 128 to 145, FIG. 17) antisense primer: GGC AGC TTG GGT CTC TGG (position 1993 to 1976, FIG. 6).
[0237]The sequencing is carried out with the Applied Biosystem kit with primers specific for SR-p70 cDNA, and reveals two cDNAs: [0238]a first cDNA corresponding to the mRNA coding for SR-p70a [0239]a second cDNA having a deletion of 98 by between positions 24 and 25 (sequence ID No. 16 and FIG. 15).
[0240]This deletion comprises the translation initiation ATG of SR-p70a. The protein deduced (designated SR-p70f) from this second cDNA possesses a translation initiation ATG downstream corresponding to an internal ATG of SR-p70a. The deduced protein hence contains 588 amino acids (sequence ID No. 17 and FIG. 16) and is truncated with respect to the 48 N-terminal amino acids of SR-p70a.
D) Demonstration of an mRNA Coding for Human SR-p70b
1) Culturing of K562 Cells
[0241]The cells are cultured as described in the "Catalogue of cell lines and hybridomas, 7th edition, 1992" of ATCC (American Type Culture Collection).
2) Preparation of the cDNA, Amplification of SR-p70 cDNA by the So-Called PCR Technique and Sequencing
[0242]These steps are carried out as described in EXAMPLE VIII.C.
[0243]The sequencing reveals two cDNAs:
A first cDNA corresponding to the mRNA coding for SR-p70a, and a second cDNA having a deletion of 94 by between positions 1516 and 1517 (sequence ID No. 18 and FIG. 15). The deduced protein (designated SR-p70b) contains 199 amino acids and possesses a C-terminal sequence truncated by 137 amino acids relative to SR-p70a, with the last 4 amino acids divergent (sequence ID No. 19 and FIG. 21).
[0244]This cDNA is similar to the one described in EXAMPLE I relating to monkey SR-p70b.
[0245]The molecules described in this example (EXAMPLE VIII.A, B, C and D) reveal SR-p70 variants which are the outcome of differential splicings of the primary mRNA, transcribed by the SR-p70 gene.
[0246]The SR-p70a is encoded by an mRNA composed of 14 exons (see EXAMPLE VII). This is the reference protein. SR-p70b is the outcome of an insertion between exons 3 and 4 and of the absence of exons 11 and 13. SR-p70f is the outcome of the absence of exon 2. This example describes the existance of SR-p70 variants non-exhaustively, with a strong probability of existence of other variants. Similarly, the existence of these variants described in this example, as well as SR-p70a, is not limited to the lines in which they have been demonstrated. In effect, studies performed by RT-PCR showed that these variants are to be found in the various lines studied.
[0247]Furthermore, the initiation methionine of SR-p70f corresponds to an internal methionine of SR-p70a, suggesting the possibility of initiation downstream on the mRNA coding for SR-p70a.
EXAMPLE IX
Obtaining a 5' Sequence of Human SR-p70a mRNA
[0248]1) Amplification of the 5' end of SR-p70 cDNA by PCR
[0249]The cell culturing and the preparations of total RNA and of cDNA are carried out as described in EXAMPLE VIII.1 and 2. The RNA template is hydrolysed by incubation for 5 minutes at 65° C. after the addition of 4 μl of 500 mM EDTA and 4 μl of 2 N NaOH. The sample is then desalted on a P10 column. The cDNA is elongated at the 3' end with a dG tail as described in EXAMPLE I.3.d, in a final volume of 40 μl. After the addition of 4 μl of 500 mM EDTA and 4 μl of 2 N NaOH, the cDNA is incubated at 65° C. for 3 minutes and then desalted on a P10 column. PCR amplification is carried out as described in EXAMPLE VIII.3 with 8 μl of cDNA and for 30 cycles with the primer pair of the following composition:
TABLE-US-00010 sense primer: C C C C C C C C C C C C C C N (where N equals G, A or T) antisense primer: CCATCAGCTCCAGGCTCTC (position 1167 to 1149, FIG. 6).
[0250]After removal of the excess primers on an S-400 column and desalting on a P10 column, 1 μl of the sample is subjected again to a PCR with the pair of the following composition:
TABLE-US-00011 sense primer: C C C C C C C C C C C C C C N antisense primer: CCAGGACAGGCGCAGATG (position 928 to 911, FIG. 6).
[0251]The sample, passed again through an S-400 column and a P10 column, is subjected to a third amplification for 20 cycles with the following pair:
TABLE-US-00012 sense primer: C C C C C C C C C C C C C C C N antisense primer: CTTGGCGATCTGGCAGTAG (position 503 to 485, FIG. 6).
2) Determination of the SR-p70 cDNA 5' Sequence
[0252]The sequence is produced as described in EXAMPLE VIII.4. This sequence reveals a non-coding 5' region of at least 237 bases upstream of the initiation ATG of SR-p70a (FIG. 17). By comparison of this sequence (obtained from the line IMR-32) with the one obtained from the line HT-29 in particular (FIG. 6), two point differences (FIG. 17: see bold characters) are revealed (G→A and C→T), positioned, respectively, at -20 and -30 from the initiation ATG of SR-p70a (FIGS. 6 and 17). This variability is located in exon 2 (FIG. 13). It is not ruled out that this variability is also to be found within a coding frame as the outcome of an alternative splicing as described in EXAMPLES III in mouse and VIII in man, or alternatively as the outcome of a translation initiation on a CTG (as has been demonstrated for FGFb (Proc. Natl. Acad. Sci USA, 1989, 86, 1836-1840)).
[0253]Similarly, it is not ruled out that this variability has a repercussion on the translation of SR-p70 or on the splicing of the primary RNA.
[0254]At all events, this variability, probably of allelic origin, may serve as a marker, either at genomic level (see EXAMPLE XI) or at mRNA level (see EXAMPLE X).
EXAMPLE X
1) Analysis by PCR of the Transcriptional Expression of SR-p70a in Cell Samples (RT-PCR)
[0255]Cell culturing (SK-N-AS, SK-N-MC, HT-29, U-373MG, SW480, IMR-32, CHP212) is carried out as described in Example VI.1.a (referred to the catalogue "Catalogue of cell lines and hybridomas, 7th edition 1992" of the ATCC).
[0256]The preparation of the cDNA and the PCR amplification are carried out as described in EXAMPLE VIII.2 and 3. The primer pair used is of the following composition:
TABLE-US-00013 sense primer: AGGGGACGCAGCGAAACC (position 128 to 145, FIG. 17) antisense primer: GGCAGCTTGGGTCTCTGG (position 1993 to 1976, FIG. 6).
[0257]The samples are analysed by electrophoresis on a 1% agarose gel and visualization with ethidium bromide (FIG. 18).
[0258]The size of the band obtained in the samples corresponds to the expected size (approximately 2 kb, FIGS. 6 and 17). The intensity of the bands obtained is reproducible. A reamplification of 1 μl of the sample under the same conditions for 20 cycles reveals a band in each of the samples.
[0259]2) Determination of the Sequence of the Amplified Products
[0260]After passage of the samples through S-400 and P10 columns, sequencing is carried out on an Applied Biosystems sequencer 373 with the reference kit 401 628. The primers used are, inter alia, the following:
TABLE-US-00014 position Figure AGGGGACGCAGCGAAACC 128 to 145 22 CTTGGCGATCTGGCAGTAG 503 to 485 6 GATGAGGTGGCTGGCTGGA 677 to 659 6 CCATCAGCTCCAGGCTCTC 1167 to 1149 6 TGGTCAGGTTCTGCAGGTG 1605 to 1587 6 GGCAGCTTGGGTCTCTGG 1993 to 1976 6
[0261]No protein difference in the SR-p70a was detected. However, sequences obtained reveal a double variability at positions -20 and -30 upstream of the initiation ATG of SR-p70a (FIGS. 6 and 17). This variability, probably of allelic origin, enables two classes of transcripts to be defined: a first class possessing a G at position -30 and a C at position -20 (class G.sup.+/G-20) and a second class possessing a difference at two positions: an A at -30 and a T at -20 (class)A-30/T-20 .
First class: SK-N-AS, SK-N-MC, HT-29, U-373MG, SW480.Second class: IMR-32, CHP212.
EXAMPLE XI
Analytical Method of Determination of the Allelic Distribution of the SR-p70 Gene in a Population of 10 Persons
[0262]This allelic distribution is based on the allelic variability demonstrated in EXAMPLES IX and X: [0263]G-30/C-20 allele possessing, respectively, a G and a C at positions -30 and -20 upstream of the initiation ATG of SR-p70a. [0264]A-30/T-20 allele possessing, respectively, an A and a T at the same positions. This variability may be demonstrated by the use of restriction enzymes that differentiate the two alleles (FIG. 13). As an example: [0265]Enzyme Bpl I having a cleavage site only on the G-30/C-20 allele in the zone of interest (this site encompasses both variable positions). [0266]Enzyme StyI having a cleavage site only on the A-30/T-20 allele in the zone of interest.
[0267]1) Genomic Amplification of Exon 2 by PCR
[0268]The polymerization reaction is carried out with 500 ng of purified genomic DNA, in 50 μl final with the conditions described in Example VIII.3.
[0269]The primer pair is of the following position:
TABLE-US-00015 Sense primer: CACCTACTCCAGGGATGC (position 1 to 18, FIG. 13) Antisense primer: AGGAAAATAGAAGCGTCAGTC (position 833 to 813, FIG. 13).
The reaction is carried out for 30 cycles as described in EXAMPLE VIII.3.
[0270]After removal of the excess primer on an S-400 column and desalting on a P10 column, 1 μl of the sample is amplified again for 25 cycles under the same conditions with the following primer pair:
TABLE-US-00016 Sense primer: CAGGCCCACTTGCCTGCC (position 25 to 32, FIG. 13) Antisense primer: CTGTCCCCAAGCTGATGAG (position 506 to 488, FIG. 13).
[0271]The amplified products are subjected to electrophoresis on a 1% agarose gel (FIG. 19-A).
2) Digestion with the Restriction Enzyme StyI
[0272]The samples are desalted beforehand on a P10 column and then digested with the restriction enzyme StyI (BRL 15442-015) in the buffer of the following composition: 50 mM Tris-HCl pH 8, 100 mM NaCl, 10 mM MgCl2, at 37° C. for 30 mM. The digestion products are analysed by electrophoresis on a 1% agarose gel (TAE buffer). Visualization is carried out by ethidium bromide staining (FIG. 19-B).
[0273]A band of 482 base pairs characterizes the G-30/C-20 allele (FIGS. 13 and 19). The presence of a band of 376 base pairs and a band of 106 base pairs characterize the A-30/T-20 allele (allele possessing a StyI cleavage site).
[0274]On the population of 10 persons, 2 persons exhibit the G-30/C-20 and A-30/T-20 alleles, the other 8 persons being homozygous with the G-30/C-20 allele. The study of a fresh population of 9 persons demonstrated 3 heterozygous persons exhibiting the G-30/C-20 and A-30/T-20 alleles, the other 6 persons being homozygous for the G-30/C-20 allele.
EXAMPLE XII
Test of reversion of transformation of the line SK-N-AS by transfection with SR-p70 cDNA
[0275]The expression vector used is described in EXAMPLE II.5 and shown diagrammatically in FIG. 15. The method used is the so-called calcium phosphate method described by Graham et al. (Virology 1973, 54, 2, 536-539). The line is inoculated in the proportion of 5×105 cells per dish 6 cm in diameter in 5 ml of the medium described in Example I.1. The cells are cultured at 37° C. and with 5% CO2 overnight. The transfection medium is prepared in the following manner: the following mixture is prepared by adding, in order, 1 ml of HEBS buffer (8 mg/ml NaCl, 370 μg/ml KCl, 125 μg/ml Na2HPO4.2H2O, 1 mg/ml dextrose, 5 mg/ml Hepes pH 7.05), 10 μg of the plasmid to be transfected and 50 μl of 2.5 M CaCl2 added dropwise. The transfection medium is left for 30 min at room temperature and then added dropwise to the medium contained in the culture dish. The cells are incubated for 5 to 6 hours at 37° C./5% CO2. After the medium is aspirated off, 5 ml of fresh medium containing 2% of foetal bovine serum are added. After 48 hours at 37° C./5% CO2, the cells are rinsed with PBS, detached by trypsinization, diluted in 10 ml of culture medium (5% foetal bovine serum) and plated out in a dish 10 cm in diameter (the dilution may be adjusted in accordance with the efficiency of transfection). After a further incubation for 10 hours (the time for the cells to adhere), the cells are subjected to selection by adding G418 at a final concentration of 600 μg/ml Geneticin equivalent for 15 to 21 days (the medium is changed every day). The clones obtained are then rinsed with PBS, fixed in 70% ethanol, dried, stained with 1% crystal violet and then counted.
[0276]Four plasmid transfections were carried out in duplicate: [0277]plasmid pCDNA3 without insert [0278]plasmid pCDNA3/SR-p70 containing human SR-p70a cDNA [0279]plasmid pCDNA3/SR-p70 Mut containing SR-p70a cDNA possessing a mutation at position 293 AA (R→H) which is analogous to the mutation 273 (R→H) in the DNA-binding domain of p53 [0280]control without plasmid.
[0281]The result is expressed as the number of clones per dish.
TABLE-US-00017 Experiment 1 Experiment 2 Mean pCDNA3 172 353 262 pCDNA3/SR-p70 13 8 10 pCDNA3/SR-p70 Mut 92 87 89 Absence of plasmid 1 3 2
[0282]The number of clones obtained by transfection with plasmid pCDNA3/SR-p70 is 25-fold less than the number of clones obtained with the control pCDNA3 and 9-fold less than the number of clones obtained with pCDNA3/SR-p70 Mut, indicating a mortality or an arrest of cell division of the cells transfected with SR-p70 cDNA. This result is not the consequence of a toxicity in view of the clones obtained with the mutated SR-p70 cDNA, but probably of an apoptosis as has been demonstrated for the p53 protein (Koshland et al., Sciences, 1993, 262, 1953-1981).
EXAMPLE XIII
Biological Role of the SR-p70 Protein
[0283]The structural homology between the DNA-binding domain of p53 and the central region of the SR-p70 protein enables it to be inferred that SR-p70 is a transcription factor (see FIGS. 1 and 2). In effect, p53 (393 amino acids) consists of several functional domains. The N-terminal region (1-91 amino acids) is involved in the activation of transcription, and contains sites for interaction with different cellular and viral proteins. The central portion (amino acids 92 to 292) permits binding to the specific DNA sequences located in the promoter regions of certain genes (the majority of point mutations that inactivate p53 are localized in this region), and also possesses numerous sites for interaction with viral proteins which inhibit its activity. Finally, the last 100 amino acids of p53 are responsible for its oligomerization as well as for the regulation of the latter (Hainaut P., Current Opinion in Oncology, 1995, 7, 76-82; Prokocimer M., Blood, 1994, 84 No. 8, 2391-2411).
[0284]The sequence homology between p53 and SR-p70 is significant, in particular as regards the amino acids involved directly in the interaction with DNA, suggesting that SR-p70 binds to the p53 sites on DNA. These amino acids correspond very exactly to what are referred to as the "hot spots", amino acids frequently mutated in human tumours (SWISS PROT: SW: P53_human and Prokocimer M., Blood, 1994, 84 No. 8, 2391-2411). From this homology, it may be deduced that the SR-p70 protein exerts a control over the activity of the genes regulated by p53, either independently of the latter or by forming heterooligomers with it.
[0285]Consequently, like p53, the products of the SR-p70 gene must be involved in the control and regulation of the cell cycle, causing the cycle to stop (momentarily or permanently), and the implementation of programmes such as DNA repair, differentiation or cell death. The likelihood of the existence of "p53-like" activities had been strongly felt with the demonstration in p53.sup.-/- mice of activities of DNA repair and cell death in response to ionizing radiations (Strasser et al., Cell, 1994, 79, 329-339). The authors of the present invention have localized the human SR-p70 gene in the telomeric region of the short arm of chromosome 1, precisely at 1p36.2-36.3, the smallest deleted region (SRO) common to a majority of neuroblastomas and of other types of tumours (melanomas and sarcomas) (White et al., PNAS, 1995, 92, 5520-5524). This region of loss of heterozygosity (LOH) defines the locus of a tumour-suppressing gene whose loss of activity is considered to be the cause of tumour formation. It is important to recall that this region is also subject to "maternal imprinting"; the maternal allele is preferentially lost in neuroblastomas having the 1p36 deletion (without amplification of N-Myc) (Caron et al., Hum. Mol. Gen., 1995, 4, 535-539). The wide-type SR-p70 gene introduced into neuroblastoma cells and expressed therein permits the reversion of their transformation. The loss of this anti-oncogenic activity is hence associated with the development of the tumour. The 1p36 region possesses a syngeneic homology with the distal segment of the mouse chromosome 4. In this region, the curly tail (ct) gene (Beier et al., Mammalian Genome, 1995, 6, 269-272) involved in congenital malformations of the neural tube (NTM: spina bifida, anencephaly, etc). The ct mouse is the best animal model for studying these malformations. It is accepted that these malformations result from abnormalities of cell proliferation. Bearing in mind the nature of the SR-p70 gene and its chromosomal localization, one of the hypotheses is that SR-p70 could be the human homologue of ct and that, on this basis, the detection of early mutations and chromosomal abnormalities affecting this gene should permit, for example, as an application, the identification of persons at risk (0.5-1% of newborn babies affected by NTM) and the implementation of preventive treatments (Neumann et al., Nature Genetics, 1994, 6, 357-362; Di Vinci et al., Int. J. Cancer, 1994, 59, 422-426; Moll et al., PNAS, 1995, 92, 4407-4411; Chen et al., Development, 1995, 121, 681-691).
EXAMPLE XIV
Allelic Study of the SR-p70 Gene
[0286]The GC and AT alleles are readily identified by StyI restriction of the PCR products of exon 2 (see Example XI). Hence it was possible to determine in this way, in GC/AT heterozygous individuals bearing neuroblastoma tumours, the lost SR-p70 allele (GC or AT), in spite of the presence of contaminating healthy tissue.
[0287]Surprisingly, when the same analysis is carried out on the RNA, a single allele is demonstrated independently of the presence or otherwise of a deletion and, still more surprisingly, in spite of the presence of healthy tissue. This suggests that the imprint (differential expression of the two alleles) would also exist in the contaminating tissue.
[0288]In order to verify this, the same analysis was repeated on the RNA originating from blood cells of healthy GC/AT heterozygous individuals. Only one of the two types of transcript was detected also in these cells. This result confirms the observation made on the tumour samples regarding the existence of a generalized genetic imprint for the SR-p70 gene.
[0289]The implications of this discovery are important, since it enables it to be postulated that a single sporadic mutation inactivating the active SR-p70 allele will give rise to a loss of activity, this potentially occurring in all the tissues.
[0290]The absence of precise data on the biological function of SR-p70 does not enable the consequences of this loss of SR-p70 activity for the cell to be measured. Nevertheless, its strong homology with the p53 tumour-suppressing protein, as well as the demonstration that SR-p70 is a transcription factor capable of utilizing the P21waf promoter, suggests a role of this protein in the control of the cell cycle and in differentiation.
[0291]Knudson and Meadows, 1980 (New Eng. J. Med. 302: 1254-56), consider the IV-S neuroblastomas to be a collection of non-malignant cells from the neural crest carrying a mutation which interferes with their normal differentiation.
[0292]It is conceivable that the loss of SR-p70 activity, like the loss of p53 control over the cell cycle, favours the appearance of cellular abnormalities such as aneuploidy, amplification (described in the case of neuroblastomas) and other genetic reorganizations capable of causing cell transformation (Livingstone et al., 1992, Cell 71:923-25; Yin et al. 1992, Cell 72:937-48; Cross et al. 1995, Science 267:1353-56; Fukasawa et al. 1996, Science 271:1744-47). Neuroblastomas might hence arise originally from a temporary or permanent loss of activity of SR-p70, thereby favouring the occurrence of oncogenic events and hence tumour progression.
[0293]In the case of the 1p36 constitutional deletion described by Biegel et al., 1993 (Am. J. Hum. Genet. 52:176-82), IV-S neuroblastoma does indeed occur and the gene affected is NBS-1 (SR-p70).
[0294]In conclusion, what is described for neuroblastomas might also apply to other types of tumours, in particular those associated with reorganization of the end of the short arm of chromosome 1 (Report 2 international workshop on human chr 1 mapping 1995, Cytogenetics and Cell Genet. 72:113-154). From a therapeutic standpoint, the involvement of SR-p70 in the occurrence of tumours should lead to the avoidance of the use of mutagenic agents in chemotherapy, bearing in mind the risks of cell transformation by these products, and to the use, in preference to these products, of non-mutagenic substances which stimulate differentiation.
[0295]Moreover, the frequency of occurrence of the GC and AT alleles is as follows: in the population, Frequency(AT)=0.15, and on a sample of 25 (neuroblastoma) patients, F(AT)=0.30. These statistics indicate that the AT allele could be a predisposing factor.
Sequence CWU
1
5512874DNACebus apellaCDS(156)..(2066) 1tgcctccccg cccgcgcacc cgccccgagg
cctgtgctcc tgcgaagggg acgcagcgaa 60gccggggccc gcgccaggcc ggccgggacg
gacgccgatg cccggagctg cgacggctgc 120agagcgagct gccctcggag gccggtgtga
ggaag atg gcc cag tcc acc acc 173
Met Ala Gln Ser Thr Thr 1
5acc tcc ccc gat ggg ggc acc acg ttt gag cac ctc tgg agc tct ctg
221Thr Ser Pro Asp Gly Gly Thr Thr Phe Glu His Leu Trp Ser Ser Leu
10 15 20gaa cca gac agc acc tac
ttc gac ctt ccc cag tca agc cgg ggg aat 269Glu Pro Asp Ser Thr Tyr
Phe Asp Leu Pro Gln Ser Ser Arg Gly Asn 25 30
35aat gag gtg gtg ggt ggc acg gat tcc agc atg gac gtc ttc
cac cta 317Asn Glu Val Val Gly Gly Thr Asp Ser Ser Met Asp Val Phe
His Leu 40 45 50gag ggc atg acc aca
tct gtc atg gcc cag ttc aat ttg ctg agc agc 365Glu Gly Met Thr Thr
Ser Val Met Ala Gln Phe Asn Leu Leu Ser Ser55 60
65 70acc atg gac cag atg agc agc cgc gct gcc
tcg gcc agc ccg tac acc 413Thr Met Asp Gln Met Ser Ser Arg Ala Ala
Ser Ala Ser Pro Tyr Thr 75 80
85ccg gag cac gcc gcc agc gtg ccc acc cat tca ccc tac gca cag ccc
461Pro Glu His Ala Ala Ser Val Pro Thr His Ser Pro Tyr Ala Gln Pro
90 95 100agc tcc acc ttc gac acc
atg tcg ccc gcg cct gtc atc ccc tcc aac 509Ser Ser Thr Phe Asp Thr
Met Ser Pro Ala Pro Val Ile Pro Ser Asn 105 110
115acc gac tat ccc gga ccc cac cac ttc gag gtc act ttc cag
cag tcc 557Thr Asp Tyr Pro Gly Pro His His Phe Glu Val Thr Phe Gln
Gln Ser 120 125 130agc acg gcc aag tca
gcc acc tgg acg tac tcc cca ctc ttg aag aaa 605Ser Thr Ala Lys Ser
Ala Thr Trp Thr Tyr Ser Pro Leu Leu Lys Lys135 140
145 150ctc tac tgc cag atc gcc aag aca tgc ccc
atc cag atc aag gtg tcc 653Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro
Ile Gln Ile Lys Val Ser 155 160
165gcc cca ccg ccc ccg ggc acc gcc atc cgg gcc atg cct gtc tac aag
701Ala Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala Met Pro Val Tyr Lys
170 175 180aag gcg gag cac gtg acc
gac atc gtg aag cgc tgc ccc aac cac gag 749Lys Ala Glu His Val Thr
Asp Ile Val Lys Arg Cys Pro Asn His Glu 185 190
195ctc ggg agg gac ttc aac gaa gga cag tct gcc cca gcc agc
cac ctc 797Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser Ala Pro Ala Ser
His Leu 200 205 210atc cgt gtg gaa ggc
aat aat ctc tcg cag tat gtg gac gac cct gtc 845Ile Arg Val Glu Gly
Asn Asn Leu Ser Gln Tyr Val Asp Asp Pro Val215 220
225 230acc ggc agg cag agc gtc gtg gtg ccc tat
gag cca cca cag gtg ggg 893Thr Gly Arg Gln Ser Val Val Val Pro Tyr
Glu Pro Pro Gln Val Gly 235 240
245aca gaa ttc acc acc atc ctg tac aac ttc atg tgt aac agc agc tgt
941Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys
250 255 260gtg ggg ggc atg aac cga
cgg ccc atc ctc atc atc atc acc ctg gag 989Val Gly Gly Met Asn Arg
Arg Pro Ile Leu Ile Ile Ile Thr Leu Glu 265 270
275acg cgg gat ggg cag gtg ctg ggc cgc cgg tcc ttc gag ggc
cgc atc 1037Thr Arg Asp Gly Gln Val Leu Gly Arg Arg Ser Phe Glu Gly
Arg Ile 280 285 290tgc gcc tgt cct ggc
cgc gac cga aaa gcc gat gag gac cac tac cgg 1085Cys Ala Cys Pro Gly
Arg Asp Arg Lys Ala Asp Glu Asp His Tyr Arg295 300
305 310gag cag cag gcc ttg aat gag agc tcc gcc
aag aac ggg gct gcc agc 1133Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala
Lys Asn Gly Ala Ala Ser 315 320
325aag cgc gcc ttc aag cag agt ccc cct gcc gtc ccc gcc ctg ggc ccg
1181Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala Val Pro Ala Leu Gly Pro
330 335 340ggt gtg aag aag cgg cgg
cac gga gac gag gac acg tac tac ctg cag 1229Gly Val Lys Lys Arg Arg
His Gly Asp Glu Asp Thr Tyr Tyr Leu Gln 345 350
355gtg cga ggc cgc gag aac ttc gag atc ctg atg aag ctg aag
gag agc 1277Val Arg Gly Arg Glu Asn Phe Glu Ile Leu Met Lys Leu Lys
Glu Ser 360 365 370ctg gag ctg atg gag
ttg gtg ccg cag ccg ctg gta gac tcc tat cgg 1325Leu Glu Leu Met Glu
Leu Val Pro Gln Pro Leu Val Asp Ser Tyr Arg375 380
385 390cag cag cag cag ctc cta cag agg ccg agt
cac cta cag ccc cca tcc 1373Gln Gln Gln Gln Leu Leu Gln Arg Pro Ser
His Leu Gln Pro Pro Ser 395 400
405tac ggg ccg gtc ctc tcg ccc atg aac aag gtg cac ggg ggc gtg aac
1421Tyr Gly Pro Val Leu Ser Pro Met Asn Lys Val His Gly Gly Val Asn
410 415 420aag ctg ccc tcc gtc aac
cag ctg gtg ggc cag cct ccc ccg cac agc 1469Lys Leu Pro Ser Val Asn
Gln Leu Val Gly Gln Pro Pro Pro His Ser 425 430
435tcg gca gct aca ccc aac ctg gga cct gtg ggc tct ggg atg
ctc aac 1517Ser Ala Ala Thr Pro Asn Leu Gly Pro Val Gly Ser Gly Met
Leu Asn 440 445 450aac cac ggc cac gca
gtg cca gcc aac agc gag atg acc agc agc cac 1565Asn His Gly His Ala
Val Pro Ala Asn Ser Glu Met Thr Ser Ser His455 460
465 470ggc acc cag tcc atg gtc tcg ggg tcc cac
tgc act ccg cca ccc ccc 1613Gly Thr Gln Ser Met Val Ser Gly Ser His
Cys Thr Pro Pro Pro Pro 475 480
485tac cac gcc gac ccc agc ctc gtc agt ttt tta aca gga ttg ggg tgt
1661Tyr His Ala Asp Pro Ser Leu Val Ser Phe Leu Thr Gly Leu Gly Cys
490 495 500cca aac tgc atc gag tat
ttc acg tcc cag ggg tta cag agc att tac 1709Pro Asn Cys Ile Glu Tyr
Phe Thr Ser Gln Gly Leu Gln Ser Ile Tyr 505 510
515cac ctg cag aac ctg acc atc gag gac ctg ggg gcc ctg aag
atc ccc 1757His Leu Gln Asn Leu Thr Ile Glu Asp Leu Gly Ala Leu Lys
Ile Pro 520 525 530gag cag tat cgc atg
acc atc tgg cgg ggc ctg cag gac ctg aag cag 1805Glu Gln Tyr Arg Met
Thr Ile Trp Arg Gly Leu Gln Asp Leu Lys Gln535 540
545 550ggc cac gac tac ggc gcc gcc gcg cag cag
ctg ctc cgc tcc agc aac 1853Gly His Asp Tyr Gly Ala Ala Ala Gln Gln
Leu Leu Arg Ser Ser Asn 555 560
565gcg gcc gcc att tcc atc ggc ggc tcc ggg gag ctg cag cgc cag cgg
1901Ala Ala Ala Ile Ser Ile Gly Gly Ser Gly Glu Leu Gln Arg Gln Arg
570 575 580gtc atg gag gcc gtg cac
ttc cgc gtg cgc cac acc atc acc atc ccc 1949Val Met Glu Ala Val His
Phe Arg Val Arg His Thr Ile Thr Ile Pro 585 590
595aac cgc ggc ggc ccc ggc gcc ggc ccc gac gag tgg gcg gac
ttc ggc 1997Asn Arg Gly Gly Pro Gly Ala Gly Pro Asp Glu Trp Ala Asp
Phe Gly 600 605 610ttc gac ctg ccc gac
tgc aag gcc cgc aag cag ccc atc aag gag gag 2045Phe Asp Leu Pro Asp
Cys Lys Ala Arg Lys Gln Pro Ile Lys Glu Glu615 620
625 630ttc acg gag gcc gag atc cac tgaggggccg
ggcccagcca gagcctgtgc 2096Phe Thr Glu Ala Glu Ile His
635caccgcccag agacccaggc cgcctcgctc tccttcctgt gtccaaaact gcctccggag
2156gcagggcctc caggctgtgc ccggggaaag gcaaggtccg gcccatgccc cggcacctca
2216ccggccccag gagaggccca gccaccaaag ccgcctgcgg acagcctgag tcacctgcag
2276aaccttctgg agctgcccta atgctgggct tgcggggcag gggccggccc actctcagcc
2336ctgccactgc cgggcgtgct ccatggcagg cgtgggtggg gaccgcagtg tcagctccga
2396cctccaggcc tcatcctaga gactctgtca tctgccgatc aagcaaggtc cttccagagg
2456aaagaatcct cttcgctggt ggactgccaa aaagtatttt gcgacatctt ttggttctgg
2516agagtggtga gcagccaagc gactgtgtct gaaacaccgt gcattttcag ggaatgtccc
2576taacgggctg gggactctct ctgctggact tgggagtggc ctttgccccc agcacactgt
2636attctgcggg accgcctcct tcctgcccct aacaaccacc aaagtgttgc tgaaattgga
2696gaaaactggg gaaggcgcaa cccctcccag gtgcgggaag catctggtac cgcctcggcc
2756agtgcccctc agcctggcca cagtcacctc tccttgggga accctgggca gaaagggaca
2816gcctgtcctt agaggaccgg aaattgtcaa tatttgataa aatgataccc ttttctac
28742637PRTCebus apella 2Met Ala Gln Ser Thr Thr Thr Ser Pro Asp Gly Gly
Thr Thr Phe Glu1 5 10
15His Leu Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr Phe Asp Leu Pro
20 25 30Gln Ser Ser Arg Gly Asn Asn
Glu Val Val Gly Gly Thr Asp Ser Ser 35 40
45Met Asp Val Phe His Leu Glu Gly Met Thr Thr Ser Val Met Ala
Gln 50 55 60Phe Asn Leu Leu Ser Ser
Thr Met Asp Gln Met Ser Ser Arg Ala Ala65 70
75 80Ser Ala Ser Pro Tyr Thr Pro Glu His Ala Ala
Ser Val Pro Thr His 85 90
95Ser Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala
100 105 110Pro Val Ile Pro Ser Asn
Thr Asp Tyr Pro Gly Pro His His Phe Glu 115 120
125Val Thr Phe Gln Gln Ser Ser Thr Ala Lys Ser Ala Thr Trp
Thr Tyr 130 135 140Ser Pro Leu Leu Lys
Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro145 150
155 160Ile Gln Ile Lys Val Ser Ala Pro Pro Pro
Pro Gly Thr Ala Ile Arg 165 170
175Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Asp Ile Val Lys
180 185 190Arg Cys Pro Asn His
Glu Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser 195
200 205Ala Pro Ala Ser His Leu Ile Arg Val Glu Gly Asn
Asn Leu Ser Gln 210 215 220Tyr Val Asp
Asp Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr225
230 235 240Glu Pro Pro Gln Val Gly Thr
Glu Phe Thr Thr Ile Leu Tyr Asn Phe 245
250 255Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg
Arg Pro Ile Leu 260 265 270Ile
Ile Ile Thr Leu Glu Thr Arg Asp Gly Gln Val Leu Gly Arg Arg 275
280 285Ser Phe Glu Gly Arg Ile Cys Ala Cys
Pro Gly Arg Asp Arg Lys Ala 290 295
300Asp Glu Asp His Tyr Arg Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala305
310 315 320Lys Asn Gly Ala
Ala Ser Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala 325
330 335Val Pro Ala Leu Gly Pro Gly Val Lys Lys
Arg Arg His Gly Asp Glu 340 345
350Asp Thr Tyr Tyr Leu Gln Val Arg Gly Arg Glu Asn Phe Glu Ile Leu
355 360 365Met Lys Leu Lys Glu Ser Leu
Glu Leu Met Glu Leu Val Pro Gln Pro 370 375
380Leu Val Asp Ser Tyr Arg Gln Gln Gln Gln Leu Leu Gln Arg Pro
Ser385 390 395 400His Leu
Gln Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn Lys
405 410 415Val His Gly Gly Val Asn Lys
Leu Pro Ser Val Asn Gln Leu Val Gly 420 425
430Gln Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu Gly
Pro Val 435 440 445Gly Ser Gly Met
Leu Asn Asn His Gly His Ala Val Pro Ala Asn Ser 450
455 460Glu Met Thr Ser Ser His Gly Thr Gln Ser Met Val
Ser Gly Ser His465 470 475
480Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser Phe
485 490 495Leu Thr Gly Leu Gly
Cys Pro Asn Cys Ile Glu Tyr Phe Thr Ser Gln 500
505 510Gly Leu Gln Ser Ile Tyr His Leu Gln Asn Leu Thr
Ile Glu Asp Leu 515 520 525Gly Ala
Leu Lys Ile Pro Glu Gln Tyr Arg Met Thr Ile Trp Arg Gly 530
535 540Leu Gln Asp Leu Lys Gln Gly His Asp Tyr Gly
Ala Ala Ala Gln Gln545 550 555
560Leu Leu Arg Ser Ser Asn Ala Ala Ala Ile Ser Ile Gly Gly Ser Gly
565 570 575Glu Leu Gln Arg
Gln Arg Val Met Glu Ala Val His Phe Arg Val Arg 580
585 590His Thr Ile Thr Ile Pro Asn Arg Gly Gly Pro
Gly Ala Gly Pro Asp 595 600 605Glu
Trp Ala Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lys 610
615 620Gln Pro Ile Lys Glu Glu Phe Thr Glu Ala
Glu Ile His625 630 63532034DNACebus
apellaCDS(156)..(1652) 3tgcctccccg cccgcgcacc cgccccgagg cctgtgctcc
tgcgaagggg acgcagcgaa 60gccggggccc gcgccaggcc ggccgggacg gacgccgatg
cccggagctg cgacggctgc 120agagcgagct gccctcggag gccggtgtga ggaag atg
gcc cag tcc acc acc 173 Met
Ala Gln Ser Thr Thr 1
5acc tcc ccc gat ggg ggc acc acg ttt gag cac ctc tgg agc tct ctg
221Thr Ser Pro Asp Gly Gly Thr Thr Phe Glu His Leu Trp Ser Ser Leu
10 15 20gaa cca gac agc acc tac ttc
gac ctt ccc cag tca agc cgg ggg aat 269Glu Pro Asp Ser Thr Tyr Phe
Asp Leu Pro Gln Ser Ser Arg Gly Asn 25 30
35aat gag gtg gtg ggt ggc acg gat tcc agc atg gac gtc ttc cac
cta 317Asn Glu Val Val Gly Gly Thr Asp Ser Ser Met Asp Val Phe His
Leu 40 45 50gag ggc atg acc aca tct
gtc atg gcc cag ttc aat ttg ctg agc agc 365Glu Gly Met Thr Thr Ser
Val Met Ala Gln Phe Asn Leu Leu Ser Ser55 60
65 70acc atg gac cag atg agc agc cgc gct gcc tcg
gcc agc ccg tac acc 413Thr Met Asp Gln Met Ser Ser Arg Ala Ala Ser
Ala Ser Pro Tyr Thr 75 80
85ccg gag cac gcc gcc agc gtg ccc acc cat tca ccc tac gca cag ccc
461Pro Glu His Ala Ala Ser Val Pro Thr His Ser Pro Tyr Ala Gln Pro
90 95 100agc tcc acc ttc gac acc
atg tcg ccc gcg cct gtc atc ccc tcc aac 509Ser Ser Thr Phe Asp Thr
Met Ser Pro Ala Pro Val Ile Pro Ser Asn 105 110
115acc gac tat ccc gga ccc cac cac ttc gag gtc act ttc cag
cag tcc 557Thr Asp Tyr Pro Gly Pro His His Phe Glu Val Thr Phe Gln
Gln Ser 120 125 130agc acg gcc aag tca
gcc acc tgg acg tac tcc cca ctc ttg aag aaa 605Ser Thr Ala Lys Ser
Ala Thr Trp Thr Tyr Ser Pro Leu Leu Lys Lys135 140
145 150ctc tac tgc cag atc gcc aag aca tgc ccc
atc cag atc aag gtg tcc 653Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro
Ile Gln Ile Lys Val Ser 155 160
165gcc cca ccg ccc ccg ggc acc gcc atc cgg gcc atg cct gtc tac aag
701Ala Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala Met Pro Val Tyr Lys
170 175 180aag gcg gag cac gtg acc
gac atc gtg aag cgc tgc ccc aac cac gag 749Lys Ala Glu His Val Thr
Asp Ile Val Lys Arg Cys Pro Asn His Glu 185 190
195ctc ggg agg gac ttc aac gaa gga cag tct gcc cca gcc agc
cac ctc 797Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser Ala Pro Ala Ser
His Leu 200 205 210atc cgt gtg gaa ggc
aat aat ctc tcg cag tat gtg gac gac cct gtc 845Ile Arg Val Glu Gly
Asn Asn Leu Ser Gln Tyr Val Asp Asp Pro Val215 220
225 230acc ggc agg cag agc gtc gtg gtg ccc tat
gag cca cca cag gtg ggg 893Thr Gly Arg Gln Ser Val Val Val Pro Tyr
Glu Pro Pro Gln Val Gly 235 240
245aca gaa ttc acc acc atc ctg tac aac ttc atg tgt aac agc agc tgt
941Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys
250 255 260gtg ggg ggc atg aac cga
cgg ccc atc ctc atc atc atc acc ctg gag 989Val Gly Gly Met Asn Arg
Arg Pro Ile Leu Ile Ile Ile Thr Leu Glu 265 270
275acg cgg gat ggg cag gtg ctg ggc cgc cgg tcc ttc gag ggc
cgc atc 1037Thr Arg Asp Gly Gln Val Leu Gly Arg Arg Ser Phe Glu Gly
Arg Ile 280 285 290tgc gcc tgt cct ggc
cgc gac cga aaa gcc gat gag gac cac tac cgg 1085Cys Ala Cys Pro Gly
Arg Asp Arg Lys Ala Asp Glu Asp His Tyr Arg295 300
305 310gag cag cag gcc ttg aat gag agc tcc gcc
aag aac ggg gct gcc agc 1133Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala
Lys Asn Gly Ala Ala Ser 315 320
325aag cgc gcc ttc aag cag agt ccc cct gcc gtc ccc gcc ctg ggc ccg
1181Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala Val Pro Ala Leu Gly Pro
330 335 340ggt gtg aag aag cgg cgg
cac gga gac gag gac acg tac tac ctg cag 1229Gly Val Lys Lys Arg Arg
His Gly Asp Glu Asp Thr Tyr Tyr Leu Gln 345 350
355gtg cga ggc cgc gag aac ttc gag atc ctg atg aag ctg aag
gag agc 1277Val Arg Gly Arg Glu Asn Phe Glu Ile Leu Met Lys Leu Lys
Glu Ser 360 365 370ctg gag ctg atg gag
ttg gtg ccg cag ccg ctg gta gac tcc tat cgg 1325Leu Glu Leu Met Glu
Leu Val Pro Gln Pro Leu Val Asp Ser Tyr Arg375 380
385 390cag cag cag cag ctc cta cag agg ccg agt
cac cta cag ccc cca tcc 1373Gln Gln Gln Gln Leu Leu Gln Arg Pro Ser
His Leu Gln Pro Pro Ser 395 400
405tac ggg ccg gtc ctc tcg ccc atg aac aag gtg cac ggg ggc gtg aac
1421Tyr Gly Pro Val Leu Ser Pro Met Asn Lys Val His Gly Gly Val Asn
410 415 420aag ctg ccc tcc gtc aac
cag ctg gtg ggc cag cct ccc ccg cac agc 1469Lys Leu Pro Ser Val Asn
Gln Leu Val Gly Gln Pro Pro Pro His Ser 425 430
435tcg gca gct aca ccc aac ctg gga cct gtg ggc tct ggg atg
ctc aac 1517Ser Ala Ala Thr Pro Asn Leu Gly Pro Val Gly Ser Gly Met
Leu Asn 440 445 450aac cac ggc cac gca
gtg cca gcc aac agc gag atg acc agc agc cac 1565Asn His Gly His Ala
Val Pro Ala Asn Ser Glu Met Thr Ser Ser His455 460
465 470ggc acc cag tcc atg gtc tcg ggg tcc cac
tgc act ccg cca ccc ccc 1613Gly Thr Gln Ser Met Val Ser Gly Ser His
Cys Thr Pro Pro Pro Pro 475 480
485tac cac gcc gac ccc agc ctc gtc agg acc tgg ggg ccc tgaagatccc
1662Tyr His Ala Asp Pro Ser Leu Val Arg Thr Trp Gly Pro 490
495cgagcagtat cgcatgacca tctggcgggg cctgcaggac ctgaagcagg
gccacgacta 1722cggcgccgcc gcgcagcagc tgctccgctc cagcaacgcg gccgccattt
ccatcggcgg 1782ctccggggag ctgcagcgcc agcgggtcat ggaggccgtg cacttccgcg
tgcgccacac 1842catcaccatc cccaaccgcg gcggccccgg cgccggcccc gacgagtggg
cggacttcgg 1902cttcgacctg cccgactgca aggcccgcaa gcagcccatc aaggaggagt
tcacggaggc 1962cgagatccac tgaggggccg ggcccagcca gagcctgtgc caccgcccag
agacccaggc 2022cgcctcgctc tc
20344499PRTCebus apella 4Met Ala Gln Ser Thr Thr Thr Ser Pro
Asp Gly Gly Thr Thr Phe Glu1 5 10
15His Leu Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr Phe Asp Leu
Pro 20 25 30Gln Ser Ser Arg
Gly Asn Asn Glu Val Val Gly Gly Thr Asp Ser Ser 35
40 45Met Asp Val Phe His Leu Glu Gly Met Thr Thr Ser
Val Met Ala Gln 50 55 60Phe Asn Leu
Leu Ser Ser Thr Met Asp Gln Met Ser Ser Arg Ala Ala65 70
75 80Ser Ala Ser Pro Tyr Thr Pro Glu
His Ala Ala Ser Val Pro Thr His 85 90
95Ser Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp Thr Met Ser
Pro Ala 100 105 110Pro Val Ile
Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu 115
120 125Val Thr Phe Gln Gln Ser Ser Thr Ala Lys Ser
Ala Thr Trp Thr Tyr 130 135 140Ser Pro
Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro145
150 155 160Ile Gln Ile Lys Val Ser Ala
Pro Pro Pro Pro Gly Thr Ala Ile Arg 165
170 175Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr
Asp Ile Val Lys 180 185 190Arg
Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser 195
200 205Ala Pro Ala Ser His Leu Ile Arg Val
Glu Gly Asn Asn Leu Ser Gln 210 215
220Tyr Val Asp Asp Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr225
230 235 240Glu Pro Pro Gln
Val Gly Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe 245
250 255Met Cys Asn Ser Ser Cys Val Gly Gly Met
Asn Arg Arg Pro Ile Leu 260 265
270Ile Ile Ile Thr Leu Glu Thr Arg Asp Gly Gln Val Leu Gly Arg Arg
275 280 285Ser Phe Glu Gly Arg Ile Cys
Ala Cys Pro Gly Arg Asp Arg Lys Ala 290 295
300Asp Glu Asp His Tyr Arg Glu Gln Gln Ala Leu Asn Glu Ser Ser
Ala305 310 315 320Lys Asn
Gly Ala Ala Ser Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala
325 330 335Val Pro Ala Leu Gly Pro Gly
Val Lys Lys Arg Arg His Gly Asp Glu 340 345
350Asp Thr Tyr Tyr Leu Gln Val Arg Gly Arg Glu Asn Phe Glu
Ile Leu 355 360 365Met Lys Leu Lys
Glu Ser Leu Glu Leu Met Glu Leu Val Pro Gln Pro 370
375 380Leu Val Asp Ser Tyr Arg Gln Gln Gln Gln Leu Leu
Gln Arg Pro Ser385 390 395
400His Leu Gln Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn Lys
405 410 415Val His Gly Gly Val
Asn Lys Leu Pro Ser Val Asn Gln Leu Val Gly 420
425 430Gln Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn
Leu Gly Pro Val 435 440 445Gly Ser
Gly Met Leu Asn Asn His Gly His Ala Val Pro Ala Asn Ser 450
455 460Glu Met Thr Ser Ser His Gly Thr Gln Ser Met
Val Ser Gly Ser His465 470 475
480Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Arg Thr
485 490 495Trp Gly
Pro52156DNAHomo sapiensCDS(33)..(1940) 5gcgagctgcc ctcggaggcc ggcgtgggga
ag atg gcc cag tcc acc gcc acc 53
Met Ala Gln Ser Thr Ala Thr 1
5tcc cct gat ggg ggc acc acg ttt gag cac ctc tgg agc tct ctg gaa
101Ser Pro Asp Gly Gly Thr Thr Phe Glu His Leu Trp Ser Ser Leu Glu
10 15 20cca gac agc acc tac ttc gac
ctt ccc cag tca agc cgg ggg aat aat 149Pro Asp Ser Thr Tyr Phe Asp
Leu Pro Gln Ser Ser Arg Gly Asn Asn 25 30
35gag gtg gtg ggc gga acg gat tcc agc atg gac gtc ttc cac ctg gag
197Glu Val Val Gly Gly Thr Asp Ser Ser Met Asp Val Phe His Leu Glu40
45 50 55ggc atg act aca
tct gtc atg gcc cag ttc aat ctg ctg agc agc acc 245Gly Met Thr Thr
Ser Val Met Ala Gln Phe Asn Leu Leu Ser Ser Thr 60
65 70atg gac cag atg agc agc cgc gcg gcc tcg
gcc agc ccc tac acc cca 293Met Asp Gln Met Ser Ser Arg Ala Ala Ser
Ala Ser Pro Tyr Thr Pro 75 80
85gag cac gcc gcc agc gtg ccc acc cac tcg ccc tac gca caa ccc agc
341Glu His Ala Ala Ser Val Pro Thr His Ser Pro Tyr Ala Gln Pro Ser
90 95 100tcc acc ttc gac acc atg tcg
ccg gcg cct gtc atc ccc tcc aac acc 389Ser Thr Phe Asp Thr Met Ser
Pro Ala Pro Val Ile Pro Ser Asn Thr 105 110
115gac tac ccc gga ccc cac cac ttt gag gtc act ttc cag cag tcc agc
437Asp Tyr Pro Gly Pro His His Phe Glu Val Thr Phe Gln Gln Ser Ser120
125 130 135acg gcc aag tca
gcc acc tgg acg tac tcc ccg ctc ttg aag aaa ctc 485Thr Ala Lys Ser
Ala Thr Trp Thr Tyr Ser Pro Leu Leu Lys Lys Leu 140
145 150tac tgc cag atc gcc aag aca tgc ccc atc
cag atc aag gtg tcc acc 533Tyr Cys Gln Ile Ala Lys Thr Cys Pro Ile
Gln Ile Lys Val Ser Thr 155 160
165ccg cca ccc cca ggc act gcc atc cgg gcc atg cct gtt tac aag aaa
581Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala Met Pro Val Tyr Lys Lys
170 175 180gcg gag cac gtg acc gac gtc
gtg aaa cgc tgc ccc aac cac gag ctc 629Ala Glu His Val Thr Asp Val
Val Lys Arg Cys Pro Asn His Glu Leu 185 190
195ggg agg gac ttc aac gaa gga cag tct gct cca gcc agc cac ctc atc
677Gly Arg Asp Phe Asn Glu Gly Gln Ser Ala Pro Ala Ser His Leu Ile200
205 210 215cgc gtg gaa ggc
aat aat ctc tcg cag tat gtg gat gac cct gtc acc 725Arg Val Glu Gly
Asn Asn Leu Ser Gln Tyr Val Asp Asp Pro Val Thr 220
225 230ggc agg cag agc gtc gtg gtg ccc tat gag
cca cca cag gtg ggg acg 773Gly Arg Gln Ser Val Val Val Pro Tyr Glu
Pro Pro Gln Val Gly Thr 235 240
245gaa ttc acc acc atc ctg tac aac ttc atg tgt aac agc agc tgt gta
821Glu Phe Thr Thr Ile Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys Val
250 255 260ggg ggc atg aac cgg cgg ccc
atc ctc atc atc atc acc ctg gag atg 869Gly Gly Met Asn Arg Arg Pro
Ile Leu Ile Ile Ile Thr Leu Glu Met 265 270
275cgg gat ggg cag gtg ctg ggc cgc cgg tcc ttt gag ggc cgc atc tgc
917Arg Asp Gly Gln Val Leu Gly Arg Arg Ser Phe Glu Gly Arg Ile Cys280
285 290 295gcc tgt cct ggc
cgc gac cga aaa gct gat gag gac cac tac cgg gag 965Ala Cys Pro Gly
Arg Asp Arg Lys Ala Asp Glu Asp His Tyr Arg Glu 300
305 310cag cag gcc ctg aac gag agc tcc gcc aag
aac ggg gcc gcc agc aag 1013Gln Gln Ala Leu Asn Glu Ser Ser Ala Lys
Asn Gly Ala Ala Ser Lys 315 320
325cgt gcc ttc aag cag agc ccc cct gcc gtc ccc gcc ctt ggt gcc ggt
1061Arg Ala Phe Lys Gln Ser Pro Pro Ala Val Pro Ala Leu Gly Ala Gly
330 335 340gtg aag aag cgg cgg cat gga
gac gag gac acg tac tac ctt cag gtg 1109Val Lys Lys Arg Arg His Gly
Asp Glu Asp Thr Tyr Tyr Leu Gln Val 345 350
355cga ggc cgg gag aac ttt gag atc ctg atg aag ctg aaa gag agc ctg
1157Arg Gly Arg Glu Asn Phe Glu Ile Leu Met Lys Leu Lys Glu Ser Leu360
365 370 375gag ctg atg gag
ttg gtg ccg cag cca ctg gtg gac tcc tat cgg cag 1205Glu Leu Met Glu
Leu Val Pro Gln Pro Leu Val Asp Ser Tyr Arg Gln 380
385 390cag cag cag ctc cta cag agg ccg agt cac
cta cag ccc ccg tcc tac 1253Gln Gln Gln Leu Leu Gln Arg Pro Ser His
Leu Gln Pro Pro Ser Tyr 395 400
405ggg ccg gtc ctc tcg ccc atg aac aag gtg cac ggg ggc atg aac aag
1301Gly Pro Val Leu Ser Pro Met Asn Lys Val His Gly Gly Met Asn Lys
410 415 420ctg ccc tcc gtc aac cag ctg
gtg ggc cag cct ccc ccg cac agt tcg 1349Leu Pro Ser Val Asn Gln Leu
Val Gly Gln Pro Pro Pro His Ser Ser 425 430
435gca gct aca ccc aac ctg ggg ccc gtg ggc ccc ggg atg ctc aac aac
1397Ala Ala Thr Pro Asn Leu Gly Pro Val Gly Pro Gly Met Leu Asn Asn440
445 450 455cat ggc cac gca
gtg cca gcc aac ggc gag atg agc agc agc cac agc 1445His Gly His Ala
Val Pro Ala Asn Gly Glu Met Ser Ser Ser His Ser 460
465 470gcc cag tcc atg gtc tcg ggg tcc cac tgc
act ccg cca ccc ccc tac 1493Ala Gln Ser Met Val Ser Gly Ser His Cys
Thr Pro Pro Pro Pro Tyr 475 480
485cac gcc gac ccc agc ctc gtc agt ttt tta aca gga ttg ggg tgt cca
1541His Ala Asp Pro Ser Leu Val Ser Phe Leu Thr Gly Leu Gly Cys Pro
490 495 500aac tgc atc gag tat ttc acc
tcc caa ggg tta cag agc att tac cac 1589Asn Cys Ile Glu Tyr Phe Thr
Ser Gln Gly Leu Gln Ser Ile Tyr His 505 510
515ctg cag aac ctg acc att gag gac ctg ggg gcc ctg aag atc ccc gag
1637Leu Gln Asn Leu Thr Ile Glu Asp Leu Gly Ala Leu Lys Ile Pro Glu520
525 530 535cag tac cgc atg
acc atc tgg cgg ggc ctg cag gac ctg aag cag ggc 1685Gln Tyr Arg Met
Thr Ile Trp Arg Gly Leu Gln Asp Leu Lys Gln Gly 540
545 550cac gac tac agc acc gcg cag cag ctg ctc
cgc tct agc aac gcg gcc 1733His Asp Tyr Ser Thr Ala Gln Gln Leu Leu
Arg Ser Ser Asn Ala Ala 555 560
565acc atc tcc atc ggc ggc tca ggg gaa ctg cag cgc cag cgg gtc atg
1781Thr Ile Ser Ile Gly Gly Ser Gly Glu Leu Gln Arg Gln Arg Val Met
570 575 580gag gcc gtg cac ttc cgc gtg
cgc cac acc atc acc atc ccc aac cgc 1829Glu Ala Val His Phe Arg Val
Arg His Thr Ile Thr Ile Pro Asn Arg 585 590
595ggc ggc cca ggc ggc ggc cct gac gag tgg gcg gac ttc ggc ttc gac
1877Gly Gly Pro Gly Gly Gly Pro Asp Glu Trp Ala Asp Phe Gly Phe Asp600
605 610 615ctg ccc gac tgc
aag gcc cgc aag cag ccc atc aag gag gag ttc acg 1925Leu Pro Asp Cys
Lys Ala Arg Lys Gln Pro Ile Lys Glu Glu Phe Thr 620
625 630gag gcc gag atc cac tgagggcctc gcctggctgc
agcctgcgcc accgcccaga 1980Glu Ala Glu Ile His
635gacccaagct gcctcccctc tccttcctgt gtgtccaaaa ctgcctcagg aggcaggacc
2040ttcgggctgt gcccggggaa aggcaaggtc cggcccatcc ccaggcacct cacaggcccc
2100aggaaaggcc cagccaccga agccgcctgt ggacagcctg agtcacctgc agaacc
21566636PRTHomo sapiens 6Met Ala Gln Ser Thr Ala Thr Ser Pro Asp Gly Gly
Thr Thr Phe Glu1 5 10
15His Leu Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr Phe Asp Leu Pro
20 25 30Gln Ser Ser Arg Gly Asn Asn
Glu Val Val Gly Gly Thr Asp Ser Ser 35 40
45Met Asp Val Phe His Leu Glu Gly Met Thr Thr Ser Val Met Ala
Gln 50 55 60Phe Asn Leu Leu Ser Ser
Thr Met Asp Gln Met Ser Ser Arg Ala Ala65 70
75 80Ser Ala Ser Pro Tyr Thr Pro Glu His Ala Ala
Ser Val Pro Thr His 85 90
95Ser Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala
100 105 110Pro Val Ile Pro Ser Asn
Thr Asp Tyr Pro Gly Pro His His Phe Glu 115 120
125Val Thr Phe Gln Gln Ser Ser Thr Ala Lys Ser Ala Thr Trp
Thr Tyr 130 135 140Ser Pro Leu Leu Lys
Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro145 150
155 160Ile Gln Ile Lys Val Ser Thr Pro Pro Pro
Pro Gly Thr Ala Ile Arg 165 170
175Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Asp Val Val Lys
180 185 190Arg Cys Pro Asn His
Glu Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser 195
200 205Ala Pro Ala Ser His Leu Ile Arg Val Glu Gly Asn
Asn Leu Ser Gln 210 215 220Tyr Val Asp
Asp Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr225
230 235 240Glu Pro Pro Gln Val Gly Thr
Glu Phe Thr Thr Ile Leu Tyr Asn Phe 245
250 255Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg
Arg Pro Ile Leu 260 265 270Ile
Ile Ile Thr Leu Glu Met Arg Asp Gly Gln Val Leu Gly Arg Arg 275
280 285Ser Phe Glu Gly Arg Ile Cys Ala Cys
Pro Gly Arg Asp Arg Lys Ala 290 295
300Asp Glu Asp His Tyr Arg Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala305
310 315 320Lys Asn Gly Ala
Ala Ser Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala 325
330 335Val Pro Ala Leu Gly Ala Gly Val Lys Lys
Arg Arg His Gly Asp Glu 340 345
350Asp Thr Tyr Tyr Leu Gln Val Arg Gly Arg Glu Asn Phe Glu Ile Leu
355 360 365Met Lys Leu Lys Glu Ser Leu
Glu Leu Met Glu Leu Val Pro Gln Pro 370 375
380Leu Val Asp Ser Tyr Arg Gln Gln Gln Gln Leu Leu Gln Arg Pro
Ser385 390 395 400His Leu
Gln Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn Lys
405 410 415Val His Gly Gly Met Asn Lys
Leu Pro Ser Val Asn Gln Leu Val Gly 420 425
430Gln Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu Gly
Pro Val 435 440 445Gly Pro Gly Met
Leu Asn Asn His Gly His Ala Val Pro Ala Asn Gly 450
455 460Glu Met Ser Ser Ser His Ser Ala Gln Ser Met Val
Ser Gly Ser His465 470 475
480Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser Phe
485 490 495Leu Thr Gly Leu Gly
Cys Pro Asn Cys Ile Glu Tyr Phe Thr Ser Gln 500
505 510Gly Leu Gln Ser Ile Tyr His Leu Gln Asn Leu Thr
Ile Glu Asp Leu 515 520 525Gly Ala
Leu Lys Ile Pro Glu Gln Tyr Arg Met Thr Ile Trp Arg Gly 530
535 540Leu Gln Asp Leu Lys Gln Gly His Asp Tyr Ser
Thr Ala Gln Gln Leu545 550 555
560Leu Arg Ser Ser Asn Ala Ala Thr Ile Ser Ile Gly Gly Ser Gly Glu
565 570 575Leu Gln Arg Gln
Arg Val Met Glu Ala Val His Phe Arg Val Arg His 580
585 590Thr Ile Thr Ile Pro Asn Arg Gly Gly Pro Gly
Gly Gly Pro Asp Glu 595 600 605Trp
Ala Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lys Gln 610
615 620Pro Ile Lys Glu Glu Phe Thr Glu Ala Glu
Ile His625 630 63572040DNAMus
musculusCDS(124)..(1890) 7tgatctccct gtggcctgca ggggactgag ccagggagta
gatgccctga gaccccaagg 60gacacccaag gaaaccttgc tggctttgag aaagggatcg
tctctctcct gcccaagaga 120agc atg tgt atg ggc cct gtg tat gaa tcc ttg
ggg cag gcc cag ttc 168 Met Cys Met Gly Pro Val Tyr Glu Ser Leu
Gly Gln Ala Gln Phe 1 5 10
15aat ttg ctc agc agt gcc atg gac cag atg ggc agc cgt gcg gcc ccg
216Asn Leu Leu Ser Ser Ala Met Asp Gln Met Gly Ser Arg Ala Ala Pro
20 25 30gcg agc ccc tac acc
ccg gag cac gcc gcc agc gcg ccc acc cac tcg 264Ala Ser Pro Tyr Thr
Pro Glu His Ala Ala Ser Ala Pro Thr His Ser 35
40 45ccc tac gcg cag ccc agc tcc acc ttc gac acc atg
tct ccg gcg cct 312Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp Thr Met
Ser Pro Ala Pro 50 55 60gtc atc
cct tcc aat acc gac tac ccc ggc ccc cac cac ttc gag gtc 360Val Ile
Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu Val 65
70 75acc ttc cag cag tcg agc act gcc aag tcg gcc
acc tgg aca tac tcc 408Thr Phe Gln Gln Ser Ser Thr Ala Lys Ser Ala
Thr Trp Thr Tyr Ser80 85 90
95cca ctc ttg aag aag ttg tac tgt cag att gct aag aca tgc ccc atc
456Pro Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro Ile
100 105 110cag atc aaa gtg tcc
aca cca cca ccc ccg ggc acg gcc atc cgg gcc 504Gln Ile Lys Val Ser
Thr Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala 115
120 125atg cct gtc tac aag aag gca gag cat gtg acc gac
att gtt aag cgc 552Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Asp
Ile Val Lys Arg 130 135 140tgc ccc
aac cac gag ctt gga agg gac ttc aat gaa gga cag tct gcc 600Cys Pro
Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser Ala 145
150 155ccg gct agc cac ctc atc cgt gta gaa ggc aac
aac ctc gcc cag tac 648Pro Ala Ser His Leu Ile Arg Val Glu Gly Asn
Asn Leu Ala Gln Tyr160 165 170
175gtg gat gac cct gtc acc gga agg cag agt gtg gtt gtg ccg tat gaa
696Val Asp Asp Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr Glu
180 185 190ccc cca cag gtg gga
aca gaa ttt acc acc atc ctg tac aac ttc atg 744Pro Pro Gln Val Gly
Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe Met 195
200 205tgt aac agc agc tgt gtg ggg ggc atg aat cgg agg
ccc atc ctt gtc 792Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg
Pro Ile Leu Val 210 215 220atc atc
acc ctg gag acc cgg gat gga cag gtc ctg ggc cgc cgg tct 840Ile Ile
Thr Leu Glu Thr Arg Asp Gly Gln Val Leu Gly Arg Arg Ser 225
230 235ttc gag ggt cgc atc tgt gcc tgt cct ggc cgt
gac cgc aaa gct gat 888Phe Glu Gly Arg Ile Cys Ala Cys Pro Gly Arg
Asp Arg Lys Ala Asp240 245 250
255gaa gac cat tac cgg gag caa cag gct ctg aat gaa agt acc acc aaa
936Glu Asp His Tyr Arg Glu Gln Gln Ala Leu Asn Glu Ser Thr Thr Lys
260 265 270aat gga gct gcc agc
aaa cgt gca ttc aag cag agc ccc cct gcc atc 984Asn Gly Ala Ala Ser
Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala Ile 275
280 285cct gcc ctg ggt acc aac gtg aag aag aga cgc cac
ggg gac gag gac 1032Pro Ala Leu Gly Thr Asn Val Lys Lys Arg Arg His
Gly Asp Glu Asp 290 295 300atg ttc
tac atg cac gtg cga ggc cgg gag aac ttt gag atc ttg atg 1080Met Phe
Tyr Met His Val Arg Gly Arg Glu Asn Phe Glu Ile Leu Met 305
310 315aaa gtc aag gag agc cta gaa ctg atg gag ctt
gtg ccc cag cct ttg 1128Lys Val Lys Glu Ser Leu Glu Leu Met Glu Leu
Val Pro Gln Pro Leu320 325 330
335gtt gac tcc tat cga cag cag cag cag cag cag ctc cta cag agg ccg
1176Val Asp Ser Tyr Arg Gln Gln Gln Gln Gln Gln Leu Leu Gln Arg Pro
340 345 350agt cac ctg cag cct
cca tcc tat ggg ccc gtg ctc tcc cca atg aac 1224Ser His Leu Gln Pro
Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn 355
360 365aag gta cac ggt ggt gtc aac aaa ctg ccc tcc gtc
aac cag ctg gtg 1272Lys Val His Gly Gly Val Asn Lys Leu Pro Ser Val
Asn Gln Leu Val 370 375 380ggc cag
cct ccc ccg cac agc tca gca gct ggg ccc aac ctg ggg ccc 1320Gly Gln
Pro Pro Pro His Ser Ser Ala Ala Gly Pro Asn Leu Gly Pro 385
390 395atg ggc tcc ggg atg ctc aac agc cac ggc cac
agc atg ccg gcc aat 1368Met Gly Ser Gly Met Leu Asn Ser His Gly His
Ser Met Pro Ala Asn400 405 410
415ggt gag atg aat gga ggc cac agc tcc cag acc atg gtt tcg gga tcc
1416Gly Glu Met Asn Gly Gly His Ser Ser Gln Thr Met Val Ser Gly Ser
420 425 430cac tgc acc ccg cca
ccc ccc tat cat gca gac ccc agc ctc gtc agt 1464His Cys Thr Pro Pro
Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser 435
440 445ttt ttg aca ggg ttg ggg tgt cca aac tgc atc gag
tgc ttc act tcc 1512Phe Leu Thr Gly Leu Gly Cys Pro Asn Cys Ile Glu
Cys Phe Thr Ser 450 455 460caa ggg
ttg cag agc atc tac cac ctg cag aac ctt acc atc gag gac 1560Gln Gly
Leu Gln Ser Ile Tyr His Leu Gln Asn Leu Thr Ile Glu Asp 465
470 475ctt ggg gct ctg aag gtc cct gac cag tac cgt
atg acc atc tgg agg 1608Leu Gly Ala Leu Lys Val Pro Asp Gln Tyr Arg
Met Thr Ile Trp Arg480 485 490
495ggc cta cag gac ctg aag cag agc cat gac tgc ggc cag caa ctg cta
1656Gly Leu Gln Asp Leu Lys Gln Ser His Asp Cys Gly Gln Gln Leu Leu
500 505 510cgc tcc agc agc aac
gcg gcc acc atc tcc atc ggc ggc tct ggc gag 1704Arg Ser Ser Ser Asn
Ala Ala Thr Ile Ser Ile Gly Gly Ser Gly Glu 515
520 525ctg cag cgg cag cgg gtc atg gaa gcc gtg cat ttc
cgt gtg cgc cac 1752Leu Gln Arg Gln Arg Val Met Glu Ala Val His Phe
Arg Val Arg His 530 535 540acc atc
aca atc ccc aac cgt gga ggc gca ggt gcg gtg aca ggt ccc 1800Thr Ile
Thr Ile Pro Asn Arg Gly Gly Ala Gly Ala Val Thr Gly Pro 545
550 555gac gag tgg gcg gac ttt ggc ttt gac ctg cct
gac tgc aag tcc cgt 1848Asp Glu Trp Ala Asp Phe Gly Phe Asp Leu Pro
Asp Cys Lys Ser Arg560 565 570
575aag cag ccc atc aaa gag gag ttc aca gag aca gag agc cac
1890Lys Gln Pro Ile Lys Glu Glu Phe Thr Glu Thr Glu Ser His
580 585tgaggaacgt accttcttct cctgtccttc ctctgtgaga
aactgctctt ggaagtggga 1950cctgttggct gtgcccacag aaaccagcaa ggaccttctg
ccggatgcca ttcctgaagg 2010gaagtcgctc atgaactaac tccctcttgg
20408589PRTMus musculus 8Met Cys Met Gly Pro Val
Tyr Glu Ser Leu Gly Gln Ala Gln Phe Asn1 5
10 15Leu Leu Ser Ser Ala Met Asp Gln Met Gly Ser Arg
Ala Ala Pro Ala 20 25 30Ser
Pro Tyr Thr Pro Glu His Ala Ala Ser Ala Pro Thr His Ser Pro 35
40 45Tyr Ala Gln Pro Ser Ser Thr Phe Asp
Thr Met Ser Pro Ala Pro Val 50 55
60Ile Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu Val Thr65
70 75 80Phe Gln Gln Ser Ser
Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Pro 85
90 95Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala Lys
Thr Cys Pro Ile Gln 100 105
110Ile Lys Val Ser Thr Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala Met
115 120 125Pro Val Tyr Lys Lys Ala Glu
His Val Thr Asp Ile Val Lys Arg Cys 130 135
140Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser Ala
Pro145 150 155 160Ala Ser
His Leu Ile Arg Val Glu Gly Asn Asn Leu Ala Gln Tyr Val
165 170 175Asp Asp Pro Val Thr Gly Arg
Gln Ser Val Val Val Pro Tyr Glu Pro 180 185
190Pro Gln Val Gly Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe
Met Cys 195 200 205Asn Ser Ser Cys
Val Gly Gly Met Asn Arg Arg Pro Ile Leu Val Ile 210
215 220Ile Thr Leu Glu Thr Arg Asp Gly Gln Val Leu Gly
Arg Arg Ser Phe225 230 235
240Glu Gly Arg Ile Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp Glu
245 250 255Asp His Tyr Arg Glu
Gln Gln Ala Leu Asn Glu Ser Thr Thr Lys Asn 260
265 270Gly Ala Ala Ser Lys Arg Ala Phe Lys Gln Ser Pro
Pro Ala Ile Pro 275 280 285Ala Leu
Gly Thr Asn Val Lys Lys Arg Arg His Gly Asp Glu Asp Met 290
295 300Phe Tyr Met His Val Arg Gly Arg Glu Asn Phe
Glu Ile Leu Met Lys305 310 315
320Val Lys Glu Ser Leu Glu Leu Met Glu Leu Val Pro Gln Pro Leu Val
325 330 335Asp Ser Tyr Arg
Gln Gln Gln Gln Gln Gln Leu Leu Gln Arg Pro Ser 340
345 350His Leu Gln Pro Pro Ser Tyr Gly Pro Val Leu
Ser Pro Met Asn Lys 355 360 365Val
His Gly Gly Val Asn Lys Leu Pro Ser Val Asn Gln Leu Val Gly 370
375 380Gln Pro Pro Pro His Ser Ser Ala Ala Gly
Pro Asn Leu Gly Pro Met385 390 395
400Gly Ser Gly Met Leu Asn Ser His Gly His Ser Met Pro Ala Asn
Gly 405 410 415Glu Met Asn
Gly Gly His Ser Ser Gln Thr Met Val Ser Gly Ser His 420
425 430Cys Thr Pro Pro Pro Pro Tyr His Ala Asp
Pro Ser Leu Val Ser Phe 435 440
445Leu Thr Gly Leu Gly Cys Pro Asn Cys Ile Glu Cys Phe Thr Ser Gln 450
455 460Gly Leu Gln Ser Ile Tyr His Leu
Gln Asn Leu Thr Ile Glu Asp Leu465 470
475 480Gly Ala Leu Lys Val Pro Asp Gln Tyr Arg Met Thr
Ile Trp Arg Gly 485 490
495Leu Gln Asp Leu Lys Gln Ser His Asp Cys Gly Gln Gln Leu Leu Arg
500 505 510Ser Ser Ser Asn Ala Ala
Thr Ile Ser Ile Gly Gly Ser Gly Glu Leu 515 520
525Gln Arg Gln Arg Val Met Glu Ala Val His Phe Arg Val Arg
His Thr 530 535 540Ile Thr Ile Pro Asn
Arg Gly Gly Ala Gly Ala Val Thr Gly Pro Asp545 550
555 560Glu Trp Ala Asp Phe Gly Phe Asp Leu Pro
Asp Cys Lys Ser Arg Lys 565 570
575Gln Pro Ile Lys Glu Glu Phe Thr Glu Thr Glu Ser His
580 5859758DNAMus musculusCDS(389)..(757) 9tggtcccgct
tcgaccaaga ctccggctac cagcttgcgg gccccgcgga ggaggagacc 60ccgctggggc
tagctgggcg acgcgcgcca agcggcggcg ggaaggaggc gggaggagcg 120gggcccgaga
ccccgactcg ggcagagcca gctggggagg cggggcgcgc gtgggagcca 180ggggcccggg
tggccggccc tcctccgcca cggctgagtg cccgcgctgc cttcccgccg 240gtccgccaag
aaaggcgcta agcctgcggc agtcccctcg ccgccgcctc cctgctccgc 300acccttataa
cccgccgtcc cgcatccagg cgaggaggca acgctgcagc ccagccctcg 360ccgacgccga
cgcccggccc ggagcaga atg agc ggc agc gtt ggg gag atg 412
Met Ser Gly Ser Val Gly Glu Met
1 5gcc cag acc tct tct tcc tcc tcc tcc acc ttc gag
cac ctg tgg agt 460Ala Gln Thr Ser Ser Ser Ser Ser Ser Thr Phe Glu
His Leu Trp Ser 10 15 20tct cta gag
cca gac agc acc tac ttt gac ctc ccc cag ccc agc caa 508Ser Leu Glu
Pro Asp Ser Thr Tyr Phe Asp Leu Pro Gln Pro Ser Gln25 30
35 40ggg act agc gag gca tca ggc agc
gag gag tcc aac atg gat gtc ttc 556Gly Thr Ser Glu Ala Ser Gly Ser
Glu Glu Ser Asn Met Asp Val Phe 45 50
55cac ctg caa ggc atg gcc cag ttc aat ttg ctc agc agt gcc
atg gac 604His Leu Gln Gly Met Ala Gln Phe Asn Leu Leu Ser Ser Ala
Met Asp 60 65 70cag atg ggc
agc cgt gcg gcc ccg gcg agc ccc tac acc ccg gag cac 652Gln Met Gly
Ser Arg Ala Ala Pro Ala Ser Pro Tyr Thr Pro Glu His 75
80 85gcc gcc agc gcg ccc acc cac tcg ccc tac gcg
cag ccc agc tcc acc 700Ala Ala Ser Ala Pro Thr His Ser Pro Tyr Ala
Gln Pro Ser Ser Thr 90 95 100ttc gac
acc atg tct ccg gcg cct gtc atc cct tcc aat acc gac tac 748Phe Asp
Thr Met Ser Pro Ala Pro Val Ile Pro Ser Asn Thr Asp Tyr105
110 115 120ccc ggc ccc c
758Pro Gly Pro10123PRTMus musculus
10Met Ser Gly Ser Val Gly Glu Met Ala Gln Thr Ser Ser Ser Ser Ser1
5 10 15Ser Thr Phe Glu His Leu
Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr 20 25
30Phe Asp Leu Pro Gln Pro Ser Gln Gly Thr Ser Glu Ala
Ser Gly Ser 35 40 45Glu Glu Ser
Asn Met Asp Val Phe His Leu Gln Gly Met Ala Gln Phe 50
55 60Asn Leu Leu Ser Ser Ala Met Asp Gln Met Gly Ser
Arg Ala Ala Pro65 70 75
80Ala Ser Pro Tyr Thr Pro Glu His Ala Ala Ser Ala Pro Thr His Ser
85 90 95Pro Tyr Ala Gln Pro Ser
Ser Thr Phe Asp Thr Met Ser Pro Ala Pro 100
105 110Val Ile Pro Ser Asn Thr Asp Tyr Pro Gly Pro
115 12011559DNAHomo sapiens 11cgaccttccc cagtcaagcc
gggggaataa tgaggtggtg ggcggaacgg attccagcat 60ggacgtcttc cacctggagg
gcatgactac atctgtcatg catcctcggc tcctgcctca 120ctagctgcgg agcctctccc
gctcggtcca cgctgccggg cggccacgac cgtgaccctt 180cccctcgggc cgcccagatc
catgcctcgt cccacgggac accagttccc tggcgtgtgc 240agaccccccg gcgcctacca
tgctgtacgt cggtgacccc gcacggcacc tcgccacggc 300ccagttcaat ctgctgagca
gcaccatgga ccagatgagc agccgcgcgg cctcggccag 360cccctacacc ccagagcacg
ccgccagcgt gcccacccac tcgccctacg cacaacccag 420ctccaccttc gacaccatgt
cgccggcgcc tgtcatcccc tccaacaccg actaccccgg 480accccaccac tttgaggtca
ctttccagca gtccagcacg gccaagtcag ccacctggac 540gtactccccg ctcttgaag
559121764DNAHomo sapiens
12atgctgtacg tcggtgaccc cgcacggcac ctcgccacgg cccagttcaa tctgctgagc
60agcaccatgg accagatgag cagccgcgcg gcctcggcca gcccctacac cccagagcac
120gccgccagcg tgcccaccca ctcgccctac gcacaaccca gctccacctt cgacaccatg
180tcgccggcgc ctgtcatccc ctccaacacc gactaccccg gaccccacca ctttgaggtc
240actttccagc agtccagcac ggccaagtca gccacctgga cgtactcccc gctcttgaag
300aaactctact gccagatcgc caagacatgc cccatccaga tcaaggtgtc caccccgcca
360cccccaggca ctgccatccg ggccatgcct gtttacaaga aagcggagca cgtgaccgac
420gtcgtgaaac gctgccccaa ccacgagctc gggagggact tcaacgaagg acagtctgct
480ccagccagcc acctcatccg cgtggaaggc aataatctct cgcagtatgt ggatgaccct
540gtcaccggca ggcagagcgt cgtggtgccc tatgagccac cacaggtggg gacggaattc
600accaccatcc tgtacaactt catgtgtaac agcagctgtg tagggggcat gaaccggcgg
660cccatcctca tcatcatcac cctggagatg cgggatgggc aggtgctggg ccgccggtcc
720tttgagggcc gcatctgcgc ctgtcctggc cgcgaccgaa aagctgatga ggaccactac
780cgggagcagc aggccctgaa cgagagctcc gccaagaacg gggccgccag caagcgtgcc
840ttcaagcaga gcccccctgc cgtccccgcc cttggtgccg gtgtgaagaa gcggcggcat
900ggagacgagg acacgtacta ccttcaggtg cgaggccggg agaactttga gatcctgatg
960aagctgaaag agagcctgga gctgatggag ttggtgccgc agccactggt ggactcctat
1020cggcagcagc agcagctcct acagaggccg agtcacctac agcccccgtc ctacgggccg
1080gtcctctcgc ccatgaacaa ggtgcacggg ggcatgaaca agctgccctc cgtcaaccag
1140ctggtgggcc agcctccccc gcacagttcg gcagctacac ccaacctggg gcccgtgggc
1200cccgggatgc tcaacaacca tggccacgca gtgccagcca acggcgagat gagcagcagc
1260cacagcgccc agtccatggt ctcggggtcc cactgcactc cgccaccccc ctaccacgcc
1320gaccccagcc tcgtcagttt tttaacagga ttggggtgtc caaactgcat cgagtatttc
1380acctcccaag ggttacagag catttaccac ctgcagaacc tgaccattga ggacctgggg
1440gccctgaaga tccccgagca gtaccgcatg accatctggc ggggcctgca ggacctgaag
1500cagggccacg actacagcac cgcgcagcag ctgctccgct ctagcaacgc ggccaccatc
1560tccatcggcg gctcagggga actgcagcgc cagcgggtca tggaggccgt gcacttccgc
1620gtgcgccaca ccatcaccat ccccaaccgc ggcggcccag gcggcggccc tgacgagtgg
1680gcggacttcg gcttcgacct gcccgactgc aaggcccgca agcagcccat caaggaggag
1740ttcacggagg ccgagatcca ctga
176413587PRTHomo sapiens 13Met Leu Tyr Val Gly Asp Pro Ala Arg His Leu
Ala Thr Ala Gln Phe1 5 10
15Asn Leu Leu Ser Ser Thr Met Asp Gln Met Ser Ser Arg Ala Ala Ser
20 25 30Ala Ser Pro Tyr Thr Pro Glu
His Ala Ala Ser Val Pro Thr His Ser 35 40
45Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala
Pro 50 55 60Val Ile Pro Ser Asn Thr
Asp Tyr Pro Gly Pro His His Phe Glu Val65 70
75 80Thr Phe Gln Gln Ser Ser Thr Ala Lys Ser Ala
Thr Trp Thr Tyr Ser 85 90
95Pro Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro Ile
100 105 110Gln Ile Lys Val Ser Thr
Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala 115 120
125Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Asp Val Val
Lys Arg 130 135 140Cys Pro Asn His Glu
Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser Ala145 150
155 160Pro Ala Ser His Leu Ile Arg Val Glu Gly
Asn Asn Leu Ser Gln Tyr 165 170
175Val Asp Asp Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr Glu
180 185 190Pro Pro Gln Val Gly
Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe Met 195
200 205Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg
Pro Ile Leu Ile 210 215 220Ile Ile Thr
Leu Glu Met Arg Asp Gly Gln Val Leu Gly Arg Arg Ser225
230 235 240Phe Glu Gly Arg Ile Cys Ala
Cys Pro Gly Arg Asp Arg Lys Ala Asp 245
250 255Glu Asp His Tyr Arg Glu Gln Gln Ala Leu Asn Glu
Ser Ser Ala Lys 260 265 270Asn
Gly Ala Ala Ser Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala Val 275
280 285Pro Ala Leu Gly Ala Gly Val Lys Lys
Arg Arg His Gly Asp Glu Asp 290 295
300Thr Tyr Tyr Leu Gln Val Arg Gly Arg Glu Asn Phe Glu Ile Leu Met305
310 315 320Lys Leu Lys Glu
Ser Leu Glu Leu Met Glu Leu Val Pro Gln Pro Leu 325
330 335Val Asp Ser Tyr Arg Gln Gln Gln Gln Leu
Leu Gln Arg Pro Ser His 340 345
350Leu Gln Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn Lys Val
355 360 365His Gly Gly Met Asn Lys Leu
Pro Ser Val Asn Gln Leu Val Gly Gln 370 375
380Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu Gly Pro Val
Gly385 390 395 400Pro Gly
Met Leu Asn Asn His Gly His Ala Val Pro Ala Asn Gly Glu
405 410 415Met Ser Ser Ser His Ser Ala
Gln Ser Met Val Ser Gly Ser His Cys 420 425
430Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser
Phe Leu 435 440 445Thr Gly Leu Gly
Cys Pro Asn Cys Ile Glu Tyr Phe Thr Ser Gln Gly 450
455 460Leu Gln Ser Ile Tyr His Leu Gln Asn Leu Thr Ile
Glu Asp Leu Gly465 470 475
480Ala Leu Lys Ile Pro Glu Gln Tyr Arg Met Thr Ile Trp Arg Gly Leu
485 490 495Gln Asp Leu Lys Gln
Gly His Asp Tyr Ser Thr Ala Gln Gln Leu Leu 500
505 510Arg Ser Ser Asn Ala Ala Thr Ile Ser Ile Gly Gly
Ser Gly Glu Leu 515 520 525Gln Arg
Gln Arg Val Met Glu Ala Val His Phe Arg Val Arg His Thr 530
535 540Ile Thr Ile Pro Asn Arg Gly Gly Pro Gly Gly
Gly Pro Asp Glu Trp545 550 555
560Ala Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lys Gln Pro
565 570 575Ile Lys Glu Glu
Phe Thr Glu Ala Glu Ile His 580
585141521DNAHomo sapiens 14atgctgtacg tcggtgaccc cgcacggcac ctcgccacgg
cccagttcaa tctgctgagc 60agcaccatgg accagatgag cagccgcgcg gcctcggcca
gcccctacac cccagagcac 120gccgccagcg tgcccaccca ctcgccctac gcacaaccca
gctccacctt cgacaccatg 180tcgccggcgc ctgtcatccc ctccaacacc gactaccccg
gaccccacca ctttgaggtc 240actttccagc agtccagcac ggccaagtca gccacctgga
cgtactcccc gctcttgaag 300aaactctact gccagatcgc caagacatgc cccatccaga
tcaaggtgtc caccccgcca 360cccccaggca ctgccatccg ggccatgcct gtttacaaga
aagcggagca cgtgaccgac 420gtcgtgaaac gctgccccaa ccacgagctc gggagggact
tcaacgaagg acagtctgct 480ccagccagcc acctcatccg cgtggaaggc aataatctct
cgcagtatgt ggatgaccct 540gtcaccggca ggcagagcgt cgtggtgccc tatgagccac
cacaggtggg gacggaattc 600accaccatcc tgtacaactt catgtgtaac agcagctgtg
tagggggcat gaaccggcgg 660cccatcctca tcatcatcac cctggagatg cgggatgggc
aggtgctggg ccgccggtcc 720tttgagggcc gcatctgcgc ctgtcctggc cgcgaccgaa
aagctgatga ggaccactac 780cgggagcagc aggccctgaa cgagagctcc gccaagaacg
gggccgccag caagcgtgcc 840ttcaagcaga gcccccctgc cgtccccgcc cttggtgccg
gtgtgaagaa gcggcggcat 900ggagacgagg acacgtacta ccttcaggtg cgaggccggg
agaactttga gatcctgatg 960aagctgaaag agagcctgga gctgatggag ttggtgccgc
agccactggt ggactcctat 1020cggcagcagc agcagctcct acagaggccg ccccgggatg
ctcaacaacc atggccacgc 1080agtgccagcc aacggcgaga tgagcagcag ccacagcgcc
cagtccatgg tctcggggtc 1140ccactgcact ccgccacccc cctaccacgc cgaccccagc
ctcgtcagga cctgggggcc 1200ctgaagatcc ccgagcagta ccgcatgacc atctggcggg
gcctgcagga cctgaagcag 1260ggccacgact acagcaccgc gcagcagctg ctccgctcta
gcaacgcggc caccatctcc 1320atcggcggct caggggaact gcagcgccag cgggtcatgg
aggccgtgca cttccgcgtg 1380cgccacacca tcaccatccc caaccgcggc ggcccaggcg
gcggccctga cgagtgggcg 1440gacttcggct tcgacctgcc cgactgcaag gcccgcaagc
agcccatcaa ggaggagttc 1500acggaggccg agatccactg a
152115506PRTHomo sapiens 15Met Leu Tyr Val Gly Asp
Pro Ala Arg His Leu Ala Thr Ala Gln Phe1 5
10 15Asn Leu Leu Ser Ser Thr Met Asp Gln Met Ser Ser
Arg Ala Ala Ser 20 25 30Ala
Ser Pro Tyr Thr Pro Glu His Ala Ala Ser Val Pro Thr His Ser 35
40 45Pro Tyr Ala Gln Pro Ser Ser Thr Phe
Asp Thr Met Ser Pro Ala Pro 50 55
60Val Ile Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu Val65
70 75 80Thr Phe Gln Gln Ser
Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser 85
90 95Pro Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala
Lys Thr Cys Pro Ile 100 105
110Gln Ile Lys Val Ser Thr Pro Pro Pro Pro Gly Thr Ala Ile Arg Ala
115 120 125Met Pro Val Tyr Lys Lys Ala
Glu His Val Thr Asp Val Val Lys Arg 130 135
140Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gln Ser
Ala145 150 155 160Pro Ala
Ser His Leu Ile Arg Val Glu Gly Asn Asn Leu Ser Gln Tyr
165 170 175Val Asp Asp Pro Val Thr Gly
Arg Gln Ser Val Val Val Pro Tyr Glu 180 185
190Pro Pro Gln Val Gly Thr Glu Phe Thr Thr Ile Leu Tyr Asn
Phe Met 195 200 205Cys Asn Ser Ser
Cys Val Gly Gly Met Asn Arg Arg Pro Ile Leu Ile 210
215 220Ile Ile Thr Leu Glu Met Arg Asp Gly Gln Val Leu
Gly Arg Arg Ser225 230 235
240Phe Glu Gly Arg Ile Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp
245 250 255Glu Asp His Tyr Arg
Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala Lys 260
265 270Asn Gly Ala Ala Ser Lys Arg Ala Phe Lys Gln Ser
Pro Pro Ala Val 275 280 285Pro Ala
Leu Gly Ala Gly Val Lys Lys Arg Arg His Gly Asp Glu Asp 290
295 300Thr Tyr Tyr Leu Gln Val Arg Gly Arg Glu Asn
Phe Glu Ile Leu Met305 310 315
320Lys Leu Lys Glu Ser Leu Glu Leu Met Glu Leu Val Pro Gln Pro Leu
325 330 335Val Asp Ser Tyr
Arg Gln Gln Gln Gln Leu Leu Gln Arg Pro Pro Arg 340
345 350Asp Ala Gln Gln Pro Trp Pro Arg Ser Ala Ser
Gln Arg Arg Asp Glu 355 360 365Gln
Gln Pro Gln Arg Pro Val His Gly Leu Gly Val Pro Leu His Ser 370
375 380Ala Thr Pro Leu Pro Arg Arg Pro Gln Pro
Arg Gln Asp Leu Gly Ala385 390 395
400Leu Lys Ile Pro Glu Gln Tyr Arg Met Thr Ile Trp Arg Gly Leu
Gln 405 410 415Asp Leu Lys
Gln Gly His Asp Tyr Ser Thr Ala Gln Gln Leu Leu Arg 420
425 430Ser Ser Asn Ala Ala Thr Ile Ser Ile Gly
Gly Ser Gly Glu Leu Gln 435 440
445Arg Gln Arg Val Met Glu Ala Val His Phe Arg Val Arg His Thr Ile 450
455 460Thr Ile Pro Asn Arg Gly Gly Pro
Gly Gly Gly Pro Asp Glu Trp Ala465 470
475 480Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg
Lys Gln Pro Ile 485 490
495Lys Glu Glu Phe Thr Glu Ala Glu Ile His 500
505161870DNAHomo sapiensCDS(104)..(1867) 16tgcccggggc tgcgacggct
gcagggaacc agacagcacc tacttcgacc ttccccagtc 60aagccggggg aataatgagg
tggtgggcgg aacggattcc agc atg gac gtc ttc 115
Met Asp Val Phe
1cac ctg gag ggc atg act aca tct gtc atg gcc cag ttc aat ctg
ctg 163His Leu Glu Gly Met Thr Thr Ser Val Met Ala Gln Phe Asn Leu
Leu5 10 15 20agc agc
acc atg gac cag atg agc agc cgc gcg gcc tcg gcc agc ccc 211Ser Ser
Thr Met Asp Gln Met Ser Ser Arg Ala Ala Ser Ala Ser Pro 25
30 35tac acc cca gag cac gcc gcc agc
gtg ccc acc cac tcg ccc tac gca 259Tyr Thr Pro Glu His Ala Ala Ser
Val Pro Thr His Ser Pro Tyr Ala 40 45
50caa ccc agc tcc acc ttc gac acc atg tcg ccg gcg cct gtc atc
ccc 307Gln Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala Pro Val Ile
Pro 55 60 65tcc aac acc gac tac
ccc gga ccc cac cac ttt gag gtc act ttc cag 355Ser Asn Thr Asp Tyr
Pro Gly Pro His His Phe Glu Val Thr Phe Gln 70 75
80cag tcc agc acg gcc aag tca gcc acc tgg acg tac tcc ccg
ctc ttg 403Gln Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Pro
Leu Leu85 90 95 100aag
aaa ctc tac tgc cag atc gcc aag aca tgc ccc atc cag atc aag 451Lys
Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro Ile Gln Ile Lys
105 110 115gtg tcc acc ccg cca ccc cca
ggc act gcc atc cgg gcc atg cct gtt 499Val Ser Thr Pro Pro Pro Pro
Gly Thr Ala Ile Arg Ala Met Pro Val 120 125
130tac aag aaa gcg gag cac gtg acc gac gtc gtg aaa cgc tgc
ccc aac 547Tyr Lys Lys Ala Glu His Val Thr Asp Val Val Lys Arg Cys
Pro Asn 135 140 145cac gag ctc ggg
agg gac ttc aac gaa gga cag tct gct cca gcc agc 595His Glu Leu Gly
Arg Asp Phe Asn Glu Gly Gln Ser Ala Pro Ala Ser 150
155 160cac ctc atc cgc gtg gaa ggc aat aat ctc tcg cag
tat gtg gat gac 643His Leu Ile Arg Val Glu Gly Asn Asn Leu Ser Gln
Tyr Val Asp Asp165 170 175
180cct gtc acc ggc agg cag agc gtc gtg gtg ccc tat gag cca cca cag
691Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr Glu Pro Pro Gln
185 190 195gtg ggg acg gaa ttc
acc acc atc ctg tac aac ttc atg tgt aac agc 739Val Gly Thr Glu Phe
Thr Thr Ile Leu Tyr Asn Phe Met Cys Asn Ser 200
205 210agc tgt gta ggg ggc atg aac cgg cgg ccc atc ctc
atc atc atc acc 787Ser Cys Val Gly Gly Met Asn Arg Arg Pro Ile Leu
Ile Ile Ile Thr 215 220 225ctg gag
atg cgg gat ggg cag gtg ctg ggc cgc cgg tcc ttt gag ggc 835Leu Glu
Met Arg Asp Gly Gln Val Leu Gly Arg Arg Ser Phe Glu Gly 230
235 240cgc atc tgc gcc tgt cct ggc cgc gac cga aaa
gct gat gag gac cac 883Arg Ile Cys Ala Cys Pro Gly Arg Asp Arg Lys
Ala Asp Glu Asp His245 250 255
260tac cgg gag cag cag gcc ctg aac gag agc tcc gcc aag aac ggg gcc
931Tyr Arg Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala Lys Asn Gly Ala
265 270 275gcc agc aag cgt gcc
ttc aag cag agc ccc cct gcc gtc ccc gcc ctt 979Ala Ser Lys Arg Ala
Phe Lys Gln Ser Pro Pro Ala Val Pro Ala Leu 280
285 290ggt gcc ggt gtg aag aag cgg cgg cat gga gac gag
gac acg tac tac 1027Gly Ala Gly Val Lys Lys Arg Arg His Gly Asp Glu
Asp Thr Tyr Tyr 295 300 305ctt cag
gtg cga ggc cgg gag aac ttt gag atc ctg atg aag ctg aaa 1075Leu Gln
Val Arg Gly Arg Glu Asn Phe Glu Ile Leu Met Lys Leu Lys 310
315 320gag agc ctg gag ctg atg gag ttg gtg ccg cag
cca ctg gtg gac tcc 1123Glu Ser Leu Glu Leu Met Glu Leu Val Pro Gln
Pro Leu Val Asp Ser325 330 335
340tat cgg cag cag cag cag ctc cta cag agg ccg agt cac cta cag ccc
1171Tyr Arg Gln Gln Gln Gln Leu Leu Gln Arg Pro Ser His Leu Gln Pro
345 350 355ccg tcc tac ggg ccg
gtc ctc tcg ccc atg aac aag gtg cac ggg ggc 1219Pro Ser Tyr Gly Pro
Val Leu Ser Pro Met Asn Lys Val His Gly Gly 360
365 370atg aac aag ctg ccc tcc gtc aac cag ctg gtg ggc
cag cct ccc ccg 1267Met Asn Lys Leu Pro Ser Val Asn Gln Leu Val Gly
Gln Pro Pro Pro 375 380 385cac agt
tcg gca gct aca ccc aac ctg ggg ccc gtg ggc ccc ggg atg 1315His Ser
Ser Ala Ala Thr Pro Asn Leu Gly Pro Val Gly Pro Gly Met 390
395 400ctc aac aac cat ggc cac gca gtg cca gcc aac
ggc gag atg agc agc 1363Leu Asn Asn His Gly His Ala Val Pro Ala Asn
Gly Glu Met Ser Ser405 410 415
420agc cac agc gcc cag tcc atg gtc tcg ggg tcc cac tgc act ccg cca
1411Ser His Ser Ala Gln Ser Met Val Ser Gly Ser His Cys Thr Pro Pro
425 430 435ccc ccc tac cac gcc
gac ccc agc ctc gtc agt ttt tta aca gga ttg 1459Pro Pro Tyr His Ala
Asp Pro Ser Leu Val Ser Phe Leu Thr Gly Leu 440
445 450ggg tgt cca aac tgc atc gag tat ttc acc tcc caa
ggg tta cag agc 1507Gly Cys Pro Asn Cys Ile Glu Tyr Phe Thr Ser Gln
Gly Leu Gln Ser 455 460 465att tac
cac ctg cag aac ctg acc att gag gac ctg ggg gcc ctg aag 1555Ile Tyr
His Leu Gln Asn Leu Thr Ile Glu Asp Leu Gly Ala Leu Lys 470
475 480atc ccc gag cag tac cgc atg acc atc tgg cgg
ggc ctg cag gac ctg 1603Ile Pro Glu Gln Tyr Arg Met Thr Ile Trp Arg
Gly Leu Gln Asp Leu485 490 495
500aag cag ggc cac gac tac agc acc gcg cag cag ctg ctc cgc tct agc
1651Lys Gln Gly His Asp Tyr Ser Thr Ala Gln Gln Leu Leu Arg Ser Ser
505 510 515aac gcg gcc acc atc
tcc atc ggc ggc tca ggg gaa ctg cag cgc cag 1699Asn Ala Ala Thr Ile
Ser Ile Gly Gly Ser Gly Glu Leu Gln Arg Gln 520
525 530cgg gtc atg gag gcc gtg cac ttc cgc gtg cgc cac
acc atc acc atc 1747Arg Val Met Glu Ala Val His Phe Arg Val Arg His
Thr Ile Thr Ile 535 540 545ccc aac
cgc ggc ggc cca ggc ggc ggc cct gac gag tgg gcg gac ttc 1795Pro Asn
Arg Gly Gly Pro Gly Gly Gly Pro Asp Glu Trp Ala Asp Phe 550
555 560ggc ttc gac ctg ccc gac tgc aag gcc cgc aag
cag ccc atc aag gag 1843Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lys
Gln Pro Ile Lys Glu565 570 575
580gag ttc acg gag gcc gag atc cac tga
1870Glu Phe Thr Glu Ala Glu Ile His 58517588PRTHomo
sapiens 17Met Asp Val Phe His Leu Glu Gly Met Thr Thr Ser Val Met Ala
Gln1 5 10 15Phe Asn Leu
Leu Ser Ser Thr Met Asp Gln Met Ser Ser Arg Ala Ala 20
25 30Ser Ala Ser Pro Tyr Thr Pro Glu His Ala
Ala Ser Val Pro Thr His 35 40
45Ser Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala 50
55 60Pro Val Ile Pro Ser Asn Thr Asp Tyr
Pro Gly Pro His His Phe Glu65 70 75
80Val Thr Phe Gln Gln Ser Ser Thr Ala Lys Ser Ala Thr Trp
Thr Tyr 85 90 95Ser Pro
Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys Pro 100
105 110Ile Gln Ile Lys Val Ser Thr Pro Pro
Pro Pro Gly Thr Ala Ile Arg 115 120
125Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Asp Val Val Lys
130 135 140Arg Cys Pro Asn His Glu Leu
Gly Arg Asp Phe Asn Glu Gly Gln Ser145 150
155 160Ala Pro Ala Ser His Leu Ile Arg Val Glu Gly Asn
Asn Leu Ser Gln 165 170
175Tyr Val Asp Asp Pro Val Thr Gly Arg Gln Ser Val Val Val Pro Tyr
180 185 190Glu Pro Pro Gln Val Gly
Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe 195 200
205Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg Pro
Ile Leu 210 215 220Ile Ile Ile Thr Leu
Glu Met Arg Asp Gly Gln Val Leu Gly Arg Arg225 230
235 240Ser Phe Glu Gly Arg Ile Cys Ala Cys Pro
Gly Arg Asp Arg Lys Ala 245 250
255Asp Glu Asp His Tyr Arg Glu Gln Gln Ala Leu Asn Glu Ser Ser Ala
260 265 270Lys Asn Gly Ala Ala
Ser Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala 275
280 285Val Pro Ala Leu Gly Ala Gly Val Lys Lys Arg Arg
His Gly Asp Glu 290 295 300Asp Thr Tyr
Tyr Leu Gln Val Arg Gly Arg Glu Asn Phe Glu Ile Leu305
310 315 320Met Lys Leu Lys Glu Ser Leu
Glu Leu Met Glu Leu Val Pro Gln Pro 325
330 335Leu Val Asp Ser Tyr Arg Gln Gln Gln Gln Leu Leu
Gln Arg Pro Ser 340 345 350His
Leu Gln Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn Lys 355
360 365Val His Gly Gly Met Asn Lys Leu Pro
Ser Val Asn Gln Leu Val Gly 370 375
380Gln Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu Gly Pro Val385
390 395 400Gly Pro Gly Met
Leu Asn Asn His Gly His Ala Val Pro Ala Asn Gly 405
410 415Glu Met Ser Ser Ser His Ser Ala Gln Ser
Met Val Ser Gly Ser His 420 425
430Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser Phe
435 440 445Leu Thr Gly Leu Gly Cys Pro
Asn Cys Ile Glu Tyr Phe Thr Ser Gln 450 455
460Gly Leu Gln Ser Ile Tyr His Leu Gln Asn Leu Thr Ile Glu Asp
Leu465 470 475 480Gly Ala
Leu Lys Ile Pro Glu Gln Tyr Arg Met Thr Ile Trp Arg Gly
485 490 495Leu Gln Asp Leu Lys Gln Gly
His Asp Tyr Ser Thr Ala Gln Gln Leu 500 505
510Leu Arg Ser Ser Asn Ala Ala Thr Ile Ser Ile Gly Gly Ser
Gly Glu 515 520 525Leu Gln Arg Gln
Arg Val Met Glu Ala Val His Phe Arg Val Arg His 530
535 540Thr Ile Thr Ile Pro Asn Arg Gly Gly Pro Gly Gly
Gly Pro Asp Glu545 550 555
560Trp Ala Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lys Gln
565 570 575Pro Ile Lys Glu Glu
Phe Thr Glu Ala Glu Ile His 580
585181817DNAHomo sapiens 18atggcccagt ccaccgccac ctcccctgat gggggcacca
cgtttgagca cctctggagc 60tctctggaac cagacagcac ctacttcgac cttccccagt
caagccgggg gaataatgag 120gtggtgggcg gaacggattc cagcatggac gtcttccacc
tggagggcat gactacatct 180gtcatggccc agttcaatct gctgagcagc accatggacc
agatgagcag ccgcgcggcc 240tcggccagcc cctacacccc agagcacgcc gccagcgtgc
ccacccactc gccctacgca 300caacccagct ccaccttcga caccatgtcg ccggcgcctg
tcatcccctc caacaccgac 360taccccggac cccaccactt tgaggtcact ttccagcagt
ccagcacggc caagtcagcc 420acctggacgt actccccgct cttgaagaaa ctctactgcc
agatcgccaa gacatgcccc 480atccagatca aggtgtccac cccgccaccc ccaggcactg
ccatccgggc catgcctgtt 540tacaagaaag cggagcacgt gaccgacgtc gtgaaacgct
gccccaacca cgagctcggg 600agggacttca acgaaggaca gtctgctcca gccagccacc
tcatccgcgt ggaaggcaat 660aatctctcgc agtatgtgga tgaccctgtc accggcaggc
agagcgtcgt ggtgccctat 720gagccaccac aggtggggac ggaattcacc accatcctgt
acaacttcat gtgtaacagc 780agctgtgtag ggggcatgaa ccggcggccc atcctcatca
tcatcaccct ggagatgcgg 840gatgggcagg tgctgggccg ccggtccttt gagggccgca
tctgcgcctg tcctggccgc 900gaccgaaaag ctgatgagga ccactaccgg gagcagcagg
ccctgaacga gagctccgcc 960aagaacgggg ccgccagcaa gcgtgccttc aagcagagcc
cccctgccgt ccccgccctt 1020ggtgccggtg tgaagaagcg gcggcatgga gacgaggaca
cgtactacct tcaggtgcga 1080ggccgggaga actttgagat cctgatgaag ctgaaagaga
gcctggagct gatggagttg 1140gtgccgcagc cactggtgga ctcctatcgg cagcagcagc
agctcctaca gaggccgagt 1200cacctacagc ccccgtccta cgggccggtc ctctcgccca
tgaacaaggt gcacgggggc 1260atgaacaagc tgccctccgt caaccagctg gtgggccagc
ctcccccgca cagttcggca 1320gctacaccca acctggggcc cgtgggcccc gggatgctca
acaaccatgg ccacgcagtg 1380ccagccaacg gcgagatgag cagcagccac agcgcccagt
ccatggtctc ggggtcccac 1440tgcactccgc caccccccta ccacgccgac cccagcctcg
tcaggacctg ggggccctga 1500agatccccga gcagtaccgc atgaccatct ggcggggcct
gcaggacctg aagcagggcc 1560acgactacag caccgcgcag cagctgctcc gctctagcaa
cgcggccacc atctccatcg 1620gcggctcagg ggaactgcag cgccagcggg tcatggaggc
cgtgcacttc cgcgtgcgcc 1680acaccatcac catccccaac cgcggcggcc caggcggcgg
ccctgacgag tgggcggact 1740tcggcttcga cctgcccgac tgcaaggccc gcaagcagcc
catcaaggag gagttcacgg 1800aggccgagat ccactga
181719499PRTHomo sapiens 19Met Ala Gln Ser Thr Ala
Thr Ser Pro Asp Gly Gly Thr Thr Phe Glu1 5
10 15His Leu Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr
Phe Asp Leu Pro 20 25 30Gln
Ser Ser Arg Gly Asn Asn Glu Val Val Gly Gly Thr Asp Ser Ser 35
40 45Met Asp Val Phe His Leu Glu Gly Met
Thr Thr Ser Val Met Ala Gln 50 55
60Phe Asn Leu Leu Ser Ser Thr Met Asp Gln Met Ser Ser Arg Ala Ala65
70 75 80Ser Ala Ser Pro Tyr
Thr Pro Glu His Ala Ala Ser Val Pro Thr His 85
90 95Ser Pro Tyr Ala Gln Pro Ser Ser Thr Phe Asp
Thr Met Ser Pro Ala 100 105
110Pro Val Ile Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu
115 120 125Val Thr Phe Gln Gln Ser Ser
Thr Ala Lys Ser Ala Thr Trp Thr Tyr 130 135
140Ser Pro Leu Leu Lys Lys Leu Tyr Cys Gln Ile Ala Lys Thr Cys
Pro145 150 155 160Ile Gln
Ile Lys Val Ser Thr Pro Pro Pro Pro Gly Thr Ala Ile Arg
165 170 175Ala Met Pro Val Tyr Lys Lys
Ala Glu His Val Thr Asp Val Val Lys 180 185
190Arg Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly
Gln Ser 195 200 205Ala Pro Ala Ser
His Leu Ile Arg Val Glu Gly Asn Asn Leu Ser Gln 210
215 220Tyr Val Asp Asp Pro Val Thr Gly Arg Gln Ser Val
Val Val Pro Tyr225 230 235
240Glu Pro Pro Gln Val Gly Thr Glu Phe Thr Thr Ile Leu Tyr Asn Phe
245 250 255Met Cys Asn Ser Ser
Cys Val Gly Gly Met Asn Arg Arg Pro Ile Leu 260
265 270Ile Ile Ile Thr Leu Glu Met Arg Asp Gly Gln Val
Leu Gly Arg Arg 275 280 285Ser Phe
Glu Gly Arg Ile Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala 290
295 300Asp Glu Asp His Tyr Arg Glu Gln Gln Ala Leu
Asn Glu Ser Ser Ala305 310 315
320Lys Asn Gly Ala Ala Ser Lys Arg Ala Phe Lys Gln Ser Pro Pro Ala
325 330 335Val Pro Ala Leu
Gly Ala Gly Val Lys Lys Arg Arg His Gly Asp Glu 340
345 350Asp Thr Tyr Tyr Leu Gln Val Arg Gly Arg Glu
Asn Phe Glu Ile Leu 355 360 365Met
Lys Leu Lys Glu Ser Leu Glu Leu Met Glu Leu Val Pro Gln Pro 370
375 380Leu Val Asp Ser Tyr Arg Gln Gln Gln Gln
Leu Leu Gln Arg Pro Ser385 390 395
400His Leu Gln Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn
Lys 405 410 415Val His Gly
Gly Met Asn Lys Leu Pro Ser Val Asn Gln Leu Val Gly 420
425 430Gln Pro Pro Pro His Ser Ser Ala Ala Thr
Pro Asn Leu Gly Pro Val 435 440
445Gly Pro Gly Met Leu Asn Asn His Gly His Ala Val Pro Ala Asn Gly 450
455 460Glu Met Ser Ser Ser His Ser Ala
Gln Ser Met Val Ser Gly Ser His465 470
475 480Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser
Leu Val Arg Thr 485 490
495Trp Gly Pro2017DNAArtificialprimer 20gcgagctgcc ctcggag
172119DNAArtificialantisense primer
21ggttctgcag gtgactcag
192218DNAArtificialprimer 22gccatgcctg tctacaag
182318DNAArtificialantisense primer 23accagctggt
tgacggag
182421DNAArtificialprimer 24gtcaaccagc tggtgggcca g
212516DNAArtificialantisense primer 25gtggatctcg
gcctcc
162617DNAArtificialprimer 26aggccggcgt ggggaag
172719DNAArtificialantisense primer 27cttggcgatc
tggcagtag
192817DNAArtificialprimer 28gcggccacga ccgtgac
172918DNAArtificialantisense primer 29ggcagcttgg
gtctctgg
183018DNAArtificialprimer 30ctgtacgtcg gtgacccc
183118DNAArtificialantisense primer 31tcagtggatc
tcggcctc
183218DNAArtificialprimer 32aggggacgca gcgaaacc
183319DNAArtificialantisense primer 33ccatcagctc
caggctctc
193418DNAArtificialantisense primer 34ccaggacagg cgcagatg
183519DNAArtificialantisense primer
35gatgaggtgg ctggctgga
193619DNAArtificialantisense primer 36tggtcaggtt ctgcaggtg
193718DNAArtificialprimer 37cacctactcc
agggatgc
183821DNAArtificialantisense primer 38aggaaaatag aagcgtcagt c
213918DNAArtificialprimer 39caggcccact
tgcctgcc
184019DNAArtificialantisense primer 40ctgtccccaa gctgatgag
194115DNAArtificialprimer 41cccccccccc
ccccd
154216DNAArtificialprimer 42cccccccccc cccccd
16431400DNACebus apella 43ggggctccgg ggacacttgg
cgtccgggct ggaagcgtgc tttccaagac ggtgacacgc 60ttccctgagg attggcagcc
agactgctta cgggtcactg ccatggagga gccgcagtca 120gatcccagca tcgagccccc
tctgagtcag gaaacatttt cagacctatg gaaactactt 180cctgaaaaca acgttctgtc
ccccttgccg tcccaagcgg tggatgattt gatgctctct 240ccggatgatc ttgcacaatg
gttaactgaa gacccaggtc cagatgaagc tcccagaatg 300tcagaggctg ctccccacat
ggcccccaca ccagcagctc ctacaccggc ggcccctgca 360ccagccccct cctggcccct
gtcatcctct gtcccttccc agaaaaccta ccacggcagc 420tacggtttcc gtctgggctt
cctgcattct ggaacagcca agtctgtgac ttgcacgtac 480tcccctgacc tcaacaagat
gttttgccag ctggccaaga cctgccccgt gcagctgtgg 540gttgattcca cacccccgcc
cggcagccgc gtccgcgcca tggccatcta caagcagtca 600cagcacatga ctgaggtcgt
gaggcgctgc ccccaccatg agcgctgctc agacagcgat 660ggactggccc ctcctcaaca
tcttatccga gtggaaggaa atttgcgtgt ggagtattcg 720gatgacagaa acacttttcg
acatagtgtg gtggtgccct atgagccgcc tgaggttggc 780tctgactgta ccaccatcca
ctacaactac atgtgtaaca gttcctgcat gggcggcatg 840aaccggaggc ccatcctcac
aattatcaca ctggaagact ccagtggtaa tctactggga 900cggaacagct ttgaggtgcg
agtttgtgcc tgtcctggga gagaccggcg cacagaggaa 960gagaatttcc gcaagaaagg
ggagccttgc cacgagctgc cccctgggag cactaagcga 1020gcactgccca acaacaccag
ctcctctccc cagccaaaga agaaaccact ggatggagaa 1080tatttcaccc ttcagatccg
cgggcgtgag cgcttcgaga tgttccgaga gctgaatgag 1140gccttggaac tcaaggatgc
ccaggctggg aaagagccag cggggagcag ggctcactcc 1200agccacctga agtccaagaa
ggggcaatct acctcccgcc ataaaaaatt catgttcaag 1260acagaggggc ctgactcaga
ctgacattct cagcttcttg ttcccccact gagcctccca 1320cccccatctc tccctcccct
gccattttga gttctgggtc tttaaaccct tgcttgcaat 1380aggtgtgtgt cagaagcaaa
140044393PRTCebus apella
44Met Glu Glu Pro Gln Ser Asp Pro Ser Ile Glu Pro Pro Leu Ser Gln1
5 10 15Glu Thr Phe Ser Asp Leu
Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 20 25
30Ser Pro Leu Pro Ser Gln Ala Val Asp Asp Leu Met Leu
Ser Pro Asp 35 40 45Asp Leu Ala
Gln Trp Leu Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 50
55 60Arg Met Ser Glu Ala Ala Pro His Met Ala Pro Thr
Pro Ala Ala Pro65 70 75
80Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser
85 90 95Val Pro Ser Gln Lys Thr
Tyr His Gly Ser Tyr Gly Phe Arg Leu Gly 100
105 110Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys
Thr Tyr Ser Pro 115 120 125Asp Leu
Asn Lys Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln 130
135 140Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Ser
Arg Val Arg Ala Met145 150 155
160Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys
165 170 175Pro His His Glu
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln 180
185 190His Leu Ile Arg Val Glu Gly Asn Leu Arg Val
Glu Tyr Ser Asp Asp 195 200 205Arg
Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 210
215 220Val Gly Ser Asp Cys Thr Thr Ile His Tyr
Asn Tyr Met Cys Asn Ser225 230 235
240Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
Thr 245 250 255Leu Glu Asp
Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 260
265 270Arg Val Cys Ala Cys Pro Gly Arg Asp Arg
Arg Thr Glu Glu Glu Asn 275 280
285Phe Arg Lys Lys Gly Glu Pro Cys His Glu Leu Pro Pro Gly Ser Thr 290
295 300Lys Arg Ala Leu Pro Asn Asn Thr
Ser Ser Ser Pro Gln Pro Lys Lys305 310
315 320Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Ile
Arg Gly Arg Glu 325 330
335Arg Phe Glu Met Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp
340 345 350Ala Gln Ala Gly Lys Glu
Pro Ala Gly Ser Arg Ala His Ser Ser His 355 360
365Leu Lys Ser Lys Lys Gly Gln Ser Thr Ser Arg His Lys Lys
Phe Met 370 375 380Phe Lys Thr Glu Gly
Pro Asp Ser Asp385 39045393PRTHomo sapiens 45Met Glu Glu
Pro Gln Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gln1 5
10 15Glu Thr Phe Ser Asp Leu Trp Lys Leu
Leu Pro Glu Asn Asn Val Leu 20 25
30Ser Pro Leu Pro Ser Gln Ala Met Asp Asp Leu Met Leu Ser Pro Asp
35 40 45Asp Ile Glu Gln Trp Phe Thr
Glu Asp Pro Gly Pro Asp Glu Ala Pro 50 55
60Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro65
70 75 80Thr Pro Ala Ala
Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 85
90 95Val Pro Ser Gln Lys Thr Tyr Gln Gly Ser
Tyr Gly Phe Arg Leu Gly 100 105
110Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro
115 120 125Ala Leu Asn Lys Met Phe Cys
Gln Leu Ala Lys Thr Cys Pro Val Gln 130 135
140Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala
Met145 150 155 160Ala Ile
Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys
165 170 175Pro His His Glu Arg Cys Ser
Asp Ser Asp Gly Leu Ala Pro Pro Gln 180 185
190His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu
Asp Asp 195 200 205Arg Asn Thr Phe
Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 210
215 220Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr
Met Cys Asn Ser225 230 235
240Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr
245 250 255Leu Glu Asp Ser Ser
Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 260
265 270Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr
Glu Glu Glu Asn 275 280 285Leu Arg
Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 290
295 300Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
Pro Gln Pro Lys Lys305 310 315
320Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Ile Arg Gly Arg Glu
325 330 335Arg Phe Glu Met
Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp 340
345 350Ala Gln Gln Gly Lys Glu Pro Gly Gly Arg Ser
Ala His Ser Ser His 355 360 365Leu
Lys Ser Lys Lys Gly Gln Ser Thr Ser Arg His Lys Lys Leu Met 370
375 380Phe Lys Thr Glu Gly Pro Asp Ser Asp385
39046889DNAHomo sapiens 46cacctactcc agggatgccc caggcaggcc
cacttgcctg ccgcccccac cgaggctgtc 60acaggaggac agagcacgag ttcccagggt
gctcaggtgt cattccttcc ttcctgcaga 120gcgagctgcc ctcggaggcc ggcgtgggga
agatggccca gtccaccgcc acctcccctg 180atgggggcac cacgtttgag cacctctgga
gctctctgtg agtgcgcttg gctggccaga 240gctgggggcc cccctgggag gcactctggg
ctagcctcag ccaccttcgc tgggctaact 300gggccagagc aggaggggtg gccccgggag
gactctgggc tagccccagc caccctcact 360gagactttgg gctaaacttg gcaaccctca
ctgggattct gggctagcct cgaccaccct 420tgctgcacta actggaccag agcaggagag
gtggctccac actagtcttg ggctagcctt 480agccaccctc atcagcttgg ggacagggcg
ggtcggaggg gcagggaaga gggactgctg 540ccctaggcct tccctgggga tgcaggacca
aaattcagac tcttttctct ggccagctct 600ggagagggcc catggccagc agaggcccag
aataacagag cccatgactg gctctgcctc 660tctggcactc acagcagccc tggaatggca
ggtggaggac agagatggga tgagagggaa 720tgggaagggc aggagacgta ggcctcacca
ggagtctcag gctagccttg agctctgggc 780ctgggaggta ttggggtgac acccaaactg
gggactgacg cttctatttt cctctccctg 840ccccagggaa ccagacagca cctacttcga
ccttccccag tcaagccgg 8894723DNAArtificialprimer comprising
BamHI site 47gatccgggcc cttttttttt ttt
234820DNAArtificialprimer comprising ApaI site 48aaaaaaaaaa
aaagggcccg
204926DNAArtificialprimer comprising KpnI site 49actggtaccg cgagctgccc
tcggag
265028DNAArtificialantisense primer comprising Xba I site 50gactctagag
gttctgcagg tgactcag
285119DNAArtificialprimer 51gagcatgtga ccgacattg
195230DNAArtificialprimer comprising BamHI site
52tttggatccg tcaaccagct ggtgggccag
305325DNAArtificialantisense primer comprising a Sal I site 53aaagtcgacg
tggatctcgg cctcc
255427DNAArtificialprimer 54tatctcgagc tgtacgtcgg tgacccc
275527DNAArtificialantisense primer 55atatctagat
cagtggatct cggcctc 27
User Contributions:
comments("1"); ?> comment_form("1"); ?>Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
User Contributions:
Comment about this patent or add new information about this topic: