Patent application title: Hyaluronic Acid Binding Domain-Growth Factor Fusion Protein cDNAs and Fusion Proteins for Cartilage Matrix Preservation and Report
Inventors:
IPC8 Class: AA61K4800FI
USPC Class:
1 1
Class name:
Publication date: 2020-02-06
Patent application number: 20200038521
Abstract:
The present invention provides isolated nucleic acid compounds, amino
acids compounds and related materials, along with methods to make and use
the compounds. In particular, there are provided gene therapy materials
useful for inducing positive physiological responses in tissues, and
fusion proteins encoded by the gene therapy materials.Claims:
1. A fusion protein, comprising: a.) a hyaluronic acid-binding domain
(HAB) of a cartilage matrix protein linked to b.) a conserved region of a
growth factor (GF) protein, wherein said fusion protein is capable of
upregulating glycosaminoglycan (GAG) expression in chondrocytes.
2. The fusion protein of claim 1, which further comprises at least one additional element selected from the group consisting of: signal peptide (SP); linker peptide (linker); protease cleavage site; E peptide; and additional functional peptide.
3. The fusion protein of claim 2, wherein the order of elements, N-terminus to C-terminus is: SP-HAB-GF-E peptide.
4. The fusion protein of claim 2, wherein the order of elements, N-terminus to C-terminus is selected from the group consisting of: SP-HAB-GF; SP-HAB-GF-E peptide; SP-HAB-GF-additional peptide sequence; SP-HAB-linker-GF-additional peptide sequence; SP-HAB-linker-GF-E peptide; SP-HAB-linker-protease cleavage sequence-GF-additional peptide sequence; and SP-HAB-linker-protease cleavage sequence-GF-E peptide.
5. The fusion protein of claim 2, wherein the SP is selected from the group consisting of: aggrecan signal peptide, CD44 signal peptide; link protein signal peptide; TSG-6 signal peptide, versican signal peptide; and other HA-binding protein signal peptide.
6. The fusion protein of claim 1, wherein the HAB is selected from the group consisting of: aggrecan; CD44; link protein; TSG-6; versican; and other HA-binding proteins.
7. The fusion protein of claim 2, wherein the linker is selected from the group consisting of: Linker 1: GGSG (SEQ ID NO: 1); Linker 2: GGSGGGSG (SEQ ID NO: 2); Linker 3: GGSGGGSGGGSG (SEQ ID NO: 3); Linker 4: GGGGS (SEQ ID NO: 4); Linker 5: GGGGSGGGGS (SEQ ID NO: 5); Linker 6: GGGGSGGGGSGGGGS (SEQ ID NO: 6); Linker 7: GGSGGS (SEQ ID NO: 7); and Linker 8: VIGHPIDSE (SEQ ID NO: 8).
8. The fusion protein of claim 2, wherein the protease cleavage site is selected from the group consisting of: SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID NO: 11; SEQ ID NO: 12; and SEQ ID NO: 13.
9. The fusion protein of claim 1, wherein the GF is selected from the group consisting of: IGF-1; BMP2; BMP4; BMP7; FGF2; FGF18; GDF5; TGF-.beta.1 and TGF-.beta.3
10. The fusion protein of claim 1, wherein the CD44 HAB element is a polypeptide having at least 90 percent identity to a polypeptide selected from the group consisting of: amino acids 1-132 of SEQ ID NO: 64; amino acids 1-156 of SEQ ID NO: 68; amino acids 1-178 of SEQ ID NO: 72; and amino acids 1-222 of SEQ ID NO: 76.
11. The fusion protein of claim 1, further comprising: flanking amino acids to the HAB, wherein the number of flanking amino acids is selected from the group consisting of: 5 amino acids; 10 amino acids; 15 amino acids; 20 amino acids; 25 amino acids; 30 amino acids; 35 amino acids; 40 amino acids; 45 amino acids; 50 amino acids; 55 amino acids; and 60 amino acids.
12. The fusion protein of claim 11, wherein the flanking amino acids are at the N terminus of HAB.
13. The fusion protein of claim 11, wherein the flanking amino acids are at the C terminus of HAB.
14. A nucleic acid molecule comprising a nucleic acid sequence that encodes a fusion protein comprising: a.) a hyaluronic acid-binding domain of a cartilage matrix protein (HAB) operably linked to b.) a conserved region of a growth factor protein (GF), wherein said fusion protein is capable of up-regulating glycosaminoglycan (GAG) expression in chondrocytes.
15. The nucleic acid molecule of claim 14, wherein the nucleic acid molecule further comprises at least one additional nucleic acid sequence, wherein the additional nucleic acid sequence is selected from the group consisting of: a nucleic acid sequence that encodes at least one signal peptide (SP); a nucleic acid sequence that encodes at least one linker sequence (linker); a nucleic acid sequence that encodes at least one protease cleavage sequence; a nucleic acid sequence that encodes at least one E peptide; and a nucleic acid sequence that encodes at least one additional functional peptide; wherein the nucleic acids are operably linked so as to express a functional fusion protein.
16. The nucleic acid molecule of claim 15, wherein the order of operably-linked elements, 5' to 3' is selected from the group consisting of: SP-HAB-GF; SP-HAB-GF-additional functional peptide; SP-HAB-GF-E peptide; SP-HAB-linker-GF-additional functional peptide; SP-HAB-linker-GF-E peptide; SP-HAB-linker-protease cleavage sequence-GF-E peptide; and SP-HAB-linker-protease cleavage sequence-GF-additional functional peptide.
17. The nucleic acid molecule of claim 15, wherein the SP is selected from the group consisting of: aggrecan signal peptide; CD44 signal peptide; link protein signal peptide; TSG-6 signal peptide; versican signal peptide; and other HA-binding protein signal peptide.
18. The nucleic acid molecule of claim 14, wherein the HAB comprises a polynucleotide fragment of a nucleotide sequence encoding a protein selected from the group consisting of: aggrecan; CD44; link protein; TSG-6; versican; and other HA-binding proteins.
19. The nucleic acid molecule of claim 15, wherein the linker is selected from the group consisting of: Linker 1: GGSG (SEQ ID NO: 1); Linker 2: GGSGGGSG (SEQ ID NO: 2); Linker 3: GGSGGGSGGGSG (SEQ ID NO: 3); Linker 4: GGGGS (SEQ ID NO: 4); Linker 5: GGGGSGGGGS (SEQ ID NO: 5); Linker 6: GGGGSGGGGSGGGGS (SEQ ID NO: 6); Linker 7: GGSGGS (SEQ ID NO: 7); and Linker 8: VIGHPIDSE (SEQ ID NO: 8).
20. The nucleic acid molecule of claim 15, comprising a cleavage site for a protease selected from the group consisting of: enterokinase (EK); Furin; Factor Xa; Matrix metalloproteinase (MMP); and Aggrecanase.
21. The nucleic acid molecule of claim 14, wherein the GF is selected from the group consisting of: IGF-1; BMP2; BMP4; BMP7; FGF2; FGF18; GDF5; TGF-.beta.1; and TGF-.beta.3.
22. The nucleic acid molecule of claim 14, wherein the HAB element comprises a sequence that is at least 80% identical to at least one sequence selected from the group consisting of: nucleotides 1-222; 1-178; 1-156; and 1-132 of SEQ ID NO: 54.
23. The nucleic acid molecule of claim 14, wherein the nucleic acid molecule encodes a fusion protein comprising HAB and IGF-1 linked by a number of amino acids selected from the group consisting of: at least about 60 amino acids, at least about 50 amino acids, at least about 40 amino acids, at least about 30 amino acids, at least about 20 amino acids, fewer than 20 amino acids; fewer than 15 amino acids; fewer than 10 amino acids; fewer than 5 amino acids; no amino acids.
24. A composition comprising the nucleic acid molecule of claim 14 and a pharmaceutically-acceptable carrier.
25. A composition comprising the fusion protein of claim 1 and a pharmaceutically-acceptable carrier.
26. An expression vector encoding the fusion protein of claim 1, wherein the expression vector is a plasmid or a virus.
27. The expression vector of claim 26, which is an adeno-associated virus plasmid (pAAV).
28. A cell transformed, transfected, or transduced by a vector of claim 26.
29. The cell of claim 28, wherein the cell is a chondrocyte.
30. An animal model comprising the cell of claim 28.
31. A method of producing a fusion protein comprising, expressing a nucleic acid molecule in the cell of claim 28.
32. A method to upregulate glycosaminoglycan expression in at least one chondrocyte, comprising: expressing a nucleic acid molecule in at least one chondrocyte, wherein the nucleic acid molecule comprises a HAB element and a GF element.
33. The method of claim 32 wherein the nucleic acid molecule comprises at least one segment having at least 95% identity to a sequence selected from the group consisting of: SEQ ID NO: 28; SEQ ID NO: 35; SEQ ID NO: 37; SEQ ID NO: 42; SEQ ID NO: 46; SEQ ID NO: 50; SEQ ID NO: 62; SEQ ID NO: 66; SEQ ID NO: 70; and SEQ ID NO: 74.
34. A method to treat a cartilage matrix protein-related pathology or injury in a subject, comprising: administering a fusion protein, wherein the fusion protein comprises: a.) a hyaluronic acid-binding domain (HAB) of a cartilage matrix protein linked to b.) a conserved region of a growth factor (GF) protein, wherein the fusion protein is capable of upregulating glycosaminoglycan (GAG) expression in chondrocytes; upregulating GAG expression in chondrocytes in the subject; and treating a cartilage matrix protein-related pathology or injury.
35. The method of claim 34 wherein the GF is IGF-I.
36. The method of claim 34 wherein the administering is performed by transforming at least one chondrocyte in vivo to express the fusion protein.
37. The method of claim 34 wherein the administering is performed by injection of a pharmaceutical composition comprising the fusion protein, wherein the fusion protein is produced in vitro.
38. The method of claim 34, wherein the cartilage matrix protein-related pathology or injury is selected from the group consisting of: joint stiffness; joint pain; intervertebral disc pain; degenerative disease; facet disease; traumatic cartilage injury; surgical cartilage injury; osteoarthritis; and rheumatoid arthritis.
39. The method of claim 34 further comprising: increasing sports performance in the subject, wherein the sports performance is selected from the group consisting of: increased speed; increased endurance; increased weight-lifting ability; increased flexibility; increased strength; increased resistance to impact; increased concentration; and increased career length.
40. The method of claim 34 further comprising: decreasing GF removal rate from the joint synovium or intervertebral disc synovium of a subject, by introducing the fusion protein to at least one joint or intevertebral disc of a subject and decreasing GF removal rate from the joint synovium or intervertebral disc of the subject.
41. The method of claim 34 further comprising: increasing responsiveness rates of chondrocytes to GF in a subject, by introducing the fusion protein to at least one chondrocyte of a subject and increasing responsiveness rates of chondrocytes to GF in the subject.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a Continuation Application of U.S. application Ser. No. 14/294,305 filed Jun. 3, 2014 which claims priority to U.S. Provisional Application No. 61/830,925, filed on Jun. 4, 2013, the entire disclosure of which is expressly incorporated herein by reference for all purposes.
SEQUENCE LISTING
[0003] The Sequence Listing, filed electronically in ASCII text format and identified as 3619_54880_SEQ_LIST_IURTC-11114.txt, was created on May 29, 2014, is 140,103 bytes in size and is hereby incorporated by reference.
BACKGROUND OF THE INVENTION
[0004] Articular cartilage damage is a major cause of disability in the form of arthritis and joint trauma. Because articular cartilage lacks the ability to effectively repair itself, articular cartilage repair is an unsolved clinical problem.
[0005] Growth factors, such as insulin-like growth factor-I (IGF-1), promote articular chondrocyte reparative activity. When delivered to articular cartilage as potential therapeutic agents, these factors are limited by rapid elimination from the joint, slower uptake by the articular cartilage containing the target cells, and, when present in the cartilage or repair tissue, are subject to diffusion out of the tissue where it is needed.
[0006] Biologic agents are needed that can promote cartilage biosynthesis in vitro and in vivo. A promising candidate for improving articular chondrocyte function is IGF-1. This polypeptide growth factor has been shown to stimulate the synthesis of type-2 collagen and aggrecan, two principle constituents of cartilage matrix, to stimulate the division of articular chondrocytes and to decrease the endogenous catabolic activity of these cells. Therapeutic application of IGF-1 to cartilage repair has been reported in animal models when delivered as a protein, or by gene transfer. However, an unpublished clinical trial of IGF-1 delivery to human knee joints was evidently unsuccessful. A major limitation of current approaches to IGF-1 therapy include: 1) The rapid removal of IGF-1 from the joint through the synovium, 2) the relatively slow diffusion of IGF-1 into the articular cartilage where it can act on the cells, and 3) limited responsiveness of arthritic chondrocytes to the IGF-1. A means of simultaneously providing cells, IGF-1, and of gradual IGF-1 release over time is needed.
[0007] The foregoing example of the related art and limitations related therewith are intended to be illustrative and not exclusive. Other limitations of the related art will become apparent to those of skill in the art upon a reading of the specification and a study of the drawings.
SUMMARY OF THE INVENTION
[0008] The following embodiments and aspects thereof are described and illustrated in conjunction with systems, tool and methods which are meant to be exemplary and illustrative, not limiting in scope. In various embodiments, one or more of the above described problems have been reduced or eliminated, while other embodiments are directed to other improvements.
[0009] The present invention provides isolated nucleic acid molecules comprising a nucleic acid sequence that encodes a fusion protein comprising: a.) a hyaluronic acid-binding domain of a cartilage matrix protein (HAB) operably linked to b.) a conserved region of a growth factor protein (GF), wherein said fusion protein is capable of up-regulating glycosaminoglycan (GAG) expression in chondrocytes.
[0010] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule further comprises a nucleic acid sequence that encodes at least one signal peptide (SP).
[0011] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule encodes a fusion protein which further comprises at least one linker sequence.
[0012] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule encodes a fusion protein which further comprises at least one protease cleavage sequence.
[0013] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule encodes a fusion protein which further comprises at least one additional functional peptide.
[0014] Also provided are such isolated nucleic acid molecules herein, wherein the order of operably-linked elements, 5' to 3' is: SP-HAB-GF.
[0015] Also provided are such isolated nucleic acid molecules herein, wherein the order of operably-linked elements, 5' to 3' is: SP-HAB-GF-additional peptide sequence.
[0016] Also provided are such isolated nucleic acid molecules herein, wherein the order of operably-linked elements, 5' to 3' is: SP-HAB-linker-GF-additional peptide sequence.
[0017] Also provided are such isolated nucleic acid molecules herein, wherein the order of operably-linked elements, 5' to 3' is: SP-HAB-linker-protease cleavage sequence-GF-additional peptide sequence.
[0018] Also provided are such isolated nucleic acid molecules herein, wherein the SP is selected from the group consisting of: aggrecan signal peptide; CD44 signal peptide; link protein signal peptide; TSG-6 signal peptide; versican signal peptide; and other HA-binding protein signal peptide.
[0019] Also provided are such isolated nucleic acid molecules herein, wherein the HAB element comprises a polynucleotide fragment of a nucleotide sequence encoding a protein selected from the group consisting of: aggrecan; CD44; link protein; TSG-6; versican; and other HA-binding proteins.
[0020] Also provided are such isolated nucleic acid molecules herein, wherein the linker is selected from the group consisting of: Linker 1: GGSG (SEQ ID NO: 1); Linker 2: GGSGGGSG (SEQ ID NO: 2); Linker 3: GGSGGGSGGGSG (SEQ ID NO: 3); Linker 4: GGGGS (SEQ ID NO: 4); Linker 5: GGGGSGGGGS (SEQ ID NO: 5); Linker 6: GGGGSGGGGSGGGGS (SEQ ID NO: 6); Linker 7: GGSGGS (SEQ ID NO: 7); and Linker 8: VIGHPIDSE (SEQ ID NO: 8).
[0021] Also provided are such isolated amino acid molecules herein, further comprising a cleavage site for a protease selected from the group consisting of: enterokinase (EK); Furin; Factor Xa; Matrix metalloproteinase (MMP); and Aggrecanase.
[0022] Also provided are such isolated nucleic acid molecules herein, wherein the GF is selected from the group consisting of: IGF-1; BMP2; BMP4; BMP7; FGF2; FGF18; GDF5; TGF-.beta.1; TGF-.beta.3; and other growth factors that influence the target cells.
[0023] Also provided are such isolated nucleic acid molecules herein, wherein the additional peptide sequence is selected from the group consisting of: IGF-I signal peptide; and IGF-I E peptide.
[0024] Also provided are such polypeptides, wherein the CD44 HAB element is selected from the group consisting of: amino acids 1-132 of SEQ ID NO: 64; amino acids 1-156 of SEQ ID NO: 68; 1-178 of SEQ ID NO: 72; and 1-222 of SEQ ID NO: 76.
[0025] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule encodes a fusion protein comprising HAB and IGF-1 linked by a number of amino acids selected from the group consisting of: at least about 60 amino acids, at least about 50 amino acids, at least about 40 amino acids, at least about 30 amino acids, at least about 20 amino acids, fewer than 20 amino acids; fewer than 15 amino acids; fewer than 10 amino acids; fewer than 5 amino acids; no amino acids.
[0026] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule is selected from the group consisting of DNA and RNA.
[0027] Also provided are such isolated nucleic acid molecules herein, wherein the nucleic acid molecule comprises:
[0028] a.) one or more signal peptide (SP);
[0029] b.) one or more hyaluronic acid-binding domain nucleic acid construct (HAB);
[0030] c.) one or more linker sequence (linker sequence);
[0031] d.) one or more protease cleavage sequence (protease cleavage sequence);
[0032] e.) one or more conserved region of a growth factor protein sequence (GF conserved region sequence);
[0033] f.) one or more additional peptide sequence (additional peptide); wherein the nucleic acids are operably linked so as to express a functional fusion protein.
[0034] Also provided are plasmids comprising a nucleic acid molecule herein.
[0035] Also provided are expression vectors comprising a nucleic acid molecule herein.
[0036] Also provided are expression vectors herein, which further comprises at least one regulatory element.
[0037] Also provided are expression vectors herein, wherein the regulatory element is selected from the group consisting of: promoter; repressor; enhancer; activator; and transcription factor.
[0038] Also provided are expression vectors herein, which is a viral vector.
[0039] Also provided are expression vectors herein, which is an adeno-associated virus plasmid (pAAV).
[0040] Also provided are cells transformed, transfected or transduced by a vector herein.
[0041] Also provided are cells herein, which is a chondrocyte.
[0042] Also provided are animal models comprising an expression vector herein.
[0043] Also provided are fusion proteins, comprising: a.) at least one hyaluronic acid-binding domain of a cartilage matrix protein (HAB) linked to b.) at least one conserved region of a growth factor protein (GF), wherein said fusion protein is capable of upregulating glycosaminoglycan (GAG) expression in chondrocytes.
[0044] Also provided are fusion proteins herein, wherein the fusion protein upregulates GAG expression in chondrocytes by 20-50%, 30-70%, 50-100%, 75-200%, 100-300%, or 200-500%.
[0045] Also provided are fusion proteins herein, which further comprise at least one signal peptide (SP).
[0046] Also provided are fusion proteins herein, which further comprise at least one linker peptide.
[0047] Also provided are fusion proteins herein, which further comprise at least one protease cleavage sequence.
[0048] Also provided are fusion proteins herein, which further comprise at least one additional functional peptide.
[0049] Also provided are fusion proteins herein, wherein the order of elements, N-terminus to C-terminus is: SP-HAB-GF.
[0050] Also provided are fusion proteins herein, wherein the order of elements, N-terminus to C-terminus is: SP-HAB-GF-additional peptide sequence.
[0051] Also provided are fusion proteins herein, wherein the order of elements, N-terminus to C-terminus is: SP-HAB-linker-GF-additional peptide sequence.
[0052] Also provided are fusion proteins herein, wherein the order of elements, N-terminus to C-terminus is: SP-HAB-linker-protease cleavage sequence-GF-additional peptide sequence.
[0053] Also provided are fusion proteins herein, wherein the SP is selected from the group consisting of: aggrecan signal peptide, CD44 signal peptide; link protein signal peptide; TSG-6 signal peptide, versican signal peptide; and other HA-binding protein signal peptide.
[0054] Also provided are fusion proteins herein, wherein the HAB is selected from the group consisting of: aggrecan; CD44; link protein; TSG-6; versican; and other HA-binding proteins.
[0055] Also provided are fusion proteins herein, wherein the linker is selected from the group consisting of: Linker 1: GGSG (SEQ ID NO: 1); Linker 2: GGSGGGSG (SEQ ID NO: 2); Linker 3: GGSGGGSGGGSG (SEQ ID NO: 3); Linker 4: GGGGS (SEQ ID NO: 4); Linker 5: GGGGSGGGGS (SEQ ID NO: 5); Linker 6: GGGGSGGGGSGGGGS (SEQ ID NO: 6); Linker 7: GGSGGS (SEQ ID NO: 7); and Linker 8: VIGHPIDSE (SEQ ID NO: 8).
[0056] Also provided are fusion proteins herein, comprising a protease cleavage site selected from the group consisting of: SEQ ID NO: 9; SEQ ID NO: 10; SEQ ID NO: 11; SEQ ID NO: 12; and SEQ ID NO: 13.
[0057] Also provided are fusion proteins herein, wherein the GF is selected from the group consisting of: IGF-1; BMP2; BMP4; BMP7; FGF2; FGF18; GDF5; TGF-.beta.1 and TGF-.beta.3 and other growth factors that influence the target cells.
[0058] Also provided are fusion proteins herein, wherein the additional peptide sequence is selected from the group consisting of: IGF-I signal peptide; and IGF-I E peptide.
[0059] Also provided are fusion proteins herein, wherein the CD44 HAB element is selected from the group consisting of: CD44(132); CD44(156); CD44(178); and CD44(222).
[0060] Also provided are fusion proteins herein, which comprise a number of flanking amino acids to the HAB, wherein the number of flanking amino acids is selected from the group consisting of: 5 amino acids; 10 amino acids; 15 amino acids; 20 amino acids; 25 amino acids; 30 amino acids; 35 amino acids; 40 amino acids; 45 amino acids; 50 amino acids; 55 amino acids; and 60 amino acids.
[0061] Also provided are fusion proteins herein, which comprise a number of flanking amino acids to the HAB, wherein the flanking amino acids are at the N terminus of HAB, and wherein the number of flanking amino acids is selected from the group consisting of: 5 amino acids; 10 amino acids; 15 amino acids; 20 amino acids; 25 amino acids; 30 amino acids; 35 amino acids; 40 amino acids; 45 amino acids; 50 amino acids; 55 amino acids; and 60 amino acids.
[0062] Also provided are fusion proteins herein, which comprise a number of flanking amino acids to the HAB, wherein the flanking amino acids are at the C terminus of HAB, and wherein the number of flanking amino acids is selected from the group consisting of: 5 amino acids; 10 amino acids; 15 amino acids; 20 amino acids; 25 amino acids; 30 amino acids; 35 amino acids; 40 amino acids; 45 amino acids; 50 amino acids; 55 amino acids; and 60 amino acids.
[0063] Also provided are compositions comprising a nucleic acid molecule herein and a pharmaceutically-acceptable carrier.
[0064] Also provided are compositions comprising a fusion protein herein and a pharmaceutically-acceptable carrier.
[0065] Also provided are methods of producing a fusion protein comprising expressing a nucleic acid molecule herein in at least one cell.
[0066] Also provided are methods herein, wherein the cell is selected from the group consisting of prokaryotic cell and eukaryotic cell.
[0067] Also provided are methods herein, wherein the cell is a chondrocyte.
[0068] Also provided are methods to upregulate glycosaminoglycan expression in at least one chondrocyte, comprising expressing a nucleic acid molecule herein in at least one chondrocyte.
[0069] Also provided are methods to increase GF diffusion rates to the articular cartilage of a subject, comprising expressing a nucleic acid molecule herein in at least one chondrocyte in at least one joint of a subject and increasing GF diffusion rates to the articular cartilage of the subject.
[0070] Also provided are methods to increase responsiveness rates of arthritic chondrocytes to GF in a subject, comprising expressing a nucleic acid molecule herein in at least one chondrocyte in at least one arthritic joint of a subject and increasing responsiveness rates of arthritic chondrocytes to GF in the subject.
[0071] Also provided are methods to treat cartilage matrix protein-related pathology in a subject, comprising expressing a nucleic acid molecule herein in at least one chondrocyte in the cartilage matrix of a subject with cartilage matrix protein-related pathology and treating cartilage matrix protein-related pathology in the subject.
[0072] Also provided are methods herein, wherein the cartilage matrix protein-related pathology is selected from the group consisting of: joint stiffness; degenerative disease; facet disease; osteoarthritis; and rheumatoid arthritis.
[0073] Also provided are methods to ameliorating joint pain or intervertebral disc pain in a subject, comprising expressing a nucleic acid molecule herein in at least one painful joint or painful intervertebral disc of a subject and ameliorating joint pain or intervertebral disc pain in the subject.
[0074] Also provided are methods to ameliorate the symptoms of joint injury or intervertebral disc injury in a subject, comprising expressing a nucleic acid molecule herein in at least one injured joint or injured intervertebral disc of a subject and ameliorating the symptoms of joint injury or intervertebral disc injury in the subject.
[0075] Also provided are methods herein, wherein the cartilage matrix, joint, and/or intervetebral disc injury is selected from the group consisting of: traumatic injury; surgical injury; degenerative disease; developmental defect; and work injury.
[0076] Also provided are methods to increase sports performance in an athlete, comprising expressing a nucleic acid molecule herein in at least one joint or intervertebral disc of an athlete and increasing sports performance in the athlete.
[0077] Also provided are methods herein, wherein the sports performance is selected from the group consisting of: increased speed; increased endurance; increased weight-lifting ability; increased flexibility; increased strength; increased resistance to impact; increased concentration; and increased career length.
[0078] Also provided are methods herein, wherein the subject or athlete is a mammal selected from the group consisting of: laboratory animal; companion animal; draft animal; meat animal; and human.
[0079] Also provided are methods herein, wherein the subject or athlete is a mammal selected from the group consisting of: cat; dog; horse; bovine; and human.
[0080] Also provided are methods herein, wherein the cartilage matrix is selected from the group consisting of: toe; ankle; knee; hip; spine; shoulder; neck; elbow; wrist; fingers; and thumb.
[0081] Also provided are methods herein, wherein the cartilage matrix is selected from the group consisting of: cartilage of the nose; and cartilage of the ear.
[0082] Also provided are methods herein, wherein the cartilage matrix is at least one intervertebral disc.
[0083] Also provided are methods to upregulate glycosaminoglycan expression in at least one chondrocyte, comprising introducing a fusion protein herein to at least one chondrocyte.
[0084] Also provided are methods to decrease GF removal rates from the joint synovium or intervertebral disc synovium of a subject, comprising introducing a fusion protein herein in at least one joint or intevertebral disc of a subject and decreasing GF removal rate from the joint synovium or intervertebral disc of the subject.
[0085] Also provided are methods to increase responsiveness rates of chondrocytes to GF in a subject, comprising introducing a fusion protein herein in at least one chondrocyte of a subject and increasing responsiveness rates of chondrocytes to GF in the subject.
[0086] Also provided are methods to ameliorate cartilage matrix protein-related pathology in a subject, comprising introducing a fusion protein herein in the cartilage matrix of a subject with cartilage matrix protein-related pathology and ameliorating cartilage matrix protein-related pathology in the subject.
[0087] Also provided are method herein, wherein the cartilage matrix protein-related pathology is selected from the group consisting of: joint stiffness; degenerative joint disease; facet disease; osteoarthritis; and rheumatoid arthritis.
[0088] Also provided are methods to ameliorate joint pain or intervertebral disc pain in a subject, comprising introducing a fusion protein herein in at least one painful joint or painful intervertebral disc of a subject and ameliorating joint pain or intervetebral disc pain in the subject.
[0089] Also provided are methods to ameliorate the symptoms of joint or intervertebral disc injury in a subject, comprising introducing a fusion protein herein in at least one injured joint or intervertebral disc of a subject and ameliorating the symptoms of joint or intervertebral disc injury in the subject.
[0090] Also provided are methods herein, wherein the joint or intervertebral disc injury is selected from the group consisting of: traumatic injury; surgical injury; degenerative disease; developmental defect; and work injury.
[0091] Also provided are methods to increase sports performance in an athlete, comprising introducing a fusion protein herein in at least one joint or intervertebral disc of an athlete and increasing sports performance in the athlete.
[0092] Also provided are methods herein, wherein the sports performance is selected from the group consisting of: increased speed; increased endurance; increased weight-lifting ability; increased flexibility; increased strength; increased resistance to impact; increased concentration; and increased career length.
[0093] Also provided are methods herein, wherein the subject or athlete is a mammal selected from the group consisting of: laboratory animal; companion animal; draft animal; meat animal; and human.
[0094] Also provided are methods herein, wherein the subject or athlete is a mammal selected from the group consisting of: cat; dog; horse; bovine; and human.
[0095] Also provided are methods herein, wherein the cartilage matrix is selected from the group consisting of: toe; ankle; knee; hip; spine; shoulder; neck; elbow; wrist; fingers; and thumb.
[0096] Also provided are methods herein, wherein the cartilage matrix is selected from the group consisting of: cartilage of the nose; and cartilage of the ear.
[0097] Also provided are methods herein, wherein the cartilage matrix is at least one intervertebral disc.
[0098] Also provided is the invention as shown and described herein.
[0099] The term "joint cavity," "joint space," and "joint capsule" refer to a hollow or fluid-filled place or depression in the place of union or junction between two or more bones of the skeleton, including the space of a synovial joint, enclosed by the synovial membrane and articular cartilages.
[0100] The terms "treat", "treatment," and "treating" and/or "ameliorating" include pathology reduction, reduction in symptoms, preventative (e.g., prophylactic) and palliative care.
[0101] In addition to the exemplary aspects and embodiments described above, further aspects and embodiments will become apparent by reference to the accompanying drawings forming a part of this specification wherein like reference characters designate corresponding parts in the several views.
BRIEF DESCRIPTION OF THE FIGURES
[0102] The patent or application file may contain one or more drawings executed in color and/or one or more photographs. Copies of this patent or patent application publication with color drawing(s) and/or photograph(s) will be provided by the Patent Office upon request and payment of the necessary fee.
[0103] FIGS. 1A-1B. Schematic illustration of three HA binding proteins (aggrecan, link protein and versican) and human IGF-I (FIG. 1A), and HAB-IGF-I fusion proteins (FIG. 1B).
[0104] FIG. 1A: Human aggrecan and versican precursors each contains a signal peptide (SP) and G1 domain thereafter, which is responsible for aggrecan and versican binding to HA. Link protein consists of a signal peptide and a mature link protein peptide, which is homologous to the G1 domains of aggrecan and versican and also functions to bind to HA. Human IGF-I precursor consists of a signal peptide (aa 1-48), mature IGF-I peptide (aa 49-118) and E peptide (aa 119-153).
[0105] FIG. 1B: HAB-IGF-I fusion proteins (AG1-IGF-I, LP-IGF-I and VG1-IGF-I) were made up of human HA binding protein (aggrecan, link protein and versican) and human IGF-I, each box representing a native domain or region of human aggrecan, link protein and versican. HAB-IGF-I fusion protein AG1-IGF-I was created by fusion of aggrecan (aa 1-357) to IGF-I (aa 49-153), HAB-IGF-I fusion protein LP-IGF-I was created by fusion of link protein (aa 1-354) to IGF-I (aa 49-153) and HAB-IGF-I fusion protein VG1-IGF-I was created by fusion of versican (aa 1-363) to IGF-I (aa 49-153).
[0106] FIGS. 2A-2B. Schematic illustration of human HA receptor CD44 and human IGF-I (FIG. 2A), and CD44-IGF-I fusion proteins (FIG. 2B).
[0107] FIG. 2A: Human CD44 precursor consists of a 20 aa signal peptide (SP), an extracellular region from AA21 to AA268, a 20 aa transmembrane region (TM) and a 73 aa of the intracellular region at c-terminal. The extracellular region from AA21 to AA268 contains the HA-binding domain, which is responsible for the binding of CD44 to HA. The human IGF-I precursor consists of a signal peptide (aa 1-48), mature IGF-I peptide (aa 49-118) and E peptide (aa 119-153).
[0108] FIG. 2B: CD44-IGF-I fusion proteins were made up of human CD44 and human IGF-I, each box representing a native domain or region of human CD44 and human IGF-I. CD44(132)-IGF-I was created by fusion of CD44(aa 1-132) to IGF-I(aa 49-153), CD44(156)-IGF-I was created by fusion of CD44 (aa 1-156) to IGF-I(aa 49-153), CD44(178)-IGF-I was created by fusion of CD44(aa 1-178) to IGF-I(aa 49-153), CD44(222)-IGF-I was created by fusion of CD44(aa 1-222) to IGF-I(aa 49-153).
[0109] FIGS. 3A-3C. Schematic illustration of human HA receptor CD44 and human IGF-I (FIG. 3A), CD44(178)-IGF-I fusion protein (FIG. 3B) and CD44(178)-IGF-I fusion protein with linker (FIG. 3C).
[0110] FIG. 3A: Human CD44 precursor consists of a 20 aa signal peptide (SP), an extracellular region from AA21 to AA268, a 20 aa transmembrane region (TM) and a 73 aa of the intracellular region at c-terminal. The extracellular region from AA21 to AA268 contains the HA-binding domain, which is responsible for the binding of CD44 to HA. The human IGF-I precursor consists of a signal peptide (aa 1-48), mature IGF-I peptide (aa 49-118) and E peptide (aa 119-153).
[0111] FIG. 3B: CD44(178)-IGF-I fusion protein was created by fusion of CD44 (aa 1-178) to IGF-I(aa 49-153).
[0112] FIG. 3C: CD44(178)-IGF-I fusion protein with linker was created by inserting a linker sequence after the sequence of CD44(178) and before the sequence of IGF-I in CD44(178)-IGF-I. FIG. 3C discloses SEQ ID NOS 1-8, respectively, in order of appearance.
[0113] FIGS. 4A-4C. Schematic illustration of human versican and human IGF-I (FIG. 4A), VG1-IGF-I fusion protein (FIG. 4B) and VG1-IGF-I fusion proteins with a cleavage site (FIG. 4C).
[0114] FIG. 4A) Human versican precursor contains G1 domain (VG1). It is responsible for versican binding to HA. The human IGF-I precursor consists of a signal peptide (aa 1-48), mature IGF-I peptide (aa 49-118) and E peptide (aa 119-153).
[0115] FIG. 4B) VG1-IGF-I was created by fusion of versican (aa 1-363) to IGF-I(aa 49-153).
[0116] FIG. 4C) VG1-IGF-I fusion protein with protease cleavage site was created by inserting the designated protease cleavage site sequences after the sequence of VG1 and before the sequence of IGF-I in VG1-IGF-I. The triangle () indicates the point of cleavage within the cleavage site sequence. FIG. 4C discloses SEQ ID NOS 9-13, respectively, in order of appearance.
[0117] Before explaining the disclosed embodiment of the present invention in detail, it is to be understood that the invention is not limited in its application to the details of the particular arrangement shown, since the invention is capable of other embodiments. Exemplary embodiments are illustrated in referenced figures of the drawings. It is intended that the embodiments and figures disclosed herein are to be considered illustrative rather than limiting. Also, the terminology used herein is for the purpose of description and not of limitation.
DETAILED DESCRIPTION OF THE INVENTION
[0118] Described are a set of related but distinct nucleic acid molecules that encode fusion proteins comprised by a hyaluronic acid-binding domain and insulin-like growth factor-I. Cells to which such nucleic acid molecules are transferred manufacture the fusion protein. Using a gene therapy approach, the fusion protein is made by the chondrocytes themselves. The nucleic acid molecules are designed to enable their protein products to 1) remain within the articular cartilage and/or repair tissue by matrix hyaluronic acid (HA) binding to the fusion protein's hyaluronic acid-binding domain and 2) stimulate chondrocyte reparative activity by insulin-like growth factor-I (IGF-1) binding to its cell surface receptors.
[0119] In addition, by putting a protease cleavage site between the HA tags and IGF-1, the binding of the HA to the cartilage matrix may be regulated. Further, a sustained release formulation is contemplated herein. The present invention may also be used to release growth factor (GF) in other organs, such as by creating an HA scaffolding in an area of the body so that the GF can be concentrated in one place and released as needed.
[0120] An important aspect of embodiments of the present invention is the retention of the molecule within a tissue when delivered as a protein. Further, those tissues that are deficient in matrix elements, for example--if it had already lost constituents, are particularly influenced by such fusion proteins. Without being bound by any particular theory, the fusion proteins facilitate the ability of the growth factor to enter the tissue.
[0121] In addition, the fusion protein, when purified from the conditioned medium of producer cells carrying the fusion protein nucleic acid molecule(s), may be used as a therapeutic agent using any of a variety of available protein delivery methods.
[0122] In one embodiment, there are provided DNA constructs encoding fusion proteins. The constructs contain the coding region for the mature form of insulin-like 1 (IGF-1). The constructs contain the coding region of at least one cartilage matrix protein hyaluronic acid-binding domain (HAB). The constructs may optionally contain signal peptide and carboxy-terminal extension peptide (E peptide). Vectors containing these cDNA constructs are designed to be transferred into target cells, enabling the cells to produce the encoded fusion proteins and provide local delivery of IGF-1 for tissue repair.
[0123] In another embodiment, vectors containing the DNA constructs are designed to be transferred into cultured cells and the fusion protein is recovered from the conditioned media. Examples of cell lines useful for this embodiment include Chinese Hamster Ovary (CHO) cells and HEK-293 cells Human Embryonic Kidney 293 (HEK-293) cells. The fusion protein is then purified, combined with a pharmaceutically acceptable excipient, and formulated into a therapeutically effective dose.
EXAMPLES
Example 1. Materials and Methods
[0124] Fusion Proteins
[0125] The nucleic acid sequences encoding the described proteins and chimeric nucleic acid sequences encoding fusion proteins may be constructed. The chimeric nucleic acid molecules may be prepared with sections encoding a linking peptide connecting the protein portions of the encoded fusion protein. Optionally, the peptide linker may be selected to include residues imparting stearic flexibility in order to enhance the function of the fusion protein.
[0126] Fusion Protein Linkers
[0127] The components of the fusion proteins may be operatively linked directly one to the other. The two protein components of the fusion protein may be directly linked via an amino terminus to carboxy terminus peptide bond.
[0128] Alternatively, one or more linker molecules connect the two portions of the fusion protein. The linker molecule allows the two portions of the fusion protein increased stearic freedom. In one embodiment, the linkers are peptide linkers. Peptides for linking protein chains are known to the art (Huston et al., 1993, Immunotechnology, ed. by J. Gosling et al., 47-60; Huston et al., Molecular Design and Modeling: Concepts and Applications, Part B, ed. J. J. Langone, Methods in Enzymology 203:46-88; Chaudhary et al, 1989, Nature 339:394-397). For example, a DNA construct may be prepared by recombinant methods or by polymerase chain reaction.
[0129] Examples of linkers include, for example, the linkers of FIG. 3C. Linker 1: GGSG; Linker 2: GGSGGGSG; Linker 3: GGSGGGSGGGSG; Linker 4: GGGGS; Linker 5: GGGGSGGGGS; Linker 6: GGGGSGGGGSGGGGS; Linker 7: GGSGGS; and Linker 8: VIGHPIDSE (SEQ ID NOS 1-8, respectively, in order of appearance).
[0130] The DNA construct sequentially, 5' to 3', encodes a first part of a fusion protein, then a peptide linker region, followed by a second part of a fusion protein as a single open reading frame, flanked by regulatory elements suitable for expressing the encoded fusion protein in a host cell. The peptide linker according to the invention may optionally include serine, glycine or other residues that will act as "molecular hinges" to allow greater stearic freedom.
[0131] The peptide linker may be encoded by the chimeric nucleic acid molecule, protein parts may be prepared separately and the fusion protein assembled by peptide chemical methods.
[0132] In another alternative, the linker molecule may be comprised of peptide and non-peptide polymers or may be exclusively non-peptide in composition. A wide variety of organic linkers are known in the art. For example, polyalkane polymers (e.g., --(CH.sub.2).sub.n--) may be readily linked by well known methods to thiol groups on sulfhydral containing amino acids.
[0133] Vectors and Promoters
[0134] The present invention further relates to vectors expressing the fusion proteins according to the invention. The vectors may be selected from any suitable vectors for inserting nucleic acid molecules into human host cells. The vectors may be deoxyribonucleic acid) ("DNA") vectors such as plasmids, adenovirus or even naked DNA inserted directly into cells to be treated by art known methods. The vectors may also be ribonucleic acid ("RNA") vectors, such as safe strains of the retroviruses.
[0135] Methods for the insertion of nucleic acid molecular fragments into a vector, as described, for example, in Maniatis, T., Fritsch, E. F., and Sambrook, J. (1989): Molecular Cloning (A Laboratory Manual), Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.; and Ausubel, F. M., Brent, R., Kingston, R. E., Moore, D. D., Seidman, J. G., Smith, J. A., and Struhl, K. (1992): Current Protocols in Molecular Biology, John Wiley & Sons, New York, may be used to construct fusion protein-encoding expression vectors consisting of appropriate transcriptional/translational control signals. These methods may include in vitro DNA recombinant and synthetic techniques and in vivo genetic recombination. Expression of a nucleic acid sequence encoding the fusion proteins may be regulated by a second nucleic acid sequence so that the fusion protein is expressed in a host infected or transfected with the recombinant chimeric nucleic acid molecule. For example, expression of the fusion proteins may be controlled by a selected promoter/enhancer element. The promoter activation may be tissue specific or inducible by a metabolic product or administered substance.
[0136] Promoters/enhancers which may be used to control the fusion protein gene expression include, but are not limited to, the native IGF-1 or EGF promoter, the cytomegalovirus (CMV) promoter/enhancer (Karasuyama, H., et al., 1989, J. Exp. Med., 169:13), the human .beta.-actin promoter (Gunning, et al., 1987, Proc. Natl. Acad. Sci. USA, 84:4831-4835), the glucocorticoid-inducible promoter present in the mouse mammary tumor virus long terminal repeat (MMTV LTR) (Klessig, D. F., et al., 1984, Mol. Cell Biol., 4:1354-1362), the long terminal repeat sequences of Moloney murine leukemia virus (MuLV LTR) (Weiss, R., et al., 1985, RNA Tumor Viruses, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.), the SV40 early region promoter (Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus (RSV) (Yamamoto et al., 1980, Cell 22:787-797), the herpes simplex virus (HSV) thymidine kinase promoter/enhancer (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the metallothionein gene (Brinster et al., 1982, Nature 296:39-42), the adenovirus promoter (Yamada et al., 1985, Proc. Natl. Acad. Sci. U.S.A. 82(11):3567-71), and the herpes simplex virus LAT promoter (Wolfe, J. H., et al., 1992, Nature Genetics, 1:379-384).
[0137] Expression vectors compatible with mammalian host cells for use in genetic therapy of cells, include, but are not limited to: plasmids, retroviral vectors, adenovirus vectors, herpes viral vectors, poxvirus vectors and non-replicative avipox viruses, as disclosed, for example, by U.S. Pat. No. 5,174,993. The viral vectors may be suitably modified for safety and modified to be non-replicative in human host cells.
[0138] Assays for Chondrocyte-Stimulating Activity
[0139] The effectiveness of the fusion proteins herein may be ascertained according to the present examples.
[0140] Treatment Methods
[0141] The present invention is also directed to methods of ameliorating joint and chondrocyte-associated maladies in any mammal, including laboratory animals, companion animals, draft animals, meat animals, and humans. For instance, the present invention may be used to treat joint disease, injury (surgical or accident), pain, structural defects, or to increase sports performance. In particular, dogs, cats, horses, and humans would be treatable with the compositions and methods of the present invention. Methods of treatment include the direct introduction of the chimeric nucleic acid into the cells of the subject to be treated. Proteins may be introduced to the subject directly.
[0142] Alternatively, the methods of treatment include the ex vivo treatment of cells, e.g., chondrocytes that have been removed from a subject's body, followed by the reintroduction of the treated cells into the subject. The ex vivo treatment can be conducted with the fusion protein according to the invention, in order to affect the IGF-1 functionality in the treated tissue. In addition, the ex vivo treatment can be conducted with the chimeric nucleic acid molecules according to the invention, in order that the reintroduced cells will continue to affect IGF-1 functionality after reintroduction.
[0143] In another embodiment, vectors according to the invention can be applied to the skin and internal organs by suspension in appropriate physiological carriers.
[0144] For example, a physiologically appropriate solution containing an effective concentration of active vectors can be administered topically, interarticularly, intraocularly, parenterally, orally, intranasally, intravenously, intramuscularly, subcutaneously or by any other effective means. In particular, the vector may be directly injected into a target tissue by a needle in amounts effective to treat the cells of the target tissue. Alternatively, a body cavity or tissue can receive a physiologically appropriate composition (e.g., a solution such as a saline or phosphate buffer, a suspension, or an emulsion, which is sterile except for the vector) containing an effective concentration of active vectors via direct injection with a needle or via a catheter or other delivery tube placed into the afflicted tissue. Any effective imaging device such as X-ray, sonogram, or fiberoptic visualization system may be used to locate the target area and guide the needle or other administration device.
[0145] In another alternative, a physiologically appropriate solution containing an effective concentration of active vectors can be administered systemically into the blood circulation to treat cells or tissues which cannot be directly reached or anatomically isolated.
[0146] In yet another alternative, target cells can be treated by introducing a fusion protein according to the invention into the cells. For example, liposomes are artificial membrane vesicles that are available to deliver drugs, proteins, and plasmid vectors both in vitro or in vivo (Mannino, R. J. et al., 1988, Biotechniques, 6:682-690) into target cells (Newton and Huestis, Biochemistry, 1988, 27: 4655-4659; Tanswell, A. K. et al., 1990, Biochemica et Biophysica Acta, 1044:269-274; and Ceccoll, J. et al. Journal of Investigative Dermatology, 1989, 93:190-194). Thus, fusion protein can be encapsulated at high efficiency with liposome vesicles and delivered into mammalian cells in vitro or in vivo.
[0147] Liposome-encapsulated fusion protein may be administered topically, intraocularly, parenterally, intranasally, intratracheally, intrabronchially, intramuscularly, subcutaneously or by any other effective means at a dose efficacious to treat the injured tissue or abnormal cells of the target. The liposomes may be administered in any physiologically appropriate composition containing an effective concentration of encapsulated fusion protein.
[0148] An effective concentration of vector or fusion protein may be determined by the ordinary artisan, for example, by screening chimeric nucleic acid vectors encoding fusion protein, or a fusion protein, in a chondrocyte assay system as described herein.
[0149] cDNA constructs encoding the present fusion proteins are described in the examples. The human IGF-1 cDNA sequence, including the coding sequence for the mature peptide, with and without the E-peptide were linked to the cDNA sequence encoding the hyaluronic acid-binding domain of cartilage matrix protein (HAB) with and without additional upstream cDNA sequences. When these gene sequences are placed into a vector and transferred into articular chondrocytes, the fusion protein is produced by the cells. The fusion protein is designed to bind to the hyaluronic acid-like molecules present in the tissue matrix that surrounds articular chondrocytes in vivo and during culture in vitro. Because this binding is reversible, the IGF-1 will be released over time. The significance of this approach is that it achieves local IGF-1 delivery over time in the presence of a responsive cell population.
[0150] The gene sequences, when expressed by chondrocytes, produce fusion proteins with superior IGF-1 attributes. When the fusion protein is secreted from the chondrocytes, the secreted fusion protein acts upon the cells that produced it and neighboring cells to stimulate cell division, the production of new matrix, and to inhibit cartilage degradation. The compositions are preferably administered locally, preferably by an instrument of injecting liquids into a joint cavity according to conventional methods. In one embodiment of the present method, the solution is directly injected via a syringe needle into the joint space (including connective tissue and synovial) of the patient. The size of the syringe needle is determined by the health professional administering the solution and the amount of solution to be injected into joint space of the patient. Preferably, the syringe is capable of containing at least 0.1 cc of liquid and administering the liquid percutaneously into the joint space of the patient via a syringe needle.
[0151] The typical volumes of solution injected into joint space of the patient ranges from 0.1 cc to 10 cc of solution per administration, depending on the size of the joint being treated. Preferably, the solution volume is ranges between 0.5 cc to 5 cc of solution and most preferably, the volume is lcc to 5 cc of solution.
[0152] The treatment may optionally include administration of an anesthetic/anti-inflammatory component of the solution dosage typically comprises at least one anesthetic that preferably has a volume that is between 20% to 60% of the total solution volume of each solution dosage. The anti-inflammatory component of the solution dosage may optionally comprise corticosteroids, including, optionally, two corticosteroids, each corticosteroid preferably having a volume that is between 10% to 50% of the total solution volume of each solution dosage.
[0153] The anesthetic component of the solution of the present method is selected based on the desired duration of pain relief provided to the painful (arthritic) or connective tissue target of the patient. Exemplary anesthetic compounds include procaine hydrochloride ("Procaine").
[0154] The corticosteroids components are selected from the group consisting of Betamethasone, Methylprednisolone acetate ("Depomedrol"), Cortisone acetate, Dexamethasone, Hydrocortisone, Methylprednisolone, Prednisolone, Prednisolone sodium phosphate, and Prednisone. In one embodiment of the present method corticosteroids component includes a mixture of Dexamethasone and Depomedrol. In one embodiment of the method the anesthetic is buffered.
[0155] The needle of the syringe needle of the present method is selected based on the size of the joint space of the patient. In one embodiment of the present method the needle is a hypodermic needle, preferably an arthroscopic needle, but may be other types of needles depending on the application including non-arthroscopic needles. Preferably, the present method provides for needle gauges in the range of between 22 and 25 gauge and from 3/4 inch to 11/2 inches in length.
[0156] Further, the treatment solution is injected into the joint. Preferably, this is done by inserting the syringe needle through the treated topical skin area and then slowly guiding the needle of the syringe needle into the synovia capsule of the target joint. Depending upon the joint or target space capacity, preferably 10 ml or less of the present treatment composition is introduced slowly into the joint capsule.
[0157] Further, the treatment may be repeated periodically. Repeat injections may be administered in 1, 5, 7, 10, 20, 30 day or yearly intervals after the first injection. The solution can be mixed in a separate container prior to be loaded into a syringe or they can be mixed in the syringe directly prior to administration.
Example 2. Creation of Hyaluronic Acid Binding--Insulin-Like Growth Factor I Fusion Protein cDNA Constructs
[0158] Three cDNAs were created, encoding three hyaluronic acid-binding domain (HAB)--insulin-like growth factor I (IGF-I) fusion proteins (HAB-IGF-I) by coupling the sequence encoding IGF-I with those encoding a hyaluronic acid binding region derived from three cartilage matrix proteins. These regions include 1) the aggrecan G1 domain (AG1), 2) the versican G1 domain (VG1) and 3) link protein (LP). The resulting fusion proteins are designated AG1-IGF-I, VG1-IGF-I, and LP-IGF-I respectively (FIG. 1B). Further, the encoded fusion proteins were created by transferring the cDNAs into producer cells and the selected therapeutic target cells, articular chondrocytes.
[0159] To facilitate the HAB-IGF-I constructs to be used for human therapeutic purposes, the DNA constructs encoding all the HAB-IGF-I fusion proteins were generated from native human matrix protein gene sequences and the native human IGF-I gene sequence (Accession number: X57025) and were designed to include signal peptides.
[0160] All the constructs described and illustrated in FIG. 1B were created using specifically designed primers by polymerase chain reaction (Table 4).
[0161] As shown in FIG. 1A, Aggrecan has two globular structural domains (G1 and G2) near the N-terminal end and one globular domain (G3) at the C-terminal end, separated by a large extended domain modified with glycosaminoglycans. The two main modifier moieties are arranged into distinct regions, a chondroitin sulfate (CS) and a keratan sulfate (KS) region.
Example 3. Production of Fusion Proteins and Demonstration of Fusion Protein Integrity
[0162] To promote articular cartilage preservation or repair using these fusion proteins as therapeutic agents, it is necessary to have a production source for the proteins. A method of production employs the fusion protein-encoding cDNAs described above and a non-viral (plasmid) vector that the inventors developed previously, Shi et al., Effect of Transfection Strategy on Growth Factor Overexpression by Articular Chondrocytes, J. of Orthopaedic Research, January 2010, hereby incorporated by reference. The cDNAs were inserted into the vector and used to transfect HEK 293 cells (Stratagene). The HEK 293 cells then use these inserted transgenes to synthesize the HAB-IGF-I fusion proteins. On day 3 after transfection conditioned medium was collected for measurement of HAB-IGF-I fusion proteins produced as transgene products by the HEK 293 cells. The production and integrity of these proteins was confirmed by IGF-I ELISA (R&D Systems) and a HA-binding functional ELISA.
[0163] It was found that pAAV-VG-IGF-I transfected cells produced a high level of IGF-I (1939 ng/ml), pAAV-AG1-IGF-I transfected cells produced 865 ng/ml of IGF-I, and pAAV-LP-IGF-I transfected cells only produced 5 ng/ml of IGF-I. HA-binding functional ELISA data demonstrated that the VG-IGF-I and the AG1-IGF-I fusion proteins successfully attached to hyaluronic acid, and AG1 or VG1 and IGF-I are linked together.
Example 4. Demonstration of Bioactivity of Transferred HAB-IGF-I Genes in Articular Chondrocytes
[0164] To promote articular cartilage preservation or repair using the fusion protein cDNAs as agents for gene therapy, the transgenes must stimulate reparative activity by their host cells. To test this capability, primary bovine articular chondrocytes were transfected and cultured for 5 days. On day 5, the chondrocytes were collected and used to measure biosynthesis of cell-associated matrix molecules. An index of articular chondrocyte reparative activity is the production of glycosaminoglycan (GAG). GAG is an essential component of articular cartilage. It is produced by articular chondrocytes incorporated into their surrounding matrix.
[0165] Chondrocytes transfected with pAAV-VG1-IGF-I produced a high level of GAG (71.4.+-.3.3 ug/well), pAAV-AG1-IGF-I transfected cells produced 65.5.+-.0.3 ug/well of GAG, and pAAV-LP-IGF-I transfected cells produced 42.4.+-.0.6 ug/well of GAG.
Example 5. CD44-IGF-I Constructs
[0166] Four different constructs of CD44-IGF-I were generated: 1) CD44(132)-IGF-I (SEQ ID NO: 64), 2) CD44(156)-IGF-I (SEQ ID NO: 68), 3) CD44(178)-IGF-I (SEQ ID NO: 72), and 4) CD44(222)-IGF-I (SEQ ID NO: 76). Each construct was separately delivered to bovine chondrocytes using a pAAV-based vector. The empty vector, pAAV-MCS, was used as a control. The conditioned medium was harvested on day 2 after transfection. The conditioned medium was analyzed by both IGF-I ELISA and HA-binding functional ELISA. Different constructs generated dramatically different production of CD44-IGF-I fusion protein in chondrocytes. Construct CD44(156)-IGF-I generated the least amount of fusion protein (13.50 ng/ml) while construct CD44(178)-IGF-I produced the highest amount of fusion protein (289.51 ng/ml), 21.4 fold higher than that of the construct CD44(156)-IGF-I. HA-binding functional ELISA showed that the fusion protein from construct CD44(178)-IGF-I in chondrocytes binds to HA, and CD44(178) and IGF-I are linked together. HA-binding activity was not detectable in the conditioned medium from the chondrocytes transfected by construct CD44(132)-IGF-I, CD44(156)-IGF-I or CD44(222)-IGF-I. Low production of CD44-IGF-I fusion protein in chondoocytes transfected by CD44(132)-IGF-I or CD44(156)-IGF-I interfered with HA-binding activity detection in the condition medium (Table 1).
TABLE-US-00001 TABLE 1 Production and HA-binding activity of CD44-IGF-I fusion protein Arbitrary unit IGF-I (ng/ml) (HA-binding (IGF-I ELISA) functional ELISA) Control 0.00 0.00 CD44(132)-IGF-I 19.95 0.00 CD44(156)-IGF-I 13.50 0.00 CD44(178)-IGF-I 289.51 1.70 CD44(222)-IGF-I 172.91 0.00
[0167] To improve the HA-binding activity of CD44(178)-IGF-I, nine different constructs were generated, each containing a different linker between CD44(178) and IGF-I, using pAAV-CD44(178)-IGF-I as a template and QuickChange Lighting Site-Directed Mutagenesis Kit (Stratagene). Each construct was separately delivered to bovine chondrocytes using a pAAV vector. The original construct, CD44(178)-IGF-I, in the pAAV vector was included for comparison. The conditioned medium was harvested on day 2 after transfection. The conditioned medium was analyzed by both IGF-I ELISA and HA-binding functional ELISA. Different linkers have varied effects on the fusion protein production and HA-binding activity. Compared to the original construct, CD44(178)-IGF-I, the insertion of Linker 8 increases the fusion protein production and HA-binding activity by 1.7 fold and 2.9 fold respectively (Table 2).
TABLE-US-00002 TABLE 2 Production and HA-binding activity of CD44-IGF-I fusion protein with or without a linker Arbitrary unit IGF-I (ng/ml) (HA-binding (IGF-I ELISA) functional ELISA) CD44(178)-IGF-I 278.77 1.56 CD44(178)-Linker 1-IGF-I 278.20 0.77 CD44(178)-Linker 2-IGF-I 236.56 0.99 CD44(178)-Linker 3-IGF-I 144.40 0.33 CD44(178)-Linker 4-IGF-I 271.98 1.11 CD44(178)-Linker 5-IGF-I 267.40 0.21 CD44(178)-Linker 6-IGF-I 240.64 0.40 CD44(178)-Linker 7-IGF-I 343.39 0.30 CD44(178)-Linker 8-IGF-I 463.64 4.47
Example 6. Protease Cleavage Site Constructs
[0168] Constructs were generated, each containing a different specific protease cleavage site between VG1 and IGF-I, using pAAV-VG1-IGF-I as a template and QuickChange Lighting Site-Directed Mutagenesis Kit (Stratagene). The resulting constructs and the original construct were delivered to bovine chondrocytes using a pAAV-based vector. The conditioned medium was harvested on day 2 after transfection. The conditioned medium was analyzed by IGF-I ELISA and HA-binding functional ELISA respectively. The constructs with a protease cleavage site generated similar amounts of fusion protein and the resulting fusion proteins had similar HA-binding activity (Table 3), compared with the original construct VG1-IGF-I (Table 1). These VG1 fusion proteins with protease cleavage site are assessed to examine specific proteases cleavage efficiency and to measure the activity of the released growth factor. This method can be applied to aggrecan, CD44, and other HA-binding domains, and other growth factors.
TABLE-US-00003 TABLE 3 Production and HA-binding activity of CD44-IGF-I fusion protein Arbitrary unit IGF-I (ng/ml) (HA binding (IGF-I ELISA) functional ELISA) Control 0.00 0.00 VG1-I1 299.03 199.26 VG1-EK-IGF-I 358.97 249.86 VG1-Furin-IGF-I 221.79 228.85 VG1-Xa-IGF-I 312.21 256.90
[0169] A depiction of a fusion protein with a cleavage site is shown in FIG. 4C. Examples of cleavage sites include: cleavage site for EK protease: DDDDK(SEQ ID NO: 9); cleavage site for Furin protease: RVRR (SEQ ID NO: 10); cleavage site for Factor Xa protease: IEGR (SEQ ID NO: 11); cleavage site for MMP protease: DIPENFFG (SEQ ID NO: 12); and cleavage site for Aggrecanase: NITEGEARGSVI (SEQ ID NO: 13).
Example 7. Sequences
[0170] Specific nucleotide and amino acid sequences may be utilized in embodiments of the invention.
[0171] Examples of primers useful for building HAB-IGF-I fusion protein constructs are provided in Table 4.
TABLE-US-00004 TABLE 4 Primers for building HAB-IGF-I fusion protein constructs SEQ Prim- ID er Sequence No. AG1 F 5'-GAATTCACCATGGCCACTTTACTCTGGGTTTTCGT 14 GACTC-3' AG1 R 5'-GGGCCCGATGTCCACAAAGTCTTCACCTGTGTAG- 15 3' LP F 5'-GAATTCACCATGGAGAGTCTACTTCTTCTGGTGCT 16 GATTTC-3' LP R 5'-GGGCCCGTTGTATGCTCTGAAGCAGTAGACACC-3' 17 VG1 F 5'-GAATTCACCATGGTCATAAATATAAAGAGCATCTT 18 ATGGA-3' VG1 R 5'-GTCGACCAATTGGATGACCAATTACACTCAAATCA 19 CTC-3' IGF-I 5'-GTCGACGGGCCCGAGACGCTCTGCGGGGCTGAGCT 20 F1 GGTG-3' IGF-I 5'-CAATTGATTCAGAAGGACCGGAGACGCTCTGCGGG 21 F2 GCTGAG-3' IGF-I 5'-CTCGAGCTACATCCTGTAGTTCTTGTTTCCTG-3' 22 R IGF-I 5'-GAATTCACAATGGGAAAAATCAGCAGTCTTCC-3' 23 F3 The underlined 6-bp sequence represents EcoR I (GAATTC), Xho I (CTCGAG), Mfe I (CAATTG), BamH I (GGATCC) or Apa I (GGGCCC) sites. Sal I (GTCGAC) site in primer VG1R and IGF-I F1I is included for cloning.
[0172] Human aggrecan gene nucleotide sequence is provided (SEQ ID No. 24) and the human aggrecan gene nucleotide sequence segment encoding the N-terminal 357 aa of human aggrecan (SEQ ID No. 25), which includes aggrecan G1 domain and a partial IGD domain (FIG. 1A). The aggrecan G1 domain (AG1) is responsible for the binding of aggrecan to hyaluronic acid.
[0173] Human aggrecan protein amino acid sequence is provided (SEQ ID No. 26) and the N-terminal 357 aa of human aggrecan (SEQ ID No. 27), which includes aggrecan G1 domain and a partial IGD domain (FIG. 1A). The aggrecan G1 domain (AG1) is responsible for the binding of aggrecan to hyaluronic acid.
[0174] Human link protein gene nucleotide sequence is provided (SEQ ID No. 28). Link protein (LP) binds to hyaluronic acid and aggrecan G1 domain (AG1) in stabilizing the interaction between aggrecan and hyaluronan to form aggrecan aggregates in articular cartilage.
[0175] Human link protein amino acid sequence is provided (SEQ ID No. 29).
[0176] Human versican gene nucleotide sequence is provided (SEQ ID No. 30) and the N-terminal 363 aa of human versican (SEQ ID No. 31), which includes versican G1 domain (FIG. 1A). The versican G1 domain (VG1) is responsible for the binding of versican to hyaluronic acid.
[0177] Human versican protein amino acid sequence is provided (SEQ ID No. 32) and the N-terminal 363 aa of human versican (SEQ ID No. 33), which includes versican G1 domain (FIG. 1A). The versican G1 domain (VG1) is responsible for the binding of versican to hyaluronic acid.
[0178] Human IGF-I (signal peptide and mature peptide and E peptide) gene nucleotide sequence is provided (SEQ ID No. 34). The 70aa gene nucleotide sequence encoding the IGF-I mature peptide (SEQ ID No. 35) is provided. The 48aa gene nucleotide sequence encoding the IGF-I signal peptide (SEQ ID No. 36) is provided. The 35aa gene nucleotide sequence encoding the IGF-I E peptide (SEQ ID No. 37) is also provided (FIG. 1A).
[0179] Human IGF-I (signal peptide and mature peptide and E peptide) protein amino acid sequence is provided (SEQ ID No. 38). The amino acid sequence (70aa) encoding the IGF-I mature peptide (SEQ ID No. 39) is provided. The amino acid sequence (48aa) encoding the IGF-I signal peptide (SEQ ID No. 40) is provided. The amino acid sequence (35aa) encoding the IGF-I E peptide (SEQ ID No. 41) is also provided (FIG. 1A).
[0180] HAB-IGF-I Fusion Protein
[0181] AG1-IGF-I fusion protein (FIG. 1B) nucleotide sequence is provided (SEQ ID No. 42). A segment of the AG1-IGF-I fusion protein nucleotide sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 43). This is fused to a c-terminal truncated aggrecan from AA 1 to AA 357, which includes aggrecan G1 domain (AG1) and a partial IGD domain. The aggrecan G1 domain (AG1) is responsible for the binding of aggrecan to hyaluronic acid.
[0182] AG1-IGF-I fusion protein (FIG. 1B) amino acid sequence is provided (SEQ ID No. 44). A segment of the AG1-IGF-I fusion protein amino acid sequence is IGF-I mature peptide and E peptide (SEQ ID No. 45). This is fused to a c-terminal truncated aggrecan from AA 1 to AA 357, which includes aggrecan G1 domain (AG1) and a partial IGD domain. The aggrecan G1 domain (AG1) is responsible for the binding of aggrecan to hyaluronic acid.
[0183] LP-IGF-I fusion protein (FIG. 1B) nucleotide sequence is provided (SEQ ID No. 46). A segment of the LP-IGF-I fusion protein nucleotide sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 47). This is fused to human link protein at c-terminal. Link protein (LP) binds to hyaluronic acid and aggrecan G1 domain (AG1) in stabilizing the interaction between aggrecan and hyaluronan to form aggrecan aggregates in articular cartilage.
[0184] LP-I1 (FIG. 1B) amino acid sequence is provided (SEQ ID No. 48). A segment of the LP-II amino acid sequence is IGF-I mature peptide and E peptide (SEQ ID No. 49). This is fused to human link protein at c-terminal. Link protein (LP) binds to hyaluronic acid and aggrecan G1 domain (AG1) in stabilizing the interaction between aggrecan and hyaluronan to form aggrecan aggregates in articular cartilage.
[0185] VG1-I1 (FIG. 1B) nucleotide sequence is provided (SEQ ID No. 50). A segment of the VG1-I1 nucleotide sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 51). This is fused to a c-terminal truncated versican from AA 1 to AA 363, which includes versican G1 domain (VG1). The versican G1 domain (VG1) is responsible for the binding of versican to hyaluronic acid.
[0186] VG1-I1 (FIG. 1B) amino acid sequence is provided (SEQ ID No. 52). A segment of the VG1-I1 sequence is IGF-I mature peptide and E peptide (SEQ ID No. 53). This is fused to a c-terminal truncated versican from AA 1 to AA 363, which includes versican G1 domain (VG1). The versican G1 domain (VG1) is responsible for the binding of versican to hyaluronic acid.
[0187] Human CD44 gene nucleotide sequence is provided (SEQ ID No. 54). The protein encoded by this gene sequence is a cell-surface glycoprotein. It is a receptor for hyaluronic acid (HA). This sequence (NM_0010013931) is one of CD44 mRNA splicing variants. The 5'-804 nt sequence (SEQ ID No. 55) encodes the N-terminal 268 aa of CD44: a signal peptide (SP) from AA1 to AA20 and an extracellular region from AA21 to AA268. The extracellular region from AA21 to AA268 contains the HA binding domain, which is responsible for the binding of CD44 to HA. The 60 nt sequence after the 804 nt sequence encodes the 20 aa of human CD44 transmembrane region (TM) (SEQ ID No. 56). The 3'-222 nt sequence encodes the 73 aa of the C-terminal intracellular region of CD44 (SEQ ID No. 57).
[0188] Human CD44 protein amino acid sequence is provided (SEQ ID No. 58). The N-terminal 268 aa sequence (SEQ ID No. 59) consists of a signal peptide (SP) from AA1 to AA20 and an extracellular region from AA21 to AA268. The CD44 extracellular region from AA21 to AA268 contains the HA-binding domain responsible for the binding of CD44 to HA. The C-terminal 73 aa sequence (SEQ ID No. 61) is the intracellular region of CD44. The 20 aa sequence between them is CD44 transmembrane region (TM) (SEQ ID No. 60) (FIG. 4A).
[0189] CD44-IGF-I Fusion Protein
[0190] CD44(132)-IGF-I nucleotide sequence is provided (SEQ ID No. 62). A segment of the CD44(132)-IGF-I nucleotide sequence (SEQ ID No. 63) encodes IGF-I mature peptide and E peptide. This is fused to a c-terminal truncated CD44 from AA 1 to AA 132 (FIG. 4B).
[0191] CD44(132)-IGF-I amino acid sequence is provided (SEQ ID No. 64). A segment of the CD44(132)-IGF-I amino acid sequence is IGF-I mature peptide and E peptide (SEQ ID No. 65). This is fused to a c-terminal truncated CD44 from AA 1 to AA 132 (FIG. 4B).
[0192] CD44(156)-IGF-I nucleotide sequence is provided (SEQ ID No. 66). A segment of the CD44(156)-IGF-I nucleotide sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 67). This is fused to a c-terminal truncated CD44 from AA 1 to AA 156 (FIG. 4B).
[0193] CD44(156)-IGF-I amino acid sequence is provided (SEQ ID No. 68). A segment of the CD44(156)-IGF-I amino acid sequence (SEQ ID No. 69) is IGF-I mature peptide and E peptide. This is fused to a c-terminal truncated CD44 from AA 1 to AA 156 (FIG. 4B).
[0194] CD44(178)-IGF-I nucleotide sequence is provided (SEQ ID No. 70). A segment of the CD44(178)-IGF-I nucleotide sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 71). This is fused to a c-terminal truncated CD44 from AA 1 to AA 178 (FIG. 4B).
[0195] CD44(178)-IGF-I amino acid sequence is provided (SEQ ID No. 72). A segment of the CD44(178)-IGF-I amino acid sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 73). This is fused to a c-terminal truncated CD44 from AA 1 to AA 178 (FIG. 4B).
[0196] CD44(222)-IGF-I nucleotide sequence is provided (SEQ ID No. 74). A segment of the CD44(222)-IGF-I nucleotide sequence encodes IGF-I mature peptide and E peptide (SEQ ID No. 75). This is fused to a c-terminal truncated CD44 from AA 1 to AA 222 (FIG. 4B).
[0197] CD44(222)-IGF-I amino acid sequence is provided (SEQ ID No. 76). A segment of the CD44(222)-IGF-I amino acid sequence is IGF-I mature peptide and E peptide (SEQ ID No. 77). This is fused to a c-terminal truncated CD44 from AA 1 to AA 222 (FIG. 4B).
TABLE-US-00005 Sequence Listing Table SEQ ID Abbreviation NO (as applicable) Name 1 Linker Linker 1 2 Linker Linker 2 3 Linker Linker 3 4 Linker Linker 4 5 Linker Linker 5 6 Linker Linker 6 7 Linker Linker 7 8 Linker Linker 8 9 Cleavage site cleavage site for EK protease 10 Cleavage site cleavage site for Furin protease 11 Cleavage site cleavage site for Factor Xa protease 12 Cleavage site cleavage site for MMP protease 13 Cleavage site cleavage site for Aggrecanase 14 AG1 F Primer: Aggrecan G1 domain (forward) 15 AG1 R Primer: Aggrecan G1 domain (reverse) 16 LP F Primer: Link Protein (forward) 17 LP R Primer: Link Protein (reverse) 18 VG1 F Primer: Versican G1 domain (forward) 19 VG1 R Primer: Versican G1 domain (reverse) 20 IGF-I F1 Primer: insulin-like growth factor (forward) 21 IGF-I F2 Primer: insulin-like growth factor (forward) 22 IGF-I R Primer: insulin-like growth factor (reverse) 23 IGF-I F3 Primer: insulin-like growth factor (forward) 24 Aggrecan Human aggrecan gene nucleotide sequence 25 nucleotide sequence N-terminal 357 aa of human aggrecan - includes aggrecan G1 domain (AG1) and a partial IGD domain 26 Aggrecan Human aggrecan protein amino acid sequence 27 amino acid sequence N-terminal 357 aa of human aggrecan 28 LP Human link protein gene nucleotide sequence 29 LP Human link protein amino acid sequence 30 Versican Human versican gene nucleotide sequence 31 nucleotide sequence N-terminal 363 aa of human versican - includes versican G1 domain (VG1) 32 Versican Human versican protein amino acid sequence 33 amino acid sequence N-terminal 363 aa of human versican - includes versican G1 domain (VG1) 34 Human IGF-I (signal peptide and mature peptide and E peptide) gene nucleotide sequence 35 IGF-I (70aa) IGF-I mature peptide nucleotide sequence 36 SP IGF-I signal peptide nucleotide sequence 37 E peptide IGF-I E nucleotide sequence 38 Human IGF-I (signal peptide, mature peptide, and E peptide) amino acid sequence 39 IGF-I (70aa) IGF-I mature peptide amino acid sequence 40 SP IGF-I signal peptide amino acid sequence 41 E peptide IGF-I E peptide amino acid sequence 42 AG1-IGF-I fusion protein nucleotide sequence 43 segment of the AG1-IGF-I fusion protein nucleotide sequence encoding IGF-I mature peptide and E peptide 44 AG1-IGF-I fusion protein amino acid sequence 45 segment of the AG1-IGF-I fusion protein amino acid sequence: IGF-I mature peptide and E peptide 46 LP-IGF-I fusion protein nucleotide sequence 47 segment of the LP-IGF-I fusion protein nucleotide sequence encoding IGF-I mature peptide and E peptide 48 LP-I1 amino acid sequence 49 segment of the LP-I1 amino acid sequence: IGF-I mature peptide and E peptide 50 VG1-IGF-1 nucleotide sequence 51 segment of the VG1-IGF-1 nucleotide sequence encoding IGF-I mature peptide and E peptide 52 VG1-IGF-1 amino acid sequence 53 segment of the VG1-IGF-1 nucleotide sequence: IGF-I mature peptide and E peptide 54 CD44 Human CD44 gene nucleotide sequence 55 N-terminal 268 aa of CD44 (including SP & HAB) 56 TM human CD44 transmembrane region 57 C-terminal intracellular region of CD44 58 Human CD44 protein amino acid sequence 59 CD44 N-terminal 268 aa sequence consists of a signal peptide (SP) from AA1 to AA20 and an extracellular region from AA21 to AA268 60 CD44 transmembrane region 61 C-terminal intracellular region of CD44 62 CD44(132)-IGF-I nucleotide sequence 63 segment of the CD44(132)-IGF-I nucleotide sequence encoding IGF-I mature peptide and E peptide 64 CD44(132)-IGF-I amino acid sequence 65 segment of the CD44(132)-IGF-I amino acid sequence is IGF-I mature peptide and E peptide 66 CD44(156)-IGF-I nucleotide sequence 67 segment of the CD44(156)-IGF-I nucleotide sequence encodes IGF-I mature peptide and E peptide 68 CD44(156)-IGF-I amino acid sequence 69 segment of the CD44(156)-IGF-I amino acid sequence: IGF-I mature peptide and E peptide 70 CD44(178)-IGF-I nucleotide sequence 71 segment of the CD44(178)-IGF-I nucleotide sequence encoding IGF-I mature peptide and E peptide 72 CD44(178)-IGF-I amino acid sequence 73 segment of the CD44(178)-IGF-I amino acid sequence: IGF-I mature peptide and E peptide 74 CD44(222)-IGF-I nucleotide sequence 75 segment of the CD44(222)-IGF-I nucleotide sequence encoding IGF-I mature peptide and E peptide 76 CD44(222)-IGF-I amino acid sequence 77 segment of the CD44(222)-IGF-I amino acid sequence: IGF-I mature peptide and E peptide
[0198] The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof. Thus, it should be understood that although the present invention has been specifically disclosed by particular embodiments and optional features, modification and variation of the concepts herein disclosed may be used by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the appended claims. Whenever a range is given in the specification, all intermediate ranges and subranges, as well as all individual values included in the ranges given are intended to be included in the disclosure. When a Markush group or other grouping is used herein, all individual members of the group and all combinations and sub-combinations possible of the group are intended to be individually included in the disclosure. While the present invention has been described in terms of those particular embodiments and examples, it will be appreciated that the spirit and scope of the invention is not limited to those embodiments, but extends to the various modifications and equivalents as defined in the appended claims.
Sequence CWU
1
1
7714PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 1Gly Gly Ser Gly128PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 2Gly Gly Ser Gly Gly Gly Ser
Gly1 5312PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 3Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly
Ser Gly1 5 1045PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 4Gly
Gly Gly Gly Ser1 5510PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 5Gly Gly Gly Gly Ser Gly Gly
Gly Gly Ser1 5 10615PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 6Gly
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser1 5
10 1576PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 7Gly
Gly Ser Gly Gly Ser1 589PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 8Val Ile Gly His Pro Ile
Asp Ser Glu1 595PRTUnknownDescription of Unknown EK
protease cleavage site peptide 9Asp Asp Asp Asp Lys1
5104PRTUnknownDescription of Unknown Furin protease cleavage site
peptide 10Arg Val Arg Arg1114PRTUnknownDescription of Unknown factor Xa
protease cleavage site peptide 11Ile Glu Gly
Arg1128PRTUnknownDescription of Unknown MMP protease cleavage site
peptide 12Asp Ile Pro Glu Asn Phe Phe Gly1
51312PRTUnknownDescription of Unknown Aggrecanase protease cleavage
site peptide 13Asn Ile Thr Glu Gly Glu Ala Arg Gly Ser Val Ile1
5 101440DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 14gaattcacca tggccacttt
actctgggtt ttcgtgactc 401534DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
15gggcccgatg tccacaaagt cttcacctgt gtag
341641DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 16gaattcacca tggagagtct acttcttctg gtgctgattt c
411733DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 17gggcccgttg tatgctctga agcagtagac acc
331840DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 18gaattcacca tggtcataaa tataaagagc
atcttatgga 401938DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
19gtcgaccaat tggatgacca attacactca aatcactc
382039DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 20gtcgacgggc ccgagacgct ctgcggggct gagctggtg
392141DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 21caattgattc agaaggaccg gagacgctct gcggggctga g
412232DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 22ctcgagctac atcctgtagt tcttgtttcc tg
322332DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 23gaattcacaa tgggaaaaat
cagcagtctt cc 32247296DNAHomo sapiens
24atgaccactt tactctgggt tttcgtgact ctgagggtca tcactgcagc tgtcactgta
60gaaacttcag accatgacaa ctcgctgagt gtcagcatcc cccaaccgtc cccgctgagg
120gtcctcctgg ggacctccct caccatcccc tgctatttca tcgaccccat gcaccctgtg
180accaccgccc cttctaccgc cccactggcc ccaagaatca agtggagccg tgtgtccaag
240gagaaggagg tagtgctgct ggtggccact gaagggcgcg tgcgggtcaa cagtgcctat
300caggacaagg tctcactgcc caactacccg gccatcccca gtgacgccac cttggaagtc
360cagagcctgc gctccaatga ctctggggtc taccgctgcg aggtgatgca tggcatcgag
420gacagcgagg ccaccctgga agtcgtggtg aaaggcatcg tgttccatta cagagccatc
480tctacacgct acaccctcga ctttgacagg gcgcagcggg cctgcctgca gaacagtgcc
540atcattgcca cgcctgagca gctgcaggcc gcctacgaag acggcttcca ccagtgtgac
600gccggctggc tggctgacca gactgtcaga taccccatcc acactccccg ggaaggctgc
660tatggagaca aggatgagtt tcctggtgtg aggacgtatg gcatccgaga caccaacgag
720acctatgatg tgtactgctt cgccgaggag atggagggtg aggtctttta tgcaacatct
780ccagagaagt tcaccttcca ggaagcagcc aatgagtgcc ggcggctggg tgcccggctg
840gccaccacgg gccagctcta cctggcctgg caggctggca tggacatgtg cagcgccggc
900tggctggccg accgcagcgt gcgctacccc atctccaagg cccggcccaa ctgcggtggc
960aacctcctgg gcgtgaggac cgtctacgtg catgccaacc agacgggcta ccccgacccc
1020tcatcccgct acgacgccat ctgctacaca ggtgaagact ttgtggacat cccagaaaac
1080ttctttggag tggggggtga ggaggacatc accgtccaga cagtgacctg gcctgacatg
1140gagctgccac tgcctcgaaa catcactgag ggtgaagccc gaggcagcgt gatccttacc
1200gtaaagccca tcttcgaggt ctcccccagt cccctggaac ccgaggagcc cttcacgttt
1260gcccctgaaa taggggccac tgccttcgct gaggttgaga atgagactgg agaggccacc
1320aggccctggg gctttcccac acctggcctg ggccctgcca cggcattcac cagtgaggac
1380ctcgtcgtgc aggtgaccgc tgtccctggg cagccgcatt tgccaggggg ggtcgtcttc
1440cactaccgcc cgggacccac ccgctactcg ctgacctttg aggaggcaca gcaggcctgc
1500ctgcgcacgg gggcggtcat tgcctcgccg gagcagctcc aggccgccta cgaagcaggc
1560tatgagcagt gtgacgccgg ctggctgcgg gaccagaccg tcagataccc cattgtgagc
1620ccccggaccc catgcgtggg tgacaaggac agcagcccag gggtcaggac ctatggcgtg
1680cgcccatcaa cagagaccta cgatgtctac tgctttgtag acagacttga gggggaggtg
1740ttcttcgcca cacgccttga gcagttcacc ttccaggaag cactggagtt ctgtgaatct
1800cacaatgcta cgctggccac cacgggccag ctctacgccg cctggagccg cggcctggac
1860aagtgctatg ccggctggct ggccgacggc agcctccgct accccatcgt caccccaagg
1920cctgcctgcg gtggggacaa gccaggcgtg agaacggtct acctctaccc taaccagacg
1980ggcctcccag acccactgtc ccggcaccat gccttctgct tccgaggcat ttcagcggtt
2040ccttctccag gagaagaaga gggtggcaca cccacatcac cctctggtgt ggaggagtgg
2100atcgtgaccc aagtggttcc tggtgtggct gctgtccccg tagaagagga gacaactgct
2160gtaccctcag gggagactac tgccatccta gagttcacca ccgagccaga aaaccagaca
2220gaatgggaac cagcctatac cccagtgggc acatccccgc tgccagggat ccttcctact
2280tggcctccca ctggcgcagc aacagaggaa agtacagaag gcccttctgc aactgaagtg
2340ccctctgcct cagaggaacc atccccctca gaggtgccat tcccctcaga ggagccatcc
2400ccctcagagg aaccattccc ctcagtgagg ccattcccct cagtggagct gttcccctca
2460gaggagccat tcccctccaa ggagccatcc ccctcagagg aaccatcagc ctcggaagag
2520ccgtatacac cttcaccccc cgtgcccagc tggactgagc tgcccagctc tggggaggaa
2580tctggggccc ctgatgtcag tggtgacttc acaggcagtg gagatgtttc aggacacctt
2640gacttcagtg ggcagctgtc aggggacagg gcaagtggac tgccctctgg agacctggac
2700tccagtggtc ttacttccac agtgggctca ggcctgcctg tggaaagtgg actaccctca
2760ggggatgaag agagaattga gtggcccagc actcctacgg ttggtgaact gccctctgga
2820gctgagatcc tagagggctc tgcctctgga gttggggatc tcagtggact tccttctgga
2880gaagttctag agacctctgc ctctggagta ggagacctca gtgggcttcc ttctggagaa
2940gttctagaga ccactgcccc tggagtagag gacatcagcg ggcttccttc tggagaagtt
3000ctagagacca ctgcccctgg agtagaggac atcagcgggc ttccttctgg agaagttcta
3060gagaccactg cccctggagt agaggacatc agcgggcttc cttctggaga agttctagag
3120accactgccc ctggagtaga ggacatcagc gggcttcctt ctggagaagt tctagagacc
3180actgcccctg gagtagagga catcagcggg cttccttctg gagaagttct agagaccact
3240gcccctggag tagaggacat cagcgggctt ccttctggag aagttctaga gaccgctgcc
3300cctggagtag aggacatcag cgggcttcct tctggagaag ttctagagac cgctgcccct
3360ggagtagagg acatcagcgg gcttccttct ggagaagttc tagagaccgc tgcccctgga
3420gtagaggaca tcagcgggct tccttctgga gaagttctag agaccgctgc ccctggagta
3480gaggacatca gcgggcttcc ttctggagaa gttctagaga ccgctgcccc tggagtagag
3540gacatcagcg ggcttccttc tggagaagtt ctagagaccg ctgcccctgg agtagaggac
3600atcagcgggc ttccttctgg agaagttcta gagaccgctg cccctggagt agaggacatc
3660agcgggcttc cttctggaga agttctagag actgctgccc ctggagtaga ggacatcagc
3720gggcttcctt ctggagaagt tctagagact gctgcccctg gagtagagga catcagcggg
3780cttccttctg gagaagttct agagactgct gcccctggag tagaggacat cagcgggctt
3840ccttctggag aagttctaga gactgctgcc cctggagtag aggacatcag cgggcttcct
3900tctggagaag ttctagagac tgctgcccct ggagtagagg acatcagcgg gcttccttct
3960ggagaagttc tagagactgc tgcccctgga gtagaggaca tcagcgggct tccttctgga
4020gaagttctag agactgctgc ccctggagta gaggacatca gcgggcttcc ttctggagaa
4080gttctagaga ctgctgcccc tggagtagag gacatcagcg ggcttccttc tggagaagtt
4140ctagagactg ctgcccctgg agtagaggac atcagcgggc ttccttctgg agaagttcta
4200gagactactg cccctggagt agaggagatc agcgggcttc cttctggaga agttctagag
4260actactgccc ctggagtaga tgagatcagt gggcttcctt ctggagaagt tctagagact
4320actgcccctg gagtagagga gatcagcggg cttccttctg gagaagttct agagacttct
4380acctctgcgg taggggacct cagtggactt ccttctggag gagaagttct agagatttct
4440gtctctggag tagaggacat cagtgggctt ccttctggag aggttgtaga gacttctgcc
4500tctggaatag aggatgtcag tgaacttcct tcaggagaag gtctagagac ctctgcttct
4560ggagtagagg acctcagcag gctcccttct ggagaagaag ttctagagat ttctgcctct
4620ggatttgggg acctcagtgg acttccttct ggaggagaag gtctagagac ctctgcttct
4680gaagtaggga ctgacctcag tgggcttcct tctggaaggg agggtctaga gacttcagct
4740tctggagctg aggacctcag tgggttgcct tctggaaaag aagacttggt ggggtcagct
4800tctggagact tggacttggg caaactgcct tctggaactc taggaagtgg gcaagctcca
4860gaaacaagtg gtcttccctc tggatttagt ggtgagtatt ctggggtgga ccttggaagt
4920ggcccaccct ctggcctgcc tgactttagt ggacttccat ctggattccc aactgtttcc
4980ctagtggatt ctacattggt ggaagtggtc acagcctcca ctgcaagtga actggaaggg
5040aggggaacca ttggcatcag tggtgcagga gaaatatctg gactgccctc cagtgagctg
5100gacattagtg ggagagctag tggactccct tcaggaactg aactcagtgg ccaagcatct
5160gggtctcctg atgtcagtgg ggaaatacct ggactctttg gtgtcagtgg acagccatca
5220gggtttcctg acactagtgg ggaaacatct ggagtgactg agcttagcgg gctgtcctct
5280ggacaaccag gtattagtgg agaagcatct ggagttcttt atggcactag tcaacccttt
5340ggcataactg atctgagtgg agaaacatct ggggtccctg atctcagtgg gcagccttca
5400gggttaccag ggttcagtgg ggcaacatca ggagtccctg acctggtttc tggtaccacg
5460agtggcagcg gtgaatcttc tgggattaca tttgtggaca ccagtttggt tgaagtggcc
5520cctactacat ttaaagaaga agaaggctta gggtctgtgg aactcagtgg cctcccttcc
5580ggagaggcag atctgtcagg caaatctggg atggtggatg tcagtggaca gttttctgga
5640acagtcgatt ccagtgggtt tacatcccag actccggaat tcagtggcct accaagtggc
5700atagctgagg tcagtggaga atcctccaga gctgagattg ggagcagcct gccctcggga
5760gcatattatg gcagtggaac tccatctagt ttccccactg tctctcttgt agacagaact
5820ttggtggaat ctgtaaccca ggctccaaca gcccaagagg caggagaagg gccttctggc
5880attttagaac tcagtggtgc tcattctgga gcaccagaca tgtctgggga gcattctgga
5940tttctggacc taagtgggct gcagtccggg ctgatagagc ccagcggaga gccaccaggt
6000actccatatt ttagtgggga ttttgccagc accaccaatg taagtggaga atcctctgta
6060gccatgggca ccagtggaga ggcctcagga cttccagaag ttactttaat cacttctgag
6120ttcgtggagg gtgttactga accaactatt tctcaggaac taggccaaag gccccctgtg
6180acacacacac cccagctttt tgagtccagt ggaaaagtct ccacagctgg ggacattagt
6240ggagctaccc cagtgctccc tgggtctgga gtagaagtat catcagtccc agaatctagc
6300agtgagacgt ccgcctatcc tgaagctggg ttcggggcat ctgccgcccc tgaggccagc
6360agagaagatt ctgggtcccc tgatctgagt gaaaccacct ctgcattcca cgaagctaac
6420cttgagagat cctctggcct aggagtgagc ggcagcactt tgacatttca agaaggcgag
6480gcgtccgctg ccccagaagt gagtggagaa tccaccacca ccagtgatgt ggggacagag
6540gcaccaggct tgccttcagc cactcccacg gcttctggag acaggactga aatcagcgga
6600gacctgtctg gtcacacctc gcagctgggc gttgtcatca gcaccagcat cccagagtct
6660gagtggaccc agcagaccca gcgccctgca gagacgcatc tagaaattga gtcctcaagc
6720ctcctgtact caggagaaga gactcacaca gtcgaaacag ccacctcccc aacagatgct
6780tccatcccag cttctccgga atggaaacgt gaatcagaat caactgctgc agaccaggag
6840gtatgtgagg agggctggaa caagtaccag ggccactgtt accgccactt cccggaccgc
6900gagacctggg tggatgctga gcgccggtgt cgggagcagc agtcacacct gagcagcatc
6960gtcacccccg aggagcagga gtttgtcaac aacaatgccc aagactacca gtggatcggc
7020ctgaacgaca ggaccatcga aggggacttc cgctggtcag atggacaccc catgcaattt
7080gagaactggc gccccaacca gcctgacaac ttttttgccg ctggagagga ctgtgtggtg
7140atgatctggc acgagaaggg cgagtggaat gatgttccct gcaattacca cctccccttc
7200acgtgtaaaa agggcacagc caccacctac aaacgcagac tacagaagcg gagctcacgg
7260caccctcgga ggagccgccc cagcacagcc cactga
7296251071DNAHomo sapiens 25atgaccactt tactctgggt tttcgtgact ctgagggtca
tcactgcagc tgtcactgta 60gaaacttcag accatgacaa ctcgctgagt gtcagcatcc
cccaaccgtc cccgctgagg 120gtcctcctgg ggacctccct caccatcccc tgctatttca
tcgaccccat gcaccctgtg 180accaccgccc cttctaccgc cccactggcc ccaagaatca
agtggagccg tgtgtccaag 240gagaaggagg tagtgctgct ggtggccact gaagggcgcg
tgcgggtcaa cagtgcctat 300caggacaagg tctcactgcc caactacccg gccatcccca
gtgacgccac cttggaagtc 360cagagcctgc gctccaatga ctctggggtc taccgctgcg
aggtgatgca tggcatcgag 420gacagcgagg ccaccctgga agtcgtggtg aaaggcatcg
tgttccatta cagagccatc 480tctacacgct acaccctcga ctttgacagg gcgcagcggg
cctgcctgca gaacagtgcc 540atcattgcca cgcctgagca gctgcaggcc gcctacgaag
acggcttcca ccagtgtgac 600gccggctggc tggctgacca gactgtcaga taccccatcc
acactccccg ggaaggctgc 660tatggagaca aggatgagtt tcctggtgtg aggacgtatg
gcatccgaga caccaacgag 720acctatgatg tgtactgctt cgccgaggag atggagggtg
aggtctttta tgcaacatct 780ccagagaagt tcaccttcca ggaagcagcc aatgagtgcc
ggcggctggg tgcccggctg 840gccaccacgg gccagctcta cctggcctgg caggctggca
tggacatgtg cagcgccggc 900tggctggccg accgcagcgt gcgctacccc atctccaagg
cccggcccaa ctgcggtggc 960aacctcctgg gcgtgaggac cgtctacgtg catgccaacc
agacgggcta ccccgacccc 1020tcatcccgct acgacgccat ctgctacaca ggtgaagact
ttgtggacat c 1071262431PRTHomo sapiens 26Met Thr Thr Leu Leu
Trp Val Phe Val Thr Leu Arg Val Ile Thr Ala1 5
10 15Ala Val Thr Val Glu Thr Ser Asp His Asp Asn
Ser Leu Ser Val Ser 20 25
30Ile Pro Gln Pro Ser Pro Leu Arg Val Leu Leu Gly Thr Ser Leu Thr
35 40 45Ile Pro Cys Tyr Phe Ile Asp Pro
Met His Pro Val Thr Thr Ala Pro 50 55
60Ser Thr Ala Pro Leu Ala Pro Arg Ile Lys Trp Ser Arg Val Ser Lys65
70 75 80Glu Lys Glu Val Val
Leu Leu Val Ala Thr Glu Gly Arg Val Arg Val 85
90 95Asn Ser Ala Tyr Gln Asp Lys Val Ser Leu Pro
Asn Tyr Pro Ala Ile 100 105
110Pro Ser Asp Ala Thr Leu Glu Val Gln Ser Leu Arg Ser Asn Asp Ser
115 120 125Gly Val Tyr Arg Cys Glu Val
Met His Gly Ile Glu Asp Ser Glu Ala 130 135
140Thr Leu Glu Val Val Val Lys Gly Ile Val Phe His Tyr Arg Ala
Ile145 150 155 160Ser Thr
Arg Tyr Thr Leu Asp Phe Asp Arg Ala Gln Arg Ala Cys Leu
165 170 175Gln Asn Ser Ala Ile Ile Ala
Thr Pro Glu Gln Leu Gln Ala Ala Tyr 180 185
190Glu Asp Gly Phe His Gln Cys Asp Ala Gly Trp Leu Ala Asp
Gln Thr 195 200 205Val Arg Tyr Pro
Ile His Thr Pro Arg Glu Gly Cys Tyr Gly Asp Lys 210
215 220Asp Glu Phe Pro Gly Val Arg Thr Tyr Gly Ile Arg
Asp Thr Asn Glu225 230 235
240Thr Tyr Asp Val Tyr Cys Phe Ala Glu Glu Met Glu Gly Glu Val Phe
245 250 255Tyr Ala Thr Ser Pro
Glu Lys Phe Thr Phe Gln Glu Ala Ala Asn Glu 260
265 270Cys Arg Arg Leu Gly Ala Arg Leu Ala Thr Thr Gly
Gln Leu Tyr Leu 275 280 285Ala Trp
Gln Ala Gly Met Asp Met Cys Ser Ala Gly Trp Leu Ala Asp 290
295 300Arg Ser Val Arg Tyr Pro Ile Ser Lys Ala Arg
Pro Asn Cys Gly Gly305 310 315
320Asn Leu Leu Gly Val Arg Thr Val Tyr Val His Ala Asn Gln Thr Gly
325 330 335Tyr Pro Asp Pro
Ser Ser Arg Tyr Asp Ala Ile Cys Tyr Thr Gly Glu 340
345 350Asp Phe Val Asp Ile Pro Glu Asn Phe Phe Gly
Val Gly Gly Glu Glu 355 360 365Asp
Ile Thr Val Gln Thr Val Thr Trp Pro Asp Met Glu Leu Pro Leu 370
375 380Pro Arg Asn Ile Thr Glu Gly Glu Ala Arg
Gly Ser Val Ile Leu Thr385 390 395
400Val Lys Pro Ile Phe Glu Val Ser Pro Ser Pro Leu Glu Pro Glu
Glu 405 410 415Pro Phe Thr
Phe Ala Pro Glu Ile Gly Ala Thr Ala Phe Ala Glu Val 420
425 430Glu Asn Glu Thr Gly Glu Ala Thr Arg Pro
Trp Gly Phe Pro Thr Pro 435 440
445Gly Leu Gly Pro Ala Thr Ala Phe Thr Ser Glu Asp Leu Val Val Gln 450
455 460Val Thr Ala Val Pro Gly Gln Pro
His Leu Pro Gly Gly Val Val Phe465 470
475 480His Tyr Arg Pro Gly Pro Thr Arg Tyr Ser Leu Thr
Phe Glu Glu Ala 485 490
495Gln Gln Ala Cys Leu Arg Thr Gly Ala Val Ile Ala Ser Pro Glu Gln
500 505 510Leu Gln Ala Ala Tyr Glu
Ala Gly Tyr Glu Gln Cys Asp Ala Gly Trp 515 520
525Leu Arg Asp Gln Thr Val Arg Tyr Pro Ile Val Ser Pro Arg
Thr Pro 530 535 540Cys Val Gly Asp Lys
Asp Ser Ser Pro Gly Val Arg Thr Tyr Gly Val545 550
555 560Arg Pro Ser Thr Glu Thr Tyr Asp Val Tyr
Cys Phe Val Asp Arg Leu 565 570
575Glu Gly Glu Val Phe Phe Ala Thr Arg Leu Glu Gln Phe Thr Phe Gln
580 585 590Glu Ala Leu Glu Phe
Cys Glu Ser His Asn Ala Thr Leu Ala Thr Thr 595
600 605Gly Gln Leu Tyr Ala Ala Trp Ser Arg Gly Leu Asp
Lys Cys Tyr Ala 610 615 620Gly Trp Leu
Ala Asp Gly Ser Leu Arg Tyr Pro Ile Val Thr Pro Arg625
630 635 640Pro Ala Cys Gly Gly Asp Lys
Pro Gly Val Arg Thr Val Tyr Leu Tyr 645
650 655Pro Asn Gln Thr Gly Leu Pro Asp Pro Leu Ser Arg
His His Ala Phe 660 665 670Cys
Phe Arg Gly Ile Ser Ala Val Pro Ser Pro Gly Glu Glu Glu Gly 675
680 685Gly Thr Pro Thr Ser Pro Ser Gly Val
Glu Glu Trp Ile Val Thr Gln 690 695
700Val Val Pro Gly Val Ala Ala Val Pro Val Glu Glu Glu Thr Thr Ala705
710 715 720Val Pro Ser Gly
Glu Thr Thr Ala Ile Leu Glu Phe Thr Thr Glu Pro 725
730 735Glu Asn Gln Thr Glu Trp Glu Pro Ala Tyr
Thr Pro Val Gly Thr Ser 740 745
750Pro Leu Pro Gly Ile Leu Pro Thr Trp Pro Pro Thr Gly Ala Ala Thr
755 760 765Glu Glu Ser Thr Glu Gly Pro
Ser Ala Thr Glu Val Pro Ser Ala Ser 770 775
780Glu Glu Pro Ser Pro Ser Glu Val Pro Phe Pro Ser Glu Glu Pro
Ser785 790 795 800Pro Ser
Glu Glu Pro Phe Pro Ser Val Arg Pro Phe Pro Ser Val Glu
805 810 815Leu Phe Pro Ser Glu Glu Pro
Phe Pro Ser Lys Glu Pro Ser Pro Ser 820 825
830Glu Glu Pro Ser Ala Ser Glu Glu Pro Tyr Thr Pro Ser Pro
Pro Val 835 840 845Pro Ser Trp Thr
Glu Leu Pro Ser Ser Gly Glu Glu Ser Gly Ala Pro 850
855 860Asp Val Ser Gly Asp Phe Thr Gly Ser Gly Asp Val
Ser Gly His Leu865 870 875
880Asp Phe Ser Gly Gln Leu Ser Gly Asp Arg Ala Ser Gly Leu Pro Ser
885 890 895Gly Asp Leu Asp Ser
Ser Gly Leu Thr Ser Thr Val Gly Ser Gly Leu 900
905 910Pro Val Glu Ser Gly Leu Pro Ser Gly Asp Glu Glu
Arg Ile Glu Trp 915 920 925Pro Ser
Thr Pro Thr Val Gly Glu Leu Pro Ser Gly Ala Glu Ile Leu 930
935 940Glu Gly Ser Ala Ser Gly Val Gly Asp Leu Ser
Gly Leu Pro Ser Gly945 950 955
960Glu Val Leu Glu Thr Ser Ala Ser Gly Val Gly Asp Leu Ser Gly Leu
965 970 975Pro Ser Gly Glu
Val Leu Glu Thr Thr Ala Pro Gly Val Glu Asp Ile 980
985 990Ser Gly Leu Pro Ser Gly Glu Val Leu Glu Thr
Thr Ala Pro Gly Val 995 1000
1005Glu Asp Ile Ser Gly Leu Pro Ser Gly Glu Val Leu Glu Thr Thr
1010 1015 1020Ala Pro Gly Val Glu Asp
Ile Ser Gly Leu Pro Ser Gly Glu Val 1025 1030
1035Leu Glu Thr Thr Ala Pro Gly Val Glu Asp Ile Ser Gly Leu
Pro 1040 1045 1050Ser Gly Glu Val Leu
Glu Thr Thr Ala Pro Gly Val Glu Asp Ile 1055 1060
1065Ser Gly Leu Pro Ser Gly Glu Val Leu Glu Thr Thr Ala
Pro Gly 1070 1075 1080Val Glu Asp Ile
Ser Gly Leu Pro Ser Gly Glu Val Leu Glu Thr 1085
1090 1095Ala Ala Pro Gly Val Glu Asp Ile Ser Gly Leu
Pro Ser Gly Glu 1100 1105 1110Val Leu
Glu Thr Ala Ala Pro Gly Val Glu Asp Ile Ser Gly Leu 1115
1120 1125Pro Ser Gly Glu Val Leu Glu Thr Ala Ala
Pro Gly Val Glu Asp 1130 1135 1140Ile
Ser Gly Leu Pro Ser Gly Glu Val Leu Glu Thr Ala Ala Pro 1145
1150 1155Gly Val Glu Asp Ile Ser Gly Leu Pro
Ser Gly Glu Val Leu Glu 1160 1165
1170Thr Ala Ala Pro Gly Val Glu Asp Ile Ser Gly Leu Pro Ser Gly
1175 1180 1185Glu Val Leu Glu Thr Ala
Ala Pro Gly Val Glu Asp Ile Ser Gly 1190 1195
1200Leu Pro Ser Gly Glu Val Leu Glu Thr Ala Ala Pro Gly Val
Glu 1205 1210 1215Asp Ile Ser Gly Leu
Pro Ser Gly Glu Val Leu Glu Thr Ala Ala 1220 1225
1230Pro Gly Val Glu Asp Ile Ser Gly Leu Pro Ser Gly Glu
Val Leu 1235 1240 1245Glu Thr Ala Ala
Pro Gly Val Glu Asp Ile Ser Gly Leu Pro Ser 1250
1255 1260Gly Glu Val Leu Glu Thr Ala Ala Pro Gly Val
Glu Asp Ile Ser 1265 1270 1275Gly Leu
Pro Ser Gly Glu Val Leu Glu Thr Ala Ala Pro Gly Val 1280
1285 1290Glu Asp Ile Ser Gly Leu Pro Ser Gly Glu
Val Leu Glu Thr Ala 1295 1300 1305Ala
Pro Gly Val Glu Asp Ile Ser Gly Leu Pro Ser Gly Glu Val 1310
1315 1320Leu Glu Thr Ala Ala Pro Gly Val Glu
Asp Ile Ser Gly Leu Pro 1325 1330
1335Ser Gly Glu Val Leu Glu Thr Ala Ala Pro Gly Val Glu Asp Ile
1340 1345 1350Ser Gly Leu Pro Ser Gly
Glu Val Leu Glu Thr Ala Ala Pro Gly 1355 1360
1365Val Glu Asp Ile Ser Gly Leu Pro Ser Gly Glu Val Leu Glu
Thr 1370 1375 1380Ala Ala Pro Gly Val
Glu Asp Ile Ser Gly Leu Pro Ser Gly Glu 1385 1390
1395Val Leu Glu Thr Thr Ala Pro Gly Val Glu Glu Ile Ser
Gly Leu 1400 1405 1410Pro Ser Gly Glu
Val Leu Glu Thr Thr Ala Pro Gly Val Asp Glu 1415
1420 1425Ile Ser Gly Leu Pro Ser Gly Glu Val Leu Glu
Thr Thr Ala Pro 1430 1435 1440Gly Val
Glu Glu Ile Ser Gly Leu Pro Ser Gly Glu Val Leu Glu 1445
1450 1455Thr Ser Thr Ser Ala Val Gly Asp Leu Ser
Gly Leu Pro Ser Gly 1460 1465 1470Gly
Glu Val Leu Glu Ile Ser Val Ser Gly Val Glu Asp Ile Ser 1475
1480 1485Gly Leu Pro Ser Gly Glu Val Val Glu
Thr Ser Ala Ser Gly Ile 1490 1495
1500Glu Asp Val Ser Glu Leu Pro Ser Gly Glu Gly Leu Glu Thr Ser
1505 1510 1515Ala Ser Gly Val Glu Asp
Leu Ser Arg Leu Pro Ser Gly Glu Glu 1520 1525
1530Val Leu Glu Ile Ser Ala Ser Gly Phe Gly Asp Leu Ser Gly
Leu 1535 1540 1545Pro Ser Gly Gly Glu
Gly Leu Glu Thr Ser Ala Ser Glu Val Gly 1550 1555
1560Thr Asp Leu Ser Gly Leu Pro Ser Gly Arg Glu Gly Leu
Glu Thr 1565 1570 1575Ser Ala Ser Gly
Ala Glu Asp Leu Ser Gly Leu Pro Ser Gly Lys 1580
1585 1590Glu Asp Leu Val Gly Ser Ala Ser Gly Asp Leu
Asp Leu Gly Lys 1595 1600 1605Leu Pro
Ser Gly Thr Leu Gly Ser Gly Gln Ala Pro Glu Thr Ser 1610
1615 1620Gly Leu Pro Ser Gly Phe Ser Gly Glu Tyr
Ser Gly Val Asp Leu 1625 1630 1635Gly
Ser Gly Pro Pro Ser Gly Leu Pro Asp Phe Ser Gly Leu Pro 1640
1645 1650Ser Gly Phe Pro Thr Val Ser Leu Val
Asp Ser Thr Leu Val Glu 1655 1660
1665Val Val Thr Ala Ser Thr Ala Ser Glu Leu Glu Gly Arg Gly Thr
1670 1675 1680Ile Gly Ile Ser Gly Ala
Gly Glu Ile Ser Gly Leu Pro Ser Ser 1685 1690
1695Glu Leu Asp Ile Ser Gly Arg Ala Ser Gly Leu Pro Ser Gly
Thr 1700 1705 1710Glu Leu Ser Gly Gln
Ala Ser Gly Ser Pro Asp Val Ser Gly Glu 1715 1720
1725Ile Pro Gly Leu Phe Gly Val Ser Gly Gln Pro Ser Gly
Phe Pro 1730 1735 1740Asp Thr Ser Gly
Glu Thr Ser Gly Val Thr Glu Leu Ser Gly Leu 1745
1750 1755Ser Ser Gly Gln Pro Gly Ile Ser Gly Glu Ala
Ser Gly Val Leu 1760 1765 1770Tyr Gly
Thr Ser Gln Pro Phe Gly Ile Thr Asp Leu Ser Gly Glu 1775
1780 1785Thr Ser Gly Val Pro Asp Leu Ser Gly Gln
Pro Ser Gly Leu Pro 1790 1795 1800Gly
Phe Ser Gly Ala Thr Ser Gly Val Pro Asp Leu Val Ser Gly 1805
1810 1815Thr Thr Ser Gly Ser Gly Glu Ser Ser
Gly Ile Thr Phe Val Asp 1820 1825
1830Thr Ser Leu Val Glu Val Ala Pro Thr Thr Phe Lys Glu Glu Glu
1835 1840 1845Gly Leu Gly Ser Val Glu
Leu Ser Gly Leu Pro Ser Gly Glu Ala 1850 1855
1860Asp Leu Ser Gly Lys Ser Gly Met Val Asp Val Ser Gly Gln
Phe 1865 1870 1875Ser Gly Thr Val Asp
Ser Ser Gly Phe Thr Ser Gln Thr Pro Glu 1880 1885
1890Phe Ser Gly Leu Pro Ser Gly Ile Ala Glu Val Ser Gly
Glu Ser 1895 1900 1905Ser Arg Ala Glu
Ile Gly Ser Ser Leu Pro Ser Gly Ala Tyr Tyr 1910
1915 1920Gly Ser Gly Thr Pro Ser Ser Phe Pro Thr Val
Ser Leu Val Asp 1925 1930 1935Arg Thr
Leu Val Glu Ser Val Thr Gln Ala Pro Thr Ala Gln Glu 1940
1945 1950Ala Gly Glu Gly Pro Ser Gly Ile Leu Glu
Leu Ser Gly Ala His 1955 1960 1965Ser
Gly Ala Pro Asp Met Ser Gly Glu His Ser Gly Phe Leu Asp 1970
1975 1980Leu Ser Gly Leu Gln Ser Gly Leu Ile
Glu Pro Ser Gly Glu Pro 1985 1990
1995Pro Gly Thr Pro Tyr Phe Ser Gly Asp Phe Ala Ser Thr Thr Asn
2000 2005 2010Val Ser Gly Glu Ser Ser
Val Ala Met Gly Thr Ser Gly Glu Ala 2015 2020
2025Ser Gly Leu Pro Glu Val Thr Leu Ile Thr Ser Glu Phe Val
Glu 2030 2035 2040Gly Val Thr Glu Pro
Thr Ile Ser Gln Glu Leu Gly Gln Arg Pro 2045 2050
2055Pro Val Thr His Thr Pro Gln Leu Phe Glu Ser Ser Gly
Lys Val 2060 2065 2070Ser Thr Ala Gly
Asp Ile Ser Gly Ala Thr Pro Val Leu Pro Gly 2075
2080 2085Ser Gly Val Glu Val Ser Ser Val Pro Glu Ser
Ser Ser Glu Thr 2090 2095 2100Ser Ala
Tyr Pro Glu Ala Gly Phe Gly Ala Ser Ala Ala Pro Glu 2105
2110 2115Ala Ser Arg Glu Asp Ser Gly Ser Pro Asp
Leu Ser Glu Thr Thr 2120 2125 2130Ser
Ala Phe His Glu Ala Asn Leu Glu Arg Ser Ser Gly Leu Gly 2135
2140 2145Val Ser Gly Ser Thr Leu Thr Phe Gln
Glu Gly Glu Ala Ser Ala 2150 2155
2160Ala Pro Glu Val Ser Gly Glu Ser Thr Thr Thr Ser Asp Val Gly
2165 2170 2175Thr Glu Ala Pro Gly Leu
Pro Ser Ala Thr Pro Thr Ala Ser Gly 2180 2185
2190Asp Arg Thr Glu Ile Ser Gly Asp Leu Ser Gly His Thr Ser
Gln 2195 2200 2205Leu Gly Val Val Ile
Ser Thr Ser Ile Pro Glu Ser Glu Trp Thr 2210 2215
2220Gln Gln Thr Gln Arg Pro Ala Glu Thr His Leu Glu Ile
Glu Ser 2225 2230 2235Ser Ser Leu Leu
Tyr Ser Gly Glu Glu Thr His Thr Val Glu Thr 2240
2245 2250Ala Thr Ser Pro Thr Asp Ala Ser Ile Pro Ala
Ser Pro Glu Trp 2255 2260 2265Lys Arg
Glu Ser Glu Ser Thr Ala Ala Asp Gln Glu Val Cys Glu 2270
2275 2280Glu Gly Trp Asn Lys Tyr Gln Gly His Cys
Tyr Arg His Phe Pro 2285 2290 2295Asp
Arg Glu Thr Trp Val Asp Ala Glu Arg Arg Cys Arg Glu Gln 2300
2305 2310Gln Ser His Leu Ser Ser Ile Val Thr
Pro Glu Glu Gln Glu Phe 2315 2320
2325Val Asn Asn Asn Ala Gln Asp Tyr Gln Trp Ile Gly Leu Asn Asp
2330 2335 2340Arg Thr Ile Glu Gly Asp
Phe Arg Trp Ser Asp Gly His Pro Met 2345 2350
2355Gln Phe Glu Asn Trp Arg Pro Asn Gln Pro Asp Asn Phe Phe
Ala 2360 2365 2370Ala Gly Glu Asp Cys
Val Val Met Ile Trp His Glu Lys Gly Glu 2375 2380
2385Trp Asn Asp Val Pro Cys Asn Tyr His Leu Pro Phe Thr
Cys Lys 2390 2395 2400Lys Gly Thr Ala
Thr Thr Tyr Lys Arg Arg Leu Gln Lys Arg Ser 2405
2410 2415Ser Arg His Pro Arg Arg Ser Arg Pro Ser Thr
Ala His 2420 2425 243027357PRTHomo
sapiens 27Met Thr Thr Leu Leu Trp Val Phe Val Thr Leu Arg Val Ile Thr
Ala1 5 10 15Ala Val Thr
Val Glu Thr Ser Asp His Asp Asn Ser Leu Ser Val Ser 20
25 30Ile Pro Gln Pro Ser Pro Leu Arg Val Leu
Leu Gly Thr Ser Leu Thr 35 40
45Ile Pro Cys Tyr Phe Ile Asp Pro Met His Pro Val Thr Thr Ala Pro 50
55 60Ser Thr Ala Pro Leu Ala Pro Arg Ile
Lys Trp Ser Arg Val Ser Lys65 70 75
80Glu Lys Glu Val Val Leu Leu Val Ala Thr Glu Gly Arg Val
Arg Val 85 90 95Asn Ser
Ala Tyr Gln Asp Lys Val Ser Leu Pro Asn Tyr Pro Ala Ile 100
105 110Pro Ser Asp Ala Thr Leu Glu Val Gln
Ser Leu Arg Ser Asn Asp Ser 115 120
125Gly Val Tyr Arg Cys Glu Val Met His Gly Ile Glu Asp Ser Glu Ala
130 135 140Thr Leu Glu Val Val Val Lys
Gly Ile Val Phe His Tyr Arg Ala Ile145 150
155 160Ser Thr Arg Tyr Thr Leu Asp Phe Asp Arg Ala Gln
Arg Ala Cys Leu 165 170
175Gln Asn Ser Ala Ile Ile Ala Thr Pro Glu Gln Leu Gln Ala Ala Tyr
180 185 190Glu Asp Gly Phe His Gln
Cys Asp Ala Gly Trp Leu Ala Asp Gln Thr 195 200
205Val Arg Tyr Pro Ile His Thr Pro Arg Glu Gly Cys Tyr Gly
Asp Lys 210 215 220Asp Glu Phe Pro Gly
Val Arg Thr Tyr Gly Ile Arg Asp Thr Asn Glu225 230
235 240Thr Tyr Asp Val Tyr Cys Phe Ala Glu Glu
Met Glu Gly Glu Val Phe 245 250
255Tyr Ala Thr Ser Pro Glu Lys Phe Thr Phe Gln Glu Ala Ala Asn Glu
260 265 270Cys Arg Arg Leu Gly
Ala Arg Leu Ala Thr Thr Gly Gln Leu Tyr Leu 275
280 285Ala Trp Gln Ala Gly Met Asp Met Cys Ser Ala Gly
Trp Leu Ala Asp 290 295 300Arg Ser Val
Arg Tyr Pro Ile Ser Lys Ala Arg Pro Asn Cys Gly Gly305
310 315 320Asn Leu Leu Gly Val Arg Thr
Val Tyr Val His Ala Asn Gln Thr Gly 325
330 335Tyr Pro Asp Pro Ser Ser Arg Tyr Asp Ala Ile Cys
Tyr Thr Gly Glu 340 345 350Asp
Phe Val Asp Ile 355281065DNAHomo sapiens 28atgaagagtc tacttcttct
ggtgctgatt tcaatctgct gggctgatca tctttcagac 60aactatactc tggatcatga
cagagctatt cacatccaag cagaaaatgg cccccatcta 120cttgtggaag cagagcaagc
caaggtgttt tcacacagag gtggcaatgt tacactgcca 180tgtaaatttt atcgagaccc
tacagcattt ggctcaggaa tccataaaat ccgaattaag 240tggaccaagc taacttcgga
ttacctcaag gaagtggatg tttttgtttc catgggatac 300cacaaaaaaa cctatggagg
ctaccagggt agagtgtttc tgaagggagg cagtgatagt 360gatgcttctc tggtcatcac
agacctcact ctggaagatt atgggagata taagtgtgag 420gtgattgaag gattagaaga
tgatactgtt gtggtagcac tggacttaca aggtgtggta 480ttcccttact ttccacgact
ggggcgctac aatctcaatt ttcacgaggc gcagcaggcg 540tgtctggacc aggatgctgt
gatcgcctcc ttcgaccagc tgtacgacgc ctggcggggc 600gggctggact ggtgcaatgc
cggctggctc agtgatggct ctgtgcaata tcccatcaca 660aagcccagag agccctgtgg
ggggcagaac acagtgcccg gagtcaggaa ctacggattt 720tgggataaag ataaaagcag
atatgatgtt ttctgtttta catccaattt caatggccgt 780ttttactatc tgatccaccc
caccaaactg acctatgatg aagcggtgca agcttgtctc 840aatgatggtg ctcagattgc
aaaagtgggc cagatatttg ctgcctggaa aattctcgga 900tatgaccgct gtgatgcggg
ctggttggcg gatggcagcg tccgctaccc catctctagg 960ccaagaaggc gctgcagtcc
tactgaggct gcagtgcgct tcgtgggttt cccagataaa 1020aagcataagc tgtatggtgt
ctactgcttc agagcataca actga 106529354PRTHomo sapiens
29Met Lys Ser Leu Leu Leu Leu Val Leu Ile Ser Ile Cys Trp Ala Asp1
5 10 15His Leu Ser Asp Asn Tyr
Thr Leu Asp His Asp Arg Ala Ile His Ile 20 25
30Gln Ala Glu Asn Gly Pro His Leu Leu Val Glu Ala Glu
Gln Ala Lys 35 40 45Val Phe Ser
His Arg Gly Gly Asn Val Thr Leu Pro Cys Lys Phe Tyr 50
55 60Arg Asp Pro Thr Ala Phe Gly Ser Gly Ile His Lys
Ile Arg Ile Lys65 70 75
80Trp Thr Lys Leu Thr Ser Asp Tyr Leu Lys Glu Val Asp Val Phe Val
85 90 95Ser Met Gly Tyr His Lys
Lys Thr Tyr Gly Gly Tyr Gln Gly Arg Val 100
105 110Phe Leu Lys Gly Gly Ser Asp Ser Asp Ala Ser Leu
Val Ile Thr Asp 115 120 125Leu Thr
Leu Glu Asp Tyr Gly Arg Tyr Lys Cys Glu Val Ile Glu Gly 130
135 140Leu Glu Asp Asp Thr Val Val Val Ala Leu Asp
Leu Gln Gly Val Val145 150 155
160Phe Pro Tyr Phe Pro Arg Leu Gly Arg Tyr Asn Leu Asn Phe His Glu
165 170 175Ala Gln Gln Ala
Cys Leu Asp Gln Asp Ala Val Ile Ala Ser Phe Asp 180
185 190Gln Leu Tyr Asp Ala Trp Arg Gly Gly Leu Asp
Trp Cys Asn Ala Gly 195 200 205Trp
Leu Ser Asp Gly Ser Val Gln Tyr Pro Ile Thr Lys Pro Arg Glu 210
215 220Pro Cys Gly Gly Gln Asn Thr Val Pro Gly
Val Arg Asn Tyr Gly Phe225 230 235
240Trp Asp Lys Asp Lys Ser Arg Tyr Asp Val Phe Cys Phe Thr Ser
Asn 245 250 255Phe Asn Gly
Arg Phe Tyr Tyr Leu Ile His Pro Thr Lys Leu Thr Tyr 260
265 270Asp Glu Ala Val Gln Ala Cys Leu Asn Asp
Gly Ala Gln Ile Ala Lys 275 280
285Val Gly Gln Ile Phe Ala Ala Trp Lys Ile Leu Gly Tyr Asp Arg Cys 290
295 300Asp Ala Gly Trp Leu Ala Asp Gly
Ser Val Arg Tyr Pro Ile Ser Arg305 310
315 320Pro Arg Arg Arg Cys Ser Pro Thr Glu Ala Ala Val
Arg Phe Val Gly 325 330
335Phe Pro Asp Lys Lys His Lys Leu Tyr Gly Val Tyr Cys Phe Arg Ala
340 345 350Tyr Asn307230DNAHomo
sapiens 30atgttcataa atataaagag catcttatgg atgtgttcaa ccttaatagt
aacccatgcg 60ctacataaag tcaaagtggg aaaaagccca ccggtgaggg gctccctctc
tggaaaagtc 120agcctacctt gtcatttttc aacgatgcct actttgccac ccagttacaa
caccagtgaa 180tttctccgca tcaaatggtc taagattgaa gtggacaaaa atggaaaaga
tttgaaagag 240actactgtcc ttgtggccca aaatggaaat atcaagattg gtcaggacta
caaagggaga 300gtgtctgtgc ccacacatcc cgaggctgtg ggcgatgcct ccctcactgt
ggtcaagctg 360ctggcaagtg atgcgggtct ttaccgctgt gacgtcatgt acgggattga
agacacacaa 420gacacggtgt cactgactgt ggatggggtt gtgtttcact acagggcggc
aaccagcagg 480tacacactga attttgaggc tgctcagaag gcttgtttgg acgttggggc
agtcatagca 540actccagagc agctctttgc tgcctatgaa gatggatttg agcagtgtga
cgcaggctgg 600ctggctgatc agactgtcag atatcccatc cgggctccca gagtaggctg
ttatggagat 660aagatgggaa aggcaggagt caggacttat ggattccgtt ctccccagga
aacttacgat 720gtgtattgtt atgtggatca tctggatggt gatgtgttcc acctcactgt
ccccagtaaa 780ttcaccttcg aggaggctgc aaaagagtgt gaaaaccagg atgccaggct
ggcaacagtg 840ggggaactcc aggcggcatg gaggaacggc tttgaccagt gcgattacgg
gtggctgtcg 900gatgccagcg tgcgccaccc tgtgactgtg gccagggccc agtgtggagg
tggtctactt 960ggggtgagaa ccctgtatcg ttttgagaac cagacaggct tccctccccc
tgatagcaga 1020tttgatgcct actgctttaa acgtcgaatg agtgatttga gtgtaattgg
tcatccaata 1080gattcagaat ctaaagaaga tgaaccttgt agtgaagaaa cagatccagt
gcatgatcta 1140atggctgaaa ttttacctga attccctgac ataattgaaa tagacctata
ccacagtgaa 1200gaaaatgaag aagaagaaga agagtgtgca aatgctactg atgtgacaac
caccccatct 1260gtgcagtaca taaatgggaa gcatctcgtt accactgtgc ccaaggaccc
agaagctgca 1320gaagctaggc gtggccagtt tgaaagtgtt gcaccttctc agaatttctc
ggacagctct 1380gaaagtgata ctcatccatt tgtaatagcc aaaacggaat tgtctactgc
tgtgcaacct 1440aatgaatcta cagaaacaac tgagtctctt gaagttacat ggaagcctga
gacttaccct 1500gaaacatcag aacatttttc aggtggtgag cctgatgttt tccccacagt
cccattccat 1560gaggaatttg aaagtggaac agccaaaaaa ggggcagaat cagtcacaga
gagagatact 1620gaagttggtc atcaggcaca tgaacatact gaacctgtat ctctgtttcc
tgaagagtct 1680tcaggagaga ttgccattga ccaagaatct cagaaaatag cctttgcaag
ggctacagaa 1740gtaacatttg gtgaagaggt agaaaaaagt acttctgtca catacactcc
cactatagtt 1800ccaagttctg catcagcata tgtttcagag gaagaagcag ttaccctaat
aggaaatcct 1860tggccagatg acctgttgtc taccaaagaa agctgggtag aagcaactcc
tagacaagtt 1920gtagagctct cagggagttc ttcgattcca attacagaag gctctggaga
agcagaagaa 1980gatgaagata caatgttcac catggtaact gatttatcac agagaaatac
tactgataca 2040ctcattactt tagacactag caggataatc acagaaagct tttttgaggt
tcctgcaacc 2100accatttatc cagtttctga acaaccttct gcaaaagtgg tgcctaccaa
gtttgtaagt 2160gaaacagaca cttctgagtg gatttccagt accactgttg aggaaaagaa
aaggaaggag 2220gaggagggaa ctacaggtac ggcttctaca tttgaggtat attcatctac
acagagatcg 2280gatcaattaa ttttaccctt tgaattagaa agtccaaatg tagctacatc
tagtgattca 2340ggtaccagga aaagttttat gtccttgaca acaccaacac agtctgaaag
ggaaatgaca 2400gattctactc ctgtctttac agaaacaaat acattagaaa atttgggggc
acagaccact 2460gagcacagca gtatccatca acctggggtt caggaagggc tgaccactct
cccacgtagt 2520cctgcctctg tctttatgga gcagggctct ggagaagctg ctgccgaccc
agaaaccacc 2580actgtttctt cattttcatt aaacgtagag tatgcaattc aagccgaaaa
ggaagtagct 2640ggcactttgt ctccgcatgt ggaaactaca ttctccactg agccaacagg
actggttttg 2700agtacagtaa tggacagagt agttgctgaa aatataaccc aaacatccag
ggaaatagtg 2760atttcagagc gattaggaga accaaattat ggggcagaaa taaggggctt
ttccacaggt 2820tttcctttgg aggaagattt cagtggtgac tttagagaat actcaacagt
gtctcatccc 2880atagcaaaag aagaaacggt aatgatggaa ggctctggag atgcagcatt
tagggacacc 2940cagacttcac catctacagt acctacttca gttcacatca gtcacatatc
tgactcagaa 3000ggacccagta gcaccatggt cagcacttca gccttcccct gggaagagtt
tacatcctca 3060gctgagggct caggtgagca actggtcaca gtcagcagct ctgttgttcc
agtgcttccc 3120agtgctgtgc aaaagttttc tggtacagct tcctccatta tcgacgaagg
attgggagaa 3180gtgggtactg tcaatgaaat tgatagaaga tccaccattt taccaacagc
agaagtggaa 3240ggtacgaaag ctccagtaga gaaggaggaa gtaaaggtca gtggcacagt
ttcaacaaac 3300tttccccaaa ctatagagcc agccaaatta tggtctaggc aagaagtcaa
ccctgtaaga 3360caagaaattg aaagtgaaac aacatcagag gaacaaattc aagaagaaaa
gtcatttgaa 3420tcccctcaaa actctcctgc aacagaacaa acaatctttg attcacagac
atttactgaa 3480actgaactca aaaccacaga ttattctgta ctaacaacaa agaaaactta
cagtgatgat 3540aaagaaatga aggaggaaga cacttcttta gttaacatgt ctactccaga
tccagatgca 3600aatggcttgg aatcttacac aactctccct gaagctactg aaaagtcaca
ttttttctta 3660gctactgcat tagtaactga atctatacca gctgaacatg tagtcacaga
ttcaccaatc 3720aaaaaggaag aaagtacaaa acattttccg aaaggcatga gaccaacaat
tcaagagtca 3780gatactgagc tcttattctc tggactggga tcaggagaag aagttttacc
tactctacca 3840acagagtcag tgaattttac tgaagtggaa caaatcaata acacattata
tccccacact 3900tctcaagtgg aaagtacctc aagtgacaaa attgaagact ttaacagaat
ggaaaatgtg 3960gcaaaagaag ttggaccact cgtatctcaa acagacatct ttgaaggtag
tgggtcagta 4020accagcacaa cattaataga aattttaagt gacactggag cagaaggacc
cacggtggca 4080cctctccctt tctccacgga catcggacat cctcaaaatc agactgtcag
gtgggcagaa 4140gaaatccaga ctagtagacc acaaaccata actgaacaag actctaacaa
gaattcttca 4200acagcagaaa ttaacgaaac aacaacctca tctactgatt ttctggctag
agcttatggt 4260tttgaaatgg ccaaagaatt tgttacatca gcaccaaaac catctgactt
gtattatgaa 4320ccttctggag aaggatctgg agaagtggat attgttgatt catttcacac
ttctgcaact 4380actcaggcaa ccagacaaga aagcagcacc acatttgttt ctgatgggtc
cctggaaaaa 4440catcctgagg tgccaagcgc taaagctgtt actgctgatg gattcccaac
agtttcagtg 4500atgctgcctc ttcattcaga gcagaacaaa agctcccctg atccaactag
cacactgtca 4560aatacagtgt catatgagag gtccacagac ggtagtttcc aagaccgttt
cagggaattc 4620gaggattcca ccttaaaacc taacagaaaa aaacccactg aaaatattat
catagacctg 4680gacaaagagg acaaggattt aatattgaca attacagaga gtaccatcct
tgaaattcta 4740cctgagctga catcggataa aaatactatc atagatattg atcatactaa
acctgtgtat 4800gaagacattc ttggaatgca aacagatata gatacagagg taccatcaga
accacatgac 4860agtaatgatg aaagtaatga tgacagcact caagttcaag agatctatga
ggcagctgtc 4920aacctttctt taactgagga aacatttgag ggctctgctg atgttctggc
tagctacact 4980caggcaacac atgatgaatc aatgacttat gaagatagaa gccaactaga
tcacatgggc 5040tttcacttca caactgggat ccctgctcct agcacagaaa cagaattaga
cgttttactt 5100cccacggcaa catccctgcc aattcctcgt aagtctgcca cagttattcc
agagattgaa 5160ggaataaaag ctgaagcaaa agccctggat gacatgtttg aatcaagcac
tttgtctgat 5220ggtcaagcta ttgcagacca aagtgaaata ataccaacat tgggccaatt
tgaaaggact 5280caggaggagt atgaagacaa aaaacatgct ggtccttctt ttcagccaga
attctcttca 5340ggagctgagg aggcattagt agaccatact ccctatctaa gtattgctac
tacccacctt 5400atggatcaga gtgtaacaga ggtgcctgat gtgatggaag gatccaatcc
cccatattac 5460actgatacaa cattagcagt ttcaacattt gcgaagttgt cttctcagac
accatcatct 5520cccctcacta tctactcagg cagtgaagcc tctggacaca cagagatccc
ccagcccagt 5580gctctgccag gaatagacgt cggctcatct gtaatgtccc cacaggattc
ttttaaggaa 5640attcatgtaa atattgaagc gactttcaaa ccatcaagtg aggaatacct
tcacataact 5700gagcctccct ctttatctcc tgacacaaaa ttagaacctt cagaagatga
tggtaaacct 5760gagttattag aagaaatgga agcttctccc acagaactta ttgctgtgga
aggaactgag 5820attctccaag atttccaaaa caaaaccgat ggtcaagttt ctggagaagc
aatcaagatg 5880tttcccacca ttaaaacacc tgaggctgga actgttatta caactgccga
tgaaattgaa 5940ttagaaggtg ctacacagtg gccacactct acttctgctt ctgccaccta
tggggtcgag 6000gcaggtgtgg tgccttggct aagtccacag acttctgaga ggcccacgct
ttcttcttct 6060ccagaaataa accctgaaac tcaagcagct ttaatcagag ggcaggattc
cacgatagca 6120gcatcagaac agcaagtggc agcgagaatt cttgattcca atgatcaggc
aacagtaaac 6180cctgtggaat ttaatactga ggttgcaaca ccaccatttt cccttctgga
gacttctaat 6240gaaacagatt tcctgattgg cattaatgaa gagtcagtgg aaggcacggc
aatctattta 6300ccaggacctg atcgctgcaa aatgaacccg tgccttaacg gaggcacctg
ttatcctact 6360gaaacttcct acgtatgcac ctgtgtgcca ggatacagcg gagaccagtg
tgaacttgat 6420tttgatgaat gtcactctaa tccctgtcgt aatggagcca cttgtgttga
tggttttaac 6480acattcaggt gcctctgcct tccaagttat gttggtgcac tttgtgagca
agataccgag 6540acatgtgact atggctggca caaattccaa gggcagtgct acaaatactt
tgcccatcga 6600cgcacatggg atgcagctga acgggaatgc cgtctgcagg gtgcccatct
cacaagcatc 6660ctgtctcacg aagaacaaat gtttgttaat cgtgtgggcc atgattatca
gtggataggc 6720ctcaatgaca agatgtttga gcatgacttc cgttggactg atggcagcac
actgcaatac 6780gagaattgga gacccaacca gccagacagc ttcttttctg ctggagaaga
ctgtgttgta 6840atcatttggc atgagaatgg ccagtggaat gatgttccct gcaattacca
tctcacctat 6900acgtgcaaga aaggaacagt cgcttgcggc cagccccctg ttgtagaaaa
tgccaagacc 6960tttggaaaga tgaaacctcg ttatgaaatc aactccctga ttagatacca
ctgcaaagat 7020ggtttcattc aacgtcacct tccaactatc cggtgcttag gaaatggaag
atgggctata 7080cctaaaatta cctgcatgaa cccatctgca taccaaagga cttattctat
gaaatacttt 7140aaaaattcct catcagcaaa ggacaattca ataaatacat ccaaacatga
tcatcgttgg 7200agccggaggt ggcaggagtc gaggcgctga
7230311089DNAHomo sapiens 31atgttcataa atataaagag catcttatgg
atgtgttcaa ccttaatagt aacccatgcg 60ctacataaag tcaaagtggg aaaaagccca
ccggtgaggg gctccctctc tggaaaagtc 120agcctacctt gtcatttttc aacgatgcct
actttgccac ccagttacaa caccagtgaa 180tttctccgca tcaaatggtc taagattgaa
gtggacaaaa atggaaaaga tttgaaagag 240actactgtcc ttgtggccca aaatggaaat
atcaagattg gtcaggacta caaagggaga 300gtgtctgtgc ccacacatcc cgaggctgtg
ggcgatgcct ccctcactgt ggtcaagctg 360ctggcaagtg atgcgggtct ttaccgctgt
gacgtcatgt acgggattga agacacacaa 420gacacggtgt cactgactgt ggatggggtt
gtgtttcact acagggcggc aaccagcagg 480tacacactga attttgaggc tgctcagaag
gcttgtttgg acgttggggc agtcatagca 540actccagagc agctctttgc tgcctatgaa
gatggatttg agcagtgtga cgcaggctgg 600ctggctgatc agactgtcag atatcccatc
cgggctccca gagtaggctg ttatggagat 660aagatgggaa aggcaggagt caggacttat
ggattccgtt ctccccagga aacttacgat 720gtgtattgtt atgtggatca tctggatggt
gatgtgttcc acctcactgt ccccagtaaa 780ttcaccttcg aggaggctgc aaaagagtgt
gaaaaccagg atgccaggct ggcaacagtg 840ggggaactcc aggcggcatg gaggaacggc
tttgaccagt gcgattacgg gtggctgtcg 900gatgccagcg tgcgccaccc tgtgactgtg
gccagggccc agtgtggagg tggtctactt 960ggggtgagaa ccctgtatcg ttttgagaac
cagacaggct tccctccccc tgatagcaga 1020tttgatgcct actgctttaa acgtcgaatg
agtgatttga gtgtaattgg tcatccaata 1080gattcagaa
1089322409PRTHomo sapiens 32Met Phe Ile
Asn Ile Lys Ser Ile Leu Trp Met Cys Ser Thr Leu Ile1 5
10 15Val Thr His Ala Leu His Lys Val Lys
Val Gly Lys Ser Pro Pro Val 20 25
30Arg Gly Ser Leu Ser Gly Lys Val Ser Leu Pro Cys His Phe Ser Thr
35 40 45Met Pro Thr Leu Pro Pro Ser
Tyr Asn Thr Ser Glu Phe Leu Arg Ile 50 55
60Lys Trp Ser Lys Ile Glu Val Asp Lys Asn Gly Lys Asp Leu Lys Glu65
70 75 80Thr Thr Val Leu
Val Ala Gln Asn Gly Asn Ile Lys Ile Gly Gln Asp 85
90 95Tyr Lys Gly Arg Val Ser Val Pro Thr His
Pro Glu Ala Val Gly Asp 100 105
110Ala Ser Leu Thr Val Val Lys Leu Leu Ala Ser Asp Ala Gly Leu Tyr
115 120 125Arg Cys Asp Val Met Tyr Gly
Ile Glu Asp Thr Gln Asp Thr Val Ser 130 135
140Leu Thr Val Asp Gly Val Val Phe His Tyr Arg Ala Ala Thr Ser
Arg145 150 155 160Tyr Thr
Leu Asn Phe Glu Ala Ala Gln Lys Ala Cys Leu Asp Val Gly
165 170 175Ala Val Ile Ala Thr Pro Glu
Gln Leu Phe Ala Ala Tyr Glu Asp Gly 180 185
190Phe Glu Gln Cys Asp Ala Gly Trp Leu Ala Asp Gln Thr Val
Arg Tyr 195 200 205Pro Ile Arg Ala
Pro Arg Val Gly Cys Tyr Gly Asp Lys Met Gly Lys 210
215 220Ala Gly Val Arg Thr Tyr Gly Phe Arg Ser Pro Gln
Glu Thr Tyr Asp225 230 235
240Val Tyr Cys Tyr Val Asp His Leu Asp Gly Asp Val Phe His Leu Thr
245 250 255Val Pro Ser Lys Phe
Thr Phe Glu Glu Ala Ala Lys Glu Cys Glu Asn 260
265 270Gln Asp Ala Arg Leu Ala Thr Val Gly Glu Leu Gln
Ala Ala Trp Arg 275 280 285Asn Gly
Phe Asp Gln Cys Asp Tyr Gly Trp Leu Ser Asp Ala Ser Val 290
295 300Arg His Pro Val Thr Val Ala Arg Ala Gln Cys
Gly Gly Gly Leu Leu305 310 315
320Gly Val Arg Thr Leu Tyr Arg Phe Glu Asn Gln Thr Gly Phe Pro Pro
325 330 335Pro Asp Ser Arg
Phe Asp Ala Tyr Cys Phe Lys Arg Arg Met Ser Asp 340
345 350Leu Ser Val Ile Gly His Pro Ile Asp Ser Glu
Ser Lys Glu Asp Glu 355 360 365Pro
Cys Ser Glu Glu Thr Asp Pro Val His Asp Leu Met Ala Glu Ile 370
375 380Leu Pro Glu Phe Pro Asp Ile Ile Glu Ile
Asp Leu Tyr His Ser Glu385 390 395
400Glu Asn Glu Glu Glu Glu Glu Glu Cys Ala Asn Ala Thr Asp Val
Thr 405 410 415Thr Thr Pro
Ser Val Gln Tyr Ile Asn Gly Lys His Leu Val Thr Thr 420
425 430Val Pro Lys Asp Pro Glu Ala Ala Glu Ala
Arg Arg Gly Gln Phe Glu 435 440
445Ser Val Ala Pro Ser Gln Asn Phe Ser Asp Ser Ser Glu Ser Asp Thr 450
455 460His Pro Phe Val Ile Ala Lys Thr
Glu Leu Ser Thr Ala Val Gln Pro465 470
475 480Asn Glu Ser Thr Glu Thr Thr Glu Ser Leu Glu Val
Thr Trp Lys Pro 485 490
495Glu Thr Tyr Pro Glu Thr Ser Glu His Phe Ser Gly Gly Glu Pro Asp
500 505 510Val Phe Pro Thr Val Pro
Phe His Glu Glu Phe Glu Ser Gly Thr Ala 515 520
525Lys Lys Gly Ala Glu Ser Val Thr Glu Arg Asp Thr Glu Val
Gly His 530 535 540Gln Ala His Glu His
Thr Glu Pro Val Ser Leu Phe Pro Glu Glu Ser545 550
555 560Ser Gly Glu Ile Ala Ile Asp Gln Glu Ser
Gln Lys Ile Ala Phe Ala 565 570
575Arg Ala Thr Glu Val Thr Phe Gly Glu Glu Val Glu Lys Ser Thr Ser
580 585 590Val Thr Tyr Thr Pro
Thr Ile Val Pro Ser Ser Ala Ser Ala Tyr Val 595
600 605Ser Glu Glu Glu Ala Val Thr Leu Ile Gly Asn Pro
Trp Pro Asp Asp 610 615 620Leu Leu Ser
Thr Lys Glu Ser Trp Val Glu Ala Thr Pro Arg Gln Val625
630 635 640Val Glu Leu Ser Gly Ser Ser
Ser Ile Pro Ile Thr Glu Gly Ser Gly 645
650 655Glu Ala Glu Glu Asp Glu Asp Thr Met Phe Thr Met
Val Thr Asp Leu 660 665 670Ser
Gln Arg Asn Thr Thr Asp Thr Leu Ile Thr Leu Asp Thr Ser Arg 675
680 685Ile Ile Thr Glu Ser Phe Phe Glu Val
Pro Ala Thr Thr Ile Tyr Pro 690 695
700Val Ser Glu Gln Pro Ser Ala Lys Val Val Pro Thr Lys Phe Val Ser705
710 715 720Glu Thr Asp Thr
Ser Glu Trp Ile Ser Ser Thr Thr Val Glu Glu Lys 725
730 735Lys Arg Lys Glu Glu Glu Gly Thr Thr Gly
Thr Ala Ser Thr Phe Glu 740 745
750Val Tyr Ser Ser Thr Gln Arg Ser Asp Gln Leu Ile Leu Pro Phe Glu
755 760 765Leu Glu Ser Pro Asn Val Ala
Thr Ser Ser Asp Ser Gly Thr Arg Lys 770 775
780Ser Phe Met Ser Leu Thr Thr Pro Thr Gln Ser Glu Arg Glu Met
Thr785 790 795 800Asp Ser
Thr Pro Val Phe Thr Glu Thr Asn Thr Leu Glu Asn Leu Gly
805 810 815Ala Gln Thr Thr Glu His Ser
Ser Ile His Gln Pro Gly Val Gln Glu 820 825
830Gly Leu Thr Thr Leu Pro Arg Ser Pro Ala Ser Val Phe Met
Glu Gln 835 840 845Gly Ser Gly Glu
Ala Ala Ala Asp Pro Glu Thr Thr Thr Val Ser Ser 850
855 860Phe Ser Leu Asn Val Glu Tyr Ala Ile Gln Ala Glu
Lys Glu Val Ala865 870 875
880Gly Thr Leu Ser Pro His Val Glu Thr Thr Phe Ser Thr Glu Pro Thr
885 890 895Gly Leu Val Leu Ser
Thr Val Met Asp Arg Val Val Ala Glu Asn Ile 900
905 910Thr Gln Thr Ser Arg Glu Ile Val Ile Ser Glu Arg
Leu Gly Glu Pro 915 920 925Asn Tyr
Gly Ala Glu Ile Arg Gly Phe Ser Thr Gly Phe Pro Leu Glu 930
935 940Glu Asp Phe Ser Gly Asp Phe Arg Glu Tyr Ser
Thr Val Ser His Pro945 950 955
960Ile Ala Lys Glu Glu Thr Val Met Met Glu Gly Ser Gly Asp Ala Ala
965 970 975Phe Arg Asp Thr
Gln Thr Ser Pro Ser Thr Val Pro Thr Ser Val His 980
985 990Ile Ser His Ile Ser Asp Ser Glu Gly Pro Ser
Ser Thr Met Val Ser 995 1000
1005Thr Ser Ala Phe Pro Trp Glu Glu Phe Thr Ser Ser Ala Glu Gly
1010 1015 1020Ser Gly Glu Gln Leu Val
Thr Val Ser Ser Ser Val Val Pro Val 1025 1030
1035Leu Pro Ser Ala Val Gln Lys Phe Ser Gly Thr Ala Ser Ser
Ile 1040 1045 1050Ile Asp Glu Gly Leu
Gly Glu Val Gly Thr Val Asn Glu Ile Asp 1055 1060
1065Arg Arg Ser Thr Ile Leu Pro Thr Ala Glu Val Glu Gly
Thr Lys 1070 1075 1080Ala Pro Val Glu
Lys Glu Glu Val Lys Val Ser Gly Thr Val Ser 1085
1090 1095Thr Asn Phe Pro Gln Thr Ile Glu Pro Ala Lys
Leu Trp Ser Arg 1100 1105 1110Gln Glu
Val Asn Pro Val Arg Gln Glu Ile Glu Ser Glu Thr Thr 1115
1120 1125Ser Glu Glu Gln Ile Gln Glu Glu Lys Ser
Phe Glu Ser Pro Gln 1130 1135 1140Asn
Ser Pro Ala Thr Glu Gln Thr Ile Phe Asp Ser Gln Thr Phe 1145
1150 1155Thr Glu Thr Glu Leu Lys Thr Thr Asp
Tyr Ser Val Leu Thr Thr 1160 1165
1170Lys Lys Thr Tyr Ser Asp Asp Lys Glu Met Lys Glu Glu Asp Thr
1175 1180 1185Ser Leu Val Asn Met Ser
Thr Pro Asp Pro Asp Ala Asn Gly Leu 1190 1195
1200Glu Ser Tyr Thr Thr Leu Pro Glu Ala Thr Glu Lys Ser His
Phe 1205 1210 1215Phe Leu Ala Thr Ala
Leu Val Thr Glu Ser Ile Pro Ala Glu His 1220 1225
1230Val Val Thr Asp Ser Pro Ile Lys Lys Glu Glu Ser Thr
Lys His 1235 1240 1245Phe Pro Lys Gly
Met Arg Pro Thr Ile Gln Glu Ser Asp Thr Glu 1250
1255 1260Leu Leu Phe Ser Gly Leu Gly Ser Gly Glu Glu
Val Leu Pro Thr 1265 1270 1275Leu Pro
Thr Glu Ser Val Asn Phe Thr Glu Val Glu Gln Ile Asn 1280
1285 1290Asn Thr Leu Tyr Pro His Thr Ser Gln Val
Glu Ser Thr Ser Ser 1295 1300 1305Asp
Lys Ile Glu Asp Phe Asn Arg Met Glu Asn Val Ala Lys Glu 1310
1315 1320Val Gly Pro Leu Val Ser Gln Thr Asp
Ile Phe Glu Gly Ser Gly 1325 1330
1335Ser Val Thr Ser Thr Thr Leu Ile Glu Ile Leu Ser Asp Thr Gly
1340 1345 1350Ala Glu Gly Pro Thr Val
Ala Pro Leu Pro Phe Ser Thr Asp Ile 1355 1360
1365Gly His Pro Gln Asn Gln Thr Val Arg Trp Ala Glu Glu Ile
Gln 1370 1375 1380Thr Ser Arg Pro Gln
Thr Ile Thr Glu Gln Asp Ser Asn Lys Asn 1385 1390
1395Ser Ser Thr Ala Glu Ile Asn Glu Thr Thr Thr Ser Ser
Thr Asp 1400 1405 1410Phe Leu Ala Arg
Ala Tyr Gly Phe Glu Met Ala Lys Glu Phe Val 1415
1420 1425Thr Ser Ala Pro Lys Pro Ser Asp Leu Tyr Tyr
Glu Pro Ser Gly 1430 1435 1440Glu Gly
Ser Gly Glu Val Asp Ile Val Asp Ser Phe His Thr Ser 1445
1450 1455Ala Thr Thr Gln Ala Thr Arg Gln Glu Ser
Ser Thr Thr Phe Val 1460 1465 1470Ser
Asp Gly Ser Leu Glu Lys His Pro Glu Val Pro Ser Ala Lys 1475
1480 1485Ala Val Thr Ala Asp Gly Phe Pro Thr
Val Ser Val Met Leu Pro 1490 1495
1500Leu His Ser Glu Gln Asn Lys Ser Ser Pro Asp Pro Thr Ser Thr
1505 1510 1515Leu Ser Asn Thr Val Ser
Tyr Glu Arg Ser Thr Asp Gly Ser Phe 1520 1525
1530Gln Asp Arg Phe Arg Glu Phe Glu Asp Ser Thr Leu Lys Pro
Asn 1535 1540 1545Arg Lys Lys Pro Thr
Glu Asn Ile Ile Ile Asp Leu Asp Lys Glu 1550 1555
1560Asp Lys Asp Leu Ile Leu Thr Ile Thr Glu Ser Thr Ile
Leu Glu 1565 1570 1575Ile Leu Pro Glu
Leu Thr Ser Asp Lys Asn Thr Ile Ile Asp Ile 1580
1585 1590Asp His Thr Lys Pro Val Tyr Glu Asp Ile Leu
Gly Met Gln Thr 1595 1600 1605Asp Ile
Asp Thr Glu Val Pro Ser Glu Pro His Asp Ser Asn Asp 1610
1615 1620Glu Ser Asn Asp Asp Ser Thr Gln Val Gln
Glu Ile Tyr Glu Ala 1625 1630 1635Ala
Val Asn Leu Ser Leu Thr Glu Glu Thr Phe Glu Gly Ser Ala 1640
1645 1650Asp Val Leu Ala Ser Tyr Thr Gln Ala
Thr His Asp Glu Ser Met 1655 1660
1665Thr Tyr Glu Asp Arg Ser Gln Leu Asp His Met Gly Phe His Phe
1670 1675 1680Thr Thr Gly Ile Pro Ala
Pro Ser Thr Glu Thr Glu Leu Asp Val 1685 1690
1695Leu Leu Pro Thr Ala Thr Ser Leu Pro Ile Pro Arg Lys Ser
Ala 1700 1705 1710Thr Val Ile Pro Glu
Ile Glu Gly Ile Lys Ala Glu Ala Lys Ala 1715 1720
1725Leu Asp Asp Met Phe Glu Ser Ser Thr Leu Ser Asp Gly
Gln Ala 1730 1735 1740Ile Ala Asp Gln
Ser Glu Ile Ile Pro Thr Leu Gly Gln Phe Glu 1745
1750 1755Arg Thr Gln Glu Glu Tyr Glu Asp Lys Lys His
Ala Gly Pro Ser 1760 1765 1770Phe Gln
Pro Glu Phe Ser Ser Gly Ala Glu Glu Ala Leu Val Asp 1775
1780 1785His Thr Pro Tyr Leu Ser Ile Ala Thr Thr
His Leu Met Asp Gln 1790 1795 1800Ser
Val Thr Glu Val Pro Asp Val Met Glu Gly Ser Asn Pro Pro 1805
1810 1815Tyr Tyr Thr Asp Thr Thr Leu Ala Val
Ser Thr Phe Ala Lys Leu 1820 1825
1830Ser Ser Gln Thr Pro Ser Ser Pro Leu Thr Ile Tyr Ser Gly Ser
1835 1840 1845Glu Ala Ser Gly His Thr
Glu Ile Pro Gln Pro Ser Ala Leu Pro 1850 1855
1860Gly Ile Asp Val Gly Ser Ser Val Met Ser Pro Gln Asp Ser
Phe 1865 1870 1875Lys Glu Ile His Val
Asn Ile Glu Ala Thr Phe Lys Pro Ser Ser 1880 1885
1890Glu Glu Tyr Leu His Ile Thr Glu Pro Pro Ser Leu Ser
Pro Asp 1895 1900 1905Thr Lys Leu Glu
Pro Ser Glu Asp Asp Gly Lys Pro Glu Leu Leu 1910
1915 1920Glu Glu Met Glu Ala Ser Pro Thr Glu Leu Ile
Ala Val Glu Gly 1925 1930 1935Thr Glu
Ile Leu Gln Asp Phe Gln Asn Lys Thr Asp Gly Gln Val 1940
1945 1950Ser Gly Glu Ala Ile Lys Met Phe Pro Thr
Ile Lys Thr Pro Glu 1955 1960 1965Ala
Gly Thr Val Ile Thr Thr Ala Asp Glu Ile Glu Leu Glu Gly 1970
1975 1980Ala Thr Gln Trp Pro His Ser Thr Ser
Ala Ser Ala Thr Tyr Gly 1985 1990
1995Val Glu Ala Gly Val Val Pro Trp Leu Ser Pro Gln Thr Ser Glu
2000 2005 2010Arg Pro Thr Leu Ser Ser
Ser Pro Glu Ile Asn Pro Glu Thr Gln 2015 2020
2025Ala Ala Leu Ile Arg Gly Gln Asp Ser Thr Ile Ala Ala Ser
Glu 2030 2035 2040Gln Gln Val Ala Ala
Arg Ile Leu Asp Ser Asn Asp Gln Ala Thr 2045 2050
2055Val Asn Pro Val Glu Phe Asn Thr Glu Val Ala Thr Pro
Pro Phe 2060 2065 2070Ser Leu Leu Glu
Thr Ser Asn Glu Thr Asp Phe Leu Ile Gly Ile 2075
2080 2085Asn Glu Glu Ser Val Glu Gly Thr Ala Ile Tyr
Leu Pro Gly Pro 2090 2095 2100Asp Arg
Cys Lys Met Asn Pro Cys Leu Asn Gly Gly Thr Cys Tyr 2105
2110 2115Pro Thr Glu Thr Ser Tyr Val Cys Thr Cys
Val Pro Gly Tyr Ser 2120 2125 2130Gly
Asp Gln Cys Glu Leu Asp Phe Asp Glu Cys His Ser Asn Pro 2135
2140 2145Cys Arg Asn Gly Ala Thr Cys Val Asp
Gly Phe Asn Thr Phe Arg 2150 2155
2160Cys Leu Cys Leu Pro Ser Tyr Val Gly Ala Leu Cys Glu Gln Asp
2165 2170 2175Thr Glu Thr Cys Asp Tyr
Gly Trp His Lys Phe Gln Gly Gln Cys 2180 2185
2190Tyr Lys Tyr Phe Ala His Arg Arg Thr Trp Asp Ala Ala Glu
Arg 2195 2200 2205Glu Cys Arg Leu Gln
Gly Ala His Leu Thr Ser Ile Leu Ser His 2210 2215
2220Glu Glu Gln Met Phe Val Asn Arg Val Gly His Asp Tyr
Gln Trp 2225 2230 2235Ile Gly Leu Asn
Asp Lys Met Phe Glu His Asp Phe Arg Trp Thr 2240
2245 2250Asp Gly Ser Thr Leu Gln Tyr Glu Asn Trp Arg
Pro Asn Gln Pro 2255 2260 2265Asp Ser
Phe Phe Ser Ala Gly Glu Asp Cys Val Val Ile Ile Trp 2270
2275 2280His Glu Asn Gly Gln Trp Asn Asp Val Pro
Cys Asn Tyr His Leu 2285 2290 2295Thr
Tyr Thr Cys Lys Lys Gly Thr Val Ala Cys Gly Gln Pro Pro 2300
2305 2310Val Val Glu Asn Ala Lys Thr Phe Gly
Lys Met Lys Pro Arg Tyr 2315 2320
2325Glu Ile Asn Ser Leu Ile Arg Tyr His Cys Lys Asp Gly Phe Ile
2330 2335 2340Gln Arg His Leu Pro Thr
Ile Arg Cys Leu Gly Asn Gly Arg Trp 2345 2350
2355Ala Ile Pro Lys Ile Thr Cys Met Asn Pro Ser Ala Tyr Gln
Arg 2360 2365 2370Thr Tyr Ser Met Lys
Tyr Phe Lys Asn Ser Ser Ser Ala Lys Asp 2375 2380
2385Asn Ser Ile Asn Thr Ser Lys His Asp His Arg Trp Ser
Arg Arg 2390 2395 2400Trp Gln Glu Ser
Arg Arg 240533363PRTHomo sapiens 33Met Phe Ile Asn Ile Lys Ser Ile Leu
Trp Met Cys Ser Thr Leu Ile1 5 10
15Val Thr His Ala Leu His Lys Val Lys Val Gly Lys Ser Pro Pro
Val 20 25 30Arg Gly Ser Leu
Ser Gly Lys Val Ser Leu Pro Cys His Phe Ser Thr 35
40 45Met Pro Thr Leu Pro Pro Ser Tyr Asn Thr Ser Glu
Phe Leu Arg Ile 50 55 60Lys Trp Ser
Lys Ile Glu Val Asp Lys Asn Gly Lys Asp Leu Lys Glu65 70
75 80Thr Thr Val Leu Val Ala Gln Asn
Gly Asn Ile Lys Ile Gly Gln Asp 85 90
95Tyr Lys Gly Arg Val Ser Val Pro Thr His Pro Glu Ala Val
Gly Asp 100 105 110Ala Ser Leu
Thr Val Val Lys Leu Leu Ala Ser Asp Ala Gly Leu Tyr 115
120 125Arg Cys Asp Val Met Tyr Gly Ile Glu Asp Thr
Gln Asp Thr Val Ser 130 135 140Leu Thr
Val Asp Gly Val Val Phe His Tyr Arg Ala Ala Thr Ser Arg145
150 155 160Tyr Thr Leu Asn Phe Glu Ala
Ala Gln Lys Ala Cys Leu Asp Val Gly 165
170 175Ala Val Ile Ala Thr Pro Glu Gln Leu Phe Ala Ala
Tyr Glu Asp Gly 180 185 190Phe
Glu Gln Cys Asp Ala Gly Trp Leu Ala Asp Gln Thr Val Arg Tyr 195
200 205Pro Ile Arg Ala Pro Arg Val Gly Cys
Tyr Gly Asp Lys Met Gly Lys 210 215
220Ala Gly Val Arg Thr Tyr Gly Phe Arg Ser Pro Gln Glu Thr Tyr Asp225
230 235 240Val Tyr Cys Tyr
Val Asp His Leu Asp Gly Asp Val Phe His Leu Thr 245
250 255Val Pro Ser Lys Phe Thr Phe Glu Glu Ala
Ala Lys Glu Cys Glu Asn 260 265
270Gln Asp Ala Arg Leu Ala Thr Val Gly Glu Leu Gln Ala Ala Trp Arg
275 280 285Asn Gly Phe Asp Gln Cys Asp
Tyr Gly Trp Leu Ser Asp Ala Ser Val 290 295
300Arg His Pro Val Thr Val Ala Arg Ala Gln Cys Gly Gly Gly Leu
Leu305 310 315 320Gly Val
Arg Thr Leu Tyr Arg Phe Glu Asn Gln Thr Gly Phe Pro Pro
325 330 335Pro Asp Ser Arg Phe Asp Ala
Tyr Cys Phe Lys Arg Arg Met Ser Asp 340 345
350Leu Ser Val Ile Gly His Pro Ile Asp Ser Glu 355
36034462DNAHomo sapiens 34atgggaaaaa tcagcagtct tccaacccaa
ttatttaagt gctgcttttg tgatttcttg 60aaggtgaaga tgcacaccat gtcctcctcg
catctcttct acctggcgct gtgcctgctc 120accttcacca gctctgccac ggctggaccg
gagacgctct gcggggctga gctggtggat 180gctcttcagt tcgtgtgtgg agacaggggc
ttttatttca acaagcccac agggtatggc 240tccagcagtc ggagggcgcc tcagacaggc
atcgtggatg agtgctgctt ccggagctgt 300gatctaagga ggctggagat gtattgcgca
cccctcaagc ctgccaagtc agctcgctct 360gtccgtgccc agcgccacac cgacatgccc
aagacccaga aggaagtaca tttgaagaac 420gcaagtagag ggagtgcagg aaacaagaac
tacaggatgt ag 46235210DNAHomo sapiens 35ggaccggaga
cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac 60aggggctttt
atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag 120acaggcatcg
tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat 180tgcgcacccc
tcaagcctgc caagtcagct 21036144DNAHomo
sapiens 36atgggaaaaa tcagcagtct tccaacccaa ttatttaagt gctgcttttg
tgatttcttg 60aaggtgaaga tgcacaccat gtcctcctcg catctcttct acctggcgct
gtgcctgctc 120accttcacca gctctgccac ggct
14437108DNAHomo sapiens 37cgctctgtcc gtgcccagcg ccacaccgac
atgcccaaga cccagaagga agtacatttg 60aagaacgcaa gtagagggag tgcaggaaac
aagaactaca ggatgtag 10838153PRTHomo sapiens 38Met Gly Lys
Ile Ser Ser Leu Pro Thr Gln Leu Phe Lys Cys Cys Phe1 5
10 15Cys Asp Phe Leu Lys Val Lys Met His
Thr Met Ser Ser Ser His Leu 20 25
30Phe Tyr Leu Ala Leu Cys Leu Leu Thr Phe Thr Ser Ser Ala Thr Ala
35 40 45Gly Pro Glu Thr Leu Cys Gly
Ala Glu Leu Val Asp Ala Leu Gln Phe 50 55
60Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly65
70 75 80Ser Ser Ser Arg
Arg Ala Pro Gln Thr Gly Ile Val Asp Glu Cys Cys 85
90 95Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu
Met Tyr Cys Ala Pro Leu 100 105
110Lys Pro Ala Lys Ser Ala Arg Ser Val Arg Ala Gln Arg His Thr Asp
115 120 125Met Pro Lys Thr Gln Lys Glu
Val His Leu Lys Asn Ala Ser Arg Gly 130 135
140Ser Ala Gly Asn Lys Asn Tyr Arg Met145
1503970PRTHomo sapiens 39Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val Asp
Ala Leu Gln Phe1 5 10
15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly
20 25 30Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile Val Asp Glu Cys Cys 35 40
45Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro
Leu 50 55 60Lys Pro Ala Lys Ser
Ala65 704048PRTHomo sapiens 40Met Gly Lys Ile Ser Ser
Leu Pro Thr Gln Leu Phe Lys Cys Cys Phe1 5
10 15Cys Asp Phe Leu Lys Val Lys Met His Thr Met Ser
Ser Ser His Leu 20 25 30Phe
Tyr Leu Ala Leu Cys Leu Leu Thr Phe Thr Ser Ser Ala Thr Ala 35
40 454135PRTHomo sapiens 41Arg Ser Val Arg
Ala Gln Arg His Thr Asp Met Pro Lys Thr Gln Lys1 5
10 15Glu Val His Leu Lys Asn Ala Ser Arg Gly
Ser Ala Gly Asn Lys Asn 20 25
30Tyr Arg Met 35421389DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 42atggccactt tactctgggt
tttcgtgact ctgagggtca tcactgcagc tgtcactgta 60gaaacttcag accatgacaa
ctcgctgagt gtcagcatcc cccaaccgtc cccgctgagg 120gtcctcctgg ggacctccct
caccatcccc tgctatttca tcgaccccat gcaccctgtg 180accaccgccc cttctaccgc
cccactggcc ccaagaatca agtggagccg tgtgtccaag 240gagaaggagg tagtgctgct
ggtggccact gaagggcgcg tgcgggtcaa cagtgcctat 300caggacaagg tctcactgcc
caactacccg gccatcccca gtgacgccac cttggaagtc 360cagagcctgc gctccaatga
ctctggggtc taccgctgcg aggtgatgca tggcatcgag 420gacagcgagg ccaccctgga
agtcgtggtg aaaggcatcg tgttccatta cagagccatc 480tctacacgct acaccctcga
ctttgacagg gcgcagcggg cctgcctgca gaacagtgcc 540atcattgcca cgcctgagca
gctgcaggcc gcctacgaag acggcttcca ccagtgtgac 600gccggctggc tggctgacca
gactgtcaga taccccatcc acactccccg ggaaggctgc 660tatggagaca aggatgagtt
tcctggtgtg aggacgtatg gcatccgaga caccaacgag 720acctatgatg tgtactgctt
cgccgaggag atggagggtg aggtctttta tgcaacatct 780ccagagaagt tcaccttcca
ggaagcagcc aatgagtgcc ggcggctggg tgcccggctg 840gccaccacgg gccagctcta
cctggcctgg caggctggca tggacatgtg cagcgccggc 900tggctggccg accgcagcgt
gcgctacccc atctccaagg cccggcccaa ctgcggtggc 960aacctcctgg gcgtgaggac
cgtctacgtg catgccaacc agacgggcta ccccgacccc 1020tcatcccgct acgacgccat
ctgctacaca ggtgaagact ttgtggacat cgggcccgag 1080acgctctgcg gggctgagct
ggtggatgct cttcagttcg tgtgtggaga caggggcttt 1140tatttcaaca agcccacagg
gtatggctcc agcagtcgga gggcgcctca gacaggcatc 1200gtggatgagt gctgcttccg
gagctgtgat ctaaggaggc tggagatgta ttgcgcaccc 1260ctcaagcctg ccaagtcagc
tcgctctgtc cgtgcccagc gccacaccga catgcccaag 1320acccagaagg aagtacattt
gaagaacgca agtagaggga gtgcaggaaa caagaactac 1380aggatgtag
138943318DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
43gggcccgaga cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac
60aggggctttt atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag
120acaggcatcg tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat
180tgcgcacccc tcaagcctgc caagtcagct cgctctgtcc gtgcccagcg ccacaccgac
240atgcccaaga cccagaagga agtacatttg aagaacgcaa gtagagggag tgcaggaaac
300aagaactaca ggatgtag
31844462PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 44Met Ala Thr Leu Leu Trp Val Phe Val Thr Leu
Arg Val Ile Thr Ala1 5 10
15Ala Val Thr Val Glu Thr Ser Asp His Asp Asn Ser Leu Ser Val Ser
20 25 30Ile Pro Gln Pro Ser Pro Leu
Arg Val Leu Leu Gly Thr Ser Leu Thr 35 40
45Ile Pro Cys Tyr Phe Ile Asp Pro Met His Pro Val Thr Thr Ala
Pro 50 55 60Ser Thr Ala Pro Leu Ala
Pro Arg Ile Lys Trp Ser Arg Val Ser Lys65 70
75 80Glu Lys Glu Val Val Leu Leu Val Ala Thr Glu
Gly Arg Val Arg Val 85 90
95Asn Ser Ala Tyr Gln Asp Lys Val Ser Leu Pro Asn Tyr Pro Ala Ile
100 105 110Pro Ser Asp Ala Thr Leu
Glu Val Gln Ser Leu Arg Ser Asn Asp Ser 115 120
125Gly Val Tyr Arg Cys Glu Val Met His Gly Ile Glu Asp Ser
Glu Ala 130 135 140Thr Leu Glu Val Val
Val Lys Gly Ile Val Phe His Tyr Arg Ala Ile145 150
155 160Ser Thr Arg Tyr Thr Leu Asp Phe Asp Arg
Ala Gln Arg Ala Cys Leu 165 170
175Gln Asn Ser Ala Ile Ile Ala Thr Pro Glu Gln Leu Gln Ala Ala Tyr
180 185 190Glu Asp Gly Phe His
Gln Cys Asp Ala Gly Trp Leu Ala Asp Gln Thr 195
200 205Val Arg Tyr Pro Ile His Thr Pro Arg Glu Gly Cys
Tyr Gly Asp Lys 210 215 220Asp Glu Phe
Pro Gly Val Arg Thr Tyr Gly Ile Arg Asp Thr Asn Glu225
230 235 240Thr Tyr Asp Val Tyr Cys Phe
Ala Glu Glu Met Glu Gly Glu Val Phe 245
250 255Tyr Ala Thr Ser Pro Glu Lys Phe Thr Phe Gln Glu
Ala Ala Asn Glu 260 265 270Cys
Arg Arg Leu Gly Ala Arg Leu Ala Thr Thr Gly Gln Leu Tyr Leu 275
280 285Ala Trp Gln Ala Gly Met Asp Met Cys
Ser Ala Gly Trp Leu Ala Asp 290 295
300Arg Ser Val Arg Tyr Pro Ile Ser Lys Ala Arg Pro Asn Cys Gly Gly305
310 315 320Asn Leu Leu Gly
Val Arg Thr Val Tyr Val His Ala Asn Gln Thr Gly 325
330 335Tyr Pro Asp Pro Ser Ser Arg Tyr Asp Ala
Ile Cys Tyr Thr Gly Glu 340 345
350Asp Phe Val Asp Ile Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
355 360 365Asp Ala Leu Gln Phe Val Cys
Gly Asp Arg Gly Phe Tyr Phe Asn Lys 370 375
380Pro Thr Gly Tyr Gly Ser Ser Ser Arg Arg Ala Pro Gln Thr Gly
Ile385 390 395 400Val Asp
Glu Cys Cys Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met
405 410 415Tyr Cys Ala Pro Leu Lys Pro
Ala Lys Ser Ala Arg Ser Val Arg Ala 420 425
430Gln Arg His Thr Asp Met Pro Lys Thr Gln Lys Glu Val His
Leu Lys 435 440 445Asn Ala Ser Arg
Gly Ser Ala Gly Asn Lys Asn Tyr Arg Met 450 455
46045105PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 45Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
Asp Ala Leu Gln Phe1 5 10
15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly
20 25 30Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile Val Asp Glu Cys Cys 35 40
45Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro
Leu 50 55 60Lys Pro Ala Lys Ser Ala
Arg Ser Val Arg Ala Gln Arg His Thr Asp65 70
75 80Met Pro Lys Thr Gln Lys Glu Val His Leu Lys
Asn Ala Ser Arg Gly 85 90
95Ser Ala Gly Asn Lys Asn Tyr Arg Met 100
105461380DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 46atggagagtc tacttcttct ggtgctgatt
tcaatctgct gggctgatca tctttcagac 60aactatactc tggatcatga cagagctatt
cacatccaag cagaaaatgg cccccatcta 120cttgtggaag cagagcaagc caaggtgttt
tcacacagag gtggcaatgt tacactgcca 180tgtaaatttt atcgagaccc tacagcattt
ggctcaggaa tccataaaat ccgaattaag 240tggaccaagc taacttcgga ttacctcaag
gaagtggatg tttttgtttc catgggatac 300cacaaaaaaa cctatggagg ctaccagggt
agagtgtttc tgaagggagg cagtgatagt 360gatgcttctc tggtcatcac agacctcact
ctggaagatt atgggagata taagtgtgag 420gtgattgaag gattagaaga tgatactgtt
gtggtagcac tggacttaca aggtgtggta 480ttcccttact ttccacgact ggggcgctac
aatctcaatt ttcacgaggc gcagcaggcg 540tgtctggacc aggatgctgt gatcgcctcc
ttcgaccagc tgtacgacgc ctggcggggc 600gggctggact ggtgcaatgc cggctggctc
agtgatggct ctgtgcaata tcccatcaca 660aagcccagag agccctgtgg ggggcagaac
acagtgcccg gagtcaggaa ctacggattt 720tgggataaag ataaaagcag atatgatgtt
ttctgtttta catccaattt caatggccgt 780ttttactatc tgatccaccc caccaaactg
acctatgatg aagcggtgca agcttgtctc 840aatgatggtg ctcagattgc aaaagtgggc
cagatatttg ctgcctggaa aattctcgga 900tatgaccgct gtgatgcggg ctggttggcg
gatggcagcg tccgctaccc catctctagg 960ccaagaaggc gctgcagtcc tactgaggct
gcagtgcgct tcgtgggttt cccagataaa 1020aagcataagc tgtatggtgt ctactgcttc
agagcataca acgggcccga gacgctctgc 1080ggggctgagc tggtggatgc tcttcagttc
gtgtgtggag acaggggctt ttatttcaac 1140aagcccacag ggtatggctc cagcagtcgg
agggcgcctc agacaggcat cgtggatgag 1200tgctgcttcc ggagctgtga tctaaggagg
ctggagatgt attgcgcacc cctcaagcct 1260gccaagtcag ctcgctctgt ccgtgcccag
cgccacaccg acatgcccaa gacccagaag 1320gaagtacatt tgaagaacgc aagtagaggg
agtgcaggaa acaagaacta caggatgtag 138047318DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
47gggcccgaga cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac
60aggggctttt atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag
120acaggcatcg tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat
180tgcgcacccc tcaagcctgc caagtcagct cgctctgtcc gtgcccagcg ccacaccgac
240atgcccaaga cccagaagga agtacatttg aagaacgcaa gtagagggag tgcaggaaac
300aagaactaca ggatgtag
31848459PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 48Met Glu Ser Leu Leu Leu Leu Val Leu Ile Ser
Ile Cys Trp Ala Asp1 5 10
15His Leu Ser Asp Asn Tyr Thr Leu Asp His Asp Arg Ala Ile His Ile
20 25 30Gln Ala Glu Asn Gly Pro His
Leu Leu Val Glu Ala Glu Gln Ala Lys 35 40
45Val Phe Ser His Arg Gly Gly Asn Val Thr Leu Pro Cys Lys Phe
Tyr 50 55 60Arg Asp Pro Thr Ala Phe
Gly Ser Gly Ile His Lys Ile Arg Ile Lys65 70
75 80Trp Thr Lys Leu Thr Ser Asp Tyr Leu Lys Glu
Val Asp Val Phe Val 85 90
95Ser Met Gly Tyr His Lys Lys Thr Tyr Gly Gly Tyr Gln Gly Arg Val
100 105 110Phe Leu Lys Gly Gly Ser
Asp Ser Asp Ala Ser Leu Val Ile Thr Asp 115 120
125Leu Thr Leu Glu Asp Tyr Gly Arg Tyr Lys Cys Glu Val Ile
Glu Gly 130 135 140Leu Glu Asp Asp Thr
Val Val Val Ala Leu Asp Leu Gln Gly Val Val145 150
155 160Phe Pro Tyr Phe Pro Arg Leu Gly Arg Tyr
Asn Leu Asn Phe His Glu 165 170
175Ala Gln Gln Ala Cys Leu Asp Gln Asp Ala Val Ile Ala Ser Phe Asp
180 185 190Gln Leu Tyr Asp Ala
Trp Arg Gly Gly Leu Asp Trp Cys Asn Ala Gly 195
200 205Trp Leu Ser Asp Gly Ser Val Gln Tyr Pro Ile Thr
Lys Pro Arg Glu 210 215 220Pro Cys Gly
Gly Gln Asn Thr Val Pro Gly Val Arg Asn Tyr Gly Phe225
230 235 240Trp Asp Lys Asp Lys Ser Arg
Tyr Asp Val Phe Cys Phe Thr Ser Asn 245
250 255Phe Asn Gly Arg Phe Tyr Tyr Leu Ile His Pro Thr
Lys Leu Thr Tyr 260 265 270Asp
Glu Ala Val Gln Ala Cys Leu Asn Asp Gly Ala Gln Ile Ala Lys 275
280 285Val Gly Gln Ile Phe Ala Ala Trp Lys
Ile Leu Gly Tyr Asp Arg Cys 290 295
300Asp Ala Gly Trp Leu Ala Asp Gly Ser Val Arg Tyr Pro Ile Ser Arg305
310 315 320Pro Arg Arg Arg
Cys Ser Pro Thr Glu Ala Ala Val Arg Phe Val Gly 325
330 335Phe Pro Asp Lys Lys His Lys Leu Tyr Gly
Val Tyr Cys Phe Arg Ala 340 345
350Tyr Asn Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val Asp Ala Leu
355 360 365Gln Phe Val Cys Gly Asp Arg
Gly Phe Tyr Phe Asn Lys Pro Thr Gly 370 375
380Tyr Gly Ser Ser Ser Arg Arg Ala Pro Gln Thr Gly Ile Val Asp
Glu385 390 395 400Cys Cys
Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala
405 410 415Pro Leu Lys Pro Ala Lys Ser
Ala Arg Ser Val Arg Ala Gln Arg His 420 425
430Thr Asp Met Pro Lys Thr Gln Lys Glu Val His Leu Lys Asn
Ala Ser 435 440 445Arg Gly Ser Ala
Gly Asn Lys Asn Tyr Arg Met 450 45549105PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
49Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val Asp Ala Leu Gln Phe1
5 10 15Val Cys Gly Asp Arg Gly
Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly 20 25
30Ser Ser Ser Arg Arg Ala Pro Gln Thr Gly Ile Val Asp
Glu Cys Cys 35 40 45Phe Arg Ser
Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro Leu 50
55 60Lys Pro Ala Lys Ser Ala Arg Ser Val Arg Ala Gln
Arg His Thr Asp65 70 75
80Met Pro Lys Thr Gln Lys Glu Val His Leu Lys Asn Ala Ser Arg Gly
85 90 95Ser Ala Gly Asn Lys Asn
Tyr Arg Met 100 105501407DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
50atggtcataa atataaagag catcttatgg atgtgttcaa ccttaatagt aacccatgcg
60ctacataaag tcaaagtggg aaaaagccca ccggtgaggg gctccctctc tggaaaagtc
120agcctacctt gtcatttttc aacgatgcct actttgccac ccagttacaa caccagtgaa
180tttctccgca tcaaatggtc taagattgaa gtggacaaaa atggaaaaga tttgaaagag
240actactgtcc ttgtggccca aaatggaaat atcaagattg gtcaggacta caaagggaga
300gtgtctgtgc ccacacatcc cgaggctgtg ggcgatgcct ccctcactgt ggtcaagctg
360ctggcaagtg atgcgggtct ttaccgctgt gacgtcatgt acgggattga agacacacaa
420gacacggtgt cactgactgt ggatggggtt gtgtttcact acagggcggc aaccagcagg
480tacacactga attttgaggc tgctcagaag gcttgtttgg acgttggggc agtcatagca
540actccagagc agctctttgc tgcctatgaa gatggatttg agcagtgtga cgcaggctgg
600ctggctgatc agactgtcag atatcccatc cgggctccca gagtaggctg ttatggagat
660aagatgggaa aggcaggagt caggacttat ggattccgtt ctccccagga aacttacgat
720gtgtattgtt atgtggatca tctggatggt gatgtgttcc acctcactgt ccccagtaaa
780ttcaccttcg aggaggctgc aaaagagtgt gaaaaccagg atgccaggct ggcaacagtg
840ggggaactcc aggcggcatg gaggaacggc tttgaccagt gcgattacgg gtggctgtcg
900gatgccagcg tgcgccaccc tgtgactgtg gccagggccc agtgtggagg tggtctactt
960ggggtgagaa ccctgtatcg ttttgagaac cagacaggct tccctccccc tgatagcaga
1020tttgatgcct actgctttaa acgtcgaatg agtgatttga gtgtaattgg tcatccaatt
1080gattcagaag gaccggagac gctctgcggg gctgagctgg tggatgctct tcagttcgtg
1140tgtggagaca ggggctttta tttcaacaag cccacagggt atggctccag cagtcggagg
1200gcgcctcaga caggcatcgt ggatgagtgc tgcttccgga gctgtgatct aaggaggctg
1260gagatgtatt gcgcacccct caagcctgcc aagtcagctc gctctgtccg tgcccagcgc
1320cacaccgaca tgcccaagac ccagaaggaa gtacatttga agaacgcaag tagagggagt
1380gcaggaaaca agaactacag gatgtag
140751318DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 51ggaccggaga cgctctgcgg ggctgagctg
gtggatgctc ttcagttcgt gtgtggagac 60aggggctttt atttcaacaa gcccacaggg
tatggctcca gcagtcggag ggcgcctcag 120acaggcatcg tggatgagtg ctgcttccgg
agctgtgatc taaggaggct ggagatgtat 180tgcgcacccc tcaagcctgc caagtcagct
cgctctgtcc gtgcccagcg ccacaccgac 240atgcccaaga cccagaagga agtacatttg
aagaacgcaa gtagagggag tgcaggaaac 300aagaactaca ggatgtag
31852468PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
52Met Val Ile Asn Ile Lys Ser Ile Leu Trp Met Cys Ser Thr Leu Ile1
5 10 15Val Thr His Ala Leu His
Lys Val Lys Val Gly Lys Ser Pro Pro Val 20 25
30Arg Gly Ser Leu Ser Gly Lys Val Ser Leu Pro Cys His
Phe Ser Thr 35 40 45Met Pro Thr
Leu Pro Pro Ser Tyr Asn Thr Ser Glu Phe Leu Arg Ile 50
55 60Lys Trp Ser Lys Ile Glu Val Asp Lys Asn Gly Lys
Asp Leu Lys Glu65 70 75
80Thr Thr Val Leu Val Ala Gln Asn Gly Asn Ile Lys Ile Gly Gln Asp
85 90 95Tyr Lys Gly Arg Val Ser
Val Pro Thr His Pro Glu Ala Val Gly Asp 100
105 110Ala Ser Leu Thr Val Val Lys Leu Leu Ala Ser Asp
Ala Gly Leu Tyr 115 120 125Arg Cys
Asp Val Met Tyr Gly Ile Glu Asp Thr Gln Asp Thr Val Ser 130
135 140Leu Thr Val Asp Gly Val Val Phe His Tyr Arg
Ala Ala Thr Ser Arg145 150 155
160Tyr Thr Leu Asn Phe Glu Ala Ala Gln Lys Ala Cys Leu Asp Val Gly
165 170 175Ala Val Ile Ala
Thr Pro Glu Gln Leu Phe Ala Ala Tyr Glu Asp Gly 180
185 190Phe Glu Gln Cys Asp Ala Gly Trp Leu Ala Asp
Gln Thr Val Arg Tyr 195 200 205Pro
Ile Arg Ala Pro Arg Val Gly Cys Tyr Gly Asp Lys Met Gly Lys 210
215 220Ala Gly Val Arg Thr Tyr Gly Phe Arg Ser
Pro Gln Glu Thr Tyr Asp225 230 235
240Val Tyr Cys Tyr Val Asp His Leu Asp Gly Asp Val Phe His Leu
Thr 245 250 255Val Pro Ser
Lys Phe Thr Phe Glu Glu Ala Ala Lys Glu Cys Glu Asn 260
265 270Gln Asp Ala Arg Leu Ala Thr Val Gly Glu
Leu Gln Ala Ala Trp Arg 275 280
285Asn Gly Phe Asp Gln Cys Asp Tyr Gly Trp Leu Ser Asp Ala Ser Val 290
295 300Arg His Pro Val Thr Val Ala Arg
Ala Gln Cys Gly Gly Gly Leu Leu305 310
315 320Gly Val Arg Thr Leu Tyr Arg Phe Glu Asn Gln Thr
Gly Phe Pro Pro 325 330
335Pro Asp Ser Arg Phe Asp Ala Tyr Cys Phe Lys Arg Arg Met Ser Asp
340 345 350Leu Ser Val Ile Gly His
Pro Ile Asp Ser Glu Gly Pro Glu Thr Leu 355 360
365Cys Gly Ala Glu Leu Val Asp Ala Leu Gln Phe Val Cys Gly
Asp Arg 370 375 380Gly Phe Tyr Phe Asn
Lys Pro Thr Gly Tyr Gly Ser Ser Ser Arg Arg385 390
395 400Ala Pro Gln Thr Gly Ile Val Asp Glu Cys
Cys Phe Arg Ser Cys Asp 405 410
415Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro Leu Lys Pro Ala Lys Ser
420 425 430Ala Arg Ser Val Arg
Ala Gln Arg His Thr Asp Met Pro Lys Thr Gln 435
440 445Lys Glu Val His Leu Lys Asn Ala Ser Arg Gly Ser
Ala Gly Asn Lys 450 455 460Asn Tyr Arg
Met46553105PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 53Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
Asp Ala Leu Gln Phe1 5 10
15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly
20 25 30Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile Val Asp Glu Cys Cys 35 40
45Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro
Leu 50 55 60Lys Pro Ala Lys Ser Ala
Arg Ser Val Arg Ala Gln Arg His Thr Asp65 70
75 80Met Pro Lys Thr Gln Lys Glu Val His Leu Lys
Asn Ala Ser Arg Gly 85 90
95Ser Ala Gly Asn Lys Asn Tyr Arg Met 100
105541086DNAHomo sapiens 54atggacaagt tttggtggca cgcagcctgg ggactctgcc
tcgtgccgct gagcctggcg 60cagatcgatt tgaatataac ctgccgcttt gcaggtgtat
tccacgtgga gaaaaatggt 120cgctacagca tctctcggac ggaggccgct gacctctgca
aggctttcaa tagcaccttg 180cccacaatgg cccagatgga gaaagctctg agcatcggat
ttgagacctg caggtatggg 240ttcatagaag ggcacgtggt gattccccgg atccacccca
actccatctg tgcagcaaac 300aacacagggg tgtacatcct cacatccaac acctcccagt
atgacacata ttgcttcaat 360gcttcagctc cacctgaaga agattgtaca tcagtcacag
acctgcccaa tgcctttgat 420ggaccaatta ccataactat tgttaaccgt gatggcaccc
gctatgtcca gaaaggagaa 480tacagaacga atcctgaaga catctacccc agcaacccta
ctgatgatga cgtgagcagc 540ggctcctcca gtgaaaggag cagcacttca ggaggttaca
tcttttacac cttttctact 600gtacacccca tcccagacga agacagtccc tggatcaccg
acagcacaga cagaatccct 660gctaccagag accaagacac attccacccc agtggggggt
cccataccac tcatggatct 720gaatcagatg gacactcaca tgggagtcaa gaaggtggag
caaacacaac ctctggtcct 780ataaggacac cccaaattcc agaatggctg atcatcttgg
catccctctt ggccttggct 840ttgattcttg cagtttgcat tgcagtcaac agtcgaagaa
ggtgtgggca gaagaaaaag 900ctagtgatca acagtggcaa tggagctgtg gaggacagaa
agccaagtgg actcaacgga 960gaggccagca agtctcagga aatggtgcat ttggtgaaca
aggagtcgtc agaaactcca 1020gaccagttta tgacagctga tgagacaagg aacctgcaga
atgtggacat gaagattggg 1080gtgtaa
108655804DNAHomo sapiens 55atggacaagt tttggtggca
cgcagcctgg ggactctgcc tcgtgccgct gagcctggcg 60cagatcgatt tgaatataac
ctgccgcttt gcaggtgtat tccacgtgga gaaaaatggt 120cgctacagca tctctcggac
ggaggccgct gacctctgca aggctttcaa tagcaccttg 180cccacaatgg cccagatgga
gaaagctctg agcatcggat ttgagacctg caggtatggg 240ttcatagaag ggcacgtggt
gattccccgg atccacccca actccatctg tgcagcaaac 300aacacagggg tgtacatcct
cacatccaac acctcccagt atgacacata ttgcttcaat 360gcttcagctc cacctgaaga
agattgtaca tcagtcacag acctgcccaa tgcctttgat 420ggaccaatta ccataactat
tgttaaccgt gatggcaccc gctatgtcca gaaaggagaa 480tacagaacga atcctgaaga
catctacccc agcaacccta ctgatgatga cgtgagcagc 540ggctcctcca gtgaaaggag
cagcacttca ggaggttaca tcttttacac cttttctact 600gtacacccca tcccagacga
agacagtccc tggatcaccg acagcacaga cagaatccct 660gctaccagag accaagacac
attccacccc agtggggggt cccataccac tcatggatct 720gaatcagatg gacactcaca
tgggagtcaa gaaggtggag caaacacaac ctctggtcct 780ataaggacac cccaaattcc
agaa 8045660DNAHomo sapiens
56tggctgatca tcttggcatc cctcttggcc ttggctttga ttcttgcagt ttgcattgca
6057222DNAHomo sapiens 57gtcaacagtc gaagaaggtg tgggcagaag aaaaagctag
tgatcaacag tggcaatgga 60gctgtggagg acagaaagcc aagtggactc aacggagagg
ccagcaagtc tcaggaaatg 120gtgcatttgg tgaacaagga gtcgtcagaa actccagacc
agtttatgac agctgatgag 180acaaggaacc tgcagaatgt ggacatgaag attggggtgt
aa 22258361PRTHomo sapiens 58Met Asp Lys Phe Trp
Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro1 5
10 15Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr
Cys Arg Phe Ala Gly 20 25
30Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser Ile Ser Arg Thr Glu
35 40 45Ala Ala Asp Leu Cys Lys Ala Phe
Asn Ser Thr Leu Pro Thr Met Ala 50 55
60Gln Met Glu Lys Ala Leu Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly65
70 75 80Phe Ile Glu Gly His
Val Val Ile Pro Arg Ile His Pro Asn Ser Ile 85
90 95Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu
Thr Ser Asn Thr Ser 100 105
110Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp
115 120 125Cys Thr Ser Val Thr Asp Leu
Pro Asn Ala Phe Asp Gly Pro Ile Thr 130 135
140Ile Thr Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly
Glu145 150 155 160Tyr Arg
Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp
165 170 175Asp Val Ser Ser Gly Ser Ser
Ser Glu Arg Ser Ser Thr Ser Gly Gly 180 185
190Tyr Ile Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp
Glu Asp 195 200 205Ser Pro Trp Ile
Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr Arg Asp 210
215 220Gln Asp Thr Phe His Pro Ser Gly Gly Ser His Thr
Thr His Gly Ser225 230 235
240Glu Ser Asp Gly His Ser His Gly Ser Gln Glu Gly Gly Ala Asn Thr
245 250 255Thr Ser Gly Pro Ile
Arg Thr Pro Gln Ile Pro Glu Trp Leu Ile Ile 260
265 270Leu Ala Ser Leu Leu Ala Leu Ala Leu Ile Leu Ala
Val Cys Ile Ala 275 280 285Val Asn
Ser Arg Arg Arg Cys Gly Gln Lys Lys Lys Leu Val Ile Asn 290
295 300Ser Gly Asn Gly Ala Val Glu Asp Arg Lys Pro
Ser Gly Leu Asn Gly305 310 315
320Glu Ala Ser Lys Ser Gln Glu Met Val His Leu Val Asn Lys Glu Ser
325 330 335Ser Glu Thr Pro
Asp Gln Phe Met Thr Ala Asp Glu Thr Arg Asn Leu 340
345 350Gln Asn Val Asp Met Lys Ile Gly Val
355 36059268PRTHomo sapiens 59Met Asp Lys Phe Trp Trp His
Ala Ala Trp Gly Leu Cys Leu Val Pro1 5 10
15Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg
Phe Ala Gly 20 25 30Val Phe
His Val Glu Lys Asn Gly Arg Tyr Ser Ile Ser Arg Thr Glu 35
40 45Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser
Thr Leu Pro Thr Met Ala 50 55 60Gln
Met Glu Lys Ala Leu Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly65
70 75 80Phe Ile Glu Gly His Val
Val Ile Pro Arg Ile His Pro Asn Ser Ile 85
90 95Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr
Ser Asn Thr Ser 100 105 110Gln
Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp 115
120 125Cys Thr Ser Val Thr Asp Leu Pro Asn
Ala Phe Asp Gly Pro Ile Thr 130 135
140Ile Thr Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu145
150 155 160Tyr Arg Thr Asn
Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp 165
170 175Asp Val Ser Ser Gly Ser Ser Ser Glu Arg
Ser Ser Thr Ser Gly Gly 180 185
190Tyr Ile Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp
195 200 205Ser Pro Trp Ile Thr Asp Ser
Thr Asp Arg Ile Pro Ala Thr Arg Asp 210 215
220Gln Asp Thr Phe His Pro Ser Gly Gly Ser His Thr Thr His Gly
Ser225 230 235 240Glu Ser
Asp Gly His Ser His Gly Ser Gln Glu Gly Gly Ala Asn Thr
245 250 255Thr Ser Gly Pro Ile Arg Thr
Pro Gln Ile Pro Glu 260 2656021PRTHomo sapiens
60Trp Leu Ile Ile Leu Ala Ser Leu Leu Ala Leu Ala Leu Ile Leu Ala1
5 10 15Val Cys Ile Ala Val
206172PRTHomo sapiens 61Asn Ser Arg Arg Arg Cys Gly Gln Lys Lys Lys
Leu Val Ile Asn Ser1 5 10
15Gly Asn Gly Ala Val Glu Asp Arg Lys Pro Ser Gly Leu Asn Gly Glu
20 25 30Ala Ser Lys Ser Gln Glu Met
Val His Leu Val Asn Lys Glu Ser Ser 35 40
45Glu Thr Pro Asp Gln Phe Met Thr Ala Asp Glu Thr Arg Asn Leu
Gln 50 55 60Asn Val Asp Met Lys Ile
Gly Val65 7062714DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 62atggacaagt tttggtggca
cgcagcctgg ggactctgcc tcgtgccgct gagcctggcg 60cagatcgatt tgaatataac
ctgccgcttt gcaggtgtat tccacgtgga gaaaaatggt 120cgctacagca tctctcggac
ggaggccgct gacctctgca aggctttcaa tagcaccttg 180cccacaatgg cccagatgga
gaaagctctg agcatcggat ttgagacctg caggtatggg 240ttcatagaag ggcacgtggt
gattccccgg atccacccca actccatctg tgcagcaaac 300aacacagggg tgtacatcct
cacatccaac acctcccagt atgacacata ttgcttcaat 360gcttcagctc cacctgaaga
agattgtaca tcagtcgggc ccgagacgct ctgcggggct 420gagctggtgg atgctcttca
gttcgtgtgt ggagacaggg gcttttattt caacaagccc 480acagggtatg gctccagcag
tcggagggcg cctcagacag gcatcgtgga tgagtgctgc 540ttccggagct gtgatctaag
gaggctggag atgtattgcg cacccctcaa gcctgccaag 600tcagctcgct ctgtccgtgc
ccagcgccac accgacatgc ccaagaccca gaaggaagta 660catttgaaga acgcaagtag
agggagtgca ggaaacaaga actacaggat gtag 71463318DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
63gggcccgaga cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac
60aggggctttt atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag
120acaggcatcg tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat
180tgcgcacccc tcaagcctgc caagtcagct cgctctgtcc gtgcccagcg ccacaccgac
240atgcccaaga cccagaagga agtacatttg aagaacgcaa gtagagggag tgcaggaaac
300aagaactaca ggatgtag
31864237PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 64Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly
Leu Cys Leu Val Pro1 5 10
15Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30Val Phe His Val Glu Lys Asn
Gly Arg Tyr Ser Ile Ser Arg Thr Glu 35 40
45Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met
Ala 50 55 60Gln Met Glu Lys Ala Leu
Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly65 70
75 80Phe Ile Glu Gly His Val Val Ile Pro Arg Ile
His Pro Asn Ser Ile 85 90
95Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110Gln Tyr Asp Thr Tyr Cys
Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp 115 120
125Cys Thr Ser Val Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu
Val Asp 130 135 140Ala Leu Gln Phe Val
Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro145 150
155 160Thr Gly Tyr Gly Ser Ser Ser Arg Arg Ala
Pro Gln Thr Gly Ile Val 165 170
175Asp Glu Cys Cys Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr
180 185 190Cys Ala Pro Leu Lys
Pro Ala Lys Ser Ala Arg Ser Val Arg Ala Gln 195
200 205Arg His Thr Asp Met Pro Lys Thr Gln Lys Glu Val
His Leu Lys Asn 210 215 220Ala Ser Arg
Gly Ser Ala Gly Asn Lys Asn Tyr Arg Met225 230
23565105PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 65Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
Asp Ala Leu Gln Phe1 5 10
15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly
20 25 30Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile Val Asp Glu Cys Cys 35 40
45Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro
Leu 50 55 60Lys Pro Ala Lys Ser Ala
Arg Ser Val Arg Ala Gln Arg His Thr Asp65 70
75 80Met Pro Lys Thr Gln Lys Glu Val His Leu Lys
Asn Ala Ser Arg Gly 85 90
95Ser Ala Gly Asn Lys Asn Tyr Arg Met 100
10566786DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 66atggacaagt tttggtggca cgcagcctgg
ggactctgcc tcgtgccgct gagcctggcg 60cagatcgatt tgaatataac ctgccgcttt
gcaggtgtat tccacgtgga gaaaaatggt 120cgctacagca tctctcggac ggaggccgct
gacctctgca aggctttcaa tagcaccttg 180cccacaatgg cccagatgga gaaagctctg
agcatcggat ttgagacctg caggtatggg 240ttcatagaag ggcacgtggt gattccccgg
atccacccca actccatctg tgcagcaaac 300aacacagggg tgtacatcct cacatccaac
acctcccagt atgacacata ttgcttcaat 360gcttcagctc cacctgaaga agattgtaca
tcagtcacag acctgcccaa tgcctttgat 420ggaccaatta ccataactat tgttaaccgt
gatggcaccc gctatgtcgg gcccgagacg 480ctctgcgggg ctgagctggt ggatgctctt
cagttcgtgt gtggagacag gggcttttat 540ttcaacaagc ccacagggta tggctccagc
agtcggaggg cgcctcagac aggcatcgtg 600gatgagtgct gcttccggag ctgtgatcta
aggaggctgg agatgtattg cgcacccctc 660aagcctgcca agtcagctcg ctctgtccgt
gcccagcgcc acaccgacat gcccaagacc 720cagaaggaag tacatttgaa gaacgcaagt
agagggagtg caggaaacaa gaactacagg 780atgtag
78667318DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
67gggcccgaga cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac
60aggggctttt atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag
120acaggcatcg tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat
180tgcgcacccc tcaagcctgc caagtcagct cgctctgtcc gtgcccagcg ccacaccgac
240atgcccaaga cccagaagga agtacatttg aagaacgcaa gtagagggag tgcaggaaac
300aagaactaca ggatgtag
31868261PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 68Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly
Leu Cys Leu Val Pro1 5 10
15Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30Val Phe His Val Glu Lys Asn
Gly Arg Tyr Ser Ile Ser Arg Thr Glu 35 40
45Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met
Ala 50 55 60Gln Met Glu Lys Ala Leu
Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly65 70
75 80Phe Ile Glu Gly His Val Val Ile Pro Arg Ile
His Pro Asn Ser Ile 85 90
95Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110Gln Tyr Asp Thr Tyr Cys
Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp 115 120
125Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro
Ile Thr 130 135 140Ile Thr Ile Val Asn
Arg Asp Gly Thr Arg Tyr Val Gly Pro Glu Thr145 150
155 160Leu Cys Gly Ala Glu Leu Val Asp Ala Leu
Gln Phe Val Cys Gly Asp 165 170
175Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly Ser Ser Ser Arg
180 185 190Arg Ala Pro Gln Thr
Gly Ile Val Asp Glu Cys Cys Phe Arg Ser Cys 195
200 205Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro Leu
Lys Pro Ala Lys 210 215 220Ser Ala Arg
Ser Val Arg Ala Gln Arg His Thr Asp Met Pro Lys Thr225
230 235 240Gln Lys Glu Val His Leu Lys
Asn Ala Ser Arg Gly Ser Ala Gly Asn 245
250 255Lys Asn Tyr Arg Met
26069105PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 69Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
Asp Ala Leu Gln Phe1 5 10
15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly
20 25 30Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile Val Asp Glu Cys Cys 35 40
45Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro
Leu 50 55 60Lys Pro Ala Lys Ser Ala
Arg Ser Val Arg Ala Gln Arg His Thr Asp65 70
75 80Met Pro Lys Thr Gln Lys Glu Val His Leu Lys
Asn Ala Ser Arg Gly 85 90
95Ser Ala Gly Asn Lys Asn Tyr Arg Met 100
10570852DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 70atggacaagt tttggtggca cgcagcctgg
ggactctgcc tcgtgccgct gagcctggcg 60cagatcgatt tgaatataac ctgccgcttt
gcaggtgtat tccacgtgga gaaaaatggt 120cgctacagca tctctcggac ggaggccgct
gacctctgca aggctttcaa tagcaccttg 180cccacaatgg cccagatgga gaaagctctg
agcatcggat ttgagacctg caggtatggg 240ttcatagaag ggcacgtggt gattccccgg
atccacccca actccatctg tgcagcaaac 300aacacagggg tgtacatcct cacatccaac
acctcccagt atgacacata ttgcttcaat 360gcttcagctc cacctgaaga agattgtaca
tcagtcacag acctgcccaa tgcctttgat 420ggaccaatta ccataactat tgttaaccgt
gatggcaccc gctatgtcca gaaaggagaa 480tacagaacga atcctgaaga catctacccc
agcaacccta ctgatgatga cgtggggccc 540gagacgctct gcggggctga gctggtggat
gctcttcagt tcgtgtgtgg agacaggggc 600ttttatttca acaagcccac agggtatggc
tccagcagtc ggagggcgcc tcagacaggc 660atcgtggatg agtgctgctt ccggagctgt
gatctaagga ggctggagat gtattgcgca 720cccctcaagc ctgccaagtc agctcgctct
gtccgtgccc agcgccacac cgacatgccc 780aagacccaga aggaagtaca tttgaagaac
gcaagtagag ggagtgcagg aaacaagaac 840tacaggatgt ag
85271318DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
71gggcccgaga cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac
60aggggctttt atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag
120acaggcatcg tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat
180tgcgcacccc tcaagcctgc caagtcagct cgctctgtcc gtgcccagcg ccacaccgac
240atgcccaaga cccagaagga agtacatttg aagaacgcaa gtagagggag tgcaggaaac
300aagaactaca ggatgtag
31872283PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 72Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly
Leu Cys Leu Val Pro1 5 10
15Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30Val Phe His Val Glu Lys Asn
Gly Arg Tyr Ser Ile Ser Arg Thr Glu 35 40
45Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met
Ala 50 55 60Gln Met Glu Lys Ala Leu
Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly65 70
75 80Phe Ile Glu Gly His Val Val Ile Pro Arg Ile
His Pro Asn Ser Ile 85 90
95Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110Gln Tyr Asp Thr Tyr Cys
Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp 115 120
125Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro
Ile Thr 130 135 140Ile Thr Ile Val Asn
Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu145 150
155 160Tyr Arg Thr Asn Pro Glu Asp Ile Tyr Pro
Ser Asn Pro Thr Asp Asp 165 170
175Asp Val Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val Asp Ala Leu
180 185 190Gln Phe Val Cys Gly
Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly 195
200 205Tyr Gly Ser Ser Ser Arg Arg Ala Pro Gln Thr Gly
Ile Val Asp Glu 210 215 220Cys Cys Phe
Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala225
230 235 240Pro Leu Lys Pro Ala Lys Ser
Ala Arg Ser Val Arg Ala Gln Arg His 245
250 255Thr Asp Met Pro Lys Thr Gln Lys Glu Val His Leu
Lys Asn Ala Ser 260 265 270Arg
Gly Ser Ala Gly Asn Lys Asn Tyr Arg Met 275
28073105PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 73Gly Pro Glu Thr Leu Cys Gly Ala Glu Leu Val
Asp Ala Leu Gln Phe1 5 10
15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly
20 25 30Ser Ser Ser Arg Arg Ala Pro
Gln Thr Gly Ile Val Asp Glu Cys Cys 35 40
45Phe Arg Ser Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro
Leu 50 55 60Lys Pro Ala Lys Ser Ala
Arg Ser Val Arg Ala Gln Arg His Thr Asp65 70
75 80Met Pro Lys Thr Gln Lys Glu Val His Leu Lys
Asn Ala Ser Arg Gly 85 90
95Ser Ala Gly Asn Lys Asn Tyr Arg Met 100
10574984DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 74atggacaagt tttggtggca cgcagcctgg
ggactctgcc tcgtgccgct gagcctggcg 60cagatcgatt tgaatataac ctgccgcttt
gcaggtgtat tccacgtgga gaaaaatggt 120cgctacagca tctctcggac ggaggccgct
gacctctgca aggctttcaa tagcaccttg 180cccacaatgg cccagatgga gaaagctctg
agcatcggat ttgagacctg caggtatggg 240ttcatagaag ggcacgtggt gattccccgg
atccacccca actccatctg tgcagcaaac 300aacacagggg tgtacatcct cacatccaac
acctcccagt atgacacata ttgcttcaat 360gcttcagctc cacctgaaga agattgtaca
tcagtcacag acctgcccaa tgcctttgat 420ggaccaatta ccataactat tgttaaccgt
gatggcaccc gctatgtcca gaaaggagaa 480tacagaacga atcctgaaga catctacccc
agcaacccta ctgatgatga cgtgagcagc 540ggctcctcca gtgaaaggag cagcacttca
ggaggttaca tcttttacac cttttctact 600gtacacccca tcccagacga agacagtccc
tggatcaccg acagcacaga cagaatccct 660gctaccgggc ccgagacgct ctgcggggct
gagctggtgg atgctcttca gttcgtgtgt 720ggagacaggg gcttttattt caacaagccc
acagggtatg gctccagcag tcggagggcg 780cctcagacag gcatcgtgga tgagtgctgc
ttccggagct gtgatctaag gaggctggag 840atgtattgcg cacccctcaa gcctgccaag
tcagctcgct ctgtccgtgc ccagcgccac 900accgacatgc ccaagaccca gaaggaagta
catttgaaga acgcaagtag agggagtgca 960ggaaacaaga actacaggat gtag
98475318DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
75gggcccgaga cgctctgcgg ggctgagctg gtggatgctc ttcagttcgt gtgtggagac
60aggggctttt atttcaacaa gcccacaggg tatggctcca gcagtcggag ggcgcctcag
120acaggcatcg tggatgagtg ctgcttccgg agctgtgatc taaggaggct ggagatgtat
180tgcgcacccc tcaagcctgc caagtcagct cgctctgtcc gtgcccagcg ccacaccgac
240atgcccaaga cccagaagga agtacatttg aagaacgcaa gtagagggag tgcaggaaac
300aagaactaca ggatgtag
31876327PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 76Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly
Leu Cys Leu Val Pro1 5 10
15Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30Val Phe His Val Glu Lys Asn
Gly Arg Tyr Ser Ile Ser Arg Thr Glu 35 40
45Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met
Ala 50 55 60Gln Met Glu Lys Ala Leu
Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly65 70
75 80Phe Ile Glu Gly His Val Val Ile Pro Arg Ile
His Pro Asn Ser Ile 85 90
95Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110Gln Tyr Asp Thr Tyr Cys
Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp 115 120
125Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro
Ile Thr 130 135 140Ile Thr Ile Val Asn
Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu145 150
155 160Tyr Arg Thr Asn Pro Glu Asp Ile Tyr Pro
Ser Asn Pro Thr Asp Asp 165 170
175Asp Val Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly
180 185 190Tyr Ile Phe Tyr Thr
Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp 195
200 205Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro
Ala Thr Gly Pro 210 215 220Glu Thr Leu
Cys Gly Ala Glu Leu Val Asp Ala Leu Gln Phe Val Cys225
230 235 240Gly Asp Arg Gly Phe Tyr Phe
Asn Lys Pro Thr Gly Tyr Gly Ser Ser 245
250 255Ser Arg Arg Ala Pro Gln Thr Gly Ile Val Asp Glu
Cys Cys Phe Arg 260 265 270Ser
Cys Asp Leu Arg Arg Leu Glu Met Tyr Cys Ala Pro Leu Lys Pro 275
280 285Ala Lys Ser Ala Arg Ser Val Arg Ala
Gln Arg His Thr Asp Met Pro 290 295
300Lys Thr Gln Lys Glu Val His Leu Lys Asn Ala Ser Arg Gly Ser Ala305
310 315 320Gly Asn Lys Asn
Tyr Arg Met 32577105PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 77Gly Pro Glu Thr Leu Cys
Gly Ala Glu Leu Val Asp Ala Leu Gln Phe1 5
10 15Val Cys Gly Asp Arg Gly Phe Tyr Phe Asn Lys Pro
Thr Gly Tyr Gly 20 25 30Ser
Ser Ser Arg Arg Ala Pro Gln Thr Gly Ile Val Asp Glu Cys Cys 35
40 45Phe Arg Ser Cys Asp Leu Arg Arg Leu
Glu Met Tyr Cys Ala Pro Leu 50 55
60Lys Pro Ala Lys Ser Ala Arg Ser Val Arg Ala Gln Arg His Thr Asp65
70 75 80Met Pro Lys Thr Gln
Lys Glu Val His Leu Lys Asn Ala Ser Arg Gly 85
90 95Ser Ala Gly Asn Lys Asn Tyr Arg Met
100 105
User Contributions:
Comment about this patent or add new information about this topic: