Patent application title: METHOD FOR REJUVENATING CELLS
Inventors:
IPC8 Class: AC12N5074FI
USPC Class:
1 1
Class name:
Publication date: 2018-11-01
Patent application number: 20180312813
Abstract:
The invention relates to a method for reprogramming cells from aged
donors or senescent cells to pluripotent cells that have lost marks of
senescence. In particular, the invention relates to an ex vivo method for
preparing induced pluripotent stein cells (iPSCs) from a target cell
population comprising cells from aged donors or senescent cells, said
method comprising the steps of culturing said target cell population
tinder appropriate conditions for reprogramming said cells into iPSCs,
wherein said appropriate conditions comprises increasing expression in
said target cells, of at least the following reprogramming factors: Oct4,
Klf4, Sox2, Myc, Lin28 and, optionally Nanog.Claims:
1-15. (canceled)
16. A method for regenerating damaged tissues in a subject in need thereof, comprising the steps of a) providing a target cell population comprising cells from aged donors or senescent cells; b) culturing said target cell population under appropriate conditions for reprogramming said target cell population into induced pluripotent stem cells (iPSCs), wherein said appropriate conditions comprise increasing expression in said target cell population of at least the following combination of reprogramming factors: i. a reprogramming factor encoded by one gene of the Oct gene family, ii. a reprogramming factor encoded by one gene of the Klf gene family, iii. a reprogramming factor encoded by one gene of the Sox gene family, iv. a reprogramming factor encoded by one gene of the Myc gene family, v. Lin28, and, optionally, vi. Nanog; and c) implanting said iPSCs into said subject.
17. The method of claim 16, wherein said target cell population is an autologous cell population.
18. A method of performing an autologous graft in an aged patient in need thereof comprising the steps of a) providing a target cell population comprising cells from aged donors or senescent cells; b) culturing said target cell population under appropriate conditions for reprogramming said target cell population into induced pluripotent stem cells (iPSCs), wherein said appropriate conditions comprise increasing expression in said target cell population of at least the following combination of reprogramming factors: i. a reprogramming factor encoded by one gene of the Oct gene family, ii. a reprogramming factor encoded by one gene of the Klf gene family, iii. a reprogramming factor encoded by one gene of the Sox gene family, iv. a reprogramming factor encoded by one gene of the Myc gene family, v. Lin28, and, optionally, vi. Nanog; and c) performing an autologous graft in an aged patient using said iPSCs.
19. The method of claim 18, wherein said iPSCs are further differentiated into a desired cell lineage prior to said step of performing.
Description:
FIELD OF THE INVENTION
[0001] The invention relates to a method for reprogramming cells from aged donors or senescent cells to pluripotent cells that have lost marks of senescence. In particular, the invention relates to an ex vivo method for preparing induced pluripotent stem cells (iPSCs) from a target cell population from aged donors or senescent cells, said method comprising the steps of culturing said target cell population under appropriate conditions for reprogramming said cells into iPSCs, wherein said appropriate conditions comprises increasing expression in said target cells, of at least the following combination of reprogramming factors: Oct4, Klf4, Sox2, c-Myc, Lin28 and, optionally Nanog.
BACKGROUND OF THE INVENTION
[0002] The discovery of induced Pluripotent Stem Cells (iPSCs) by S. Yamanaka.sup.1,2, and very rapid progress in iPSC technology have opened up a new avenue in autologous regenerative medicine, whereby patient-specific pluripotent cells could potentially be derived from adult somatic cells. iPSCs have been reproducibly obtained in different cell types by forced expression of the OCT4, SOX2, c-MYC and KLF4 transcription factor cocktail or by an alternative combination of factors, substituting KLF4 and c-MYC by NANOG and LIN28.sup.3.
[0003] Cellular senescence is linked to physiological aging, and is characterized by a stable cell cycle arrest in response to various forms of stress stimuli, including oncogene activation or extremely shortened telomeres called replicative senescence.sup.9,10. A common feature is the activation of the p53/p21.sup.CIP1 and pRb/p16.sup.INK4A tumor suppressor pathways in these cells, associated with alteration of morphology, increase in senescence-associated .beta.-galactosidase (SA-.beta.-Gal) activity, a specific SA secretome (SASP) and formation of senescence-associated heterochromatic foci (SAHF), which are thought to be involved in repression of genes that promote cell division.sup.11.
[0004] EP 2 096 169 discloses a process for generating induced pluripotent stem cells from somatic cells, comprising the step of introducing the following six genes: Oct family gene, Klf family gene, Sox family gene, Myc family gene, Lin28 and Nanog into somatic cells. However, this specific combination of reprogramming factors has never been applied to senescent cells or cells from aged donors.
[0005] It has been recently described by several groups, that cellular senescence is a barrier to reprogramming, due to up-regulation of p53, p16.sup.INK4A, and p21.sup.CIP1, suggesting that cellular aging might be an important limitation of this technology. Accordingly, ablation of different senescence effectors has been proposed as a potential solution to improve the efficiency of iPSCs generation.sup.4-8.
[0006] WO 2011/016588 suggests using functional inhibitors of p53 together with a cocktail of reprogramming factors consisting of Oct3/4, Sox2, Klf4, L-myc and Lin28 p53 shRNA were used as functional inhibitors of p53.
[0007] The inventors have now shown that using the specific combination of the six factors OCT4, NANOG, SOX2, KLF4, c-MYC and LIN28 allows efficient reprogramming of both proliferative centenarian and senescent fibroblasts into human iPSCs, without the need of ablating senescence effectors, contrary to the technical prejudice in the art related to reprogramming of senescent cells.
[0008] Moreover, the inventors showed that this reprogramming restores telomere size, gene expression profile, oxidative stress and mitochondrial metabolism as observed in human embryonic stem cells (hESCs). Surprisingly, iPSCs derived from aged and senescent cells do not retain detectable marks of the cellular aging phenotype, and are indistinguishable from hESCs. Finally, iPSCs re-differentiated into fibroblasts exhibit an increased potential to proliferate, and gene expression profile equivalent to young proliferative fibroblasts, demonstrating that the reprogramming strategy according to the present invention erases the hallmarks of the cellular aging phenotype, defining a new method to produce rejuvenated cells.
[0009] To the applicant's knowledge, the invention is the first description of a method for producing iPSCs with cells from aged donors or senescent cells, therefore the invention may be highly useful in particular in autologous regenerative medicine, whereby patient-specific pluripotent cells could potentially be derived from adult aged or senescent somatic cells, and will also find numerous applications in the research field. Moreover, the invention is useful as a general method to rejuvenating senescent cells or cells from aged donors, either in vitro or in vivo.
SUMMARY OF THE INVENTION
[0010] A first aspect of the invention relates to an ex vivo method for preparing induced pluripotent stem cells (iPSCs) from a target cell population comprising cells from aged donors or senescent cells or cells overexpressing p16.sup.INK4A or p21.sup.CIP senescence effectors, said method comprising the steps of:
[0011] a) providing said target cell population comprising cells from aged donors or senescent cells or cells overexpressing p16.sup.INK4A or p21.sup.CIP senescence effectors, and,
[0012] b) culturing said target cell population under appropriate conditions for reprogramming said target cell population into iPSCs, wherein said appropriate conditions comprises increasing expression in said target cell population of at least the following combination of reprogramming factors:
[0013] i. a reprogramming factor encoded by one gene of the Oct family gene,
[0014] ii. a reprogramming factor encoded by one gene of the Klf family gene,
[0015] iii. a reprogramming factor encoded by one gene of the Sox family gene,
[0016] iv. a reprogramming factor encoded by one gene of the Myc family gene, and
[0017] v. Lin28,
[0018] vi. and, optionally, Nanog.
[0019] In a preferred embodiment, said appropriate conditions comprise increasing expression in said target cell population of the following reprogramming factors: Oct4, Klf4, Sox2, c-Myc (or L-myc), Lin28 and, optionally Nanog. In a related embodiment, said appropriate conditions comprise increasing expression in said target cell population of the following reprogramming factors: Oct4, Klf4, Sox2, c-Myc (or L-myc), Lin28 and Nanog.
[0020] The inventors have shown that marks of senescence can be fully erased by the method of the invention. Thus, advantageously, the target cell population may be selected from cell population comprising adult somatic cells from aged donors or senescent cells such as senescent fibroblasts, or from any kind of cells harboring aged or senescent associated physiology, like premature aging syndrome. In one specific embodiment, the target cell population is a human cell population obtained from an adult subject being at least 50 years old, with apparently no limits in aged as a 101 years old donor cell population was efficiently reprogrammed with this strategy, for example at least 60, 70, 80, 90, or 100 years old, for example in need of regenerative autologous cell therapy.
[0021] Advantageously, the method may not comprise any step of direct silencing of senescence effectors, such as p21.sup.CIP1 and/or p16.sup.INK4a and or p53. In particular, the method may not comprise any use of functional inhibitors of p53, such as p53 shRNA.
[0022] In one embodiment, conditions for increasing expression of the reprogramming factors listed above comprise either
[0023] (a) introducing one or more expression vectors comprising the coding sequences of said reprogramming factors; or,
[0024] (b) directly delivering an effective amount of each reprogramming factor or their precursor RNA,
[0025] into said target cell population.
[0026] In one specific embodiment, the method of the invention comprises the step of transfecting said target cell population with a combination of viral vectors, each viral vector comprising the coding sequence of each of the reprogramming factors, Oct4, Klf4, Sox2, c-Myc (or L-myc), Lin28 and, optionally, Nanog.
[0027] The invention further relates to the induced pluripotent stem cells obtainable by the method described above, in particular to induced pluripotent stem cells obtainable by the method and obtained from aged or senescent cells.
[0028] The invention further relates to an in vitro method for rejuvenating cells from aged donors or senescent cells comprising reprogramming said cells from aged donors or senescent cells to induced pluripotent stem cells, by increasing expression in said cells from aged donors or senescent cells of at least a combination of the following reprogramming factors:
[0029] i. a reprogramming factor encoded by one gene of the Oct family gene,
[0030] ii. a reprogramming factor encoded by one gene of the Klf family gene,
[0031] iii. a reprogramming factor encoded by one gene of the Sox family gene,
[0032] iv. a reprogramming, factor encoded by one gene of the Myc family gene, and,
[0033] v. Lin28,
[0034] vi. and, optionally, Nanog.
[0035] The invention further relates to a composition for in vivo use in rejuvenating cells from aged donors or senescent cells in a subject in need thereof, said composition comprising means for increasing expression of the following reprogramming factors Oct4, Klf4, Sox2, c-Myc, Lin28 and, optionally, Nanog into said aged or senescent cells.
[0036] In one embodiment, said means for increasing expression of said reprogramming factors comprise a combination of Oct4 protein, Klf4 protein, Sox2 protein, c-Myc protein, Lin28 protein and, optionally, Nanog protein, wherein each protein is associated to appropriate means for delivery of said protein into the nucleus of the cells to be rejuvenated.
[0037] Alternatively, said means for increasing expression of said reprogramming factors may comprise a combination of Oct4 precursor RNA, Klf4 precursor RNA, Sox2 precursor RNA, c-Myc precursor RNA, Lin28 precursor RNA and, optionally, Nanog precursor RNA, wherein each precursor RNA is associated to appropriate means for delivery of each precursor RNA into the cytoplasm of the cells to be rejuvenated.
[0038] The compositions of the invention as described above may advantageously be suitable for topical application.
DETAILED DESCRIPTION OF THE INVENTION
[0039] A first aspect of the invention relates to an ex vivo method for preparing induced pluripotent stem cells (iPSCs) from a target cell population comprising cells from aged donors or senescent cells or cells overexpressing p16.sup.INK4A or p21.sup.CIP senescence effectors, said method comprising the steps of:
[0040] a) providing said target cell population comprising cells from aged donors or senescent cells or cells overexpressing p16.sup.INK4A or p21.sup.CIP senescence effectors, and,
[0041] b) culturing said target cell population under appropriate conditions for reprogramming said target cell population into iPSCs, wherein said appropriate conditions comprises increasing expression in said target cell population, of at least the following combination of reprogramming factors:
[0042] i. a reprogramming factor encoded by one gene of the Oct family gene,
[0043] ii. a reprogramming factor encoded by one gene of the Klf family gene,
[0044] iii. a reprogramming factor encoded by one gene of the Sox family gene,
[0045] iv. a reprogramming factor encoded by one gene of the Myc family gene, and,
[0046] v. Lin28, and, optionally,
[0047] vi. Nanog.
[0048] As used herein, the term "pluripotent" refers to cells with the ability to give rise to progeny that can undergo differentiation, under appropriate conditions, into cell types that collectively exhibit characteristics associated with cell lineages from the three germ layers (endoderm, mesoderm, and ectoderm). Pluripotent stem cells can contribute to tissues of a prenatal, postnatal or adult organism. A standard art-accepted test, such as the ability to form a teratoma in 8-12 week old SCID mice, can be used to establish the pluripotency of a cell population. However, identification of various pluripotent stem cell characteristics can also be used to identify pluripotent cells.
[0049] More specifically, human pluripotent stem cells may express at least some, and optionally all, of the markers from the following non-limiting list: SSEA-3, SSEA-4, TRA-I-60, TRA-I-81, TRA-2-49/6E, ALP, Sox2, E-cadherin, UTF-I, Oct4, Lin28, Rexl, and Nanog.
[0050] As used herein, the term "induced pluripotent stem cell" refers to a pluripotent stem cell artificially derived from a non-pluripotent cell. A non-pluripotent cell can be a cell of lesser potency to self-renew and differentiate than a pluripotent stem cell. Cells of lesser potency can be, but are not limited to, somatic stem cells, tissue specific progenitor cells, primary or secondary cells. One remarkable advantage of the present method is that it enables the production of induced pluripotent stem cells from any somatic cells, including aged or senescent cells, previously believed as non inducible to pluripotency due to their aging phenotype.
[0051] The term "reprogramming" refers to the process of changing the fate of a target cell into that of a different cell type, caused by the expression of a small set of factors (or reprogramming factors) in the target cells. For example, methods for reprogramming fibroblast cells to induced pluripotent stem cells by expressing ectopically Oct3/4, Sox2, c-myc and Klf4 have been described by Takahashi and Yamanaka, 2006.sup.1.
[0052] Accordingly, a "reprogramming factor" is a factor, for example, it may be a transcription factor, which can be used to reprogram a target cell. The term "reprogramming factor" further includes any analogue molecule that mimics the function of the factor with respect to reprogramming capacity.
[0053] The Target Cell Population for Use in the Method of the Invention
[0054] The target cell population for use in the method of the present invention is advantageously a cell population comprising either cells from aged donors or senescent cells or cells overexpressing p16.sup.INK4A or p21.sup.CIP senescence effectors. These cells may be obtained from living of frozen tissues of animals.
[0055] The term "senescent cells" refers to cells that exhibit cell cycle arrest, generally during the G1 transition of the cell cycle or in few cases in G2, elicited by replicative exhaustion due to telomere attrition or in response to stresses such as DNA damage, chemotherapeutic drugs, or aberrant expression of oncogenes. This arrest is implemented primarily through activation of p53 and the up-regulation of the cyclin-dependent kinase (CDK) inhibitors p16.sup.INK4a and p21.sup.CIP1 (Collado et al. 2007, Cell, 130: 223-233).
[0056] "Senescent cells" may be characterized by at least one or more of the following characteristics:
[0057] activation of the p53/p21.sup.CIP1 and pRb/p16.sup.INK4A tumor suppressor pathways (hereafter referred as senescence effectors),
[0058] cells arrested irreversibly in G1,
[0059] shortening of telomere size,
[0060] expression of senescent-associated .beta.-galactosidase activity (SA .beta.-Gal),
[0061] Specific chromatin modification as senescence-associated heterochromatic foci (SAHF),
[0062] Specific secretome,
[0063] reduced/altered overall mitochondrial activity.
[0064] Irreversible cell arrest in G1 may be assessed by FACS as described in Matsuura et al.sup.25 and briefly summarized below:
[0065] To analyze the cell cycle, trypsinized cells are fixed with cooled 70% EtOH during at least 15 min at 4.degree. C. Fixed cells are centrifuged and resuspended in PBS before staining with Propidium Iodide (10 .mu.g/ml) plus RNase A (250 .mu.g/ml) during 30 min and analysis by flow cytometry, using for example a FacsCalibur II (BD Biosciences).
[0066] Shortening of telomere size may be characterized by evaluating the mean terminal restriction fragment (TRF) length for example by Southern blot analysis, for example as described in the Examples below.
[0067] A method for detecting expression of senescent-associated .beta.-galactosidase activity (SA (3-Gal) is described in Matsuura et al.sup.25 and briefly summarized below:
[0068] Cell cultures are stained as described (Dimri et al. Proc Natl Acad Sci USA. 1995 Sep. 26; 92(20):9363-7). Briefly, cells are washed with phosphate-buffered saline (PBS) and fixed with 1% paraformaldehyde for 3 minutes at room temperature, then washed three times with PBS for 5 minutes each at room temperature. Staining is performed overnight in a non-CO2 enriched incubator at 37.degree. C. using a solution pH 6 containing 40 mM sodium phosphate (dibasic), 40 mM citric acid, 150 mM NaCl, 2 mM MgCl2, 5 mM potassium ferrocyanide, 5 mM potassium ferricyanide, 1 mg/ml X-gal (5-bromo-4-chloro-3-indolyl-b-D-galactoside, Pierce Chemical Co., Rockford, Ill.). Cyanide salts and X-gal are added from freshly made 100 X_stocks in PBS and dimethylformamide, respectively. Cells are then washed three times with PBS for 5 minutes each at room temperature, before microscopic examination and photography.
[0069] A method for detecting expression of senescence-associated heterochromatic foci (SAHF) by indirect immunofluorescence is described in the Examples below.
[0070] Overall mitochondrial activity can be evaluated by measuring the transmembrane potential generated by the proton gradient. A method for measuring this parameter using the cationic dye JC-1 is for example described in the Examples below.
[0071] Cells from aged donors comprise a number of proliferative cells that exhibit certain characteristics of senescent cells, in particular
[0072] upregulation of the tumor suppressors p16.sup.INK4a and p21.sup.CIP1 (hereafter referred as senescence effectors).
[0073] decreased ability to proliferate,
[0074] global genome CpG hypomethylation,
[0075] unscheduled heterochromatinization.
[0076] Upregulation of the tumor suppressors p16.sup.INK4a and p21.sup.CIP1 can be observed using usual techniques in the Art for measuring protein expression and/or mRNA expression, for example, Western Blot, Northern Blot or Real-Time PCR. Upregulation is observed when significant higher expression of p16.sup.INK4a and p21.sup.CIP1 is observed in the test cells compared to controlled (non senescent) cells, for example embryonic stem cells.
[0077] The inventors have shown that proliferative and senescent cells from aged donors (more than 70 years old) have a different signature, for example in terms of gene expression or metabolism, from cells of young donors. The iPSCs obtained according to the method have a signature that is closer to the embryonic stem cells and distinguish in that respect with iPSCs obtained with the conventional cocktail of 4 reprogramming factors OCT4, SOX2, c-MYC and KLF4.
[0078] To applicant's knowledge, the method of the invention comprising the use of a combination of at least 5, preferably 6 specific reprogramming factors is the only method described in the art for generating induced pluripotent stem cells from senescent cells or cells of aged donors. Accordingly, the method of the invention is particularly useful for cell population susceptible to contain senescent cells in a high proportion.
[0079] In one preferred embodiment of the method of the invention, said target cell population comprises at least 10%, 20%, 30%, 40% or at least 50% of the cells displaying at least one or more (or all) of the following characteristics of the aging phenotype:
[0080] upregulation of the tumor suppressors p16.sup.INK4a and p21.sup.CIP1 (hereafter referred as senescence effectors),
[0081] cells arrested irreversibly in G1,
[0082] expression of senescent-associated .beta.-galactosidase activity (SA .beta.-Gal),
[0083] expression of senescence-associated heterochromatic foci (SAHF),
[0084] altered overall mitochondrial activity.
[0085] In another specific embodiment, said target cell population are cells, such as dermal cells or fibroblast cells, obtained from an adult subject being at least 50 years old, for example at least 60, 70, 80, 90 or at least 100 years old.
[0086] Such target cell population may be obtained from mammal species, and preferably from rodent, primate or human species, more preferably from human species.
[0087] The target cell population may be obtained from various tissues, preferably from a human aged patient in need of autologous regenerative treatment.
[0088] Methods to obtain samples from various tissues and methods to establish primary cells are well-known in the art (see e.g. Jones and Wise, Methods Mol Biol. 1997).
[0089] In one specific embodiment, said target cell population is obtained from primary cells from blood, bone marrow, adipose tissue, skin, hair, skin appendages, internal organs such as heart, gut or liver, mesenchymal tissues, muscle, bone, cartilage or skeletal tissues.
[0090] The Combination of Reprogramming Factors for Use in Rejuvenating or Generating iPSCs
[0091] One essential feature of the present invention is the use of the following combination of reprogramming factors for use in rejuvenating or inducing pluripotent stem cells from the target cell population:
[0092] i. a reprogramming factor encoded by one gene of the Oct family gene, preferably Oct4,
[0093] ii. a reprogramming factor encoded by one gene of the Klf family gene, preferably Klf4,
[0094] iii. a reprogramming factor encoded by one gene of the Sox family gene, preferably Sox2,
[0095] iv. a reprogramming factor encoded by one gene of the Myc family gene, preferably c-Myc,
[0096] v. Lin28, and, optionally,
[0097] vi. Nanog.
[0098] The combination of reprogramming factors for use in rejuvenating or inducing pluripotent stem cells from the target cell population such as senescent cells or cells from aged donors, may include for example the combination of the 5 reprogramming factors Oct4, Klf4, Sox2, c-Myc (or L-myc) and Lin28, or the 6 reprogramming factors Oct4, Klf4, Sox2, c-Myc (or L-myc), Lin28 and Nanog. In one preferred embodiment, no functional inhibitors of p53 are used.
[0099] As used, herein, a functional inhibitor of p53 is any substance capable of inhibiting either (a) the function of the p53 protein or (b) the expression of the p53 gene. Such substances are for example described in WO 2011/016588. Most specifically, the present method does not contain the use of any means for expressing siRNA or shRNA against p53 into the target cell population (e.g the senescent cells).
[0100] As used herein, the term "Oct family" refers to the family of octamer ("Oct") transcription factors which play a crucial role in maintaining pluripotency. POU5F1 (POU domain, class 5, transcription factor 1) also known as Oct3/4 is one representative of Oct family. The absence of Oct3/4 in Oct-3/4+ cells, such as blastomeres and embryonic stem cells, leads to spontaneous trophoblast differentiation, and presence of Oct-3/4 thus gives rise to the pluripotency and differentiation potential of embryonic stem cells. Exemplary Oct3/4 proteins are the proteins encoded by the murine Oct3/4 gene (Genbank accession number NM_013633) and the human Oct3/4 gene (Genbank accession number NM_002701)
[0101] The terms "Oct3/4", "Oct4," "OCT4," "Oct4 protein," "OCT4 protein" and the like thus refer to any of the naturally-occurring forms of the Octomer 4 transcription factor, or variants thereof that maintain Oct4 transcription factor activity (e.g. within at least 50%, 80%, 90% or 100% activity compared to wild type Oct4 as measured by methods known in the art). In some embodiments, variants have at least 90% amino acid sequence identity across their whole sequence compared to the naturally occurring Oct4 polypeptide. In other embodiments, the Oct4 protein is the protein as identified by the Genbank reference ADW77327.1.
[0102] As used herein, the term "Sox family" refers to Sox genes associated with maintaining pluripotency similar to Oct-3/4, although it is associated with multipotent and unipotent stem cells in contrast with Oct-3/4, which is exclusively expressed in pluripotent stem cells. While Sox2 was the initial gene used for induction.sup.1,3, other genes in the Sox family have been found to work as well in the induction process. Sox1 yields iPSCs with a similar efficiency as Sox2, and genes Sox3, Sox15, and Sox18 also generate iPSCs.
[0103] Exemplary Sox2 proteins are the proteins encoded by the murine Sox2 gene (Genbank accession number NM_011443) and the human Sox2 gene (Genbank accession number NM_003106).
[0104] The terms "Sox2," "SOX2," "Sox2 protein," "SOX2 protein" and the like as referred to herein thus includes any of the naturally-occurring forms of the Sox2 transcription factor, or variants thereof that maintain Sox2 transcription factor activity (e.g. within at least 50%, 80%, 90% or 100% activity compared to wild type Sox2 as measured by methods known in the art). In some embodiments, variants have at least 90% amino acid sequence identity across their whole sequence compared to the naturally occurring Sox2 polypeptide. In other embodiments, the Sox2 protein is the protein as identified by the NCBI reference NP 003097.1.
[0105] As used herein, the term "Klf family" refers to Klf genes initially identified as a factor for the generation of mouse iPSCs and also demonstrated to be a factor for generation of human iPSCs. Exemplary Klf4 proteins are the proteins encoded by the murine klf4 gene (Genbank accession number NM_010637) and the human klf4 gene (Genbank accession number NM_004235).
[0106] The terms "KLF4," "KLF4 protein" and the like as referred to herein thus includes any of the naturally-occurring forms of the KLF4 transcription factor, or variants thereof that maintain KLF4 transcription factor activity (e.g. within at least 50%, 80%, 90% or 100% activity compared to wild type KLF4 as measured by methods known in the art). In some embodiments, variants have at least 90% amino acid sequence identity across their whole sequence compared to the naturally occurring KLF4 polypeptide. In other embodiments, the KLF4 protein is the protein as identified by the NCBI reference NP_004226.3.
[0107] As used herein, factors of the Myc family refers to factors encoded by myc proto-oncogenes implicated in cancer. c-Myc was shown to be a factor implicated in the generation of mouse iPSCs and of human iPSCs. Exemplary c-Myc proteins are the proteins encoded by the murine c-myc gene (Genbank accession number NM_010849) and the human c-myc gene (Genbank accession number NM_002467) N-Myc or L-myc was also used as possible reprogramming factor replacing c-Myc
[0108] The terms "c-Myc," C-MYC," "c-Myc protein", "C-MYC protein" and the like as referred to herein thus includes any of the naturally-occurring forms of the cMyc transcription factor, or variants thereof that maintain cMyc transcription factor activity (e.g. within at least 50%, 80%, 90% or 100% activity compared to wild type cMyc as measured by methods known in the art). In some embodiments, variants have at least 90% amino acid sequence identity across their whole sequence compared to the naturally occurring c-Myc polypeptide. In other embodiments, the c-Myc protein is the protein as identified by the NCBI reference NP_002458.2.
[0109] The term "Nanog" or "nanog" refers to a transcription factor critically involved with self-renewal of undifferentiated embryonic stem cells. In humans, this protein is encoded by the NANOG gene. Exemplary nanog is the protein encoded by murine gene (Genbank accession number) XM_132755) and human Nanog gene (Genbank accession number NM_024865).
[0110] The term "Nanog" or "nanog" and the like as referred to herein thus includes any of the naturally-occurring forms of the Nanog transcription factor, or variants thereof that maintain Nanog transcription factor activity (e.g. within at least 50%, 80%, 90% or 100% activity compared to wild type Nanog as measured by methods known in the art). In some embodiments, variants have at least 90% amino acid sequence identity across their whole sequence compared to the naturally occurring Nanog polypeptide. In other embodiments, the Nanog protein is the protein as identified by the NCBI reference NP_079141.
[0111] The term "Lin28" or "Lin-28 homolog A" is a protein that is encoded by the LIN28 gene in humans. It is a marker of undifferentiated human embryonic stem cells and encodes a cytoplasmic mRNA-binding protein that binds to and enhances the translation of the IGF-2 (Insulin-like growth factor 2) mRNA. Lin28 has also been shown to bind to the let-7 pre-miRNA and block production of the mature let-7 microRNA in mouse embryonic stem cells. Yu et al. demonstrated that it is a factor in iPSCs generation, although it is not mandatory.sup.3. Exemplary Lin28 is the protein encoded by murine gene (Genbank accession number NM_145833) and human Lin28 gene (Genbank accession number NM_024674).
[0112] The term "Lin28" or "Lin28 homolog A" and the like as referred to herein thus includes any of the naturally-occurring forms of the Lin28 transcription factor, or variants thereof that maintain Lin28 transcription factor activity (e.g. within at least 50%, 80%, 90% or 100% activity compared to wild type Lin28 as measured by methods known in the art). In some embodiments, variants have at least 90% amino acid sequence identity across their whole sequence compared to the naturally occurring Lin28 polypeptide. In other embodiments, the Lin28 protein is the protein as identified by the NCBI reference NP_078950.
[0113] As used herein, the percent identity between the two amino-acid sequences is a function of the number of identical positions shared by the sequences (i. e., % identity=# of identical positions/total # of positions.times.100), taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm, as described below.
[0114] The percent identity between two amino-acid sequences can be determined using the algorithm of E. Meyers and W. Miller (Comput. Appl. Biosci., 4:11-17, 1988) which has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
[0115] The skilled person may select other corresponding reprogramming factors originating from other mammals, such as mice, rats, cows, horses, sheep, pigs, goats, camels, antelopes, and dogs. Advantageously the skilled person may select the corresponding reprogramming factor from the same species as the target cells used as starting material in the method of the invention.
[0116] The skilled person may also select analogues of one or more of the above reprogramming factors. As used herein the term "analogue" refers to a compound that has a different structure but provides the same result as the reprogramming factor for use in generating iPSCs, and can thus replace said reprogramming factor in a method for generating induced pluripotent stem cells.
[0117] For example, analogues of such reprogramming factors have been described in Wenlin Li and Sheng Ding. Trends in Pharmacological Sciences Volume 31, Issue 1, January 2010, Pages 36-45 or Feng et al. Cell Stem Cell. 2009 Apr. 3; 4(4):301-12 (see in particular analogues disclosed in Tables 1 and 2 of Feng et al., 2009).
[0118] Conditions for Increasing the Expression of Reprogramming Factor
[0119] Any conditions available in the art for increasing expression of a reprogramming factor can be used in the methods of the invention, as long as such conditions result in the presence of reprogramming factor in an appropriate amount for reprogramming said target cells to induced pluripotent stem cells.
[0120] Various methods for increasing expression of reprogramming factors have been described in the art. For a review, see Hanna J H, Saha K, Jaenisch R. Cell. 2010 November 12; 143(4):508-25; or, Sheng Ding. Trends in Pharmacological Sciences Volume 31, Issue 1, January 2010, Pages 36-45; and, Feng et al. Cell Stem Cell. 2009 Apr. 3; 4(4):301-12.
[0121] In preferred embodiments, the following alternative may be used for increasing expression of the reprogramming factors:
[0122] (i) enhancing endogenous expression of the gene encoding said reprogramming factor,
[0123] (ii) allowing ectopic expression of said reprogramming factor by introducing an expression vector comprising a coding sequence of said reprogramming factor operably linked to control sequences into the target cell population, or
[0124] (iii) delivering into the cells an appropriate amount of said reprogramming factor or its precursor RNA.
[0125] In another embodiment, one or more expression vectors are used which comprise the coding sequence of the combination of reprogramming factors, for example, Oct4 coding sequence, Sox2 coding sequence, Klf4 coding sequence, c-Myc coding sequence, Lin28 coding sequence, and, optionally, Nanog coding sequence and/or coding sequences having at least 60%, 70%, 80%, 90% or 95% identity to the corresponding native coding sequences of Oct4, Sox2, Klf4, c-Myc, Lin28 and, optionally, Nanog.
[0126] As used herein, the term "coding sequence" relates to a nucleotide sequence that upon transcription gives rise to the encoded product. The transcription of the coding sequence in accordance with the present invention can readily be effected in connection with a suitable promoter. Preferably, the coding sequence corresponds to the cDNA sequence of a gene that gives rise upon transcription to a reprogramming factor.
[0127] The percent identity between two nucleotide sequences may be determined using for example algorithms such as the BLASTN program for nucleic acid sequences using as defaults a word length (W) of 11, an expectation (E) of 10, M=5, N=4, and a comparison of both strands.
[0128] Expression vectors for ectopic expression of the reprogramming factors may be, for example, plasmid vector, cosmid vector, bacterial artificial chromosome (BAC) vector, transposon-based vector (such as PiggyBac) or viral vector.
[0129] In one specific embodiment, the expression vectors used for increasing expression of said reprogramming factors are viral vectors. Examples of such viral vectors includes vectors originated from retroviruses such as HIV (Human Immunodeficiency Virus), MLV (Murine Leukemia Virus), ASLV (Avian Sarcoma/Leukosis Virus), SNV (Spleen Necrosis Virus), RSV (Rous Sarcoma Virus), MMTV (Mouse Mammary Tumor Virus), etc, lentivirus, Adeno-associated viruses, and Herpes Simplex Virus, but are not limited to.
[0130] Methods for generating induced pluripotent stem cells based on expression vectors encoding reprogramming factors have been described in the Art, see for example WO2007/69666, EP2096169-A1 or WO2010/042490.
[0131] Typically, the coding sequence of any reprogramming factors as used in the method of the invention, for example, Oct4 coding sequence, Sox2 coding sequence, Klf4 coding sequence, c-Myc coding sequence, Nanog coding sequence and/or Lin28 coding sequence, may be operably linked to control sequences, for example a promoter, capable of effecting the expression of the coding sequence in the target cell population. Such expression vector may further include regulatory elements controlling its expression, such as a promoter, an initiation codon, a stop codon, a polyadenylation signal and an enhancer. The promoter may be constitutive, or inducible. The vector may be self-replicable or may be integrated into the DNA of the host cell.
[0132] Alternatively, the vector for ectopic expression is a viral vector and viral particles are produced and used to introduce the coding sequence of said reprogramming factors into said target cell population comprising aged or senescent cells. The term viral particles is intended to refer to the particles containing viral structural proteins and a sequence coding said reprogramming factors.
[0133] Viral particles may be prepared by transforming or transfecting a packaging cell with a viral vector carrying the nucleotide coding sequences of said combination of reprogramming factors. In the examples below, viral particles are prepared from lentivirus.
[0134] The target cell population may then be transfected using the expression vectors as described above.
[0135] The term "transfection" or "transfecting" refers to a process of introducing nucleic acid molecules into a cell. The nucleic acid molecules may be gene sequences encoding complete proteins or functional portions thereof. Any appropriate transfection method is useful in the methods described herein.
[0136] Incorporating the coding sequence and its control sequences directly into the genome of the target cells may cause activating or inactivating mutations of oncogenes or tumor suppressor genes, respectively. For certain applications, in particular medical applications, it may be required to avoid any genetic modifications of the target cells.
[0137] In a third embodiment, the reprogramming factors, for example, Oct4, Sox2, Flk4, c-Myc, Nanog and Lin28, or corresponding coding DNA or RNA, are introduced into the target cells without integration of exogenous genetic material in the host DNA, i.e. without introduction of the nucleotide sequence in the cell's genome.
[0138] An expression vector such as a plasmid vector can be delivered into said cells for ectopic expression of the reprogramming factor, in the form of naked DNA. Alternatively, RNAs coding for said reprogramming factors either chemically modified or not, can be introduced into the cells to reprogram them (see for example Warren L, et al, 2010, Cell Stem Cell. November 5; 7(5):618-30).
[0139] Other expression vectors have been described for example in WO 2009115295.
[0140] These nucleic acids can be delivered into the target cells with the aid, for example, of a liposome or a cationic polymer, for example, using conventional transfection protocols in mammalian cells.
[0141] In particular, appropriate transfection methods that do not use viral DNA or viral particles as a delivery system to introduce the nucleic acid molecules into the target cell may be used in the methods described herein. Exemplary transfection methods include without limitation calcium phosphate transfection, liposomal transfection, nucleofection, sonoporation, transfection through heat shock, magnetofection and electroporation. In some embodiments, the nucleic acid molecules are introduced into the target cells using electroporation following standard procedures well known in the art.
[0142] Alternatively, the reprogramming factor protein or fragments thereof showing similar properties to the intact proteins with respect to the reprogramming of target cells can be delivered into said target cells with the aid of chemical carriers such as cell-penetrating peptides including, without limitation, penetratin or TAT-derived peptides.
[0143] Methods to improve efficiency of the generation of iPSCs have also been described in the Art. In particular, in the method of the invention, introduction and/or addition of various generation efficiency improving agents may be performed. Examples of substance for improving generation efficiency of iPSCs include, without limitation, histone deacetylase inhibitors (such as for example valproic acid, trichostatin A, sodium lactate, MC1293 and M344), and nucleic acid expression inhibitors such as siRNAs and shRNA for HDAC, and G9a histone methyltransferase inhibitors, and nucleic acid expression system inhibitors such as siRNA and shRNA for G9a (see also Feng et al., 2009, supra).
[0144] In one specific embodiment, the methods according to the invention do not comprise any step of direct silencing of senescence effectors, and in particular direct silencing of p53 effectors.
[0145] By "direct silencing", it is meant to use a substance that will act directly on the expression of the gene of interest, for example, p53 gene, and not on a substance that will act on a factor upstream. One way to silence a gene is the use of siRNA or shRNA directly directed on the gene sequence to inhibit.
[0146] Compositions Comprising iPSCs Obtainable from the Methods of the Invention
[0147] The invention further relates to a cell-based composition comprising iPSCs obtainable from the method as described above (hereafter referred as "the iPSCs compositions"), and a pharmaceutically acceptable vehicle. Remarkably, iPSCs according to the present invention have no characteristics of the aging phenotype, though deriving from cells from aged donors or senescent cells.
[0148] These iPSCs compositions typically may comprise iPSCs obtained from a patient suffering from aged-related disorder such as disorders caused by defective helicase, including Werner Syndrome, Cockayne, Rothmund-Thomson, and Bloom syndromes, and xeroderma pigmentosa and trichothiodystrophy, or other disorders including without limitation, Hutchinson-Giford progeria or Wiedemann-Rautenstrauch syndrome.
[0149] Composition of Cells Obtained by Differentiation of iPSCs Obtainable from the Methods of the Invention and Uses Thereof
[0150] The iPSCs obtained from the methods of the invention, in particular from cells of aged donors, may advantageously be cultured in vitro under differentiation conditions, to generate differentiated cells, such as muscle, cartilage, bone, dermal tissue, cardiac or vascular tissue, or other tissues of interest.
[0151] Thus, the invention relates to the methods for preparing compositions comprising differentiated cells, said method comprising the steps of
[0152] (a) providing a composition comprising iPS cells obtained from the methods of the invention from target cells of aged donors; and,
[0153] (b) culturing said composition comprising iPS cells, under appropriate conditions for their differentiation into the desired cell lineages.
[0154] The skilled person may use known protocols for differentiating stem cells, such as induced pluripotent stem cells, ES cells or mesenchymal stem cells into the desired cell lineages.
[0155] Another aspect of the invention relates to the use of said composition comprising said cell lineages derived from differentiation of iPSC, hereafter referred as the Differentiated Cells of the Invention.
[0156] The Differentiated Cells of the Invention have the particularity to have a rejuvenated phenotype, for example, with respect to the size of the telomeres, gene expression profile, metabolism, and the number of cell cycle prior to appearance of senescence phenotype, while being derived from cells of aged donors, for example donors more than 70 years old. The Differentiated Cells of the Invention may thus be used in a variety of application, in particular, in research or therapeutic field.
[0157] One major field of application is cell therapy or regenerative medicine. These iPSCs or Differentiated Cells composition may also be useful for generating cellular models of aged-related disorders as described above.
[0158] For example, primary cells, such as fibroblast cells obtained from a patient suffering from a genetic defect, may be cultured and genetically corrected according to methods known in the art, and subsequently reprogrammed and rejuvenated into iPSCs according to the methods of the present invention and differentiated into the suitable cell lineages for re-administration into the patient, for example the same patient as the cell donor (autologous treatment).
[0159] Similarly, regenerative medicine can be used to potentially cure any disease that results from malfunctioning, damaged or failing tissue by either regenerating the damaged tissues in vivo by direct in vivo implanting of a composition comprising iPSCs or their derivatives comprising appropriate progenitors or cell lineages or differentiated cells of the Invention. Preferably, such damaged tissues are tissue damaged from aged-related disorders or aged patients, being more than 50, 60, 70, 80, 90 or more than 100 years old.
[0160] In one aspect, the iPS cells or the Differentiated Cells of the Invention may be useful for autologous regenerative therapy of a patient suffering from aged related disorders or an aged patient in need of regenerative therapy due to specific disorders or treatments associated to such disorders, including without limitation, cancer disorders, inflammatory and autoimmune disorders, muscle and skeletal disorders, neurologic disorders, diabete and other metabolic disorders.
[0161] Therefore, in one aspect, the invention relates to the iPSCs compositions or Differentiated Cells of the Invention for use as a cell therapy product for implanting into mammal, for example a human patient, preferably, an aged patient being more than 50, 60, 70, 80, 90 or more than 100 years old, most preferably as an autologous graft (i.e the cells have the same genotype as the patient's cells)
[0162] In another specific embodiment, the iPSCs compositions or Differentiated Cells of the Invention are used for the treatment of joint or cartilage, muscle or bone damages.
[0163] In another specific embodiment, the iPSCs compositions or Differentiated Cells of the Invention may also be used advantageously for the production of dermal tissues, for example, skin tissues, for use in regenerative medicine (cell-based therapy) or in research.
[0164] In another specific embodiment, the iPSCs compositions or Differentiated Cells of the Invention may also be used advantageously for the production of, but not restricted to, dermal, muscle or skeletal cells from healthy or diseased patients for screening applications in the pharmaceutical industry. Such screening tests can be used to search for new drugs with clinical applications or for toxicology tests.
[0165] In another specific embodiment, the iPSCs compositions or Differentiated Cells of the Invention may also be used for regenerating cardiac or vascular tissue.
[0166] In another specific embodiment, the iPSCs compositions or Differentiated Cells of the Invention may also be used for regenerating brain tissue or neuronal tissue, for example in patient suffering from neurodegenerative disorders.
[0167] Methods for Rejuvenating Target Cells
[0168] In another aspect, the invention relates to a method for rejuvenating cells from aged donors or senescent cells.
[0169] In particular, the invention relates to an in vitro method for rejuvenating cells from aged donors or senescent cells, said method comprising reprogramming said target cells to induced pluripotent stem cell, by increasing expression in said target cells of at least the following reprogramming factors:
[0170] (i) a reprogramming factor encoded by one gene of the Oct family gene, for example Oct4,
[0171] (ii) a reprogramming factor encoded by one gene of the Klf family gene, for example Klf4,
[0172] (iii) a reprogramming factor encoded by one gene of the Sox family gene, for example Sox2,
[0173] (iv) a reprogramming factor encoded by one gene of the Myc family gene, for example, c-myc or L-myc, and,
[0174] (v) Lin28,
[0175] (vi) and, optionally, Nanog.
[0176] The term "rejuvenating" refers to the process of erasing epigenetic modifications participating in the cellular aging phenotype. The cellular aging phenotype can be characterized, inter alia by the following markers:
[0177] activation of the p53/p21.sup.CIP1 and pRb/p16.sup.INK4A tumor suppressor pathways (hereafter referred as senescence effectors),
[0178] cells arrested irreversibly in G1, shortening of telomere size,
[0179] expression of senescent-associated .beta.-galactosidase activity (SA .beta.-Gal),
[0180] Specific chromatin modification as senescence-associated heterochromatic foci (SAHF),
[0181] Specific secretome,
[0182] reduced/altered overall mitochondrial activity.
[0183] A process of rejuvenation is observed when one or all of these markers of aging phenotype is reduced or suppressed in an aged or senescent cell type due to the rejuvenating process.
[0184] In one preferred embodiment, the in vitro method for rejuvenating cells from aged donors or senescent cells comprise culturing said cells from aged donors or senescent cells under appropriate conditions for increasing expression of the following combination of reprogramming factors consisting of Oct4, Klf4, Sox2, c-Myc, Lin28 and, optionally, Nanog.
[0185] The combination of reprogramming factors may be used in vivo for rejuvenating tissue of a subject in need thereof. Thus the invention further relates to a composition for in vivo use in rejuvenating senescent or aged cells in a subject in need thereof, said composition comprising means for increasing expression of the following combination of reprogramming factors consisting of Oct4, Klf4, Sox2, c-Myc, Lin28 and, optionally, Nanog.
[0186] In some embodiments, said means for increasing expression of said reprogramming factors comprises an appropriate amount of Oct4 protein, Klf4 protein, Sox2 protein, c-Myc protein, Lin28 protein and, optionally, Nanog protein, each protein being associated to appropriate means for delivery of said protein into the nucleus of the senescent cells to be rejuvenated.
[0187] Means for delivery of a protein into a cell includes, without limitation, the chemical carriers such as cell-penetrating peptides such as penetratin or TAT-derived peptides.
[0188] In other embodiments, said means for increasing expression of said reprogramming factors comprise an appropriate amount of Oct4 precursor RNA, Klf4 precursor RNA, Sox2 precursor RNA, c-Myc precursor RNA, Lin28 precursor RNA and, optionally, Nanog precursor RNA, associated to appropriate means for delivery of each precursor RNA into the cytoplasm of the senescent cells to be rejuvenated.
[0189] These compositions as described above are particularly suitable for topical application, for example for skin or dermal application, for example for treating skin disorders.
[0190] The invention will be further illustrated by the following examples. However, these examples should not be interpreted in any way as limiting the scope of the present invention.
EXAMPLES
[0191] Methods
[0192] Cells from aged donors were obtained from the Coriell Institute for Medical Research (NJ, USA). Replicative senescent cells were obtained by extensive cell culture until cell cycle growth arrest, assessed by FACS, and senescence-Associated .beta.-Galactosidase activity detected as previously described.sup.25.
[0193] H9 and H1 human embryonic stem cells were obtained from the WiCell Research Institute (WI, USA). hESCs and iPSCs were either maintained using standard hESC procedures on Mitomycin-C treated OF-1 mouse embryonic fibroblasts (MEF) in KO DMEM culture medium (hESC medium) supplemented with 20% KnockOut serum replacement, 0.1 mM non-essential amino acids, 2 mM L-glutamine (all from Invitrogen), 0.1 mM .beta.-mercaptoethanol and 10 ng/ml basic fibroblast growth factor (bFGF, Peprotech) or in feeder-free culture on matrigel (BD Biosciences) with chemically defined mTeSR medium (Stemcell Technologies) as previously described.sup.26.
[0194] For generation of human iPSCs, lentiviral vectors containing cDNAs of human OCT4, SOX2, NANOG and LIN28 genes were obtained from Addgene and previously described by Yu et al.sup.3. KLF4 and c-MYC cDNAs were subcloned into the same vector backbone from vectors described by Takahashi et al..sup.2,3. The 293T cell line (Invitrogen) was used to produce transgene-expressing lentiviruses. Human primary fibroblasts were seeded at 2 10.sup.5 cells per 35 mm dish one day before transduction. Equal amounts of supernatants containing each of the six lentiviruses were mixed, transferred to the fibroblast dish, and incubated overnight. Twenty-four hours after transduction, the lentivirus-containing medium was replaced with the second supernatant. Six days after transduction, fibroblasts were harvested by trypsinization and replated in a 100 mm dish on MEF feeder layer. Next day, the medium was replaced with hESCs medium supplemented with 10 ng/ml bFGF. The medium was changed every other day. Thirty at forty days after transduction, colonies were picked up and transferred into 35 mm dish on a feeder layer with 2 ml of hESCs medium supplemented with 10 ng/ml bFGF.
[0195] Results
[0196] 74-year-old donor proliferative human diploid fibroblasts (hereafter 74P) were induced into replicative senescence by serial passaging (74S). Cellular senescence was assessed after 18 passages (51 population doublings) by FACS analysis, increase in SA-13-Gal activity, up-regulation of the CDK inhibitors p16.sup.INK4A and p21.sup.CIP1 and the formation of SAHFs. These senescent cells were also maintained more than 2 months in culture without any detectable increase in the cell number, confirming the robustness of cell cycle arrest. We initially tried to generate iPSCs from these senescent fibroblasts by overexpression of the LIN28-containing set of genes (OCT4, NANOG, SOX2, LIN28) to compare with previous experimental results published with the set of 4 reprogramming factors OCT4, SOX2, KLF4, c-MYC and originally described as inefficient for senescence bypass. All of these reprogramming factor genes were transduced by individual lentiviruses. After 40 days, we did not observe any new proliferation or formation of hESC colonies resembling iPSCs. This was confirmed by the absence of detectable expression of endogenous pluripotency genes in the infected cell population, whereas quantitative RT-PCR demonstrated efficient viral transduction. These results demonstrate that the LIN28-containing set of reprogramming factors is not able to reverse the replicative senescence state to generate iPSCs, as previously described with the OCT4, SOX2, KLF4, c-MYC combination.
[0197] Interestingly, NANOG overexpression has been described to accelerate reprogramming in a predominantly cell-division-rate-independent manner, and overexpression of LIN28, similar to inhibition of the p53/p21.sup.CIP1 pathway, increases the cell division rate, resulting in an accelerated kinetics of iPSCs production.sup.13,14. We therefore hypothesized that the senescence barrier might be overcome using a protocol optimized for an increased reprogramming efficiency based on the combination of the 6 reprogramming factors OCT4, NANOG, SOX2, KLF4, LIN28, and c-MYC introduced by individual lentiviral particles, without additional transient or permanent inhibition of senescence inductors. To test our hypothesis, 74P and 74S cells were infected twice with a mix of the individual lentiviruses carrying each of the 6 genes. One week after infection, we observed disappearance of SAHFs in infected senescent fibroblasts (74S inf) revealing a first step of reprogramming. Then, cells were plated onto mouse fibroblast feeders in hESCs medium, and after 18-20 days, proliferation was recovered in infected senescent cells. Colonies resembling hESCs appeared at 35-40 days post-infection. iPSC-like colonies were produced from senescent fibroblasts (74S), with a mean reprogramming efficiency of 0.015-0.03%, similar to proliferative fibroblasts (74P), infected under the same conditions. We randomly selected 6 colonies from proliferative (iPSC 74P) and senescent (iPSC 74S) 74-year-old donor fibroblasts, and further characterized 3 clones, which were then successfully expanded in either regular hESCs feeder or feeder-free culture conditions. Long-term culture and assessment for the expression of stem cell markers confirmed successful maintenance and reprogramming of these cells, which have now been grown for more than 35 passages. Immunocytochemistry analysis demonstrated the continued presence of the cell surface markers SSEA-4 and TRA-1-60 that characterize human pluripotent stem cells. Quantitative RT-PCR analyses showed that the iPSCs re-expressed endogenous OCT4, SOX2, NANOG and the REX1 pluripotent marker genes at the same level as H1 and H9 hESC lines or the IMR90 clone 4 iPSC line generated in the Thomson Laboratory (iPSC IMR90 TH Cl 4.sup.3) and grown in parallel, while no transcript was detected in the parental fibroblasts. In order to corroborate this reactivation of endogenous genes, we investigated the DNA methylation status of CpG dinucleotides in one described CpG-rich region in OCT4 and NANOG promoters. Bisulfite genomic sequencing analysis showed that both the OCT- and NANOG promoters were demethylated either in iPSCs from senescent (iPSC 74S Cl F) or proliferative (iPSC 74P Cl H) cells, as efficiently as the previously published iPSC IMR90 TH Cl 4.sup.3, when compared to the methylation status of hESC H9, whereas the same regions were highly methylated in parental fibroblasts.
[0198] To exclude any cell type-specific effects, we repeated the same protocol using the IMR90 human embryonic fibroblasts induced into replicative senescence by serial passaging. Similar to the 74-year-old donor fibroblasts, we did not succeed in generating iPSCs using both sets of 4 factors described previously, whereas we obtained similar reprogramming efficiency from proliferative or senescent IMR90 fibroblasts with the combination of the 6 factors.
[0199] Next we assessed the pluripotent state of our generated iPSCs by evaluating the differentiation abilities of iPSC 74S compared to the iPSC 74P clones. All iPSCs were able to differentiate into the three early embryonic lineages, endoderm, ectoderm and mesoderm, as demonstrated using immunostaining with specific antibodies against SMA, MAP2 and FOXA2 respectively. We obtained similar results with senescent IMR90 fibroblasts. Altogether, these results indicate that the combination of the 6 transcription factors, OCT4, NANOG, SOX2, KLF4, LIN28, and c-MYC is a successful reprogramming strategy for reversing the cellular senescence state leading to generation of iPSCs.
[0200] During aging, the number of senescent cells increases in the human body, and is thought to impair tissue homeostasis. However, increased expression of p16.sup.INK4A and p21.sup.CIP1 occurs in proliferative cells from aged donors, and is thought to decrease progressively their proliferative capacity''. Whether proliferative cells from centenarians might be efficiently reprogrammed towards pluripotency, and whether they retain some specific markers of the cellular aging phenotype, is an unresolved issue. To investigate this specific point, we used fibroblasts from 92-, 94-, 96- and 101-year-old donors for reprogramming experiments, by using the 6 factors in combination. As suggested by our previous experiments on senescent cells, we were able to generate iPSCs from all the aged donor fibroblasts, with similar efficiency to those obtained with senescent fibroblasts. All iPSC clones generated re-expressed endogenous pluripotency genes OCT4, SOX2, NANOG and REX1, as presented for two clones for each parental fibroblast line, that was also confirmed by demethylation of CpG in the OCT4 and NANOG promoter regions. Interestingly, we also observed partial demethylation of the NANOG promoter in parental fibroblasts from aged donors that is in accordance with the global demethylation of the genome already described in the elderlyl.sup.6, that might also have contributing effects in reprogramming. In addition, we detected re-expression of the pluripotency cell surface markers SSEA-4 and TRA-1-60, and finally, we demonstrated their ability to differentiate into endoderm, ectoderm and mesoderm derivatives as judged by immunostaining with SMA, MAP2 and FOXA2 antibodies respectively.
[0201] These results demonstrate the efficiency of our reprogramming procedure to successfully reinstate the pluripotent state and self-renewal from centenarian fibroblasts; thus, cellular aging and the frequently-associated senescent phenotype is not a limit to reprogramming towards pluripotency.
[0202] Although we succeeded in generating iPSCs using aged donors and senescent fibroblasts, an essential issue was to elucidate whether induction of a pluripotent state might erase the main markers of the cellular aging phenotype. We previously showed that reprogramming induced SAHF disappearance, demonstrating the genome organization plasticity of senescent cells. We then analyzed other hallmarks of aging and senescence in our iPSCs and found that iPSCs generated from replicative senescent fibroblasts from the 74-year-old donor did not retain increased expression of p21.sup.CIP1 and p16.sup.4A inherited from their previous senescent state, as shown by immunoblot analysis. Similar results were also obtained with IMR90 embryonic replicative senescent fibroblasts. In addition, all iPSCs generated from centenarians also have downregulated expression of p21.sup.CIP1 and p16.sup.INK4A proteins, similar to hESC lines. These results indicate that we were able to reset the expression of p21.sup.CIP1 and p16.sup.INK4A to the low level found in hESCs.
[0203] Proliferative fibroblasts from aged donors are usually characterized by telomeres of heterogeneous size, whose mean length depends on the parental inherited size and on the various number of divisions occurring during the lifespan, but which do not necessarily correlate with proliferation capacity. However replicative senescence is always associated with short telomeres. Short telomeres are recognized as damaged DNA, leading to activation of the DNA damage response signaling cascade and triggering senescence-associated cell cycle arrest. Although, iPSCs generally exhibit an increased telomere size compared to the parental differentiated cells.sup.17, we wondered whether iPSCs obtained from senescent cells or from centenarian cells exhibited telomeres of increased length. To address this question, we used Southern blot analysis to examine the mean terminal restriction fragment (TRF) length of iPSCs obtained from replicative senescent cells, compared to proliferative cells. We found that telomere length of iPSCs from the aged 74 donor, both from proliferative or senescent cells, increased to a size equivalent to those observed in H9 hESC. Unlike parental fibroblasts, which entered into replicative senescence after 50 population doublings, and 60-63 doublings for embryonic fibroblasts IMR90, we were able, to culture all iPSC lines continuously, which remained stable after more than 110 population doublings. Similarly, telomeric DNA length was increased in iPSCs derived from senescent or proliferative IMR90 embryonic fibroblasts, after reprogramming. Although, telomeres from centenarians were shortened less in size than in senescent fibroblasts, we were able to reset their size to the same length as hESCs. Interestingly, in some iPSC clones, we found a longer upper size than found in H9 hESC, suggesting that telomere size in pluripotent cells does not have an inherited maximum size. It also suggests some possible additional developments in iPSC generation for increased proliferation ability of differentiated cells, suitable for cell-based therapy in regenerative medicine.
[0204] Collectively, these data highlight that our reprogramming protocol leads to erasure of the most common marks of senescence and aging in the generated iPSCs.
[0205] To further evaluate the pluripotent capacity of senescent and aged derived iPSCs, we selected 3 iPSC clones from aged proliferative and senescent fibroblasts, iPSC 74P Cl H, iPSC 74S Cl F and iPSC 96 CI 1. We first confirmed the full capacity of these clones to progress into terminal differentiation by the formation of teratoma in mice, leading to appearance of organized organ-like structures in the three embryonic lineages. DNA fingerprinting analyses (short tandem repeat, STR) were also performed to confirm that iPSC clones were derived from their corresponding parental fibroblasts. We also verified that the 6 transgenes used for the reprogramming were almost completely downregulated.
[0206] Then, we performed transcriptome analysis of 3 selected clones and their parental counterpart, that we compared to a hESCs and iPSCs data set, built as a compendium (.sup.18). We first confirmed that the specific pluripotent genes were expressed in our iPSCs at a similar level as hESCs and iPSCs from the compendium (.sup.19). Then, we performed a hierarchical clustering of our 3 iPSC clones and their parental fibroblasts, combined with several hESCs, iPSCs and post-natal fibroblasts. Strikingly, we found that, proliferative, senescent and aged fibroblasts were clustered together compared to post-natal fibroblasts, suggesting that they share a general common aging signature. Further, the derived iPSCs from proliferative and senescent aged fibroblasts obtained using the infection of the 6 factors are significantly more similar to hESCs than previously described iPSCs derived from a 4 factor infection.
[0207] Since oxidative stress and mitochondrial dysfunction are well described in senescence and aging.sup.20,21, we wondered whether these functions were also specifically reprogrammed from senescent and aged cells. Transcriptome analysis allowed us to study genes involved in both processes, as previously described.sup.22. Again, clustering of transcriptomes with this subset of specific genes indicated that global modifications in expression profiles associated with these altered functions were specific to aged and senescent fibroblasts, when compared to young proliferative embryonic or post-natal fibroblasts, and that our derived iPSCs have reset these functions to an embryonic-like status. Next, we evaluated the overall mitochondrial activity in the derived iPSCs compared to hESCs, by measuring the (.DELTA..PSI.m) transmembrane potential generated by the proton gradient, which is an indicator of healthy mitochondrial function. For this purpose, we used the cationic dye JC-1 and quantified the fluorescence intensity ratio of its two forms by confocal microscopy and flow cytometry analysis. As previously shown, the red/green ratio decreased with senescence.sup.20,21 and also seems associated with aging. Strikingly, we found an increased ratio in iPSCs to a level similar to that found in hESCs, confirming that reprogramming restored the mitochondrial activity of iPSCs derived from old and senescent fibroblasts. Similar results were obtained with iPSCs from proliferative or senescent IMR90 fibroblasts. Furthermore, we did not observe differences in distribution and morphology of mitochondria in iPSCs by electronic microscopy when compared to H1 hESC. Analysis of mitochondrial properties illustrates how nuclear reprogramming, in resetting gene expression programs, might rejuvenate to a healthy cell physiology through restoring impaired function of cellular organelles whose dysfunction is involved in cellular aging. Finally, using a fibroblasts differentiation assay.sup.23,24, we demonstrated that these cells did not enter prematurely into senescence. Indeed, fibroblasts derived from 74P, S and 96 iPSCs did not reveal SA-.beta.-Gal activity after 10 population doublings, and exhibited a proliferation rate equivalent to young proliferative fibroblasts. To exclude the possibility that our reprogramming strategy was not associated with any mutations in the senescence induction pathways, we demonstrated the ability of re-differentiated fibroblasts to re-enter into replicative senescence. After extensive culture, these cells became senescent, as shown by increased SA-G3-Gal activity associated with cell cycle arrest, the re-increased expression of p16.sup.INK4A, and p21.sup.CIP1 and a re-shortened telomere size. More interestingly, the number of population doublings (PD) necessary to trigger replicative senescence was increased. While the age 74 parental fibroblasts entered into replicative senescence after 51 PD, the re-differentiated fibroblasts from iPSC 74S Cl F entered into replicative senescence only after 58 PD. This re-acquired proliferation potential is similar to iPSC 74P Cl H derived from the age 74 proliferative parental fibroblasts PD 60, which was infected at PD 12. These cells exhibited a population doubling potential of 39 PD before exhaustion by replicative senescence. A similar resetting of the proliferation ability of the age 96 fibroblasts was observed and explained by the lengthening of telomeres. We conclude that our reprogramming strategy, which overcame the senescent state, was also able to increase the cellular lifespan. Transcriptome analysis by hierarchical clustering, comparing parental fibroblasts with post-natal and differentiated H1 hESCs in fibroblasts, finally demonstrated that the global gene expression profile of early re-differentiated fibroblasts from our iPSCs generated from old donors and senescent fibroblasts are distinct from parental fibroblasts, and closer to embryonic fibroblasts derived from the H1 hESC line. This result was also confirmed by gene expression profiles associated with oxidative stress and mitochondrial activity, confirming the rejuvenated physiology of our aged and senescent cells.
[0208] Taken together, our results show that it is possible to reprogram replicative senescent cells and cells derived from centenarians into iPSCs using a specific combination of genes, demonstrating that aging and senescence are not a barrier to reprogramming towards pluripotency. It also improves our understanding of fundamental cell reprogramming and underlines the underestimated importance of epigenetic modifications participating in the cellular aging process, which is evidently susceptible to be reprogrammed as well. But most importantly, we also demonstrate that, using an adequate reprogramming strategy, it is possible to rejuvenate the cell physiology, suggesting the potential reversibility of major aspects of the aging phenotype. These results also promote the potential development of age-related disease models, and support the development of new therapeutic cell-based strategies to erase some pathologies related to aging.
[0209] Useful Nucleotide and Amino Acid Sequences for Practicing the Methods of the Invention
TABLE-US-00001 TABLE 1 NO: Description Sequence 1 Human Oct3/4 Gene CCTTCGCAAGCCCTCATTTCACCAGGCCCCCGCCTTGGGGCGCCTT sequence CCTTCCCCATGGCGGGACACCTGGCTTCGGATTTCGCCTTCTCGCC (NM_002701) CCCTCCAGGTGGTGGAGGTGATGGGCCAGGGGGGCCGGAGCCGGGC TGGGTTGATCCTCGGACCTGGCTAAGCTTCCAAGGCCCTCCTGGAG GGCCAGGAATCGGGCCGGGGGTTGGGCCAGGCTCTGAGGTGTGGGG GATTCCCCCATGCCCCCCGCCGTATGAGTTCTGTGGGGGGATGGCG TACTGTGGGCCCCAGGTTGGAGTGGGGCTAGTGCCCCAAGGCGGCT TGGAGACCTCTCAGCCTGAGGGCGAAGCAGGAGTCGGGGTGGAGAG CAACTCCGATGGGGCCTCCCCGGAGCCCTGCACCGTCACCCCTGGT GCCGTGAAGCTGGAGAAGGAGAAGCTGGAGCAAAACCCGGAGGAGT CCCAGGACATCAAACCTCTGCAGAAAGAACTCCAGCAATTTGCCAA GCTCCTGAAGCAGAAGAGGATCACCCTGGGATATACACAGGCCGAT GTGGGGCTCACCCTGGGGGTTCTATTTGGGAAGGTATTCAGCCAAA CGACCATCTGCCGCTTTGAGGCTCTGCAGCTTAGCTTCAAGAACAT GTGTAAGCTGCGGCCCTTGCTGCAGAAGTGGGTGGAGGAAGCTGAC AACAATGAAAATCTTCAGGAGATATGCAAAGCAGAAACCCTCGTGC AGGCCCGAAAGAGAAAGCGAACCAGTATCGAGAACCGAGTGAGAGG CAACCTGGAGAATTTGTTCCTGCAGTGCCCGAAACCCACACTGCAG CAGATCAGCCACATCGCCCAGCAGCTTGGGCTCGAGAAGGATGTGG TCCGAGTGTGGTTCTGTAACCGGCGCCAGAAGGGCAAGCGATCAAG CAGCGACTATGCACAACGAGAGGATTTTGAGGCTGCTGGGTCTCCT TTCTCAGGGGGACCAGTGTCCTTTCCTCTGGCCCCAGGGCCCCATT TTGGTACCCCAGGCTATGGGAGCCCTCACTTCACTGCACTGTACTC CTCGGTCCCTTTCCCTGAGGGGGAAGCCTTTCCCCCTGTCTCCGTC ACCACTCTGGGCTCTCCCATGCATTCAAACTGAGGTGCCTGCCCTT CTAGGAATGGGGGACAGGGGGAGGGGAGGAGCTAGGGAAAGAAAAC CTGGAGTTTGTGCCAGGGTTTTTGGGATTAAGTTCTTCATTCACTA AGGAAGGAATTGGGAACACAAAGGGTGGGGGCAGGGGAGTTTGGGG CAACTGGTTGGAGGGAAGGTGAAGTTCAATGATGCTCTTGAT TTTAATCCCACATCATGTATCACTTTTTTCTTAAATAAAGAAGCCT GGGACACAGTAGATAGACACACTTAAAAAAAAAAA 7 Human Oct4 amino MAGHLASDFAFSPPPGGGGDGPGGPEPGWVDPRTWLSFQGPPGGPG acid sequence IGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGVGLVPQGGLET (ADW77327.1) SQPEGEAGVGVESNSDGASPEPCTVTPGAVKLEKEKLEQNPEESQD IKALQKELEQFAKLLKQKRITLGYTQADVGLTLGVLFGKVFSQTTI CRFEALQLSFKNMCKLRPLLQKWVEEADNNENLQEICKAETLVQAR KRKRTSIENRVRGNLENLFLQCPKPTLQQISHIAQQLGLEKDVVRV WFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPLAPGPHFGT PGYGSPHFTALYSSVPFPEGEAFPPVSVTTLGSPMHSN 3 Human Sox2 gene CTATTAACTTGTTCAAAAAAGTATCAGGAGTTGTCAAGGCAGAGAA sequence GAGAGTGTTTGCAAAAAGGGAAAAGTACTTTGCTGCCTCTTTAAGA (NM_003106) CTAGGGCTGGGAGAAAGAAGAGGAGAGAGAAAGAAAGGAGAGAAGT TTGGAGCCCGAGGCTTAAGCCTTTCCAAAAACTAATCACAACAATC GCGGCGGCCCGAGGAGGAGAGCGCCTGTTTTTTCATCCCAATTGCA CTTCGCCCGTCTCGAGCTCCGCTTCCCCCCAACTATTCTCCGCCAG ATCTCCGCGCAGGGCCGTGCACGCCGAGGCCCCCGCCCGOGGCCCC TGCATCCCGGCCCCCGAGCGCGGCCCCCACAGTCCCGGCCGGGCCG AGGGTTGGCGGCCGCCGGCGGGCCGCGCCCGCCCAGCGCCCGCATG TATAACATGATGGAGACGGAGCTGAAGCCGCCGGGCCCGCAGCAAG CTTCGGGGGGCGGCGGCGGAGGAGGCAACGCCACGGCGGCGGCGAC CGGCGGCAACCAGAAGAACAGCCCGGACCGCGTCAAGAGGCCCATG AACGCCTTCATGGTATGGTCCCGGGGGCAGCGGCGTAAGATGGCCC AGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCCTGGG CGCGGAGTGGAAACTTTTGTCCGAGACCGAGAAGCGGCCGTTCATC GACGAGGCCAAGCGGCTGCGCGCTCTGCACATGAAGGAGCACCCGG ATTATAAATACCGCCCGCCGCGGAAAACCAAGACGCTCATGAAGAA GGATAAGTACACGCTTCCCGGAGGCTTGCTGGCCCCCGGCGGGAAC AGCATGGCGAGCGGGGTTGGGGTGGGCGCCGCCCTGGGTGCGGGCG TGAACCAGCGCATGGACAGCTACGCGCACATGAACGGCTCGAGCAA CGGCAGCTACAGCATGATGCAGGAGCAGCTGGGCTACCCGCAGCAC CCGGGCCTCAACGCTCACGGCGCGGCACAGATGCAACCGATGCACC GCTACGACGTCAGCGCCCTGCAGTACAACTCCATGACCAGCTCGCA GACCTACATGAACGGCTCGCCCACCTACAGCATGTCCTACTCGCAG CAGGGCACCCCCGGTATGGCGCTGGGCTCCATGGGCTCTGTGGTCA AGTCCGAGGCCAGCTCCAGCCCCCCCGTGGTTACCTCTTCCTCCCA CTCCAGGGCGCCCTGCCAGGCCGGGGACCTCCGGGACATGATCAGC ATGTACCTCCCCGGCGCCGAGGTGCCGGAGCCCGCTGCGCCCAGTA GACTGCACATGGCCCAGCACTACCAGAGCGGCCCGGTGCCCGGCAC GGCCATTAACGGCACACTGCCCCTGTCGCACATGTGAGGGCTGGAC TGCGAACTGGAGAAGGGGAGAGATTTTCAAAGAGATACAAGGGAAT TGGGAGGGGTGCAAAAAGAGGAGAGTAGGAAAAATCTGATAATGCT CAAAAGGAAAAAAAATCTCCGCAGCGAAACGACAGCTGCGGAAAAA AACCACCAATCCCATCCAAATTAACGCAAAAACCGTGATGCCGACT AGAAAACTTTTATGAGAGATCTTGGGACTTCTTTTTGGGGGACTAT TTTTGTACAGAGAAAACCTGAGGGCGGCGGGGAGGGCGGGGGAATC GGACCATGTATAGATCTGGAGGAAAAAAAACTACGCAAAACTTTTT TTAAAGTTCTAGTGGTACGTTAGGCGCTTCGCAGGGAGTTCGCAAA AGTCTTAACCAGTAATATTCAGAGCTAGACTCCGGGCGATGAAAAA AAAGTTTTAATATTTGCAAGCAACTTTTGTACAGTATTTATCGAGA TAAACATGGCAATCAAATGTCCATTGTTTATAAGCTGAGAATTTGC CAATATTTTTCGAGGAAAGGGTTCTTGCTGGGTTTTGATTCTGCAG CTTAAATTTAGGACCGTTACAAACAAGGAAGGAGTTTATTCGGATT TGAACATTTTAGTTTTAAAATTGTACAAAAGGAAAACATGAGAGCA AGTACTGGCAAGACCGTTTTCGTGGTCTTGTTTAAGGCAAACGTTC TAGATTGTACTAAATTTTTAACTTACTGTTAAAGGCAAAAAAAAAA TGTCCATGCAGGTTGATATCGTTGGTAATTTATAATAGCTTTTGTT CAATCCTACCCTTTCATTTTGTTCACATAAAAAATATGGAATTACT GTGTTTGAAATATTTTCTTATGGTTTGTAATATTTCTGTAAATTGT GATATTTTAAGGTTTTTCCCCCCTTTTATTTTCCGTAGTTGTATTT TAAAAGATTCGGCTCTGTTATTGGAATCAGGCTGCCGAGAATCCAT GTATATATTTGAACTAATACCATCCTTATAACAGCTACATTTTCAA CTTAAGTTTTTACTCCATTATGCACAGTTTGACATAAATAAATTTT TGAAATATGGACACTGAAA 4 Human Sox2 amino MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMN acid sequence (NCBI AFMVWSRGQRRKMAQEWPKMHNSEISKRLGAEWKLLSETEKRPFID ref NP_003097.1) EAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNS MASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGYPQHP GLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQ GTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISM YLPGPAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM 5 Human Klf4 gene AGTTTCCCGACCAGAGAGAACGAACGTGTGTGCGGGCGCGCGGGGA sequence GCAGAGGCGGTGGCGGGCGGCGGCGGCACCGGGAGCCGCCGAGTGA (NM_004235) CCCTCCCCCGCCCCTCTGGCCCCCCACCCTCCCACCCGCCCGTGGC CCGCGCCCATGGCCGCGCGCGCTCCACACAACTCACCGGAGTCCGC GCCTTGCGCCGCCGACCAGTTCGCAGCTCCGCGCCACGGCAGCCAG TCTCACCTGGCGGCACCGCCCGCCCACCGCCCCGGCCACAGCCCCT GCGCCCACGGCAGCACTCGAGGCGACCGCGACAGTGGTGGGGGACG CTGCTGAGTGGAAGAGAGCGCAGCCCGGCCACCGGACCTACTTACT CGCCTTGCTGATTGTCTATTTTTGCGTTTACAACTTTTCTAAGAAC TTTTGTATACAAAGGAACTTTTTAAAAAAGACGCTTCCAAGTTATA TTTAATCCAAAGAAGAAGGATCTCGGCCAATTTGGGGTTTTGGGTT TTGGCTTCGTTTCTTCTCTTCGTTGACTTTGGGGTTCAGGTGCCCC AGCTGCTTCGGGCTGCCGAGGACCTTCTGGGCCCCCACATTAATGA GGCAGCCACCTGGCGAGTCTGACATGGCTGTCAGCGACGCGCTGCT CCCATCTTTCTCCACGTTCGCGTCTGGCCCGGCGGGAAGGGAGAAG ACACTGCGTCAAGCAGGTGCCCCGAATAACCGCTGGCGGGAGGAGC TCTCCCACATGAAGCGACTTCCCCCAGTGCTTCCCGGCCGCCCCTA TGACCTGGCGGCGGCGACCGTGGCCACAGACCTGGAGAGCGGCGGA GCCGGTGCGGCTTGCGGCGGTAGCAACCTGGCGCCCCTACCTCGGA GAGAGACCGAGGAGTTCAACGATCTCCTGGACCTGG ACTTTATTCTCTCCAATTCGCTGACCCATCCTCCGGAGTCAGTGGC CGCCACCGTGTCCTCGTCAGCGTCAGCCTCCTCTTCGTCGTCGCCG TCGACCAGCCGCCCTGCCAGCGCGCCCTCCACCTCCACCTTCACCT ATCCGATCCGGGCCGGGAACGACCCGGGCGTGGCGCCGGGCGGCAC GGGCGGAGGCCTCCTCTATGGCAGGGAGTCCGCTCCCCCTCCGACG GCTCCCTTCAACCTGGCGGACATCAACGACGTGAGCCCCTCGGGCG GCTTCGTGGCCGAGCTCCTGCGGCCAGAATTGCACCCGGTGTACAT TCCGCCGCAGGAGCCGCAGCCGCCAGGTGGCGGGCTGATGGGCAAG TTCGTGCTGAAGGCGTCGOTGAGCGCCCCTGGCAGCGAGTACGGCA GCCCGTCGGTCATCAGCGTCAGCAAAGGCAGCCCTGACGGCAGCCA CCCGCTGGTGGTGGCGCCCTACAACGGCGGGCCGCCGCGCACGTGC CCCAAGATCAACCAGGAGGCGGTCTCTTCGTGCACCCACTTGGGCG CTGGACCCCCTCTCAGCAATGGCCACCGGCCGGCTGCACACGACTT CCCCCTGGGGCGGCAGCTCCCCAGGAGGACTACCCCGACCCTGGGT CTTGAGGAAGTGCTGAGGAGCAGGGACTGTCACCCTGCCCTGCCGC TTCCTCCCGGCTTCCATCCCCACCCGGGGCCCAATTACCCATCCTT CCTGCCCGATCAGATGCAGCCGCAAGTCCCGCCGCTCCATTACCAA GAGCTCATGCCACCCGGTTCCTGCATGCCAGAGGAGCCCAAGCCAA AGAGGGGAAGACGATCGTGGCCCCGGAAAAGGACCGCCACCCACAC TTGTGATTACGCGGGCTGCGGCAAAACCTACACAAAGAGTTCCCAT CTCAAGGCACACCTGCGAACCCACACAGGTGAGAAACCTTACCACT GTGACTGGGACGGCTGTGGATGGAAATTCGCCCGCTCAGATGAACT GACCAGGCACTACCGTAAACACACGGGGCACCGCCCGTTCCAGTGC CAAAAATGCGACCGAGCATTTTCCAGGTCGGACCACCTCGCCTTAC ACATGAAGAGGCATTTTTAAATCCCAGACAGTGGATATGACCCACA CTGCCAGAAGAGAATTCAGTATTTTTTACTTTTCACACTGTCTTCC CGATCAGGGAAGGAGCCCAGCCAGAAAGCACTACAATCATGGTCAA GTTCCCAACTGAGTCATCTTGTGAGTGGATAATCAGGAAAAATGAG GAATCCAAAAGACAAAAATCAAAGAACAGATGGGGTCTGTGACTGG ATCTTCTATCATTCCAATTCTAAATCCGACTTGAATATTCCTGGAC TTACAAAATGCCAAGGGGGTGACTGGAAGTTGTGGATATCAGGGTA TAAATTATATCCGTGAGTTGGGGGAGGGAAGACCAGAATTCCCT TGAATTGTGTATTGATGCAATATAAGCATAAAAGATCACCTTGTAT TCTCTTTACCTTCTAAAAGCCATTATTATGATGTTAGAAGAAGAGG AAGAAATTCAGGTACAGAAAACATGTTTAAATAGCCTAAATGATGG TGCTTGGTGAGTCTTGGTTCTAAAGGTACCAAACAAGGAAGCCAAA GTTTTCAAACTGCTGCATACTTTGACAAGGAAAATCTATATTTGTC TTCCGATCAACATTTATGACCTAAGTCAGGTAATATACCTGGTTTA CTTCTTTAGCATTTTTATGCAGACAGTCTGTTATGCACTGTGGTTT CAGATGTGCAATAATTTGTACAATGGTTTATTCCCAAGTATGCCTT AAGCAGAACAAATGTGTTTTTCTATATAGTTCCTTGCCTTAATAAA TATGTAATATAAATTTAAGCAAACGTCTATTTTGTATATTTGTAAA CTACAAAGTAAAATGAACATTTTGTGGAGTTTGTATTTTGCATACT CAAGGTGAGAATTAAGTTTTAAATAAACCTATAATATTTTATCTGA AAAAAAAAAAAAAAAAA 6 Human Klf4 amino MRQPPGESDMAVSDALLPSFSTFASGPAGREKTLRQAGAPNNRWRE acid sequence ELSHMKRLPPVLPGRPYDLAAATVATDLESGGAGAACGGSNLAPLP (NP_004226.3) RRETEEFNDLLDLDFILSNSLTHPPESVAATVSSSASASSSSSPSS SGPASAPSTCSFTYPIRAGNDPGVAPGGTGGGLLYGRESAPPPTAP FNLADINDVSPSGGFVAELLRPELDPVYIPPQQPQPPGGGLMGKFV LKASLSAPGSEYGSPSVISVSKGSPDGSHPVVVAPYNGGPPRTCPK IKQEAVSSCTHLGAGPPLSNGHRPAAHDFPLGRQLPSRTTPTLGLE EVLSSRDCHPALPLPPGFHPHPGPNYPSFLPDQMQPQVPPLHYQEL MPPGSCMPEEPKPKRGRRSWPRKRTATHTCDYAGCGKTYTKSSHLK AHLRTHTGEKPYHCDWDGCGWKFARSDELTRHYRKHTGHRPFQCQK CDRAFSRSDHLALHMKRHFE 7 Human c-Myc gene GACCCCCGAGCTGTGCTGCTCGCGGCCGCCACCGCCGGGCCCCGGC sequence (NCBI Ref CGTCCCTGGCTCCCCTCCTGCCTCGAGAAGGGCAGGGCTTCTCAGA NM_002467) GGCTTGGCGGGAAAAAGAACGGAGGGAGGGATCGCGCTGAGTATAA AAGCCGGTTTTCGGGGCTTTATCTAACTCGCTGTAGTAATTCCAGC GAGAGGCAGAGGGAGCGAGCGGGCGGCCGGCTAGGGTGGAAGAGCC GGGCGAGCAGAGCTGCGCTGCGGGCGTCCTGGGAAGGGAGATCCGG AGCGAATAGGGGGCTTCGCCTCTGGCCCAGCCCTCCCGCTGATCCC CCAGCCAGCGGTCCGCAACCCTTGCCGCATCCACGAAACTTTGCCC ATAGCAGCGGGCGGGCACTTTGCACTGGAACTTACAACACCCGAGC AAGGACGCGACTCTCCCGACGCGGGGAGGCTATTCTGCCCATTTGG GGACACTTCCCCGCCGCTGCCAGGACCCGCTTCTCTGAAAGGCTCT CCTTGCAGCTGCTTAGACGCTGGATTTTTTTCGGGTAGTGGAAAAC CAGCAGCCTCCCGCGACGATGCCCCTCAACGTTAGCTTCACCAACA GGAACTATGACCTCGACTACGACTCGGTGCAGCCGTATTTCTACTG CGACGAGGAGGAGAACTTCTACCAGCAGCAGCAGCAGAGCGAGCTG CAGCCCCCGGCGCCCAGCGAGGATATCTGGAAGAAATTCGAGCTGC TGCCCACCCCGCCCCTGTCCCCTAGCCGCCGCTCCGGGCTCTGCTC GCCCTCCTACGTTGCGGTCACACCOTTCTCCCTTCGGGGAGACAAC GACGGCGGTGGCGGGAGCTTCTCCACGGCCGACCAGCTGGAGATGG TGACCGAGCTGCTGGGAGGAGACATGGTGAACCAGAGTTTCATCTG CGACCCGGACGACGAGACCTTCATCAAAAACATCATCATCCAGGAC TGTATGTGGAGCGGCTTCTCGGCCGCCGCCAAGCTCGTCTCAGAGA AGCTGGCCTCCTACCAGGCTGCGCGCAAAGACAGCGGCAGCCCGAA CCCCGCCCGCGGCCACAGCGTCTGCTCCACCTCCAGCTTGTACCTG CAGGATCTGAGCGCCGCCGCCTCAGAGTGCATCGACCCCTCGGTGG TCTTCCCCTACCCICTCAACGACAGCAGCTCGCCCAAGTCCTGCGC CTCGCAAGACTCCAGCGCCTTCTCTCCGTCCTCGGATTCTCTGCTC TCCTCGACGGAGTCCTCCCCGCAGGGCAGCCCCGAGCCCCTGGTGC TCCATGAGGAGACACCGCCCACCACCAGCAGCGACTCTGAGGAGGA ACAAGAAGATGAGGAAGAAATCGATGTTGTTTCTGTGGAAAAGAGG CAGGCTCCTGGCAAAAGGTCAGAGTCTGGATCACCTTCTGCTGGAG GCCACAGCAAACCTCCTCACAGCCCACTGGTCCTCAAGAGGTGCCA CGTCTCCACACATCAGCACAACTACCCACCGCCTCCCTCCACTCGC AAGGACTATCCTGCTGCCAAGAGGGTCAAGTTGGACAGTGTCAGAG TCCTGAGACAGATCAGCAACAACCGAAAATGCACCAGCCCCAGGTC CTCGGACACCGAGGAGAATGTCAAGAGGCGAACACACAACGTCTTG GAGCGCCAGAGGAGGAACGAGCTAAAACGGAGCTTTTTTGCCCTGC GTGACCAGATCCCGGAGTTGGAAAACAATGAAAAGGCCCCCAAGGT AGTTATCCTTAAAAAAGCCACAGCATACATCCTGTCCGTCCAAGCA GAGGAGCAAAACCTCATTTCTGAACAGGACTTGTTGCGGAAACGAC GAGAACAGTTGAAACACAAACTTGAACAGCTACGGAACTCTTGTGC GTAAGGAAAAGTAAGGAAAACGATTCCTTCTAACAGAAATGTCCTG AGCAATCACCTATGAACTTGTTTCAAATGCATGATCAAATGCAACC TCACAACCTTGGCTGACTCTTGAGACTGAAAGATTTAGCCATAATG TAAACTGCCTCAAATTGGACTTTGGGCATAAAAGAACTTTTTTATG CTTACCATCTTTTTTTTTTCTTTAACAGATTTGTATTTAAGAATTG TTTTTAAAAAATTTTAAGATTTACACAATGTTTCTCTGTAAATATT GCCATTAAATGTAAATAACTITAATAAAACGTTTATACCAGTTACA CAGAATTTCAATCCTAGTATATAGTACCTAGTATTATAGCTACTAT AAACCCTAATTTTTTTTATTTAAGTACATTTTGCTTTTTAAAGTTG ATTTTTTTCTATTGTTTTTAGAAAAAATAAAATAACTGGCAAATAT ATCATTGAGCCAAATCTTAAAAAAAAAAAAAAA 8 Human c-Myc amino MDFFRVVENQQPPATMPLNVSFTNRNYCLDYDSVQPYFYCDEEENF acid sequence (NCBI YQQQQQSELQPPAPSEDIWKKFELLPTPPLSPSRRSGLCSPSYVAV ref NP_002458.2) TPFSLRGDNDGGGGSFSTADQLEMVTELLGGDMVNQSFICDPDDET FIKNIIIQDCMWSGFSAAAKLVSEKLASYQAARKDSGSPNPARGHS VCSTSSLYLQDLSAAASECIDPSVVFPYPLNDSSSPKSCASQDSSA FSPSSDSLLSSTESSPQGSPEPLVLHEETPPTTSSDSEEEQEDEEE IDVVSVEKRQAPCKRSESGSPSAGGHSKPPHSPLVLKRCHVSTHQH NYAAPPSTRKDYPAAKRVKLCSVRVLRQISNNRKCTSPRSSDTEEN VKRRTHNVLERQRRNELKRSFFALRDQIPELENNEKAPKVVILKKA TAYILSVQAEEQKLISEEDLLRKRREQLKHKLEQLRNSCA
9 Human Nanog gene ATTATAAATCTACAGACTCCAGGATTTTAACGTTCTGCTGGACTGA sequence GCTGGTTGCCTCATGTTATTATGCAGGCAACTCACTTTATCCCAAT (NM_024865.2) TTCTTGATACTTTTCCTTCTGGAGGTCCTATTTCTCTAACATCTTC CAGAAAAGTCTTAAAGCTGCCTTAACCTTTTTTCCAGTCCACCTCT TAAATTTTTTCCTCCTCTTCCTCTATACTAACATGAGTGTGGATCC AGCTTGTCCCCAAAGCTTGCCTTGCTTTGAAGCATCCGACTGTAAA GAATCTTCACCTATGCCTGTGATTTGTGGGCCTGAAGAAAACTATC CATCCTTGCAAATGTCTTCTGCTGAGATGCCTCACACGGAGACTGT CTCTCCTCTTCCTTCCTCCATGGATCTGCTTATTCAGGACAGCCCT GATTCTTCCACCAGTCCCAAAGGCAAACAACCCACTTCTGCAGAGA AGAGTGTCGCAAAAAAGGAAGACAAGGTCCCGGTCAAGAAACAGAA GACCAGAACTGTGTTCTCTTCCACCCAGCTGTGTGTACTCAATGAT AGATTTCAGAGACAGAAATACCTCAGCCTCCAGCAGATGCAAGAAC TCTCCAACATCCTGAACCTCAGCTACAAACAGGTGAAGACCTGGTT CCAGAACCAGAGAATGAAATCTAAGAGGTGGCAGAAAAACAACTGG CCGAAGAATAGCAATGGTGTGACGCAGAAGGCCTCAGCACCTACCT ACCCCAGCCTTTACTCTTCCTACCACCAGGGATGCCTGGTGAACCC GACTGGGAACCTTCCAATGTGGAGCAACCAGACCTGGAACAATTCA ACCTGGAGCAACCAGACCCAGAACATCCAGTCCTGGAGCAACCACT CCTGGAACACTCAGACCTGGTGCACCCAATCCTGGAACAATCAGGC CTGGAACAGTCCCTTCTATAACTGTGGAGAGGAATCTCTGCAGTCC TGCATGCAGTTCCAGCCAAATTCTCCTGCCAGTGACTTGGAGGCTG CCTTGGAAGCTGCTGGGGAAGGCCTTAATGTAATACAGCAGACCAC TAGGTATTTTAGTACTCCACAAACCATGGATTTATTCCTAAACTAC TCCATGAACATGCAACCTGAAGACGTGTGAAGATGAGTGAAACTGA TATTACTCAATTTCAGTCTGGACACTGGCTGAATCCTTCCTCTCCC CTCCTCCCATCCCTCATAGGATTTTTCTTGTTTGGAAACCACGTGT TCTGGTTTCCATGATGCCCATCCAGTCAATCTCATGGAGGGTGGAG TATGGTTGGAGCCTAATCAGCGAGGTTTCTTTTTTTTTTTTTTTCC TATTGGATCTTCCTGGAGAAAATACTTTTTTTTTTTTTTTTTTTGA AACGGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGGT CTTGGCTCACTGCAAGCTCCGTCTCCCGGGTTCACGCCATTCTCCT GCCTCAGCCTCCCGAGCAGCTGGGACTACAGGCGCCCGCCACCTCG CCCGGCTAATATTTTGTATTTTTAGTAGAGACGGGGTTTCACTGTG TTAGCCAGGATGGTCTCGATCTCCTGACCTTGTGATCCACCCGCCT CGGCCTCCCTAACAGCTGGGATTTACAGGCGTGAGCCACCGCGCCC TGCCTAGAAAAGACATTTTAATAACCTTGGCTGCCGTCTCTGGCTA TAGATAAGTAGATCTAATACTAGTTTGGATATCTTTAGGGTTTAGA ATCTAACCTCAAGAATAAGAAATACAAGTACAAATTGGTGATGAAG ATGTATTCGTATTGTTTGGGATTGGCAGGCTTTGCTTATTTTTTAA AAACTATTGAGGTAAAGGGTTAAGCTGTAACATACTTAATTGATTT CTTACCGTTTTTGGCTCTGTTTTGCTATATCCCCTAATTTGTTGGT TGTGCTAATCTTTGTAGAAAGAGGTCTCGTATTTGCTGCATCGTAA TGACATGAGTACTGCTTTAGTTGGTTTAAGTTCAAATGAATGAAAC AACTATTTTTCCTTTAGTTGATTTTACCCTGATTTCACCGAGTGTT TCAATGAGTAAATATACAGCTTAAACAT 10 Human Nanog amino MSVDPACPQSLPCFEASDCKESSPMPVICGPEENYPSLQMSSAEMP acid sequence HTETVSPLPSSMDLLIQDSPDSSTSPKGKQPTSAEKSVAKKEDKVP (NP_079141) VKKQKTRTVFSSTQLCVLNDRFQRQKYLSLQQMQELSNILNLSYKQ VKTWFQNQRMKSKRWQKNNWPKNSNGVTQKASAPTYPSLYSSYHQG CLVNPTGNLPMWSNQTWNNSTWSNQTQNIQSWSNHSWNTQTWCTQS WNNQAWNSPFYNCGEESLQSCMQFQPNSPASDLEAALEAAGEGLNV IQQTTRYFSTPQTMDLFLNYSMNMQPEDV 11 Human Lin28 gene GTGCGGGGGAAGATGTAGCAGCTTCTTCTCCGAACCAACCCTTTGC sequence CTTCGGACTTCTCCGGGGCCAGCAGCCGCCCCACCAGGGGCCCGGG (NM_024674) GCCACGGGCTCAGCCGACGACCATGGGCTCCGTGTCCAACCAGCAG TTTGCAGGTGGCTGCGCCAAGGCGGCAGAAGAGGCGCCCGAGGAGG CGCCGGAGGACGCGGCCCGGGCGGCGGACGAGCCTCAGCTGCTGCA CGGTGCGGGCATCTGTAAGTGGTTCAACGTGCGCATGGGGTTCGGC TTCCTGTCCATGACCGCCCGCGCCGGGGTCGCGCTCGACCCCCCAG TGGATGTCTTTGTGCACCAGAGTAAGCTGCACATGGAAGGGTTCCG GAGCTTGAAGGAGGGTGAGGCAGTGGAGTTCACCTTTAAGAAGTCA GCCAAGGGTCTGGAATCCATCCGTGTCACCGGACCTGGTGGAGTAT TCTGTATTGGGAGTGAGAGGCGGCCAAAAGGAAAGAGCATGCAGAA GCGCAGATCAAAAGGAGACAGGTGCTACAACTGTGGAGGTCTAGAT CATCATGCCAAGGAATGCAAGCTGCCACCCCACCCCAAGAAGTGCC ACTTCTGCCAGAGCATCAGCCATATGGTAGCCTCATGTCCGCTGAA GGCCCAGCAGGGCCCTAGTGCACAGGGAAAGCCAACCTACTTTCGA GAGGAAGAAGAAGAAATCCACAGCCCTACCCTGCTCCCGGAGGCAC AGAATTGAGCCACAATGGGTGGGGGCTATTCTTTTGCTATCAGGAA GTTTTGAGGAGCAGGCAGAGTGGAGAAAGTGGGAATAGGCTGCATT GGGGCTAGTTGGCACTGCCATGTATCTCAGGCTTGGGTTCACACCA TCACCCTTTCTTCCCTCTAGGTGGGGGGAAAGGGTGAGTCAAAGGA ACTCCAACCATGCTCTGTCCAAATGCAAGTGAGGGTTCTGGGGGCA ACCAGGAGGGGGGAATCACCCTACAACCTGCATACTTTGAGTCTCC ATCCCCAGAATTTCCAGCTTTTGAAAGTGGCCTGGATAGGGAAGTT GTTTTCCTTTTAAAGAAGGATATATAATAATTCCCATGCCAGAGTG AAATGATTAAGTATAAGACCAGATTCATGGAGCCAAGCCACTACAT TCTGTGGAAGGAGATCTCTCAGGAGTAAGCATTGTTTTTTTTTCAC ATCTTGTATCCTCATACCCACTTTTGGGATAGGGTGCTGGCAGCTG TCCCAAGCAATGGGTAATGATGATGGCAAAAAGGGTGTTTGGGGGA ACAGCTGCAGACCTGCTGCTCTATGCTCACCCCCGCCCCATTCTGG GCCAATGTGATTTTATTTATTTGCTCCOTTGGATACTGCACCTTGG GTCCCACTTTCTCCAGGATGCCAACTGCACTAGCTGTGTGCGAATG ACGTATCTTGTGCATTTTAACTTTTTTTCCTTAATATAAATATTCT GGTTTTGTATTTTTGTATATTTTAATCTAAGGCCCTCATTTCCTGC ACTGTGTTCTCAGGTACATGAGCAATCTCAGGGATAGCCAGCAGCA GCTCCAGGTCTGCGCAGCAGGAATTACTTTTTGTTGTTTTTGCCAC CGTGGAGAGCAACTATTTGGAGTGCACAGCCTATTGAACTACCTCA TTTTTGCCAATAAGAGCTGGCTTTTCTGCCATAGTGTCCTCTTGAA ACCCCCTCTGCCTTGAAAATGTTTTATGGGAGACTAGGTTTTAACT GGGTGGCCCCATGACTTGATTGCCTTCTACTGGAAGATTGGGAATT AGTCTAAACAGGAAATGGTGGTACACAGAGGCTAGGAGAGGCTGGG CCCGGTGAAAAGGCCAGAGAGCAAGCCAAGATTAGGTGAGGGTTGT CTAATCCTATGGCACAGGACGTGCTTTACATCTCCAGATCTGTTCT TCACCAGATTAGGTTAGGCCTACCATGTGCCACAGGGTGTGTGTGT GTTTGTAAAACTAGAGTTGCTAAGGATAAGTTTAAAGACCAATACC CCTGTACTTAATCCTGTGCTGTCGAGGGATGGATATATGAAGTAAG GTGAGATCCTTAACCTTTCAAAATTTTCGGGTTCCAGGGAGACACA CAAGCGAGGGTTTTGTGGTGCCTGGAGCCTGTGTCCTGCCCTGCTA CAGTAGTGATTAATAGTGTCATGGTAGCTAAAGGAGAAAAAGGGGG TTTCGTTTACACGCTGTGAGATCACCGCAAACCTACCTTACTGTGT TGAAACGGGACAAATGCAATAGAACGCATTGGGTGGTGTGTGTCTG ATCCTGGGTTCTTGTCTCCCCTAAATGCTGCCCCCCAAGTTACTGT ATTTGTCTGGGCTTTGTAGGACTTCACTACGTTGATTGCTAGGTGG CCTAGTTTGTGTAAATATAATGTATTGGTCTTTCTCCGTGTTCTTT GGGGGTTTTGTTTACAAACTTCTTTTTGTATTGAGAGAAAAATAGC CAAAGCATCTTTGACAGAAGGTTCTGCACCAGGCAAAAAGATCTGA AACATTAGTTTGGGGGGCCCTCTTCTTAAAGTGGGGATCTTGAACC ATCCTTTCTTTTGTATTCCCCTTCCCCTATTACCTATTACACCAGA TCTTCTGTCCTAAAAACTTGTCTTCTACCCTGCCCTCTTTTCTGTT CACCCCCAAAAGAAAACTTACACACCCACACACATACACATTTCAT GCTTGGAGTGTCTCCACAACTCTTAAATGATGTATGCAAAAATACT GAAGCTAGGAAAACCCTCCATCCCTTGTTCCCAACCTCCTAAGTCA AGACCATTACCATTTCTTTCTTTCTTTTTTTTTTTTTTTTAAAATG GAGTCTCACTGTGTCACCCAGGCTGGAGTGCAGTGGCATGATCGGC TCACTGCAGCCTCTGCCTCTTGGGTTCAAGTGATTCTCCTGCCTCA GCCTCCTGAGTAGCTGGGATTTCAGGCACCCGCCACACTCAGCTAA TTTTTGTATTTTTAGTAGAGACGCGGTTTCACCATGTTGTCCAGGC TGGTCTGGAACTCCTGACCTCAGGTGATCTGCCCACCTTGGCTTCC CAAAGTGCTGGCATTACAGGCATGAGCCACCATGCTGGGCCAACCA TTTCTTGGTGTATTCATGCCAAACACTTAAGACACTGCTGTAGCCC AGGCGCGGTGGCTCACACCTGTAATCCCAGCACTTTGGAAGGCTGA GGCGGGCGGATCACAAGGTCACGAGTTCAAAACTATCCTGGCCAAC ACAGTGAAACCCCGTCTCTACTAAAATACAAAAAAATTAGCCGGGT GTGGTGGTGCATGCCTTTAGTCCTAGCTATTCAGGAGGCTGAGGCA GGGGAATCGCTTGAACCCGAGAGGCAGAGGTTGCAGTGAGCTGAGA TCGCACCACTGCACTCCAGCCTGGTTACAGAGCAAGACTCTGTCTC AAACAAAACAAAACAAAACAAAAACACACTACTGTATTTTGGATGG ATCAAACCTCCTTAATTTTAATTTCTAATCCTAAAGTAAAGAGATG CAATTGGGGGCCTTCCATGTAGAAAGTGGGGTCAGGAGGCCAAGAA AGGGAATATGAATGTATATCCAAGTCACTCAGGAACTTTTATGCAG GTGCTAGAAACTTTATGTCAAAGTGGCCACAAGATTGTTTAATAGG AGACGAACGAATGTAACTCCATGTTTACTGCTAAAAACCAAAGCTT TGTGTAAAATCTTGAATTTATGGGGCGGGAGGGTAGGAAAGCCTGT ACCTGTCTGTTTTTTTCCTGATCCTTTTCCCTCATTCCTGAACTGC AGGAGACTGAGCCCCTTTGGGCTTTGGTGACCCCATCACTGGGGTG TGTTTATTTGATGGTTGATTTTGCTGTACTGGGTACTTCCTTTCCC ATTTTCTAATCATTTTTTAACACAAGCTGACTCTTCCCTTCCCTTC TCCTTTCCCTGGGAAAATACAATGAATAAATAAAGACTTATTGGTA CGCAAACTGTCA 12 Human Lin28 amino MGSVSNQQFAGGCAKAAEEAPEEAPEDAARAADEPQLLHGAGICKW acid sequence FNVRMGFGFLSMTARAGVALDPPVDVFVHQSKLHMEGFRSLKEGEA (NP_078950) VEFTFKKSAKGLESIRVTGPGGVFCIGSERRPKGKSMQKRRSKGDR CYNCGGLDHHAKECKLPPQPKKCHFCQSISHMVASCPLKAQQGPSA QGKPTYFREEEEEIHSPTLLPEAQN
REFERENCES
[0210] .sup.1 Takahashi, K. & Yamanaka, S., Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126 (4), 663-676 (2006).
[0211] .sup.2 Takahashi, K. et al., Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 131 (5), 861-872 (2007).
[0212] .sup.3 Yu, J. et al., Induced pluripotent stem cell lines derived from human somatic cells. Science 318 (5858), 1917-1920 (2007).
[0213] .sup.4 Marion, R. M. et al., A p53-mediated DNA damage response limits reprogramming to ensure iPS cell genomic integrity. Nature 460 (7259), 1149-1153 (2009).
[0214] .sup.5 Li, H. et al., The Ink4/Arf locus is a barrier for iPS cell reprogramming. Nature 460 (7259), 1136-1139 (2009).
[0215] .sup.6 Banito, A. et al., Senescence impairs successful reprogramming to pluripotent stem cells. Genes Der 23 (18), 2134-2139 (2009).
[0216] .sup.7 Utikal, J. et al., Immortalization eliminates a roadblock during cellular reprogramming into iPS cells. Nature 460 (7259), 1145-1148 (2009).
[0217] .sup.8 Kawamura, T. et al., Linking the p53 tumour suppressor pathway to somatic cell reprogramming. Nature 460 (7259), 1140-1144 (2009).
[0218] .sup.9 Campisi, J. & d'Adda di Fagagna, F., Cellular senescence: when bad things happen to good cells. Nat Rev Mol Cell Biol 8 (9), 729-740 (2007).
[0219] .sup.10 Collado, M., Blasco, M. A., & Serrano, M., Cellular senescence in cancer and aging. Cell 130 (2), 223-233 (2007).
[0220] .sup.11 Zhang, R., Chen, W., & Adams, P. D., Molecular dissection of formation of senescence-associated heterochromatin foci. Mol Cell Biol 27 (6), 2343-2358 (2007).
[0221] .sup.12 Liao, J. et al., Enhanced efficiency of generating induced pluripotent stem (iPS) cells from human somatic cells by a combination of six transcription factors. Cell Res 18 (5), 600-603 (2008).
[0222] .sup.13 Hanna, J. et al., Direct cell reprogramming is a stochastic process amenable to acceleration. Nature 462 (7273), 595-601 (2009).
[0223] .sup.14 Yu, J. et al., Human Induced Pluripotent Stem Cells Free of Vector and Transgene Sequences. Science (2009).
[0224] .sup.15 Krishnamurthy, J. et al., Ink4a/Arf expression is a biomarker of aging. J Clin Invest 114 (9), 1299-1307 (2004).
[0225] .sup.16 Zhang, W. et al., Comparison of global DNA methylation profiles in replicative versus premature senescence. Life Sci 83 (13-14), 475-480 (2008).
[0226] .sup.17 Marion, R. M. et al., Telomeres acquire embryonic stem cell characteristics in induced pluripotent stem cells. Cell Stein Cell 4 (2), 141-154 (2009).
[0227] .sup.18 Assou, S. et al., A meta-analysis of human embryonic stem cells transcriptome integrated into a web-based expression atlas. Stein Cells 25 (4), 961-973 (2007).
[0228] .sup.19 Guenther, M. G. et al., Chromatin structure and gene expression programs of human embryonic and induced pluripotent stem cells. Cell Stem Cell 7 (2), 249-257.
[0229] .sup.20 Passos, J. F. et al., Mitochondrial dysfunction accounts for the stochastic heterogeneity in telomere-dependent senescence. PLoS Biol 5 (5), el 10 (2007).
[0230] .sup.21 Moiseeva, O., Bourdeau, V., Roux, A., Deschenes-Simard, X., & Ferbeyre, G., Mitochondrial dysfunction contributes to oncogene-induced senescence. Mol Cell Biol 29 (16), 4495-4507 (2009).
[0231] .sup.22 Prigione, A., Fauler, B., Lurz, R., Lehrach, H., & Adjaye, J., The senescence-related mitochondrial/oxidative stress pathway is repressed in human induced pluripotent stem cells. Stem Cells 28 (4), 721-733 (2010).
[0232] .sup.23 Park, I. H. et al., Reprogramming of human somatic cells to pluripotency with defined factors. Nature 451 (7175), 141-146 (2008).
[0233] .sup.24 Assou, S. el al., A gene expression signature shared by human mature oocytes and embryonic stem cells. BMC Genomics 10, 10 (2009).
[0234] .sup.25 Matsuura, F. et al., Senescent phenotypes of skin fibroblasts from patients with Tangier disease. Biochemical and biophysical research communications 357 (2), 493-498 (2007).
[0235] .sup.26 Ludwig, T. E. et al., Feeder-independent culture of human embryonic stem cells. Nat Methods 3 (8), 637-646 (2006).
[0236] .sup.27 Freberg, C. T., Dahl, J. A., Timoskainen, S., & Collas, P., Epigenetic Reprogramming of OCT4 and NANOG regulatory regions by embryonal carcinoma cell extract. Molecular biology of the cell 18 (5), 1543-1553 (2007).
Sequence CWU
1
1
1211411DNAhomo sapiens 1ccttcgcaag ccctcatttc accaggcccc cggcttgggg
cgccttcctt ccccatggcg 60ggacacctgg cttcggattt cgccttctcg ccccctccag
gtggtggagg tgatgggcca 120ggggggccgg agccgggctg ggttgatcct cggacctggc
taagcttcca aggccctcct 180ggagggccag gaatcgggcc gggggttggg ccaggctctg
aggtgtgggg gattccccca 240tgccccccgc cgtatgagtt ctgtgggggg atggcgtact
gtgggcccca ggttggagtg 300gggctagtgc cccaaggcgg cttggagacc tctcagcctg
agggcgaagc aggagtcggg 360gtggagagca actccgatgg ggcctccccg gagccctgca
ccgtcacccc tggtgccgtg 420aagctggaga aggagaagct ggagcaaaac ccggaggagt
cccaggacat caaagctctg 480cagaaagaac tcgagcaatt tgccaagctc ctgaagcaga
agaggatcac cctgggatat 540acacaggccg atgtggggct caccctgggg gttctatttg
ggaaggtatt cagccaaacg 600accatctgcc gctttgaggc tctgcagctt agcttcaaga
acatgtgtaa gctgcggccc 660ttgctgcaga agtgggtgga ggaagctgac aacaatgaaa
atcttcagga gatatgcaaa 720gcagaaaccc tcgtgcaggc ccgaaagaga aagcgaacca
gtatcgagaa ccgagtgaga 780ggcaacctgg agaatttgtt cctgcagtgc ccgaaaccca
cactgcagca gatcagccac 840atcgcccagc agcttgggct cgagaaggat gtggtccgag
tgtggttctg taaccggcgc 900cagaagggca agcgatcaag cagcgactat gcacaacgag
aggattttga ggctgctggg 960tctcctttct cagggggacc agtgtccttt cctctggccc
cagggcccca ttttggtacc 1020ccaggctatg ggagccctca cttcactgca ctgtactcct
cggtcccttt ccctgagggg 1080gaagcctttc cccctgtctc cgtcaccact ctgggctctc
ccatgcattc aaactgaggt 1140gcctgccctt ctaggaatgg gggacagggg gaggggagga
gctagggaaa gaaaacctgg 1200agtttgtgcc agggtttttg ggattaagtt cttcattcac
taaggaagga attgggaaca 1260caaagggtgg gggcagggga gtttggggca actggttgga
gggaaggtga agttcaatga 1320tgctcttgat tttaatccca catcatgtat cacttttttc
ttaaataaag aagcctggga 1380cacagtagat agacacactt aaaaaaaaaa a
14112360PRTHomo sapiens 2Met Ala Gly His Leu Ala
Ser Asp Phe Ala Phe Ser Pro Pro Pro Gly 1 5
10 15 Gly Gly Gly Asp Gly Pro Gly Gly Pro Glu Pro
Gly Trp Val Asp Pro 20 25
30 Arg Thr Trp Leu Ser Phe Gln Gly Pro Pro Gly Gly Pro Gly Ile
Gly 35 40 45 Pro
Gly Val Gly Pro Gly Ser Glu Val Trp Gly Ile Pro Pro Cys Pro 50
55 60 Pro Pro Tyr Glu Phe Cys
Gly Gly Met Ala Tyr Cys Gly Pro Gln Val 65 70
75 80 Gly Val Gly Leu Val Pro Gln Gly Gly Leu Glu
Thr Ser Gln Pro Glu 85 90
95 Gly Glu Ala Gly Val Gly Val Glu Ser Asn Ser Asp Gly Ala Ser Pro
100 105 110 Glu Pro
Cys Thr Val Thr Pro Gly Ala Val Lys Leu Glu Lys Glu Lys 115
120 125 Leu Glu Gln Asn Pro Glu Glu
Ser Gln Asp Ile Lys Ala Leu Gln Lys 130 135
140 Glu Leu Glu Gln Phe Ala Lys Leu Leu Lys Gln Lys
Arg Ile Thr Leu 145 150 155
160 Gly Tyr Thr Gln Ala Asp Val Gly Leu Thr Leu Gly Val Leu Phe Gly
165 170 175 Lys Val Phe
Ser Gln Thr Thr Ile Cys Arg Phe Glu Ala Leu Gln Leu 180
185 190 Ser Phe Lys Asn Met Cys Lys Leu
Arg Pro Leu Leu Gln Lys Trp Val 195 200
205 Glu Glu Ala Asp Asn Asn Glu Asn Leu Gln Glu Ile Cys
Lys Ala Glu 210 215 220
Thr Leu Val Gln Ala Arg Lys Arg Lys Arg Thr Ser Ile Glu Asn Arg 225
230 235 240 Val Arg Gly Asn
Leu Glu Asn Leu Phe Leu Gln Cys Pro Lys Pro Thr 245
250 255 Leu Gln Gln Ile Ser His Ile Ala Gln
Gln Leu Gly Leu Glu Lys Asp 260 265
270 Val Val Arg Val Trp Phe Cys Asn Arg Arg Gln Lys Gly Lys
Arg Ser 275 280 285
Ser Ser Asp Tyr Ala Gln Arg Glu Asp Phe Glu Ala Ala Gly Ser Pro 290
295 300 Phe Ser Gly Gly Pro
Val Ser Phe Pro Leu Ala Pro Gly Pro His Phe 305 310
315 320 Gly Thr Pro Gly Tyr Gly Ser Pro His Phe
Thr Ala Leu Tyr Ser Ser 325 330
335 Val Pro Phe Pro Glu Gly Glu Ala Phe Pro Pro Val Ser Val Thr
Thr 340 345 350 Leu
Gly Ser Pro Met His Ser Asn 355 360 32457DNAHomo
sapiens 3ctattaactt gttcaaaaaa gtatcaggag ttgtcaaggc agagaagaga
gtgtttgcaa 60aaagggaaaa gtactttgct gcctctttaa gactagggct gggagaaaga
agaggagaga 120gaaagaaagg agagaagttt ggagcccgag gcttaagcct ttccaaaaac
taatcacaac 180aatcgcggcg gcccgaggag gagagcgcct gttttttcat cccaattgca
cttcgcccgt 240ctcgagctcc gcttcccccc aactattctc cgccagatct ccgcgcaggg
ccgtgcacgc 300cgaggccccc gcccgcggcc cctgcatccc ggcccccgag cgcggccccc
acagtcccgg 360ccgggccgag ggttggcggc cgccggcggg ccgcgcccgc ccagcgcccg
catgtataac 420atgatggaga cggagctgaa gccgccgggc ccgcagcaag cttcgggggg
cggcggcgga 480ggaggcaacg ccacggcggc ggcgaccggc ggcaaccaga agaacagccc
ggaccgcgtc 540aagaggccca tgaacgcctt catggtatgg tcccgggggc agcggcgtaa
gatggcccag 600gagaacccca agatgcacaa ctcggagatc agcaagcgcc tgggcgcgga
gtggaaactt 660ttgtccgaga ccgagaagcg gccgttcatc gacgaggcca agcggctgcg
cgctctgcac 720atgaaggagc acccggatta taaataccgg ccgcggcgga aaaccaagac
gctcatgaag 780aaggataagt acacgcttcc cggaggcttg ctggcccccg gcgggaacag
catggcgagc 840ggggttgggg tgggcgccgg cctgggtgcg ggcgtgaacc agcgcatgga
cagctacgcg 900cacatgaacg gctggagcaa cggcagctac agcatgatgc aggagcagct
gggctacccg 960cagcacccgg gcctcaacgc tcacggcgcg gcacagatgc aaccgatgca
ccgctacgac 1020gtcagcgccc tgcagtacaa ctccatgacc agctcgcaga cctacatgaa
cggctcgccc 1080acctacagca tgtcctactc gcagcagggc acccccggta tggcgctggg
ctccatgggc 1140tctgtggtca agtccgaggc cagctccagc ccccccgtgg ttacctcttc
ctcccactcc 1200agggcgccct gccaggccgg ggacctccgg gacatgatca gcatgtacct
ccccggcgcc 1260gaggtgccgg agcccgctgc gcccagtaga ctgcacatgg cccagcacta
ccagagcggc 1320ccggtgcccg gcacggccat taacggcaca ctgcccctgt cgcacatgtg
agggctggac 1380tgcgaactgg agaaggggag agattttcaa agagatacaa gggaattggg
aggggtgcaa 1440aaagaggaga gtaggaaaaa tctgataatg ctcaaaagga aaaaaaatct
ccgcagcgaa 1500acgacagctg cggaaaaaaa ccaccaatcc catccaaatt aacgcaaaaa
ccgtgatgcc 1560gactagaaaa cttttatgag agatcttggg acttcttttt gggggactat
ttttgtacag 1620agaaaacctg agggcggcgg ggagggcggg ggaatcggac catgtataga
tctggaggaa 1680aaaaactacg caaaactttt ttttaaagtt ctagtggtac gttaggcgct
tcgcagggag 1740ttcgcaaaag tctttaccag taatatttag agctagactc cgggcgatga
aaaaaaagtt 1800ttaatatttg caagcaactt ttgtacagta tttatcgaga taaacatggc
aatcaaatgt 1860ccattgttta taagctgaga atttgccaat atttttcgag gaaagggttc
ttgctgggtt 1920ttgattctgc agcttaaatt taggaccgtt acaaacaagg aaggagttta
ttcggatttg 1980aacattttag ttttaaaatt gtacaaaagg aaaacatgag agcaagtact
ggcaagaccg 2040ttttcgtggt cttgtttaag gcaaacgttc tagattgtac taaattttta
acttactgtt 2100aaaggcaaaa aaaaaatgtc catgcaggtt gatatcgttg gtaatttata
atagcttttg 2160ttcaatccta ccctttcatt ttgttcacat aaaaaatatg gaattactgt
gtttgaaata 2220ttttcttatg gtttgtaata tttctgtaaa ttgtgatatt ttaaggtttt
tccccccttt 2280tattttccgt agttgtattt taaaagattc ggctctgtta ttggaatcag
gctgccgaga 2340atccatgtat atatttgaac taataccatc cttataacag ctacattttc
aacttaagtt 2400tttactccat tatgcacagt ttgagataaa taaatttttg aaatatggac
actgaaa 24574317PRTHomo sapiens 4Met Tyr Asn Met Met Glu Thr Glu Leu
Lys Pro Pro Gly Pro Gln Gln 1 5 10
15 Thr Ser Gly Gly Gly Gly Gly Asn Ser Thr Ala Ala Ala Ala
Gly Gly 20 25 30
Asn Gln Lys Asn Ser Pro Asp Arg Val Lys Arg Pro Met Asn Ala Phe
35 40 45 Met Val Trp Ser
Arg Gly Gln Arg Arg Lys Met Ala Gln Glu Asn Pro 50
55 60 Lys Met His Asn Ser Glu Ile Ser
Lys Arg Leu Gly Ala Glu Trp Lys 65 70
75 80 Leu Leu Ser Glu Thr Glu Lys Arg Pro Phe Ile Asp
Glu Ala Lys Arg 85 90
95 Leu Arg Ala Leu His Met Lys Glu His Pro Asp Tyr Lys Tyr Arg Pro
100 105 110 Arg Arg Lys
Thr Lys Thr Leu Met Lys Lys Asp Lys Tyr Thr Leu Pro 115
120 125 Gly Gly Leu Leu Ala Pro Gly Gly
Asn Ser Met Ala Ser Gly Val Gly 130 135
140 Val Gly Ala Gly Leu Gly Ala Gly Val Asn Gln Arg Met
Asp Ser Tyr 145 150 155
160 Ala His Met Asn Gly Trp Ser Asn Gly Ser Tyr Ser Met Met Gln Asp
165 170 175 Gln Leu Gly Tyr
Pro Gln His Pro Gly Leu Asn Ala His Gly Ala Ala 180
185 190 Gln Met Gln Pro Met His Arg Tyr Asp
Val Ser Ala Leu Gln Tyr Asn 195 200
205 Ser Met Thr Ser Ser Gln Thr Tyr Met Asn Gly Ser Pro Thr
Tyr Ser 210 215 220
Met Ser Tyr Ser Gln Gln Gly Thr Pro Gly Met Ala Leu Gly Ser Met 225
230 235 240 Gly Ser Val Val Lys
Ser Glu Ala Ser Ser Ser Pro Pro Val Val Thr 245
250 255 Ser Ser Ser His Ser Arg Ala Pro Cys Gln
Ala Gly Asp Leu Arg Asp 260 265
270 Met Ile Ser Met Tyr Leu Pro Gly Ala Glu Val Pro Glu Pro Ala
Ala 275 280 285 Pro
Ser Arg Leu His Met Ser Gln His Tyr Gln Ser Gly Pro Val Pro 290
295 300 Gly Thr Ala Ile Asn Gly
Thr Leu Pro Leu Ser His Met 305 310 315
52949DNAHomo sapiens 5agtttcccga ccagagagaa cgaacgtgtc tgcgggcgcg
cggggagcag aggcggtggc 60gggcggcggc ggcaccggga gccgccgagt gaccctcccc
cgcccctctg gccccccacc 120ctcccacccg cccgtggccc gcgcccatgg ccgcgcgcgc
tccacacaac tcaccggagt 180ccgcgccttg cgccgccgac cagttcgcag ctccgcgcca
cggcagccag tctcacctgg 240cggcaccgcc cgcccaccgc cccggccaca gcccctgcgc
ccacggcagc actcgaggcg 300accgcgacag tggtggggga cgctgctgag tggaagagag
cgcagcccgg ccaccggacc 360tacttactcg ccttgctgat tgtctatttt tgcgtttaca
acttttctaa gaacttttgt 420atacaaagga actttttaaa aaagacgctt ccaagttata
tttaatccaa agaagaagga 480tctcggccaa tttggggttt tgggttttgg cttcgtttct
tctcttcgtt gactttgggg 540ttcaggtgcc ccagctgctt cgggctgccg aggaccttct
gggcccccac attaatgagg 600cagccacctg gcgagtctga catggctgtc agcgacgcgc
tgctcccatc tttctccacg 660ttcgcgtctg gcccggcggg aagggagaag acactgcgtc
aagcaggtgc cccgaataac 720cgctggcggg aggagctctc ccacatgaag cgacttcccc
cagtgcttcc cggccgcccc 780tatgacctgg cggcggcgac cgtggccaca gacctggaga
gcggcggagc cggtgcggct 840tgcggcggta gcaacctggc gcccctacct cggagagaga
ccgaggagtt caacgatctc 900ctggacctgg actttattct ctccaattcg ctgacccatc
ctccggagtc agtggccgcc 960accgtgtcct cgtcagcgtc agcctcctct tcgtcgtcgc
cgtcgagcag cggccctgcc 1020agcgcgccct ccacctgcag cttcacctat ccgatccggg
ccgggaacga cccgggcgtg 1080gcgccgggcg gcacgggcgg aggcctcctc tatggcaggg
agtccgctcc ccctccgacg 1140gctcccttca acctggcgga catcaacgac gtgagcccct
cgggcggctt cgtggccgag 1200ctcctgcggc cagaattgga cccggtgtac attccgccgc
agcagccgca gccgccaggt 1260ggcgggctga tgggcaagtt cgtgctgaag gcgtcgctga
gcgcccctgg cagcgagtac 1320ggcagcccgt cggtcatcag cgtcagcaaa ggcagccctg
acggcagcca cccggtggtg 1380gtggcgccct acaacggcgg gccgccgcgc acgtgcccca
agatcaagca ggaggcggtc 1440tcttcgtgca cccacttggg cgctggaccc cctctcagca
atggccaccg gccggctgca 1500cacgacttcc ccctggggcg gcagctcccc agcaggacta
ccccgaccct gggtcttgag 1560gaagtgctga gcagcaggga ctgtcaccct gccctgccgc
ttcctcccgg cttccatccc 1620cacccggggc ccaattaccc atccttcctg cccgatcaga
tgcagccgca agtcccgccg 1680ctccattacc aagagctcat gccacccggt tcctgcatgc
cagaggagcc caagccaaag 1740aggggaagac gatcgtggcc ccggaaaagg accgccaccc
acacttgtga ttacgcgggc 1800tgcggcaaaa cctacacaaa gagttcccat ctcaaggcac
acctgcgaac ccacacaggt 1860gagaaacctt accactgtga ctgggacggc tgtggatgga
aattcgcccg ctcagatgaa 1920ctgaccaggc actaccgtaa acacacgggg caccgcccgt
tccagtgcca aaaatgcgac 1980cgagcatttt ccaggtcgga ccacctcgcc ttacacatga
agaggcattt ttaaatccca 2040gacagtggat atgacccaca ctgccagaag agaattcagt
attttttact tttcacactg 2100tcttcccgat gagggaagga gcccagccag aaagcactac
aatcatggtc aagttcccaa 2160ctgagtcatc ttgtgagtgg ataatcagga aaaatgagga
atccaaaaga caaaaatcaa 2220agaacagatg gggtctgtga ctggatcttc tatcattcca
attctaaatc cgacttgaat 2280attcctggac ttacaaaatg ccaagggggt gactggaagt
tgtggatatc agggtataaa 2340ttatatccgt gagttggggg agggaagacc agaattccct
tgaattgtgt attgatgcaa 2400tataagcata aaagatcacc ttgtattctc tttaccttct
aaaagccatt attatgatgt 2460tagaagaaga ggaagaaatt caggtacaga aaacatgttt
aaatagccta aatgatggtg 2520cttggtgagt cttggttcta aaggtaccaa acaaggaagc
caaagttttc aaactgctgc 2580atactttgac aaggaaaatc tatatttgtc ttccgatcaa
catttatgac ctaagtcagg 2640taatatacct ggtttacttc tttagcattt ttatgcagac
agtctgttat gcactgtggt 2700ttcagatgtg caataatttg tacaatggtt tattcccaag
tatgccttaa gcagaacaaa 2760tgtgtttttc tatatagttc cttgccttaa taaatatgta
atataaattt aagcaaacgt 2820ctattttgta tatttgtaaa ctacaaagta aaatgaacat
tttgtggagt ttgtattttg 2880catactcaag gtgagaatta agttttaaat aaacctataa
tattttatct gaaaaaaaaa 2940aaaaaaaaa
29496479PRTHomo sapiens 6Met Arg Gln Pro Pro Gly
Glu Ser Asp Met Ala Val Ser Asp Ala Leu 1 5
10 15 Leu Pro Ser Phe Ser Thr Phe Ala Ser Gly Pro
Ala Gly Arg Glu Lys 20 25
30 Thr Leu Arg Gln Ala Gly Ala Pro Asn Asn Arg Trp Arg Glu Glu
Leu 35 40 45 Ser
His Met Lys Arg Leu Pro Pro Val Leu Pro Gly Arg Pro Tyr Asp 50
55 60 Leu Ala Ala Ala Thr Val
Ala Thr Asp Leu Glu Ser Gly Gly Ala Gly 65 70
75 80 Ala Ala Cys Gly Gly Ser Asn Leu Ala Pro Leu
Pro Arg Arg Glu Thr 85 90
95 Glu Glu Phe Asn Asp Leu Leu Asp Leu Asp Phe Ile Leu Ser Asn Ser
100 105 110 Leu Thr
His Pro Pro Glu Ser Val Ala Ala Thr Val Ser Ser Ser Ala 115
120 125 Ser Ala Ser Ser Ser Ser Ser
Pro Ser Ser Ser Gly Pro Ala Ser Ala 130 135
140 Pro Ser Thr Cys Ser Phe Thr Tyr Pro Ile Arg Ala
Gly Asn Asp Pro 145 150 155
160 Gly Val Ala Pro Gly Gly Thr Gly Gly Gly Leu Leu Tyr Gly Arg Glu
165 170 175 Ser Ala Pro
Pro Pro Thr Ala Pro Phe Asn Leu Ala Asp Ile Asn Asp 180
185 190 Val Ser Pro Ser Gly Gly Phe Val
Ala Glu Leu Leu Arg Pro Glu Leu 195 200
205 Asp Pro Val Tyr Ile Pro Pro Gln Gln Pro Gln Pro Pro
Gly Gly Gly 210 215 220
Leu Met Gly Lys Phe Val Leu Lys Ala Ser Leu Ser Ala Pro Gly Ser 225
230 235 240 Glu Tyr Gly Ser
Pro Ser Val Ile Ser Val Ser Lys Gly Ser Pro Asp 245
250 255 Gly Ser His Pro Val Val Val Ala Pro
Tyr Asn Gly Gly Pro Pro Arg 260 265
270 Thr Cys Pro Lys Ile Lys Gln Glu Ala Val Ser Ser Cys Thr
His Leu 275 280 285
Gly Ala Gly Pro Pro Leu Ser Asn Gly His Arg Pro Ala Ala His Asp 290
295 300 Phe Pro Leu Gly Arg
Gln Leu Pro Ser Arg Thr Thr Pro Thr Leu Gly 305 310
315 320 Leu Glu Glu Val Leu Ser Ser Arg Asp Cys
His Pro Ala Leu Pro Leu 325 330
335 Pro Pro Gly Phe His Pro His Pro Gly Pro Asn Tyr Pro Ser Phe
Leu 340 345 350 Pro
Asp Gln Met Gln Pro Gln Val Pro Pro Leu His Tyr Gln Glu Leu 355
360 365 Met Pro Pro Gly Ser Cys
Met Pro Glu Glu Pro Lys Pro Lys Arg Gly 370 375
380 Arg Arg Ser Trp Pro Arg Lys Arg Thr Ala Thr
His Thr Cys Asp Tyr 385 390 395
400 Ala Gly Cys Gly Lys Thr Tyr Thr Lys Ser Ser His Leu Lys Ala His
405 410 415 Leu Arg
Thr His Thr Gly Glu Lys Pro Tyr His Cys Asp Trp Asp Gly 420
425 430 Cys Gly Trp Lys Phe Ala Arg
Ser Asp Glu Leu Thr Arg His Tyr Arg 435 440
445 Lys His Thr Gly His Arg Pro Phe Gln Cys Gln Lys
Cys Asp Arg Ala 450 455 460
Phe Ser Arg Ser Asp His Leu Ala Leu His Met Lys Arg His Phe 465
470 475 72379DNAHomo sapiens
7gacccccgag ctgtgctgct cgcggccgcc accgccgggc cccggccgtc cctggctccc
60ctcctgcctc gagaagggca gggcttctca gaggcttggc gggaaaaaga acggagggag
120ggatcgcgct gagtataaaa gccggttttc ggggctttat ctaactcgct gtagtaattc
180cagcgagagg cagagggagc gagcgggcgg ccggctaggg tggaagagcc gggcgagcag
240agctgcgctg cgggcgtcct gggaagggag atccggagcg aatagggggc ttcgcctctg
300gcccagccct cccgctgatc ccccagccag cggtccgcaa cccttgccgc atccacgaaa
360ctttgcccat agcagcgggc gggcactttg cactggaact tacaacaccc gagcaaggac
420gcgactctcc cgacgcgggg aggctattct gcccatttgg ggacacttcc ccgccgctgc
480caggacccgc ttctctgaaa ggctctcctt gcagctgctt agacgctgga tttttttcgg
540gtagtggaaa accagcagcc tcccgcgacg atgcccctca acgttagctt caccaacagg
600aactatgacc tcgactacga ctcggtgcag ccgtatttct actgcgacga ggaggagaac
660ttctaccagc agcagcagca gagcgagctg cagcccccgg cgcccagcga ggatatctgg
720aagaaattcg agctgctgcc caccccgccc ctgtccccta gccgccgctc cgggctctgc
780tcgccctcct acgttgcggt cacacccttc tcccttcggg gagacaacga cggcggtggc
840gggagcttct ccacggccga ccagctggag atggtgaccg agctgctggg aggagacatg
900gtgaaccaga gtttcatctg cgacccggac gacgagacct tcatcaaaaa catcatcatc
960caggactgta tgtggagcgg cttctcggcc gccgccaagc tcgtctcaga gaagctggcc
1020tcctaccagg ctgcgcgcaa agacagcggc agcccgaacc ccgcccgcgg ccacagcgtc
1080tgctccacct ccagcttgta cctgcaggat ctgagcgccg ccgcctcaga gtgcatcgac
1140ccctcggtgg tcttccccta ccctctcaac gacagcagct cgcccaagtc ctgcgcctcg
1200caagactcca gcgccttctc tccgtcctcg gattctctgc tctcctcgac ggagtcctcc
1260ccgcagggca gccccgagcc cctggtgctc catgaggaga caccgcccac caccagcagc
1320gactctgagg aggaacaaga agatgaggaa gaaatcgatg ttgtttctgt ggaaaagagg
1380caggctcctg gcaaaaggtc agagtctgga tcaccttctg ctggaggcca cagcaaacct
1440cctcacagcc cactggtcct caagaggtgc cacgtctcca cacatcagca caactacgca
1500gcgcctccct ccactcggaa ggactatcct gctgccaaga gggtcaagtt ggacagtgtc
1560agagtcctga gacagatcag caacaaccga aaatgcacca gccccaggtc ctcggacacc
1620gaggagaatg tcaagaggcg aacacacaac gtcttggagc gccagaggag gaacgagcta
1680aaacggagct tttttgccct gcgtgaccag atcccggagt tggaaaacaa tgaaaaggcc
1740cccaaggtag ttatccttaa aaaagccaca gcatacatcc tgtccgtcca agcagaggag
1800caaaagctca tttctgaaga ggacttgttg cggaaacgac gagaacagtt gaaacacaaa
1860cttgaacagc tacggaactc ttgtgcgtaa ggaaaagtaa ggaaaacgat tccttctaac
1920agaaatgtcc tgagcaatca cctatgaact tgtttcaaat gcatgatcaa atgcaacctc
1980acaaccttgg ctgagtcttg agactgaaag atttagccat aatgtaaact gcctcaaatt
2040ggactttggg cataaaagaa cttttttatg cttaccatct tttttttttc tttaacagat
2100ttgtatttaa gaattgtttt taaaaaattt taagatttac acaatgtttc tctgtaaata
2160ttgccattaa atgtaaataa ctttaataaa acgtttatag cagttacaca gaatttcaat
2220cctagtatat agtacctagt attataggta ctataaaccc taattttttt tatttaagta
2280cattttgctt tttaaagttg atttttttct attgttttta gaaaaaataa aataactggc
2340aaatatatca ttgagccaaa tcttaaaaaa aaaaaaaaa
23798454PRTHomo sapiens 8Met Asp Phe Phe Arg Val Val Glu Asn Gln Gln Pro
Pro Ala Thr Met 1 5 10
15 Pro Leu Asn Val Ser Phe Thr Asn Arg Asn Tyr Asp Leu Asp Tyr Asp
20 25 30 Ser Val Gln
Pro Tyr Phe Tyr Cys Asp Glu Glu Glu Asn Phe Tyr Gln 35
40 45 Gln Gln Gln Gln Ser Glu Leu Gln
Pro Pro Ala Pro Ser Glu Asp Ile 50 55
60 Trp Lys Lys Phe Glu Leu Leu Pro Thr Pro Pro Leu Ser
Pro Ser Arg 65 70 75
80 Arg Ser Gly Leu Cys Ser Pro Ser Tyr Val Ala Val Thr Pro Phe Ser
85 90 95 Leu Arg Gly Asp
Asn Asp Gly Gly Gly Gly Ser Phe Ser Thr Ala Asp 100
105 110 Gln Leu Glu Met Val Thr Glu Leu Leu
Gly Gly Asp Met Val Asn Gln 115 120
125 Ser Phe Ile Cys Asp Pro Asp Asp Glu Thr Phe Ile Lys Asn
Ile Ile 130 135 140
Ile Gln Asp Cys Met Trp Ser Gly Phe Ser Ala Ala Ala Lys Leu Val 145
150 155 160 Ser Glu Lys Leu Ala
Ser Tyr Gln Ala Ala Arg Lys Asp Ser Gly Ser 165
170 175 Pro Asn Pro Ala Arg Gly His Ser Val Cys
Ser Thr Ser Ser Leu Tyr 180 185
190 Leu Gln Asp Leu Ser Ala Ala Ala Ser Glu Cys Ile Asp Pro Ser
Val 195 200 205 Val
Phe Pro Tyr Pro Leu Asn Asp Ser Ser Ser Pro Lys Ser Cys Ala 210
215 220 Ser Gln Asp Ser Ser Ala
Phe Ser Pro Ser Ser Asp Ser Leu Leu Ser 225 230
235 240 Ser Thr Glu Ser Ser Pro Gln Gly Ser Pro Glu
Pro Leu Val Leu His 245 250
255 Glu Glu Thr Pro Pro Thr Thr Ser Ser Asp Ser Glu Glu Glu Gln Glu
260 265 270 Asp Glu
Glu Glu Ile Asp Val Val Ser Val Glu Lys Arg Gln Ala Pro 275
280 285 Gly Lys Arg Ser Glu Ser Gly
Ser Pro Ser Ala Gly Gly His Ser Lys 290 295
300 Pro Pro His Ser Pro Leu Val Leu Lys Arg Cys His
Val Ser Thr His 305 310 315
320 Gln His Asn Tyr Ala Ala Pro Pro Ser Thr Arg Lys Asp Tyr Pro Ala
325 330 335 Ala Lys Arg
Val Lys Leu Asp Ser Val Arg Val Leu Arg Gln Ile Ser 340
345 350 Asn Asn Arg Lys Cys Thr Ser Pro
Arg Ser Ser Asp Thr Glu Glu Asn 355 360
365 Val Lys Arg Arg Thr His Asn Val Leu Glu Arg Gln Arg
Arg Asn Glu 370 375 380
Leu Lys Arg Ser Phe Phe Ala Leu Arg Asp Gln Ile Pro Glu Leu Glu 385
390 395 400 Asn Asn Glu Lys
Ala Pro Lys Val Val Ile Leu Lys Lys Ala Thr Ala 405
410 415 Tyr Ile Leu Ser Val Gln Ala Glu Glu
Gln Lys Leu Ile Ser Glu Glu 420 425
430 Asp Leu Leu Arg Lys Arg Arg Glu Gln Leu Lys His Lys Leu
Glu Gln 435 440 445
Leu Arg Asn Ser Cys Ala 450 92098DNAHomo sapiens
9attataaatc tagagactcc aggattttaa cgttctgctg gactgagctg gttgcctcat
60gttattatgc aggcaactca ctttatccca atttcttgat acttttcctt ctggaggtcc
120tatttctcta acatcttcca gaaaagtctt aaagctgcct taaccttttt tccagtccac
180ctcttaaatt ttttcctcct cttcctctat actaacatga gtgtggatcc agcttgtccc
240caaagcttgc cttgctttga agcatccgac tgtaaagaat cttcacctat gcctgtgatt
300tgtgggcctg aagaaaacta tccatccttg caaatgtctt ctgctgagat gcctcacacg
360gagactgtct ctcctcttcc ttcctccatg gatctgctta ttcaggacag ccctgattct
420tccaccagtc ccaaaggcaa acaacccact tctgcagaga agagtgtcgc aaaaaaggaa
480gacaaggtcc cggtcaagaa acagaagacc agaactgtgt tctcttccac ccagctgtgt
540gtactcaatg atagatttca gagacagaaa tacctcagcc tccagcagat gcaagaactc
600tccaacatcc tgaacctcag ctacaaacag gtgaagacct ggttccagaa ccagagaatg
660aaatctaaga ggtggcagaa aaacaactgg ccgaagaata gcaatggtgt gacgcagaag
720gcctcagcac ctacctaccc cagcctttac tcttcctacc accagggatg cctggtgaac
780ccgactggga accttccaat gtggagcaac cagacctgga acaattcaac ctggagcaac
840cagacccaga acatccagtc ctggagcaac cactcctgga acactcagac ctggtgcacc
900caatcctgga acaatcaggc ctggaacagt cccttctata actgtggaga ggaatctctg
960cagtcctgca tgcagttcca gccaaattct cctgccagtg acttggaggc tgccttggaa
1020gctgctgggg aaggccttaa tgtaatacag cagaccacta ggtattttag tactccacaa
1080accatggatt tattcctaaa ctactccatg aacatgcaac ctgaagacgt gtgaagatga
1140gtgaaactga tattactcaa tttcagtctg gacactggct gaatccttcc tctcccctcc
1200tcccatccct cataggattt ttcttgtttg gaaaccacgt gttctggttt ccatgatgcc
1260catccagtca atctcatgga gggtggagta tggttggagc ctaatcagcg aggtttcttt
1320tttttttttt ttcctattgg atcttcctgg agaaaatact tttttttttt ttttttttga
1380aacggagtct tgctctgtcg cccaggctgg agtgcagtgg cgcggtcttg gctcactgca
1440agctccgtct cccgggttca cgccattctc ctgcctcagc ctcccgagca gctgggacta
1500caggcgcccg ccacctcgcc cggctaatat tttgtatttt tagtagagac ggggtttcac
1560tgtgttagcc aggatggtct cgatctcctg accttgtgat ccacccgcct cggcctccct
1620aacagctggg atttacaggc gtgagccacc gcgccctgcc tagaaaagac attttaataa
1680ccttggctgc cgtctctggc tatagataag tagatctaat actagtttgg atatctttag
1740ggtttagaat ctaacctcaa gaataagaaa tacaagtaca aattggtgat gaagatgtat
1800tcgtattgtt tgggattggg aggctttgct tattttttaa aaactattga ggtaaagggt
1860taagctgtaa catacttaat tgatttctta ccgtttttgg ctctgttttg ctatatcccc
1920taatttgttg gttgtgctaa tctttgtaga aagaggtctc gtatttgctg catcgtaatg
1980acatgagtac tgctttagtt ggtttaagtt caaatgaatg aaacaactat ttttccttta
2040gttgatttta ccctgatttc accgagtgtt tcaatgagta aatatacagc ttaaacat
209810305PRTHomo sapiens 10Met Ser Val Asp Pro Ala Cys Pro Gln Ser Leu
Pro Cys Phe Glu Ala 1 5 10
15 Ser Asp Cys Lys Glu Ser Ser Pro Met Pro Val Ile Cys Gly Pro Glu
20 25 30 Glu Asn
Tyr Pro Ser Leu Gln Met Ser Ser Ala Glu Met Pro His Thr 35
40 45 Glu Thr Val Ser Pro Leu Pro
Ser Ser Met Asp Leu Leu Ile Gln Asp 50 55
60 Ser Pro Asp Ser Ser Thr Ser Pro Lys Gly Lys Gln
Pro Thr Ser Ala 65 70 75
80 Glu Lys Ser Val Ala Lys Lys Glu Asp Lys Val Pro Val Lys Lys Gln
85 90 95 Lys Thr Arg
Thr Val Phe Ser Ser Thr Gln Leu Cys Val Leu Asn Asp 100
105 110 Arg Phe Gln Arg Gln Lys Tyr Leu
Ser Leu Gln Gln Met Gln Glu Leu 115 120
125 Ser Asn Ile Leu Asn Leu Ser Tyr Lys Gln Val Lys Thr
Trp Phe Gln 130 135 140
Asn Gln Arg Met Lys Ser Lys Arg Trp Gln Lys Asn Asn Trp Pro Lys 145
150 155 160 Asn Ser Asn Gly
Val Thr Gln Lys Ala Ser Ala Pro Thr Tyr Pro Ser 165
170 175 Leu Tyr Ser Ser Tyr His Gln Gly Cys
Leu Val Asn Pro Thr Gly Asn 180 185
190 Leu Pro Met Trp Ser Asn Gln Thr Trp Asn Asn Ser Thr Trp
Ser Asn 195 200 205
Gln Thr Gln Asn Ile Gln Ser Trp Ser Asn His Ser Trp Asn Thr Gln 210
215 220 Thr Trp Cys Thr Gln
Ser Trp Asn Asn Gln Ala Trp Asn Ser Pro Phe 225 230
235 240 Tyr Asn Cys Gly Glu Glu Ser Leu Gln Ser
Cys Met Gln Phe Gln Pro 245 250
255 Asn Ser Pro Ala Ser Asp Leu Glu Ala Ala Leu Glu Ala Ala Gly
Glu 260 265 270 Gly
Leu Asn Val Ile Gln Gln Thr Thr Arg Tyr Phe Ser Thr Pro Gln 275
280 285 Thr Met Asp Leu Phe Leu
Asn Tyr Ser Met Asn Met Gln Pro Glu Asp 290 295
300 Val 305 114014DNAHomo sapiens 11gtgcggggga
agatgtagca gcttcttctc cgaaccaacc ctttgccttc ggacttctcc 60ggggccagca
gccgcccgac caggggcccg gggccacggg ctcagccgac gaccatgggc 120tccgtgtcca
accagcagtt tgcaggtggc tgcgccaagg cggcagaaga ggcgcccgag 180gaggcgccgg
aggacgcggc ccgggcggcg gacgagcctc agctgctgca cggtgcgggc 240atctgtaagt
ggttcaacgt gcgcatgggg ttcggcttcc tgtccatgac cgcccgcgcc 300ggggtcgcgc
tcgacccccc agtggatgtc tttgtgcacc agagtaagct gcacatggaa 360gggttccgga
gcttgaagga gggtgaggca gtggagttca cctttaagaa gtcagccaag 420ggtctggaat
ccatccgtgt caccggacct ggtggagtat tctgtattgg gagtgagagg 480cggccaaaag
gaaagagcat gcagaagcgc agatcaaaag gagacaggtg ctacaactgt 540ggaggtctag
atcatcatgc caaggaatgc aagctgccac cccagcccaa gaagtgccac 600ttctgccaga
gcatcagcca tatggtagcc tcatgtccgc tgaaggccca gcagggccct 660agtgcacagg
gaaagccaac ctactttcga gaggaagaag aagaaatcca cagccctacc 720ctgctcccgg
aggcacagaa ttgagccaca atgggtgggg gctattcttt tgctatcagg 780aagttttgag
gagcaggcag agtggagaaa gtgggaatag ggtgcattgg ggctagttgg 840cactgccatg
tatctcaggc ttgggttcac accatcaccc tttcttccct ctaggtgggg 900ggaaagggtg
agtcaaagga actccaacca tgctctgtcc aaatgcaagt gagggttctg 960ggggcaacca
ggagggggga atcaccctac aacctgcata ctttgagtct ccatccccag 1020aatttccagc
ttttgaaagt ggcctggata gggaagttgt tttcctttta aagaaggata 1080tataataatt
cccatgccag agtgaaatga ttaagtataa gaccagattc atggagccaa 1140gccactacat
tctgtggaag gagatctctc aggagtaagc attgtttttt tttcacatct 1200tgtatcctca
tacccacttt tgggataggg tgctggcagc tgtcccaagc aatgggtaat 1260gatgatggca
aaaagggtgt ttgggggaac agctgcagac ctgctgctct atgctcaccc 1320ccgccccatt
ctgggccaat gtgattttat ttatttgctc ccttggatac tgcaccttgg 1380gtcccacttt
ctccaggatg ccaactgcac tagctgtgtg cgaatgacgt atcttgtgca 1440ttttaacttt
ttttccttaa tataaatatt ctggttttgt atttttgtat attttaatct 1500aaggccctca
tttcctgcac tgtgttctca ggtacatgag caatctcagg gatagccagc 1560agcagctcca
ggtctgcgca gcaggaatta ctttttgttg tttttgccac cgtggagagc 1620aactatttgg
agtgcacagc ctattgaact acctcatttt tgccaataag agctggcttt 1680tctgccatag
tgtcctcttg aaaccccctc tgccttgaaa atgttttatg ggagactagg 1740ttttaactgg
gtggccccat gacttgattg ccttctactg gaagattggg aattagtcta 1800aacaggaaat
ggtggtacac agaggctagg agaggctggg cccggtgaaa aggccagaga 1860gcaagccaag
attaggtgag ggttgtctaa tcctatggca caggacgtgc tttacatctc 1920cagatctgtt
cttcaccaga ttaggttagg cctaccatgt gccacagggt gtgtgtgtgt 1980ttgtaaaact
agagttgcta aggataagtt taaagaccaa tacccctgta cttaatcctg 2040tgctgtcgag
ggatggatat atgaagtaag gtgagatcct taacctttca aaattttcgg 2100gttccaggga
gacacacaag cgagggtttt gtggtgcctg gagcctgtgt cctgccctgc 2160tacagtagtg
attaatagtg tcatggtagc taaaggagaa aaagggggtt tcgtttacac 2220gctgtgagat
caccgcaaac ctaccttact gtgttgaaac gggacaaatg caatagaacg 2280cattgggtgg
tgtgtgtctg atcctgggtt cttgtctccc ctaaatgctg ccccccaagt 2340tactgtattt
gtctgggctt tgtaggactt cactacgttg attgctaggt ggcctagttt 2400gtgtaaatat
aatgtattgg tctttctccg tgttctttgg gggttttgtt tacaaacttc 2460tttttgtatt
gagagaaaaa tagccaaagc atctttgaca gaaggttctg caccaggcaa 2520aaagatctga
aacattagtt tggggggccc tcttcttaaa gtggggatct tgaaccatcc 2580tttcttttgt
attccccttc ccctattacc tattagacca gatcttctgt cctaaaaact 2640tgtcttctac
cctgccctct tttctgttca cccccaaaag aaaacttaca cacccacaca 2700catacacatt
tcatgcttgg agtgtctcca caactcttaa atgatgtatg caaaaatact 2760gaagctagga
aaaccctcca tcccttgttc ccaacctcct aagtcaagac cattaccatt 2820tctttctttc
tttttttttt ttttttaaaa tggagtctca ctgtgtcacc caggctggag 2880tgcagtggca
tgatcggctc actgcagcct ctgcctcttg ggttcaagtg attctcctgc 2940ctcagcctcc
tgagtagctg ggatttcagg cacccgccac actcagctaa tttttgtatt 3000tttagtagag
acggggtttc accatgttgt ccaggctggt ctggaactcc tgacctcagg 3060tgatctgccc
accttggctt cccaaagtgc tgggattaca ggcatgagcc accatgctgg 3120gccaaccatt
tcttggtgta ttcatgccaa acacttaaga cactgctgta gcccaggcgc 3180ggtggctcac
acctgtaatc ccagcacttt ggaaggctga ggcgggcgga tcacaaggtc 3240acgagttcaa
aactatcctg gccaacacag tgaaaccccg tctctactaa aatacaaaaa 3300aattagccgg
gtgtggtggt gcatgccttt agtcctagct attcaggagg ctgaggcagg 3360ggaatcgctt
gaacccgaga ggcagaggtt gcagtgagct gagatcgcac cactgcactc 3420cagcctggtt
acagagcaag actctgtctc aaacaaaaca aaacaaaaca aaaacacact 3480actgtatttt
ggatggatca aacctcctta attttaattt ctaatcctaa agtaaagaga 3540tgcaattggg
ggccttccat gtagaaagtg gggtcaggag gccaagaaag ggaatatgaa 3600tgtatatcca
agtcactcag gaacttttat gcaggtgcta gaaactttat gtcaaagtgg 3660ccacaagatt
gtttaatagg agacgaacga atgtaactcc atgtttactg ctaaaaacca 3720aagctttgtg
taaaatcttg aatttatggg gcgggagggt aggaaagcct gtacctgtct 3780gtttttttcc
tgatcctttt ccctcattcc tgaactgcag gagactgagc ccctttgggc 3840tttggtgacc
ccatcactgg ggtgtgttta tttgatggtt gattttgctg tactgggtac 3900ttcctttccc
attttctaat cattttttaa cacaagctga ctcttccctt cccttctcct 3960ttccctggga
aaatacaatg aataaataaa gacttattgg tacgcaaact gtca 401412209PRTHomo
sapiens 12Met Gly Ser Val Ser Asn Gln Gln Phe Ala Gly Gly Cys Ala Lys Ala
1 5 10 15 Ala Glu
Glu Ala Pro Glu Glu Ala Pro Glu Asp Ala Ala Arg Ala Ala 20
25 30 Asp Glu Pro Gln Leu Leu His
Gly Ala Gly Ile Cys Lys Trp Phe Asn 35 40
45 Val Arg Met Gly Phe Gly Phe Leu Ser Met Thr Ala
Arg Ala Gly Val 50 55 60
Ala Leu Asp Pro Pro Val Asp Val Phe Val His Gln Ser Lys Leu His 65
70 75 80 Met Glu Gly
Phe Arg Ser Leu Lys Glu Gly Glu Ala Val Glu Phe Thr 85
90 95 Phe Lys Lys Ser Ala Lys Gly Leu
Glu Ser Ile Arg Val Thr Gly Pro 100 105
110 Gly Gly Val Phe Cys Ile Gly Ser Glu Arg Arg Pro Lys
Gly Lys Ser 115 120 125
Met Gln Lys Arg Arg Ser Lys Gly Asp Arg Cys Tyr Asn Cys Gly Gly 130
135 140 Leu Asp His His
Ala Lys Glu Cys Lys Leu Pro Pro Gln Pro Lys Lys 145 150
155 160 Cys His Phe Cys Gln Ser Ile Ser His
Met Val Ala Ser Cys Pro Leu 165 170
175 Lys Ala Gln Gln Gly Pro Ser Ala Gln Gly Lys Pro Thr Tyr
Phe Arg 180 185 190
Glu Glu Glu Glu Glu Ile His Ser Pro Thr Leu Leu Pro Glu Ala Gln
195 200 205 Asn
User Contributions:
Comment about this patent or add new information about this topic: