Patent application title: Identification of Tumor-Associated Markers for Diagnosis and Therapy
Inventors:
IPC8 Class: AC12Q168FI
USPC Class:
1 1
Class name:
Publication date: 2016-07-28
Patent application number: 20160215351
Abstract:
The present technology relates to genetic products the expression of
which is associated with cancer diseases. The present technology also
relates to the therapy and diagnosis of diseases in which the genetic
products are expressed or aberrantly expressed, in particular cancer
diseases.Claims:
1.-48. (canceled)
49. A method of diagnosing or monitoring a lung cancer or a colon cancer, wherein the method comprises the steps of: detecting the presence of or determining the quantity of a tumor-associated nucleic acid in a biological sample comprising lung or colon tissue isolated from a mammal having or suspected of having lung or colon cancer, and diagnosing or monitoring lung or colon cancer based on the presence or quantity of the tumor-associated nucleic acid in the biological sample, wherein the tumor-associated nucleic acid is selected from the group consisting of (a) a nucleic acid that comprises a nucleic acid sequence consisting essentially of SEQ ID NO: 587, and (b) a nucleic acid that has at least 90% sequence identity with the nucleic acid of (a); the detecting or determining comprises (i) contacting the biological sample with an agent that binds specifically to the tumor-associated nucleic acid, and (ii) detecting the formation of or determining the quantity of a complex between the agent and the tumor-associated nucleic acid wherein said agent is an oligonucleotide or polynucleotide that hybridizes specifically to the tumor-associated nucleic acid or to the complementary nucleic acid sequence, and has a nucleic acid sequence comprising SEQ ID NO: 589 or 590; and the lung or colon cancer is characterized by expression of or abnormal expression of a tumor-associated antigen encoded by the tumor-associated nucleic acid.
50. The method of claim 49, wherein the monitoring of the lung or the colon cancer comprises determining regression, course or onset of the lung or colon cancer in the mammal.
51. The method of claim 49, wherein the method comprises a detection of the presence of or a determination of the quantity of the tumor-associated nucleic acid in a first sample at a first point in time and in a further sample at a second point in time and a comparison of the presence of or quantity of the tumor-associated nucleic acid in the two samples.
52. The method of claim 49, wherein the agent is labeled in a detectable manner.
53. The method of claim 49, wherein the lung or colon tissue is from a tissue biopsy.
54. The method of claim 49, wherein the tumor-associated antigen comprises an amino acid sequence consisting essentially of SEQ ID NO: 588.
55. A method of diagnosing or monitoring lung or colon cancer, wherein the method comprises the steps of: detecting or determining the quantity of a tumor-associated nucleic acid in a biological sample comprising lung or colon tissue isolated from a mammal having or suspected of having lung or colon cancer, and diagnosing or monitoring lung or colon cancer based on the presence or quantity of the tumor-associated nucleic acid in the biological sample, wherein the tumor-associated nucleic acid is selected from the group consisting of (a) a nucleic acid that comprises a nucleic acid sequence consisting essentially of SEQ ID NO: 587, and (b) a nucleic acid that has at least 90% sequence identity with the nucleic acid of (a); the detecting or determining comprises (i) contacting the biological sample with an agent that binds specifically to the tumor-associated nucleic acid, and (ii) detecting the formation of or determining the quantity of a complex between the agent and the tumor-associated nucleic acid via real-time reverse-transcription polymerase chain reaction (RT-PCR); the lung or colon cancer is characterized by expression or abnormal expression of a tumor-associated antigen encoded by the tumor-associated nucleic acid; and the agent is an oligonucleotide or polynucleotide that hybridizes specifically to the tumor-associated nucleic acid or to the complementary nucleic acid sequence, and has a nucleic acid sequence comprising SEQ ID NO: 589 or 590.
56. The method of claim 55, wherein the monitoring of the lung or colon cancer comprises determining regression, course or onset of the lung or colon cancer in the mammal.
57. The method of claim 55, wherein the method comprises a detection of the presence of or determination of the quantity of the tumor-associated nucleic acid in a first sample at a first point in time and in a further sample at a second point in time and a comparison of the presence of or quantity of the tumor-associated nucleic acid in the two samples.
58. The method of claim 55, wherein the agent is labeled in a detectable manner.
59. The method of claim 55, wherein the lung or colon tissue is from a tissue biopsy.
60. The method of claim 55, wherein the tumor-associated antigen comprises an amino acid sequence consisting essentially of SEQ ID NO: 588.
61. The method of claim 55, wherein the agent is an oligonucleotide or polynucleotide that hybridizes specifically to the tumor-associated nucleic acid and has a nucleic acid sequence comprising SEQ ID NO: 589.
62. The method of claim 55, wherein the agent is an oligonucleotide or polynucleotide that hybridizes specifically to the tumor-associated nucleic acid and has a nucleic acid sequence comprising SEQ ID NO: 590.
63. The method of claim 49, wherein the lung cancer is an adenocarcinoma or a squamous cell carcinoma.
64. The method of claim 55, wherein the lung cancer is an adenocarcinoma or a squamous cell carcinoma.
Description:
RELATED APPLICATIONS
[0001] The present application is a continuation of International Patent Application No. PCT/EP08/08924, which was filed Oct. 22, 2008, claiming the benefit of priority to European Patent Application No. 07020730.3, which was filed on Oct. 23, 2007. The entire text of the aforementioned applications is incorporated herein by reference in its entirety.
FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
[0002] [Not Applicable]
BACKGROUND OF THE INVENTION
[0003] The present technology relates to nucleic acids and encoded polypeptides which are expressed in cancers. The present technology also relates to agents which bind the polypeptides. The nucleic acids, polypeptides coded for by such nucleic acids and peptides derived therefrom, as well as related antibodies and cytolytic T lymphocytes, are useful, inter alia, in diagnostic and therapeutic contexts.
[0004] Despite interdisciplinary approaches and exhaustive use of classical therapeutic procedures, cancers are still among the leading causes of death.
[0005] More recent therapeutic concepts in cancer therapy aim at incorporating the patient's immune system into the overall therapeutic concept by using recombinant tumor vaccines and other specific measures such as antibody therapy. A prerequisite for the success of such a strategy is the recognition of tumor-specific or tumor-associated antigens or epitopes by the patient's immune system whose effector functions are to be interventionally enhanced.
[0006] Tumor cells biologically differ substantially from their nonmalignant cells of origin. These differences are due to genetic alterations acquired during tumor development and result, inter alia, also in the formation of qualitatively or quantitatively altered molecular structures in the cancer cells. Tumor-associated structures of this kind which are recognized by the specific immune system of the tumor-harboring host are referred to as tumor-associated antigens.
[0007] The specific recognition of tumor-associated antigens involves cellular and humoral mechanisms which are two functionally interconnected units: CD4.sup.+ and CD8.sup.+ T lymphocytes recognize the processed antigens presented on the molecules of the MHC (major histocompatibility complex) classes II and I, respectively, while B lymphocytes produce circulating antibody molecules which bind directly to unprocessed antigens. The potential clinical-therapeutical importance of tumor-associated antigens results from the fact that the recognition of antigens on neoplastic cells by the immune system leads to the initiation of cytotoxic effector mechanisms and, in the presence of T helper cells, can cause elimination of the cancer cells (Pardon, Nat. Med. 4:525-31, 1998).
[0008] Antibody based cancer therapies have been successfully introduced into the clinic and have emerged as the most promising therapeutics in oncology over the last decade. Eight antibodies have been approved for treatment of neoplastic diseases, most of them, however in lymphoma and leukemia (Adams G P, Weiner L M, Nat Biotechnol 23:1147-57, 2005).
[0009] One of the challenges to be mastered for the advent of the next generation of upgraded antibody-based cancer therapeutics is the selection of appropriate target molecules, which is the key for a favorable toxicity/efficacy profile.
[0010] The search for genes tightly silenced in the vast majority of healthy tissues moves into the focus of attention the intriguing observation that genes of the gametogenic and/or trophoblastic lineage are frequently ectopically activated and robustly expressed in human cancer. Based on phenotypical similarities between germ cells, pregnancy trophoblast and cancer cells, John Beard proposed as much as 100 years ago a "trophoblastic theory of cancer" (Beard J, Lancet 1:1758-63, 1902; Gurchot C, Oncology 31:310-3, 1975). The discovery of the sporadic production of chorionic gonadotropin, alpha-fetoprotein, CEA and other trophoblastic hormones by cancer cells provided the first molecules shared between neoplastic and trophoblastic cells (Acevedo H F et al., Cancer 76:1467-75, 1995; Dirnhofer S et al., Hum Pathol 29:377-82, 1998; Gurchot C, Oncology 31:310-3, 1975; Iles R K, Chard T, J Ural 145:453-8, 1991; Laurence D J, Neville A M, Br J Cancer 26:335-55, 1972). The concept was reignited by the inauguration of the steadily growing so-called cancer/germline (CO) class of genes, which represents more than 100 members, each expressed in a variety of tumor types. The observation that entire trophoblastic and gametogenic programs escape transcriptional silencing and are ectopically activated in cancer cells (Koslowski M et al., Cancer Res 64:5988-93, 2004; Simpson A J et al., Nat Rev Cancer 5:615-25, 2005) indicates that within this class of genes with exquisitely selective tissue distribution, appropriate targets for mAB therapy may be found.
[0011] It was the object of the present technology to provide target structures for a diagnosis and therapy of cancers. This object is achieved by the subject matter of the claims.
BRIEF SUMMARY OF THE INVENTION
[0012] According to the present technology, placenta-specific genes are identified which are selectively or aberrantly expressed in tumor cells and thus, provide target structures for therapeutic and diagnostic approaches.
[0013] The nucleic acids identified according to the present technology to be selectively or aberrantly expressed in tumor cells are selected from the group consisting of (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1-540, 541, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624 of the sequence listing, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c). These nucleic acids are also termed "tumor-associated nucleic acids" herein.
[0014] In another aspect, the present technology relates to antigens encoded by the tumor-associated nucleic acids identified according to the present technology. Accordingly, the tumor-associated antigens identified according to the present technology have an amino acid sequence encoded by a nucleic acid which is selected from the group consisting of (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1-540, 541, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624 of the sequence listing, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c). In a preferred embodiment, the tumor-associated antigens identified according to the present technology comprise an amino acid sequence selected from the group consisting of SEQ ID NOs: 542, 546, 550, 554, 567, 571, 584, 588, 592, 596, 603, 607, 614, 621, and 625 of the sequence listing, a part or derivative thereof.
[0015] If, according to the present technology, reference is made to nucleic acids comprising certain nucleic acid sequences or tumor-associated antigens comprising certain amino acid sequences this also includes embodiments wherein the nucleic acids or tumor-associated antigens consist of these certain nucleic acid sequences or amino acid sequences, respectively.
[0016] The present technology generally relates to the use of tumor-associated nucleic acids and tumor-associated antigens identified according to the present technology or of parts or derivatives thereof, of nucleic acids directed against said tumor-associated nucleic acids, of antibodies or T cells directed against the tumor-associated antigens identified according to the present technology or parts or derivatives thereof and/or of host cells expressing the tumor-associated antigens identified according to the present technology or parts or derivatives thereof for therapy, prophylaxis, diagnosis and/or monitoring of neoplastic diseases.
[0017] This may also involve the use of a combination of two or more of these nucleic acids, antigens, antibodies, T cells and/or host cells.
[0018] In those embodiments of the present technology relating to the use of antibodies directed against the tumor-associated antigens identified according to the present technology or parts or derivatives thereof also T cell receptors directed against the tumor-associated antigens identified according to the present technology or parts or derivatives thereof, optionally in a complex with MHC molecules, may be used.
[0019] Especially suitable for therapy, prophylaxis, diagnosis and/or monitoring is a part of the tumor-associated antigens identified according to the present technology which corresponds to the non-transmembrane portion, in particular the extracellular portion of the tumor-associated antigens or is comprised thereof. Therefore, according to the present technology, a part of the tumor-associated antigens identified according to the present technology which corresponds to the non-transmembrane portion, in particular the extracellular portion of the tumor-associated antigens or is comprised thereof, or a corresponding part of the nucleic acids coding for the tumor-associated antigens identified according to the present technology is preferred for therapy, prophylaxis, diagnosis and/or monitoring. Similarly the use of antibodies is preferred which are directed against a part of the tumor-associated antigens identified according to the present technology which corresponds to the non-transmembrane portion, in particular the extracellular portion of the tumor-associated antigens or is comprised thereof.
[0020] Preferred diseases for a therapy, prophylaxis, diagnosis and/or monitoring are those in which one or more of the tumor-associated nucleic acids identified according to the present technology are selectively expressed or abnormally expressed. Particularly preferred diseases for a therapy, prophylaxis, diagnosis and/or monitoring are those in which one or more of the tumor-associated nucleic acids identified according to the present technology and/or one or more of the tumor-associated antigens encoded thereby are selectively expressed or abnormally expressed.
[0021] In one aspect, the present technology relates to a pharmaceutical composition comprising an agent which recognizes a tumor-associated antigen identified according to the present technology or a nucleic acid coding for the tumor-associated antigen and which is preferably selective for cells which have expression or abnormal expression of a tumor-associated antigen identified according to the present technology.
[0022] In a further aspect, the present technology relates to a pharmaceutical composition comprising an agent which (I) inhibits expression or activity of a tumor-associated antigen identified according to the present technology, and/or (II) has tumor-inhibiting or tumor-destroying activity and is selective for cells expressing or abnormally expressing a tumor-associated antigen identified according to the present technology, and/or (III) when administered, selectively increases the amount of complexes between an MHC molecule and a tumor-associated antigen identified according to the present technology or a part thereof, such as a peptide epitope. In particular embodiments, said agent may cause induction of cell death, reduction in cell growth, damage to the cell membrane or secretion of cytokines and preferably have a tumor-inhibiting activity.
[0023] In one embodiment, the agent is an antisense nucleic acid which hybridizes selectively with the nucleic acid coding for the tumor-associated antigen. In a further embodiment, the agent is a siRNA preferably comprising a sense RNA strand and an antisense RNA strand, wherein the sense and antisense RNA strands form an RNA duplex, and wherein the sense RNA strand comprises a nucleotide sequence substantially identical to a target sequence of about 19 to about 25 contiguous nucleotides in a nucleic acid coding for the tumor-associated antigen, preferably mRNA coding for the tumor-associated antigen. In a further embodiment, the agent is an antibody which binds selectively to the tumor-associated antigen, in particular a complement-activated or toxin conjugated antibody which binds selectively to the tumor-associated antigen. In a preferred embodiment, the antibody which binds selectively to the tumor-associated antigen is coupled to a therapeutically useful substance and/or recruits natural or artificial effector mechanisms to said cell expressing or abnormally expressing said tumor-associated antigen. In a further embodiment, the agent is a cytotoxic T lymphocyte which recognizes the tumor-associated antigen or a part thereof bound by an MHC molecule on a cell and lyses the cells labeled in this way. In a further embodiment, the agent is a T helper lymphocyte which recognizes the tumor-associated antigen or a part thereof bound by an MHC molecule on a cell and enhances effector functions of other cells specifically recognizing said tumor-associated antigen or a part thereof.
[0024] In a further embodiment, the agent comprises two or more agents which each recognize different tumor-associated antigens or different nucleic acids coding for tumor-associated antigens and/or inhibit expression or activity of different tumor-associated antigens, and/or have tumor-inhibiting or tumor-destroying activity and are selective for cells expressing or abnormally expressing different tumor-associated antigens, and/or when administered, selectively increase the amount of complexes between MHC molecules and different tumor-associated antigens or parts thereof, wherein at least one of said different tumor-associated antigens is a tumor-associated antigen identified according to the present technology.
[0025] Preferably, a tumor-associated antigen selectively limited to tumors serves as a label for recruiting effector mechanisms to this specific location. In this aspect, the present technology includes embodiments wherein the agent itself does not have an ability to inhibit activity of a tumor-associated antigen or a tumor-inhibiting or tumor-destroying activity but mediates such effect, in particular by recruiting effector mechanisms, in particular those having cell damaging potential, to a specific location, in particular a tumor or tumor cells.
[0026] Preferably, said cells expressing or abnormally expressing a tumor-associated antigen identified according to the present technology are non-placenta cells.
[0027] The activity of a tumor-associated antigen identified according to the present technology can be any activity of a protein or a peptide. In one embodiment this activity is an enzymatic activity.
[0028] According to the present technology the phrase "inhibit expression or activity" includes a complete or essentially complete inhibition of expression or activity and a reduction in expression or activity.
[0029] The agent which, when administered, selectively increases the amount of complexes between an MHC molecule and a tumor-associated antigen identified according to the present technology or a part thereof comprises one or more components selected from the group consisting of (i) the tumor-associated antigen or a part thereof, (ii) a nucleic acid which codes for said tumor-associated antigen or a part thereof, (iii) a host cell which expresses said tumor-associated antigen or a part thereof, and (iv) isolated complexes between peptide epitopes from said tumor-associated antigen and an MHC molecule.
[0030] The present technology furthermore relates to a pharmaceutical composition which comprises one or more components selected from the group consisting of (i) a tumor-associated antigen identified according to the present technology or a part thereof, (ii) a nucleic acid which codes for a tumor-associated antigen identified according to the present technology or a part thereof, (iii) an antibody which binds to a tumor-associated antigen identified according to the present technology or to a part thereof, (iv) an antisense nucleic acid which hybridizes specifically with a tumor-associated nucleic acid identified according to the present technology/a nucleic acid coding for a tumor-associated antigen identified according to the present technology, (v) an siRNA directed against a tumor-associated nucleic acid identified according to the present technology/a nucleic acid coding for a tumor-associated antigen identified according to the present technology, (vi) a host cell which expresses a tumor-associated antigen identified according to the present technology or a part thereof, and (vii) isolated complexes between a tumor-associated antigen identified according to the present technology or a part thereof and an MHC molecule.
[0031] In one embodiment, a nucleic acid coding for a tumor-associated antigen identified according to the present technology or a part thereof is present in the pharmaceutical composition in an expression vector and functionally linked to a promoter. In a further embodiment, a nucleic acid coding for a tumor-associated antigen identified according to the present technology or a part thereof is present in the pharmaceutical composition in a virus as further described below.
[0032] A host cell present in a pharmaceutical composition of the present technology may secrete the tumor-associated antigen or the part thereof, may express it on the surface and preferably may additionally express an MHC molecule which binds to said tumor-associated antigen or said part thereof. In one embodiment, the host cell expresses the MHC molecule endogenously. In a further embodiment, the host cell expresses the MHC molecule and/or the tumor-associated antigen or the part thereof in a recombinant manner. The host cell is preferably nonproliferative. In a preferred embodiment, the host cell is an antigen-presenting cell, in particular a dendritic cell, a monocyte or a macrophage.
[0033] In a further embodiment, an antibody present in a pharmaceutical composition of the present technology is a monoclonal antibody. In further embodiments, the antibody is a chimeric or humanized antibody, a fragment of an antibody or a synthetic antibody. The antibody may be coupled to a therapeutically or diagnostically useful agent also termed therapeutic or diagnostic agent herein.
[0034] An antisense nucleic acid present in a pharmaceutical composition of the present technology may comprise a sequence of 6-50, in particular 10-30, 15-30 and 20-30, contiguous nucleotides of the nucleic acid coding for the tumor-associated antigen identified according to the present technology.
[0035] In further embodiments, a tumor-associated antigen or a part thereof, provided by a pharmaceutical composition of the present technology either directly or via expression of a nucleic acid, binds to MHC molecules on the surface of cells, said binding preferably causing a cytolytic response and/or inducing cytokine release.
[0036] A pharmaceutical composition of the present technology may comprise a pharmaceutically compatible carrier and/or an adjuvant.
[0037] A pharmaceutical composition of the present technology is preferably used for the treatment or prevention of a disease characterized by selective expression or abnormal expression of a tumor-associated nucleic acid and/or tumor-associated antigen. In a preferred embodiment, the disease is a neoplastic disease, preferably cancer.
[0038] In a preferred embodiment, the pharmaceutical composition of the present technology is in the form of a vaccine which may be used therapeutically or prophylactically. Such vaccine preferably comprises a tumor-associated antigen identified according to the present technology or a part thereof, and/or a nucleic acid which codes for a tumor-associated antigen identified according to the present technology or a part thereof. In particular embodiments, the nucleic acid is present in a virus or host cell.
[0039] The present technology furthermore relates to methods of treating, preventing, diagnosing or monitoring, i.e. determining the regression, progression, course and/or onset of, a disease characterized by expression or abnormal expression of one of more tumor-associated nucleic acids identified according to the present technology, preferably also resulting in expression or abnormal expression of one of more tumor-associated antigens identified according to the present technology, preferably a neoplastic disease, in particular cancer. In one embodiment, the treatment or prevention comprises administering a pharmaceutical composition of the present technology.
[0040] The methods of diagnosing and/or methods of monitoring according to the present technology generally concern the detection of and/or determination of the quantity of one or more parameters selected from the group consisting of (i) a tumor-associated nucleic acid identified according to the present technology, or a part thereof, (ii) a tumor-associated antigen identified according to the present technology, or a part thereof, (iii) an antibody against a tumor-associated antigen identified according to the present technology or a part thereof, and (iv) T lymphocytes, preferably cytotoxic or T helper lymphocytes, which are specific for a tumor-associated antigen identified according to the present technology or a part thereof and/or a complex between the tumor-associated antigen or a part thereof and an MHC molecule, in a biological sample isolated from a patient, preferably from a patient having said disease, being suspected of having or falling ill with said disease or having a potential for said disease. Means for accomplishing said detection and/or determination of the quantity are described herein and will be apparent to the skilled person.
[0041] Preferably, the presence of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes and/or a quantity of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes which is increased compared to a patient without said disease is indicative for the presence of said disease or a potential for a development of said disease.
[0042] The methods of diagnosing and/or monitoring of the present technology also include embodiments wherein by detection or determination of the quantity of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes it is possible to assess and/or prognose the metastatic behavior of said disease, wherein, preferably, the presence of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes and/or a quantity of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes which is increased compared to a patient without said disease or without a metastasis of said disease is indicative for a metastatic behavior of said disease or a potential for a metastatic behavior of said disease.
[0043] In particular embodiments, said detection or determination of the quantity comprises (i) contacting a biological sample with an agent which binds specifically to said tumor-associated nucleic acid or said part thereof, to said tumor-associated antigen or said part thereof, to said antibody or to said T lymphocytes, and (ii) detecting the formation of or determining the amount of a complex between the agent and the nucleic acid or the part thereof, the tumor-associated antigen or the part thereof, the antibody, or the T lymphocytes.
[0044] In one embodiment, the disease is characterized by expression or abnormal expression of two or more different tumor-associated nucleic acids preferably also resulting in expression or abnormal expression of two or more different tumor-associated antigens and a detection or determination of the quantity comprises a detection or determination of the quantity of two or more different tumor-associated nucleic acids or of parts thereof, of two or more different tumor-associated antigens or of parts thereof, of two or more antibodies binding to said two or more different tumor-associated antigens or to parts thereof and/or of two or more T lymphocytes specific for said two or more different tumor-associated antigens or parts thereof, or complexes thereof with MHC molecules. In a further embodiment, the biological sample isolated from the patient is compared to a comparable normal biological sample.
[0045] The methods of monitoring according to the present technology preferably comprise a detection of and/or determination of the quantity of one or more of the parameters mentioned above in a first sample at a first point in time and in a further sample at a second point in time, wherein the course of the disease is determined by comparing the two samples.
[0046] Preferably, a level of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes which is increased in a sample compared to a sample taken earlier from a patient indicates that the patient has developed or is about to develop cancer and/or a metastasis of cancer and/or a relapse of cancer. Preferably, a level of said nucleic acid or said part thereof, said tumor-associated antigen or said part thereof, said antibody and/or said T lymphocytes which is decreased in a sample compared to a sample taken earlier from a patient indicates regression of cancer and/or a metastasis of cancer in said patient and thus, preferably indicates a successful cancer therapy.
[0047] According to the present technology, detection of a nucleic acid or of a part thereof or determining the quantity of a nucleic acid or of a part thereof may be carried out using a oligo- or polynucleotide probe which hybridizes specifically to said nucleic acid or said part thereof or may be carried out by selective amplification of said nucleic acid or said part thereof, e.g. by means of PCR amplification. In one embodiment, the oligo- or polynucleotide probe comprises a sequence of 6-50, in particular 10-30, 15-30 and 20-30, contiguous nucleotides of said nucleic acid.
[0048] In particular embodiments, the tumor-associated antigen or the part thereof which is to be detected or the quantity of which is to be determined in the methods of the present technology is present intracellularly, on the cell surface or in a complex with an MHC molecule.
[0049] According to the present technology, detection of a tumor-associated antigen or of a part thereof or determining the quantity of a tumor-associated antigen or of a part thereof may be carried out using an antibody binding specifically to said tumor-associated antigen or said part thereof.
[0050] According to the present technology, detection of an antibody or determining the quantity of an antibody may be carried out using a protein or peptide binding specifically to said antibody.
[0051] According to the present technology, detection of or determining the quantity of T lymphocytes which are specific for a tumor-associated antigen or a part thereof and/or a complex thereof with an MHC molecule may be carried out using a cell presenting the complex between said tumor-associated antigen or said part thereof and an MHC molecule. T lymphocytes may additionally be detected by detecting their proliferation, their cytokine production, and their cytotoxic activity triggered by specific stimulation with a complex of an MHC molecule and a tumor-associated antigen or a part thereof. T lymphocytes may also be detected with aid of a recombinant MHC molecule or a complex of two or more MHC molecules loaded with immunogenic fragments of one or more tumor-associated antigens.
[0052] An agent which is used for detection or determining the quantity in the methods of the present technology such as a oligo- or polynucleotide probe, an antibody, a protein or peptide or a cell is preferably labeled in a detectable manner, in particular by a detectable marker such as a radioactive marker or an enzymic marker.
[0053] In a particular aspect, the present technology relates to a method of treating, preventing, diagnosing or monitoring a disease characterized by expression or abnormal expression of a tumor-associated antigen identified according to the present technology, which method comprises administering an antibody which binds to said tumor-associated antigen or to a part thereof and which is coupled to a therapeutic or diagnostic agent. The antibody may be a monoclonal antibody. In further embodiments, the antibody is a chimeric or humanized antibody or a fragment of an antibody.
[0054] In certain embodiments, the methods of the present technology of diagnosing or monitoring a disease are performed with a biological sample containing or suspected of containing disseminating tumor cells or metastatic tumor cells. Such biological samples include, for example, blood, serum, bone marrow, sputum, bronchial aspirate, and/or bronchial lavage. Preferably, the methods of the present technology of diagnosing or monitoring a disease are performed with a biological sample not containing placental cells and, in particular, being a non-placenta biological sample isolated from a subject.
[0055] In one particular aspect, the present technology relates to a method of treating a patient having a disease characterized by expression or abnormal expression of a tumor-associated antigen identified according to the present technology, which method comprises (i) providing a sample containing immunoreactive cells, either obtained from said patient or from another individual of the same species, in particular a healthy individual, or an individual of a different species, (ii) contacting said sample with a host cell expressing said tumor-associated antigen or a part thereof, under conditions which favor production of cytolytic T cells against said tumor-associated antigen or a part thereof, and (iii) introducing the cytolytic T cells into the patient in an amount suitable for lysing cells expressing the tumor-associated antigen or a part thereof. In one embodiment, the method includes cloning of the T cell receptor of cytolytic T cells obtained and transferring the nucleic acid coding for the T cell receptor to T cells, either obtained from said patient or from another individual of the same species, in particular a healthy individual, or an individual of a different species, which T cells thus receive the desired specificity and, as under (iii), may be introduced into the patient.
[0056] In one embodiment, the host cell endogenously expresses an MHC molecule. In a further embodiment, the host cell recombinantly expresses an MHC molecule and/or the tumor-associated antigen or the part thereof. Preferably, the host cell presents the tumor-associated antigen or the part thereof by MHC molecules on its surface. The host cell is preferably nonproliferative. In a preferred embodiment, the host cell is an antigen-presenting cell, in particular a dendritic cell, a monocyte or a macrophage.
[0057] The present technology also relates to a method of treating a disease characterized by expression or abnormal expression of a tumor-associated antigen identified according to the present technology, which method comprises (i) identifying cells from the patient which express abnormal amounts of the tumor-associated antigen, (ii) isolating a sample of said cells, (iii) culturing said cells, and (iv) introducing said cells into the patient in an amount suitable for triggering an immune response to the cells.
[0058] The present technology furthermore relates to a nucleic acid selected from the group consisting of (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1-540, 541, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0059] In a further aspect, the present technology relates to a recombinant nucleic acid molecule, in particular DNA or RNA molecule, which comprises a nucleic acid of the present technology.
[0060] The present technology also relates to host cells which contain a nucleic acid or recombinant nucleic acid molecule of the present technology.
[0061] The host cell may also comprise a nucleic acid coding for a MHC molecule. In one embodiment, the host cell endogenously expresses the MHC molecule. In a further embodiment, the host cell recombinantly expresses the MHC molecule and/or the nucleic acid or recombinant nucleic acid molecule of the present technology or a part thereof. Preferably, the host cell is nonproliferative. In a preferred embodiment, the host cell is an antigen-presenting cell, in particular a dendritic cell, a monocyte or a macrophage.
[0062] In a further embodiment, the present technology relates to oligonucleotides which hybridize with a nucleic acid identified according to the present technology and which may be used as genetic probes or as "antisense" molecules. Nucleic acid molecules in the form of oligonucleotide primers or competent probes, which hybridize with a nucleic acid identified according to the present technology or parts thereof, may be used for detecting said nucleic acid and/or finding nucleic acids which are homologous to said nucleic acid identified according to the present technology, e.g. by PCR amplification, Southern and Northern hybridization. Hybridization may be carried out under low stringency, more preferably under medium stringency and most preferably under high stringency conditions.
[0063] In a further aspect, the present technology relates to a protein or peptide which is encoded by a nucleic acid selected from the group consisting of (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1-540, 541, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c). In a preferred embodiment, the protein or peptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 542, 546, 550, 554, 567, 571, 584, 588, 592, 596, 603, 607, 614, 621, and 625 of the sequence listing, a part or derivative thereof.
[0064] In a further aspect, the present technology relates to an immunogenic fragment of a tumor-associated antigen identified according to the present technology. Said fragment preferably binds to a MHC molecule or an antibody, preferably to a human HLA receptor or a human antibody. According to the present technology, a part or fragment preferably comprises a sequence of at least 5, at least 6, in particular at least 8, at least 10, at least 12, at least 15, at least 20, at least 30 or at least 50, amino acids.
[0065] In a further aspect, the present technology relates to an agent which binds to a tumor-associated antigen identified according to the present technology or to a part thereof. In a preferred embodiment, the agent is a protein or peptide, in particular an antibody, a T cell receptor or an MHC molecule. In further embodiments, the antibody is a monoclonal, chimeric, or humanized antibody, an antibody produced by combinatory techniques, or a fragment of an antibody. In one preferred embodiment, the present technology relates to an antibody which binds selectively to a complex of (i) a tumor-associated antigen identified according to the present technology or a part thereof and (ii) an MHC molecule to which said tumor-associated antigen identified according to the present technology or said part thereof binds, with said antibody not binding to (i) or (ii) alone.
[0066] According to the present technology, the term "binding" preferably relates to a specific binding. "Specific binding" means that an agent such as an antibody binds stronger to a target such as an epitope for which it is specific compared to the binding to another target. An agent binds stronger to a first target compared to a second target if it binds to the first target with a dissociation constant (K.sub.D) which is lower than the dissociation constant for the second target. Preferably the dissociation constant (K.sub.D) for the target to which the agent binds specifically is more than 10-fold, preferably more than 20-fold, more preferably more than 50-fold, even more preferably more than 100-fold, 200-fold, 500-fold or 1000-fold lower than the dissociation constant (K.sub.D) for the target to which the agent does not bind specifically.
[0067] Such specific antibodies may, for example, be obtained by immunization using the aforementioned peptides.
[0068] The present technology furthermore relates to a conjugate between an agent of the present technology which binds to a tumor-associated antigen identified according to the present technology or to a part thereof or an antibody of the present technology and a therapeutic or diagnostic agent. In one embodiment, the therapeutic or diagnostic agent is a toxin.
[0069] In a further aspect, the present technology relates to a kit for detecting a disease characterized by expression or abnormal expression of one of more tumor-associated nucleic acids identified according to the present technology, preferably also resulting in expression or abnormal expression of one of more tumor-associated antigens identified according to the present technology, preferably a neoplastic disease, in particular cancer, which kit comprises agents for detection or determining the quantity (i) of the tumor-associated nucleic acid or of a part thereof, (ii) of the tumor-associated antigen or of a part thereof, (iii) of antibodies which bind to the tumor-associated antigen or to a part thereof, and/or (iv) of T cells which are specific for the tumor-associated antigen or a part thereof or a complex thereof with an MHC molecule. Such agents are described herein above.
[0070] In one embodiment, the present technology relates to a pharmaceutical composition which comprises an agent that (I) inhibits expression or activity of a tumor-associated antigen and/or (II) has tumor-inhibiting activity, and is selective for cells expressing or abnormally expressing a tumor-associated antigen and/or OW when administered, selectively increases the amount of complexes between an MHC molecule and a tumor-associated antigen or a part thereof, the tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0071] In another embodiment, the present technology relates to a pharmaceutical composition which comprises one or more components selected from the group consisting of: (i) a tumor-associated antigen or a part thereof, (ii) a nucleic acid which codes for a tumor-associated antigen or a part thereof, (iii) an antibody which binds to a tumor-associated antigen or a part thereof, (iv) an antisense nucleic acid which hybridizes specifically with a nucleic acid coding for a tumor-associated antigen, (v) an siRNA directed against a nucleic acid coding for a tumor-associated antigen, (vi) a host cell which expresses a tumor-associated antigen or a part thereof, and (vii) isolated complexes between a tumor-associated antigen or a part thereof and an MHC molecule, said tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0072] In yet another embodiment, the present technology relates to a method of diagnosing or monitoring a cancer disease which comprises detecting or determining the quantity (i) of a tumor-associated nucleic acid or of a part thereof, and/or (ii) of a tumor-associated antigen or of a part thereof, and/or (iii) of an antibody to the tumor-associated antigen or a part thereof and/or (iv) of T lymphocytes which are specific to the tumor-associated antigen or to a part thereof in a biological sample isolated from a patient, said tumor-associated nucleic acid being selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c), and said tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from said group of nucleic acids.
[0073] In a further embodiment, the present technology relates to a method of treating or preventing a disease characterized by expression or abnormal expression of a tumor-associated antigen which comprises administration of a pharmaceutical composition of the present technology, said tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0074] In yet another embodiment, the present technology relates to a method of treating, preventing, diagnosing or monitoring a disease characterized by expression or abnormal expression of a tumor-associated antigen which comprises administering an antibody that binds to said tumor-associated antigen or to a part thereof and is coupled to a therapeutic or diagnostic agent, said tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0075] Another embodiment of the present technology relates to a method of treating a patient having a disease characterized by expression or abnormal expression of a tumor-associated antigen which comprises: (i) providing a sample containing immunoreactive cells, (ii) contacting said sample with a host cell expressing said tumor-associated antigen or a part thereof, under conditions which favor production of cytolytic or cytokine-releasing T cells against said tumor-associated antigen or said part thereof, and (iii) introducing the cytolytic or cytokine-releasing T cells into the patient in an amount suitable for lysing cells expressing the tumor-associated antigen or a part thereof, said tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0076] An additional embodiment of the present technology relates to a method of inhibiting the development of cancer in a patient which comprises administering an effective amount of a pharmaceutical composition of the present technology.
[0077] In yet another embodiment, the present technology relates to an agent, which binds specifically to a protein or polypeptide or to a part thereof, said protein or polypeptide being encoded by a nucleic acid selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0078] In an additional embodiment, the present technology relates to an antibody, which binds selectively to a complex of: (i) a protein or polypeptide or a part thereof and (ii) an MHC molecule to which said protein or polypeptide or said part thereof binds, with said antibody not binding to (i) or (ii) alone and said protein or polypeptide being encoded by a nucleic acid selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c).
[0079] In yet another embodiment, the present technology relates to a kit for detecting cancer, which comprises agents for detecting or determining the quantity of (i) of a tumor-associated nucleic acid or of a part thereof, and/or (ii) of a tumor-associated antigen or of a part thereof, and/or (iii) of antibodies which bind to the tumor-associated antigen or to a part thereof, and/or (iv) of T cells which are specific for a complex between the tumor-associated antigen or a part thereof and an MHC molecule, said tumor-associated nucleic acid being selected from the group consisting of: (a) a nucleic acid which comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 541, 1-540, 545, 549, 553, 557, 560, 563, 566, 570, 574, 577, 580, 583, 587, 591, 595, 599, 602, 606, 610, 613, 617, 620, and 624, a part or derivative thereof, (b) a nucleic acid which hybridizes with the nucleic acid of (a) under stringent conditions, (c) a nucleic acid which is degenerate with respect to the nucleic acid of (a) or (b), and (d) a nucleic acid which is complementary to the nucleic acid of (a), (b) or (c), and said tumor-associated antigen having a sequence encoded by a nucleic acid which is selected from said group of nucleic acids.
BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGS
[0080] FIG. 1. Expression of a tumor-associated nucleic acid identified according to the present technology in normal tissues and cancer tissue. Significant expression of the nucleic acid sequence according to SEQ ID NO:540 was found only in placenta tissue and mamma carcinomas.
[0081] FIG. 2. Quantitative expression of a tumor-associated nucleic acid identified according to the present technology in normal tissues and cancer tissue. Quantitative RT-PCR showed selective expression of the nucleic acid sequence according to SEQ ID NO:540 in placenta tissue and mamma carcinomas.
[0082] FIG. 3. Quantitative expression of SEQ ID NO:540 mRNA in MCF-7 breast cancer cells. Real-time RT-PCR 24 h after transfection with siRNA oligos showed that both SEQ ID NO:540-specific siRNAs (siRNA#1 (SEQ ID NO:630, 631), siRNA#2 (SEQ ID NO:632, 633)) induce robust silencing of SEQ ID NO:540 expression.
[0083] FIG. 4. Silencing of SEQ ID NO:540 expression by transfection with siRNA oligos results in impaired proliferation of MCF-7 breast cancer cells. Proliferation was quantified 96 h after transfection with siRNAs by measuring incorporation of BrdU in newly synthesized DNA strands. These results show that SEQ ID NO:540 is a positive factor for the proliferation of breast cancer cells.
[0084] FIG. 5. Quantitative expression of SEQ ID NO:541 in normal tissues and cancer tissue. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:541 in lung cancer.
[0085] FIG. 6. Quantitative expression of SEQ ID NO:545 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:545 in malignant melanomas.
[0086] FIG. 7. Quantitative expression of SEQ ID NO:549 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:549 in ovarian cancer.
[0087] FIG. 8. Quantitative expression of SEQ ID NO:553 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:553 in colon cancer and ovarian cancer.
[0088] FIG. 9. Quantitative expression of SEQ ID NO:557 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:557 in breast cancer.
[0089] FIG. 10. Quantitative expression of SEQ ID NO:560 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:560 in colon cancer and ovarian cancer.
[0090] FIG. 11. Quantitative expression of SEQ ID NO:563 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:563 in breast cancer, colon cancer, ovarian cancer, lung cancer and melanoma.
[0091] FIG. 12. Quantitative expression of SEQ ID NO:566 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:566 in gastric cancer, breast cancer, colon cancer, ovarian cancer, lung cancer and melanoma.
[0092] FIG. 13. Quantitative expression of SEQ ID NO:570 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:570 in ovarian cancer, lung cancer and melanoma.
[0093] FIG. 14. Quantitative expression of SEQ ID NO:574 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:574 in lung cancer and melanoma.
[0094] FIG. 15. Quantitative expression of SEQ ID NO:577 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:577 in gastric cancer, breast cancer and lung cancer.
[0095] FIG. 16. Quantitative expression of SEQ ID NO:580 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:580 in ovarian cancer and lung cancer.
[0096] FIG. 17. Quantitative expression of SEQ ID NO:583 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:583 in colon cancer, ovarian cancer and lung cancer.
[0097] FIG. 18. Quantitative expression of SEQ ID NO:587 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:587 in lung cancer.
[0098] FIG. 19. Quantitative expression of SEQ ID NO:591 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:591 in breast cancer, colon cancer, ovarian cancer, lung cancer and melanoma.
[0099] FIG. 20. Quantitative expression of SEQ ID NO:595 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:595 in gastric cancer, colon cancer, ovarian cancer, lung cancer and melanoma.
[0100] FIG. 21. Quantitative expression of SEQ ID NO:599 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:599 in gastric cancer, breast cancer, lung cancer and melanoma.
[0101] FIG. 22. Quantitative expression of SEQ ID NO:602 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:602 in ovarian cancer and lung cancer.
[0102] FIG. 23. Quantitative expression of SEQ ID NO:606 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:606 in gastric cancer, colon cancer and lung cancer.
[0103] FIG. 24. Quantitative expression of SEQ ID NO:610 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:610 in gastric cancer, breast cancer and lung cancer.
[0104] FIG. 25. Quantitative expression of SEQ ID NO:613 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:613 in breast cancer, lung cancer and melanoma.
[0105] FIG. 26. Quantitative expression of SEQ ID NO:617 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:617 in lung cancer and melanoma.
[0106] FIG. 27. Quantitative expression of SEQ ID NO:620 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:620 in ovarian cancer and melanoma.
[0107] FIG. 28. Quantitative expression of SEQ ID NO:624 in normal tissues and cancer tissues. Real-time RT-PCR showed overexpression of the nucleic acid sequence according to SEQ ID NO:624 in gastric cancer and lung cancer.
DETAILED DESCRIPTION OF THE INVENTION
[0108] A reference herein to a range of numerical values is to be understood so as to specify and mention each of the individual numerical values comprised by said range. For example, a reference to SEQ ID NOs: 1-540 is to be understood so as to refer to each and every of the following individual SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, and 540.
[0109] According to the present technology, a "reference" such as a reference sample or reference organism may be used to correlate and compare the results obtained in the methods of the present technology from a test sample or test organism, i.e. a patient. Typically the reference organism is a healthy organism, in particular an organism which does not suffer from cancer.
[0110] A "reference value" can be determined from a reference empirically by measuring a sufficiently large number of references. Preferably the reference value is determined by measuring at least 2, preferably at least 3, preferably at least 5, preferably at least 8, preferably at least 12, preferably at least 20, preferably at least 30, preferably at least 50, or preferably at least 100 references.
[0111] According to the present technology, a nucleic acid is preferably deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). Nucleic acids comprise according to the present technology genomic DNA, cDNA, mRNA, recombinantly produced and chemically synthesized molecules. According to the present technology, a nucleic acid may be present as a single-stranded or double-stranded and linear or covalently circularly closed molecule.
[0112] The terms "tumor-associated nucleic acid identified according to the present technology" and "nucleic acid encoding a tumor-associated antigen identified according to the present technology" have similar meanings. However, the different terms are used herein to account for the fact that in some embodiments only the expression of nucleic acid, in particular mRNA, is of relevance while the expression of protein is not a critical factor.
[0113] As used herein, the term "RNA" means a molecule comprising at least one ribonucleotide residue. By "ribonucleotide" is meant a nucleotide with a hydroxyl group at the 2'-position of a beta-D-ribo-furanose moiety. The term includes double stranded RNA, single stranded RNA, isolated RNA such as partially purified RNA, essentially pure RNA, synthetic RNA, recombinantly produced RNA, as well as altered RNA that differs from naturally occurring RNA by the addition, deletion, substitution and/or alteration of one or more nucleotides. Such alterations can include addition of non-nucleotide material, such as to the end(s) of a RNA or internally, for example at one or more nucleotides of the RNA. Nucleotides in RNA molecules can also comprise non-standard nucleotides, such as non-naturally occurring nucleotides or chemically synthesized nucleotides or deoxynucleotides. These altered RNAs can be referred to as analogs or analogs of naturally-occurring RNA.
[0114] If reference is made herein to the detection of or the determination of the quantity of a nucleic acid, the nucleic acid which is actually to be detected or the quantity of which is actually to be determined is preferably mRNA. However, it should be understood that this may also include embodiments wherein mRNA is detected or the quantity of mRNA is determined indirectly. For example, mRNA may be transformed into cDNA and the cDNA detected or its quantity determined. mRNA is given herein as the cDNA equivalent. One skilled in the art would understand that the cDNA sequence is equivalent to the mRNA sequence, and can be used for the same purpose herein, e.g., the generation of probes hybridizing to the nucleic acid to be detected. Thus, if reference is made herein to the sequences shown in the sequence listing this is also to include the RNA equivalents of said sequences.
[0115] The nucleic acids described according to the present technology have preferably been isolated. The term "isolated nucleic acid" means according to the present technology that the nucleic acid was (i) amplified in vitro, for example by polymerase chain reaction (PCR), (ii) recombinantly produced by cloning, (iii) purified, for example by cleavage and gel-electrophoretic fractionation, or (iv) synthesized, for example by chemical synthesis. An isolated nucleic acid is a nucleic acid which is available for manipulation by recombinant DNA techniques.
[0116] A degenerate nucleic acid according to the present technology is a nucleic acid that differs from a reference nucleic acid in codon sequence due to the degeneracy of the genetic code.
[0117] "Derivative" of a nucleic acid means according to the present technology that single or multiple such as at least 2, at least 4, or at least 6 and preferably up to 3, up to 4, up to 5, up to 6, up to 10, up to 15, or up to 20 nucleotide substitutions, deletions and/or additions are present in said nucleic acid. Furthermore, the term "derivative" also comprises chemical derivatization of a nucleic acid on a nucleotide base, on the sugar or on the phosphate. The term "derivative" also comprises nucleic acids which contain nucleotides and nucleotide analogs not occurring naturally.
[0118] Preferably the degree of identity between a specific nucleic acid sequence described herein and a nucleic acid sequence which is a derivative of said specific nucleic acid sequence, which hybridizes with said specific nucleic acid sequence and/or which is degenerate with respect to said specific nucleic acid sequence will be at least 70%, preferably at least 75%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90% or most preferably at least 95%, 96%, 97%, 98% or 99%. The degree of identity is preferably given for a region of at least about 30, at least about 50, at least about 70, at least about 90, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, or at least about 400 nucleotides. In preferred embodiments, the degree of identity is given for the entire length of the reference nucleic acid sequence, such as the nucleic acid sequences given in the sequence listing.
[0119] A nucleic acid is "complementary" to another nucleic acid if the two sequences are capable of hybridizing and forming a stable duplex with one another, with hybridization preferably being carried out under conditions which allow specific hybridization between polynucleotides (stringent conditions). Stringent conditions are described, for example, in Molecular Cloning: A Laboratory Manual, J. Sambrook et al., Editors, 2nd Edition, Cold Spring Harbor Laboratory press, Cold Spring Harbor, N.Y., 1989 or Current Protocols in Molecular Biology, F. M. Ausubel et al., Editors, John Wiley & Sons, Inc., New York and refer, for example, to hybridization at 65.degree. C. in hybridization buffer (3.5.times.SSC, 0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine serum albumin, 2.5 mM NaH.sub.2PO.sub.4 (pH 7), 0.5% SDS, 2 mM EDTA). SSC is 0.15 M sodium chloride/0.15 M sodium citrate, pH 7. After hybridization, the membrane to which the DNA has been transferred is washed, for example, in 2.times.SSC at room temperature and then in 0.1-0.5.times.SSC/0.1.times.SDS at temperatures of up to 68.degree. C.
[0120] A percent complementarity indicates the percentage of contiguous residues in a nucleic acid molecule that can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). "Perfectly complementary" or "fully complementary" means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. Preferably, the degree of complementarity according to the present technology is at least 70%, preferably at least 75%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90% or most preferably at least 95%, 96%, 97%, 98% or 99%.
[0121] "Sequence similarity" indicates the percentage of amino acids that either are identical or that represent conservative amino acid substitutions. "Sequence identity" between two polypeptide or nucleic acid sequences indicates the percentage of amino acids or nucleotides that are identical between the sequences.
[0122] The term "percentage identity" is intended to denote a percentage of nucleotides or of amino acid residues which are identical between the two sequences to be compared, obtained after the best alignment, this percentage being purely statistical and the differences between the two sequences being distributed randomly and over their entire length. Sequence comparisons between two nucleotide or amino acid sequences are conventionally carried out by comparing these sequences after having aligned them optimally, said comparison being carried out by segment or by "window of comparison" in order to identify and compare local regions of sequence similarity. The optimal alignment of the sequences for comparison may be produced, besides manually, by means of the local homology algorithm of Smith and Waterman, 1981, Ads App. Math. 2, 482, by means of the local homology algorithm of Neddleman and Wunsch, 1970, J. Mol. Biol. 48, 443, by means of the similarity search method of Pearson and Lipman, 1988, Proc. Natl Acad. Sci. USA 85, 2444, or by means of computer programs which use these algorithms (GAP, BESTFIT, FASTA, BLAST P, BLAST N and TFASTA in Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wis.).
[0123] The percentage identity is calculated by determining the number of identical positions between the two sequences being compared, dividing this number by the number of positions compared and multiplying the result obtained by 100 so as to obtain the percentage identity between these two sequences.
[0124] In one embodiment, a nucleic acid sequence which is a derivative of a specific nucleic acid sequence, which is degenerate with respect to a specific nucleic acid sequence or which is a part of a specific nucleic acid sequence has a relevant function and/or activity of the specific nucleic acid sequence, i.e. it may encode a protein or peptide having the same activity or immunological properties as the protein or peptide encoded by the specific nucleic acid sequence and, in one embodiment, encodes the same protein or peptide.
[0125] Nucleic acids coding for tumor-associated antigens may, according to the present technology, be present alone or in combination with other nucleic acids, in particular heterologous nucleic acids. In preferred embodiments, a nucleic acid is functionally linked to expression control sequences or regulatory sequences which may be homologous or heterologous with respect to said nucleic acid. A coding sequence and a regulatory sequence are "functionally" linked to one another, if they are covalently linked to one another in such a way that expression or transcription of said coding sequence is under the control or under the influence of said regulatory sequence. If the coding sequence is to be translated into a functional protein, then, with a regulatory sequence functionally linked to said coding sequence, induction of said regulatory sequence results in transcription of said coding sequence, without causing a frame shift in the coding sequence or said coding sequence not being capable of being translated into the desired protein or peptide.
[0126] The term "expression control sequence" or "regulatory sequence" comprises according to the present technology promoters, enhancers and other control elements which regulate expression of a gene. In particular embodiments of the present technology, the expression control sequences can be regulated. The exact structure of regulatory sequences may vary as a function of the species or cell type, but generally comprises 5'untranscribed and 5'untranslated sequences which are involved in initiation of transcription and translation, respectively, such as TATA box, capping sequence, CAAT sequence, and the like. More specifically, 5'untranscribed regulatory sequences comprise a promoter region which includes a promoter sequence for transcriptional control of the functionally linked gene. Regulatory sequences may also comprise enhancer sequences or upstream activator sequences.
[0127] According to the present technology, a nucleic acid may furthermore be present in combination with another nucleic acid which codes for a peptide controlling secretion of the protein or peptide encoded by said nucleic acid from a host cell. According to the present technology, a nucleic acid may also be present in combination with another nucleic acid which codes for a peptide causing the encoded protein or peptide to be anchored on the cell membrane of the host cell or compartmentalized into particular organelles of said cell. Similarly, a combination with a nucleic acid is possible which represents a reporter gene or any "tag".
[0128] In a preferred embodiment, a recombinant nucleic acid molecule is according to the present technology a vector, where appropriate with a promoter, which controls expression of a nucleic acid, for example a nucleic acid coding for a tumor-associated antigen identified according to the present technology. The term "vector" is used here in its most general meaning and comprises any intermediary vehicle for a nucleic acid which enables said nucleic acid, for example, to be introduced into prokaryotic and/or eukaryotic cells and, where appropriate, to be integrated into a genome. Vectors of this kind are preferably replicated and/or expressed in the cells. An intermediary vehicle may be adapted, for example, to the use in electroporation, in bombardment with microprojectiles, in liposomal administration, in the transfer with the aid of agrobacteria or in insertion via DNA or RNA viruses. Vectors comprise plasmids, phagemids, bacteriophages or viral genomes.
[0129] The nucleic acids coding for a tumor-associated antigen identified according to the present technology may be used for transfection of host cells. Nucleic acids here mean both recombinant DNA and RNA. Recombinant RNA may be prepared by in-vitro transcription of a DNA template. Furthermore, it may be modified by stabilizing sequences, capping and polyadenylation prior to application.
[0130] According to the present technology, the term "host cell" relates to any cell which can be transformed or transfected with an exogenous nucleic acid. The term "host cells" comprises according to the present technology prokaryotic (e.g. E. coli) or eukaryotic cells (e.g. dendritic cells, B cells, CHO cells, COS cells, K562 cells, yeast cells and insect cells). Particular preference is given to mammalian cells such as cells from humans, mice, hamsters, pigs, goats, primates. The cells may be derived from a multiplicity of tissue types and comprise primary cells and cell lines. Specific examples comprise keratinocytes, peripheral blood leukocytes, stem cells of the bone marrow and embryonic stem cells. In further embodiments, the host cell is an antigen-presenting cell, in particular a dendritic cell, monocyte or a macrophage. A nucleic acid may be present in the host cell in the form of a single copy or of two or more copies and, in one embodiment, is expressed in the host cell.
[0131] According to the present technology, the term "expression" is used in its most general meaning and comprises the production of RNA or of RNA and protein. It also comprises partial expression of nucleic acids. Furthermore, expression may be carried out transiently or stably. Preferred expression systems in mammalian cells comprise pcDNA3.1 and pRc/CMV (Invitrogen, Carlsbad, Calif.), which contain a selectable marker such as a gene imparting resistance to G418 (and thus enabling stably transfected cell lines to be selected) and the enhancer-promoter sequences of cytomegalovirus (CMV).
[0132] In those cases of the present technology in which a MHC molecule presents a tumor-associated antigen or a part thereof, an expression vector may also comprise a nucleic acid sequence coding for said MHC molecule. The nucleic acid sequence coding for the MHC molecule may be present on the same expression vector as the nucleic acid coding for the tumor-associated antigen or the part thereof, or both nucleic acids may be present on different expression vectors. In the latter case, the two expression vectors may be cotransfected into a cell. If a host cell expresses neither the tumor-associated antigen or the part thereof nor the MHC molecule, both nucleic acids coding therefor may be transfected into the cell either on the same expression vector or on different expression vectors. If the cell already expresses the MHC molecule, only the nucleic acid sequence coding for the tumor-associated antigen or the part thereof can be transfected into the cell.
[0133] The present technology also comprises kits for detection and/or determination of the quantity of nucleic acids. Such kits comprise, for example, a pair of amplification primers which hybridize to the nucleic acid which is to be detected or the amount of which is to be determined. The primers preferably comprise a sequence of 6-50, in particular 10-30, 15-30 and 20-30 contiguous nucleotides of the nucleic acid and are nonoverlapping, in order to avoid the formation of primer dimers. One of the primers will hybridize to one strand of the nucleic acid, and the other primer will hybridize to the complementary strand in an arrangement which allows amplification of the nucleic acid.
[0134] "Antisense molecules" or "antisense nucleic acids" may be used for regulating, in particular reducing, expression of a nucleic acid. The term "antisense molecule" or "antisense nucleic acid" refers according to the present technology to an oligonucleotide which is an oligoribonucleotide, oligodeoxyribonucleotide, modified oligoribonucleotide or modified oligodeoxyribonucleotide and which hybridizes under physiological conditions to DNA comprising a particular gene or to mRNA of said gene, thereby inhibiting transcription of said gene and/or translation of said mRNA. According to the present technology, an "antisense molecule" also comprises a construct which contains a nucleic acid or a part thereof in reverse orientation with respect to its natural promoter. An antisense transcript of a nucleic acid or of a part thereof may form a duplex with naturally occurring mRNA and thus prevent accumulation of or translation of the mRNA. Another possibility is the use of ribozymes for inactivating a nucleic acid.
[0135] Antisense oligonucleotides preferred according to the present technology have a sequence of 6-50, in particular 10-30, 15-30 and 20-30, contiguous nucleotides of the target nucleic acid and preferably are fully complementary to the target nucleic acid or to a part thereof.
[0136] In preferred embodiments, the antisense oligonucleotide hybridizes with an N-terminal or 5' upstream site such as a translation initiation site, transcription initiation site or promoter site. In further embodiments, the antisense oligonucleotide hybridizes with a 3'untranslated region or mRNA splicing site.
[0137] In one embodiment, an oligonucleotide of the present technology consists of ribonucleotides, deoxyribonucleotides or a combination thereof, with the 5' end of one nucleotide and the 3' end of another nucleotide being linked to one another by a phosphodiester bond. These oligonucleotides may be synthesized in the conventional manner or produced recombinantly.
[0138] In preferred embodiments, an oligonucleotide of the present technology is a "modified" oligonucleotide. Here, the oligonucleotide may be modified in very different ways, without impairing its ability to bind its target, in order to increase, for example, its stability or therapeutic efficacy. According to the present technology, the term "modified oligonucleotide" means an oligonucleotide in which (i) at least two of its nucleotides are linked to one another by a synthetic internucleoside bond (i.e. an internucleoside bond which is not a phosphodiester bond) and/or (ii) a chemical group which is usually not found in nucleic acids is covalently linked to the oligonucleotide. Preferred synthetic internucleoside bonds are phosphorothioates, alkyl phosphonates, phosphorodithioates, phosphate esters, alkyl phosphonothioates, phosphoramidates, carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides.
[0139] The term "modified oligonucleotide" also comprises oligonucleotides having a covalently modified base and/or sugar. "Modified oligonucleotides" comprise, for example, oligonucleotides with sugar residues which are covalently bound to low molecular weight organic groups other than a hydroxyl group at the 3' position and a phosphate group at the 5' position. Modified oligonucleotides may comprise, for example, a 2'-O-alkylated ribose residue or another sugar instead of ribose, such as arabinose.
[0140] It is to be understood that all embodiments described above with respect to oligonucleotides may also apply to polynucleotides.
[0141] By "small interfering RNA" or "siRNA" as used herein is meant an isolated RNA molecule, preferably greater than 10 nucleotides in length, more preferably greater than 15 nucleotides in length, and most preferably 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length that is used to identify a target gene or mRNA to be degraded. A range of 19-25 nucleotides is the most preferred size for siRNAs.
[0142] siRNA according to the present technology can comprise partially purified RNA, substantially pure RNA, synthetic RNA, or recombinantly produced RNA, as well as altered RNA that differs from naturally-occurring RNA by the addition, deletion, substitution and/or alteration of one or more nucleotides. Such alterations can include addition of non-nucleotide material, such as to the end(s) of the siRNA or to one or more internal nucleotides of the siRNA; modifications that make the siRNA resistant to nuclease digestion (e. g., the use of 2'-substituted ribonucleotides or modifications to the sugar-phosphate backbone); or the substitution of one or more nucleotides in the siRNA with deoxyribonucleotides. Furthermore, siRNA may be modified to increase the stability thereof as described above for modified oligonucleotides, in particular by introducing one or more phosphorothioate linkages.
[0143] One or both strands of the siRNA can also comprise a 3'-overhang. As used herein, a "3'-overhang" refers to at least one unpaired nucleotide extending from the 3'-end of an RNA strand. Thus in one embodiment, the siRNA comprises at least one 3'-overhang of from 1 to about 6 nucleotides (which includes ribonucleotides or deoxynucleotides) in length, preferably from 1 to about 5 nucleotides in length, more preferably from 1 to about 4 nucleotides in length, and particularly preferably from about 2 to about 4 nucleotides in length. In the embodiment in which both strands of the siRNA molecule comprise a 3'-overhang, the length of the overhangs can be the same or different for each strand. In a most preferred embodiment, the 3'-overhang is present on both strands of the siRNA, and is 2 nucleotides in length. For example, each strand of the siRNA of the present technology can comprise 3'-overhangs of dideoxythymidylic acid ("TT") or diuridylic acid ("uu").
[0144] In order to enhance the stability of the siRNA, the 3'-overhangs can be also stabilized against degradation. In one embodiment, the overhangs are stabilized by including purine nucleotides, such as adenosine or guanosine nucleotides. Alternatively, substitution of pyrimidine nucleotides by modified analogues, e.g., substitution of uridine nucleotides in the 3'-overhangs with 2'-deoxythymidine, is tolerated and does not affect the efficiency of RNAi degradation. In particular, the absence of a 2'-hydroxyl in the 2'-deoxythymidine significantly enhances the nuclease resistance of the 3'-overhang in tissue culture medium.
[0145] The sense and antisense strands of the siRNA can comprise two complementary, single-stranded RNA molecules or can comprise a single molecule in which two complementary portions are base-paired and are covalently linked by a single-stranded "hairpin" area. That is, the sense region and antisense region can be covalently connected via a linker molecule. The linker molecule can be a polynucleotide or non-nucleotide linker. Without wishing to be bound by any theory, it is believed that the hairpin area of the latter type of siRNA molecule is cleaved intracellularly by the "Dicer" protein (or its equivalent) to form a siRNA of two individual base-paired RNA molecules.
[0146] As used herein, "target mRNA" refers to an RNA molecule that is a target for downregulation.
[0147] siRNA can be expressed from pol III expression vectors without a change in targeting site, as expression of RNAs from pol III promoters is only believed to be efficient when the first transcribed nucleotide is a purine.
[0148] siRNA according to the present technology can be targeted to any stretch of approximately 19-25 contiguous nucleotides in any of the target mRNA sequences (the "target sequence"). Techniques for selecting target sequences for siRNA are given, for example, in Tuschl T. et al., "The siRNA User Guide", revised Oct. 11, 2002, the entire disclosure of which is herein incorporated by reference. "The siRNA User Guide" is available on the world wide web at a website maintained by Dr. Thomas Tuschl, Laboratory of RNA Molecular Biology, Rockefeller University, New York, USA, and can be found by accessing the website of the Rockefeller University and searching with the keyword "siRNA". Thus, the sense strand of the present siRNA comprises a nucleotide sequence substantially identical to any contiguous stretch of about 19 to about 25 nucleotides in the target mRNA.
[0149] Generally, a target sequence on the target mRNA can be selected from a given cDNA sequence corresponding to the target mRNA, preferably beginning 50 to 100 nt downstream (i.e., in the 3'-direction) from the start codon. The target sequence can, however, be located in the 5'- or 3'-untranslated regions, or in the region nearby the start codon.
[0150] siRNA can be obtained using a number of techniques known to those of skill in the art. For example, siRNA can be chemically synthesized or recombinantly produced using methods known in the art, such as the Drosophila in vitro system described in U.S. published application 2002/0086356 of Tuschl et al., the entire disclosure of which is herein incorporated by reference.
[0151] Preferably, siRNA is chemically synthesized using appropriately protected ribonucleoside phosphoramidites and a conventional DNA/RNA synthesizer. siRNA can be synthesized as two separate, complementary RNA molecules, or as a single RNA molecule with two complementary regions.
[0152] Alternatively, siRNA can also be expressed from recombinant circular or linear DNA plasmids using any suitable promoter. Such embodiments are included according to the present technology when reference is made herein to the administration of siRNA or the incorporation of siRNA into pharmaceutical compositions. Suitable promoters for expressing siRNA of the present technology from a plasmid include, for example, the U6 or Hi RNA pol III promoter sequences and the cytomegalovirus promoter.
[0153] Selection of other suitable promoters is within the skill in the art. The recombinant plasmids of the present technology can also comprise inducible or regulatable promoters for expression of the siRNA in a particular tissue or in a particular intracellular environment.
[0154] The siRNA expressed from recombinant plasmids can either be isolated from cultured cell expression systems by standard techniques, or can be expressed intracellularly. The use of recombinant plasmids to deliver siRNA to cells in vivo is discussed in more detail below. siRNA can be expressed from a recombinant plasmid either as two separate, complementary RNA molecules, or as a single RNA molecule with two complementary regions.
[0155] Selection of plasmids suitable for expressing siRNA, methods for inserting nucleic acid sequences for expressing the siRNA into the plasmid, and methods of delivering the recombinant plasmid to the cells of interest are within the skill in the art.
[0156] siRNA can also be expressed from recombinant viral vectors intracellularly in vivo. The recombinant viral vectors comprise sequences encoding the siRNA and any suitable promoter for expressing the siRNA sequences. The recombinant viral vectors can also comprise inducible or regulatable promoters for expression of the siRNA in a particular tissue or in a particular intracellular environment. siRNA can be expressed from a recombinant viral vector either as two separate, complementary RNA molecules, or as a single RNA molecule with two complementary regions.
[0157] The term "peptide" comprises oligo- and polypeptides and refers to substances comprising two or more, preferably 3 or more, preferably 4 or more, preferably 6 or more, preferably 8 or more, preferably 10 or more, preferably 13 or more, preferably 16 more, preferably 21 or more and up to preferably 8, 10, 20, 30, 40 or 50, in particular 100 amino acids joined covalently by peptide bonds. The term "protein" refers to large peptides, preferably to peptides with more than 100 amino acid residues, but in general the terms "peptides" and "proteins" are synonyms and are used interchangeably herein.
[0158] Preferably, the proteins and peptides described according to the present technology have been isolated. The terms "isolated protein" or "isolated peptide" mean that the protein or peptide has been separated from its natural environment. An isolated protein or peptide may be in an essentially purified state. The term "essentially purified" means that the protein or peptide is essentially free of other substances with which it is associated in nature or in vivo.
[0159] Such proteins and peptides may be used, for example, in producing antibodies and in an immunological or diagnostic assay or as therapeutics. Proteins and peptides described according to the present technology may be isolated from biological samples such as tissue or cell homogenates and may also be expressed recombinantly in a multiplicity of pro- or eukaryotic expression systems.
[0160] For the purposes of the present technology, "derivatives" of a protein or peptide or of an amino acid sequence comprise amino acid insertion variants, amino acid deletion variants and/or amino acid substitution variants.
[0161] Amino acid insertion variants comprise amino- and/or carboxy-terminal fusions and also insertions of single or two or more amino acids in a particular amino acid sequence. In the case of amino acid sequence variants having an insertion, one or more amino acid residues are inserted into a particular site in an amino acid sequence, although random insertion with appropriate screening of the resulting product is also possible.
[0162] Amino acid deletion variants are characterized by the removal of one or more amino acids from the sequence.
[0163] Amino acid substitution variants are characterized by at least one residue in the sequence being removed and another residue being inserted in its place. Preference is given to the modifications being in positions in the amino acid sequence which are not conserved between homologous proteins or peptides and/or to replacing amino acids with other ones having similar properties.
[0164] "Conservative substitutions" may be made, for instance, on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved. For example: (a) nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; (b) polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; (c) positively charged (basic) amino acids include arginine, lysine, and histidine; and (d) negatively charged (acidic) amino acids include aspartic acid and glutamic acid. Substitutions typically may be made within groups (a)-(d). In addition, glycine and proline may be substituted for one another based on their ability to disrupt .alpha.-helices. Some preferred substitutions may be made among the following groups: (i) S and T; (ii) P and G; and (iii) A, V, L and I. Given the known genetic code, and recombinant and synthetic DNA techniques, the skilled scientist readily can construct DNAs encoding the conservative amino acid variants.
[0165] Preferably the degree of similarity, preferably identity between a specific amino acid sequence described herein and an amino acid sequence which is a derivative of said specific amino acid sequence will be at least 70%, preferably at least 80%, preferably at least 85%, even more preferably at least 90% or most preferably at least 95%, 96%, 97%, 98% or 99%. The degree of similarity or identity is given preferably for a region of at least about 20, at least about 40, at least about 60, at least about 80, at least about 100, at least about 120, at least about 140, at least about 160, at least about 200 or 250 amino acids. In preferred embodiments, the degree of similarity or identity is given for the entire length of the reference amino acid sequence.
[0166] In one embodiment, a protein or peptide which is a derivative of a specific protein or peptide or which is a part of a specific protein or peptide has a relevant function and/or activity of the specific protein or peptide, i.e. it may have the same activity or immunological properties as the specific protein or peptide.
[0167] The amino acid variants described above may be readily prepared with the aid of known peptide synthesis techniques such as, for example, by solid phase synthesis (Merrifield, 1964) and similar methods or by recombinant DNA manipulation. The manipulation of DNA sequences for preparing proteins and peptides having substitutions, insertions or deletions, is described in detail in Sambrook et al. (1989), for example.
[0168] According to the present technology, "derivatives" of proteins and peptides also comprise single or multiple substitutions, deletions and/or additions of any molecules associated with the protein or peptide, such as carbohydrates, lipids and/or proteins or peptides. The term "derivative" also extends to all functional chemical equivalents of said proteins and peptides.
[0169] According to the present technology, a part or fragment of a tumor-associated antigen preferably has a functional property of the protein or peptide from which it has been derived. Such functional properties comprise the interaction with antibodies, the interaction with other peptides or proteins, the selective binding of nucleic acids and an enzymatic activity. A particular property is the ability to form a complex with MHC molecules and, where appropriate, generate an immune response, preferably by stimulating cytotoxic or T helper cells. A part or fragment of a tumor-associated antigen preferably comprises a sequence of at least 6, in particular at least 8, at least 10, at least 12, at least 15, at least 20, at least 30 or at least 50, consecutive amino acids of the tumor-associated antigen. A part or fragment of a tumor-associated antigen preferably comprises a sequence of up to 8, in particular up to 10, up to 12, up to 15, up to 20, up to 30 or up to 55, consecutive amino acids of the tumor-associated antigen. A part or fragment of a tumor-associated antigen is preferably a part of the tumor-associated antigen which corresponds to the non-transmembrane portion, in particular the extracellular portion of the antigen, or is comprised thereof.
[0170] Preferred parts or fragments of a tumor-associated antigen are in particular suitable for the stimulation of cytotoxic T-lymphocytes in vivo but also for the production of expanded and stimulated T-lymphocytes for the therapeutic adoptive transfer ex vivo.
[0171] A part or a fragment of a nucleic acid coding for a tumor-associated antigen relates according to the present technology to the part of the nucleic acid, which codes at least for the tumor-associated antigen and/or for a part or a fragment of said tumor-associated antigen, as defined above. A part or fragment of a nucleic acid coding for a tumor-associated antigen is preferably that part of the nucleic acid corresponding to the open reading frame.
[0172] According to the present technology, particular embodiments ought to involve providing "dominant negative" proteins or peptides derived from tumor-associated antigens. A dominant negative protein or peptide is an inactive protein or peptide variant which, by way of interacting with the cellular machinery, displaces an active protein or peptide from its interaction with the cellular machinery or which competes with the active protein or peptide, thereby reducing the effect of said active protein.
[0173] Antisera which contain specific antibodies specifically binding to the target protein can be prepared by various standard processes; see, for example, "Monoclonal Antibodies: A Practical Approach" by Philip Shepherd, Christopher Dean ISBN 0-19-963722-9; "Antibodies: A Laboratory Manual" by Ed Harlow, David Lane, ISBN: 0879693142 and "Using Antibodies: A Laboratory Manual: Portable Protocol NO" by Edward Harlow, David Lane, Ed Harlow ISBN 0879695447. Thereby it is also possible to generate affine and specific antibodies which recognize complex membrane proteins in their native form (Azorsa et al., J. Immunol. Methods 229: 35-48, 1999; Anderson et al., J. Immunol. 143: 1899-1904, 1989; Gardsvoll, J. Immunol. Methods 234: 107-116, 2000). This is in particular relevant for the preparation of antibodies which are to be used therapeutically, but also for many diagnostic applications. In this respect, it is possible to immunize with the whole protein, with extracellular partial sequences as well as with cells which express the target molecule in physiologically folded form.
[0174] Monoclonal antibodies are traditionally prepared using the hybridoma technology. (for technical details see: "Monoclonal Antibodies: A Practical Approach" by Philip Shepherd, Christopher Dean ISBN 0-19-963722-9; "Antibodies: A Laboratory Manual" by Ed Harlow, David Lane ISBN: 0879693142; "Using Antibodies: A Laboratory Manual: Portable Protocol NO" by Edward Harlow, David Lane, Ed Harlow ISBN: 0879695447).
[0175] It is known that only a small part of an antibody molecule, the paratope, is involved in binding of the antibody to its epitope (cf. Clark, W. R. (1986), The Experimental Foundations of Modern Immunology, Wiley & Sons, Inc., New York; Roitt, I. (1991), Essential Immunology, 7th Edition, Blackwell Scientific Publications, Oxford). The pFc' and Fc regions are, for example, effectors of the complement cascade but are not involved in antigen binding. An antibody from which the pFc' region has been enzymatically removed or which has been produced without the pFc' region, referred to as F(ab').sub.2 fragment, carries both antigen binding sites of a complete antibody. Similarly, an antibody from which the Fc region has been enzymatically removed or which has been produced without said Fc region, referred to as Fab fragment, carries one antigen binding site of an intact antibody molecule. Furthermore, Fab fragments consist of a covalently bound light chain of an antibody and part of the heavy chain of said antibody, referred to as Fd. The Fd fragments are the main determinants of antibody specificity (a single Fd fragment can be associated with up to ten different light chains, without altering the specificity of the antibody) and Fd fragments, when isolated, retain the ability to bind to an epitope.
[0176] Located within the antigen-binding part of an antibody are complementary-determining regions (CDRs) which interact directly with the antigen epitope and framework regions (FRs) which maintain the tertiary structure of the paratope. Both the Fd fragment of the heavy chain and the light chain of IgG immunoglobulins contain four framework regions (FR1 to FR4) which are separated in each case by three complementary-determining regions (CDR1 to CDR3). The CDRs and, in particular, the CDR3 regions and, still more particularly, the CDR3 region of the heavy chain are responsible to a large extent for antibody specificity.
[0177] Non-CDR regions of a mammalian antibody are known to be able to be replaced by similar regions of antibodies with the same or a different specificity, with the specificity for the epitope of the original antibody being retained. This made possible the development of "humanized" antibodies in which nonhuman CDRs are covalently linked to human FR and/or Fc/pFc' regions to produce a functional antibody.
[0178] As another example, WO 92/04381 describes the production and use of humanized murine RSV antibodies in which at least part of the murine FR regions have been replaced with FR regions of a human origin. Antibodies of this kind, including fragments of intact antibodies with antigen-binding capability, are often referred to as "chimeric" antibodies.
[0179] According to the present technology, the term "antibody" also includes F(ab').sub.2, Fab, Fv, and Fd fragments of antibodies, chimeric antibodies, in which the Fc and/or FR and/or CDR1 and/or CDR2 and/or light chain-CDR3 regions have been replaced with homologous human or nonhuman sequences, chimeric F(ab').sub.2-fragment antibodies in which the FR and/or CDR1 and/or CDR2 and/or light chain-CDR3 regions have been replaced with homologous human or nonhuman sequences, chimeric Fab-fragment antibodies in which the FR and/or CDR1 and/or CDR2 and/or light chain-CDR3 regions have been replaced with homologous human or nonhuman sequences, and chimeric Fd-fragment antibodies in which the FR and/or CDR1 and/or CDR2 regions have been replaced with homologous human or nonhuman sequences. The term "antibody" also comprises "single-chain" antibodies.
[0180] The present technology also comprises proteins and peptides which bind specifically to tumor-associated antigens. Binding substances of this kind may be provided, for example, by degenerate peptide libraries which may be prepared simply in solution in an immobilized form or as phage-display libraries. It is likewise possible to prepare combinatorial libraries of peptides with one or more amino acids. Libraries of peptoids and nonpeptidic synthetic residues may also be prepared.
[0181] Antibodies may also be coupled to specific diagnostic substances for displaying cells and tissues expressing tumor-associated antigens. They may also be coupled to therapeutically useful substances.
[0182] Diagnostic substances or agents include any label that functions to: (i) provide a detectable signal; (ii) interact with a second label to modify the detectable signal provided by the first or second label, e.g. FRET (Fluorescence Resonance Energy Transfer); (iii) affect mobility, e.g. electrophoretic mobility, by charge, hydrophobicity, shape, or other physical parameters, or (iv) provide a capture moiety, e.g., affinity, antibody/antigen, or ionic complexation. Suitable as label are structures, such as fluorescent labels, luminescent labels, chromophore labels, radioisotopic labels, isotopic labels, preferably stable isotopic labels, isobaric labels, enzyme labels, particle labels, in particular metal particle labels, magnetic particle labels, polymer particle labels, small organic molecules such as biotin, ligands of receptors or binding molecules such as cell adhesion proteins or lectins, label-sequences comprising nucleic acids and/or amino acid residues which can be detected by use of binding agents, etc. Diagnostic substances comprise, in a nonlimiting manner, barium sulfate, iocetamic acid, iopanoic acid, calcium ipodate, sodium diatrizoate, meglumine diatrizoate, metrizamide, sodium tyropanoate and radio diagnostic, including positron emitters such as fluorine-18 and carbon-11, gamma emitters such as iodine-123, technetium-99m, iodine-131 and indium-111, nuclides for nuclear magnetic resonance, such as fluorine and gadolinium.
[0183] According to the present technology, the terms "therapeutically useful substance", "therapeutic substance" or "therapeutic agent" means any molecule which may exert a therapeutic effect. According to the present technology, a therapeutically useful substance is preferably selectively guided to a cell which expresses one or more tumor-associated antigens and includes anticancer agents, radioactive iodine-labeled compounds, toxins, cytostatic or cytolytic drugs, etc. Anticancer agents comprise, for example, aminoglutethimide, azathioprine, bleomycin sulfate, busulfan, carmustine, chlorambucil, cisplatin, cyclophosphamide, cyclosporine, cytarabidine, dacarbazine, dactinomycin, daunorubin, doxorubicin, taxol, etoposide, fluorouracil, interferon-.alpha., lomustine, mercaptopurine, methotrexate, mitotane, procarbazine HCl, thioguanine, vinblastine sulfate and vincristine sulfate. Other anticancer agents are described, for example, in Goodman and Gilman, "The Pharmacological Basis of Therapeutics", 8th Edition, 1990, McGraw-Hill, Inc., in particular Chapter 52 (Antineoplastic Agents (Paul Calabresi and Bruce A. Chabner). Toxins may be proteins such as pokeweed antiviral protein, cholera toxin, pertussis toxin, ricin, gelonin, abrin, diphtheria exotoxin or Pseudomonas exotoxin. Toxin residues may also be high energy-emitting radionuclides such as cobalt-60.
[0184] The term "major histocompatibility complex" or "MHC" relates to a complex of genes present in all vertebrates. MHC proteins or molecules are involved in signaling between lymphocytes and antigen presenting cells in normal immune reactions by binding peptides and presenting them for recognition by T cell receptors (TCR). MHC molecules bind peptides within an intracellular processing compartment and present these peptides on the surface of antigen presenting cells for recognition by T cells. The human MHC region also termed HLA is located on chromosome 6 and includes the class I and class II region. In one preferred embodiment of all aspects of the present technology an MHC molecule is an HLA molecule.
[0185] "Reduce" or "inhibit" as used herein means the ability to cause an overall decrease, preferably of 20% or greater, more preferably of 50% or greater, and most preferably of 75% or greater, in the level, e.g. in the level of protein or mRNA as compared to a reference sample (e.g., a sample not treated with siRNA). This reduction or inhibition of RNA or protein expression can occur through targeted mRNA cleavage or degradation. Assays for protein expression or nucleic acid expression are known in the art and include, for example, ELISA, western blot analysis for protein expression, and northern blotting or RNase protection assays for RNA.
[0186] The term "patient" means according to the present technology a human being, a nonhuman primate or another animal, in particular a mammal such as a cow, horse, pig, sheep, goat, dog, cat or a rodent such as a mouse and rat. In a particularly preferred embodiment, the patient is a human being.
[0187] According to the present technology the term "increased" or "increased amount" preferably refers to an increase by at least 10%, in particular at least 20%, at least 50% or at least 100%. The amount of a substance is also increased in a test sample such as a biological sample compared to a reference sample if it is detectable in the test sample but absent or not detectable in the reference sample.
[0188] According to the present technology, the term "disease" refers to any pathological state in which tumor-associated nucleic acids and/or tumor-associated antigens are expressed or abnormally expressed. "Abnormal expression" means according to the present technology that expression is altered, preferably increased, compared to the state in a healthy individual. An increase in expression refers to an increase by at least 10%, in particular at least 20%, at least 50% or at least 100%. In one embodiment, expression is only found in tissue of a diseased individual, while expression in a healthy individual is repressed or is repressed in a healthy individual except for placenta. One example of such a disease is cancer, wherein the term "cancer" according to the present technology comprises leukemias, seminomas, melanomas, teratomas, lymphomas, neuroblastomas, gliomas, rectal cancer, endometrial cancer, kidney cancer, adrenal cancer, thyroid cancer, blood cancer, skin cancer, cancer of the brain, cervical cancer, intestinal cancer, liver cancer, colon cancer, stomach cancer, intestine cancer, head and neck cancer, gastrointestinal cancer, lymph node cancer, esophagus cancer, colorectal cancer, pancreas cancer, ear, nose and throat (ENT) cancer, breast cancer, prostate cancer, cancer of the uterus, ovarian cancer and lung cancer and the matastases thereof. Examples thereof are lung carcinomas, mamma carcinomas, prostate carcinomas, colon carcinomas, renal cell carcinomas, cervical carcinomas, or metastases of the cancer types or tumors described above. The term cancer according to the present technology also comprises cancer metastases.
[0189] By "tumor" is meant an abnormal group of cells or tissue that grows by a rapid, uncontrolled cellular proliferation and continues to grow after the stimuli that initiated the new growth cease. Tumors show partial or complete lack of structural organization and functional coordination with the normal tissue, and usually form a distinct mass of tissue, which may be either benign or malignant.
[0190] By "metastasis" is meant the spread of cancer cells from its original site to another part of the body. The formation of metastasis is a very complex process and depends on detachment of malignant cells from the primary tumor, invasion of the extracellular matrix, penetration of the endothelial basement membranes to enter the body cavity and vessels, and then, after being transported by the blood, infiltration of target organs. Finally, the growth of a new tumor at the target site depends on angiogenesis. Tumor metastasis often occurs even after the removal of the primary tumor because tumor cells or components may remain and develop metastatic potential. In one embodiment, the term "metastasis" according to the present technology relates to "distant metastasis" which relates to a metastasis which is remote from the primary tumor and the regional lymph node system.
[0191] According to the present technology, a biological sample may be a tissue sample, including bodily fluids, and/or a cellular sample and may be obtained in the conventional manner such as by tissue biopsy, including punch biopsy, and by taking blood, bronchial aspirate, sputum, urine, feces or other body fluids. According to the present technology, the term "biological sample" also includes fractions of biological samples. Preferably, the term "biological sample" according to the present technology does not include samples derived from placental tissue.
[0192] According to the present technology, the term "immunoreactive cell" means a cell which can mature into an immune cell (such as B cell, T helper cell, or cytolytic T cell) with suitable stimulation. Immunoreactive cells comprise CD34.sup.+ hematopoietic stem cells, immature and mature T cells and immature and mature B cells. If production of cytolytic or T helper cells recognizing a tumor-associated antigen is desired, the immunoreactive cell is contacted with a cell expressing a tumor-associated antigen under conditions which favor production, differentiation and/or selection of cytolytic T cells and of T helper cells. The differentiation of T cell precursors into a cytolytic T cell, when exposed to an antigen, is similar to clonal selection of the immune system.
[0193] The terms "T cell" and "T lymphocyte" are used interchangeably herein and include T helper cells and cytotoxic T cells which comprise cytolytic T cells.
[0194] Some therapeutic methods are based on a reaction of the immune system of a patient, which results in a lysis of antigen-presenting cells such as cancer cells which present one or more tumor-associated antigens. In this connection, for example autologous cytotoxic T lymphocytes specific for a complex of a tumor-associated antigen and an MHC molecule are administered to a patient having a cellular abnormality. The production of such cytotoxic T lymphocytes in vitro is known. An example of a method of differentiating T cells can be found in WO-A-9633265. Generally, a sample containing cells such as blood cells is taken from the patient and the cells are contacted with a cell which presents the complex and which can cause propagation of cytotoxic T lymphocytes (e.g. dendritic cells). The target cell may be a transfected cell such as a COS cell. These transfected cells present the desired complex on their surface and, when contacted with cytotoxic T lymphocytes, stimulate propagation of the latter. The clonally expanded autologous cytotoxic T lymphocytes are then administered to the patient.
[0195] In another method of selecting antigen-specific cytotoxic T lymphocytes, fluorogenic tetramers of MHC class I molecule/peptide complexes are used for obtaining specific clones of cytotoxic T lymphocytes (Altman et al., Science 274:94-96, 1996; Dunbar et al., Curr. Biol. 8:413-416, 1998).
[0196] The present technology also includes therapeutic methods referred to as adoptive transfer (Greenberg, J. Immunol. 136(5):1917, 1986; Riddel et al., Science 257:238, 1992; Lynch et al., Eur. J. Immunol. 21:1403-1410, 1991; Kast et al., Cell 59:603-614, 1989), wherein cells presenting the desired complex (e.g. dendritic cells) are combined with cytotoxic T lymphocytes of the patient to be treated, resulting in a propagation of specific cytotoxic T lymphocytes. The propagated cytotoxic T lymphocytes are then administered to a patient having a cellular anomaly characterized by particular abnormal cells presenting the specific complex. The cytotoxic T lymphocytes then lyse the abnormal cells, thereby achieving a desired therapeutic effect.
[0197] Furthermore, cells presenting the desired complex (e.g. dendritic cells) may be combined with cytotoxic T lymphocytes of healthy individuals or another species (e.g. mouse) which may result in propagation of specific cytotoxic T lymphocytes with high affinity. The high affinity T cell receptor of these propagated specific T lymphocytes may be cloned and optionally humanized to a different extent, and the T cell receptors thus obtained then transduced via gene transfer, for example using retroviral vectors, into T cells of patients. Adoptive transfer may then be carried out using these genetically altered T lymphocytes (Stanislawski et al., Nat Immunol. 2:962-70, 2001; Kessels et al., Nat Immunol. 2:957-61, 2001).
[0198] Adoptive transfer is not the only form of therapy which can be applied according to the present technology. Cytotoxic T lymphocytes may also be generated in vivo in a manner known per se. One method uses nonproliferative cells expressing the complex. The cells used here will be those which usually express the complex, such as irradiated tumor cells or cells transfected with one or both genes necessary for presentation of the complex (i.e. the antigenic peptide and the presenting MHC molecule). Another preferred form is the introduction of the tumor-associated antigen in the form of recombinant RNA which may be introduced into cells by liposomal transfer or by electroporation, for example. The resulting cells present the complex of interest and are recognized by autologous cytotoxic T lymphocytes which then propagate.
[0199] A similar effect can be achieved by combining the tumor-associated antigen or a fragment thereof with an adjuvant in order to make incorporation into antigen-presenting cells in vivo possible. The tumor-associated antigen or a fragment thereof may be represented as protein, as DNA (e.g. within a vector) or as RNA. The tumor-associated antigen is processed to produce a peptide partner for the MHC molecule, while a fragment thereof may be presented without the need for further processing. The latter is the case in particular, if these can bind to MHC molecules. Preference is given to administration forms in which the complete antigen is processed in vivo by a dendritic cell, since this may also produce T helper cell responses which are needed for an effective immune response (Ossendorp et al., Immunol Lett. 74:75-9, 2000; Ossendorp et al., J. Exp. Med. 187:693-702, 1998). In general, it is possible to administer an effective amount of the tumor-associated antigen to a patient by intradermal injection, for example. However, injection may also be carried out intranodally into a lymph node (Malay et al., Proc Natl Acad Sci USA 98:3299-303, 2001).
[0200] The pharmaceutical compositions and methods of treatment described according to the present technology may also be used for immunization or vaccination to therapeutically treat or prevent a disease described herein. According to the present technology, the terms "immunization" or "vaccination" preferably relate to an increase in or activation of an immune response to an antigen. It is possible to use animal models for testing an immunizing effect on cancer by using a tumor-associated antigen or a nucleic acid coding therefor. For example, human cancer cells may be introduced into a mouse to generate a tumor, and one or more nucleic acids coding for tumor-associated antigens may be administered. The effect on the cancer cells (for example reduction in tumor size) may be measured as a measure for the effectiveness of an immunization by the nucleic acid.
[0201] As part of the composition for an immunization or a vaccination, preferably one or more tumor-associated antigens or stimulating fragments thereof are administered together with one or more adjuvants for inducing an immune response or for increasing an immune response. An adjuvant is a substance which is incorporated into the antigen or administered together with the latter and which enhances the immune response. Adjuvants may enhance the immune response by providing an antigen reservoir (extracellularly or in macrophages), activating macrophages and/or stimulating particular lymphocytes. Adjuvants are known and comprise in a nonlimiting way monophosphoryl lipid A (MPL, SmithKline Beecham), saponins such as QS21 (SmithKline Beecham), DQS21 (SmithKline Beecham; WO 96/33739), QS7, QS17, QS18 and QS-L1 (So et al., Mol. Cells 7:178-186, 1997), incomplete Freund's adjuvant, complete Freund's adjuvant, vitamin E, montanide, alum, CpG oligonucleotides (cf. Kreig et al., Nature 374:546-9, 1995) and various water-in-oil emulsions prepared from biologically degradable oils such as squalene and/or tocopherol. Preferably, the peptides are administered in a mixture with DQS21/MPL. The ratio of DQS21 to MPL is typically about 1:10 to 10:1, preferably about 1:5 to 5:1 and in particular about 1:1. For administration to humans, a vaccine formulation typically contains DQS21 and MPL in a range from about 1 .mu.g to about 100 .mu.g.
[0202] Other substances which stimulate an immune response of the patient may also be administered. It is possible, for example, to use cytokines in a vaccination, owing to their regulatory properties on lymphocytes. Such cytokines comprise, for example, interleukin-12 (IL-12) which was shown to increase the protective actions of vaccines (cf. Science 268:1432-1434, 1995), GM-CSF and IL-18.
[0203] There are a number of compounds which enhance an immune response and which therefore may be used in a vaccination. Said compounds comprise costimulating molecules provided in the form of proteins or nucleic acids such as B7-1 and B7-2 (CD80 and CD86, respectively).
[0204] The present technology also provides for administration of nucleic acids, proteins or peptides. Proteins and peptides may be administered in a manner known per se. In one embodiment, nucleic acids are administered by ex vivo methods, i.e. by removing cells from a patient, genetic modification of said cells in order to incorporate a tumor-associated antigen and reintroduction of the altered cells into the patient. This generally comprises introducing a functional copy of a gene into the cells of a patient in vitro and reintroducing the genetically altered cells into the patient. The functional copy of the gene is under the functional control of regulatory elements which allow the gene to be expressed in the genetically altered cells. Transfection and transduction methods are known to the skilled worker. The present technology also provides for administering nucleic acids in vivo by using vectors such as viruses and target-controlled liposomes. If according to the present technology reference is made to the administration or incorporation into pharmaceutical compositions of nucleic acids this includes embodiments wherein the nucleic acid is present in such vectors.
[0205] In a preferred embodiment, a virus or viral vector for administering a nucleic acid coding for a tumor-associated antigen is selected from the group consisting of adenoviruses, adeno-associated viruses, pox viruses, including vaccinia virus and attenuated pox viruses, Semliki Forest virus, retroviruses, Sindbis virus and Ty virus-like particles. Particular preference is given to adenoviruses and retroviruses. The retroviruses are typically replication-deficient (i.e. they are incapable of generating infectious particles).
[0206] Methods of introducing nucleic acids into cells in vitro or in vivo comprise transfection of nucleic acid calcium phosphate precipitates, transfection of nucleic acids associated with DEAE, transfection or infection with the above viruses carrying the nucleic acids of interest, liposome-mediated transfection, and the like. In particular embodiments, preference is given to directing the nucleic acid to particular cells. In such embodiments, a carrier used for administering a nucleic acid to a cell (e.g. a retrovirus or a Liposome) may have a bound target control molecule. For example, a molecule such as an antibody specific for a surface membrane protein on the target cell or a ligand for a receptor on the target cell may be incorporated into or attached to the nucleic acid carrier. Preferred antibodies comprise antibodies which bind selectively a tumor-associated antigen. If administration of a nucleic acid via liposomes is desired, proteins binding to a surface membrane protein associated with endocytosis may be incorporated into the liposome formulation in order to make target control and/or uptake possible. Such proteins comprise capsid proteins or fragments thereof which are specific for a particular cell type, antibodies to proteins which are internalized, proteins addressing an intracellular site, and the like.
[0207] The therapeutic compositions of the present technology may be administered in pharmaceutically compatible preparations. Such preparations may usually contain pharmaceutically compatible concentrations of salts, buffer substances, preservatives, carriers, supplementing immunity-enhancing substances such as adjuvants, e.g. CpG oligonucleotides, cytokines, chemokines, saponin, GM-CSF and/or RNA and, where appropriate, other therapeutically active compounds.
[0208] The therapeutically active compounds of the present technology may be administered via any conventional route, including by injection or infusion. The administration may be carried out, for example, orally, intravenously, intraperitoneally, intramuscularly, subcutaneously or transdermally. Preferably, antibodies are therapeutically administered by way of a lung aerosol. Antisense nucleic acids are preferably administered by slow intravenous administration.
[0209] The compositions of the present technology are administered in effective amounts. An "effective amount" refers to the amount which achieves a desired reaction or a desired effect alone or together with further doses. In the case of treatment of a particular disease or of a particular condition characterized by expression of one or more tumor-associated antigens, the desired reaction preferably relates to inhibition of the course of the disease. This comprises slowing down the progress of the disease and, in particular, interrupting or reversing the progress of the disease. The desired reaction in a treatment of a disease or of a condition may also be delay of the onset or a prevention of the onset of said disease or said condition. According to the present technology, a diagnosis or treatment of cancer may also include the diagnosis or treatment of cancer metastases which have already formed or will form. According to the present technology, the term "treatment" comprises therapeutic and prophylactic treatment, i.e. prevention.
[0210] An effective amount of a composition of the present technology will depend on the condition to be treated, the severeness of the disease, the individual parameters of the patient, including age, physiological condition, size and weight, the duration of treatment, the type of an accompanying therapy (if present), the specific route of administration and similar factors.
[0211] The pharmaceutical compositions of the present technology are preferably sterile and contain an effective amount of the therapeutically active substance to generate the desired reaction or the desired effect.
[0212] The doses administered of the compositions of the present technology may depend on various parameters such as the type of administration, the condition of the patient, the desired period of administration, etc. In the case that a reaction in a patient is insufficient with an initial dose, higher doses (or effectively higher doses achieved by a different, more localized route of administration) may be used.
[0213] Generally, doses of the tumor-associated antigen of from 1 ng to 1 mg, preferably from 10 ng to 100 .mu.g, are formulated and administered for a treatment or for generating or increasing an immune response. If the administration of nucleic acids (DNA and RNA) coding for tumor-associated antigens is desired, doses of from 1 ng to 0.1 mg are formulated and administered.
[0214] The pharmaceutical compositions of the present technology are generally administered in pharmaceutically compatible amounts and in pharmaceutically compatible compositions. The term "pharmaceutically compatible" refers to a nontoxic material which does not interact with the action of the active component of the pharmaceutical composition. Preparations of this kind may usually contain salts, buffer substances, preservatives, carriers and, where appropriate, other therapeutically active compounds. When used in medicine, the salts should be pharmaceutically compatible. However, salts which are not pharmaceutically compatible may used for preparing pharmaceutically compatible salts and are included in the present technology. Pharmacologically and pharmaceutically compatible salts of this kind comprise in a nonlimiting way those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, nitric, phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic acids, and the like. Pharmaceutically compatible salts may also be prepared as alkali metal salts or alkaline earth metal salts, such as sodium salts, potassium salts or calcium salts.
[0215] A pharmaceutical composition of the present technology may comprise a pharmaceutically compatible carrier. According to the present technology, the term "pharmaceutically compatible carrier" refers to one or more compatible solid or liquid fillers, diluents or encapsulating substances, which are suitable for administration to humans. The term "carrier" refers to an organic or inorganic component, of a natural or synthetic nature, in which the active component is combined in order to facilitate application. The components of the pharmaceutical composition of the present technology are usually such that no interaction occurs which substantially impairs the desired pharmaceutical efficacy.
[0216] The pharmaceutical compositions of the present technology may contain suitable buffer substances such as acetic acid in a salt, citric acid in a salt, boric acid in a salt and phosphoric acid in a salt.
[0217] The pharmaceutical compositions may, where appropriate, also contain suitable preservatives such as benzalkonium chloride, chlorobutanol, paraben and thimerosal.
[0218] The pharmaceutical compositions are usually provided in a uniform dosage form and may be prepared in a manner known per se. Pharmaceutical compositions of the present technology may be in the form of capsules, tablets, lozenges, solutions, suspensions, syrups, elixirs or in the form of an emulsion, for example.
[0219] Compositions suitable for parenteral administration usually comprise a sterile aqueous or nonaqueous preparation of the active compound, which is preferably isotonic to the blood of the recipient. Examples of compatible carriers and solvents are Ringer solution and isotonic sodium chloride solution. In addition, usually sterile, fixed oils are used as solution or suspension medium.
[0220] The present technology is described in detail by the figures and examples below, which are used only for illustration purposes and are not meant to be limiting. Owing to the description and the examples, further embodiments which are likewise included in the present technology are accessible to the skilled worker.
EXAMPLES
[0221] The techniques and methods mentioned herein are carried out in a manner known per se and are described, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. All methods including the use of kits and reagents are carried out according to the manufacturers' information unless specifically indicated.
Example 1
Screening for Placenta-Specific Genes Aberrantly Activated in Tumors
[0222] Tissues and Cell Lines
[0223] Tissues were obtained as human surplus materials during routine diagnostic or therapeutic procedures and were stored at -80.degree. C. until use. Cell lines were purchased from the American Type Culture Collection (ATCC) and the German Resource Collection of Microorganisms and Cell Culture (DSMZ).
[0224] RNA Isolation and Microarray Hybridization
[0225] Total RNA was isolated using the RNeasy Mini Kit protocol (Qiagen). Quantification of isolated RNA was performed using UV-spectroscopy and the quality was determined both by A.sub.260/A.sub.280 ratio and Agilent bioanalyzer (Agilent Technologies). Five micrograms total RNA were used for cDNA synthesis with 5 pmol .mu.l.sup.-1 T7-oligo(dT).sub.24 primer and was performed at 43.degree. C. for 90 minutes with the "Superscript First-Strand Synthesis-System" for RT-PCR (Invitrogen). Second-strand synthesis was performed with complete cDNA. The cDNA solution was incubated at 16.degree. C. for 2 hours followed by an incubation step for 20 min with 6 U T4-DNA polymerase at 16.degree. C. and the reaction was stopped using 10 .mu.l of 0.5 M EDTA. After purification of the double stranded cDNA using the GeneChip Sample Cleanup Module (Affymetrix) labeled cRNA was generated from the cDNA sample by an in vitro transcription reaction that was supplemented with biotin-11-CTP and biotin-16-UTP (Enzo Diagnostics) according to the manufacturer's instructions. The cRNA was quantified by A.sub.260, and the quality was determined using the labchip bioanalyzer (Agilent). Only cRNA specimens with a high quality were selected for further analyses. Fragmented cRNA (15 .mu.g) was used to prepare 300 .mu.l hybridization cocktail (100 mM MES, 1 M NaCl, 20 mM EDTA, 0.01% Tween-20) containing 0.1 mg ml.sup.-1 of herring sperm DNA, and 0.5 mg ml.sup.-1 acetylated bovine serum albumin. Control cRNA was used in order to compare hybridization efficiencies between arrays and to standardize the quantification of measured transcript levels and was included as component of the `Eukaryotic Hybridization Control kit` (Affymetrix, Santa Clara, Calif., USA). The cocktails were heated to 95.degree. C. for 5 minutes, equilibrated at 45.degree. C. for 5 minutes, and clarified by centrifugation. The cocktail was hybridized to HG 0133 Plus 2.0 arrays (Affymetrix) at 45.degree. C. for 16 hours. The arrays were washed and stained with a streptavidin-conjugated fluor using the GeneChip fluidics station protocol EukGE-WS2 (Affymetrix) according to the manufacturer's instructions. Arrays were scanned with an argon-ion laser confocal scanner (Hewlett-Packard, Santa Clara, Calif.) with detection at 570 nm. Data were extracted using Microarray Suite version 5.0 (Affymetrix) and linearly scaled to achieve an average intensity of 2,500 per gene. Text files were exported to determine the intensity of each interrogating oligonucleotide perfect match probe cells or mismatch probe cells. In addition, the ratios of 5'- and 3'-ends of mRNA were analyzed of six randomly selected specimens (two of each group) using microarray test-chips (Test3 Array) containing 24 human housekeeping/maintenance genes (Affymetrix) and RNA degradation was not observed.
[0226] Bioinformatic Analysis
[0227] The GeneChip.RTM. Operating Software 1.4 (Affymetrix) and ArrayAssist software package 5.2 (Stratagene) were used for statistical analyses.
[0228] Results
[0229] Screening of samples from the 18 normal tissues shown below in table 1 and 30 tumor cell lines of different entities shown below in table 2 resulted in the sequences described herein which are expressed in placenta among the normal tissues investigated and in tumor cell lines.
TABLE-US-00001 TABLE 1 Tissues used for microarray expression analysis Tissue Number Placenta 2 Testis 2 Mammary gland 2 Thymus 2 Skin 2 Liver 2 Colon 2 Esophagus 2 Stomach 2 Lung 2 Kidney 2 Lymph node 2 Skeletal muscle 2 Myocard 1 Brain 1 Cerebellum 1 resting PBMCs 2 activ. PBMCs 2
TABLE-US-00002 TABLE 2 Cell lines used for microarray expression analysis Cell line Tissue BT-549 Breast cancer MDA-MB-231 metastasizing Breast cancer MDA-MB-231 non-metastasizing Breast cancer MDA-MB-435S Breast cancer MDA-MB-468 Breast cancer SK-BR-3 Breast cancer Caov-3 Ovarian cancer FU-OV Ovarian cancer NIH-OVCAR-3 Ovarian cancer COLO-205 Colorectal cancer HCT-116 Colorectal cancer HCT-116 DKO Colorectal cancer HCT-15 Colorectal cancer HT-29 Colorectal cancer LOVO Colorectal cancer SW-480 Colorectal cancer CPC-N Lung cancer LOU-NH-91 Lung cancer SHP-77 Lung cancer SK-MES-1 Lung cancer NCI-H-187 Lung cancer NCI-H-209 Lung cancer NCI-H-522 Lung cancer DU-145 Prostate cancer Uncap Prostate cancer PC-3 Prostate cancer MEL-JUSO Melanoma Murkowski Melanoma SK-MEL-37 Melanoma HELA Cervical cancer
Example 2
Validation of the Identified Tumor-Associated Markers
[0230] 1. Examination of RNA Expression
[0231] The identified tumor-associated markers are first validated with the aid of RNA which is obtained from various tissues or from tissue-specific cell lines. Since the differential expression pattern of healthy tissue in comparison with tumor tissue is of decisive importance for the subsequent therapeutic application, the target genes are preferably characterized with the aid of these tissue samples.
[0232] Total RNA is isolated from native tissue samples or from tumor cell lines by standard methods of molecular biology. Said isolation may be carried out, for example, with the aid of the RNeasy Maxi kit (Qiagen, Cat. No. 75162) according to the manufacturer's instructions. This isolation method is based on the use of chaotropic reagent guanidinium isothiocyanate. Alternatively, acidic phenol can be used for isolation (Chomczynski & Sacchi, Anal. Biochem. 162: 156-159, 1987). After the tissue has been worked up by means of guanidinium isothiocyanate, RNA is extracted with acidic phenol, subsequently precipitated with isopropanol and taken up in DEPC-treated water.
[0233] 2-4 .mu.g of the RNA isolated in this way are subsequently transcribed into cDNA, for example by means of Superscript II (Invitrogen) according to the manufacturer's protocol. cDNA synthesis is primed with the aid of random hexamers (e.g. Roche Diagnostics) according to standard protocols of the relevant manufacturer. For quality control, the cDNAs are amplified over 30 cycles, using primers specific for the p53 gene which is expressed only lowly. Only p53-positive cDNA samples will be used for the subsequent reaction steps.
[0234] The targets are analyzed in detail by carrying out an expression analysis by means of PCR or quantitative PCR (qPCR) on the basis of a cDNA archive which has been isolated from various normal and tumor tissues and from tumor cell lines. For this purpose, 0.5 .mu.l of cDNA of the above reaction mixture is amplified by a DNA polymerase (e.g. 1 U of HotStarTaq DNA polymerase, Qiagen) according to the protocols of the particular manufacturer (total volume of the reaction mixture: 25-50 .mu.l). Aside from said polymerase, the amplification mixture comprises 0.3 mM dNTPs, reaction buffer (final concentration 1.times., depending on the manufacturer of the DNA polymerase) and in each case 0.3 mM gene-specific "sense" and "antisense" primers.
[0235] The specific primers of the target gene are, as far as possible, selected in such a way that they are located in two different exons so that genomic contaminations do not lead to false-positive results. In a non-quantitative end point PCR, the cDNA is typically incubated at 95.degree. C. for 15 minutes in order to denature the DNA and to activate the Hot-Start enzyme. Subsequently the DNA is amplified over 35 cycles (1 min at 95.degree. C., 1 min at the primer-specific hybridization temperature (approx. 55-65.degree. C.), 1 min at 72.degree. C. to elongate the amplicons). Subsequently, 10 .mu.l of the PCR mixture are applied to agarose gels and fractionated in the electric field. The DNA is made visible in the gels by staining with ethidium bromide and the PCR result is documented by way of a photograph.
[0236] As an alternative to conventional PCR, expression of a target gene may also be analyzed by quantitative real time PCR. Meanwhile various analytical systems are available for this analysis, of which the best known ones are the ABI PRISM sequence detection system (TagMan, Applied Biosystems), the iCycler (Biorad) and the Light cycler (Roche Diagnostics). As described above, a specific PCR mixture is subjected to a run in the real time instruments. By adding a DNA-intercalating dye (e.g. ethidium bromide, CybrGreen), the newly synthesized DNA is made visible by specific light excitation (according to the dye manufacturers' information). A multiplicity of points measured during amplification enables the entire process to be monitored and the nucleic acid concentration of the target gene to be determined quantitatively. The PCR mixture is normalized by measuring a housekeeping gene (e.g. 18S RNA, .beta.-actin). Alternative strategies via fluorescently labeled DNA probes likewise allow quantitative determination of the target gene of a specific tissue sample (see TaqMan applications from Applied Biosystems).
[0237] As shown in FIG. 1, placenta was confirmed in RT-PCR analyses as the only healthy tissue expressing the nucleic acid sequence according to SEQ ID NO:540. No significant expression was found in any other normal tissue. However, high and significant levels of expression were found in breast cancer.
[0238] Quantitative real-time RT-PCR analyses revealed that the nucleic acid sequence according to SEQ ID NO:540 was expressed in significant levels in the majority of breast cancer samples analyzed; cf. FIG. 2.
[0239] 2. Cloning
[0240] The complete target gene which is required for further characterization of the tumor-associated marker is cloned according to common molecular-biological methods (e.g. in "Current Protocols in Molecular Biology", John Wiley & Sons Ltd., Wiley InterScience). In order to clone the target gene or to analyze its sequence, said gene is first amplified by a DNA polymerase having a proof reading function (e.g. pfu, Roche Diagnostics). The amplicon is then ligated by standard methods into a cloning vector. Positive clones are identified by sequence analysis and subsequently characterized with the aid of prediction programs and known algorithms.
[0241] 3. Prediction of the Protein
[0242] Genes found according to the present technology (in particular those from the RefSeq XM domain) may require cloning of the full-length gene, determination of the open reading frame and deduction and analysis of the protein sequence.
[0243] In order to clone the full-length sequence, common protocols for the rapid amplification of cDNA ends and the screening of cDNA expression libraries with gene-specific probes may be used (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd edition (1989), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).
[0244] After assembling the fragments found in this way, potential open reading frames (ORF) can be predicted using common prediction programs. Since the position of the PolyA tail and of polyadenylation motifs predetermines the orientation of the potential gene product, only the 3 reading frames of that particular orientation remain out of a possible 6 reading frames. The former often yield only one sufficiently large open reading frame which may code for a protein, while the other reading frames have too many stop codons and would not code for any realistic protein. In the case of alternative open reading frames, identification of the authentic ORF is assisted by taking into account the Kozak criteria for optimal transcription initiation and by analyzing the deduced protein sequences which may arise. Said ORF is further verified by generating immune sera against proteins deduced from the potential ORFS and analyzing said immune sera for recognition of a real protein in tissues and cell lines.
[0245] 4. Production of Antibodies
[0246] The tumor-associated antigens identified according to the present technology are characterized, for example, by using antibodies. The present technology further comprises the diagnostic or therapeutic use of antibodies. Antibodies may recognize proteins in the native and/or denatured state (Anderson et al., J. Immunol. 143: 1899-1904, 1989; Gardsvoll, J. Immunol. Methods 234: 107-116, 2000; Kayyem et al., Eur. J. Biochem. 208: 1-8, 1992; Spiller et al., J. Immunol. Methods 224: 51-60, 1999).
[0247] Antisera comprising specific antibodies which specifically bind to the target protein may be prepared by various standard methods; cf., for example, "Monoclonal Antibodies: A Practical Approach" by Phillip Shepherd, Christopher Dean ISBN 0-19-963722-9, "Antibodies: A Laboratory Manual" by Ed Harlow, David Lane ISBN: 0879693142 and "Using Antibodies: A Laboratory Manual: Portable Protocol NO" by Edward Harlow, David Lane, Ed Harlow ISBN: 0879695447. It is also possible here to generate affine and specific antibodies which recognize complex membrane proteins in their native form (Azorsa et al., J. Immunol. Methods 229: 35-48, 1999; Anderson et al., J. Immunol. 143: 1899-1904, 1989; Gardsvoll, J. Immunol. Methods. 234: 107-116, 2000). This is especially important in the preparation of antibodies which are intended to be used therapeutically but also for many diagnostic applications. For this purpose, both the complete protein and extracellular partial sequences may be used for immunization.
[0248] Immunization and Production of Polyclonal Antibodies
[0249] Various immunization protocols are published. A species (e.g. rabbits, mice) is immunized by a first injection of the desired target protein. The immune response of the animal to the immunogen can be enhanced by a second or third immunization within a defined period of time (approx. 2-4 weeks after the previous immunization). Blood is taken from said animals and immune sera obtained, again after various defined time intervals (1st bleeding after 4 weeks, then every 2-3 weeks, up to 5 takings). The immune sera taken in this way comprise polyclonal antibodies which may be used to detect and characterize the target protein in Western blotting, by flow cytometry, immunofluorescence or immunohistochemistry.
[0250] The animals are usually immunized by any of four well-established methods, with other methods also in existence. The immunization may be carried out using peptides specific for the target protein, using the complete protein, or using extracellular partial sequences of a protein which can be identified experimentally or via prediction programs. Since the prediction programs do not always work perfectly, it is also possible to employ two domains separated from one another by a transmembrane domain. In this case, one of the two domains has to be extracellular, which may then be proved experimentally (see below). Immunization is offered commercially by different service providers.
[0251] (1) In the first case, peptides (length: 8-12 amino acids) are synthesized by in vitro methods (possibly carried out by a commercial service), and said peptides are used for immunization. Normally 3 immunizations are carried out (e.g. with a concentration of 5-100 .mu.g/immunization).
[0252] (2) Alternatively, immunization may be carried out using recombinant proteins. For this purpose, the cloned DNA of the target gene is cloned into an expression vector and the target protein is synthesized, for example, cell-free in vitro, in bacteria (e.g. E. coli), in yeast (e.g. S. pombe), in insect cells or in mammalian cells, according to the conditions of the particular manufacturer (e.g. Roche Diagnostics, Invitrogen, Clontech, Qiagen). It is also possible to synthesize the target protein with the aid of viral expression systems (e.g. baculovirus, vacciniavirus, adenovirus). After it has been synthesized in one of said systems, the target protein is purified, normally by employing chromatographic methods. In this context, it is also possible to use for immunization proteins which have a molecular anchor as an aid for purification (e.g. His tag, Qiagen; FLAG tag, Roche Diagnostics; GST fusion proteins). A multiplicity of protocols can be found, for example, in "Current Protocols in Molecular Biology", John Wiley & Sons Ltd., Wiley InterScience. After the target protein has been purified, an immunization is carried out as described above.
[0253] (3) If a cell line is available which synthesizes the desired protein endogenously, it is also possible to use this cell line directly for preparing the specific antiserum. In this case, immunization is carried out by 1-3 injections with in each case approx. 1-5.times.10.sup.7 cells.
[0254] (4) The immunization may also be carried out by injecting DNA (DNA immunization). For this purpose, the target gene is first cloned into an expression vector so that the target sequence is under the control of a strong eukaryotic promoter (e.g. CMV promoter). Subsequently, DNA (e.g. 1-10 .mu.g per injection) is transferred as immunogen using a gene gun into capillary regions with a strong blood flow in an organism (e.g. mouse, rabbit). The transferred DNA is taken up by the animal's cells, the target gene is expressed, and the animal finally develops an immune response to the target protein (Jung et al., Mol. Cells 12: 41-49, 2001; Kasinrerk et at., Hybrid Hybridomics 21: 287-293, 2002).
[0255] Production of Monoclonal Antibodies
[0256] Monoclonal antibodies are traditionally produced with the aid of the hybridoma technology (technical details: see "Monoclonal Antibodies: A Practical Approach" by Philip Shepherd, Christopher Dean ISBN 0-19-963722-9; "Antibodies: A Laboratory Manual" by Ed Harlow, David Lane ISBN: 0879693142, "Using Antibodies: A Laboratory Manual: Portable Protocol NO" by Edward Harlow, David Lane, Ed Harlow ISBN: 0879695447). A new method which is also used is the "SLAM" technology. Here, B cells are isolated from whole blood and the cells are made monoclonal. Subsequently the supernatant of the isolated B cell is analyzed for its antibody specificity. In contrast to the hybridoma technology, the variable region of the antibody gene is then amplified by single-cell PCR and cloned into a suitable vector. In this manner production of monoclonal antibodies is accelerated (de Wildt et al., J. Immunol. Methods 207:61-67, 1997).
[0257] 5. Validation of the Targets by Protein-Chemical Methods Using Antibodies
[0258] The antibodies which can be produced as described above can be used to further analyze the target protein as follows:
[0259] Specificity of the Antibody
[0260] Assays based on cell culture with subsequent Western blotting are most suitable for demonstrating the fact that an antibody binds specifically only to the desired target protein (various variations are described, for example, in "Current Protocols in Protein Chemistry", John Wiley & Sons Ltd., Wiley InterScience). For the demonstration, cells are transfected with a cDNA for the target protein, which is under the control of a strong eukaryotic promoter (e.g. cytomegalovirus promoter; CMV). A wide variety of methods (e.g. electroporation, liposome-based transfection, calcium phosphate precipitation) are well established for transfecting cell lines with DNA (e.g. Lemoine et al., Methods Mol. Biol. 75: 441-7, 1997). As an alternative, it is also possible to use cell lines which express the target gene endogenously (detection via target gene-specific RT-PCR). As a control, in the ideal case, homologous genes are cotransfected in the experiment, in order to be able to demonstrate in the following Western blot the specificity of the analyzed antibody.
[0261] In the subsequent Western blotting, cells from cell culture or tissue samples which might contain the target protein are lysed in a 1% strength SDS solution, and the proteins are denatured in the process. The lysates are fractionated according to size by electrophoresis on 8-15% strength denaturing polyacrylamide gels (contain 1% SDS) (SDS polyacrylamide gel electrophoresis, SDS-PAGE). The proteins are then transferred by one of a plurality of blotting methods (e.g. semi-dry electroblot; Biorad) to a specific membrane (e.g. nitrocellulose, Schleicher & Schull). The desired protein can be visualized on this membrane. For this purpose, the membrane is first incubated with the antibody which recognizes the target protein (dilution approx. 1:20-1:200, depending on the specificity of said antibody), for 60 minutes. After a washing step, the membrane is incubated with a second antibody which is coupled to a marker (e.g. enzymes such as peroxidase or alkaline phosphatase) and which recognizes the first antibody. It is then possible to make the target protein visible on the membrane in a color or chemi-luminescent reaction (e.g. ECL, Amersham Bioscience). An antibody with a high specificity for the target protein should in the ideal case only recognize the desired protein itself.
[0262] Localization of the Target Protein
[0263] Various methods are used to confirm the membrane localization, identified in the in silico approach, of the target protein. An important and well-established method using the antibodies described above is immunofluorescence (IF). For this purpose, cells of established cell lines which either synthesize the target protein (detection of the RNA by RT-PCR or of the protein by Western blotting) or else have been transfected with plasmid DNA are utilized. A wide variety of methods (e.g. electroporation, liposome-based transfection, calcium phosphate precipitation) are well established for transfection of cell lines with DNA (e.g. Lemoine et al., Methods Mol. Biol. 75: 441-7, 1997). The plasmid transfected into the cells, in immunofluorescence, may encode the unmodified protein or else couple different amino acid markers to the target protein. The principle markers are, for example, the fluorescent green fluorescent protein (GFP) in various differentially fluorescent forms, short peptide sequences of 6-12 amino acids for which high-affinity and specific antibodies are available, or the short amino acid sequence Cys-Cys-X-X-Cys-Cys which can bind via its cysteines specific fluorescent substances (Invitrogen). Cells which synthesize the target protein are fixed, for example, with paraformaldehyde or methanol. The cells may then, if required, be permeabilized by incubation with detergents (e.g. 0.2% Triton X-100). The cells are then incubated with a primary antibody which is directed against the target protein or against one of the coupled markers. After a washing step, the mixture is incubated with a second antibody coupled to a fluorescent marker (e.g. fluorescein, Texas Red, Dako), which binds to the first antibody. The cells labeled in this way are then overlaid with glycerol and analyzed with the aid of a fluorescence microscope according to the manufacturer's information. Specific fluorescence emissions are achieved in this case by specific excitation depending on the substances employed. The analysis usually permits reliable localization of the target protein, the antibody quality and the target protein being confirmed in double stainings with, in addition to the target protein, also the coupled amino acid markers or other marker proteins whose localization has already been described in the literature being stained. GFP and its derivatives represent a special case, being excitable directly and themselves fluorescing. The membrane permeability which may be controlled through the use of detergents, in immunofluorescence, allows demonstration of whether an immunogenic epitope is located inside or outside the cell. The prediction of the selected proteins can thus be supported experimentally. An alternative possibility is to detect extracellular domains by means of flow cytometry. For this purpose, cells are fixed under non-permeabilizing conditions (e.g. with PBS/Na azide/2%, FCS/5 mM EDTA) and analyzed in a flow cytometer in accordance with the manufacturer's instructions. Only extracellular epitopes can be recognized by the antibody to be analyzed in this method. A difference from immunofluorescence is that it is possible to distinguish between dead and living cells by using, for example, propidium iodide or trypan blue, and thus avoid false-positive results.
[0264] Another important detection is by immunohistochemistry (IHC) on specific tissue samples. The aim of this method is to identify the localization of a protein in a functionally intact tissue aggregate. IHC serves specifically for (1) being able to estimate the amount of target protein in tumor and normal tissues, (2) analyzing how many cells in tumor and healthy tissues synthesize the target gene, and (3) defining the cell type in a tissue (tumor, healthy cells) in which the target protein is detectable. Alternatively, the amounts of protein of a target gene may be quantified by tissue immunofluorescence using a digital camera and suitable software (e.g. Tillvision, Till-photonics, Germany). The technology has frequently been published, and details of staining and microscopy can therefore be found, for example, in "Diagnostic Immunohistochemistry" by David J., MD Dabbs ISBN: 0443065667 or in "Microscopy, Immunohistochemistry, and Antigen Retrieval Methods: For Light and Electron Microscopy" ISBN: 0306467704. It should be noted that, owing to the properties of antibodies, different protocols have to be used (an example is described below) in order to obtain a meaningful result.
[0265] Normally, histologically defined tumor tissues and, as reference, comparable healthy tissues are employed in IHC. It is also possible to use as positive and negative controls cell lines in which the presence of the target gene is known through RT-PCR analyses. A background control must always be included.
[0266] Formalin-fixed (another fixation method, for example with methanol, is also possible) and paraffin-embedded tissue pieces with a thickness of 4 .mu.m are applied to a glass support and deparaffinated with xylene, for example. The samples are washed with TBS-T and blocked in serum. This is followed by incubation with the first antibody (dilution: 1:2 to 1:2000) for 1-18 hours, with affinity-purified antibodies normally being used. A washing step is followed by incubation with a second antibody which is coupled to an alkaline phosphatase (alternative: for example peroxidase) and directed against the first antibody, for approx. 30-60 minutes. This is followed by a color reaction using alkaline phosphatase (cf., for example, Shi et al., J. Histochem. Cytochem. 39: 741-748, 1991; Shin et al., Lab. Invest. 64: 693-702, 1991). To demonstrate antibody specificity, the reaction can be blocked by previous addition of the immunogen.
[0267] Analysis of Protein Modifications
[0268] Secondary protein modifications such as, for example, N- or O-glycosylations or myristilations may impair or even completely prevent the accessibility of immunogenic epitopes and thus call into question the efficacy of antibody therapies. Moreover, it has frequently been demonstrated that the type and amount of secondary modifications differ in normal and tumor tissues (e.g. Durand & Seta, 2000; Clin. Chem. 46: 795-805; Hakomori, 1996; Cancer Res. 56: 5309-18). The analysis of these modifications is therefore essential to the therapeutic success of an antibody. Potential binding sites can be predicted by specific algorithms.
[0269] Analysis of protein modifications usually takes place by Western blotting (see above). Glycosylations which usually have a size of several kDa, especially lead to a larger total mass of the target protein, which can be fractionated in SDS-PAGE. To detect specific O- and N-glycosidic bonds, protein lysates are incubated prior to denaturation by SDS with O- or N-glycosylases (in accordance with their respective manufacturer's instructions, e.g. PNgase, endoglycosidase F, endoglycosidase H, Roche Diagnostics). This is followed by Western blotting as described above. Thus, if there is a reduction in the size of a target protein after incubation with a glycosidase, it is possible to detect a specific glycosylation and, in this way, also analyze the tumor specificity of a modification.
[0270] Functional Analysis of the Target Gene
[0271] The function of the target molecule may be crucial for its therapeutic usefulness, so that functional analyses are an important component in the characterization of therapeutically utilizable molecules. The functional analysis may take place either in cells in cell culture experiments or else in vivo with the aid of animal models. This involves either switching off the gene of the target molecule by mutation (knockout) or inserting the target sequence into the cell or the organism (knockin). Thus it is possible to analyze functional modifications in a cellular context firstly by way of the loss of function of the gene to be analyzed (loss of function). In the second case, modifications caused by addition of the analyzed gene can be analyzed (gain of function).
[0272] a. Functional Analysis in Cells
[0273] Transfection. In order to analyze the gain of function, the gene of the target molecule must be transferred into the cell. For this purpose, cells which allow synthesis of the target molecule are transfected with a DNA. Normally, the gene of the target molecule here is under the control of a strong eukaryotic promoter (e.g. cytomegalovirus promoter; CMV). A wide variety of methods (e.g. electroporation, liposome-based transfection, calcium phosphate precipitation) are well established for transfecting cell lines with DNA (e.g. Lemoine et al., Methods Mol. Biol. 75: 441-7, 1997). The gene may be synthesized either transiently, without genomic integration, or else stably, with genomic integration after selection with neomycin, for example.
[0274] RNA interference (siRNA). An inhibition of expression of the target gene, which may induce a complete loss of function of the target molecule in cells, may be generated by the RNA interference (siRNA) technology in cells (Hannon, G J. 2002. RNA interference. Nature 418: 244-51; Czauderna et al. 2003. Nucl. Acid Res. 31: 670-82). For this purpose, cells are transfected with short, double-stranded RNA molecules of approx. 20-25 nucleotides in length, which are specific for the target molecule. An enzymic process then results in degradation of the specific RNA of the target gene and thus in reduced expression of the target protein and consequently enables the target gene to be functionally analyzed.
[0275] Cell lines which have been modified by means of transfection or siRNA may subsequently be analyzed in different ways. The most common examples are listed below.
[0276] 1. Proliferation and Cell Cycle Behavior
[0277] A multiplicity of methods for analyzing cell proliferation are established and are commercially supplied by various companies (e.g. Roche Diagnostics, Invitrogen; details of the assay methods are described in the numerous application protocols). The number of cells in cell culture experiments can be determined by simple counting or by calorimetric assays which measure the metabolic activity of the cells (e.g. wst-1, Roche Diagnostics). Metabolic assay methods measure the number of cells in an experiment indirectly via enzymic markers. Cell proliferation may be measured directly by analyzing the rate of DNA synthesis, for example by adding bromodeoxyuridine (BrdU), with the integrated BrdU being detected colorimetrically via specific antibodies.
[0278] 2. Apoptosis and Cytotoxicity
[0279] A large number of assay systems for detecting cellular apoptosis and cytotoxicity are available. A decisive characteristic is the specific, enzyme-dependent fragmentation of genomic DNA, which is irreversible and in any case results in death of the cell. Methods for detecting these specific DNA fragments are commercially obtainable. An additional method available is the TUNEL assay which can detect DNA single-strand breaks also in tissue sections. Cytotoxicity is mainly detected via an altered cell permeability which serves as marker of the vitality state of cells. This involves on the one hand the analysis of markers which can typically be found intracellularly in the cell culture supernatant. On the other hand, it is also possible to analyze the absorbability of dye markers which are not absorbed by intact cells. The best-known examples of dye markers are Trypan blue and propidium iodide, a common intracellular marker is lactate dehydrogenase which can be detected enzymatically in the supernatant. Different assay systems of various commercial suppliers (e.g. Roche Diagnostics, Invitrogen) are available.
[0280] 3. Migration Assay
[0281] The ability of cells to migrate is analyzed in a specific migration assay, preferably with the aid of a Boyden chamber (Corning Costar) (Cinamon G., Alon R. J. Immunol. Methods. 2003 February; 273(1-2):53-62; Stockton et al. 2001. Mol. Biol. Cell. 12: 1937-56). For this purpose, cells are cultured on a filter with a specific pore size. Cells which can migrate are capable of migrating through this filter into another culture vessel below. Subsequent microscopic analysis then permits determination of a possibly altered migration behavior induced by the gain of function or loss of function of the target molecule.
[0282] b. Functional Analysis in Animal Models
[0283] A possible alternative of cell culture experiments for the analysis of target gene function are complicated in vivo experiments in animal models. Compared to the cell-based methods, these models have the advantage of being able to detect faulty developments or diseases which are detectable only in the context of the whole organism. A multiplicity of models for human disorders are available by now (Abate-Shen & Shen. 2002. Trends in Genetics S1-5; Matsusue et al. 2003. J. Clin. Invest. 111:737-47). Various animal models such as, for example, yeast, nematodes or zebra fish have since been characterized intensively. However, models which are preferred over other species are mammalian animal models such as, for example, mice (Mus musculus) because they offer the best possibility of reproducing the biological processes in a human context. For mice, on the one hand transgenic methods which integrate new genes into the mouse genome have been established in recent years (gain of function; Jegstrup I. et al. 2003. Lab Anim. 2003 January; 37(1):1-9). On the other hand, other methodical approaches switch off genes in the mouse genome and thus induce a loss of function of a desired gene (knockout models, loss of function; Zambrowicz B P & Sands A T. 2003. Nat. Rev. Drug Discov. 2003 January; 2(1):38-51; Niwa H. 2001. Cell Struct. Funct. 2001 June; 26(3):137-48); technical details have been published in large numbers.
[0284] After the mouse models have been generated, alterations induced by the transgene or by the loss of function of a gene can be analyzed in the context of the whole organism (Balling R, 2001. Ann. Rev. Genomics Hum. Genet. 2:463-92). Thus it is possible to carry out, for example, behavior tests as well as to biochemically study established blood parameters. Histological analyses, immunohistochemistry or electron microscopy enable alterations to be characterized at the cellular level. The specific expression pattern of a gene can be detected by in-situ hybridization (Peters T. et al. 2003. Hum. Mol. Genet 12:2109-20).
Example 3
Detailed Analysis of the Identified Tumor-Associated Markers
[0285] RNA-Isolation, RT-PCR and Real-Time RT-PCR
[0286] RNA extraction, first-strand cDNA synthesis, RT-PCR and real-time RT-PCR was performed as previously described (Koslowski, M. et al., Cancer Res. 62, 6750-6755 (2002), Koslowski, M. et al., Cancer Res. 64, 5988-5993 (2004)). Real-time quantitative expression analysis was performed in a 40 cycle RT-PCR. After normalization to HPRT (sense 5'-TGA CAC TGG CAA AAC MT GCA-3'; antisense 5'-GGT CCT TTT CAC CAG CAA GCT-3', 62.degree. C. annealing) gene-specific transcripts in tumor samples were quantified relative to normal tissues using .DELTA..DELTA.CT calculation.
[0287] siRNA Duplexes
[0288] The SEQ ID NO:540 siRNA duplexes (Qiagen, Hilden, Germany) were directed against target sequences 5'-NNC CAC AGA AGG UAC CAG UUA-3' (siRNA#1; sense (5'-CCA CAG AAG GUA CCA GUU AUU-3'), antisense (5'-UAA CUG GUA CCU UCU GUG GUU-3') and 5'-NNC AGC AAG ACU CCC UCU AAA-3' (siRNA#2; sense (5'-CAG CAA GAC UCC CUC UAA AUU-3'), antisense (5'-UUU AGA GGG AGU CUU GCU GUU-3') of the SEQ ID NO:540 mRNA sequence.
[0289] Cell Proliferation Analysis
[0290] 24 h after transfection with siRNA duplexes 1.times.10.sup.4 cells were cultured for 48 h in medium supplemented with 10% FCS. Proliferation was analyzed by measuring the incorporation of BrdU into newly synthesized DNA strands using the DELFIA cell proliferation Kit (Perkin Elmer, Boston, Mass.) according to the manufacturer's instructions on a Wallac Victor.sup.2 multi-label counter (Perkin Elmer, Boston, Mass.).
[0291] FIG. 3 shows the quantification of SEQ ID NO:540 mRNA expression in MCF-7 breast cancer cells by real-time RT-PCR 24 h after transfection with siRNA oligos. Compared to non-transfected cells and cells transfected with non-silencing (ns) siRNA both SEQ ID NO:540-specific siRNAs (siRNA#1 (SEQ ID NO:630, 631), siRNA#2 (SEQ ID NO:632, 633)) induce robust silencing of SEQ ID NO:540 expression.
[0292] FIG. 4 shows that silencing of SEQ ID NO:540 expression by transfection with siRNA oligos results in impaired proliferation of MCF-7 breast cancer cells. Proliferation was quantified 96 h after transfection with siRNAs by measuring incorporation of BrdU in newly synthesized DNA strands. These results show that SEQ ID NO:540 is a positive factor for the proliferation of breast cancer cells.
[0293] The nucleotide sequence according to SEQ ID NO:541 was deduced from SEQ ID NO:65 and codes for a 177 aa protein (SEQ ID NO:542) of unknown function. Expression of SEQ ID NO:541 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:543, 544); see FIG. 5. In normal tissues SEQ ID NO:541 is highly expressed in placenta and shows only weak expression in thymus. SEQ ID NO:541 is overexpressed in lung cancer. Based on these expression results, SEQ ID NO:541 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for this particular tumor type.
[0294] The nucleotide sequence according to SEQ ID NO:545 was deduced from SEQ ID NO:249 and codes for a member of the solute carrier (SLC) group of membrane proteins (SEQ ID NO:546). As is typical of integral membrane proteins, SLCs contain a number of hydrophobic transmembrane alpha helices connected to each other by hydrophilic intra- or extra-cellular loops. Depending on the SLC, these transporters are functional as either monomers or obligate homo- or hetero-oligomers. The protein encoded by SEQ ID NO:545 is a cell surface protein. Expression of SEQ ID NO:545 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:547, 548); see FIG. 6. Compared to normal tissues, SEQ ID NO:545 is overexpressed in malignant melanomas. Based on these expression results, SEQ ID NO:545 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for this particular tumor type.
[0295] The nucleotide sequence according to SEQ ID NO:549 was deduced from SEQ ID NO:4 and codes for a 763 aa protein (SEQ ID NO:550) of unknown function. The protein harbors two potential transmembrane domains and a typical fibronectin type III domain. Fibronectin is a high-molecular-weight extracellular matrix glycoprotein that binds to membrane spanning receptor proteins (integrins). In addition to integrins, they also bind extracellular matrix components such as collagen, fibrin and heparan sulfate. The protein encoded by SEQ ID NO:549 might represent a hitherto unknown new fibronection-like protein. Expression of SEQ ID NO:549 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:551, 552); see FIG. 7. Compared to normal tissues, SEQ ID NO:549 is overexpressed in ovarian cancer. Based on these expression results, SEQ ID NO:549 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular of this particular tumor type.
[0296] The nucleotide sequence according to SEQ ID NO:553 was deduced from SEQ ID NO:156 and codes for a 496 as protein (SEQ ID NO:554) of unknown function. The protein harbors a potential transmembrane protein. Expression of SEQ ID NO:553 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:555, 556); see FIG. 8. In normal tissues SEQ ID NO:553 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:553 is overexpressed in colon cancer and ovarian cancer. Based on these expression results, SEQ ID NO:553 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0297] The nucleotide sequence according to SEQ ID NO:557 was deduced from SEQ ID NO:273. SEQ ID NO:557 represents a partial cDNA with no apparent open reading frame. Expression of SEQ ID NO:557 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:558, 559); see FIG. 9. In normal tissues high expression of SEQ ID NO:557 is detectable in breast. Compared to normal tissues, SEQ ID NO:557 is overexpressed in breast cancer. Based on these expression results, SEQ ID NO:557 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for this particular tumor type.
[0298] The nucleotide sequence according to SEQ ID NO:560 was deduced from SEQ ID NO:135. SEQ ID NO:560 has no apparent open reading frame. Expression of SEQ ID NO:560 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:561, 562); see FIG. 10. In normal tissues expression of SEQ ID NO:560 is detectable in duodenum and colon. Compared to normal tissues, SEQ ID NO:560 is overexpressed in colon cancer and ovarian cancer. Based on these expression results, SEQ ID NO:560 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0299] The nucleotide sequence according to SEQ ID NO:563 was deduced from SEQ ID NO:177. SEQ ID NO:563 has no apparent open reading frame. Expression of SEQ ID NO:563 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:564, 565); see FIG. 11. SEQ ID NO:563 is highly expressed in placenta. Compared to normal tissues, SEQ ID NO:563 is overexpressed in breast cancer, colon cancer, ovarian cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:563 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0300] The nucleotide sequence according to SEQ ID NO:566 was deduced from SEQ ID NO:149 and codes for a 155 aa protein (SEQ ID NO:567) of unknown function. The protein sequence is partially homologous to members of the tumor necrosis factor receptor superfamily and harbors a potential transmembrane domain. The protein encoded by SEQ ID NO:566 might represent a new member of the tumor necrosis factor receptor superfamily. Expression of SEQ ID NO:566 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:568, 569); see FIG. 12. Compared to normal tissues, SEQ ID NO:566 is overexpressed in gastric cancer, breast cancer, colon cancer, ovarian cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:566 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0301] The nucleotide sequence according to SEQ ID NO:570 was deduced from SEQ ID NO:53 and codes for a member of the kernel lipocain superfamily (SEQ ID NO:571). These secreted glycoproteins have distinct and essential roles in regulating an uterine environment suitable for pregnancy and in the timing and occurrence of the appropriate sequence of events in the fertilization process. Expression of SEQ ID NO:570 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:572, 573); see FIG. 13. SEQ ID NO:570 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:570 is overexpressed in ovarian cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:570 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0302] The nucleotide sequence according to SEQ ID NO:574 has no apparent open reading frame. Expression of SEQ ID NO:574 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:575, 576); see FIG. 14. SEQ ID NO:574 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:574 is overexpressed in lung cancer and melanoma. Based on these expression results, SEQ ID NO:574 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0303] The nucleotide sequence according to SEQ ID NO:577 was deduced from SEQ ID NO:20. SEQ ID NO:577 represents a partial cDNA with no apparent open reading frame. Expression of SEQ ID NO:577 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:578, 579); see FIG. 15. SEQ ID NO:577 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:577 is overexpressed in gastric cancer, breast cancer and lung cancer. Based on these expression results, SEQ ID NO:577 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0304] The nucleotide sequence according to SEQ ID NO:580 was deduced from SEQ ID NO:32. SEQ ID NO:580 represents a partial cDNA with no apparent open reading frame. Expression of SEQ ID NO:580 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:581, 582); see FIG. 16. SEQ ID NO:580 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:580 is overexpressed in ovarian cancer and lung cancer. Based on these expression results, SEQ ID NO:580 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0305] The nucleotide sequence according to SEQ ID NO:583 was deduced from SEQ ID NO:257 and codes for a member of the homeobox class of transcription factors (SEQ ID NO:584). Expression of these proteins is spatially and temporally regulated during embryonic development. Expression of SEQ ID NO:583 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:585, 586); see FIG. 17. SEQ ID NO:583 is highly expressed in placenta and prostate. Compared to other normal tissues, SEQ ID NO:583 is overexpressed in colon cancer, ovarian cancer and lung cancer. Based on these expression results, SEQ ID NO:583 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0306] The nucleotide sequence according to SEQ ID NO:587 was deduced from SEQ ID NO:148 and codes for a member of the IGF-II mRNA-binding protein (IMP) family (SEQ ID NO:588). It functions by binding to the 5' UTR of the insulin-like growth factor 2 (IGF2) mRNA and regulating IGF2 translation. Expression of SEQ ID NO:587 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:589, 590); see FIG. 18. Compared to normal tissues, SEQ ID NO:587 is overexpressed in lung cancer. Based on these expression results, SEQ ID NO:587 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for this particular tumor type.
[0307] The nucleotide sequence according to SEQ ID NO:591 was deduced from SEQ ID NO:194 and codes for a 372 aa protein (SEQ ID NO:592) of unknown function. Expression of SEQ ID NO:591 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:593, 594); see FIG. 19. SEQ ID NO:591 is highly expressed in testis. Compared to other normal tissues, SEQ ID NO:591 is overexpressed in breast cancer, colon cancer, ovarian cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:591 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0308] The nucleotide sequence according to SEQ ID NO:595 was deduced from SEQ ID NO:191 and codes for a 357 aa protein (SEQ ID NO:596) of unknown function. Expression of SEQ ID NO:595 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:597, 598); see FIG. 20. SEQ ID NO:595 is highly expressed in testis. Compared to other normal tissues, SEQ ID NO:595 is overexpressed in gastric cancer, colon cancer, ovarian cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:595 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0309] The nucleotide sequence according to SEQ ID NO:599 was deduced from SEQ ID NO:18 and has no apparent open reading frame. Expression of SEQ ID NO:599 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:600, 601); see FIG. 21. SEQ ID NO:599 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:599 is overexpressed in gastric cancer, breast cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:599 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0310] The nucleotide sequence according to SEQ ID NO:602 was deduced from SEQ ID NO:133 and codes for a member of the von Willebrand factor domain superfamily of extracellular matrix proteins (SEQ ID NO:603). Expression of SEQ ID NO:602 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:604, 605); see FIG. 22. Compared to normal tissues, SEQ ID NO:602 is overexpressed in ovarian cancer and lung cancer. Based on these expression results, SEQ ID NO:602 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0311] The nucleotide sequence according to SEQ ID NO:606 was deduced from SEQ ID NO:128 and codes for a member of the Borg family of CDC42 effector proteins (SEQ ID NO:607). Borg family proteins contain a CRIB (Cdc42/Rac interactive-binding) domain. They bind to, and negatively regulate the function of CDC42. CDC42, a small Rho GTPase, regulates the formation of F-actin-containing structures through its interaction with the downstream effector proteins. Expression of SEQ ID NO:606 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:608, 609); see FIG. 23. Compared to normal tissues, SEQ ID NO:606 is overexpressed in gastric cancer, colon cancer and lung cancer. Based on these expression results, SEQ ID NO:606 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0312] The nucleotide sequence according to SEQ ID NO:610 was deduced from SEQ ID NO:118 and has no apparent open reading frame. Expression of SEQ ID NO:610 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:611, 612); see FIG. 24. Compared to normal tissues, SEQ ID NO:610 is overexpressed in gastric cancer, breast cancer and lung cancer. Based on these expression results, SEQ ID NO:610 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0313] The nucleotide sequence according to SEQ ID NO:613 was deduced from SEQ ID NO:116 and codes for a 76 aa protein (SEQ ID NO:614) of unknown function. Expression of SEQ ID NO:613 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:615, 616); see FIG. 25. SEQ ID NO:613 is highly expressed in placenta. Compared to other normal tissues, SEQ ID NO:613 is overexpressed in breast cancer, lung cancer and melanoma. Based on these expression results, SEQ ID NO:613 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0314] The nucleotide sequence according to SEQ ID NO:617 was deduced from SEQ ID NO:267. SEQ ID NO:617 represents a partial cDNA with no apparent open reading frame. Expression of SEQ ID NO:617 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:618, 619); see FIG. 26. SEQ ID NO:617 is highly expressed in placenta and endometrium. Compared to other normal tissues, SEQ ID NO:617 is overexpressed in lung cancer and melanoma. Based on these expression results, SEQ ID NO:617 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0315] The nucleotide sequence according to SEQ ID NO:620 was deduced from SEQ ID NO:182 and codes for a 829 as protein (SEQ ID NO:621) harboring multiple putative transmembrane domains and a patched family domain. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals. SEQ ID NO:620 might represent a novel member of the Patched family. Expression of SEQ ID NO:620 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:622, 623); see FIG. 27. SEQ ID NO:620 is highly expressed in lung. Compared to other normal tissues, SEQ ID NO:620 is overexpressed in ovarian cancer and melanoma. Based on these expression results, SEQ ID NO:620 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0316] The nucleotide sequence according to SEQ ID NO:624 was deduced from SEQ ID NO:184 and codes for a 323 as protein (SEQ ID NO:625) similar to TWIK-related acid-sensitive K.sup.+ channel, a member of the superfamily of potassium channel proteins that contain two pore-forming P domains. Expression of SEQ ID NO:624 in normal and cancerous tissues was quantified by real-time RT-PCR using sequence-specific oligos (SEQ ID NO:626, 627); see FIG. 28. SEQ ID NO:624 is highly expressed in lung. Compared to other normal tissues, SEQ ID NO:624 is overexpressed in gastric cancer and lung cancer. Based on these expression results, SEQ ID NO:624 and its expression products qualify as molecular markers and/or target candidates for targeted therapies, in particular for these particular tumor types.
[0317] The presently described technology is now described in such full, clear, concise and exact terms as to enable any person skilled in the art to which it pertains, to practice the same. It is to be understood that the foregoing describes preferred embodiments of the technology and that modifications may be made therein without departing from the spirit or scope of the invention as set forth in the appended claims.
Sequence CWU
1
1
6411398DNAHomo sapiensmisc_feature(272)..(272)n is a, c, g, or t
1aacataggtg gaccgctgct gagtccaggc ttacttgcag agatctatgc tggccaggcc
60ctgtgctagg cagcagagga catggaataa aatcaaataa ggtcactgtg tgcaggcacc
120tcacggtgtg gtaaaggagc agccccatcc acaggttcta ttaattccag cctgtgagaa
180ttggaaccac agggtgaatt ttggaggaca ggcacttaca ctaatctgga agcataatat
240ataaagagta cctacaaatc aataaaaaaa ananaaaaaa aaanagcaaa gtatatgaac
300agaaaattca atgaaaagga aatagaaatg gctcttaaat gaatgaaaac atactctcac
360tcagagaaat gaaaatttaa cccatgtcaa gatacttg
398299DNAHomo sapiens 2atgcacagag catcacgtac aatggctcca tggacagccc
agtgcccttg taccctaccg 60attgcccccc ttcttatgag gcagtcatgg gactacgag
993266DNAHomo sapiens 3aggagaaaac ctctacttgt
cctgcttcgc ggaatctaac ccaccggcag agtatttttg 60gacaattaat gggaagtttc
agcaatcagg acaaaagctc tctatccccc aaattactac 120aaagcataga gggctctata
cttgctctgt tcgtaactca gccactggca aggaaagctc 180caaatccatg acagtcgaag
tctctgctcc ttcaggaata ggacgtcttc ctctccttaa 240tccaatatag cagccgtgaa
gtcatt 2664492DNAHomo
sapiensmisc_feature(369)..(369)n is a, c, g, or t 4aaaggagtca tcagcgtctc
tttcctggat tatatctggg ttacctaaag ctctgtagct 60ctgtggatca aatcaaagtt
cttgttaccc aaaagttgcc caacattctc tgccacgtga 120agatccgtga aaacaataat
atttctagag aggaatggga atggatccaa aagctttctg 180gctctgaatc tatggaaagt
gtggatcata cttctgactg ccccatgcaa ttgttcttct 240acgagctcca gatggcagtg
aaagctctcc ttcagcagat caatatacct ctacaccagg 300caaggaactt ccgcctctac
acacaggagg tgttggaaat gggtcacaat gtgtcctttc 360ttctcctgnt cnctgcctca
gacgacgtct gtacagcccc aggacagaat aatccttnna 420cccnacactc agggtttctt
aanctccctc ttcagatgtt tgaacttggt atagtagctt 480gtttcaccta ga
4925133DNAHomo sapiens
5ttcaaaaact gctggtgagc ctatggaaga ggagccagcc ttgtgaagtg ccaagtcccc
60ctctgatatt tcctgtgtgt gacatcattg tgtatccccc caccccagta ccctcagaca
120tgtcttgtct gct
1336371DNAHomo sapiensmisc_feature(178)..(178)n is a, c, g, or t
6ggtttcggcc tctgcgaaag tgaaatgccc aagcctccgg ccaaagccca gaccagtaca
60gtatgaattg tcctatgaga ctgaggggtt cggcttcatt cctacctgcc cgcaaagctc
120gcccccagcc tcgaaaacaa agcgactggt ctgacgtggg gtccctgcgc ccctcctnta
180gcgcgacagg acccccccag ggaagagcca gtacccgtgg gatgtcaccc cgtccccatc
240taccggggtg gggggcctga aaggagaacg atttaaaata atcttcagaa agaaaaggga
300ggagggagcg ggtgacacat cgttcacata aacccaattt ctggtttcga gtgaagtcaa
360gatctccgcc c
3717215DNAHomo sapiensmisc_feature(153)..(153)n is a, c, g, or t
7ccatttaaca tgtatagtag gtcaacattg gtgcatccag aaaatgaagc atttaggaaa
60tctgtttcag tgtcttttca atgtgtgtaa cttttacttg caaaccaatg gaaccaagaa
120agtcatcatt tgcctaaaat gcagtcatca ccncaaatga ttcatttata ctatgtgagt
180taattgcctt catctcatta atggccaagg aggga
2158322DNAHomo sapiens 8actgttcagt actgcaaccc caggatacac aatggcatct
ggctctgttt attcaccacc 60tactcggcca ctacctagaa acaccctatc aagaagtgct
tttaaattca agaagtcttc 120aaagtactgt agctggaaat gcactgcact gtgtgccgta
ggggtctcgg tgctcctggc 180aatactcctg tcttatttta tagcaatgca tctctttggc
ctcaactggc agctacagca 240gactgaaaat gacacatttg agaatggaaa agtgaattct
gataccatgc caacaaacac 300tgtgtcatta ccttctggag ac
3229547DNAHomo sapiens 9ttacccttca ttcacctatt
acggttcagg agaaaacctc gacttgtcct gcttcacgga 60atctaaccca ccggcagagt
atttttggac aattaatggg aagtttcagc aatcaggaca 120aaagctcttt atcccccaaa
ttactagaaa tcatagcggg ctctatgttt gctctgttca 180taactcagcc actggcaagg
aaatctccaa atccatgaca gtcaaagtct ctggtccctg 240ccatggagac ctgacagagt
ttcagtcatg actgcaacaa ctgagacact gagaaaaaga 300acaggctgat accttcatga
aattcaagac aaagaagaaa aaaactcaat gttattggac 360taaataatca aaaggataat
gttttcataa ttttttattg gaaaatgtgc tgattctttg 420aatgttttat tctccagatt
tatgaacttt ttttcttcag caattggtaa agtatacttt 480tgtaaacaaa aattgaaata
tttgcttttg ctgtctatct gaatgcccca gaattgtgaa 540actactc
54710396DNAHomo sapiens
10gcaggggacg caccaaggat ggagatgttc caggggctgc tgctgttgct gctgctgagc
60atgggcggga catgggcatc caaggagccg cttcggccac ggtgccgccc catcaatgcc
120accctggctg tggagaagga gggctgcccc gtgtgcatca ccgtcaacac caccatctgt
180gccggctact gccccaccat gacccgcgtg ctgcaggggg tcctgccggc cctgcctcag
240gtggtgtgca actaccgcga tgtgcgcttc gagtccatcc ggctccctgg ctgcccgcgc
300ggcgtgaacc ccgtggtctc ctacgccgtg gctctcagct gtcaatgtgc actctgccgc
360cgcagcacca ctgactgcgg gggtcccaag gaccac
39611311DNAHomo sapiens 11aaatggtggt gtttgactgg tatatgacct tcctctggag
gtgatcaacc agtaagggaa 60aatcgctcca agtgagcatg cacacaacct cagtaaacac
actgtgcatg tggcttctcc 120caagtactag caggccactg cacatgtcac aactgagcaa
cagcccaccc caatggaggg 180atcaagggag gagaagaaaa accccggaac caaaagccag
tttataaaaa tcctgagcca 240aaggctgagg ggggcacttg atctctcaag ttccctactt
ggccctcttc caagtgtgat 300ttgcttcttt t
31112246DNAHomo sapiensmisc_feature(184)..(184)n
is a, c, g, or t 12gccttagccc ccgggattta gagcatcctc gcgaccaccc ggaggcttct
gggggccact 60ctgcggatga ggaagctgac gcctgggtgc agaaccccgg acccccggat
tcagagccca 120ggtccagccg cgcttccgca caaacttgcg ctcggagcaa gtcccctcct
tcccagcact 180catntgagac cagaggtgtc cccaccgtcc ccgctagcag cgctggttat
attgtgggcc 240aacctt
24613516DNAHomo sapiensmisc_feature(149)..(150)n is a, c, g,
or t 13ccttttgctt gagcagggtt cccaggaggg agaaagagaa gacaagagcc tgatgcccaa
60ctttgtgtgt gtggggacgg gggagtcagg gccccccaag tcccacaata gccccaatgt
120ttgcctatcc acctccccca agccccttnn ccnnnnnnnc nnntnacnnn nnnnctgctg
180ctgctgctgc tgctgcttaa aggctcatgc ttggagtggg gactggtcgg tgcccagaaa
240gtctcttctg ccactgacgc ccccatcagg gattgggcct tctttccccc ttcctttctg
300tgtctcctgc ctcatcggcc tgccatgacc tgcagccaag cccagccccg tggggaaggg
360gagaaagtgg gggatggcta agaaagctgg gagataggga acagaagagg gtagtgggtg
420ggctaggggg gctgccttat ttaaagtggt tgtttatgat tcttatacta atttatacaa
480agatattaag gccctgttca ttaagannnn nnnnnn
51614162DNAHomo sapiensmisc_feature(92)..(92)n is a, c, g, or t
14gagcctagag agtaaggaac gttatatagt tttccccaaa ggttcacttg aaagaacttt
60tcattggttg tcatggtagt aatgtcctga tnttgaaatc tcccagaacc tagtagctct
120taaacatgct ttcatcttgg ttcctttggt ctgacggaaa ct
16215523DNAHomo sapiensmisc_feature(49)..(49)n is a, c, g, or t
15tttgcaaaag gtttccggga cactggaaat ggccgaagag aaaaaagana acagctcacc
60ctgcagtcca tgagggtgtt tgatgaaaga cacaaaaagg agaatgggac ctctgatgag
120tcctccagtg aacaagcagc tttcaactgc ttcgcccagg cttcttctcc agccgcctcc
180actgtaggga catcgaacct caaagattta tgtcccagcg agggtgagag cgacgccgag
240gccgagagca aagaggagca tggccccgag gcctgcgacg cggccaagat ctccaccacc
300acgtcggagg agccctgccg tgacaagggc agccccgcgg tcaaggctca ccttttcgct
360gctgagcggc cccgggacag cgggcggctg gacaaagcgt cgcccgactc acgccatagc
420cccgccacca tctcgtccag cactcgcggc ctgggcgcgg aggagcgcag gagcccggtt
480cgcgagggca cagcgccggc caaggtggaa gaggcgcgcg cgc
52316424DNAHomo sapiens 16actcggtcac actcagtaag tccttgcaga gtccatgggt
ttcttcgaca agtggcttca 60aggaagggaa ttcccaccct tgtcttccag caaggccaca
cacatgaaac cagcagaaaa 120gagtcttatt tgctggaaag acccccagca agggcatagt
gagcccttac agtggttcca 180gtcagaaaag gcaccacttg ggtgggcaca gccccatggg
tgtccaactt ggtaagcaga 240gcaaggctgg acttgagtcc ccgtcctcca caaaacacag
agccacaagc cccagccctg 300cagcagccct ccggaagcag cggggcactg gtttccttgt
cccctgccat ctaccgagtg 360gctcactctc aggtgggagt gctggtgatg gttaattagg
actgcagaaa catgagcctc 420ctta
42417524DNAHomo sapiens 17ctatctttct ggtcacattg
tcggtgtttc tgcatgttct ccattccgct cctgatgtgc 60aggattgccc agaatgcacg
ctacaggaaa acccattctt ctcccagccg ggtgccccaa 120tacttcagtg catgggctgc
tgcttctcta gagcatatcc cactccacta aggtccaaga 180agacgatgtt ggtccaaaag
aacgtcacct cagagtccac ttgctgtgta gctaaatcat 240ataacagggt cacagtaatg
gggggtttca aagtggagaa ccacacggcg tgccactgca 300gtacttgtta ttatcacaaa
tcttaaatgt tttaccaagt gctgtcttga tgactgctga 360ttttctggaa tggaaaatta
agttgtttag tgtttatggc tttgtgagat aaaactctcc 420ttttccttac cataccactt
tgacacgctt caaggatata ctgcagcttt actgccttcc 480tccttatcct acagtacaat
cagcagtcta gttcttttca tttg 52418538DNAHomo sapiens
18gtagggcgaa ctctgctata cagtttatga tgtcagagtg aatactttct ttgagttgca
60gtcagaaact gtagattttt aaaaatttaa aattcattat tctctgtcag tattccaaag
120tgtatacaga aagctattgc actgttcagg agatggcgct taacattttg gaaattcaag
180gtgatgaatg tccagataag actatctctc ctggtacaaa gtttgacaat gctgaacatt
240tttaaaggtt ctttttgata tacaaagtgc accaatgagt gctttttaat tcttacaata
300attctgggtg aggtaggtat ttttccaatt cccattttat gcttcggtag ccctttgtat
360ttatacttca aaacacttgg ctctcttgta attatttaag aaattagttg tgattatttg
420tttaatgtgc aggagttaca aaaggcaagc tttagaacaa gacagacctg gttatgattc
480ctggctctga aagctgtaca ccctgtgacc ctagacaggt gttttaatgc ctcgctgc
53819324DNAHomo sapiensmisc_feature(294)..(295)n is a, c, g, or t
19tcttggtctt ctgctcatgg atgacgcacc agtcgtgcat gttcttggtg aagtcggcca
60gcctgagcaa gcggaggacc tacgccggcc tggcattcca cgcctacggg aaggcaggca
120agatgctggt ggagaccagc atgatcgggc tgatgctggg cacctgcatc gccttctacg
180tcgtgatcgg cgacttgggg tccaacttct ttgcccggct gttcgggttt caggtgggcg
240gcaccttccg catgttcctg ctgttcgccg tgtcgctgtg catcgtgctc ccgnncagcc
300tgcagcggaa catgatggcc tcca
32420414DNAHomo sapiens 20agaaagcaga gcagcctcct ggaagaaggc cttgtcagct
ttgtctgtgc ctcgcaaatc 60agaggcaagg gagaggttgt taccagggga cactgagaat
gtacatttga tctgccccag 120ccacggaagt cagagtagga tgcacagtac aaaggagggg
ggagtggagg cctgagaggg 180aagtttctgg agttcagata ctctctgttg ggaacaggac
atctcaacag tctcaggttc 240gatcagtggg tcttttggca ctttgaacct tgaccacagg
gaccaagaag tggcaatgag 300gacacctgca ggaggggcta gcctgactcc cagaacttta
agactttctc cccactgcct 360tctgctgcag cccaagcagg gagtgtcccc ctcccagaag
catatcccag atga 41421531DNAHomo sapiens 21caaagtcatc tgaacttccg
tttccccagg gcctccagct gccctcagac actgatgtct 60gtccccaggt gctctctgcc
cctcatgccc ctctcaccgg cccagtgccc cgactctcca 120ggctttatca aggtgctaag
gcccgggtgg gcagctcctc gtctcagagc cctcctccgg 180cctggtgctg cctttacaaa
cacctgcagg agaagggcca cggaagcccc aggctttaga 240gccctcagca ggtctgggga
gctagagcaa aggagggacc tcaggccttc cgtttcttct 300tccagggtgg ggtggcctgg
tgttccccta gccttccaaa cccaggtggc ctgcccttct 360ccccagaggg aggcggcctc
cgcccattgg tgctcatgca gactctgggg ctgaggtgcc 420ccggggggtg atctctggtg
ctcacagccg agggagccgt ggctccatgg ccagatgacg 480gaaacagggt ctgaccaagt
gccaggaaga cctgtgctat aaaccaccct g 53122522DNAHomo sapiens
22atgtcctgcc atttggtcat aagacagttg catttactct gctaccattg cttcagttga
60tatgaagaga gaaagctgtg ttgtgattta cactggatat ggaaatagag aggaacaaat
120ctgtctgatc tactttcttc aacctctgta gtagctaata atataggaca gaatgctcca
180aagaatgaaa atgaaagtca agattcaatg gatgaaagtg agaactcctc caggtcctgg
240aaacaaacca tttagcatca ggtcagaagc tactccatgg aattctgaga ccacgaaagc
300caggtcaggt ctcaaattca gtagcccacc acccacacca ccacccacac caccctgctt
360cccctcatgc ttgctgcctc catttccttc tggaccacca ataattcccc caccacctcc
420cacaggtcta gattttcttg atgatgttaa tgttttatga agtatgctaa tctcttggta
480cattaagtgg ctatcatact ggctattata cagggctcaa gc
52223520DNAHomo sapiensmisc_feature(98)..(98)n is a, c, g, or t
23gaaaggcctg caattgtgtc ttcacgatgc ttttccaaga cagccaaggc aggtataatt
60ttcctcagca agaaagagga acctcggagg tgtgcacngc ctggctggcc acccaggtat
120tggcaaaagt gactgtcggg cntgctggcc cggcccccgc ccgccgtccc tggagcactc
180acgatgcggt ccggcggcgg cgtgctccgg atgaagcact tgatctggcc cttctcgccg
240tggagggcgt gctgggtctg ggtgctggag atgatggggg gtcctgttga gaaacagcgt
300cccattaggc acccgggaag ggcacgtccc tgctggcgcc ctcttgggtg ggttcagaag
360tgtattcatt aatccaagca ttcagcaaac atttgccgaa ggcctgtatg tgcaaggtaa
420agtgcaaggt agaggactca gagataaatt aggcattcag tcataaacct ctcaagggat
480catgagcgaa tgcttctaag tcagaacccc cagaagatac
52024488DNAHomo sapiens 24agcagaacct cctagggttt ctccttccac tccttgccat
gatcttcttc tactcccgta 60ttggttgtgt cttggtgagg ctgaggcccg caggccaggg
ccgggcttta aaaatagctg 120cagccttggt ggtggccttc ttcgtgctat ggttcccata
caatctcacc ttgtttctgc 180atacgctgtt ggacctgcaa gtattcggga actgtgaggt
cagccagcat ctagactacg 240cactccaggt aacagagagc atcgccttcc ttcactgctg
cttttccccc atcctgtatg 300ccttctccag tcaccgcttc cgccagtacc tgaaggcttt
cctggctgcc gtgcttggat 360ggcacctggc acctggcact gcccaggcct cattatccag
ctgttctgag agcagcatac 420ttactgccct tgaggaaatg actggcatga atgaccttgg
agagaggcag tctgagaact 480accctaac
48825552DNAHomo sapiens 25gaggagagac ctgcgtggga
taatcaacag gggtctggag gacggggaga gctgggaata 60tcagatctga ctgcgtgttc
tcacttcgct tcctggaact tgctctcatt ttcctgggtg 120catcaaacaa aacaaaaacc
aaacacccag aggtctcatc tcccaggccc caggggagaa 180agaggagtag catgaacgcc
aaggaatgta cgttgagaat cactgctcca ggcctgcatt 240actccttcag ctctggggca
gaggaagccc agcccaagca cggggctggc agggcgtgag 300gaactctcct gtggcctgct
catcaccctt ccgacaggag cactgcatgt cagagcactt 360taaaaacagg ccagcctgct
tgggcgctcg gtctccaccc cagggtcata agtggggaga 420gagcccttcc cagggcaccc
aggcaggtgc agggaagtgc agagcttgtg gaaagcgtgt 480gagtgaggga gacaggaacg
gctctggggg tgggaagtgg ggctaggtct tgccaactcc 540atcttcaata aa
55226511DNAHomo sapiens
26aagcctcaag gcacttctag gacctggctc ttctcaccaa gatgaactca ctggtttctt
60ggcagctact gcttttcctc tgtgccaccc actttgggga gccattagaa aaggtggcct
120ctgtggggaa ttctagaccc acaggccagc agctagaatc cctgggcctc ctggcccccg
180gggagcagag cctgccgtgc accgagagga agccagctgc tactgccagg ctgagccgtc
240gggggacctc gctgtccccg ccccccgaga gctccgggag ccgccagcag ccgggcctgt
300ccgcccccca cagccgccag atccccgcac cccagggcgc ggtgctggtg cagcgggaga
360aggacctgcc gaactacaac tggaactcct tcggcctgcg cttcggcaag cgggaggcgg
420caccagggaa ccacggcaga agcgctgggc ggggctgggg cgcaggtgcg gggcagtgaa
480cttcagaccc caaaggagtc agagcatgcg g
51127131DNAHomo sapiens 27ctcctccagc aagacagatg cctagcccgt cctcaggaat
ctgccgccag ggagaatggc 60aaccctggcc agatagctgg aagcacaggg ttgctcttca
acctgcctcc cggctcagtt 120cactataaga a
13128304DNAHomo sapiensmisc_feature(41)..(41)n is
a, c, g, or t 28tttcctctga gagaacagcg gtcttctgtc tgctgtggca nagcaagtca
cttcttcttg 60tagtgagaac tgaaaccaga accnatcatg tgccacttcc tggacacctc
ctattaaata 120ttaaagtcct ctcaccacag aagccggagt ttagtggtta ggggcacagg
ttcttagata 180tgaacatcag ttgcaaccta ccaactgcat gctcttggac aatttacatt
tctgtgtatc 240agctttcctt tttctttaga atgagatatt aatagtagca acccagaatt
gtcatgaagc 300ctaa
30429226DNAHomo sapiens 29catggcaaag gcttgcccca aatctcaact
tctcagacgt tccatacccc cacatgccaa 60tttcagcacc caactgagat ccgaggagct
cctgggaagc cctgggtgca ggacactggt 120cgagagccaa aggtccctcc ccagacatct
ggacactggg catagatttc tcaagaagga 180agactcccct gcctccccag ggcctctgct
ctcctgggag acaaag 22630567DNAHomo
sapiensmisc_feature(195)..(195)n is a, c, g, or t 30ggccaccaga ggattctcag
ggctcctttg tcctggactg tggaactggg ggcagctggt 60ccctggggtc tctgaagtca
gtgtctccct cactgctcac tgccatggtg tctctgcctc 120tgcttctctg tgtccctcat
cttcctccca cttcattctg actggcaagc cctgtcctgc 180acagcttctt ccccnacccc
taggccttcc ccaganactc cctctnacta ggctggctgt 240tctgttccct tcccnnctaa
nactgtggcc tggcccacct cccnaggaaa taggaaaggt 300gcagaaatca ccntggagtt
gccactcntg ccnnggcttc atctcgagcc aatgtnccca 360ggtcactaag agaatgagct
tccactgtat tcccatccag ggctctttcc ntttgtgagg 420ctgacctgtg gacaagacaa
tgggacaggg ataggcagtt cctccatcca ntntcataat 480tgccaggcaa gntcttnnnn
ccncctgcan nancctcccc agtggatcag gggttagaga 540tattcaaggg tagtttcagg
agcacag 56731448DNAHomo
sapiensmisc_feature(82)..(82)n is a, c, g, or t 31taatgcggac gtaccgactg
ccagatcttt acactcaccc ctccacctgc cccgaggagt 60ccggtcacaa gggcccagcc
antcacaaag acaccnnggt gtcccttcca tttttttcca 120cgaaggccca gaatccattt
taggtttcca aacagacctt tcgtcccttc aaggtgtaac 180caccgttttc cattccagcc
attttattgg ccacaccgtt accttactta taggtatttc 240cccagaagaa gactccagag
aggaagctca tctgaggaaa gctgagaggg aagagaaacc 300caaacatact gaagcaaaaa
aaagcctatc cttcagaaaa aagcaacaaa aagatttctg 360ttttatcttt cgaaactaaa
actattggat ttgaagatta agtatcctaa acatcactga 420ctagaaactg ttctctttgt
cagcagtg 44832396DNAHomo
sapiensmisc_feature(141)..(141)n is a, c, g, or t 32agtgtgcatg ttcactgggc
atcttccctt cgaccccttt gcccacgtgg tgaccgctgg 60ggagctgtga gagtgtgagg
ggcacgttcc agccgtctgg actctttctc tcctactgag 120acgcagccta taggtccgca
ngccagtcct cccaggaact gaaatagtga aatatgagtt 180ggcgaggaag atcaacatat
aggcctaggc caagaagaag tttacagcct cctgagctga 240ttggggctat gcttgaaccc
actgatgaag agcctaaaga agagaaacca cccactaaaa 300gtcggaatcc tacacctgat
cagaagagag aagatgatca gggtgcagct gagattcaag 360tgcctgacct ggaagccgat
ctccaggagc tatgtc 39633484DNAHomo sapiens
33cggtggcttg caacatgctc atgccagagc ccgctcaacg ctggctggtg ggcttcgtgt
60tgtacacatt tctcatgggc ttcctgctgc ccgtgggggc tatctgcctg tgctacgtgc
120tcatcattgc taagatgcgc atggtggccc tcaaggccgg ctggcagcag cgcaagcgct
180cggagcgcaa gatcacctta atggtgatga tggtggtgat ggtgtttgtc atctgctgga
240tgcctttcta cgtggtgcag ctggttaacg tgtttgctga gcaggacgac gccacggtga
300gtcagctgtc ggtcatcctc ggctatgcca acagctgcgc caaccccatc ctctatggct
360ttctctcaga caacttcaag cgctctttcc aacgcatcct atgcctcagc tggatggaca
420acgccgcgga ggagccggtt gactattacg ccaccgcgct caagagccgt gcctacagtg
480tgga
48434393DNAHomo sapiens 34tctgacttcc agaacctacg ataatagact ccatgaaatc
tgtaatcagt ggccacagga 60aactcatgca acagcccttc caaggggctc cccagcaaag
cctccgtggt gtctgccccc 120aacccctgtg cctcctggga cacaagacag gcccagcaag
ggtgggggtg ccacggaaag 180cttggtggct gggcaggtcc ccagagggcc gccatcagtc
ctcaaagaca tgctcagatg 240cagtggctca ggcctggcac cagctggtcc caaggtgggg
tggtgagggt acatctgctg 300tgcacacgtg gctggacgcg ctgggggcag gtccaggtca
gcttcaagga ctctgcccag 360gctaacccta gaggcctcta gtgccagcag tta
39335493DNAHomo sapiens 35aggcccatgt gctgtttttg
acttcagtac ttcagattgc tgtgggaaca caggaggcag 60cagccagatg agaaattgag
tctgactctg gagtattata aagtccttat agttactggc 120attaggtata gggtctgtat
tattaaagag aaattattca ccaaacactt gttaaaaatg 180gcaagacagt ttatttaaga
gcattgcaat aggtaagtgc tatggtctca atgtttgtgt 240ctccctcaaa ttcataagtt
gaaacttcac ttccaagatg aaggaattag gaggtgggca 300ctttaaggga tgattatgtc
ataggccaga gccctcatga acgagatcag tgcccttcta 360aaagaggcat tgggagagac
ccctcacctt ttccatcata tgaggacaca gccaggaagc 420atcatccacg aaccagaaaa
ttggccctta ccagacactg aatctgctga tgtcctgacc 480atggacttct gag
49336336DNAHomo sapiens
36acgctgcagc gtcacattaa tctttgtgcc gccagtgcct atgccatgct tagtatgcat
60caaatatttg agcagtacac aagtgagtac tctgagagct ccccccacca aaaatatgat
120gattaaatac agttatgatc agatccccag agtgtggctc taaactgtat gggggccaag
180tttgaatact gttgtgtctt acactgttat tacctatcca gtatctattt ccccatattc
240cttataaata aaacctagat tttgattggg acagtaaggt gtcccactga aaactcattt
300ctctaaccaa tgtgatgcca gtgcttgccc aaaaag
33637507DNAHomo sapiens 37gtgagtgaac gtgaaggcct gcagcattac tgtacactac
catagatttt tatcaacact 60gtacacttag ggacactaaa cttatttaaa cattttttct
tcaaaaataa attaacctca 120gctcactgta actttataag ctttatattt aaaaaaactt
tttgactctt ttgtagtaac 180acttagctta aaacacaaac acattgtaca gttacacaaa
atattttctt aaaaaatatt 240ttattatatc ctattctata agcttttcct tgtttttcac
ttttttttaa cttttaaact 300ttttataaaa actaagacac aaacacacac attagtgcag
gcctgcatag catcaggatc 360atcagtatca ctgtctccca cctccgcatc ttgtcccact
gaaaggtctt cagcgggaat 420atcatgcatg gagctgtcat ctcctgtgat aacaatgcct
tcttctggat acctcctgaa 480ggacctggtt gagcctgttt tacagtt
50738423DNAHomo sapiens 38gaatccctta agcagaacaa
ccgtgatgcc atggaactca agcccaacgg cggtgctgac 60caaaaatgtc tcaaagtcaa
cagcccaata agaatgaaga atggaaatgg aaaagggtgg 120ctgcgactca agaataatat
gggagcccat gaggagaaaa aggaagactg gaataatgtc 180actaaagctg agtcaatggg
gctattgtct gaggacccca agagcagtga ttcagagaac 240agtgtgacca aaaacccact
aaggaaaaca gattcttgtg acagtggaat tacaaaaagt 300gaccttcgtt tggataaggc
tggggaggcc cgaagtccgc tagagcacag tcccatccag 360gctgatgcca agcacccctt
ttatcccatc cccgagcagg ccttacagac cacactgcag 420gaa
42339365DNAHomo
sapiensmisc_feature(244)..(244)n is a, c, g, or t 39ttactctgtg attaacttcc
ttctccctca ccccaaaata cagaagagtg aaatctctga 60ccagaaagtt cctggcacct
accttgggtt ctgtgaaaaa ataatggccc tggttttcaa 120tgctgccaaa gttaagaaaa
gttttcaccc cttcatttta aagcagccat aaagtgccat 180gtgtttaacc gcaggaaaaa
aagggtcttt ttaactattg agaagtagct tttcatatcc 240ccancagggg aangaaagag
cgggaaccag gagactcgtg aggactgcaa agatggtcct 300ccctgggtac ttctgctgct
ctcttctctc cagagctact ttgtgattgg cctgatggtc 360agacc
36540389DNAHomo sapiens
40gatggagcat catgggttgg attattactg agttcaataa tctggtgggt tttgccagct
60agaaacataa taaaatacat gataaaggaa tagaaaggaa atatatttat ttgaaattaa
120attactgctt ataaattcat gtctctgatt ttacaaagtg taatgggtaa aattaccata
180ttctttttct tatttcaatc catacaatga gagtcatgtt cagtttttca ctgacttcat
240gctgggtaat gttcactctg cattagcggt tgccatgttc accgttttct tacaatgtct
300atccagtgct tgttactgtc tcactgacag acagaagtct agctgttttc atccacataa
360tggcaggcag ggctagtgtt gctgctgct
38941537DNAHomo sapiens 41ttatgtccct gctggtattt ttgctttttc ataaaaatta
tcatttattt ttctcaactg 60catattgcca tcttcatttt ttaattttct cacatattca
tagaattgtt ctctgtaata 120gatactgtac agtaattatt tgttgatcga ttaacctttt
caatgctact tccgacacct 180acttcccatc ctccgtgtaa cagatactgt cagttaccta
tccttaacag cctttccact 240cccaacttct gtgaatggac aagagatgca attgtgatca
ctgaacatga ggcaacatct 300tctaggaaga catttccata gtcttcagac aaaagggaga
gatatctttt cagacaatct 360ttgaacaatc ctatatgaag cttacctgaa gttgctgtag
ccgtttggca agtctgggga 420gactaacaga cacactgagg atagcagaaa ataaagatag
aaacagccca ggtttttggt 480gaaattcatg agcttctgaa taacgaaccc cataccaccc
tacctctata aaagaat 53742351DNAHomo sapiens 42tggatcccag catcgttggc
aatagggttt taggtggagt ctatctggca ttcagagaag 60agtcaggaaa acaattgtat
tcccagcctg tgtccctagg gcacaagcaa atcccaaatt 120ctcctcctga accctccaaa
tttgtctaag aacttcgaaa actttaacaa acaggctgat 180atcttcataa tattcccagc
ctagaccaag caggaagaac attgatttca ttgaaataat 240tgataataat gaagataatg
tttttatgat ttttatttga aaatttgcta attctttaaa 300tggtttgttt tctacattga
tggaattttt ctcttttaat ctatctacag c 35143528DNAHomo sapiens
43tctgtttatc ccccaaatta ctacaaagca tagcgggctc tatgtttgct ctgttcgtaa
60ctcagccact gggcaggaaa gctccacatc gttgacagtc aaagtctctg cttctacaag
120aataggactt cttcctctcc ttaatccaac atagcagctg tgatgtcatt tctgtatttc
180aggaagactg gcaggagatt tatggaaagg tctcttacaa ggactcttga atacaagctc
240ctgataactt caagatcata ccactggact aagaactttc aaaattttaa tgaacaggct
300gataccttca tgaaattcaa gacaaagaag aaaaatactc aatgttattg gactaaataa
360tcaaaaggat aatgatttca taattttcta tttgaaaatg tgctgattct tggaatgttt
420cattctccag atttatgaac attttttctt gagcaattgg taaagtatac ttttgtaaac
480aaaaattgaa acatttcctt ttgctctcta tctgagtgcc ccagaatt
52844545DNAHomo sapiens 44gggacacacc agcacagtct ggtaggctac agcagcaagt
ctctaaagaa aggctgagaa 60cacccagaac aggagagttc aggtccagga tggccagcct
gttccggtcc tatctgccag 120caatctggct gctgctgagc caactcctta gagaaagcct
agcagcagag ctgaggggat 180gtggtccccg atttggaaaa cacttgctgt catattgccc
catgcctgag aagacattca 240ccaccacccc aggagggtgg ctgctggaat ctggacgtcc
caaagaaatg gtgtcaacct 300ccaacaacaa agatggacaa gccttaggta cgacatcaga
attcattcct aatttgtcac 360cagagctgaa gaaaccactg tctgaagggc agccatcatt
gaagaaaata atactttccc 420gcaaaaagag aagtggacgt cacagatttg atccattctg
ttgtgaagta atttgtgacg 480atggaacttc agttaaatta tgtacatagt agagtaatca
tggactggac atctcatcca 540ttctc
54545166DNAHomo sapiensmisc_feature(35)..(36)n is
a, c, g, or t 45tgctgtttgt gtgaaacctc cactgtgcca agcannannn nannngactg
tgaatanttt 60aacatttatt cacagatagc atgaaaagcc acagtccatt tgccatttag
cttatttgat 120tgagagaaaa ctgaggcaca ggaaggcaca gtgactgagc aagagt
16646205DNAHomo sapiensmisc_feature(28)..(28)n is a, c, g, or
t 46ggatcagtct taagaggagc tttttttngg agcgagaaat catataaaat aaaatgaaat
60aaaacaagga ggaaggcaac cagctgttag gggaaaaata aggcagataa aggagcgggg
120agagaaatta attgccaacc aggaggagtt gggctgtatt tttcaaaggt ggggagagtg
180gagcacacac cttgaggagg aaagc
20547294DNAHomo sapiensmisc_feature(68)..(68)n is a, c, g, or t
47gaaccatttg agattcaatg cctgtgtcca gctcccagga gtccaaccgt gaaatccaca
60agtgcagncc ccaccctgtc ctgcagttct ctttccctta tgataatgtg gttgagtcct
120ttgtcactcc cntcctcctg ctggctgcag aaatgacctc agcccaggcc agagacccca
180gctctggcaa ggncctcttg tggtcgncca ggncccagnn tgaaagccaa gcagaatcag
240gncaggatct ctagcgggan gggaaancct gataggacct ttgtcagact tttg
29448432DNAHomo sapiens 48acatttaccg tattacctag cactttcatt ccttgttgtc
tactccaaag gaaaaaaacc 60tatgtccaca caacacatga atgtgaatat tagtagcagc
tttatccata atagtccata 120aagtagaaac acatcaaata tctatcagct gatgaaagaa
taaacaaatg ggagtgatcc 180atacaattta atagaatcta gcacctaaaa aaataaaata
ttgatacgtg ctacaacaca 240ggtgaaccac aaaagcacat taatctaagt gaaagaagac
agatacaaaa aaccacatgt 300tgtatgactc tatttttatg atatccagaa aagacaaatc
tgtagtgtca gtaagtcaat 360taggggttgt ctggagctgg ggagtgggaa taaggggtgg
tattgatgag catgagggat 420ttcttaggaa tt
43249541DNAHomo sapiensmisc_feature(54)..(54)n is
a, c, g, or t 49gtgaatccta gagtagtttg ctatcaactt ctgatctttg cacattctgg
attnggcata 60taatgtnaca gcagtgccna ttgtaatgtt gcacaaagta gtntagcaat
ttcttggttc 120accaggntta gagataacat tgtagaaatg atccagcatc tttaacantc
tgtggtttaa 180ggtggggcac ttaggggtag aatcaataac aatgttagaa atcaaattag
acaagataac 240tgaaacagca tgatccatgt gtgactccaa gttataaagg aggacatgga
ttaatggtat 300acttctaggc tataggggta gtacaagtgg aaggacacca tcttagcatc
agatcacttt 360ctgagcaact ttggcaaatc ttttaaattc tctaatgtgt agttttttaa
tatatgacac 420aggtgtaaag aaaataaagc aagtgaatgt atgtgaaagc caatgctgac
tgggcacggg 480ggctcacgcc tgaaattnnt agcactttgg gaggcagagc cggggatatc
acttgagccc 540a
54150393DNAHomo sapiensmisc_feature(63)..(63)n is a, c, g, or
t 50tcatctacac gctggccagc aaggagatgc ggcgggcctt cttccgtctg gtctgcaact
60gcntngnnng gggacggggg gcccgngncc tcacccatcc agcctgcgct cgacccaagc
120agaagtaaat caagcagcag caacaatagc agccactctc cgaaggtcaa ggaagacctg
180ccccacacag ncccctcatc ctgcatcatg gacaagaacg cagcacttca gaatgggatc
240ttctgcaact gatcgtctcc atgcgccctg ctctgcggct gtgtncttat ttattgcatg
300cgtcgcttcc acaggggccc ctcaagagct gtgactcggg agagctacct tactttgacc
360aacagcctgc ccagtgtgga tgtctcttac aga
39351543DNAHomo sapiens 51cctccttagc cttagaagcc agtggtgccc tgaccagagg
ggaccctgtg ttcacaggca 60tcctcaagga ttatcttcga gagttgccca ccccactcat
cacccagccc ctgtataagg 120tggtactgga ggccatggcc cgggaccccc caaacagagt
tccccccacc actgagggca 180cccgagggct cctcagctgc ctgccagatg tggaaagggc
cacgctgacg cttctcctgg 240accacctgcg cctcgtctcc tccttccatg cctacaaccg
catgacccca cagaacttgg 300ccgtgtgctt cgggcctgtg ctgctgccgg cacgccaggc
gcccacaagg cctcgtgccc 360gcagctccgg cccaggcctt gccagtgcag tggacttcaa
gcaccacatc gaggtgctgc 420actacctgct gcagtcttgg ccaggtgagt tcatgcccag
ggcctgcacc accaatctga 480gccaggctgc tacaatcccc gcctgccccg acaatctcca
gatgtcgcgc cttacttgcg 540acc
54352367DNAHomo sapiens 52tcgcctgtac cagctggcat
atgacaccta tcaggagttt aacccccaga cctccctctg 60cttctcagag tctattccaa
caccttccaa cagggtgaaa acgcagcaga aatctaacct 120agagctgctc cgcatctccc
tgctgctcat ccagtcatgg ctggagcccg tgcagctcct 180caggagcgtc ttcgccaaca
gcctggtgta tggcgcctcg gacagcaacg tctatcgcca 240cctgaaggac ctagaggaag
gcatccaaac gctgatgtgg aggctggaag atggcagccc 300ccggactggg cagatcttca
atcagtccta cagcaagttt gacacaaaat cgcacaacga 360tgacgca
36753470DNAHomo sapiens
53cccccgagga caacctggag atcgttctgc acagatggga gaacaacagc tgtgttgaga
60agaaggtcct tggagagaag actgggaatc caaagaagtt caagatcaac tatacggtgg
120cgaacgaggc cacgctgctc gatactgact acgacaattt cctgtttctc tgcctacagg
180acaccaccac ccccatccag agcatgatgt gccagtacct ggccagagtc ctggtggagg
240acgatgagat catgcaggga ttcatcaggg ctttcaggcc cctgcccagg cacctatggt
300acttgctgga cttgaaacag atggaagagc cgtgccgttt ctagctcacc tccgcctcca
360ggaagaccag actcccaccc ttccacacct ccagagcagt gggacttcct cctgcccttt
420caaagaataa ccacagctca gaagacgatg acgtggtcat ctgtgtcgcc
47054504DNAHomo sapiens 54gtgtgtggat tcaacagtcg accccagctg tcgcagagcg
cgaggaagct gcgcagtaaa 60ccccttacat atcaacctct gaggaccggt ttttctgcac
ctggtggtcc ttctagacgt 120ctaggaggat cgtgttctca ggagagggtt cttcagcatc
tgtgctgaag aacactgccc 180cagcgggtca catgcaagat tccaccttcg agcaacatag
ctgacactct gcagcccagt 240tgtcacttgt aacaaacccc agtgggtcac atagtgaggg
gaggcaaggc agcgtaaggc 300agtggctgaa ctatcccaga aaacaaggat cacaggcccc
cagtgacacc aatgttgcag 360aaacacctgc agtggcaagt cagatgtcct ccaggaccag
gcagataaca aggagtaggg 420gtctgcagag gcctcgggag ggtctgcacc atccaaagaa
atcaattgtt ctgcacagtg 480gtaaggatcc agtgttccca gcac
50455382DNAHomo sapiensmisc_feature(27)..(27)n is
a, c, g, or t 55gaacaccatt gtcttcaata acctgtnggg catatccagg aggcacatag
ataggaggca 60caganncatn tngggacatc attggaacct gagcaggacc tgtaatgcac
tgaaactgtc 120catcttctct tcttattgta aatgcttctc ctgggttaac ttgtaccaga
ataacctgtt 180gtgttccatc tgcacttaca ataggggcag acaaaagaga aatatcacta
cttaagatct 240gagttgtatc cagtagtggt ggatgttctg ccattatcaa taagacatta
atatactgaa 300taacgctcca attctccgag tcacgccgtt ctgaggcaga aggcngctcc
tctggcgcct 360cttcttaggg ttcctgatcg tt
38256440DNAHomo sapiensmisc_feature(83)..(83)n is a, c, g, or
t 56gaagtgggag cggctcagca taggatgggc acgccatcag ccgtcaccag gcgcccggtg
60gtggggtcgt aggtgcccgc cangtagtag aggtcctgcc ggtcctggca cttgcggctc
120cgggccatct gctcatactg ntcgcgcgcc acgacctggc acttgtggga gatggtctgc
180acgtcgttct catcctcgtg gcaggactgg tacagcgcat tcttgccgtc gcactgcctc
240ttgcccagct tggtctcctc angggtggta gaaccacttg accttgacca ccatgttgct
300gccccacgac tcccacatgc tctcgatgcg gccgatgtag gggaggttgg gccgcccagc
360tgacaggaag acngcacagt ccccgacacg cagggtctcc tcgccccgca cgatggcctt
420gtagaacagc ttccgggcct
44057265DNAHomo sapiens 57catcgcccac caaggcctgg gtgggtgaga acagtgccca
caaggagacc ctgagtaaca 60gagactcaca gcccatccag gtctctgggc aggaaattga
aggaatcatc acattttaca 120gaggaggaga ctgcagctca gagtggggga agtgtgtgca
ccaggccaca ggcaagtctg 180tccagagcac tggtaggaat gagggaaact aggaatgacc
actttaaaaa gttagatgag 240aagaatttca aggccgggcg cggtg
26558355DNAHomo sapiensmisc_feature(229)..(229)n
is a, c, g, or t 58gctttatgca gtttgtcctt tcagttttca ggaatgagac ctcttgaccc
ctcccctcca 60atgcagcccc tactaagggg gagtttaagg agccatacat agttctataa
ttcaaatcaa 120gtaaacatgc ttcttgtccc aggttaactt gtgctgcctc agtcgctgtt
taaacatttt 180tatacgcact gttaacctgc ctgcccatta ccctattact tttaatggnt
aaactactgt 240tccctgggca gttgtctctt ttaacgtccc accctaaact tgccaaccct
catatgaagg 300cctcaggctt gttattggca aaggtcagaa gtcttaagct agtgaccttg
caggc 35559443DNAHomo sapiens 59ccctgacggc agaagagccc agcttcctgc
agcccctgag gcgacaggct ttcctgagga 60gtgtgagtat gccagccgag acagcccaca
tctcttcacc ccaccatgag ctccggcggc 120cggtgctgca acgccagacg tccatcacac
agaccatccg cagggggacc gccgactggt 180ttggagtgag caaggacagt gacagcaccc
agaaatggca gcgcaagagc atccgtcact 240gcagccagcg ctacgggaag ctgaagcccc
aggtcctccg ggagctggac ctgcccagcc 300aggacaacgt gtcgctgacc agcaccgaga
cgccaccccc actctacgtg gggccatgcc 360agctgggcat gcagaagatc atagaccccc
tggcccgtgg ccgtgccttc cgtgtggcag 420atgacactgc ggaaggcctg agt
44360552DNAHomo sapiens 60gtctcgaggc
agggctgaca catggtgcca tagccagcgg agggcgctca gtgagtgccc 60cgggccttct
agacaacagg caggaaggat gaacctcagg gcacccccag gtggtgcgga 120aagccaggca
gttgggacag aggtgcccac gagggcagag gccggtgcta aggggatggg 180gaagaaggga
caagattccc agagaggaga ggaggctgtt ggtaggaaag tggcagggct 240gggggagacc
cagccccaag ggtccggggc ggaggatgct ttgttctttt ctggttttgg 300ttcctctttc
gcggggggtg ggggaggtca acagggactg agtggggcag aggcccagaa 360gtgccagcct
ggggagccgt ttgggggcag ccccttctgc ccaccccatc cttcttcctc 420tccagagatg
ccaggggggc gtgtatgctc tgccccttcc ctcagacagg ggctgggtgg 480ggaggctctt
taggctcagg agaagcattt taaagaaacc cccaccctgc cgcccgcatt 540ataaacacag
ga 55261361DNAHomo
sapiens 61ctctttatcc ctcagattac tccaaagcat aatgggctct atgcttgctc
tgctcgtaac 60tcagccactg gcgaggaaag ctccacatcc ttgacaatca gagtcattgc
tcctccagga 120ttaggaactt tttgctttca ataatccaag tagcagccct gatgtcattt
ttgtatttca 180ggaagactgg caggagattt atggaaaaga ctatgaaaag gactcttgaa
tacaagttcc 240tgataacttc aagatcatac cactggacta agaactttca aaattttgat
gaacaggctg 300ataccttcat gaaattcaag acaaagaaga aaagaactcc atttcattgg
actaaataac 360a
36162238DNAHomo sapiensmisc_feature(29)..(29)n is a, c, g, or
t 62caaagtggga ggattacaag tgttatccna ccnatgcntg gacaggaata tttttaaata
60atgaaaccna agttccnttt cgctttgtaa ngttaatgca tgtattgatg gtgagtagag
120aacaatgaca caatctctag agagacatag gtgttcggcc tggctcaatc actagcctta
180tagtctcaca ggaaaatatg aacttcatca aaatagctaa ttattaccac atcatgga
23863355DNAHomo sapiens 63atgcagatga cgttgtggcc accgcactgg ccgtggagcc
catgaagttt gtctacagag 60gcaggatcgc tgtgttctct gtgaccgtgc tgcacgacga
ccggattgtc ctggtggctg 120agcagcggcc ggatgcctcg gaggaggaca gcttccagtg
gatgagccgt gtgctgcagg 180tgggcgcccc ggcacggcct atggttcggt gaatctccca
agctggcacc cccactccac 240tccaagtgcc aagtggttgg cttgtcccgc ccggtcctcc
ctggctccag ctttgtttat 300ctgtattttt cattgcaaat tgacaaatta cagctgtatg
tatttacggg ataca 35564230DNAHomo sapiens 64cctccctcaa agctactaaa
catgaaaaca ttgtgcctat atgataaaaa tgtcaatatt 60gctggtgata ctgatgctga
tggaaatgac gatattagct gccattaacg tagtatctaa 120tgtgtgccaa acaatattaa
aaattgctgt atatacatgt ttgccattta ttatttataa 180ccttaacaag atgtctcact
cataagacta ctttccgcac tatgatacag 23065552DNAHomo sapiens
65agtggcctta gacataactg ctgcccaagg agccacctgt gcccttttag gaacacaatg
60ttgtacctta tccctgacaa tcagcagaac ataacagcag ccctgcaaag gggtcttcca
120ggagattaag gtgactgaga gcctcactgt caaccccctg cagagatggt gagcatccct
180aggttctggc gtacattggg ccctaatagt cataagtatc atagctgaga tcctagtagt
240gagctgttgc tctctgtatt gttgttgtgg gttatggact cagggctccg ccatataggc
300atgtgtccct gcctggagga cgccctcagc ctagggggtg tagtgtaagg gaaatggctg
360tgctttagtc aggagtaggc tgaggcagcc ttctggtgca gcatgactca gtgggtttgg
420agtgcaagca cacaaccttg ctcgttatgt aaccacacca catgaggccc attaggtaac
480aactcacatg agctcgtgtt tggctcagag ccactattgt ctgtaaaagg tataccttgc
540tgatgctgca ca
55266508DNAHomo sapiensmisc_feature(48)..(48)n is a, c, g, or t
66gggtgactgg tctaagtgct caattacctg tggcaaagga atgcagtncc gtgtnatcca
60atgcatgcat aagatcacag gaagacatgg aaatgaatgt ttttcctcag aaaaacctgc
120agcanannng cnnnnnnanc ttcaaccctg caatgagaaa attaatgtaa ataccataac
180atcacccaga ctggctgctc tgactttcaa gtgcctggga gatcagtggc cagtgtactg
240ccgagtgata cgtgaaaaga acctatgtca ggacatgcgg tggtatcagc gctgctgtga
300aacatgcagg gacttctatg cccaaaagct gcagcagaag agttgacctc tagcaggctg
360gctggatcac agctcttngc aattacatta tttataaaca cacacactag catgtttttc
420nagaccaaat attatcagat tacatataat ttaatcaaat taatttattt tttntgcctg
480ccaaacatcc aatgtggtgc ttgttttg
50867410DNAHomo sapiens 67gcatgtgtaa aaagtccttc agccacaaaa ccaacctgcg
gtctcatgag agaatccaca 60caggagaaaa gccttataca tgtccctttt gtaagacaag
ctaccgccag tcatccacat 120accaccgcca tatgaggact catgagaaaa ttaccctgcc
aagtgttccc tccacaccag 180aagcttccta agctgctggt ctgataatgt gtataaatat
gtatgcaagt atgtatattc 240ctatagtatt tatctactta ggatataaga tataatctcc
tgattatgct ttcaatttat 300tgtcttgctt cattaaaatg taaggctaag gagagcatgg
aatttgtcag ttttgttcac 360taaagtattc caagtggttg ggaaagtgga acatttccaa
gaaccaataa 41068291DNAHomo sapiens 68cacaggatgt ggtctctacc
gtgattcctg agcatgcatg caccccttct cctgccaata 60gaggggagga agtcggaggg
gtgtctttat gcctataaac ttgccttgga atccagcctc 120actccctttc ctcctggagt
tgagaagccc ccacagagac tggctatggg ggagtgactg 180tctataggtt ccttggatgt
cctgcctatc tgcaaaatga gaatgagatc gataccttca 240tgaggctgta agatggcaga
tataaaagtg ctgtgttatc tcaaaagggt g 29169326DNAHomo
sapiensmisc_feature(47)..(47)n is a, c, g, or t 69actgtgtgca gcatattgca
ggctttcact catttaatat ctacaangtc ctcaatangn 60atatnaatta cttatgattt
ccctgttttt tcttcctata aggaagctga ggcacaagtt 120aatcaaagtc tcttggccta
gggtgacaca gctaagattt gtacctagag atttctgagt 180gttgacttct ctcctgcccc
cacctatctc cccccccnna aaaaaaaaca caacaacaac 240aacaacagaa cataccaggg
attcatggct tgcccaatgt tggaggggga gaagagagga 300gagggatgag ataagctcct
cccacc 32670352DNAHomo
sapiensmisc_feature(61)..(61)n is a, c, g, or t 70ttctgttttc ttcttaaagt
catttatatt atgtattact cttaaagaat gttttagtct 60ncattttagt agtctgtgca
taaggtagta atacatgtac acaaagaaaa attcacaagn 120cccattcagg tgtcttttag
aacattattt anccactaaa tatttataca gttgacataa 180tgcttattat gcccttgaat
aatagaattt gttttgtttt tacttcttat ccataagcat 240tggccttaca ttgcctcaag
aggaacagaa tttattatta aacaggattc ttaaatccat 300aactcatatt gtgacttcat
acattttgta accctagtag tgaatatacc ct 35271414DNAHomo sapiens
71gcccaaatcg cgcaggtctg ggacctgatt gcgggccacg aggcgcaatt cggggcggag
60ctgctgctca ggctcttcac ggtgtacccc agcaccaagg tctacttccc gcacctgagc
120gcctgccagg acgcgacgca gctgctgagc cacgggcagc gcatgctggc ggctgtgggc
180gcggcggtgc agcacgtgga caacctgcgc gccgcgctga gcccgctggc ggacctgcac
240gcgctcgtgc tgcgcgtgga cccagccaac tttccgctgc taatccagtg tttccacgtc
300gtgctggcct cccacctgca ggacgagttc accgtgcaaa tgcaagcggc gtgggacaag
360ttcctgactg gtgtggccgt ggtgctgacc gaaaaatacc gctgagccct gtgc
41472533DNAHomo sapiensmisc_feature(51)..(51)n is a, c, g, or t
72tgccagctac aggtgctcac ctgaaaagca agccagacca tattaaccct nggcattgct
60ggtacctngg aagactttct gattcaatgc tttccacctc ctcctacccc tcaccacccc
120cgtnggcatg aaatcctngg gggctgcttt agaaattgtt ttctttggct gctggtgggg
180gtgctgctgg tgggggtttg cacagctngg canactgcan ccagtctggt gggggtttgc
240anagctggca nactgcancc agtctcctgc ctgctgccaa naaggnccat ttcccaagca
300ctggctttgg agaagttggg gctctgaagt gggaacacaa ggctgccttt tgcaggncca
360ggtgtaaatt ctccccctgc cactttcagc ctagcgtgaa acagatggag tgtgcattcc
420cacttccctt tatggtaccc tggaatgatg gagctgccca gggcatcgcc acgttactct
480ctagacagtc tctttgtctt cctgcaatgg cagcgccgag gttgtatatt tct
53373492DNAHomo sapiensmisc_feature(226)..(226)n is a, c, g, or t
73gaagggctgc cttattttag agcacagatt ttctgaatat ctattttgac aggttcgatc
60ctctcccctt cctgccttcc ttctgtcgat tttcaatgtc ttgatggtgt cccacctgag
120tggcctttag agatgtgagt tgtgaggcac tggggaggca ggcacacgtc ctccagccca
180agactgccta atttaacagg gatttctgca ttctggaaca agcctnccat tttnncccca
240agcaggatta ctnccagagg gcaaaacaca gncccaatag tatcacattt cctttctgct
300ttagcaaaaa taaccactgt ctcattcatg ggaaaaggcc gccaaacaaa tttgttactg
360gaaccatttg taacaacttc tagtttgcac tgccttggag caagcacact ttgtagagga
420gggatttgca gttacttggg caacaaggta accactgatc attacaggaa gcttcagaaa
480ccgtgggacc ag
49274354DNAHomo sapiensmisc_feature(90)..(90)n is a, c, g, or t
74ctgttgctgc tgctgagcat gggcggggca tgggcatcca gggagccgct tcggccatgg
60tgccacccca tcaatgccat cctggctgtn gagaaggagg gctgcccngt gtgcatcacc
120gtcaacacca ccatctgtgc cggctactgc cccaccatga tgcgcgtgct gcaggcggtc
180ctgccgcccc tgcctcaggt ggtgtgcacc taccgtgatg tgcgcttcga gtccatccgg
240ctccctggct gcccgcgtgg ngtggacccc gtggtctcct tccctgtggc tctcagctgt
300cgctgtggac cctgccgccg cagcacctct gactgtgggg gtcccaaaga ccac
35475275DNAHomo sapiens 75agttccagaa atccagtgac gaatgtggta tacaaaaaaa
tatataaatt ctttcaactt 60agaataatta agtcataaaa tacatagggt acaaatacca
cattccgttc taaaatgata 120tcttaggatc atcaaaagaa aaagaggatt tggattatgc
aaaaaatgat tcctatatat 180ataatcaatt atctaactga catttttgca aatctaccac
aacttcgcct tttattgcat 240atgctaaaca agcagatgct aagtctgtaa actgt
2757662DNAHomo sapiens 76ttgcttaatc atgcgctttg
ttttttatgc attcacttcc tgtctttatc tctattttct 60tt
6277471DNAHomo sapiens
77ttaacctaag tatcagccct ggcatgctta tactggtcca agcaagcatt acgtcacagc
60ctgttcctct tctttatcta aaagtgcttt ttcctttctc agcattccac aagttacttc
120ctccttcctt tgttctcctc tgcctttgcc tcttttaaat agttccaagg tgctggccaa
180tcgggacaaa tacagaatgt gaggtcccat tccagccctg gaaactggac acagcagtag
240ggcggacgca tcaagtgata aatgaccctg tcccctttgt tcgctgtact ctcctggcaa
300aactgctgga gagtgtaccc tttctgcaga aagtaaaaaa aaatggcctt gctgaggaaa
360ttaatgttca agtgctattt ctttatggca ctggggaaca agcatttcaa acagacctga
420ggtttacccg atttctgctg gaaaagaaac ctcaggtctg ctgccttaga a
47178373DNAHomo sapiens 78tctgtaggag atcttccaaa ttactgctta tatacatgta
tattctatta caaaaattac 60accactcaat gtagtctaaa ttattgagag taaattgtag
ccattctttt acatgttttc 120tgaacttagt tgccaataat cataatcatt agcttttcaa
ggtttgctct gaaacttaca 180aaccatgcaa aagtgaaaac ttaggcttaa catatttggc
aatttaaatc aactaaattg 240aatcaatcta aatactgctt tgcaaagtaa aaaaggaatc
aaaatgacac ataagacaat 300cactaatccc tatattttta gggtctattt caagaaattt
actactactt cttaccagcc 360taaggactgt gta
37379505DNAHomo sapiensmisc_feature(334)..(334)n
is a, c, g, or t 79aggccaggtg ctatgctcag agttcacacc tgcctgatac tgtgaggatt
gggctacaga 60ttctaaacca cactctccat agaggacatg gcaggtgagc ggctggcttc
tgtgggtctg 120ggcctggtgg gttagtgtgg gctgcatggc cccaaggctg ggagctgtgt
tgggatctgg 180tggcaggggg tttatctgac aacctcacta ttccatgtct cctctctgtg
tggaggaatg 240ggatgcagcg aggaggccag gctggagttc tgtagagtgt aaaatcctgg
atgtcctctc 300agcctgtctc cttgagagga cctgctgcct gccnttctgg agcacgtcat
tctcttcttg 360gatgaccaaa taaatcattc aagaatgaaa tgaaaactcc ttatctcctt
ataggatctg 420agctcagtga tgagaagtgg aaggacaata attgaccaat cacacattta
natgaataaa 480ttaggccgtt ggtgttcagc agcaa
50580366DNAHomo sapiens 80tgtttccttt ccacttgcta gaagttattt
tgccaatcac atatgattat tttatcattt 60tttaattacc atcagtgcat gaaattatct
ttattattca cttgttttta ttataatctt 120ataatttcaa ataaaatgta aatctactgt
cccttgcttt acctccgtgt cttcagtgcc 180tagaacagga ctgtcataca cagtgactca
atacacattt acttatgggt gattccctgc 240ctgactgtta caggaagaag gaccaggaat
atcagaatct gaagtgtcct ctaaagtcat 300aaagactaga aggcattgaa taatgtttct
taactatgca aggacttcag aattagatct 360cacata
36681455DNAHomo sapiens 81agatctcatt
ttctggaggt gcatgtctcc cgtgaccccc tctttggatt gcccgcagag 60cccgtgaaga
tggtgttatc actcctgtga ttactttact gatcaggtga ctttgagtca 120atcaaaaggt
agattatcca ggtgtgcctg atttgatcag gtggtccctt aaggaggctt 180aaaatgaccc
tttctgaagt agagtaattg gaaaagtaag agggtctatg ggtggggtca 240cctggcaagg
aactgaactc agcctccatg agctctggcc accagctgac ctttagcaag 300aaagcaaatc
tttctttggt cagtctccac aacaggacga agctggctga gcccttgcct 360ttggccctgt
gagatgctga cccgagtatc cagcgaacac gtgccagagt cctgacccat 420ggaaactgag
atgatgagtc tgtgttgctt taagc 45582119DNAHomo
sapiens 82ccgattttct gtttgaagca gttcccttct atgttgcagt ctccttgaag
gcaaaggttg 60tgcactgtca tgttttgaag cccagtatcg ctgagaacaa tgacagacac
atgcagtgg 11983137DNAHomo sapiens 83tggctctcag agaaaccgta tttgatcaga
gagctaaagg aagtgaggtt gtgagccaca 60gggttatctt gaagaagagc attccaagga
caggggaaac ttcctcaaag accagtaagc 120cagagtgttc ttggtgc
13784345DNAHomo sapiens 84agcttacaca
gcattcttag agaggacaca gaatttggag tttgagtctt gccaagttat 60agggccttga
gaaacattta gggctttcca tggatccacc ctaacgaagc ataaaattaa 120gcctaggatt
ttagggtcat cagccaaaaa tggaactgcc ttctagaaca aaaaatgaca 180tccttttgag
gaagacagtc atccagagtc tttacaatct tttacccaca ttgcctagta 240cataattaaa
catttctaga tatgaatagg aacaggaaaa tgtgacccat aatcaagaca 300acaagcaata
aatggaaacc tacccttaag tagctaaact gttgc 34585459DNAHomo
sapiens 85tatgtttatt cagggctctg gaacataaaa aggctttgcc acactcttca
catctataag 60gtttctctcc agtatgaatt ttcttatgtt tactcaggta tgcagaccat
ccaaaggctt 120tgccacactc ttcacatttg taaggtttct ctccagtatg aattatctta
tgtttattca 180ggtctgtgga ccatccaaag gctttgccac actcttcaca tttgtggggc
ctctctccag 240tatgaattct cttatgttca ttaagggttg tgaaccgact aaaggctttt
ccacattctt 300cacatgtgta gggtttctct ccagtatgaa tactcttatg tttattaagg
gttgcggatt 360gtctaaaggc tttgccacat tgttcacatt tgtagggctt ctctccagta
tgaattctct 420tatgttcatt cagaactgag gacctactaa aggctttgc
45986229DNAHomo sapiensmisc_feature(78)..(78)n is a, c, g, or
t 86gggagtggac ctgcattagc aagcagagaa tgtccagagc ctagagacag ccagcccatg
60cagagggtag ggcataancc naggcagtgg agagggtgag gagtggtgta tagaagagag
120catggagttt aaggggttat tatggctgag atccagacca tgagcagaga aaagttcagt
180ttatctcacg gaaaacttta atgttaggct taatcctctg ttccttcct
22987351DNAHomo sapiensmisc_feature(80)..(80)n is a, c, g, or t
87ggttgatggt aatttatgta ccctgacagg ggtttggttt acatagctgt attcatttgt
60caaaacttat attaaatggn tgcaattaat attgatgcat ttcattgtat gtaaatttta
120ccctaaaata attttagaca aattgtaaaa cctagttaaa gacatatatg ctgatatttt
180cagggttacc tctcttgatg tctgcaactt actttgaaat gcttcaaaag gaaaatagga
240taatggatgg aaatagggag agagaaatgg atcgatgtgt aaataaaaca aatctatcta
300aatgttaaag cttaattgta gatgatgaat gtaggagtgt tgaatgttaa a
35188482DNAHomo sapiens 88aagagtctat gaagaaccca acggaagttt gtggcacatc
cctaccctca aattcacagt 60gagggtggaa tgacagtaac caaatctgtg aaaatattca
catgagacag gaaagaagtc 120agaatatcca gtgtacaatg agagtgaaag aggatgtcta
aaaggggaca gcccattcac 180aacccacaca caacccacgc acaaatattt ttgggggggc
ctcccatggg catttataat 240cttctaagtg ctccgaagaa catgtgtcac aaaagatgaa
gagaatattt tccagaacat 300agcccaacaa agaacttctt tgacattttt tagtgtaaag
gtaactgacg gtatctacca 360aattagcaat ttgtaaaact ggaatttcta aaagcaaata
cttggagctg agattacctc 420ccacttccca aattcgagtt atatgatctc aagtataata
ccctttggta tagacctagc 480ca
4828937DNAHomo sapiens 89tctgagtggg cctgctctct
gtagactgaa ttcagca 3790394DNAHomo
sapiensmisc_feature(130)..(130)n is a, c, g, or t 90tgcactcaat ggcttctgtt
cgaagtccct attaaatgtt tctttcttaa atactgtatt 60tgtcagcttc ttccttcagc
atcccaactt cctcagactt tggggtactt ttgcacagac 120ctagccaccn caaancactg
tcatagatgc agcaatccac tttcacaaaa ccccatggac 180aatgcagagg gggagaacag
ggactgatta aagaaaggga cagaaatggc atcactatcc 240aagactgaaa aacaggctga
atggattatc actctgaccc aactgcacat ttctaatgtc 300ttcatgtttt caattactcc
atgaattccc ttatctgatg ctgattatgc acaggactgt 360gtaagagtta aacaacacct
gacactggtg actc 39491300DNAHomo
sapiensmisc_feature(175)..(175)n is a, c, g, or t 91cctcatcact atgtcaccaa
agtgttttgg aacttggtat tccagagact tctggaacgc 60cgtgcaaggc ctgcccccag
caagccacaa ccaggaaggt gcaggcacgc ccccactagc 120tcctccccta tttattgcct
cctggaaaac ccaggaccct cttccccatc tccancccct 180acccctgggg gcagcccagg
gagagccagg cacaatgagg gctcccaaca gctgcaagga 240tttatctgaa cctttgagaa
agaggaggag ccatctaagt ttctggaaac ctgagcccca 30092490DNAHomo
sapiensmisc_feature(49)..(49)n is a, c, g, or t 92cccattgcga tctggctctg
ggggaccctg ggatgatatc cctaccccna gggacaggac 60ccaaccccng gggacctgga
gagactctgt gccctgcagg accgatgggg gactcctccc 120tgtatgtacg tgtgcgtggc
cctgccttgt tcttgccccg gacctggcct ggtgaaggag 180gcacgaggaa gattgcagtc
agggacgctc agcctgggag ctgaccctca ggtgaggccc 240taaggaagtt cccagacctc
cctgaacctc agtatgctca tctgtccagc agcaaccctg 300ggccttaagt gagaacatct
atgcggaaga ggcaggtgcc aatcaagccc tctgtaaagt 360tacctcccct tttcccttct
tctcctctca cagagctgaa gaatattttg caaagttcat 420tgtaaacatt aaaataatct
tgggtgttta tcattcgtta aacctgttgg gctgacttta 480ggtctaccgc
49093317DNAHomo sapiens
93gagaaaagta gactccccaa tgcctcgcag taaatgagga cgcctggcgc ctgcggcgag
60gtcaactgag gtcagacgag cttatctctc ctgtcccggg aattaagggc atcctgggga
120cagctgcaga gcaggaggct ccccgtgccc tcctcttcct aagcaagtca ggatcccaag
180aggcgcgtgc ggggaggccc ctccgaaggg ctgctggctt gtgtcttcca ccagcgcaaa
240gggaagctat cggttgcttc tgcagtgagg caagctcagc cggacgccca gaagagagac
300gaggtgtcgc tgtcggg
31794208DNAHomo sapiensmisc_feature(37)..(37)n is a, c, g, or t
94atagatattt tttagtccac ttggctggat aataaantct taataacagg gggaaaaaaa
60gaaagaaaaa ggaggaaaag atttaggaaa gaaaacaaca actttagtat ggaatgtgaa
120gaactggcag gatattcacg ttgagctgtg cagtaagtag cttactggac atgtgaggct
180gaagatacag ttgttcatat ggaagcaa
20895361DNAHomo sapiens 95tccctcgtgt atcttatctt tattattgaa ttttccctca
caatccactg ttaaaagaag 60aaagtatcac acacgtgggt tcttttggct atggaagtgt
ccttgagatc actttttgca 120cgtgactcag ctgaagtgtt caaagcacat ggaaatcact
tgccagtgac aggtggacgt 180tgtatgtgtt ttctctctcc taaggatgcc taaactttct
tttcttcaca ggtaaagtca 240gtgataaatc ttttgtttgc tgcatatact ggagatgtgt
ctgcacttcg aaggtatgtt 300tacaggatgg attagcatgc actttacaga tatttatgaa
gttgcttctg ggcgagcagc 360c
36196377DNAHomo sapiensmisc_feature(32)..(32)n is
a, c, g, or t 96gaaacgccat ggaatgtatt gtatttctct antctatccc ttaaaatgnc
cattgataat 60tattggcaat ggttattgat agtctcaacg taatttcagt agaatttgtt
ttgagatttt 120ttttatgcac ataaaagatt tctttaggga ttattgtaca gagttctagn
aaaatatata 180attttttttt ctgggcttat aactttcttt tctaaaaatt tatttggcag
cctgattaga 240aatgtggtaa aatctgaaca ataaaatagn aaatagacta gttgcataga
atgtttcaaa 300aacaggcatt agattggcgg ctactcggga ggctgaggcg ggagaatcgc
ttgagcctga 360gaggtggagg ttgcggt
37797525DNAHomo sapiensmisc_feature(72)..(72)n is a, c, g, or
t 97cacctttctg ttctgtgacg ggctgtccct gcttgtcctg ctttaggagg taggtaccca
60gtggctcccc gncccctcag cggctcattc ctctcgctct ccccacgttg gtctgtgtga
120gctccgctgt gtggctgcca ttcatccgat ccatctgtgg acttgctggg gctgcgccgt
180gcacggtgtg gtgaatgcta canccanccc caggggcggg gctgagagtg gctgggacct
240ggagcacatg gggatgctgt gtgggaacca acttgccccc caccctgtgt ctctaggggt
300ccgcagcagt agagaagcag acagccagcc ctgtccctgc ggcgtcaccc tccaccccat
360actaacccag cagcgcatgg agagatttcg ggagtgctct aaaggccttt ggagcaattt
420agggcaatta cgggcagttt tagaaatgct gaggggttgt tttgcctgcg gggcggggat
480ggttgcctta tgcccacagt gaagcgggcg agatgcggta gctgg
52598434DNAHomo sapiensmisc_feature(30)..(30)n is a, c, g, or t
98gaagttcaac tcaggaaggt gcaatataan caaatgtgct atattataat gaggaatggt
60actaccgttc cagattttct gtaattgctt ctgcaaagta ataggcttct tgtccctttt
120ttttctggca tgttatggaa tgatcattgt aaatcaggac catttatcaa gcagtacacc
180aactcataag atcaaatttc attgaatggt ttgaggttgt agctctataa atagtagttt
240ttaacatgcc tgtagtattg ctaactgcaa aaacatactc tttgtacaag aagtgcttct
300aagaatttca ttgacattaa tgacactgta tacaataaat gtgtagtttc ttaatcgcac
360tacctatgca acactgtgta ttaggtttat catcctcatg tatttttatg tgacctgtat
420gtatattcta atct
43499412DNAHomo sapiensmisc_feature(47)..(47)n is a, c, g, or t
99gggagacaga tcacaatcag atccataagg aaaagtgtgt ctgtgtntat cttcctctct
60agggaaaaat acagcagggt gaggggattg agtgggagtg caatcaggga agacttcctg
120aaggcagtga ctggtgactg gaatgaagca tgagaatgag ccatgcaggt tgcccagaga
180gagcatccag gcagagggag cngaaagttc catcctcacc cagctctgcc ggcccaggta
240ctttctcctc tgccttctac tcccagtctc actccagtgc aacacacttc agttttctgg
300gaactcctga tggaaagtgg ctgtatttgt tcatccctat agccttgggg cacagccagc
360agcccctgga ggaagccccg caggtnggta aagagacaca gggctcccag cc
412100493DNAHomo sapiens 100actgttttca gacctaacct tggcaaggtc agtcctactt
tgatgttctt gtttcatcac 60acttcttggc atttgtagat ttggaagaat tgggcctttg
gtacctctga tctcttcgtt 120tagcaactta ctgtgcaccc atatgcttag cttttgctgt
tttagctttt tttttttttt 180tttttaacct gccacctagt ggccgaaatg ttgctatact
attgataagg tactcctaat 240tttggcaaaa tagtaagagg caaagcacca aagattatgt
tctctccctt ctccaaatct 300ctcttggtga gaatgatctt taaaacatac cactcagatt
attagcaatc ttggtatgga 360acgtttttaa aaataataat aatgtacttt atgtggtgat
ttatgttatt atttaggccc 420aaagttttga tttaattgtt tccttttagc ttatttttga
gatatgcagt ctgttaggaa 480gctgtctctg tct
493101415DNAHomo sapiens 101gccctcttgt gtagttttca
ttgtgtctag tgcaatgccg taaaccttaa caccatgaga 60cccatatgaa gtgccaacag
tgatgatgga agcgctttca aagaaagaag tcatagacat 120tataagaata aagcgacttg
cttgatatgt acagtagata ggtacagctg tagctgctgg 180ccatttcaga cagatgcttc
atcttgtaaa cagcaacata aatgtatggt accaataaat 240acagtacagt actgtaaatg
tgttttctct tccttatgat tttcttggta catgttcttt 300tctctagttt actttattgt
taagaatata ctatataata cacatacaaa atatgtgtta 360ttgcctgttt atgttgtggg
tagggcttct ggtcaacagt gggctacatt atcga 415102530DNAHomo sapiens
102ggactagcag tcttcttctt cagacgccat gggaccccca ggcgactgct ctactgccag
60cgttccctgc tggacaaggt ctgacgccca ccgccggccc gcccactcct accacaagga
120ctttgcctct gaagaccagt gtcagcaagg tggtggtggg tgggctgctc ccatccgtcc
180ggagccccct ccccgcagcc tccttgcttc tctcagtccc ctggctggcc tccttcaccc
240tcaccgcctg tagcttgtgt ctgtccagcc ccatctgaat gtgttggggg ctctgcactt
300gaaggcagga ccctcagacc tcgctggtaa aggtcaaatg gggtcatctg ctccttttcc
360atcccctgac ataccttaac ctctgaactc tgacctcagg aggctctggg cactccagcc
420ctgaaagccc caagtgtacc cagttggcag cctcccgtca ctctgactaa aaagaatctt
480cagagtgcat atttggaggt ggaaagattg ttcagttacc ctaaagactt
530103509DNAHomo sapiensmisc_feature(47)..(48)n is a, c, g, or t
103taattttagc tccaatccat ctttctcttc tccaaaaccc tacctcnntn nnntcnnnnc
60caccccttaa gtacttagtc atgcntagcc ttatattctt gtttgaattc tnatgtnctg
120nnccncccaa acagattata catttcttgg gtcccatact ttgcatttac catagcagnt
180ttcatagccc atacaaacat taggccttca aaatatttgt caagtatttc ttcaataaaa
240atgaaaacat cccaaatctt gatccnccta anatgtnaaa tgggnactta gttaagcaaa
300ctaacatcat gatatactgg aaacaggtat ctctttcctt tacccttgtg cctgctgang
360atcttattct cagccttgct gttttaaact caggggtgtg tgtacaacat atttaagcaa
420attctggaat accaaagcca agcagtcttc caggggcttc atcctgncac acagcagctt
480acctggtggg tgttgggtag cacacagta
509104338DNAHomo sapiens 104catgatcagt gtattttagg gggactaata tggcaactaa
agctactttg gaagagaaag 60agtggagata catagattgc tattatagtt caggccaata
gagaggaatt gggtttaaga 120gatacattat ggaggcagaa gtgttcattc aacaagcgtt
tgttaaatat ctactatgta 180atcatgatta tacaactaga gagaatatga aaaaaatgaa
ttacgtatgt tagcttatag 240atggatgctc tcagtaccca tccctattaa tcgtcatttc
cctttgttta gtgaaccttc 300tgatatattg gatatcaaat atcctttcca agtattgt
338105279DNAHomo sapiensmisc_feature(26)..(26)n is
a, c, g, or t 105gttccaggtc ccggatagcg agggcngccg cgcnngctcc nagggccatg
aagcccccag 60gaggagaatc gagcaatctt tttggaagtc cagaagaagc tactccttcc
agcaggccta 120ataggatggc atctaatatt tttggaccaa cagaagaacc tcagaacata
cccaagagga 180caaatccccc aggatcatgt tttcttatgt gaaggagaag aaccaaaatc
ggatcttnaa 240ngcttgcaag gagcatcccg gctgggagca gagccaggg
279106395DNAHomo sapiens 106ccaggctact gctaagactc gtacttccca
gtttggtgtg ggcagctttc agactccatc 60ctccttcagc tccatgtccc tccctggtgc
cccaactgca tcgcctggtg ctgctgccta 120ccctagtctc accaatcgtg gatctaactt
tgctcctgag actggacaga ctgcaggaca 180attccagaca cggacagcag agggtgtggg
tgtctggcca cagtggcagg gccagcagcc 240tcatcatcgt tcaagttcta gtgagcaaca
tgttcaacaa ccgccagcac agcaacctgg 300ccagcctgag gtcttccagg agatgctgtc
catgctggga gatcagagca acagctacaa 360caatgaagaa ttccctgatc taactatgtt
tcccc 395107412DNAHomo sapiens
107acatagagag gtgactcatt ctttttaaag gttacattaa gtttgtagta tgtcagaatg
60gcaatactat aattgtttta accagtgacg tttaagttgt ttccagattt tttgatctaa
120caaataatgt gtcatgagta tagaattttt atgttcatgt actagtatag ttataggatg
180actcatattt gaagcaaagt acaaaacgca tgctttctgt agctactcat aaattctggt
240atgagcaaaa tgtcaagatg cttgcttatc accgaccaag tgatgattaa gctcttgcta
300aactgtatca aaggagaaaa agggaaatac aggcttatcc taacaatttc acagtgaaca
360gtaatctctg gcattcagtt aaagctagac ttgttctaat tactttgatt tt
412108531DNAHomo sapiensmisc_feature(121)..(121)n is a, c, g, or t
108gtaagtggta ccagccacaa ctgaatatcc atctgggata aaataaaatt gcactcgtct
60tagagatcca aatcaacttc agatggatta aaactttgaa tgtaaaaaac ataaatgact
120nacagtcctg caaaatatct tggagacaac ctgtgccatc tggagagtgg gaagagcaca
180tgcaaaggcc aaggggtgga gcagcccagc atgttctgga aaaggtaggg ctccccaagg
240ctgggatnat ggtggagacc tgggtgtgtg ggagcacagg ggtgggggcc cgtgggccag
300gaatgcacag agaggggctg gtgctctgcc gcaggcccaa gcccccaaag cccggtcatt
360cccagcacca tcttcacggg tttctgccca ggtctttctg ctgcatctct tcctcccccg
420attccttaat catttttttt aaaatcagtt catgtctttg taaaccaaat tatttctaaa
480aggcaaattt atattactgc cgaaatcaag ggtcagtgag ctagttgtgt a
531109541DNAHomo sapiensmisc_feature(53)..(53)n is a, c, g, or t
109gacttgggat tccggagcag tcgcccctat cgctgctcct gcagttgcgg acnccaccga
60ccccgccgcc ggaggactgg gcactgaaag gcctctangc ctaggcgcgg cccgcggagc
120cagacgtgtt gctgccgtga gtaaaacgag cgccctctcc gcactcgttt acaaattaaa
180atggaggaaa tttcgttggc caacctggat actaacaagc tagaggccat cgctcaggag
240atttacgtag acctgataga ggattcttgt ttgggattct gctttgaggt gcaccgggca
300gtcaagtgtg gctacttcta cctggagttc gcagagactg gtagcgtgaa ggattttggc
360attcagccag tggaagacaa aggagcgtgc cgcctcccgc tttgctccct tcccggagaa
420cctgggaatg ggcctgatca gcagctccag cgctcacctc cggaattcca gtagctgcaa
480aatgagagtc tgaaagtggc caggacaata acatagactg gtcctgtggc ttcgaggagt
540a
541110359DNAHomo sapiens 110ctccctgcaa atgcacatgt caatcaatga ttaatgcacc
caggttatgt acaaggcact 60gggcttagca ccacagggaa cttccttcca gaggctcgct
ttctagttgt gtagacaaga 120atacatgcat gagaagatac aagacaattc acccatgcca
aatgattcat acaggctgtt 180taagtactgc agaaaataaa agaaggaaag gctaccagac
ttttcaataa ggtctacagc 240ttcccaagag catgtctttg ttaaatcagg aaatataaaa
attatgtgtg tatgtgtatg 300tatatatata taccacccta ttaactattt taaaatcgta
ttctattttg ggggttgtg 359111491DNAHomo sapiensmisc_feature(56)..(56)n
is a, c, g, or t 111cagagtggac tgttccctga ggtgggagat gtggaaaagc
caagaggctg cagccnaggc 60cactggcccc tgagatctct gcaggaaatg gctgtggagt
gtggcagttt ggcaaactct 120ccaccacacg taatgaaact tggatttgct ncagtgtctg
gctgcagagc agtgggcctg 180gccagcaggt ccccagcttt ggctatgagg gccttgagtc
ccccaaaaca ccgggttcca 240gcaccacact cagccctcat tggctcttga actgagcttg
gaagcttctg gtgaccttcc 300aagagcctga gagtgaggtg gaattatttt aaaagataaa
tattatatta tatatatata 360tatttccctg aaggaaccaa agcgaatttt aaaagatgca
atgtagaggg gaaaagagat 420gatgaaaata tttaaaggcc ctatctgttt acagtgttcc
gtggttaaac tcgctcactg 480ctaagaatat t
491112287DNAHomo sapiens 112gtgatcatga gaatgctgcc
tttaaagatg tggccctggt cctgactgtt ctgctagagg 60aggaaacatt agaagcaagt
gtaggcccaa gggaaacgga agaaaaagtg agagacttac 120tctgggccaa gtttaccaac
tctgacactc ccacctcctt caaccacatg gactcagaca 180aattgagtgg gctgtggagc
cgaatttcac acctggtact gccagtccag ccaatcttag 240atgctagcgt tacatccaca
aaaccagtgt tgccttgtat aactatt 287113389DNAHomo sapiens
113tagccgatcg ttacctcaag ggagtgggaa ttgggcccag ctccggcccc tcctggtcac
60ctttggccat gatggccggg gccatgcctt gacccgacgc cggagggcca agcgtagccc
120taagcatcac tcacagcggg ccaggaagaa gaataagaac tgccggcgcc actcgctcta
180tgtggacttc agcgatgtgg gctggaatga ctggattgtg gccccaccag gctaccaggc
240cttctactgc catggggact gcccctttcc actggctgac cacctcaact caaccaacca
300tgccattgtg cagaccctgg tcaattctgt caattccagt atccccaaag cctgttgtgt
360gcccactgaa ctgagtgcca tctccatgc
389114499DNAHomo sapiens 114gtacctcgct ggacctggag ttagacctgc aggcgacaag
aacctggcac agccaactga 60cccaggagat ctcggtgctg aaggagctca aggagcagct
ggaacaagcc aagagccacg 120gggagaagga gctgccacag tggttgcgtg aggacgagcg
tttccgcctg ctgctgagga 180tgctggagaa gcggatggac cgagcggagc acaagggtga
gcttcagaca gacaagatga 240tgagggcagc tgccaaggat gtgcacaggc tccgaggcca
gagctgtaag gaacccccag 300aagttcagtc tttcagggag aagatggcat ttttcacccg
gcctcggatg aatatcccag 360ctctctctgc agatgacgtc taatcgccag aaaagtattt
cctttgttcc actgaccagg 420ctgtgaacat tgactgtggc taaagttatt tatgtggtgt
tatatgaagg tactgagtca 480caagtcctct agtgctctt
499115504DNAHomo sapiens 115gagtttcagg accaggcagc
ttgattacag catcaagggc ccctgtgttc tctgttttct 60gcagccatag tattggcttc
ttcccaagac ttatttttcc catcagtgtc acctgtgcta 120caagctcctt cagtcacatc
tatttttgat atttgtgggt acctaggagg tgcatatatt 180tgtgggatac atgagatact
ctgacacaga tgtgcagtgt gcacggatca cagggaaatg 240gggcagccat ccatcccttc
aagcattcat gatttctttg tgttgtgaac attcccgttg 300tgctctctta gttattctga
atgtacaaga aattattgct gactatagtc accctgtcgt 360gctatcaaat actagacctc
attcgtggta tctaactata ttttgtaccc attaaccatc 420cccatctccc accccctacc
tttcccacta tccatcccag cctctggtaa ccatccttcg 480tctatctcca cgagttcaat
tgaa 504116476DNAHomo
sapiensmisc_feature(423)..(423)n is a, c, g, or t 116agcacagtct
ggctggatga gacagggtcg tgcccagatg atggagaaat cgacccagaa 60gcctgaggag
gtgtcctggg tttggctggc tggctcctgc tccagcggcc cggcttcagg 120tgtccggggg
cgtggctgcc tggagcaggt gtgctgaata ccctggatgg gaactgagcg 180aacccgggcc
tccgctcaga gagacgtggc aggaccagcg aggaatccag cctgtccact 240tccagaacag
tgtttcccag gccccgctga gtggaccgga cctctgacac ctccaggttc 300ttgctgactc
cggcctggtg aaagggagcg ccatggtcct ggctgttggg gtcccaggga 360gaggctctct
tctggacaaa cacaccctcc cagcccccag ggctgtgcaa acacatgccc 420ctnccataag
caccaacaag aacttcttgc aggtggagtg gctgtttttt ataagt
476117494DNAHomo sapiens 117atccttgtac ctgatgtctg agccactcag aactcaccaa
aatgttcaac accataacaa 60cagctgctca aactgtaaac aaggaaaaca agttgatgac
ttcacactgt ggacagtttt 120tcccaagatg tcagaataag actccccatc atgatgaggc
tctcacccct cttagctgtc 180cttgcttgtg cctgcctctt tcacttggca ggataatgca
gtcattagaa tttcacatgt 240agtataggag cttctgaggg taacaacaga gtgtcagata
tgtcatctca acctcaaact 300tttacataac atctcaggag gaaatgtggc tctctccatc
ttgcatacag ggctcccaat 360agaaatgaac acagagatat tgcctgtgtg tttgcagaga
agatggtttc tataaagagt 420aggaaagctg aaattatagt agagtcccct ttaaatgcac
attgtgtgga tggctctcac 480catttcctaa gaga
494118553DNAHomo sapiensmisc_feature(191)..(191)n
is a, c, g, or t 118gataacccca atctacgaag actagctatg gaacttccta
cactgagaca actccagtgg 60aactctgata attatcctaa aataaggagg cttcttcagt
agccctcgaa atatgttcaa 120atacatgatt acatttatgt ccttaatatt gctattagtt
tctgatgtta atgtaaaagt 180tggggaaaaa ngtggaaaag ttaaagcagt gcaggttaat
tcaatgccag agtancttct 240cagagggtgt atattcagtg tgaacaattt tcaacagaga
aatgtcaact tctggccaca 300acggcaacca gtaaaatgac tatttttact gtcttatcta
ttaatgaaga ggagattgca 360taatatagat gaaggagcat agtatttgca ggtggaacgc
ctagcagggc ttgagtctca 420actctgctgc ttttactcta attgaccgag acaagtcatt
taaactaata gagcttcaat 480tttctcatat ctaatgtaac ataacaattc acagcctttt
actttgtagt tatcgtgaag 540atctaatcgc agt
553119462DNAHomo sapiens 119ctcctgttca tcctgttcac
agagtggctc ggctgatggt agctcgacca atggctgcaa 60ccatgagagg gctcccctga
aacttctctg tgacaatatg aagtaccaga tcctctccag 120agccttctat ggatggctgg
cctactgcag acacctgtcc accgtgagaa cccacctatc 180agccctggtc aatcacatga
tcgtgtctcc agacttgccc tgcgatgctg gacagggact 240gacagccagg atctgggagc
agtaccttca cgacagcaca agttacgagg agcaggagct 300gctgcgcctc atctactacg
ggggcatcca gcctgagatc cgcaaggccg tgtggccctt 360cctcctgggc cactaccagt
tcgggatgac ggaaacagaa aggaaagagg tggacgagca 420gattcatgcc tgctatgcac
agaccatggc tgagtggctg gg 462120524DNAHomo
sapiensmisc_feature(28)..(28)n is a, c, g, or t 120tctgctgctg aaggcctgtg
attttgtngg ggaagggcct gttctangca actggnaaag 60gcactgccac ctgccgttgg
atgccaggac tcaagagctg gccccagtca ctgtgcgcag 120agctgtctga gaatgtgtga
gtggactggg tccttcggca ctgcctgcat tggctcaggg 180cagtcaaccg tcgcagagga
tgaggggcac actcaggcag cctccccggc cctggaggca 240gaaaggccca ggcagaacca
ctgactggga ggaaacagaa aaagcagagg agagccaggc 300tgcaggcgtg tggatgggac
cagctcaggc agacgctgtc tcatacccac tctcccctct 360cttgccaggg cctggcctgg
tgtctctcag gagcctgggc atgagacaaa agcagagatt 420gttctcttgt ggtaccacag
gctgtaacca gtccacccag tgttgtttta gaaatttaaa 480tcggttgccc atctttttaa
attggcaaca tcgtttacca catt 524121326DNAHomo sapiens
121ccccaagttg gcgggcctga ttgggcggca cgggccccag aacaagcagc ccttcatggt
60ggctttcttc aaggccacgg aggtccactt ccgcagcatc cggtccacgg ggagcaaaca
120gcgcagccag aaccgctcca agacgcccaa gaaccaggaa gccctgcgga tggccaacgt
180ggcagagaac agcagcagcg accagaggca ggcctgtaag aagcacgagc tgtatgtcag
240cttccgagac ctgggctggc aggactggat catcgcgcct gaaggctacg ccgcctacta
300ctgtgagggg gagtgtgcct tccctc
326122372DNAHomo sapiens 122atgcggagtg agaaaagcct gttgcagaag actacataca
acaggatttg acacttgtaa 60ggctccaaaa caaagaaaat taaatgatat tgtttaggtt
ttcatacata ggtgataaaa 120gtgtgtttct ttgtttttaa tgagaaaatt agtcacagaa
tttaagatct tagttacttc 180tatagggaag gcaggggaat gggacaagga ggaagcccac
agcattggtc atgctctcat 240gttgaagttg ggttcaaagg tgttcattat taaaatgctt
cataatgatg accatacatt 300tggtatttct aggacaatct tggtttacat ctattgtctc
aacataatta ttcagtgcaa 360gcctttcctt tc
372123197DNAHomo sapiens 123ctaccttgcc tgctgagaca
tagggctcca cgggtctctc tcctggggcc gggctgactg 60tggcctgcga ggggcagtca
tcgtgttggg ttttcctgcc agaggcagaa accacaaaat 120tacctggaac atacacgccc
caagtgacag attcaattca attccacaaa tattgacctc 180gcgtctaatc cactcgt
197124379DNAHomo sapiens
124ctctgagcct tgcttggttg tcagaggcca tgagaggtgc cagttatagg tggatgtgcc
60aagatgctgg tgaacttggt cttcagctat acccaggctc agaaagggca agagccatgc
120tgcagcgtag gtgactttgg aggtgcactt ggggcccagg gctttgagtg ttgcgggtgt
180gcctgtccct ccagatagtg ctctgtttct ctctgttgtc cccctgcctg gtcctctggg
240gccactgtgc tttctgctgt gtgcatttat aaatgatgtg tattttatat agacctgctt
300gcattggctg atgctcctct aattccctga gtttgattca accacccttg ggttgttttg
360ctatggcctt agcctttga
379125495DNAHomo sapiens 125gaaccagaat ccttggaagc tctaggtcct acctcagaaa
atctcatcgt catggttctg 60cttcaagaac agaatttggg aattaggtat aagttcaatg
ttcccatcac tcgaactggc 120agtggagata atgaagttgg ctttacatgg aatcatcagc
cttggtcaga atgctcagct 180acttgtgctg gaggtaagat gcccactagg cagcccaccc
agagggcaag atggagaaca 240aaacacattc tgagctatgc tttgtgtttg ttaaaaaagc
taattggaaa catttcttgc 300aggtttgctt caagctgtaa tttagcaaaa gaaactttgc
tttaattata ttatattcca 360tttgttttca acctcatgta atttgtgcag atttgttggt
aaaatacatc ttggcacaat 420gagtgtctct gctggtgctt ctcccaagac tatcttgaag
gtgggctgtt tgcctttcgt 480gaacacattc ttggt
495126491DNAHomo sapiens 126atacctcatg cagccttcag
gttcagttct gacaccaggg atggaccatc ccatttctct 60ccagcctgcc tccatgatgg
gaccccttac ccagcaactg ggccatctct ccctcagcag 120cacaggcacg tatatgccga
cggctgcagc tatgcaagga gcttacatct cccagtacac 180ccctgtgcct tcttccagtg
tttcagtcga ggagagcagc ggccaacaga accaagtggc 240agtggacgca ccctcagagc
atggggtcta ttctttccag ttcaacaagt aacagtggga 300ttcccctccc catctttact
gaatagaaat gaattcttgg agatactcat gctcccagat 360tccagagggt taaccaggaa
tggagaccat ccgtcggccc tgctaaggac taacacttag 420ccatcgtttt tcacaggcct
gggcctggaa aaagaaatct ctacgttcct gccctttact 480attgctgatg g
491127391DNAHomo sapiens
127ggtgctgccc tgtgtacata taaatgaatc tggtgttggg gaaaccttca tctgaaaccc
60acagatgtct ctggggcaga tccccactgt cctaccagtt gccctagccc agactctgag
120ctgctcaccg gagtcattgg gaaggaaaag tggagaaatg gcaagtctag agtctcagaa
180actcccctgg gggtttcacc tgggccctgg aggaattcag ctcagcttct tcctaggtcc
240aagcccccca caccttttcc ccaaccacag agaacaagag tttgttctgt tctgggggac
300agagaaggcg cttcccaact tcatactggc aggagggtga ggaggttcac tgagctcccc
360agatctccca ctgcggggag acagaagcct g
391128458DNAHomo sapiens 128tgtatggtcg ctggccagtg attctccttc tgagccgtgt
ttcccctctc cctccctctc 60cacgtgggca gggcaggccc catcgctttc ctctgataac
cacatggaca catcctgaag 120tcagcccagg cgccctgagc atcttggggc acctggaccc
catcacaata ctccttcttc 180cttcaggtcc ctgggtgaag gctttgctga aaccgacccc
ccttttcacg tcccttctgc 240ctctgccccg ttggatgccc tgactggggg caggggaaga
gacagggcac agctggccac 300agggctcagc cactgagcag gctgttccgg gcctttggct
ttgcatcctg gacggggagt 360gtcctgtcag ggaccagatg tgtcctgcct catccctagc
tccaatccct tccccacgtg 420accggggatt ctggttgcaa taaaacatgc tgctgctg
458129496DNAHomo sapiens 129gcagtctcgt ccaatttcta
tagcaccgtg ggcaggaacg gcgtcctgcc acaggctttc 60gaccagtttt tcgagacagc
ctacggcacc ccggaaaacc tcgcctcctc cgactacccc 120ggggacaaga gcgccgagaa
ggggcccccg gcggccacgg cgacctccgc ggcggcggcg 180gcggctgcaa cgggcgcgcc
ggcaacttca agttcggaca gcggcggcgg cggcggctgc 240cgggagacgg cggcggcagc
agaggagaaa gagcggcggc ggcgccccga gagcagcagc 300agccccgagt cgtcttccgg
ccacactgag gacaaggccg gcggctccag tggccaacgc 360acccgcaaaa agcgctgccc
ctataccaag taccagatcc gagagctgga acgggagttc 420ttcttcagcg tctacattaa
caaagagaag cgcctgcaac tgtcccgcat gctcaacctc 480actgatcgtc aagtca
496130538DNAHomo
sapiensmisc_feature(475)..(475)n is a, c, g, or t 130aggtcaccca
gctgtgaatg aacgtggtca gaacacagaa tctgagttgg tcacacttcc 60cactgatcca
tggggccttt aagccctctg gaagcttcca ttaaagatga ttatttgagg 120ataattgtat
tgggatgcct atgatcttat ctagggtttt cctacccatc cccaacattc 180agctcagctg
cctctttctt gaggacaccc tcactgatca ccccagccca gccagagtgg 240ttgctcctgc
tcctgcccct gaacctatga catacccaag tcccaatact ttcgagccat 300ctgccactgc
cttttgacat ctctgccttg gctagattca aatggtgttt cataataaaa 360gtctgagttt
aagcagcttt accgaaaacg caagggaagt ttcattccat ttatacttct 420ccagaccccc
tgccatcctc tgctgctacc cacacaggca gaataaaagg cttanatgtg 480taagtcccat
gaaggcaaag attggtctct tgtgttcact gctgtctgta gtacttag
538131414DNAHomo sapiens 131gtggcaaaag gggattcggc agctgtgatt aagaaccttg
tgatggggag ggattcctgg 60attacccagg tgagtttaat gtaaccacaa agatcctttc
aagagggagg caggaaggtc 120tgaggcagac gaaagagctg tgccaaggga agcaggcggc
agtgggatgc aggtggcctc 180tagaagctgg aaaaggcaag tccatgggtt ctttcctgga
gccttcagaa ggagcacggc 240cttgctgacc catcttagaa cggcaggata atcaatgtgt
gttgtttgag gccactaagt 300ttgtggcaat ttgttacagc agcaatagga aactactaca
ctgtgtctga ttagatcagg 360ccaatgaatg gagaaagtat tggatttcag ttgagtgcta
aaacctggtc tgtt 414132408DNAHomo sapiens 132ccagcttcac
atggtctctc aagtgcttat gcttttcttc tctctgccac ccacattccc 60acatcccgcc
caccccccaa ctttcctccc ttcaccttcc catggagact ttttgcctgg 120gctaaatctg
atcctcagcc cactctcaga atcgataaat gcccctaggt gattgtaagc 180tcacctaaga
tatacttttt ctcctctaga attttagttt attagatttt tctagttgtc 240tttgcaaaag
cgttaacagg ctctgacttc tgacattcaa ctagatgtgg aatatccaac 300ccctagcatt
tcatggaatg tactgaccaa gataaaatgt gttcttatta aacaatgcca 360tttcttgacc
acttctgttt ttaggaattg tggtatctga gtcatggt
408133483DNAHomo sapiensmisc_feature(94)..(94)n is a, c, g, or t
133gcgacaaggt tgtgatccac gtggcaggtg ttcagaaggc tgggggcggg cagcgctggg
60gagagccctg ggtacttcga ggagaccccg aagngnggct gctcccacac ctgcgccagt
120ttccaccctc tctgtgagca gggctgcggt cacctcccac atctgaagag aaccaacctg
180aggatttcac gctggctgcg tgccagacca gtccctgaca ggttgtgcga ggcccttcgc
240tggacagccc attgctggcc actggacgga gaggcagagg gggctgaaat tcgggcccat
300gcctctgtga gcgatgacgg agcaacagct ctccagcacg tgaagctctc cagacagctg
360ttcgtgagaa gccagacaga ggcctggggt ctcagtccag atttctgggg agtggggtgt
420ccaancgtgg gccacgctgc tgggagccac ctagggaagc aggtcgcctg tttctatagt
480gac
483134496DNAHomo sapiens 134gggaaaaccc ttgtacctga agcatgagcc actcagaact
caccaaaata ttcgacacca 60taacaacaga tgctcaaact gtaaaccagg acaacaagtg
gatgacttca cactgtggac 120agtttttccc aagatgtcag aacaagactc cccatcatga
tgaggctctc ccccctctta 180actgtccttg ctcatgcctg cctctttcac ttggcaggat
aatgcagtca ttagaatttc 240acatgtagta gcttctgaga gtaacaacag agtgtcagat
atgtcatctc aacctcaaac 300ttttacataa catctcaggg ggaaatgtgg ctctctccac
cttgcataca gggctcccaa 360tagaaatgaa cacagagata ttgcctgtgt gtttgcagag
aagatggttt gtatgaagac 420gtaggaaagc tgaaattata atagagtccc ctttaaatcc
acattgtgtg gatggctctt 480gccgtttcct aagaga
496135479DNAHomo sapiensmisc_feature(305)..(305)n
is a, c, g, or t 135gagtccgagg atttcagggg cagctgggcg caggagctgg
tgggctgttg ggagtgcccc 60tttactgggc aggcttcctt cctcctggtg atggggggtt
cctcagcaca aaagtgaagg 120ggtggagggg ctggaggagc aggaatctct cttgttgata
ggtatgaggc cttgaagtcc 180ttttctttgt cccaggattc atggacgctt cggggctgat
ctttgagttt tcaagcatgg 240ggtgcagaga cgtttaggta aactcttacc gtcctctctc
ttcgtcaggg cttcccagga 300atcancaatg cccaagaagg aagggattgt agaaatagct
taaccctttc atttaccaac 360gtggaaattg aagcccaggg aagggaaggg accggtcgtg
gaagggagag ccatcagcag 420aaagagaccc tgagatcttc gcctgggatt cccaggaagt
ccagcccgag ctgattcac 479136393DNAHomo
sapiensmisc_feature(101)..(101)n is a, c, g, or t 136tcccaagccc
ttagggaccg cagaggactt ggggaccagc aagcaacccc cagggcacga 60gaagagctct
tgctgtctgc cctgcctcac cctgccccac nccaggcccg gtggccccca 120gctgcatcaa
gtggaggcgg aggaggaggc ggaggagggt ggcaccatgg gcccgggcgg 180tgccctccat
gcccggggga tgaagacact gctgccatgg acagcccgtg ccagccgcag 240cccctaagtc
aggctctccc tcagttacca gggtcttcgt cagagccctt ggagcctgag 300cctggccggg
ccaggatggg agtggagagt tacctgccct gtcccctgct cccctcctac 360cactgtccag
gagtgcctag tgaggcctcg gca
393137377DNAHomo sapiens 137aacctatcgc tgacttagca accaaagcct ccatcgttag
gcaaggaata aaataaaacc 60agcacgcttt ttccactgtg atttttaaaa gtcattaaaa
aatatctttt cccttatgta 120cagaaaaatt ggaacagaaa aatatctaac ttgctgagca
tttgatggga aaaagtaaaa 180gataacttcc atttggtaca caacttattg tacatagagc
tatgatttga ggaggcatct 240aatttctgaa caaattcacc aagaaatacc atcacttaaa
gtcattatcg caatcatgct 300gcagtgaaca ctctatacaa aatggccagg tcattaaaca
tcaaagatgg aaaacaagcc 360agcaatctct tctgttc
377138483DNAHomo sapiens 138tgggcctcac ctatgatggg
atgctgagtg atgtccagag catgcccaag actggcattc 60tcatacttat cctaagcata
atcttcatag agggctactg cacccctgag gaggtcatct 120gggaagcact gaatatgatg
gggctgtatg atgggatgga gcacctcatt tatggggagc 180ccaggaagct gctcacccaa
gattgggtgc aggaaaacta cctggagtac cggcaggtgc 240ctggcagtga tcctgcacgg
tatgagtttc tgtggggtcc aagggctcat gctgaaatta 300ggaagatgag tctcctgaaa
tttttggcca aggtaaatgg gagtgatcca agatccttcc 360cactgtggta tgaggaggct
ttgaaagatg aggaagagag agcccaggac agaattgcca 420ccacagatga tactactgcc
atggccagtg caagttctag cgctacaggt agcttctcct 480acc
483139200DNAHomo sapiens
139ttttgcttgt cttattggcc cagcaaccag cttgacactg gggactatca ggctccaaat
60aataaccaat gtctcactcc aaacagacag gatactacgg agccagggtc agcaaacatt
120ttctgtaaag ggccagatag taaatatttt gggctttgtg ggccctatgg tctctgtcac
180aacgattcaa ctctgctgtt
200140243DNAHomo sapiens 140gagcgcctcc agtctagaag gcataagcca ataggataat
atattcaggg tgcagggtgg 60gtaggttgct ctggggatgg gtttatttaa gggagattgc
aaggaagcta tttaacatgg 120tgctgagcta gccaggactg atggagcccc tgggggtgtg
ggatggagga gggtctgcag 180ccagttcatt cccagggccc catcttgatg ggccaagggc
taaacatgca tgtgtcagtg 240gct
243141554DNAHomo sapiensmisc_feature(63)..(63)n is
a, c, g, or t 141tgagtgggct ttgagagagg gggaagagtg agtctgagca cgagttgcag
ccagggccag 60tgnggagggg gtttgggcca gtgcaccttc cggggcccca tcccttagtt
tccactgcct 120cctgtgacgt gaggcccatt cttcactctt tgaagcgagc agtcagcatt
cttagtagtg 180ggttncngnt ctgtnggang actntngaga ntattcttng ttncctgttg
gagttgntca 240aatgtncctt ttaacggatg gttgnatgng cgtcngcnnc caggtttatg
aatgacagta 300gtcacacata gtgctgttta tatagtttag gagtaagagt cttgtttttt
attcagattg 360ggaaatccat tccattttgt gaattgtgac ataataatag cagtggnaaa
agtatttgct 420taaaattgtg agcgaattag caataacata catgagataa ctcaagaaat
caaaagatag 480ttgattcttg ccttgtacct caatctattc tgtaaaatta aacaaatatg
caaaccagga 540tttccttgac ttct
554142479DNAHomo sapiens 142ggacatggtt atctacagca ctgagataca
ctactcttct aagggcacgc catctaagtt 60tgtgatccca gtgtcatgtg ctgcccccca
aaagtcccca tggctcacca agccctgctc 120catgagagta gccagcaaga gcagggccac
agcccagaag gatgagaaat gctacgaggt 180gttcagcttg tcacagtcca gtcaaaggcc
caactgcgat tgtccacctt gtgtcttcag 240tgaagaagag catacccagg tcccttgtca
ccaagcaggg gctcaggagg ctcaacctct 300gcagccatct cactttcttg atatttctga
ggattggtct cttcacacag atgatatgat 360tgggtccatg tgatcctcag gtttggggtc
tcctgaagat gctatttcta gaattagtat 420atagtgtaca aatgtctgac aaataagtgc
tcttgtgacc ctcatgtgag cacttttga 479143514DNAHomo sapiens
143cagttgctgc cctacatgga gaacaggagg ggtgctgtca tcctggtctc ttccattgca
60gcttataatc cagtagtggc gctgggtgtc tacaatgtca gcaagacagc gctgctgggt
120ctcactagaa cactggcatt ggagctggcc cccaaggaca tccgggtaaa ctgcgtggtt
180ccaggaatta taaaaactga cttcagcaaa gtgtttcatg ggaatgagtc tctctggaag
240aacttcaagg aacatcatca gctgcagagg attggggagt cagaggactg tgcaggaatc
300gtgtccttcc tgtgctctcc agatgccagc tacgtcaacg gggagaacat tgcggtggca
360ggctactcca ctcggctctg agaggagtgg gggcggctgc gtagctgtgg tcccagccca
420ggagcctgag ggggtgtcta ggtgatcatt tggatctgga gcagagtctg ccattctgcc
480agactagcaa tttgggggct tactcatgct aggc
514144265DNAHomo sapiensmisc_feature(74)..(75)n is a, c, g, or t
144gtgtggtgtt tgtgtcttaa ctatgcactg ggcccttgtc tgcgtcggct tgcatacaga
60gggcccctgg ggtnngccnt ccggcctggc ctcagccagt gggatggaca gggccaggca
120ggcctntgaa cttccacctc ctggggcctc ccagacctcc tgtgccccca cctgtgtggg
180caggtgggcc agtcttcggg tgatgggacc aaaccccttc agttcagtag agaaaggcta
240ggtcctctac aaagagctgc aagac
265145419DNAHomo sapiensmisc_feature(53)..(53)n is a, c, g, or t
145ggaggcgcag aagattgatc gcatgatgga ggctttcgct tctcgctact gcntgtncaa
60ncccggggtc ttncagtnca cnaggtcagt gcagagccca cagcctggcc cctnnccagg
120cacagcctcn agctctggag gggncggccc ctgtgggcac agccnagcgt gtgttcntgg
180ggacctgcnn tnccctgagc gaggacgacc tgtgggcngg gcacntcttg caggcgggcc
240cccagcacgc ggggtcccac tgtccactgg aggttctggc tgagcccagc accccggact
300cgttgcagac acgtgctacg tgctgtcatt cgccatcatc atgctcaaca ccagcctcca
360caaccacaac gtgcgtgaca agcccacggc agaacggttc atcgccatga accgcggca
419146492DNAHomo sapiensmisc_feature(411)..(411)n is a, c, g, or t
146tatgagaaac ctctgcgacc attcccagat gatgtctgcg ttgtccctga gaaatttgaa
60ggagacatca agcaggaagg ggtcggtgca tttcgagagg ggccgcccta ccagcgccgg
120ggtgccctgc agctgtggca atttctggtg gccttgctgg atgacccaac aaatgcccat
180ttcattgcct ggacgggccg gggaatggag ttcaagctca ttgagcctga ggaggtcgcc
240aggctctggg gcatccagaa gaaccggcca gccatgaatt acgacaagct gagccgctcg
300ctccgatact attatgagaa aggcatcatg cagaaggtgg ctggtgagcg ttacgtgtac
360aagtttgtgt gtgagcccga ggccctcttc tctttggcct tcccggacaa ntcagcgtcc
420agctctcaag gctgagtttg accggcctgt cagtgaggag gacacagtcc ctttgtccca
480cttggatgag ag
492147527DNAHomo sapiens 147aatattgtct cataagcatc tttctatatt tgttcacatc
gtacataatc atgtttttgc 60acagatacat taatattatc atagtttgtt taactacttg
gctttttcta acagtttttt 120tttttgagat ggtcttgctc tgttgcccag gctggagtgc
agtgacgtga tctcggctca 180ctgcagcctt gacttcctgg gctcaagtga tcatcccacc
tcagcctcct gagtagctgg 240gactacaggt atgcaccacg accagctaat tttttgtatt
ttttttttgt agagagggta 300ttttgccatg ttgcccaggc tagtcttgaa ctcctgggct
caagcgatct gcctgcttca 360gcctcccaga gtgctaggat tacaggcatg agccactgca
cccagcctct taacaaattt 420tgaatataac tcctgtctta aaatctgcag aatattgaat
ttttccagct attttttact 480tttgcttagc ttatagatgc taaaggatac tgtcatttgc
attttta 527148476DNAHomo sapiensmisc_feature(50)..(50)n
is a, c, g, or t 148ctctctcact ttctatagct ttgttggacc agatggtgag
gaaaggaatn ggcctcttcc 60cttctagagg gggctggctg gagtgagacc tnggggcttg
gcctnggaac ccaccacaca 120gccccaaagt caggaagcct ggggaaacca gagctgagac
ctcttcaaca gggtttcttt 180gagatcctac acctccattg ggcccttttt cagtcttcaa
tgggggccca gttggctcta 240gaaggagaag aggtgaagca ggatcctttg ccctggggga
gtctgagggc gcggtccttg 300gactcattca ggccgtcttt gtagttgggg gagttccact
gggcgatccc agcccctccc 360cacccaccct ctaatggacc tcctcataga agccccattt
cacttttgtt ttatctacct 420cttagcaaaa caatagataa attaggtagt ggcagctcca
cttgcttagg ttaggg 476149177DNAHomo sapiens 149gggagtttga
ccagagatgc aaggggtgaa ggagcgcttc ctaccgttag ggaactctgg 60ggacagagcg
ccccggccgc ctgatggccg aggcagggtg cgacccagga cccaggacgg 120cgtcgggaac
cataccatgg cccggatccc caagacccta aagttcgtcg tcgtcat
177150497DNAHomo sapiensmisc_feature(109)..(109)n is a, c, g, or t
150ctaaccactg aggctctcta atcttcctct ggagttttag tgaaaggatt tattgagcag
60cttctggaat ataatgtgca tgtccaaaat gaactcagcg cttcaaaang acnaagtctg
120tagcctggag gggcttgagt ggatgnnagc tgatgctgtg attttgagct gtggttacat
180gcagtcagta aacctgtgag actgctggag gaaatgtagc agacagcatg gaggctggga
240cccagcagct actttgggtc atgtctttac tgtcctgcct ccaacccttt agtctcgtag
300acttttgttc ttgtggaaat ttcttctgta ttccagttgt gtaaatatgt atggaaaact
360gatattacta ggttttacgt tgcatctcca gtattgatct ttggaaactg atgttacatt
420aggttccaat tcgcaatagt agcagagact gacatgcttt tattgagctg ctaagccccg
480tggatgatgg agcgaga
497151529DNAHomo sapiensmisc_feature(195)..(195)n is a, c, g, or t
151gccgacagct cctttaattt catggcgttt ttcttcatct tcggagccca gtttgtcctg
60accgtcatcc aggcgattgg cttctccggc tggggcgcgt gcggctggct gtcggcaatt
120ggattcttcc agtacagccc gggcgctgcc gtggtcatgc tgcttccagc catcatgttc
180tccgtgtcgg ctgcnatgat ggccatcgcg atcatgaagg tgcacaggat ctaccgaggg
240gctggcggaa gcttccagaa ggcacagacg gagtggaaca cgggcacttg gcggaaccca
300ccgtcgaggg aggcccagta caacaacttc tcaggcaaca gcctgcccga gtaccccact
360gtgcccagct acccgggcag tggccagtgg ccnttagagg gangcctgcc ctgcccncac
420cgcccaccac nnncnccccn tnnttcctgc tgctacccct gtgtcccgag ggctgggagt
480acctggggcc ccatcccccc agctgtgatg gtggaagccg gtggtggcc
529152437DNAHomo sapiensmisc_feature(145)..(146)n is a, c, g, or t
152agatgaagcc cttcaagcgc tacgtgaaga agaaagccaa gcccaagaaa tgtgcccggc
60gtttcaccga ctactgtgac ctgaacaaag acaaggtcat ttcactgcct gagctgaagg
120gctgcctggg tgttagcaaa gaagnngacg cctcgtctaa ggagcagaaa acccaagggc
180aggtggagag tccagggagg caggatggat caccagacac ctaaccttca gcgttgccca
240tggccctgcc acatcccgtg taacataagt ggtgcccacc atgtttgcac ttttaataac
300tcttacttgc gtgttttgtt tttggtttca ttttaaaaca ccaatatcta ataccacagt
360gggaaaagga aagggaagaa agactttatt ctctctctta ttgtaagttt ttggatctgc
420tactgacaac ttttaga
43715387DNAHomo sapiens 153ttctttcaca ccctgtcggg agaatgtgtg ccctgcgact
gtaatggcaa ttccaacgag 60tgtttggacg gctcaggata ctgtgtg
87154417DNAHomo sapiens 154cccgctggtg cagtggaaga
gcccggcggc cgccgccgca gccttctcgg cccgcgcccc 60cgccgcctgc acccccatct
gctcttcccc gcgggggccg cgcggcgcgg gctgggggcc 120cgggcagccg cgctcgggca
gcgggggcgc ggggctgccg cctgcgctcg cagctggtgc 180cggtgcgcgc gctcggcctg
ggccaccgct ccgacgagct ggtgcgtttc cgcttctgca 240gcggctcctg ccgccgcgcg
cgctctccac acgacctcag cctggccagc ctactgggcg 300ccggggccct gcgaccgccc
ccgggctccc ggcccgtcag ccagccctgc tgccgaccca 360cgcgctacga agcggtctcc
ttcatggacg tcaacagcac ctggagaacc gtggacc 417155407DNAHomo sapiens
155taagagactg agccgctagc agcgcctggg gaccagacag acgcatgtgg caaagctcac
60catcttcact acaaacacgc ctgagagtgg cactggggaa acataactcc atctacacct
120tggatttgga ctgattctcc attttatcac ctgaaggctt gggccagagc tcaacagcta
180ctcaactgga ggggtgaggg ggataaggtc tgtagtatac agacaggaag atggtaggtt
240tatgccttct gtggccagag tcttggactc atggaaatag aatgaataga ggggcattca
300caaggcacac cagtgcaagc agatgacaaa aaggtgcaga aggcaatctt aaaacagaaa
360ggtgcaggag gtaccttaac tcacccctca gcaaatacct atgtcaa
407156399DNAHomo sapiens 156gagaccagtt cacggggcaa gagatgaacg tggcccagtt
cctcatgcac atgggcttcg 60acatgcagac ggtggcccag ccgcagggac tggagcccag
tgagctgctg gggatgctga 120gcaacggaag ctaggcagac tgtctggagg aggagccggc
actgaggggc ccagacaccc 180gctgccccag tgccacctca ccccccacca gcaggccctc
ccgtctcttc gggacagggc 240cccagccgtc ccccctgtct gggtctgccc actgccctcc
tgccccggct ttccctgccc 300ctctcccaca gcccagccag agacaaggga cctgctgtca
tccccatctg tggcctgggg 360gtccttcctg acaacgaggg ggtagccaga agagaagca
399157422DNAHomo sapiens 157gtgaccagta ccgcaagggg
atcatctcgg gctccgtctg ccaggacctg tgtgagctgc 60atatggtgga gtggaggacc
tgcctctcgg tggccccggg ccagcaggtg tacagcgggc 120tctggcggga caaggatgta
accatcaagt gtggcattga ggagaccctc gactccaagg 180cccggtcgga tgcggccccc
cggcgggagc tggtactgtt tgacaagccc acccggggca 240cctccatcaa ggaattccgg
gagatgaccc tcggcttcct caaggcgaac ctgggagacc 300tgccttccct gccggcgctg
gttggccagg tcctgctcat ggctgacttc aacaaggaca 360accgggtgtc cctggcggaa
gccaagtccg tgtgggccct gctgcagcgt aacgagttcc 420tg
422158414DNAHomo
sapiensmisc_feature(364)..(364)n is a, c, g, or t 158acgcagcccg
cgacaacaaa aagacccgca tcatcccgcg ccacttgcag ctggccatcc 60gcaacgacga
ggagctcaac aagctgcttg gtaaagttac catcgctcag ggcggtgttc 120tgcctaacat
ccaggccgta ctgctcccca agaagactga gagccaccac aaagctaagg 180gcaagtaagg
gctgaacttt aaaaatgtaa acttacaaga caaaaggctc ttttcagagc 240cacccaccat
ttctacggaa gaactgagca ctctgttctc caaacctatc agaaatttgt 300ggccgagttc
aagcactgag gccattactt tcctattggg taaaataaaa gtattgaatc 360aggnctagta
aanannannn aanngctacc ttataacatg aaggaacctc ctta
414159470DNAHomo sapiens 159tatcaagatt gcccctgcgg aaggcccaga cgtcagcgaa
aggatggtca tcatcaccgg 60gccaccggaa gcccagttca aggcccaggg acggatcttt
gggaaactga aagaggaaaa 120cttctttaac cccaaagaag aagtgaagct ggaagcgcat
atcagagtgc cctcttccac 180agctggccgg gtgattggca aaggtggcaa gaccgtgaac
gaactgcaga acttaaccag 240tgcagaagtc atcgtgcctc gtgaccaaac gccagatgaa
aatgaggaag tgatcgtcag 300aattatcggg cacttctttg ctagccagac tgcacagcgc
aagatcaggg aaattgtaca 360acaggtgaag cagcaggagc agaaataccc tcagggagtc
gcctcacagc gcagcaagtg 420aggctcccac aggcaccagc aaaacaacgg atgaatgtag
cccttccaac 470160383DNAHomo sapiens 160agagagactc
agagacccgg gagggccttc ctctgaaagg ccaagccaag ccatgcttgg 60cagggtgagg
ggccagttga gttctgggag ctgggcacta ctctgccagt ccagagttgt 120acagcagaag
cctctctcct agactgaaaa tgaatgtgaa actaggaaat aaaatgtgcc 180cctcccagtc
tgggaggagg atgttgcaga gccctctccc atagtttatt atgttgcatc 240gtttattatt
attattgata atattattat tactattttt ttgtgtcatg tgagtcctct 300ctccttttct
ctttctgaca ttccaaaacc aggccccttc ctacctctgg ggctgcttga 360gtctagaacc
cttcgtatgt gtg
383161474DNAHomo sapiens 161aggatgcccc tttgagaaat gctgttccac agaaccctgc
ctttcaggcc ttggagacgt 60gggcagggga gaagcagcgt ccctcagagc caggcctggc
agtggtgcta gcaggggcca 120aggccaggga gcagggtctc ctgtcggagg gacctgggca
agcccctcca cgcgccagcg 180ggtttctcag caggggaggt ccacaccaca ccgcttggga
acctgggtgc ctaaacgcaa 240caggagccaa ggcacaaatt taaccaaaca ccaaggttgc
gtgaggcccc atttcatgag 300ccgggctcca aggacgtgtc cttaggcggc tctggaaggc
ccagcgccag cccccgtcct 360ctgttaaagg gagccagccc cggcgtccgc ccaggcatgg
tagcctgagc gcgcccccag 420ggtagtaggg ggcacctgag gagcagggtc tgccctggca
tgagcagagc ccag 474162371DNAHomo
sapiensmisc_feature(134)..(134)n is a, c, g, or t 162gatacttgga
tgcttttcct ctgactgatg aagatcctga ataccaaaga gggccgctga 60caggtctagg
agtacacttc tagcacctag cagagagagg cttcactaca tcatgcttcc 120tgacatctct
cccnttgaag agcagtcaga ctcctgcttt gctcttcaga cttaatttgg 180gggtttaaca
ggtgaggttg ctgggggaac tcttttacaa catctctctg aaagaatccg 240ggctgccagt
ttcatttggt ttgggtgtca gtagcatgat ggaaagacaa aaaaacacaa 300cttgacatct
gcagaaatgg gttcaaattt tacctgcaac tcaccaattc tgtggccttg 360gttcagcaat t
371163445DNAHomo
sapiens 163caacaagacg gacctggctg ataagaggca gataaccatc gaggaggggg
agcagcgcgc 60caaagaactg agcgtcatgt tcattgagac cagtgcgaag actggctaca
acgtgaagca 120gctttttcga cgtgtggcgt cggctctacc cggaatggag aatgtccagg
agaaaagcaa 180agaagggatg attgacatca agctggacaa accccaggag cccccggcca
gcgagggcgg 240ctgctcctgc taatgcagag ccgacctgtg gcttcccatg acactccttg
cttgttgtgt 300tgcttcctat tggctagctt cctaaggggg gagggaaccg agttatcaag
atgggaggat 360ttttcttttc tctctgtctt taggagtagg gtgggatggg gagggaggct
gggcatcagg 420gatcacatca ctcttaacgg ctgtt
445164313DNAHomo sapiens 164ggtggcctct ggatcctccg tggaccgaac
cgtcccccca ggaacacacc ttcaggtaga 60ccccgaagcc tcaaggccgg ggctggagcg
gagaccccag ggcctctcag gagacagtga 120ggctgcccct cctaccacct acctcattct
gcctactcac cccaggggcc acagccacag 180cctgctggac tcaggactgt cctgtcaact
ccagacaact gaataaacag gccgggtaca 240gtggctcgca cctgtaatcc tagcactttg
ggaggccgaa gcgggtggac cacttgacgt 300ccgtagttcg aga
313165344DNAHomo sapiens 165aatgtcatgt
ttattcaggc tgggaactgt attcacagta gaagtttcag tggtcaacat 60atctatgact
ctttaggctg ctgtagtttt acagtcaatt atttaaaagt gagtagttac 120atttataaga
gcctgagaat acttagactc agtcatttgt tagtattttt accaaaatct 180cttagtttca
gacatgtcag aagcagctat atagcatatc ttattctatg atatacatca 240ggctatctca
agttcctgtc tcacagttaa ttcaaagaag gattaggatt tctgtatttt 300ttctcatttg
aatctttatg tgcatttggt ttgtgtacat gctt
344166448DNAHomo sapiens 166tcttacccca ctgaaaccaa cagggatcgg gccaggctcc
cagattcttg aggacaggga 60cttcggcatt tactaatggg ggactactgt ggggtaaggg
ggcgcctgct tgcctgatac 120aggatggggt caagggacag tgggcaggtc ctcactcagg
agtggggggt gtaggctggc 180cagcccccag ggcttgtcca ccagtcttct ccccgcaagg
ccctcagagc agcgcctgtg 240ggtgtcagta ttacctgagc ctaggccaaa gctagcccaa
ggctggggaa ggggaggaga 300ctccaggtca gaatgtgagg tctcagtctg tgatttaagg
tgttgcatgt ggactcttaa 360ctgtacgtgt agtttctagt ggagaaatca aggctctgat
cattttgttt ttagtatgaa 420aatgtgattt cctttctgtt tgtaactc
448167334DNAHomo sapiens 167agatgccagt aatcaatatt
gaggacctga cagaaaagga caaattgaag atggaagttg 60accagctcaa gaaagaagtg
acactggaaa gaatgctagt ttccaaatgt tgtgaagaag 120taagagatta cgttgaagaa
cgatctggcg aggatccact ggtaaagggc atcccagagg 180acaaaaatcc cttcaaggag
ctcaaaggag gctgtgtgat ttcataatac aaacaaaaag 240aaaaaaaatt aaacaaattc
ttggaaatat ctcaaatgtt aataacaata tgaatttttc 300tcatgcatac tattactact
aagcatgtac gtga 334168561DNAHomo sapiens
168gcccccgact gaggcggaga cgaaggtgct gcaggcgcga cgggagcggc aagatcgcat
60ctcccggctc atgggcgact atctgctgcg cggttaccgc atgctgggcg agacgtgtgc
120ggactgcggg acgatcctcc tccaagacaa acagcggaaa atctactgcg tggcttgtca
180ggaactcgac tcagacgtgg ataaagataa tcccgctctg aatgcccagg ctgccctctc
240ccaagctcgg gagcaccagc tggcctcagc ctcagagctc cccctgggct ctcgacctgc
300gccccagccc ccagtacctc gtccggagca ctgtgaggga gctgcagcag gactcaaggc
360agcccagggg ccacctgctc ctgctgtgcc tccaaataca gatgtcatgg cctgcacaca
420gacagccctc ttgcagaagc tgacctgggc ctctgctgaa ctgggctcca gcacctccct
480ggagactagc atccagctgt gtggccttat ccgcgcatgt gcggaggccc tgcgcagcct
540gcagcagcta cagcactaag a
561169244DNAHomo sapiensmisc_feature(94)..(94)n is a, c, g, or t
169aatgtgtatg tctgggtaag tgtatagatt ttacaactat tttgaaggcg acctttttaa
60ctttaaacag accactctgg aggagacgcc tganccagag cgctttacct aaagttcggt
120gcctaaantg cacccttcct ctggctggtg tctcccttct gccaagctat gcctcctgca
180gaggtaggct ccgtggtgtc tcccactccg ccccaactgg agaacggtgt aaagaactgt
240cagc
244170408DNAHomo sapiensmisc_feature(262)..(262)n is a, c, g, or t
170caggatggca ttagctctgt gtctgcaggt gctgtgcagc ctgtgtggct ggctctcgct
60ctatatttct ttctgccacc tgaataagca ccgaagctat gagtggagct gccgcctggt
120caccttcacc catggagtcc tctctatagg cctctccgct tatattggct tcattgatgg
180cccatggcct tttacccacc caggctcacc caatacacct ctccaagttc atgtcctgtg
240tctcaccttg ggctacttca tnttcganct tgggctgcat ctggcgcttt gcatggagga
300agagcatcaa gaagtaccat gcttggagaa gcaggcggag tgaggaacgg cagctgaaac
360acaacggaca tctcaaaata cactagccaa ggcttgctcc agattatg
408171359DNAHomo sapiens 171aggacatcga ggctgcggtg aaccatgatt gtaccactgt
attccagcct ggacgactga 60gtgagaccct gtctcaaaca aaacaaaaca aaacaaaaaa
aagtacaaga ggaaaaaaat 120tgatttctga ttgcctcact caagataagg tcaacattga
aggtggaggt ggaagatgca 180gtttatgtag gggtctgaag attttaccat tctggggact
gtctttaaga aagagaatcc 240aaaattaggt agaaaagtga acgtctgacc gggcgcggtg
gctcatccct gtaatcccag 300cacttaagga gtacgagacg ggaggatcac gaggtcaaga
gatcgacagc atgctggcc 359172386DNAHomo
sapiensmisc_feature(182)..(182)n is a, c, g, or t 172gtttctgcct
ttgaacgtgg ctgtgggaag acatgatgct tagtgttgct gcagctatct 60catgaccttg
ggcaaaacat cccaacacac aggagggcca aacaagcagt cagaagaagc 120ctgagtcttg
tgggtgttgt tgagcagctg aacaaaccct aggatggctt ccttccagac 180tncttaggat
tgcgaacaat gaagctctat tgtttaagca aggtatcgat ggctattttc 240acttgccact
gaaagcacca ggacagagaa tcgtctttct aggaatacag ccacaaaagc 300cttcattatg
gtatatgcac ataaagaata taaaagtttc ctttatgttt ctctttaaaa 360tatagctgaa
gtctgcctca ggcaaa
386173408DNAHomo sapiens 173ggttccaggc tttgcatctg gagcctttac cggttgactg
ttgccttcca cacaaacagc 60ctctgaaaag cactttctcc atacataatt ctggagaaga
tgagggatct tgccctccag 120gagccttcct tcctccccca atgaggaaat cagtcactgc
actggtgcaa aggcaagcag 180attggaattt ctgctcttca ccgattttct cagggaaaga
ccccttcccc ttgccagcag 240aggaacctgt agttttttcc atttctttct tcagaaccaa
agtatgtatc actcctcatg 300ctcacaggga ttgacaggag agaattcacc aggatcttag
ctcaaaagac acagcctcag 360aatggccaga tggattgcac gaaacctgac ttggattcac
catcttcc 408174331DNAHomo
sapiensmisc_feature(227)..(227)n is a, c, g, or t 174gtggacgagt
gactgtccct ggtttgggct ggtgccattt agagggcaac cagagtgcag 60ggaagggagg
agcttgggca agagggacat tgctgtcgct ggttgatggt gagatggcac 120ttaatgagaa
cctggtcatt gggaaagccc caagcctgcg tcttgctgtg atgccttccc 180cattatgaag
ggtccattgg catgggagtg gggagacctg gactcanana agctacaagg 240gcaagggtgg
aaaggcatag cttntgcaag ttgatgctga aaaagatcca agactcatat 300tcagcagaca
gcccataacc aagagccaag g
331175260DNAHomo sapiens 175aggtcttcaa agaattggcc agtcttacag ctcaccttgg
ggtgtagatg actctccact 60gtggtgctag gcaattttat tgaacaggtg gccactggtg
gtgatggctg aaccactcat 120taaacaaatt gctctaaatg gcctcagtat caaggtgtgc
tttctgtacc cttaatctga 180ctttaatcct gcagaacctc agtcttacca tgtttaacag
cattgccatg tacgatatgc 240ctttatccta cactgtatat
260176528DNAHomo sapiens 176gctggctatg tacatggtcc
cattccctac ctgcacttct ttatgcctgt cttcaccctg 60ctgaccatcc acagcagcca
gcactaccag gccctcatag tgcctgagct cacccagcag 120atggttgatg ccaagaacat
gatggttccc tgagacccct gccatggcca ctacctaaag 180gtggccacag tgttcacgga
ctacatgtcc atgaaggagt tggatgagca aatgcttaat 240gtccaaaaca agaacagcag
ctactttgtt gagtgaatcc ccaactatgt gaaaacagct 300gtctgtgaca tcccactctt
ggggctataa atgtctgcca ccttcaacat caacagcgtg 360gccatccagg agctgttcaa
gcacatctct gagtggtcat gtttcggtgc aaagcctttc 420tgcactggca catgggcaag
agcatggact agatggagtt caccaaggct gagagcaaca 480tgaacaacct ggtgtcccgg
taccagtaat accaggacac ctcagcca 528177540DNAHomo
sapiensmisc_feature(31)..(31)n is a, c, g, or t 177acttatctgt gctgtaacta
ttgaaatgaa nccncttcaa atatgtannc cncntttctt 60tttnanattt ctaganangg
tttcaatata gactttctga cttttatggt atacatatag 120gncaatattc tattcttctt
tccttttaaa tacttactgt ttcaatttca aataaaaaat 180cagcattcta gtttgtacat
tttagcacag aaatgtttac aaccttcagc acaattgctt 240ttgtaattta ctgacttggc
attttgaggc gtttttaaca aattatgaga aataacacct 300tcagaaagca tgtgactact
ttgatgcaac tatttacaat gtattcataa gaagtcatta 360acctgtagag ttcttagaca
tgtggaacct ttaacaatta tactaaagag tacatacaaa 420atacagagct atgtaataat
aactaatttt aaatcctgac aaattagaag ttaagcctac 480tatctgtaaa aatatgtcct
gattcatttt tttaagtata tacctgagcc tttaaaaagt 540178560DNAHomo
sapiensmisc_feature(460)..(460)n is a, c, g, or t 178gccattttga
gtgccagatc tagttatttt gctgcaatgc tgagtggctg ttgggctgaa 60agctcccaag
agtacgttac tcttcaaggt ataagccatg tagaactgaa tgttatgatg 120cattttatat
atggaggaac tctggacatt ccagacaaaa ctaatgttgg tcagatactc 180aatatggctg
atatgtatgg actagaagga ttaaaagaag tagcaatcta tattttaaga 240agagattact
gtaatttctt tcagaagcct gttcccagaa cattgacgtc tatactagaa 300tgcctgatta
ttgctcattc agttggagtg gaaagtcttt ttgctgactg catgaagtgg 360attgtaaagc
attttgcaag gttttggtct gagagaagct ttgcaaatat acctcctgag 420attcagaaaa
gttgtcttaa tatgttgatt cagtccttan tnnnnntnnc nngannnnnn 480ntnnnnntnn
nnnccnnnnn nnngnnnnnn cnnnnnnnnn nnnnnnnnnn nnnnncaggg 540tgcactcaca
gcacagaaca
560179385DNAHomo sapiens 179gggttcacgt cattttcctg tctcagcctc cccagtagct
gggactccag gcacccacca 60ccactcccgg ctaatttttt gtatttttag tacagacagg
gtttcactgt gttggccagg 120atggtcttga tctcctgacc ttgtgatcca cccacctcgg
cctcccaaag tgctgggatt 180gcaggcatga atgaccgcgc ccagccgcag gcgcaacttt
tttgagtttt cctggccagg 240cgcggtggct caggcctgta gtcccagcat tttgggaggc
cgaggtgggc ggatcacttg 300aggtcaggag ttagaaacca gcctggccaa cgtggtgaaa
ccccgtctcc agtaaacata 360caaagccatt acagggcatg gtggg
385180173DNAHomo sapiens 180gacaacctta gttcacttgg
gtattcccat aatccttgtc tttcagggtt gacctgttac 60agctgcttaa acacatcact
gtatgctagg tattgcctac cttcacttac ttttctaacc 120ttgccgatgt gctgccttca
taaactgggt atatctccgc cacacttcta cgt 173181340DNAHomo
sapiensmisc_feature(167)..(167)n is a, c, g, or t 181ggtaactttg
gccaagactt ttcagtagga aatgcttcaa aatacaaagc aagagctatt 60ttcaagaaag
accttctaaa tttatattag gacatagtga gaagaaagcc atctgaaaac 120caggaagaga
gccctcacca gaatctgacc atgctggtgc cctgatnctt ggactttcag 180cctccagaac
tgcaaaattc tggtgtggtg tgaatgctgt ggctcagtcc gaacatgttt 240ttttctgtaa
ttttatcatt attacacgat tgcaatatca gttttgtttt ttaattggaa 300agcaacattt
tctactgttg aaagacgttt tttgacaaat
340182416DNAHomo sapiens 182acagcttgtc tgtcacagtg cctgttctga ttgcaggctt
tggtgttctc ctggtgttaa 60tcctgacttt tttcctagtg atccaccctc tgggaaactt
ctggctaatt cttagcgtca 120cctcaattga gctgggcgtt ctgggcttaa tgacattatg
gaacgtcgac atggattgca 180tttctatctt gtgccttatc tacaccttga atttcgccat
tgaccactgt gcaccactgc 240ttttcacatt tgtattagca actgagcaca cccgaacaca
atgtataaaa agctccttgc 300aagaccatgg gacagccatt ttgcaaaatg ttacttcttt
tcttattggg ttagtccccc 360ttctatttgt gccttcgaac ctgaccttca cactgttcaa
atgcttgctg ctcact 416183503DNAHomo sapiensmisc_feature(78)..(78)n
is a, c, g, or t 183aggccgggct cagaggcgga gaagcctgcc tggtgcccac
agccgtctgg ctcagggact 60ccaccctggc cccgagtngc cgtntgctgg gcctttcctt
cctggctctg caccccatgc 120tggctgcccg gtctggcttc ccttcttgtc tctgtcttgg
gcgaggcagc tgtgagcatt 180gcacagaggc aaagaccctc ctgcagcctn tgcgctgggc
cgtagaaaca agagcctttg 240taatacngaa cctcattcaa ggattaggag tggtggttag
gtcagggcca cccccagtgc 300tgcaggaacg gcctccaccc agctctgttg gtcagagcct
gggtcatgca cctggagttg 360ggagatcaag ntgggtctca gggcagtgag gtggccatat
ccaccacatc gcatttcgtg 420ggggaagagg tgacctcttt gttttaaact taaggtgtct
gcttatccag ccagaaataa 480aaatctgcca gtggtgttcc caa
503184377DNAHomo sapiensmisc_feature(26)..(26)n is
a, c, g, or t 184gagtcccgtc tcagtgtgga ggaacnggct gcacatggga cctgaaggtg
ccctctgtgt 60ttatgttggg ggtggggggg cagtgctggc tgcctctgtc ctgtgtgtga
ccctaccctc 120gaagggtcct gtcctgtcag tcccgaggga gccacaacca aagctgcgga
gagaaggtgg 180ggaagggtgc ggaatggccg tggggcacag cgtggcagac tgttcagtct
ctgctgggtc 240tttcctaggg acctggaagg ccagtgttgc ttccccctca ctccctttca
ctgnaggcag 300cctctctgct tccccaatgc cttatgcctg ggcacactgc cacagaatat
gcaatatgtg 360tgggtgacca tgccctc
377185390DNAHomo sapiens 185gtcatcctgt gctcagttag cagctcatcc
agctgggtca ggaaagcctt ttggaagcgt 60aggaccttgc cagccagcgc tgggatatgc
aggaggacgg ggacagcatt cagcacctcg 120cgcagaaagc ccgactcctc cttcagtccc
tcctgagcta ggtccagcag cctgaggaag 180cgagggtcgt cgtactcgaa gcggcgcccg
caggtgaggg aggcgatcac gttgctcacg 240gctttgtcca agagaccgtt ggggcgaaag
gggcgtcgga gtggttggcg aaggcggcac 300aaaggcaggc ggcctcctcg gtcacccact
gctccagcga cttcttgccc aggcccaagt 360tgcgcaaggt ggagacggag aagcgcctct
390186188DNAHomo sapiens 186ggctggcaac
ccagaaagat tggatttcag tgccatggtg ctggctgcgg agagcttcac 60ctcagggagg
cactactggg aggtggacgt ggaaaaggca accaggtggc aagtgggcat 120ataccacggc
tctgcagacg cgaagggcag cacggccaga gcttccggag agaaagtctt 180gctcacgg
188187549DNAHomo
sapiensmisc_feature(213)..(213)n is a, c, g, or t 187taggaatgga
gcccgagcag tctcgctctc agggccctgt gtggagtcac tgtgctgtcc 60cagctctgga
gacgcagaat tccacatgag gaatgtggaa ttcagcatgg ggatgacgct 120gcttcaccca
gacttggagg agcgtggtga attgcccgtg cccatgctct gatgtgcctc 180tctggccgct
gcgttcctcc tttctccctg ccntgggtca gtgcctgtaa acactgccct 240aaatcagcag
ggcccccgtc acttctgctt tatgcacctt tttcctcaga cacattaata 300caggggagtt
ttgtttccaa gggaccacat ccagatggag gggctgtttt tggtgatctg 360cactgccaaa
tgcccgagtg tccctgacag tcggagctga tgaggccaag gctgtgtgtg 420gttcctctgg
atggccagaa gaggaaccaa aacactgaat tctgggcctt cttaagagtg 480gtgatcagca
cattgtgata gaagcatatc tgggaatgaa cttggcctca agcttttggc 540cttttaatt
549188459DNAHomo
sapiensmisc_feature(120)..(120)n is a, c, g, or t 188ctactctctg
tctccataaa ctggtctatt ttggacattt cacataagcc tccctggatc 60ccagtttaag
catcctgggg tttgtctgcc tgccagagcc atggtgccac tggggctacn 120tgtcctgtgg
gatgacaagg caggtccaaa cctttgcctg ctctcccatc cattcctttt 180gtgttagtcc
atgtgtctcc cgactgttct ctccaacaac aacacagact gacaaaacct 240actgacttgg
agtcaggaac agactttgct attttctggc tgtgtgatcc tgatgagtcc 300cttgaacctc
ctggacttgt tcctcagcct aaaaaccaag actaataaat caagtctatc 360tcacagcctt
acgtggggat caaaaaacat ggagcatgtg aacacacatt gtacatcacg 420aagctgtgtg
caaataaata tcgtgtaact ccagccctt
459189430DNAHomo sapiensmisc_feature(112)..(112)n is a, c, g, or t
189gcccgaggcc tgctgagaag catgggggcc ttgggatagt tccagaatga ggatgtgcgt
60ttctagctgc tttgcgccct cctcccccaa aaatctgcta ccacaattcc ancccggcgg
120cacgccccca agactccttt gtcgccccag gggcgggacc tgagctgtcg gtttcaggag
180cccttcgtga cttcaaaagt cctgggcact gttgctcatg agtgctgcac aactgtcgcc
240ctctaaagcc acctccatcc ctcactgggc tggcctcctg agccttcggt gaggaaacgg
300ggttccgagt tgcccgcctg agagcttaac agtctgacta gaaaagggct aattcgcttt
360ctgtgcaaat ctcttgagct aattatttaa tctgaaacat ggacaggtaa aggaccattg
420gcgggcgtgg
430190406DNAHomo sapiens 190acatcaagca gctttctcgc tttgctggag cttcaagtaa
gattgctcca gtggaagcac 60cagatgctaa ggtgaggatg gtgatgatcg ctggatcacc
agaggctcgg ttcaaggctc 120agggaagaat tatggaaaaa tgaaagaaga aaacttcgtt
agtcctaaag aagaggtgaa 180acttgaagct catatcagag tgccatcctt tgctgctggc
agttactgga aaaggaggca 240aaacggtgaa tgaacttcag aatttgtcaa gtgcagaagt
tgttgtccct tgtgaccaga 300cacctgatga gaatgaccaa gtggttgtca aaataactgg
tcacttctat gcttgccagg 360ttgcccagag aaaaattcag gaaattctga ctcaggtaaa
gcagca 406191555DNAHomo sapiens 191aatgctgtca
gcccttaggc aagactaaat tggaaagaaa ggtgtctgcc aaagaaaaca 60ggcaggcccc
tgtcctcctt caaacataca gggaatcctg gaatggagaa aacatagaat 120cagtgaaaca
aagccgtagt ccagtttctg tgttttcctg ggacaatgaa aagaatgaca 180aggactcctg
gagtcaactt ttcactgaag attctcaagg ccagcgggtc attgcccaca 240acactagagc
tccttttcaa gatgtaacca ataactggaa ttgggactta gggccgtttc 300ctaacagtcc
ttgggctcag tgccaggagg atgggccaac tcaaaatctg aagcctgatt 360tgctctttac
ccaggactct gaaggtaatc aagttatcag acaccaattc taaatgtttg 420aagctttgtt
tctaaaagta ccttgaaatg atagagatgt aggaaaatat agttgtgggt 480ggagagagga
gtgagtttgt ttaggtggga aggtggcatg ggatgaagtt gtcattactg 540agcatcttct
ctgtg
555192554DNAHomo sapiens 192gccctgctca gaggtcagag ggtctgggca gaggagggac
cacattcccc tgccttgccc 60ctgagcactt ctggagactg cgtcctgtcc tatctgctca
ccatcaccct tcctgcccga 120cggagctgct tctgctccct ggggcatatg gactgaccca
cctcctgctg agaaccttcc 180cctaggccct gtgcagaagg gctactgccc cttaggcctc
agctggggga aaggcagttc 240tggtgctgta gaggccctgg tgcagaaagt gggacgtctt
ttttcctaag gtgtttaagc 300acaggcttga taagtttggt ttttaaaaaa taatctagga
aatgaataat tctaaatcta 360gtaatgagga aactgagcat ttcttttgcc ctccagggtg
ccaagaccct acatatgaca 420gaacccttgg cccttctcca tgcctgtggg atctgtttct
ttaaagcact ttgtactgtt 480attcaggagg ttgataatct ccttgaccca tgtctttcta
ccctaatccc cacttccctg 540cagaatcaat ctga
554193319DNAHomo sapiens 193acgcgtccaa catctcaaac
ttgatctcca tctttggctc cggcttctcg gggctggtga 60gccgacagcc ggactcctcg
gagcagccgc cgccgctcaa cgggcagctg tgcgccaagc 120aggcgctcgc cagcctcggc
gcctggactc gagccattgt cgccttctag ggacccccga 180gggcacaggg acccggggcc
ccgcggggct ggggccagac aaagactcgg caaaggggcg 240agaggaggga acgagcgggc
gccgggccac tcggggctga gctgggggcg agcgggggca 300ggcggctgat gttttataa
319194218DNAHomo sapiens
194gaagactttc taaataatga taatcagagc tgtactctct ctggaggcaa acatcatggt
60cctgttgaag ccctgaaaca aatgttattt aaccttcaag cagtacaaga acgttttaat
120caaaataaga ccacagatcc aaaagaagag attaaacaag tttcagaaga tgatttctct
180aaattacagt tgaaggaaag tatgattcct attactag
218195246DNAHomo sapiens 195ccccacccaa atacaagtcc cagtggaaag gaaaggtagt
acctattctt ctccatgggg 60ttcctaacac cctccattac tctttcagtc tccaagcact
ttgaatccat ttttaaacat 120tcaggttgcc agacctgtca cacagtgggc tctgataggg
ttacggaggg ggcctggctc 180tcagtctcta ctctcctatg tcccatcagt tggttggagg
ccaccttcca gggggtatgg 240gagaca
246196283DNAHomo sapiens 196caccttgacg gttccagtgt
ctgtatttat gttgaaagtc caggtgaatg acatcatcag 60tcgtcagtac ctgagccaag
cagttgtaga agtgtttgta aactacacga agacaaattc 120cacagtaact aaaagcaatg
gagcagtgct gataaaagta ccctacaaat taggacttag 180tttaactatt attgcttaca
aagatggcta cgtgttgacc cctctgcctt ggaaaaccag 240aagaatgcca atatattcat
cagttacact ttcactgttc ccg 283197391DNAHomo sapiens
197cgtccgagtg tgagtcagtc agcgacaagg ctcccagccc tgccaccctg ccagccacct
60cctcctccct gcccagccca gccaccccat cccatggctc tcccagttcc catgggcctc
120cagccaccca ccctacctcc cccactcccc cttcgacagc cagtggggcc accacagctg
180ccaacggggg tagcttgaac tgcctgcaga caccatcctc caccagcagg gggcgcaaga
240tgactgtcaa cggcgctccc gtgcccccct taacttgagg ccagggaccc tctcccttct
300tccagccaag cctctccact ccttccactt tttctgggcc cttttttcca cctcttctac
360tttccccagc tcttcccacc ttgggggtgg g
391198563DNAHomo sapiensmisc_feature(116)..(116)n is a, c, g, or t
198agaggcaggc atagaggctt ctccgccagc ctcctctgga cggcaggctc actgccaggc
60cagcctccga gagggagaga gagagagaga ggacagcttg agccgggccc ctgggnttgg
120cctgctgtga ttccactaca cctggctgag gttcctctgc ctgcnccngc ccccnagtcc
180ccacccctgc ccccagcccc ggggtgagtc cattctccca ggtanccagc tgcgcttgct
240tttctgtatt ttatttagac aagagatggg aatgaggtgg gaggtggaag aagggagaag
300aaaggtgagt ttgagctgcc ttccctagct ttagaccctg ggtgggctct gtgcagtcac
360tggaggttga agccaagtgg ggtgctggga ggagggagag ggaggtcact ggaaagggga
420gagcctgctg gcacccaccg tggaggagga aggcaagagg gggtggaggg gtgtggcagt
480ggttttggca aacgctaaag agcccttgcc tccccatttc ccatctgcac cccttctctc
540ctccccaaat caatacacta gtt
563199591DNAHomo sapiensmisc_feature(60)..(84)n is a, c, g, or t
199ctggagagcc agtgcccatg gcccgctgcg tctccacagg gggtcgcccg ccagcccaan
60nnnnnnnnnn nnnnnnnnnn nnnnggatgc ccaatacgag ccaggtgcca gggttcctgt
120cnnnnnnnnn nnnnnnnnnn nnnnnntgga tattggtgcc ctcaagccag gtggacggca
180annnnnnnnn nnnnnnnnnn gagcacgaga gctttgagaa gcctcagctg ctgactgtga
240acctcaccgt gtactacccc ccagaggtat ccatctctgg ctatgataac aactggtacc
300ttggccagaa tgaggccacc ctgacctgcg atgctcgcag caacccagag cccacaggct
360ataattggag cacgaccatg ggtcccctgc caccctttgc tgtggcccag ggcgcccagc
420tcctgatccg tcctgtggac aaaccaatca acnnnnnnnn nnnnnnnnnn nnnnnnnntg
480ccctaggagc tcgcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagcn
540nnnnnnnnnn nnnncatgtc tcctattcag ctgtgagcag agagaacagc t
591200485DNAHomo sapiens 200catcagattt ctgcagatct gctttaaagc tgtacatttt
tgttacagtc taagatgtgt 60tcttaaatca ccattccttc ctggtcctca ccctccaggg
tggtctcaca ctgtaattag 120agctattgag gagtctttac agcaaattaa gattcagatg
ccttgctaag tctagagttc 180tagagttatg tttcagaaag tctaagaaac ccacctcttg
agaggtcagt aaagaggact 240taatatttca tatctacaaa atgaccacag gattggatac
agaacgagag ttatcctgga 300taactcagag ctgagtactg ctccagggtg gtgtgcaatc
ttatattgat gcttgtgaat 360ctgccatttg atttgtagga taaataaata tgtttaatat
taacaacttc catcaaaact 420ataataataa tattatatct actgttgacc tctaacaaca
atcaggtgct gtattcagag 480tcata
485201432DNAHomo sapiens 201gccctgactg actgtattct
ctggccacat tcaagtcccc cattggtggg ggcagagaag 60taggaccagg ccatccttgg
ctccagagct cgaagacccc aagacagccc tctgctctca 120gcggcgccac agagagcctg
ggctcagcct tctgcatcag gacatggcct cgtccactga 180gggcacgatt taaacatttg
acatcagaag ctttatttgt aaacctcaca cagataagga 240ccaagggctg gcggtgtggc
cagaggacag gggaagctga aggccccgtg cttgagctcg 300gcagtcctgc tccttgcagt
gaagccacca tgggtgaccg tccagcctca cccggtggcc 360tgcacagtga gggaagggct
tcagggccat ctgctcccag ggcaggggac aggccaccaa 420ggacctttgg ca
432202499DNAHomo
sapiensmisc_feature(425)..(425)n is a, c, g, or t 202ggtggagaag
ctggcgcgtg agaacagcag catgcggctg gagctggacg ccctgcgctc 60caagtacgag
gcgctgcaga ccttcgcgcg caccgtggcc cggggacctg tggcgccctc 120caaggtggcc
accaccagcg tcatcaccat cgtcaagtcc accgagctct cctccacctc 180cgtgcccttc
tcggctgcat cctagtgccg gccgggggcg gggggtggcg ggcggcgggc 240ggcgggcagg
cgggtggggg cacacccctc gtacctgtca ctgggatgca gactctcgac 300atccgagtcc
aagcgcaggc ccctcgggcg caggcagctc acaccaggaa gagactgtat 360tgcagggtga
agagtgggct cccgtgggcc cagagctgca cgccggtcca cagacacact 420cacgnccgcc
acctgctccc cgcagatgtg tctgtgtgtg ggaattggta tcttgcaccc 480gtgggagtcg
ggacatata
499203569DNAHomo sapiensmisc_feature(107)..(107)n is a, c, g, or t
203ttccagcacc attcttttcc tattaaatta cactggcaaa tttgattaaa aaaaacaact
60gactatatat gcttgtaaac atttccagat tatgttattc ttttaancta aatatgtgtc
120cttatgccaa taccccactc catctattac tgcagtgtat gataagtctt gaaatctagt
180agtgtaagtt cttcaacgtt gcccttaatt tttaaaatca ctcttgctat ttaaaattgt
240ttgtattaca tggaaatttt ataatcagct tgccaatttc tacaaaagtc ctgctgagat
300tttaattggt attttgcttg ttctgcagct taatgcaaga aaattatctt aacaatattg
360aatttttcaa tctattaaca tgttatatat tactgtttac ttaggatttt ttcacttttc
420ctgccttgtt ttgaactgat attgtggttt taagtaattt tttttatttc tactattggc
480ttagtaacta tgccccactt tttgattttg tagcacagtt gaccattgaa caacacaagt
540ttgaattgtg catgtccaat tgtctatgg
569204266DNAHomo sapiens 204ggagcagaga cagagcgacc catacctggc ccaggccccg
gccccgcagg cagctgaatt 60cctgagccca gtgacaaccc cttccccctg cactctgtcg
tccgcccaag cctcaggccc 120tgaggctgca gatgagactt gtccccagct ggctgtccat
cctcctggtg tcagcaagct 180gggtttgcag tgtcttccaa gcgacggtgt tcagaatgtg
aaccagtgac tctcgggcgc 240ccctgtggta actttgcagg cggccc
266205506DNAHomo sapiensmisc_feature(41)..(41)n is
a, c, g, or t 205gcaagagctt tatccagagc tcccacctga tccgccaccg nccgcatcca
cacgggcaac 60aagccgcaca agtgtgcggg ctgcggcaaa ggcttccgnt atnaaaacgc
acctcgcgca 120gcaccagaag ctgcacctgt gttaggggct gggtccgcgg gaggctgccg
tctggggagc 180ctgtgggggg tagatatcct gggactgacc caggggaagg aagtggggaa
ggggcgggag 240ggacaatctg agagtgactg gggagccttt ggtgtttggg gtttcctgaa
gtgggaggag 300tgttgagtaa gttggtcttt cccggtgcta tacttgcctc ctctccacgg
aagaattgtt 360caggagatgc gcttggggtg atgacttcct taaatacacg ctgtaggggg
tgaagagctt 420ggaggaccag gcactttgag gaagggcagt tcgtgggctg gggtgggaac
aggatggcgg 480gcaatagact agggtaggcc gcgatg
506206439DNAHomo sapiensmisc_feature(53)..(53)n is a, c, g,
or t 206tcatttagcc ggtgtccact aactcagtgt tgtgggccat ttgtaaaccc ttntgnngtn
60nncnccaggc agacgtaggg aaagaaagag aggatctgta tagacaagaa agctggccat
120gtgggaagtc cagagctcaa accatgtgcc ccagaggact ggtgctggca ttaagcctgt
180aaatcaaagg cttctttggc aggaccctgg gctgttagaa tcaccctagg gagcagagcc
240aggggacatt ttggcccctg actagcaagg cacaacccta taatggcaga agcccttctt
300tcccctcccc gtttcccacc agacccactt ccttgatggg cctctagcac ccttccaagc
360tgatggggtc gggaatgtga gctggtaaaa tgggcagtgg aaggggctgt actgtttctt
420tacatctcac ggggactag
439207375DNAHomo sapiens 207aagaaatgct acctcggtgc catgttctgg actccgcaga
acaaggactt tttggagaac 60tccagcctat accctctatt gccatgacca gtacttcagc
cactctggtg tcatctcagg 120ctgatctccc tgaattccac ccttcagatt caatgcaaat
caggcactgt tgcagaggtt 180ataaacatga gataccagcc acgaccttgc cagtaccttc
cttaggcaac caccatactt 240attgtaacct gcctctgacg ctactcaacg gacagctacc
ccttaataac accctgaaag 300atacccagga atttcacagg aacagttctt tgctgccttt
atcctccaaa gagcttagct 360ttaccagtga tattt
375208502DNAHomo sapiens 208gttcttgagt acatagccaa
tgccaatggg agggatccca cttcttaccc atccctgtat 60gaagatgctt tgagagagga
gggagaggga gtctgagcat gagatgcaac cagggccagc 120gggcagggaa atgggccaat
gcatgcttca gggccacacc cagcagtttc cctgtcctgt 180gtgaaatcag gcccattctt
ccctctgtgt ttgatgagag aagtcagtgt tctcagtagt 240agaaggcaca gtgaatggaa
gggaacacat tgtatactgc ctttaggttt ctcttccatc 300gggtgacttg gagatttctt
tttgtttccc tttggtaatt ttcaaatatt gttcctgtaa 360taaaagtttt agttagcttc
aacatctaag tgtatggatg atactgacca cacatgttgt 420tttgcttatc catttcaagt
gcaagtgttt gccattttgt aaaacatttt gggaaatctt 480ccatcttgct gtgatttgca
at 502209250DNAHomo
sapiensmisc_feature(110)..(110)n is a, c, g, or t 209tcccctagct
tggggtccag acagcccagt ggacccaggc gcctgagcag gagggtaacc 60caggccaccc
ggccccttcg gccctctcgg ccccaccccc tgcagccggn gncnnncnnc 120nncnacnana
nngcngcgag aagangacag angngactga gcaaaggggg gtgggctcca 180ggcgacccct
agcccaattc tgcccctcca tcccaagggg cagagaaatt gtctttcttt 240gctgactcct
250210440DNAHomo
sapiensmisc_feature(142)..(144)n is a, c, g, or t 210tttggacatg
tccattttgg aagaaacttt tgtgttaaaa taaactaata tattatgggc 60tagaacataa
aattcaccaa gaatttcaag ataaaaatac taatgttttg cttgtttggg 120ttatttcaaa
caataacttt gnnntctata attttttcac caccgaccct ctacctcctt 180gcatgctcat
tctcctgtgt ggctagatgc atttcgggtg ttttgaatat tatttcagag 240caagtatcat
tccagaaaat aagtttaaag tttgaaatgt ttattttttg taacccatga 300atcttcagct
taagtatctt ctgacataaa agcattttca taattataaa agtgctgata 360ttactctcca
cagtattata tctgatcctg caaagtagtt cagataccag agaatactct 420taaacatttt
gactcacgca
440211573DNAHomo sapiens 211ggactcaggg agtacacact taccagtgcc cttaaagata
gccgttttcc cccaatgaca 60agggatgagc tgccacggct tttctgctca gtgtctctgc
tcactaactt tgaagatgtc 120tgtgattatt tggactggga ggtgggtgta catggcatta
gaatagaatt catcaatgaa 180aaaggatcaa aacgcaccgc cacctaccta ccggaggttg
caaaggagca aggatgggac 240catatacaga ccatagactc cttattgagg aaaggaggat
acaaagctcc gattactaat 300gaattcagga aaaccataaa actgaccagg tatcgtagtg
aaaagatgac cctgagctat 360gctgaatacc ttgctcatcg ccagcatcat catttccaaa
atggcattgg gcatcccctt 420ccgccataca accattattc ctgacactga gccgcacaac
cagtcactgg gcctctctgc 480agacctcttc ccaggagacc ctacaccttc ttggtctagc
tatctctttt actgtaccat 540tttatgatga tagtttccgt tgccatggtg aag
573212514DNAHomo sapiens 212cgtccttgtc atatcctttt
aactaggcat ctcagagaag cagagacagg gcagccttcg 60tcctggggga aaagggaccc
tcaggatggc atgagaggtc ctcaatccca agtgtggaac 120tgtccccctc aacttgttaa
aatgcagatt tctgggtctt gccaatgggg cctgggactc 180catgtgacaa ctggcccagg
agcttctgat gtcacacaga attctgcagt cccaagctcc 240agccccgacc tgctctgctg
ttcctaggtg actgccctca cactgctgac cacagtggat 300ttctccccct gctgctcggg
ctcagctggg gtcagccctg cttataaggt caactgtgca 360aaaccttata ctggccaaga
acaaactagt gctgggggag gagggctggg tgccccggcc 420actggtggag tccccaggaa
atcctcagag ctgttgcgag gatgagacac atttgtggac 480acgtccacct gtcctcctga
ccgtctggag agaa 514213504DNAHomo sapiens
213ccggctatgg gctcgagccg agttccttca acatgcactg cgcgcccttt gagcagaacc
60tctccggggt gtgtcccggc gactccgcca aggcggcggg cgccaaggag cagagggact
120cggacttggc ggccgagagt aacttccgga tctacccctg gatgcgaagc tcaggaactg
180accgcaaacg aggccgccag acctacaccc gctaccagac cctggagctg gagaaagaat
240ttcactacaa tcgctacctg acgcggcggc ggcgcatcga gatcgcgcac acgctctgcc
300tcacggaaag acagatcaag atttggtttc agaaccggcg catgaagtgg aaaaaggaga
360acaagaccgc gggcccgggg accaccggcc aagacagggc tgaagcagag gaggaagagg
420aagagtgagg gatggagaaa gggcagagga agagacatga gaaagggaga ggaagagaag
480cccagctctg ggaactgaat cagg
504214529DNAHomo sapiens 214gaaattattc actccgtata ctgaaacaga aataaacgag
gaagaactta caaagccaag 60actcttgtgg gctctttatt ttaatatgag agattcctcg
ggaatcagca gaagctcgta 120taatggcttg ccttccaatg tttatgtctg ctctgggcct
gactgtggcc tgggaaatga 180gcatgctgtc aagcaagctg aaacactttt ccaggagatc
tttccaactg aagaattctg 240ccctccacct ccaaatccag aagacattat ctttgatggt
gatgataagc agccagaggc 300tcctggaacc aataatgtag taatggccaa actagaatcc
tctgaggaaa gcaaaaacct 360agaaagccca gagaagcacc ttcaaaatta gaaaagagca
atctcgaaat gctgttttgg 420acctccttca tggcatcaga attttctcat ttaaaggaca
gtttcccata tgagtaatta 480gaagtggtta tatatgatga atgctatgca gatgttgtct
ttaactctc 529215480DNAHomo sapiens 215tctttgctct
agtattccac ggtgcctctg acatgagaac aggatggaga ctggcttctg 60atttgacatg
cattttgtag gtatgatcca aaatagcttg gaaactatcc cagtcttcaa 120ccatcccatt
ttttagaggt gaaatggcct ccatattctc cctcggaaca cgcagagcat 180tagtatctat
gtagtaggtg ggaccgcctt gtttgccttt atcgccatct atttccatta 240atgtgcttcc
gtcatctctt tctaccacca taccaatagc tgtaggaaaa tccaccttgg 300ggcagtcctc
accagcataa ccagctctca cagtatagga tccaatgtca aaaacaaggg 360ctccaacttc
atctcccccg taccacgccg ccgctcatgg ctgctgccgg cgcgactcct 420accctaaggg
ctaactggcg aagtgactgc agtggccgcg actgcgagtc tcgaggagcg
480216282DNAHomo sapiens 216tggaagcatt tgttgcctcg atcttccact ttagaaaaat
gaagtttctc cttttctttg 60ggagaggata tatctgaata cttgccttct tggcatttat
acattcaaag ctcagtgcta 120gattagagct attatttgca tagtcttttg gtattgccca
cttttggcat taccatatta 180tttgacaatt agaaggaata gggaaggaat attacatgac
tgtaaaagag ttggttatat 240tttatgttga cttcaagggt tccatttgaa ctattatggg
ca 282217563DNAHomo sapiens 217gcaggaccac
cttgaattct gccctgacac actggattgg agagcagcag aacccagggc 60ctggcccacc
aagctggagt gggaaaggca caagattcgg gccaggcaga acagggccta 120cctggagagg
gactgccctg cacagctgca gcagttgctg gagctgggga gaggtgtttt 180ggaccaacaa
gtgaccactc tacggtgtcg ggccttgaac tactaccccc agaacatcac 240catgaagtgg
ctgaaggata agcagccaat ggatgccaag gagttcgaac ctaaagacgt 300attgcccaat
ggggatggga cctaccaggg ctggataacc ttggctgtac cccctgggga 360agagcagaga
tatacgtgcc aggtggagca cccaggcctg gatcagcccc tcattgtgat 420ctgggagccc
tcaccgtctg gcaccctagt cattggagtc atcagtggaa ttgctgtttt 480tgtcgtcatc
ttgttcattg gaattttgtt cataatatta aggaagaggc agggttcaag 540aggagccatg
gggcactacg tct
563218391DNAHomo sapiensmisc_feature(100)..(100)n is a, c, g, or t
218gccagacaac ctgagtgtga atccagcttc accacttcat tcattcactc acccattcat
60tcaacaacat atttgaagca catactttgt accagggacn tttccaggca cnggactaca
120gctatgaaca agacaaacag tccctagcct cccaagagcc gtcacttcag aagggcagac
180atgacacgca aacaaaatga tgccaggtgg taccaagtgc cttggggaaa cagtgccacc
240tttctgagac cgtttctcca tccgtccatg gagctgataa caccagtccc tcagggtgga
300ggtgaagact aagaggttgc tttgagaggg ggaacttggt ggcttttttt caccacctag
360aacctggcac atactaagct ctcaataaaa g
391219474DNAHomo sapiensmisc_feature(417)..(417)n is a, c, g, or t
219aactacgcct ggtacaagct ggcagaggag gtttctgggc gcacagaagt cactgtgaaa
60cagccagaca gccgcctgag gctcagccaa gcccagggga acctgtcggt tctggagacc
120cggcaggtac agctggagtg tgtggttctc aaccgcacca gcataacctc ccagctcatg
180gtggaatggt ttgtatggaa gcccaaccac cctgagcggg agactgtggc ccgcttgagc
240cgtgacgcca ccttccacta tggagagcag gcagccaaga acaatctgaa ggggcggctg
300catttggaga gtccttcccc cggcgtgtac cgtctcttca tccagaacgt ggctgtgcag
360gacagcggga cctacagctg ccatgtggag gagtggctgc ccagccccag tggcatntgg
420tataagcggg cagaggacac cgctgggcag acagctctga cagtcatgcg acca
474220471DNAHomo sapiensmisc_feature(125)..(125)n is a, c, g, or t
220gggaccttgt aacttccttg caagttaagt gagctatcct gtcacggttt tatgttgagt
60gagtgggaag ctgggactct gttttacagc catctgtact ggagcctgga caaaccactg
120gtctntatgg gangccccag cctcacattt ccctggcaag gagagagagg tttagccatg
180tcctgggtct aggattacag cccagagatg ggcacttaag aagacctggt cattggtcca
240gacttgggcc aaggctctcc tctgtgaggg atgggtttta ctggtgaatt acctgtgtgg
300agaagctatc agggccatgt ttagcacact gaagggacca gtctccacca agcactttaa
360catccctcca gccagcatag attgatctcg tgttacagag agggcaaggt ttttggcccc
420tgtttgcaga ctccatgtct taatcagaga ccacagtttt ctctttgttc c
471221527DNAHomo sapiensmisc_feature(408)..(408)n is a, c, g, or t
221taaataatgt cctctacgtg ccggtgtgga agtagcccgg atgcaattga atgaacaaca
60gacggtgctt tccaggacgg cgctgtgctt tccaggatgg tgctgtgctt tcattcattt
120gggtagctcc tctgtgagcc tcccagcgcc gactgcagag cccccactct ccagcctgca
180agaccccgaa attcaagcca cacaaagaaa ggaggagggg gccgttggca tttactgaac
240cttataaaac tgtcagcaaa acagccctta ggcttggact ccctgctagc cgggttttac
300ggtgctgaag tcagcatctt gattcagctg cataaataat ctcctgcagt cctgcaaggc
360ctggggtagg agagggtatg gggaccaggg cactctgtaa gggctggnat aggaacccca
420gggaataaga cagaccaant gcgggacttc agactccact gcagccggga tcgggttgtt
480gttaatttct taagcaattt ctaaattctg tattgactct ctcatgc
527222310DNAHomo sapiensmisc_feature(43)..(43)n is a, c, g, or t
222atacatgtgg ttatcttttg ccctgttgtg atggataatt tgnaaagaag tgggtttatg
60tcaccttctc accttcttat aagaaagctc tgagaatggg catttttgtn ttttnttgtt
120gttgttgaga tggagtctgc cacccaggct ggagtccagt ggcgtgatca tacctcactg
180cagcttcanc ttcctgggct caagtaatcc tcccacccca gcctcccagg tagctngtac
240tataggtgtg cnccaccacg cccagcaaat ttttaaattt attatagagt gggaggcagg
300gtgcggtggc
310223283DNAHomo sapiensmisc_feature(169)..(169)n is a, c, g, or t
223cactgtctgt gtgagtccat tcacttcaat accagagcca cctctttgtt tcctatttac
60taagaagcca taccagcatg agatctcctt gatagtgtta aatcccactg tggaaagatt
120gaaaaatatc tcccagcctt accagaggtt acgatctagt gtggaggcna aagacattga
180gaagaaaaaa gcaggtgcct cctcctggct ctcctgttag gttaacataa tcataattcc
240cctttgaaat gtctcccaca tttgcccttt aacttcctat tgc
283224499DNAHomo sapiens 224gacgactacg gtctggacaa ctttgacaca cagttcacca
gcgagcccgt gcagctgacc 60ccagacgatg aggatgccat aaagaggatc gaccagtcag
agttcgaagg ctttgagtat 120atcaacccat tattgctgtc caccgaggag tcggtgtgag
gccgcgtgcg tctctgtcgt 180ggacacgcgt gattgaccct ttaactgtat ccttaaccac
cgcatatgca tgccaggctg 240ggcacggctc cgagggcggc cagggacaga cgcttgcgcc
gagaccgcag agggaagcgt 300cagcgggcgc tgctgggagc agaacagtcc ctcacacctg
gcccggcagg cagcttcgtg 360ctggaggaac ttgctgctgt gcctgcgtcg cggcggatcc
gcggggaccc tgccgagggg 420gctgtcatgc ggtttccaag gtgcacattt tccacggaaa
cagaactcga tgcactgacc 480tgctccgcca ggaaagtga
499225562DNAHomo sapiens 225tcttctgtgg aggaatggca
tcccaggcct tcacccctcc aggtcagccg tggctgccgg 60ccaagatggc cgcgtgggca
gcctcacatt ccttctcggc ttttggcccc atgtcctcgg 120cactcaggtc tgcagttcag
cccaagtgtt gagactcagg tatgcagctc agggcggcct 180taattaaccc tcccatgggc
ctgggcaccg cctgcgcctc atcaactctg ggctgctggt 240tttgttcctg acgctgcagc
ctgacactgt gggcgggggt gcagtttgcg atggaaggct 300gcctccgaat cgaggaagcc
ttgaccttgg gaggggcctg ccttttcgct gggcttgcct 360ttctctgggc agcgttcgct
cagcacttca gtgcggccga ttcccctggg actgaattca 420caccagccac gacgacttcc
cggctacttc acgttctcta tgtttgcagc tgttctttgg 480tggcagaaaa agatgatttt
tcttcccccc actcccattc ccttttgtta gtttctctcc 540ctgaaccaca ttttgagctg
ag 56222647DNAHomo sapiens
226ttccagaatt tcttccgagg tagtatggtt ttcttcatag gataaag
47227523DNAHomo sapiensmisc_feature(476)..(476)n is a, c, g, or t
227aggcagcgct gcggagagga gcggcagagt gggttgtctg ccgcaggcaa ccaggcaagt
60gtgtcggggc tggggtgtga atgccagcct gtgagtcccg gaactatgtg ggtaccccta
120cccctcacag aagccaaggg catggaggag gtccctccac agtgacaacg gtgtggggta
180ggggaggtgc attcaggaca ccacccaggg acagtgccta tgtgatcacc tcttaaaggc
240taagcttagg ggcatttccc aaagtgggga cagagggcag gacgcccagg ctgggggctc
300tcctcgcccg ccctggtgtc tgacagcctc aaggaaggag cagtgcctgt gtcagccatg
360gggcccttgg agctgccgct ggtgcctagg gggcctgggt ttctgcccag gcagccagtg
420gctgttggga gcctctgttt cccctgtgct gggggccttg agtgctatgc tagcangggc
480ctggccccaa gtgtgagtga tgagcaataa acgtaccgtc ccc
523228138DNAHomo sapiens 228aagtgcgaag tcagggatgg tctaagaggg ctgagaggag
aattccggaa cctcaggacc 60ttgctcactg gctgctggct ggggctgtga agctgtccag
tctagaactc aaagagtgat 120ggtacaggct ttagagcc
138229396DNAHomo sapiensmisc_feature(198)..(198)n
is a, c, g, or t 229gggctgggta cctcttctgg ttgctgagtg gagtgcacca
gcagccccac cccagagaag 60ccctgttgga agcgctgtgg gaatccccca aggtagggga
gtggacacca taaggaaggg 120gaggagtgcc agctccatat gcggtctccc ccatcagtca
ggccagcagc gggttcagct 180gcctctgggc agccctancc catacagaca gggagacctc
cctcccgatc ttctgtgaat 240agtcccttat acccctgctt atgcctcagg ggctcctcca
cccttttgtc ttcatactgc 300atatgaaaac tgcccttgta tatgtggata tctgaatgtg
tcagtgaagg cctatatgaa 360tgtgcacatg tgggtatgtt ctcagccatg tgtata
396230432DNAHomo sapiensmisc_feature(39)..(39)n is
a, c, g, or t 230gaactaaagg agccatctct ctcccctctc ctccgttcnc gagaggaggg
gtgggtctca 60gacgtttttc ctatggactt atttcttcca tgtccaggac tttgcacaac
tttggtttta 120aaagctgttg aaaaatagga aaacaaaggg cattgttcac agatagggcc
aagtctcccc 180ttgcaagggt gcctctgttc tgtccctgcc cccacctcac cttctctact
cctccagtaa 240gttggcagtt ttggtgccaa accccaaatc tccaaagaga catgccaggc
aagacaaacc 300cccaaacacc tcctttccgg tggccttgga aacagattgc tccgagctgg
agaatgtcgg 360gtgaggtgta tgggagagga ggggagagtt agaacttgtg cctttgggag
taaggggtaa 420ctgcctggag gg
432231549DNAHomo sapiens 231atcagtgcca gaaattcctt acctaaagtg
gcatatgcga cggccatgga ctggttcata 60gccgtctgtt atgcctttgt attttctgca
ctgattgaat ttgccactgt caactatttc 120accaagcgga gttgggcttg ggaaggcaag
aaggtgccag aggccctgga gatgaagaag 180aaaacaccag cagccccagc aaagaaaacc
agcactacct tcaacatcgt ggggaccacc 240tatcccatca acctggccaa ggacactgaa
ttttccacca tctccaaggg cgctgctccc 300agtgcctcct caaccccaac aatcattgct
tcacccaagg ccacctacgt gcaggacagc 360ccgactgaga ccaagaccta caacagtgtc
agcaaggttg acaaaatttc ccgcatcatc 420tttcctgtgc tctttgccat attcaatctg
gtctattggg ccacatatgt caaccgggag 480tcagctatca agggcatgat ccgcaaacag
tagatagtgg cagtgcagca accagagcac 540tgtataccc
549232554DNAHomo sapiens 232gatgagtcca
tctcacttgc tcagaacttt gcctggtgag agcggttaca agcgaacaag 60gtggaaatga
aagaaaccct gactttccca ctaggaagga agagactgtt ccttcttgtg 120atgtactctg
aagaaaaatt ctaggatttg gacagatttc ttgggttata aaacatgatt 180ttcttctctg
tttcttgggc ttttataatg ggtactgttg ttttcttgca aagctttaat 240gattccataa
ggacttgtat aaagtttatg ggagaatttt caatgtagat gtgaatggca 300gaaacccaag
aatctgtgtg aggttgaata agatcctgtg tctccagaga ggtctgatgg 360ggagacacag
atctaaattt taaaggtggt ttgggccttc tcaatcatat attaaggtcc 420ttttatgtta
tagataagta aattaaggcc cagaaagatt aatagcccaa ggtcccaaga 480cctgcttgag
acctgtgccc catttctgac taatattctt catgatattg tatcactctg 540tatcaaaacc
aacc
554233539DNAHomo sapiens 233gatggtgcag tacctgctag cactttgctg cagaatgcct
ctgcactcag ttctgcaaat 60gtactgtttt agtttcattt aaaacccctt tttttgtgag
aagatttcaa acatcaggca 120agtttgtaat gaattcaagc tgagttctct cgagggacaa
acatgtataa ctacagttcc 180agtgtcagtg ccagctgtca ggttttcact gtgcagctag
ggctgcctgc atacccagtc 240atgtaaacca aattcactct agaatcggcc aggtcttacc
aaaatgcaaa tagaatacaa 300agcaactgga aatatatttc gtaatttcat tttatgtgtg
attttaaaag ttaagctact 360tcaaaactca tctgtctaac ttattttcac taataagtgt
aacttgcctg gaatttggca 420gatctaagct gggcttgggc tagatggttt caagcctgag
tcattaagat gtgaaattta 480cagaaacaac agaggattga ggaacaagtt aaaggacact
ctaatggtgc agtctgcat 539234431DNAHomo
sapiensmisc_feature(102)..(102)n is a, c, g, or t 234gtgagcatgg
aagtagatct tccccggtca agcccccaga aggacccagc cctgcggaca 60ccttgaccga
aacctgtgag agctccggaa atagaggaac cnagcattcc ctctggaata 120catcagcact
gttgcctttg aggctggcct gcttgaatgc acacctgagc tccggattca 180cagtggagga
agccagatgc catgtcatga gggtgctcaa gcaacttttt ggagatgtat 240gtatggagag
aaactgaggc ctcctgccaa cagccagcac taacttggca agcatgtttg 300agagccacct
gggaagtgga gccttcagcc ccagttaagc cttcagatga gactgcagtc 360ctggccacca
tctggactgc aacttcacaa gagctcctaa gccagagcca tgcagatgga 420ttcttggccc c
431235403DNAHomo
sapiensmisc_feature(139)..(139)n is a, c, g, or t 235gatctcattg
cctttttatg ccgattaaca tgcttttagc ccctactgag cttatagtta 60acagaagttt
ccaggtcttt cttcacctga actgtgtcta aagcaagttc cctccacctt 120ctgtatttat
acgcttgant ttttaaaacc taaatgttgg gcttcacatt tgttccttgt 180aaatttcatc
ttggtgattg cagtctaccc tctggccttt aaaaattgtc tgagccttga 240ttcgatcatg
aaaccagctt acccttcccc tgtgtgctgg ccccagtttt ctaaccaggt 300gttgaatgaa
ctggatggac tctgccagat ccctccgtgc aaggctggaa tcagtccatt 360gttcaactgt
gccctttggg gctgtggttc atttggctct gat
403236257DNAHomo sapiens 236ctgctggaaa ggcatccttg ctgcagctgt gagtgtgatg
ggacagcaga gtcactcctg 60catgggattc tagggctggg ggtcccagag gggtggcctc
cgcccctcct gggggccgag 120gactgtcacc atgtcactac ggcactctcc agctgctgac
caaagccctc gctaaccgca 180gccctgccat actctgggtc tttcctctgg agcaaggtga
agagactgca gcgaggcgtg 240gaattgggaa gctcttc
257237446DNAHomo sapiens 237actgtgactg cgcgcaggac
gagaactgca agtccaccaa gcgcgccatt gagccgtgcc 60tgccccggac gagcggcggc
ggcgcgggcg gccccggcgc gggcggggtc atgggctgca 120ccgaggcccg gcggcgctgc
gaccgcgaca gccgctgcaa cctggcgctg agccgctacc 180tgacctactg cggcaaagtc
ttcaacgggc tgcgctgcac ggacgaatgc cgcaccgtca 240ttgaggacat gctggctatg
cccaaggcgg cgctgctcaa cgactgcgtg tgcgacggcc 300tcgagcggcc catctgcgag
tcggtcaagg agaacatggc ccgcctgtgc ttcggcgccg 360agctgggcaa cggccccggc
agcagcggct cggacggggg cctggacgac tactacgatg 420aggactacga tgacgagcag
cgcacc 446238340DNAHomo sapiens
238ggaacagagg agagatgccg gctggaggac acagcaaatt tgaaccaaga ggagcttgga
60ggaagcccga gcgacctgga ggggactggc tgaccttcct cattcttttc aagtgtgaat
120aataaccaag cccagtttgg caactccttg agggtgagga cgaagcccca ttctcctttt
180tggaacttgg tggggctcag gaagcaggtt ctctccagtc ggtggctttc ctttctgttg
240cgggtctctt gagggcctgc cttcatgaag gcacatgagt gactcatcat ttgtgaatta
300attgctatat gtgaagggca tctgagaaca aattatcttc
340239560DNAHomo sapiens 239tgaccgccat gtggctgtgt ctgaccgcct gcgatactcg
gccatcatgc atggagggct 60gtgtgctagg ttggccatca catcctgggt cagtggctcc
atcaactctc ttgtgcagac 120tgctatcacc tttcagctgc ccatgtgcac taacaagttt
attgatcaca tatcctgtga 180actcctagct gtggtcaggc tggcttgtgt ggacacctcc
tccaatgagg ctgccatcat 240ggtgtctagc attgttcttc tgatgacacc tttctgcctg
gttctgttgt cctacatccg 300gatcatctcc accatcctaa agatccagtc cagagaagga
agaaagaaag ccttccacac 360gtgtgcctct cacctcacgg tggttgccct gtgctacggc
acaacgattt tcacttacat 420ccagccccac tctggtccct cagtccttca agagaagctg
atctctgtct tctatgccat 480tgttatgcct ctgctgaacc ctgtgattta tagtctaagg
aataaagagg tgaagggggc 540ctggcataaa ctattagaga
560240524DNAHomo sapiens 240ggaaatagtt tgttcatatg
gccaaattat aaagggactt agtaaaagaa agctatgttt 60tctgattacg aaggaaatct
atgctcacag tgggaaaaca agaaaatgtg gcaaagcaca 120ggtaagaaaa taaaaatcaa
taatatcaac attatgaata ttttaggtac ttaggaattt 180ggggtagaat gatggaaagc
aaactgttaa ttatagctgt atatttcagt gtagaggcta 240caggtgcctt gcatttgttt
tcttataaaa tctgttccca tacattttac ttactttatt 300tgaatttagg aaactttcat
taggtagcca tttttatttt ctgtttcttt aatcatttta 360ctttgaaata attttaaatt
tacagaaaat ttgcaaaaat agtgtagaaa tttcccattt 420gcctttatcc agcttcctgt
agtgttgcca ttttatgtaa ccatagtaca attattgaaa 480ccaagacatt aactttgaga
ggctgctact actctaagaa ccat 524241504DNAHomo
sapiensmisc_feature(71)..(72)n is a, c, g, or t 241tcctgtgtct tgacccagaa
aattgtgaca tgtaaaaaga ataaattcct ggtttaagcc 60agtaaggtta nnggtacatt
gttacatctc agataattaa aaccttgaaa aactcatgag 120agatcacaag tagaaccttg
atctgaaaca tggcatgtgg cgatttatat tgagtattag 180gttaaaaatg caagaangga
gcatagttaa tattttacnt taaagctaaa acnataattg 240cctacttaaa attttcagtt
aattaggttg tcactttttg ttcttaacna agaaatcaac 300tagttttant ccataaacag
ttagaactga tgcacacatc cgtttntcct tactcatttt 360aaacagctat ctgaaatagg
aagtgtaatn taatntttaa agaatctgaa aacatgacag 420aaatgtttaa actataaaca
tatattgtat atgttagcat attgtataca ttgnatatta 480acataagcta gaatcattga
cata 504242317DNAHomo sapiens
242cgaaccactc agggtcctgt ggacgctcac ctagctgcaa tggctacaga ggctggaaga
60tggcagcccc cggactgggc agatcttcaa gcagacctac agcaagttcg acacaaactc
120acacaacgat gacgcactac tcaagaacta cgggctgctc tactgcttca ggaaggacat
180ggacaaggtc gagacattcc tgcgcatcgt gcagtgccgc tctgtggagg gcagctgtgg
240cttctagctg cccgggtggc atccctgtga cccctcccca gtgcctctcc tggccttgga
300agttgccact ccagtgc
317243437DNAHomo sapiens 243aatgccggct ggctcagtga tggctctgtg caatatccca
tcacaaagcc cagagagccc 60tgtgggggcc agaacacagt gcccggagtc aggaactacg
gattttggga taaagataaa 120agcagatatg atgttttctg ttttacatcc aatttcaatg
gccgttttta ctatctgatc 180caccccacca aactgaccta tgatgaagcg gtgcaagctt
gtctcaatga tggtgctcag 240attgcaaaag tgggccagat atttgctgcc tggaaaattc
tcggatatga ccgctgtgat 300gcgggctggt tggcggatgg cagcgtccgc taccccatct
ctaggccaag aaggcgctgc 360agtcctactg aggctgcagt gcgcttcgtg ggttttccag
ataaaaagca taagctgtat 420ggtgtctact gcttcag
437244389DNAHomo sapiensmisc_feature(299)..(299)n
is a, c, g, or t 244tagatcatgc cctcattggg cttacatgct gttgaaaaga
taggatataa atccatgaaa 60atttttacaa tgctatttat taacaataca tgacaagagt
actagaaatg ttacttgtga 120ctattttgtc tattctagcc aagctggatg cctggctgtt
tctcagttat actaaatgag 180ttctgctctc agggtcttca tacttgccct tccctctgcc
tgcaacactc ttcctccagt 240tttttttttt tttttttggc tctctccatc actttaggtc
tccattaaaa ctgtcagcnt 300tcagggaagt tgccttccct gaccacaacc acactaattc
aaataccaat ccttccccgc 360ctccgtttgg taactctcta gtctcttat
389245136DNAHomo sapiensmisc_feature(68)..(69)n is
a, c, g, or t 245gccccaaggt ctttaagtat ctctgtcact tattagctca ccagagaaga
cacaggaatg 60agaggccnnt tgtttgtccc gagtgtcaaa naaggcttct tccagatatc
agacctacgg 120gtgcatcaga taattc
136246369DNAHomo sapiens 246ggccctgggc taagtcgggg atgaaggcgg
gagctgctgt gctggactgc agctcagcac 60agagacagtg agcctagatt gcagagctgc
ccagggaggg atgtcacctt gggggatgga 120ggctgcaggt gctcctcaga ccttagggaa
acatttggga gggagcttgt tgaggagata 180caggcacctc agggtggctg ggctggatgg
actttgatga cccttccttt tttgagacct 240gatggttctc taatttggga atcatttcca
aagatgggtc taaaaatcct tgtttcattg 300gaaataatga gtttgctatg atgcttaaga
ccaagcatgt caccatttgt tattactgca 360cttttccct
369247444DNAHomo sapiens 247gaggcttttg
acacagttat tagttaaatc aaatgttcaa aaatacggag cagtgcctag 60tatctggaga
gcagcactac catttattct ttcatttata gttgggaaag tttttgacgg 120tactaacaaa
gtggtcgcag gagattttgg aacggctggt ttaaatggct tcaggagact 180tcagtttttt
gtttagctac atgattgaat gcataataaa tgctttgtgc ttctgactat 240caatacctaa
agaaagtgca tcagtgaaga gatgcaagac tttcaactga ctggcaaaaa 300gcaagcttta
gcttgtctta taggatgctt agtttgccac tacacttcag accaatggga 360cagtcataga
tggtgtgaca gtgtttaaac gcaacaaaag gctacatttc catggggcca 420gcactgtcat
gagcctcact aagc
444248394DNAHomo sapiens 248ggggcggcgg aagcgagtag agtttgtgac atttgtgcca
gcccctccag cccagtcacc 60tgaggagcct gtaggggccc ctgctgtgca gtccatcctt
gtggcaggcg aggaggacat 120ccgctgggtg tgtgaggaca tggggctgaa ggaccctgag
gagcttcgca actacatgga 180gaggatccgg ggcagctcct gaccctccac agccacctgg
tcagccacca gctggggcaa 240cgagggtgga ggtcccactg agcctctcgc ctgcccccgc
cactcgtctg gtgcttgttg 300atccaagtcc cctgcctggt cccccacaag gactcccatc
caggccccct ctgccctgcc 360ccttgtcatg gaccatggtc gtgaggaagg gctc
394249414DNAHomo sapiens 249tttgctttgg gtactgtgat
aactactttt tatactttat cccatttaat tataaaaacc 60actcttgaga agtaattttt
attttcagaa ccattttaca gatttaaaat aaacaggttt 120gaggaattag tttaacttat
ccaaagtttc gtggctatta agttctagta tttggagtca 180aatgcaagtc tgtctaaatc
tagagcccat gttctttaac tgcaacacta taatgtctca 240ccccgtccta gtcccaccaa
ttagtcaact cttttagggc agaagtctgt ctaattcatc 300tttgcttcct gttactttat
atttaattaa aaattttagt gactttttaa cttgtaaatt 360gtagctgatt ttacatttat
cttcctgaag gaaactctgt atcattttgt cttt 414250268DNAHomo sapiens
250cttttattag aatgccatgc ctgcttatgt tatgcatgta ttttataata atttaatcta
60ttttacaatt ttaaactcaa atatgattta gtattatgca cataatacaa acagtagtgg
120tgagcaaacg tgtgtttccc ccacatgtgc agaatatgat ggattttatg aaaataaata
180ttcttaactc caggaaatat gatctatatg gttccttaaa agattttcca atacactgaa
240aatttagttc cttatgttca ttgtataa
268251443DNAHomo sapiensmisc_feature(131)..(132)n is a, c, g, or t
251cgtgcagcag atcccaggag ttggaaaagt taaagctccc cttctcctcc agaagtttcc
60aagcatccag caactgagta atgcttccat tggggaactg gagcaggtgg tcggacaagc
120agtggcacag nnagatccat gccttcttca cgcagnccca ggtgagggct ggcctcaggg
180ccacggnnat cttctcccga gaccacaaac accaggatct tgttttcagn tttaaaaacc
240aagagaatgg gccgggtgca ctggctcacg cctctaatct cagcactttg ggaggccgaa
300gacagcggat catctgaggt caggagttca agaccagcct ggccaacatg gagaaacccc
360taaaaatagg aacaattagc caggcatggt gacaggtgcc tgtaatccca gctacttggg
420aggccgaggc atgagaatca ctt
443252281DNAHomo sapiens 252gagaaattcc cacactaaaa acactacaag tttttggaat
cgtgccagat ggtacccttc 60aactgttaaa ggaagccctt cctcatctac agattaattg
ctcccatttc accaccattg 120ccaggccaac tattggcaac aaaaagaacc aggagatatg
gggcatcaaa tgccgactga 180cactgcaaaa gcccagttgt ctatgaagta tttattgcag
gatggtgtct cttctttaga 240acagggaaaa taggcaggaa gcccaattgc tggagtactt a
281253249DNAHomo sapiens 253ccaaatatct agattctgat
cccttttgag gtcctagacc ctttgagaaa ctgatgaagc 60caggcacctc cttcctcagg
aaaatgctgg tgtacaaata cacacaaagc tcttcaggca 120gctgatagat ttcccccaga
gagctattca aggacttcct aaggtgggtg gactgcaggg 180ttaggacacc tgctatagag
gtgacatttt tccaaggaca agcagggact ttggtcttga 240ctgttctct
249254259DNAHomo sapiens
254agaagagcct gaacctcaac atcttcctga agcaatttaa gtgctccaac gaggaggtcg
60ctgctatgat ccgggctgga gataccacca agtttgatgt ggaggttctc aaacaactcc
120ttaagctcct tcccgagaag cacgagattg aaaacctgcg ggcattcaca gaggagcgag
180ccaagctggc cagcgccgac cacttctacc tcctcctgct ggccattccc tgctaccagc
240tgcgaatcga gtgcatgct
259255535DNAHomo sapiens 255aaattctgca atgaacccta caccgaccgg acagaagaaa
gggaagaatc caaagaggaa 60gaagactggt ccctccgacc tgtcctttcg ggagctgaga
aagatgacga agctgagcgt 120ctcagagaaa caacagaaga cggagaagac ccgccaggct
acaccaccga catgagaaca 180gataaagaag ctgactcaaa tggcagaggg cagcctaaag
gagaaacaac tggcaattat 240cccgggtaat atgatcttgg ctgccttgat ggtaattacc
gcggcggtaa gtctccctgc 300tgtctggact gaagaaaatt ttacatactg gcttctgttc
catttcctcc tttaattagg 360ccagttactt ggatggattc ccctattgaa gtttatacaa
atgatagtat tttggatgcc 420tgggccgatt gatgatggct gtcctgcaca gcctgagaag
gagggtatgt tgatgaatgt 480aactggtatg aataccctcc aatttgttta ggaattgctc
cttgatgttt accat 535256230DNAHomo sapiens 256ggaagtaatg
acttttttgc ccatttactc actgagtccc ataatgtggt aaatgtataa 60tgctgacatt
tgttccgtcc ttatagattg aggatagtac ggccctgaat tttgccttta 120ctttagaaac
ctgattcaac ttaaccgaac tctcaggaat ctgattccta agctgagtat 180cacattttag
attacttact aatttgtgca tctatccacc tagcaaatat
230257532DNAHomo sapiensmisc_feature(97)..(97)n is a, c, g, or t
257taaaaccaac cagctgaacc tttcaggcta caagagaacc cgggtcggta atgtcttttt
60aagaataatt tttaattgct tataacaagc atatttngtg gcatttgaac tatatttact
120gctccaatat ccgttatttt ccaaaggatt tngtatcttt ttgaaaatgt ttacatcatc
180agatgatcca cagaattcac tttatgtgag atctcccgag agtttccatc ccaacataat
240ggactttggt ttgaacacaa ttcgtttttt catttgaatt ggcatttccc aatatttgct
300aaacatttgc tggagaaatc atttttcttt tttctttttt agaaaactca gaatgaaaat
360tcattcccct gaaatattta ggtgtctata ttctatattt tgatctatta agggattagt
420atttttccat gtttattgtg ttatcagagt gcattagaaa gattagtgat tcatcttcac
480agcacatttt taatcaagca gttatttcaa ccagcacatt cgttttgttc at
532258489DNAHomo sapiensmisc_feature(363)..(363)n is a, c, g, or t
258atcaccccct gttcattatg tcaggcctca tgggagcctg gccttctcca gaagctggcc
60ccggcgtcct cccaagctgg accacgtagg ccccagatca cacctggggg tccagatgta
120ggggtcccgt gtgcacgccc aatcagaccg agcacttgtg acactacccc aacacctctc
180ccagggctga atgaggaacg cgccactgga cacatgagga agaggctgcc ctgggagcta
240ctgatgctgt gacctcacct ctctggcttt gggcggcagg tccctgcacc taggatgcct
300gcctggaaat gtccttgcat tcgtggcctc cttcacagcc tcctcctcag agaagcctct
360gcnagtgcac agggagtgtg tgcagccttg tgaagggctg ggaccacttg cccagactgg
420ggcccctcag gcacaggcgt ngggtcctac tgacctgtct ccccagctcc cacacagaaa
480gcatctaaa
489259468DNAHomo sapiens 259cagaaggaaa cggtgtctct cggctgtggc tctgagtgca
aattgcatgg gcggaaaggc 60gggggtggct gctcttcctg gcaggcctgg gccatcagcg
aactgggccc cgtgaggagg 120gcgggagtgt ggaggagggt gggcctctca cccaggcttt
ctcggcccct ctcctcagct 180tgcagagctg gccagccccc tccttagggg gtgggcgagg
agcctctggg cagacccaag 240aaccatgggg actggggtgg gttggtggca ccaatggcag
ccctccccgc ccctctcctt 300caaggagggt tcccgcagct ggggggtgtg cggaggcgca
tggcctcccg ccacggggcc 360gtgctgtgtt tatggctggc agaggcagcc agcgggtggg
ggattctgct gctcgctcac 420ctgcctggct cgctggtctc tcgaattttc ttccctctga
aatcctat 468260531DNAHomo sapiens 260ctgcaccaac
tcatgctgga ctgttggcag aaggaccgca accaccggcc caagttcggc 60caaattgtca
acacgctaga caagatgatc cgcaatccca acagcctcaa agccatggcg 120cccctctcct
ctggcatcaa cctgccgctg ctggaccgca cgatccccga ctacaccagc 180tttaacacgg
tggacgagtg gctgaaggcc atcaagatgg ggcagtacaa ggagagcttc 240gccaatgccg
gcttcacctc ctttgacgtc gtgtctcaga tgatgatgga ggacattctc 300cgggttgggg
tcactttggc tggccaccag aaaaaaatcc tgaacagtat ccaggtgatg 360cgggcgcaga
tgaaccagat tcagtctgtg gaggtttgac attcacctgc ctcggctcac 420ctcttcctcc
aagccccgcc ccctctgccc cacgtgccgg ccctcctggt gctctatcca 480ctgcagggcc
agccactcgc caggaggcca cgggcacggg aagaaccaag c
531261379DNAHomo sapiensmisc_feature(210)..(210)n is a, c, g, or t
261cctcggacac cagagacaat aactgagcgc ggaggacacg cctgccctgc ctgccatctg
60tggcccgaag ccattgccat ccactgcaga cgcctggaga gggacaggcc gcttccgagt
120gcagtcctgg cgcagcaccg actcccacgc acccggggaa ggacaccctc actcccacac
180cccgggaaga acactagaac atcagcagan gggccctgcc cctccgcctg cagccgtgaa
240aggaagctgg gtcatcagcc cagccccgcc caccccagcc cctatgtgtg tttccctcaa
300taaggagatg ccttgttctt ttcaccatgc naataacatg cccagcaaaa acttgcttta
360tgggtctgcc tggagaaaa
379262486DNAHomo sapiens 262aaccacacca gaagacatcc tcaggaacaa aggctgctcc
agctctacca gtgtcctcct 60cacccttgac aacaacgtgg tgaatggttc cagccctgcc
atccgcacta actacattgg 120ccacaagaca aaggacttgc aagccatctg cggcatctcc
tgtgatgagc tgtccagcat 180ggtcctggaa ctcaggggcc tgcgcaccat tgtgaccacg
ctgcaggaca gcatccgcaa 240agtgactgaa gagaacaaag agttggccaa tgagctgagg
cggcctcccc tatgctatca 300caacggagtt cagtacagaa ataacgagga atggactgtt
gatagctgca ctgagtgtca 360ctgtcagaac tcagttacca tctgcaaaaa ggtgtcctgc
cccatcatgc cctgctccaa 420tgccacagtt cctgatggag aatgctgtcc tcgctgttgg
cccagcgact ctgcggacga 480tggctg
486263350DNAHomo sapiens 263tctccgtgga ggctatggct
tcagacaggc cccgaaggtc tgtcaccaat gtgctcggtt 60gtgggtcaca taacgctctc
tggagggctt gcctttcagc ttgggatcat gaaaagatga 120tttgacgctg tttctcatgg
tctccgacct aataaagcaa gataagagaa aacaaatgtt 180attttaaaaa aatcaccctt
tggcaaaaga aacatgtaaa attagaatct ggcacaaaca 240aaacctgaat ctgggttgtg
aactttcacc acccgccgca actctttgat aaaacctcaa 300gtgatatcta ttaccattgt
aaaaataaag cctgccccta tgcttagaat 350264507DNAHomo sapiens
264ggcaaccggg gaagtattgt ggccttggag tttgctaaat ccaaatatga aaatcaaaag
60ctttagtatt cctcatcttc tcttctggaa gatttgcgtt agagtttttg ttgggccttc
120aaaaagctgt gttcagagtt aggagaatat atccaataaa agatggtttc gtctaccaat
180tggggaagtt tcaccctctc cctatctgaa gaaaaaaatc aaaaacaaat gtccccggat
240ctttcgatgc aagtcctgga ggcagggaga tcactgcctg cctggcccac gctgctggga
300cggctcgtcc tccctgcttt ttgtttttca aacctcctgc ttctcccacc ttgggaagga
360gaaatgtgaa acccggcagc ggccgaccta ggcggtcttg tggcccggag ccggcccggc
420ccgaaaacca tagacctggt tgtactgtag cttgttgttt gggggaccaa attttctaga
480gagaactaga gcacttttgt tgtgttt
507265192DNAHomo sapiens 265cacaggcctt cagaggcgat ggctgggcga cagtgacgaa
agcaaagcaa agcagggctg 60tggagacact cctcgcattt gtctcttccc tccaaggatt
atctgagcaa gtcgacttgt 120tcattcaaag gcggggtctg ccaagccctg ctctatccaa
tggggatagc ttctacgtaa 180cggattccaa tt
192266202DNAHomo sapiens 266agagcaacag ctctatatct
ggatcactgc agtgcctaga agatacaaca gcacaattta 60caaatccaaa tttccaggaa
gtctctgcac atacctctag tacaaaagat gtttcagaga 120ctagagggtc agaaggcaaa
gagaggcaat attcaactcc cagttcaggt caaaagggaa 180gaaagcctgg tgttgaaaga
aa 202267278DNAHomo sapiens
267gaaccacgtt ctttgtatgg gcccaatgag ctgtcaagct gccctgtgtt catttcattt
60ggaattgccc cctctggttc ctctgtatac tactgcttca tctctaaaga cagctcatcc
120tcctccttca cccctgaatt tccagagcac ttcatctgct ccttcatcac aagtccagtt
180ttctgccact agtctgaatt tcatgagaag atgccgattt ggttcctgtg ggtcctcagc
240actattcagt acagtgcttg atgcacagca ggcactca
278268392DNAHomo sapiensmisc_feature(302)..(302)n is a, c, g, or t
268ctcctggcct gatactctag ggatgcaggt gggagaagca ggggtcctgg gggctgcctg
60gagctctggg aggcattctg aacggggtct actactgatc tcaggtgagc tctgccctcc
120tctgaaagtc acttttctca tcagttaaat gggggcaagg gtccgtggtc cgaccaaggt
180cttggcttca cagacatcac caggagcctg catgcccctg atcactcctt ctccttcctc
240caggaaactc cagcctggcc tctgacccca gttcaatccg accatgccca agcccaagcg
300gncctttcct ccagaactgc tccggggcct ggctgtgtga ctggagcaag gtgctaaacc
360tctctgtgcc tcgctggtct aatctgtaaa at
392269417DNAHomo sapiensmisc_feature(240)..(240)n is a, c, g, or t
269taatctcatc caaaaccatg ctcacagaca cacccagcat aatgtttgac caagtatctg
60ggcaccttgt ggttcagtca aattaacaca tattaactac cttagcaaga tgaaaagcag
120tgaatgcagg atggtggttg aaattttaaa tacgttggtt atatagtctc attgaaaaag
180gaacatttga gtgaagactt gaaggggtgg tggaataaac catttatttg cttattgccn
240gtctccctct atcagaatga aagcttcatg aagcgagaga cttaattttt atctgttata
300tccctagtgc ctggtgcagg gtaagtactc aaaaatattt gttgagtgaa taagtaatga
360ttgaggatgg ggactggttt gtatctggtt atatctcttg tccttagcac agtacct
417270412DNAHomo sapiens 270ggggccctag ggattatagc caggactcta atctgcctac
catgccattt aacaagagat 60cccactctcc agctgccttg tgtccctagg gtcctggcca
tgtgtttagt gtgctaaact 120ttctcctttg ttctcaggcc ttccaggtag tccccttcct
ggacttaaga gtgcaaactc 180ttctctgtgg ttctagcctt gggcagaatt atatcccaga
gaccacagag caactgtcaa 240gctgcttacc ccctcaccca gggctacagc ctgtgcccag
ccctctaatt tgtgcctctc 300ttgtgttggg ggtggtgggg gttattcctt tccctttcct
gctctggcct ccttgaaagt 360tcagagtacc cagtacaagt cagccaccat gctgacgggt
atttttcctc at 412271372DNAHomo sapiensmisc_feature(76)..(76)n
is a, c, g, or t 271tagccaggta tagtggcagg aacctgtaat cccagctaca
ggggaggctg aggcaggaga 60atcgcttgaa cccggnaggt gtaggttgca gtgagccgag
attgcaccac tgcactccag 120cctgggcgac agagcgagac tctgtctcga aaaaaaaaaa
ggtccgtgcc aagctgctcc 180ctgcccttgc cctttccctt tccctggggt ccaaaccaca
tgtgtcctgc ctctcctggc 240cctaccacat tctggtgctg tcctcactcn cccctggccc
agaggctcct gaagatgctg 300ggcggtcctg gcacagggag gagcagctct gtaaatctgt
gcacatngcc actcttggcc 360taataaagga gg
372272427DNAHomo sapiens 272cctaccaccg tcttcgagag
gatgtcatgc ggctctctcg cctagcactg ggctcagagg 60cctggcgccg agtctggagc
cgcagtctgc agctggcgag ttggccaaac cggggagggg 120cacctggagc tccccagggt
gaccctatga gggtattctc agttaggacc cggagacagg 180acactcttcc tgaagcgggg
cgcagatcag aggcagaaga ggaggaggcc aggaccatca 240gagtgacacc tgtcaggggc
cgagagaggc tcaatgagga ggagcctcca ggtgggcaag 300acccttggaa attgctgaag
gagcaagagg agcggaagaa gtgtgtcatc tgccaggacc 360agagcaagac agtgttgctc
ctgccctgcc ggcatctgtg cctgtgccag gcctgcactg 420aaatcct
427273526DNAHomo sapiens
273gtccacattc ctgcaagcat tgattgagac atttgcacaa tctaaaatgt aagcaaagta
60gtcattaaaa atacaccctc tacttgggct ttatactgca tacaaattta ctcatgagcc
120ttcctttgag gaaggatgtg gatctccaaa taaagattta gtgtttattt tgagctctgc
180atcttaacaa gatgatctga acacctctcc tttgtatcaa taaatagccc tgttattctg
240aagtgagagg accaagtata gtaaaatgct gacatctaaa actaaataaa tagaaaacac
300caggccagaa ctatagtcat actcacacaa agggagaaat ttaaactcga accaagcaaa
360aggcttcacg gaaatagcat ggaaaaacaa tgcttccagt ggccacttcc taaggaggaa
420caaccccgtc tgatctcaga attggcacca cgtgagcttg ctaagtgata atatctgttt
480ctactacgga tttaggcaac aggacctgta cattgtcaca ttgcat
526274429DNAHomo sapiens 274tgtgtccact ggtttcagtc tgagttctct gcactttgag
gatgcagaca gtgaagttct 60cccatggtta tagggggaga gatcatagga atgctatgga
aagaggcctg aagtcagagc 120cagctagtgg ttattattta ttaattgcct gtgaggtgcc
aggcgcacat attagaccat 180atgtgattgc agtgagccac ccggatcccc ttcaagctgc
tgctgcagct gatggaagtc 240ctattggcag acagccttct ctcatcagcc ccttcaggac
ttgcctcagt tgcagagagc 300tgccttcccc aagatcacac ccttccctgg ggactcacaa
ccaatggctg atccagaaga 360atccataaag cccgtatcat ttcagcccaa tttaggacag
ctttgttgag ccattagacc 420tacatgcag
429275434DNAHomo sapiensmisc_feature(376)..(390)n
is a, c, g, or t 275gaagctctac ttgcctggtg gtaattccag gatgacccag
gagaggctgg aaagagcgtt 60caaacggcag ggcagccagc ccgcacctgt caggaaaaat
cagttgctgc cgtctgacaa 120ggtggatggt gagctgggtg ccctgcggct cgaggatgtg
gaggatgagt tgataaggga 180agaggtcatc ctgtcgccag tcccatcagt gctcaagttg
cagacagcat caaaaccaat 240tgacctctca gtagcaaagg aaataaagac ccttctgttt
ggttccagct tttgctgttt 300caatgaagaa tggaaacttc agagtttttc ctttagtaac
acagcctcat taaaatacgg 360catagtgcag aacaannnnn nnnnnnnnnn agtcctggca
gctgtccaag gctgtgtcct 420acagaaactc ctgt
434276189DNAHomo sapiens 276aaaatcactg ccactgactt
ttaccctctt caggaagagg ccaaggagga ggaacgcctc 60atagctttga agaaaatcct
cagctcgggg gtgttctatt tctcatggcc aaacgatggg 120tctcgctttg acctgactgt
ccgcacgcag aagcaggggg atgacagctc tgaatggggg 180aactccttc
189277542DNAHomo sapiens
277gaggagcagg caaggctacg tgggcagctg aaggagcaaa gcgtgcgctg ccggcgcctc
60gctcacctgc tggcctcggc ccagaaggag cctgaggcag cagccccagc cccagggacc
120gggggtgatt ctgtgtgtgg ggagacccac cgggccctgc agggggccat ggagaagctg
180cagagccgct ttatggagct catgcaggag aaggcagacc tgaaggagag gccagggagg
240gttctccccg tgacaacccc actgcacagc agatcatgca gctgcttcgt gagatgcaga
300acccccggga gcgcccaggc ttgggcagca acccctgcat tccttttttt taccgggctg
360acgagaatga tgaggtgaag atcactgtca tctaaaagcc ggctactgtc agcaaagcct
420gaagaagtgg ggctggatac cctgccccca ccatatccct accatccctt ctcagtcaac
480cctttaccct tacagtagca agcatagacc cctgtctaac gggggtagac aggtgcagat
540ga
542278475DNAHomo sapiens 278gacagtctac cgtcacgaga agcgggtgaa actgcagatc
tgggacacag ctgggcagga 60gcggtaccgg accatcacaa cagcctatta ccgtggggcc
atgggcttca ttctgatgta 120tgacatcacc aatgaagagt ccttcaatgc tgtccaagac
tgggctactc agatcaagac 180ctactcctgg gacaatgcac aagttattct ggtggggaac
aagtgtgaca tggaggaaga 240gagggttgtt cccactgaga agggccagct ccttgcagag
cagcttgggt ttgatttctt 300tgaagccagt gcaaaggaga acatcagtgt aaggcaggcc
tttgagcgcc tggtggatgc 360catttgtgac aagatgtctg attcgctgga cacagacccg
tcgatgctgg gctcctccaa 420gaacacgcgt ctctcggaca ccccaccgct gctgcagcag
aactgctcat gctag 475279294DNAHomo
sapiensmisc_feature(225)..(228)n is a, c, g, or t 279ttttttagat
ctaccctctt gttgcccagg gggagtccag tggcgtgatc ttggctcact 60gcaaccgccg
cctcccgggt tcaagcaatt ctcctgcctc agtctcccga gtgtcttctg 120tcttttgtaa
aagtttttca tgcccaagtg agattaattg tttaaaaaaa aaaaaacaag 180aagaaaacaa
catagattta ccgcaagacc tattgatata ttatnnnnca nggtggtata 240cccagggtgg
gtgtgacaca gaccaaaaga ggctgtgtgt tctgttgttg ataa
294280421DNAHomo sapiensmisc_feature(129)..(129)n is a, c, g, or t
280ggaagcgtgt ctgctgggag tttgcgaccg atgactatga cattggcttt ggagtttatt
60ttgactggac ccctgtaact agcactgaca taactgtgca ggtcagtgat tccagtgacg
120atgaggatna agaagnnagg aagagnagga agagattgaa gaacccgttc cagctggaga
180tgtggagaga ggctccagga gctccttgcg gggtcgctat ggggaggtca tgcctgtgta
240ccggcgggac agccaccgag acgtgcaggc tggcagccat gactaccctg gtgagggcat
300ctacctgctc aagttcgaca acncctactc cctgctgcgc aacaagactc tctacttcca
360catctactac accagctgaa ggactgctgt gacaggggca ggctgtattt gctggctgaa
420g
421281544DNAHomo sapiens 281atgagaacgg cgtcttcatg tgcgccgagg gcaccggcaa
gttctgtccc ctgaggtcct 60tcccagacac tgtctacaag aagctggtcc agagagagaa
gactttaaag gttagaggag 120tggaccgcac tccctacctg ggggatgtcg ctgttgtcgt
gcaccctggg aaaaaagaga 180tgggaacccc actcgcagac actcctaccc ggcccgtcac
ccggcatggg ggcatgaggg 240accttcacga atccagcttc agcctctctg gctctcagat
cgatgaccat gttccaaagc 300gagcttcagc tcggatcctc gctcctcccg gaggcaggtc
gagtggcatt tggtaaaggc 360attgccaagc cccccgagtg aggacgcacc gccgccacca
gcccgcaact ctccagccga 420agctgcaggg gcaggagagg ctgggctggg tggcacacca
cccgaggggg gccccgggac 480ccacggagcc ctccctatgt ctgcaaagtg attcactgtg
cttcgagcca actctaacag 540gcac
544282430DNAHomo sapiens 282ctgattctac ttctgcaggg
ttccacagaa gtctccagtc ttcaaatctt cagtgtatga 60aagcacagat tcctgaaaga
atggcctcaa atgaccagga gtaggagctc tctatatccc 120tgctcctgaa aaacaagcta
actggagtct ccatcacctg ccaccagcta tacacactac 180caactaccca actgaactcc
atgactgatt tgccagctaa tcatgcccct gacccagccc 240acatggacat gggaaggaca
tcagtgaact gtgaaaagag gcagagactc actcccgttt 300gtattatgaa aacacacgcc
aataggacat aaaaagaagc aagagtactg ggctttacca 360tgagttcaaa tctcatttct
ggcaattcct atgtctaaaa aaagcttcgt aatctctttt 420gagccctcac
430283219DNAHomo sapiens
283ccagaggatg atagcacctg tcagtgccag gcgtgcgggc ctcaccaagc cgcgggtcca
60gatcttggtt cctctaatga tggctgccct cagctgttcc aggagcggtc agtcatagtg
120gagaactcct caggctctac cagcgcttct gagctcctca aacccatgaa gaagaggaag
180cgcagggaat accagagccc atcagaggag gagtcggag
219284232DNAHomo sapiens 284tttgcctgag gttgactata catacaaata ttgagcattt
cctcctggtc tccgtgataa 60acaaaggttt tgatattgtt cggcgagatg gaaagaaaat
atcaaggagt gagctgaagc 120cactgccctt gagaaccctc tcgaggagtc tggcctcatg
aagatgccag aataaacggc 180agatatatcc tgaatgaatg tgagattttt accctgtgaa
tttcctgtga gg 232285249DNAHomo
sapiensmisc_feature(208)..(208)n is a, c, g, or t 285agtgcttcca
gtggcccaaa aatgcttttt gaagtgtgtt ttgaaacagc ccccaccaac 60atacacccca
ccaggagtac tgatcctgcc tcccttcatg tctaggggaa gcattcgcct 120ttgagcactt
gtttgcaaat ctggggagtt tgagacctcc tagcatctct tcccttcttt 180ccctgcagtc
tattcactcc cgcagccnaa aaatctctgg cgttcaggtt agcagtttct 240gggttggtt
249286510DNAHomo
sapiensmisc_feature(138)..(140)n is a, c, g, or t 286gggaattacc
ttttgtattg cttgaattta ctgctgtctg tatgaactct ttttcagata 60aatttttaag
aaatcagata agtgaagtga aagagagaga tcaaagtgtt gtggcagcac 120aaaggagaga
ctgactannn tnntgctggg gaatctgaaa gagtgctttg gtggaggtaa 180catgagatca
gggccttgaa gggtgagtca agtctgtcaa ggagacaaga gggagagaag 240agcttgccag
aggcccagag accagcgagg aggctgtggt gtcctggaat gagggcgaga 300tacttggtgg
gactggtcaa cacggcaatg aagagggata tggccgagga aaatggagag 360gggcactgga
nctgtgccag caaggactgg gatgcgtgga cttgatcctg tagataacgg 420gaggaagaaa
ggcctggatg cagcgccatg tcatgagcac atctgatcat gacagctcac 480ctatgggagg
attctccctc aacatttttc
510287555DNAHomo sapiensmisc_feature(39)..(39)n is a, c, g, or t
287aggatgtgac agtgactcgg ggcgaccagg ctatgtttnc ttgcatcgta aacttccagc
60tgccaaagga ggagatcacc tattcctgna agttcgcagg aggagnnctc cggactcagg
120acttgtccta tttccgagat atgccgcggg ccgaaggata cctggcgcgg atccggccgg
180ctcagctcac gcaccgcggg acgttctcct gcgtgatcaa gcaagaccag cgccccctgg
240cccggctcta cttctttctt aacgtgacgg gnnngccccc gcgggcggag acagagttgc
300aggcctcgtt ccgggaagtg ctgcgctggg cgccgcggga tgccgagctg atcgagccct
360ggaggcccag cctgggcgag ctgctggcca ggcccgaggc tctgacgccc agcaatctgt
420tcctgcttgc agtcctcggg gccctcgcat cagcgagtgc gacagtgttg gcgtggatgt
480tctttcgatg gtactgcagt ggcaactaac aaaggtatct ttcctccttc cctatcctat
540ttccatcctg aaaat
555288381DNAHomo sapiens 288atgtatccgc tgtcaactac gaatttgagg atgaatactt
cagtaatacc agtgccctag 60ccaaagattt cataagaaga cttctggtca aggatccaaa
gaagagaatg acaattcaag 120atagtttgca gcatccctgg atcaagccta aagatacaca
acaggcactt agtagaaaag 180catcagcagt aaacatggag aaattcaaga agtttgcagc
ccggaaaaaa tggaaacaat 240ccgttcgctt gatatcactg tgccaaagat tatccaggtc
attcctgtcc agaagtaaca 300tgagtgttgc cagaagcgat gatactctgg atgaggaaga
ctcctttgtg atgaaagcca 360tcatccatgc catcaacgat g
381289488DNAHomo sapiens 289cacgctcctg gaacgtcaga
tcattattga ggcaaatgat cgccatctag aatcagcagg 60acagactgag atcttccgaa
agcacccccg caaagcctcc atcctcaaca tgccactagt 120gacaacactt ttctactcct
gcttctatca ctacacagag gctgagggga cattcagcag 180tcccgtcaac ctgaagaaga
catttaagat cccagataaa cagtatgtgc tgacagccct 240ggctgctcgt gccaagcttc
gagcctggaa tgatgtagat gccctattca ccacaaagaa 300ctggctgggc tataccaaga
agagagcacc cattggcttc catcgggttg tcgaaatttt 360gcacaagaac aatgcccctg
tgcagatatt acaggagtat gtcaatctgg tggaagatgt 420ggacacgaag ttgaacttag
ccactaagtt caagtgccat gatgtcgtca ttgataccta 480ccgggacc
488290306DNAHomo sapiens
290tttcatgact tctccttcac ctaagcacct caaaacagat gatagcactt caggattgac
60gcgaagcatc ttcaaatatt tggagagcta acaccatcaa aggtgccaaa atctacattg
120agactgcttt gagaagtttc tagcactgaa agttggaatt gacactccag ccaatgatcc
180ttccttcttt cataatcaat gcaataagat tgcagacaga aattccagtg atttctactg
240cacagctctg gacatctctt ttcctagtat tattccctga attggccact gatttcaatt
300ctgcag
306291348DNAHomo sapiens 291ctcctgggtc cgcagtgtac tgcgagggag cacagatgtc
catcccccgc tggggtggag 60agcggcagca ggcctgatgg atgagggatc gtggcttccc
ggcccagaga catgaggtgt 120ccagggccag gccccccacc ctcagttggg gctgttccgg
gggtgactgt gagcgatccc 180accccaaacc tgagatgggg tagcccgtcc tgtgtcctcc
acagggacaa gcagtgggag 240gagtctgaat ggtcaccagg aagcccgggc tccatcttga
cctccttttt cagggacagg 300agcaacaggc ccctcttccc tgactctaag cccttccctg
taaggtga 348292395DNAHomo
sapiensmisc_feature(343)..(343)n is a, c, g, or t 292tctctgcttt
ccctcttatg aaaatggcag atgccttttt gtgaaggtct caaagcccac 60ttcatcctgg
ctgcagcacc aaaaggacaa aggcccgctt ttgaagtgcc tgataaggca 120ttcctttcac
ccctccatga ggaaggtggc aaatcttgag actccctatt agagagcttc 180gattttcctg
aaattgtgtt aggaaaatag ggtgacttgg tttgatcttg gtttctatac 240ctattatggc
tgcctgactc tggtcatttg gcccctgcag gcctaagcca cttggttttg 300cttcacatat
tggggtttat tagaacagta cgtagggaag canatgccag aggcacccgt 360nccttttccc
tgccttctag gtgctcctgg gaaat
395293557DNAHomo sapiens 293accaagatct ctgcctggca caataacgtg gagaaaaccc
tgccctccac caaggccacg 60cgggtgggcg tgcttctcaa ctgtgaccac ggctttgtca
tcttcttcgc tgttgccgac 120aaggtccacc tgatgtataa gttcagggtg gactttactg
aggctttgta cccggctttc 180tgggtatttt ctgctggtgc cacactctcc atctgctccc
ccaagtaggc aggctgtagg 240cacttgggct gactgcctgc agaagtccca agaccctagt
gaaaatacag caggcagaac 300tctccttgga taattccccc aagaggtccc caaggattgg
gagcatggga ggggagctgg 360cgggagggtg ggaggtggga tttagccagg aaaggggtga
gagtgattgt gttgtgggcg 420aggaggcgtt tccaccccct ggtgcctatc agggcagggt
gacctactcc ccattgttct 480ggaaatctcc aggctgctgg gcagctgggc agctgggcag
agctctggga agtgaagtca 540tgagtgcccg attcctc
557294547DNAHomo sapiens 294ggttcgggtg agggcactct
atgactacgc tggccaggaa gctgatgagc tgagcttccg 60agcaggggag gagctgctga
agatgagtga ggaggacgag cagggctggt gccaaggcca 120gttgcagagt ggccgcattg
gcctgtaccc tgccaactac gtggagtgtg tgggcgcctg 180agtgtcctga cagcccttct
gcaacgttta cccaccctgg ttcagagccc agcttctcct 240ggagagccgg accctcaggg
ccctgaaccg tcgctctctg gctgctcctc tgtcccttga 300gggaggaagt cctgggaccc
agggagggga ggggcctttg tctagggaag ggactggtag 360ggaagggacg agtctaggct
gagggcaaga tgggaggtca gaggtgacag aagcgttcag 420gggtgcctgg gcctccccag
gagctgtgga ctcagttcct gacctctgct ttggggttcc 480tggggtgggc ttggggtgag
tgtagttctg gcctagcagc accctcttgt ggcttgttct 540agcgtgt
547295147DNAHomo sapiens
295tgtatgtgac caaaggtagg tcctgggatg acagcaatgc tgacactggc ctaaggagtt
60actcatccat ttaataagta ttccagcaga tacagatgtg aacagtcaag tctctgccat
120ccacaatgct tgtgttctaa tgcaaga
14729683DNAHomo sapiens 296atgtgttcaa ccaagcggga aactctccgg gtagagtgaa
atccgaagtt gctatgctac 60aagataacct gggccgtgcg ccg
83297545DNAHomo sapiens 297gtctttctga gagtttcatt
gccattatca acaagagaag ttgaaattta caagtcagga 60ggttattttt ccagattgat
aaccatagaa agtgaataaa cacttttaag gtcgcaaaca 120tttgctaggt tgtccttctc
aatgcatgtg caggctgcat cctgtccttg tttttaagcc 180agggtttata aataagtaga
tttataccaa tcttaataga attgtatatt ttatgcaaga 240attaaatgct ttacaacatg
aagtataact caacccattg taaactttgg tggcaatatg 300gatttgaaac tcgacagttc
tcttgtattt gcttcctagg tttctgcatg caagttatga 360caggtaggac tgaaaaaaca
ctgccttttg acttctagca tttagcaacc gagagtcgta 420gagtcaataa agctgtaagt
gtcttcactt aatctgtggt tctcctaaaa ctattatctg 480aaacctacag catcccacca
tgaaatattt ggtaaattta tgttgtgacg tgttgcagca 540tgtaa
545298485DNAHomo sapiens
298aatttgtctg tgacccagat gccctcttct ccatggcttt cccggataac cagcgtccgt
60tcctgaaggc agagtccgag tgccacctca gcgaggagga caccctgccg ctgacccact
120ttgaagacag ccccgcttac ctcctggaca tggaccgctg cagcagcctc ccctatgccg
180aaggctttgc ttactaagtt tctgagtggc ggagtggcca aaccctagag ctagcagttc
240ccattcaggc aaacaagggc agtggttttg tttgtgtttt tggttgttcc taaagcttgc
300cctttgagta ttatctggag aacccaagct gtctctggat tggcaccctt aaagacagat
360acattggctg gggagtggga acagggaggg gcagaaaacc accaaaaggc cagtgcctca
420actcttgatt ctgatgaggt ttctgggaag agatcaaaat ggagtctcct taccatggac
480aatac
485299409DNAHomo sapiensmisc_feature(36)..(36)n is a, c, g, or t
299acagcttagc gatggagaaa atggcatccc tgttgntntn tcaccagata aattgcctgg
60atctctggga cacccccgtc cccaggagaa ggatgtttgg gaagagatgg atgccaacaa
120aaacaagata aagcttggaa tttgtaaggc tgctactgaa gaggagaaca gccatggcca
180ggcaaatggt cttctcaatg ctccaagcct tgggtcacca attcgtgtcc gctcagagat
240tactcagcca gacagagata ttccactggt gcgaaagtta cgttccattc acagctttga
300gctggaaaaa cntctgaccc tggagccaaa gccagacact gacaagttcc ttgagacctg
360gtataaaata gtgtattttt ctttttaaag cttctaaggt accattatt
409300430DNAHomo sapiensmisc_feature(150)..(150)n is a, c, g, or t
300gagggccaag agctagggac agggggaaga gactggccca ggtggtaggg aggaaagaac
60tcccagagtt tcctttagcc aggaaacctg ctctactgac cccgtgactt ggacagtcag
120acatcaccct gagagtgaca agtgtaaaan tgactccctt cctnccccgn ccnncggaag
180tatantnaga tacttgaaag cagtccnttn ctaaaatggn cttacctatg tggcctgaac
240gattaaaaga aagaactcag agttacaagg gaaaaagaaa aagagttaca agggaattgt
300agtctttttc tgaatagaat attagtactg tggtattgca tttcatggga atggaaatgt
360attggtaaag ctacctgatg gaagctttcn ctngnnnncn aanatggagg gtgtattatg
420tgcagttatt
430301536DNAHomo sapiensmisc_feature(68)..(69)n is a, c, g, or t
301atcgaagaac aaagagtgct ccaaaaaata ggtcattctt ttattttcat aaagtatcta
60aactgtanna anannnannn nngtgtttca ttctaaattn gcagctgaaa taaatttatt
120ngcgatagna gaantatctt attattcatc ctcagaaata aaggattnga agggatagag
180attatatgat aaatttatag aagactttca gaattntgaa tgcatttngt ttagtgttat
240gaaatgacaa tagnaaaaaa gtctcgactt caattnaaaa gttacacaaa caaacaaatc
300tacaggcatg tctttatata ccatcaggtc taagttttca aagaaaatgg tagatataac
360tgcagataac tcattacagt cataatctct gcccatgtgt attgagaggg ggcagttgtg
420cacgaaaaaa gaatttatgt ggccatttta ataaattcag tttaaaatag acttgtgtat
480atgcatgaat catcagagat gaaactggtt tgagagactc atgtgaacct tacgaa
536302371DNAHomo sapiens 302ctggtacgct gctgctgcag ttaccagaaa aactcataag
caaatacaac tggatcaagc 60aatggaaact tggactgaaa tttgatggga agaatgagga
cctggttgat aaaattaaag 120agtcccttac tctgctgagg aagaaggttt ggaacctgta
gtgtcctgtc tgataagggt 180gaagctctcg ttcttgcttg ccccagaaga ccagttttta
gtcttcactc agtggatttt 240caaatgctct tggctgattt ttaggcaaaa tggttttaaa
tgaattcaaa ctcttcccac 300gagggcttta gtaaaatggg aagtaccaac attatatatt
cttagagcag atgccatgta 360ctagggtatc a
371303355DNAHomo sapiensmisc_feature(223)..(223)n
is a, c, g, or t 303gaagctgtgt ggagtggaag atggacattg aggaagaagg
gcaggtgtgg tctcacccag 60aatggttcct gctgcttccg cggtgcccag gcttttctca
cggcctctgc tgggttctcc 120cctgggtgct gtggatgcat cctgcctgct ggaaattctg
tgctctctgt ttccatccct 180ttgtcgtggt aatgaccgta tacctctccc ctgtaccctc
ctntgcntgc tctccgtgca 240ggcccctctc cctctggttg tcccatcagc atttccccac
agctcgttgt tcctccttcc 300tcttttctgg tgacctttct actgattgca ttgtacctct
ttccctgata ttaaa 355304362DNAHomo sapiens 304gcctgtgtct
tcgggctgaa tttgatctgg ccatcccagg gggtctcctc cctgagtgcc 60cttgtgcccc
tgaacatgtt cactgaactg ctgatcgagt actatgaaaa gatcttcagc 120accccggagg
cacctgggga gcacggcctg gcaccatggg aacaggggag cagggcagcc 180cctttgcagg
aggctgtgcc acggacacaa gccacgggcc tcaccaagcc taccctacct 240ccgagtcccc
tgatggcagc cagaagacgt ctctagtgtt gcgaacactc tgtatgtttc 300gagctacctc
ccacacctgt ctgtgcactt gtatgttttg taaacttggc atctgtaaaa 360at
362305533DNAHomo
sapiens 305cgaagagcaa gacccactct gttccagaag ccctataagc tggaggtgga
caactcgatg 60taaatttcat gggaaaaccc ttgtacctga catgtgagcc actcagaact
caccaaaatg 120ttcgacacca taacaacagc tactcaaact gtaaaccagg ataagaagtt
gatgacttca 180cactgtggac agtttttcca aagatgtcag aacaagactc cccatcatga
taaggctccc 240acccctctta actgtccttg ctcatgcctg cctctttcac ttggcaggat
aatgcagtca 300ttagaatttc acatgtagta gcttctgagg gtaacaacag agtgtcagat
atgtcatctc 360aacctcaaac ttttacgtaa catctcaggg gaaatgtggc tctctccatc
ttgcatacag 420ggctcccaat agaaatgaac acagagatat tgcctgtgtg tttgcagaga
agatggtttc 480tataaagagt aggaaagctg aaattatagt agagtctcct ttaaatgcac
att 533306434DNAHomo sapiensmisc_feature(131)..(131)n is a, c,
g, or t 306ggaaccctcc tcttggcaag ggctttccga agttaacctg aaaaactggt
tcaggccatg 60acagcaaagg gttggatagc ctcattatcc ctcctccctt cagaactctg
gaacagccag 120cgttaacatc nacacaggcc ttcagtctga tgagaaacat ttaccatcta
ttgtctcgga 180agcctgctac ntggaggctt catcntgatg ataaagcctt ggtctccaca
accccgtata 240acccagacat tcctttctat tgataactct tgcaagcgat tgccaaccag
aagatgttta 300aatccaccta taacctggaa gcccccagtt ccagctgccc acctttctgg
actaaaccaa 360tgtatatctt caatatattt gattgatgtc tcatgtctcc ctaaaatggg
taccatcaag 420ctgtgcactg acca
434307157DNAHomo sapiens 307cctccgcaca ctggatgaga atccatcttc
cattcgagct gggaatagac tttgtgaaag 60atattatgta atggagtctc gggaaccctg
agacctctcc agcgaagctg aagtgaatta 120attaagtgct ttaaacggtc ttggtgctgt
gttacgg 157308367DNAHomo
sapiensmisc_feature(35)..(35)n is a, c, g, or t 308aggtgatgca ctatgcccag
tacgtcctcc tggcnctnnn ctnnnnngcg tnncctgntn 60cnnntcnntg ncntntgcna
antnnngann nanaaccgtg taaaaccatt tttatgtggc 120ttcaacgtca actataaatt
agcttggtta tcttctagga gaaatgctat ttattttgga 180gtagtagtaa aaagggctca
aaggataagg aggccattca ggcctattct gaatccctga 240tgacatcagc tcccaagggc
tctgtgctgc aggaagcaaa actgtaggng ggtaccaggt 300aatgccgtgc gcctccccgc
cccctcccat atcaagtaga atgctggcgg cttacagact 360gaagatg
367309484DNAHomo sapiens
309accccaccac gtaccagatg gatgtgaacc ccgagggcaa atacagcttt ggtgccacct
60gcgtgaagaa gtgtccccgt aattatgtgg tgacagatca cggctcgtgc gtccgagcct
120gtggggccga cagctatgag atggaggaag acggcgtccg caagtgtaag aagtgcgaag
180ggccttgccg caaagtgtgt aacggaatag gtattggtga atttaaagac tcactctcca
240taaatgctac gaatattaaa cacttcaaaa actgcacctc catcagtggc gatctccaca
300tcctgccggt ggcatttagg ggtgactcct tcacacatac tccccctctg gatccacagg
360aactggatat tctgaaaacc gtaaaggaaa tcacaggttt gagctgaatt atcacatgaa
420tataaatggg aaatcagtgt tttagagaga gaacttttcg acatatttcc tgttcccttg
480gaat
484310526DNAHomo sapiens 310ccatggggcc atctgggcca ttcagagact ggagtgagat
ttgggtgtgg agggggaggc 60gccaaggtgg aggagcttcc cactccagga ctgttgatga
aagggacaga ttgaggagga 120agtgggctct gaggctgcag ggctggaagt ccttgcccac
ttcccactct cctgccccaa 180tctatctagt acttcccagg caaataggcc cctttgaggc
tcctgagtgc cctcagatgg 240tcaaaaccca gttttccctc tgggagccta aaccaggctg
catcggaggc caggacccgg 300atcattcact gtgataccct gccctccaga gggtgcgctc
agagacacgg gcaagcatgc 360ctcttccctt ccctggagag aaagtgtgtg atttctctcc
cacctccttc cccccaccag 420acctttgctg ggcctaaagg tcttggccat ggggacgccc
tcagtctagg gatctggcca 480cagactccct cctgtgaacc aacacagaca cccaagcaga
gcaatc 526311319DNAHomo
sapiensmisc_feature(264)..(264)n is a, c, g, or t 311taaattgcct
ggatctctgg gacacccccg tccccaggag aaggatgttt gggaagagat 60ggatgccaac
aaaaacaaga taaagcttgg aatttgtaag gctgctactg aagaggagaa 120cagccatggc
caggcaaatg gtcttctcaa tgctccaagc cttgggtcac caattcgtgt 180ccgctcagag
attactcagc cagacagaga tattccactg gtgcgaaagt tacgttccat 240tcacagcttt
gagctggaaa aacntctgac cctggagcca aagccagaca ctgacaagtt 300ccttgagacc
tggtataaa
319312234DNAHomo sapiensmisc_feature(85)..(87)n is a, c, g, or t
312gcgcttgcgc agtagctgaa cgcgggcgtt tctttcctcc ctttttttcg aattggtttt
60gggggtagat tcgagttaca aaatnnncnn cngnngngtg ttcggcgcgg ttcccccagc
120tgtctctggc tgaaccggcg ctctcgcctc cctgccgaac acagcgtgag gagccccccc
180aggganatgg tgtttgagtc tctgggcttg ccgagcacta agtcctctga gttc
234313125DNAHomo sapiens 313gtactgcaaa aatcaccctc ggcaagacga atgtctgacg
tgccggaagg agtcatacgg 60gtccatgctc cacttctctc caaggtgtcc atggccattc
aactcaacaa tcaaaccaaa 120gccaa
125314446DNAHomo sapiensmisc_feature(53)..(53)n is
a, c, g, or t 314aagtcattcg tttaagcgtg gattattttg ccgaatgaat aatgatgatg
gcngctttca 60tctcttatga agttttcctg gccaagagcc agnagttgga agtttggatc
attctttttt 120cttttttaan catttcttct cttctttctc ttttttatca ctaaatgaat
gacatgtgga 180gaaactattc agcttttaaa gtatnctcca nttacttgtc tcaactacca
ctatttattg 240tgtttatcaa aatcataaaa agctcatttt tggcatttac cttcgtggtt
gagactgctg 300tctgtatgtc tgggaatgga agtcctcttc agggattcag caagggctgt
acttttgctt 360aatactagtg gttccttatt ctaagtgatg acatcatcca cctttcctag
aaatgggtct 420ttgtgcctag tatgatatct ttccaa
446315473DNAHomo sapiensmisc_feature(207)..(207)n is a, c, g,
or t 315tgtttcaggc ccatccacag ttgaagcagt gtgtgcgtca ggcaattgaa cgggctgtcc
60aggagctggt ccatcctgtg gtggatcgat caattaagat tgccatgact acttgtgagc
120aaatagtcag gaaggatttt gccctggatt cggaggaatc tcgaatgcga atagcagctc
180atcacatgat gcgtaacttg acagctngga atggctatga ttacatgcag ggaacctttg
240ctcatgagca tatctaccaa cttnaaaaaa cagttttgcc tcagcccttc gtgtaagttg
300gctatttcct tggtataggt acaaaacgta ttactgcttg tctgtaataa tttttttctt
360tgtctatata tggcnctggg cgttaccact tattnttaat aatcnccata tttgtttgat
420gtcttccatc attttagatt gtaattctgt gaggcaaagc atcatgtctg tgt
473316576DNAHomo sapiensmisc_feature(63)..(63)n is a, c, g, or t
316aggacaccag gctggtggcc acagtgctgc tgtccgtggt cgtgctgctc cacgccctcc
60tgnccatggg ctgtaagttg tacttcttcc agtcgctgcc tccggagaac gtggctcctc
120caccccaaat cacatctctg ccctcaaaca tcgcgctgtc ccctaccttg ccgcagtccc
180tggccccctc ctaggaaggc ccgggtccca caggcaacac ctaagtggac caacccctct
240gcctgtcctg ccccccagac gatgactgaa ggctcctttg acaccttgag atgattctgc
300tactttccag acttttctta caaagcaaac acttttattt tctatgcaaa nntgattcag
360agaatttata taaaggcggg cgaggggcag ccgancaggg agctttggga cagggctggg
420gcccccatat cccccccggg ccacctgctt tccctcctat ggctcccctg gaacaggagg
480gagagccaag ggggcngccc agcctggaca gcgcccgctc ctgcctgggt gcacacacgg
540cgggcctgag ctccagcatc tgagtttggg ggtatg
576317265DNAHomo sapiens 317ccaggagcag ctgcgtgacg tcatgttcta cctggagaca
cagcagaaga tcaaccatct 60gcctgccgag acccggcaga aatccaggag ggacagatca
acatcgccat ggcctcggcc 120tcgagccctg cctcttcggg gggcagtggg aagttgccct
ccaggaaggg ccgcagcaag 180aggggcaagt gaccttcaga gcaacagaca tccctgagac
tgttctccct gacactgtga 240gagtgtgctg ggaccttcag ctaaa
265318515DNAHomo sapiensmisc_feature(108)..(108)n
is a, c, g, or t 318atacgtgggt agtgttgcat ttcaaatgag gctcttctgg
ttgaaatgat atatttataa 60gaccagaata tcacaaatgg gtgatgtata atgtctcttt
agtttttngg tattnggcct 120cttttaaagc ctgtcggatg tatgggagaa aacaatgaac
gtgctttgat ttcctatcag 180tcactcttaa gaacatacat atngtttaag taactcggtc
ttttttatct gattcttgag 240ncactatggg tagcaagtaa ccacttacaa atttaaatgt
aatatacact ccttttctgt 300gtgtcaagtc cttattttta ggtgcatatt gacatttaaa
tgttaattat tgtttggcat 360ataatatcaa aaatctatta tttattttat gctgttacag
ttaaaagatg tgatttatga 420catactgaat caacttgcct tccaatttag tgtgtaatat
ggtaagcatt tatactttta 480gatatgtctt atttttattt ggatgcctgt ctacc
515319541DNAHomo sapiensmisc_feature(136)..(136)n
is a, c, g, or t 319gagttaatgc agcactcgtc attcagaaat attggcgaag
agtcttagca cagagaaaat 60tattaatgtt aaaaaaggaa aagctggaaa aagttcaaaa
taaagcagca tcacttattc 120agggatattg gagaanatat nccactngac aaagatttnc
ngaaatngaa anattattca 180ntcatccngc naatntagga taagaatgat aattgctgtn
acatcttata aacgatatct 240ttgggctaca gttacaattn cagaggcatt ggcgtgctta
tttaagaaga aaacaagatc 300aacaaagata tgaaatgcta aaatcatcaa ctcttataat
ccaatctatg ttcagaaaat 360ggaagcaacg taaaatgcaa tcacaagtaa aagctacagt
aatattgcaa agagctttta 420gagaatggca tttaagaaaa caagctaaag aagaaaattc
tgctattatc atacaatcat 480ggtatagaat gcataaagaa ttacggaant atatttatat
tagatcttgt gttgttatca 540t
541320495DNAHomo sapiensmisc_feature(144)..(145)n
is a, c, g, or t 320cttcggattt ttattgactc aaaatagtgc cattcccctt
aatgaaatag attttgagtc 60tttttttcat tgtaaccccc aaatgagaat catctacctg
attcttgtac caaaaaaaaa 120tttttttcag tctttttttt tttnnagaga gggtctcttg
tcaacgcaag actgggagtg 180gcagtggcac gatcttagct cactacaact tctggcctcc
caggctcaag caattctcct 240gcctcagcct cctgagtagc tggggattac aggcatgcac
caccacgccc agctaatttt 300ggtattttta gtagagacag ggtttcacca ttgtttggcc
aggctggtcc cgaactcctg 360acctcaggtg atccacccac ctcggcctcc caaagtgctg
ggattatagg tgcgagccat 420tgcgcccagc ctcagttatt ttatttaaca gtgtaagtac
ttagaaagta agaaaatggc 480gtgattagtt ttttg
495321429DNAHomo sapiens 321ggctgaggag gctggtctga
acatcactca catttgcctc cctccagata gcagtgaagc 60cgagattata gatgaaatct
taaagatcaa tgaagatacc agagtacatg gccttgccct 120tcagatctct gagaacttgt
ttagcaacaa agtcctcaat gccttgaaac cagaaaaaga 180tgtggatgga gtaacagaca
taaacctggg gaagctggtg cgaggggatg cccatgaatg 240ttttgtttca cctgttgcca
aagctgtaat tgaacttctt gaaaaatcag taggtgtcaa 300cctagatgga aagaagattt
tggtagtggg ggcccatggg tctttggaag ctgctctaca 360atgcctgttc cagagaaaag
ggtccatgac aatgagcatc cagtggaaaa cacgccagct 420tcaaagcaa
429322467DNAHomo sapiens
322tctgagggtg ccttgatgct ggctcatcac acattgagta tcttgggcat tatcatggcc
60cttgtgcttg gggagtctgg cacagaggtc aatgcagtcc tctttggaag tgagcttacc
120aaccccttgc tacagatgcg ctggtttctc cgggaaacag ggcactatca cagtttcact
180ggagatgtag tggacttcct ctttgtggct ctgttcacag gagtgaggat tggtgtggga
240gcttgcctcc ttttctgtga aatggtctcc cccacgccta agtggtttgt gaaggctggg
300ggagtagcga tgtatgctgt gtcttggtgt ttcatgttta gcatctggcg ctttgcatgg
360aggaagagca tcaagaagta ccatgcttgg agaagcaggc ggagtgagga acggcagctg
420aaacacaacg gacatctcaa aatacactag ccaaggcttg ctccaga
467323504DNAHomo sapiens 323ttggcacttc agaagtctcc ccaatcttga caaagccctg
gagaaagggc cgggcctccc 60gttgataaga atatcactgc agataaatgg aggtttcaaa
ttgaaagaaa ggaggagggc 120ctcctgttga taagattatt gtcactgcag gtaaatggag
gcttcaaata gaaatacatt 180tcagttacag aaaaaaaaat tatctttgtt acacatttga
gtttgcaggc ctaaggttac 240tcccgctaca ctatcatctg taaccataac gcactcaaca
ttttaagcta actataagga 300ttgttgcttc actcaaagat cctgaggttt tattcactaa
catttttatt tggtgactat 360agttgacaag aacaaagctg tggggaacca acaaacactg
caatgcctgg cattgtcacc 420tcactagatt gtgagttcct ctgggacagg gtccgtacat
tttcttagaa tccctcactt 480agccattagc ctgcacagtg cttg
504324163DNAHomo sapiens 324catggaggag tgcatttcct
tggctattcc agaagtccta cctcccttct gagattttat 60aatggtattt cttatggtta
tcccaaatat acttggcaag tcgtcttata aaccaccaat 120aatagcctct taaaaattca
aaaattactc ctcttggcta aca 163325441DNAHomo sapiens
325cctccgcgga aggcgtggca gggaggcagt cgccctgcgg tgcaagctgc tgctccagag
60cataccgtgg cccaggtggt atccccaagg cctcgtgccg tggctggggt cctgggaggt
120ggtcgccctg cagtgcaagc tgctgctcca gagcgtaccg tggcccagac tgatcctcga
180ggcctcctgc cgtggctggg gtcatggtcg gctgcgcatg tccagaagca tttccttcct
240gcgaccatcc cggcgcccct agggggagaa gccaggacag cagcttccgc tgtctccaca
300gcagacacgg gacggattcc acagacggga gcctcattcg taccatgcca aacgcattca
360ctcggggcag tattaaccgt tctagaaagc cactgtttta tagcaaaaca ggaaaggaaa
420agctaccagt tttttattca g
441326457DNAHomo sapiens 326tttcccctag ttgacctgtc tataagagaa ttatatattt
ctaactatat aaccctagga 60atttagacaa cctgaaattt attcacatat atcaaagtga
gaaaatgcct caattcacat 120agatttcttc tctttagtat aattgaccta ctttggtagt
ggaatagtga atacttacta 180taatttgact tgaatatgta gctcatcctt tacaccaact
cctaatttta aataatttct 240actctgtctt aaatgagaag tacttggttt tttttttctt
aaatatgtat atgacattta 300aatgtaactt attatttttt ttgagaccga gtcttgctct
gttacccagg ctggagtgca 360gtgggtgatc ttggctcact gcaagctctg ccctccccgg
gttcgcacca ttctcctgcc 420tcagcctccc aattagcttg gcctacagtc atctgcc
457327438DNAHomo sapiensmisc_feature(65)..(65)n is
a, c, g, or t 327ttgtccttta tgtatcttct ttccatagtg cttactggag ccttccaaaa
taatgtctcc 60tcaangtgac agcccctcag gaatttgaag gcaatngtca caccctcacc
cnctttcctg 120agttttttct ggtttattaa cgtcagtctt tacagtcagt gctcattgac
ggtggttttc 180tctggttgtt tcctgaacac gtagtgctct taaagcantg ccctgaggng
aatacaattc 240tccaggggca ttctgattgg caggtgaagc acagtgccat gttcccagca
ctgatttggg 300aagtggcttg tcacatccca cagtgaactc agtcaactgg aatgcctaac
tctctttcat 360aagacctcct gctacattat gtttctccca gactgtactc aggtccaaga
acagaattta 420ctagtctatc cttctcaa
438328535DNAHomo sapiensmisc_feature(40)..(40)n is a, c, g,
or t 328cccttcttgc tgccacagga tgaataaagt gttgagattn gtctatggag aaagctgtgt
60gtctgttttt atctcccctc tcaggaccag tcagccactg gtcaatcagg ctgatcatgg
120aacattagga attctccaat taagggagaa aaagtccagg gacttagtta tatcttcaga
180ccagtgcagc tgggacacac aaagttctcc tgtctcacca tctgatatgg tttggatgct
240cgtcccctcc aaatctcatg ttgaaatgta attcccagtg ttggaagtgg agcctggtgg
300gaagtatttg gatcatgaga gaggatcctt catgaatggc tcagcaccat ctccttggtg
360atgagtgagt tctcactcaa ttcacataga tatggttgtt taaaagagtc tgagacctct
420cccctctttc tcgccatgtg atatgcctgc tcccccttca ccttccgcct ttactgtaag
480cttcctgagg ccctcaccag aagctgagca aatgttggtg ccatgccagt acagc
535329432DNAHomo sapiens 329gccacagact gaactcgcag ggagtgcagc aggaaggaac
aaagacaggc aaacggcaac 60gtagcctggg ctcactgtgc tggggcatgg cgggatcctc
cacagagagg aggggaccaa 120ttctggacag acagatgttg ggaggataca gaggagatgc
cacttctcac tcaccactac 180cagccagcct ccagaaggcc ccagagagac cctgcaagac
cacggaggga gccgacactt 240gaatgtagta ataggcaggg ggccctgcca ccccatccag
ccagacccca gctgaaccat 300gcgtcagggg cctagaggtg gagttcttag ctatccttgg
ctttctgtgc cagcctggct 360ctgcccctcc cccatgggct gtgtcctaag gcccatttga
gaagctgagg ctagttccaa 420aaacctctcc tg
432330234DNAHomo sapiens 330agcaaatcta gctttcagta
ttcctaattt ttacctaagc tcattgctcc aggctttgat 60tacctaaaat aagcttggat
aaaattgaac caacttcaag aatgcagcac ttcttaatct 120ttagctcttt cttgggagaa
gctagacttt attcattata ttgctatgac aacttcactc 180tttcataata tataggataa
attgtttaca tgattggacc ctcagattct gtta 234331317DNAHomo sapiens
331acttaggagt ggtgcttttt ctcagaaaac aggccacggt gtttcataca gaatgtcttc
60atatcatctg aaatggtatg gctgaagttc atttgtttac agggtcggga atgtcttcag
120ttcttgagag tcaacagtaa tgattggttg taagccaagg gacattttaa gctagtgaag
180agttttttct ggaattgatt tttcccaaaa gaatatatta attgaggtta agaagtcagt
240gggaaacaca cagaaatttg ttttaaaatc tttcaggagc tttactgaaa gacttggtta
300tcaagtcttt tggggag
317332415DNAHomo sapiens 332gacttacttt aacaaccagc caatccctac ctaagcctag
tagccatggt ttggctaaga 60ccgcagcgac tgtatttagt aaatcctttg aacaagtcag
tggtgtcaca gtcccacata 120acccgtcatc tgctgttggt tgtggggctg ggacagatgc
caataggttt tccgcttgta 180gtctccaaga agaaaagctt atttacgttt cagaaagaac
tgaacttcca atgaagcatc 240aatcaggtca gcagagacct cctagtatta gcattactct
gtccacagat taattagtaa 300catatttttc tcccataacc tagtgaacct ggaaatacaa
ctttgcttct ttatgaaagt 360accctgggtc tttcatccgt attcctgaca ggagccctga
tgtcttaaat tctga 415333489DNAHomo sapiens 333gacgggtcca
ttaacaaagc gggctttgcc gtcaactttt tcaaagaggt ggacgagtgc 60tctcggccca
accgcggggg ctgtgagcag cggtgcctca acaccctggg cagctacaag 120tgcagctgtg
accccgggta cgagctggcc ccagacaagc gccgctgtga ggctgcttgt 180ggcggattcc
tcaccaagct caacggctcc atcaccagcc cgggctggcc caaggagtac 240ccccccaaca
agaactgcat ctggcagctg gtggccccca cccagtaccg catctccctg 300cagtttgact
tctttgagac agagggcaat gatgtgtgca agtacgactt cgtggaggtg 360cgcagtggac
tcacagctga ctccaagctg catggcaagt tctgtggttc tgagaagccc 420gaggtcatca
cctcccagta caacaacatg cgcgtggagt tcaagtccga caacaccgtg 480tccaaaaag
489334239DNAHomo
sapiens 334cacagataga acctgcacat tgcccattaa tgcacacttg tgtatgccta
ttacagtctg 60tgaagtttgg tttagggtca gatgctgggc agagagctgt gaagccatta
catattcctt 120cccttgcaca gtctgaatca tcccgacact tctcagactt tgacttgaat
gcacactgtg 180ctgtacaaca aggaccttga cttggactgc actgtttccc aggtttcagt
ttgcatttt 239335432DNAHomo sapiens 335gccctgactg actgtattct
ctggccacat tcaagtcccc cattggtggg ggcagagaag 60taggaccagg ccatccttgg
ctacagagct cgaagacccc aagacagccc tctgctctca 120gcggcgccac agagagcctg
ggctcagcct tctgcatcag gacatggcct cgtccactga 180gggcacgatt taaacatttg
acatcagaag ctttatttgt aaacctcaca cagataagga 240ccaagggctg gcggtgtggc
cagaggacag gggaagctga aggccccgtg cttgagctcg 300gcagtcctgc tccttgcagt
gaagccacca tgggtgaccg tccagcctca cccggtggcc 360tgcacagtga gggaagggct
tcagggccat ctgctcccag ggcaggggac aggccaccaa 420ggacctttgg ca
432336380DNAHomo sapiens
336aatgttgaca tatttcctct atctcataga tggtaaaagt gttgctttta aactggcaaa
60tgcactcttc agaaatcctt ttctatctga tccacatgga gaggttaaag gttcaatttc
120atgacctcta tgcaggcagc gctctcattg gatgtaagaa tattacctgc aaggatagaa
180tgcagttgtg caacagagac acattcttat ttcttttttt tcacaatttt gttttgtttt
240taatgaccct tttattgaat attggactga aatataaatt ttaaaaaaca cgttggaaag
300gatgtacaac agaaggctat gtatgtatat acagtatgtc aaaagccttt tatttttata
360cttcaaatgc tctaaattaa
380337544DNAHomo sapiens 337gagtctctgc ttgataagtg cctctatacc aaccgctctc
ctcatcctga catcttgata 60cggacttctg gagaagtgcg gctgagtgac ttcttgctat
ggcagacctc tcactcctgc 120ctggtgttcc aacccgttct gtggccagag tatacatttt
ggaacctctt cgaggccatc 180ctgcagttcc agatgaacca tagcgtgctt cagaaggccc
gagacatgta tgcagaggag 240cggaagaggc agcagctgga gagggaccag gctacagtga
cagagcagct gctgcgagag 300gggctccaag ccagtgggga cgcccagctc cgaaggacac
gcttgcacaa actctcggcc 360agacgggaag agcgagtcca aggcttcctg caggccttgg
aactcaagcg agctgactgg 420ctggcccgtc tgggcactgc atcagcctga atgaggctgg
ccacctgcca ctttgccctg 480ccctctgcct ccagggctcc actccccttc cttttcttgg
tgaaaggcac ctcctttcct 540gata
544338530DNAHomo sapiens 338tcaaagaacg cgtactgcag
accccaaatg accttctggc tgctggcttt gaggagcaca 60agttcagaaa cttcttcaat
gctttttaca gtgtggtgga actggtagag aaggacggct 120cagtgtccag cctgctgaag
gtgttcaacg accagagtgc ctcggaccac atcgtgcagt 180tcctgcgcct gctcacgtcg
gccttcatca ggaaccgagc agacttcttc cggcacttca 240ttgatgagga gatggacatc
aaagacttct gcactcacga agtagagccc atggccacgg 300agtgtgacca catccagatc
acggcgttgt cgcaggccct gagcattgcc ctgcaagtgg 360agtacgtgga cgagatggat
accgccctga accaccacgt gttccctgag gccgccaccc 420cttccgttta cctgctctat
aaaacatccc actacaacat cctttatgca gccgataaac 480attgattaat tttaggccat
gcagtggaac ctgtcaccta atgggactgc 53033975DNAHomo sapiens
339agtcatgcga ccaggtgagg gtccacgtcc ccaagcttcc actccctctg gtgtttccca
60tttaagtata ctgtt
75340376DNAHomo sapiens 340gatgctcacg tcacttggtg taggtttcag gatcgcctct
ttgaggaagg acttcaggac 60caactggggc ctgcataaga aaacttatct cattattaga
gtactcacag cttgtatctc 120ccagctacat cctagaaccc cattgtcctt tattccacca
aaccagctcc aggtgaccag 180actctactca gaaagcaaat tcgtcatcaa agaacagaga
ctggccacca caaggacatg 240caggagaact gtcgggacca ggaagactca ttccaaaaag
cccaggccgg gcacagtcgt 300caagcctgta atcccaacac tttgggagac cgaggtgggg
gtatcgattg agcctcggag 360gtcgagatca gcctgg
376341499DNAHomo sapiens 341ccccgcctgt ggcattttct
atgggctcag gttacacctt cccagctggt gtttctgtcc 60caggaacctt tcttcagcct
acagctcact ctccagcagg aaaccaggtg caagctggga 120aacagtccca cattccttac
agccagcaac ggccctctgg accagggcca atgaaccagg 180gacctcaaca atcacagcca
ccttcccagc aaccccttac atctttacca gctcagccaa 240cagcacagtc tacaagccag
ctgcaggttc aagctctaac tcagcaacaa caatccccta 300caaaagctgt gccggctttg
gggaaaagcc cgcctcacca ctctggattc cagcagtatc 360aacaggcaga tgcctccaaa
cagctgtgga atccccctca ggttcaaggc ccattaggga 420aaattatgcc tgtgaaacag
ccctactacc ttcagaccca agaccccata aaactgtttg 480agccgtcatt gcaacctcc
499342183DNAHomo
sapiensmisc_feature(75)..(75)n is a, c, g, or t 342cacccgagac tgacacactg
aactccactt cctcctctta aatttatttc tacttaatag 60ccactcgtct ctttntttcc
ccatctcatt gctccaagaa tttttttctt cttactcgcc 120aaagtcaggg ttccctctgc
ccgtcccgta ttaatatttc cacttttgga actactggcc 180ttt
183343558DNAHomo
sapiensmisc_feature(72)..(72)n is a, c, g, or t 343tgggccttcc cttaaacatc
agaacaatga gatttgtccc tattttacag gggttagaat 60agactattaa gngacaactg
agaaaggaca gagaagtgac agccagaggt tgagaggggc 120cataaaaaca tacaatcaga
catatatctg ctaccacttt gtagcaagat ggttcctatc 180ataactctgg gtcaaaaaga
tagtaatttg gtttataatg ttgaaagaaa gcagaaagnn 240nnagatgggg tctcactgtc
gttctggagt gtagtggttc aatcatctct cactgcagcc 300ttgaacccct aggctcaaag
gatcctccca cctcagcctc ctgaatagct gggactagag 360gcatgagcca ctatgtcttg
ctgattaaaa attgtttttn caaannnnna nnnnnnactt 420tactgcctaa gctggtcttg
aaatcctggc ttcaagcaat cctttcactt tggcctccca 480aaatgctggg attacaggca
tgagtcaata tgcccagtct cttttctttc ttagttactc 540tagaaaatgg cttgttga
558344526DNAHomo sapiens
344aataatgttc tgtcacgtga aatatttaag tatatagtat atttatactc tagaacatgc
60acatttatat atatatgtat atgtatatat atatagtaac tactttttat actccataca
120taacttgata tagaaagctg tttatttatt cactgtaagt ttattttttc tacacagtaa
180aaacttgtac tatgttaata acttgtccta tgtcaatttg tatatcatga aacacttctc
240atcatattgt atgtaagtaa ttgcatttct gctcttccaa agctcctgcg tctgttttta
300aagagcatgg aaaaatactg cctagaaaat gcaaaatgaa ataagagaga gtagtttttc
360agctagtttg aaggaggacg gttaacttgt atattccacc attcacattt gatgtacatg
420tgtagggaaa gttaaaagtg ttgattacat aatcaaagct acctgtggtg atgttgccac
480ctgttaaaat gtacactgga tatgttgtta aacacgtgtc gataat
526345435DNAHomo sapiensmisc_feature(334)..(334)n is a, c, g, or t
345ttgtgtacac ataatctcat tttgagatat ataactattt ttgtctttca gaagtgaatc
60aaaatatttc aaaatgctgt cttatgaaac tacaatattc tcacagatta gaaaagtttt
120tctgtaaaag tcagatagta aatattttag gttttgcagt gtcttttgca actactcaac
180tttcctactg tagcacaaga gtagctgtgg tactgtgcaa ataaattgct tgtgttccaa
240taaagcttca tttacaaaaa catgccatgg gccatatttg gcctgtacac tgttgtttgc
300caagtcctaa tatagttgct tagcaagtat tgtnagctat ttgaggaaga catgaaagtt
360cattgggttg ctaaaaagta tgtagaaatt caaaggaaaa ttaaaattta ggctaagtta
420taatacactg tttta
435346343DNAHomo sapiensmisc_feature(95)..(95)n is a, c, g, or t
346tctcatttac cttctctctt gagcaacgtc agtaattgat cttgcatctc agagagagag
60aaagagcatg tgtgagagag aaactggttt ctatngccag cactcctgaa accccttact
120gtaaggatat tttctcttac cccttgggat ccaggctctg agtctcttct ctttgggagt
180atccatcaaa atgacttttt ttaaaaacag attttccccc aaccagnaga atctgcacaa
240acttggcagc gtttttactt gtttaatgag tttaagacat tacatggtga aagagaagca
300ttttggactc ctgcattttt atttaccatt cccagactga cga
343347534DNAHomo sapiensmisc_feature(34)..(34)n is a, c, g, or t
347gcctaacaat caaatctctt tcttttaaag cacnaccttc taggcaggga caggagctca
60ttttccacac catnctttgt caactctcat agaaagtttt ccttgtatcg agctcaaatc
120tgcctcctgg aaattcttct tcttcttccc tccctgttgg taccagctct gctgtcagag
180acttcacagt ctgtgctccc tctgccctgt gacgtcttca gactatttga gaacaggaat
240catgactcct gggacttgcc ttttctctag gtcaaatacc tctataattc catctgctgt
300tcttcatagg gtcttctccc tatcctgccc ttttcctcca atccatcttt taactgctct
360tgagcagtct aactgagaag tatgattcaa agcaaaataa atcttaaggt ggcatgactc
420tgaaaaaatt gagaaaattg aactcagaga tcccgatccc aacccctttc tcctgggagt
480gaaaccttag tttctaccag agagtgtggg aaaccacttc tggtggaagc ccct
534348580DNAHomo sapiensmisc_feature(109)..(109)n is a, c, g, or t
348aacattccct tgtcaaccaa gaatactcaa agctacttgt attggaaatg gcagaaggcc
60taaatccaaa tttcttattt tttataattt accatagaag ttttgtgant aaattcttac
120ttctgccagt ggaggtttat gcctgaaagg tcatggggtc ctgtctgtaa atagacctaa
180agagaagtgc agtatttatt ctttgtaggc ataatgtgtt tgtcactgac aagcattcat
240nttcatccca ctagtctttt attgcagtct tttattgtca ttttcagcct tatgttggag
300agctttgctt tctcatcatg ttcacattgt cttaagtttt gtgagcttct gagaaagagc
360ttggtaaagg tttaaagggg actttgttcc accagggagc attttatttg ggcgtctcac
420ccttttctaa tgaaagctgt tgtaagccac ctctgacttg gaaattctga aagtatgaat
480attttttata tcttaattgt aaaatgccag ttctccatta tttagatgaa tagtagaaca
540ctgcaccctt tgtgcagtgt ttttgtttct ctactgcatt
580349541DNAHomo sapiens 349ccagtcttcc tggcaagggt aaacagatcc cctctcctca
tccttcctct ttcctgtcaa 60gtgcctcctt tggtgaaggt gacacatcat gtgacctctt
cagtgaccac tctacggtgt 120cgggccttga actactaccc ccagaacatc accatgaagt
ggctgaagga taagcagcca 180atggatgccg aggagttcga acctaaagac gtattgccca
atggggatgg gacctaccag 240ggctggataa ccttggctgt accccctggg gaagagcaga
gatatacgtg ccaggtggag 300cacccaggcc tggatcagcc cctcattgtg atctggggta
tgtgactgat gagagccagg 360agctgagaaa atctattggg ggttgagagg agtgcctgag
gagagccctc accgtctggc 420accctagtca ttggagtcat cagtggaatt gctgtttttg
tcgtcatctt gttcattgga 480attttgttca taatattaag gaagaggcag ggttcaagag
gagccatggg gcactacgtc 540t
541350415DNAHomo sapiens 350gaataaatct ctgggaccgg
gtctcaccat attgctctgg ctggtttcaa actcctgggc 60tcaagcgatc ctcctgcctc
agccttccaa aaccaggtgt ttaacttggg actaacatga 120agcacttaga agactacgtg
gaacatagca atgactatat atgtactaca acgtaaacag 180cacctcctgg attgaataga
acataactga catgaccagc agagacaggc taaagacact 240gagctgaaaa ccctggactc
tattgctaaa ttgaggctcc tgaatccgtt cgctctgagc 300aactgttgct gtggtgctgc
cttcacaagc actctgctga gcactcagat agaggggctg 360tgctatccgt caacagacaa
gctgcagcca gaactgctca gctgacaaac tggta 415351438DNAHomo sapiens
351gtggggaagc ctgaacacag tcctataaac taaaggccac tgcagacttt tagcacaagg
60agatccttac agggaacatg tgccatcagc tctttggagt gaacaaggaa ttagaccccc
120atcatgccaa aaaactagga tttttaggtg gtctttccat cccttcagat ttaagtattc
180aaagaaagag agacagacct acattccaag ggtcttctga gtgcaaggcc ttgtgttgtt
240tgtttattta ggggagggcc tggtgctctt ctctgtttta tgctttacct tcttttattt
300ctcagatctc atgttagcac tatgttctga attccctaat aatggctctt gagaactgat
360ttacattttg ttggtttgtt tacttcttga gcacataaaa ggaccccaaa ttagagatac
420tatcccttgg gcttctga
438352224DNAHomo sapiens 352gtccactgct ggaaatagaa gttttttcgc tgcagggcaa
ttctgtaaat gtgcttccca 60gctttaggag gtctgaggct actcttctcc aataaccttc
cttcccactg gaccttctca 120ctcacagcac tgctgccctc tggacaagcc acagtggaca
aatatgtcaa gctgaagatg 180cacaaataat ttcaagttca gttctcaggg attcaaagga
catg 224353415DNAHomo
sapiensmisc_feature(177)..(177)n is a, c, g, or t 353tgtcctgggg
atcttggagc ctgaattcat tggcacaaaa ggcagcagca tcctcactgt 60atctgcagtc
catttggact caataaaaac tttgaaagtc acatgtgtta tggaattcct 120tctcagtgac
acattcatct gtgctcagtt gtcccagcaa gggtcagccc ctcatanccc 180tgcagcatcc
gctgctatga agcagagctg taaacgccct ccctgtgtat aggaaaagct 240acatggagca
aatcctcctg cctgaagaag tgcatctcag catcacttca gctgtcgggg 300catttgtggg
gagaaccaga ccacctctgc ggaaggcagc agaccctctt ccagccatgg 360atggagttga
attctctata aacggttcac cagcaaacca ccaatacatt ccatt
415354186DNAHomo sapiens 354gccaggttaa tggtatcgat cctaatgggg attcggcaga
gtttgatttg ttgtttgaaa 60atgcttttga ccagtgggta gccagcacag cgtcagaaaa
atgcaccttc ttccagatcc 120tccaccatac ctgccagagg tacctcacgg acaggaagcc
agagtttatt aactgccaat 180ccaaaa
186355457DNAHomo sapiens 355ctttacccta ggtcaggggt
cagcaaacta ctgcctgtgg gccaaatttg cccaccacct 60gtatctgtaa ataaggtttc
attggaacac agctgtggcc atatgtttgt atattgtgtg 120tggctgcttt tgcattagga
tgacagaggt gaatagttgc aacagagact ggctggtctg 180caaagcctaa aatatgtcct
gtgtggccct ttacagaaaa agttttctaa cccctgctct 240aggttacgga gaaaaaaaaa
tggaataatg ttctctgcta cttttaacct gattttcttt 300gtacctaaat aggcagctag
aatgctgcct atattttaat aaggatttgg atctcacaag 360acaccttagg cctacacaag
ttgttcagat tctttgcccc agttctaatc tagtgacaaa 420ggcatagaat tctcctccca
caggaatgta tttctat 457356373DNAHomo sapiens
356cagtctcctg ctcgtttaga agtaagggat aataatgtat ccatagctaa atgcccagtc
60gttatatttt ctagatcaag atgcttgttg tgtacagttt cacagagcct tcggattttt
120tctttaattt tgttcatgtc tttttcattc agtagcttgg ctgatgaagc atcttgttcc
180agttccaaaa gtcgaatcat tagatccaag ctagctctat caagatccat gttcaaacga
240tctctactca gtatatacat gagggcagct gtacagaggg acagattctg atggtgctgg
300gaatcatcca aggttttaaa gaccattgct accatcccat gtgctctcag gtgcattcgc
360cagtaggcca aca
373357116DNAHomo sapiens 357tttgctccta acttgctctt ggacaggaac cagggaaaat
gtgtagaggg catggtggag 60aggctagaga tcctgatgat tggtctcgtc tggcgctcca
tggatgcagg gagagg 116358522DNAHomo
sapiensmisc_feature(297)..(297)n is a, c, g, or t 358gggcatctgg
aattgacaca ccattacatt ctgtttgcag gatttttttt gtaaccatga 60aattgaacat
ttccaaatta taaactatgt taatacctat aaaatatata gccaggaacc 120atttatcatc
aagaaaagtg taagaaatta tttttgagat gtaatttaag attgttttat 180gtaaaaggaa
aatcttgtat ggcatcgaat agccttaatg aatttaattc tttcacaaaa 240atgatttcaa
attatcctag agtataacat ttttatcaaa gatattattt ccggagntct 300tctttctttc
tttttttttt ttttttagta atttagcaaa aacattactg ttctaatgct 360gaagtgactt
ttgccagtgc catgtccagg gggggaggta taagttactt gctcttanca 420tttgggctgg
attttttggt ttgggggaca cctttgggag tattcccaaa gcatgtctca 480agnggnggcn
cccgagagca tggtttaaaa gcttggaccc ct
522359369DNAHomo sapiensmisc_feature(121)..(121)n is a, c, g, or t
359gctgggccag tgcatctaac agccctgtgc agcagcttcc cttgcctcgt gtaacatgag
60gcccattctt cactctgttt gaagaaaata gtcagtgttc ttagtagtgg gtttctattt
120ngttggatga cttggagatt tatctctgtt tccttttaca attgttgaaa tgttcctttt
180aatggatggt tgaattaact tcagcatcca agtttatgaa tcgtagttaa cgtatattgc
240tgttaatata gtttaggagt aagagtcttg ttttttattc agattgggaa atccgttcta
300ttttgtgaat ttgggacata ataacagcag tggagtaagt atttagaagt gtgaattcac
360cgtgaaata
369360378DNAHomo sapiens 360agatactcag cactagacta acataacagg tcactacacg
ggtgcagaat cactttacaa 60aagaagactc tgttttacga aggggattca ctacagggac
ttagagaaca gtctcttttc 120tgcctttaaa atgagagttc ctccatttac caaaatttga
cacgcacaca ttcttcaggg 180gcatgccaat tgcgtaaagt gaggctcgcc tgcatagcta
atcctgttaa agacaacttc 240tcaaagcaca acgtgcttgt ttcctatcgg gctccctgcg
gggctttctc tcactacaag 300tcaagcttgg gctctcaaag ccctgcgcct gttaccacgg
atgcccacag ggcctgggca 360gttgctgtgg cgacagga
378361291DNAHomo sapiens 361acagtggatc aaatttaggc
ttcttgatgc aggcatggtg tagattacta cttctgtatt 60gtcccaggag ctcagcacat
tccttgccag agatgataag gagctcaatc ttgaatactt 120gttcaagctt ttgaataaaa
aaccacagtt cctcaaagaa gaagaagaat tgcgaaatca 180ccggaaataa ccgaaaactt
ccccctgttt gactttcaac attcttgaat gcaccaagat 240agcctctttc tgtgagatta
ataaatgaat aaatgcctcc atatttttca a 291362313DNAHomo
sapiensmisc_feature(200)..(200)n is a, c, g, or t 362aagccggatg
gcaaaagagc ccagaaccta ttggaactga caaaatcaag tcacggcgcc 60tacaaagatg
aggggcagat tctggctgcc ttttaatttc gtccttcacc tgatatctgt 120gccagagaat
gtcttccagg agttctgcta cagagaagag agtaaccccc atccatcatg 180gccaaagcac
ccagtcaggn tccgctctgg atccagcccg acaaatgcaa cccttgaata 240gggtttgtgc
aagcaaactg gatgacgacc gaagaaaccc tgtcgcttct gagaagacac 300ccaatccaag
aat
313363318DNAHomo sapiens 363cctggaccca actttgttac tgtgagaaag ggtcttcatt
cattcaagat ggcatttgtt 60aagcacctac tgctggagtg cagtggttca atcacggatc
actgcagcct ccacctccca 120gttcaagaaa ttctcatgtc tcagcctcct gagcagctag
gattacagac aaaccttgga 180aatcaagaaa gttctggaat gatgaagctg ttcatgccaa
gaccgaaagt gctggcccag 240tatgagtcca ttcagttcat gccgtgacaa ttttcttgga
actccttttt attgttagtt 300ctcacttgtt tccatatt
318364531DNAHomo sapiensmisc_feature(117)..(117)n
is a, c, g, or t 364ttagcatctt ggttactgga gaactataac ttttatgtag
tcatgcttgg aaaacactaa 60aagggaaatc gagtctgttt gacaatattc tgtcttcact
gttgttcact tcataangng 120tnggaatata aagttctata cagttaatat gangntctct
ttagcattta aaacatgatt 180tgcattttca tgaggcattt tggctaattt tattgatttc
cttatatttc atagtcctta 240nccttatgag aatcttatgt ttctgtgtgt tttctatcat
gtagcacaat ttctgacaca 300caaaacatac aataaacttg tgttaatttt tctatcaaag
tcagaattta ttcataagga 360atctgaagta aggtgtacta agcttgttta tgggttaagt
gatatagcca aattcaaaac 420tttacttttt atgtcagtct agaaatatct cagattaaaa
catatcactt cttagttcca 480attagataag ggaaatcttt tataataatg ccaggattgc
tataatctga t 531365525DNAHomo sapiensmisc_feature(35)..(36)n
is a, c, g, or t 365aggccatagt aatcatcctg ctgatattgc aagtnngtng
ctagaatgag gttatataat 60atatacaaaa acattttntc aactgntaaa gntgccttag
taatataggg taataccagc 120aacattatgg atatataatt atagtctatt gggccacact
taagtttgga gtctaataaa 180gtcacaatca aattctgcaa tttcaattga agataacctt
gtctttatat tatnaattag 240aagctaaagt tgatttttct aagagttctt tatttaaatg
aagtactctg ggactgacct 300tttcggaaat ggaatcttca ttggtcaggt gattcaacat
ttttatacaa tttatccatc 360ctcatctctt caggatttgc ataccttgcc agtttctact
ggccattgtt gaaaatacat 420ttatttggag aagtccaaag ccaaggggct catggggctg
tgaggtcctt cttgctgcat 480cgtcctgtgg tagaaggtgg aggagtcaag agagtgcccc
agagt 525366267DNAHomo sapiens 366gggccaatga
aagcagggtc aaggacagga ccagcgcagg ccaaggaagg gaatatctga 60cagcgcccac
ccagccaaac cctcagccca aggacaggaa tgaggagatg ctggtgaact 120agccatccat
cagtacctgc cttcccccga ggctgcagcc ccactcccag gcgcctggcc 180aggggagttt
tctaggttct gagagccacg ttgtcatccc tgggctttga agttaaacat 240cacacagctg
tctataaaca agatttt
267367199DNAHomo sapiensmisc_feature(67)..(67)n is a, c, g, or t
367gattcaggga ttggatgagt ctctatggtt tgttttgccc tgaagagcag aaggcttctg
60tcccaantgg tgttgccaaa gcaacatatt aattccatgc catgatnctg ggtcaagatn
120tgcacaatct gattgggcat gtcacctcgg atggcaaggg agtggaagtg gtcaaaatca
180tggagtccca gctttcgga
199368372DNAHomo sapiens 368gccccatgtt gcataggtgg cctataacca gtcagacaca
ggagacaaca tgaagcccca 60tctgtgcttc cctttctgac attaccacat ttgcctgatg
gagtggccag ctccctttca 120ctgctggaat gaatacaatc cagaaaacct accttctatt
gctttaccta atggggtaag 180gaaatttaag tagaaattgc taaccgaaga ctttgctaag
caaacccagg tctgcttgat 240gtcagagccc ttgctgttaa ccccatttac tgcttagcct
ccaaagagaa gcaatagcat 300cacatgggga aatgtcaaca gcataagagg actttcataa
tcagaattta aactggctat 360tatccctctg ga
372369296DNAHomo sapiens 369gaccgtgact cctgaagctt
ttcagcgcag gtgtagccgg cttggcgtcg ccgcagtgag 60gtttggagcc gctttggatt
gctgagtcac tttcttcagc cacttaggga aaccgaaagt 120ggaaactcgt ggggcttgaa
atagtgtgtt ctcttgagaa ccaccgaggc agtgagattt 180gggattccgg ggtctggaga
tcgtgctttt tgtggactgc gtttgcagtt cctagggtgc 240tgctgattca caggccttct
ctgtctttaa gtgtgcagat cattgaccgc tcagtt 296370228DNAHomo sapiens
370aaacccagag ccttctggat gtgtgaggta gtaggcttca accctcattc atgcataggt
60cacacttctc caaagttggt atggcctgtc tccttggcat gttcccttgc ttctgcttgt
120ccagttaatc ctttctgaca taccatgcat ctcagggtga agcggttgac atcagtaaac
180tgtctccttc ttctagcttc atctgctaat tccagtgctt gtacaaga
228371206DNAHomo sapiens 371cctcctgatc accatagctt tatgcaacaa caagaaacaa
atttattagc taacctaacc 60actaatgacg caagagacaa ttctaaggac tttcaaaaca
gcaaagtagg agcagctgct 120acctctaggg atgagggatg caattgtcca attattggtg
aaattgtcat ttcatgctat 180tggctatttg aaattcctcc tctaat
206372463DNAHomo sapiensmisc_feature(94)..(94)n is
a, c, g, or t 372ccctgcctgt actaatgatc caaaaattag ccaggtgtgg tggtgcgtgt
ctgtagtgcc 60agctactcgg gaggctgagg caggagaatc tcangaaccc gggaggcgga
ggttgcagtg 120ngccgaggtt gcactactgc agtccagcct ggctctgtct tggtgttcag
ccatgttccc 180atgctcactc ccaaggtgac tctgggaagg tctcagcctt tttgtcttcc
cagttaggat 240ggtcccatgc ccctgttacc atcagacttg gtaagtttcc cgaggagact
ctgcaagagg 300cactgttctg gatggtggag gagagactag ttgttctgct ctcctggcca
cagtgggtgc 360agtggacccc atcatggaga anttcaacac atccagccta cgaccagcac
ctgtgggagg 420tggatattca aggcagcaga gcctacagcc ggggcatgga gaa
463373451DNAHomo sapiensmisc_feature(38)..(38)n is a, c, g,
or t 373agggtctcaa atgaactctg agttaccatc tttggacnga cttttaatat aaagctgtaa
60tccttaaatc tgtgtcagta gtcccannta ctatgtcact ttaattggat gaatgcgtta
120atgaaaagtt tgttttcaaa cctcactaaa ctgctactta agatcacagt taatgtgagt
180cctgcttaat ttggaaagca tttaaaaaat ggaaaagttt cttagggaag naaaaatttt
240gcaactctgc ctacaaggta cagtaattgg ctaggttctt ttgaagagca gtgttgacta
300gagttaagga aaagtcagtt gtgaaaaatg gacattttta atagcaaaat gatgtgcttt
360actgtagaaa caggaggaag ggtgcattat cctggggaaa atgaannntt cttcagttat
420nttttatgct gctctacttt attgcaaaac g
45137446DNAHomo sapiens 374cagtcaccga ccttccctga gattgctacc tggaagctct
ttctat 46375519DNAHomo sapiens 375gaataagtac
acagagtccc caaagactag tgaggccaag atgtgtgagt cattttccat 60cacacacaaa
aaacccaatt gttctaagta tgtattttac caagcagctt tatagaaaga 120aaaacaaaca
aacaaaccaa acaacaacaa caacaaaaaa ccttggccag gcacagtggc 180ttacacctgt
aatcccagca ttttgggaga ttcaggcggg tggatccttt gagcttggga 240gtttgagatc
agcctgggta atgtggcgaa acctcatctc taccaaaaat ataaaaacta 300gccaggtgtg
gtggtgcacg cctgtagtcc cagctgctta ggaaactgag gtgggaagat 360tgcctgagcc
caagaggtag aggtttcagt gagccgtggg aagattgcct gagcccaaga 420ggtagaggtt
tcagtgagcc gtgggaagat tgcctgagcc caagaggtag aggtttcagt 480gagccaagat
tgtatcactg cacaactgtt gcctgggca
519376222DNAHomo sapiens 376cctgctggac agccgcgcag gatgagccgg agaccccgag
ggccgtggcc ttccaggact 60gccccgtgga cctgttcttt gtgctggaca cctctgagag
cgtggccctg aggctgaagc 120cctacggggc cctcgtggac aaagtcaagt ccttcaccaa
gcgcttcatc gacaacctga 180gggacaggta ctaccgctgt gaccgaaacc tggtgtggaa
cg 222377460DNAHomo sapiens 377atagtagggg
caattttgtc tgtagatggc agtatgacaa ttcttgctag agaatatatt 60gaaaaaaact
tcaacacaaa gggttgtagc actgtcctca gtaccattgt gtgcatgagg 120atcagaatag
tctgggctag atacatcaca ttaaagcttt tcagaatctg ataaatagct 180ctaaatacta
atgatattga gaagcctagc ttcacttggg aaaatctgtg gctgttcaca 240gaaattcagc
accaagttat tccccccata ctctaccagg ccttcaggtc ctcataaaga 300aaagtgtcgt
tttcagatta ggaactcaaa attattttgg tgcatcaaat ctacagtcac 360acaatataac
aagaatggga ttagaaaaat gaaagcctac tcattctcat ctttaagcca 420gagaatgaaa
tatatatgag gtctctggat agctatttaa
460378544DNAHomo sapiens 378cgccgcatca agccgtggcg gagatcgacg cgctctacga
cgtgtacctg gacgtgatcg 60acaagtgggg caccgacgac atgctgttcc tgggcgactt
caacgccgac tgcagctatg 120tgcgggcgca ggactgggcc gccatccgtc tgaggagcag
tgaggtcttc aagtggctca 180tccctgacag cgccgacacc acggtgggca actcagactg
cgcctacgac cgcattgtgg 240cctgtggcgc ccgcctgcgc cggagcctga agccccagtc
ggccaccgtg cacgacttcc 300aggaggaatt cggcctggac cagactcagg ctcttgccat
cagcgaccac tttccagtgg 360aggtgaccct caagttccac cgatgactcg aggcctgact
ggggcatgcc acctgcagac 420cctggctctg aggaatggcc caacagtggc cccttcaggg
tggcagccac ccttcagtga 480ggccccaagg cagagtcggc tgggcgtgga ccaggggcat
ggacacgtga tgtgctgctc 540tgta
544379254DNAHomo sapiens 379gaagtttgtc ttcctacaac
cacgtgatcc tctctctggg atttccccac tcaaccaggg 60acaagaggtc aaagttgacc
tgattatgtg tccatcaagg aagtgcccct ggaaggcaaa 120taaagaaggc accatttaca
ttacagtctc ctaagtgcag gcaatgatac cccaaggtgg 180ggctctgcag accctccagc
aaagagcttt tgaaaataaa tgtgaagctg ggcttaggag 240ctcatgcctg caat
254380398DNAHomo
sapiensmisc_feature(140)..(140)n is a, c, g, or t 380aacctgctaa
ccaagaatgc tttacctggc aaagctgtcc ttcagaaatg agggagaaat 60gaaagctttc
tcagacaaac aaaaacaaag gaaacatgta aaagtgaaaa aataaatggt 120ataagtaata
tatagtcccn actcagaatt ctctaatact gttaaggtgg tgtgtgaagc 180aatcttatta
ctactaggag ggttaagaga caaaactatt aaaaacaact gcagctacag 240tatattgtta
aaggacacaa attttaagtt tacatcaaaa tcagaaaaca tgggnaagga 300aggaatgaaa
gtgcagagtt tttgtatgtg attaaaggca aattgttatc agtttaaagc 360ctgttttaag
gataaaatat tttatgtaag cctcatgg
398381276DNAHomo sapiens 381cccgccgcgc gagattaaag gacagaccaa gagggcgcgg
gagctaccag cttggagggg 60aggacagatg gggacccagg gctggccagg gctggtctct
ggagctgttc tgccagagtg 120atgggggcgc ttggcgaggc caaggatttg gttgggtcct
atctctgaga cattttgaag 180tctcacaccc cttccatttg ttgcctattc cacttaactt
tgtatttgtt tgaaatctac 240tgttcggatg ctggactaga agagggacac ttggcc
276382119DNAHomo sapiens 382aaacataaca gaggagttgc
gaattttatg aaatttctga gtcttacaaa cttctcttta 60agactatgag gaaatgctga
cttgtattat ttatatcatt aaatttgctt gtgtatggt 119383490DNAHomo sapiens
383gtccctgctg tttagtatgc tggagtggag gttctgtgac ttcctgttta gtggtgctga
60ttctagttgg tgtgaaacgt cagatttcat cccagtcgcg tggctgattt ttttatgtgt
120ggttctctgt gtttccagcc tggtcctgct ggtcaggatc ctctgtggat cccggaagat
180gccgctgacc aggctgtacg tgaccatcct gctcacagtg ctggtcttcc tcctctgcgg
240cctgcccttc ggcattctgg gggccctaat ttacaggatg cacctgaatt tggaagtctt
300atattgtcat gtttatctgg tttgcatgtc cctgtcctct ctaaacagta gtgccaaccc
360catcatttac ttcttcgtgg gctcctttag gcagcgtcaa aataggcaga acctgaagct
420ggttctccag agggctctgc aggacaagcc tgaggtggat aaaggtgaag ggcagcttcc
480tgaggaaagc
490384458DNAHomo sapiensmisc_feature(72)..(73)n is a, c, g, or t
384gatacctcat tatacatctt acagagagca tcattggtgt ttccaaggtc acagggctag
60gcaagggtgg anncctgagt ctgcttgtct gtttgcccca tgacagccca ggggtggtgg
120cctcactcca cctccaggca cccacaagaa tataaaatct tgtacaagga tgtcgatatt
180actattgcca ttcccaagtg cacctgcacc tgtagtatca ggtggtttnc agccttggct
240gcatagctgc atatgagaat cacctgggaa gcttttaaag atcccagtat ccccacctct
300tccccagtta cagtggagtc ttgcgggtgg tgggggacat cattattttt gaagcttcca
360agtaattctg gtgtgcagtg gggtgaccag ctgtcccagg gacctccttt aaaaaataat
420atcccgggca catgacaggc caattgccct aatgcaac
458385510DNAHomo sapiensmisc_feature(343)..(343)n is a, c, g, or t
385cacctctgca cttttgtagg ctcaacaagt actggggagc ctgccaccac tgtatgcctt
60tgaggcccct gccctgcctc cctggctggc cacggagctc gccctccctg gtagggggtg
120agtttggaag tgagaggctg gtgtgggtct gtcccatgag ctgactcaca cttgcctcac
180cacacatacc atcagaagac ccacgtggtg gagctaccgc tgctgctccc cacagtgcac
240ctaggcaccc tcctgtcctt cccatggcac tcggttgacc tgggggttcc tgtccaacag
300gtgaggcctg gtgtgcacag acactctgcc attgctagaa ggnggctgtg ccccctgcta
360agatatcagt aggtccttca cagcctcacc ttgttcctcc catttgtttt taaaaattgt
420ttcttatata tacagtttat ttagcttacg taaacatttg gtgcacntaa nnnnnntcaa
480agatcatgat gtctcttttg tggttttata
51038692DNAHomo sapiens 386cctctgccat tgcccaaaga aagtacgcag gagggaaggc
gccgggggcg caggagtcgg 60ggggaagtga aatctcggca ttagaacccc cg
92387394DNAHomo sapiens 387aaggcgccgt caagtcaaat
aaataaatgc cctacaacac caacccagga ctgagatctg 60catgctggaa tgacggtggt
ggtggtggct ttcagtattc cccaggtttt gtccggagca 120ccggcacgcc ctctcttgaa
gtccgctctc cgcacagtgg ttagacggga agatccggag 180ctgtccagtg tcttgggtaa
tgcacggcat cgcctgatgt ctgacgctag aacaccacgt 240aaagtcaagc agagggaagt
gaatgcgccc taggcccctg caggccacca agaagagcta 300gagggagttg gtgcaatcct
agagatgccg gcaggtgcac caatctgtgg cacacgtacg 360ctctccaatg gaagacaact
caagaccaca ccaa 394388289DNAHomo sapiens
388actataatgc acttcgcaaa atgtaagggg ccggcttcac gccagcgggg ccttctggga
60ctttgaattc aaccaggtga gcgctccagg tgccccgaca ggcgcactgt agccactggg
120tgttaggggc gggagtctgg aaggtgacgg tagacggcca cttgggccct tctgggggcg
180agcctactgg tggggtcagg gctctccgtg ctcagagcaa ggtagaggag caaggcccta
240cttttggggg gcagggtcca gaccaaggac cctatgcgcg gagggtggc
289389139DNAHomo sapiens 389aggcctgacc gaagagaact ttaaggaact aaagcaagac
atttctagtt tccgctttga 60agtcctggga ttactaagag gaagcaaact ttccacaata
caatctgcga atgcctcgaa 120ggagtcttca aattcggca
139390528DNAHomo sapiens 390caggttcttg aagttctcca
ccatgacatc tcggtacagc ttcctctggg taagatcgag 60cagtcgcagt tcctccctgg
agaagaccac agccacatcc ttgaatgtca cagcctccta 120caatatcaaa cacatgtaac
ctcaatctta caaccaacct tcactagaag aagggtggca 180tcaagaagga aaagagcacc
acaaaaaagt tgttatagat tccaagagat ctcagtcaat 240tttcagctgt tacagttttc
cctgtctcac tatctcctac gctcatcccc ataaagcctg 300tagtttatca ctgttttttg
tttttttctt ttttgagatg gagtctcact ctgtcaccca 360ctgcactcca gcctgggtga
caggggtgag acactgtctt aaaataaata aatttttaga 420attaaaataa atagatcata
aagtgtttga aaggatcaga tgaatgaata tatgtcaagc 480acttagaagt gcctagcaca
ccatacatgc tcaataaact cgaacaac 528391443DNAHomo sapiens
391gccaggggtc gccaatcctg gaaccccact ggcttagagg gctgggggag agaaacatgc
60tgccctcttt gtagcagtca ggcgctgacc caagagaact caccttattc ttcatttcgc
120ctggtaatcc tccaggccct tctctacacc ctgaagggga gggaggaaaa tggatgaatg
180agagagggag ggaacagtgc ccaagcgctt ggcctctcct tctcttcctt cactttgcag
240aggctggaag acggcagccg ccggactggg cagatcctca agcagaccta cagcaagttt
300gacacaaact cacacaacca tgacgcactg ctcaagaact acgggctgct ctactgcttc
360aggaaggaca tggacaaggt cgagacattc ctgcgcatgg tgcagtgccg ctctgtagag
420ggtagctgtg gcttctaggt gcc
443392463DNAHomo sapiens 392tattggcacg tagcagtaca aggatggtga ggggtgggta
gggggcagac agctaggcac 60ttgaaaggaa agctcatctg gaaagattgg atcgtctcaa
atgcacatac tcgtacactc 120gattgaagcg tactctgtgc ctactagatc ttttcacagc
caaaaacacc tggcaaccct 180tggagaagta actattcctt tttttcacaa gtaagaaaat
agagcctcag aaaatttaac 240agttgtctaa gctagaaagt agcaggactg gactttgaag
tagtctttag gttgtgctgt 300acattttgtg gatatgctta aatcacagtt tagcttgtac
acattttcct ttattagaat 360tggaagtaag tattaatgtt tgaaaaaata ttttagcctg
acaatattta ttctatcttc 420atatgttttt gaaattagat attttaaact aggcacggtg
gct 463393376DNAHomo sapiensmisc_feature(26)..(26)n
is a, c, g, or t 393agctcatttt agtctcattt ctctcnctcc cttcttccct
gatgaataaa gtttattggg 60atggntttca gatgctcagc ttttccatat gattaggtna
gtgatccaga acccttccaa 120agnaccctgt ggactcaacc ctctgtttga acaacataca
agataatatg agacatttat 180ttatcgagga ccctctgagc acctggcact gtgccagatt
ctttcagata tataaaattt 240cacttgctcc tgttgattct ggaaaggagc aacggcatct
tatgaagctg tagcagatac 300tgtcctggcc tcgctcatgt gtgtcagatg tgttggagtg
ccctggctgc tgctctgcat 360gtgtagctga ggtcct
376394220DNAHomo sapiens 394tggattcatg ccaaaggaaa
ctgaaagcct gcctttcttt ttttcccagt gcacatctca 60gattatttgg cctttgtccg
aggactgaaa acagttctgt gtccaagtat gtttttaata 120cctgatattt atttcacaaa
aaaactgaaa ttgctttgtg tgtccaggct tgaatgttta 180aggcatactt gattaataca
tgtgtgctga gtgcttcctg 220395553DNAHomo sapiens
395caaccgccac atagtcacat tgtcaaatag cgtattcacc ttctcttata agaaggctca
60gcgagatctg gcgtataagc cactctacag ctgggaggaa gccaagcaga aaacggtgga
120gtgggttggt tcccttgtgg accggcacaa ggagaccctg aagtccaaga ctcagtgatt
180taaggatgac agagatgtgc atgtgggtat tgttaggaga tgtcatcaag ctccaccctc
240ctggcctcat acagaaagtg acaagggcac aagctcaggt cctgctgcct ccctttcata
300caatggccaa cttattgtat tcctcatgtc atcaaaacct gcgcagtcat tggcccaaca
360agaaggtttc tgtcctaatc atataccaga ggaaagacca tgtggtttgc tgttaccaaa
420tctcagtagc tgattctgaa caatttaggg actcttttaa cttgagggtc gttttgacta
480ctagagctcc atttctactc ttaaatgaga aaggatttcc tttcttttta atcttccatt
540ccttcacata gtt
553396357DNAHomo sapiensmisc_feature(90)..(90)n is a, c, g, or t
396ctagaaactc actcagtcct gtggttgcca acctttctcc atctcccgca gacgttttac
60tgcatgccag ataccatgtg cagtaacttn tgaatcctct cancccccta nctnccagaa
120cacnggacta tnagttactt gaaagctgag gcttggtaga gggctggagc caattgcgtt
180aaactaacta acattattgc aaaatatatt ctagggcttt tactctaata aaaatgactc
240ctggaactgc agtactatat tcttggaacc ccaagaaacc aggtgacaac ccataaattt
300accatcactt ttcagatgag gaaggcaaat ctggaaggcc aaattacttg tccaaag
357397423DNAHomo sapiensmisc_feature(184)..(184)n is a, c, g, or t
397gttagcacat accattgaat tcactgagac acatgagaaa atatgggaaa gtcggagagt
60ggaagtaaat gtaaagaccc ccctcctccc caaagagtac gttgtgtagt ggggtagagt
120ggaaaatcaa tccaagaaaa gtagcaaacg gacccaaaga tgaagaggaa gaaaagaaac
180agcnacacga aacgnaaaaa aaaagccacc agatttgttg caacgttgat gtaaacctgg
240ccgtcttcct gaaccagtga cccagggttt ccgcttccct ttgctgtcat cttgctcaag
300tctagaagct gaaatatcat catcaactcg acatgagggg ataacctctn gatccactca
360tcagatgctc atcagacgtt ccaattacaa aactgaacct cttcttagtg ctggggcggt
420tag
423398515DNAHomo sapiensmisc_feature(132)..(132)n is a, c, g, or t
398ggacaaaaac tttcccaagt cagcttttta ctatgattac gtcctagcct cagatgtggt
60ctaccatcac tacttcctgg acaagctgct caccaccatg gtgtaccttt cccagccagg
120gacggtgctg cnttgggcaa acaaattcan ggttcagcac cgactatgaa tttttagata
180aattcaagca agtttttgac acaacactgt tggctgaata tccagagtca tcagtcaaac
240tttttaaggg gatactaaaa tgggactaaa tccaacaaaa tgcctttcac aacgttactg
300tgtcttttga gcaatgtgtt agaaattgct ttggtaatag acttctttca caggattgag
360aaggtagtgc atagaaacaa cttgtatact tggaacaaat gtaacaatac tgcagaaact
420ttctaatttc taagataatt taagattatc tggttaatct aaatatctaa aaagaacaac
480ataaaaacat gaaagtagct ttgttggttc caacg
515399483DNAHomo sapiensmisc_feature(55)..(60)n is a, c, g, or t
399gactgccatc tgatcaacag ctacccttca gccaatttca cctttctgtt acttnnnnnn
60anngngccct tgtgtggata atgtncancn cantaaggaa tgcatgttag gtactttaag
120tcccagtaga atancagttc ataatatcac acatgtggtg aaatctaaaa aggcaatagg
180gctaattttg gcaggagttg gaatagcccc ttgggatggc tttgcatact gtcagacagc
240tttgagaaac tttcaaaccc ttgaaacact ggcaactnag tacaggcaga gccataaaag
300gacntcaagc ctccctgcac tccctagcca atgctgtctt ggataacaga tttgccctgg
360aatatcttct ggctgaacaa gggcgggtat gcacagtaat aaaccacatc tgttgttctt
420acattaacag ttcaggattg gctaaactgc aagttcaaaa gatttaccaa gaccaggcac
480aat
483400555DNAHomo sapiensmisc_feature(483)..(483)n is a, c, g, or t
400ggagcaaaac acttggaacc cacaagactc ccagaaggtg aagttaagag ctcccagact
60cataaggtta ttagaacagc aaactggcac cccaaagaac tttacggaga cttgcaacct
120atcaacaagt tggatgaggg attaaaagcc ttcaacaacc aacaacccca agcatcaaac
180tgaaggaaac attctaacct tcacagacag actggaggct ggatggggac ctggctgaag
240acatctggag aatgaaagtt aagtaccagc ttgcattttt gtgcccctag attatttttg
300cattttaaaa taagaagcat caaattgcgt gtctctgtgt aaaagttcta gcaatttgtt
360ttaaggtgaa cttattttgg cttagggact acaaaaagag aaggtaattc ctagggaagg
420aagaagagaa agaaatgaaa attagagaat aagattattt tgaatgactt caggtagcga
480ggngtgtgtg tttgtgagtg tgtatttgag agacttggct catgcctgtg ggtcttctct
540tctagtatca gtgag
555401327DNAHomo sapiens 401ggctgagaaa ctactggagc accagggaca gtctgtaaag
ttggatggac caccaatggg 60aaaatgagag ctgcccaccc tggccttaca ctccttcaat
taatacataa acagaaagga 120ggatatacag agagccaaag gcccatggga cgtgaccaac
attccactga gtctatacga 180tcaaacagca aactgtttat catgaataca gaatgtgggc
aaactcatga ctgtgcctgc 240cccagaaggt ttgctgaggg caattgcttc ctgacgccaa
gctccttgag gttatctatt 300gggacatcca gagaatgcag tcttgca
327402497DNAHomo sapiens 402gggtggcctg gggatagtgg
cttcatcttt tggggcttca agattctttg tctttaaaat 60caggggttat atcaagatca
tcaaagttcc cattccatta aagaaaaccc tgcatgtatc 120cataatgatg cttcttcctg
ttaaatttac aatgaaggga aacccatcac ttaactgtag 180gaatttccca aaatgaactg
atgaccagtg atctctctat cagaaaatgg cagatttcta 240gccttccaga actttgattt
tcttggacat tcaatggttc ctttttccca aatatttttc 300aactgatgcc aaaccttgga
tttggtttaa tccacctttg gtttaggttt ggggaccctt 360ttcctggacc gtcccagttt
tgggttaaac cgatttggat gaccctgtga gtcgccactg 420gataccgaca gtctgctgtg
gtgcttagaa gccactgaaa cattggtgaa tgtgaagtca 480cttttggggt gcctgcc
497403512DNAHomo sapiens
403gaaagctcca aatccatgac agtcgaagtc tctgctcctt caggaacagg acatcttcct
60ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac
120agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg
180ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact
240tcaagatcat accactggac taagaacttt caaaatttta atgaacaggc tgatacttca
300tgaaattcaa gacaaagaaa aaaacccaat tttattggac taaatagtca aaacaatgtt
360ttcataattt tctatttgaa aatgtgctga ttctttgaat gttttattct ccagatttat
420gcactttttt tcttcagcaa ttggtaaagt atacttttgt aaacaaaaat tgaaacattt
480gcttttgctc cctaagtgcc ccagaattgg ga
512404229DNAHomo sapiens 404caccccattc aaactcaagc acagtatgcc tccccagtct
ttatgcagcc tgtatataat 60cctcaccaac agtactcggt ctatagtatt gtgcctcagt
cttggtctcc aaatcctaca 120ccttactttg aaacaccact ggctcccttt cccaatggta
gttttgtgaa tggctttaat 180tcgccaggat cttataaaac aaatgctgct gctatgaata
tgggtcgac 229405495DNAHomo sapiens 405acagctcagg
ttttatcacc gactgggaat agacaacctc aatgctgaac cgcactggag 60aaaaggggca
aggtacccct gctgaggtgt atgggctgcc atctcaggct gtcttgagga 120cctgggctcc
ctctgctact cccaggaaat gggctcctga cacagcagtc tgccaccaca 180gccccaggag
ggtgtcaaca ccagcaaatg ctgtatttgc agcatgtcca agatgaccct 240tctcccctac
ctctacctag ccactggcag ggaggggaga cagtggtgat agcagcagca 300ctctaggcat
ggtgaacgcc tgggaccaag ccatgtggcg ttttttattt tgcctttctg 360gaagactcaa
gatatgtctc ttcattctct ctcagtattt gtttactttg gtttttttgt 420ttttaatctc
agagagaggt gtgtttagtg ggcacaagct gtaatattca gcaaaacttt 480gtcgactggc
actgt
495406472DNAHomo sapiensmisc_feature(77)..(77)n is a, c, g, or t
406ttcctcttgc tgagaaaacc caccctgctc acctaaaccc tggccttgcc tggtaattcc
60atccatgcgc ctggaangnc ccagacatca aggctctgag gggccaggca cggggagaac
120ccagcagtgc cctgccctgc agtctgagct accagattcc ttgtgaagat aatttgagga
180ccatgactca cccaaccaca tttcctgggg cctcaaattg aaaattcagg atgggctttt
240ctatatgact ggctgatatc caactatgcc atggtcttta catgccatga acattctttc
300ctgccagagt tctaagaatc tgtgttctct gccttagacc ttctgcagat gagcccacag
360gaagctccac gtgtagctga gctacatgca ccaggcctca gtttgcccca agtcccctgt
420gtactctctc atggcctgtg gccaagaaat gtattctctc actttggact ta
472407395DNAHomo sapiens 407agcagatgga ccctactgga agtcagttgg attcagattt
ctctcagcaa gatactcctt 60gcctgataat tgaagattct cagcctgaaa gccaggttct
agaggatgat tctggttctc 120acttcagtat gctatctcga caccttccta atctccagac
gcacaaagaa aatcctgtgt 180tggatgttgt gtccaatcct gaacaaacag ctggagaaga
acgaggagac ggtaatagtg 240ggttcaatga acatttgaaa gaaaacaagg ttgcagaccc
tgtggattct tctaacttgg 300acacatgtgg ttccatcagt caggtcattg agcagttacc
tcagccaaac aggacaagca 360gtgttctggg aatgtcagtg gaatctgctc ctgct
395408397DNAHomo sapiens 408attttcctca taaagcattg
ctccagctaa tcttatctat ttttctccag aatctccatc 60cccttcccgt cagatacatc
taaaactttt tttgtatctt tgtttttcct cgtgttgtat 120catcttccta aaacatgttc
tacttgtgaa aaccctaaga aattctctct gtcttattga 180aattctatct ccactgtgaa
gcattatcat ggtgtggcca tatatgatct atccctatct 240gaagtcactg catttattcc
ctgatcctca tttgcaggtc cagtaccttg tacaagtttc 300tttttgtgcc atattagact
gtaagctcca agagggcagg gcccaagtct tatgaatttg 360tgtctgcata gtgtctagta
cttgtctgag gcccaca 39740948DNAHomo sapiens
409aggacgtacc ttgtgagatg cgagccggcc aacagcttgc aagcatgc
48410459DNAHomo sapiens 410gcaagtcgcg tgatttctac cacacctgct actgcctgag
cggcctgtcc atagcccagc 60acttcggcag cggagccatg ttgcatgatg tggtcctggg
tgtgcccgaa aacgctctgc 120agcccactca cccagtgtac aacattggac cagacaaggt
gatccaggcc actacatact 180ttctacagaa gccagtccca ggttttgagg agcttaagga
tgagacatcg gcagagcctg 240caaccgacta gaggacctgg gtcccggcag ctctttgctc
acccatctcc ccagtcagac 300aaggtttata cgtttcaata catactgcat tctgtgctac
acaagcctta gcctcagtgg 360agctgtggtt ctcttggtac tttcttgtca aacaaaacca
atggctctgg gtttggagaa 420cacagtggct ggttttaaaa ttctttccac acctgtcaa
459411275DNAHomo sapiens 411agagggcaag gggctggatg
caggcagaga atgactttaa gaaaagattc tatgatccct 60tcctttagta tggagctcga
ttttccagct ggcgcttggt gagaaagtac ttgaagaact 120catagacaga ccaagaaatg
gcggtggagg gcatctggta gatgacacgc gcctggatgc 180ctttgaagta gccggccagg
ccgttgagct ggtacaccgt ccggaaggca ttggccatac 240ccgacagccg gccgctgatg
ttggccagcg agagg 275412536DNAHomo sapiens
412gcagataagc tccgtctgca gttccaggcc agccagaaac tcctgtgtcc acatagagct
60gacgtgagaa atatctttca gcccaggaga gaggggtcct gatcttaacc ctttcctggg
120tctcagacaa ctcagaaggt tggggggata ccagagaggt ggtggaatag gaccgccccc
180tccttacttg tgggatcaaa tgctgtaatg gtggaggtgt gggcagagga gggaggcaag
240tgtcctttga aagttgtgag agctcagagt ttctggggtc ctcattagga gcccccatcc
300ctgtgttccc caagaattca gagaacagca ctggggctgg aatgatcttt aatgggccca
360aggccaacag gcatatgcct cactactgcc tggagaaggg agagattcag gtcctccagc
420agcctccctc acccagtatg ttttacagat tacgggggga ccgggtgagc cagtgacccc
480ctgcagcccc cagcttcagg cctcagtgtc tgccagtcaa gcttcacagg cattgt
536413286DNAHomo sapiensmisc_feature(63)..(63)n is a, c, g, or t
413ttaatttctg tgaagagtgc ccctggtgtt tcatcttggc ctgttttgat gagaatgtta
60tcntttgtgt ctggataacg cgtcagcttc ttaaagtaca tataaagata ttctgtcacc
120nccccacatg cacacacttt taaaatctat ttttattctc ttgctaaagt tgtaattatg
180tcaagaattt tccagctcta actgccttct tagtacatgt ctttctgcct ttgaagcata
240tgagtttgcc aaagtcattc tcccctaatg acatattgtg gactta
286414166DNAHomo sapiensmisc_feature(27)..(27)n is a, c, g, or t
414gaaagacgga ggaaacaatc aaaatcncca ttctattgct ttgacacctt tactaggtga
60attggtggca ttcncaaagc taatagggac gtttatatca agaaacattt ctgtatatat
120tgttgaattt tagttgtaca tatactttgt atgtttttgt cttctt
166415552DNAHomo sapiens 415tgcaggctag gggaggagcc acccccgctt ccctattgtg
accaggccta tggggaggag 60ctgtccatac gccaccgtga gacctgggcc tggctctcaa
ggacagacac cgcctggcct 120ggtgctccag gggtgaagca ggccagaatc ctgggggagc
tgctcctggt ttgagctgca 180ttcaggaagt gcgggacatg gtaggggagg caaaaagcct
tgggcactac cctccctgtg 240gagctgttcg gtgtccgtcg agctagccac accctgacac
catgttcaag ggtaccggaa 300gagaagggtg tctgccccca acctcccctg tgggtgtcac
tggccagatg tcatgaggga 360agcaggcctt gtgagtggac actgaccatg agtccctggg
gggagtgatc ccccaggcat 420cgtgtgccat gttgcacttc tgcccaggca gcagggtggg
tgggtaccat gggtgcccac 480ccctccacca catggggccc caaagcactg caggccaagc
agggcaaccc cacacccttg 540acataaaagc at
552416524DNAHomo sapiens 416acgccgcgcg aaggtgatga
gctcgcccgg ctgccctacc tacggacctg gttccgcacc 60cgcagcgcca tcatcctgca
cctcagcaac ggcagcgtgc agatcaactt cttccaggat 120cacaccaagc tcatcttgtg
cccactgatg gcagccgtga cctacatcga cgagaagcgg 180gacttccgca cataccgcct
gagtctcctg gaggagtacg gctgctgcaa ggagctggcc 240agccggctcc gctacgcccg
cactatggtg gacaagctgc tgagctcacg ctcggccagc 300aaccgtctca aggcctccta
atagctgccc tcccctccgg actggtgccc tcctcactcc 360cacctgcatc tggggcccat
actggttggc tcccgcggtg ccatgtctgc agtgtgcccc 420ccagccccgg tggctgggca
gagctgcatc atccttgcag gtgggggttg ctgtataagt 480tatttttgta catgttcggg
tgtgggttct acagacttgt cccc 524417378DNAHomo sapiens
417aaatgactgc attcgtctct tttttaaagg tagagattaa actgtataga cagcataggg
60atgaaaggaa ccaagcgttt ctgtgggatt gagactggta cgtgtacgat gaacctgctg
120ctttgttttc tgagaagagg tttgaagaca ttttattaac agcttaattt ttctctttta
180ctccatagga acttatttta atagtaacat taacaacaag aatactaaga ctgtttggga
240attttaaaaa gctactagtg agaaaccaaa tgataggttg tagagcctga tgactccaaa
300caaagccatc acccgcattc ttcctccttc ttctggtgct acagctccaa gggcccttca
360ccttcatgtc tgaaatgg
378418116DNAHomo sapiens 418agtatggaag ctgagaagag ttattggaat cacccccacc
gttgacagag gaaggcaggg 60ggtgagaatt aactgcttga gggtaggaga gtctgagatg
tgggggccct attccg 116419147DNAHomo sapiens 419cctgagccac
cacgcagaag aggcactttc caagttgttt accaagaatt tacattaaaa 60taacaagcta
ttgtttggct atacattgtt ctttgtatca catattccag gaactacagg 120aaaataatgg
gtgaggcagc tagttag
147420310DNAHomo sapiens 420gaaattccat caatacatct agacagatgt ttgcttgtag
tttttggtat ccaaaacctt 60ttttccacac atcgcacaga tgcctttttt gtaggcacag
ccctggcagt aatgagaacc 120tggttggtgc acagaacttt tacaaattct acaagtggag
aacttattct ttccatatgg 180atcaaatctt gctttttttg aagtcaaagc tttattttca
ttcagctttc ttccaccact 240ttctgtggta ttcctagcac cacctttcca tgtatctgga
gtgataacag taccaagttt 300cttttcacat
310421154DNAHomo sapiensmisc_feature(68)..(68)n is
a, c, g, or t 421agatataact ggtagagcac gtcaaagata tacagaaata accagagaaa
agtttgaggc 60attaaaanaa gaaaatatgg acctaaacaa tatgaatcaa agccttaccc
ttgaactaaa 120cacaatgaaa caagcaatga aagaactaca gtta
154422444DNAHomo sapiensmisc_feature(92)..(92)n is a, c, g,
or t 422tttttgtgca tgattacact ccactgacat cttccaagta ctgcatgtga ttgaataaga
60aacaagaaag tgaccacacc aaagcctccc tnggctggtg tacagggatc aggtccacag
120tggtgcagat tcaaccacca cccagggagt gcttgcagac tctgcataga tgttgctgca
180tgcgtcccat gtgcctgtca gaatggcagt gtttaattct cttgaaagaa agttatttgc
240tcactatccc cagcctcaag gagnccaagg aagagtcatt cacatggaag gtccgggact
300ggtcagccac tctgactttt ctaccacatt aaattctcca ttacatctca ctattggtaa
360tggcttaagt gtaaagagcc atgatgtgta tattaagcta tgtgccacat atttattttt
420agactctcca cagcattcat gtca
444423510DNAHomo sapiensmisc_feature(357)..(357)n is a, c, g, or t
423gctttggact ggctcgcatg gaaaaccagg catttgatcc cgagaaaggg aacttcaaca
60ctttgttttg caggctctgc gtgctgctgc tggtgtgtgc cgcccaggcc tggctcatgt
120ggcgcttcat ccactcccag ctgcggcact ggcgggaata ctggaatgag cagagtgcaa
180agcggagagt cccagccaca cccagactac cagccaggct catcaagagg gaatctggtt
240accatgaaaa tggagtggtg aaggcagaga acggaacctc cccacggact aagaaactca
300agtctcccta aggccaaagt gctaagaaca ggaatcctct tggtgggggc cgagcanggg
360gcaaggagcc caggccccct ccctgcctcc tccttcctgc ctgtgatgct ccgtctcaaa
420cagccgaaac ctgtcttgca atggggggag gggngcgttt cnctttcctt cttcttggct
480tcctcttatt cttccacaaa ccattctcaa
510424191DNAHomo sapiens 424acattgtgcc tcaggatttt gataataatt ctggatattg
gaacagaata gaaatgtact 60gtcgagagct gacagaaagg tttgaagatg tttgggtggt
atctgggcct ttgaccttac 120ctcagactag aggcgatgga aagaaaatag ttagttacca
ggtgattggc gaggacaacg 180tggcagtccc c
191425186DNAHomo sapiens 425gcggtgtgga ccgaggaaca
acttggaaga tctacctgca acacaacatt tgtgtcactg 60tacagttttg tggactgagc
gaggaaaaac aacaaataat ttaagttggc tagagcttct 120gtattttcaa agactgccac
gtgccttagg aatactgttt tatctccata ctttggatga 180cttgtt
186426465DNAHomo sapiens
426gttttggacc aacaagtgcc tcctttggtg aaggtgacac atcatgtgac ctcttcagtg
60accactctac ggtgtcgggc cttgaactac tacccccaga acatcaccat gaagtggctg
120aaggataagc agccaatgga tgccaaggag ttcgaaccta aagacgtatt gcccaatggg
180gatgggacct accagggctg gataaccttg gctgtacccc ctggggaaga gcagagatat
240acgtgccagg tggagcaccc aggcctggat cagcccctca ttgtgatctg ggagccctca
300ccgtctggca ccctagtcat tggagtcatc agtggaattg ctgtttttgt cgtcatcttg
360ttcattggaa ttttgttcat aatattaagg aagaggcagg gttcaagagg agccatgggg
420cactacgtct tagctgaacg tgagtgacac gcagcctgca gactc
465427480DNAHomo sapiens 427tcctttgtgt agcattatca gcctcggtct ggcctctggc
acctcaccct tgccatggct 60gaccccaccc attccaaggc ggggtcacgg taccagcagc
acttggggtg aggcctccaa 120agcttcctca gaattgtggc tgtgccacgc tggaccacag
ggtccccctc aagcatctcg 180gggccctatt ctctctgagc acctggaggg ctggactcag
gcttgtgcca gggcctgact 240tgggcctggg ggccctagaa cactcctcct cctgagccta
ctgccaaacg tcctcagtgt 300tgtctgcacc tgctccgact ccttcagccg ccccattcag
cgcccgctcc gtccagtgcc 360cgccctgtgg ggccaaggcg gccgtgcctt actactctgt
gtcttctgcc tcctctgagg 420aatctggccc tgtctgacag tcccagaccc cccgttctct
cctctttagt tgcatgagtt 480428533DNAHomo sapiens 428ttcattcaca
aacttccgct gtacctgcgt ctaaaaaggc ccaaacccga gagagacctg 60atgccggagc
cccctcactg ttcttctcca ggaagtggct ggggtcgggg aacagatgaa 120tatttcatcc
ggaagccgcc aagtgatttt ctcttcccca aacccaatag gttccagcct 180gaactgtctg
cccctgatct gcggcgattt atcgatggtc caaaccgggc tgtggccctg 240cttccggagc
tacgggaggt cgtctcctct atcagctaca tcgctcgaca gctgcaggaa 300caggaggacc
acgatgcgct gaaggaggac tggcagtttg tggccatggt agtggaccgc 360ctcttcctgt
ggactttcat catcttcacc agcgttggga ccctagtcat cttcctggac 420gccacgtacc
acttgccccc tccagacccc tttccttgaa gactggaggg ttgagaccag 480gccccctgcc
agttgaagtg agagagtttg gtgatactgt caagccctat cct
533429486DNAHomo sapiens 429gtgacctttc acgaacatgg gcatggctgc ggctccctcg
tcatcaggtg catagcaagt 60gaaagcaagt gttcacaacg gtgaaacttg agcgtcattt
ttcttagtgt gccaagagtt 120cgatgttagt gtttccattg tattttctta cagtgtgcca
ttctgttaga tactatcctt 180ataattgatg agcaagacat actgaatgca tatttcggtt
tgtgtatcca tgcacctacg 240tcagaaaaca agtattgtca ggtattctct ccatagaaca
gcactatcct catctctccc 300cagatgtgac tactgagggc agttctgagt gtttaatttc
agactttttc ctctgcattt 360acacacacac acacacacac acgcacacac acacaccaag
taccagtata agcatctccc 420atctgctttt cccattgcca tgcgtcctgg tcaagccccc
ctcactctgt ttcctggtca 480gcatgt
48643097DNAHomo sapiens 430tattagttaa ttagtgattt
cacagtatcc tttcgcaggc cgatccccac tccaaccgtt 60ccctcagcaa ccccaggggt
gtcagacggg gcaccct 97431241DNAHomo
sapiensmisc_feature(88)..(88)n is a, c, g, or t 431gctgcctttg cactggcgaa
gggaggggca ctggttatgt tgtttccatt cgacagtcct 60tccaaaggct tccctccagc
gccactancc aaatccagaa aagcgtcctc ctccagaagg 120taccaccaaa cctttaaaac
ctttaaaggc tcctccagtg tcagattcaa atccaacatt 180tctgcgcttt gctttcttta
tggctctatt cttcaagact tcctcactgg ccatggagaa 240t
241432537DNAHomo sapiens
432tgagcctgtg cgttttgcat actgggttgg tttgctgggg ctgcggtgac agcatatgcc
60gcgagctggg ctttaacaga gatgtgtgct ctcacagctt tgcaggcggg ggtctgagat
120cagggtgtcg cgggtggggg gtcactgctg aggccgtgag gggaatctgc tcaggcctgt
180ccctggcttc tgggggctgc tggtggtatt ttcagttcct tggtgtgtgg atacttcgcc
240ccatctctgc cttcacctgt gtcctccctg tgtgggtgct ggtgtccaaa atttcccctt
300ttcgtagtga caccagctgt gttggattgg ggcccaccct gctccagcat ggcctaatct
360taactaatta catttgcaag gatcttatgt ccacaaaagt cacagtctga ggtgctgggg
420gttaggactt caatatataa attttgcggt tacacaattc aatccatgac agaatccaaa
480ggtttactct ggttataaaa acagtacaat aaaatattgt ttatagcctt ccctgta
537433355DNAHomo sapiensmisc_feature(56)..(56)n is a, c, g, or t
433gaaaacccgt tatgagacac aacttgaatt aaatgatgaa ctagaaaagc aaattnttta
60tctcaaggag aaagtggaaa aaatccatgg aaactcttca gatagactnt cttctattcg
120tgtctatgaa cgaatgccag tggaatcctt aaacacatta cttaaacagc tagaagaaga
180aaagangact cttgaaagtc aagtgaaata ctatgcactt aaactggaac aagaatcaaa
240ggcttaccag aagatcaaca atgaacgccg tacataccta gctgaaatgt ctcagggttc
300tggtttacat caagtttcta aaaggcaaca ggtggatcaa ctgcctagga tgcaa
355434319DNAHomo sapiens 434ggcaagaagc caggtaaggc atgcagtctt tctgttcccc
gttgggggag tggtattaag 60gaactgtgtc ttcaggatac agtgagctgt aaaaatagac
aacaagaaca cggaaactat 120ggtagacgaa tgggctgagg acacagttca tgaaagagaa
atatactcaa gatagaagaa 180cctgcttcat cttagtggtg atttttgtaa aatgtaattt
aaaatattcc ccgatgctgg 240gagctaagta aaaaataaat aagtaaataa aatacaaaat
tacatgtaca tttaaatgtt 300ttttctctat caagtttat
319435511DNAHomo sapiens 435cacgatgacc ccagacatga
gatccatcac taataatagc tcagatcctt tcctcaatgg 60agggccatat cattcgaggg
agcagagcac tgacagtggc ctggggttag ggtgctacag 120tgtccccaca actccggagg
acttcctcag caatgtggat gagatggata caggagaaaa 180cgcaggacaa acacccatga
acatcaatcc ccaacagacc cgtttccctg atttccttga 240ctgtcttcca ggaacaaacg
ttgacttagg aactttggaa tctgaagacc tgatccccct 300cttcaatgat gtagagtctg
ctctgaacaa aagtgagccc tttctaacct ggctgtaatc 360actaccattg taacttggat
gtagccatga ccttacattt cctgggcctc ttggaaaaag 420tgatggagca gagcaagtct
gcaggtgcac cacttcccgc ctccatgact cgtgctccct 480cctttttatg ttgccagttt
aatcattgcc t 511436515DNAHomo
sapiensmisc_feature(89)..(89)n is a, c, g, or t 436taagatccag ggttccagga
ctgccaccaa ctcctgtcca gctgctctat ccagtgtccc 60gattcagcaa tgtcaaatcc
ctccagcanc nntgcnnntn ccggatacga cagctcgtca 120ggatagatca catcccagat
ctcccactgc ctaaacctct gatctcttat atccgaaagt 180tctactacta tgatcctcag
gaagaggtat acctgtctct aaaggaagcg cagctcattt 240ccaaacagaa gcaagaggtg
gaaccctcca cgtagcgagg ggctccctgc tggtcaccac 300caagggcatt tggttgccaa
gctccagctt tgaagaacca aattaagcta ccatgaaaag 360aagaggaaaa gtgagggaac
aggaaggttg ggattctctg tgcagagact ttggttcccc 420acgcagccct ggggcttgga
agaagcacat gaccgtactc tgcgtggggc tccacctcac 480acccacccct gggcatctta
ggactggagg ggctc 515437489DNAHomo sapiens
437gctttgagga aaccactgtg caacttgaga tgtctgtggt tgtggggatg ttccatccct
60ccgttcagtt gtgaagacct ctgctctgcc ctcagcaacc agagcctcgt cactctggac
120ctgggtcaga atcccttggg gtctagtgga gtgaagatgc tgtttgaaac cttgacatgt
180tccagtggca ccctccggac actcaggttg aaaatcgatg actttaatga tgaactcaat
240aagctgctgg aagaaataga agaaaaaaac ccacaactga ttattgatac tgagaaacat
300catccctggg cagaaaggcc ttcttctcat gacttcatga tctgaatccc cccgagtcat
360tcattctcca tgaagtcatc gattttccag gtgttggtga actgcctgtg actcctctcc
420tccccggccc ctacccctca gggataatga gttcattgct gggctagatg ttttagccat
480gattctgcc
489438580DNAHomo sapiensmisc_feature(275)..(275)n is a, c, g, or t
438agcgagaccc agactcgtac aacaaacacc tcttcgtgca cattgggcat gccaaccatt
60cttacagtga cccattgctt gaatcagtgg acattcgtca gatttatgac aaatttcctg
120aaaagaaagg tggcttaaag gaactgtttg gaaagggccc tcaaaatgcc ttcttcctcg
180taaaattctg ggctgattta aactgcaata ttcaagatga tgctggggct ttttatggtg
240taaccagtca gtacgagagt tctgaaaata tgacngtcac ctgnnccacc aanntttgct
300ccnntgggaa gcnngtagta gnnaaantag anncggagta tgcaaggttn nagaatggcc
360gatttgtann ccgaataaac cgctcnccna tgtgtgaata tatgatcnac ttcatccaca
420agctcanaca cttaccagag aaatanatga tgaacagtgt tttggaaaac ttcacaattt
480tattggtggt aacaaacagg gatacacaag aaactctact ctngcatggc ctgtgtgttt
540gaagtttcaa atagtgaaca cggagcacaa catcatattt
580439581DNAHomo sapiens 439gcacggacac ctatgaagac cagcagtgga gaccccccaa
gcccactggt gaaacagctg 60agtgaagtat ttgaaactga agactctaaa tcaaatcttc
ccccagagcc tgttctgccc 120ccagaggcac ctttatcttc tgaattggac ttgcctctgg
gtacccagtt atctgttgag 180gaacagatgc caccttggaa ccagactgag ttcccctcca
aacaggtgtt ttccaaggag 240gaagcaagac agcccacaga aacccctgtg gccagccaga
gctccgacaa gccctcaagg 300gaccctgaga ctcccagatc ttcaggttct atgcgcaata
gatggaaacc aaacagcagc 360aaggtactag ggagatcccc cctcaccatc ctgcaggatg
acaactcccc tggcaccctg 420acactacgac agggtaagcg gccttcaccc ctaagtgaaa
atgttagtga actaaaggaa 480ggagccattc ttggaactgg acgacttctg aaaactggag
gacgagcatg ggagcaaggc 540caggaccatg acaaggaaaa tcagcacttt cccttggtgg a
581440449DNAHomo sapiens 440ggcgtataat tcagccctgt
ttaaatatac ttgcctttca aattcttcaa gtaacatggg 60aagtattctt gaaatgtcac
attttctgcc ttccctctaa gtatgctttc tgaagaagtc 120agggaaagtt agagtctgtg
gcctgaggtg tctgctctgg gtggcgatag tgggcacctc 180aggcaggtcg gtgacgttta
gcacaggtgc cagggctcct gcctgctcct cctgtgttag 240ctctgtgaag ttcatttagg
aatttttttt tcctatgcag tttaagaaat aatcctaatt 300gttttttctt attacctaag
caatatattt ttattatagc aacctcagaa aagaaaaata 360aaaggataat ttaaaaaact
cattcatagt ctcagttacc cagataacct cggttgtcac 420cttggagtat cttgttgtag
tccctttac 449441457DNAHomo sapiens
441agcagaggct catccgggag cagatacgcc aggagcgtga ccagaggttg agaggaaagg
60cagaaaatac tgaaggccaa ggaaccccca aactaaagct aaaatggaag tgcaagaagg
120aggatgagtc aaaaggtggc tactccaaag acgtcctcct acggcttttg cagaagtatg
180gtgaggttct caacctggtg ctttccagta agaagccagg cactgctgtg gtggagtttg
240caaccgtcaa ggcagcggag ctggctgtcc agaatgaagt tggcctggtg gataaccctc
300tgaagatttc ctggttggag ggacagcccc aggatgccgt gggccgcagc cactcaggac
360tgtcaaaggg ctcagtgctg tcagagaggg actacgagag cctcgtcatg atgcgcatgc
420gccaggcggc cgagcggcaa cagctgatcg cacggat
457442498DNAHomo sapiens 442aaggctatta acgacgcgat ttcacaaagt cggcagagtt
ctgcgggaaa tcccctggaa 60agactcaatt aaagagcagt gaagagagtg cagatcccgt
cactggaagt tcggaaaatg 120cagtgtcatc ttcagaactg atgtcccaga ctcccagtga
agttctgggt accaacgaga 180atgagaaact gagccctaca agtaatacct catatagttt
agaaaaaatc tccagtctgg 240cccctcctag catggagtac tgcgttttac tcttctgctg
ttgtatttgt ggttttgaat 300caaccagcaa agaaaacctc ttggatcata tgaaagagca
cgagggtgaa attgtaaaca 360tcatcctgaa taaggaccac aatacagctc taaacacaaa
ttaggtggaa taatgactcg 420agcaggaaag cagtagaaga ggattccttc accacagttt
cacctttacg ctgtcagaca 480acttcctgcc acagaaga
498443476DNAHomo sapiensmisc_feature(73)..(73)n is
a, c, g, or t 443caaccgagag ggccggcagg agcttgaaat cattattgga gatgaacaca
tttcttttac 60aacatcaaaa atnggttccc ttattgatgt cagtcaatcc aaggatccag
aaggcttatg 120agtattttat tatcctgtcc aggaccctga agtgtttggt cttcagtctt
actggattac 180acttcaagat taaaccaatc taaactgaat attgatgtgg acatgggggg
gtgggagtag 240ttntnaatta ccattatcaa gaacatttng tgtcagggca gtatattttt
ataaactata 300tatgattatc tttaataaan tatgtgataa aatttaaaaa aagcaaaaca
aaacttctag 360angaataccn tcaaaacctt ggtgagggan attcttanac agcacaaaaa
tcattaggnn 420aagatcaant ttaacatngt caaattaatc aatgacttct cttcctcaaa
agacat 476444133DNAHomo sapiens 444ttccagagct acccagacca
tatggtgcac ccacagatcc agctgcagct ggtcctttag 60gtccatgggg atccatgtct
tctggacctt gggcgccagg aatgggaggg cagtatccta 120cccctaatat gcc
133445353DNAHomo sapiens
445cgccgctgcg aattctcgga caaaactgtc aacagcccgg gcgcgccttt tggctctgcg
60ggtccctcta tttatgcaaa gccgacctat gctacagccc cccaaccccc gacctggggt
120agggaggaag agggtgccgg ggaagggagt ccgccctgtc caggcactag aggctccctt
180gacgtttggc agatgaaaaa caactaagcc tttttgaggt gtagagattc tcaggtccag
240gcgttaaaaa ataatggtca aaagaataat acaaaaatag taaaggtctt gaagaatgcc
300agcgaagcaa ttctttttta tttgaggaca cttgtctggt gtactttttc atg
353446416DNAHomo sapiensmisc_feature(275)..(278)n is a, c, g, or t
446gaggaagata tcctggctgg cactctttca gttgacagag agtgacctca ggctggggcg
60gctcctcctc cgtgtggccc cggatcagca caccaggctg ctgcctttcg ctttttacag
120tcttctctcc tacttccatg aagacgcggc catcagggaa gaggccttcc tgcatgttgc
180tgtggacatg tacttgaagc tggtccagct cttcgtggct ggggatacaa gcacagtttc
240acctccagct ggcaggagcc tggagctcaa gggtnnnnca gggcaacccc gtggaactga
300taacaaaagc tcgtcttttt ctgctgcagt taatacctcg gtgcccgaaa aagagcttct
360cacacgtggc agagctgctg gctgatcgtg gggactgcga cccagaggtg agcgcc
416447409DNAHomo sapiens 447gctccccaca tgctggtggt gtactctgct aatggagaga
tgtttaaact gagagctgct 60gatgcaaaag agaaacaatt ctgggtgact cagcttcgag
cttgtgccaa ataccacatg 120gaaatgaatt ctaagagtgc tccaagctcc cgaagccgaa
gtctcacttt gctcccacat 180ggaacaccca attctgcgtc tccctgtagc cagagacacc
tcagtgtggg ggcccccggt 240gttgtcacaa tcacgcatca caagtcgcct gcagccgccc
gaagagccaa gagtcagtat 300tccggccagc ttcacgaagt cagagaggta cacactctcc
tgacagagga aagctgtttg 360ctgcactggt ttactggata gattaactgg gttgaggctg
tgtaattta 409448316DNAHomo sapiens 448gaggggcaca
tgcaagtcac caaagtggga agccttcacc aaggccacac ccaaagtcta 60ctgattgtct
gtccaaagtt cgttgattcc tggccatgaa caagcacaat agaaaaagac 120acagggtcct
agtggctaca agtcaatgtg aattggcaca tggtctagca gttttaaaat 180ctgacagtag
agtatggcaa tgggcaaggg ccaagaagtc ctgagatggg aggtcagcgc 240tctaactggg
ctcagtggag gtctgtgacc agtgtctgga cactagctac aggggaccgg 300gcagaggatt
ctgggc
316449473DNAHomo sapiensmisc_feature(241)..(241)n is a, c, g, or t
449gcactttagt gattgctttt attacattag ttaagatgtc ttgagagacc atctcctatc
60ttttatttca ttcatatcct ccgccctttt tgtcctagag tgagagtttg gaaggtgtcc
120aaatttaatg tagacattat cttttggctc tgaagaagca aacatgacta gagacgcacc
180ttgctgcagt gtccagaagc ggcctgtgcg ttcccttcag tactgcagcg ccacccagtg
240naaggacact cttggctcgt ttgggctcaa ggcaccgcag cctgtcagcc aacattgcct
300tgcatttgta ccttattgat ctttgcccat ggaagtctca nagatctttc gttggttgtt
360tctctgagct ttgttactga aatnngcctc gtggggagca tcagagaagg ccaggangan
420tggtgtnttn ccctagactc tgtaaccacc tctctgtctt tgtccttcct gag
473450512DNAHomo sapiensmisc_feature(363)..(363)n is a, c, g, or t
450gggaagtagg tgatgccagc cctcaagtct gtcttcagcc agggacttga gaagttatat
60tgggcagtgg ctccaatctg tggaccagta tttcagcttt ccctgaagat caggcagggt
120gccattcatt gtctttctct cctagccccc tcaggaaaga aggactatat ttgtactgta
180ccctaggggt tctggaaggg aaaacatgga atcaggattc tatagactga taggccctat
240ccacaagggc catgactggg aaaaggtatg ggagcagaag gagaattggg attttagggt
300gcagctacgc tcaccctaaa cttttggtgg cctggggcat gtcttgaggc ccagactgtt
360aancaggctc tgctggcctg tttactcgtc accacctctg cacctgctgt cttgagactc
420catccagccc caggcacgcc acctgctcct gagcctccac tatctccctg tgacgggtga
480acttcgtgta ctgtgtctcg ggtccatata tg
512451397DNAHomo sapiens 451gtgaacattt caaccagcct tatagctgtt ctcatcatca
ccttctgcat tgtgaccgtg 60cttggaaggg aggctctcac caaaggggcg ctgtgggcag
tctttctgct cgcagggtct 120gccctcctct gtgccgtggt cacgggcgtc atctggaggc
agcccgagag caagaccaag 180ctctcattta aggttccctt cctgccagtg ctccccatcc
tgagcatctt cgtgaacgtc 240tatctcatga tgcagctgga ccagggcacc tgggtccggt
ttgctgtgtg gatgctgata 300ggcttcatca tctactttgg ctatggcctg tggcacagcg
aggaggcgtc cctggatgcc 360gaccaagcaa ggactcctga cggcaacttg gaccagt
397452426DNAHomo sapiensmisc_feature(32)..(32)n is
a, c, g, or t 452gactgtaggt gcgtgggaga aactttgcag gntggggacc cggcggctgc
tggccggtag 60tgactggtgg gcgcgctcga ggactccaag gggcgcagcc cgggggcaga
cccttgggtc 120gggcggggat cttacgcttc ccttacccgc ccccttttgt ctttcacctc
agccccgccg 180gctgctgtgg gagcggcggc cgtccctctc ctggaggtcg tctcctggca
tcctcggggc 240cgcaggaagg aagaggaggc agcggccgga gccctggtgg gcggcctgag
gtgagagccc 300gaccggcccc tttgggaata tggcgaccgg tggctaccgg accagcagcg
gcctcggcgg 360cagcaccaca gacttcctgg aggagtggaa ggcgaaacgc gagaagatgc
gcgccaagca 420gaaccc
426453384DNAHomo sapiens 453ctaaagaaag tacacacact ctctcgctct
ctctcggtct tataaaactc gttggtgtct 60tataaaacaa acagtgataa tctcaagtta
gaaaacagta ggtcctgaga accataagaa 120aaatgactgg tgtgatgttg agtaacaagt
tggtacagtt actttagcta tttattaact 180tgctcatctc atagaacatt ttaatagatt
tttcacacac ctcattatta aaaaaaaaca 240aacatgctgg tgtcttggtt acccattatt
cctctgtacc tgaattcagg ttggtttttc 300tatttggaaa agactttata aatgttggct
taaaaagagg ttgagcacca gaatctcaga 360atttaccacc aaagaactca tcca
384454407DNAHomo sapiens 454agcataatga
agcctgcatg tgcccagctt caataattac caatatcttg ccagttttgt 60ttcgtttctc
ctttgattct ctgtattgag caagtcttag acatcatacg tttcccgcgt 120aagtacctta
ttctacatca ttaaccagta aggacttttt aattaaccac aataccacta 180tcacacctaa
taatagtaat tccttatgga tcttttcttt agacctattt ttgaaggcat 240aaaagcagtt
gagtttctgg agaatttttg gatggtgatt aatgacttga ctggctgctc 300ttcccagagc
tgtggcagct ctccccccgt agaagatggg gtttgtattg gcgcaccaag 360atctccaaca
gccagtgtgt gtttcccatt tcctgtaggt tccatca
407455223DNAHomo sapiens 455tagtcagagt gacccatgta tctgggaaga ctctagtctg
gactgtggcc cagcttgggg 60accttgtgtg ctcagatcat cttcaggaag gaaaaggcat
cctggagaca ggagtccatt 120cactcctctg ctctctaccc actcatttgc ttgccaaact
tagctttgcc agtgatagtc 180aatattaaag tgtacttttt tcccctttaa tccaatatag
ttg 223456160DNAHomo sapiens 456tataattata
accttaccgc atggacagtt ttgaatccta tgctaattgg ggtaattaag 60tcaattattt
catatgttat gttctcttca tgtgcatttt tcaatgatat attatgttcc 120attgtgttgg
aatgtgaatg ttcaattact tttccctata
160457465DNAHomo sapiens 457ccacatccat ggcctaggag ctactgggca ggttcccggc
cacacatctg gtgggctgtt 60ttgttttttt ttttcctctt cccccagatg tcttgacggg
atcactgggg ctctttgtga 120gtgagggtgg ccaaactacc gccggaggag atggggtctc
agagcgagag ctgcggaggg 180ggaggggaag aagaaggcct cacttttgct gctgcggggc
ccacacagcc gctgctactt 240tggggggtgg ggaaggggcc aagctgcaga cacacacagt
cattcatttc tgtccacacc 300cctgtgggtg gcgggtgtgc gtgtgtgtgc ttgtgtgtgc
gcacgtgtcg gcgctcacac 360acacatgcta gcccactgat gcacccagcc cagggctggc
agtctttgca gcgtggggcc 420gtctcaccct ggagcctgga gaggatctat gcttgtttgt
ttttg 465458212DNAHomo
sapiensmisc_feature(122)..(122)n is a, c, g, or t 458gtgccgctgg
cacccgggaa gacgctgggg gccggcgctg tagagccggg catgggctgg 60gatgtgtttg
gattccaatc cgggcctgac accagttcag tgacctcggg aagttcccca 120ancctccggg
cctgtttcct ccctctgaag tggcgacnag tagtagaacc gacctcgtag 180gctcatcggg
aggtcctgat gggagaaccc at
212459342DNAHomo sapiensmisc_feature(161)..(162)n is a, c, g, or t
459ggttgtactc aagatgtttt cctggaaaaa ttcattctgc tttctgacca ggatttccag
60aaactctgac ccttctaaga ggtctgggtg gaattgtgat ggtgattctg ctagtagaca
120gtgtaacttc tgcgtctaca aaaagaggat aggccgtcac nnctcacatg gctttgcgtg
180aaagcccaat ggtactgtct ctatggcaga gatgaggaag gaacaccagc gtcctccaac
240tttcctgttc ttcctttggg ttaatggcca ctgtaaggaa acagttttct gccacgtgtg
300gggtgatttg aatgtaaaat gcccaactct catagcaggc tg
342460519DNAHomo sapiens 460aaggggaaga tttgctgctg ctgccgggcc aagttcccgc
tgttctcgtg gccgcccagc 60tgtctcttct gcaagagagc cgtctgcact tcctgtagca
taaagatgaa gatgccttct 120aagaaatttg gacacatccc tgtctacaca ctgggctttg
agagtcctca gagggtatca 180gctgccaaaa ccgcgccaat ccagagaaga gacatctttc
agtctctgca agggccacag 240tggcagagcg tggaggaggc gttcccccac atctactccc
acggctgtgt cctgaaggat 300gtctgcagtg agtgcaccag ctttgtggca gacgtggtgc
gttccagccg caagagcgtg 360gacgtcctca acactacgcc acgacgcagt cgccagaccc
aatccctcta catccctaac 420accaggactc ttgacttcaa gtgacagccc caggtggcca
ggcctccagg aggcaccagg 480caggccctgt atcaggctag gacgctctga gctgtgcat
519461208DNAHomo sapiens 461tcccccctct gaattttact
gatgaagaaa ctgaggccac agagctaaag tgacttttcc 60caaggtcgcc cagcgaggac
gtgggacttc tcagacgtca ggagagtgat gtgagggagc 120tgtgtgacca tagaaagtga
cgtgttaaaa accagcgctg ccctctttga aagccaggga 180gcatcattca tttagcctgc
tgagaaga 208462532DNAHomo sapiens
462ctcagcattt agtgaaggta attccaaaat actggtatca gtactcttat ttataagtgt
60acggaatgca taacatgaac attagtcaaa gaacttttaa tataattcac tttttaagtg
120ttaaaattta aaggtcaagt aaaattgtaa atttgtaata tggaaacatt aagcgtcatt
180atcatacaaa ttattagcag ataaccttaa taaaaataaa cgtttgcggg ttttttttga
240gacagggtct cgctttgtca cctaagctgg agtgcagtgc gcgatctcgg ctcactgcaa
300cttccgcctc ctgggatcaa gtgattctcc tgccttagcc tcctgagtat ctgggtttac
360aggtgtgtac cgccacaccc gtctctacta aaaatacaaa aaacaaaaaa agattagctg
420ggcgtggtgg caggtgcctg tggtcccagc tgctcgggag gctgaggcag gagaatagca
480tggacctggg aggcggagct tgcagtgagc tgaaatggtg ccactgcact cc
532463542DNAHomo sapiens 463attatcgatc atgtctattg ctccccgtcc cttcgctgcg
ttcagactgc acacaatatc 60ttgaaaggtt tacaacaaga aaatcacttg aagatccgtg
tagagcccgg cttatttgag 120tggacaaaat gggttgctgg gagcacatta cctgcatgga
tacctccatc agagttagct 180gcagccaacc tgagtgttga tacaacctac agacctcaca
ttccaatcag caaattagtt 240gtttcagaat cctatgatac ttatatcagt agaagtttcc
aagtaacaaa agaaataata 300agtgaatgta aaagtaaagg aaataacatc ctgattgtgg
cccacgcatc ttcccttgaa 360gcgtgtacct gccaacttca gggcctgtca cctcagaact
ccaaggactt cgtacaaatg 420gtccgaaaga tcccatatct gggattttgt tcctgtgaag
aattaggaga aactggaata 480tggcagctga cagatccacc aatccttcct cttacccatg
gaccaactgg gggcttcaac 540tg
542464451DNAHomo sapiensmisc_feature(368)..(368)n
is a, c, g, or t 464cagccccatg acagcgaagg gacctttctg tccccgcccc
tgtccctgtg ctgggcccac 60gtactcaccc acgtactggt gcccggctcc cctgggcacc
cagagccccc cagataggcc 120ggtggaggag gtggaggagc tgtcccccca aaactactgg
cctgtggtct ggactccagg 180gccccatttc tgatgtcgcc aggtgtgcct gagcccatcg
gggccaggcc tgaggaagtg 240tttcttggga ggatgggatg accccctgtt cccaagagat
ggcagcacag tggaggccat 300ggtggaaaag gccctgccat ggggtccttg agggccagga
cagcctgagg gagggatggt 360ggccactncc cacaaggggc ctggtgggaa cgggtcccag
gacagactca tagctagacc 420ccgttggcgg cctctgtgtt gaaccagaac t
451465467DNAHomo sapiens 465ggccccaggc agttttatga
tgacacctgt gttgtcccag aaaaattcga tggagacatc 60aaacaagagc caggaatgta
tcgggaagga cccacatacc aacggcgagg atcacttcag 120ctctggcagt ttttggtagc
tcttctggat gacccggcaa attctcattt tattgcctgg 180actggtcgag gcatggaatt
taaactgatt gagcctgaag aggtggcccg acgttggggc 240attcagaaaa acaggccagc
tatgaactat gataaactta gccgttcact ccgctattac 300tatgagaaag gaattatgca
aaaggtggct ggagagagat atgtctacaa gtttgtgtgt 360gatccagaag cccttttctc
catggccttt ccagataatc agcgtccact gctgaagaca 420gacatggaac gtcacatcaa
cgaggaggac acagtgcctc tttctca 467466405DNAHomo
sapiensmisc_feature(162)..(162)n is a, c, g, or t 466catacaccta
ttaccataca ggggaagtcc ccaagctctc cggcctcaca gactctcacc 60cacgggcaga
gcattcttgg ctgattgagg ggaagttcca gcaatcagca caagtgttct 120ttatacccca
aatcactaaa acatatagag gggtctatgt cngtttcatc cataactcag 180ccactggtgg
aacaaatctc ataatcaaga ggatcatagt ccctggtaag tggatccctg 240gagcattggc
accatgtttt ccagtaaagt ctatctagct gtcagggaag agccacctgc 300nctctgcaaa
gggagaggga aaatcaaaac ccaggaaagg gaatatgttt ctgctccaaa 360accaccagct
tctgcctgtc cccttcactc tttctagatc attct
405467110DNAHomo sapiens 467gaaagagcga gagaagggga aagacaagtc gggagaggcc
ggtaggcgtg aggcgggcct 60gaagcggcag cgggcggcct tcgtccggcg agagctaggc
cgaggacccg 110468204DNAHomo sapiens 468ctgcccccca
gggctagtga agtggcctct tggataccag ctcaggggac actggcccca 60caggagttgt
gagccctcta gggcagggtg ggagccggga ccctcaggtg tagctgagct 120gtgacattgc
tggtcatcct tggtgctctt gcttttttga aagatgcttt tttttttttt 180aactgacgta
gaatgaagaa ctgc
204469139DNAHomo sapiens 469tcagatagga aggatggata tgtctttatc tacagcagaa
gttagttacc ctttcatgag 60gtgattagtt tacttctagg tggaaaaaga gaggactttg
aacttggtgt tgtcacagga 120gctgctctca tggacaaga
139470115DNAHomo sapiensmisc_feature(81)..(81)n is
a, c, g, or t 470ctcagagatt actcagccag acagagatat tccactggtg cgaaagttac
gttccattca 60cagctttgag ctggaaaaac ntctgaccct ggagccaaag ccagacactg
acaag 115471475DNAHomo sapiens 471cagcgcctcc ggttataagt
tgaagaaata agaccagttt ccaaataaat gacaaagagc 60ttggtattcc tgcaggcatc
agaatcacct ggaggaggag atgctgctgc tggtggtggc 120ccagagacca cacattgaga
accactgctc tagaaaacca tttgtctttg ctgatggaga 180aacctggctc taatagaagg
gcttgtatgt gtccaggaag tctagtgaat tcgaccatga 240atccagacat ggccagtggc
taaatcctgt gggaagacac tgtgcttctc tctgacccat 300gaacactctg ctagtcaagc
tctctgtcac aaagacaact tgaagagaca gagtggacct 360cacagaagat accatcgtca
ctcttaccaa tgcaactgtg gtgaacagga ccactattat 420tccttagatc aaaaggacag
cacattcaac agcatcctca tggcatgcca gcaat 475472446DNAHomo sapiens
472cggcttgttg ggaccaccaa ccaaggggac cagcgcatcc tgcgcagcag cgcccctccc
60tccctggctg gccctgctgt tagtcacaga ggccgcaagg ccaagacgtg agtgggctgc
120ccctccacct aggctttcca ccgtggccac tccctccatg accaggcctg actctgttaa
180ccactacttg aagtcttgag ggggaaagcc tccagggaga cataggggcc ttctcccttc
240ttcccaccaa agtagggggt aggcaactgg ttgtcatgga aatggggatc atcacagtcc
300ccttcccctt caccccacgt ggctgggcag tgttaagggt ggcaagatag tctctgtccc
360cacccccttg tacttgattc cccagctgtc tttcacacag ccccccaccc ttaggggaag
420ggggaggggc ttctctacaa tgaggt
446473443DNAHomo sapiens 473gagacttggt ggtctgagct gtcccaagtc ctccggttct
tcctcgggat tggcgggtcc 60acttgccagg gctctggggg cagatttgtg gggacctcag
cctgcaccct cttctcctct 120ggcttccctc tctgaaatag ccgaactcca ggctgggctg
agccaaagcc agagtggcca 180cggcccaggg agggtgagct ggtgcctgct ttgacgggcc
aggccctgga gggcagagac 240aatcacgggc ggtcctgcac agattcccag gccagggctg
ggtcacagga aggaaacaac 300attttcttga aaggggaaac gtctcccaga tcgctccctt
ggctttgagg ccgaagctgc 360tgtgactgtg tccccttact gagcgcaagc cacagcctgt
cttgtcaggt ggaccctgta 420aatacatcct ttttctgcta acc
443474465DNAHomo sapiens 474cctaattcac acaaagactc
cttgtggact ggctgtgccc ctgatgcagc ctgtggctgg 60agtggccaaa taggagggag
actgtggtag gggcagggag gcaacactgc tgtccacatg 120acctccattt cccaaagtcc
tctgctccag caactgccct tccaggtggg tgtgggacac 180ctgggagaag gtctccaagg
gagggtgcag ccctcttgcc cgcacccctc cctgcttgca 240cacttcccca tctttgatcc
ttctgagctc cacctctggt ggctcctcct aggaaaccag 300ctcgtgggct gggaatgggg
gagagaaggg aaaagatccc caagaccccc tggggtggga 360tctgagctcc cacctccctt
cccacctact gcactttccc ccttcccgcc ttccaaaacc 420tgcttccttc agtttgtaaa
gtcggtgatt atatttttgg gggct 465475443DNAHomo sapiens
475agaatgcaaa gaggccgctt ccctaagagg cttggaggag ctgggctcta tcccacaccc
60acccccaccc cacccccacc cagcctccag aagctggaac catttctccc gcaggcctga
120gttcctaagg aaaccaccct accggggtgg aagggagggt cagggaagaa acccactctt
180gctctacgag gagcaagtgc ctgccccctc ccagcagcca gccctgccaa agttgcatta
240tctttggcca aggctgggcc tgacggttat gatttcagcc ctgggcctgc aggagaggct
300gagaccagcc cacccagcca gtggtcgagc actgccccgc cgccaaagtc tgcagaatgt
360gagatgaggt tctcaaggtc acaggcccca gtcccagcct gggggctggc agaggccccc
420atatactctg ctacagctcc tat
443476458DNAHomo sapiens 476gactcagtgg gcactagaac gcctgaggct gcagctgggc
tccccggggt ccttgcagag 60gaaactcagt ctgctggagc aggaatccca gcagcaggag
ctgcagatcc agggcttcga 120gagtgacctc gccgagatcc gcgccgacaa acagaacctg
gaggccattc tgcacagcct 180gcccgagaac tgtgccagct ggcagtgagg gctgcccaga
tccccggcac acactccccc 240acctgctgtt tacatgaccc agggggtgca cactacccca
caggtgtgcc catacagaca 300ttccccggag ccggctgctg tgaactcgac cccgtgtgga
tagtcacact ccctgccgat 360tctgtctgtg gcttcttccc tgccagcagg actgagtgtg
cgtacccagt tcacctggac 420atgagtgcac actctcaccc ctgcacatgc ataaacgg
458477475DNAHomo sapiensmisc_feature(342)..(342)n
is a, c, g, or t 477agcatcctga accagctgtg ttttattatg cacagatatc
gtaaaaattt gactgccgca 60aagaaaaatg agttggtaca aaagacaaaa tcagagttca
atttcagcag caagacttat 120caagaattta attactattt gacatcaatg gttggttgcc
tgtggacgtc caaacccttt 180gcgaaaggaa tatatattga ccctgaaatc ctagaaaaaa
ctggagtggc tgaatataaa 240aacagtttaa atgtagtcca tcatccttct ttcttgagtt
acgctgtttc ctttttgcta 300caggaaagcc cagaagaaag gacagtaaac gtgagctcta
tncggggaaa gaaatggagc 360tggtatttgg actatttatt ttcacagggg ttacaaggct
tgaaactttt tataagaagt 420agtgttcatc attcttccat tcccagagca gagggcataa
actgcaacaa tcaat 475478490DNAHomo sapiens 478ctcgcagagt
tccgtcgatc aggactggag gaagccacgt ttcaacagat atatagtcaa 60catgtggcac
tgtgcagaat ggagggactg ccgtacccca ccatgtcaga gaccatggcc 120gtgtgttctc
acctgggctc ctgtcgcctc ctgcttgtgg agcccagcag gaacgatctg 180ctccttcggg
tgcggctcaa cgtcagccag gatgatgtgc tgtatgcgct gaaagacgag 240taaaggggct
tcacaagtta aaagactggg gtcttgctgg gttttgtttt ttgagacagg 300gtcttgctct
gtcgcccagg ctggagtgca gtggcacgat catggctcac tgcagccttg 360acttctcagg
cttaggtgac cccccaacct catcctccca ggtggctgaa actacaggca 420catgccacca
tgcccagctg attttttgta gagacagggc ttcaccatgt tgccaagcta 480gtctacaaag
490479460DNAHomo
sapiensmisc_feature(72)..(77)n is a, c, g, or t 479ttttttaggg actctcaacc
tcctggcagg gttaaaggga gagtacttta aacccatata 60ccagctgtgc tnnnnnntct
ctcactttgc cctgggtaag ctgctgtagg gtcagaagta 120accctttctg tgccagttga
gaatgagcct gtgtggtagc tgatgtcaga ggacaaagct 180ctctgcaagg gctggacaca
gagctgcaga gtcctgaaca tccctccttt caggctgcag 240aagggagagg caatgaagac
aggtgctccg gaagcagcat cagggctctt ggaggggact 300ggtggggact caggctgggt
gcagcctcca aacagagaac ggaacttagg tgtgtctcta 360cagnctaggc ccagcctagc
ccagcccaga acaaacaccc ttcagagcct aaccaaagaa 420cataagctgc aaaatgtgca
cccatatttt aagctgcttt 460480492DNAHomo
sapiensmisc_feature(77)..(77)n is a, c, g, or t 480cctgtctcct acatttagcc
aatgaaaaga atctaaaact ggaaggaaca gaggacctct 60ctgatgttct tgtgagncaa
ggagattgag ttcactatgg agaagtcagc agcaggaggc 120ccatccctta ctcagttgcc
gggacatccc cagtctcggg ggaagaagat gccatgggct 180tatacccagg ctgtagccaa
ctaccaacgt gcctgtttgt ttgttgctct ttccttctct 240ccatcatagt ctgggtgcca
gcgccctgaa gctccgtgct caactgatta aactttactg 300ccctatggtg accatctagg
agaggggagg gcagaggggg tgagggtact attctggatt 360gagaaaacct atatccattc
tttatatcaa tgtatagttt tagtctccta aattgatctg 420ttattttcca aactattctc
ttgtagaaaa ttttccagtg ggcacttaat ggtgcccttg 480aagaacttcc ta
492481501DNAHomo
sapiensmisc_feature(197)..(197)n is a, c, g, or t 481ggagggagag
gtccctgcaa ggtcccttcc cgggcagggg agggatggaa atgccgtcac 60agtagtaggg
actggagcgt ctacaaggat ggaggggagc tactcaggcc taacgttagc 120tacaaggaaa
aaggacgcct tccgtgacag atccttgagg tgtctgtgtc tgccccaagt 180ggccggcagt
ggccttncct ccgggcccaa ggcctgcagc cacctgctct aactcttgag 240tgggggngcg
gggggggacc tgcaggggct cggggacagg acagcagcaa gaggcagggg 300ccgaggacgg
aggccttccc gacagtgggg tgggttgtac attcaagtgt gaggtgaacc 360ctttggtggg
gagggggccc ctgaagcctc ggcggggcca cccctccccg cggcgcctct 420gagtctaggg
agaggggctg ctggctcggc ccggccggcc tggcttcaca gagggtctgc 480ggattgacac
tggttctttt c
501482490DNAHomo sapiensmisc_feature(120)..(120)n is a, c, g, or t
482gtgaggagct gttttcatct gtgtctgttg gagatcaaga tgattgctat tccctgttag
60atgatcagga cttcacttct tttgatttat ttcctgaggg gagtgtctgc agtgatgtcn
120cntcttctat tagcacttac tgggattggt cagatagcga gtttgaatgg cagttaccag
180gcagntgaca ttgccagtgg gagtgatgta ctttctgatg tcatacccag tattccaagt
240tcaccttgcc tgcttcctaa aaagaaaaac nagcaccgga atttagatga actcccttgg
300agtgcnatga canatgatga gcaggtggaa tatattgagt atctgagtcg gnnngtnant
360nntgngntgg ncnncnntac tgtcctgtgg tctagtgggc agggacctgg gggccatcag
420tggctgtagg acttttttac ccctctgttc ctggcctaaa tatgtgatgg gtatgcttca
480ccttaagtgg
490483231DNAHomo sapiensmisc_feature(63)..(63)n is a, c, g, or t
483ctttcacact gtggcagccc agtgaagcag actgggccat gaactctcct agccctgggg
60ccnagcctgt tccacaggca cccctgcagg aggcgctgcc aggagagcct tccatctcgg
120ggctctttga ggttccctcc ttctgggtgt tcttcaggct gagcagagag gctcctgtac
180cctctctctc ggaatctgaa gagccagatt taggccgggc aaaggggctc a
231484414DNAHomo sapiens 484ggtgctggaa aaactactat cttgtttaag ttaaaacagg
atgaattcat gcagcccatt 60ccaacaattg gttttaacgt ggaaactgta gaatataaaa
atctaaaatt cactatttgg 120gatgtaggtg gaaaacacaa attaagacca ttgtggaaac
attattacct caatactcaa 180gctgttgtgt ttgttgtaga tagcagtcat agagacagaa
ttagtgaagc acacagcgaa 240cttgcaaagt tgttaacgga aaaagaactc cgagatgctc
tgctcctgat ttttgctaac 300aaacaggatg ttgctggagc actgtcagta gaagaaatca
ctgaactact cagtctccat 360aaattatgct gtggccgtag ctggtatatt cagggctgtg
atgctcgaag tgtt 414485508DNAHomo sapiens 485tcctctgtcc
tctatattca gcatgttcct tgtcagctgc tgggccggcc ctgccttgcg 60ctagcagagc
ctctcctggc agcttctcag gtctccctaa tggagacacc aggctactag 120gacactggct
ggggccaccc cctcctgcct aatgcctcac cttacagctg gggaaactga 180ggcctggaat
ggcccagagt caccaaggca aagttggggc tggtcccagc ctgaggctcc 240agctgatgcc
ctcagctccc agagaggggg tgccccatct agctgggtgc aggggtcact 300gcttgtcagc
tcagggccct gtgcccgctt gcctgttccc ctacatctgt gcctgcacat 360ccagaactgc
ctccttgccg ctgcctccag gaagcccacc ttgagccaga gtcaagggct 420gcagcactgc
ccgatagaac acgcccgccc tcactgctgt tcttgcctta cagccaccat 480gggaaagctg
caacctttct gttttatt
508486555DNAHomo sapiensmisc_feature(400)..(401)n is a, c, g, or t
486tgtcaacttg tcatatacac ctccagggac caaaaacaaa agcagctcgg agtctgtgtt
60gcctgattgg aaagtagaag ctctggtgta tgctacagca cataacacat ttttactaaa
120ggaaaaaagc taattatgtc catgcctctc gtaaaactgg ggggaacctt aaagagaaag
180aactaaggct taagttatct gtagtataat caattagaag taatgaatgg atgcatgtaa
240aatggatgtg attttttttc aagcttattt tgaaatctta aaaatcaggt tacaccatag
300ctactcaaaa gttttacaca cttaaaactc agatcagtaa gtgttggtac cttttagact
360cataaaattg aataaaccat tgcaatgctt taaaaaaaan naaaaaaaan ggttttattg
420ctatgatttt atggcagaca catccaagca aaaccatttt ccaaatgcag accttcctga
480tgttatctga aatctgataa aatgacccta ctctctgctg tggttcattc ttgctccatg
540ctgtccatat ttatg
555487541DNAHomo sapiens 487gtggcactta ggcactatat tattgatatc tacaatggcc
tcctggatgc acaaaagacc 60ctgaagggct tttttgatca gcaaaacaaa aacagaaaag
caaaaaacag ttaatttttg 120tttggtcaag tttactcaac cagaccacct tgataccaac
aatgctggag agcatttggc 180aagagcaggg ccacaatgcc aaattccttg gaaaggtaga
cttcctatga tactttcatg 240gattggcaaa tttgtggggt ttttttggta gtagcttttg
agaatgttag tttctggctg 300gggtagtgac ttacatctgt aatcccagca cttcgggagg
cgaaggcagg tggattgctt 360gtgcccagga gtttgagacc agcctgggta acatggtgag
accccatctc tatttttata 420aaattaaaaa aaaaaaaaaa gatagagaat gttactttcc
tataaagcca tgatacccta 480agtactaaga catgtctgtt gttgtccttt ccttcataac
atttctcata acccgtaatt 540t
541488523DNAHomo sapiensmisc_feature(86)..(86)n is
a, c, g, or t 488cagccctgac gtgaactcat tttattttgg ccaggaccca gaaaggagtc
tactgctaag 60atttcagcat gtcctgtggc tgagtnaatc agagttatga cagganggta
ccgggcacac 120catcgcaatg ctccatcaan gctagtatgt tgtgttcttt ccttcatatc
aagtcaactc 180aagcttgctc tacttacctg gtgtacacag tctaagaact gtaagaagac
tggagcaaaa 240ccactcccct gacagttgag ggtcaagctg ctcctctgac tgaatttgtg
accaaaagag 300agccactctt tttcaaccaa catctggaag ccttcaagtg tcctataaaa
gggatcactg 360agtaactgaa ccagggatgt cacctagggc ataagcagga tggattgtca
ttaattttag 420ttctgaaaaa ggcctattac taagataaaa gcacttcctt ctgatgatag
ctaattcaca 480aatttacctg gacagcaaat ttgttcacta accattccag gat
523489306DNAHomo sapiens 489cggctgtacg actccataat gggcatgggg
actcaagata aggtcctgat cagaatcatg 60gtctcccaca atgaagtgga catgttgaaa
attaggtctg aattcaagag aaagtatagc 120aagtccctgt actactatat ccagcaagac
actaagggtg ctgtacctgt gtggtggaga 180tggctgaagt ccgacacagc acgagcgtcc
agaaatggtg ctccccatgc ttccagctaa 240caggtctaga aaacccgctt gtgactagca
gtccctgtgg ctgttcctgt gaggatgacg 300ttagca
306490170DNAHomo sapiens 490agaagattcc
cttgaagcct tctccttcca aaaagtttcg gtctggctca tctttctctc 60ggcgagcagg
ctccagtggc aactcctgca ttacttacca gccatcggtc tctggggaac 120acaaggcaca
agtgacaaca aaggcagaag tggagccagg cgttcacctt
170491532DNAHomo sapiens 491tgggggtgac tgctgcttat taagatgatt catttcattt
ccactcgtgg ttgtgatttt 60caccttctca aaactgagtc agcaagagaa aatcttgtct
tagaagggcc agataacact 120tcgctgtgag aacaggaggg ataatggatt ggagatggct
atgtgtaaag cagccctgcc 180tgctgattta acacactttc aaaatagatg tgtcagtatt
catttaaagc aagactctga 240tgacagaagg aaccttgaaa actacctgat attgaaatgg
ttgtgccctt tatagccctt 300ttgcatctcc ttgactttcc agtcatgcct cctaaatcag
aagaaaagct gcaaagaaaa 360tgttttgtgt ggttctgggc ttatttgaat aatgttcatg
accacaggct gccatagcac 420aagtgagaat ttcagaccac aagggtttaa ggagcagtgc
tctcttctct caaagctcag 480aacggtctct ggatccatgg tatcgtacac ccagtgtgga
tattaacatt ct 532492559DNAHomo
sapiensmisc_feature(232)..(232)n is a, c, g, or t 492aagggaagtt
agcaccttcc tcttggaggt gctgaggatt aaatgagata atacgtggaa 60agcattaggc
atgtagcaca gttagcagat ggtggttggc tccctctgct tttccatcag 120tctgtggcct
agtttaaatg gtgggaggaa gggtgtgaga tttaaggctg gttgtaaggg 180atcagtcagt
gtagttggaa aaattgtaag atgaagttat aggatataga cncaaacctt 240cctggaaggc
cagaaagtnt gcatagcttc aataaaggat ttggctgaaa gcagcgtaat 300cccctttacc
ttgagttgat agcaatagag caaataacat gggaacgtgg gggagtttat 360tgaatagctt
gtttactcat gtggtcctaa gaccaacctt tgattatcca cgggtgcatg 420attgctctct
actcggtggt cggcaaattt aattacccac aggtgtgttg actcaaagcc 480tctgtcatta
aatctatgct gaataaatgc cgtcaggcca gctagtcaag gtgcacaact 540ctttttgtgc
gtggtgtgg
559493287DNAHomo sapiens 493gtaagtctca gtcctttaaa actcagaaaa aggtgtgttt
tccaaattta atatttcctt 60tctgtaagtc tcagtgtctg cactatttgt cttggagact
taaaattatc ccttgaaagc 120ataagaagta caccccaaac cagctttgtc cttcctgtcc
tcttctagtt tacattttat 180gtggttagta attttgtacc taaaagtatt tgaaattcta
taaatttgga cttgacgtga 240gcaaaagaaa atttctacgt aagcgaaact aataaaacta
cagtcac 287494476DNAHomo sapiens 494ctgtggcatc
tataacctga gttcagtcac ttaataccga ggtcctgcgc tctgctgtgt 60gcctggccct
gggctgggca ctggggacat agcagtgacc gagacagaca ggctcacaag 120gagacatacg
acaaccaggt aaacatggca gacaagagca tgtcagatgc gctgtgaaga 180acactgcggg
gcccctccta ggaggtggca tgagttacat gcagacagag acgatccggg 240ggcagacgga
gttccatgtg gggcagtggt gagggcagac gctctggggc tgggatccct 300gggagtgttc
gagaagcacc gagaaggctt ctgtggctgg agccggccag ctgggggaga 360tggggccagg
gagatggcag gggcctctcc ctgtcccagg acccagagcc aagggaggct 420ttaagcccag
gaccaggggt ctgaaaacga aaagcactca cagtccttga acattg
476495542DNAHomo sapiens 495ggcaacctcc tggacaagga cgacctggcc atcccacccc
ccgattacgg cgccgcctcc 60cgggccttcc ccgcccagac ggccagcggc ttcaagcaga
ggccctacag tgtggccgtg 120cccgccttct cccagggcct ggatgactat ggagcgcggt
ccatgagcag tggcagcggc 180acgctggtgt ccacagtgtg aggacgctga ccccgggcag
ccgctgctct gaagagcttc 240cgcgccttcc ccctggtctc gtccgttttc ctcctcagct
ctcgctggtt tgttcttggg 300ttgtttttct tttccacctg ccccatgcct tttggttggt
gaccccagac tctgtgatcc 360cccagggtcc atggtgctgc tccatccgcc ccccctcccc
tgtgtttacg cgccccatcc 420tgtgtgtccc agccttttga gcagaaactg ccaggcagga
cctgctgggc cgtgcggggc 480accctcggcc tcaccctgca gtgtctgtgg cactcactgc
ttttctaagg ctcgccgtga 540gc
542496438DNAHomo sapiens 496gagaggtatt atcgagacat
tgcaaagatg gcatccatca gcgaccagga catggatgcc 60tacctggtgg agcagtcccg
cctccacgcc agcgacttca gcgtcctgag tgcgctcaac 120gagctgtatt tctatgtcac
caagtaccgc caggagattc tcacggctct ggaccgagat 180gcctcttgtc ggaagcataa
gttgcggcag aaactggaac agatcatcag cctcgtgtcc 240agcgacagct aaggtggtgg
aatcggtgag gagggggctt ctcagtcctg tgccgtcctc 300ccatccaggg gagtggctgg
ctcaagcctg ggtccccggg ctgagccctg gattgggtat 360cgtggggcag gtcaccctgg
ccacgatgcc cccggcacac ccaggccccc ttcattagtg 420ccttgctttg ggccctgc
438497419DNAHomo
sapiensmisc_feature(248)..(251)n is a, c, g, or t 497taagttctca
tccaacattt ctcctggcca tccattctcc atctttaaag gcaatcacca 60ttgccagttt
cttctgtatc cttctggaaa tacaatatat tacataaatg acagcattct 120atattctctc
ttctatatct tacctatttc tgtgaataat ttattttgga cagcatttta 180tgtatgaata
ttcacaaatg tgcttcctta tttcagaggc tgaactaata aaaattttgt 240ttattttnnn
nttgaggcaa tatttttata tggtacccta atctttaata cttaacctgc 300cagactttaa
ccgtaacaca ataatgtatt gccaaatagc accattcttc ttctctcact 360ctcttgccat
gggggctctt aaaaaaaaaa gtatacatct aaggtgtaca acatgctgt
419498477DNAHomo sapiens 498accagtttac ctaggccttg gactgccaaa tagctacaca
actgcttaag ctggcctata 60aggacagacc agagacaaag caagaagatc attggtccag
actgagaaga aagttgccag 120agggatgtct ccactaaggc ctttgagcag ggattaatgc
tgtcaccacc ttggtggaga 180acaagaaagc tcagctggtg gtgactgcac gtgacaatgg
atctcataga gctagctgtc 240ttcctgcctg ccctgcatca taaaatacaa agggaagaga
agactgggat gtctagtcca 300caggaagact tgcaccactg tcgccttcac acagattaac
ttggcagaca aaggagcttt 360ggctaagctg gtggaagcca tcagaaccaa tgacaatgac
agacaggatg agatccactg 420tcactaggga ggcaatatcc tgggtccaaa atctctggct
ctcattgcca agctgga 477499366DNAHomo sapiens 499tgagggaggg
atgtgcctct ggccacgtgg ttaccttgca gtgcacagcc tgtggtcata 60gaaggggcta
cagctcacgc atcgtgggtg gaaacatgtc cttgctctcg cagtggccct 120ggcaggccag
ccttcagttc cagggctacc acctgtgcgg gggctctgtc atcacgcccc 180tgtggatcat
cactgctgca cactgtgttt atgacttgta cctccccaag tcatggacca 240tccaggtggg
tctagtttcc ctgttggaca atccagcccc atcccacttg gtggagaaga 300ttgtctacca
cagcaagtac aagccaaaga ggctgggcaa tgacatcgcc cttatgaagc 360tggccg
366500537DNAHomo
sapiensmisc_feature(193)..(193)n is a, c, g, or t 500gaacaatcgt
cttttgaact tccagtaggc ccacagttgt tggttgttcc tcaaaacagg 60ttgtggctcc
tgttgaataa gatgatccat taaaaactga acaaggttga ggagaaatag 120tgcttacgtt
gaaaaatctt taagtctttg tccccgttct ctaacttcct tacgttttcg 180tttatttagc
tcnatcccca ctatctactn gaatttctca tatttaaacc aagatgggag 240actaggtcat
taggaaaata ttaccgtcta caattttctt atactttgat ctgtctttta 300tttgattgta
agttgctgat ggacagtgat cattagaaac tgaattttgt ataatactag 360ttttatatga
aactagatnt ttattgcgct caggttatgt tccttttacc tccttcctta 420ataaagagac
cacttgaaat aannanannn nttccaagta ctgtctgcac cttatcccac 480ctctttccca
tttatgagat agtgcaaaac cctagcacag tcttttccat ttagtaa
537501332DNAHomo sapiens 501aagtatctcc atacaaaata cggttgaatt acaaaaagaa
aattgtaaca ttagcatgga 60caaacctggc aggtactcct taactctcct aagtaataaa
aactgtaaaa tgcaaataag 120ccttcgatga catttactaa cctttactaa agtatcaatg
atgacttggt tgtttaaaca 180gctgacattt gggcaatttg agtatgtcaa actcaataat
actggttttc atttgcaaga 240tccacttaaa acttaaggag gccaaaaaac atcatttaaa
ataccctata aattataatc 300atacatatga tacgaaaaat atcctacttc ag
332502375DNAHomo sapiens 502agggtaactt ccagtgtcac
aatgagcagt tctgtaagtg ggtgcctctc agcacatttc 60tatgaatata ttatgtagat
aggctgtatt gattttggta gcattgacac cttcttaggc 120aattagttga agaaaactgc
aaaatatttt cttatgtaat agctgtatag agcaatagca 180atcaaagcat gagaaggcac
taacgctggg atgaaagatg agattcagag gtgactgaga 240atcatgtgag tgatggctgt
atattttgtg taaaatatat gtgtgaaaat gaactaagag 300tgagttactc agcactctca
agaattatgc agattctgca tttttcttat gccgtgtgcc 360taaaaaccta cttga
375503468DNAHomo
sapiensmisc_feature(30)..(30)n is a, c, g, or t 503gggacaggat gaccttcccg
aggaactcan tggcctgggg tagtttaaga agtaatgttc 60tttctttctt tctcttttcc
ctacctcctg ctaacccaac cagagatccc cttccttgct 120gagagggttg ggggcaggag
gagatttggc agtgcctgca ggttgcctgg ccaggtggag 180agggggaaag aggaagggca
ccgtgggtgt aagatgcctt tctcctccac ccatcgaaac 240cagccacccc ttccctgtgc
caccaagaca gccttttcca gtggccatcc taaggggaac 300tcccaaatgg gtgttgctgg
tggacacaga tgctcccccc aatggaagcc ccaagctctg 360aggtatgcgg gtagaggctt
tggataggtt ttcttctgct cccctctttt atagatctag 420gctgcttggc tgcctgtctt
tctaggcagt ccccctagag gaaaaatg 468504484DNAHomo sapiens
504accccaccac gtaccagatg gatgtgaacc ccgagggcaa atacagcttt ggtgccacct
60gcgtgaagaa gtgtccccgt aattatgtgg tgacagatca cggctcgtgc gtccgagcct
120gtggggccga cagctatgag atggaggaag acggcgtccg caagtgtaag aagtgcgaag
180ggccttgccg caaagtgtgt aacggaatag gtattggtga atttaaagac tcactctcca
240taaatgctac gaatattaaa cacttcaaaa actgcacctc catcagtggc gatctccaca
300tcctgccggt ggcatttagg ggtgactcct tcacacatac tcctcctctg gatccacagg
360aactggatat tctgaaaacc gtaaaggaaa tcacaggttt gagctgaatt atcacatgaa
420tataaatggg aaatcagtgt tttagagaga gaacttttcg acatatttcc tgttcccttg
480gaat
484505277DNAHomo sapiensmisc_feature(136)..(136)n is a, c, g, or t
505ctgcacagtc tccagtgtgg aaagctgtgg gaaaggaagg agcaggttct aggtcttcag
60gattttctgc atcttaaagc agctcatctc ctttgccctc ctagggagca ggggggccta
120gctttgggat cgtccnccta gcctcagaaa taattgttca agaaataaca tttctcacac
180aaaggataaa tgtttgaggg gatggatacc ccatcttcca tgatttgatt attacacatt
240gcatgcctgt atcaaaaatc tcatatatac acctact
277506515DNAHomo sapiensmisc_feature(380)..(380)n is a, c, g, or t
506gggggtgatt agtatgttgg gacaaacacg ctgttgctaa atggaaacac tgacctcaca
60gtgcatcctc ctgccaacac acacacacac acacctctca cacatgcacg cttacacaca
120cacacacaca cacacacaca cacacacata cacacacaca cacacacgct ctctctctct
180ctctctctct ctctctgtca gtgtgttatc ggtgtggagc ggaggccgcg gaggctcctc
240ggtccttcag cacccctcgg cccgacgcac ccacgcccct caccccccga gagccgaacg
300ctccccgcac cgcccccggt cccttccctc ggccgggagc gacttctgca gctcgttctt
360ccgaatcgca ccagcaatgn cggccagccg tagagggagg aagagcccgg ggagcccgag
420catagcgtaa acggctctct gaccttaatt tcatcctgca tggcgaatct ctgccgtctc
480tctgaacgca gaagggtctg agactggccg tctcc
515507259DNAHomo sapiens 507ttcagtttat actcaaagcc ctgcagtttc ctgacagcac
agagcacacc tgtcacgcga 60gcaggatgaa gcccagaggc tgcctggtga agtgggcggc
gcgctggaaa atccacgtag 120ctttgttccc tccacgggga gcgtgcaagg ccctctcgag
cactacggga gcctcgcctt 180ctgcacagac ttcggagcca ggtgctggag cggcagcaac
tgaggggcgt ggatgtcttt 240gcatggttcc catacgttt
259508285DNAHomo sapiensmisc_feature(189)..(189)n
is a, c, g, or t 508atagcagtgg actgtcactc atcagtatct gcagttctgt
ttaccaaagc ctgcttgcta 60gagacgtttc agggcctcct tccctcaaag cgtccactgt
acctccatct ggatacaatt 120agctggctcc ccacttcctg gactgacggt aaccaccttt
tccaatgacc ctgaagaaaa 180catgcaatnt aagctgcttt aagagtaacc tacaactgag
gacaaatttt ctatcaactc 240ccagtacccc tctctgccgt ggctgatttg ttactggttt
tcctt 285509274DNAHomo sapiens 509gaggtgcatg
ggatcaatgg gacccaatgg ggccagactc tgaggatggg atggtagtag 60tgaaggacat
aggatggggg tagagtgtgg agactttttg aaatagtata gatgaatgcc 120ctgaggggac
tgtgaacaag ctctgcccct cttaggaaat caatggggaa tcaactaaat 180taaataaaaa
atggggtcaa gattaagagg cagggtcacc cagggaatgg tttaggtcct 240ggcaactctg
aaggggttgg aagggctggc agga
274510470DNAHomo sapiens 510gcgtgggttt ttgtatccag agctgtttgg atacagctgc
tttgagctac aggacaaagg 60ctgacagact cactgggaag ctcccacccc actcagggga
ccccactccc ctcacacacc 120cccccccaca aggaaccctc aggccaccct ccacgaggtg
tgactaacta tgcaataatc 180caccccaggt gcagccccag ggcctgcgga ggcggtggca
gactagagtt tagatgcccc 240gagcccaggc agctatttca gcctcctgtt tggtggggtg
gcacctgttt cccgggcaat 300ttaacaatgt ctgaaaaggg actgtgagta atggctgtca
cttgtcgggg gcccaagtgg 360ggtgctctgg tctgaccgat gtgtctccca gaactattct
gggggcccga caggtgggcc 420tgggaggaaa atgtttacat ttttaaaggc acactggtat
ttatatttca 470511193DNAHomo sapiens 511gaaaatgaat
tccatgttct tgaaggaaag actgtaacta tgtacattca tgatgttcct 60ttggtgtgtg
gtttctgtga gtaacaggta gatgtcattt ctggaaatgg tatgtttatg 120tctatacatt
gttttataaa actccatgga gaaagaaggg gtttacttgc tttgtatcac 180atagcaataa
cat
193512452DNAHomo sapiens 512ctggcccacc caggaacagt gagggcgacg agaactacat
ggagttcctc gaggtgctga 60ccgagggcct tgagcgggtg ctgttggtgc gcggtggtgg
ccgtgaagtc atcaccatct 120actcctgagc ccagtgtcat cttgtggcct ggagtcgagg
tcttggccag gacataacaa 180gctgtggtct ggggtaacag cctcttccca gcacccacct
gccagccctg cttgcctggc 240cctgtcctgg acccagcttt gctaggtctc cttggaaacc
aggcctgggc ctcaaaatgg 300agatggatcc caggtcttgt gggaccctgg gatgtttggg
gactttacta tctagcaccc 360cagtaggcct gtcctggcca gagaagactg gtaggggccg
agtggggttt gaaggcagcc 420ggcccggccc agcccaggag cgctatttat tg
452513411DNAHomo sapiens 513ttggaggcct ttgcagcggc
ctacaaaggc acgcggccgt ttgccagtgc caacagcgtg 60ctggacccca tcctcttcta
cttcacccag aagaagttcc gccggcgacc acatgagctc 120ctacagaaac tcacagccaa
atggcagagg cagggtcgct gagtcctcca ggtcctgggc 180agccttcata tttgccattg
tgtccggggc accaggagcc ccaccaaccc caaaccatgc 240ggagaattag agttcagctc
agctgggcat ggagttaaga tccctcacag gacccagaag 300ctcaccaaaa actatttctt
cagccccttc tctggcccag accctgtggg catggagatg 360gacagacctg ggcctggctc
ttgagaggtc ccagtcagcc atggagagct g 411514423DNAHomo
sapiensmisc_feature(110)..(111)n is a, c, g, or t 514tcgtttctct
gaacacacaa cacccatcgt cctcttttat gttacttgaa atatcaaaag 60aattattaca
gctgaaaaca aatctatgta aatcggatct tgaaagagan naagctttct 120ccagttttga
aaggcgccat ttttaacttt gatcttgtaa tgacaaataa gaatgttgaa 180tcggctggct
tttttctatc ctaggtaatg tggactgtgg agctctgtgc tggtcacttt 240caaccctgaa
cctgatgcta cttattttgc agttctaagt gcaaagtcgg cctggtggat 300gcttcccatt
ataatattaa atttgcttct tcgtgaggtc acacctcaca tccccagtgt 360cactttaata
actagtgttt tttacatggt gggccatgac ccattagtgg actctgcatt 420taa
423515230DNAHomo
sapiens 515ccctggcaag gcccgggaca ggaaggccta cacggtcctc ctatacggaa
acggtccagg 60ctatgtgctc aaggacggcg cccggccgga tgttaccgag agcgagagcg
ggagccccga 120gtatcggcag cagtcagcag tgcccctgga cgaagagacc cacgcaggcg
aggacgtggc 180ggtgttcgcg cgcggcccgc aggcgcacct ggttcacggc gtgcaggagc
230516426DNAHomo sapiens 516atgaccttcg aatgcatagg cctttaatgg
tgcagacaga ggaccagtat gttttcctca 60atcagtgtgt tttggatatt gtcagatccc
agaaagactc aaaagtagat cttatctacc 120agaacacaac tgcaatgaca atctatgaaa
accttgcgcc cgtgaccaca tttggaaaga 180ccaatggtta catcgcctaa ttccaaagga
ataacctttc tggagtgaac cagaccgtcg 240cacccacagc gaaggcacat gcccgatgtc
gacatgtttt atatgctaat atcttaattc 300tttgttctgt tttgtgagaa ctaattttga
gggcatgaag ctgcatatca tagatgacaa 360attggggctg tcgggggctg tggatgggtg
gggagcaaat catctgcatt cctgatgacc 420aatggg
426517448DNAHomo sapiens 517gagcaagttg
taaattgtct cttatcggac ttaaaagggt gcctggctct tacttagttg 60attatctcct
ggatctggaa agaaaggaag gaaaacaaag gcggaagggg aatctctata 120gaatgtggat
ttttcccaca agagactttg cagggcaatt tcaaggtatg gcacggaaat 180atattttggg
gttaaatatt tttttccttg tctcataatg ttatgccaga gtcagattga 240aaagtaaatc
acaacatata gggtcaaata aaacccatct gatgagaatg tgtggtttgt 300agggcatgac
ttcctagacc tcttaggtag gaatctgggt aagacagaat atcagactta 360gtcctcaatt
cctaatgcaa agttctgaga tccaaaatgc tccaaaatct aaaacatttt 420ttagcaccga
cataatgcca caagtgga
448518148DNAHomo sapiens 518aattaacacc aggaacagca ccttgaatat tcctttttca
agttcctctt cctcaggaga 60tattcaaggt cgaaacacaa gccccaatgt ttctgtacag
aaatccaatc ccatgaggat 120tactgagagt catgccacca agggccac
148519173DNAHomo sapiensmisc_feature(141)..(141)n
is a, c, g, or t 519gaaaatcaca actctaacca taatcatctg cactatatgc
ctcgcatcag gtaatgtgtc 60taaaataata agtaacattt agcatttctg accttatccc
aaagtatttt aatagtatct 120gttaatgttt taattaatgg nttttgtatt gcatctcctg
gataacaaag tag 173520441DNAHomo sapiensmisc_feature(26)..(26)n
is a, c, g, or t 520catgagtgtg agctgatttg cacccnanca ccctctgtaa
gtgcctgctg tggntttggt 60tttgattatt ccgttaatgc tgagtctgtt tcacaaacga
gattagcaga attaattatt 120gaagatgcag tatgctttat ggttttaata acactgttaa
aaactaaaca aggaagttaa 180atatgttgat gattatcggt gactgctcac cacacagcat
ccctcaggcc gagtcagttg 240gcccagtgac tcccacatca caaactgccc tttcttggtc
agaagaagca gagtggagcc 300ttctcatccc cacgcgcgca gctgtggggc cccgtggtca
cctggccaca tgggagtttg 360catactgagt ggttcatctt ttccaatgtg ttgtgtcctt
taatttacat ttatatttca 420ttgccctttc taatgatcag a
441521488DNAHomo sapiensmisc_feature(456)..(456)n
is a, c, g, or t 521tttgagttct gctctggcca atccccaagc tccacgctgt
cagccacccc gctctcctac 60ctcccagagg agcaggctac actcctgttc cttttagaga
gagaaatatt gcggccgggc 120gcggtggctc acgtctgtaa tcccagcatt ttggcaggcc
aagggttttg ccatgttcgt 180ggggctggtc tcaaactaat tacctcagat gatccgccca
cctcggcctc ccaaagtgct 240gggattacag ccgtcctggg ccgccggaca cccccgctgg
ggccgatgcc caacagtgac 300atcgacttga gcaacctgga gcggctggag aagtaccgga
gcttcgaccg ctaccggcgc 360cgggcagagc aggaggcgca ggccccgcac tggtggcgga
cctaccgaga gtatttcggg 420gagaagacag agttccagct tctaaaatat ttgctnctaa
aatcttgacc acctgacttt 480ccggattg
488522339DNAHomo sapiensmisc_feature(117)..(119)n
is a, c, g, or t 522aaaatggatc ctgtctttct tagccaagga ctggtctctt
ttctccaatg tgtccctaac 60agagtggtga ggctggctct tcccaccagt acaggaagat
cattccttaa aagaaannnc 120catatggctt ataagtgttc tttcctgtat gaagcccaag
ctgtccactt ggagagacat 180ctggccagcc ccccgttgtt ccagccatcc ccagttcagg
catcaganat gtggtgaaga 240agccatccta gatgcccagc cccagctacc atctgatgca
accacactgc tcaccccgag 300caagaactgc ctgcaggagc ctagtattat cctctctca
339523396DNAHomo sapiens 523gcggcagcaa ccggaaccgg
aactcgtcgc ggccaccacc actgagcgct gcggggaggg 60ggagcaagga ccggacgaga
cgctacgcct gaaaacaggc ggcgggcgag ggacgaggct 120taccacggca ccacgcgagt
ggaaagggtc gtctccgcta gcggcggccc acaccagctc 180accgaggggc ggcagcgcgc
ggcccggctg ccggaccgta ccatcccggg cggtggagcc 240gccgcggagg ggcgcgcgcg
agccgaaggc gcacccggga ggcccaggta gcccgggggc 300cggtgctggg gcgccgggca
ggcccggctc ccgcctcgac ccacccggag ccagccccct 360ctgcggacac gacatcccca
tggggacggt ggcgcg 396524194DNAHomo sapiens
524ccccacaggt gttcctctgt gagctggtcg ggcggccggg gccggggccg ggcttcgctg
60ctccgtgcct tccacctccc tggcggtgcg gggcctcagg gtgggcctgg gaagctggaa
120acacctttgg aaacagccgc ctgaggcagc tgtggacaga agaccctgcc cagcagccaa
180gggagctggc ctct
194525526DNAHomo sapiensmisc_feature(424)..(430)n is a, c, g, or t
525caagggcacg aggcagtacc tttgctccat gcctttgctt ggactagtcc taccaccagc
60aattcctgca tttctgtgtt tggcaagttt ctgctcagcc tccaaagcct taaccaagtg
120tcaccttttc tctgcagcat tttctgccac cctccccatt tcttccaata gaaccaggga
180tcttttactt gggatccaga agcactgtgg acatattgcc atcacaacac ctttcatgtc
240acaatggcaa ggtttgcact gtcttggagg agaggaagga agccatattc atccctgaac
300cctcatctcc cagcactggt tgtaaaactg aaacaaaaat ggaaaacctt gatgaaattc
360attgttggtg tggctatggg gaaacagatt ttccatttct gatagtaaat gaaataggca
420ccannnnnnn aaaaaaaaaa aananattat taacactgaa aatgcacaca tctttcaacc
480cagcaatttt atttcttgct ttctagagga atgtttgccc atgtgc
526526197DNAHomo sapiens 526cattattaat tataccaatc ctttcatata tgtagaaaaa
atgtttgagt tggtcatctg 60tcttttattg aagatgcatt tcaaatatca aatatatttg
aaagataaaa tagcatctgt 120gaaattgaat attattttat gtgcgcttgg ctatgcccta
aaatgtcagt ttattgtccc 180taaagacgta tttattg
197527275DNAHomo sapiens 527ggatgaacgg gtgggctgaa
gaacagctga atccaatagc ttggcagaac atgaagacag 60gtttgttttc cagattctta
aaactccaaa cttgatatta ttacagacac aaagtaaatg 120gcacataaca agaggaagga
gatcacagtt tgcaaaactt ttatgtggac cttggtactg 180ggatcttgag atcctttgcc
atggaggtgc atcttcttga gatgtttaca cagagaacag 240actaacagca gaaaagatat
cagggttaca gtaaa 275528496DNAHomo
sapiensmisc_feature(43)..(43)n is a, c, g, or t 528aataaatcct gcgagttcac
gcccgcgtag ttcgccccct ganttntnga ngcgactcct 60ttcgcatggg atctacaaaa
ccgaactgcc ttaaagacct ctttcacacg gacgtgaagt 120cacagaactg acaaaatccc
atcctgtcaa agtgcacggg tctttgaaat ctaacacaaa 180aagccataga aagattctct
aaacaccctg tactaagagg aacacggaca gggcactgcg 240ttctgaagta gaggccaggg
cactggccct tagacacgtc tcgctgtcac cgggctaaca 300acattggcaa gggcggcggc
agcagcactg atatttgcag cccccaaggg ctctggcgaa 360accccctcta ttactctgta
tcctgcctgc ttccaagatg aacctgttgc tgggaaagaa 420caggctaaat tagaaaaggg
agtattttgt caaagttgaa ggtgagtgat agcctgcccg 480cctcaaatag gatggg
496529524DNAHomo sapiens
529agcgcagtgg cgaggcgagt gtggaaggac tcctgaacca gctcgtcctg gagcacctgc
60agctggcgcc tctgcagtgg gatgtgctgg tggacggaca gccatgtgac cgcgaggctg
120tggcggcctg ccaggtgggc gaccccgtgc gcctggaggt gcggctgacc aaccggagcc
180cgcgcagcgt agggcccttc gccctcactg tggtcccctt ccaggaccac cagaacggcg
240tgcacaacta cgacctgcac gacaccgtct ccttcgtggg ctccagcacc ttctacctcg
300acgcggtgca gccgtccggc cagtcggcct gcctcggggc cctcctcttc ctctacacgg
360gagacttctt cctccacatc cggttccacg aggacagcac cagcaaggag ctgccaccct
420cttggttctg cctgcccagt gtgcacgtgt gtgccctgga ggcgcaggcc tgagcccgcc
480tacttccgtc cctctttctg cagggccaga ggtgaccctg cctg
524530497DNAHomo sapiens 530aggtcaatct cgtattctct atgtgatatt gctgacaaag
tcaaagtaag gaaagacata 60tcaagggaag gcaatggaag caccttttct ttatagtaca
ttcacctacc ttaacagacc 120aagataacat aggagagaaa ctggggctta agtccttgat
agagcttctg ggggcacagt 180agttataggg ccaggtcaga aaatgtcctc acacactaag
aaggcatttt aaaatcagaa 240aagacagtca cactcacttt ggtcaccaag tcatttagcc
atcctgtctg gaaagcatgt 300tttcctctgg ggtcttcctc tggggtatct tgggaaaggg
tagagttttg aggagctaga 360gaagagaaag aggtcatgag ggagattagt cctttctgaa
tagcctagga aacccctcac 420caaatagatg cctacacttt cttaaatcga gaagtaagaa
ggaaatcaaa aacagcactc 480ctacttcaaa gcatcag
497531253DNAHomo sapiens 531gtgaaaagca accaaaggca
acagagtcta gctcatggcc accagaccaa aagcatccag 60cttctgtgca cctcctgcaa
agctggcaga ggccctggaa ttccagatca cctgagggga 120aagggttgtc tctctccttt
ctgttggggg agggggatgg gggacttttg ttggtggctc 180ccacccatat atccctcctt
taccatagta ctcccaccca cttccatcac ccatccaata 240aaatgcagcc agg
253532567DNAHomo sapiens
532cacctcggtc accagtgtga accaagccag cacatcccgc ctggagggcc tacagtcaga
60aaaccatcgc ctgcgaatga agatcacaga gctggataaa gacttggaag aggtcaccat
120gcagctgcag gacacaccag aaaagaccac ctacattaaa cagaaccact accaagagct
180caatgacatc ctcaacctgg gaaacttcac tgagagcaca gatggaggaa aggccatttt
240aaaaaatcac ctcgatcaaa atccccagct acagtggaac acaacagagc cctctcgaac
300atgcaaagat cctatagaag atataaactc tccagaacac atccagcgtc ggctgtccct
360ccagctcccc atcctccacc acgcctacct cccatccatc ggaggcgtgg acgccagctg
420tgtcagcccc tgcgtcagcc ccaccgccag cccccgccac agacatgtgc caccctcctt
480ccgagtcatg gtctcgggcc tgtaagggtg gggggcctgg gcccggggcc tcccccgtga
540cagaaccaca ctgggcagag gggtctg
567533402DNAHomo sapiens 533cagtattctg taccatagcg ctgctcttat gccatttgtt
tatttttata tagcttgaaa 60catagaggga gagagggaga gagcctatac cccttactta
gcatgcacaa agtgtattca 120cgtgcagcag caacacaatg ttattcgttt tgtctacgtt
tagtttccgt ttccaggtgt 180ttatagtggt gttttaaaga gaatgtagac ctgtgagaaa
acgttttgtt tgaaaaagca 240gacagaagtc actcaattgt ttttgttgtg gtctgagcca
aagagaatgc cattctcttg 300ggtgggtaag actaaatctg taagctcttt gaaacaactt
tctcttgtaa acgtttcagt 360aataaaacat ctttccagtc cttggtcagt ttggttgtgt
aa 402534279DNAHomo
sapiensmisc_feature(178)..(178)n is a, c, g, or t 534tgcattgtac
ctgtagccat tccattgtga ataacacaaa aagtggagga aatatttttc 60tcgcatttgg
aaattattct gtgattcagc aaagaagttg ttcatgtcat taacaagttc 120agaaatacat
gctgccaaag ccaaaaagag tcttcagttt aataaaaata attaacanga 180aggtgagaaa
tggtttacca gctgttcact tactggattt aaggttactt gttggggaaa 240gagcagagta
agatgcaact ctgtcaaatc atggctgaa
279535354DNAHomo sapiens 535tagcaaagga catggaagcc tggaaagatg taaccagtgg
aaatgctaaa atttaccagc 60ttccaggggg tcacttttat cttctggatc ctgcgaacga
gaaattaatc aagaactaca 120taatcaagtg tctagaagta tcatcgatat ccaattttta
gatattttcc ctttcacttt 180taaaataatc aaagtaatat catactcttc tcagttattc
agatatagct cagttttatt 240cagattggaa attacacatt ttctactgtc agggagattc
gttacataaa tatatttacg 300tatctgggga caaaggtcaa gccagtaaag aatacttctg
gcagcacttt ggga 354536497DNAHomo
sapiensmisc_feature(302)..(302)n is a, c, g, or t 536ttccctgatg
actcacttac aagttagtga actccttgtt taagtattac aaactgcaca 60ccttctccct
tctcaatcta gcttcacatc aggccttcct gccaaagcgg caaacttgcc 120acatggggca
aggtactccc caagcagaca aggcccatct gtgtcatgag tgatacccaa 180tgctaatgcc
atgctctgaa atgtagtgcc caccttggct tcccaaagtg ctgggattgc 240agacgtgagt
cactgcgccc agccattcca tgtctcttaa gtctcagaat ctcccctagc 300tncnnncnng
nnncnnnagt ggttgtcccc tcaaagctgt cccacaccct cctncgagga 360ncctttgtgt
atctcctcca gctaccgcag agcccacaaa cccaggcatc tatcaaagtc 420cctcattcat
gagggtggtg aggacacaga ctgcgaccag aacagaaata tgaaaatgtg 480aatgacagcg
tcccccg
497537340DNAHomo sapiensmisc_feature(68)..(68)n is a, c, g, or t
537tggagttagc aaaccttttc atgcctgtga cctcactgga gttctttgat gttgggactt
60cataagtntg ccaaatcctg ncacttactg ctttatgacc ttgaccattt accgnttntn
120tntggacctc agtgttctca ggatgcaaaa ggagggtcag gggtaaaata gcgactttcg
180aactgtcagg ggtaaaatag cgactttcaa acttttcaaa cttctgggac aagggtgaag
240ggcaggactc tgcctctctc cttcccttca ccttattcca cttaaattgt gtgattctac
300aagcttatgt ttaaaggaat atgttcctcc attacaaaga
340538527DNAHomo sapiensmisc_feature(133)..(133)n is a, c, g, or t
538tgggaccacg ggcatttttg gcatgtacaa gggtataggt gcctcctact tccgcctcgg
60cccccacacc atcctctccc tcttcttctg ggaccagctg cgctccctct actacacaga
120cactaaataa canccgcttt cccagtcntc caccaaatga gcactccttg gccacttgtg
180cctccaccac tatgtcctgg tgactactga ttaggtgacc tttcatccat ccatggggga
240cagccaaccc cactccccat ctgttctcag ggttgaatca ctacaagaga tgagtttccc
300ttctttcctt gggtgttgct ttaaaccttc cctacccatt ccctgggtaa ctcacacccc
360tctctcaggg ctgaacgagt catcccaaag tgtatttcct cccactcacc actgccaccc
420ttgagtccct cctgctccca tgcacagttt taaactcctc cctccaaaac caaagggaat
480tgagagaccc aattcccagg cgtctgggac ccaggtgtcc tgttaga
527539532DNAHomo sapiens 539gacatgtttt ctagccttag ttccccatct acaaaatggg
cctcatggaa tggaatgtct 60ccacttcact ccagcatcaa caagtgggga attctgatgg
attcaattcg acttctttcc 120atgggcgtgt tctaagcagc ctctttgttc cagaagctgc
cctcagccag agttggataa 180gccaatcctc actccccagc ctcctctgga tagggatgaa
gaccccactg gggttggaag 240tgcagaggca gacaggtgta tggagtcacc tgtaaattga
ttcaagtgag ccaggaaagc 300agcaaaggaa agagaaacct gagtgacgac gtggtggagg
aacagggctg gaaagaggct 360gctggctgtc tggcttcgca gctctggcct cctaatcagc
ctcgctcttg tctctggtgt 420tctctggctc ttgtccatct gtctgtgttt ctttttgcca
gctattgact aatctttgct 480gaagctgagc tagaattctg gtgtttataa gcaggtaact
agctgagcac ta 532540811DNAHomo sapiens 540ctttgggagg
ctgaggcagg cagatcacct aaggccagga attcgacacc agcctggcca 60acgtggcaaa
acccgtctct actaaaaata caaaaattag ccgggcgtgg tggtgtgcgc 120ctggaatccc
agctacccag gaggctgagg caggagaaat gctggaaccc gggaggcaga 180ggctgcagtg
agctgagatc atgccactac tgcactccag cctgggtgac acagcaagac 240tccctctaaa
aaaagaaaaa aagaaaagaa aagaaaagaa aatgatatat ccatgatgaa 300ttaaaatgga
gtggaaccca ctgatgggaa agccacagaa ggtaccagtt atccactcac 360tgacttaggt
gcctccacta gaattctcag cacgtttttg cagaacctgg gcaacaagag 420cgaaacccca
tctcaaaacc acaacaacaa caacaggaca acagagatgg acgacggatc 480gggaaagcca
accagacagc gtgaggccag gacggaaaga ggcacaggga gctctgctca 540gtgtcgctac
aggggatctc tcaggctcac aacgggccac tcctctaggg aagttctggt 600ctcatcatga
tccttgtttg gtctcactcc ccatgtcctt ctctgtccct cctccaactg 660ccatttattt
atttaactga aaaagtacca atcacccaca taggcatgac atactcatcc 720atgtacccat
ttcttaaaat tgatcattgt taacatttgg tgtaatttgc tttatttatt 780tttaatgaaa
taaataaaac tttacagaaa a
8115413874DNAHomo sapiens 541aaaaaggtac aactaccttg ctgatgctgt acatatggct
cacttgtgcc cagagagaga 60ataaagccat gtcgaaacta tctacgattc cttgagtgtt
tttccagcta cctgccactt 120gcccacccac tcccctcaga tctcagttag aacatgacaa
ttgggctcat gaacaggatc 180ctgagtggtt gcaggtgaac aagcagttgg cacaagggca
aagtgatcac atcctgattg 240agtggctatg gacagccata cagactgtgt ggaacaacgc
tggtgaaata cccaaaccat 300ttagaagcag taatgcctca cttgcctggg actgggatgg
tgtggctgag actgccttac 360tggcagccag gtgcgctatt cagcagccac aagccctaca
agtaattaac caggggcacc 420tgtttgagct ggaggtgcat gtggccacag acggttttgg
ttgaggcttg tggcaatgca 480cagagcgcct aagaatgcca gtaggctttt ggtcccaact
atggaaagga gctgaactcc 540ggtattcatt gatagagaaa cagctagcag ctgtatatgc
tggccttcgg gctcatgaga 600gcatgacagg acaggctgca gtcatcatat ggacaactta
cccaataaca ggatggatgc 660gtctatgtgt aatgaccacc tggagtggga tagcacagat
gtccactttg gcaaaatggg 720gcgactcctt gcagcagtgg agtaagctga gtacaagtcc
catagcagca gagttgcaag 780aggtcttggg acgtgtagtc ctaatgcaag ataaggccat
gcggcctgag gcacccctag 840atcctgagtc ttcaccattt aaggaagggc atcccaggat
tcctgagggg gcatggtaca 900cagtagatga ggtgctactg ctgcctggac cactgttgca
gtccaaccta gtactgacac 960catatggttt gaaactgggt gcggacaaag tagccaatgg
gctgaactca gagcagtgtg 1020gatggtaatc accaaggagg tgacacctgt ggtaatctgt
actgatagct gggcagtcta 1080ctgaggctta accttgtggt taactacttg gaaaatacag
aattggctag tgagccacag 1140acccatttga ggccaagcca tatggcaaga cctttgggga
ataggtcatc aaaaagaggt 1200aactatttat catgtgtcag gccatatgcc tttggccacc
cctagtaatg atgaggcaga 1260tgccttggct aaggtcagat ggtcagagtc agcaccaaca
caagatgtga ccttgtggct 1320acaccggaaa ctgggacatg cagggggtaa actgatgtaa
caattcaata agtgttgggg 1380tctgtccctt cccaagcaag acatttgtga ggcttgtcag
aaatgcctgg catgtgttca 1440gacatatcct aaaaagaggc agctgcccgg tgttatacaa
caagtaacaa tagggtgagt 1500gcccttgacc aggtgggaag tagactacat cgggccgccg
ccaaagtcgc gagggtatac 1560gcatgcacta acggctgtag acatggccac aggcctgttg
ttcacctacc cttgcagggt 1620ggccaaccaa cagaacacca tccaggccct gcaacactta
tgttccctgt atggttgtcc 1680tctggccatt gagagtgata ggggaacaca tttcactgga
caacaggtac aatgatgggc 1740acagcaaatg gacataaagt ggggattcca tgtgccatac
agcccacaag ctgaggtatt 1800attgaatgat ataatgggat cttgaagaat ggattacgct
tgcatgtcaa acccctgtct 1860ttgcggagct ggagttccag gctggacctg gtgctccaaa
ccttaaatga atggccacag 1920aaaggtggcc cggccccagt ggaggctttg tttcactagg
ccaccacccc cattcaattg 1980gagatacata ccaaggatga cctcctccga tcaggtatgg
ggacaaatgg taacctgttg 2040ttgcctgccc caacaaccct gaaggcaggg gaacagaaaa
cctggctgtg gccatggacc 2100ctccaagctc tccactgctg gtggttggcc atcatagctc
cctgtgggga gggcctacag 2160tatgacttgc atgtcacttt ttgagtgttc aatacatggc
ttccaaggtt gactgtttgt 2220agaggaacag ccagggaagg aaccctcctc tgagggacat
atgtactatc tgatgggcct 2280attatgagct acgctgtgac tttggcatgg atacaggatt
ctaaggaacc atggagattt 2340gagaaggtgt ggtaccatca cccagggcaa aagcccttgg
tggctgcatt gttatccagg 2400gatggaaagt tagcctatat tttgcctgag ggatgtgatt
tacctctgtt agtacctgtg 2460cctgctctgt catttcaacc gtaggttaac atgctccaat
tgcattgtgg actgacccca 2520cacctatgct gaggtgacca atgtttccaa ctgttggacc
tgcactgcct ttccagcagc 2580agctgcagac agcttgcccc gacacataca tcccgtgtct
gcagagaact ggacatgcct 2640ggagacttga gatcccatgg ctgatgcctg gaacacaatg
tggcaagctt tggacaaagg 2700acacagcaag acccatggct ggaccgtagc attcgtgatg
agtggggctg gctagtaggg 2760gaacatgtag tagcccaagc ccaggcattg cagtgcacag
agcaacattg gggtaacagg 2820atgggtacct gtcacggcct gtgcaaacat aacatgtgtc
accacactga aggtatggta 2880gaacaagtgg cctcaccaag gtcggacccc aatggacttt
ttgcctcttg ggagcttatg 2940ggtctatgag gacacagtag cctttcctat cagcaaactg
gagtggatgt tgtatctggg 3000ggtggcctta tgtacctgct actgttctcc ccacattgcc
cagatgcctg tataactggg 3060aggcactgtg ctctcagttt ttgcgaatgt gatgagcccc
ctggtgtttc taccctttgg 3120caatgactat ccctggagca ggtgtcaaaa ctgtagaagc
acaatttact gctcttgcgg 3180agcacaccgc tcaggctctg aattacacct gagtgtccct
cctcctgtta atgaatgagg 3240ttgatcagat caaaaagtgg tgttgcaaaa ccaagtggcc
ttagacataa ctgctgccca 3300aggagccacc tgtgcccttt taggaacaca atgttgtacc
ttatccctga caatcagcag 3360aacataacag cagccctgca aaggggtctt ccaggagatt
aaggtgactg agagcctcac 3420tgtcaacccc ctgcagagat ggtgagcatc cctaggttct
ggcgtacatt gggccctaat 3480agtcataagt atcatagctg agatcctagt agtgagctgt
tgctctctgt attgttgttg 3540tgggttatgg actcagggct ccgccatata ggcatgtgtc
cctgcctgga ggacgccctc 3600agcctagggg gtgtagtgta agggaaatgg ctgtgcttta
gtcaggagta ggctgaggca 3660gccttctggt gcagcatgac tcagtgggtt tggagtgcaa
gcacacaacc ttgctcgtta 3720tgtaaccaca ccacatgagg cccattaggt aacaactcac
atgagctcgt gtttggctca 3780gagccactat tgtctgtaaa aggtatacct tgctgatgct
gcacatatgg ctcgcttgtg 3840cccagagaga gagtaaagcc atgttgaaac tgtc
3874542177PRTHomo sapiens 542Met Pro Val Gly Phe
Trp Ser Gln Leu Trp Lys Gly Ala Glu Leu Arg 1 5
10 15 Tyr Ser Leu Ile Glu Lys Gln Leu Ala Ala
Val Tyr Ala Gly Leu Arg 20 25
30 Ala His Glu Ser Met Thr Gly Gln Ala Ala Val Ile Ile Trp Thr
Thr 35 40 45 Tyr
Pro Ile Thr Gly Trp Met Arg Leu Cys Val Met Thr Thr Trp Ser 50
55 60 Gly Ile Ala Gln Met Ser
Thr Leu Ala Lys Trp Gly Asp Ser Leu Gln 65 70
75 80 Gln Trp Ser Lys Leu Ser Thr Ser Pro Ile Ala
Ala Glu Leu Gln Glu 85 90
95 Val Leu Gly Arg Val Val Leu Met Gln Asp Lys Ala Met Arg Pro Glu
100 105 110 Ala Pro
Leu Asp Pro Glu Ser Ser Pro Phe Lys Glu Gly His Pro Arg 115
120 125 Ile Pro Glu Gly Ala Trp Tyr
Thr Val Asp Glu Val Leu Leu Leu Pro 130 135
140 Gly Pro Leu Leu Gln Ser Asn Leu Val Leu Thr Pro
Tyr Gly Leu Lys 145 150 155
160 Leu Gly Ala Asp Lys Val Ala Asn Gly Leu Asn Ser Glu Gln Cys Gly
165 170 175 Trp
54321DNAArtificial SequenceOligonucleotides for PCR 543catgaacagg
atcctgagtg g
2154424DNAArtificial SequenceOligonucleotides for PCR 544tgaggcatta
ctgcttctaa atgg
245452863DNAHomo sapiens 545cacgccaaac acgcagcccc ctcccgctgg agtgacaact
ggccagcata ctctaggctg 60ttgtcccttt aaaacttgaa tccaaggggg taatgattta
tcaaacttgt attatcaaga 120aaatgtcaaa ccaagggcac cttgctttgc actgacgcaa
acccggcctt tcccaaggag 180atatagaaag cgcctctcct gcctgagcca aacccagtct
tgtcaatagc gggtttcacc 240ctccaccagt tcagtctgtt gcctgtgtca gacatggatt
gcagtgctcc caaggaaatg 300aataaactgc cagccaacag cccggaggcg gcggcggcgc
agggccaccc ggatggccca 360tgcgctccca ggacgagccc ggagcaggag cttcccgcgg
ctgccgcccc gccgccgcca 420cgtgtgccca ggtccgcttc caccggcgcc caaactttcc
agtcagcgga cgcgcgagcc 480tgcgaggctg agcggccagg agtggggtct tgcaaactca
gtagcccgcg ggcgcaggcg 540gcctctgcag ctctgcggga cttgagagag gcgcaaggcg
cgcaggcctc gccccctccc 600gggagctccg ggcccggcaa cgcgctgcac tgtaagatcc
cttttctgcg aggcccggag 660ggggatgcga acgtgagtgt gggcaagggc accctggagc
ggaacaatac ccctgttgtg 720ggctgggtga acatgagcca gagcaccgtg gtgctggcca
cggatggaat cacgtccgtg 780ctcccgggca gcgtggccac cgttgccacc caggaggacg
agcaagggga tgagaataag 840gcccgaggga actggtccag caaactggac ttcatcctgt
ccatggtggg gtacgcagtg 900gggctgggca atgtctggag gtttccctac ctggccttcc
agaacggggg aggtgctttc 960ctcatccctt acctgatgat gctggctctg gctggattac
ccatcttctt cttggaggtg 1020tcgctgggcc agtttgccag ccagggacca gtgtctgtgt
ggaaggccat cccagctcta 1080caaggctgtg gcatcgcgat gctgatcatc tctgtcctaa
tagccatata ctacaatgtg 1140attatttgct atacactttt ctacctgttt gcctcctttg
tgtctgtact accctggggc 1200tcctgcaaca acccttggaa tacgccagaa tgcaaagata
aaaccaaact tttattagat 1260tcctgtgtta tcagtgacca tcccaaaata cagatcaaga
actcgacttt ctgcatgacc 1320gcttatccca acgtgacaat ggttaatttc accagccagg
ccaataagac atttgtcagt 1380ggaagtgaag agtacttcaa gtactttgtg ctgaagattt
ctgcagggat tgaatatcct 1440ggcgagatca ggtggccact agctctctgc ctcttcctgg
cttgggtcat tgtgtatgca 1500tcattggcta aaggaatcaa gacttcagga aaagtggtgt
acttcacggc cacgttcccg 1560tatgtcgtac tcgtgatcct cctcatccga ggagtcaccc
tgcctggagc tggagctggg 1620atctggtact tcatcacacc caagtgggag aaactcacgg
atgccacggt gtggaaagat 1680gctgccactc agattttctt ctctttatct gctgcatggg
gaggcctgat cactctctct 1740tcttacaaca aattccacaa caactgctac agggacactc
taattgtcac ctgcaccaac 1800agtgccacaa gcatctttgc cggcttcgtc atcttctccg
ttatcggctt catggccaat 1860gaacgcaaag tcaacattga gaatgtggca gaccaagggc
caggcattgc atttgtggtt 1920tacccggaag ccttaaccag gctgcctctc tctccgttct
gggccatcat ctttttcctg 1980atgctcctca ctcttggact tgacactatg tttgccacca
tcgagaccat agtgacctcc 2040atctcagacg agtttcccaa gtacctacgc acacacaagc
cagtgtttac tctgggctgc 2100tgcatttgtt tcttcatcat gggttttcca atgatcactc
agggtggaat ttacatgttt 2160cagcttgtgg acacctatgc tgcctcctat gcccttgtca
tcattgccat ttttgagctc 2220gtggggatct cttatgtgta tggcttgcaa agattctgtg
aagatataga gatgatgatt 2280ggattccagc ctaacatctt ctggaaagtc tgctgggcat
ttgtaacccc aaccatttta 2340acctttatcc tttgcttcag cttttaccag tgggagccca
tgacctatgg ctcttaccgc 2400tatcctaact ggtccatggt gctcggatgg ctaatgctcg
cctgttccgt catctggatc 2460ccaattatgt ttgtgataaa aatgcatctg gcccctggaa
gatttattga gaggctgaag 2520ttggtgtgct cgccacagcc ggactggggc ccattcttag
ctcaacaccg cggggagcgt 2580tacaagaaca tgatcgaccc cttgggaacc tcttccttgg
gactcaaact gccagtgaag 2640gatttggaac tgggcactca gtgctagtcc agtggtgtgg
gatggtccag acttgatcct 2700gtttttcctc tctgcctcct cctaatgttt tccatagctc
tcctcccatt tttcttcatc 2760tttcttccta catcttggtt cacatccacg catgagagtg
attatgtaga aaagtaggca 2820tagtgtcgca tgctgcagta aagagctaca tagaccacct
gaa 2863546797PRTHomo sapiens 546Met Asp Cys Ser Ala
Pro Lys Glu Met Asn Lys Leu Pro Ala Asn Ser 1 5
10 15 Pro Glu Ala Ala Ala Ala Gln Gly His Pro
Asp Gly Pro Cys Ala Pro 20 25
30 Arg Thr Ser Pro Glu Gln Glu Leu Pro Ala Ala Ala Ala Pro Pro
Pro 35 40 45 Pro
Arg Val Pro Arg Ser Ala Ser Thr Gly Ala Gln Thr Phe Gln Ser 50
55 60 Ala Asp Ala Arg Ala Cys
Glu Ala Glu Arg Pro Gly Val Gly Ser Cys 65 70
75 80 Lys Leu Ser Ser Pro Arg Ala Gln Ala Ala Ser
Ala Ala Leu Arg Asp 85 90
95 Leu Arg Glu Ala Gln Gly Ala Gln Ala Ser Pro Pro Pro Gly Ser Ser
100 105 110 Gly Pro
Gly Asn Ala Leu His Cys Lys Ile Pro Phe Leu Arg Gly Pro 115
120 125 Glu Gly Asp Ala Asn Val Ser
Val Gly Lys Gly Thr Leu Glu Arg Asn 130 135
140 Asn Thr Pro Val Val Gly Trp Val Asn Met Ser Gln
Ser Thr Val Val 145 150 155
160 Leu Ala Thr Asp Gly Ile Thr Ser Val Leu Pro Gly Ser Val Ala Thr
165 170 175 Val Ala Thr
Gln Glu Asp Glu Gln Gly Asp Glu Asn Lys Ala Arg Gly 180
185 190 Asn Trp Ser Ser Lys Leu Asp Phe
Ile Leu Ser Met Val Gly Tyr Ala 195 200
205 Val Gly Leu Gly Asn Val Trp Arg Phe Pro Tyr Leu Ala
Phe Gln Asn 210 215 220
Gly Gly Gly Ala Phe Leu Ile Pro Tyr Leu Met Met Leu Ala Leu Ala 225
230 235 240 Gly Leu Pro Ile
Phe Phe Leu Glu Val Ser Leu Gly Gln Phe Ala Ser 245
250 255 Gln Gly Pro Val Ser Val Trp Lys Ala
Ile Pro Ala Leu Gln Gly Cys 260 265
270 Gly Ile Ala Met Leu Ile Ile Ser Val Leu Ile Ala Ile Tyr
Tyr Asn 275 280 285
Val Ile Ile Cys Tyr Thr Leu Phe Tyr Leu Phe Ala Ser Phe Val Ser 290
295 300 Val Leu Pro Trp Gly
Ser Cys Asn Asn Pro Trp Asn Thr Pro Glu Cys 305 310
315 320 Lys Asp Lys Thr Lys Leu Leu Leu Asp Ser
Cys Val Ile Ser Asp His 325 330
335 Pro Lys Ile Gln Ile Lys Asn Ser Thr Phe Cys Met Thr Ala Tyr
Pro 340 345 350 Asn
Val Thr Met Val Asn Phe Thr Ser Gln Ala Asn Lys Thr Phe Val 355
360 365 Ser Gly Ser Glu Glu Tyr
Phe Lys Tyr Phe Val Leu Lys Ile Ser Ala 370 375
380 Gly Ile Glu Tyr Pro Gly Glu Ile Arg Trp Pro
Leu Ala Leu Cys Leu 385 390 395
400 Phe Leu Ala Trp Val Ile Val Tyr Ala Ser Leu Ala Lys Gly Ile Lys
405 410 415 Thr Ser
Gly Lys Val Val Tyr Phe Thr Ala Thr Phe Pro Tyr Val Val 420
425 430 Leu Val Ile Leu Leu Ile Arg
Gly Val Thr Leu Pro Gly Ala Gly Ala 435 440
445 Gly Ile Trp Tyr Phe Ile Thr Pro Lys Trp Glu Lys
Leu Thr Asp Ala 450 455 460
Thr Val Trp Lys Asp Ala Ala Thr Gln Ile Phe Phe Ser Leu Ser Ala 465
470 475 480 Ala Trp Gly
Gly Leu Ile Thr Leu Ser Ser Tyr Asn Lys Phe His Asn 485
490 495 Asn Cys Tyr Arg Asp Thr Leu Ile
Val Thr Cys Thr Asn Ser Ala Thr 500 505
510 Ser Ile Phe Ala Gly Phe Val Ile Phe Ser Val Ile Gly
Phe Met Ala 515 520 525
Asn Glu Arg Lys Val Asn Ile Glu Asn Val Ala Asp Gln Gly Pro Gly 530
535 540 Ile Ala Phe Val
Val Tyr Pro Glu Ala Leu Thr Arg Leu Pro Leu Ser 545 550
555 560 Pro Phe Trp Ala Ile Ile Phe Phe Leu
Met Leu Leu Thr Leu Gly Leu 565 570
575 Asp Thr Met Phe Ala Thr Ile Glu Thr Ile Val Thr Ser Ile
Ser Asp 580 585 590
Glu Phe Pro Lys Tyr Leu Arg Thr His Lys Pro Val Phe Thr Leu Gly
595 600 605 Cys Cys Ile Cys
Phe Phe Ile Met Gly Phe Pro Met Ile Thr Gln Gly 610
615 620 Gly Ile Tyr Met Phe Gln Leu Val
Asp Thr Tyr Ala Ala Ser Tyr Ala 625 630
635 640 Leu Val Ile Ile Ala Ile Phe Glu Leu Val Gly Ile
Ser Tyr Val Tyr 645 650
655 Gly Leu Gln Arg Phe Cys Glu Asp Ile Glu Met Met Ile Gly Phe Gln
660 665 670 Pro Asn Ile
Phe Trp Lys Val Cys Trp Ala Phe Val Thr Pro Thr Ile 675
680 685 Leu Thr Phe Ile Leu Cys Phe Ser
Phe Tyr Gln Trp Glu Pro Met Thr 690 695
700 Tyr Gly Ser Tyr Arg Tyr Pro Asn Trp Ser Met Val Leu
Gly Trp Leu 705 710 715
720 Met Leu Ala Cys Ser Val Ile Trp Ile Pro Ile Met Phe Val Ile Lys
725 730 735 Met His Leu Ala
Pro Gly Arg Phe Ile Glu Arg Leu Lys Leu Val Cys 740
745 750 Ser Pro Gln Pro Asp Trp Gly Pro Phe
Leu Ala Gln His Arg Gly Glu 755 760
765 Arg Tyr Lys Asn Met Ile Asp Pro Leu Gly Thr Ser Ser Leu
Gly Leu 770 775 780
Lys Leu Pro Val Lys Asp Leu Glu Leu Gly Thr Gln Cys 785
790 795 54724DNAArtificial
SequenceOligonucleotides for PCR 547ggatttgcaa gttgtgtagt gtgc
2454821DNAArtificial
SequenceOligonucleotides for PCR 548aagcagatgg tcatcttcca g
215492426DNAHomo sapiens 549ctctttcaac
tcaagagctc agtcctgtgt ctctcatgga ggcgtctcta accaggaggc 60tactctttaa
agacaggcat tttacttgca gcaaaataat aggaaggaga ttcgcttgct 120ttgcacagag
gctgagccac aggagaaagc aaagccaatg tgatttattg aatgaaagca 180ctggacaatt
accaacaact tgttcctctg ctgcctcgaa cagcataaac tggaattgtc 240gtgtgaaaat
gacgcaacaa atgcaaaatt tacatctctg tcagtcaaaa aaacatagtg 300ctccctcatc
tcccaacgca gccaaacgcc tgtacaggaa cctctctgag aaactgaaag 360ggagccactc
ttccttcgat gaggcctatt ttaggacaag aactgatcgg ctgagtctca 420ggaagacctc
ggtgaatttc cagggcaatg aagccatgtt tgaggcagtc gaacagcagg 480acatggatgc
tgtgcagatc ctcctgtatc agtacacacc agaagaactt gacctcaaca 540cacctaacag
cgagggcttg acacccctgg atattgccat catgaccaac aatgtgccca 600ttgcaaggat
tcttctgagg acaggggccc gagaaagtcc acactttgtc agcctggaaa 660gccgagcaat
gcacctcaac acactggtcc aggaagccca ggagagggtg agtgaactgt 720ctgcccaggt
ggagaatgaa ggattcactc tggacaacac agagaaagag aagcagctga 780aagcttggga
gtggaggtat cggctctaca gacgcatgaa aacaggcttt gagcatgcca 840gagcccctga
gatgccaacc aatgtctgtc tcatggtaac cagcagcaca tcactcactg 900tcagcttcca
agagcctctt agcgtcaatg cagctgtagt aaccaggtat aaagtggaat 960ggagtatgtc
cgaagacttt tctcctttgg ctggagaaat catcatggat aatctgcaga 1020ctctgagatg
cacaatcaca ggacttacaa tgggccaaca gtattttgtt caagtctcgg 1080cttacaatat
gaaaggatgg ggacctgctc agaccacgac accggcatgt gcctctcctt 1140ctaactggaa
agactatgac gacagagagc ccagacacaa gggacagagt gaagttttgg 1200aaggtctgct
gcagcaggtc cgagcccttc atcagcatta cagttgccgg gaaagcacaa 1260aattacaaac
cacaggccgc aagcagtcag tctcaagaag cctgaaacac ctgttccatt 1320cctcgaacaa
gtttgtgaag accttaaaac ggggactcta catagccgtt atattttatt 1380acaaagacaa
tatcttagtc accaatgaag atcaagtacc aattgttgaa atagatgact 1440ctcacaccag
ttctattaca caagattttc tgtggttcac gaagctgtct tgtatgtggg 1500aagatataag
gtggctgagg caaagcatac caatatcctc atcctcatcc acagtgctgc 1560aaactcggca
gaagatgctc gcagcaacag cacagctaca gaatttactt gggacacaca 1620acttgggaag
agtttactat gagcccatta aagatcgaca tggaaacata ctcatagtca 1680ccatcaggga
ggtggagatg ctttattcat tttttaatgg caaatggatg cagatctcaa 1740agctgcaaag
ccagagaaag tctctatcaa cacctgagga gccaacagct ttagacattc 1800tactgataac
catccaggat attctatcct atcacaaaag gagtcatcag cgtctctttc 1860ctggattata
tctgggttac ctaaagctct gtagctctgt ggatcaaatc aaagttcttg 1920ttacccaaaa
gttgcccaac attctctgcc acgtgaagat ccgtgaaaac aataatattt 1980ctagagagga
atgggaatgg atccaaaagc tttctggctc tgaatctatg gaaagtgtgg 2040atcatacttc
tgactgcccc atgcaattgt tcttctacga gctccagatg gcagtgaaag 2100ctctccttca
gcagatcaat atacctctac accaggcaag gaacttccgc ctctacacac 2160aggaggtgtt
ggaaatgggt cacaatgtgt cctttcttct cctgctccct gcctcagacg 2220acgtctgtac
agccccagga cagaataatc cttacacccc acactcaggg tttcttaacc 2280tccctcttca
gatgtttgaa cttggtatag tagcttgttt cacctagaaa tattaaccca 2340gcctccttat
aataaaatca caaagttata tctgttcccc cttgtcccag tggagggtca 2400ataaatcaca
tgatggcttt ggcaac
2426550763PRTHomo sapiens 550Met Glu Ala Ser Leu Thr Arg Arg Leu Leu Phe
Lys Asp Arg His Phe 1 5 10
15 Thr Cys Ser Lys Ile Ile Gly Arg Arg Phe Ala Cys Phe Ala Gln Arg
20 25 30 Leu Ser
His Arg Arg Lys Gln Ser Gln Cys Asp Leu Leu Asn Glu Ser 35
40 45 Thr Gly Gln Leu Pro Thr Thr
Cys Ser Ser Ala Ala Ser Asn Ser Ile 50 55
60 Asn Trp Asn Cys Arg Val Lys Met Thr Gln Gln Met
Gln Asn Leu His 65 70 75
80 Leu Cys Gln Ser Lys Lys His Ser Ala Pro Ser Ser Pro Asn Ala Ala
85 90 95 Lys Arg Leu
Tyr Arg Asn Leu Ser Glu Lys Leu Lys Gly Ser His Ser 100
105 110 Ser Phe Asp Glu Ala Tyr Phe Arg
Thr Arg Thr Asp Arg Leu Ser Leu 115 120
125 Arg Lys Thr Ser Val Asn Phe Gln Gly Asn Glu Ala Met
Phe Glu Ala 130 135 140
Val Glu Gln Gln Asp Met Asp Ala Val Gln Ile Leu Leu Tyr Gln Tyr 145
150 155 160 Thr Pro Glu Glu
Leu Asp Leu Asn Thr Pro Asn Ser Glu Gly Leu Thr 165
170 175 Pro Leu Asp Ile Ala Ile Met Thr Asn
Asn Val Pro Ile Ala Arg Ile 180 185
190 Leu Leu Arg Thr Gly Ala Arg Glu Ser Pro His Phe Val Ser
Leu Glu 195 200 205
Ser Arg Ala Met His Leu Asn Thr Leu Val Gln Glu Ala Gln Glu Arg 210
215 220 Val Ser Glu Leu Ser
Ala Gln Val Glu Asn Glu Gly Phe Thr Leu Asp 225 230
235 240 Asn Thr Glu Lys Glu Lys Gln Leu Lys Ala
Trp Glu Trp Arg Tyr Arg 245 250
255 Leu Tyr Arg Arg Met Lys Thr Gly Phe Glu His Ala Arg Ala Pro
Glu 260 265 270 Met
Pro Thr Asn Val Cys Leu Met Val Thr Ser Ser Thr Ser Leu Thr 275
280 285 Val Ser Phe Gln Glu Pro
Leu Ser Val Asn Ala Ala Val Val Thr Arg 290 295
300 Tyr Lys Val Glu Trp Ser Met Ser Glu Asp Phe
Ser Pro Leu Ala Gly 305 310 315
320 Glu Ile Ile Met Asp Asn Leu Gln Thr Leu Arg Cys Thr Ile Thr Gly
325 330 335 Leu Thr
Met Gly Gln Gln Tyr Phe Val Gln Val Ser Ala Tyr Asn Met 340
345 350 Lys Gly Trp Gly Pro Ala Gln
Thr Thr Thr Pro Ala Cys Ala Ser Pro 355 360
365 Ser Asn Trp Lys Asp Tyr Asp Asp Arg Glu Pro Arg
His Lys Gly Gln 370 375 380
Ser Glu Val Leu Glu Gly Leu Leu Gln Gln Val Arg Ala Leu His Gln 385
390 395 400 His Tyr Ser
Cys Arg Glu Ser Thr Lys Leu Gln Thr Thr Gly Arg Lys 405
410 415 Gln Ser Val Ser Arg Ser Leu Lys
His Leu Phe His Ser Ser Asn Lys 420 425
430 Phe Val Lys Thr Leu Lys Arg Gly Leu Tyr Ile Ala Val
Ile Phe Tyr 435 440 445
Tyr Lys Asp Asn Ile Leu Val Thr Asn Glu Asp Gln Val Pro Ile Val 450
455 460 Glu Ile Asp Asp
Ser His Thr Ser Ser Ile Thr Gln Asp Phe Leu Trp 465 470
475 480 Phe Thr Lys Leu Ser Cys Met Trp Glu
Asp Ile Arg Trp Leu Arg Gln 485 490
495 Ser Ile Pro Ile Ser Ser Ser Ser Ser Thr Val Leu Gln Thr
Arg Gln 500 505 510
Lys Met Leu Ala Ala Thr Ala Gln Leu Gln Asn Leu Leu Gly Thr His
515 520 525 Asn Leu Gly Arg
Val Tyr Tyr Glu Pro Ile Lys Asp Arg His Gly Asn 530
535 540 Ile Leu Ile Val Thr Ile Arg Glu
Val Glu Met Leu Tyr Ser Phe Phe 545 550
555 560 Asn Gly Lys Trp Met Gln Ile Ser Lys Leu Gln Ser
Gln Arg Lys Ser 565 570
575 Leu Ser Thr Pro Glu Glu Pro Thr Ala Leu Asp Ile Leu Leu Ile Thr
580 585 590 Ile Gln Asp
Ile Leu Ser Tyr His Lys Arg Ser His Gln Arg Leu Phe 595
600 605 Pro Gly Leu Tyr Leu Gly Tyr Leu
Lys Leu Cys Ser Ser Val Asp Gln 610 615
620 Ile Lys Val Leu Val Thr Gln Lys Leu Pro Asn Ile Leu
Cys His Val 625 630 635
640 Lys Ile Arg Glu Asn Asn Asn Ile Ser Arg Glu Glu Trp Glu Trp Ile
645 650 655 Gln Lys Leu Ser
Gly Ser Glu Ser Met Glu Ser Val Asp His Thr Ser 660
665 670 Asp Cys Pro Met Gln Leu Phe Phe Tyr
Glu Leu Gln Met Ala Val Lys 675 680
685 Ala Leu Leu Gln Gln Ile Asn Ile Pro Leu His Gln Ala Arg
Asn Phe 690 695 700
Arg Leu Tyr Thr Gln Glu Val Leu Glu Met Gly His Asn Val Ser Phe 705
710 715 720 Leu Leu Leu Leu Pro
Ala Ser Asp Asp Val Cys Thr Ala Pro Gly Gln 725
730 735 Asn Asn Pro Tyr Thr Pro His Ser Gly Phe
Leu Asn Leu Pro Leu Gln 740 745
750 Met Phe Glu Leu Gly Ile Val Ala Cys Phe Thr 755
760 55121DNAArtificial SequenceOligonucelotides
for PCR 551agctctgtag ctctgtggat c
2155221DNAArtificial SequenceOligonucleotides for PCR
552aggcggaagt tccttgcctg g
215532281DNAHomo sapiens 553gcgggccgca gccagcgcac ccagaccctg cgctgccctc
ggacggccgg gcgcggagcc 60ccagctgcgg aggccgacgg cacccggccc cgagcgcctc
gacgccgagc cgcgcgcgcc 120ttctccgcca ggcccggcgg gcgggagcgg gggcgaggga
gcaggagcgg ccagtgcccc 180cgacaccccc ggcccggcac ccccggcccg gcatcccccg
ccgccgccgc cgccgcctca 240aggccgcccg ctccccgcag gtggacgcgg ccatgggccg
aggggtgcgc gtgctgctgc 300tgctgagcct gctgcactgc gccgggggca gcgagggcag
gaagacctgg cggcgccggg 360gtcagcagcc gcctcctccc ccgcggaccg aggcggcgcc
ggcggccgga cagcccgtgg 420agagcttccc gctggacttc acggccgtgg agggtaacat
ggacagcttc atggcgcaag 480tcaagagcct ggcgcagtcc ctgtacccct gctccgcgca
gcagctcaac gaggacctgc 540gcctgcacct cctactcaac acctcggtga cctgcaacga
cggcagcccc gccggctact 600acctgaagga gtccaggggc agccggcggt ggctcctctt
cctggaaggc ggctggtact 660gcttcaaccg cgagaactgc gactccagat acgacaccat
gcggcgcctc atgagctccc 720gggactggcc gcgcactcgc acaggcacag ggatcctgtc
ctcacagccg gaggagaacc 780cctactggtg gaacgcaaac atggtcttca tcccctactg
ctccagtgat gtttggagcg 840gggcttcatc caagtctgag aagaacgagt acgccttcat
gggcgccctc atcatccagg 900aggtggtgcg ggagcttctg ggcagagggc tgagcggggc
caaggtgctg ctgctggccg 960ggagcagcgc ggggggcacc ggggtgctcc tgaatgtgga
ccgtgtggct gagcagctgg 1020agaagctggg ctacccagcc atccaggtgc gaggcctggc
tgactccggc tggttcctgg 1080acaacaagca gtatcgccac acagactgcg tcgacacgat
cacgtgcgcg cccacggagg 1140ccatccgccg tggcatcagg tactggaacg gggtggtccc
ggagcgctgc cgacgccagt 1200tccaggaggg cgaggagtgg aactgcttct ttggctacaa
ggtctacccg accctgcgct 1260gccctgtgtt cgtggtgcag tggctgtttg acgaggcaca
gctgacggtg gacaacgtgc 1320acctgacggg gcagccggtg caggagggcc tgcggctgta
catccagaac ctcggccgcg 1380agctgcgcca cacactcaag gacgtgccgg ccagctttgc
ccccgcctgc ctctcccatg 1440agatcatcat ccggagccac tggacggatg tccaggtgaa
ggggacgtcg ctgccccgag 1500cactgcactg ctgggacagg agcctccatg acagccacaa
ggccagcaag acccccctca 1560agggctgccc cgtccacctg gtggacagct gcccctggcc
ccactgcaac ccctcatgcc 1620ccaccgtccg agaccagttc acggggcaag agatgaacgt
ggcccagttc ctcatgcaca 1680tgggcttcga catgcagacg gtggcccagc cgcagggact
ggagcccagt gagctgctgg 1740ggatgctgag caacggaagc taggcagact gtctggagga
ggagccggca ctgaggggcc 1800cagacacccg ctgccccagt gccacctcac cccccaccag
caggccctcc cgtctcttcg 1860ggacagggcc ccagccgtcc cccctgtctg ggtctgccca
ctgccctcct gccccggctt 1920tccctgcccc tctcccacag cccagccaga gacaagggac
ctgctgtcat ccccatctgt 1980ggcctggggg tccttcctga caacgagggg gtagccagaa
gagaagcact ggattcctca 2040gtccaccagc tcagacagca cccaccggcc ccacccatca
agccctttta tattatttta 2100taaagtgact tttttattac tttaattttt taaaaaaagg
aaaataagaa tatatgatga 2160atgatattgt tttgtaactt tttaaaaatg attttaaaga
gacaaaaaag aacctcaaaa 2220aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa 2280a
2281554496PRTHomo sapiens 554Met Gly Arg Gly Val
Arg Val Leu Leu Leu Leu Ser Leu Leu His Cys 1 5
10 15 Ala Gly Gly Ser Glu Gly Arg Lys Thr Trp
Arg Arg Arg Gly Gln Gln 20 25
30 Pro Pro Pro Pro Pro Arg Thr Glu Ala Ala Pro Ala Ala Gly Gln
Pro 35 40 45 Val
Glu Ser Phe Pro Leu Asp Phe Thr Ala Val Glu Gly Asn Met Asp 50
55 60 Ser Phe Met Ala Gln Val
Lys Ser Leu Ala Gln Ser Leu Tyr Pro Cys 65 70
75 80 Ser Ala Gln Gln Leu Asn Glu Asp Leu Arg Leu
His Leu Leu Leu Asn 85 90
95 Thr Ser Val Thr Cys Asn Asp Gly Ser Pro Ala Gly Tyr Tyr Leu Lys
100 105 110 Glu Ser
Arg Gly Ser Arg Arg Trp Leu Leu Phe Leu Glu Gly Gly Trp 115
120 125 Tyr Cys Phe Asn Arg Glu Asn
Cys Asp Ser Arg Tyr Asp Thr Met Arg 130 135
140 Arg Leu Met Ser Ser Arg Asp Trp Pro Arg Thr Arg
Thr Gly Thr Gly 145 150 155
160 Ile Leu Ser Ser Gln Pro Glu Glu Asn Pro Tyr Trp Trp Asn Ala Asn
165 170 175 Met Val Phe
Ile Pro Tyr Cys Ser Ser Asp Val Trp Ser Gly Ala Ser 180
185 190 Ser Lys Ser Glu Lys Asn Glu Tyr
Ala Phe Met Gly Ala Leu Ile Ile 195 200
205 Gln Glu Val Val Arg Glu Leu Leu Gly Arg Gly Leu Ser
Gly Ala Lys 210 215 220
Val Leu Leu Leu Ala Gly Ser Ser Ala Gly Gly Thr Gly Val Leu Leu 225
230 235 240 Asn Val Asp Arg
Val Ala Glu Gln Leu Glu Lys Leu Gly Tyr Pro Ala 245
250 255 Ile Gln Val Arg Gly Leu Ala Asp Ser
Gly Trp Phe Leu Asp Asn Lys 260 265
270 Gln Tyr Arg His Thr Asp Cys Val Asp Thr Ile Thr Cys Ala
Pro Thr 275 280 285
Glu Ala Ile Arg Arg Gly Ile Arg Tyr Trp Asn Gly Val Val Pro Glu 290
295 300 Arg Cys Arg Arg Gln
Phe Gln Glu Gly Glu Glu Trp Asn Cys Phe Phe 305 310
315 320 Gly Tyr Lys Val Tyr Pro Thr Leu Arg Cys
Pro Val Phe Val Val Gln 325 330
335 Trp Leu Phe Asp Glu Ala Gln Leu Thr Val Asp Asn Val His Leu
Thr 340 345 350 Gly
Gln Pro Val Gln Glu Gly Leu Arg Leu Tyr Ile Gln Asn Leu Gly 355
360 365 Arg Glu Leu Arg His Thr
Leu Lys Asp Val Pro Ala Ser Phe Ala Pro 370 375
380 Ala Cys Leu Ser His Glu Ile Ile Ile Arg Ser
His Trp Thr Asp Val 385 390 395
400 Gln Val Lys Gly Thr Ser Leu Pro Arg Ala Leu His Cys Trp Asp Arg
405 410 415 Ser Leu
His Asp Ser His Lys Ala Ser Lys Thr Pro Leu Lys Gly Cys 420
425 430 Pro Val His Leu Val Asp Ser
Cys Pro Trp Pro His Cys Asn Pro Ser 435 440
445 Cys Pro Thr Val Arg Asp Gln Phe Thr Gly Gln Glu
Met Asn Val Ala 450 455 460
Gln Phe Leu Met His Met Gly Phe Asp Met Gln Thr Val Ala Gln Pro 465
470 475 480 Gln Gly Leu
Glu Pro Ser Glu Leu Leu Gly Met Leu Ser Asn Gly Ser 485
490 495 55521DNAArtificial
SequenceOligonucleotides for PCR 555gagatcatca tccggagcca c
2155621DNAArtificial
SequenceOligonucleotides for PCR 556tagcttccgt tgctcagcat c
21557522DNAHomo sapiens 557atgtaagcaa
agtagtcatt aaaaatacac cctctacttg ggctttatac tgcatacaaa 60tttactcatg
agccttcctt tgaggaagga tgtggatctc caaataaaga tttagtgttt 120attttgagct
ctgcatctta acaagatgat ctgaacacct ctcctttgta tcaataaata 180gccctgttat
tctgaagtga gaggaccaag tatagtaaaa tgctgacatc taaaactaaa 240taaatagaaa
acaccaggcc agaactatag tcatactcac acaaagggag aaatttaaac 300tcgaaccaag
caaaaggctt cacggaaata gcatggaaaa acaatgcttc cagtggccac 360ttcctaagga
ggaacaaccc cgtctgatct cagaattggc accacgtgag cttgctaagt 420gataatatct
gtttctacta cggatttagg caacaggacc tgtacattgt cacattgcat 480tatttttctt
caagcgttaa taaaagtttt aaataaatgg ca
52255821DNAArtificial SequenceOligonucleotides for PCR 558actacggatt
taggcaacag g
2155924DNAArtificial SequenceOligonucleotides for PCR 559gagatctcga
gatctcgatc gtac
245602383DNAHomo sapiens 560cttttctctt gttgagtgca aatggagaac agctgctcac
gctcgtcgtc tgacatcagc 60tatttctcag gatgaccctg cgagacaggc cagggtcatt
agacccaatt tggttctcag 120caaatatgtg tttattcctg catgcgtggg ccacaggctg
gtttcttggg tgcaatgaat 180agctgcaggt ttattagggt gtctttttag atggatgtat
gtttcccgat gtctatagaa 240cactccggac cccggagagt gaagactctg cctgtcggac
ttgctttgag aagatccttc 300tccacctccc catggcagaa gttgcttcac agaggggaac
agttttatgg atgtggctga 360gaccttaaac ttgaggcaac ccatctgagg tggcatccag
aggagactgg ctggcccctc 420cttcaccttg gatgtagtgc tgtttctagg atctcttttc
aatcagcaaa acaggggatg 480ttccaagagg gtgtggattc cctgccatcc cacatggtca
agtggagggg acgggaaaaa 540gctatgaagg gtttgtgacc acacagactc tcctggcccc
ctgtcctttt ggaaagaaga 600cagggatgaa atataatcaa gcaattaacc acccccatca
tcaccaagaa caacagtatc 660aacaagaaga acagggacaa caaaacccac ggatgaaaca
ttcctttctc agctcagatc 720ttatctggtg cgttctctct ctgctctgtc ttggtgtgtg
gtttagagaa acatggacaa 780cgctgtttgg aagaacaggt gagcgagggt ggggaatttc
agaggcctgg gcccaccgcc 840tccacccctt ccccagttta acctttgaca ggatcttcac
ctctctctga tcagcattgc 900ttcttgttca aaggcctcag ccacccagct gtgtcccttt
ccccagaaag caagggcaga 960tggcagtggg tctgttgatg agagaacttt aagggcccaa
tcagtccctg ggcaccccct 1020cctgggctcg ttttctccag gaggctgcat tctgatccat
aaaccttctc ctcggggttt 1080agggtcgagc tgttcctgat gtttatcgga gactgggatc
aaagctatcc aggtcataaa 1140tctctctctg tggctgttgg gccccagggc agctgaagag
ggttgacagc cctttggacc 1200tcaaaggaaa aaatgtgctc tactccaccc actcccagct
ctgccaagaa gctgtcctct 1260gagaagccat ggctgggccg ttccattctg gggagctgct
gaaaagagct gggaggccga 1320gaagaacttg cgtgtgctgg gggagaggaa gcctggcctt
gagggagggg tgcaggtgtg 1380gctcctctgt gtgtgggggc tgggggacct tgtgtgcctt
ttccttgtgg ctgtgaaatg 1440ctttatgagt acttccatag gaggatggac agggagtcgg
ggagataaac tcagccacaa 1500ggccccaggg cctcaggaaa cttgcaccca accctctcat
tttacagaag aaaactgtgc 1560ctggaaggtt gaagggtttg ttcccagtca cacaaccagg
gatccttagg acagccagac 1620caggaaacca tttccaaact gccaagccat ggcagagtat
caagacctca ggaaccatcg 1680agacaccatg gaagcattgg gaaaagcctc cttagctttt
gaagctcctc attgttcttg 1740agtgtgcatg gagcccatga ctgcggggtt ttgtagacac
ctcagggatt acatgactgg 1800tacccctgac aaagtcaagg ctgctggaca aaatgagtcc
gaggatttca ggggcagctg 1860ggcgcaggag ctggtgggct gttgggagtg cccctttact
gggcaggctt ccttcctcct 1920ggtgatgggg ggttcctcag cacaaaagtg aaggggtgga
ggggctggag gagcaggaat 1980ctctcttgtt gataggtatg aggccttgaa gtccttttct
ttgtcccagg attcatggac 2040gcttcggggc tgatctttga gttttcaagc atggggtgca
gagacgttta ggtaaactct 2100taccgtcctc tctcttcgtc agggcttccc aggaatcaac
aatgcccaag aaggaaggga 2160ttgtagaaat agcttaaccc tttcatttac caacgtggaa
attgaagccc agggaaggga 2220agggaccggt cgtggaaggg agagccatca gcagaaagag
accctgagat cttcgcctgg 2280gattcccagg aagtccagcc cgagctgatt cacagaacaa
atgcatgcaa accttgctat 2340caataaatta cacatgcact tacgtaaaaa aaaaaaaaaa
aaa 238356121DNAArtificial SequenceOligonucleotides
for PCR 561cagagacgtt taggtaaact c
2156221DNAArtificial SequenceOligonucleotides for PCR
562taccaacgtg gaaattgaag c
215632336DNAHomo sapiens 563aaaaaaacct atctaaggag gagccaagat ggccgcatag
gaacagctcc agtccacagc 60tcccagcgtg agcgacgcag aagacgggtg atttctgcat
ttccatctga ggtaccgggt 120tcttctcact agggagttcc agacagtggg cgcagcaggc
ttttcttctt ggagctgaaa 180caggcgcact ggcgagggtt gtgaggaaag gtttgcagcc
ccctgccttg gtctgctata 240gccagtgaca cgatcaatac cagggtgagc agcagaggaa
gctcagggaa gattatgaag 300atgagaattc attacctatc aaagaaaatg cgtaaactct
agaagtattg cttgctcctt 360tgccacaaag tgtacattca agagtaaatt gtttaaagcc
aaagggcctt gcgccacgtc 420cttagcctca gctttctact tgctaaaatg gaaataacaa
cagtacctac cttacagacc 480tggcgtgagg attaaattat accagcaaag tgcctggcac
ctagaatttg cctgagttct 540gatcaatgct aaaaacacca tttaacagtg ccctttcctg
cctgggaagc ccacaaagat 600ttcgcttttc acttattact catcaaactg actctggttc
acagtggaaa agagaagaaa 660agctgatgga gaaatggcag tgaagaagga gaacaaaatg
tcagagcaat acttttgagc 720gacttatcct ctgttctgca atatttgcaa aagtcagtct
atagacgtga agccaacagg 780gaccctcagg cctgtgcatc aggatggtgg gctgcctaag
tcctctgggg acagcagtgc 840cagaaggagt attggacaca gtgacccgac tgtatgagat
gagaaaaaac aaaaacagga 900ggtccgcagt actgatgaac taatctgtca ctcacaagct
caggtctgca aaaaaaagaa 960acgaagcact aaacatggcc ataagagatg gaaatgcaag
tcttcatttc taaatgataa 1020tcaaaccaac gatcagaaac tgattaactg tgtaattgaa
ttgaattgaa aatcatccca 1080tgaataacaa tccatcctac cttcaagggg ttaggaagct
aactacaggt aattgctatc 1140agaaatctga tttgatttcc aaaaattgtg tgaatgaacc
aagtttcttc atcttgatat 1200actaggcagg gagtttgttc ttccaagtac tagactgctt
aattgcttgc ttgggggagg 1260agaaatccta ggggaaaggc atatatgagc aatttctact
ctgtgaagcc agcgctgtgt 1320cctgagctgg atcatggcca gaaacagaaa agtctactct
tccctacagt ggaagcaact 1380gtggatattt catcctagga gtgaatgaaa aaacctaaag
ctcatacttc atgggaatct 1440ttcaatattc tgactgaaaa ctggttattt gctcctccaa
cccaaagcca tctaggaaca 1500gcactcagaa caggaaaaaa aaaagacaaa aataataatt
attccaaaac gtatttgagc 1560agaaacaaac acaaacattt gcattattaa atgggcttgt
tcacacctgc tgagtagata 1620taagacgata tttaagacaa gagctaaaaa ataaaccatc
cctttctggt tttgagtgac 1680agcagagcaa taaaaattat tttcacattc ttttccctat
tgttagaagt aatcatttga 1740gtaaatacac ttatctgtgc tgtaactatt gaaatgaatc
cacttcaaat atgtatacca 1800cctttctttt ttatatttct agatatggtt tcaatataga
ctttctgact tttatggtat 1860acatatagga caatattcta ttcttctttc cttttaaata
cttactgttt caatttcaaa 1920taaaaaatca gcattctagt ttgtacattt tagcacagaa
atgtttacaa ccttcagcac 1980aattgctttt gtaatttact gacttggcat tttgaggcgt
ttttaacaaa ttatgagaaa 2040taacaccttc agaaagcatg tgactacttt gatgcaacta
tttacaatgt attcataaga 2100agtcattaac ctgtagagtt cttagacatg tggaaccttt
aacaattata ctaaagagta 2160catacaaaat acagagctat gtaataataa ctaattttaa
atcctgacaa attagaagtt 2220aagcctacta tctgtaaaaa tatgtcctga ttcatttttt
taagtatata cctgagcctt 2280taaaaagtat atgcctttac aattgatttc caataaacaa
tactgaataa catact 233656424DNAArtificial SequenceOligonucleotides
for PCR 564gcaatacttt tgagcgactt atcc
2456523DNAArtificial SequenceOligonucleotides for PCR
565gatagcaatt acctgtagtt agc
235661187DNAHomo sapiens 566gaacaacatg gcttccccca gcgtgagact cgcttgtcct
cccaccgcct gctctctcct 60gatgaccagg ttccaggagt tatcaaagaa cagcctctga
gctgcgtgga gaaagccatg 120gagacagcca gaagaaaacg gcaggattag attaacctgt
gattcctggc tggccacgag 180gtcacccatg gcatggagct gcccacaacg cccttctcag
catgaagcat cctgaaagat 240ccagggccag gttccccagg attggggagt tggaagctca
ttggcactgt caaatttgaa 300gaagaggcgt gctctgactg cctggacagg acccggaatc
aaaccgcagg ccctgggtca 360ccgctgccgg aaagagccag ttcctgtccg tccatgcacc
caccaccaaa acccaggcct 420tcctggaggt gctaggggag gccatgcccc ttttctgagt
gcttggaagt gactgctgca 480agtgacaagt gaccacgcct tttcccccgc gggtataaat
tcagaggcgc tgcgctccga 540ttctggcagt gcagctgtgg gaacctctcc acgcgcacga
actcagccaa cgatttctga 600tagatttttg ggagtttgac cagagatgca aggggtgaag
gagcgcttcc taccgttagg 660gaactctggg gacagagcgc cccggccgcc tgatggccga
ggcagggtgc gacccaggac 720ccaggacggc gtcgggaacc ataccatggc ccggatcccc
aagaccctaa agttcgtcgt 780cgtcatcgtc gcggtcctgc tgccagtgag tccccgccgc
ggtccctggc tggggaagag 840cgcacctggc gccgggaggg ggcagggaga cggggacacg
gcagggatgc ctggccctgg 900tcacctgcgg ccgggcatgt ccgggcagga cgaactcgcc
gtcggagtca ggggaagaac 960tgggtccccg ggctgggcag gagggacccg gccgcgaggg
agcagagagg cggtccccct 1020ggctgccccg agcccgcgaa gggagggaag ttccagaatc
gagagaggga gggagtcaag 1080gtggaaccca tagagtgagc ctcctgaaga cacagagcgg
ttgcctctct cattaattaa 1140ttaattagtt aataaaatta accccatgtt taaaaaaaaa
aaaaaaa 1187567155PRTHomo sapiens 567Met Gln Gly Val Lys
Glu Arg Phe Leu Pro Leu Gly Asn Ser Gly Asp 1 5
10 15 Arg Ala Pro Arg Pro Pro Asp Gly Arg Gly
Arg Val Arg Pro Arg Thr 20 25
30 Gln Asp Gly Val Gly Asn His Thr Met Ala Arg Ile Pro Lys Thr
Leu 35 40 45 Lys
Phe Val Val Val Ile Val Ala Val Leu Leu Pro Val Ser Pro Arg 50
55 60 Arg Gly Pro Trp Leu Gly
Lys Ser Ala Pro Gly Ala Gly Arg Gly Gln 65 70
75 80 Gly Asp Gly Asp Thr Ala Gly Met Pro Gly Pro
Gly His Leu Arg Pro 85 90
95 Gly Met Ser Gly Gln Asp Glu Leu Ala Val Gly Val Arg Gly Arg Thr
100 105 110 Gly Ser
Pro Gly Trp Ala Gly Gly Thr Arg Pro Arg Gly Ser Arg Glu 115
120 125 Ala Val Pro Leu Ala Ala Pro
Ser Pro Arg Arg Glu Gly Ser Ser Arg 130 135
140 Ile Glu Arg Gly Arg Glu Ser Arg Trp Asn Pro 145
150 155 56821DNAArtificial
SequenceOligonucleotides for PCR 568ggattgggga gttggaagct c
2156921DNAArtificial
SequenceOligonucleotides for PCR 569agaaatcgtt ggctgagttc g
21570857DNAHomo sapiens 570catccctctg
gctccagagc tcagagccac ccacagccgc agccatgctg tgcctcctgc 60tcaccctggg
cgtggccctg gtctgtggtg tcccggccat ggacatcccc cagaccaagc 120aggacctgga
gctcccaaag ttggcaggga cctggcactc catggccatg gcgaccaaca 180acatctccct
catggcgaca ctgaaggccc ctctgagggt ccacatcacc tcactgttgc 240ccacccccga
ggacaacctg gagatcgttc tgcacagatg ggagaacaac agctgtgttg 300agaagaaggt
ccttggagag aagactgaga atccaaagaa gttcaagatc aactatacgg 360tggcgaacga
ggccacgctg ctcgatactg actacgacaa tttcctgttt ctctgcctac 420aggacaccac
cacccccatc cagagcatga tgtgccagta cctggccaga gtcctggtgg 480aggacgatga
gatcatgcag ggattcatca gggctttcag gcccctgccc aggcacctat 540ggtacttgct
ggacttgaaa cagatggaag agccgtgccg tttctaggtg agctcctgcc 600tggtcctgcc
tcctggctca cctccgcctc caggaagacc agactcccac ccttccacac 660ctccagagca
gtgggacttc ctcctgccct ttcaaagaat aaccacagct cagaagacga 720tgacgtggtc
atctgtgtcg ccatcccctt cctgctgcac acctgcacca cggccatggg 780gaggctgctc
cctgggggca gagtctctgg cagaggttat taataaaccc ttggagcatg 840aaaaaaaaaa
aaaaaaa
857571180PRTHomo sapiens 571Met Leu Cys Leu Leu Leu Thr Leu Gly Val Ala
Leu Val Cys Gly Val 1 5 10
15 Pro Ala Met Asp Ile Pro Gln Thr Lys Gln Asp Leu Glu Leu Pro Lys
20 25 30 Leu Ala
Gly Thr Trp His Ser Met Ala Met Ala Thr Asn Asn Ile Ser 35
40 45 Leu Met Ala Thr Leu Lys Ala
Pro Leu Arg Val His Ile Thr Ser Leu 50 55
60 Leu Pro Thr Pro Glu Asp Asn Leu Glu Ile Val Leu
His Arg Trp Glu 65 70 75
80 Asn Asn Ser Cys Val Glu Lys Lys Val Leu Gly Glu Lys Thr Glu Asn
85 90 95 Pro Lys Lys
Phe Lys Ile Asn Tyr Thr Val Ala Asn Glu Ala Thr Leu 100
105 110 Leu Asp Thr Asp Tyr Asp Asn Phe
Leu Phe Leu Cys Leu Gln Asp Thr 115 120
125 Thr Thr Pro Ile Gln Ser Met Met Cys Gln Tyr Leu Ala
Arg Val Leu 130 135 140
Val Glu Asp Asp Glu Ile Met Gln Gly Phe Ile Arg Ala Phe Arg Pro 145
150 155 160 Leu Pro Arg His
Leu Trp Tyr Leu Leu Asp Leu Lys Gln Met Glu Glu 165
170 175 Pro Cys Arg Phe 180
57221DNAArtificial SequenceOligonucleotides for PCR 572agttcaagat
caactatacg g
2157321DNAArtificial SequenceOligonucleotides for PCR 573tagaaacggc
acggctcttc c
215744415DNAHomo sapiens 574agagaagcaa catctttaag gtactgaggg caggagaagt
taatgtagaa tactatgcca 60gaaaaaataa attcccaaaa gtggaagtga aataaggaca
tttagagatg tacaaaagct 120gaccgaattc actaccagtc aacccacact acaagaaaca
tcaaatgagt cctccaagca 180gaaggaaccc aataccagat gaaaatccag atctccacga
ggaaatgaag aacaccagaa 240atggatcggc cctttcttca aataagagca gttggaataa
caaagctgtt cagttgtacc 300cttggaatcc actgaaatcc tgggtaggga agctccagta
ccaccaactg gaaagactgg 360gaatgcctaa tagctggtac tggccattgt cgtaggcttt
gtccactctg acaaactgaa 420gatggggact cgactcacct tcgccagcca caggaggacc
tccagacgag gttaggtcga 480cttcccgata actttagatc ctgaaacctc acgggatttt
tcttctcttc cctttgatct 540ctcttccgct tgctcaacag gacaggactc gctgcctttc
tttcccgtca gaaagggatc 600ccttgcggac aggacctaag tgagtagctg gtttccccta
cttgtccttc cgggcctggg 660tgtctcggga gctcaggctg acgggagacc taactaccgg
cgagtgagac cagcaggagc 720ctggaggggc gcgcaccagg gtggaggttt ggtgccgggg
gttgagaaca acagtcaaac 780cctcttcttc ccctggcacc acgcacctgc cccccgggac
gccgaacgaa gtggtcccta 840aagctcctct gcaggcccaa ccgaaacagg cctgaagctc
caggatgggc gagaggatcc 900tctttgagcg aaaccagcct tctgcctggc tggccctggt
caacaccctg ggaagaggcc 960gatttggcgg acagaacgga agaaaagacc taaaggtaga
atctcatgat gtcgagatgt 1020taaaacactc aaattttaag gttcgactgt gagggggaga
tagggggtct cgagcaggat 1080cgacccctga gccttcatct gcagagtcct gtgcaccagc
tcagaggaca ggactatgtg 1140caccaatggt tctcatcagg cggcaacttc accctcacat
gcctccccca tccctgctgg 1200tacacaagac cacgactagg ggaagcccgg agggagaatg
ttaacccctg gcatctatct 1260agtcagcaga ggtgagggat gctgctaaac accttacaat
ccaccggagg acacccgccc 1320ccaccgaccc cgaagtagcc attccctgga ggtggggaaa
ctcgcctgta gatcaatgcc 1380cacgcacttg gcggacagga aatcacgaat tggccactaa
ctggatcttg gatctgagga 1440aaaaattcca gcgtcagagg gaactctcgg agatttgccc
agagcataag gaacgtactc 1500cttccctcag tgatggatca tcacatctgg gggaaatcat
agacaatttc ttttgtaggg 1560cgaactctgc tatacagttt atgatgtcag agtgaatact
ttctttgagt tgcagtcaga 1620aactgtagat ttttaaaaat ttaaaattca ttattctctg
tcagtatttc aaagtgtata 1680cagaaagcta ttgcactgtt caggagatgg cgcctaacat
tttggaaatt caaggtgatg 1740aatgtccaga taagactatc tctcctggta caaagtttga
caatgctgaa catttttaaa 1800ggttcttttt gatatacaaa gtgcaccaat gagtgctttt
taattcttac aataattctg 1860ggtgaggtag gtatttttcc aattcccatt ttatgcttcg
gtagcccttt gtatttatac 1920ttcaaaacac ttggctctct tgtaattatt taagaaatta
gttgtgatta tttgtttaat 1980gtgcaggagt tacaaaaggc aagcgttaga acaagacaga
cctggttatg attcctggct 2040ctgaaagctg tacaccctgt gaccctagac aggtgtttta
atgcctcgct gcctctgttt 2100cttgctctgt aaaatgtgaa caataacagt attggcctca
tgcttttttt gggttttaaa 2160agtaataatg tggacaaaga tcagtggagt gcctggcatg
ctgaacccat tccatgactg 2220atagctatag ttgttatgat ttgtatcaat ccattttcac
actgctataa ggaactacct 2280gagactgggt aatctatgaa gaaaagaggt ttaattgact
cacagttctg catggctggg 2340agtcctcagg aacttacaat catggcagaa ggggaagcaa
gacatttctt atatggcagc 2400aggacagaaa gagagagagt gaagggggaa gtgccacaca
cttccaaaca accatatctt 2460gtggttaatt aaaaagtact cattggtgtg ccttgtatag
aaaaaaatat acactcacta 2520tcatgagaac agcaaggagg gagtctgccc ccaaggttca
atcacctccc agtagacccc 2580tcccctgaca tgtggggatt acaattcaag atgagatttg
ggtggggaaa cagagtcaaa 2640ccatatcgtg attgttctat aataaagaga tgcccacatg
tgtttcatca gggacagtgc 2700tcattaacca gttgtcctgc cgtaattatt aatagtatcc
cctttgcttt caaaagtgtc 2760ctagtttaca aaaagtatag aaatggagga cagaatagtg
gttgcccaag attggaaaag 2820ggtaagggta aagggtgcag aggtggatgt ggttataaaa
ggcaacatgg gagatcctcg 2880tagtgaagga accgtttagt atctccactg tggtggtaga
tacccgaacc taaacatgtg 2940aaaaattgca tgaaactaaa cacacacacc aacaagtaca
agttaagtta ggaaaatcca 3000aataagattt ctacattgta tcaataggta tatcttgatt
atgatattgc aagatggtac 3060tattcaagga aactgggtag aggctacatg agactcccct
gtattatttc ctataactcc 3120atgtgaatct acaaggatct caggattaag gaagatatcc
tagtttggaa gataaaaaat 3180atatcccagt agtaatatcc actgtcccac cagggcctga
ctaccttcta taaaaagaag 3240tgcctttgtt cccctcaagt tcctttattt ggttttattc
ttcttcacag tacctacctc 3300cacttggcag attacattta ttttttcatc tttcaacagc
tatttactga atgcctacta 3360gatgccaggc ttgagatcta gcaatgaaca agatctctgt
gaaacttaca ttccaggagg 3420agaaataaat aataaaccaa aaatataatc agtaaattat
ttaatatgct gggaaacaat 3480atgtgtaatg gaagaaatat gtaaagtgat ggattagggt
tctccagaga aacagaacca 3540acaattgact catgtgatta tggaggctga gaagtctcaa
gatcacagtt ggcaagcttg 3600agacacagga gagcccatgg tgtgtttctg atttgagtcc
aaaggcctga gaaccaggag 3660agatgatggt gtgattacag ttcaaaagct ggcaggcttg
aggcccagga agagccagtg 3720ttgcagttca attccaaagg caggaaaagg ctgatatctt
agctgaagca atcaggcaga 3780aggagctctc tcttactcat gggcaggtca gacttttggt
tctattcagg cctttaagtg 3840attggatgag gatcatctac tgtggaaaga aataagcttt
attcagtgta ctgattcaaa 3900tgttaatctc atccaaaacc atgctcacag acacacccag
cataatgttt gaccaagtat 3960ctgggcacct tgtggttcag tcaaattaac acatattaac
taccttagca agatgaaaag 4020cagtgaatgc aggatggtgg ttgaaatttt aaatacgttg
gttatatagt ctcattgaaa 4080aaggaacatt tgagtgaaga cttgaagggg tggtggaata
aaccatttat ttgcttattg 4140cctgtctccc tctatcagaa tgaaagcttc atgaagcgag
agacttaatt tttatctgtt 4200atatccctag tgcctggtgc agggtaagta ctcaaaaata
tttgttgagt gaataagtaa 4260tgattgagga tggggactgg tttgtatctg gttatatctc
ttgtccttag cacagtacct 4320ggcacatcct aagccatcca aaagagttgg ttatatgatt
gtctttgaat tctatgactg 4380tttataatat acagtaaact tcactgaaga cactg
441557522DNAArtificial SequenceOligonucelotides for
PCR 575gaagaacacc agaaatggat cg
2257621DNAArtificial SequenceOligonucleotides for PCR 576cttcagtttg
tcagagtgga c
21577484DNAHomo sapiens 577tgtcagcttt gtctgtgcct cgcaaatcag aggcaaggga
gaggttgtta ccaggggaca 60ctgagaatgt acatttgatc tgccccagcc acggaagtca
gagtaggatg cacagtacaa 120aggagggggg agtggaggcc tgagagggaa gtttctggag
ttcagatact ctctgttggg 180aacaggacat ctcaacagtc tcaggttcga tcagtgggtc
ttttggcact ttgaaccttg 240accacaggga ccaagaagtg gcaatgagga cacctgcagg
aggggctagc ctgactccca 300gaactttaag actttctccc cactgccttc tgctgcagcc
caagcaggga gtgtccccct 360cccagaagca tatcccagat gagtggtaca ttatataagg
atttttttta agttgaaaac 420aactttcttt tctttttgta tgatggtttt ttaacccagt
cattaaaaat gtttataaat 480caaa
48457821DNAArtificial SequenceOligonucleotides for
PCR 578cagccacgga agtcagagta g
2157924DNAArtificial SequenceOligonucleotides for PCR 579ccactcatct
gggatatgct tctg
24580592DNAHomo sapiensmisc_feature(62)..(62)n is a, c, g, or t
580ggtgcatgtt cattgggcat cttccattcg acccctttgc ccacgtggtg accgctgggg
60anctgtgaga gtgtgagggg cacgttccag ccgtctggac tctttctctc ctactgagac
120gcagcctata ggtccgcagg ccagtcctcc caggaactga aatagtgaaa tatgagttgg
180cgaggaagat caacatatag gcctaggcca agaagaagtt tacagcctcc tgagctgatt
240ggggctatgc ttgaacccac tgatgaagag cctaaagaag agaaaccacc cactaaaagt
300cggaatccta cacctgatca gaagagagaa gatgatcagg gtgcagctga gattcaagtg
360cctgacctgg aagccgatct ccaggagcta tgtcagacaa agactgggga tggatgtgaa
420ggtggtactg atgtcaaggg gaagattcta ccaaaagcag agcactttaa aatgccagaa
480gcaggtgaag ggaaatcaca ggtttaaagg aagataagct gaaacaacac aaactgtttt
540tatattagat attttacttt aaaatatctt aataaagttt taagcttttc tc
59258121DNAArtificial SequenceOligonucleotides for PCR 581attggggcta
tgcttgaacc c
2158221DNAArtificial SequenceOligonucleotides for PCR 582tttcccttca
cctgcttctg g
215832514DNAHomo sapiens 583actggggtct tctccatgcg gctcgggcta tgacagcctc
cgtgctcctc cacccccgct 60ggatcgagcc caccgtcatg tttctctacg acaacggcgg
cggcctggtg gccgacgagc 120tcaacaagaa catggaaggg gcggcggcgg ctgcagcagc
ggctgcagcg gcggcggctg 180ccggggccgg gggcgggggc ttcccccacc cggcggctgc
ggcggcaggg ggcaacttct 240cggtggcggc ggcggccgcg gctgcggcgg cggccgcggc
caaccagtgc cgcaacctga 300tggcgcaccc ggcgcccttg gcgccaggag ccgcgtccgc
ctacagcagc gcccccgggg 360aggcgccccc gtcggctgcc gccgctgctg ccgcggctgc
cgctgcagcc gccgccgccg 420ccgccgcgtc gtcctcggga ggtcccggcc cggcgggccc
ggcgggcgca gaggccgcca 480agcaatgcag cccctgctcg gcagcggcgc agagctcgtc
ggggcccgcg gcgctgccct 540atggctactt cggcagcggc tactacccgt gcgcccgcat
gggcccgcac cccaacgcca 600tcaagtcgtg cgcgcagccc gcctcggccg ccgccgccgc
cgccttcgcg gacaagtaca 660tggataccgc cggcccagct gccgaggagt tcagctcccg
cgctaaggag ttcgccttct 720accaccaggg ctacgcagcc gggccttacc accaccatca
gcccatgcct ggctacctgg 780atatgccagt ggtgccgggc ctcgggggcc ccggcgagtc
gcgccacgaa cccttgggtc 840ttcccatgga aagctaccag ccctgggcgc tgcccaacgg
ctggaacggc caaatgtact 900gccccaaaga gcaggcgcag cctccccacc tctggaagtc
cactctgccc gacgtggtct 960cccatccctc ggatgccagc tcctatagga gggggagaaa
gaagcgcgtg ccttatacca 1020aggtgcaatt aaaagaactt gaacgggaat acgccacgaa
taaattcatt actaaggaca 1080aacggaggcg gatatcagcc acgacgaatc tctctgagcg
gcaggtcaca atctggttcc 1140agaacaggag ggttaaagag aaaaaagtca tcaacaaact
gaaaaccact agttaatgga 1200ttaaaaatag agcaagaagg caacttgaag aaacgcttca
gaactcgttg ctttgcccag 1260ataatgataa taatgcttaa taataattga agaatgggaa
agagaaagag acagagactg 1320gcattttcct ctcccgaagg agatctcttt ctctttaatg
gaatctacaa ctgttttaaa 1380actttaagaa aggtaaagac tgccagttct tccgccaacc
ccatcagccc agcccgttaa 1440atgtcaaacg tcaaccccca aaatacgcaa tttcagataa
gttacgcagt tactgaaatc 1500ttgtaagtat ttaagtgatc gttacatttt aggacactgc
gttagatggt aataatctgg 1560aagttggtta caaacgcaag aggccattgt aaacatctgc
ttgtccttct taggtcgcca 1620ttccctttgc atgttaagcg tctgctcagg taaatcttag
tgaaattcct accgttgttg 1680tacgttctgc aaaacatttt atgtatagat ttagagggga
aacgagaagg tactgaaata 1740atgatcttgg aatatttgct gtgaagggag aaagggagag
aaaactcttc tgaggatcat 1800ttgtcttggt agtatagtaa aaccaaccag ctgaaccttt
caggctacaa gagaacccgg 1860gtcggtaatg tctttttaag aataattttt aattgcttat
aacaagcata ttttgtggca 1920tttgaactat atttactgct ccaatatccg ttattttcca
aaggattttg tatctttttg 1980aaaatgttta catcatcaga tgatccacag aattcacttt
atgtgagatc tcccgagagt 2040ttccatccca acataatgga ctttggtttg aacacaattc
gttttttcat ttgaattggc 2100atttcccaat atttgctaaa catttgctgg agaaatcatt
tttctttttt cttttttaga 2160aaactcagaa tgaaaattca ttcccctgaa atatttaggt
gtctatattc tatattttga 2220tctattaagg gattagtatt tttccatgtt tattgtgtta
tcagagtgca ttagaaagat 2280tagtgattca tcttcacagc acatttttaa tcaagcagtt
atttcaacca gcacattcgt 2340tttgttcata ttcactatag aatgatatct tgtaaataaa
gacattcagc acactgtgaa 2400aatgtatttg tgcacctgct ttttaaatat ttctactaaa
aatgaaaaaa aaaaaccctt 2460agacctgtag atagtgatat cgtaatatta attgttaata
aaatagtcac tgcc 2514584388PRTHomo sapiens 584Met Thr Ala Ser Val
Leu Leu His Pro Arg Trp Ile Glu Pro Thr Val 1 5
10 15 Met Phe Leu Tyr Asp Asn Gly Gly Gly Leu
Val Ala Asp Glu Leu Asn 20 25
30 Lys Asn Met Glu Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
Ala 35 40 45 Ala
Ala Ala Gly Ala Gly Gly Gly Gly Phe Pro His Pro Ala Ala Ala 50
55 60 Ala Ala Gly Gly Asn Phe
Ser Val Ala Ala Ala Ala Ala Ala Ala Ala 65 70
75 80 Ala Ala Ala Ala Asn Gln Cys Arg Asn Leu Met
Ala His Pro Ala Pro 85 90
95 Leu Ala Pro Gly Ala Ala Ser Ala Tyr Ser Ser Ala Pro Gly Glu Ala
100 105 110 Pro Pro
Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 115
120 125 Ala Ala Ala Ala Ala Ser Ser
Ser Gly Gly Pro Gly Pro Ala Gly Pro 130 135
140 Ala Gly Ala Glu Ala Ala Lys Gln Cys Ser Pro Cys
Ser Ala Ala Ala 145 150 155
160 Gln Ser Ser Ser Gly Pro Ala Ala Leu Pro Tyr Gly Tyr Phe Gly Ser
165 170 175 Gly Tyr Tyr
Pro Cys Ala Arg Met Gly Pro His Pro Asn Ala Ile Lys 180
185 190 Ser Cys Ala Gln Pro Ala Ser Ala
Ala Ala Ala Ala Ala Phe Ala Asp 195 200
205 Lys Tyr Met Asp Thr Ala Gly Pro Ala Ala Glu Glu Phe
Ser Ser Arg 210 215 220
Ala Lys Glu Phe Ala Phe Tyr His Gln Gly Tyr Ala Ala Gly Pro Tyr 225
230 235 240 His His His Gln
Pro Met Pro Gly Tyr Leu Asp Met Pro Val Val Pro 245
250 255 Gly Leu Gly Gly Pro Gly Glu Ser Arg
His Glu Pro Leu Gly Leu Pro 260 265
270 Met Glu Ser Tyr Gln Pro Trp Ala Leu Pro Asn Gly Trp Asn
Gly Gln 275 280 285
Met Tyr Cys Pro Lys Glu Gln Ala Gln Pro Pro His Leu Trp Lys Ser 290
295 300 Thr Leu Pro Asp Val
Val Ser His Pro Ser Asp Ala Ser Ser Tyr Arg 305 310
315 320 Arg Gly Arg Lys Lys Arg Val Pro Tyr Thr
Lys Val Gln Leu Lys Glu 325 330
335 Leu Glu Arg Glu Tyr Ala Thr Asn Lys Phe Ile Thr Lys Asp Lys
Arg 340 345 350 Arg
Arg Ile Ser Ala Thr Thr Asn Leu Ser Glu Arg Gln Val Thr Ile 355
360 365 Trp Phe Gln Asn Arg Arg
Val Lys Glu Lys Lys Val Ile Asn Lys Leu 370 375
380 Lys Thr Thr Ser 385
58524DNAArtificial SequenceOligonucleotides for PCR 585tctggaagtc
cactctgccc gacg
2458621DNAArtificial SequenceOligonucleotides for PCR 586tgtgacctgc
cgctcagaga g
215878769DNAHomo sapiens 587atttagaggc ggcgccaggg cggccgcgga gaaacgtgac
acaccagccc tctcggaggg 60gtttcggacc gaagggaaga agctgcgccg tgtcgtccgt
ctccctgcgc gccgcgggca 120cttctcctgg gctctccccg aactctcccg cgacctctgc
gcgccctcag gccgccttcc 180ccgccctggg ctcgggacaa cttctggggt ggggtgcaaa
gaaagtttgc ggctcctgcc 240gccggcctct ccgcctcttg gcctaggagg ctcgccgccc
gcgcccgctc gttcggcctt 300gcccgggacc gcgtcctgcc ccgagaccgc caccatgaac
aagctttaca tcggcaacct 360caacgagagc gtgacccccg cggacttgga gaaagtgttt
gcggagcaca agatctccta 420cagcggccag ttcttggtca aatccggcta cgccttcgtg
gactgcccgg acgagcactg 480ggcgatgaag gccatcgaaa ctttctccgg gaaagtagaa
ttacaaggaa aacgcttaga 540gattgaacat tcggtgccca aaaaacaaag gagccggaaa
attcaaatcc gaaatattcc 600accccagctc cgatgggaag tactggacag cctgctggct
cagtatggta cagtagagaa 660ctgtgagcaa gtgaacaccg agagtgagac ggcagtggtg
aatgtcacct attccaaccg 720ggagcagacc aggcaagcca tcatgaagct gaatggccac
cagttggaga accatgccct 780gaaggtctcc tacatccccg atgagcagat agcacaggga
cctgagaatg ggcgccgagg 840gggctttggc tctcggggtc agccccgcca gggctcacct
gtggcagcgg gggccccagc 900caagcagcag caagtggaca tcccccttcg gctcctggtg
cccacccagt atgtgggtgc 960cattattggc aaggaggggg ccaccatccg caacatcaca
aaacagaccc agtccaagat 1020agacgtgcat aggaaggaga acgcaggtgc agctgaaaaa
gccatcagtg tgcactccac 1080ccctgagggc tgctcctccg cttgtaagat gatcttggag
attatgcata aagaggctaa 1140ggacaccaaa acggctgacg aggttcccct gaagatcctg
gcccataata actttgtagg 1200gcgtctcatt ggcaaggaag gacggaacct gaagaaggta
gagcaagata ccgagacaaa 1260aatcaccatc tcctcgttgc aagaccttac cctttacaac
cctgagagga ccatcactgt 1320gaagggggcc atcgagaatt gttgcagggc cgagcaggaa
ataatgaaga aagttcggga 1380ggcctatgag aatgatgtgg ctgccatgag cctgcagtct
cacctgatcc ctggcctgaa 1440cctggctgct gtaggtcttt tcccagcttc atccagcgca
gtcccgccgc ctcccagcag 1500cgttactggg gctgctccct atagctcctt tatgcaggct
cccgagcagg agatggtgca 1560ggtgtttatc cccgcccagg cagtgggcgc catcatcggc
aagaaggggc agcacatcaa 1620acagctctcc cggtttgcca gcgcctccat caagattgca
ccacccgaaa cacctgactc 1680caaagttcgt atggttatca tcactggacc gccagaggcc
caattcaagg ctcagggaag 1740aatctatggc aaactcaagg aggagaactt ctttggtccc
aaggaggaag tgaagctgga 1800gacccacata cgtgtgccag catcagcagc tggccgggtc
attggcaaag gtggaaaaac 1860ggtgaacgag ttgcagaatt tgacggcagc tgaggtggta
gtaccaagag accagacccc 1920tgatgagaac gaccaggtca tcgtgaaaat catcggacat
ttctatgcca gtcagatggc 1980tcaacggaag atccgagaca tcctggccca ggttaagcag
cagcatcaga agggacagag 2040taaccaggcc caggcacgga ggaagtgacc agcccctccc
tgtcccttcg agtccaggac 2100aacaacgggc agaaatcgag agtgtgctct ccccggcagg
cctgagaatg agtgggaatc 2160cgggacacct gggccgggct gtagatcagg tttgcccact
tgattgagaa agatgttcca 2220gtgaggaacc ctgatctctc agccccaaac acccacccaa
ttggcccaac actgtctgcc 2280cctcggggtg tcagaaattc tagcgcaagg cacttttaaa
cgtggattgt ttaaagaagc 2340tctccaggcc ccaccaagag ggtggatcac acctcagtgg
gaagaaaaat aaaatttcct 2400tcaggtttta aaaacatgca gagaggtgtt ttaatcagcc
ttaaaggatg gttcatttct 2460tgaccttaat gtttttccaa tcttcttccc cctacttggg
taattgatta aaatacctcc 2520atttacggcc tctttctata tttacactaa tttttttatc
tttattgcta ccagaaaaaa 2580atgcgaacga atgcattgct ttgcttacag tattgactca
agggaaaaga actgtcagta 2640tctgtagatt aattccaatc actccctaac caataggtac
aatacggaat gaagaagagg 2700ggaaaatggg gagaaagatg gttaaaatac ataataatcc
acgtttaaaa ggagcgcact 2760tgtggctgat ctatgccaga tcaccatctt caaattggca
caactgaaat ttccccactc 2820tgttggggct tccccaccac attcatgtcc ctctcccgtg
taggtttcac attatgtcca 2880ggtgcacata ggtggtattg aatgctcagc agggtagggg
ctgaccactg tccctgattc 2940ccatcgttct caggcggatt ttatattttt ttaaagtcta
ttttaatgat tggatatgag 3000cactgggaag gggacgctaa ctccccttga taaagtctcg
gttccatgga ggacttgagt 3060ggccccaaag gctgccacgg tgccctcacc ccagcccatg
tgctcccata agggctggtt 3120cctagaggca ggggttgtgg ggcactccca gccacggcac
tgttaccttg gtggtgggac 3180ttggaaccca accctgagct cccgataaag ctaaagtcca
tcatctggca aattcagtaa 3240attggagagt acttgcttct gtttgtatct gagaggaatt
tttaactgac ggcttctgtc 3300tccatgaatc attatcagca tgatgaaagg tgtgtctaaa
aaacaattca gaataccagc 3360agcattgtac agcaaggggt aaataagctt aatttattaa
tttaccaggc ttaattaaga 3420tcccatggag tgtttagccc ttgtgggaga cagaagccat
cagttaaatg aggttaggcc 3480tctcctccta atatactgat tgacaatgca tattagccag
gtaatgcact ttagctaccc 3540tggacaatgc tatcaagtgt gctgggaagg gaggaaggcc
tctctacata tggaaaagcc 3600catgcgtgga gttcccctcc tttcaacatt gcaacaacag
taacaacaag acaaccgcaa 3660catgtgggcg tagtcaggca atgctgtgtg cgaagtaaac
tacctcaagg tatgaagtta 3720cctcagcaat tattttcctt tttgttcccc ccaaccccat
taaaaaaatt tttttttgat 3780ttttgttttt ttgcagcttg ctgatatttt atataaaaaa
gaaaagcaaa gcaaaagaga 3840agctgatagt cttgaatatt ttattttttt aatgaaaaga
aaaaacaaga aagttatgtt 3900tcataatttc ttacaacatg agccagtaac cctttaggaa
ctctctatgg agaacaggcc 3960tggtgggaaa ggctttgggg gctgccccct taggaggagg
ctagtgctaa gagggaaggc 4020ccaggtttga gagagcccag aggggcagag cccagagcct
tgtttggccc tgatctctga 4080cttctagagc cccagctgct ggcggctgct ggaatatcct
acctgatagg attaaaaggc 4140ctagtggagc tgggggctct cagtggttaa acaatgccca
acaaccaacc agctggccct 4200tggtctcctc tctttcctcc tttggttaaa gagcatctca
gccagctttt cccaccagtg 4260gtgctgttga gatattttaa aatattgcct ccgttttatc
gaggagagaa ataataacta 4320aaaaatatac cctttaaaaa aacctatatt tctctgtcta
aaaatatggg agctgagatt 4380ccgttcgtgg aaaaaagaca aggccaccct ctcgccctca
gagaggtcca cctggtttgt 4440cattgcaatg cttttcattt tttttttttg ttattgtttc
atttcagttc cgtcttgcta 4500ttcttcctaa tctatatcca tagatctaag gggcaaacag
atactagtta actgccccca 4560cctctgtctc cctgtcttct ttagatcggt ctgattgatt
ttaaaagtgg acccaaactt 4620agggaattct tgatttaggg tggctggtgg caaggagggg
caggggatat ggggacgtga 4680ctgggacagg ttcctgcctt atcattttct ccctaggaca
ttcccttgta gcccccagaa 4740ttgtctggcc caaattgaat agaagcagaa aaacatttag
ggataacatc aggccagtag 4800aattaagcct ctccacctgt cccaaccata aaaagggtct
cccagctttc catctctggc 4860tctatatgct ttatcccaaa acaaagcaga taacgttcag
acgtcggcca tttagtaatt 4920taaagcgaat ttccagcagc aagcatgctt tgatatctgg
ttcagactat catcaggaag 4980aaaaaaaaat cccacagtac ctgaaatgtg attgttgcag
tgttcagttt ccttgggggc 5040ctgctccctt cacaccttga gcccaagtcc ttttccgttg
gctgattcag ctcccagaag 5100agacgaggaa gtgtgtggca agggactgga aaacttcact
tgcttggatt aggcaaggct 5160ccactcattg ttgatatttg cccagcagga aaatcatgta
agttatacca ccagaaagca 5220aaaggagcat ggtttggtgg ttaaggttta gtgggatgaa
ggacctgtct tggtgggccg 5280ggccctcttg tgccccgtag gctaggtctt agggcaactc
cttgccctcc tgctcagcac 5340ctccatttcc ccatccttgg tgagataaca agctatcgcg
aaaagcactt gggagatttg 5400gatgatttga gaagagtgac ttaaaaaaaa tgcttctgtg
ctctaagata tatatgtgtg 5460tgtgtgtgct acatatatat ttttaagaaa ggaccatctc
tttaggatat atttttaaat 5520tctttgaaac acataaccaa aatggtttga ttcactgact
gactttgaag ctgcatctgc 5580cagttacacc ccaaatggct ttaatcccct ctcgggtctg
gttgcctttt gcagtttggg 5640ttgtggactc agctcctgtg aggggtctgg ttaggagaga
gccattttta aggacaggga 5700gttttatagc ccttttctac tttcctcccc tcctcccagt
ccttatcaat cttttttcct 5760ttttcctgac cccctccttc tggaggcagt tgggagctat
ccttgtttat gcctcactat 5820tggcagaaaa gaccccattt aaaacccaga gaacactgga
gggggatgct ctagttggtt 5880ctgtgtccat tttcctctgt gccaaagaca gacagacaga
ggctgagaga ggctgttcct 5940gaatcaaagc aatagccagc tttcgacaca tacctggctg
tctgaggagg aaggcctcct 6000ggaaactggg agctaagggc gaggcccttc ccttcagagg
ctcctggggg attagggtgt 6060ggtgtttgcc aagccaaggg gtagggagcc gagaaattgg
tctgtcggct cctggttgca 6120ctttggggaa ggagaggaag tttggggctc caggtagctc
cctgttgtgg gactgctctg 6180tcccctgccc ctactgcaga gatagcactg ccgagttccc
ttcaggcctg gcagacgggc 6240agtgaggagg ggcctcagtt agctctcaag ggtgccttcc
cctcctccca acccagacat 6300accctctgcc aaactgggaa ccagcagtgc tagtaactac
ctcacagagc cccagagggc 6360ctgcttgagc cttcttgctc cacaggagaa gctggtgcct
ctaggcaacc ccttcctccc 6420acctctcatc aggggtgggg gttctccttt ctttcccctg
aagtgtttat ggggagatcc 6480tagtggcttt gccattcaaa ccactcgact gtttgcctgt
ttcttgaaaa ccagtagaag 6540ggaaacagca cagcctgtca cagtaattgc aggaagattg
aagaaaaatc ctcatcaatg 6600ccaggggaca taaaagccat ttcccttcca aatactcgac
aatttagatg cagaacattt 6660ctctgtattc agacttagag taacaccagc tgaaaactgc
agtttctttc ctttggatac 6720ataaggcttc tctatcgggg tacgggacag ggaggaggcc
tcatgtctga agggggattt 6780aggggcgaga gccccagccc tgaccctcgg tcctgtgcac
cgctttgggg cacagtctga 6840tggcgccttt gctggcgcct tagtatggtt gactccggat
ggacaaaaga aaaaaaattt 6900tttttcttga atgaaatagc aggaagctcc tcgggagcat
gtgttttgat taaccgcagg 6960tgatggatgc tacgagtata aatggattaa ctacctcaat
ccttacagta agattggaac 7020taagggcagg gactcatgca taagggtatg aatcccagcc
aggacaagtg agttgaggct 7080tgtgccacaa aaggtttgtc cttggggaac aggcaggcct
gccaggatcc cccccatatc 7140gattgggctg ggagggctgg ccatgaggtc cccactttct
gctttccttg cccatgtgtc 7200acccctttgg cctccagctt gtccctctct cactttctat
agctttgttg gaccagatgg 7260tgaggaaagg aatggcctct tcccttctag agggggctgg
ctggagtgag acctggggct 7320tggcctggaa cccaccacac agccccaaag tcaggaagcc
tggggaaacc agagctgaga 7380cctcttcaac agggtttctt tgagatccta cacctccatt
gggccctttt tcagtcttca 7440atgggggccc agttggctct agaaggagaa gaggtgaagc
aggatccttt gccctggggg 7500agtctgaggg cgcggtcctt ggactcattc aggccgtctt
tgtagttggg ggagttccac 7560tgggcgatcc cagcccctcc ccacccaccc tctaatggac
ctcctcatag aagccccatt 7620tcacttttgt tttatctacc tcttagcaaa acaatagata
aattaggtag tggcagctcc 7680acttgcttag gttagggggg gaaaaagatt tctttttcca
aaggaaaaaa atattacctt 7740gagaatactt tccaaaaaat aaaattaaaa aaaaaaaaac
caaaaaaaaa aatttttttt 7800taaaagggag acattttcca gtgaccactg gattgtttta
atttcccaag cttttttttc 7860ccccataaat aagtttcact ctttggcgat tttcttcact
tgtttaagat aacgtgctag 7920ctattccaac aggtaacagc tttcacagtc tgcccctggc
ctgtctcacc ccatccccca 7980ccctattcct gccagtgagt ccttcctgtg cttctctccc
ttctcccctc ccagccagct 8040gacttcagtc acccctgtcc cccctcccct gccaataagc
tcccccagga ataaaggctt 8100tgttttgggg atgcttaaat cttgactggc acttcccggc
tgtgggggct ggggagccac 8160ttgtaacatt tctgtgcaga ttttatgtta gccactgcta
tgtaaaagca cgttcaaaat 8220gaatttcagc agattatgtg ttaccataat gaataaacgt
cctctatcac catttggagt 8280ctcccttttc tccaggatct tgatcctggt ccccaaaacc
agagtgaatc aaaagagctt 8340cctcccctga ggcaaagtgg atttgtaagc agttctgaaa
catcacttac tcagaagagg 8400gaacgatgta ttttgatgag tgcaaattgg gaagagctgg
aggcctactg cttgggacag 8460tttttttttt tttttttttt ttaaatatga gtgctagctt
attctgtaat tgcggcaact 8520ttgaaaattg tattttactg gaaatctgcc agccatcacc
acccgatttt gattgtatcc 8580ttcctcccat cctttaatct gttcattgct ttgggggagg
tggggcagct ggctcacacg 8640ttggagtttg ttctttgatg gatgaacgaa cactccagtt
ttctttcccg tgaaggttgt 8700ttcagccaca aaccacttca ttttgctgtt tcaatttcaa
aataaaagga aacttatatt 8760gaaagacaa
8769588577PRTHomo sapiens 588Met Asn Lys Leu Tyr
Ile Gly Asn Leu Asn Glu Ser Val Thr Pro Ala 1 5
10 15 Asp Leu Glu Lys Val Phe Ala Glu His Lys
Ile Ser Tyr Ser Gly Gln 20 25
30 Phe Leu Val Lys Ser Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu
His 35 40 45 Trp
Ala Met Lys Ala Ile Glu Thr Phe Ser Gly Lys Val Glu Leu Gln 50
55 60 Gly Lys Arg Leu Glu Ile
Glu His Ser Val Pro Lys Lys Gln Arg Ser 65 70
75 80 Arg Lys Ile Gln Ile Arg Asn Ile Pro Pro Gln
Leu Arg Trp Glu Val 85 90
95 Leu Asp Ser Leu Leu Ala Gln Tyr Gly Thr Val Glu Asn Cys Glu Gln
100 105 110 Val Asn
Thr Glu Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Asn 115
120 125 Arg Glu Gln Thr Arg Gln Ala
Ile Met Lys Leu Asn Gly His Gln Leu 130 135
140 Glu Asn His Ala Leu Lys Val Ser Tyr Ile Pro Asp
Glu Gln Ile Ala 145 150 155
160 Gln Gly Pro Glu Asn Gly Arg Arg Gly Gly Phe Gly Ser Arg Gly Gln
165 170 175 Pro Arg Gln
Gly Ser Pro Val Ala Ala Gly Ala Pro Ala Lys Gln Gln 180
185 190 Gln Val Asp Ile Pro Leu Arg Leu
Leu Val Pro Thr Gln Tyr Val Gly 195 200
205 Ala Ile Ile Gly Lys Glu Gly Ala Thr Ile Arg Asn Ile
Thr Lys Gln 210 215 220
Thr Gln Ser Lys Ile Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 225
230 235 240 Glu Lys Ala Ile
Ser Val His Ser Thr Pro Glu Gly Cys Ser Ser Ala 245
250 255 Cys Lys Met Ile Leu Glu Ile Met His
Lys Glu Ala Lys Asp Thr Lys 260 265
270 Thr Ala Asp Glu Val Pro Leu Lys Ile Leu Ala His Asn Asn
Phe Val 275 280 285
Gly Arg Leu Ile Gly Lys Glu Gly Arg Asn Leu Lys Lys Val Glu Gln 290
295 300 Asp Thr Glu Thr Lys
Ile Thr Ile Ser Ser Leu Gln Asp Leu Thr Leu 305 310
315 320 Tyr Asn Pro Glu Arg Thr Ile Thr Val Lys
Gly Ala Ile Glu Asn Cys 325 330
335 Cys Arg Ala Glu Gln Glu Ile Met Lys Lys Val Arg Glu Ala Tyr
Glu 340 345 350 Asn
Asp Val Ala Ala Met Ser Leu Gln Ser His Leu Ile Pro Gly Leu 355
360 365 Asn Leu Ala Ala Val Gly
Leu Phe Pro Ala Ser Ser Ser Ala Val Pro 370 375
380 Pro Pro Pro Ser Ser Val Thr Gly Ala Ala Pro
Tyr Ser Ser Phe Met 385 390 395
400 Gln Ala Pro Glu Gln Glu Met Val Gln Val Phe Ile Pro Ala Gln Ala
405 410 415 Val Gly
Ala Ile Ile Gly Lys Lys Gly Gln His Ile Lys Gln Leu Ser 420
425 430 Arg Phe Ala Ser Ala Ser Ile
Lys Ile Ala Pro Pro Glu Thr Pro Asp 435 440
445 Ser Lys Val Arg Met Val Ile Ile Thr Gly Pro Pro
Glu Ala Gln Phe 450 455 460
Lys Ala Gln Gly Arg Ile Tyr Gly Lys Leu Lys Glu Glu Asn Phe Phe 465
470 475 480 Gly Pro Lys
Glu Glu Val Lys Leu Glu Thr His Ile Arg Val Pro Ala 485
490 495 Ser Ala Ala Gly Arg Val Ile Gly
Lys Gly Gly Lys Thr Val Asn Glu 500 505
510 Leu Gln Asn Leu Thr Ala Ala Glu Val Val Val Pro Arg
Asp Gln Thr 515 520 525
Pro Asp Glu Asn Asp Gln Val Ile Val Lys Ile Ile Gly His Phe Tyr 530
535 540 Ala Ser Gln Met
Ala Gln Arg Lys Ile Arg Asp Ile Leu Ala Gln Val 545 550
555 560 Lys Gln Gln His Gln Lys Gly Gln Ser
Asn Gln Ala Gln Ala Arg Arg 565 570
575 Lys 58921DNAArtificial SequenceOligonucleotides for
PCR 589atttctatgc cagtcagatg g
2159021DNAArtificial SequenceOligonucelotides for PCR 590gtgggcaaac
ctgatctaca g
215915016DNAHomo sapiens 591taacattctg ttcttccgcg tgatggattt tcttttggag
attcgaactg aagcctgtac 60ggaggaaatg ttgtttttaa gggaaatgaa tagaaacaat
ccactttgaa gaagccatgg 120cgaaatcaaa gacaaaacat agactttgtt ctcaggaatc
ttcagtatct gccctgctgg 180caagctgcac cctgagtggt agtaattcct ctaattctga
tggctcgttt cactataaag 240ataagctgta cagatctgct tctcaagctc tacaggctta
tattgatgat tttgatctag 300gccaaatata tcctggtgca agcactggaa aaattaacat
tgatgaggat tttactaata 360tgtcacagtt ctgcaactat atttacaaac caaacaatgc
ttttgaaaac cttgatcacg 420aaaagcactc aaacttcata tcctgtagaa gacacatcgt
taatgacata gactccatga 480gcctaacaac tgatgatcta ttaagactcc cagcagatgg
atcattttct tatacttatg 540ttggaccgag tcaccgaacg agcaagaaaa acaagaaatg
ccgtggaagg ctgggttcat 600tggacattga gaagaatcca cattttcaag gaccctacac
ttccatgggc aaggataact 660ttgttactcc tgttatacgc tcaaatataa atggaaagca
atgtggtgac aaaattgaat 720tgcttatctt gaaggccaag agaaatctag agcagtgtac
tgaagaatta ccaaagtcca 780tgaaaaagga tgacagtcct tgctcattag ataaacttga
agcagacaga tcatgggaaa 840atattcctgt tactttcaaa tctcctgttc ccgttaactc
tgatgatagt cctcaacaaa 900cttcaagggc aaagagtgct aaaggggttc ttgaagactt
tctaaataat gataatcaga 960gctgtactct ctctggaggc aaacatcatg gtcctgttga
agccctgaaa caaatgttat 1020ttaaccttca agcagtacaa gaacgtttta atcaaaataa
gaccacagat ccaaaagaag 1080agattaaaca agtttcagaa gatgatttct ctaaattaca
gttgaaggaa agtatgattc 1140ctattactag gtcacttcag aaggctttgc accatttatc
tcgcctgaga gacctggttg 1200atgatacgaa tggagaacgg tcaccgaaaa tgtgaagagg
aaaatgaaac tgtcaccacg 1260ataaatagtc accacagaac aaataggcat tttttctatt
acttaaactg acaaagtaaa 1320tataagccat acattatttt gtggttggtt caaggattat
atatttctaa aacactaaac 1380ttgaaaatac ccataggttt tgggacctat ctttattttg
tgccaacata ctagaatgtg 1440aactgcaagg acccacaatg tatcctgaag tcttactttc
gccttctggc cagcaaatgt 1500ctaatattta aagatggatg acttctgttc ttgaagctta
cctggattta accttcttca 1560gcatcctcaa cattttatta cctggttcag gatcattaag
aaacttactg gtttttatcc 1620aaaatctttt acgttaaata gactttttta aagatatagt
tagcatcact tttaaacagc 1680ttaaaggaat atcaaaattg ttattgtgta tctcatctat
aaggaagtct gttactttga 1740aattttcata aatttaatat ttaagataca ttgtatttga
aaattgcatt aatagtgggg 1800tgatactgtg ttaaaaggaa cgttgtgttg tgacattcaa
gagaacctcc tcatttaatt 1860agtactttga ttctgtgtaa gataatcttg gtagtgcttg
acagtttcca aacctttttt 1920tggagagata tttaagaatt taatattttg atattagatt
gtttcccaga ttttaatttt 1980ggggttggct caaactagtg aaaactatga ctcaatggcc
aattgcttta tcaaatttga 2040taactaaaac ttaaaatgaa tatggaaaat cagaaagcaa
ctctatttta gagctatttt 2100gtaagagttg tgctttcttt aacaccatct gtagtcttaa
gtttgtctct agctagaact 2160gaacaaagct ctataatttt taccaagcac ttattattaa
tacttcttat aagtagtaag 2220catctttact aacacaactg agaattaagt cataaaacat
aactaataca gcacattact 2280gcctgacaaa attaaagagt actgtgtgta tgtataacta
ctacaggtta acacttcacc 2340caaatgatag cgtttttcct cagtagatta ttgtcaaata
ggaatttcta agcacattga 2400gtcaaagcat tttttccagg ttaataaagt gttatttact
atctttgtta gaggtgacat 2460gtcaaacact acagtgagct ctgtggggtt tttttttttt
tttttgcccg tgagtttttt 2520accatgctgc tctgaccagt ttgagtggca attaccaata
gatttgtttt ctttattcta 2580tggagatgtt tttaccactg acactgtttt ctgattatag
tctgcttcat agaaaatagc 2640ctgcataatc aaacaaggag ttactttgaa attaaagtat
gcctggctat taaaaatgca 2700gattttaggt gggtaaacat caggtaggtc tgggtgggtc
atgttctagg cctagaaaaa 2760tacactatta gacaagttct aaagaaggca aggagataaa
ggcatcaggt ggtaacttct 2820aattgaatat tatatgttga tcatacataa tatatactat
gcctggaaat tatgactgaa 2880aagcacctat tcggttagtg ctcctattca tgagaacata
tctccaatac taaatgagat 2940aagcctgttc taaaatctta tagccagtat tttaagaaac
ttgattatac ttaccaaagg 3000aacattgttt gttttctctt gttttaaata tggagaggtt
taatccttta cataacaaag 3060gaattaattt tagcaaaatg attcattcca accttcttat
aagaaatatc taggagagtc 3120aagtaagaaa aataacgaat ctaagtgata aacattcaag
aaattctcta aataagagat 3180ttatttataa ttttaatatc tcagggttct ttttaggttt
ccaggggaaa agagcaggat 3240aacagtgtgg agactgctaa gttgagaatt taaaacaaat
gagaacataa gatttttaaa 3300attgcattgt gaatgtaaaa tttttatcaa tcctttgctc
ttttagacat attgagaaaa 3360tgttaaatag aaaaaattaa gaaattttaa taagatgttt
cagatctttg agtatgaaaa 3420acataacaaa aaagcctaat ttcaaaaaac tatttgagat
caagggacaa tggtgtgacc 3480aatatgaagg gtcaagactg aaatgtattg tctttactat
caagaactct actttcagtt 3540gtttctcaga cagttaattt cagcttcata gagatttctg
agcaaattaa gaaacactgt 3600tttcctgggt ttgttttggg tatatgtcat tatagttatg
ttatttcttg ttgaaattta 3660taattgtagg ttttttgtat tgttttggta tttaatggtg
tataatgtgt tattacatta 3720tatgtagtta taccaaaata ttgcctgaag agaaatcatg
acaaggtccc ctgtttattc 3780ctgtgttaca gacgcatgga attgctcctg tagatttgaa
tttttgtttc atttttttct 3840gtcccaccct tcactctctc tgtttcagaa catttttggt
agaagtgcta tccagaagtg 3900aacttgtcaa aaggcaagta gcatgaaaga agacagaaga
agcaaaaggc taatacagtg 3960gataatttct gagcacttga agtttcttca aatgtgcaag
actgtgtgtc ttcctattag 4020atgtataaat tggatatttc atgcctaatt aaatgttgcg
ttggattgca gtgcctatca 4080tacagtgatt ggagtaaatt gaggcctaat cctgaacaca
tatagagcat attgttagat 4140atttttcctg tgacatttga agttattatt ctcccatttc
ctttttcttt ttttgtttat 4200aatcatatgt ccctaagatt gttttccttt tttggaccaa
aaaaaagaaa aaaaaaatct 4260tagcttttca tcctcccagt gtattctgca ttgtccttac
cctagatcag ccccttctgt 4320gtaacagttt ttctcacaat gtagcaactt ttatccaccc
ttcaggacct tcactgggac 4380tagttcattc attttcaaat agctatttca acctttaaca
tctactgtct tagtctttta 4440cacagaagcc agagtgactg gtcttggcaa gactctgttg
tgtatcacca ctctaacctt 4500actgatttgt ttcagcaaat ttgctttagt taaattgctt
tactcagatt cccccaaact 4560ttatatgtgt attgtcatct ttgtgcatat tatttctcat
gcatgaaata ctcaattttt 4620attcttttat ctaacgctta ctcttacatt tctttaaagc
tctggccaag tattttattt 4680cgtccctaaa cattctaact atccaccaaa ctggtaagtt
ggcttttctt tttcctcccc 4740ctgtcattca tttagctgtt atatttcatt ttaatgtttt
gggtggtgcc tcttatacta 4800tgttgtattc ctagacaagg aaatgtatat caaaatatgt
tagatgattg attgttttat 4860ctccttgatg atagcacctc ttatactgct ttacagaatc
aggaaaaagt aaactgcatt 4920ttacatagtg gttttaaata ttgattgatt gatattctaa
acctggtttc ctatataaag 4980ttgtaagttc aagataaaaa aaaaaaaaaa aaaaca
5016592372PRTHomo sapiens 592Met Ala Lys Ser Lys
Thr Lys His Arg Leu Cys Ser Gln Glu Ser Ser 1 5
10 15 Val Ser Ala Leu Leu Ala Ser Cys Thr Leu
Ser Gly Ser Asn Ser Ser 20 25
30 Asn Ser Asp Gly Ser Phe His Tyr Lys Asp Lys Leu Tyr Arg Ser
Ala 35 40 45 Ser
Gln Ala Leu Gln Ala Tyr Ile Asp Asp Phe Asp Leu Gly Gln Ile 50
55 60 Tyr Pro Gly Ala Ser Thr
Gly Lys Ile Asn Ile Asp Glu Asp Phe Thr 65 70
75 80 Asn Met Ser Gln Phe Cys Asn Tyr Ile Tyr Lys
Pro Asn Asn Ala Phe 85 90
95 Glu Asn Leu Asp His Glu Lys His Ser Asn Phe Ile Ser Cys Arg Arg
100 105 110 His Ile
Val Asn Asp Ile Asp Ser Met Ser Leu Thr Thr Asp Asp Leu 115
120 125 Leu Arg Leu Pro Ala Asp Gly
Ser Phe Ser Tyr Thr Tyr Val Gly Pro 130 135
140 Ser His Arg Thr Ser Lys Lys Asn Lys Lys Cys Arg
Gly Arg Leu Gly 145 150 155
160 Ser Leu Asp Ile Glu Lys Asn Pro His Phe Gln Gly Pro Tyr Thr Ser
165 170 175 Met Gly Lys
Asp Asn Phe Val Thr Pro Val Ile Arg Ser Asn Ile Asn 180
185 190 Gly Lys Gln Cys Gly Asp Lys Ile
Glu Leu Leu Ile Leu Lys Ala Lys 195 200
205 Arg Asn Leu Glu Gln Cys Thr Glu Glu Leu Pro Lys Ser
Met Lys Lys 210 215 220
Asp Asp Ser Pro Cys Ser Leu Asp Lys Leu Glu Ala Asp Arg Ser Trp 225
230 235 240 Glu Asn Ile Pro
Val Thr Phe Lys Ser Pro Val Pro Val Asn Ser Asp 245
250 255 Asp Ser Pro Gln Gln Thr Ser Arg Ala
Lys Ser Ala Lys Gly Val Leu 260 265
270 Glu Asp Phe Leu Asn Asn Asp Asn Gln Ser Cys Thr Leu Ser
Gly Gly 275 280 285
Lys His His Gly Pro Val Glu Ala Leu Lys Gln Met Leu Phe Asn Leu 290
295 300 Gln Ala Val Gln Glu
Arg Phe Asn Gln Asn Lys Thr Thr Asp Pro Lys 305 310
315 320 Glu Glu Ile Lys Gln Val Ser Glu Asp Asp
Phe Ser Lys Leu Gln Leu 325 330
335 Lys Glu Ser Met Ile Pro Ile Thr Arg Ser Leu Gln Lys Ala Leu
His 340 345 350 His
Leu Ser Arg Leu Arg Asp Leu Val Asp Asp Thr Asn Gly Glu Arg 355
360 365 Ser Pro Lys Met 370
59321DNAArtificial SequenceOligonucleotides for PCR 593aatggaaagc
aatgtggtga c
2159421DNAArtificial SequenceOligonucleotides for PCR 594tccagagaga
gtacagctct g
215952061DNAHomo sapiens 595atgaggcgga caggccccga ggaggaggcc tgcggcgtgt
ggctggacgc ggcggcgctg 60aagaggcgga aagtgcagac acatttaatc aaaccaggca
ccaaaatgct aacactcctt 120cctggagaaa gaaaggctaa tatttatttt actcaaagaa
gagctccatc tacaggcatt 180caccagagaa gcattgcttc cttcttcacc ttgcagccag
gaaagacaaa tggcagtgac 240cagaagagtg tttcatctca tacagaaagt cagatcaaca
aagagtccaa gaaaaatgcg 300acccagctag accatttgat cccaggctta gcacacgatt
gcatggcatc ccctttagcc 360acttcaacca ctgcggacat ccaggaagct ggactctctc
ctcagtccct ccagacttct 420ggccaccaca gaatgaaaac cccattttca actgagctat
ctttgctcca gcctgatact 480ccagactgtg ctggagatag tcatacccca ctggcttttt
ccttcaccga ggacttggaa 540agttcttgtt tgctagaccg aaaggaagaa aaaggggatt
ctgccaggaa atgggaatgg 600cttcatgagt ctaagaagaa ctatcagagt atggagaaac
acaccaaact acctggggac 660aaatgctgtc agcccttagg caagactaaa ttggaaagaa
aggtgtctgc caaagaaaac 720aggcaggccc ctgtcctcct tcaaacatac agggaatcct
ggaatggaga aaacatagaa 780tcggtgaaac aaagccgtag tccagtttct gtgttttcct
gggacaatga aaagaatgac 840aaggactcct ggagtcaact tttcactgaa gattctcaag
gccagcgggt cattgcccac 900aacactagag ctccttttca agatgtaacc aataactgga
attgggactt agggccgttt 960cctaacagtc cttgggctca gtgccaggag gatgggccaa
ctcaaaatct gaagcctgat 1020ttgctcttta cccaggactc tgaaggtaat caagttatca
gacaccaatt ctaaatgttt 1080gaagctttgt ttctaaaagt accttgaaat gatagagatg
taggaaaata tagttgtggg 1140tggagagagg agtgagtttg tttaggtggg aaggtggcat
gggatgaagt tgtcattact 1200gagcatcttc tctgtgtaaa taaagggcag taccattgtt
aagacagtgg gattggcatc 1260atggctttcc ctcaggaagg tggtggctgg taaattccct
gaatgagtct atgatgaaca 1320ctgaggcagc acagtgggta tttatctcta tgaaagtgcc
ttttactcag cctgcacaga 1380gccatctctt tgcccttcca gatgtctgac tgggaccttg
cttatggatg tgtttttttt 1440tttttttttt tgagatggag tctcgctctg tcgccaggct
ggagtgcagt ggtgcgacct 1500cagctcactg caccctctgt gtcccggatt caagcgattc
tcctgcctca gcctcccgaa 1560tagcagggac tacaggcatg cgccaccacg cccagctaat
tttttttgga tttttagtag 1620agacgaggtt tcaccatatt agccaggatg gtctccatct
cctgacctcc tgatccgccc 1680acctcagcct cccaaagtgc tgagattaca ggcataagcc
accgcgccca gccagatgtg 1740tgagctttta atctctggct gatcttaacc cacatcagcc
taagcttggg atgattactc 1800ttgacccttt tttttcagtg attagcaaat ctccccacaa
cccaggtgtg gagagaagag 1860aggtagaatg gtgctagttt cctattttat ttttgtggta
actgtacagc actttaaagt 1920tatatactct atgtttaaat atctccctta aaaagcctga
gctgtacaac aatctggatg 1980tgactctgtt acccttttcc cacaagatag gagggaatcc
cctttgtaaa actatgaatc 2040caaataaatg tttacaaagt g
2061596357PRTHomo sapiens 596Met Arg Arg Thr Gly
Pro Glu Glu Glu Ala Cys Gly Val Trp Leu Asp 1 5
10 15 Ala Ala Ala Leu Lys Arg Arg Lys Val Gln
Thr His Leu Ile Lys Pro 20 25
30 Gly Thr Lys Met Leu Thr Leu Leu Pro Gly Glu Arg Lys Ala Asn
Ile 35 40 45 Tyr
Phe Thr Gln Arg Arg Ala Pro Ser Thr Gly Ile His Gln Arg Ser 50
55 60 Ile Ala Ser Phe Phe Thr
Leu Gln Pro Gly Lys Thr Asn Gly Ser Asp 65 70
75 80 Gln Lys Ser Val Ser Ser His Thr Glu Ser Gln
Ile Asn Lys Glu Ser 85 90
95 Lys Lys Asn Ala Thr Gln Leu Asp His Leu Ile Pro Gly Leu Ala His
100 105 110 Asp Cys
Met Ala Ser Pro Leu Ala Thr Ser Thr Thr Ala Asp Ile Gln 115
120 125 Glu Ala Gly Leu Ser Pro Gln
Ser Leu Gln Thr Ser Gly His His Arg 130 135
140 Met Lys Thr Pro Phe Ser Thr Glu Leu Ser Leu Leu
Gln Pro Asp Thr 145 150 155
160 Pro Asp Cys Ala Gly Asp Ser His Thr Pro Leu Ala Phe Ser Phe Thr
165 170 175 Glu Asp Leu
Glu Ser Ser Cys Leu Leu Asp Arg Lys Glu Glu Lys Gly 180
185 190 Asp Ser Ala Arg Lys Trp Glu Trp
Leu His Glu Ser Lys Lys Asn Tyr 195 200
205 Gln Ser Met Glu Lys His Thr Lys Leu Pro Gly Asp Lys
Cys Cys Gln 210 215 220
Pro Leu Gly Lys Thr Lys Leu Glu Arg Lys Val Ser Ala Lys Glu Asn 225
230 235 240 Arg Gln Ala Pro
Val Leu Leu Gln Thr Tyr Arg Glu Ser Trp Asn Gly 245
250 255 Glu Asn Ile Glu Ser Val Lys Gln Ser
Arg Ser Pro Val Ser Val Phe 260 265
270 Ser Trp Asp Asn Glu Lys Asn Asp Lys Asp Ser Trp Ser Gln
Leu Phe 275 280 285
Thr Glu Asp Ser Gln Gly Gln Arg Val Ile Ala His Asn Thr Arg Ala 290
295 300 Pro Phe Gln Asp Val
Thr Asn Asn Trp Asn Trp Asp Leu Gly Pro Phe 305 310
315 320 Pro Asn Ser Pro Trp Ala Gln Cys Gln Glu
Asp Gly Pro Thr Gln Asn 325 330
335 Leu Lys Pro Asp Leu Leu Phe Thr Gln Asp Ser Glu Gly Asn Gln
Val 340 345 350 Ile
Arg His Gln Phe 355 59721DNAArtificial
SequenceOligonucleotides for PCR 597caccttgcag ccaggaaaga c
2159821DNAArtificial
SequenceOligonucleotides for PCR 598cagcacagtc tggagtatca g
215991907DNAHomo sapiens 599aatgcacacg
agcagacaga gaagcaacat ctttaaggta ctgagggcag gagaagttaa 60tgtagaatac
tatgccagaa aaaataaatt cccaaaagtg gaagtgaaat aaggacattt 120agagatgtac
aaaagctgac cgaattcact accagtcaac ccacactaca agaaacatca 180aatgagtcct
ccaagcagaa ggaatccaat accagatgaa aatccagatc tccacgagga 240aatgaagaac
accagaaatg ggtaactata ctagatcggc cctttcttca aataagagca 300gttggaataa
caaagctgtt cagttgtacc cttggaatcc actgaaatcc tgggtaggga 360agctccagta
ccaccaactg gaaagactgg gaatgcctaa tagctggtac tggccattgt 420cgtaggcttt
gtccactctg acaaactgaa gatggggact cgactcacct tcgccagcca 480caggaggacc
tccagacgag gacaggactc gctgcctttc tttcccgtca gaaagggatc 540ccttgcggac
aggacctaag caccacgcac ctgccccccg ggatgccgaa cgaagtggtc 600cctaaagctc
ctctgcaggc ccaaccgaaa caggcctgaa gctccaggat gggcgagagg 660atcctctttg
agcgaaacca gccttctgcc tggctggccc tggtcaacac cctgggaaga 720ggccgatttg
gcggacagaa cggaagaaaa gacctaaagg tagaatctca tgatgtcgag 780atgttaaaac
actcaaattt taaggttcga ctgtgagggg gagatagggg gtctcgagct 840ggatcgaccc
ctgagccttc atctgcagag tcctgtgcac cagctcagag gacaggacta 900tgtgcaccaa
tggttctcat caggcggcaa cttcaccctc acatgcctcc cccatccctg 960ctggtacaca
agaccacgac taggggaagc ccggagggag aatgttaacc cctggcatct 1020atctagtcag
cagaggtgag ggatgctgct aaacacctta caatccaccg gaggacaccc 1080gcccccaccg
accccgaagt ggccattccc tggaggtggg gaaactcgcc tgtagatcaa 1140tgcccacgca
cttggcggac aggaaatcac gaattggcca ctaactggat cttggatctg 1200agaaaaaaat
tccagcgtca gagggaactc tcggagattt gcccagagca taaggaacgt 1260actccttccc
tcagtgatgg atcctcacat ctgggggaaa tcatagacaa tttcttttgt 1320agggcgaact
ctgctataca gtttatgatg tcagagtgaa tactttcttt gagttgcagt 1380cagaaactgt
agatttttaa aaatttaaaa ttcattattc tctgtcagta ttccaaagtg 1440tatacagaaa
gctattgcac tgttcaggag atggcgctta acattttgga aattcaaggt 1500gatgaatgtc
cagataagac tatctctcct ggtacaaagt ttgacaatgc tgaacatttt 1560taaaggttct
ttttgatata caaagtgcac caatgagtgc tttttaattc ttacaataat 1620tctgggtgag
gtaggtattt ttccaattcc cattttatgc ttcggtagcc ctttgtattt 1680atacttcaaa
acacttggct ctcttgtaat tatttaagaa attagttgtg attatttgtt 1740taatgtgcag
gagttacaaa aggcaagctt tagaacaaga cagacctggt tatgattcct 1800ggctctgaaa
gctgtacacc ctgtgaccct agacaggtgt tttaatgcct cgctgcctct 1860gtttcttgct
ctgtaaaatg tgaacaataa cagtattggc ctcatgc
190760021DNAArtificial SequenceOligonucleotides for PCR 600ttgcggacag
gacctaagca c
2160121DNAArtificial SequenceOligonucleotides for PCR 601tagtcctgtc
ctctgagctg g
216022553DNAHomo sapiens 602tgcgtgtcgg ggtccgctcg tgcgcgcctc tccggggtct
gtgcgcgtgg ccctccgctc 60gcgccggagg gcgtgggcgt ggcctcggcg tgggtgtggc
cgctcgggga ggggcctccc 120gggggcgggg ccggcctggt ccgcgcggtg acgcgccctg
cagccccgag cgagcgagcg 180agcgagcgag ttgccgagcg cgccccgtcc ctcgcgcgcg
atgctcccct ggacggcgct 240cggcctggcc ctgagcttgc ggctggcgct ggcgcggagc
ggcgcggagc gcggtccacc 300agcatcagcc ccccgagggg acctgatgtt cctgctggac
agctcagcca gcgtctctca 360ctacgagttc tcccgggttc gggagtttgt ggggcagctg
gtggctccac tgcccctggg 420caccggggcc ctgcgtgcca gtctggtgca cgtgggcagt
cggccataca ccgagttccc 480cttcggccag cacagctcgg gtgaggctgc ccaggatgcg
gtgcgtgctt ctgcccagcg 540catgggtgac acccacactg gcctggcgct ggtctatgcc
aaggaacagc tgtttgctga 600agcatcaggt gcccggccag gggtgcccaa agtgctggtg
tgggtgacag atggcggctc 660cagcgaccct gtgggccccc ccatgcagga gctcaaggac
ctgggcgtca ccgtgttcat 720tgtcagcacc ggccgaggca acttcctgga gctgtcagcc
gctgcctcag cccctgccga 780gaagcacctg cactttgtgg acgtggatga cctgcacatc
attgtccaag agctgagggg 840ctccattctc gacgcgatgc ggccgcagca gctccatgcc
acggagatca cgtccagcgg 900cttccgcctg gcctggccac ccctgctgac cgcagactcg
ggctactatg tgctggagct 960ggtgcccagc gcccagccgg gggctgcaag acgccagcag
ctgccaggga acgccacgga 1020ctggatctgg gccggcctcg acccggacac ggactacgac
gtggcgctag tgcctgagtc 1080caacgtgcgc ctcctgaggc cccagatcct gcgggtgcgc
acgcggcccg gtgaggcagg 1140gccgggggct tcgggcccgg agtcgggggc tgggccggcc
cccacgcagc tcgccgccct 1200ccccgcccca gaggaggccg ggccagagcg catcgtcatc
tcccacgccc ggccgcgcag 1260cctccgcgtg agttgggccc cagcgctggg ctcagccgcg
gcgctcggct accacgtgca 1320gttcgggccg ctgcggggcg gggaggcgca gcgggtggag
gtgcccgcgg gccgcaactg 1380caccacgctg cagggcctgg cgccgggcac cgcctacctg
gtgaccgtga ccgccgcctt 1440ccgctcgggc cgcgagagcg cgctgtccgc caaggcctgc
acgcccgacg gcccgcgccc 1500gcgcccacgc cccgtgcccc gcgccccgac cccggggacc
gccagccgtg agccgtaagc 1560cggcgtcccc gcccagccga gagggccggc gcctacctga
gggcccctgt gtcccgaacc 1620cggagcggag gcgcccaacc cggcagacgg gtgcaggccc
ggcctttccc cacgcggact 1680ccgcgcgacc ccggccctct ccctgcggcc gcagggcttc
cccgcctggc gcctgccctc 1740cagggctggg gcctcgcctg gcgggacccc gcagcagccc
cggccccatc cccgcccaga 1800gccgggcgtc gtgtgggtcc gtgggtgata attgagagcg
tcagacccag gactgttcag 1860ggaggagccc cggtcagact cccacgtgtg aagaccgggc
cccaagtggc aagggctggc 1920ctggggcggg cagcttgggt cctggacgtt gataggaagc
ggaaggggaa tcgcgggaag 1980ctggcccagg tcaggtccgc aaaggcttct gaagaagagg
aagggcgagt aggggcacct 2040ggacgctgat ggtggccagg atgctcagct ggccaggagg
gcagcacctg ctggggacgg 2100tggccctgcc ttcatgccca ggacaccagc tgggtccagc
tagcagccac tgggaatcag 2160aggaatgggg cagagctggg cattcaggac cttgaggaca
cgtgacccca cccgcccacc 2220gccactatca ggccccggga ccgcactgac aggaaacctt
ccgtcgtgag ggagcacttc 2280ccaggggccg cagggacgac actctccagg gaggccccag
caaccacacc atcttcttgc 2340tgtgagaggt ctcaccccgg gctacctcct gtcactactc
actgccctgg ggtccgtggg 2400caagttgccc agggtggggg tgcctagcca ggtgcagtcc
ccgccccgcc tagtcctcgg 2460cgtcacgcaa tgctcacctc gcctcttccc cactaacatc
ccagacttta aaattcagta 2520aatcagatgt acaccgaaaa aaaaaaaaaa aaa
2553603445PRTHomo sapiens 603Met Leu Pro Trp Thr
Ala Leu Gly Leu Ala Leu Ser Leu Arg Leu Ala 1 5
10 15 Leu Ala Arg Ser Gly Ala Glu Arg Gly Pro
Pro Ala Ser Ala Pro Arg 20 25
30 Gly Asp Leu Met Phe Leu Leu Asp Ser Ser Ala Ser Val Ser His
Tyr 35 40 45 Glu
Phe Ser Arg Val Arg Glu Phe Val Gly Gln Leu Val Ala Pro Leu 50
55 60 Pro Leu Gly Thr Gly Ala
Leu Arg Ala Ser Leu Val His Val Gly Ser 65 70
75 80 Arg Pro Tyr Thr Glu Phe Pro Phe Gly Gln His
Ser Ser Gly Glu Ala 85 90
95 Ala Gln Asp Ala Val Arg Ala Ser Ala Gln Arg Met Gly Asp Thr His
100 105 110 Thr Gly
Leu Ala Leu Val Tyr Ala Lys Glu Gln Leu Phe Ala Glu Ala 115
120 125 Ser Gly Ala Arg Pro Gly Val
Pro Lys Val Leu Val Trp Val Thr Asp 130 135
140 Gly Gly Ser Ser Asp Pro Val Gly Pro Pro Met Gln
Glu Leu Lys Asp 145 150 155
160 Leu Gly Val Thr Val Phe Ile Val Ser Thr Gly Arg Gly Asn Phe Leu
165 170 175 Glu Leu Ser
Ala Ala Ala Ser Ala Pro Ala Glu Lys His Leu His Phe 180
185 190 Val Asp Val Asp Asp Leu His Ile
Ile Val Gln Glu Leu Arg Gly Ser 195 200
205 Ile Leu Asp Ala Met Arg Pro Gln Gln Leu His Ala Thr
Glu Ile Thr 210 215 220
Ser Ser Gly Phe Arg Leu Ala Trp Pro Pro Leu Leu Thr Ala Asp Ser 225
230 235 240 Gly Tyr Tyr Val
Leu Glu Leu Val Pro Ser Ala Gln Pro Gly Ala Ala 245
250 255 Arg Arg Gln Gln Leu Pro Gly Asn Ala
Thr Asp Trp Ile Trp Ala Gly 260 265
270 Leu Asp Pro Asp Thr Asp Tyr Asp Val Ala Leu Val Pro Glu
Ser Asn 275 280 285
Val Arg Leu Leu Arg Pro Gln Ile Leu Arg Val Arg Thr Arg Pro Gly 290
295 300 Glu Ala Gly Pro Gly
Ala Ser Gly Pro Glu Ser Gly Ala Gly Pro Ala 305 310
315 320 Pro Thr Gln Leu Ala Ala Leu Pro Ala Pro
Glu Glu Ala Gly Pro Glu 325 330
335 Arg Ile Val Ile Ser His Ala Arg Pro Arg Ser Leu Arg Val Ser
Trp 340 345 350 Ala
Pro Ala Leu Gly Ser Ala Ala Ala Leu Gly Tyr His Val Gln Phe 355
360 365 Gly Pro Leu Arg Gly Gly
Glu Ala Gln Arg Val Glu Val Pro Ala Gly 370 375
380 Arg Asn Cys Thr Thr Leu Gln Gly Leu Ala Pro
Gly Thr Ala Tyr Leu 385 390 395
400 Val Thr Val Thr Ala Ala Phe Arg Ser Gly Arg Glu Ser Ala Leu Ser
405 410 415 Ala Lys
Ala Cys Thr Pro Asp Gly Pro Arg Pro Arg Pro Arg Pro Val 420
425 430 Pro Arg Ala Pro Thr Pro Gly
Thr Ala Ser Arg Glu Pro 435 440
445 60421DNAArtificial SequenceOligonucleotides for PCR 604aaccaacctg
aggatttcac g
2160521DNAArtificial SequenceOligonucleotides for PCR 605agacagctgt
tcgtgagaag c
216061999DNAHomo sapiens 606cactctgtaa gttcaccgcc ggtcgggtcc ggccgccgcg
ctgtccagct cctgagacct 60tgctgtccgc cggtctgccg tctgcgcgcc tcacgctcct
cagccctgga ccggggacaa 120gtaaccctcg gtgacaagac caaagtgcac tgctgcccac
acagttccta cctttctggc 180ttcaattctt cagaagagtt tgccgtcctt tggggagaac
gtgatttttg ttatctcagc 240ccactgactt cattgatctc taatcttttt taattccttg
ggccaacttt gttcgtgccc 300ccacactgta gccagaagcc cgttggcgag ctctggcacc
tgcaaaccac cccgtggaac 360gagtgtttcc tctggctgag ggttggagag gaggtgtggt
ctcagcaggc ggcccgtagc 420ctcacagcca ggcctggtgg tgaggtcacc atgtccacca
aggtgcccat ctatctgaag 480cgtggcagtc gcaagggcaa gaaggagaag cttcgggacc
tgctgtcctc ggacatgatc 540agcccaccgc tgggggactt ccgccacacc attcatattg
gcagtggcgg cggcagtgac 600atgtttggcg acatctcctt cctgcagggc aagttccacc
tcctgccggg gaccatggtg 660gaggggcctg aagaagatgg caccttcgac ctccccttcc
agttcacccg caccgccacc 720gtgtgtgggc gggagctccc ggacggccca tcccctctgc
tcaagaacgc catctccctc 780ccggttatcg gtggacccca ggctctcacc ctgcccacag
cccaggctcc acccaagccc 840cctcgcctgc acctggagac ccctcagcct tccccacagg
agggagggag tgtggacatc 900tggaggattc cagagactgg ctcccccaac agtggactga
ccccggagtc aggggccgag 960gagcccttcc tgtccaatgc cagctccctg ctgtccctgc
acgtggacct ggggccttcc 1020atcctggatg atgtcctgca gatcatggat caggacctgg
acagcatgca gatccccaca 1080taggacacga ggctgcctag gctggggtcc caggtggggc
ccagccagga ggtggggtgt 1140ggacccggcc ctggcggcgg agtcagggtc ccaagatccc
acctgtatgg tcgctggcca 1200gtgattctcc ttctgagccg tgtttcccct ctccctccct
ctccacgtgg gcagggcagg 1260ccccatcgct ttcctctgat aaccacatgg acacatcctg
aagtcagccc aggcgccctg 1320agcatcttgg ggcacctgga ccccatcaca atactccttc
ttccttcagg tccctgggtg 1380aaggctttgc tgaaaccgac cccccttttc acgtcccttc
tgcctctgcc ccgttggatg 1440ccctgactgg gggcagggga agagacaggg cacagctggc
cacagggctc agccactgag 1500caggctgttc cgggcctttg gctttgcatc ctggacgggg
agtgtcctgt cagggaccag 1560atgtgtcctg cctcatccct agctccaatc ccttccccac
gtgaccgggg attctggttg 1620caataaaaca tgctgctgct ggtggcggag ctccctgtcc
ctttgcccca ggtttcctcc 1680cggaggcaga cagtctccca gagctgaggg cttgcctctg
gagaccccag ccccagaggg 1740ctttgtggag gacaggcctt gccctcaaga acgtcgtacc
tgacgctgag cctgtcatga 1800gaatgcaaca ggagcaaacc aagtgttgct gtgacattga
ttcagatgtt tggcaagagg 1860tggctgagca ctggggtggg cttggcactg tgccaagcct
ggggccaatc cctgcccagt 1920cagctggggt ctggtggggg acacccaaga ataaaagaat
aaccacaaag tgtgcaaggg 1980aaaaaaaaaa aaaaaaaaa
1999607210PRTHomo sapiens 607Met Ser Thr Lys Val
Pro Ile Tyr Leu Lys Arg Gly Ser Arg Lys Gly 1 5
10 15 Lys Lys Glu Lys Leu Arg Asp Leu Leu Ser
Ser Asp Met Ile Ser Pro 20 25
30 Pro Leu Gly Asp Phe Arg His Thr Ile His Ile Gly Ser Gly Gly
Gly 35 40 45 Ser
Asp Met Phe Gly Asp Ile Ser Phe Leu Gln Gly Lys Phe His Leu 50
55 60 Leu Pro Gly Thr Met Val
Glu Gly Pro Glu Glu Asp Gly Thr Phe Asp 65 70
75 80 Leu Pro Phe Gln Phe Thr Arg Thr Ala Thr Val
Cys Gly Arg Glu Leu 85 90
95 Pro Asp Gly Pro Ser Pro Leu Leu Lys Asn Ala Ile Ser Leu Pro Val
100 105 110 Ile Gly
Gly Pro Gln Ala Leu Thr Leu Pro Thr Ala Gln Ala Pro Pro 115
120 125 Lys Pro Pro Arg Leu His Leu
Glu Thr Pro Gln Pro Ser Pro Gln Glu 130 135
140 Gly Gly Ser Val Asp Ile Trp Arg Ile Pro Glu Thr
Gly Ser Pro Asn 145 150 155
160 Ser Gly Leu Thr Pro Glu Ser Gly Ala Glu Glu Pro Phe Leu Ser Asn
165 170 175 Ala Ser Ser
Leu Leu Ser Leu His Val Asp Leu Gly Pro Ser Ile Leu 180
185 190 Asp Asp Val Leu Gln Ile Met Asp
Gln Asp Leu Asp Ser Met Gln Ile 195 200
205 Pro Thr 210 60821DNAArtificial
SequenceOligonucleotides for PCR 608aacgtcgtac ctgacgctga g
2160921DNAArtificial
SequenceOligonucleotides for PCR 609ccaagtgttg ctgtgacatt g
21610986DNAHomo sapiens 610attcaagatg
atgttagaga gatgacagag tctaggttag ggagggcctg agtccttgta 60gactctgagt
acggtgctga gcagggaaga gacaggctct ggcttagggt ttagaaggaa 120cacaggctac
tgcgatgagg attgtctgaa ggggaacaaa ggccaagatg ttgtttgaag 180ccgtgagact
gagtgagatc atggaggaag tgaatgtaaa tagaaaaggg aagaagtctg 240aagacggagc
cctgagacac tccattgtaa actggaagat gaggaagagc cagcaaagga 300gactgagaag
gggcagccag tgaagatgga gccagggcta gagtaagagc cccttctggg 360atgctgtgac
ccccaagttt gaagactgct gataacccca atctacgaag actagctatg 420gaacttccta
cactgagaca actccagtgg aactctgata attatcctaa aataaggagg 480cttcttcagt
agccctcgaa atatgttcaa atacatgatt acatttatgt ccttaatatt 540gctattagtt
tctgatgtta atgtaaaagt tggggaaaaa gtggaaaagt taaagcagtg 600caggttaatt
caatgccaga gtaacttctc agagggtgta tattcagtgt gaacaatttt 660caacagagaa
atgtcaactt ctggccacaa cggcaaccag taaaatgact atttttactg 720tcttatctat
taatgaagag gagattgcat aatatagatg aaggagcata gtatttgcag 780gtggaacgcc
tagcagggct tgagtctcaa ctctgctgct tttactctaa ttgaccgaga 840caagtcattt
aaactaatag agcttcaatt ttctcatatc taatgtaaca taacaattca 900cagcctttta
ctttgtagtt atcgtgaaga tctaatcgca gtgaaatata tttatatatc 960tgtctgccga
taaaaaaaaa aaaaaa
98661121DNAArtificial SequenceOligonucleotides for PCR 611gactctgagt
acggtgctga g
2161221DNAArtificial SequenceOligonucleotides for PCR 612tcttactcta
gccctggctc c
21613899DNAHomo sapiens 613cccgagcgcc ggccgggcca tgacccccgc tgctctgtct
tgcaggctcg tcgccgcggc 60cccccgagcc cgaccgccgc cgccaccacc accagcgccc
gggcgggcct cgcgcgcctc 120gggcgcggct ccgcagtgag cccaccaaga aggaagcggc
ctgcagaggt gccgacatgg 180ggcttaagat gtcctgcctg aaaggctttc aaatgtgtgt
cagcagcagc agcagcagcc 240acgacgaggc ccccgtcctg aacgacaagc acctggacgt
gcccgacatc atcatcacgc 300cccccacccc cacgggcatg atgctgccga gggacttggg
gagcacagtc tggctggatg 360agacagggtc gtgcccagat gatggagaaa tcgacccaga
agcctgagga ggtgtcctgg 420gtttggctgg ctggctcctg ctccagcggc ccggcttcag
gtgtccgggg gcgtggctgc 480ctggagcagg tgtgctgaat accctggatg ggaactgagc
gaacccgggc ctccgctcag 540agagacgtgg caggaccagc gaggaatcca gcctgtccac
ttccagaaca gtgtttccca 600ggccccgctg agtggaccgg acctctgaca cctccaggtt
cttgctgact ccggcctggt 660gaaagggagc gccatggtcc tggctgttgg ggtcccaggg
agaggctctc ttctggacaa 720acacaccctc ccagccccca gggctgtgca aacacatgcc
cctgccataa gcaccaacaa 780gaacttcttg caggtggagt ggctgttttt tataagttgt
tttacagata cggaaacagt 840ccaaaatggg atttataatt tcttttttgc attataaata
aagatcctct gtaacaaaa 89961476PRTHomo sapiens 614Met Gly Leu Lys Met
Ser Cys Leu Lys Gly Phe Gln Met Cys Val Ser 1 5
10 15 Ser Ser Ser Ser Ser His Asp Glu Ala Pro
Val Leu Asn Asp Lys His 20 25
30 Leu Asp Val Pro Asp Ile Ile Ile Thr Pro Pro Thr Pro Thr Gly
Met 35 40 45 Met
Leu Pro Arg Asp Leu Gly Ser Thr Val Trp Leu Asp Glu Thr Gly 50
55 60 Ser Cys Pro Asp Asp Gly
Glu Ile Asp Pro Glu Ala 65 70 75
61521DNAArtificial SequenceOligonucleotides for PCR 615tgtcctgcct
gaaaggcttt c
2161621DNAArtificial SequenceOligonucleotides for PCR 616catccagggt
attcagcaca c
21617432DNAHomo sapiens 617ttatgtgcct gaagtcgcac agtgaataag ctaaaacacc
tgcttttaac aatggtacca 60tacaaccact actccattaa ctccacccac ctcctgcacc
cctccccaca cacacaaaat 120gaaccacgtt ctttgtatgg gcccaatgag ctgtcaagct
gccctgtgtt catttcattt 180ggaattgccc cctctggttc ctctgtatac tactgcttca
tctctaaaga cagctcatcc 240tcctccttca cccctgaatt tccagagcac ttcatctgct
ccttcatcac aagtccagtt 300ttctgccact agtctgaatt tcatgagaag atgccgattt
ggttcctgtg ggtcctcagc 360actattcagt acagtgcttg actcacagca ggcactcaga
aaatactgga ggaaataaaa 420caccaaagat at
43261821DNAArtificial SequenceOligonucleotides for
PCR 618ctgctccttc atcacaagtc c
2161924DNAArtificial SequenceOligonucleotides for PCR 619gagatctcga
gatctcgatc gtac
246202575DNAHomo sapiens 620gagaacgggg tagcccggcg cttacacatg tcacatgtgc
tttttaagac ggccgggagc 60gcctgcgagc tggatctggt ggaggatgct gcggcaggtg
cttcgcagag ggctccagtc 120gttctgccac aggctgggtt tgtgcgtgag ccggcacccg
gtctttttcc tcaccgtgcc 180cgcagtcctg acaatcacct tcggcctcag cgcgctcaac
cgcttccagc ccgagggcga 240cctggagcgc ctggtcgctc ccagccacag cctggccaag
atcgagcgca gcctggccag 300cagccttttc cccctggacc agtccaaaag ccagctctat
tcggacttac acacccctgg 360gaggtatggc agggtgatcc tcctctcccc aaccggggac
aatattttgc tccaggctga 420ggggatcctg cagacccacc gagccgtgct ggaaatgaag
gatgggagga acagttttat 480tggacaccaa ctgggcgggg tagtggaagt gccaaacagc
aaagatcagc gggtcaagtc 540agccagagcc attcaaatca cctactacct ccagacctat
ggctctgcca cccaagacct 600cataggggag aagtgggaga atgagttctg taagcttata
aggaagctcc aggaggagca 660tcaagaactc cagctctact ctttagcatc ctttagcctc
tggagggact ttcataagac 720cagcatcctg gccagaagca aggtcctggt gagcctcgtg
ctgatcctga ccacagccac 780cctctccagc tccatgaagg actgcttgcg cagtaagccc
ttcctgggcc tcctgggggt 840gctcacagta tgcatctcca tcatcacagc agcagggatc
ttcttcatca ccgatggaaa 900gtacaactcc accctgctgg gaatcccgtt cttcgccatg
ggtcatggaa ctaaaggagt 960gtttgagctt ctgtccggat ggcggagaac caaagagaac
ttgcccttca aagacaggat 1020agcagatgcc tattctgatg tgatggtcac ctataccatg
accagctccc tgtacttcat 1080cacttttggc atgggtgcca gcccattcac aaacatagag
gctgtgaagg tcttctgtca 1140aaacatgtgt gtctctattc tgttgaacta cttctacatt
ttctccttct ttggctcctg 1200tctggtcttt gctggccaac tagagcaaaa ccgctaccac
agcatctttt gctgtaagat 1260cccttctgca gaatacctgg atcgcaaacc tgtgtggttc
cagacagtga tgagtgatgg 1320gcatcaacag acgtcccatc atgagacgaa cccctaccag
caccacttca ttcagcactt 1380cctccgtgaa cattataatg aatggattac caatatatat
gtgaagccat ttgttgtcat 1440cctctatctc atttatgcct ccttctcctt catggggtgc
ttacagatca gtgacggagc 1500caacatcatc aatctactag ccagtgattc gccaagtgtt
tcctatgcca tggttcagca 1560gaaatatttc agcaactata gccctgtgat aggattctac
gtctatgagc ccctagagta 1620ctggaacagc agcgtccagg atgacctaag aagactctgt
agtggattca ctgcagtgtc 1680ctgggtggag cagtactacc agttcctgaa agtcagcaac
gtcagtgcca ataacaaaag 1740tgacttcatc agtgtcctgc aaagctcatt tttaaaaaag
ccagaattcc agcattttcg 1800aaatgatatc atcttctcca aggcagggga tgaaagcaat
atcattgctt ctcgcttgta 1860tctggtggcc aggactagca gagacaagca gaaagaaatc
acagaagtgt tggaaaagct 1920gaggccccta tccctctcaa agagcatccg attcatcgtg
ttcaacccct cctttgtctt 1980catggaccat tacagcttgt ctgtcacagt gcctgttctg
attgcaggct ttggtgttct 2040cctggtgtta atcctgactt ttttcctagt gatccaccct
ctgggaaact tctggctaat 2100tcttagcgtc acctcaattg agctgggcgt tctgggctta
atgacattat ggaacgtcga 2160catggattgc atttctatct tgtgccttat ctacaccttg
aatttcgcca ttgaccactg 2220tgcaccactg cttttcacat ttgtattagc aactgagcac
acccgaacac aatgtataaa 2280aagctccttg caagaccatg ggacagccat tttgcaaaat
gttacttctt ttcttattgg 2340gttagtcccc cttctatttg tgccttcgaa cctgaccttc
acactgttca aatgcttgct 2400gctcactggg ggttgcacac ttctgcactg ttttgttatt
ttacctgtgt tcctaacgtt 2460tttcccccct tccaaaaagc accacaagaa aaagaaacgt
gccaagcgaa aggagagaga 2520ggaaattgaa tgcatagaaa ttcaagagaa cccggatcac
gtcaccacag tatga 2575621829PRTHomo sapiens 621Met Leu Arg Gln Val
Leu Arg Arg Gly Leu Gln Ser Phe Cys His Arg 1 5
10 15 Leu Gly Leu Cys Val Ser Arg His Pro Val
Phe Phe Leu Thr Val Pro 20 25
30 Ala Val Leu Thr Ile Thr Phe Gly Leu Ser Ala Leu Asn Arg Phe
Gln 35 40 45 Pro
Glu Gly Asp Leu Glu Arg Leu Val Ala Pro Ser His Ser Leu Ala 50
55 60 Lys Ile Glu Arg Ser Leu
Ala Ser Ser Leu Phe Pro Leu Asp Gln Ser 65 70
75 80 Lys Ser Gln Leu Tyr Ser Asp Leu His Thr Pro
Gly Arg Tyr Gly Arg 85 90
95 Val Ile Leu Leu Ser Pro Thr Gly Asp Asn Ile Leu Leu Gln Ala Glu
100 105 110 Gly Ile
Leu Gln Thr His Arg Ala Val Leu Glu Met Lys Asp Gly Arg 115
120 125 Asn Ser Phe Ile Gly His Gln
Leu Gly Gly Val Val Glu Val Pro Asn 130 135
140 Ser Lys Asp Gln Arg Val Lys Ser Ala Arg Ala Ile
Gln Ile Thr Tyr 145 150 155
160 Tyr Leu Gln Thr Tyr Gly Ser Ala Thr Gln Asp Leu Ile Gly Glu Lys
165 170 175 Trp Glu Asn
Glu Phe Cys Lys Leu Ile Arg Lys Leu Gln Glu Glu His 180
185 190 Gln Glu Leu Gln Leu Tyr Ser Leu
Ala Ser Phe Ser Leu Trp Arg Asp 195 200
205 Phe His Lys Thr Ser Ile Leu Ala Arg Ser Lys Val Leu
Val Ser Leu 210 215 220
Val Leu Ile Leu Thr Thr Ala Thr Leu Ser Ser Ser Met Lys Asp Cys 225
230 235 240 Leu Arg Ser Lys
Pro Phe Leu Gly Leu Leu Gly Val Leu Thr Val Cys 245
250 255 Ile Ser Ile Ile Thr Ala Ala Gly Ile
Phe Phe Ile Thr Asp Gly Lys 260 265
270 Tyr Asn Ser Thr Leu Leu Gly Ile Pro Phe Phe Ala Met Gly
His Gly 275 280 285
Thr Lys Gly Val Phe Glu Leu Leu Ser Gly Trp Arg Arg Thr Lys Glu 290
295 300 Asn Leu Pro Phe Lys
Asp Arg Ile Ala Asp Ala Tyr Ser Asp Val Met 305 310
315 320 Val Thr Tyr Thr Met Thr Ser Ser Leu Tyr
Phe Ile Thr Phe Gly Met 325 330
335 Gly Ala Ser Pro Phe Thr Asn Ile Glu Ala Val Lys Val Phe Cys
Gln 340 345 350 Asn
Met Cys Val Ser Ile Leu Leu Asn Tyr Phe Tyr Ile Phe Ser Phe 355
360 365 Phe Gly Ser Cys Leu Val
Phe Ala Gly Gln Leu Glu Gln Asn Arg Tyr 370 375
380 His Ser Ile Phe Cys Cys Lys Ile Pro Ser Ala
Glu Tyr Leu Asp Arg 385 390 395
400 Lys Pro Val Trp Phe Gln Thr Val Met Ser Asp Gly His Gln Gln Thr
405 410 415 Ser His
His Glu Thr Asn Pro Tyr Gln His His Phe Ile Gln His Phe 420
425 430 Leu Arg Glu His Tyr Asn Glu
Trp Ile Thr Asn Ile Tyr Val Lys Pro 435 440
445 Phe Val Val Ile Leu Tyr Leu Ile Tyr Ala Ser Phe
Ser Phe Met Gly 450 455 460
Cys Leu Gln Ile Ser Asp Gly Ala Asn Ile Ile Asn Leu Leu Ala Ser 465
470 475 480 Asp Ser Pro
Ser Val Ser Tyr Ala Met Val Gln Gln Lys Tyr Phe Ser 485
490 495 Asn Tyr Ser Pro Val Ile Gly Phe
Tyr Val Tyr Glu Pro Leu Glu Tyr 500 505
510 Trp Asn Ser Ser Val Gln Asp Asp Leu Arg Arg Leu Cys
Ser Gly Phe 515 520 525
Thr Ala Val Ser Trp Val Glu Gln Tyr Tyr Gln Phe Leu Lys Val Ser 530
535 540 Asn Val Ser Ala
Asn Asn Lys Ser Asp Phe Ile Ser Val Leu Gln Ser 545 550
555 560 Ser Phe Leu Lys Lys Pro Glu Phe Gln
His Phe Arg Asn Asp Ile Ile 565 570
575 Phe Ser Lys Ala Gly Asp Glu Ser Asn Ile Ile Ala Ser Arg
Leu Tyr 580 585 590
Leu Val Ala Arg Thr Ser Arg Asp Lys Gln Lys Glu Ile Thr Glu Val
595 600 605 Leu Glu Lys Leu
Arg Pro Leu Ser Leu Ser Lys Ser Ile Arg Phe Ile 610
615 620 Val Phe Asn Pro Ser Phe Val Phe
Met Asp His Tyr Ser Leu Ser Val 625 630
635 640 Thr Val Pro Val Leu Ile Ala Gly Phe Gly Val Leu
Leu Val Leu Ile 645 650
655 Leu Thr Phe Phe Leu Val Ile His Pro Leu Gly Asn Phe Trp Leu Ile
660 665 670 Leu Ser Val
Thr Ser Ile Glu Leu Gly Val Leu Gly Leu Met Thr Leu 675
680 685 Trp Asn Val Asp Met Asp Cys Ile
Ser Ile Leu Cys Leu Ile Tyr Thr 690 695
700 Leu Asn Phe Ala Ile Asp His Cys Ala Pro Leu Leu Phe
Thr Phe Val 705 710 715
720 Leu Ala Thr Glu His Thr Arg Thr Gln Cys Ile Lys Ser Ser Leu Gln
725 730 735 Asp His Gly Thr
Ala Ile Leu Gln Asn Val Thr Ser Phe Leu Ile Gly 740
745 750 Leu Val Pro Leu Leu Phe Val Pro Ser
Asn Leu Thr Phe Thr Leu Phe 755 760
765 Lys Cys Leu Leu Leu Thr Gly Gly Cys Thr Leu Leu His Cys
Phe Val 770 775 780
Ile Leu Pro Val Phe Leu Thr Phe Phe Pro Pro Ser Lys Lys His His 785
790 795 800 Lys Lys Lys Lys Arg
Ala Lys Arg Lys Glu Arg Glu Glu Ile Glu Cys 805
810 815 Ile Glu Ile Gln Glu Asn Pro Asp His Val
Thr Thr Val 820 825
62221DNAArtificial SequenceOligonucleotides for PCR 622cagctctact
ctttagcatc c
2162321DNAArtificial SequenceOligonucleotides for PCR 623cctttagttc
catgacccat g
216246035DNAHomo sapiens 624ggcggcggcg gcggcggccc cgggcgctga gcgggtgccc
ggcgcggaga gcggcgagcg 60cagccatgcc ccaggccgcc tccggggcag cagcagcggc
ggccggggcc gaggcgcggg 120ccgggggcgc cggggggccg gcggcggccc gggcgggacg
atgaagcggc agaacgtgcg 180cacgctggcg ctcatcgtgt gcaccttcac ctacctgctg
gtgggcgccg cggtcttcga 240cgcgctggag tcggagcccg agctgatcga gcggcagcgg
ctggagctgc ggcagcagga 300gctgcgggcg cgctacaacc tcagccaggg cggctacgag
gagctggagc gcgtcgtgct 360gcgcctcaag ccgcacaagg ccggcgtgca gtggcgcttc
gccggctcct tctacttcgc 420catcaccgtc atcaccacca tcggctacgg gcacgcggcg
cccagcacgg atggcggcaa 480ggtgttctgc atgttctacg cgctgctggg catcccgctc
acgctcgtca tgttccagag 540cctgggcgag cgcatcaaca ccttggtgag gtacctgctg
caccgcgcca agaaggggct 600gggcatctcg tggccttcgc ttcgtctcat ccttacgggc
ctcacggtca tcggcgcctt 660cctcaacctc gtggtgctgc gcttcatgac catgaacgcc
gaggacgaga agcgcgacgc 720cgagcaccgc gcgctgctca cgcgcaacgg gcaggcgggc
ggcggcggag ggagtggcag 780cgcgcacact acggacaccg cctcatccac ggcggcagcg
ggcggcggcg gcttccgcaa 840cgtctacgcg gaggtgctgc acttccagtc catgtgctcg
tgcctgtggt acaagagccg 900cgagaagctg cagtactcca tccccatgat catcccgcgg
gacctctcca cgtccgacac 960gtgcgtggag cagagccact cgtcgccggg agggggcggc
cgctacagcg acacgccctc 1020gcgacgctgc ctgtgcagcg gggcgccacg ctccgccatc
agctcggtgt ccacgggtct 1080gcacagcctg tccaccttcc gcggactcat gaagcgcagg
agctccgtgt gactgccccg 1140aggggcctgg agcacctggg ggcgcgggcg ggggacccct
gctgggaggc caggagactg 1200cccctgctgc cttctgccca gtgggacccc gcacaacatc
cctcaccact ctcccccagc 1260acccccatct ccgactgtgc ctgcttgcac cagccggcag
gaggccgggc tctgaggacc 1320cctggggccc ccatcggagc cctgcaaatt ccgagaaatg
tgaaacttgg tggggtcagg 1380gaggaaaggc agaagctggg agcctccctt ccctttgaaa
atctaagaag ctcccagtcc 1440tcagagaccc tgctggtacc cagaccccca ccttcggagg
ggacttcatg ttccgtgtac 1500gtttgcatct ctatttatac ctctgtcctg ctaggtctcc
caccttccct tggttccaaa 1560agccagggtg tctttgtcca agtcacccct actcagcccc
actccctttc ctcatcccca 1620gctgtgtctc ccaacctccc ttcgtgttgt tttgcatggc
tttgcagtta tggagaaagt 1680ggaaacccag cagtccctaa agctggtccc cagaaagcag
gacagaaaga aggagggaca 1740ggcaggcagc aggaggggcg agctgggagg caggaggcag
cggcctgtca gtctgcagaa 1800tggtcgcact ggaggttcaa gctaactggc ctccagccac
attctcatag caggtaggac 1860ttcagccttc cagacactgc ccttagaatc tggaacagaa
gacttcagac tcaccataat 1920tgctgataat tacctactct taaatttgtc gagtgatttt
tagcctctga aaactctatg 1980ctggccactg attcctttga gtctcacaaa accctactta
ggtcatcagg gcaggagttc 2040tcactcccat tttacagatg agaatactga ggcctggaca
ggtgaagtga ccagagagca 2100aaaggcaaag gggtgggggc tgggtgcagt ggctcacacc
tgtattccca acacttttgg 2160aggctgaggt tagaggattg cttgagccca ggaattcgag
accagcctag gcgacatagt 2220gagaccccat ctctacaaaa aataaaaaat ttaccaggtg
tggtggcacg tgcctgggag 2280tcccagcgac ttgggaggct gaggtgggag gattgtttga
gcctgggagg tcaaggctgt 2340agtgagccct gattacacca ctgtactcca gcctgggtga
cagggcaaga ccctgtctca 2400aaaaaaaaaa aaaaatggca aagggagaca agagcccagc
ctacttgttc ctagccaaag 2460tgttctttcc ttccagcttg gcctgctctt aaaagcaaag
ctcctgcagt gtacatcctg 2520gcattgtgtg gctacctggg ttttaaacca gaatcagaag
tcccgggtca gagggcactg 2580ctgaggctca gcctcttctc ttcttggcca ggaggcagca
gctctgaatg ggcccctgag 2640gctgcacagg ggcctttgtc actggggtgc atgcttacaa
acagtgcagt tcttggcacc 2700gaggtaagca gggctgggtc tcatggcaga aaggccagga
tctggggctc taggaatttg 2760ggaattgggc agagtggcca agaaagctgg caggcatatc
ctatgggaca tcacacctgg 2820caccattgtc attgttggtg cctgtgtccc aagtagctag
tgataagctg aggctgcagc 2880aagaaacacc cttcccaggt gggggagttt ggaccagagg
tgccctctgc ccaccacacc 2940tgcaacccag aagcccagat ggaacgcagc tgatgaaggt
gatgcttgag gctcactttt 3000ggggccccac agctggagcc ggtatagtga ctgggacaac
atcaaggggt ggatgagggg 3060cctctcctcc cgcaacactg ccttcccatg ctgttcccct
gccagctcct taacactgcc 3120gaccaaggcc agacctggca ttcaggaaag ttggagggca
gcacccatag ggtggccagc 3180ctcaggcccc accccagctg tgtcctctag tctctgggga
cccctggggg gaagaagtct 3240accctgcttg tgagtcccgt ctcagtgtgg aggaactggc
tgcacgtggg acctgaaggt 3300gccctctgtg tttatgttgg gggggggggg gcagtgctgg
ctgcctctgt cctgtgtgtg 3360accctgccct cgaagggtcc tgtcctgtca gtcccgaggg
agccacaacc aaagctgcgg 3420agagaaggtg gggaacggtg cggagtggcc gtggggcaca
gcgtggcaga ctgttcagtc 3480tctgctgggt ctttcctagg gacctggaag gccagtgttg
cttccccctc actccctttc 3540actgcaggca gcctctctcc ttccccaatg ccttatgcct
gggcacactg ccacagaata 3600tgcaatatgt gtgggtgacg atgccctcac gaccacaccc
ccaccccggg cagcccccgg 3660actccaaagg tcgtggctgc cacagcctcc ctcagctctt
cctgcctatc tgtcttcaca 3720ctgagaatgg cgcccaataa atgctatcca cggagaccag
gctcaggctc cagctgcctc 3780tgtcatcgta tgcccttgct gctgccaggg aggggccatc
tcccaccccc tcccctgccg 3840gggtctacaa acatacctag ctgctgggtg ccgtggctca
cacctataat cacagcacta 3900ggcgggcaga tcacctgagg tcagaagttc aagaccagcc
tggccaacat ggtaaaaccc 3960cgtctctact aaaaatacaa aaattagctg agcgtggtgg
cgcctgtctg tagtcccagc 4020tactcggcta ctcaggaggc tgacgcacga gaatcgcttg
aacccgggag gcggaggttg 4080cagtgagctg agatcgcgcc actgcactcc agcctgagcg
atagagtgag accctgtcta 4140aaaaaaacaa taataataaa ataaaataac atacctagct
gactcgccat gggctcgctg 4200gcctgtgggc gacactggct tcccttttgg gatttcccag
aagatccaga ttttcttaag 4260tccccttgga acagactaag aaagaaacac cttagaaatc
acctggtcct attgtccccc 4320cgtacatgag taactgaggc ccacagagag caaatcgcct
gcctgagtca cacagcagtg 4380agtggcagac ctaggctagg aactagaact ggggattgct
attccagtgc tccccatcct 4440cacacagcct gtggagtccg cctggacaca ccccagctga
cagtggtacc tcccagtcag 4500ccaggagaat ggattccttc tcctgcagta ggggccccct
ggctgagtgg cctgattgac 4560taaaacatat gtctttgaag gagagtgcat cacaagcacc
tttctttggg gtagattttt 4620ctctgggtct agagggacac ctcaggcttg ggactgggcc
tcagaaccta ggacagaccc 4680tgagagcaga cccaccttat ccatctggtg ccagctcccc
aggtcagcta cagcaacccc 4740cgaacttcat agagtacaat ccacagtaat agcacacagc
tctgtaccta tctagctcca 4800tgcctatcta tctgcctacc tttcacaaaa taattcttag
caaccctgct acagccaatg 4860attctaatac gttctgttct attacatgtt ataaaatgct
ggtcacgatc cactaaattg 4920atgtctctac ctgctaatgg tttaatacct gcagattgaa
atatactgga gaaataaaga 4980gagtaggagt agggacactt tctcccagtg cccacaccgc
ccctcgttac ccgcataggt 5040caactgaaag atacagagag ggaagctttg atggggggtt
cagagttcaa aggaagaaat 5100gatggcacct gcactccctg cccccagagg caggacacag
ccagccctcc tgtgacagca 5160ctcctggcag ctccttgttg gcctgcagcc cttagttgcc
attgactcac ccactcctaa 5220ggccaccaca tcaaaatctg aggcttactg ccctggccca
cctgcctctg tctttcttaa 5280aacagctaaa tgcaacgata gcagaaatta gcttgttttt
gaggttggca atgaccagtt 5340caactcttat tttcttaagc agtgcttgca ggacataaat
gtgatgacac ttgccctcct 5400ttctttatcg cctggggcag actttacaaa cagacctggg
aggagtcccc taaggggctg 5460catttatccc catctcccta ggggtgatca gcattgtgac
agctgggcag agcagtggtg 5520aactgcaccc atgtccctgc tcacatctcc taagatctca
gaattgcctg aggttctagc 5580gtgggctcct tctctccaga tgatgccatc cccacccccc
tcatttccac acagcatctg 5640aggcatcctg cactaaaaga tatatgtaca gcaaaacaaa
aatagaaaac cagcacagca 5700gagtggaggt ggggtataaa tatacccaga tccccgctga
tttggttact cggggtgagc 5760atcagatgga aatagaagtt tccgggggcc aagagagaaa
gagggatgta acgacaattc 5820ttttcaaaac gtgtcccatg gtatgcctcg tggaaaaaat
ggttcgttgg tcaaatgaat 5880ttgggaaaat gctgtcaata tcaccgactc atggagcttc
gcaaggcatc ttagcttaat 5940aaaggttatg aaaagtcttg cagcaaagat gctgtttacc
ccacttaatc cagcactgcc 6000caaactcatt ccaaatacca gagcctctgt ttgca
6035625323PRTHomo sapiens 625Met Lys Arg Gln Asn
Val Arg Thr Leu Ala Leu Ile Val Cys Thr Phe 1 5
10 15 Thr Tyr Leu Leu Val Gly Ala Ala Val Phe
Asp Ala Leu Glu Ser Glu 20 25
30 Pro Glu Leu Ile Glu Arg Gln Arg Leu Glu Leu Arg Gln Gln Glu
Leu 35 40 45 Arg
Ala Arg Tyr Asn Leu Ser Gln Gly Gly Tyr Glu Glu Leu Glu Arg 50
55 60 Val Val Leu Arg Leu Lys
Pro His Lys Ala Gly Val Gln Trp Arg Phe 65 70
75 80 Ala Gly Ser Phe Tyr Phe Ala Ile Thr Val Ile
Thr Thr Ile Gly Tyr 85 90
95 Gly His Ala Ala Pro Ser Thr Asp Gly Gly Lys Val Phe Cys Met Phe
100 105 110 Tyr Ala
Leu Leu Gly Ile Pro Leu Thr Leu Val Met Phe Gln Ser Leu 115
120 125 Gly Glu Arg Ile Asn Thr Leu
Val Arg Tyr Leu Leu His Arg Ala Lys 130 135
140 Lys Gly Leu Gly Ile Ser Trp Pro Ser Leu Arg Leu
Ile Leu Thr Gly 145 150 155
160 Leu Thr Val Ile Gly Ala Phe Leu Asn Leu Val Val Leu Arg Phe Met
165 170 175 Thr Met Asn
Ala Glu Asp Glu Lys Arg Asp Ala Glu His Arg Ala Leu 180
185 190 Leu Thr Arg Asn Gly Gln Ala Gly
Gly Gly Gly Gly Ser Gly Ser Ala 195 200
205 His Thr Thr Asp Thr Ala Ser Ser Thr Ala Ala Ala Gly
Gly Gly Gly 210 215 220
Phe Arg Asn Val Tyr Ala Glu Val Leu His Phe Gln Ser Met Cys Ser 225
230 235 240 Cys Leu Trp Tyr
Lys Ser Arg Glu Lys Leu Gln Tyr Ser Ile Pro Met 245
250 255 Ile Ile Pro Arg Asp Leu Ser Thr Ser
Asp Thr Cys Val Glu Gln Ser 260 265
270 His Ser Ser Pro Gly Gly Gly Gly Arg Tyr Ser Asp Thr Pro
Ser Arg 275 280 285
Arg Cys Leu Cys Ser Gly Ala Pro Arg Ser Ala Ile Ser Ser Val Ser 290
295 300 Thr Gly Leu His Ser
Leu Ser Thr Phe Arg Gly Leu Met Lys Arg Arg 305 310
315 320 Ser Ser Val 62621DNAArtificial
SequenceOigonucleotides for PCR 626agactttaca aacagacctg g
2162723DNAArtificial
SequenceOligonucleotides for PCR 627gcttgcagga cataaatgtg atg
2362821DNAArtificial
SequenceOligonucleotides for PCR 628tgacactggc aaaacaatgc a
2162921DNAArtificial
SequenceOligonucleotides for PCR 629ggtccttttc accagcaagc t
2163021DNAArtificial SequencesiRNA
630ccacagaagg uaccaguuau u
2163121DNAArtificial SequencesiRNA 631uaacugguac cuucuguggu u
2163221DNAArtificial SequencesiRNA
632cagcaagacu cccucuaaau u
2163321DNAArtificial SequencesiRNA 633uuuagaggga gucuugcugu u
2163421DNAArtificial
SequenceOligonucleotides for PCR 634tgacactggc aaaacaatgc a
2163521DNAArtificial
SequenceOligonucleotides for PCR 635ggtccttttc accagcaagc t
2163621DNAArtificial SequencesiRNA
636nnccacagaa gguaccaguu a
2163721DNAArtificial SequencesiRNA 637ccacagaagg uaccaguuau u
2163821DNAArtificial SequencesiRNA
638uaacugguac cuucuguggu u
2163921DNAArtificial SequencesiRNA 639nncagcaaga cucccucuaa a
2164021DNAArtificial SequencesiRNA
640cagcaagacu cccucuaaau u
2164121DNAArtificial SequencesiRNA 641uuuagaggga gucuugcugu u
21
User Contributions:
Comment about this patent or add new information about this topic: