Patent application title: METHOD OF CLASSIFYING GENE EXPRESSION STRENGTH IN LUNG CANCER TISSUES
Inventors:
Takashi Takahashi (Nagoya-City, JP)
Shuta Tomita (Nagoya-City, JP)
Tetsuya Mitsudomi (Nagoya-City, JP)
Yasushi Yatabe (Nagoya-City, JP)
Nobuhiko Ogura (Ashigarakami-Gun, JP)
Masato Some (Ashigarakami-Gun, JP)
Assignees:
FUJIFILM CORPORATION
AICHI PREFECTURE
IPC8 Class: AC12Q168FI
USPC Class:
506 9
Class name: Combinatorial chemistry technology: method, library, apparatus method of screening a library by measuring the ability to specifically bind a target molecule (e.g., antibody-antigen binding, receptor-ligand binding, etc.)
Publication date: 2013-11-14
Patent application number: 20130303389
Abstract:
The present invention provides a method of confirming the gene
expression, useful in the decision of a five year survival rate of a
patient with lung cancer and the use of a DNA probe kit in the method. A
method useful in the decision of a survival rate of a patient with
non-small cell lung cancer comprising confirming the expression strength
of at least one gene in lung cancer tissues isolated from the patient.Claims:
1. (canceled)
2. A method for predicting a survival rate of a patient with non-squamous cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of SEQ ID NO: 9, SEQ ID NO: 1, SEQ ID NO: 6, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 20, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 11, SEQ ID NO: 61, SEQ ID NO: 7, SEQ ID NO: 62, SEQ ID NO: 2, SEQ ID NO: 63, SEQ ID NO: 14, SEQ ID NO: 21, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 71, SEQ ID NO: 5, SEQ ID NO: 72, SEQ ID NO: 18, SEQ ID NO: 73 and SEQ ID NO: 14 in lung cancer tissues isolated from the patient.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This is a divisional of U.S. patent application Ser. No. 11/008,265, filed Dec. 10, 2004 (presently allowed). The entire disclosure of the prior application is considered part of the disclosure and is hereby incorporated by reference.
TECHNICAL FIELD
[0002] The present invention relates to a method of confirming the expression of a specific gene in lung cancer tissues, used in a technique of predicting a five year survival rate of a patient with lung cancer with high accuracy.
BACKGROUND OF THE INVENTION
[0003] When various therapies are applied to patients with cancer (carcinoma), a five year survival rate is often used as a measure of cure. That is, a five year survival rate is a probability that a patient who underwent a cancer diagnosis or therapy will be survival over five years thereafter. By this probability, a progressive level (stage) of cancer, a therapeutic effect and the like are represented.
[0004] Until now, the TNM classification comprising the combination of the size of tumor (tumor meter, represented by T), the range where metastasis to lymphonodi are observed (represented by N) and the presence or absence of distant metastasis (represented by M), each of which is determined by clinical method, has been mainly used ("Cancer of the lung," written by Robert Ginsberg et al., 5th edition, pp. 858 to 910, Lippincott-Raven (1997)). For example, patients judged to be in stage I under the TNM classification means those having a progressive level such that a little over 60% of the patients could be survival for five years if cancer is resected by surgery. Patients judged to be in stage III means those having a progressive level such that at most 20% the patients could be survival even under the same condition.
[0005] Recently, focusing on one or two genes specifically expressed in cancer patients or cancer tissues, a therapeutic effect is often predicted by determining the difference in the expression of said gene(s) between patients showing superior therapeutic effect and patients showing poor therapeutic effect (Horio et al, Cancer Research, Vol. 54, pp. 1 to 4, Jan. 1, 1993).
SUMMARY OF THE INVENTION
[0006] However, the TNM classification cannot be applied unless outcomes of many clinical tests are accumulated. Thus, this classification is not be said to be simple and its accuracy is not satisfactory at all. And, in a method of predicting a therapeutic effect by confirming the expression of a specific gene, the correlation between the gene expression in patients with lung cancer and a five year survival rate of the patients has not been reported.
[0007] An object of the present invention is to accurately decide a survival rate of patients especially with lung cancer. In the present invention, the expression of a specific gene in lung Cancer tissues is confirmed.
[0008] Accordingly, the present invention relates to a method useful in the decision of a survival rate of a patient with non-small cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of WEE1 (AA039640), MYC (AA464600), TITF1 (T60168), FOSL1 (T82817), LYPLA1 (H00817), SSBP1 (R05693), SFTPC (AA487571), THBD (H59861), NICE-4 (AA054954), PTN (AA001449), SNRPB (AA599116), NAP1L1 (R93829) CTNND1 (AA024656), CCT3 (R60933), DSC2 (AA074677), SPRR1B (AA447835), COPB (AA598868), ARG1 (AA453673), ARCN1 (AA598401), MST1 (T47813), SERPINE1 (N75719), SERPINB1 (AA486275), EST fragment (N73201), ACTR3 (N34974), PTP4A3 (AA039851), ISLR (H62387), ANXA1 (1.163077), GJA1 (AA487623), HSPE1 (AA448396) and PSMA5 (AA598815) in lung cancer tissues isolated from the patient.
[0009] And, the present invention provides a method useful in the decision of a survival rate of a patient with squamous cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of FLJ20619 (R74480), SPC12 (R19183), EST fragment (R96358), KRT5 (AA160507), PTP4A3 (AA039851), SPRR1B (AA947835), LOC339324 (W23522), MYST4 (AA057313), SPARCL1 (AA990699), IGJ (T70057), EIF4A2 (H05919), EST fragment (AA115121), ID2 (H82706), THBD (H59861), MGC15476 (W72525), ZFP (H53499), COPB (AA598868), ZYG (AA453289) CACNA1I (N52765), FLJ4623 (N71473), CSTB (H22919), EPB41L1 (R71689), MGC4549 (AA455267), EST fragment (T64878), DSC2 (AA074677), EST fragment (H79007), EST fragment (W84776), IF130 (AA630800), EST fragment (T81155) and IL1RN(T72877) in lung cancer tissues isolated from the patient.
[0010] Further, the present invention provides a method useful in the decision of the survival rate of a patient with non-squamous cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of NICE-4 (AA054954), WEE1 (AA039640) SSBP1 (R05693), WFDC2 (AA451904), ACTA2 (AA634006), G22P1 (AA486311), MST1 (T47813), PHB (R60946), DRPLA(H08642), SNRBP (AA599116), GJA1 (AA487623), SFTPC (AA487571), ACTR1A(R40850), MYC (AA464600), RAD23B (A2489678), CCT3(R60933), SERPINE1 (N75719), LAMP1 (H29077), IRAK1 (AA683550), BIRC2 (R19628), LMAN1 (H73420), HSPE1 (AA448396), TMSB4X (AA634103), EEF1G (R43973), EST fragment (H05820), LYPLA1 (H00817), SOD1 (R52548), ARG1 (AA453673), KRT25A (W73634) and FOSL1 (T82817) in lung cancer tissues isolated from the patient.
[0011] Another aspect of the present invention relates to the use in the above method of a DNA probe comprising a nucleic acid sequence specifically hybridizing to at least one gene targeted in this method.
[0012] All genes which expression is to be confirmed in the present invention are known genes. The nucleotide sequence of each gene is registered in "UniGene", one of the public databases provided by NCBI, with its abbreviated name and its accession number represented by the combination of alphabet (such as AA) and numeral. In the present specification including claims, all of the genes to be confirmed in the method of the present invention are represented with the abbreviated names and the accession numbers registered in "UniGene" on Nov. 19, 2003. Since a gene can be specified with the abbreviated name and the accession number registered in "UniGene", those skilled in the art easily confirm a gene in question and its detailed nucleotide sequence by referring to "UniGene" and conduct the present invention. Similarly, as to a nucleic acid sequence of a DNA prove specific for each gene used in the method of the present invention, those skilled in the art can easily determine some candidate sequences for each gene based on the nucleic acid sequence registered in the above database using a homology searching program or the like. Especially, the nucleic acid sequence of the probe of the present invention is not limited unless it is selected such that the probe can be specifically hybridized to a gene corresponding therefor. It is not necessarily to restrict or limit to one nucleic acid sequence. Such a procedure can be made by those skilled in the art without having a need of any specific effort.
[0013] The present inventors studied to search for genes specifically expressed in lung cancer tissues of patients who were underwent non-small cell lung cancer diagnosis or therapy and who were dead within five years thereafter or survival over five or more years thereafter. As the result, they found that there is a specific tendency between a five year survival rate and a gene expression pattern.
[0014] Focusing on genes whose expression amounts were specifically increased or decreased in cancer tissues of the group of patients who were dead within five years after operation or diagnosis as compared with the group of patients who were survival over five years after operation or diagnosis, the present inventors selected predictive genes capable of distinguishing both groups efficiently using a signal-to-noise metrics (Golub et al., Science, Vol. 286, pp. 531 to 537 (1999)). Briefly, if a prognosis favorable patient and a prognosis fatal patient are defined to belong to class 0 and class 1 respectively, a signal-to-noise statistic (Sx) for gene x is calculated as follows:
Sx=(μclass 0-μclass 1/δclass 0+δclass 1)
As to each gene, μclass 0 means an average of data on total expression strength of patients belonging to class 0 (a group of prognosis favorable patients) and δclass 0 means a standard deviation of data on total expression strength of patients belonging to class 0 (a group of prognosis favorable patients). Using the thus-calculated absolute value of Sx, genes ranked higher, i.e. genes showing a significant difference in expression strength between the group of prognosis favorable patients and the group of prognosis fatal patients, were selected.
[0015] In order to assay a statistical significance of a marker gene specific for a different type of cancer, a temple level (prognosis favorable or fatal) of each patient used in the analysis in association with a set of data on gene expression strength were randomly labeled and then the signal-to-noise value (Sx value) was recalculated in accordance with the labels after randomizing. This procedure was repeated 10,000 times. P values were assigned to every genes based on the extent so that Sx value obtained by randomizing the labels was better than Sx value obtained actually.
[0016] When genes to be judged that they are significantly related to a survival rate of patients with a different type of lung cancer, i.e. predictive genes, were searched for among genes expressed in cancer tissues of the patients, the following correlation became clear.
[0017] Thus, an expression pattern such that in many lung cancer tissues of patients who were underwent non-small cell lung cancer diagnosis or therapy and dead within five years thereafter, the expression of each of WEE1 (AA039640), MYC (AA464600), FOSL1 (T82817), LYPLA1(H00817), SSBP1 (R05693), THEM (H59861), NICE-4 (AA054954), PTN (AA001449), SNRPB (AA599116), NAP1L1 (R93829), CTNND1 (AA024656), CCT3 (R60933), DSC2 (AA074677), SPRR1B (AA447835), COPB(AA598868), ARG1(AA453673), ARCN1(AA598401), MST1 (T47813), SERPINE1 (N75719), SERPINB1 (AA486275), ACTR3 (N34974), PTP4A3(AA039851), ISLR (H62387), ANXA1 (1163077), GJA1 (AA487623), HSPE1 (AA448396) and PSMA5 (AA598815) was significantly increased and the expression of each of TITF1 (T60168), SFTPC (AA487571) and EST fragment (N73201) was significantly lowered was observed. Hereinafter, the group comprising the above genes is referred to be a gene group 1.
[0018] Accordingly, by extracting total RNAs from cancer tissues of a patient who was underwent a non-small cell lung cancer diagnosis and confirming the expression strength of at least one gene belonging to the gene group 1, it is possible to predict a five year survival rate of the patient whether the patient would be dead within five years or survival over five or more years.
[0019] For example, when PTP4A3 (AA039851, fatal) is selected as a gene and a five year survival rate is predicted based on the outcome obtained by confirming the expression strength of this gene, an accuracy of 64% can be expected. When WEE1 (AA039640, fatal) or ACTR3 (N34974, fatal) is selected as a gene in addition to PTP4A3 (AA039851, fatal) and a five year survival rate is predicted based on the outcomes obtained by confirming the expression strength of these genes, an accuracy will be 66% or 7.4%. And, based on the outcomes obtained by confirming the expression strength of all genes constituting the gene group 1, an accuracy will reach 82%. The above outcomes have reliability higher than that of the prior method.
[0020] Although non-small cell lung cancer is further classified squamous cell cancer (SQ) and non-squamous cell cancer (non-SQ), the gene group 1 is useful as a gene group selected when a five year survival rate is decided without subdividing the type of lung cancer cells.
[0021] On the other hand, the present inventors confirmed the gene expression strength for squamous cell cancer (SQ) and non-squamous cell cancer (non-SQ) and as the result, they found that a five year survival rate can be decided more accurately by using a gene group different from the gene group 1 as targets.
[0022] Thus, an expression pattern such that in many lung cancer tissues of patients who were underwent squamous cell cancer diagnosis of therapy and dead within five years thereafter, the expression of each of KRT5 (AA160507), PTP4A3 (AA039851), SPRR1B (AA447835), MYST4 (AA057313), SPARCL1 (AA490694), IGJ (T70057), EST fragment (AA115121), ID2 (H82706), THBD (H59861), MGC15476 (W72525), COPB (AA598868), ZYG (AA453289), CACNA1I (N52765), CSTB (1122919), EPB41L1 (R71689), MGC4549 (AA455267), DSC2 (AA074677), IFI30 (AA630800), EST fragment (T81155) and IL1RN(T72877) was significantly increased and the expression of each of FLJ20619 (R74480), SPC12 (R19183), EST fragment (R96358), LOC339324 (W23522), EIF4A2 (H05919), ZFP (H53499), FLJ4623 (N71473), EST fragment (T64878), EST fragment (H79007) and EST fragment (W84776) was significantly lowered was observed. Hereinafter, the group comprising the above genes is referred to be a gene group 2.
[0023] Accordingly, by extracting total RNAs from cancer tissues of a patient who was underwent a squamous cell cancer diagnosis and confirming the expression strength of at least one gene belonging to the gene group 2, it is possible to predict a five year survival rate of the patient whether the patient would be dead within five years or survival over five or more years.
[0024] For example, when CACNAII (N52765, fatal) is selected as a gene and a five year survival rate is predicted based on the outcome obtained by confirming the expression strength of this gene, an accuracy of 81% can be expected. When FLJ20619 (R74480, favorable) is selected as gene in addition to CACNAII (N52765, fatal) and a five year survival rate is predicted based on the outcomes obtained by confirming the expression strength of these genes, an accuracy will be 75% or 81%. And, based on the outcomes obtained by confirming the expression strength of all genes constituting the gene group 2, an accuracy will reach 100%.
[0025] And, an expression pattern such that in many lung cancer tissues of patients who were underwent non-squamous cell cancer diagnosis or therapy and dead within five years thereafter, the expression of each of NICE-4 (AA054954), WEE1 (AA039640), SSBP1 (R05693), G22P1 (AA486311), MST1 (T47$13), PHB (R60946), DRPLA (H08642), SNRBP (AA59911.6), GJA1 (AA487623), ACTR1A (R40850), MYC (AA464600), RAD23B (AA489678), CCT3 (R60933), SERPINE1 (N75719), BIRC2 (R19628), LMAN1 (H73420) HSPE1 (AA448396), EEF1G (R43973), EST fragment (1405820), LYPLA1 (H00817), SOD1 (R52548), ARG1 (AA453673), KRT25A (W73634) and FOSL1 (T82817) was significantly increased and the expression of each of WFDC2 (AA451904), ACTA2 (AA634006), SFTPC (AA487571), LAMP1 (H29077), IRAK1 (AA683550) and TMSB4X (AA634103) was significantly lowered was observed. Hereinafter, the group comprising the above genes is referred to be a gene group 3.
[0026] Accordingly, by extracting total RNAs from cancer tissues of a patient who was underwent a non-squamous cell cancer and confirming the expression strength of at least one gene belonging to the gene group 3, it is possible to predict a five year survival rate of the patient whether the patient would be dead within five years or survival over five or more years.
[0027] For example, when SFTPC (AA487571, favorable) is selected as a gene and a five year survival rate is predicted based on the outcome obtained by confirming the expression strength of this gene, an accuracy of 56% can be expected. When NICE-4 (AA054954, fatal) or GJA1 (AA487623, fatal) is selected as a gene in addition to SFTPC (AA487571, favorable) and a five year survival rate is predicted based on the outcomes obtained by the expression strength of these genes, an accuracy will be 79% or 76%. And, based on the outcomes obtained by the expression strength of all genes constituting the gene group 3, an accuracy will reach 91%.
[0028] As mentioned above, it is preferable to select two or more genes, more preferably all genes belonging to each gene group as targets although only one gene may be freely selected from each gene group and used it.
[0029] Further, the present invention provides information about samples γ obtained from cancer tissues of new patients for deciding whether the patients will be survival or dead based on the above correlation.
[0030] In order to decide whether new patients with lung cancer (test samples γ) will be prognostic favorable or fatal after five years, Vx may be calculated for each gene contained in a set of predictive genes from the equation: Vx=Sx (Gx.sup.γ-bx) wherein Sx is the above-mentioned signal-to-noise statistic; Gx.sup.γ represents the expression strength of each gene x contained in the set of predictive genes; and bx is calculated from the equation: bx=(μclass 0+μclass 1)/2. When the sum of Vx (ΣVx) for the genes contained in the set of predictive genes is calculated to be plus (+), the patient in question is decided to be "prognosis favorable". When ΣVx is calculated to be minus (-), the patient in question is decided to be "prognosis fatal".
BRIEF DESCRIPTION OF DRAWINGS
[0031] FIG. 1 represents the outcomes obtained by predicting patients with non-squamous cell lung cancer using 25 predictive genes in a weighted-voting model.
[0032] FIG. 2 is a survival curve showing the prognosis "favorable" or "fatal" of patients with non-small cell lung cancer.
[0033] FIG. 3 represents the outcomes obtained by predicting patients with non-squamous cell lung cancer using 12 predictive genes in a weighted-voting model.
[0034] FIG. 4 represents the outcomes obtained by predicting patients with squamous cell lung cancer using 19 predictive genes in a weighted-voting model.
[0035] FIG. 5 is a survival curve showing the prognosis "favorable" or "fatal" of patients with non-squamous cell lung cancer.
[0036] FIG. 6 is a survival curve showing the prognosis "favorable" or "fatal" of patients with squamous cell lung cancer.
EFFECT OF THE INVENTION
[0037] By using the method of the present invention, a five year survival rate of patients with lung cancer can be predicted with high accuracy. Therefore, it is possible according to the present invention to predict whether or not a patient with a different type of lung cancer could be survival over five or more years with high accuracy by confirming that a specified gene group is expressed in cancer tissues of the patient.
DISCLOSURE OF THE INVENTION
[0038] Expression strength of each gene belonging to the gene group specified in the present invention can be confirmed by providing a specific probe every nucleotide sequence and conducting PCR or hybridization. The nucleotide sequence of each gene can be easily confirmed from the database "UniGene". And, conditions such as the design of a probe specifically hybridizing to each gene, its synthesis, hybridization and the like can be suitably determined by those skilled in the art without having a need of any specific effort.
[0039] The probe can be synthesized as a set of probes capable of subjecting to PCR reaction for each gene, i.e. PCR primers. The expression strength may be confirmed by conducting PCR reaction using these primers.
[0040] Upon practice of the present method, the expression of a gene is preferably confirmed in the so-called microarray. As an microarray, a glass substrate on which probe DNAs are spotted; a membrane on which probe DNAs are spotted; beads on which probe DNAs are spotted; a glass substrate on which probes are directly synthesized; and the like have been developed. Examples of the microarray include a membrane microarray available from Invitrogen (GeneFilters®, Mammalian Microarrays; Catalog #GF200 or GF201). This membrane microarray contains 11168 spots in total of probe DNA corresponding to 8644 independent genes. It is confirmed by Blast search that the sequence of each probe does not occur the so-called cross hybridization even when gene (s) closely related to each sequence is (are) present, Otherwise the expression of such gene(s) is detected erroneously.
[0041] Examples of the microarray available in the present invention include cDNA or oligo-arrays available from Affimetrix, Agilent and other companies, in addition to the membrane microarray available from Invitrogen.
[0042] It is desirable in the present invention to immediately frozen cancer tissues isolated from a patient with lung cancer during thoractomy or by biopsy with an endoscope or the like to prepare a slice, prepare a tissue section by hollowing out minutely regions rich in cancer cells in the slice, extract RNAs from the tissue section according to any standard method and transform all mRNAs expressed in the tissue into a cDNA by acting a reverse transcriptase thereto. In this case, the targeted gene group can be labeled by adding to the cDNA a suitable radioisotope such as 33P and the like or a fluorochrome such as Cy3, Cy5 and the like during the preparation of the cDNA via the reaction with a reverse transcriptase.
[0043] According to the present invention, based on the information about the nucleotide sequence of the gene contained in each gene group, the expression strength of the gene to be detected can be confirmed by hybridization or real time PCR using an oligoDNA specific for each gene to be detected. Preferably the expression of each gene group to be detected is confirmed more easily by combining cDNAs prepared with a reverse transcriptase and a suitable label with a microarray.
[0044] The expression strength of a gene group targeted in the present invention can be confirmed easily by hybridizing a labeled cDNA and a microarray under suitable conditions and then confirming the expression of the genes and their amounts as an index of the label. The expression strength is confirmed by quantifying the strength of a signal produced from the label by a suitable method.
[0045] For example, when a radioactive label is used, a signal strength can be quantified by exposing a hybridized array to an imaging plate (Fuji Photo Film), scanning and imaging using a bioimaging analyzer BAS 5000 (Fuji Photo Film), processing images of the hybridized array using L Process (Fuji Photo Film) and then analyzing using an analytical soft Array Gauge (Fuji Photo Film). Alternatively, the strength of a radioactive label can be quantified using a phospho-imager (Amersham). And, the strength of a fluorescent label can be quantified using a microarray reader (Agilent) or the like.
[0046] The thus-obtained data on label strength are converted to data on hybridization strength, respectively by using, for example, the method of Tseng et al. (Nucleic Acids Res., Vol., 29, pp. 2549 to 2557). Thereafter, a reproducibility in expression is evaluated after normalization, preparation of scatter plots for each gene and the like. Thus, a significant increase or decrease in expression amount of a targeted gene may be evaluated.
EXAMPLES
[0047] The present invention will be described in more detail by referring to the following examples which are not to be construed as limiting the scope of the invention.
Example 1
[0048] In the following example, all procedures using commercially available kits were conducted under conditions as recommended by the manufactures unless otherwise stated.
1) Extraction of Total RNAs from Lung Cancer Tissue
[0049] From each of 50 patients (15 females and 35 males; between the ages of 43 and 76, average age of 63) with non-small cell lung cancer, specifically 30 patients with glandular lung cancer, 16 patients with squamous cell lung cancer and 4 patients with large cell lung cancer (23 patients with stage I, 11 patients with stage II and 16 patients with stage III), lung cancer tissues (0.5 g in average) were isolated. The tissues were embedded in OCT compound and frozen at -80° C., thereby a frozen sample of 7 μm in thickness was prepared. Then, a region rich in cancer cells was carefully excised from the sample to obtain a section having cancer cells accounted for 75.4% in average of cells contained therein. From this section, total RNAs (12 μg in average) were extracted using RNAeasy (Quiagen) and a purity thereof was confirmed using RNA 600 nanoassay kit and 2100 Bioanalyzer (Agilent).
2) Hybridization to Microarray
[0050] 5 micrograms of the total RNAs as prepared in the above 1) was transformed into cDNA using oligo-dT primer (Invitrogen) and Superscript II reverse transcriptase (Invitrogen) by adding 10 μCi of [32P] dCTP. GeneFilters (Invitrogen) was prehybridized in 10 ml of AlkPhos DIRECT hybridization buffer (Amersham) containing 0.5 μg/ml of poly-dA (Invitrogen) and 0.5 μg/ml of Cot-1 DNA (Invitrogen) at 51° C. for 2 hours and then hybridized with a modified radiolabeled probe cDNA for 17 hours.
[0051] After hybridizing, the microarray was washed with a solution containing 2M urea, 0.1% SDS, 50 mM sodium phosphate buffer solution (pH 7.0), 150 mM NaCl, 1 mM MgCl2 and 0.2% AlkPhos DIRECT blocking reagent (Amersham) twice, a solution containing 2 mM MgCl2, 50 mM Tris and 100 mM NaCl twice ands solution containing 2 mM MgCl2, 50 mM Tris and 15 mM NaCl twice successively. The microarray was exposed to an imaging plate (Fuji Photo Film) for 2 hours and then the imaging plate was scanned and imaged using a bioimaging analyzer BAS 5000 (Fuji Photo Film) with resolution of 25 μm. The image of the hybridized array was processed with L Process (Fuji Photo Film) and then a signal strength was quantified using an analytical soft Array Gauge (Fuji Photo Film).
3) Data Processing
[0052] The data on signal strength obtained in the above 2) was converted to data on hybridization strength, respectively. First, the method of Tseng et al. (Nucleic Acids Res., Vol. 29, pp. 2549 to 2557) was employed for selecting genes used in the fitting of a non-linear normalization curve. After normalization, scatter plots of 50 sets of replication data on each gene were prepared and a reproducibility of expression between replication pairs was evaluated. Genes showing a Pearson correlation coefficient of 0.85 or higher were selected. An average of the first hybridization and the second hybridization was used for further analysis. In addition, genes not showing a double or half change at at least an expression level were excluded. Genes having a median intensity of less than 0.3 were excluded from the following analysis.
4) Isolation of Gene for Five Year Survival
[0053] Predictive genes distinguishing patients who would be dead within five years after operation or diagnosis (prognosis fatal patients) and patients who would be survival over five years after operation or diagnosis (prognosis favorable patients) most efficiently were selected using a signal-to noise metrics (Golub et al., Science, Vol. 286, pp. 531 to 537 (1999)). Briefly, if a prognosis favorable patient and a prognosis fatal patient are defined to belong to class 0 and class 1 respectively, a signal-to-noise statistic (Sx) is calculated as follows:
Sx=(μclass 0-μclass 1/δclass 0+δclass 1)
As to each gene, μclass 0 means an average of data on total expression strength of patients belonging to class 0 (the group of prognosis favorable patients) and δclass 0 means a standard deviation of data on total expression strength of patients belonging to class 0 (the group of prognosis favorable patients).
[0054] Genes ranked higher based on the absolute value of Sx were selected. In order to predict the outcomes using the thus-selected genes, a weighted-voting classification algorithm was employed. The thus-obtained outcome classifiers were tested using a leave-one-out cross validation. In this scheme, the algorithm can be employed to find decision boundaries between class average and bx=(μclass 0+μclass 1)/2 for each gene, in addition to the calculation of Sx.
5) Permutation Test
[0055] In order to assay a statistical significance of a marker gene specific for a different type of cancer, a sample level (survival or dead) of each patient used in the analysis together with a set of data on gene expression strength were labeled randomly and then the signal-to-noise value (Sx value) for each gene was recalculated in accordance with the labels after randomizing. This procedure was repeated 10,000 times. P values were assigned to every genes based on the extent so that Sx value obtained by randomizing the labels was better than $x value obtained actually.
6) Construction of Model Predicting Survival Rate of Patients with Non-Small Cell Cancer
[0056] In order to develop an outcome prediction classifier of each patient, a signal-to-noise metrics was employed for selecting a gene distinguishing prognosis favorable patients from prognosis fatal patients most clearly. As the outcomes of a non-supervised hierarchical clustering algorithm using spots ranked top 100 corresponding to unique 98 genes, two major branches representing prognosis favorable patients and prognosis fatal patients were obtained. Among 21 patients with non-small cell cancer, 19 patients (left frame), i.e. the favorable branch, were survival over five years after operation. On the other hand, among 29 patients with non-small cell cancer, 15 patients(right frame),i.e. the fatal branch, were dead within five years after operation. The Kaplan-Meier survival curve reveals statistically significant difference.
[0057] Since our final goal was to develop outcome classifiers at patient level, a supervised learning method was employed. Thus, weighted-voting outcome classifiers were constructed based on the predictive genes preselected using the signal-to-noise metrics. A learning error against each model while increasing the number of predictive genes used was calculated by a leave-one-out cross validation. Among 30 genes constituting the outcome classifiers for non-small cell cancer (Table 1), the weighted-voting model using 25 predictive genes ranked top 25 revealed the highest accuracy such that 41 patients (82%) of 50 patients revealed the outcomes as predicted individually (FIG. 1).
TABLE-US-00001 TABLE 1 Non-small cell cancer accession expression in Rank Gene Description No. lung cancer P bx Sx SEQ ID NO. 1 WEE1 WEE1 homolog AA039640 Up 0.0027 0.483 0.483 SEQ ID NO: 1 2 MYC v-myc viral oncogene homolog AA464600 Up 0.0057 0.479 0.441 SEQ ID NO: 2 3 TITF1 thyroid transcription factor 1 T60168 Down 0.0085 0.452 0.416 SEQ ID NO: 3 4 FOSL1 FOS-like antigen 1 (Fra-1) T82817 Up 0.0062 0.330 0.411 SEQ ID NO: 4 5 LYPLA1 lysophospholipase 1 H00817 Up 0.0081 0.460 0.408 SEQ ID NO: 5 6 SSBP1 single-stranded DNA binding protein R05693 Up 0.0199 0.495 0.406 SEQ ID NO: 6 7 SFTPC surfactant, pulmonary-associated protein C AA487571 Down 0.0113 0.322 0.405 SEQ ID NO: 7 8 THBD thrombomodulin H59861 Up 0.0099 0.466 0.403 SEQ ID NO: 8 9 NICE-4 NICE-4 protein AA054954 Up 0.0099 0.514 0.403 SEQ ID NO: 9 10 PTN pleiotrophin (heparin binding growth factor 8) AA001449 Up 0.0100 0.500 0.401 SEQ ID NO: 10 11 SNRPB small nuclear ribonucleoprotein polypeptides B AA599116 Up 0.0115 0.657 0.394 SEQ ID NO: 11 and B1 13 CTNND1 catenin delta 1 R93829 Up 0.0120 0.513 0.393 SEQ ID NO: 12 12 NAP1L1 nucleosome assembly protein 1-like 1 AA024656 Up 0.0131 0.483 0.384 SEQ ID NO: 13 14 CCT3 chaperonin containing TCP1, subunit 3 R60933 Up 0.0186 0.566 0.378 SEQ ID NO: 14 15 DSC2 desmocollin 2 AA074677 Up 0.0160 0.533 0.374 SEQ ID NO: 15 16 SPRR1B small proline-rich protein 1B (cornifin) AA447835 Up 0.0209 0.421 0.370 SEQ ID NO: 16 17 COPB coatomer protein complex, subunit beta AA598868 Up 0.0195 0.466 0.369 SEQ ID NO: 17 18 ARG1 arginase type I (liver) AA453673 Up 0.0193 0.581 0.369 SEQ ID NO: 18 19 ARCN1 archain 1 (coatomer protein complex, subunit delta) AA598401 Up 0.0169 0.412 0.367 SEQ ID NO: 19 20 MST1 macrophage stimulating 1 T47813 Up 0.0193 0.462 0.366 SEQ ID NO: 20 21 SERPINE1 serine (or cysteine) proteinase inhibitor, clade N75719 Up 0.0194 0.495 0.366 SEQ ID NO: 21 E member 1 22 SERPINB1 serine (or cysteine) proteinase inhibitor, clade AA486275 Up 0.0205 0.556 0.362 SEQ ID NO: 22 B member 1 23 ESTs N73201 Down 0.0205 0.494 0.360 SEQ ID NO: 23 24 ACTR3 actin-related protein 3 homolog (ARP3) N34974 Up 0.0229 0.496 0.358 SEQ ID NO: 24 25 PTP4A3 protein tyrosine phosphatase type 4A, member 3 AA039851 Up 0.0199 0.478 0.357 SEQ ID NO: 25 26 ISLR immunoglobulin superfamily containing leucine-rich H62387 Up 0.0228 0.478 0.356 SEQ ID NO: 26 repeat 27 ANXA1 annexin A1 H63077 Up 0.0262 0.367 0.354 SEQ ID NO: 27 28 GJA1 gap junction protein, alpha 1 AA487623 Up 0.0230 0.406 0.354 SEQ ID NO: 28 29 HSPE1 heat shock 10 kD protein 1 AA448396 Up 0.0273 0.444 0.352 SEQ ID NO: 29 30 PSMA5 proteasome (prosome, macropain) subunit, alpha AA598815 Up 0.0265 0.545 0.346 SEQ ID NO: 30 type, 5
[0058] As to these classifiers, 27 patients of 33 patients (82%) practically survival over five or more years after operation were decided to be "prognosis favorable" and 14 patients of 17 patients (82%) practically dead within five years after operation were decided to be "prognosis fatal". A survival curve of patients for the prediction of "prognosis favorable" or "prognosis fatal" is shown in FIG. 2. This figure reveals the difference between two groups (P=6.0×10-6).
[0059] With the increase in the number of the above genes, another supervised learning algorithm including Support vector machine and k-nearest neighbors was employed. The accuracy of the model is comparable with that of the weighted-voting outcome classifiers, but the latter showed the highest accuracy.
[0060] In order to decide whether new patients with lung cancer (test samples γ) could be prognosis favorable or fatal after five years, Vx may be calculated for each gene contained in the set of predictive genes from the equation: Vx=Sx (Gx.sup.γ-bx) wherein Sx is the above-mentioned signal-to-noise statistic; GX.sup.γ represents an expression strength of each gene x contained in the set of predictive genes; and bx is calculated from bx=(μclass 0+μclass 1)/2. When the sum of VX (ΣVx) for genes contained in the set of predictive genes is calculated to be plus (+), the patient in question is decided to be "prognosis favorable". When ΣVx is calculated to be minus (-), the patient in question is decided to be "prognosis fatal".
[0061] With the increase in the number of the above genes, another supervised learning algorithm including Support vector machine and k-nearest neighbors was employed. The accuracy of the model is comparable with that of the weighted-voting outcome classifiers, but the latter showed the highest accuracy.
7) Construction of Model Predicting Survival Rate Specific for Each of Squamous Cell Cancer and Non-Squamous Cell Cancer
[0062] Squamous cell cancer and non-squamous cell cancer are recognized as diseases distinguishable clinicopathologically each other. Thus, using predictive genes for each subtype selected with the weighted-voting algorithm and the signal-to-noise metrics, outcome prediction classifiers for a different type of cancer were constructed.
[0063] Among 30 genes constituting the outcome classifiers for a different type of cancer (Tables 2 and 3), 12 genes (Table 2) for non-squamous cell cancer and 19 genes (Table 3) for squamous cell cancer revealed the highest accuracy by a leave-one-out cross validation including the increase in the number of predictive genes ranked higher.
TABLE-US-00002 TABLE 2 Non-squamous cell cancer accession expression in Rank Gene Description No. lung cancer P bx Sx SEQ ID NO. 1 NICE-4 NICE-4 protein AA054954 Up 0.0036 0.567 0.604 SEQ ID NO: 9 2 WEE1 WEE1 homolog AA039640 Up 0.0039 0.485 0.567 SEQ ID NO: 1 3 SSBP1 single-stranded DNA binding protein R05693 Up 0.0122 0.466 0.500 SEQ ID NO: 6 4 WFDC2 WAP four-disulfide core domain 2 AA451904 Down 0.0155 0.544 0.489 SEQ ID NO: 56 5 ACTA2 actin, alpha 2, smooth muscle, aorta AA634006 Down 0.0149 0.684 0.487 SEQ ID NO: 57 6 G22P1 thyroid autoantigen 70 kDa (Ku70) AA486311 Up 0.0176 0.519 0.482 SEQ ID NO: 58 7 MST1 macrophage stimulating 1 T47813 Up 0.0153 0.462 0.481 SEQ ID NO: 20 8 PHB prohibitin R60946 Up 0.0219 0.419 0.472 SEQ ID NO: 59 9 DRPLA dentatorubral-pallidoluysian atrophy H08642 Up 0.0238 0.478 0.455 SEQ ID NO: 60 10 SNRPB small nuclear ribonucleoprotein polypeptides B AA599116 Up 0.0192 0.615 0.455 SEQ ID NO: 11 and B1 11 GJA1 gap junction protein, alpha 1 AA487623 Up 0.0268 0.332 0.446 SEQ ID NO: 61 12 SFTPC surfactant, pulmonary-associated protein C AA487571 Down 0.0313 0.350 0.445 SEQ ID NO: 7 13 ACTR1A actin-related protein 1 homolog A R40850 Up 0.0256 0.626 0.444 SEQ ID NO: 62 14 MYC v-myc viral oncogene homolog AA464600 Up 0.0294 0.385 0.434 SEQ ID NO: 2 15 RAD23B RAD23 homolog B AA489678 Up 0.0276 0.495 0.434 SEQ ID NO: 63 16 CCT3 chaperonin containing TCP1, subunit 3 R60933 Up 0.0305 0.548 0.431 SEQ ID NO: 14 17 SERPINE1 serine (or cysteine) proteinase inhibitor, clade N75719 Up 0.0338 0.473 0.424 SEQ ID NO: 21 E member 1 18 LAMP1 lysosomal-associated membrane protein 1 H29077 Down 0.0374 0.382 0.418 SEQ ID NO: 64 19 IRAK1 interleukin-1 receptor-associated kinase 1 AA683550 Down 0.0355 0.199 0.414 SEQ ID NO: 65 20 BIRC2 baculoviral IAP repeat-containing 2 R19628 Up 0.0362 0.359 0.412 SEQ ID NO: 66 21 LMAN1 lectin, mannose-binding, 1 H73420 Up 0.0339 0.409 0.411 SEQ ID NO: 67 22 HSPE1 heat shock 10 kD protein 1 AA448396 up 0.0411 0.406 0.410 SEQ ID NO: 68 23 TMSB4X thymosin, beta 4, X chromosome AA634103 Down 0.0440 0.585 0.404 SEQ ID NO: 69 24 EEF1G eukaryotic translation elongation factor 1 gamma R43973 up 0.0450 0.638 0.404 SEQ ID NO: 70 25 ESTs H05820 Up 0.0492 0.570 0.403 SEQ ID NO: 71 26 LYPLA1 lysophospholipase I H00817 Up 0.0488 0.456 0.401 SEQ ID NO: 5 27 SOD1 superoxide dismutase 1 R52548 Up 0.0477 0.609 0.397 SEQ ID NO: 72 28 ARG1 arginase type I (liver) AA453673 Up 0.0454 0.541 0.396 SEQ ID NO: 18 29 KRT25A type I inner root sheath specific keratin 25 irs1 W73634 Up 0.0534 0.584 0.394 SEQ ID NO: 73 30 FOSL1 FOS-like antigen 1 (Fra-1) T82817 Up 0.0366 0.309 0.391 SEQ ID NO: 4
TABLE-US-00003 TABLE 3 Squamous cell cancer accession expression in Rank Gene Description No. lung cancer P bx Sx SEQ ID NO. 1 FLJ20619 hypothetical protein R74480 Down 0.0068 0.507 0.882 SEQ ID NO: 31 2 SPC12 signal peptidase 12 kDa R19183 Down 0.0087 0.521 0.859 SEQ ID NO: 32 3 ESTs R96358 Down 0.0034 0.448 0.835 SEQ ID NO: 33 4 KRT5 keratin 5 AA160507 Up 0.0046 0.841 0.789 SEQ ID NO: 34 5 PTP4A3 protein tyrosine phosphatase type 4A, member 3 AA039851 Up 0.0104 0.438 0.753 SEQ ID NO: 25 6 SPRR1B small proline-rich protein 1B AA447835 Up 0.0147 0.695 0.730 SEQ ID NO: 16 7 LOC339324 hypothetical protein LOC339324 W23522 Down 0.0171 0.536 0.693 SEQ ID NO: 35 8 MYST4 MYST histone acetyltransferase 4 AA057313 Up 0.0188 0.573 0.691 SEQ ID NO: 36 9 SPARCL1 SPARC-like 1 AA490694 Up 0.0210 0.454 0.682 SEQ ID NO: 37 10 IGJ immunoglobulin J polypeptide T70057 Up 0.0143 0.385 0.681 SEQ ID NO: 38 11 EIF4A2 eukaryotic translation initiation factor 4A, H05919 Down 0.0233 0.750 0.679 SEQ ID NO: 39 isoform 2 12 ESTs AA115121 Up 0.0226 0.412 0.672 SEQ ID NO: 40 13 ID2 inhibitor of DNA binding 2 H82706 Up 0.0214 0.608 0.670 SEQ ID NO: 41 14 THBD thrombomodulin H59861 Up 0.0077 0.636 0.669 SEQ ID NO: 8 15 MGC15476 Thymus expressed gene 3-like W72525 Up 0.0231 0.412 0.665 SEQ ID NO: 42 16 ZFP zinc finger protein H53499 Down 0.0217 0.632 0.659 SEQ ID NO: 43 17 COPB coatomer protein complex, subunit beta AA598868 Up 0.0272 0.527 0.648 SEQ ID NO: 17 18 ZYG ZYG homolog AA453289 Up 0.0237 0.349 0.647 SEQ ID NO: 44 19 CACNA1I calcium channel, voltage-dependent, alpha 1I N52765 Up 0.0312 0.495 0.636 SEQ ID NO: 45 subunit 20 FLJ4623 hypothetical protein N71473 Down 0.0309 0.457 0.632 SEQ ID NO: 46 21 CSTB cystatin B H22919 Up 0.0286 0.762 0.631 SEQ ID NO: 47 22 EPB41L1 erythrocyte membrane protein band 4.1-like 1 R71689 Up 0.0482 0.690 0.613 SEQ ID NO: 48 23 MGC4549 hypothetical protein AA455267 Up 0.0327 0.410 0.606 SEQ ID NO: 49 24 ESTs T64878 Down 0.0406 0.457 0.600 SEQ ID NO: 50 25 DSC2 desmocollin 2 AA074677 Up 0.0407 0.656 0.592 SEQ ID NO: 15 26 ESTs H79007 Down 0.0415 0.363 0.590 SEQ ID NO: 51 27 ESTs W84776 Down 0.0364 0.665 0.587 SEQ ID NO: 52 28 IFI30 interferon, gamma-inducible protein 30 AA630800 Up 0.0415 0.336 0.587 SEQ ID NO: 53 29 ESTs T81155 Up 0.0552 0.633 0.583 SEQ ID NO: 54 30 IL1RN interleukin 1 receptor antagonist T72877 Up 0.0431 0.573 0.578 SEQ ID NO: 55
[0064] These outcomes show that among 34 patients with non-squamous cell cancer, a five year survival rate after operation of 31 patients (91%) was accurately predicted (FIG. 3). Specifically, among 25 patients who were predicted to be "prognosis favorable", 23 patients (92%) were actually survival over five years after operation. Among 9 patients who were decided to be "prognosis fatal", only one patient was survival over five years. The difference between the survival curve of 25 patients who were decided to be "prognosis favorable" and that of 9 patients who were predicted to be "prognosis fatal" was very significant.
Sequence CWU
1
1
7314232DNAHomo sapiens 1aaaattgcgt ttgagtttgc cgcgagccgg gccaatcggt
tttgccaacg catgcccacg 60tgctggcgaa caaatgtaaa cacggagatc gtgtgccggg
cacttggttt cgtggtgggc 120aactgtgctg ctgtttcttt tggccgcgga caaggtcggc
agaggtggac ccctgcttgg 180gagagctctt ctcgctgtgc tgacacccgc ccctaacagt
cacccacccc ggggaaataa 240tggggctcgg aggcctcctc ccagccagtg tccagcctaa
gcacatcggc tcccgcagtt 300cagaaaggtc ccgaggcccg agtcaccatt tccggctcag
acctcgaccc ggaacgtggc 360tgcccactgc cacgcccact acgccccagt ggctcgcccc
aggggacgag gggcaagaag 420cggcctccga gggcagcggc cgaaggccat tcggtccctg
gctcttccca gctcgcagag 480acccggaagc gctgcccggc cgcctgcccc tcttcagatc
ccccagcacc ggaggagcag 540cgagggggct gcgtccaggc cggctttcgg gtcggcttag
gcgaatccag ctctcttttg 600cccctcccag aaggcccagc cccgtccggg cggtgttcgg
gcggcgccgg gccgggcccc 660ccgccgcccc aggctcgctc ataggcccgg aacaccacag
cccgcccaga cttggctggc 720gccgagccgg gggtggagcc agcgggttcc cgccaaaatc
gcgtagctgg tccttccccc 780gcgggctacg tcgcgccctc cttttttttt caaacccgga
gctgcactgg gattggtgga 840ctgggcactc acgtggttaa cggtcgcggg aagccgcgga
gcccgaacct gagactggac 900ctgaggagac ctcagcctcg gtgctcgggc cgccccgcct
ctgccggaaa gtccgcgccg 960ccgctgccgc caccgtccgc agcccgagcg ccccggagcc
gcaggccgcc gccgcgcaga 1020gacgccgcgg ctgcgactag gcgcgcccag ccgcacgtgg
cggacccgcc cccaggcccg 1080cagtgtcctg gaccccgcag gcctccgctc tcctgtcctc
ggccccgtcc ccagggccgc 1140gatgagcttc ctgagccgac agcagccgcc gccaccccgc
cgcgccgggg cggcctgcac 1200cttgcggcag aagctgatct tctcgccctg cagcgactgt
gaggaggagg aagaagagga 1260ggaggaggag ggcagcggcc acagcaccgg ggaggactcg
gcctttcaag agcccgactc 1320gccgctgccg cccgcgcgga gccccacgga gcccgggccc
gagcgccgcc gctcgcccgg 1380gccggccccc gggagccccg gcgagctgga ggaggacctg
ttgctgcccg gcgcctgccc 1440gggcgcggac gaggcgggcg gtggggcgga gggcgactcg
tgggaggagg agggcttcgg 1500ctcctcgtcg ccggtcaagt cgccggcggc cccctacttc
ctgggtagct ctttctcgcc 1560ggtgcgctgc ggcggcccag gagatgcgtc gccgcggggt
tgcggggcgc gccgggcggg 1620cgaaggccgc cgctcgccgc ggccggacca cccgggcacc
ccgccacaca agaccttccg 1680caagctgcga ctcttcgaca ccccgcacac gcccaagagt
ttgctctcca aagctcgggg 1740aattgattcc agctctgtta aactccgggg tagttctctc
ttcatggata cagaaaaatc 1800aggaaaaagg gaatttgatg tgcgacagac tcctcaagtg
aatattaatc cttttactcc 1860ggattctttg ttgcttcatt cctcaggaca gtgtcgtcgt
agaaagagaa cgtattggaa 1920tgattcctgt ggtgaagaca tggaagccag tgattatgag
cttgaagatg aaacaagacc 1980tgctaagaga attacaatta ctgaaagcaa tatgaagtcc
cggtatacaa cagaatttca 2040tgagctagag aaaatcggct ctggagaatt tggttctgta
tttaagtgtg tgaagaggct 2100ggatggatgc atttatgcca ttaagcgatc aaaaaagcca
ttggcgggct ctgttgatga 2160gcagaacgct ttgagagaag tatatgctca tgcagtgctt
ggacagcatt ctcatgtagt 2220tcgatatttc tctgcgtggg cagaagatga tcatatgctt
atacagaatg aatattgtaa 2280tggtggaagt ttagctgatg ctataagtga aaactacaga
atcatgagtt actttaaaga 2340agcagagttg aaggatctcc ttttgcaagt tggccgaggc
ttgaggtata ttcattcaat 2400gtctttggtt cacatggata taaaacctag taatattttc
atatctcgaa cctcaatccc 2460aaatgctgcc tctgaagaag gagacgaaga tgattgggca
tccaacaaag ttatgtttaa 2520aataggtgat cttgggcatg taacaaggat ctccagtcca
caagttgaag agggcgatag 2580tcgttttctt gcaaatgaag ttttacagga gaattatacc
catctaccaa aagcagatat 2640ttttgcgctt gccctcacag tggtatgtgc tgctggtgct
gaacctcttc cgagaaatgg 2700agatcaatgg catgaaatca gacagggtag attacctcgg
ataccacaag tgctttccca 2760agaatttaca gagttgctaa aagttatgat tcatccagat
ccagagagaa gaccttcagc 2820aatggcactg gtaaagcatt cagtattgct gtccgcttct
agaaagagtg cagaacaatt 2880acgaatagaa ttgaatgccg aaaagttcaa aaattcactt
ttacaaaaag aactcaagaa 2940agcacagatg gcaaaagctg cagctgagga aagagcactc
ttcactgacc ggatggccac 3000taggtccacc acccagagta atagaacatc tcgacttatt
ggaaagaaaa tgaaccgctc 3060tgtcagcctt actatatact gagctactcc tttcccacct
ccccctgaac actgtgacaa 3120gaggaagcta ggttgaaatc actgatagaa tccagtttgc
aattactttc tcgattggtg 3180tcagtagttt tactgattag gacttttatt gtgaattaca
gttgaaagct gtattttgat 3240gattgctatg tcaggctttc atctaatctt accagtctgt
cttctgtagg atgtgtcact 3300gttggatgtt acaccagcct ttccagggtt aaccactgtg
gtggtgtgct gcttatagtt 3360tgctgttgca ttgtaataaa aggtgtcttt ccctgtagtg
acctgtaaaa agtactcaag 3420ggctttatta cagacatacc ctccctttga aaagggacat
gctaaaagac tcattactac 3480tcagccttca atgtacctgt gtgtccatct tatatttctt
tttttttttt aattgtgaat 3540tagacttgta tatcccactg ggagcacttt gtaggcattg
catgaaccat gggatgatga 3600ttctgtggag gtattgcctt gtgaatttgc tgctatttta
gttttgtctt tgctgtaaac 3660ttgtagcatt aaacaatcat tgttgttaat aggtcttctt
tttgaaacaa ttatgtgaaa 3720tgtatagctg cttttgatga aaagcagcta tttgcctttt
ttttttttcc tttgaacttt 3780gaagctagtg cattggaaaa atgcaccctt tccctccttt
ggaatgctgt attaatgtag 3840tataataatt actggttttg taacttgttc tggtaatgtc
cttcccggac tctttttaaa 3900tgtctccccc taagttttat acttgattgt attattagtc
tgtttttaaa tgttttgccc 3960ggtttttctc ttcaatattt gtgtatataa accgatcttc
gtgatactgt acatagctgt 4020ttgaaatgcc agaatgactt ctgacattcc aagtttttca
caaaatatat tttatctgtg 4080attagccatt tgactaataa tactggctaa cagatgttga
aaaaaattgt ctgtttgttt 4140tctcattaat tttggtctaa aacatgtttg cacttgtctt
tgacttgtgt tttattaaca 4200ttgattggca tattaaaagt cactctgagc tt
423222189DNAHomo sapiens 2gcagagggag cgagcgggcg
gccggctagg gtggaagagc cgggcgagca gagctgcgct 60gcgggcgtcc tgggaaggga
gatccggagc gaataggggg cttcgcctct ggcccagccc 120tcccgctgat cccccagcca
gcggtccgca acccttgccg catccacgaa actttgccca 180tagcagcggg cgggcacttt
gcactggaac ttacaacacc cgagcaagga cgcgactctc 240ccgacgcggg gaggctattc
tgcccatttg gggacacttc cccgccgctg ccaggacccg 300cttctctgaa aggctctcct
tgcagctgct tagacgctgg atttttttcg ggtagtggaa 360aaccagcagc ctcccgcgac
gatgcccctc aacgttagct tcaccaacag gaactatgac 420ctcgactacg actcggtgca
gccgtatttc tactgcgacg aggaggagaa cttctaccag 480cagcagcagc agagcgagct
gcagcccccg gcgcccagcg aggatatctg gaagaaattc 540gagctgctgc ccaccccgcc
cctgtcccct agccgccgct ccgggctctg ctcgccctcc 600tacgttgcgg tcacaccctt
ctcccttcgg ggagacaacg acggcggtgg cgggagcttc 660tccacggccg accagctgga
gatggtgacc gagctgctgg gaggagacat ggtgaaccag 720agtttcatct gcgacccgga
cgacgagacc ttcatcaaaa acatcatcat ccaggactgt 780atgtggagcg gcttctcggc
cgccgccaag ctcgtctcag agaagctggc ctcctaccag 840gctgcgcgca aagacagcgg
cagcccgaac cccgcccgcg gccacagcgt ctgctccacc 900tccagcttgt acctgcagga
tctgagcgcc gccgcctcag agtgcatcga cccctcggtg 960gtcttcccct accctctcaa
cgacagcagc tcgcccaagt cctgcgcctc gcaagactcc 1020agcgccttct ctccgtcctc
ggattctctg ctctcctcga cggagtcctc cccgcagggc 1080agccccgagc ccctggtgct
ccatgaggag acaccgccca ccaccagcag cgactctgag 1140gaggaacaag aagatgagga
agaaatcgat gttgtttctg tggaaaagag gcaggctcct 1200ggcaaaaggt cagagtctgg
atcaccttct gctggaggcc acagcaaacc tcctcacagc 1260ccactggtcc tcaagaggtg
ccacgtctcc acacatcagc acaactacgc agcgcctccc 1320tccactcgga aggactatcc
tgctgccaag agggtcaagt tggacagtgt cagagtcctg 1380agacagatca gcaacaaccg
aaaatgcacc agccccaggt cctcggacac cgaggagaat 1440gtcaagaggc gaacacacaa
cgtcttggag cgccagagga ggaacgagct aaaacggagc 1500ttttttgccc tgcgtgacca
gatcccggag ttggaaaaca atgaaaaggc ccccaaggta 1560gttatcctta aaaaagccac
agcatacatc ctgtccgtcc aagcagagga gcaaaagctc 1620atttctgaag aggacttgtt
gcggaaacga cgagaacagt tgaaacacaa acttgaacag 1680ctacggaact cttgtgcgta
aggaaaagta aggaaaacga ttccttctaa cagaaatgtc 1740ctgagcaatc acctatgaac
ttgtttcaaa tgcatgatca aatgcaacct cacaaccttg 1800gctgagtctt gagactgaaa
gatttagcca taatgtaaac tgcctcaaat tggactttgg 1860gcataaaaga acttttttat
gcttaccatc tttttttttt ctttaacaga tttgtattta 1920agaattgttt ttaaaaaatt
ttaagattta cacaatgttt ctctgtaaat attgccatta 1980aatgtaaata actttaataa
aacgtttata gcagttacac agaatttcaa tcctagtata 2040tagtacctag tattataggt
actataaacc ctaatttttt ttatttaagt acattttgct 2100ttttaaagtt gatttttttc
tattgttttt agaaaaaata aaataactgg caaatatatc 2160attgagccaa aaaaaaaaaa
aaaaaaaaa 218932352DNAHomo sapiens
3gaaacttaaa ggtgtttacc ttgtcatcag catgtaagct aattatctcg ggcaagatgt
60aggcttctat tgtcttgttg ctttagcgct tacgccccgc ctctggtggc tgcctaaaac
120ctggcgccgg gctaaaacaa acgcgaggca gcccccgagc ctccactcaa gccaattaag
180gaggactcgg tccactccgt tacgtgtaca tccaacaaga tcggcgttaa ggtaacacca
240gaatatttgg caaagggaga aaaaaaaagc agcgaggctt cgccttcccc ctctcccttt
300tttttcctcc tcttccttcc tcctccagcc gccgccgaat catgtcgatg agtccaaagc
360acacgactcc gttctcagtg tctgacatct tgagtcccct ggaggaaagc tacaagaaag
420tgggcatgga gggcggcggc ctcggggctc cgctggcggc gtacaggcag ggccaggcgg
480caccgccaac agcggccatg cagcagcacg ccgtggggca ccacggcgcc gtcaccgccg
540cctaccacat gacggcggcg ggggtgcccc agctctcgca ctccgccgtg gggggctact
600gcaacggcaa cctgggcaac atgagcgagc tgccgccgta ccaggacacc atgaggaaca
660gcgcctctgg ccccggatgg tacggcgcca acccagaccc gcgcttcccc gccatctccc
720gcttcatggg cccggcgagc ggcatgaaca tgagcggcat gggcggcctg ggctcgctgg
780gggacgtgag caagaacatg gccccgctgc caagcgcgcc gcgcaggaag cgccgggtgc
840tcttctcgca ggcgcaggtg tacgagctgg agcgacgctt caagcaacag aagtacctgt
900cggcgccgga gcgcgagcac ctggccagca tgatccacct gacgcccacg caggtcaaga
960tctggttcca gaaccaccgc tacaaaatga agcgccaggc caaggacaag gcggcgcagc
1020agcaactgca gcaggacagc ggcggcggcg ggggcggcgg gggcaccggg tgcccgcagc
1080agcaacaggc tcagcagcag tcgccgcgac gcgtggcggt gccggtcctg gtgaaagacg
1140gcaaaccgtg ccaggcgggt gcccccgcgc cgggcgccgc cagcctacaa ggccacgcgc
1200agcagcaggc gcagcaccag gcgcaggccg cgcaggcggc ggcagcggcc atctccgtgg
1260gcagcggtgg cgccggcctt ggcgcacacc cgggccacca gccaggcagc gcaggccagt
1320ctccggacct ggcgcaccac gccgccagcc ccgcggcgct gcagggccag gtatccagcc
1380tgtcccacct gaactcctcg ggctcggact acggcaccat gtcctgctcc accttgctat
1440acggtcggac ctggtgagag gacgccgggc cggccctagc ccagcgctct gcctcaccgc
1500ttccctcctg cccgccacac agaccaccat ccaccgctgc tccacgcgct tcgacttttc
1560ttaacaacct ggccgcgttt agaccaagga acaaaaaaac cacaaaggcc aaactgctgg
1620acgtctttct ttttttcccc ccctaaaatt tgtgggtttt tttttttaaa aaaagaaaat
1680gaaaaacaac caagcgcatc caatctcaag gaatctttaa gcagagaagg gcataaaaca
1740gctttggggt gtcttttttt ggtgattcaa atgggttttc cacgctaggg cggggcacag
1800attggagagg gctctgtgct gacatggctc tggactctaa agaccaaact tcactctggg
1860cacactctgc cagcaaagag gactcgcttg taaataccag gatttttttt tttttttgaa
1920gggaggacgg gagctgggga gaggaaagag tcttcaacat aacccacttg tcactgacac
1980aaaggaagtg ccccctcccc ggcaccctct ggccgcctag gctcagcggc gaccgccctc
2040cgcgaaaata gtttgtttaa tgtgaacttg tagctgtaaa acgctgtcaa aagttggact
2100aaatgcctag tttttagtaa tctgtacatt ttgttgtaaa aagaaaaacc actcccagtc
2160cccagccctt cacatttttt atgggcattg acaaatctgt gtatattatt tggcagtttg
2220gtatttgcgg cgtcagtctt tttctgttgt aacttatgta gatatttggc ttaaatatag
2280ttcctaagaa gcttctaata aattatacaa attaaaaaga ttctttttct gattaaaaaa
2340aaaaaaaaaa aa
23524431DNAHomo sapiensmisc_feature(353)..(353)n is a, c, g, or t
4cagcagcgga gacccatcct ctgaccccct tggctctcca accctcctcg ctttgtgagg
60cacccgagcc ttactccctg caggtgccac cctaagcaac gtctgctccc cttcccccac
120cagtccagct ggcctggaca gtatcccata cccaactcca gcagctgctt ctccatccct
180ctaatgagac taaccatatt gtgcttcaca gtagagccag cttggggcca ccaaagctgc
240ccattgtttc tctaggagct gggcctctct aggcacaatt tggcactaaa tcaggaggac
300aaaatatttt cccatttctg gccggaggaa ttccggggga ggcccaggag gantttgtta
360ggattcctta ggagggtcct ctggggaggc cctaaaccct ttccagattc attggccaca
420tttttcccnt c
4315292DNAHomo sapiensmisc_feature(286)..(286)n is a, c, g, or t
5gtttttgatg cagacataaa aatagcaatc attttaaatt gtcaaaattt ccagattact
60ggtaaaaatt atttgaaaac aaacttatgg gtaataaagg ctagtcagaa ccctatacca
120taaagtgtag ttaccataca gattaatatg tagcaaaaat gtatgcttga tatttctcaa
180ctgtgttaat ttttctgctg tattccagct gaccaaaaca atattaagaa tgcatcttta
240taaatggggt gctaattgat aatgggaaat aatttaggta atgggnctat ac
2926400DNAHomo sapiensmisc_feature(1)..(1)n is a, c, g, or t 6ngaagggata
gccagcgcga aggaagtnct ggagtcgtgt gttttggctg cgcgtgatcc 60tgcgtgggtc
gggaggtgtt tctgtgtagg tntctggccc tttnatcagt cgtgcggagg 120accgcgtgat
ttccttccag ttctnctcgg ntttcangaa aagcctaaag attagactnt 180aagaaaagan
aatagaagcc atgtttcgaa gacctgtatt acaggtactt cgtcagtttc 240taagacatga
gtcccganac aactaccagt ttggttcttn gaaagatccc tggaatgcac 300tttnctttng
gcccaggtng ggtcagggac cctgtctttt taggacaggn tcggaaggga 360aaaaaatccc
agttcacaat antttttntc ttaggcaact 4007859DNAHomo
sapiens 7acaggagagc atagcacctg cagcaagatg gatgtgggca gcaaagaggt
cctgatggag 60agcccgccgg actactccgc agctccccgg ggccgatttg gcattccctg
ctgcccagtg 120cacctgaaac gccttcttat cgtggtggtg gtggtggtcc tcatcgtcgt
ggtgattgtg 180ggagccctgc tcatgggtct ccacatgagc cagaaacaca cggagatggt
tctggagatg 240agcattgggg cgccggaagc ccagcaacgc ctggccctga gtgagcacct
ggttaccact 300gccaccttct ccatcggctc cactggcctc gtggtgtatg actaccagca
gctgctgatc 360gcctacaagc cagcccctgg cacctgctgc tacatcatga agatagctcc
agagagcatc 420cccagtcttg aggctctcaa tagaaaagtc cacaacttcc agatggaatg
ctctctgcag 480gccaagcccg cagtgcctac gtctaagctg ggccaggcag aggggcgaga
tgcaggctca 540gcaccctccg gaggggaccc ggccttcttg ggcatggccg tgaacaccct
gtgtggcgag 600gtgccgctct actacatcta ggacgcctcc ggtgagcagg gtcagtggaa
gccccaacgg 660gaaaggaaac gccccgggca aagggtcttt tgcagctttt gcagacgggc
aagaagctgc 720ttctgcccac accgcaggga caaaccctgg agaaatggga gcttggggag
aggatgggag 780tgggcagagg tggcacccag gggcccggga actcctgcca caacagaata
aagcagcctg 840atttgaaaag caaaaaaaa
85984050DNAHomo sapiens 8cttgcaatcc aggctttcct tggaagtggc
tgtaacatgt atgaaaagaa agaaaggagg 60accaagagat gaaagagggc tgcacgcgtg
ggggcccgag tggtgggcgg ggacagtcgt 120cttgttacag gggtgctggc cttccctggc
gcctgcccct gtcggccccg cccgagaacc 180tccctgcgcc agggcagggt ttactcatcc
cggcgaggtg atcccatgcg cgagggcggg 240cgcaagggcg gccagagaac ccagcaatcc
gagtatgcgg catcagccct tcccaccagg 300cacttccttc cttttcccga acgtccaggg
agggagggcc gggcacttat aaactcgagc 360cctggccgat ccgcatgtca gaggctgcct
cgcaggggct gcgcgcacgg caagaagtgt 420ctgggctggg acggacagga gaggctgtcg
ccatcggcgt cctgtgcccc tctgctccgg 480cacggccctg tcgcagtgcc cgcgctttcc
ccggcgcctg cacgcggcgc gcctgggtaa 540catgcttggg gtcctggtcc ttggcgcgct
ggccctggcc ggcctggggt tccccgcacc 600cgcagagccg cagccgggtg gcagccagtg
cgtcgagcac gactgcttcg cgctctaccc 660gggccccgcg accttcctca atgccagtca
gatctgcgac ggactgcggg gccacctaat 720gacagtgcgc tcctcggtgg ctgccgatgt
catttccttg ctactgaacg gcgacggcgg 780cgttggccgc cggcgcctct ggatcggcct
gcagctgcca cccggctgcg gcgaccccaa 840gcgcctcggg cccctgcgcg gcttccagtg
ggttacggga gacaacaaca ccagctatag 900caggtgggca cggctcgacc tcaatggggc
tcccctctgc ggcccgttgt gcgtcgctgt 960ctccgctgct gaggccactg tgcccagcga
gccgatctgg gaggagcagc agtgcgaagt 1020gaaggccgat ggcttcctct gcgagttcca
cttcccagcc acctgcaggc cactggctgt 1080ggagcccggc gccgcggctg ccgccgtctc
gatcacctac ggcaccccgt tcgcggcccg 1140cggagcggac ttccaggcgc tgccggtggg
cagctccgcc gcggtggctc ccctcggctt 1200acagctaatg tgcaccgcgc cgcccggagc
ggtccagggg cactgggcca gggaggcgcc 1260gggcgcttgg gactgcagcg tggagaacgg
cggctgcgag cacgcgtgca atgcgatccc 1320tggggctccc cgctgccagt gcccagccgg
cgccgccctg caggcagacg ggcgctcctg 1380caccgcatcc gcgacgcagt cctgcaacga
cctctgcgag cacttctgcg ttcccaaccc 1440cgaccagccg ggctcctact cgtgcatgtg
cgagaccggc taccggctgg cggccgacca 1500acaccggtgc gaggacgtgg atgactgcat
actggagccc agtccgtgtc cgcagcgctg 1560tgtcaacaca cagggtggct tcgagtgcca
ctgctaccct aactacgacc tggtggacgg 1620cgagtgtgtg gagcccgtgg acccgtgctt
cagagccaac tgcgagtacc agtgccagcc 1680cctgaaccaa actagctacc tctgcgtctg
cgccgagggc ttcgcgccca ttccccacga 1740gccgcacagg tgccagatgt tttgcaacca
gactgcctgt ccagccgact gcgaccccaa 1800cacccaggct agctgtgagt gccctgaagg
ctacatcctg gacgacggtt tcatctgcac 1860ggacatcgac gagtgcgaaa acggcggctt
ctgctccggg gtgtgccaca acctccccgg 1920taccttcgag tgcatctgcg ggcccgactc
ggcccttgcc cgccacattg gcaccgactg 1980tgactccggc aaggtggacg gtggcgacag
cggctctggc gagcccccgc ccagcccgac 2040gcccggctcc accttgactc ctccggccgt
ggggctcgtg cattcgggct tgctcatagg 2100catctccatc gcgagcctgt gcctggtggt
ggcgcttttg gcgctcctct gccacctgcg 2160caagaagcag ggcgccgcca gggccaagat
ggagtacaag tgcgcggccc cttccaagga 2220ggtagtgctg cagcacgtgc ggaccgagcg
gacgccgcag agactctgag cggcctccgt 2280ccaggagcct ggctccgtcc aggagctgtg
cctcctcacc cccagctttg ctaccaaagc 2340accttagctg gcattacagc tggagaagac
cctccccgca ccccccaagc tgttttcttc 2400tattccatgg ctaactggcg agggggtgat
tagagggagg agaatgagcc tcggcctctt 2460ccgtgacgtc actggaccac tgggcaatga
tggcaatttt gtaacgaaga cacagactgc 2520gatttgtccc aggtcctcac taccgggcgc
aggagggtga gcgttattgg tcggcagcct 2580tctgggcaga ccttgacctc gtgggctagg
gatgactaaa atatttattt tttttaagta 2640tttaggtttt tgtttgtttc ctttgttctt
acctgtatgt ctccagtatc cactttgcac 2700agctctccgg tctctctctc tctacaaact
cccacttgtc atgtgacagg taaactatct 2760tggtgaattt ttttttccta gccctctcac
atttatgaag caagccccac ttattcccca 2820ttcttcctag ttttctcctc ccaggaactg
ggccaactca cctgagtcac cctacctgtg 2880cctgacccta cttcttttgc tcatctagct
gtctgctcag acagaacccc tacatgaaac 2940agaaacaaaa acactaaaaa taaaaatggc
catttgcttt ttcaccagat ttgctaattt 3000atcctgaaat ttcagattcc cagagcaaaa
taattttaaa caaagggttg agatgtaaaa 3060ggtattaaat tgatgttgct ggactgtcat
agaaattaca cccaaagagg tatttatctt 3120tacttttaaa cagtgagcct gaattttgtt
gctgttttga tttgtactga aaaatggtaa 3180ttgttgctaa tcttcttatg caatttcctt
ttttgttatt attacttatt tttgacagtg 3240ttgaaaatgt tcagaaggtt gctctagatt
gagagaagag acaaacacct cccaggagac 3300agttcaagaa agcttcaaac tgcatgattc
atgccaatta gcaattgact gtcactgttc 3360cttgtcactg gtagaccaaa ataaaaccag
ctctactggt cttgtggaat tgggagcttg 3420ggaatggatc ctggaggatg cccaattagg
gcctagcctt aatcaggtcc tcagagaatt 3480tctaccattt cagagaggcc ttttggaatg
tggcccctga acaagaattg gaagctgccc 3540tgcccatggg agctggttag aaatgcagaa
tcctaggctc caccccatcc agttcatgag 3600aatctatatt taacaagatc tgcagggggt
gtgtctgctc agtaatttga ggacaaccat 3660tccagactgc ttccaatttt ctggaataca
tgaaatatag atcagttata agtagcaggc 3720caagtcaggc ccttattttc aagaaactga
ggaattttct ttgtgtagct ttgctctttg 3780gtagaaaagg ctaggtacac agctctagac
actgccacac agggtctgca aggtctttgg 3840ttcagctaag ctaggaatga aatcctgctt
cagtgtatgg aaataaatgt atcatagaaa 3900tgtaactttt gtaagacaaa ggttttcctc
ttctattttg taaactcaaa atatttgtac 3960atagttattt atttattgga gataatctag
aacacaggca aaatccttgc ttatgacatc 4020acttgtacaa aataaacaaa taacaatgtg
40509466DNAHomo
sapiensmisc_feature(155)..(155)n is a, c, g, or t 9tttttttttt tttttttttt
taagtctcct tctttattat taggaaaaca acaacaacaa 60caaacaaaaa aatggcgtca
tgaatatgaa cagcattgtc agatgaatta gttgaagtgg 120tttttttttt gttttttttt
ttttttttgt actgngtcct caaatttaat ggattaatgt 180gtcttgtata tataaaaaga
aaacctctac cttcagcctc tgcctattct tgctccgtct 240aggacatccn caatttcgtc
gatgaccagc ttggtgaata agtattactg taccaactgg 300gcctcctcta gcaggcccct
gaaggcagtg gaataaaatg aaatcttcgc cctttaagaa 360ctcctgacct taatgtggta
gtagtatctt gtccttgagg ggatttcctt cccctcaccc 420ctaagacttt cacaacctgg
tgactggaaa gaaccaccac naatcc 46610470DNAHomo
sapiensmisc_feature(208)..(208)n is a, c, g, or t 10aacaaatgct tctgccaaag
tgaaagaatt ttatgtctta atgcttttct ttaaaaaaaa 60aaaaagtcaa cattgaacta
ggacatgctc tgcttcccca cccccatttt gctgactaca 120ttttaaaaaa tctattggca
gaaaacaaga tattttcttc aaatagagtg attatgtttt 180attgctattt tgtttagtat
atattttnct caattgggaa aaaaatctag gtgaaaaaaa 240ttacctaaca agagaagtag
tttacatagt cataacattt aaatttgctg cccaaaaaat 300gtaaaanaat ttnaatgtaa
aatgtcacat antttcaaaa aacttacctc aattgtctat 360catttatcat gtactataag
tcaacttcct aaataagatt cagtccttta ttataagccc 420ctactggtac catngtatac
attaaaaacg ctnctccaaa atttcctggc 470111007DNAHomo sapiens
11aactccaggg ctagtgagct ggaccggaag taggtttcta cccgaccgca ttttacgtgg
60tgctgcattt ccggtagcgg cggcgggaaa tcggctgtgg gagagaggct aggcctctga
120ggaggcgaat ccggcgggta tcagagccat cagaaccgcc accatgacgg tgggcaagag
180cagcaagatg ctgcagcata ttgattacag gatgaggtgc atcctgcagg acggccggat
240cttcattggc accttcaagg cttttgacaa gcacatgaat ttgatcctct gtgactgtga
300tgagttcaga aagatcaagc caaagaactc caaacaagca gaaagggaag agaagcgagt
360cctcggtctg gtgctgctgc gaggggagaa tctggtctca atgacagtag agggacctcc
420tcccaaagat actggtattg ctcgagttcc acttgctgga gctgccgggg gcccagggat
480cggcagggct gctggcagag gaatcccagc tggggttccc atgccccagg ctcctgcagg
540acttgctggg ccagtccgtg gggttggcgg gccatcccaa caggtgatga ccccacaagg
600aagaggtact gttgcagccg ctgcagctgc tgccacagcc agtattgccg gggctccaac
660ccagtaccca cctggccgtg ggggtcctcc cccacctatg ggccgaggag caccccctcc
720aggcatgatg ggcccacctc ctggtatgag acctcctatg ggtcccccaa tggggatccc
780ccctggaaga gggactccaa tgggcatgcc ccctccggga atgcggcctc ctccccctgg
840gatgcgaggg ccccctcccc cgggaatgcg cccaccaagg ccctagactc atcttggccc
900tcctcagctc cctgcctgtt tcccgtaagg ctgtacatag tccttttatc tccttgtggc
960ctatgaaact ggtttataat aaactcttaa gagaacatta taattgc
1007123582DNAHomo sapiens 12ctgctcgcgg cgccgcctcc tgctcctccc gctgctgctg
ccgctgccgc cctgagtcac 60tgcctgcgca gctccggccg cctggctccc catactagtc
gccgatattt ggagttctta 120caacatggca gacattgaca acaaagaaca gtctgaactt
gatcaagatt tggatgatgt 180tgaagaagta gaagaagagg aaactggtga agaaacaaaa
ctcaaagcac gtcagctaac 240tgttcagatg atgcaaaatc ctcagattct tgcagccctt
caagaaagac ttgatggtct 300ggtagaaaca ccaacaggat acattgaaag cctgcctagg
gtagttaaaa gacgagtgaa 360tgctctcaaa aacctgcaag ttaaatgtgc acagatagaa
gccaaattct atgaggaagt 420tcacgatctt gaaaggaagt atgctgttct ctatcagcct
ctatttgata agcgatttga 480aattattaat gcaatttatg aacctacgga agaagaatgt
gaatggaaac cagatgaaga 540agatgagatt tcggaggaat tgaaagaaaa ggccaagatt
gaagatgaga aaaaggatga 600agaaaaagaa gaccccaaag gaattcctga attttggtta
actgttttta agaatgttga 660cttgctcagt gatatggttc aggaacacga tgaacctatt
ctgaagcact tgaaagatat 720taaagtgaag ttctcagatg ctggccagcc tatgagtttt
gtcttagaat ttcactttga 780acccaatgaa tattttacaa atgaagtgct gacaaagaca
tacaggatga ggtcagaacc 840agatgattct gatccctttt cttttgatgg accagaaatt
atgggttgta cagggtgcca 900gatagattgg aaaaaaggaa agaatgtcac tttgaaaact
attaagaaga agcagaaaca 960caagggacgt gggacagttc gtactgtgac taaaacagtt
tccaatgact ctttctttaa 1020cttttttgcc cctcctgaag ttcctgagag tggagatctg
gatgatgatg ctgaagctat 1080ccttgctgca gacttcgaaa ttggtcactt tttacgtgag
cgtataatcc caagatcagt 1140gttatatttt actggagaag ctattgaaga tgatgatgat
gattatgatg aagaaggtga 1200agaagcggat gaggaagggg aagaagaagg agatgaggaa
aatgatccag actatgaccc 1260aaagaaggat caaaacccag cagagtgcaa gcagcagtga
agcaggatgt atgtggcctt 1320gaggataacc tgcactgtaa tagcctaaac acaactctta
tttacttaca gccttatgtt 1380tttgtatttt cttggtagac taggtaattt ttttttaaag
gacaggaaac tgatatttta 1440aagaccaatt tgttctacct agcattttaa ctagtttttc
tgccagctat gttgaatgca 1500caaattctgt cacgcatgtt cattcattgc tacataattt
ggttcttctg gaatattttt 1560atgtagctct tggagtacag ctatgaaaat taacaactgt
taaaggaaat accttttttt 1620tttttttgta attttttcct tgaagaacca aagtattttt
tcagctggtt gttgaatagg 1680gttaagtccg cttggattag ctgtgccttt cattactttg
ttacagaaat gcagtgactt 1740atactaagac aatttattgt ttaaaaaaaa aattggcaag
acaactatat ggttaagaat 1800ttccagtatg accacaccca ataactgtta ttagagtgtt
aatggattat tgtgttttag 1860gtgacatagt taactgtaaa gtaacctgac tcagtatagt
tactggtacc acagtgaggt 1920gaataaaacg ggattttcag aagttagcct gaatttaact
gtatttttaa atttaacctc 1980cattaactaa gcatcttttc tttgtggtag ggtctacctt
ctgcttccct ggaaaggatg 2040aatttacatc atttgacaag cctattttca agttatttgt
tgtttgtttg cttgtttttg 2100tttttgcagc taaaataaaa atttcaaata caattttagt
tcttacaaga taatgtctta 2160attttgtacc aattcaggta gaagtagagg cctaccttga
attaagggtt atactcagtt 2220tttaacacat tgttgaagaa aaggtaccag ctttggaacg
agatgctata ctaataagca 2280agtgtaaaaa aaaaaaaaaa agaggaagaa aatcttaagt
gattgatgct gttttctttt 2340aaaaaaaaaa aaaaaaattc attttctttg ggttagagct
agagagaagg ccccaagctt 2400ctatggtttc ttctaattct tattgcttaa agtatgagta
tgtcacttac ccgtgcttct 2460gtttactgtg taattaaaat gggtagtact gtttacctaa
ctacctcatg gatgtgttaa 2520ggcatattga gttaaatctc atataatgtt tctcaatctt
gttaaaagct caaaattttg 2580ggcctatttg taatgccagt gtgacactaa gcattttgtt
cacaccacgc tttgataact 2640aaactggaaa acaaaggtgt taagtacctc tgttctggat
ctgggcagtc agcactcttt 2700ttagatcttt gtgtggctcc tatttttata gaagtggagg
gatgcactat ttcacaaggt 2760ccaagatttg ttttcagata tttttgatga ctgtattgta
aatactacag ggatagcact 2820atagtattgt agtcatgaga cttaaagtgg aaataagact
atttttgaca aaagatgcca 2880ttaaatttca gactgtagag ccacatttac aatacctcag
gctaattact gttaattttg 2940gggttgaact ttttttgaca gtgagggtgg attattggat
tgtcattaga ggaaggtcta 3000gatttcctgc tcttaataaa attacattga attgattttt
agaggtaatg aaaacttcct 3060ttctgagaag ttagtgttaa ggtcttggaa tgtgaacaca
ttgtttgtag tgctatccat 3120tcctctcctg agattttaac ttactactgg aaatccttaa
ccaattataa tagctttttt 3180tctttatttt caaaatgatt tcctttgctt tgattagaca
ctatgtgctt ttttttttta 3240accatagttc atcgaaatgc agctttttct gaacttcaaa
gatagaatcc catttttaat 3300gaactgaagt agcaaaatca tctttttcat tctttaggaa
atagctattg ccaaagtgaa 3360ggtgtagata atacctagtc ttgttacata aaggggatgt
ggtttgcaga agaattttct 3420ttataaaatt gaagttttaa gggacgtcag tgtttatgcc
atttttccag ttccaaaatg 3480attccattcc attctagaaa tttgaagtat gtaacctgaa
atccttaata aaatttggat 3540ttaattttat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
aa 3582136232DNAHomo sapiens 13ctgccagatc agtttgtcac
cacccaggct cccttgcctt tggctgggtg caacttccat 60tttaggtgtt ggatctgagg
gggaaaaaaa agagagaggg agagagagag aaagaagagc 120aggaaagatc ccgaaaggag
gaagaggtgg cgaaaaatca actgccctgc tggatttgtc 180tttctcagca ccttggcgaa
gccttgggtt tctttcttaa aggactgatt tttagaactc 240cacatttgag gtgtgtggct
tttgaagaaa atgtatgtac tgacgggaaa aggaagataa 300gcaagtcgaa tttttgtctt
acgctctctc cttcctgctt cctccttgct gtggtggctg 360ggatgctcct tccatgattt
tttgaatcta gactgggctg ttctctgtgt taaaccaatc 420agttgcgacc ttctcttaac
agtgtgaagt gagggggtct ctctccctcc ttctccttcc 480tctgtgattc accttccttt
ttaccctgcc ctgcggcggc tccgcccctt accttcatgg 540acgactcaga ggtggagtcg
accgccagca tcttggcctc tgtgaaggaa caagaggccc 600agtttgagaa gctgacccgg
gcgctggagg aggaacggcg ccacgtctcg gcgcagctgg 660aacgcgtccg ggtctcacca
caagatgcca acccactcat ggccaacggc acactcaccc 720gccggcatca gaacggccgg
tttgtgggcg atgctgacct tgaaagacag aaattttcag 780atttgaaact caacggaccc
caggatcaca gtcaccttct atatagcacc atccccagga 840tgcaggagcc ggggcagatt
gtggagacct acacggagga ggatcctgag ggagccatgt 900ctgtagtctc tgtggagacc
tcagatgatg ggaccactcg gcgcacagag accacggtca 960agaaagtagt gaagactgtg
acaacacgga cagtacagcc agtcgctatg ggaccagacg 1020ggttgcctgt ggatgcttca
tcagtttcta acaactatat ccagactttg ggtcgtgatt 1080tccgcaagaa tggcaatggg
ggacctggtc cctatgtggg gcaagctggc actgctaccc 1140ttcctaggaa cttccactac
cctcctgatg gttatagtcg ccactatgaa gatggttatc 1200caggtggcag tgataactat
ggcagtctgt cccgggtgac ccgcattgag gagcggtata 1260ggcccagcat ggaaggctac
cgggcaccta gtagacagga tgtgtatggg ccccaacccc 1320aggttcgggt aggtgggagc
agcgtggatc tgcatcgctt tcatccagag ccttatgggc 1380tagaggatga ccagcgtagt
atgggctatg atgacctgga ttatggtatg atgtctgatt 1440atggcactgc ccgtcggact
gggacaccct ctgaccctcg tcggcgcctc aggagctatg 1500aagacatgat tggtgaggag
gtgccatcgg atcaatacta ctgggctcct ttggcccagc 1560atgagcgagg aagtttagca
agcttggata gcctgcgcaa aggagggcct ccacctccta 1620attggagaca gccagagctg
ccagaggtga tcgccatgct tggattccgc ttggatgctg 1680tcaagtccaa tgcagctgca
tacctgcaac acttatgcta ccgcaatgac aaggtgaaga 1740ctgacgtgcg gaagctcaag
ggcatcccag tactggtggg attgttagac catcccaaaa 1800aggaagtgca ccttggagcc
tgtggagctc tcaagaatat ctcttttgga cgtgaccagg 1860ataacaagat tgccataaaa
aactgtgatg gtgtgcctgc ccttgtgcga ttgcttcgaa 1920aggctcgtga tatggacctt
actgaagtta ttaccggaac cctgtggaat ctttcatccc 1980atgactcaat caaaatggag
attgtggacc atgcactgca tgccttgaca gatgaagtga 2040tcattcctca ttctggttgg
gagcgggaac ctaatgaaga ctgtaagcca cgccatattg 2100agtgggaatc ggtgctcacc
aacacagctg gctgccttag gaatgtaagc tcagagagga 2160gtgaagctcg ccggaaactt
cgggaatgtg atggtttagt tgatgccctc attttcattg 2220ttcaggctga gattgggcag
aaggattcag acagcaagct tgtagagaac tgtgtttgcc 2280ttcttcggaa cttatcatat
caagttcacc gggagatccc acaggcagag cgttaccaag 2340aggcagctcc caatgttgcc
aacaatactg ggccacatgc tgccagttgc tttggggcca 2400agaagggcaa agggaaaaaa
cctatagagg atccagcaaa cgatacagtg gatttcccta 2460aaagaacgag tccagctcga
ggctatgagc tcttatttca gccagaggtg gttcggatat 2520acatctcact tcttaaggag
agcaagactc ctgccatcct agaagcctca gctggagcta 2580tccagaactt gtgtgctggg
cgctggacgt atggtcgata catccgctct gctctgcgtc 2640aagagaaggc tctttctgcc
atagctgacc tcctgactaa tgaacatgaa cgggtggtga 2700aagctgcatc tggagcactg
agaaacctgg ctgtggatgc tcgcaacaaa gaattaattg 2760gtaaacatgc tattcctaac
ttggtaaaga atctgccagg aggacagcag aactcctctt 2820ggaatttctc tgaggacact
gtcatctcta ttttgaacac tatcaacgag gttatcgctg 2880agaacttgga ggctgccaaa
aagcttcgag agacacaggg tattgagaag ctggtgttga 2940tcaacaaatc agggaaccgc
tcagaaaaag aagttcgagc agcagcactt gtattacaga 3000caatctgggg atataaggaa
ctgcggaagc cactggaaaa agaaggatgg aagaaatcag 3060actttcaggt gaatctaaac
aatgcttccc gaagccagag cagtcattca tatgatgata 3120gtactctccc tctcattgac
cggaaccaaa aatcagataa caactattcc acaccaaatg 3180agagaggaga ccacaataga
acactggatc gatcggggga tctaggcgac atggagccat 3240tgaagggaac aacacccttg
atgcaggacg aggggcagga atctctggag gaagagttgg 3300atgtgttggt tttggatgat
gaggggggcc aagtgtctta cccctccatg cagaagattt 3360agcaccacta tctccgttcc
atctgggctt atatgtactt ttattttttg gtggtgaaat 3420tgactgatga ttttcctttt
tcttcgctgg actattgtgc caactgccag gctgcctcct 3480gcccttacag ccctaagtgg
ctgccttctt tccatcaact cccaacttct tcctgtgaag 3540tttaattgtc tcaacgcctc
cccctccccc attccctcca tttttctccc aagaaacctg 3600actcaattat ttgcatattt
tgagaaactg ctgcagatta gttctttttg ccagttttcc 3660ctggaactcc tggccttttg
tggaggggag ggatggagag aataggaatc ttcactagaa 3720gccgtgggaa gaattggaag
ttacatgctg tatatgcaat gtccagcagt ctgataaact 3780gacgattctt aatcaagatt
tttttcctga tggggaaggg acttttattt tcttttagag 3840aggggaaagt gtgagctctt
cccttattcc taatggctat ttttgaagca aagaaggcca 3900gcaacattgg cacatgccac
ctggcaaagg acccttgagt aagtgaaggt ctcctaaaac 3960tgggattaag aaaccttgct
ctcctcatct ccaaggcagg gaccatcaag aacctacaga 4020ctccatctct tctgcaagcc
tcatgccaac cctgggctat tgctgctgcc ccttaaacac 4080aggctgtcct taacccacct
ctcctgccct gtgatatgtc tgctgagttg gcctggccat 4140ttccaagagg ctgtagaaag
gggagaatgt caaggaagac ttttggtaga gaaggagcag 4200aaagatgtgt ttttgggaag
aagaagacct ctaggaggag ctagtaggaa tgtacatgaa 4260gcaattagtc tgaaactggc
ttccccactc ccccgtttct ccttttccta tccttatagg 4320cctgtccctt gcctctgccc
tggattggtt ggcaaactat aggacttgat gtacataact 4380cctgtccctt ttcccttaca
aggtggggat tgcccctggc tttgcctctt ctttgtgcct 4440ttggcctggg gtgcatctcc
tcccgccctt ccatgtgcct ttctttgcct ctgcagtctc 4500atttctcata attttgcaaa
ttatattttg ttgctttctt acctactatt ggccctaaat 4560agcagaaaga agagaagtga
ccgagagaac ctcagattct tcattgagga ttggtatagc 4620catgatttca gtcatagcaa
gcttttgctc aacagcatat gggtgggatt tggcaaaaat 4680cctattctga tgaatctcaa
agtaaggctg gtaagagaag tgagtggtgt gactcttact 4740ccttaggtgc ccagaattta
ccatcatctc tgaaggagtt acagggaagt ggtctcccca 4800attctcccct ccctccagta
ttgccccctc tcactttagc atatattaat tagcaggttg 4860ggctagagaa atcagctgct
atgcgggttg attattatta ttatttctaa tccttttcct 4920tatttgcctt ctactcccct
taatctaatc taaaagctct gttccatgca actggagttc 4980cttatccctc tcttcccctt
cccttatata ttgaggctat ggggtaggag aaaagtgcac 5040aacccaccac cccctctact
cgtgcattaa aatttcttat ttaccctttt cccccttccc 5100atttcttccc actttcatct
accttttctg gcaaaaagga gccttttgct ctctgtgacc 5160ctaagagcac actgcacagg
gaaaattgcc ccatccagac ctggctccac tcttgatctc 5220tcttgtcctc ttctgctctt
ttcctggtgc tcttttttct cggtggggtg tgggtaatag 5280aacagccgtg ggcttttggg
gacctttaac ttttttttct ctcttttgtt tataaaaaac 5340actaaacatt caattccaga
gaacccaaaa tcccaccttc ccaccgaaca ctactaaggg 5400gcttgtgttc tgctccatac
cttttctctt ttctttctgt cttgttaatg cttttaaaaa 5460caaatgagtt ttttatataa
ataaagtttt taaagtgtgt atgtgggggg tctgtgtcat 5520ttcttcactt caagctgtta
tttcttccct gctttgcatc tttgttactt ccttatgtat 5580cagtgtcctt tccagagcaa
ccagaaggag gttataccag gatttatttt gagctcagcc 5640ccaactcttt atcaagcaac
attcttgtta actatatgtg aaacattttt tcttctgaag 5700attcttaaaa attgaatgtg
gctgaagttg aacatgggag cttattgcta atttagagat 5760aggaaactga agcataaaga
attaatgact tactttaatt actggaattc ttctgcaaca 5820tttgacaaaa ctaaccttga
ataaggccca ctgtaatacg tagctctctt aaatataaca 5880cttaggacta gaagattaga
aactaccaat cccaactacg taataggaaa atgtaggatc 5940aaaaggccca tgtatataag
tactgaccac tgggccataa tgttgcttct caggctatat 6000gcagtccttt agtcagaagt
caataggcct atttattaat attttacaga ccatattacc 6060tggattacca gggactatct
ttgctgcaga gatcaagggt taagatctat gggaagatac 6120ttatttttct gaggtcctta
tgtcctgtca tataattaaa gactcaagag aatttatgtg 6180aaatgctttc tgtatgcccc
aatctttaga ttaaaattat atacctgctc ct 6232141965DNAHomo sapiens
14gtctggttct ctctctccag aaggttctgc cggttccccc agctctgggt acccggctct
60gcatcgcgtc gccatgatgg gccatcgtcc agtgctcgtg ctcagccaga acacaaagcg
120tgaatccgga agaaaagttc aatctggaaa catcaatgct gccaagacta ttgcagatat
180catccgaaca tgtttgggac ccaagtccat gatgaagatg cttttggacc caatgggagg
240cattgtgatg accaatgatg gcaatgccat tcttcgagag attcaagtcc agcatccagc
300ggccaagtcc atgatcgaaa ttagccggac ccaggatgaa gaggttggag atgggaccac
360atcagtaatt attcttgcag gggaaatgct gtctgtagct gagcacttcc tggagcagca
420gatgcaccca acagtggtga tcagtgctta ccgcaaggca ttggatgata tgatcagcac
480cctaaagaaa ataagtatcc cagtcgacat cagtgacagt gatatgatgc tgaacatcat
540caacagctct attactacca aagccatcag tcggtggtca tctttggctt gcaacattgc
600cctggatgct gtcaagatgg tacagtttga ggagaatggt cggaaagaga ttgacataaa
660aaaatatgca agagtggaaa agatacctgg aggcatcatt gaagactcct gtgtcttgcg
720tggagtcatg attaacaagg atgtgaccca tccacgtatg cggcgctata tcaagaaccc
780tcgcattgtg ctgctggatt cttctctgga atacaagaaa ggagaaagcc agactgacat
840tgagattaca cgagaggagg acttcacccg aattctccag atggaggaag agtacatcca
900gcagctctgt gaggacatta tccaactgaa gcccgatgtg gtcatcactg aaaagggcat
960ctcagattta gctcagcact accttatgcg ggccaatatc acagccatcc gcagagtccg
1020gaagacagac aataatcgca ttgctagagc ctgtggggcc cggatagtca gccgaccaga
1080ggaactgaga gaagatgatg ttggaacagg agcaggcctg ttggaaatca agaaaattgg
1140agatgaatac tttactttca tcactgactg caaagacccc aaggcctgca ccattctcct
1200ccggggggct agcaaagaga ttctctcgga agtagaacgc aacctccagg atgccatgca
1260agtgtgtcgc aatgttctcc tggaccctca gctggtgcca gggggtgggg cctccgagat
1320ggctgtggcc catgccttga cagaaaaatc caaggccatg actggtgtgg aacaatggcc
1380atacagggct gttgcccagg ccctagaggt cattcctcgt accctgatcc agaactgtgg
1440ggccagcacc atccgtctac ttacctccct tcgggccaag cacacccagg agaactgtga
1500gacctggggt gtaaatggtg agacgggtac tttggtggac atgaaggaac tgggcatatg
1560ggagccattg gctgtgaagc tgcagactta taagacagca gtggagacgg cagttctgct
1620actgcgaatt gatgacatcg tttcaggcca caaaaagaaa ggcgatgacc agagccggca
1680aggcggggct cctgatgctg gccaggagtg agtgctaggc aaggctactt caatgcacag
1740aaccagcaga gtctcccctt ttcctgagcc agagtgccag gaacactgtg gacgtctttg
1800ttcagaaggg atcaggttgg ggggcagccc ccagtccctt tctgtcccag ctcagttttc
1860caaaagacac tgacatgtaa ttcttctcta ttgtaaggtt tccatttagt ttgcttccga
1920tgattaaatc taagtcattt gaaaaaaaaa aaaaaaaaaa aaaaa
1965153454DNAHomo sapiens 15cgccaaagga aaagcccctt ggatgagagg caggcgcttc
agagaagcta agaaaagcac 60ctctccgcgc gccccacctc ctccgcctcg cgctcctcct
gagcagcggg cccagactgc 120gctccggccg cggccctcgc cccgcggagc cctcctaccc
cggcccgacg ctcggcccgc 180gacctgcccc gagccctctc catggaggca gcccgcccct
ccggctcctg gaacggagcc 240ctctgccggc tgctcctgct gaccctcgcg atcttaatat
ttgccagtga tgcctgcaaa 300aatgtgacat tacatgttcc ctccaaacta gatgccgaga
aacttgttgg tagagttaac 360ctgaaagagt gctttacagc tgcaaatcta attcattcaa
gtgatcctga cttccaaatt 420ttggaggatg gttcagtcta tacaacaaat actattctat
tgtcctcgga gaagagaagt 480tttaccatat tactttccaa cactgagaac caagaaaaga
agaaaatatt tgtctttttg 540gagcatcaaa caaaggtcct aaagaaaaga catactaaag
aaaaagttct aaggcgcgcc 600aagagaagat gggctccaat tccttgttcg atgctagaaa
actccttggg tccttttcca 660cttttccttc aacaggttca atctgacacg gcccaaaact
ataccatata ctattccata 720agaggtcctg gagttgacca agaacctcgg aatttatttt
atgtggagag agacactgga 780aacttgtatt gtactcgtcc tgtagatcgt gagcagtatg
aatcttttga gataattgcc 840tttgcaacaa ctccagatgg gtatactcca gaacttccac
tgcccctaat aatcaaaata 900gaggatgaaa atgataacta cccaattttt acagaagaaa
cttatacttt tacaattttt 960gaaaattgca gagtgggcac tactgtggga caagtgtgtg
ctactgacaa agatgagcct 1020gacacgatgc acacacgcct gaagtactcc atcattgggc
aggtgccacc atcacccacc 1080ctattttcta tgcatccaac tacaggcgtg atcaccacaa
catcatctca gctagacaga 1140gagttaattg acaagtacca gttgaaaata aaagtacaag
acatggatgg tcagtatttt 1200ggtctacaga caacttcaac ttgtatcatt aacattgatg
atgtaaatga ccacttgcca 1260acatttactc gtacttctta tgtgacatca gtggaagaaa
atacagttga tgtggaaatc 1320ttacgagtta ctgttgagga taaggactta gtgaatactg
ctaactggag agctaattat 1380accattttaa agggcaatga aaatggcaat tttaaaattg
taacagatgc caaaaccaat 1440gaaggagttc tttgtgtagt taagcctttg aattatgaag
aaaagcaaca gatgatcttg 1500caaattggtg tagttaatga agctccattt tccagagagg
ctagtccaag atcagccatg 1560agcacagcaa cagttactgt taatgtagaa gatcaggatg
agggccctga gtgtaaccct 1620ccaatacaga ctgttcgcat gaaagaaaat gcagaagtgg
gaacaacaag caatggatat 1680aaagcatatg acccagaaac aagaagtagc agtggcataa
ggtataagaa attaactgat 1740ccaacagggt gggtcaccat tgatgaaaat acaggatcaa
tcaaagtttt cagaagcctg 1800gatagagagg cagagaccat caaaaatggc atatataata
ttacagtcct tgcatcagac 1860caaggaggga gaacatgtac ggggacactg ggcattatac
ttcaagacgt gaatgataac 1920agcccattca tacctaaaaa gacagtgatc atctgcaaac
ccaccatgtc atctgcggag 1980attgttgcgg ttgatcctga tgagcctatc catggcccac
cctttgactt tagtctggag 2040agttctactt cagaagtaca gagaatgtgg agactgaaag
caattaatga tacagcagca 2100cgtctttcct atcagaatga tcctccattt ggctcatatg
tagtacctat aacagtgaga 2160gatagacttg gcatgtctag tgtcacttca ttggatgtta
cactgtgtga ctgcattacc 2220gaaaatgact gcacacatcg tgtagatcca aggattggcg
gtggaggagt acaacttgga 2280aagtgggcca tccttgcaat attgttgggc atagcattgc
tcttttgcat cctgtttacg 2340ctggtctgtg gggcttctgg gacgtctaaa caaccaaaag
taattcctga tgatttagcc 2400cagcagaacc taattgtatc aaacacagaa gctcctggag
atgacaaagt gtattctgcg 2460aatggcttca caacccaaac tgtgggcgct tctgctcagg
gagtttgtgg caccgtggga 2520tcaggaatca aaaacggagg tcaggagacc atcgaaatgg
tgaaaggagg acaccagacc 2580tcggaatcct gccggggggc tggccaccat cacaccctgg
actcctgcag gggaggacac 2640acggaggtgg acaactgcag atacacttac tcggagtggc
acagttttac tcagccccgt 2700cttggtgaaa aagtgtatct gtgtaatcaa gatgaaaatc
acaagcatgc ccaagactat 2760gtcctgacat ataactatga aggaagagga tcggtggctg
ggtctgtagg ttgttgcagt 2820gaacgacaag aagaagatgg gcttgaattt ttggataatt
tggagcccaa atttaggaca 2880ctagcagaag catgcatgaa gagatgagtg tgttctaata
agtctctgaa agccagtggc 2940tttatgactt ttaaaaaaaa ttacaaacca agaatttttt
aaagcagaag atgctatttg 3000tgggggtttt tctctcatta tttggatgga atctctttgg
tcaaatgcac atttacagag 3060agacactata aacaagtaca caaatttttc aatttttaca
tatttttaaa ttacttatct 3120tctatccaag gaggtctaca gagaaattaa agtctgcctt
atttgttaca tttgggtata 3180atgacaacag ccaatttata gtgcaataaa atgtaattaa
ttcaagtcct tattatagac 3240tatttgaagc acaacctaat ggaaaattgt agagaccttg
ctttaacatt atctccagtt 3300aattaagtgt tcatgtggtg cttggaaact gttgttttcc
tgaacatcta aagtgtgtag 3360actgcattct tgctattatt ttattcttgt aatgtgacct
tttcactgtg caaagggaga 3420tttctagcca ggcattgact attacaattt catt
345416619DNAHomo sapiens 16agcagttcta agggaccata
cagagtattc ctctcttcac accaggacca gccactgttg 60cagcatgagt tcccagcagc
agaagcagcc ctgcatccca ccccctcagc ttcagcagca 120gcaggtgaaa cagccttgcc
agcctccacc tcaggaacca tgcatcccca aaaccaagga 180gccctgccac cccaaggtgc
ctgagccctg ccaccccaaa gtgcctgagc cctgccagcc 240caagcttcca gagccatgcc
accccaaggt gcctgagccc tgcccttcaa tagtcactcc 300agcaccagcc cagcagaaga
ccaagcagaa gtaatgtggt ccacagccat gcccttgagg 360agccggccac cagatgctga
atcccctatc ccattctgtg tatgagtccc atttgccttg 420caattagcat tctgtctccc
ccaaaaaaga atgtgctatg aagctttctt tcctacacac 480tctgagtctc tgaatgaagc
tgaaggtctt agtaccagag ctagttttca gctgctcaga 540attcatctga agagagactt
aagatgaaag caaatgattc agctccctta tacccccatt 600aaattcactt tcaattcca
619173528DNAHomo sapiens
17agccaaggac tctggagccg ccgccgccgc tgctgcggtt catatccgga gtagacggag
60ccgcagtaga cggatccgcg gctgcaccaa accactgccc ctcggagcct ggtagtgggc
120cacaagcccc cagtcccaga ggcgtggtgg gtcgggcaga gtcggaagaa ctggctttct
180agctggaaga tgcggaaggg gagcgactag gccgcttgcg tctgggcctg gcagaaggga
240ccggattttc tggcatcctt aaatcttgtg tcaaggattg gttataatat aaccagaaac
300catgacggcg gctgagaacg tatgctacac gttaattaac gtgccaatgg attcagaacc
360accatctgaa attagcttaa aaaatgatct agaaaaagga gatgtaaagt caaagactga
420agctttgaag aaagtaatca ttatgattct gaatggtgaa aaacttcctg gacttctgat
480gaccatcatt cgttttgtgc tacctcttca ggatcacact atcaagaaat tacttctggt
540attttgggaa attgttccta aaacaactcc agatgggaga cttttacatg agatgatcct
600tgtatgtgat gcatacagaa aggatcttca acatcctaat gaatttattc gaggatctac
660tcttcgtttt ctttgcaaat tgaaagaagc agaattgcta gaacctttaa tgccagctat
720tcgtgcatgt ttggagcatc gacacagcta tgttagaaga aatgctgttt tggccatcta
780taccatctat agaaattttg aacatcttat acctgatgct cctgaactga tacatgattt
840tctggtgaat gagaaggatg caagttgcaa aaggaatgca tttatgatgc taattcatgc
900agatcaggat cgagctttgg attacttaag tacttgcatt gatcaagttc aaacatttgg
960agacattctg cagctggtta ttgttgaact gatttataag gtctgtcatg ctaatccatc
1020agaaagagct cgttttattc gctgcatcta taacttatta cagtcatcca gccctgctgt
1080aaaatatgaa gctgctggga cattagtgac actctctagt gcaccaactg caatcaaggc
1140tgctgctcag tgttacattg atttaattat taaggagagc gacaacaatg taaaactcat
1200agttttggat cgcttgatag aattaaaaga gcatcctgct catgaacgag tactacagga
1260tctggttatg gatatcctaa gagtattgag cacaccagac ttagaagtac gaaagaaaac
1320tctgcagtta gcactggatc ttgtctcttc tagaaatgtt gaagagctgg ttattgtcct
1380gaagaaggaa gtgataaaaa caaataatgt gtctgagcat gaagatactg acaaatacag
1440acaactccta gtgcgaacat tgcattcctg ttctgtccga tttccagata tggctgcaaa
1500tgttattcct gtgttaatgg aatttctcag tgacaacaac gaagcagcag ctgctgatgt
1560cttggagttt gttcgtgaag ccattcagcg ctttgataac ctgagaatgc ttattgttga
1620gaagatgctt gaagtctttc atgctattaa atctgtcaag atttaccgag gagcattatg
1680gatcctggga gaatactgta gtaccaagga agacattcag agtgtgatga ctgagatccg
1740caggtccctt ggagagatcc caattgtaga gtcagaaata aagaaagaag ctggtgaatt
1800aaaacctgaa gaagaaataa ctgtagggcc agttcagaaa ttggttactg aaatgggtac
1860ctatgcaact cagagtgccc ttagcagttc tagacccacc aagaaagagg aagacagacc
1920tcccttgaga ggattccttc tggatggaga tttctttgtt gctgcctccc ttgccacaac
1980tctgaccaag attgcattgc gctatgtagc tttggttcag gagaagaaaa agcaaaattc
2040ttttgttgct gaggctatgt tgctcatggc tactatcctg catttgggaa aatcctctct
2100tcctaagaag ccaattactg atgatgatgt ggatcgaatt tccctgtgcc tcaaggtctt
2160gtctgaatgt tcacctttaa tgaatgacat tttcaataag gaatgcagac agtccctttc
2220tcacatgtta tctgctaaac tagaagaaga gaaattatcc caaaagaaag aatctgaaaa
2280gaggaatgtg acagtacagc ctgatgaccc catttccttc atgcaactaa ctgctaagaa
2340tgaaatgaac tgcaaggaag atcagtttca gctgagttta ctggcagcaa tgggtaacac
2400acagaggaaa gaggcagcag atcccctagc atctaaactt aacaaggtca cccaattgac
2460aggtttctca gatcctgtat atgcagaagc ttacgttcat gtcaaccaat atgatattgt
2520cctggatgta cttgttgtga accaaaccag tgatactttg cagaattgca cattagaact
2580agctacacta ggggatctga aacttgtgga aaagccgtct cctttgactc ttgctcctca
2640tgacttcgca aatattaaag ctaacgtcaa agtagcatca acagaaaatg gaataatttt
2700tggtaatata gtttatgatg tctctggagc agcaagtgac agaaattgtg tggttctcag
2760tgatattcac atcgacatca tggactatat ccagcctgca acttgcactg atgcagaatt
2820ccgtcagatg tgggccgaat ttgaatggga aaacaaagtg acagttaaca ccaacatggt
2880tgatttaaat gactacttac agcacatatt aaagtcaacc aatatgaaat gcctgactcc
2940agaaaaggcc ctttctggtt actgtggctt tatggcagcc aacctttatg ctcgttccat
3000atttggtgaa gatgcacttg caaatgtcag cattgagaag ccaattcacc agggaccaga
3060tgctgctgtt accggccata taagaattcg tgcaaagagc cagggaatgg ccttaagtct
3120tggagataaa atcaacttgt cacagaagaa aactagtata taaaaataaa caaaaagtcc
3180ttgaagcttt acagttaatt taggtatggg cttactggac tccaacatct tttgtactct
3240ttcatgctta tatagaatct gagttcatgc tgaatacttt tcagccaata atttatagcc
3300tttcccttaa atcaagattg agtttaaaat tatagtttgt cttttgtctt aacagttctg
3360aatgctgtcc tcaaagtata taatgtttca tgtaccaaga cccttttcac agtacaataa
3420acagatctat tcataaattt ttgttatttt ataaataaat gattacataa ttttagttat
3480aaaaaaaaaa aaaaaaaaaa agaaaaaaaa aaaaaaaaaa aaaaaaaa
3528181447DNAHomo sapiens 18tgtcactgag ggttgactga ctggagagct caagtgcagc
aaagagaagt gtcagagcat 60gagcgccaag tccagaacca tagggattat tggagctcct
ttctcaaagg gacagccacg 120aggaggggtg gaagaaggcc ctacagtatt gagaaaggct
ggtctgcttg agaaacttaa 180agaacaagag tgtgatgtga aggattatgg ggacctgccc
tttgctgaca tccctaatga 240cagtcccttt caaattgtga agaatccaag gtctgtggga
aaagcaagcg agcagctggc 300tggcaaggtg gcagaagtca agaagaacgg aagaatcagc
ctggtgctgg gcggagacca 360cagtttggca attggaagca tctctggcca tgccagggtc
caccctgatc ttggagtcat 420ctgggtggat gctcacactg atatcaacac tccactgaca
accacaagtg gaaacttgca 480tggacaacct gtatctttcc tcctgaagga actaaaagga
aagattcccg atgtgccagg 540attctcctgg gtgactccct gtatatctgc caaggatatt
gtgtatattg gcttgagaga 600cgtggaccct ggggaacact acattttgaa aactctaggc
attaaatact tttcaatgac 660tgaagtggac agactaggaa ttggcaaggt gatggaagaa
acactcagct atctactagg 720aagaaagaaa aggccaattc atctaagttt tgatgttgac
ggactggacc catctttcac 780accagctact ggcacaccag tcgtgggagg tctgacatac
agagaaggtc tctacatcac 840agaagaaatc tacaaaacag ggctactctc aggattagat
ataatggaag tgaacccatc 900cctggggaag acaccagaag aagtaactcg aacagtgaac
acagcagttg caataacctt 960ggcttgtttc ggacttgctc gggagggtaa tcacaagcct
attgactacc ttaacccacc 1020taagtaaatg tggaaacatc cgatataaat ctcatagtta
atggcataat tagaaagcta 1080atcattttct taagcataga gttatccttc taaagacttg
ttctttcaga aaaatgtttt 1140tccaattagt ataaactcta caaattccct cttggtgtaa
aattcaagat gtggaaattc 1200taactttttt gaaatttaaa agcttatatt ttctaacttg
gcaaaagact tatccttaga 1260aagagaagtg tacattgatt tccaattaaa aatttgctgg
cattaaaaat aagcacactt 1320acataagccc ccatacatag agtgggactc ttggaatcag
gagacaaagc taccacatgt 1380ggaaaggtac tatgtgtcca tgtcattcaa aaaatgtgat
tttttataat aaactcttta 1440taacaag
1447193916DNAHomo sapiens 19gcttggggcc gccatcttgg
caagaggcga agcggcagcg gttcctgtca agggggcagc 60aggtccagag ctgctggtgc
tcccgttccc cagaccctac ccctatcccc agtggagccg 120gagtgcgggc gcgccccacc
accgccctca ccatggtgct gttggcagca gcggtctgca 180caaaagcagg aaaggctatt
gtttctcgac agtttgtgga aatgacccga actcggattg 240agggcttatt agcagctttt
ccaaagctca tgaacactgg aaaacaacat acgtttgttg 300aaacagagag tgtaagatat
gtctaccagc ctatggagaa actgtatatg gtactgatca 360ctaccaaaaa cagcaacatt
ttagaagatt tggagaccct aaggctcttc tcaagagtga 420tccctgaata ttgccgagcc
ttagaagaga atgaaatatc tgagcactgt tttgatttga 480tttttgcttt tgatgaaatt
gtcgcactgg gataccggga gaatgttaac ttggcacaga 540tcagaacctt cacagaaatg
gattctcatg aggagaaggt gttcagagcc gtcagagaga 600ctcaagaacg tgaagctaag
gctgagatgc gtcgtaaagc aaaggaatta caacaggccc 660gaagagatgc agagagacag
ggcaaaaaag caccaggatt tggcggattt ggcagctctg 720cagtatctgg aggcagcaca
gctgccatga tcacagagac catcattgaa actgataaac 780caaaagtggc acctgcacca
gccaggcctt caggccccag caaggcttta aaacttggag 840ccaaaggaaa ggaagtagat
aactttgtgg acaaattaaa atctgaaggt gaaaccatca 900tgtcctctag tatgggcaag
cgtacttctg aagcaaccaa aatgcatgct ccacccatta 960atatggaaag tgtacatatg
aagattgaag aaaagataac attaacctgt ggacgagacg 1020gaggattaca gaatatggag
ttgcatggca tgatcatgct taggatctca gatgacaagt 1080atggccgaat tcgtcttcat
gtggaaaatg aagataagaa aggggtgcag ctacagaccc 1140atccaaatgt ggataaaaaa
cttttcactg cagagtctct aattggcctg aagaatccag 1200agaagtcatt tccagtcaac
agtgacgtag gggtgctaaa gtggagacta caaaccacag 1260aggaatcttt tattccactg
acaattaatt gctggccctc ggagagtgga aatggctgtg 1320atgtcaacat agaatatgag
ctacaagaag ataatttaga actgaatgat gtggttatca 1380ccatcccact cccgtctggt
gtcggcgcgc ctgttatcgg tgagatcgat ggggagtatc 1440gacatgacag tcgacgaaat
accctggagt ggtgcctgcc tgtgattgat gccaaaaata 1500agagtggcag cctggagttt
agcattgctg ggcagcccaa tgacttcttc cctgttcaag 1560tttcctttgt ctccaagaaa
aattactgta acatacaggt taccaaagtg acccaggtag 1620atggaaacag ccccgtcagg
ttttccacag agaccacttt cctagtggat aagtatgaaa 1680ttctgtaata ccaagaagag
ggagctgaaa aggaaaattt tcagattaat aaagaagacg 1740ccaatgatgg ctgaagagtt
tttcccagat ttacaagcca ctggagaccc cttttttctg 1800atacaatgca cgattctctg
cgcgcaagga ccctcgactc acccccatgt ttcagtgtca 1860cagagacatt ctttgataag
gaaatggcac aaacataaag ggaaaggctg ctaattttct 1920ttggcagatt gtattggcca
gcaggaaagc aagctctcca gagaatgccc ccagttaaat 1980acctcctcta cctttaccta
agttgctcct ttatttttat tttattatta ttattattat 2040tattattttt tgagatggag
tctcactttg taacccaggc tggaatgcaa tggcatgatc 2100tcagctcact gcaacctccg
cctcctgggt tcaagcaagt ctcctgcctc agcctccgag 2160tagctgggac tacaggtgca
cgccaccacg cctggctaat tttttgtatt ttagtagaga 2220cggggtttca ccgtgttgcc
caggctggtc gcgaactcct gagctcaggc aatccgccca 2280cctcagcctc ccaaagtgtt
gggattacag gcatgagcca ccatgcccag ctgctccttt 2340attttaatcc ctaaatataa
tccctaaata tagttatatt tcatacttag tttgttttta 2400aaaagttttc tctgtagaaa
attttaatca ttcataccct ttacctttag gtttttcttt 2460ctatacattc agtcaggcac
tgggatcatc tgtttacagg cattatattt atttggcact 2520cctggaacaa gtatatctaa
cccattcttg atttttggac tattcaggtg aactatttga 2580ggggtatggg gtctagaagt
taaaagatac gcatgtcttc tgttcttttc ccgtatcaat 2640tcattccttc atctctttgc
caagttgttt tcctttcagg gcctgtcctt ccagtttaga 2700acagtaccat gaatcccact
tgtgtcaata ttaaagatag ctgagaagca cctttcaaat 2760ggcacagtcc ctcttcaaga
tgtctaaaag aatggttatg tctgtccagt tagggatttc 2820acatccacat gtaatcatgt
ctgctgctgt tgctacccaa attttcattt ctccacattt 2880tgggtactta agctaaaacg
taatggccac agtctgtaat ccattcacat tcctcagttt 2940caccacctcc ctcttccaga
ctgcactctc tgtcatcagt cccctccttt ctaacagaaa 3000tggggttatg attttgaagg
ctgtgggttc agggagtctt tgccaatcct gttggcccta 3060aactatcaag gaggctccat
ttcaccattt gattttttgc atttcaggag gcaactgatt 3120gtttcgatat gtacatatta
ctcacgtata ccccatttcc ttccagtcag cccaacattt 3180tccaccagtc tgtccccatc
tctgaaatcc ttccttctct ttccccctaa gtcttttgag 3240tgtcatcatg tactggtggt
ttctcggttc catctcatcc atttcctttt caatggagac 3300tacagcgtca gccagctcag
ccttggcttt taactcaata ttccagtcca taggggtggt 3360taaaagttgc tgcaaggctg
caggcactgg cagtgggaag aggcagacga ctagatgact 3420tctgcacttt tagctggttg
aaaagtacca ctcccactct gaacatctgg ccgtccctgc 3480aaagagtgta ctgtgcttga
agcagagcac tcacacataa atggctgtgt gtggaattgc 3540ttgccaaaga agtttctagc
ctttcccttt cccctaactg catcagggaa gaattcttat 3600ctctagcttg gtttccacat
gaggtttttc tgagaagggc ttgggacaag aagtctgtca 3660tgttagttaa gcaggcaaga
aatcctacta atccagtttt gtttgaaagt tgtttgtccg 3720tatgattttt taaaagtcaa
gtttaatttc aaaaaacctt ttttttctga gattactttt 3780ggggtaatat ttaaaatgag
agacattttg taaccctgta aaatacatag ggaatataac 3840attccagtgt atacaaagaa
ggcaaattct ttaatcaaat aaagcgtatt ataaaatgag 3900aaaaaaaaaa aaaaaa
3916202280DNAHomo sapiens
20tccagccaga aggatggggt ggctcccact cctgctgctt ctgactcaat gcttaggggt
60ccctgggcag cgctcgccat tgaatgactt ccaagtgctc cggggcacag agctacagca
120cctgctacat gcggtggtgc ccgggccttg gcaggaggat gtggcagatg ctgaagagtg
180tgctggtcgc tgtgggccct taatggactg ccgggccttc cactacaacg tgagcagcca
240tggttgccaa ctgctgccat ggactcaaca ctcgccccac acgaggctgc ggcgttctgg
300gcgctgtgac ctcttccaga agaaagacta cgtacggacc tgcatcatga acaatggggt
360tgggtaccgg ggcaccatgg ccacgaccgt gggtggcctg ccctgccagg cttggagcca
420caagttcccg aatgatcaca agtacacgcc cactctccgg aatggcctgg aagagaactt
480ctgccgtaac cctgatggcg accccggagg tccttggtgc tacacaacag accctgctgt
540gcgcttccag agctgcggca tcaaatcctg ccgggaggcc gcgtgtgtct ggtgcaatgg
600cgaggaatac cgcggcgcgg tagaccgcac ggagtcaggg cgcgagtgcc agcgctggga
660tcttcagcac ccgcaccagc accccttcga gccgggcaag ttcctcgacc aaggtctgga
720cgacaactat tgccggaatc ctgacggctc cgagcggcca tggtgctaca ctacggatcc
780gcagatcgag cgagagttct gtgacctccc ccgctgcggg tccgaggcac agccccgcca
840agaggccaca actgtcagct gcttccgcgg gaagggtgag ggctaccggg gcacagccaa
900taccaccact gcgggcgtac cttgccagcg ttgggacgcg caaatccctc atcagcaccg
960atttacgcca gaaaaatacg cgtgcaaaga ccttcgggag aacttctgcc ggaaccccga
1020cggctcagag gcgccctggt gcttcacact gcggcccggc atgcgcgcgg ccttttgcta
1080ccagatccgg cgttgtacag acgacgtgcg gccccaggac tgctaccacg gcgcagggga
1140gcagtaccgc ggcacggtca gcaagacccg caagggtgtc cagtgccagc gctggtccgc
1200tgagacgccg cacaagccgc agttcacgtt tacctccgaa ccgcatgcac aactggagga
1260gaacttctgc cggaacccag atggggatag ccatgggccc tggtgctaca cgatggaccc
1320aaggacccca ttcgactact gtgccctgcg acgctgcgct gatgaccagc cgccatcaat
1380cctggacccc ccagaccagg tgcagtttga gaagtgtggc aagagggtgg atcggctgga
1440tcagcggcgt tccaagctgc gcgtggttgg gggccatccg ggcaactcac cctggacagt
1500cagcttgcgg aatcggcagg gccagcattt ctgcgggggg tctctagtga aggagcagtg
1560gatactgact gcccggcagt gcttctcctc ctgccatatg cctctcacgg gctatgaggt
1620atggttgggc accctgttcc agaacccaca gcatggagag ccaagcctac agcgggtccc
1680agtagccaag atggtgtgtg ggccctcagg ctcccagctt gtcctgctca agctggagag
1740atctgtgacc ctgaaccagc gtgtggccct gatctgcctg ccccctgaat ggtatgtggt
1800gcctccaggg accaagtgtg agattgcagg ctggggtgag accaaaggta cgggtaatga
1860cacagtccta aatgtggcct tgctgaatgt catctctaac caggagtgta acatcaagca
1920ccgaggacgt gtgcgggaga gtgagatgtg cactgaggga ctgttggccc ctgtgggggc
1980ctgtgagggt gactacgggg gcccacttgc ctgctttacc cacaactgct gggtcctgga
2040aggaattata atccccaacc gagtatgcgc aaggtcccgc tggccagctg tcttcacgcg
2100tgtctctgtg tttgtggact ggattcacaa ggtcatgaga ctgggttagg cccagccttg
2160atgccatatg ccttggggag gacaaaactt cttgtcagac ataaagccat gtttcctctt
2220taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaataaaaaa aaaaaaaaaa aaaaaaaaaa
2280212876DNAHomo sapiens 21gaattcctgc agctcagcag ccgccgccag agcaggacga
accgccaatc gcaaggcacc 60tctgagaact tcaggatgca gatgtctcca gccctcacct
gcctagtcct gggcctggcc 120cttgtctttg gtgaagggtc tgctgtgcac catcccccat
cctacgtggc ccacctggcc 180tcagacttcg gggtgagggt gtttcagcag gtggcgcagg
cctccaagga ccgcaacgtg 240gttttctcac cctatggggt ggcctcggtg ttggccatgc
tccagctgac aacaggagga 300gaaacccagc agcagattca agcagctatg ggattcaaga
ttgatgacaa gggcatggcc 360cccgccctcc ggcatctgta caaggagctc atggggccat
ggaacaagga tgagatcagc 420accacagacg cgatcttcgt ccagcgggat ctgaagctgg
tccagggctt catgccccac 480ttcttcaggc tgttccggag cacggtcaag caagtggact
tttcagaggt ggagagagcc 540agattcatca tcaatgactg ggtgaagaca cacacaaaag
gtatgatcag caacttgctt 600gggaaaggag ccgtggacca gctgacacgg ctggtgctgg
tgaatgccct ctacttcaac 660ggccagtgga agactccctt ccccgactcc agcacccacc
gccgcctctt ccacaaatca 720gacggcagca ctgtctctgt gcccatgatg gctcagacca
acaagttcaa ctatactgag 780ttcaccacgc ccgatggcca ttactacgac atcctggaac
tgccctacca cggggacacc 840ctcagcatgt tcattgctgc cccttatgaa aaagaggtgc
ctctctctgc cctcaccaac 900attctgagtg cccagctcat cagccactgg aaaggcaaca
tgaccaggct gccccgcctc 960ctggttctgc ccaagttctc cctggagact gaagtcgacc
tcaggaagcc cctagagaac 1020ctgggaatga ccgacatgtt cagacagttt caggctgact
tcacgagtct ttcagaccaa 1080gagcctctcc acgtcgcgca ggcgctgcag aaagtgaaga
tcgaggtgaa cgagagtggc 1140acggtggcct cctcatccac agctgtcata gtctcagccc
gcatggcccc cgaggagatc 1200atcatggaca gacccttcct ctttgtggtc cggcacaacc
ccacaggaac agtccttttc 1260atgggccaag tgatggaacc ctgaccctgg ggaaagacgc
cttcatctgg gacaaaactg 1320gagatgcatc gggaaagaag aaactccgaa gaaaagaatt
ttagtgttaa tgactctttc 1380tgaaggaaga gaagacattt gccttttgtt aaaagatggt
aaaccagatc tgtctccaag 1440accttggcct ctccttggag gacctttagg tcaaactccc
tagtctccac ctgagaccct 1500gggagagaag tttgaagcac aactccctta aggtctccaa
accagacggt gacgcctgcg 1560ggaccatctg gggcacctgc ttccacccgt ctctctgccc
actcgggtct gcagacctgg 1620ttcccactga ggccctttgc aggatggaac tacggggctt
acaggagctt ttgtgtgcct 1680ggtagaaact atttctgttc cagtcacatt gccatcactc
ttgtactgcc tgccaccgcg 1740gaggaggctg gtgacaggcc aaaggccagt ggaagaaaca
ccctttcatc tcagagtcca 1800ctgtggcact ggccacccct ccccagtaca ggggtgctgc
aggtggcaga gtgaatgtcc 1860cccatcatgt ggcccaactc tcctggcctg gccatctccc
tccccagaaa cagtgtgcat 1920gggttatttt ggagtgtagg tgacttgttt actcattgaa
gcagatttct gcttcctttt 1980atttttatag gaatagagga agaaatgtca gatgcgtgcc
cagctcttca ccccccaatc 2040tcttggtggg gaggggtgta cctaaatatt tatcatatcc
ttgcccttga gtgcttgtta 2100gagagaaaga gaactactaa ggaaaataat attatttaaa
ctcgctccta gtgtttcttt 2160gtggtctgtg tcaccgtatc tcaggaagtc cagccacttg
actggcacac acccctccgg 2220acatccagcg tgacggagcc cacactgcca ccttgtggcc
gcctgagacc ctcgcgcccc 2280ccgcgccccc cgcgcccctc tttttcccct tgatggaaat
tgaccataca atttcatcct 2340ccttcagggg atcaaaagga cggagtgggg ggacagagac
tcagatgagg acagagtggt 2400ttccaatgtg ttcaatagat ttaggagcag aaatgcaagg
ggctgcatga cctaccagga 2460cagaactttc cccaattaca gggtgactca cagccgcatt
ggtgactcac ttcaatgtgt 2520catttccggc tgctgtgtgt gagcagtgga cacgtgaggg
gggggtgggt gagagagaca 2580ggcagctcgg attcaactac cttagataat atttctgaaa
acctaccagc cagagggtag 2640ggcacaaaga tggatgtaat gcactttggg aggccaaggc
gggaggattg cttgagccca 2700ggagttcaag accagcctgg gcaacatacc aagacccccg
tctctttaaa aatatatata 2760ttttaaatat acttaaatat atatttctaa tatctttaaa
tatatatata tattttaaag 2820accaatttat gggagaattg cacacagatg tgaaatgaat
gtaatctaat agaagc 2876221310DNAHomo sapiens 22gctcggagcc cggagcgtgc
ctcggcggcc tgtcggtttt caccatggag cagctgagct 60cagcaaacac ccgcttcgcc
ttggacctgt tcctggcgtt gagtgagaac aatccggctg 120gaaacatctt catctctccc
ttcagcattt catctgctat ggccatggtt tttctgggga 180ccagaggtaa cacggcagca
cagctgtcca agactttcca tttcaacacg gttgaagagg 240ttcattcaag attccagagt
ctgaatgctg atatcaacaa acgtggagcg tcttatattc 300tgaaacttgc taatagatta
tatggagaga aaacttacaa tttccttcct gagttcttgg 360tttcgactca gaaaacatat
ggtgctgacc tggccagtgt ggattttcag catgcctctg 420aagatgcaag gaagaccata
aaccagtggg tcaaaggaca gacagaagga aaaattccgg 480aactgttggc ttcgggcatg
gttgataaca tgaccaaact tgtgctagta aatgccatct 540atttcaaggg aaactggaag
gataaattca tgaaagaagc cacgacgaat gcaccattca 600gattgaataa gaaagacaga
aaaactgtga aaatgatgta tcagaagaaa aaatttgcat 660atggctacat cgaggacctt
aagtgccgtg tgctggaact gccttaccaa ggcgaggagc 720tcagcatggt catcctgctg
ccggatgaca ttgaggacga gtccacgggc ctgaagaaga 780ttgaggaaca gttgactttg
gaaaagttgc atgagtggac taaacctgag aatctcgatt 840tcattgaagt taatgtcagc
ttgcccaggt tcaaactgga agagagttac actctcaact 900ccgacctcgc ccgcctaggt
gtgcaggatc tctttaacag tagcaaggct gatctgtctg 960gcatgtcagg agccagagat
atttttatat caaaaattgt ccacaagtca tttgtggaag 1020tgaatgaaga gggaacagag
gcggcagctg ccacagcagg catcgcaact ttctgcatgt 1080tgatgcccga agaaaatttc
actgccgacc atccattcct tttctttatt cggcataatt 1140cctcaggtag catcctattc
ttggggagat tttcttcccc ttagaagaaa gagactgtag 1200caatacaaaa atcaagctta
gtgctttatt acctgagttt ttaatagagc caatatgtct 1260tatatcttta ccaataaaac
cactgtccag aaaaaaaaaa aaaaaaaaaa 131023495DNAHomo
sapiensmisc_feature(488)..(488)n is a, c, g, or t 23tttgaatatt tatgtcaaat
tacaaaccag tttaaagctg cctatttggc aaaatgatct 60gctgcagaat tttcattttc
tgtctctaga atgcagaaaa atgtcttaaa gttccttaat 120ttgcttaatt taatgtggtt
tccagaagat gtgaaaacct cctttatttt taaaatacct 180gattccacat tggtcaatag
tttcctcttt aatttacctc tctcctctca ctttatctat 240aataagcagg gagaaatgaa
gacacaccat caacacgttt gcttagatat gtcctcaact 300aaatttctag tgtcacttac
taattctaat ttcatccaat ataacataat taagataaat 360tctataacaa gctacacata
ctttccagtt ctaataccat gtttgtgatg gaaacaaagc 420aggagtgccc tctgcaaggt
gatcatctga gggtccaaga tgaaggggca cacaggtatt 480ttatctgncc cacac
49524488DNAHomo sapiens
24ctgatatttt gtatattaat gaattatcca agattcgatg ggatttatca gtgtgtagat
60agctctataa tgcttgaatt gtacacttct aagtgtgcag tgcaagagct tgtttatatt
120tcatactttt tatactttga ggaaaaaaag tcaaagaaaa attgtatttg agggaaaaaa
180ccatgaccaa gtaaaggata aattcaaaaa atagcctcat gagacttggc atacacactc
240atgggattcc agttattatg gagtgcttcc atccctctcc accccttccc cccaaaaggt
300tttctttgca agtgcttttg gaactaagag ctagtatctt ggattaactg atgcctgcta
360gtgctttctg attactcgca ttctgtttct tgctttaaaa gaagagtaaa gacaagagtg
420ttggaccagt attgcagttc tgtagtgtca tttcttataa aaaacaaaac aacaacaata
480atttatca
488251396DNAHomo sapiens 25tgactatcca gctctgagag acgggagttt ggagttgccc
gctttacttt ggttgggttg 60gggggggcgg cgggctgttt tgttcctttt cttttttaag
agttgggttt tcttttttaa 120ttatccaaac agtgggcagc ttcctccccc acacccaagt
atttgcacaa tatttgtgcg 180gggtatgggg gtgggttttt aaatctcgtt tctcttggac
aagcacaggg atctcgttct 240cctcattttt tgggggtgtg tggggacttc tcaggtcgtg
tccccagcct tctctgcagt 300cccttctgcc ctgccgggcc cgtcgggagg cgccatggct
cggatgaacc gcccggcccc 360ggtggaggtg agctacaaac acatgcgctt cctcatcacc
cacaacccca ccaacgccac 420gctcagcacc ttcattgagg acctgaagaa gtacggggct
accactgtgg tgcgtgtgtg 480tgaagtgacc tatgacaaaa cgccgctgga gaaggatggc
atcaccgttg tggactggcc 540gtttgacgat ggggcgcccc cgcccggcaa ggtagtggaa
gactggctga gcctggtgaa 600ggccaagttc tgtgaggccc ccggcagctg cgtggctgtg
cactgcgtgg cgggcctggg 660ccgggctcca gtccttgtgg cgctggcgct tattgagagc
gggatgaagt acgaggacgc 720catccagttc atccgccaga agcgccgcgg agccatcaac
agcaagcagc tcacctacct 780ggagaaatac cggcccaaac agaggctgcg gttcaaagac
ccacacacgc acaagacccg 840gtgctgcgtt atgtagctca ggaccttggc tgggcctggt
cgtcatgtag gtcaggacct 900tggctggacc tggaggccct gcccagccct gctctgccca
gcccagcagg ggctccaggc 960cttggctggc cccacatcgc cttttcctcc ccgacacctc
cgtgcacttg tgtccgagga 1020gcgaggagcc cctcgggccc tgggtggcct ctgggccctt
tctcctgtct ccgccactcc 1080ctctggcggc gctggccgtg gctctgtctc tctgaggtgg
gtcgggcgcc ctctgcccgc 1140cccctcccac accagccagg ctggtctcct ctagcctgtt
tgttgtgggg tgggggtata 1200ttttgtaacc actgggcccc cagcccctct tttgcgaccc
cttgtcctga cctgttctcg 1260gcaccttaaa ttattagacc ccggggcagt caggtgctcc
ggacacccga aggcaataaa 1320acaggagccg tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa 1380aaaaaaaaaa aaaaaa
1396262294DNAHomo sapiens 26aagcagttgt tttgctggaa
ggagggagtg cgcgggctgc cccgggctcc tccctgccgc 60ctcctctcag tggatggttc
caggcaccct gtctggggca gggagggcac aggcctgcac 120atcgaaggtg gggtgggacc
aggctgcccc tcgccccagc atccaagtcc tcccttgggc 180gcccgtggcc ctgcagactc
tcagggctaa ggtcctctgt tgctttttgg ttccacctta 240gaagaggctc cgcttgacta
agagtagctt gaaggaggca ccatgcagga gctgcatctg 300ctctggtggg cgcttctcct
gggcctggct caggcctgcc ctgagccctg cgactgtggg 360gaaaagtatg gcttccagat
cgccgactgt gcctaccgcg acctagaatc cgtgccgcct 420ggcttcccgg ccaatgtgac
tacactgagc ctgtcagcca accggctgcc aggcttgccg 480gagggtgcct tcagggaggt
gcccctgctg cagtcgctgt ggctggcaca caatgagatc 540cgcacggtgg ccgccggagc
cctggcctct ctgagccatc tcaagagcct ggacctcagc 600cacaatctca tctctgactt
tgcctggagc gacctgcaca acctcagtgc cctccaattg 660ctcaagatgg acagcaacga
gctgaccttc atcccccgcg acgccttccg cagcctccgt 720gctctgcgct cgctgcaact
caaccacaac cgcttgcaca cattggccga gggcaccttc 780accccgctca ccgcgctgtc
ccacctgcag atcaacgaga accccttcga ctgcacctgc 840ggcatcgtgt ggctcaagac
atgggccctg accacggccg tgtccatccc ggagcaggac 900aacatcgcct gcacctcacc
ccatgtgctc aagggtacgc cgctgagccg cctgccgcca 960ctgccatgct cggcgccctc
agtgcagctc agctaccaac ccagccagga tggtgccgag 1020ctgcggcctg gttttgtgct
ggcactgcac tgtgatgtgg acgggcagcc ggcccctcag 1080cttcactggc acatccagat
acccagtggc attgtggaga tcaccagccc caacgtgggc 1140actgatgggc gtgccctgcc
tggcacccct gtggccagct cccagccgcg cttccaggcc 1200tttgccaatg gcagcctgct
tatccccgac tttggcaagc tggaggaagg cacctacagc 1260tgcctggcca ccaatgagct
gggcagtgct gagagctcag tggacgtggc actggccacg 1320cccggtgagg gtggtgagga
cacactgggg cgcaggttcc atggcaaagc ggttgaggga 1380aagggctgct atacggttga
caacgaggtg cagccatcag ggccggagga caatgtggtc 1440atcatctacc tcagccgtgc
tgggaaccct gaggctgcag tcgcagaagg ggtccctggg 1500cagctgcccc caggcctgct
cctgctgggc caaagcctcc tcctcttctt cttcctcacc 1560tccttctagc cccacccagg
gcttccctaa ctcctcccct tgcccctacc aatgcccctt 1620taagtgctgc aggggtctgg
ggttggcaac tcctgaggcc tgcatgggtg acttcacatt 1680ttcctacctc tccttctaat
ctcttctaga gcacctgcta tccccaactt ctagacctgc 1740tccaaactag tgactaggat
agaatttgat cccctaactc actgtctgcg gtgctcattg 1800ctgctaacag cattgcctgt
gctctcctct caggggcagc atgctaacgg ggcgacgtcc 1860taatccaact gggagaagcc
tcagtggtgg aattccaggc actgtgactg tcaagctggc 1920aagggccagg attgggggaa
tggagctggg gcttagctgg gaggtggtct gaagcagaca 1980gggaatggga gaggaggatg
ggaagtagac agtggctggt atggctctga ggctccctgg 2040ggcctgctca agctcctcct
gctccttgct gttttctgat gatttggggg cttgggagtc 2100cctttgtcct catctgagac
tgaaatgtgg ggatccagga tggcttcctt cctcttaccc 2160ttcctccctc agcctgcaac
ctctatcctg gaacctgtcc tccctttctc cccaactatg 2220catctgttgt ctgctcctct
gcaaaggcca gccagcttgg gagcagcaga gaaataaaca 2280gcatttctga tgcc
2294271399DNAHomo sapiens
27agtgtgaaat cttcagagaa gaatttctct ttagttcttt gcaagaaggt agagataaag
60acactttttc aaaaatggca atggtatcag aattcctcaa gcaggcctgg tttattgaaa
120atgaagagca ggaatatgtt caaactgtga agtcatccaa aggtggtccc ggatcagcgg
180tgagccccta tcctaccttc aatccatcct cggatgtcgc tgccttgcat aaggccataa
240tggttaaagg tgtggatgaa gcaaccatca ttgacattct aactaagcga aacaatgcac
300agcgtcaaca gatcaaagca gcatatctcc aggaaacagg aaagcccctg gatgaaacac
360ttaagaaagc ccttacaggt caccttgagg aggttgtttt agctctgcta aaaactccag
420cgcaatttga tgctgatgaa cttcgtgctg ccatgaaggg ccttggaact gatgaagata
480ctctaattga gattttggca tcaagaacta acaaagaaat cagagacatt aacagggtct
540acagagagga actgaagaga gatctggcca aagacataac ctcagacaca tctggagatt
600ttcggaacgc tttgctttct cttgctaagg gtgaccgatc tgaggacttt ggtgtgaatg
660aagacttggc tgattcagat gccagggcct tgtatgaagc aggagaaagg agaaagggga
720cagacgtaaa cgtgttcaat accatcctta ccaccagaag ctatccacaa cttcgcagag
780tgtttcagaa atacaccaag tacagtaagc atgacatgaa caaagttctg gacctggagt
840tgaaaggtga cattgagaaa tgcctcacag ctatcgtgaa gtgcgccaca agcaaaccag
900ctttctttgc agagaagctt catcaagcca tgaaaggtgt tggaactcgc cataaggcat
960tgatcaggat tatggtttcc cgttctgaaa ttgacatgaa tgatatcaaa gcattctatc
1020agaagatgta tggtatctcc ctttgccaag ccatcctgga tgaaaccaaa ggagattatg
1080agaaaatcct ggtggctctt tgtggaggaa actaaacatt cccttgatgg tctcaagcta
1140tgatcagaag actttaatta tatattttca tcctataagc ttaaatagga aagtttcttc
1200aacaggatta cagtgtagct acctacatgc tgaaaaatat agcctttaaa tcatttttat
1260attataactc tgtataatag agataagtcc attttttaaa aatgttttcc ccaaaccata
1320aaaccctata caagttgttc tagtaacaat acatgagaaa gatgtctatg tagctgaaaa
1380taaaatgacg tcacaagac
1399283088DNAHomo sapiens 28acaaaaaagc ttttacgagg tatcagcact tttctttcat
tagggggaag gcgtgaggaa 60agtaccaaac agcagcggag ttttaaactt taaatagaca
ggtctgagtg cctgaacttg 120ccttttcatt ttacttcatc ctccaaggag ttcaatcact
tggcgtgact tcactacttt 180taagcaaaag agtggtgccc aggcaacatg ggtgactgga
gcgccttagg caaactcctt 240gacaaggttc aagcctactc aactgctgga gggaaggtgt
ggctgtcagt acttttcatt 300ttccgaatcc tgctgctggg gacagcggtt gagtcagcct
ggggagatga gcagtctgcc 360tttcgttgta acactcagca acctggttgt gaaaatgtct
gctatgacaa gtctttccca 420atctctcatg tgcgcttctg ggtcctgcag atcatatttg
tgtctgtacc cacactcttg 480tacctggctc atgtgttcta tgtgatgcga aaggaagaga
aactgaacaa gaaagaggaa 540gaactcaagg ttgcccaaac tgatggtgtc aatgtggaca
tgcacttgaa gcagattgag 600ataaagaagt tcaagtacgg tattgaagag catggtaagg
tgaaaatgcg aggggggttg 660ctgcgaacct acatcatcag tatcctcttc aagtctatct
ttgaggtggc cttcttgctg 720atccagtggt acatctatgg attcagcttg agtgctgttt
acacttgcaa aagagatccc 780tgcccacatc aggtggactg tttcctctct cgccccacgg
agaaaaccat cttcatcatc 840ttcatgctgg tggtgtcctt ggtgtccctg gccttgaata
tcattgaact cttctatgtt 900ttcttcaagg gcgttaagga tcgggttaag ggaaagagcg
acccttacca tgcgaccagt 960ggtgcgctga gccctgccaa agactgtggg tctcaaaaat
atgcttattt caatggctgc 1020tcctcaccaa ccgctcccct ctcgcctatg tctcctcctg
ggtacaagct ggttactggc 1080gacagaaaca attcttcttg ccgcaattac aacaagcaag
caagtgagca aaactgggct 1140aattacagtg cagaacaaaa tcgaatgggg caggcgggaa
gcaccatctc taactcccat 1200gcacagcctt ttgatttccc cgatgataac cagaattcta
aaaaactagc tgctggacat 1260gaattacagc cactagccat tgtggaccag cgaccttcaa
gcagagccag cagtcgtgcc 1320agcagcagac ctcggcctga tgacctggag atctagatac
aggcttgaaa gcatcaagat 1380tccactcaat tgtggagaag aaaaaaggtg ctgtagaaag
tgcaccaggt gttaattttg 1440atccggtgga ggtggtactc aacagcctta ttcatgaggc
ttagaaaaca caaagacatt 1500agaataccta ggttcactgg gggtgtatgg ggtagatggg
tggagaggga ggggataaga 1560gaggtgcatg ttggtattta aagtagtgga ttcaaagaac
ttagattata aataagagtt 1620ccattaggtg atacatagat aagggctttt tctccccgca
aacaccccta agaatggttc 1680tgtgtatgtg aatgagcggg tggtaattgt ggctaaatat
ttttgtttta ccaagaaact 1740gaaataattc tggccaggaa taaatacttc ctgaacatct
taggtctttt caacaagaaa 1800aagacagagg attgtcctta agtccctgct aaaacattcc
attgttaaaa tttgcacttt 1860gaaggtaagc tttctaggcc tgaccctcca ggtgtcaatg
gacttgtgct actatatttt 1920tttattcttg gtatcagttt aaaattcaga caaggcccac
agaataagat tttccatgca 1980tttgcaaata cgtatattct ttttccatcc acttgcacaa
tatcattacc atcacttttt 2040catcattcct cagctactac tcacattcat ttaatggttt
ctgtaaacat ttttaagaca 2100gttgggatgt cacttaacat tttttttttt tgagctaaag
tcagggaatc aagccatgct 2160taatatttaa caatcactta tatgtgtgtc gaagagtttg
ttttgtttgt catgtattgg 2220tacaagcaga tacagtataa actcacaaac acagatttga
aaataatgca catatggtgt 2280tcaaatttga acctttctca tggatttttg tggtgtgggc
caatatggtg tttacattat 2340ataattcctg ctgtggcaag taaagcacac tttttttttc
tcctaaaatg tttttccctg 2400tgtatcctat tatggatact ggttttgtta attatgattc
tttattttct ctcctttttt 2460taggatatag cagtaatgct attactgaaa tgaatttcct
ttttctgaaa tgtaatcatt 2520gatgcttgaa tgatagaatt ttagtactgt aaacaggctt
tagtcattaa tgtgagagac 2580ttagaaaaaa tgcttagagt ggactattaa atgtgcctaa
atgaattttg cagtaactgg 2640tattcttggg ttttcctact taatacacag taattcagaa
cttgtattct attatgagtt 2700tagcagtctt ttggagtgac cagcaacttt gatgtttgca
ctaagatttt atttggaatg 2760caagagaggt tgaaagagga ttcagtagta cacatacaac
taatttattt gaactatatg 2820ttgaagacat ctaccagttt ctccaaatgc cttttttaaa
actcatcaca gaagattggt 2880gaaaatgctg agtatgacac ttttcttctt gcatgcatgt
cagctacata aacagttttg 2940tacaatgaaa attactaatt tgtttgacat tccatgttaa
actacggtca tgttcagctt 3000cattgcatgt aatgtagacc tagtccatca gatcatgtgt
tctggagagt gttctttatt 3060caataaagtt ttaatttagt ataaacat
308829403DNAHomo sapiens 29tttcattagt tatcattagt
ttattataaa agagaaatat ggaaattatt tacatgacga 60aagatttcag aacttcagtg
gaatgggcag catcatgttg atgccatttc aatagtgact 120tatttcagtc tacgtacttt
ccaagaatgt caccatctct aaataggaaa taatccttgt 180catctagaac tactttggtg
cctccatatt ctgggagaag aactttatct ccaactttca 240cgctaactgg ttgaatctct
ccaccctttc ctttagaacc cgatccaaca gcgactactg 300ttgcttgcaa tacttttcct
tgagattttt ctggaagcat aatgcctcct ttggttacag 360tttcagcagc actcctttca
accaatactc ggtcaaagag tgg 403301023DNAHomo sapiens
30gttggctgcc ggtgagttgg gtgccggtgg agtcgtgttg gtcctcagaa tccccgcgta
60gccgctgcct cctcctaccc tcgccatgtt tcttacccgg tctgagtacg acaggggcgt
120gaatactttt tctcccgaag gaagattatt tcaagtggaa tatgccattg aggctatcaa
180gcttggttct acagccattg ggatccagac atcagagggt gtgtgcctag ctgtggagaa
240gagaattact tccccactga tggagcccag cagcattgag aaaattgtag agattgatgc
300tcacataggt tgtgccatga gtgggctaat tgctgatgct aagactttaa ttgataaagc
360cagagtggag acacagaacc actggttcac ctacaatgag acaatgacag tggagagtgt
420gacccaagct gtgtccaatc tggctttgca gtttggagaa gaagatgcag atccaggtgc
480catgtctcgt ccctttggag tagcattatt atttggagga gttgatgaga aaggacccca
540gctgtttcat atggacccat ctgggacctt tgtacagtgt gatgctcgag caattggctc
600tgcttcagag ggtgcccaga gctccttgca agaagtttac cacaagtcta tgactttgaa
660agaagccatc aagtcttcac tcatcatcct caaacaagta atggaggaga agctgaatgc
720aacaaacatt gagctagcca cagtgcagcc tggccagaat ttccacatgt tcacaaagga
780agaacttgaa gaggttatca aggacattta aggaatcctg atcctcagaa cttctctggg
840acaatttcag ttctaataat gtccttaaat tttatttcca gctcctgttc cttggaaaat
900ctccattgta tgtgcatttt ttaaatgatg tctgtacata aaggcagttc tgaaataaag
960aaaattttaa aataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1020aaa
102331313DNAHomo sapiensmisc_feature(1)..(1)n is a, c, g, or t
31ntcttgggct caagcaancc tcctgccctg gcttcccaaa gtgttcagat tacaagtgtg
60agccactgca cccagaccaa gaaattttaa ccctaactaa atacccaaaa aaagngtata
120tatgttccac aaaggacatg ggtaagaatg tttatagcag cagtatttgt aatagccaga
180aactggaaac aagccaaaca tctatctaca gcagaagaga ctattgttta tttatacaat
240aaactacaat ataggcaata aaatgantga ggctacaaca acaggaaatc aatttcacaa
300acatantact gag
31332358DNAHomo sapiensmisc_feature(205)..(205)n is a, c, g, or t
32tgttaagtac ttaagattta ttgaatgaga actgcattgt acaatatggt gccactagac
60acgtctattt aatttaaatt aaaatataaa actctaaaac tagccatgat tcaaaggttc
120aatagctata tgtgactagt ggctaccata taaaacattt ccatcacaaa gttccattta
180tcagatctta tataggaacc ttgantaaaa tttaatagac aagtgatttt gtatttaaca
240tttcaccttt attgaatgcc ctatagggcc atttgaatac gggtcatgtn caaggcacag
300gggaaaaaaa aactgcagcn ggtaagggtt ttncaggggg gttttccagg tcccctcc
35833326DNAHomo sapiensmisc_feature(3)..(4)n is a, c, g, or t
33ttnnatatta nttatttttt attatacttt aagttttagg gtacatgtgc acaatgtcag
60ggtttgttac atatgtatgg gcaaggactt catgtctaaa acaccaaaag caatggcaac
120aaaagccaaa attgacaaaa gtagtatcat tctattatag ctgcatggaa aaagttaatt
180tattaataca atggatgcct aaggncagaa gtactcaaac ttttggtctc agtactcctt
240tacattctta aaaatcatta nggnccccaa ngantgtttg tttacaaggg ttacttacat
300tgataattac cacatttgaa atgaaa
326342301DNAHomo sapiens 34tcgacagctc tctcgcccag cccagttctg gaagggataa
aaagggggca tcaccgttcc 60tgggtaacag agccaccttc tgcgtcctgc tgagctctgt
tctctccagc acctcccaac 120ccactagtgc ctggttctct tgctccacca ggaacaagcc
accatgtctc gccagtcaag 180tgtgtccttc cggagcgggg gcagtcgtag cttcagcacc
gcctctgcca tcaccccgtc 240tgtctcccgc accagcttca cctccgtgtc ccggtccggg
ggtggcggtg gtggtggctt 300cggcagggtc agccttgcgg gtgcttgtgg agtgggtggc
tatggcagcc ggagcctcta 360caacctgggg ggctccaaga ggatatccat cagcactaga
ggaggcagct tcaggaaccg 420gtttggtgct ggtgctggag gcggctatgg ctttggaggt
ggtgccggta gtggatttgg 480tttcggcggt ggagctggtg gtggctttgg gctcggtggc
ggagctggct ttggaggtgg 540cttcggtggc cctggctttc ctgtctgccc tcctggaggt
atccaagagg tcactgtcaa 600ccagagtctc ctgactcccc tcaacctgca aatcgacccc
agcatccaga gggtgaggac 660cgaggagcgc gagcagatca agaccctcaa caataagttt
gcctccttca tcgacaaggt 720gcggttcctg gagcagcaga acaaggttct ggacaccaag
tggaccctgc tgcaggagca 780gggcaccaag actgtgaggc agaacctgga gccgttgttc
gagcagtaca tcaacaacct 840caggaggcag ctggacagca tcgtggggga acggggccgc
ctggactcag agctgagaaa 900catgcaggac ctggtggaag acttcaagaa caagtatgag
gatgaaatca acaagcgtac 960cactgctgag aatgagtttg tgatgctgaa gaaggatgta
gatgctgcct acatgaacaa 1020ggtggagctg gaggccaagg ttgatgcact gatggatgag
attaacttca tgaagatgtt 1080ctttgatgcg gagctgtccc agatgcagac gcatgtctct
gacacctcag tggtcctctc 1140catggacaac aaccgcaacc tggacctgga tagcatcatc
gctgaggtca aggcccagta 1200tgaggagatt gccaaccgca gccggacaga agccgagtcc
tggtatcaga ccaagtatga 1260ggagctgcag cagacagctg gccggcatgg cgatgacctc
cgcaacacca agcatgagat 1320cacagagatg aaccggatga tccagaggct gagagccgag
attgacaatg tcaagaaaca 1380gtgcgccaat ctgcagaacg ccattgcgga tgccgagcag
cgtggggagc tggccctcaa 1440ggatgccagg aacaagctgg ccgagctgga ggaggccctg
cagaaggcca agcaggacat 1500ggcccggctg ctgcgtgagt accaggagct catgaacacc
aagctggccc tggacgtgga 1560gatcgccact taccgcaagc tgctggaggg cgaggaatgc
agactcagtg gagaaggagt 1620tggaccagtc aacatctctg ttgtcacaag cagtgtttcc
tctggatatg gcagtggcag 1680tggctatggc ggtggcctcg gtggaggtct tggcggcggc
ctcggtggag gtcttgccgg 1740aggtagcagt ggaagctact actccagcag cagtgggggt
gtcggcctag gtggtgggct 1800cagtgtgggg ggctctggct tcagtgcaag cagtggccga
gggctggggg tgggctttgg 1860cagtggcggg ggtagcagct ccagcgtcaa atttgtctcc
accacctcct cctcccggaa 1920gagcttcaag agctaagaac ctgctgcaag tcactgcctt
ccaagtgcag caacccagcc 1980catggagatt gcctcttcta ggcagttgct caagccatgt
tttatccttt tctggagagt 2040agtctagacc aagccaattg cagaaccaca ttctttggtt
cccaggagag ccccattccc 2100agcccctggt ctcccgtgcc gcagttctat attctgcttc
aaatcagcct tcaggtttcc 2160cacagcatgg cccctgctga cacgagaacc caaagttttc
ccaaatctaa atcatcaaaa 2220cagaatcccc accccaatcc caaattttgt tttggttcta
actacctcca gaatgtgttc 2280aataaaatgc ttttataata t
230135448DNAHomo sapiensmisc_feature(437)..(437)n
is a, c, g, or t 35gatcatatta ttaaataata tatgcacaga catggagaga attagttttt
actaaaacat 60ttatcagaaa ttttaatact ctgcataacc agtattagca ttagaaatta
gccactttta 120aaatgagaaa actgtgtcac tcttcaattt ttttataagc cattgaggaa
aacattaact 180cctggatttc agcttcactt ttaacctgca gactaaattt ctttctcaat
tatgtcagac 240acacccaagt caatcccaac ccccttgtta ccttgggaag acccgtgctg
aaaaaggaga 300tcttccacct aaacacgtgt tctcttattt gaagcaaatc tttttgagaa
tttgtttact 360tgatttcttt ccacaataaa ctgacagaga acgctactaa tgattttttt
ttttttttgg 420agacggggtt ttgttcntgg ttggccca
44836219DNAHomo sapiensmisc_feature(39)..(39)n is a, c, g, or
t 36tgtttttttg aagtgactga ctaaaaagag aacagatana tacaagagtg tcgctggatc
60ctattttata caaggattac gcctctcctg cttggccctt actgtcaccc tgtacaggta
120caaaggctac aaaaaaggaa gcaatataaa cagacacaaa taactttttt gcttttttac
180atgcgatttg taagcttagt ttgagctatt cacaagcta
219372808DNAHomo sapiens 37cggcatgaga ggccagcctg ccagggaaat ccaggaatct
gcaacaaaaa cgatgacagt 60ctgaaatact ctctggtgcc aacctccaaa ttctcgtctg
tcacttcaga cccccactag 120ttgacagagc agcagaatat caactccagt agacttgaat
gtgcctctgg gcaaagaagc 180agagctaacg aggaaaggga tttaaagagt ttttcttggg
tgtttgtcaa acttttattc 240cctgtctgtg tgcagagggg attcaacttc aattttctgc
agtggctctg ggtccagccc 300cttacttaaa gatctggaaa gcatgaagac tgggcctttt
ttcctatgtc tcttgggaac 360tgcagctgca atcccgacaa atgcaagatt attatctgat
cattccaaac caactgctga 420aacggtagca cctgacaaca ctgcaatccc cagtttatgg
gctgaagctg aagaaaatga 480aaaagaaaca gcagtatcca cagaagacga ttcccaccat
aaggctgaaa aatcatcagt 540actaaagtca aaagaggaaa gccatgaaca gtcagcagaa
cagggcaaga gttctagcca 600agagctggga ttgaaggatc aagaggacag tgatggtcac
ttaagtgtga atttggagta 660tgcaccaact gaaggtacat tggacataaa agaagatatg
attgagcctc aggagaaaaa 720actctcagag aacactgatt ttttggctcc tggtgttagt
tccttcacag attctaacca 780acaagaaagt atcacaaaga gagaggaaaa ccaagaacaa
cctagaaatt attcacatca 840tcagttgaac aggagcagta aacatagcca aggcctaagg
gatcaaggaa accaagagca 900ggatccaaat atttccaatg gagaagagga agaagaaaaa
gagccaggtg aagttggtac 960ccacaatgat aaccaagaaa gaaagacaga attgcccagg
gagcatgcta acagcaagca 1020ggaggaagac aatacccaat ctgatgatat tttggaagag
tctgatcaac caactcaagt 1080aagcaagatg caggaggatg aatttgatca gggtaaccaa
gaacaagaag ataactccaa 1140tgcagaaatg gaagaggaaa atgcatcgaa cgtcaataag
cacattcaag aaactgaatg 1200gcagagtcaa gagggtaaaa ctggcctaga agctatcagc
aaccacaaag agacagaaga 1260aaagactgtt tctgaggctc tgctcatgga acctactgat
gatggtaata ccacgcccag 1320aaatcatgga gttgatgatg atggcgatga tgatggcgat
gatggcggca ctgatggccc 1380caggcacagt gcaagtgatg actacttcat cccaagccag
gcctttctgg aggccgagag 1440agctcaatcc attgcctatc acctcaaaat tgaggagcaa
agagaaaaag tacatgaaaa 1500tgaaaatata ggtaccactg agcctggaga gcaccaagag
gccaagaaag cagagaactc 1560atcaaatgag gaggaaacgt caagtgaagg caacatgagg
gtgcatgctg tggattcttg 1620catgagcttc cagtgtaaaa gaggccacat ctgtaaggca
gaccaacagg gaaaacctca 1680ctgtgtctgc caggatccag tgacttgtcc tccaacaaaa
ccccttgatc aagtttgtgg 1740cactgacaat cagacctatg ctagttcctg tcatctattc
gctactaaat gcagactgga 1800ggggaccaaa aaggggcatc aactccagct ggattatttt
ggagcctgca aatctattcc 1860tacttgtacg gactttgaag tgattcagtt tcctctacgg
atgagagact ggctcaagaa 1920tatcctcatg cagctttatg aagccaactc tgaacatgct
ggttatctaa atgagaagca 1980gagaaataaa gtcaagaaaa tttacctgga tgaaaagagg
cttttggctg gggaccatcc 2040cattgatctt ctcttaaggg actttaagaa aaactaccac
atgtatgtgt atcctgtgca 2100ctggcagttt agtgaacttg accaacaccc tatggataga
gtcttgacac attctgaact 2160tgctcctctg cgagcatctc tggtgcccat ggaacactgc
ataacccgtt tctttgagga 2220gtgtgacccc aacaaggata agcacatcac cctgaaggag
tggggccact gctttggaat 2280taaagaagag gacatagatg aaaatctctt gttttgaacg
aagattttaa agaactcaac 2340tttccagcat cctcctctgt tctaaccact tcagaaatat
atgcagctgt gatacttgta 2400gatttatatt tagcaaaatg ttagcatgta tgacaagaca
atgagagtaa ttgcttgaca 2460acaacctatg caccaggtat ttaacattaa ctttggaaac
aaaaatgtac aattaagtaa 2520agtcaacata tgcaaaatac tgtacattgt gaacagaagt
ttaattcata gtaatttcac 2580tctctgcatt gacttatgag ataattaatg attaaactat
taatgataaa aataatgcat 2640ttgtattgtt cataatatca tgtgcacttc aagaaaatgg
aatgctactc ttttgtggtt 2700tacgtgtatt attttcaata tcttaatacc ctaataaaga
gtccataaaa atccaaaaaa 2760aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
aaaaaaaa 280838416DNAHomo sapiensmisc_feature(8)..(9)n is
a, c, g, or t 38tttatttnnt tgaatctatt taattgctca gactgtgcta gagaatacgt
accatgaaat 60acatatattt cataaggttc agttacaaaa tggattgttt caaatggcaa
tttcttacac 120taacctgatt atgaaaaaaa gaagtctgta tcatctgctt ccaagtctgt
tatgtccaaa 180tatattttaa ttatgcattt attttgctac ttttataaat attagagatt
tcaccntaaa 240ttatttttgt aactagttct agaacatgtt tnccaattat tattnnccta
atgggagaca 300tataattgac cnatggttta tggcatatat ggtcctctac acagnggaac
ctntttttaa 360aaggaatagg taaaggaaaa tgcgggacgg cctgggctct ccagggccaa
gggcca 41639471DNAHomo sapiensmisc_feature(6)..(6)n is a, c, g, or
t 39tttttntttt tttaaagtga atatacaatt tatttaacat tcaaacttca ttaagacatg
60tgcaatatgg caattttact ggggattaaa ccctacctag gattgcttgc tggggcttag
120caacagggtc cagttcacac ttagcactaa ttaaatactt tattgaataa atacaatacc
180angcaaaatg cattcaaatg ctttctaaaa aaattttaaa ggcctttcta ctcaggctaa
240tgacaaacac aataaaggca gatatgctag tttaacataa ttgggctgat tttatacagg
300cacttatatc ttttagtcca caaggtatat tattaaatga taggggaaca tctnatacaa
360ccatttctac agnactaggg gaattaaatt tctatgggaa ggaagggttt ttacagaccc
420catctttttt tacccncccc aacagttcta actctaaggg ggttatagcc a
47140525DNAHomo sapiens 40tttttttttt tttttttttg aaattttaac attttatatg
catataaagc tgaacacatg 60actaacaatc tagtggatgt gtatagaacc caacaattgc
agaatatata ttcttttcaa 120gcacacattg aatatttata aaaactgatc atatactgtg
ccgtaagttt catctcagca 180aatttcaaag ttttgatgcc atgaatgaaa tgaaacctga
catttcaaaa ttataaacag 240aatatgccct ggagtaactt gtggtattgt ttggggatga
ggagagccat ccgaatagtg 300ttttaaggaa agtctctatt cattgatctg gggtaacaag
gcaggaacca ttccaatgca 360gaagctttgg ctaagcagtt gagcgttcag tagtgcatgt
aaattcctgt gtgaaggctg 420tggtgtcatg gctaaaggca tagcccctgg aacccagact
gtttgggttc aaatctcagt 480tctgctgctt aactcactgt gtgatggtgg gcaagttgcc
taacc 525411402DNAHomo sapiens 41ggggacgaag ggaagctcca
gcgtgtggcc ccggcgagtg cggataaaag ccgccccgcc 60gggctcgggc ttcattctga
gccgagcccg gtgccaagcg cagctagctc agcaggcggc 120agcggcggcc tgagcttcag
ggcagccagc tccctcccgg tctcgccttc cctcgcggtc 180agcatgaaag ccttcagtcc
cgtgaggtcc gttaggaaaa acagcctgtc ggaccacagc 240ctgggcatct cccggagcaa
aacccctgtg gacgacccga tgagcctgct atacaacatg 300aacgactgct actccaagct
caaggagctg gtgcccagca tcccccagaa caagaaggtg 360agcaagatgg aaatcctgca
gcacgtcatc gactacatct tggacctgca gatcgccctg 420gactcgcatc ccactattgt
cagcctgcat caccagagac ccgggcagaa ccaggcgtcc 480aggacgccgc tgaccaccct
caacacggat atcagcatcc tgtccttgca ggcttctgaa 540ttcccttctg agttaatgtc
aaatgacagc aaagcactgt gtggctgaat aagcggtgtt 600catgatttct tttattcttt
gcacaacaac aacaacaaca aattcacgga atcttttaag 660tgctgaactt atttttcaac
catttcacaa ggaggacaag ttgaatggac ctttttaaaa 720agaaaaaaaa aatggaagga
aaactaagaa tgatcatctt cccagggtgt tctcttactt 780ggactgtgat attcgttatt
tatgaaaaag acttttaaat gccctttctg cagttggaag 840gttttcttta tatactattc
ccaccatggg gagcgaaaac gttaaaatca caaggaattg 900cccaatctaa gcagactttg
ccttttttca aaggtggagc gtgaatacca gaaggatcca 960gtattcagtc acttaaatga
agtcttttgg tcagaaatta cctttttgac acaagcctac 1020tgaatgctgt gtatatattt
atatataaat atatctattt gagtgaaacc ttgtgaactc 1080tttaattaga gttttcttgt
atagtggcag agatgtctat ttctgcattc aaaagtgtaa 1140tgatgtactt attcatgcta
aactttttat aaaagtttag ttgtaaactt aaccctttta 1200tacaaaataa atcaagtgtg
tttattgaat ggtgattgcc tgctttattt cagaggacca 1260gtgctttgat ttttattatg
ctatgttata actgaaccca aataaataca agttcaaatt 1320tatgtagact gtataagatt
ataataaaac atgtctgaag tcaaaaaaaa aaaaaaaaaa 1380aaaaaaaaaa aaaaaaaaaa
aa 1402422544DNAHomo sapiens
42ctcactcaga cccatgaggc cctgcctggt ctcgtctggg acctgggaca gcagctggga
60gacctgagcc tggagtctgg gggcctggaa caggagagcg ggcgtagctc gggcttctat
120gaagatccca gctctacagg aggtccagat tcaccaccct caaccttctg tggggacagt
180ggcttctctg gatccagctc ctatggtcgc ctgggtccct ctgagccccg gggcatctat
240gccagtgaga ggcccaagtc cctaggagac gccagtccca gcgctccgga ggtggtgggc
300gcgcgggcag cggtgccgcg gtccttctca gcgccctacc cgacggcagg tgggtcgccg
360gcccggaggc ctgctcctcg gcggagcggc gggcccgcgc cgggcccttt ctgacgccca
420gccccctgca cgccgtggcg atgcgcagcc cgcggccctg cggccgccct cccaccgact
480cgcccgacgc ggggggcgca gggcggcccc tggacggcta catctcggcg ctcctgcgca
540ggcgccgccg ccggggggcg ggccagcccc ggaccagtcc tgggggcgcg gacggcggcc
600cgcggcgcca gaacagcgtg cgccagcggc cgcccgacgc gtctccgtcc cccggcagcg
660cgcgacccgc gcgggagccc tcgttggagc gcgtcggggg ccaccccacc agccctgccg
720ccttgagccg cgcctgggcg tcgtcgtggg agtcggaggc ggcacccgag cccgctgcgc
780cgcccgccgc cccctcaccc cccgacagcc cggctgaggg ccgcttggtg aaggcgcagt
840acatcccggg cgcgcaggcg gccacccgag gcctccctgg ccgcgccgcc cgccgcaaac
900cgccgccact gacccgcggc cgcagcgtgg agcagtcacc accccgggag cgtccccggg
960ccgccggccg ccgtggacgc atggccgagg cttcgggccg ccgcggctcg cccagggccc
1020gcaaggcctc gcgctcccag tctgagacca gcctgctggg ccgcgcctcc gcggtccctt
1080cggggccccc taagtacccc acggcggagc gggaagagcc tcggcctcca cggccacgcc
1140gcggcccagc gcccacgctg gcggcccagg ccgcagggtc ctgccgtcgc tggcgctcca
1200ctgcggagat cgacgctgcc gatgggcgcc gcgtgcggcc ccgagcccct gcggcgcgtg
1260ttcccggccc cggcccgtcc ccgtcagctc cccagcgtcg tctgctttac ggctgcgcgg
1320gcagcgactc cgagtgctcg gctgggcgcc tggggcccct gggacgccgg gggcctgcgg
1380gaggcgtcgg cgggggttac ggggagagcg aatcgagcgc cagcgaggga gaatcgcctg
1440ccttcagctc tgcctccagc gactcagacg gcagcggtgg cctcgtgtgg ccgcagcagc
1500tggtggcggc caccgcggcc tctgggggtg gagcaggtgc aggggcgccc gcaggccccg
1560ccaaagtctt cgtgaaaatc aaagcttccc acgcgctcaa gaaaaagata ctgcgtttcc
1620gttcgggttc tctcaaggtc atgactacag tgtgagtttg gggatttgct tgggctcccc
1680cttcatggcc tctgcacctc cacactccca accactgacc cttccacatc taccttccaa
1740agaccatcgt tttctctgct tccaaagacc cccctcactc tccccactcc taacagtctt
1800ggttgaaaag gctcccccac caccaccgag aggaatgggg aggagccctg tttgacccag
1860ttcagcttct agcttggaag cccttgggca agacagttcc ccttctctgg gcgtcacttt
1920cctcatctgt acagtaagtg tccatgtatg caaaaggggt aattcggttt gaatttcccc
1980gttttagttt agaagcctag tctgtttgtt ccccttcacc gctctccctc tcattcctga
2040tgagccctct cattcctcct ttccttgccc agctatggcc ccctctcatt cacaaagtgc
2100cccctccatg tccctggacc cttaagatat ccccttggca ccctggtcag agactctgtg
2160tctgactcag gtggtccctg cagagtgccc tgggaaggga aggagcactg atttgggggt
2220tttgagggtc aagtaggggt tggtaacacc tggaaagaag gactctttca cttcgatccc
2280tggacaatta tggaggattc ggaggtagaa gaggggaagg aagatggttt ctatctcatg
2340acccccactc cctgtgagag ggaatggggg aagcctgatg accctcagct gttccaatct
2400agtatttttt ttctttttta aaattactgt atttattatg acgatggtga ctccccagtg
2460caaagggggg ccagattctg tgtgtttctc taacctcttt gtaaataaat gcacagtgta
2520acataaaaaa aaaaaaaaaa aaaa
254443374DNAHomo sapiensmisc_feature(345)..(345)n is a, c, g, or t
43aagcattaga gaagcatcag gccgccattc tagactcaac tgctcacctc ctctgatcca
60ctgaggtgtc tctggaaatc ctccaccaca gccacagcct cctcaccact ctcagggtga
120tgcagctgca cccaggtccg gagctcccca gggaggatgc tcaggaactg ctcaagcacc
180agcagctcca ggatctgctc cttggtgtgc acctctgggc atgagccacc agcggcagag
240cttcccgaag ccggctcaat gctttcctgc ggcccagaca tctcgtgggt aacacaattg
300cctgaagtgt aggccggaag attttcgcag acaggaggat agttnttttt gggagattgt
360tggccttgnc ccca
374444299DNAHomo sapiens 44tgacagcgga ggcggcggcg gctgcaggct ccgagccgta
ggagccggat cgggggaggg 60gccgggccca ggagcctcag ccccgccggc agccctaagg
gcaaggtaac cgccacgggg 120tccccgtcgc gaccccctcc ctcccggagc tcccgtcccc
gggatcccaa gctccgcccc 180gccgaccccc gtctcccctg gaccccggct ctagcctgac
gagatcccca acctcctgag 240gtgctctggc cccggattct cccgggctgc attctctgct
cctcctcgcc tgcgaagcat 300cacgtccgct tcccgacgct gagggcagcc ccgtccaggg
cagtggctct gccaatgatc 360ctgtgagtat tcaggaatca ctgttgcccc tggggatcct
tgtcctggag tggcccacct 420gcttgccccc agcatggcgt ccgacactcc cgagtcgctg
atggccctct gtactgactt 480ctgcttgcgc aacctggatg gcaccctggg ctacctgctg
gacaaggaga ccctgcggct 540acatccggac atcttcttgc ccagcgagat ctgtgaccgg
ctcgtcaatg agtatgtgga 600gctggtgaac gctgcctgta acttcgagcc acacgagagc
ttcttcagcc tcttttcgga 660cccccgcagc acccgcctca cgcggatcca cctccgtgag
gacctggtgc aggaccagga 720cctggaggcc atccgcaagc aggacctggt ggagctgtac
ctgactaact gcgagaagct 780gtccgccaag agcctgcaga cactgaggag cttcagccac
accctggtgt ccttgagcct 840cttcggctgt acaaacattt tctatgagga ggagaaccca
gggggctgtg aagatgagta 900cctcgtcaac cccacctgcc aggtgctggt taaggatttc
accttcgagg gcttcagccg 960cctccgcttc ctcaacttgg gccgcatgat tgattgggtc
cctgtggagt ccctgctgcg 1020gccgcttaac tccctggctg ccttggacct ctcaggcatt
cagacgagcg acgccgcctt 1080cctcacccag tggaaagaca gcctggtgtc cctcgtcctc
tacaacatgg acctgtccga 1140cgaccacatc cgggtcatcg tgcagctgca caagctgcga
cacctggaca tctcccgaga 1200ccgcctctcc agctactaca agttcaagct gactcgggag
gtgctgagcc tctttgtgca 1260gaagctgggg aacctaatgt ccctggacat ctctggccac
atgatcctag agaactgcag 1320catctccaag atggaagagg aagcggggca gaccagcatt
gagccttcca agagcagcat 1380catacctttc cgggctctga agaggccgct gcagttcctc
gggctctttg agaactctct 1440gtgccgcctc acgcacattc cagcctacaa agtaagtggt
gacaaaaacg aagagcaggt 1500gctgaatgcc atcgaggcct acacggagca ccggcctgag
atcacctcgc gggccatcaa 1560cttgcttttt gacatcgccc gcatcgagcg ttgcaaccag
ctgctgcggg ccctgaagct 1620ggtcatcacg gccctcaagt gccacaaata tgacaggaac
attcaagtga caggcagcgc 1680cgctctcttc tacctaacaa attccgagta ccgctcagag
cagagtgtga agctgcgccg 1740gcaggttatc caggtggtgc tgaatggcat ggaatcctac
caggaggtga cggtgcagcg 1800gaactgctgc ctgacgctct gcaacttcag catccccgag
gagctggaat tccagtaccg 1860ccgggtcaac gagctcctgc tcagcatcct caaccccacg
cggcaggacg agtctatcca 1920gcggatcgcc gtgcacctgt gcaatgccct ggtctgccag
gtagacaacg accacaagga 1980ggccgtgggc aagatgggct ttgtcgtgac catgctgaag
ctgattcaga agaagctgct 2040ggacaagaca tgtgaccagg tcatggagtt ctcctggagt
gccctgtgga acatcacaga 2100tgaaactcct gacaactgcg agatgttcct caatttcaac
ggcatgaagc tcttcctgga 2160ctgcctgaag gaattcccag agaagcagga actgcatagg
aatatgctag gacttttggg 2220gaatgtggca gaagtgaagg agctgaggcc tcaactaatg
acttcccagt tcatcagcgt 2280cttcagcaac ctgttggaga gcaaggccga tgggatcgag
gtttcctaca atgcctgcgg 2340cgtcctctcc cacatcatgt ttgatggacc cgaggcctgg
ggcgtctgtg agccccagcg 2400tgaggaggtg gaggaacgca tgtgggctgc catccagagc
tgggacataa actctcggag 2460aaacatcaat tacaggtcat ttgaaccaat tctccgcctc
cttccccagg gaatctctcc 2520tgtcagccag cactgggcaa cctgggccct gtataacctc
gtgtctgtct acccggacaa 2580gtactgccct ctgctgatca aagaaggggg gatgcccctt
ctgagggaca taattaagat 2640ggcgaccgca cggcaggaga ccaaggaaat ggcccgcaag
gtgattgagc actgcagtaa 2700ctttaaagag gagaacatgg acacgtctag atagaggcct
ccgtccccat ggccgccacc 2760gctctggacc acaggcgggg aggaagcatg ctcaagcagc
ccagcgggcg ggccccttcc 2820gagggagcct cccacggagt gaagagacat gggggacttt
tgcacaaccg acgcttttcc 2880ttaatgttag tgagatatat atatattata tatatatatt
ttttttttgg ttaggaagtg 2940tgaagttttg tgtgtatgat ttctgtgcaa aaacaaaagc
aacactcctg agtccttgca 3000gcttccttgg ccattctcaa acccactcag ccttcatcgc
tgacacacac actcctaccc 3060caaccagact aaatgcctat aacgctgtga gtgtccagtc
cttgtccagg aaactcagat 3120cccggcctgg cttcctttca tgagaggagc aggccttgga
cagcgtatcg agcatcctga 3180cccactgccc ctgcctgaga acgccatctc ggctcccggg
cacagctgat ggggtttggg 3240gattagaact taccccactg ggtctcccaa aagccttggt
gctcccggct gtgggccatc 3300tggggcagga aagtgagcca ttcctaggct gaggtccagg
cagccctgcc cctgaagacc 3360ctctaggagc agggcaccca gtggccctgc tgctgtccag
ccaggcctgc ctgaggccac 3420gctgctatgg aggctgcctc ctagtctccc accaggtccc
aggctgtgga aagccccagc 3480ccagggatgg tcagaactcg ggggcagatt ccactgcccc
ttctgccaaa cacatccaga 3540acctgccctc agccctggaa gctagcatct tctggggcca
ggggcttgct tcctcgctcc 3600atagccctca actgcccagg cgctcccacc agcagaactg
agcctgcctc ctcctcccag 3660cctgccccgc tgcccagagg accccacgcc tctcagaggc
agaggtccca tgccagcctt 3720tgacccacaa cggccacaca gccgcctcca gaccagcact
cggactgccc tgcagtggcc 3780gcttgggcct ccctggcggt cccgccctgc cctaggcttt
accttggaag cctgagaggc 3840gccggctctc ttgctcctcc atcgatggac actgcattgc
ttctcatcgg acacttgtgg 3900agcgcagggg cctggggagc agcgctaacc ctggaggcag
cctttgggtg atggcttttt 3960cttccctttt cctcccgcgg gcctgttttc aggtgttcct
agcatttctg cctccaggca 4020ggacggcagg ggtgagcagc tttgggagag acacctggcc
tttttctcct ggagcctctc 4080cctcccggcc ctgggaagtg ggcgcagccc tgtgttcccc
cagcttggca gatgggctgc 4140atgcggcgct cccttccttc ccacgctcag cggccccggc
cagaccctgg cagacttcac 4200acctcattgc tttaccccct ggggcctggg gaaatgtctg
tactttggga agtcacagaa 4260atacattttt gtgcaaaatg gaaaaaaaaa aaaaaaaaa
4299456990DNAHomo sapiens 45atggctgaga gcgcctcccc
gccctcctca tctgcagcag ccccagccgc tgagccagga 60gtcaccacgg agcagcccgg
accccggagc cccccatcct ccccgccagg cctggaggag 120cctctggatg gagctgatcc
tcatgtccca cacccagacc tggcgcctat tgccttcttc 180tgcctgcgac agaccaccag
cccccggaac tggtgcatca agatggtgtg caacccgtgg 240tttgaatgtg tcagcatgct
ggtgatcctg ctgaactgcg tgacacttgg catgtaccag 300ccgtgcgacg acatggactg
cctgtccgac cgctgcaaga tcctgcaggt ctttgatgac 360ttcatcttta tcttctttgc
catggagatg gtgctcaaga tggtggccct ggggattttt 420ggcaagaagt gctacctcgg
ggacacatgg aaccgcctgg atttcttcat cgtcatggca 480gggatggtcg agtactccct
ggaccttcag aacatcaacc tgtcagccat ccgcaccgtg 540cgcgtcctga ggcccctcaa
agccatcaac cgcgtgccca gtatgcggat cctggtgaac 600ctgctcctgg acacactgcc
catgctgggg aatgtcctgc tgctctgctt ctttgtcttc 660ttcatctttg gcatcatagg
tgtgcagctc tgggcgggcc tgctgcgtaa ccgctgcttc 720ctggaggaga acttcaccat
acaaggggat gtggccttgc ccccatacta ccagccggag 780gaggatgatg agatgccctt
catctgctcc ctgtcgggcg acaatgggat aatgggctgc 840catgagatcc ccccgctcaa
ggagcagggc cgtgagtgct gcctgtccaa ggacgacgtc 900tacgactttg gggcggggcg
ccaggacctc aatgccagcg gcctctgtgt caactggaac 960cgttactaca atgtgtgccg
cacgggcagc gccaaccccc acaagggtgc catcaacttt 1020gacaacatcg gttatgcttg
gattgtcatc ttccaggtga tcactctgga aggctgggtg 1080gagatcatgt actacgtgat
ggatgctcac tccttctaca acttcatcta cttcatcctg 1140cttatcatag tgggctcctt
cttcatgatc aacctgtgcc tcgttgtcat agcgacccag 1200ttctcggaga ccaagcaacg
ggagcaccgg ctgatgctgg agcagcggca gcgctacctg 1260tcctccagca cggtggccag
ctacgccgag cctggcgact gctacgagga gatcttccag 1320tatgtctgcc acatcctgcg
caaggccaag cgccgcgccc tgggcctcta ccaggccctg 1380cagagccggc gccaggccct
gggcccggag gccccggccc ccgccaaacc tgggccccac 1440gccaaggagc cccggcacta
ccatgggaag actaagggtc agggagatga agggagacat 1500ctcggaagcc ggcattgcca
gactttgcat gggcctgcct cccctggaaa tgatcactcg 1560ggaagagagc tgtgcccgca
acatagcccc ctggatgcga cgccccacac cctggtgcag 1620cccatccccg ccacgctggc
ttccgatccc gccagctgcc cttgctgcca gcatgaggac 1680ggccggcggc cctcgggcct
gggcagcacc gactcgggcc aggagggctc gggctccggg 1740agctccgctg gtggcgagga
cgaggcggat ggggacgggg cccggagcag cgaggacgga 1800gcctcctcag aactggggaa
ggaggaggag gaggaggagc aggcggatgg ggcggtctgg 1860ctgtgcgggg atgtgtggcg
ggagacgcga gccaagctgc gcggcatcgt ggacagcaag 1920tacttcaacc ggggcatcat
gatggccatc ctggtcaaca ccgtcagcat gggcatcgag 1980caccacgagc agccggagga
gctgaccaac atcctggaga tctgcaatgt ggtcttcacc 2040agcatgtttg ccctggagat
gatcctgaag ctggctgcat ttgggctctt cgactacctg 2100cgtaacccct acaacatctt
cgacagcatc attgtcatca tcagcatctg ggagatcgtg 2160gggcaggcgg acggtgggct
gtcggtgctg cggaccttcc ggctgctgcg cgtgctgaaa 2220ctggtgcgct tcatgcctgc
cctgcggcgc cagctcgtgg tgctcatgaa gaccatggac 2280aacgtggcca ccttctgcat
gctgctcatg ctcttcatct tcatcttcag catccttggg 2340atgcatattt ttggctgcaa
gttcagcctc cgcacggaca ctggagacac ggtgcccgac 2400aggaagaact tcgactccct
gctgtgggcc atcgtcactg tgttccagat cctcacccag 2460gaggactgga acgtcgttct
ctacaatggc atggcctcca cttctccctg ggcctccctc 2520tactttgtcg ccctcatgac
cttcggcaac tatgtgctct tcaacctgct ggtggccatc 2580ctggtggagg gcttccaggc
ggagggtgac gccaatcgct cctactcgga cgaggaccag 2640agctcatcca acatagaaga
gtttgataag ctccaggaag gcctggacag cagcggagat 2700cccaagctct gcccaatccc
catgaccccc aatgggcacc tggaccccag tctcccactg 2760ggtgggcacc taggtcctgc
tggggctgcg ggacctgccc cccgactctc actgcagccg 2820gaccccatgc tggtggccct
gggctcccga aagagcagtg tcatgtctct agggaggatg 2880agctatgacc agcgctccct
gtccagctcc cggagctcct actacgggcc atggggccgc 2940agcgcggcct gggccagccg
tcgctccagc tggaacagcc tcaagcacaa gccgccgtcg 3000gcggagcatg agtccctgct
ctctgcggag cgcggcggcg gcgcccgggt ctgcgaggtt 3060gccgcggacg aggggccgcc
gcgggccgca cccctgcaca ccccacacgc ccaccacatt 3120catcacgggc cccatctggc
gcaccgccac cgccaccacc gccggacgct gtccctcgac 3180aacagggact cggtggacct
ggccgagctg gtgcccgcgg tgggcgccca cccccgggcc 3240gcctggaggg cggcaggccc
ggcccccggg catgaggact gcaatggcag gatgcccagc 3300atcgccaaag acgtcttcac
caagatgggc gaccgcgggg atcgcgggga ggatgaggag 3360gaaatcgact acaccctgtg
cttccgcgtc cgcaagatga tcgacgtcta taagcccgac 3420tggtgcgagg tccgcgaaga
ctggtctgtc tacctcttct ctcccgagaa caggttccgg 3480gtcctgtgtc agaccattat
tgcccacaaa ctcttcgact acgtcgtcct ggccttcatc 3540tttctcaact gcatcaccat
cgccctggag cggcctcaga tcgaggccgg cagcaccgaa 3600cgcatctttc tcaccgtgtc
caactacatc ttcacggcca tcttcgtggg cgagatgaca 3660ttgaaggtag tctcgctggg
cctgtacttc ggcgagcagg cgtacctacg cagcagctgg 3720aacgtgctgg atggctttct
tgtcttcgtg tccatcatcg acatcgtggt gtccctggcc 3780tcagccgggg gagccaagat
cttgggggtc ctccgagtct tgcggctcct gcgcacccta 3840cgccccctgc gtgtcatcag
ccgggcgccg ggcctgaagc tggtggtgga gacactcatc 3900tcctccctca agcccatcgg
caacatcgtg ctcatctgct gtgccttctt catcatcttt 3960ggcatcctgg gagtgcagct
cttcaagggc aagttctacc actgtctggg cgtggacacc 4020cgcaacatca ccaaccgctc
ggactgcatg gccgccaact accgctgggt ccatcacaaa 4080tacaacttcg acaacctggg
ccaggctctg atgtccctct ttgtcctggc atccaaggat 4140ggttgggtga acatcatgta
caatggactg gatgctgttg ctgtggacca gcagcctgtg 4200accaaccaca acccctggat
gctgctgtac ttcatctcct tcctgctcat cgtcagcttc 4260tttgtgctca acatgtttgt
gggtgtcgtg gtggagaact tccacaagtg ccggcagcac 4320caggaggctg aagaggcacg
gcggcgtgag gagaagcggc tgcggcgcct ggagaagaag 4380cgccggaagg cccagcggct
gccctactat gccacctatt gtcacacccg gctgctcatc 4440cactccatgt gcaccagcca
ctacctggac atcttcatca ccttcatcat ctgcctcaac 4500gtggtcacca tgtccctgga
gcactacaat cagcccacgt ccctggagac agccctcaag 4560tactgcaact atatgttcac
cactgtcttt gtgctggagg ctgtgctgaa gctggtggca 4620tttggtctga ggcgcttctt
caaggaccga tggaaccagc tggacctggc cattgtgcta 4680ctgtcagtca tgggcatcac
cctggaggag atcgagatca atgcggccct gcccatcaat 4740cccaccatca tccgcatcat
gagggttctg cgcattgccc gagtgctgaa gctgttgaag 4800atggccacag gaatgcgggc
cctgctggac acggtggtgc aagctttgcc ccaggtgggc 4860aacctgggcc tcctcttcat
gctgctcttc ttcatctatg ctgctctcgg ggtggagctc 4920tttgggaagc tggtctgcaa
cgacgagaac ccgtgcgagg gcatgagccg gcatgccacc 4980ttcgagaact tcggcatggc
cttcctcaca ctcttccagg tctccacggg tgacaactgg 5040aacgggatca tgaaggacac
gctgcgggac tgcacccacg acgagcgcag ctgcctgagc 5100agcctgcagt ttgtgtcgcc
gctgtacttc gtgagcttcg tgctcaccgc gcagttcgtg 5160ctcatcaacg tggtggtggc
tgtgctcatg aagcacctgg acgacagcaa caaggaggcg 5220caggaggacg ccgagatgga
tgccgagctc gagctggaga tggcccatgg cctgggccct 5280ggcccgaggc tgcctaccgg
ctccccgggc gcccctggcc gagggccggg aggggcgggc 5340ggcgggggcg acaccgaggg
cggcttgtgc cggcgctgct actcgcctgc ccaggagaac 5400ctgtggctgg acagcgtctc
tttaatcatc aaggactcct tggaggggga gctgaccatc 5460atcgacaacc tgtcgggctc
catcttccac cactactcct cgcctgccgg ctgcaagaag 5520tgtcaccacg acaagcaaga
ggtgcagctg gctgagacgg aggccttctc cctgaactca 5580gacaggtcct cgtccatcct
gctgggtgac gacctgagtc tcgaggaccc cacagcctgc 5640ccacctggcc gcaaagacag
caagggtgag ctggacccac ctgagcccat gcgtgtggga 5700gacctgggcg aatgcttctt
ccccttgtcc tctacggccg tctcgccgga tccagagaac 5760ttcctgtgtg agatggagga
gatcccattc aaccctgtcc ggtcctggct gaaacatgac 5820agcagtcaag cacccccaag
tcccttctcc ccggatgcct ccagccctct cctgcccatg 5880ccagccgagt tcttccaccc
tgcagtgtct gccagccaga aaggcccaga aaagggcact 5940ggcactggaa ccctccccaa
gattgcgctg cagggctcct gggcatctct gcggtcacca 6000agggtcaact gtaccctcct
ccggcaggcc accgggagcg acacgtcgct ggacgccagc 6060cccagcagct ccgcgggcag
cctgcagacc acgctcgagg acagcctgac cctgagcgac 6120agcccccggc gtgccctggg
gccgcccgcg cctgctccag gaccccgggc cggcctgtcc 6180cccgccgctc gccgccgcct
gagcctgcgc ggccggggcc tcttcagcct gcgggggctg 6240cgggcgcatc agcgcagcca
cagcagcggg ggctccacca gcccgggctg cacccaccac 6300gactccatgg acccctcgga
cgaggagggc cgcggtggcg cgggcggcgg gggcgcgggc 6360agcgagcact cggagaccct
cagcagcctc tcgctcacct ccctcttctg cccgccgccc 6420ccgccgccag cccccggcct
cacgcccgcc aggaagttca gcagcaccag cagcctggcc 6480gcccccggcc gcccccacgc
cgccgccctg gcccacggcc tggcccggag cccctcgtgg 6540gccgcggacc gcagcaagga
cccccccggc cgggcaccgc tgcccatggg cctgggcccc 6600ttggcgcccc cgccgcaacc
gctccccgga gagctggagc cgggagacgc cgccagcaag 6660aggaagagat gagggtcgca
ggggcccccg gccgcccacc gcccgccccg tctcaccttc 6720tttacctcag gagccaggag
cagacagcaa tacttcgtcc acacctggga tcgcgcaggg 6780cccgcagggc acaggcgccc
gacagccggg ctgagcggag tctgggttag ccaggcctgc 6840gtggcccatg gtggcccttc
cagtgcatat acatacatat atatatatat atgcatatat 6900atatatatat atatatatat
gtgtatacac acacacatag acagacatat atatatatat 6960ttattttttt tactgagagc
ttatgacttc 699046139DNAHomo
sapiensmisc_feature(138)..(138)n is a, c, g, or t 46ctaatatttg catgtacaca
atgagttatc ttagggaggg gatccaagtg gaaacacaaa 60atttattttt gtgtgtatac
acacatacac acatcactta tatacatagc cttaaggtaa 120ttttataccg tatttttng
13947674DNAHomo sapiens
47ccccttggtt ccgcccgcgc gtcacgtgac cccagcgcct acttgggctg aggagccgcc
60gcgtcccctc gccgagtccc ctcgccagat tccctccgtc gccgccaaga tgatgtgcgg
120ggcgccctcc gccacgcagc cggccaccgc cgagacccag cacatcgccg accaggtgag
180gtcccagctt gaagagaaag aaaacaagaa gttccctgtg tttaaggccg tgtcattcaa
240gagccaggtg gtcgcgggga caaactactt catcaaggtg cacgtcggcg acgaggactt
300cgtacacctg cgagtgttcc aatctctccc tcatgaaaac aagcccttga ccttatctaa
360ctaccagacc aacaaagcca agcatgatga gctgacctat ttctgatcct gactttggac
420aaggcccttc agccagaaga ctgacaaagt catcctccgt ctaccagagc gtgcacttgt
480gatcctaaaa taagcttcat ctccgggctg tgccccttgg ggtggaaggg gcaggattct
540gcagctgctt ttgcatttct cttcctaaat ttcattgtgt tgatttcttt ccttcccaat
600aggtgatctt aattactttc agaatatttt caaaatagat atatttttaa aatccttaaa
660aaaaaaaaaa aaaa
674486276DNAHomo sapiens 48agtcggcatc catcagcggg cgggggtgtc gccgaacagg
ctgctccgca gagcccgccg 60cgaccccgcg ccgccccgcc ccgcggcctg cctgccagag
gagccgaggg ggccgcccct 120cgcccaacct gcccgacatg gggaaccccg ggcccaggcg
tgctggtcac catgacaaca 180gagacaggcc ccgactctga ggtgaagaaa gctcaggagg
aggccccgca gcagcccgag 240gctgctgccg ctgtgaccac ccctgtgacc cctgcaggcc
acggccaccc agaggccaac 300tccaatgaga agcatccatc ccagcaggac acgcggcctg
ctgaacagag cctagacatg 360gaggagaagg actacagtga ggccgatggc ctttcggaga
ggaccacgcc cagcaaggcc 420cagaaatcgc cccagaagat tgccaagaaa tacaagagtg
ccatctgccg ggtcactctg 480cttgatgcct cggagtatga gtgtgaggtg gagaaacatg
gccggggcca ggtgctgttt 540gacctggtct gtgaacacct caacctccta gagaaggact
acttcggcct gaccttctgt 600gatgctgaca gccagaagaa ctggctggac ccctccaagg
agatcaagaa gcagatccgg 660agtagcccct ggaattttgc cttcacagtc aagttctacc
cgcctgatcc tgcccagctg 720acagaagaca tcacaagata ctacctgtgc ctgcagctgc
gggcagacat catcacgggc 780cggctgccat gctcctttgt cacgcatgcc ctactgggct
cctacgctgt gcaggctgag 840ctgggtgact atgatgctga ggagcatgtg ggcaactatg
tcagcgagct ccgcttcgcc 900cctaaccaga cccgggagct ggaggagagg atcatggagc
tgcataagac atataggggg 960atgaccccgg gagaagcaga aatccacttc ttagagaatg
ccaagaagct ttccatgtac 1020ggagtagacc tgcaccatgc caaggactct gagggcatcg
acatcatgtt aggcgtttgt 1080gccaatggcc tgctcatcta ccgggaccgg ctgagaatca
accgctttgc ctggcccaag 1140atcctcaaga tctcctacaa gaggagtaac ttctatatca
agatccggcc tggggagtat 1200gagcaatttg agagcacaat tggctttaag ctcccaaacc
accggtcagc caagagactg 1260tggaaggtct gcatcgagca tcatacattc ttccggctgg
tgtcccctga gcccccaccc 1320aagggcttcc tggtgatggg ctccaagttc cggtacagtg
ggaggaccca ggcacagact 1380cgccaggcca gcgccctcat tgaccggcct gcacccttct
ttgagcgttc ttccagcaaa 1440cggtacacca tgtcccgcag ccttgatgga gcagagttct
cccgcccagc ctcggtcagc 1500gagaaccatg atgcagggcc tgacggtgac aagcgggatg
aggatggcga gtctgggggg 1560caacggtcag aggctgagga gggagaggtc aggactccaa
ccaagatcaa ggagctaaag 1620ccggagcagg aaaccacgcc gagacacaag caggagttct
tagacaagcc agaagatgtc 1680ttgctgaagc accaggccag catcaatgag ctcaaaagga
ccctgaagga gcccaacagc 1740aaactcatcc accgggatcg agactgggaa cgggagcgca
ggctgccctc ctcccccgcc 1800tccccctccc ccaagggcac ccctgagaaa gccaatgaga
gagcagggct gagggagggc 1860tccgaggaga aagtcaaacc accacgtccc cgggccccag
agagtgacac aggcgatgag 1920gaccaggacc aggagaggga cacggtgttc ctgaaggaca
accacctggc cattgagcgc 1980aagtgctcca gcatcacggt cagctctacg tctagcctgg
aggctgaggt ggacttcacg 2040gtcattggtg actaccatgg cagcgccttc gaagacttct
cccgcagcct gcctgagctc 2100gaccgggaca aaagcgactc ggacactgag ggcctgctgt
tctcccggga tctcaacaag 2160ggggccccca gccaggatga tgagtctggg ggcattgagg
acagcccgga tcgaggggcc 2220tgctccaccc cggatatgcc ccagtttgag cccgtgaaaa
cagaaaccat gactgtcagc 2280agtctggcca ttagaaagaa gattgagccg gaggccgtac
tgcagaccag agtctccgct 2340atggataaca cccagcaggt tgatgggagt gcctcagtgg
ggagggagtt catagcaacc 2400actccctcca tcaccacgga gaccatatcg accaccatgg
agaacagtct caagtccggg 2460aagggggcag ctgccatgat cccaggccca cagacggtgg
ccacggaaat ccgttctctt 2520tctccgatca tcgggaaaga tgtcctcacc agcacctacg
gcgccactgc ggaaaccctc 2580tcaacctcca ccaccaccca tgtcaccaaa actgtgaaag
gagggttttc tgagacaagg 2640atcgagaagc gaatcatcat tactggggat gaagatgtcg
atcaagacca ggccctggct 2700ttggccatca aggaggccaa actgcagcat cctgatatgc
tggtaaccaa agctgtcgta 2760tacagagaaa cagacccatc cccagaggag agggacaaga
agccacagga atcctgacct 2820ctgtgaagag atcctggcat ttctggtcca acccaagcca
gagaaccatt aagaaggggc 2880cttcattctg gattctccga cgcaacactg acgtcccagc
tgcgacgtac tgtcactgat 2940gagagactgg gaagggaaaa gcatatatat atagatatat
agagatatag atatatatac 3000aggaaacacc gcatccttgc actgctgctg gggctggcag
agcagttggc tgacagcaac 3060aaccgacatc tgaacaccta catttccttt gcagacaaat
tgaagaactg gtgggatttt 3120tttcaagaaa aaaaattata taataactat aatcccttgc
tcaccccttt cccccgccaa 3180ataagaaacg caagccagac cacgatgatt gtagaagtcc
ctcccgccct ggttctgcac 3240gttacagtta gcagacgagc aattccattt gttcttctcc
agcatctcta aggcccactt 3300gaatgcaaag gaaaacactt gcacagcaaa gcaagagaag
tcacagcagc aagacacgca 3360cagtcaacca ttttccgaga aaaaaagaaa attccccact
tggaaagaaa gaggaggaac 3420actggattct tactttctgg atcttgacac tgggctgcaa
aacctacctt cctctctccc 3480gcctcccctc accctcaact ctcaatgtct tgctgtcatt
ttctgtctcg gctccctcct 3540cccccttccc ccttccccca ccccacaccc ttcaccctct
gtgtcctggt ccttctgagg 3600gccactgcag atgactctcc tttgaaatga gaaaaagaaa
agaaagcaag aacagaaaac 3660gaagccacag gaagggaagt agacattgta tgcttatggt
ttctcattat gaaggtgcag 3720cttgtaggag gtttgtacgg atgtgctttg aagttatgta
tattacatat aacaggaaaa 3780aatattaaaa taaacagtgc tggtaagtat gaagctgaca
ttctaaaatt ataattatct 3840gactgtgatt gatgtatcct gaggttccta gatctcactg
aactggccca gctaaggaga 3900cctggactct gggtgtgggt tggctcacag taggggctga
cgggttcagt gtagtaatac 3960tgtgtgtggt gtttgtaatt ggttgattgg tggggagggg
tggggggccc taatggagag 4020gtgtgggttt ggcaagaaag aagcaacaca gatgtcgtcc
ccaaaatgcc agttcaagac 4080accttctccc tgcccccctg gtagtaacag tcagggcctg
gtctgtgctc aggtactggg 4140tcccagtctg ggactctgct gctgaagttg ccacagtaga
ggtccctggc ttagtcctta 4200tctccctacg gggcttgcct tggttttcag tcttctctct
ctttctctct tttttttttt 4260tttgccacat tctgcccttc cctgacccca ttgtaataac
caactccata tccaaaggga 4320ggtggtgctc tcagccattg tagaagatgg tggctttaac
ctgactgtct aaaaattccc 4380agctaagcct tttcctctac tctcttcctt gttctgaatc
atttcttctt ctcaggccaa 4440agtagccatg gtaaggaggc ttcatggggc agaccctgaa
agatcaaaac tgcatttgca 4500aagccctccc ctgtcccagg acaaagctga gactgacggg
tgatgttgct cataggctcc 4560agctctgcat aagaccttgg cttggagacc tccctctcag
tcaacagctg aactctgagc 4620ttgtgcccag aaattacccc aagaccacag gaacccttca
agaagctccc atcacaagct 4680tggcattgct ctctgccaca cgtgggcttc ctcaggcttg
tctgccacaa gctacttctc 4740tgagctcaga aagtgcccct tgatgaggga aaatgtccca
ctgcactgcg aatttctcag 4800ttccatttta cctcccagtc ctccttctaa accagttaat
aaattcattc cacaagtatt 4860tactgattac ctgcttgtgc cagggactat tctcaggctg
aagaaggtgg gaggggaggg 4920cggaacctga ggagccacct gagccagctt tatatttcaa
ccatggctgg cccatctgag 4980agcatctccc cactctcgcc aacctatcgg ggcatagccc
agggatgccc ccaggcggcc 5040caggttagat gcgtcccttt ggcttgtcag tgatgacata
caccttagct gcttagctgg 5100tgctggcctg aggcagggca ggaaatcaga atagcatttg
cttctctggg caaatgggaa 5160gttcagcggg gcagcagaat cagtggcatt ccccctggtg
caggccggtg ggtccactcc 5220aactccccct gagtgtagca gcacactttc catacaccag
gttctttcta caatcctggt 5280ggaaaagcca cagaaccttc ttcctgccct tcttgagagt
tccccctctt tctgggtcaa 5340gagctggagt ggtggctcca tcctctctgg gccacttcgg
tctaggaact catctttgca 5400ggaaccagga gtcctgagca cactgaacac acctcagagg
gaggatcctt gttgtggatt 5460ttgcacctgg ctttggggca ggggtgaagt gaccaggctt
agcttgtgga gtttatgggc 5520caccagggtt tggggaaatc accatcccgc ggatgctgtg
acctcccttc tacggagatg 5580caggcagtgc cacgagggag gaggggacct gcaaagctag
aatctagggc actgtttcct 5640ccccatcctt ctctttgtag agaatagaga cgtttgtctt
gtctgtcttc aacctacttt 5700tccttttctc ttttttgttt ctcatcctct ctgtgccacc
tctccaccca ggaggccatg 5760tagcatagtg gaaaaagtcc ctgagggcgg ttaggagttc
tgggtgacca tcctggctca 5820gctcctaact caccatgtga catcaggcta tccccattcc
ccctcttggg cctcagtttc 5880ccgacttgca aaataagcag aaagaaccag atgctctcca
gggtcttttt ctactttgct 5940atctcatggg tcttcatttt ctcttatttt gttttctctg
gatcttttcc atctgagggt 6000acaggaagta ccaggacctg tttcagtttt tgaatcctgc
aagcacattc caagactggc 6060ctgaaactgc atgagcaaca tcactcgaaa taattttttt
tttcaaaagc accttaacaa 6120ccaattgcga tgctgtcctg ttccttttta ctcacaccct
tctctccttt ctcgtcccca 6180tgctccccca cctcagtgct ccgtgctgta tgcgtgtgct
ctctgttctt gtatactcaa 6240tataagtgaa ataaatgtgt ttgatgctga accata
627649982DNAHomo sapiens 49ggctcatcca cctgcagaca
tggggcgcag aaagtcaaaa cgaaagccgc ctcccaagaa 60gaagatgaca ggcaccctcg
agacccagtt cacctgcccc ttctgcaacc acgagaaatc 120ctgtgatgtg aaaatggacc
gtgcccgcaa caccggagtc atctcttgta ccgtgtgcct 180agaggaattc cagacgccca
taacgtatct gtcagaaccc gtggatgtgt acagtgattg 240gatagacgcc tgcgaggcgg
ccaatcagta gcgacacaga ggacccgccc cctgagcagc 300cccgcgtact gtggatccag
ctgttcggtt ctggtccaga gacattccag gggtccaggg 360tgtgggtcct gggctgtcac
agccgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 420gtgtgtgtag tgggtgtgcg
tgtgggtgtg ggtgtgagtg agtgtgggtg tgtgtggctg 480cacgtgtcac tggggtggcc
gtgagtgtgt gctcacaggt acgcggtggt gtcgggttcc 540tgggcctgag gggcctgaac
tgatctcact tggctccgaa agcctttgct gtgttccctg 600cagcccctgg ccccccagcc
ttggggctct ggctcccccc ggcggaattg ggggactgtt 660tcctgacatc ctggacaagg
gaagcccact agaggctgga acaggacctc tccagcctcc 720tcaccagcac cgtgcccatc
tcaactggac ttcccgccct ccttctccac cttctagtgc 780ccgtggccgg ggattcaaag
ccgccgttcc ccaggtccct gggctgggcc ctgacaggga 840gccgcccccc tccccatggt
aaccaggaag cccgtttcat gttcagttgc ttttgtagag 900gaagcaaggg ctgggatggg
gacagctgtc aatcacaagc ccttaaataa agcagccagc 960gcacaaaaaa aaaaaaaaaa
aa 982501767DNAHomo sapiens
50gaaaggagca agccaggaag ccagacaaca acagcatcaa aacaaggctg tttctgtgtg
60tgaggaactt tgcctgggag ataaaattag acctagagct ttctgacagg gagtctgaag
120cgtgggacat ggaccgttca ctgggatggc aagggaattc tgtccctgag gacaggactg
180aagctgggat caagcgtttc ctggaggaca ccacggatga tggagaactg agcaagttcg
240tgaaggattt ctcaggaaat gcgagctgcc acccaccaga ggctaagacc tgggcatcca
300ggccccaagt cccggagcca aggccccagg ccccggacct ctatgatgat gacctggagt
360tcagaccccc ctcgcggccc cagtcctctg acaaccagca gtacttctgt gccccagccc
420ctctcagccc atctgccagg ccccgcagcc catggggcaa gcttgatccc tatgattcct
480ctgaggatga caaggagtat gtgggctttg caaccctccc caaccaagtc caccgaaagt
540ccgtgaagaa aggctttgac tttaccctca tggtggcagg agagtctggc ctgggcaaat
600ccacacttgt caatagcctc ttcctcactg atctgtaccg ggaccggaaa cttcttggtg
660ctgaagagag gatcatgcaa actgtggaga tcactaagca tgcagtggac atagaagaga
720agggtgtgag gctgcggctc accattgtgg acacaccagg ttttggggat gcagtcaaca
780acacagagtg ctggaagcct gtggcagaat acattgatca gcagtttgag cagtatttcc
840gagacgagag tggcctgaac cgaaagaaca tccaagacaa cagggtgcac tgctgcctgt
900acttcatctc acccttcggc catgggctcc ggccattgga tgttgaattc atgaaggccc
960tgcatcagcg ggtcaacatc gtgcctatcc tggctaaggc agacacactg acacctcccg
1020aagtggacca caagaaacgc aaaatccggg aggagattga gcattttgga atcaagatct
1080atcaattccc agactgtgac tctgatgagg atgaggactt caaattgcag gaccaagccc
1140taaaggaaag catcccattt gcagtaattg gcagcaacac tgtagtagag gccagagggc
1200ggcgagttcg gggtcgactc tacccctggg gcatcgtgga agtggaaaac ccagggcact
1260gcgactttgt gaagctgagg acaatgctgg tacgtaccca catgcaggac ctgaaggatg
1320tgacacggga gacacattat gagaactacc gggcacagtg catccagagc atgacccgcc
1380tggtggtgaa ggaacggaat cgcaacaaac tgactcggga aagtggtacc gacttcccca
1440tccctgctgt cccaccaggg acagatccag aaactgagaa gcttatccga gagaaagatg
1500aggagctgcg gcggatgcag gagatgctac acaaaataca aaaacagatg aaggagaact
1560attaactggc tttcagccct ggatatttaa atctcctcct cttcttcctg tccatgccgg
1620cccctcccag caccagctct gctcaggccc cttcagctac tgccacttcg ccttacatcc
1680ctgctgactg cccagagact cagaggaaat aaagtttaat aaatctgtag gtggctaaaa
1740aaaaaaaaaa aaaaaaaaaa aaaaaaa
176751339DNAHomo sapiensmisc_feature(1)..(1)n is a, c, g, or t
51naaatgttaa tagtaacttt tatttgaaag ttagggagat gaaaatacat ttccaaattc
60ttccaaagat atagctaaat gacaaaataa aaacttcact atgggccagg cgcggtgact
120cacgcctgta atcctagcac tttgggaggc cgaggcaggt ggatcacctg agagcaggag
180attgagacca gcctggccaa cttggtgaaa accctatctc tactaaaaaa tacaaaaact
240agccgngcat gatggcgtat gtttgtaaat ccccagctac ttngggacat taagggcaga
300agggatccgc tttgaacctc agggnggcca gaggtttac
33952453DNAHomo sapiensmisc_feature(453)..(453)n is a, c, g, or t
52ggtggggggg gggggtgttt aaaaaatccc tcaaatataa caatgaagca tgcttttcta
60acacaaagag taccaaaatg aatgtgctac tttctgttaa agttttattt ccagagcttg
120cccaagcaag aatctacttg ccctgtaaaa ttctgcttat acagaattaa aactccttta
180ttatcccaca aatacattat atatttccat agctttcttt agcccataca cttcttctta
240agtgttcaac tttcaaatct ctgataaaat gaaactcatc atgaagacca gtcaaaatgc
300taaaggaaac cttccttaat ctactttgca attactgttc ctttcagtta ctccctacct
360gcgcctgcca tgaatttttg tttttgtgtt ggtctattct ggactagtgg gctctacaat
420gagggatgcg tatctggaat accgagagct ttn
453531051DNAHomo sapiens 53ggaccgccgc ctggttaaag gcgcttattt cccaggcagc
cgctgcagtc gccacacctt 60tgcccctgct gcgatgaccc tgtcgccact tctgctgttc
ctgccaccgc tgctgctgct 120gctggacgtc cccacggcgg cggtgcaggc gtcccctctg
caagcgttag acttctttgg 180gaatgggcca ccagttaact acaagacagg caatctatac
ctgcgggggc ccctgaagaa 240gtccaatgca ccgcttgtca atgtgaccct ctactatgaa
gcactgtgcg gtggctgccg 300agccttcctg atccgggagc tcttcccaac atggctgttg
gtcatggaga tcctcaatgt 360cacgctggtg ccctacggaa acgcacagga acaaaatgtc
agtggcaggt gggagttcaa 420gtgccagcat ggagaagagg agtgcaaatt caacaaggtg
gaggcctgcg tgttggatga 480acttgacatg gagctagcct tcctgaccat tgtctgcatg
gaagagtttg aggacatgga 540gagaagtctg ccactatgcc tgcagctcta cgccccaggg
ctgtcgccag acactatcat 600ggagtgtgca atgggggacc gcggcatgca gctcatgcac
gccaacgccc agcggacaga 660tgctctccag ccaccacacg agtatgtgcc ctgggtcacc
gtcaatggga aacccttgga 720agatcagacc cagctcctta cccttgtctg ccagttgtac
cagggcaaga agccggatgt 780ctgcccttcc tcaaccagct ccctcaggag tgtttgcttc
aagtgatggc cggtgagctg 840cggagagctc atggaaggcg agtgggaacc cggctgcctg
cctttttttc tgatccagac 900cctcggcacc tgctacttac caactggaaa attttatgca
tcccatgaag cccagataca 960caaaattcca ccccatgatc aagaatcctg ctccactaag
aatggtgcta aagtaaaact 1020agtttaataa gcaaaaaaaa aaaaaaaaaa a
105154340DNAHomo sapiensmisc_feature(49)..(49)n is
a, c, g, or t 54ggcacgagca taccccattt ttgagctttc tttgagggcc aactttttnc
tctaaaacca 60gccagggcat gcttttccct caccagctct ganttcttcc aggctaggca
actggaaaag 120cctggnctta gaaactgctt tnttggctta cggcccagct gagctgacca
aaatagccaa 180gagaaagact gtttgcacag tgtgaaattc ctccagggga aataccatag
ncaaaaagcc 240aaganagcca gnacccacgn atggncaggg aacccacagg gcaaaaaaag
gccgagttac 300ccccaaggnc cggggtttgt gggagatggg aggcctaggt
340551760DNAHomo sapiens 55atttctttat aaaccacaac tctgggcccg
caatggcagt ccactgcctt gctgcagtca 60cagaatggaa atctgcagag gcctccgcag
tcacctaatc actctcctcc tcttcctgtt 120ccattcagag acgatctgcc gaccctctgg
gagaaaatcc agcaagatgc aagccttcag 180aatctgggat gttaaccaga agaccttcta
tctgaggaac aaccaactag ttgctggata 240cttgcaagga ccaaatgtca atttagaaga
aaagatagat gtggtaccca ttgagcctca 300tgctctgttc ttgggaatcc atggagggaa
gatgtgcctg tcctgtgtca agtctggtga 360tgagaccaga ctccagctgg aggcagttaa
catcactgac ctgagcgaga acagaaagca 420ggacaagcgc ttcgccttca tccgctcaga
cagcggcccc accaccagtt ttgagtctgc 480cgcctgcccc ggttggttcc tctgcacagc
gatggaagct gaccagcccg tcagcctcac 540caatatgcct gacgaaggcg tcatggtcac
caaattctac ttccaggagg acgagtagta 600ctgcccaggc ctgcctgttc ccattcttgc
atggcaagga ctgcagggac tgccagtccc 660cctgccccag ggctcccggc tatgggggca
ctgaggacca gccattgagg ggtggaccct 720cagaaggcgt cacaagaacc tggtcacagg
actctgcctc ctcttcaact gaccagcctc 780catgctgcct ccagaatggt ctttctaatg
tgtgaatcag agcacagcag cccctgcaca 840aagcccttcc atgtcgcctc tgcattcagg
atcaaacccc gaccacctgc ccaacctgct 900ctcctcttgc cactgcctct tcctccctca
ttccaccttc ccatgccctg gatccatcag 960gccacttgat gacccccaac caagtggctc
ccacaccctg ttttacaaaa aagaaaagac 1020cagtccatga gggaggtttt taagggtttg
tggaaaatga aaattaggat ttcatgattt 1080ttttttttca gtccccgtga aggagagccc
ttcatttgga gattatgttc tttcggggag 1140aggctgagga cttaaaatat tcctgcattt
gtgaaatgat ggtgaaagta agtggtagct 1200tttcccttct ttttcttctt tttttgtgat
gtcccaactt gtaaaaatta aaagttatgg 1260tactatgtta gccccataat tttttttttc
cttttaaaac acttccataa tctggactcc 1320tctgtccagg cactgctgcc cagcctccaa
gctccatctc cactccagat tttttacagc 1380tgcctgcagt actttacctc ctatcagaag
tttctcagct cccaaggctc tgagcaaatg 1440tggctcctgg gggttctttc ttcctctgct
gaaggaataa attgctcctt gacattgtag 1500agcttctggc acttggagac ttgtatgaaa
gatggctgtg cctctgcctg tctcccccac 1560cgggctggga gctctgcaga gcaggaaaca
tgactcgtat atgtctcagg tccctgcagg 1620gccaagcacc tagcctcgct cttggcaggt
actcagcgaa tgaatgctgt atatgttggg 1680tgcaaagttc cctacttcct gtgacttcag
ctctgtttta caataaaatc ttgaaaatgc 1740ctaaaaaaaa aaaaaaaaaa
176056584DNAHomo sapiens 56cacctgcacc
ccgcccgggc atagcaccat gcctgcttgt cgcctaggcc cgctagccgc 60cgccctcctc
ctcagcctgc tgctgttcgg cttcacccta gtctcaggca caggagcaga 120gaagactggc
gtgtgccccg agctccaggc tgaccagaac tgcacgcaag agtgcgtctc 180ggacagcgaa
tgcgccgaca acctcaagtg ctgcagcgcg ggctgtgcca ccttctgcct 240tctctgccca
aatgataagg agggttcctg cccccaggtg aacattaact ttccccagct 300cggcctctgt
cgggaccagt gccaggtgga cagccagtgt cctggccaga tgaaatgctg 360ccgcaatggc
tgtgggaagg tgtcctgtgt cactcccaat ttctgaggtc cagccaccac 420caggctgagc
agtgaggaga gaaagtttct gcctggccct gcatctggtt ccagcccacc 480tgccctcccc
tttttcggga ctctgtattc cctcttgggc tgaccacagc ttctcccttt 540cccaaccaat
aaagtaacca ctttcagcaa aaaaaaaaaa aaaa
584571330DNAHomo sapiens 57gcagcccagc caagcactgt caggaatcct gtgaagcagc
tccagctatg tgtgaagaag 60aggacagcac tgccttggtg tgtgacaatg gctctgggct
ctgtaaggcc ggctttgctg 120gggacgatgc tcccagggct gttttcccat ccattgtggg
acgtcccaga catcaggggg 180tgatggtggg aatgggacaa aaagacagct acgtgggtga
cgaagcacag agcaaaagag 240gaatcctgac cctgaagtac ccgatagaac atggcatcat
caccaactgg gacgacatgg 300aaaagatctg gcaccactct ttctacaatg agcttcgtgt
tgcccctgaa gagcatccca 360ccctgctcac ggaggcaccc ctgaacccca aggccaaccg
ggagaaaatg actcaaatta 420tgtttgagac tttcaatgtc ccagccatgt atgtggctat
ccaggcggtg ctgtctctct 480atgcctctgg acgcacaact ggcatcgtgc tggactctgg
agatggtgtc acccacaatg 540tccccatcta tgagggctat gccttgcccc atgccatcat
gcgtctggat ctggctggcc 600gagatctcac tgactacctc atgaagatcc tgactgagcg
tggctattcc ttcgttacta 660ctgctgagcg tgagattgtc cgggacatca aggagaaact
gtgttatgta gctctggact 720ttgaaaatga gatggccact gccgcatcct catcctccct
tgagaagagt tacgagttgc 780ctgatgggca agtgatcacc atcggaaatg aacgtttccg
ctgcccagag accctgttcc 840agccatcctt catcgggatg gagtctgctg gcatccatga
aaccacctac aacagcatca 900tgaagtgtga tattgacatc aggaaggacc tctatgctaa
caatgtccta tcagggggca 960ccactatgta ccctggcatt gccgaccgaa tgcagaagga
gatcacggcc ctagcaccca 1020gcaccatgaa gatcaagatc attgcccctc cggagcgcaa
atactctgtc tggatcggtg 1080gctccatcct ggcctctctg tccaccttcc agcagatgtg
gatcagcaaa caggaatacg 1140atgaagccgg gccttccatt gtccaccgca aatgcttcta
aaacactttc ctgctcctct 1200ctgtctctag cacacaactg tgaatgtcct gtggaattat
gccttcagtt cttttccaaa 1260tcattcctag ccaaagctct gactcgttac ctatgtgttt
tttaataaat ctgaaatagg 1320ctactggtaa
1330582743DNAHomo sapiens 58gcgggccgtt atccatttgt
gttgttcgcc agctaggcct ggcctcgtcc cgcttcgctc 60ggtcggtctc gcgcgccccc
atagccttgc tagagggtta gcgttagcct taagtgtgcg 120aatccgagga gcagcgacag
actcgagacc acgctccttc ctcgggaagg aggcggcacc 180tcgcgtttga ggcccgcctg
cgtttgaggc ccgcctgcgc ttgcggcccg cctgcgcttg 240aggcctgtct gcgtttgaga
tctcattggg cgtgattgag gaatttgggg aggtttttgg 300gcggtattga ggacgagggg
gtccgttagt cagcatagaa tcctggagcg ggaatccctc 360accgtctaaa tggcgtcggg
ggcgggacct ccgggatctg gcttccgcgg gccgccgccg 420gccctgaaac gtgagggata
gctgagatga ggcagctact gggatggccc ccatgcgcat 480ttacatgcag tccgactgcc
gagctttcga ggcagcagga tttaccgtcc acattcctca 540ctactaacca agcttttaga
acagatctca caagaaccta gaggtcggta ttttttcgat 600ttaaatttgc ctgttactga
cgttaacgtc tttcgcctag tgagcagtag ccaacatgtc 660agggtgggag tcatattaca
aaaccgaggg cgatgaagaa gcagaggaag aacaagaaga 720gaaccttgaa gcaagtggag
actataaata ttcaggaaga gatagtttga tttttttggt 780tgatgcctcc aaggctatgt
ttgaatctca gagtgaagat gagttgacac cttttgacat 840gagcatccag tgtatccaaa
gtgtgtacat cagtaagatc ataagcagtg atcgagatct 900cttggctgtg gtgttctatg
gtaccgagaa agacaaaaat tcagtgaatt ttaaaaatat 960ttacgtctta caggagctgg
ataatccagg tgcaaaacga attctagagc ttgaccagtt 1020taaggggcag cagggacaaa
aacgtttcca agacatgatg ggccacggat ctgactactc 1080actcagtgaa gtgctgtggg
tctgtgccaa cctctttagt gatgtccaat tcaagatgag 1140tcataagagg atcatgctgt
tcaccaatga agacaacccc catggcaatg acagtgccaa 1200agccagccgg gccaggacca
aagccggtga tctccgagat acaggcatct tccttgactt 1260gatgcacctg aagaaacctg
ggggctttga catatccttg ttctacagag atatcatcag 1320catagcagag gatgaggacc
tcagggttca ctttgaggaa tccagcaagc tagaagacct 1380gttgcggaag gttcgcgcca
aggagaccag gaagcgagca ctcagcaggt taaagctgaa 1440gctcaacaaa gatatagtga
tctctgtggg catttataat ctggtccaga aggctctcaa 1500gcctcctcca ataaagctct
atcgggaaac aaatgaacca gtgaaaacca agacccggac 1560ctttaataca agtacaggcg
gtttgcttct gcctagcgat accaagaggt ctcagatcta 1620tgggagtcgt cagattatac
tggagaaaga ggaaacagaa gagctaaaac ggtttgatga 1680tccaggtttg atgctcatgg
gtttcaagcc gttggtactg ctgaagaaac accattacct 1740gaggccctcc ctgttcgtgt
acccagagga gtcgctggtg attgggagct caaccctgtt 1800cagtgctctg ctcatcaagt
gtctggagaa ggaggttgca gcattgtgca gatacacacc 1860ccgcaggaac atccctcctt
attttgtggc tttggtgcca caggaagaag agttggatga 1920ccagaaaatt caggtgactc
ctccaggctt ccagctggtc tttttaccct ttgctgatga 1980taaaaggaag atgcccttta
ctgaaaaaat catggcaact ccagagcagg tgggcaagat 2040gaaggctatc gttgagaagc
ttcgcttcac atacagaagt gacagctttg agaaccccgt 2100gctgcagcag cacttcagga
acctggaggc cttggccttg gatttgatgg agccggaaca 2160agcagtggac ctgacattgc
ccaaggttga agcaatgaat aaaagactgg gctccttggt 2220ggatgagttt aaggagcttg
tttacccacc agattacaat cctgaaggga aagttaccaa 2280gagaaaacac gataatgaag
gttctggaag caaaaggccc aaggtggagt attcagaaga 2340ggagctgaag acccacatca
gcaagggtac gctgggcaag ttcactgtgc ccatgctgaa 2400agaggcctgc cgggcttacg
ggctgaagag tggtctgaag aagcaggagc tgctggaagc 2460cctcaccaag cacttccagg
actgaccaga ggccgcgcgt ccagctgccc ttccgcagtg 2520tggccaggct gcctggcctt
gtcctcagcc agttaaaatg tgtttctcct gagctaggaa 2580gagtctaccc gacataagtc
gagggacttt atgtttttga ggctttctgt tgccatggtg 2640atggtgtagc cctcccactt
tgctgttctt tactttactg cctgaataaa gagccctaag 2700tttgtactaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaa 2743591826DNAHomo sapiens
59agtatgtgtg gttggggaat tcatgtggag gtcagagtgg aagcaggtgt gagagggtcc
60agcagaagga aacatggctg ccaaagtgtt tgagtccatt ggcaagtttg gcctggcctt
120agctgttgca ggaggcgtgg tgaactctgc cttatataat gtggatgctg ggcacagagc
180tgtcatcttt gaccgattcc gtggagtgca ggacattgtg gtaggggaag ggactcattt
240tctcatcccg tgggtacaga aaccaattat ctttgactgc cgttctcgac cacgtaatgt
300gccagtcatc actggtagca aagatttaca gaatgtcaac atcacactgc gcatcctctt
360ccggcctgtc gccagccagc ttcctcgcat cttcaccagc atcggagagg actatgatga
420gcgtgtgctg ccgtccatca caactgagat cctcaagtca gtggtggctc gctttgatgc
480tggagaacta atcacccaga gagagctggt ctccaggcag gtgagcgacg accttacaga
540gcgagccgcc acctttgggc tcatcctgga tgacgtgtcc ttgacacatc tgaccttcgg
600gaaggagttc acagaagcgg tggaagccaa acaggtggct cagcaggaag cagagagggc
660cagatttgtg gtggaaaagg ctgagcaaca gaaaaaggcg gccatcatct ctgctgaggg
720cgactccaag gcagctgagc tgattgccaa ctcactggcc actgcagggg atggcctgat
780cgagctgcgc aagctggaag ctgcagagga catcgcgtac cagctctcac gctctcggaa
840catcacctac ctgccagcgg ggcagtccgt gctcctccag ctgccccagt gagggcccac
900cctgcctgca cctccgcggg ctgactgggc cacagccccg atgattctta acacagcctt
960ccttctgctc ccaccccaga aatcactgtg aaatttcatg attggcttaa agtgaaggaa
1020ataaaggtaa aatcacttca gatctctaat tagtctatca aatgaaactc tttcattctt
1080ctcacatcca tctacttttt tatccacctc cctaccaaaa attgccaagt gcctatgcaa
1140accagcttta ggtcccaatt cggggcctgc tggagttccg gcctgggcac cagcatttgg
1200cagcacgcag gcggggcagt atgtgatgga ctggggagca caggtgtctg cctagatcca
1260cgtgtggcct ccgtcctgtc actgatggaa ggtttgcgga tgagggcatg tgcggctgaa
1320ctgagaaggc aggcctccgt cttcccagcg gttcctgtgc agatgctgct gaagagaggt
1380gccggggagg ggcagagagg aagtggtctg tctgttacca taagtctgat tctctttaac
1440tgtgtgacca gcggaaacag gtgtgtgtga actgggcaca gattgaagaa tctgcccctg
1500ttgaggtggg tgggcctgac tgttgccccc cagggtccta aaacttggat ggacttgtat
1560agtgagagag gaggcctgga ccgagatgtg agtcctgttg aagacttcct ctctaccccc
1620caccttggtc cctctcagat acccagtgga attccaactt gaaggattgc atcctgctgg
1680ggctgaacat gcctgccaaa gacgtgtccg acctacgttc ctggccccct cgttcagaga
1740ctgcccttct cacgggctct atgcctgcac tgggaaggaa acaaatgtgt ataaactgct
1800gtcaataaat gacacccaga ccttcc
1826604322DNAHomo sapiens 60cccccagagg cgccggagcc cggaatcccg ctcggagcca
gccagccgtc ccgagctacc 60agcaggtttc attgaaaaca gatcctgcaa aagttccagg
tgcccacact ggaaacttgg 120agatcctgct tcccagacca cagctgtggg gaacttgggg
tggagcagag aagtttctgt 180attcagctgc ccaggcagag gagaatgggg tctccacagc
ctgaagaatg aagacacgac 240agaataaaga ctcgatgtca atgaggagtg gacggaagaa
agaggcccct gggccccggg 300aagaactgag atcgaggggc cgggcctccc ctggaggggt
cagcacgtcc agcagtgatg 360gcaaagctga gaagtccagg cagacagcca agaaggcccg
agtagaggaa gcctccaccc 420caaaggtcaa caagcagggt cggagtgagg agatctcaga
gagtgaaagt gaggagacca 480atgcaccaaa aaagaccaaa actgaggaac tccctcggcc
acagtctccc tccgatctgg 540atagcttgga cgggcggagc cttaatgatg atggcagcag
cgaccctagg gatatcgacc 600aggacaaccg aagcacgtcc cccagtatct acagccctgg
aagtgtggag aatgactctg 660actcatcttc tggcctgtcc cagggcccag cccgccccta
ccacccacct ccactctttc 720ctccttcccc tcaaccgcca gacagcaccc ctcgacagcc
agaggctagc tttgaacccc 780atccttctgt gacacccact ggatatcatg ctcccatgga
gccccccaca tctcgaatgt 840tccaggctcc tcctggggcc cctccccctc acccacagct
ctatcccggg ggcactggtg 900gagttttgtc tggaccccca atgggtccca aggggggagg
ggctgcctca tcagtggggg 960gccctaatgg gggtaagcag caccccccac ccactactcc
catttcagta tcaagctctg 1020gggctagtgg tgctccccca acaaagccgc ctaccactcc
agtgggtggt gggaacctac 1080cttctgctcc accaccagcc aacttccccc atgtgacacc
gaacctgcct cccccacctg 1140ccctgagacc cctcaacaat gcatcagcct ctccccctgg
cctgggggcc caaccactac 1200ctggtcatct gccctctccc cacgccatgg gacagggtat
cggtggactt cctcctggcc 1260cagagaaggg cccaactctg gctccttcac cccactctct
gcctcctgct tcctcttctg 1320ctccagcgcc ccccatgagg tttccttatt catcctctag
tagtagctct gcagcagcct 1380cctcttccag ttcttcctcc tcttcctctg cctccccctt
cccagcttcc caggcattgc 1440ccagctaccc ccactctttc cctcccccaa caagcctctc
tgtctccaat cagcccccca 1500agtatactca gccttctctc ccatcccagg ctgtgtggag
ccagggtccc ccaccacctc 1560ctccctatgg ccgcctctta gccaacagca atgcccatcc
aggccccttc cctccctcta 1620ctggggccca gtccaccgcc cacccaccag tctcaacaca
tcaccatcac caccagcaac 1680agcaacagca gcagcagcag cagcagcagc agcagcatca
cggaaactct gggccccctc 1740ctcctggagc atttccccac ccactggagg gcggtagctc
ccaccacgca cacccttacg 1800ccatgtctcc ctccctgggg tctctgaggc cctacccacc
agggccagca cacctgcccc 1860cacctcacag ccaggtgtcc tacagccaag caggccccaa
tggccctcca gtctcttcct 1920cttccaactc ttcctcttcc acttctcaag ggtcctaccc
atgttcacac ccctcccctt 1980cccagggccc tcaaggggcg ccctaccctt tcccaccggt
gcctacggtc accacctctt 2040cggctaccct ttccacggtc attgccaccg tggcttcctc
gccagcaggc tacaaaacgg 2100cctccccacc tgggccccca ccgtacggaa agagagcccc
gtccccgggg gcctacaaga 2160cagccacccc acccggatac aaacccgggt cgcctccctc
cttccgaacg gggaccccac 2220cgggctatcg aggaacctcg ccacctgcag gcccagggac
cttcaagccg ggctcgccca 2280ccgtgggacc tgggcccctg ccacctgcgg ggccctcagg
cctgccatcg ctgccaccac 2340cacctgcggc ccctgcctca gggccgcccc tgagcgccac
gcagatcaaa caggagccgg 2400ctgaggagta tgagaccccc gagagcccgg tgcccccagc
ccgcagcccc tcgccccctc 2460ccaaggtggt agatgtaccc agccatgcca gtcagtctgc
caggttcaac aaacacctgg 2520atcgcggctt caactcgtgc gcgcgcagcg acctgtactt
cgtgccactg gagggctcca 2580agctggccaa gaagcgggcc gacctggtgg agaaggtgcg
gcgcgaggcc gagcagcgcg 2640cgcgcgaaga aaaggagcgc gagcgcgagc gggaacgcga
gaaagagcgc gagcgcgaga 2700aggagcgcga gcttgaacgc agcgtgaagt tggctcagga
gggccgtgct ccggtggaat 2760gcccatctct gggcccagtg ccccatcgcc ctccatttga
accgggcagt gcggtggcta 2820cagtgccccc ctacctgggt cctgacactc cagccttgcg
cactctcagt gaatatgccc 2880ggcctcatgt catgtctcct ggcaatcgca accatccatt
ctacgtgccc ctgggggcag 2940tggacccggg gctcctgggt tacaatgtcc cggccctgta
cagcagtgat ccagctgccc 3000gggagaggga acgggaagcc cgtgaacgag acctccgtga
ccgcctcaag cctggctttg 3060aggtgaagcc tagtgagctg gaacccctac atggggtccc
tgggccgggc ttggatccct 3120ttccccgaca tgggggcctg gctctgcagc ctggcccacc
tggcctgcac cctttcccct 3180ttcatccgag cctggggccc ctggagcgag aacgtctagc
gctggcagct gggccagccc 3240tgcggcctga catgtcctat gctgagcggc tggcagctga
gaggcagcac gcagaaaggg 3300tggcggccct gggcaatgac ccactggccc ggctgcagat
gctcaatgtg actccccatc 3360accaccagca ctcccacatc cactcgcacc tgcacctgca
ccagcaagat gctatccatg 3420cagcctctgc ctcggtgcac cctctcattg accccctggc
ctcagggtct caccttaccc 3480ggatccccta cccagctgga actctcccta accccctgct
tcctcaccct ctgcacgaga 3540acgaagttct tcgtcaccag ctctttgctg ccccttaccg
ggacctgccg gcctcccttt 3600ctgccccgat gtcagcagct catcagctgc aggccatgca
cgcacagtca gctgagctgc 3660agcgcttggc gctggaacag cagcagtggc tgcatgccca
tcacccgctg cacagtgtgc 3720cgctgcctgc ccaggaggac tactacagtc acctgaagaa
ggaaagcgac aagccactgt 3780agaacctgcg atcaagagag caccatggct cctacattgg
accttggagc acccccaccc 3840tccccccacc gtgcccttgg cctgccaccc agagccaaga
gggtgctgct cagttgcagg 3900gcctccgcag ctggacagag agtgggggag ggagggacag
acagaaggcc aaggcccgat 3960gtggtgtgca gaggtgggga ggtggcgagg atggggacag
aaagcgcaca gaatcttgga 4020ccaggtctct cttccttgtc ccccctgctt ttctcctccc
ccatgcccaa cccctgtggc 4080cgccgcccct cccctgcccc gttggtgtga ttatttcatc
tgttagatgt ggctgttttg 4140cgtagcatcg tgtgccaccc ctgcccctcc ccgatccctg
tgtgcgcgcc ccctctgcaa 4200tgtatgcccc ttgccccttc cccacactaa taatttatat
atataaatat ctatatgacg 4260ctcttaaaaa aacatcccaa ccaaaaccaa ccaaacaaaa
acatcctcac aactccccag 4320ga
4322613088DNAHomo sapiens 61acaaaaaagc ttttacgagg
tatcagcact tttctttcat tagggggaag gcgtgaggaa 60agtaccaaac agcagcggag
ttttaaactt taaatagaca ggtctgagtg cctgaacttg 120ccttttcatt ttacttcatc
ctccaaggag ttcaatcact tggcgtgact tcactacttt 180taagcaaaag agtggtgccc
aggcaacatg ggtgactgga gcgccttagg caaactcctt 240gacaaggttc aagcctactc
aactgctgga gggaaggtgt ggctgtcagt acttttcatt 300ttccgaatcc tgctgctggg
gacagcggtt gagtcagcct ggggagatga gcagtctgcc 360tttcgttgta acactcagca
acctggttgt gaaaatgtct gctatgacaa gtctttccca 420atctctcatg tgcgcttctg
ggtcctgcag atcatatttg tgtctgtacc cacactcttg 480tacctggctc atgtgttcta
tgtgatgcga aaggaagaga aactgaacaa gaaagaggaa 540gaactcaagg ttgcccaaac
tgatggtgtc aatgtggaca tgcacttgaa gcagattgag 600ataaagaagt tcaagtacgg
tattgaagag catggtaagg tgaaaatgcg aggggggttg 660ctgcgaacct acatcatcag
tatcctcttc aagtctatct ttgaggtggc cttcttgctg 720atccagtggt acatctatgg
attcagcttg agtgctgttt acacttgcaa aagagatccc 780tgcccacatc aggtggactg
tttcctctct cgccccacgg agaaaaccat cttcatcatc 840ttcatgctgg tggtgtcctt
ggtgtccctg gccttgaata tcattgaact cttctatgtt 900ttcttcaagg gcgttaagga
tcgggttaag ggaaagagcg acccttacca tgcgaccagt 960ggtgcgctga gccctgccaa
agactgtggg tctcaaaaat atgcttattt caatggctgc 1020tcctcaccaa ccgctcccct
ctcgcctatg tctcctcctg ggtacaagct ggttactggc 1080gacagaaaca attcttcttg
ccgcaattac aacaagcaag caagtgagca aaactgggct 1140aattacagtg cagaacaaaa
tcgaatgggg caggcgggaa gcaccatctc taactcccat 1200gcacagcctt ttgatttccc
cgatgataac cagaattcta aaaaactagc tgctggacat 1260gaattacagc cactagccat
tgtggaccag cgaccttcaa gcagagccag cagtcgtgcc 1320agcagcagac ctcggcctga
tgacctggag atctagatac aggcttgaaa gcatcaagat 1380tccactcaat tgtggagaag
aaaaaaggtg ctgtagaaag tgcaccaggt gttaattttg 1440atccggtgga ggtggtactc
aacagcctta ttcatgaggc ttagaaaaca caaagacatt 1500agaataccta ggttcactgg
gggtgtatgg ggtagatggg tggagaggga ggggataaga 1560gaggtgcatg ttggtattta
aagtagtgga ttcaaagaac ttagattata aataagagtt 1620ccattaggtg atacatagat
aagggctttt tctccccgca aacaccccta agaatggttc 1680tgtgtatgtg aatgagcggg
tggtaattgt ggctaaatat ttttgtttta ccaagaaact 1740gaaataattc tggccaggaa
taaatacttc ctgaacatct taggtctttt caacaagaaa 1800aagacagagg attgtcctta
agtccctgct aaaacattcc attgttaaaa tttgcacttt 1860gaaggtaagc tttctaggcc
tgaccctcca ggtgtcaatg gacttgtgct actatatttt 1920tttattcttg gtatcagttt
aaaattcaga caaggcccac agaataagat tttccatgca 1980tttgcaaata cgtatattct
ttttccatcc acttgcacaa tatcattacc atcacttttt 2040catcattcct cagctactac
tcacattcat ttaatggttt ctgtaaacat ttttaagaca 2100gttgggatgt cacttaacat
tttttttttt tgagctaaag tcagggaatc aagccatgct 2160taatatttaa caatcactta
tatgtgtgtc gaagagtttg ttttgtttgt catgtattgg 2220tacaagcaga tacagtataa
actcacaaac acagatttga aaataatgca catatggtgt 2280tcaaatttga acctttctca
tggatttttg tggtgtgggc caatatggtg tttacattat 2340ataattcctg ctgtggcaag
taaagcacac tttttttttc tcctaaaatg tttttccctg 2400tgtatcctat tatggatact
ggttttgtta attatgattc tttattttct ctcctttttt 2460taggatatag cagtaatgct
attactgaaa tgaatttcct ttttctgaaa tgtaatcatt 2520gatgcttgaa tgatagaatt
ttagtactgt aaacaggctt tagtcattaa tgtgagagac 2580ttagaaaaaa tgcttagagt
ggactattaa atgtgcctaa atgaattttg cagtaactgg 2640tattcttggg ttttcctact
taatacacag taattcagaa cttgtattct attatgagtt 2700tagcagtctt ttggagtgac
cagcaacttt gatgtttgca ctaagatttt atttggaatg 2760caagagaggt tgaaagagga
ttcagtagta cacatacaac taatttattt gaactatatg 2820ttgaagacat ctaccagttt
ctccaaatgc cttttttaaa actcatcaca gaagattggt 2880gaaaatgctg agtatgacac
ttttcttctt gcatgcatgt cagctacata aacagttttg 2940tacaatgaaa attactaatt
tgtttgacat tccatgttaa actacggtca tgttcagctt 3000cattgcatgt aatgtagacc
tagtccatca gatcatgtgt tctggagagt gttctttatt 3060caataaagtt ttaatttagt
ataaacat 3088622828DNAHomo sapiens
62gcgctacggc ggacccggct gggcagttcc ttccccagaa ggagagattc ctctgccatg
60gagtcctacg atgtgatcgc caaccagcct gtcgtgatcg acaacggatc cggtgtgatt
120aaagctggtt ttgctggtga tcagatcccc aaatactgct ttccaaacta tgtgggccga
180cccaagcacg ttcgtgtcat ggcaggagcc cttgaaggcg acatcttcat tggccccaaa
240gctgaggagc accgagggct gctttcaatc cgctatccca tggagcatgg catcgtcaag
300gattggaacg acatggaacg catttggcaa tatgtctatt ctaaggacca gctgcagact
360ttctcagagg agcatcctgt gctcctgact gaggcgcctt taaacccacg aaaaaaccgg
420gaacgagctg ccgaagtttt cttcgagacc ttcaatgtgc ccgctctttt catctccatg
480caagctgtac tcagccttta cgctacaggc aggaccacag gggtggtgct ggattctggg
540gatggagtca cccatgctgt gcccatctat gagggctttg ccatgcccca ctccatcatg
600cgcatcgaca tcgcgggccg ggacgtctct cgcttcctgc gcctctacct gcgtaaggag
660ggctacgact tccactcatc ctctgagttt gagattgtca aggccataaa agaaagagcc
720tgttacctat ccataaaccc ccaaaaggat gagacgctag agacagagaa agctcagtac
780tacctgcctg atggcagcac cattgagatt ggtccttccc gattccgggc ccctgagttg
840ctcttcaggc cagatttgat tggagaggag agtgaaggca tccacgaggt cctggtgttc
900gccattcaga agtcagacat ggacctgcgg cgcacgcttt tctctaacat tgtcctctca
960ggaggctcta ccctgttcaa aggttttggt gacaggctcc tgagtgaagt gaagaaacta
1020gctccaaaag atgtgaagat caggatatct gcacctcagg agagactgta ttccacgtgg
1080attgggggct ccatccttgc ctccctggac acctttaaga agatgtgggt ctccaaaaag
1140gaatatgagg aagacggtgc ccgatccatc cacagaaaaa ccttctaatg tcgggacatc
1200atcttcacct ctctctgaag ttaactccac tttaaaactc gctttcttga gtcggagtgt
1260ttgcgaggaa ctgcctgtgt gtgagtgcgt gtgtggatat gagtgtgtgc gcacatgcga
1320gtgccgtgtg gccctgggac cctgggccca gaaaggacga tgaactaccc gcagtggtga
1380tgcctgaggc ctggggttga ccactaactg gctcctgaca gggaagagcg ctggcagagg
1440ctgtgctccc tcctcaggtg gcctctggct ggctgtgggg gactccgttt actaccacag
1500ggagacagag ggaggtaagc catcccccgg gagaccttgc tgctgaccat cctaggctgg
1560gctggcccac cctcaccccc acccccaggg tgccctgagg ccccaggcag ctgctgcctc
1620cactatcgat gcctcctgac tgcacactga ggactgggac tggggttgag ttctgtctgg
1680ttttgttgcc attttggttt gggaggctgg aaaagcaccc caagaagcta ttacagagac
1740tggagtcagg agagagcagg aggccctcat gttcaccagg gaacaggacc acaccggcca
1800ctgaaggagg gcaggagcag tcctccctct gaatggctgc agagttaatg ttcccagccc
1860agtccccttt cgggggcctt gggagagttt aaggcacctg ctggttccag gacctcgctt
1920tccatctgtt cttgttgcaa tgccatcttc aaaccgtttt atttattgaa gtgtttgttc
1980agttaggggc tggagagagg gagcttgctg cctcctgcct tgctacacta atgtttacag
2040cacctaagct tagcctccag ggccccacct ctcccagctg atggtgagct gacagtgtcc
2100acaggttcca ggaccatttg agattggaag ctacactcaa agacactccc accaggctct
2160ttctcccttt tcctcttctc actgccctgg aatcaacagg ctggttgctg gttagatttt
2220ctgaaacagg aggtaaaatt tttctttggc agaggcccct aagcaaggga ggggtgttgg
2280agagccagtg cccttaagac tggagaaagc tgcaatttac caagttgcct tttgccactg
2340tagctgacca ggggactagg ttgtagaggt gggaaggccc ctctgggctg atcttgtgcc
2400attcttgacc ttggacctgc ttggttaagg agggagtggg ccagaccaga gtgccaggag
2460ctaatggagc caggcctgac acctaggagt ggtccaaagc cttcagccta gatggtgcaa
2520agctggggcc agcctgtctt caccggcacc ctcacctgtg acaccaagac ccaccccaat
2580ccagacttca cacagtattc tcccccacgc cgtctatgac caaaggcccc tgccaggtgt
2640gggtccacag cagcaggtat gtgtgaaagc aacgtagcgc cccgcggact gcagtgcgct
2700taaccaactc acctcccttc tcttagccca agcctgtccc tcgcacagcc tcgcacaaac
2760cacattgcct ggtggggccc agtgtactga aataaagtcg ttccgataga cacgtcaaaa
2820aaaaaaaa
282863415DNAHomo sapiens 63ttttttttat tgctattaag atttttcttt taatatgcca
tgagatatct tgattgtata 60ttttccaaag tactttccag ccacatctcc caacccatcc
aaaagacttt gccagtcttt 120ccaatgcaat aaaagatgct ggattatagt tttgtctacc
atttcttttt gaaagcaata 180ttatactaat gactttaatg gtaatacact cttatctaat
aaagaaacac atttacaaat 240atcagaaacc cagttttgga acaatttgca taaattttga
actgaatcag cattttgtgg 300gttttttaaa aggcagcagt ttgactcacg acttgctgat
aaacacgttt ctgctgaggg 360aaggggaaaa gacagggaga gtgaatgctg catttctcca
ttggccccaa aagtg 415642455DNAHomo sapiens 64gaattcgggc gggcttcttc
gctgccgacg tacgacgagt ggccgggctc ttgcgtctgg 60taacgcgctg tctctaacgc
cagcgccgtc tcgcgcgcac tgcgcacaga ccacccgcag 120acgcccggca gtccgcaggc
ccaaacgcgc acgcgacccc gctctccgca ccgtacccgg 180ccgcctcggc atggcgcccc
gcagcgcccg gcgacccctg ctgctgctac tgcctgttgc 240tgctgctcgg cctcatgcat
tgtcgtcagc agccatgttt atggtgaaaa atggcaacgg 300gaccgcgtgc ataatggcca
acttctctgc tgccttctca gtgaactacg acaccaagag 360tggccccaag aacatgacct
ttgacctgcc atcagatgcc acagtggtgc tcaaccgcag 420ctcctgtgga aaagagaaca
cttctgaccc cagtctcgtg attgcttttg gaagaggaca 480tacactcact ctcaatttca
cgagaaatgc aacacgttac agcgttcagc tcatgagttt 540tgtttataac ttgtcagaca
cacacctttt ccccaatgcg agctccaaag aaatcaagac 600tgtggaatct ataactgaca
tcagggcaga tatagataaa aaatacagat gtgttagtgg 660cacccaggtc cacatgaaca
acgtgaccgt aacgctccat gatgccacca tccaggcgta 720cctttccaac agcagcttca
gcaggggaga gacacgctgt gaacaagaca ggccttcccc 780aaccacagcg ccccctgcgc
cacccagccc ctcgccctca cccgtgccca agagcccctc 840tgtggacaag tacaacgtga
gcggcaccaa cgggacctgc ctgctggcca gcatggggct 900gcagctgaac ctcacctatg
agaggaagga caacacgacg gtgacaaggc ttctcaacat 960caaccccaac aagacctcgg
ccagcgggag ctgcggcgcc cacctggtga ctctggagct 1020gcacagcgag ggcaccaccg
tcctgctctt ccagttcggg atgaatgcaa gttctagccg 1080gtttttccta caaggaatcc
agttgaatac aattcttcct gacgccagag accctgcctt 1140taaagctgcc aacggctccc
tgcgagcgct gcaggccaca gtcggcaatt cctacaagtg 1200caacgcggag gagcacgtcc
gtgtcacgaa ggcgttttca gtcaatatat tcaaagtgtg 1260ggtccaggct ttcaaggtgg
aaggtggcca gtttggctct gtggaggagt gtctgctgga 1320cgagaacagc acgctgatcc
ccatcgctgt gggtggtgcc ctggcggggc tggtcctcat 1380cgtcctcatc gcctacctcg
tcggcaggaa gaggagtcac gcaggctacc agactatcta 1440gcctggtgca cgcaggcaca
gcagctgcag gggcctctgt tcctttctct gggcttaggg 1500tcctgtcgaa ggggaggcac
actttctgca aacgtttctc aaatctgctt catccaatgt 1560gaagttcatc ttgcagcatt
tactatgcac aacagagtaa ctatcgaaat gacggtgtta 1620attttgctaa ctgggttaaa
tattttgcta actggttaaa cattaatatt taccaaagta 1680ggattttgag ggtgggggtg
ctctctctga gggggtgggg gtgccgctgt ctctgagggg 1740tgggggtgcc gctgtctgag
gggtgggggt gccgctctct ctgagggggt gggggtgccg 1800ctttctctga gggggtgggg
gtgccgctct ctctgagggg gtgggggtgc tgctctctcc 1860gaggggtgga atgccgctgt
ctctgagggg tgggggtgcc gctctaaatt ggctccatat 1920cattgagttt agggttctgg
tgtttggttt cttcattctt tactgcactc agatttaagc 1980cttacaaagg gaaacctctg
gccgtcacac gtaggacgca tgaaggtcac tcgtgtgagg 2040ctgacatgct cacacattac
aacagtagag agggaaaatc ctaagacaga ggaactccag 2100agatgagtgt ctggagcggc
ttcagttcag ctttaaaggc caggacgcgc gacacgtggc 2160tggcggcctc gttccagtgg
cggcacgtcc ttggcgtctc taatgtctgc agctcaaggg 2220ctggcacttt tttaaatata
aaaatggtgt tatttttatt tttttttgta aagtgatttt 2280tggtcttctg ttgacattcg
ggtgatcctg ttctgcgctg tgtacaatgt gagatcggtg 2340cgttctcctg atgttttgcc
gtggcttggg gattgtacac gggaccagct cacgtaatgc 2400attgcctgta acaatgtaat
aaaaagcctc tttctttcaa aaaaaccccg aattc 2455653583DNAHomo sapiens
65cgcggacccg gccggcccag gcccgcgccc gccgcggccc tgagaggccc cggcaggtcc
60cggcccggcg gcggcagcca tggccggggg gccgggcccg ggggagcccg cagcccccgg
120cgcccagcac ttcttgtacg aggtgccgcc ctgggtcatg tgccgcttct acaaagtgat
180ggacgccctg gagcccgccg actggtgcca gttcgccgcc ctgatcgtgc gcgaccagac
240cgagctgcgg ctgtgcgagc gctccgggca gcgcacggcc agcgtcctgt ggccctggat
300caaccgcaac gcccgtgtgg ccgacctcgt gcacatcctc acgcacctgc agctgctccg
360tgcgcgggac atcatcacag cctggcaccc tcccgccccg cttccgtccc caggcaccac
420tgccccgagg cccagcagca tccctgcacc cgccgaggcc gaggcctgga gcccccggaa
480gttgccatcc tcagcctcca ccttcctctc cccagctttt ccaggctccc agacccattc
540agggcctgag ctcggcctgg ttccaagccc tgcttccctg tggcctccac cgccatctcc
600agccccttct tctaccaagc caggcccaga gagctcagtg tccctcctgc agggagcccg
660cccctctccg ttttgctggc ccctctgtga gatttcccgg ggcacccaca acttctcgga
720ggagctcaag atcggggagg gtggctttgg gtgcgtgtac cgggcggtga tgaggaacac
780ggtgtatgct gtgaagaggc tgaaggagaa cgctgacctg gagtggactg cagtgaagca
840gagcttcctg accgaggtgg agcagctgtc caggtttcgt cacccaaaca ttgtggactt
900tgctggctac tgtgctcaga acggcttcta ctgcctggtg tacggcttcc tgcccaacgg
960ctccctggag gaccgtctcc actgccagac ccaggcctgc ccacctctct cctggcctca
1020gcgactggac atccttctgg gtacagcccg ggcaattcag tttctacatc aggacagccc
1080cagcctcatc catggagaca tcaagagttc caacgtcctt ctggatgaga ggctgacacc
1140caagctggga gactttggcc tggcccggtt cagccgcttt gccgggtcca gccccagcca
1200gagcagcatg gtggcccgga cacagacagt gcggggcacc ctggcctacc tgcccgagga
1260gtacatcaag acgggaaggc tggctgtgga cacggacacc ttcagctttg gggtggtagt
1320gctagagacc ttggctggtc agagggctgt gaagacgcac ggtgccagga ccaagtatct
1380gaaagacctg gtggaagagg aggctgagga ggctggagtg gctttgagaa gcacccagag
1440cacactgcaa gcaggtctgg ctgcagatgc ctgggctgct cccatcgcca tgcagatcta
1500caagaagcac ctggacccca ggcccgggcc ctgcccacct gagctgggcc tgggcctggg
1560ccagctggcc tgctgctgcc tgcaccgccg ggccaaaagg aggcctccta tgacccaggt
1620gtacgagagg ctagagaagc tgcaggcagt ggtggcgggg gtgcccgggc atttggaggc
1680cgccagctgc atcccccctt ccccgcagga gaactcctac gtgtccagca ctggcagagc
1740ccacagtggg gctgctccat ggcagcccct ggcagcgcca tcaggagcca gtgcccaggc
1800agcagagcag ctgcagagag gccccaacca gcccgtggag agtgacgaga gcctaggcgg
1860cctctctgct gccctgcgct cctggcactt gactccaagc tgccctctgg acccagcacc
1920cctcagggag gccggctgtc ctcaggggga cacggcagga gaatcgagct gggggagtgg
1980cccaggatcc cggcccacag ccgtggaagg actggccctt ggcagctctg catcatcgtc
2040gtcagagcca ccgcagatta tcatcaaccc tgcccgacag aagatggtcc agaagctggc
2100cctgtacgag gatggggccc tggacagcct gcagctgctg tcgtccagct ccctcccagg
2160cttgggcctg gaacaggaca ggcaggggcc cgaagaaagt gatgaatttc agagctgatg
2220tgttcacctg ggcagatccc ccaaatccgg aagtcaaagt tctcatggtc agaagttctc
2280atggtgcacg agtcctcagc actctgccgg cagtgggggt gggggcccat gcccgcgggg
2340gagagaagga ggtggccctg ctgttctagg ctctgtgggc ataggcaggc agagtggaac
2400cctgcctcca tgccagcatc tgggggcaag gaaggctggc atcatccagt gaggaggctg
2460gcgcatgttg ggaggctgct ggctgcacag acccgtgagg ggaggagagg ggctgctgtg
2520caggggtgtg gagtagggag ctggctcccc tgagagccat gcagggcgtc tgcagcccag
2580gcctctggca gcagctcttt gcccatctct ttggacagtg gccaccctgc acaatggggc
2640cgacgaggcc tagggccctc ctacctgctt acaatttgga aaagtgtggc cgggtgcggt
2700ggctcacgcc tgtaatccca gcactttggg aggccaaggc aggaggatcg ctggagccca
2760gtaggtcaag accagccagg gcaacatgat gagaccctgt ctctgccaaa aaatttttta
2820aactattagc ctggcgtggt agcgcacgcc tgtggtccca gctgctgggg aggctgaagt
2880aggaggatca tttatgcttg ggaggtcgag gctgcagtga gtcatgattg tatgactgca
2940ctccagcctg ggtgacagag caagaccctg tttcaaaaag aaaaaccctg ggaaaagtga
3000agtatggctg taagtctcat ggttcagtcc tagcaagaag cgagaattct gagatcctcc
3060agaaagtcga gcagcaccca cctccaacct cgggccagtg tcttcaggct ttactgggga
3120cctgcgagct ggcctaatgt ggtggcctgc aagccaggcc atccctgggc gccacagacg
3180agctccgagc caggtcaggc ttcggaggcc acaagctcag cctcaggccc aggcactgat
3240tgtggcagag gggccactac ccaaggtcta gctaggccca agacctagtt acccagacag
3300tgagaagccc ctggaaggca gaaaagttgg gagcatggca gacagggaag ggaaacattt
3360tcagggaaaa gacatgtatc acatgtcttc agaagcaagt caggtttcat gtaaccgagt
3420gtcctcttgc gtgtccaaaa gtagcccagg gctgtagcac aggcttcaca gtgattttgt
3480gttcagccgt gagtcacact acatgccccc gtgaagctgg gcattggtga cgtccaggtt
3540gtccttgagt aataaaaacg tatgttccct aaaaaaaaaa aaa
3583663496DNAHomo sapiens 66gaattctatg gagtgtaatt ttgtgtatga attatatttt
taaaacattg aagagttttc 60agaaagaagg ctagtagagt tgattactga tactttatgc
taagcagtac ttttttggta 120gtacaatatt ttgttaggcg tttctgataa cactagaaag
gacaagtttt atcttgtgat 180aaattgatta atgtttacaa catgactgat aattatagct
gaatagtcct taaatgatga 240acaggttatt tagtttttaa atgcagtgta aaaagtgtgc
tgtggaaatt ttatggctaa 300ctaagtttat ggagaaaata ccttcagttg atcaagaata
atagtggtat acaaagttag 360gaagaaagtc aacatgatgc tgcaggaaat ggaaacaaat
acaaatgata tttaacaaag 420atagagttta cagtttttga actttaagcc aaattcattt
gacatcaagc actatagcag 480gcacaggttc aacaaagctt gtgggtattg acttccccca
aaagttgtca gctgaagtaa 540tttagcccac ttaagtaaat actatgatga taagctgtgt
gaacttagct tttaaatagt 600gtgaccatat gaaggtttta attacttttg tttattggaa
taaaatgaga ttttttgggt 660tgtcatgtta aagtgcttat agggaaagaa gcctgcatat
aattttttac cttgtggcat 720aatcagtaat tggtctgtta ttcaggcttc atagcttgta
accaaatata aataaaaggc 780ataatttagg tattctatag ttgcttagaa ttttgttaat
ataaatctct gtgaaaaatc 840aaggagtttt aatattttca gaagtgcatc cacctttcag
ggctttaagt tagtattact 900caagattatg aacaaatagc acttaggtta cctgaaagag
ttactacaac cccaaagagt 960tgtgttctaa gtagtatctt ggaaattcag agagatactc
atcctacctg aatataaact 1020gagataaatc cagtaaagaa agtgtagtaa attctacata
agagtctatc attgatttct 1080tttggtggta aaaatcttag ttcatgtgaa gaaatttcat
gtgaatgttt tagctatcaa 1140acagcactgt cacctactca tgcacaaaac tgcctcccaa
agacttttcc caggtccctc 1200gtatcaaaac attaagagta taatggaaga tagcacgatc
ttgtcagatt ggacaaacag 1260caacaaacaa aaaatgaagt atgacttttc ctgtgaactc
tacagaatgt ctacatattc 1320aactttcccc gccggggtgc ctgtctcaga aaggagtctt
gctcgtgctg gtttttatta 1380tactggtgtg aatgacaagg tcaaatgctt ctgttgtggc
ctgatgctgg ataactggaa 1440actaggagac agtcctattc aaaagcataa acagctatat
cctagctgta gctttattca 1500gaatctggtt tcagctagtc tgggatccac ctctaagaat
acgtctccaa tgagaaacag 1560ttttgcacat tcattatctc ccaccttgga acatagtagc
ttgttcagtg gttcttactc 1620cagcctttct ccaaaccctc ttaattctag agcagttgaa
gacatctctt catcgaggac 1680taacccctac agttatgcaa tgagtactga agaagccaga
tttcttacct accatatgtg 1740gccattaact tttttgtcac catcagaatt ggcaagagct
ggtttttatt atataggacc 1800tggagatagg gtagcctgct ttgcctgtgg tgggaagctc
agtaactggg aaccaaagga 1860tgatgctatg tcagaacacc ggaggcattt tcccaactgt
ccatttttgg aaaattctct 1920agaaactctg aggtttagca tttcaaatct gagcatgcag
acacatgcag ctcgaatgag 1980aacatttatg tactggccat ctagtgttcc agttcagcct
gagcagcttg caagtgctgg 2040tttttattat gtgggtcgca atgatgatgt caaatgcttt
tgttgtgatg gtggcttgag 2100gtgttgggaa tctggagatg atccatgggt agaacatgcc
aagtggtttc caaggtgtga 2160gttcttgata cgaatgaaag gccaagagtt tgttgatgag
attcaaggta gatatcctca 2220tcttcttgaa cagctgttgt caacttcaga taccactgga
gaagaaaatg ctgacccacc 2280aattattcat tttggacctg gagaaagttc ttcagaagat
gctgtcatga tgaatacacc 2340tgtggttaaa tctgccttgg aaatgggctt taatagagac
ctggtgaaac aaacagttca 2400aagtaaaatc ctgacaactg gagagaacta taaaacagtt
aatgatattg tgtcagcact 2460tctaaatgct gaagatgaaa aaagagagga ggagaaggaa
aaacaagctg aagaaatggc 2520atcagatgat ttgtcattaa ttcggaagaa cagaatggct
ctctttcaac aattgacatg 2580tgtgcttcct atcctggata atcttttaaa ggccaatgta
attaataaac aggaacatga 2640tattattaaa caaaaaacac agataccttt acaagcgaga
gaactgattg ataccatttt 2700ggttaaagga aatgctgcgg ccaacatctt caaaaactgt
ctaaaagaaa ttgactctac 2760attgtataag aacttatttg tggataagaa tatgaagtat
attccaacag aagatgtttc 2820aggtctgtca ctggaagaac aattgaggag gttgcaagaa
gaacgaactt gtaaagtgtg 2880tatggacaaa gaagtttctg ttgtatttat tccttgtggt
catctggtag tatgccagga 2940atgtgcccct tctctaagaa aatgccctat ttgcaggggt
ataatcaagg gtactgttcg 3000tacatttctc tcttaaagaa aaatagtcta tattttaacc
tgcataaaaa ggtctttaaa 3060atattgttga acacttgaag ccatctaaag taaaaaggga
attatgagtt tttcaattag 3120taacattcat gttctagtct gctttggtac taataatctt
gtttctgaaa agatggtatc 3180atatatttaa tcttaatctg tttatttaca agggaagatt
tatgtttggt gaactatatt 3240agtatgtatg tgtacctaag ggagtagtgt cactgcttgt
tatgcatcat ttcaggagtt 3300actggatttg ttgttctttc agaaagcttt gaatactaaa
ttatagtgta gaaaagaact 3360ggaaaccagg aactctggag ttcatcagag ttatggtgcc
gaattgtctt tggtgctttt 3420cacttgtgtt ttaaaataag gatttttctc ttatttctcc
ccctagtttg tgagaaacat 3480ctcaataaag tgcttt
3496672764DNAHomo sapiens 67ctctaaagct tagagccaag
atggcgggat ccaggcaaag gggtctccgg gccagagttc 60ggccgctgtt ctgcgccttg
ctgctgtcac tcggtcgctt cgtccggggc gacggcgtgg 120gaggagaccc cgcggtcgcg
ttgccacatc gccgtttcga gtacaaatac agcttcaagg 180ggccgcacct ggtgcagagc
gacgggaccg tgcccttctg ggcccacgcg gggaatgcta 240ttccaagttc agatcaaatt
cgagtagcac catctttaaa aagccaaaga ggctcagtgt 300ggacaaagac aaaagcggcc
tttgagaact gggaagttga ggtgacattt cgagtgactg 360gaagaggtcg aattggagct
gatggcctag caatttggta tgcagaaaat caaggcttgg 420agggccctgt gtttggatca
gctgatctgt ggaatggtgt tggaatattt tttgattctt 480ttgacaatga tggaaagaaa
aataatcctg ctatagtaat tataggcaac aatggacaaa 540tccattatga ccatcaaaat
gacggggcta gtcaagcttt ggcaagttgc cagagggact 600tccgcaacaa accctatcct
gtccgagcaa agattaccta ttaccagaac acactgacag 660taatgatcaa taatggcttt
acaccagata aaaatgatta tgaattttgt gccaaagtgg 720aaaatatgat tatccctgca
caagggcatt ttggaatatc tgctgcaact ggaggtcttg 780cagatgacca tgatgtcctt
tcttttctga ctttccagtt gactgaacct ggaaaagagc 840cgcccacacc agataaagaa
atttcggaaa aggaaaaaga aaagtatcag gaggaatttg 900agcactttca acaagaattg
gataaaaaaa aagaggaatt ccagaagggc caccccgacc 960tccaagggca gcctgcggag
gaaatatttg agagtgtagg agatcgagag ctaagacaag 1020tctttgaagg acagaatcgt
attcatcttg aaatcaagca gctgaaccgg cagttagata 1080tgattcttga tgaacagaga
agatatgtct cttccttaac agaggaaatc tctaaaagag 1140gagcaggaat gcctgggcag
catgggcaga ttactcaaca agaactggat actgttgtga 1200aaactcagca tgagattctg
agacaagtaa atgaaatgaa aaattccatg agtgaaaccg 1260tcagactggt cagtggaatg
cagcaccctg gctctgctgg aggcgtctat gagacaacac 1320agcacttcat tgacatcaaa
gagcacctgc acatagtaaa gagggacata gataacttag 1380tgcagcgaaa tatgccatca
aatgaaaagc cgaaatgccc agaactacca ccatttccat 1440catgtttgtc tacggtccac
ttcattatat ttgttgtggt gcaaactgta ttattcattg 1500gttatatcat gtataggtct
cagcaagaag cagctgccaa aaaattcttt tgactaccat 1560tttcctgtgt acttcatcta
tttgtgtaca aaatgatgtc gttttgaggg aatttaagta 1620tttaaattgc ttcatagtct
aaattattaa ttttcttaat aaaataactg tttaaacatt 1680gatttgcagt taagaataaa
ccttaaagca aagacaacca cattttaatt tgttcacagt 1740atgtaaatct gtctaaattt
cagtgaattt ctggtcagta tgatgcagcc tctgagcaga 1800tattgaccag taagagggta
aataaagtgg gggcaacccc tggatatgaa tgttaccccc 1860taagtctcca atattgcagg
tttccctgta taacgtaaac acacttgccc tcatgcctcc 1920cagaatatga ggtctaatta
agaagtccca tcaggtttat tttgtaacca aagtcttttt 1980tagaggtcag acttcctaat
caaaggcctg ggcctgcagt cctttcatct taatgcaact 2040tcctttgaaa tcaaagaata
ttttgtctga gagctttaag gatctggtaa tagacttcaa 2100aatgttaagt gaaatttttt
ttcctctatt tatcaatgat atatttcact tttaaaggaa 2160attttggagg aaaatatagc
tgctttttgc ctaaaaaacc ttgtgggtgg aaatattcct 2220ctgagaatgg cttttatagg
tattttgcct ggtaatgtat tcattcatga ttgcccatat 2280tcttgaatgt ttcttcattc
caatggggtc aggtcaatat tatgaaaata atttttatat 2340ttatatttgt aactaagaat
ttatttctcc ctttactaca cgatgtaaat tcacgtcaaa 2400ttcgatgatc tgaggattta
aattcacaaa acctgccact acattctggt ttacattagt 2460tacttcatgc tggctggggt
tagtgaccat ttgcatactc ttttaaatca aggaggctgt 2520agtagaggca gttttaagat
tcttgaaggc aaaatttgaa aaacagtgaa tacttctaat 2580tgtttccttt tagtgccaga
actaagacat tgtgaagcac ttgttagtaa acttaacctt 2640gaaatgtcag actggaagga
gtttttatgt ctttgtgcat acttctgggt attacagaaa 2700cagtctgtaa ataacatttt
aagatgcaaa tttaattctg ttcacagctg atttatactg 2760attt
276468403DNAHomo sapiens
68tttcattagt tatcattagt ttattataaa agagaaatat ggaaattatt tacatgacga
60aagatttcag aacttcagtg gaatgggcag catcatgttg atgccatttc aatagtgact
120tatttcagtc tacgtacttt ccaagaatgt caccatctct aaataggaaa taatccttgt
180catctagaac tactttggtg cctccatatt ctgggagaag aactttatct ccaactttca
240cgctaactgg ttgaatctct ccaccctttc ctttagaacc cgatccaaca gcgactactg
300ttgcttgcaa tacttttcct tgagattttt ctggaagcat aatgcctcct ttggttacag
360tttcagcagc actcctttca accaatactc ggtcaaagag tgg
40369656DNAHomo sapiens 69acaactcggt ggtggccact gcgcagacca gacttcgctc
gtactcgtgc gcctcgcttc 60gcttttcctc cgcaaccatg tctgacaaac ccgatatggc
tgagatcgag aaattcgata 120agtcgaaact gaagaagaca gagacgcaag agaaaaatcc
actgccttcc aaagaaacga 180ttgaacagga gaagcaagca ggcgaatcgt aatgaggcgt
gcgccgccaa tatgcactgt 240acattccaca agcattgcct tcttatttta cttcttttag
ctgtttaact ttgtaagatg 300caaagaggtt ggatcaagtt taaatgactg tgctgcccct
ttcacatcaa agaactactg 360acaacgaagg ccgcgcctgc ctttcccatc tgtctatcta
tctggctggc agggaaggaa 420agaacttgca tgttggtgaa ggaagaagtg gggtggaaga
agtggggtgg gacgacagtg 480aaatctagag taaaaccaag ctggcccaag gtgtcctgca
ggctgtaatg cagtttaatc 540agagtgccat tttttttttt gttcaaatga ttttaattat
tggaatgcac aattttttta 600atatgcaaat aaaaagttta aaaacttaaa aaaaaaaaaa
aaaaaaaaaa aaaaaa 65670361DNAHomo sapiens 70tttttttttc aatgttcagt
ttcctttaat gacccccatc tccctgaagg gcaggtgcag 60gcagctaggt gatggcaaga
gatgttcact tgaagatctt gccctgattg aaggctttgc 120cacatgctgg aaggccccct
cccaggaaaa gtactctcga accagcgtct gggtctcctc 180gctgccagga tccagtttcc
gccatgtgta tgactcgtag tccacctgcc aatctggact 240cagcggaaag gcaagctcct
ggcctcggaa gacccagact ccagaaatgg agtctgctat 300tgttggttcc aaaaaggatg
acactgggcg aaggcatttc ttcctcagct tgtccagttc 360g
36171455DNAHomo
sapiensmisc_feature(376)..(376)n is a, c, g, or t 71ttttttttga taatttatga
ttttattgtc tttcctttgt ccggccttta acatgtttct 60gtaatttaaa taaaaatcta
tttactttct ccattttagc aaatggtttc tttacccaaa 120taggttgcac tatagtcccc
atatggtttt ctactgttcc acaaccacta tttcacaaag 180attgacaaaa ctttaataaa
agttaaattt acaggacatc ttaaggataa cttggggaaa 240tatgtaggta aaaaaggaat
cgagtccaca aattaaggaa tattttgcta atatggccca 300acaccaattt caggcaaatc
caatctactt aactcatata tttaatgtgg ggtaattttt 360cttaaccaaa atttangggg
gggtatggan tggatattat ttatggccct tggacaaggg 420tggacngtgt ggntttgttg
tggactaggg ngggg 45572645DNAHomo sapiens
72ctcctgcagc gtctggggtt tccgttgcag tcctcggaac caggacctcg gcgtggccta
60gcgagttatg gcgacgaagg ccgtgtgcgt gctgaagggc gacggcccag tgcagggcat
120catcaatttc gagcagaagg aaagtaatgg accagtgaag gtgtggggaa gcattaaagg
180actgactgaa ggcctgcatg gattccatgt tcatgagttt ggagataata cagcaggctg
240taccagtgca ggtcctcact ttaatcctct atccagaaaa cacggtgggc caaaggatga
300agagaggcat gttggagact tgggcaatgt gactgctgac aaagatggtg tggccgatgt
360gtctattgaa gattctgtga tctcactctc aggagaccat tgcatcattg gccgcacact
420ggtggtccat gaaaaagcag atgacttggg caaaggtgga aatgaagaaa gtacaaagac
480aggaaacgct ggaagtcgtt tggcttgtgg tgtaattggg atcgcccaat aaacattccc
540ttggatgtag tctgaggccc cttaactcat ctgttatcct gctagctgta gaaatgtatc
600ctgataaaca ttaaacactg taatcttaaa aaaaaaaaaa aaaaa
645731684DNAHomo sapiens 73gctttcacaa atacagctct gcaacgcgtt tgccctgata
ccatgtctct tcgactttcc 60agtgcatcca ggaggtcctg tcctcgtccc accactggat
cactcagact ctatggtggg 120ggaaccagct ttggtactgg aaattcttgt ggcatttcag
ggattggaag tggcttctct 180agtgccttcg gaggcagctc atcgggagga aacacagggg
gaggtaatcc ctgtgctggc 240ttcactgtga atgagcgggg gctcctttct ggcaatgaga
aggtgaccat gcagaacctc 300aatgaccgcc tggcatccta cctggacagt gtgcatgctc
tggaggaggc caacgctgac 360ctggagcaga agatcaaggg ctggtatgag aaatttgggc
ctggctcttg ccgtggtctt 420gatcatgact atagcagata tttcccaata attgatgacc
ttaaaaatca gatcatcgca 480tccaccacca gcaatgctaa tgctgttctg cagatcgata
atgccaggct tacagctgat 540gatttcagac tcaagtatga aaatgagctg gctcttcacc
agagtgtaga ggctgatgtc 600aatgggttac gaagagtttt ggatgaaata accctgtgca
gaacagatct ggagattcag 660tatgaaaccc tgagtgagga gatgacttac ctcaaaaaga
accataaaga ggaaatgcaa 720gttctgcagt gcgcagctgg aggcaacgtg aacgtggaga
tgaacgcagc ccccggggtg 780gacctcacag ttctgctgaa caacatgcga gctgagtacg
aagcccttgc agagcagaac 840cgcagggacg cggaggcctg gttcaacgag aagagcgcct
ccctgcagca gcagatctct 900gaggatgtcg gagccacaac ctcagcccgg aatgagctga
ctgaaatgaa gcgcactctt 960caaaccctgg aaattgaact tcagtctctc ctagccacga
aacactccct ggagtgctcc 1020ttgacagaga ccgagagcaa ctactgtgcg cagctggcgc
agatccaggc tcagatcggg 1080gccctggagg agcagctgca ccaggtcaga accgagaccg
agggccagaa gctggagtat 1140gagcagctcc tggacatcaa gctccacctg gaaaaagaaa
ttgagaccta ctgtctcctt 1200ataggaggag atgatggagc ctgtaagtct gggggttaca
agtctaaaga ttatggatct 1260ggaaatgtgg gaagtcaagt caaagaccca gccaaagcca
tagtggttaa gaaagttctt 1320gaggaggtag accaacgcag caaaatactt accaccaggc
tccactccct ggaagagaaa 1380tctcaaagca attaatttga gatgcaacag agaacgtatg
ccacatagcc cctgcgaaga 1440aaaggcatta tgtatctgtc cagaaaaatg tgcatgtcta
agaaaaatgt ctaacctgtt 1500gtctttctgt tactttcttt ctgggcaatc aatgacagca
tctccccatt catctagaag 1560aatgccacac acaaatatga ctcatttgat tatcctacag
aaatctgttg tcaattcttt 1620gtattcaata aacctcttct ttagcaagtt aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa 1680aaaa
1684
User Contributions:
Comment about this patent or add new information about this topic: