Patent application title: USING PHAGE EPITOPES TO PROFILE THE IMMUNE RESPONSE
Inventors:
Arul M. Chinnaiyan (Plymouth, MI, US)
Xiaoju Wang (Ann Arbor, MI, US)
Alex Tsodikov (Ann Arbor, MI, US)
Jeanne Ohrnberger (Northville, MI, US)
Assignees:
THE REGENTS OF THE UNIVERSITY OF MICHIGAN
Armune Biosciences, Inc.
IPC8 Class: AC40B4002FI
USPC Class:
506 14
Class name: Combinatorial chemistry technology: method, library, apparatus library, per se (e.g., array, mixture, in silico, etc.) library contained in or displayed by a micro-organism (e.g., bacteria, animal cell, etc.) or library contained in or displayed by a vector (e.g., plasmid, etc.) or library containing only micro-organisms or vectors
Publication date: 2011-09-29
Patent application number: 20110237461
Abstract:
The present disclosure provides compositions and methods for using one or
more polypeptide probes to profile an immune response. The polypeptide
probe can be used to detect one or more antibodies from a sample.
Furthermore, the present disclosure provides methods and compositions for
characterizing a cancer based on the detection of one or more antibodies,
such as autoantibodies.Claims:
1. An antibody profiling panel comprising: a plurality of polypeptide
probes, wherein at least one of said polypeptide probes comprises: a
full-length or fragment of a protein listed in Table 1 or polypeptide
sequence selected from SEQ ID NO: 56, 57, 58, 59, 60, 61, 62, 63, 64, 65,
66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122,
123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136,
137, 138, 139, 140, or 141, wherein each of said probes in said plurality
of polypeptide probes is capable of being specifically bound by an
antibody.
2. The antibody profiling panel of claim 1, wherein at least one of said polypeptide probes comprises a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, or 7.
3. The antibody profiling panel of claim 1, wherein said panel further comprises a full-length or fragment of a protein listed in Tables 2, 3, or 4.
4. The antibody profiling panel of claim 3, wherein at least one of said polypeptide probes comprises a polypeptide sequence selected from SEQ ID NO: 8, 9, 10, 11, 12, 13, or 14.
5. The antibody profiling panel of claim 1, wherein at least one of said polypeptide probes comprises a full-length or fragment of a protein that is CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR.sub.--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain.
6. The antibody profiling panel of claim 5, wherein at least one of said polypeptide probe comprises a polypeptide sequence selected from SEQ ID NO: 2, 5, 56, 57, 58, 59, 61, 62, 63, 64, 65, 66, 67, 68, or 69.
7. The antibody profiling panel of claim 1, wherein at least one of said polypeptide probes comprise a full-length or fragment of a protein that is FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789.
8. The antibody profiling panel of claim 7, wherein said full-length or fragment of a protein that is FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789 comprises a polypeptide sequence selected from SEQ ID NO: 9, 11, 14, or 60.
9. The antibody profiling panel of claim 1, wherein each of said probes is displayed by a phage.
10. The antibody profiling panel of claim 1, wherein each of said probes is attached to a substrate.
11. The antibody profiling panel of claim 9, wherein each of said probes is attached to a substrate via said phage.
12. The antibody profiling panel of claim 10, wherein said substrate is an array.
13. The antibody profiling panel of claim 1, wherein said panel comprises at least 5 polypeptide probes.
14. The antibody profiling panel of claim 1, wherein said panel screens a subject for a cancer with greater specificity and sensitivity as compared to a panel with less than said plurality of probes.
15. The antibody profiling panel of claim 14, wherein said cancer is prostate, breast or lung cancer.
16. The antibody profiling panel of claim 1, wherein said antibody is an autoantibody.
17. The antibody profiling panel of claim 16, wherein said autoantibody is a human autoantibody.
18. A method for screening a subject for a cancer comprising: detecting in a sample obtained from a subject an expression level of one or more antibodies with at least one polypeptide probe comprising: a full-length or fragment of a protein listed in Table 1 or a polypeptide sequence selected from SEQ ID NO: 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, or 141; wherein said expression level is indicative of the presence, absence, or stage of said cancer.
19. The method of claim 18, wherein said screening is with greater specificity and sensitivity as compared to a panel with less than said plurality of probes.
20. The method of claim 18, wherein said cancer is prostate, breast or lung cancer.
21. A method of recommending a biopsy be obtained comprising: (a) contacting a biological sample obtained from a subject with one or more probes for an antibody, wherein said subject has a PSA level greater than about 2.5 ng/mL; (b) detecting an expression level of an antibody; and (c) recommending a biopsy be obtained based on said expression level of said antibody.
22. The method of claim 21, wherein said PSA level is between about 2.5 ng/mL and about 10 ng/mL.
23. The method of claim 21, further comprising: (a) contacting a biological sample obtained from said subject with one or more probes for a second antibody when said biopsy provides a positive result for cancer; (b) detecting an expression level for said second antibody; and (c) providing a prognosis or theranosis based on said expression level of said second antibody.
24. A method of screening a subject for a cancer comprising: (a) contacting a biological sample obtained from said subject with one or more probes for an antibody, wherein said subject has a positive biopsy result for cancer; and (b) detecting an expression level for said antibody, wherein said expression level is indicative of the presence, absence, or stage of said cancer.
25. The method of claim 24, wherein said cancer is aggressive or indolent.
26. The method of claim 21 or 24, wherein said detecting is with at least one polypeptide probe comprising: a full-length or fragment of a protein listed in Table 1, 2, 3, or 4; or a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, or 141.
27. The method of claim 21 or 24, wherein said cancer is prostate cancer, lung cancer or breast cancer.
28. The method of claim 21 or 24, further comprising selecting a treatment for said cancer.
29. The method of claim 21 or 24, wherein said detecting said expression level is by an immunoassay.
30. The method of claim 21 or 24, wherein said subject is a human.
31. The method of claim 21 or 24, wherein said antibody is an autoantibody.
32. The method of claim 31, wherein said autoantibody is a human autoantibody
Description:
CROSS-REFERENCE
[0001] This application claims the benefit of U.S. provisional application Ser. No. 61/314,750, filed Mar. 17, 2010, which is incorporated herein by reference in its entirety.
BACKGROUND
[0002] It is desirable to improve cancer detection, prognostic prediction, monitoring, and therapeutic decisions. For example, when cancer is identified at the earliest stages, the probability of cure is very high and therefore diagnostic screening tests that can detect these early stages are crucial.
[0003] One example in which early detection can be beneficial is prostate cancer (PCA). PCA is a leading cause of male cancer-related death, second only to lung cancer (Abate-Shen and Shen, Genes Dev 14:2410 (2000); Ruijter et al., Endocr Rev, 20:22 (1999)). Prostate cancer is typically diagnosed with a digital rectal exam and/or prostate specific antigen (PSA) screening. An elevated serum PSA level can indicate the presence of PCA. PSA is used as a marker for prostate cancer because it is secreted only by prostate cells. A healthy prostate will produce a stable amount--typically below 4 nanograms per milliliter (ng/ml), or a PSA reading of "4" or less--whereas cancer cells produce escalating amounts that correspond with the severity of the cancer. A level between 4 and 10 ng/ml may raise a doctor's suspicion that a patient has prostate cancer, while amounts above 50 ng/ml may show that the tumor has spread elsewhere in the body.
[0004] The advent of prostate specific antigen (PSA) screening has led to earlier detection of PCA and significantly reduced PCA-associated fatalities. However, a major limitation of the serum PSA test is a lack of prostate cancer sensitivity and specificity, especially in the intermediate range of PSA detection (4-10 ng/ml). Elevated serum PSA levels are often detected in patients with non-malignant conditions such as benign prostatic hyperplasia (BPH) and prostatitis, and provide little information about the aggressiveness of the cancer detected. Coincident with increased serum PSA testing, there has been a dramatic increase in the number of prostate needle biopsies performed (Jacobsen et al., JAMA 274:1445 (1995)). This has resulted in a surge of equivocal prostate needle biopsies (Epstein and Potter J. Urol., 166:402 (2001)).
[0005] Thus, development of biomarkers to detect cancer, with improved sensitivity and specificity is advantageous.
SUMMARY
[0006] Provided herein are methods and compositions for screening for, or characterizing, a cancer in a subject. In one embodiment, an antibody profiling panel comprising: a plurality of polypeptide probes, wherein at least one of the polypeptide probes comprises a full-length or fragment of a protein encoded by a gene listed in Tables 1, 2, 3, or 4; and each of the probes in the plurality of polypeptide probes is capable of being specifically bound by an antibody, is disclosed herein. In another embodiment, an antibody profiling panel comprising: a plurality of polypeptide probes, wherein at least one of the polypeptide probes comprises a sequence listed in Tables 1, 2, 3, or 4 or a sequence encoded by a sequence listed in Tables 1, 2, 3, or 4; and each of the probes in the plurality of polypeptide probes is capable of being specifically bound by an antibody, is disclosed herein. In one embodiment the subject is a human. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody. In one embodiment the presence of a human autoantibody that binds to a polypeptide probe is indicative of cancer (e.g. an expression level for one or more autoantibodies is indicative of the presence, absence, or stage of the cancer). In another embodiment the quantity or level of a human autoantinbody that binds to a polypeptide probe is indicative of cancer. In one embodiment the cancer is a prostate, lung, breast or colon cancer.
[0007] In one embodiment, the polypeptide probe comprises a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof.
[0008] In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by DCHS1 (SEQ ID NO: 29), Centrosomal Protein (CEP 164) (SEQ ID NO: 30), KBTBD6 (SEQ ID NO: 31), RPS19 (SEQ ID NO: 32), RPL34 (SEQ ID NO: 33), Hemk1 (SEQ ID NO: 34), eIF4G1 (SEQ ID NO: 35), BMI1 (SEQ ID NO: 36), BRD2 (SEQ ID NO: 37), RP3-323M22 (Nucleolin) (SEQ ID NO: 38), SFRS14 (SEQ ID NO: 39), LOC388789 (SEQ ID NO: 40), RNA binding motif protein 6 (genomic DNA sequence) (SEQ ID NO: 41), BRMSL1 (SEQ ID NO: 42), NKX3-1 (SEQ ID NO: 43), RPSA (SEQ ID NO: 44), Cytochrome C Oxidase 5 subunit (SEQ ID NO: 45), FAM53B (SEQ ID NO: 46), a fragment of the UTR region of chromosome 11 (Homo sapiens genomic DNA, chromosome 11 clone: CTD-2579L12, NTs 149521-151500) (SEQ ID NO: 47), MAPKKK9 (SEQ ID NO: 48) cDNA clone XR--113641.1 (Homo sapiens hypothetical LOC643783, transcript variant 2 (LOC643783), partial miscRNA) (SEQ ID NO: 49), PSA (SEQ ID NO: 50), H2aa4 (SEQ ID NO: 51). UBE2I (SEQ ID NO: 52), TIMP2 (SEQ ID NO: 53), WDR77 (SEQ ID NO: 54), a fragment of Deaminase Domain Cont 1 (Human DNA sequence from clone RP1-20N2 on chromosome 6q24 Contains the gene for a novel protein similar to yeast and bacterial cytosine deaminase, NTs 48121-50100) (SEQ ID NO: 55), Lamin A/C (SEQ ID NO: 85), Lsm3 (SEQ ID NO: 86), a fragment of cDNA clone Chromosome 19, which encompasses the nucleic acid sequence for DAZ associated protein (Homo sapiens chromosome 19 clone CTB-25B13, NTs 20521-22500) (SEQ ID NO: 87), ADAM metallopetidase domain 9 (SEQ ID NO: 88), AZGP1 (SEQ ID NO: 89), Desmocolin 3 (SEQ ID NO: 90), PERP (SEQ ID NO: 91), Chromosome 3 UTR region ropporin/RhoEGF (Homo sapiens 3 BAC RP11-783D3 (Roswell Park Cancer Institute Human BAC Library) NTs 178621-180600) (SEQ ID NO: 92), Cox5a (SEQ ID NO: 93), a Mitochondrion sequence (Homo sapiens isolate PD047 mitochondrion, NTs 4801-6780) (SEQ ID NO: 94), MYH9 (SEQ ID NO: 95), ASND1 (SEQ ID NO: 96), Cathepsin F (SEQ ID NO: 97), Mastermind-like 2 (Homo sapiens genomic DNA, chromosome llq clone:RP11-82212, NTs 157801-159780) (SEQ ID NO: 98), CSNK2A2 (SEQ ID NO: 99), AURKAIP1 (SEQ ID NO: 100), a fragment of Chromosome 4 (Homo sapiens BAC clone RP11-327017 from 4, NTs 107401-109380) (SEQ ID NO: 101), ARF6 (SEQ ID NO: 102), JAG1 (Human DNA sequence from clone RP1-278O22 on chromosome 20 Contains two novel genes, NTs 26161-26140) (SEQ ID NO: 103), a Mitochondrion sequence (Homo sapiens isolate PD047 mitochondrion, NTs 2041-4020) (SEQ ID NO: 104), a fragment of Chromosome 20 (Human DNA sequence from clone RP1-278O22 on chromosome 20 Contains two novel genes, NTs 25321-27300) (SEQ ID NO:105), a fragment of Chromosome 6 UTR region (Human DNA sequence from clone RP3-523G1 on chromosome 6p22.3-24.1, NTs 34621-36600) (SEQ ID NO: 106), a fragment of MAPKKK5 (SEQ ID NO: 107), RASA1 (SEQ ID NO: 108), Hsp90b (SEQ ID NO: 109), ribosomal protein S6 (RPS6) (SEQ ID NO: 110), or a fragment of Homo sapiens chromosome 3 (Homo sapiens 3 BAC RP13-616I3 (Roswell Park Cancer Institute Human BAC Library) NTs 22921-24900) (SEQ ID NO: 111).
[0009] In one embodiment, the antibody profiling panel comprises a plurality of polypeptide probes, wherein at least one of the polypeptide probes comprises a full-length or fragment of a protein listed in Table 1, or a polypeptide sequence selected from SEQ ID NO: 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, or 141, and each of said probes in said plurality of polypeptide probes is capable of being specifically bound by an antibody. In one embodiment, one or more of the polypeptide probes can comprise SEQ ID NO: 1, 2, 3, 4, 5, 6, or 7. In another embodiment, one or more of the polypeptide probes can comprise a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, or 21. In one embodiment, the antibody profiling panel can further comprise a full-length or fragment of a protein listed in Tables 2, 3, or 4. In another embodiment, the antibody profiling panel, one of the polypeptide probes can comprise SEQ ID NO: 8, 9, 10, 11, 12, 13, or 14. In another embodiment, one or more of the polypeptide probes can comprise a polypeptide encoded by SEQ ID NO: 22, 23, 24, 25, 26, 27, or 28. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody. In one embodiment the presence of a human autoantinbody that binds to a polypeptide probe is indicative of cancer (e.g. an expression level for one or more autoantibodies is indicative of the presence, absence, or stage of the cancer). In another embodiment the quantity or level of a human autoantibody that binds to a polypeptide probe is indicative of cancer. In one embodiment the cancer is a prostate, lung, breast or colon cancer.
[0010] In one embodiment, the plurality of probes comprise a polypeptide probe comprising a full-length or fragment of a protein encoded by CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment, the polypeptide probe comprises SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide sequence encoded by SEQ ID NO: 16, 19, 70, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0011] In one embodiment, the plurality of probes comprise a polypeptide probe comprising a full-length or fragment of a protein encoded by CEP164, RPL34, BRMSL1, NKX31, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain. In one embodiment, the plurality of probes comprise a polypeptide probe comprising a polypeptide sequence selected from SEQ ID NOs. 2, 5, 56, 57, 58, 59, 61, 62, 63, 64, 65, 66, 67, 68, or 69. In one embodiment, the plurality of probes comprises a polypeptide probe comprising a polypeptide sequence encoded by SEQ ID NO: 16, 19, 70, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 83, or 84.
[0012] In one embodiment, the plurality of probes comprise a polypeptide probe comprising a full-length or fragment of a protein encoded by FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment, the plurality of probes comprise a polypeptide probe comprising a polypeptide sequence selected from SEQ ID NO: 9, 11, 14, or 60. In one embodiment, the plurality of probes comprises a polypeptide probe comprising a polypeptide sequence encoded by SEQ ID NO: 23, 25, 28, 71, or 75.
[0013] In another embodiment, an antibody profiling panel comprising: a plurality of polypeptide probes, wherein at least one of the polypeptide probes comprises a full-length or fragment of a protein that is DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, or Hemk1; and each of the probes in the plurality of polypeptide probes is capable of being specifically bound by an antibody, is disclosed herein. In another embodiment, the plurality of probes further comprise a polypeptide probe comprising a full-length or fragment of a protein encoded by eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. In one embodiment, the polypeptide probe comprises a sequence listed in Table 1 or 2, such as SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or a fragment thereof. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody. In one embodiment the presence of a human autoantinbody that binds to a polypeptide probe is indicative of cancer (e.g. an expression level for one or more autoantibodies is indicative of the presence, absence, or stage of the cancer). In another embodiment the quantity or level of a human autoantibody that binds to a polypeptide probe is indicative of cancer. In one embodiment the cancer is a prostate, lung, breast or colon cancer.
[0014] In another embodiment, one or more of the probes is displayed by a phage. In one embodiment, the one or more probes is attached to a substrate, such as attached via a phage. In another embodiment, the substrate is an array. In yet another embodiment, the panel comprises at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 different probes. In one embodiment, the panel characterizes a cancer, such as prostate cancer, with at least 80% sensitivity and specificity. In another embodiment, the panel screens for a cancer, such as prostate cancer, with at least 80% sensitivity and specificity.
[0015] Also provided herein is a method of characterizing or screening a subject for a cancer, such as prostate cancer, lung cancer, breast cancer or colon cancer. In one embodiment, the method comprises detecting in a sample obtained from a subject a presence or level of one or more antibodies to one or more polypeptide probes comprising a full-length or a fragment of a protein encoded by DCHS1, CEP164, KBTBD6, RPS19, RPL34, SFRS14, RNA binding protein 6, or Hemk1; and characterizing or identifying, the prostate cancer based on a presence or level of the one or more antibodies. In one embodiment, the method further comprises detecting a presence, absence or level of one or more antibodies to one or more polypeptide probe comprising a full-length or a fragment of a protein encoded by eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody.
[0016] In another embodiment, the method comprises detecting in a sample obtained from a subject a presence or level of one or more antibodies to one or more polypeptide probes comprising a full-length or a fragment of a protein encoded by CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain; and characterizing the prostate cancer based on a presence or level of the one or more antibodies. In one embodiment, the method further comprises detecting a presence, absence or level of one or more antibodies to one or more polypeptide probe comprising a full-length or a fragment of a protein encoded by FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment the subject is a human. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody. In one embodiment the presence of a human autoantibody that binds to a polypeptide probe is indicative of cancer (e.g. an expression level for one or more autoantibodies is indicative of the presence, absence, or stage of the cancer). In another embodiment the quantity or level of a human autoantinbody that binds to a polypeptide probe is indicative of cancer. In one embodiment the cancer is a prostate, lung, breast or colon cancer.
[0017] Also provided herein is a method of obtaining a biopsy, wherein a determiniation of whether a biopsy should be obtained is based on detecting an expression level for an antibody. In one embodiment, a subject suspected of having cancer based on an expression level of an antibody is recommended to have a biopsy obtained. In another embodiment, a biological sample is obtained from a subject with a PSA level of greater than about 2.5 ng/ml, and the sample is contacted with one or more probes for an antibody, and based on the expression level of an antibody, a biopsy is obtained or recommended for the subject. In one embodiment, the subject has a PSA level between about 2.5 ng/mL and about 10 ng/mL. In one embodiment the subject is a human. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody.
[0018] In one embodiment, the method further comprises contacting a biological sample obtained from the subject with one or more probes for a second antibody when the biopsy provides a positive result for a cancer, such as prostate cancer, and based on the expression level of the second antibody, a prognosis or theranosis is provided. In one embodiment the subject is a human. In one embodiment the second antibody is an autoantibody. In another embodiment the second antibody is a human autoantibody.
[0019] Also provided herein is a method of characterizing, identifying, or screening for a cancer in a subject. In one embodiment, the method comprises detecting an expression level for one or more antibodies, wherein the expression level of the one or more antibodies is indicative of the presence, absence, or stage of the cancer. In another embodiment, the indication is whether the cancer is aggressive or indolent. In one embodiment, the method of identifying a cancer as aggressive or indolent comprises: obtaining a positive biopsy result for cancer from the subject; contacting a biological sample obtained from the subject with one or more probes for an antibody; detecting an expression level for the antibody; and characterizing or identifying the cancer as aggressive or indolent based on the expression level of the antibody. In one embodiment the subject is a human. In one embodiment the antibody is an autoantibody. In another embodiment the antibody is a human autoantibody. In one embodiment the presence of a human autoantibody that binds to a polypeptide probe is indicative of cancer (e.g. an expression level for one or more autoantibodies is indicative of the presence, absence, or stage of the cancer). In another embodiment the quantity or level of a human autoantinbody that binds to a polypeptide probe is indicative of cancer. In one embodiment the cancer is a prostate, lung, breast or colon cancer.
INCORPORATION BY REFERENCE
[0020] All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
BRIEF DESCRIPTION OF THE DRAWINGS
[0021] The novel features of the disclosure are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present disclosure will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the disclosure are utilized, and the accompanying drawings of which:
[0022] FIG. 1 is a schematic depicting detecting in a sample from a subject with PSA levels greater than 2.5 ng/mL the expression of one or more autoantibodies ("Autoantibody Test I"). If the result of the Autoantibody Test I is negative, a biopsy is not recommended to be obtained from the subject for further analysis. If result of the Autoantibody Test II is positive, then a biopsy is obtained. If the biopsy is positive for prostate cancer, expression of one or more autoantibodies is detected from a sample from the subject to characterize the cancer as aggressive or indolent, and a prognosis or theranosis provided.
[0023] FIG. 2 lists the nucleic acid sequence for DCHS 1 (SEQ ID NO: 29).
[0024] FIG. 3 lists the nucleic acid sequence for Centrosomal Protein (CEP 164) (SEQ ID NO: 30).
[0025] FIG. 4 lists the nucleic acid sequence for KBTBD6 (SEQ ID NO: 31).
[0026] FIG. 5 lists the nucleic acid sequence for RPS19 (SEQ ID NO: 32).
[0027] FIG. 6 lists the nucleic acid sequence for RPL34 (SEQ ID NO: 33).
[0028] FIG. 7 lists the nucleic acid sequence for Hemk1 (SEQ ID NO: 34).
[0029] FIG. 8 lists the nucleic acid sequence for eIF4G1 (SEQ ID NO: 35).
[0030] FIG. 9 lists the nucleic acid sequence for BMI1 (SEQ ID NO: 36).
[0031] FIG. 10 lists the nucleic acid sequence for BRD2 (SEQ ID NO: 37).
[0032] FIG. 11 lists the nucleic acid sequence for RP3-323M22 (Nucleolin) (SEQ ID NO: 38).
[0033] FIG. 12 lists the nucleic acid sequence for SFRS14 (SEQ ID NO: 39).
[0034] FIG. 13 lists the nucleic acid sequence for LOC388789 (SEQ ID NO: 40).
[0035] FIG. 14 lists the nucleic acid sequence for RNA binding motif protein 6 (genomic DNA sequence) (SEQ ID NO: 41).
[0036] FIG. 15 lists the nucleic acid sequence for BRMSL1 (SEQ ID NO: 42).
[0037] FIG. 16 lists the nucleic acid sequence for NKX3-1 (SEQ ID NO: 43).
[0038] FIG. 17 lists the nucleic acid sequence for RPSA (SEQ ID NO: 44).
[0039] FIG. 18 lists the nucleic acid sequence for Cytochrome C Oxidase 5 subunit (SEQ ID NO: 45).
[0040] FIG. 19 lists the nucleic acid sequence for FAM53B (SEQ ID NO: 46).
[0041] FIG. 20 lists the nucleic acid sequence for a fragment of the UTR region of chromosome 11 (Homo sapiens genomic DNA, chromosome 11 clone: CTD-2579L12, NTs 149521-151500) (SEQ ID NO: 47).
[0042] FIG. 21 lists the nucleic acid sequence for MAPKKK9 (SEQ ID NO: 48).
[0043] FIG. 22 lists the nucleic acid sequence for cDNA clone XR--113641.1 (Homo sapiens hypothetical LOC643783, transcript variant 2 (LOC643783), partial miscRNA) (SEQ ID NO: 49).
[0044] FIG. 23 lists the nucleic acid sequence for PSA (SEQ ID NO: 50).
[0045] FIG. 24 lists the nucleic acid sequence for H2aa4 (SEQ ID NO: 51).
[0046] FIG. 25 lists the nucleic acid sequence for UBE2I (SEQ ID NO: 52).
[0047] FIG. 26 lists the nucleic acid sequence for TIMP2 (SEQ ID NO: 53).
[0048] FIG. 27 lists the nucleic acid sequence for WDR77 (SEQ ID NO: 54).
[0049] FIG. 28 lists the nucleic acid sequence for a fragment of Deaminase Domain Cont 1 (Human DNA sequence from clone RP1-20N2 on chromosome 6q24 Contains the gene for a novel protein similar to yeast and bacterial cytosine deaminase, NTs 48121-50100) (SEQ ID NO: 55).
[0050] FIG. 29 lists the nucleic acid sequence for Lamin A/C (SEQ ID NO: 85).
[0051] FIG. 30 lists the nucleic acid sequence Lsm3 (SEQ ID NO: 86).
[0052] FIG. 31 lists the nucleic acid sequence for a fragment of cDNA clone Chromosome 19, which encompasses the nucleic acid sequence for DAZ associated protein (Homo sapiens chromosome 19 clone CTB-25B13, NTs 20521-22500) (SEQ ID NO: 87).
[0053] FIG. 32 lists the nucleic acid sequence for ADAM metallopetidase domain 9 (SEQ ID NO: 88).
[0054] FIG. 33 lists the nucleic acid sequence for AZGP1 (SEQ ID NO: 89).
[0055] FIG. 34 lists the nucleic acid sequence for Desmocolin 3 (SEQ ID NO: 90).
[0056] FIG. 35 lists the nucleic acid sequence for PERP (SEQ ID NO: 91).
[0057] FIG. 36 lists the nucleic acid sequence for Chromosome 3 UTR region ropporin/RhoEGF (Homo sapiens 3 BAC RP11-783D3 (Roswell Park Cancer Institute Human BAC Library) NTs 178621-180600) (SEQ ID NO: 92).
[0058] FIG. 37 lists the nucleic acid sequence for Cox5a (SEQ ID NO: 93).
[0059] FIG. 38 lists the nucleic acid sequence for a Mitochondrion sequence (Homo sapiens isolate PD047 mitochondrion, NTs 4801-6780) (SEQ ID NO: 94).
[0060] FIG. 39 lists the nucleic acid sequence for MYH9 (SEQ ID NO: 95).
[0061] FIG. 40 lists the nucleic acid sequence for ASND1 (SEQ ID NO: 96).
[0062] FIG. 41 lists the nucleic acid sequence for Cathepsin F (SEQ ID NO: 97).
[0063] FIG. 42 lists the nucleic acid sequence for Mastermind-like 2 (Homo sapiens genomic DNA, chromosome 1 lq clone:RP11-82212, NTs 157801-159780) (SEQ ID NO: 98).
[0064] FIG. 43 lists the nucleic acid sequence for CSNK2A2 (SEQ ID NO: 99).
[0065] FIG. 44 lists the nucleic acid sequence for AURKAIP1 (SEQ ID NO: 100).
[0066] FIG. 45 lists the nucleic acid sequence for a fragment of Chromosome 4 (Homo sapiens BAC clone RP11-327017 from 4, NTs 107401-109380) (SEQ ID NO: 101).
[0067] FIG. 46 lists the nucleic acid sequence for ARF6 (SEQ ID NO: 102).
[0068] FIG. 47 lists the nucleic acid sequence for JAG1 (Human DNA sequence from clone RP1-278022 on chromosome 20 Contains two novel genes, NTs 26161-26140) (SEQ ID NO: 103).
[0069] FIG. 48 lists the nucleic acid sequence for a Mitochondrion sequence (Homo sapiens isolate PD047 mitochondrion, NTs 2041-4020) (SEQ ID NO: 104).
[0070] FIG. 49 lists the nucleic acid sequence for a fragment of Chromosome 20 (Human DNA sequence from clone RP1-278O22 on chromosome 20 Contains two novel genes, NTs 25321-27300) (SEQ ID NO:105).
[0071] FIG. 50 lists the nucleic acid sequence for a fragment of Chromosome 6 UTR region (Human DNA sequence from clone RP3-523G1 on chromosome 6p22.3-24.1, NTs 34621-36600) (SEQ ID NO: 106).
[0072] FIG. 51 lists the nucleic acid sequence for a fragment of MAPKKK5 (SEQ ID NO: 107).
[0073] FIG. 52 lists the nucleic acid sequence for RASA1 (SEQ ID NO: 108).
[0074] FIG. 53 lists the nucleic acid sequence for Hsp90b (SEQ ID NO: 109).
[0075] FIG. 54 lists the nucleic acid sequence for ribosomal protein S6 (RPS6) (SEQ ID NO: 110).
[0076] FIG. 55 lists the nucleic acid sequence for a fragment of Homo sapiens chromosome 3 (Homo sapiens 3 BAC RP13-616I3 (Roswell Park Cancer Institute Human BAC Library) NTs 22921-24900) (SEQ ID NO: 111).
DETAILED DESCRIPTION
[0077] The compositions and methods of the present disclosure relate to compositions and methods for characterizing a cancer or screening for a cancer. Provided herein are tests which can be used to analyze a presence or absence of an antibody from a subject, such as a subject being tested or screened for a cancer. In one embodiment, an antibody is an autoantibody. In another embodiment, the test comprises a single antigen, thus detecting only an antibody that binds to that antigen. In another embodiment, a panel of antigens is constructed such that the panel tests for a presence of one or more antibodies which specifically bind to two or more antigens derived from proteins associated with a specific cancer, such as lung cancer, prostate cancer, or ovarian cancer. By detecting an antibody to a protein associated with a disease state, the compositions and methods provided herein allow for the characterization of a cancer.
[0078] A cancer is characterized for a subject using a composition or method disclosed herein. In one embodiment, a subject is an individual or patient. In one embodiment, a subject is a human. In another embodiment, a subject is a cancer patient. In one embodiment, a subject exhibits no symptom of cancer, such as no symptoms of prostate cancer. In another embodiment, a subject has no detectable symptom of cancer, such as no detectable symptoms for prostate cancer. In yet another embodiment, a subject exhibits a symptom of cancer, such as a symptom for prostate cancer. In one embodiment, a subject is a human. In another embodiment, a subject is an individual. In yet another embodiment, a subject is a patient, such as a cancer patient.
[0079] Characterizing a cancer, or screening for a cancer, can include detecting the cancer (including pre-symptomatic early stage detecting), determining the prognosis, diagnosis, or theranosis of the cancer, or determining the stage or progression of the cancer. In one embodiment, a prognosis is predicting or giving a likelihood of outcome of a disease or condition, such as an extent of malignancy of a cancer, a likelihood of survival, or expected life expectancy, such as in an individual with prostate cancer. In another embodiment, a prognosis is a prediction or likelihood analysis of cancer progression, cancer recurrence, or metastatic spread or relapse.
[0080] In one embodiment, the diagnosis is prediction or likelihood an individual or subject has a disease or condition, such as prostate cancer. In one embodiment, the individual is an asymptomatic individual. In another embodiment, the individual is a symptomatic individual.
[0081] In one embodiment, a theranosis is a therapy selected based on an outcome of determining a binding of one or more antibodies from a sample from a subject to an antigen or polypeptide probe as described herein. In one embodiment, a theranosis is identifying an appropriate treatment or treatment efficacy for a cancer. In one embodiment, a theranosis is modifying a treatment. In another embodiment, a theranosis is selecting a treatment regimen. In yet another embodiment, a theranosis is discontinuing or not selecting a particular treatment regimen. In one embodiment a treatment regimen or therapeutic agent is selected based on the presence or absence of an autoantibody that binds to polypeptide probes described herein. In one embodiment the autoantibody is a human aautoantibody. In one embodiment a treatment regimen or therapeutic agent is excluded based on the presence or absence of an autoantibody that binds to polypeptide probes described herein. In one embodiment the autoantibody is a human aautoantibody.
[0082] In yet another embodiment, characterizing or screening for a cancer is detecting the cancer, such as pre-symptomatic early stage detecting. In one embodiment, characterizing a cancer is determining the stage or progression of the cancer, such as early-stage, late-stage or advanced stage of cancer. Characterizing or screening for a cancer can also be determining the likelihood or possibility an individual has a cancer. Characterizing or screening for a cancer can also be identification of a cancer, such as determining whether expression of one or more antibodies is indicative of the cancer.
[0083] In one embodiment, an antigen panel is used to detect a presence of one or more antibodies to one or more proteins, antigens, mimotopes, or epitopes. In one embodiment, one or more polypeptide probes described herein is a protein or fragment thereof. In another embodiment, one or more polypeptide probes described herein comprises an antigen, mimotope, or epitope. A "mimotope" can mimic the epitope of a protein or peptide. In one embodiment, the mimotope is structurally similar to an antigen or epitope of an expressed protein, but is unrelated or weakly related at the protein sequence level.
[0084] In one embodiment, the antigen panel comprises one or more polypeptide probes comprising a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, the antigen panel comprises one or more polypeptide probes comprising a sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0085] In one embodiment, the antigen panel comprises one or more polypeptide probes derived from one or more proteins encoded by one or more genes selected from: CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment, detection of one or more antibodies is used to detect a presence of prostate cancer in a subject.
[0086] In one embodiment, the antigen panel comprises one or more polypeptide probes derived from one or more proteins encoded by one or more genes selected from: DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, and LOC388789. In one embodiment, detection of one or more antibodies is used to detect a presence of prostate cancer in a subject.
[0087] A cancer can also be characterized by determining a presence or absence, or level, of one or more antibodies in a sample. In one embodiment, a sample is obtained from a subject. The subject can be a mammal, including, but not limited to, humans, non-human primates, rodents, and the like. In another embodiment, a sample is a biological fluid. The biological fluid can be, but not limited to, peripheral blood, sera, or plasma. The sample can be ascites, urine, cerebrospinal fluid (CSF), sputum, saliva, bone marrow, synovial fluid, aqueous humor, amniotic fluid, cerumen, breast milk, broncheoalveolar lavage fluid, semen, prostatic fluid, cowper's fluid or pre-ejaculatory fluid, female ejaculate, sweat, fecal matter, hair, tears, cyst fluid, pleural and peritoneal fluid, pericardial fluid, lymph, chyme, chyle, bile, interstitial fluid, menses, pus, sebum, vomit, vaginal secretions, mucosal secretion, stool water, pancreatic juice, lavage fluids from sinus cavities, or bronchopulmonary aspirates.
[0088] In one embodiment, the level, presence, or absence of an antibody can be determined by detecting the binding of one or more antibodies to a polypeptide probe. In one embodiment, an antibody is an autoantibody. An autoantibody refers to an antibody produced by a host (with or without immunization) and directed to a host antigen (such as a tumor antigen). Tumor-associated antigens recognized by humoral effectors of the immune system are an attractive target for diagnostic and therapeutic approaches to human cancer.
[0089] The binding of an antibody with a polypeptide probe can be specific, such that the interaction of the autoantibody with the polypeptide probe is dependent upon a presence of a particular structure (i.e., the antigenic determinant or epitope) of the polypeptide probe. Antigenic determinates or epitopes can comprise amino acids in linear or non-linear sequence in a polypeptide probe and can also comprise one or more amino acids which are in proximity to each other via protein folding (e.g., conformational epitopes). Thus, a single polypeptide or protein can potentially be bound by multiple antibodies which recognize different epitopes. In some instances, known epitopes of a particular polypeptide can be used as a probe to detect for a presence, absence or level of autoantibodies which bind a particular epitope
[0090] The polypeptide probe can be an antigen identified through serologic identification of antigens, for example by recombinant expression cloning (SEREX), such as described by Kim et al., Biotech. Lett. (2004); 26: 585-588. Generally, in this method, an antigen can be identified by screening expression cDNA libraries from human solid tumors with sera of autologous patients. This type of screening of a cDNA expression library by conventional methods typically requires the preparation of a large number of membrane filters blotted with bacteriophage plaques that are then searched with a specific probe. In the case of the SEREX experiments, the screening is performed using sera from cancer patients, which can be in very limited quantities.
[0091] A polypeptide probe for detecting an antibody can also be identified by phage-display technology, which can be based on the insertion of foreign nucleotide sequences into genes encoding for various capsid proteins of T7 phage, resulting in a heterogeneous mixture of phages, each displaying the different peptide sequence encoded by a corresponding insert. A physical link between a displayed fusion protein and DNA encoded for it make this phage target selectable. The phage target can express or display a polypeptide probe, which can be used to detect antibodies that are produced by a subject, or autoantibodies, which can then be used to detect or characterize a cancer. The polypeptide probe can be displayed by a phage and used to detect an antibody from a sample obtained from a subject. In one embodiment, an antibody is an autoantibody.
Polypeptide Probes
[0092] Provided herein is a composition and method for detecting one or more antibodies in a sample using one or more polypeptide probes. Polypeptide is used in its broadest sense and can include a sequence of subunit amino acids, amino acid analogs, or peptidomimetics. The subunits can be linked by peptide bonds. The polypeptides can be naturally occurring, processed forms of naturally occurring polypeptides (such as by enzymatic digestion), chemically synthesized or recombinantly expressed. The polypeptides for use in the methods of the present invention can be chemically synthesized using standard techniques. The polypeptides can comprise D-amino acids (which are resistant to L- amino acid-specific proteases), a combination of D- and L-amino acids, β amino acids, or various other designer or non-naturally occurring amino acids (e.g., β-methyl amino acids, Cα- methyl amino acids, and Nα-methyl amino acids, etc.) to convey special properties. Synthetic amino acids can include ornithine for lysine, and norleucine for leucine or isoleucine. In addition, the polypeptides can have peptidomimetic bonds, such as ester bonds, to prepare polypeptides with novel properties. For example, a polypeptide can be generated that incorporates a reduced peptide bond, i.e., R1--CH2--NH-R2, where R1 and R2 are amino acid residues or sequences. A reduced peptide bond can be introduced as a dipeptide subunit. Such a polypeptide can be resistant to protease activity, and can possess an extended half-life in vivo. A polypeptide can also include a peptoid (N-substituted glycines), in which the one or more side chains are appended to nitrogen atoms along the molecule's backbone, rather than to the α-carbons, as in amino acids. Polypeptide and peptide are intended to be used interchangeably throughout this application, i.e. where the term peptide is used, it can also include polypeptide and where the term polypeptides is used, it can also include peptide.
[0093] In one embodiment, a polypeptide probe can be a fragment or portion of a larger protein. A fragment can range in size from two amino acid residues to the entire amino acid sequence minus one amino acid. In one embodiment, a polypeptide probe is a fragment of an untranslated region (UTR) of a protein, such as a fragment that is encoded by a nucleic sequence that is a UTR region of a gene, such as the 5' or 3' UTR of a gene.
[0094] The fragment can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 amino acids in size. In one embodiment, the fragment is less than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 amino acids in size. A polypeptide probe useful in the compositions and methods herein, regardless of size, is capable of specific interaction with an antibody, such as an autoantibody.
[0095] In one embodiment, a polypeptide probe can be a fragment of a protein encoded by a gene, or a region upstream or downstream of a coding sequence, such as a UTR region, of a gene listed in Table 1, Table 2, Table 3 or Table 4. In one embodiment, the polypeptide probe comprises a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0096] In one embodiment, a polypeptide probe is a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene.
[0097] In one embodiment, the gene can be CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain. In another embodiment, the gene is FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789.
[0098] In another embodiment, a polypeptide probe comprises SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof. In another embodiment, a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0099] In one embodiment, the gene can be DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, or Hemk1. In another embodiment, the gene is eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. A polypeptide probe can comprise a peptide sequence, or fragment thereof, such as those listed in Tables 1, 2, 3 or 4.
[0100] In one embodiment, a polypeptide probe comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or a fragment thereof. In another embodiment, a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof.
TABLE-US-00001 TABLE 1 NCBI Gene Peptide Clone DNA Sequence Clone ID Gene Designation Sequence Sequence (Encoding Peptide Sequence) 2E11 DCHS1 AB384634.1 FIG. 2 PQTTAPRRAR AGCTTTCGCTAGAGACGCCTCCATA (proto- (SEQ ID PRRS (SEQ AGTCACTTGCCCGTTGGCCCCCACG cadherin-16 NO: 29) ID NO: 1) ATCGGGGTCGGTTGCTCGCAGGGC precursor) TGAGCAGAGATGTGCCAGGAGGGT TGTTCTCACGCAAGAGGACGCTGT ACTCCTGCTGCTGGAAAGTAGGCG CCTCGTCGTTGACGTCAGCGACACT GACGGTCAGGACCTGCGTGGCCGA GCGCGGCGGGGAGCCGTGGTCTGA GG (SEQ ID NO: 15) 1B4A Centrosomal NM_014956.4 FIG. 3 PVSSSGSYSTP TGGAGGAGAGGCTGGGCTGCCCCA Protein (SEQ ID IRKSLRRAAPP AGCCCCTGCTCAGGGCCTCAGAAG (CEP 164) NO: 30) FRA (SEQ ID CCATACACCTTCACTCTGATTGTGC (Minus NO: 2) TCATCAAGGCCCAGCATGCAGGAG strand) GCTCAAAGTAGCTTTTGGCTTGGGT GTTGACGAGAAGAGAGGTAACCTG GGGTCATTCTTGACACGTTCCAGCC ACCTCCGGTTGGCCTCAATTATGCC CTGAAAGGTGGTGCTGCCCGCCTC AGGGACTTGCGAATGGGAGTGCTG TAGGAGCCGGAGCTGCTCACTGG (SEQ ID NO: 16) 37A8 KBTBD6 NM_152903.4 FIG. 4 SSFSPLN GAATTCGTCATTCTCACCTTTGAAT (SEQ ID (SEQ ID TAAAGCTTAGACTAAATAGTAATA NO: 31) NO: 3) TATCGTGGGAAGGATTTTGGTTTTG TGATATTTCTGTGAATTAAGGAATA GATGTTAACCATTATTTTGTAGAAA AGTGATTTGTATGTGGTTAATTATA AATAAAACTGGTACCAGAA (SEQ ID NO: 17) 4H10 RPS19 NM_001022.3 FIG. 5 AARRPHDAW TTTATTAACCCAGCATGGTTTGTTC (SEQ ID SYCKRREPAG TAATGCTTCTTGTTGGCAGCTGCCA NO: 32) VXQSSGSLPQ CCTGTCCGGCGATTCTGTCCAGATC KVREAESPRM TCTTTGTCCCTGAGGTGTCAGTTTG GGYRQAGQA CGGCCGCCATCTTGGTCCTTTTCCA QRACSLR CCATTTTCAGCCCCTCCAGGGCTTG (SEQ ID GAGGACCCGGCGGGCCACACTCTT NO: 4) GGAGCCTCGGCTGAAGTGGCTGGG CATGACGCCGTTTCTCTGACGTCCC CCATAGATCTTGGTCATGGAGCCA ACCCCAGCGCCACCCCGGAGGTAC AGGTGCCGCGCTGTGNAAGCAGCT CGCGTGTAGAACCAGTTCTCATCGT AGGGAGCAAGCTCTTTGTGCTTGGC CAGCTTGACGGTATCCACCCATTCG GGGACTTTCAGCTTCCCGGACTTTT TGAGGAAGGCTGCCAGAGCTCTGA CNAACTCCTGCTGGTTCACGTCTTT TACAGTAACTCCAGGCATCGTGCG GCCTCCGCGCTGC (SEQ ID NO: 18) 3D10 RPL34 NM_033625.2 FIG. 6 QARLFIFITQK TTCTCGAGTGCGGCCGCAGCTTGGG (SEQ ID SFIFLFSFLTL TATGGAGACATATCATATAAGTAA NO: 33) CLCLQHFHNDF TGCTAGGGTCNGTGGTAGGAAGTT LLLDKESTLD TTTTCATAGGAGGTGTATGAGTTGG PVTNTFSTHG TCGTAGCGGAATCGGGGGTATGCT TKTLLLTSLFL GTTCGAATTCATAAGAACAGGGAG (SEQ ID GTTAGAAGTAGGGTCTTGGTTCCAT NO: 5) GTGTGCTAAATGTGTTCGTGACAGG ATCAAGCGTGCTTTCCTTATCGAGG AGCAGAAAATCGTTGTGAAAGTGT TGAAGGCACAAGCACAGAGTCAGA AAGCTAAATAAAAAAATGAAACTT TTTTGAGTAATAAAAATGAAAAGA CGCGCTTGA (SEQ ID NO: 19) 40A3 RNA NT_022517.18 FIG. 14 LRGITKNDRNF CTCTGAGGGGCATCACCAAAAATG binding (SEQ ID NRKIHLNWISK ACAGGAATTTCAACAGGAAGATAC protein 6 NO: 41) (SEQ ID ATCTGAATTGGATCTCGAAATAAG (Minus NO: 6) GAGTTTGTGTAAGAGAAAAGGAGG strand) ACACAAGCAAGGAGACACAAAAG ACAATTTGTCCAAGAGAGTAGTAG TAGAAACTGACAAAGGTAAGGCTG CTTGGTGGCCGGGTGCAGTGACTC ACGCCTGTAATCCCAGCACTTTGGG AGGCCAAGGCGGGTGGATCACCTG AGGTCAGGAGTTCGAGACCACCCT GACCAACAGGTGAAACCCCTCTCT ACTAAAAATACAAACATTAGCCCA TAGTCCCAGCTACTGGGGAGGCTG AGGCAGGAGAATCGCTTGAACCTG GGAGGCGGAGGTTGCAGTGAGCCA AGATCGTGCCATTGCACTCCAGCCT GGGCGACAGAATGAGACTGTCTCA AAACAAAAGGAAAAAAAAAA (SEQ ID NO: 20) 25C4 Hemk1 NM_016173.3 FIG. 7 RGCCAGIRCT CACTTCTTCAAGCTCCAACACAAAT (minus (SEQ ID (SEQ ID GCTGCCTCCTTTAGGATGCCTGCTC strand) NO: 34) NO: 7) TGTGCTCTCCCTGCCTCCCCTAGCC CATACCTCTGCTGGCACCTTCTGTA CCATGCCTTCAGAAACCTTCTTATC CCCCTCATCTCTGGGGCCCCCTGTG GATCTGGCATACCCAAGTTCAGTA AATGTCTATCAGTAAGCTGATGGTA CATGCATTTTCTAGAATAGAGCTGG GACTTCCCATGTGGCCCACATCTGA CCTGGCAGCCCATGTATTCCGGTCA TTAGGGATGGGAAGCCATGAGGAC CTGGCCTTCTGCCCGACCCAGGCAG CCATTCAAGTTGAGCAATGGCCACT TCGAAGACTCAAGTGCACCTGATC CCTGCGCAACAGCCAC (SEQ ID NO: 21)
TABLE-US-00002 TABLE 2 NCBI Gene Peptide Clone DNA Sequence Clone Gene Designation Sequence Sequence (Encoding Peptide Sequence) 24E1 eIF4G1 NM_182917.3 FIG. 8 IRDPNQGG TTCTTCTACAGACATTTGTATAGT (SEQ ID KDITEEIMS TGTCATAGTGTCCCCAGGAATAG NO: 35) GARTASTP AGAGGACTGCGAGATTAGGCTCA TPPQTGGG GACCCCGGTTCCAAGACTGGGGA LEPQANGE TGGTGATGGGGTCGGAGAAGGCG TPQVAVIV ACGAAGGCTGGGATTCTGAAGGG RPDDRSQG CTATGCTCTGGGCCAGGCAGCCC AIIADRPGL TGGCCGGTCAGCAATGATTGCTC PGPEHSPSE CCTGTGACCGGTCATCTGGCCGG SQPSSPSPT ACAATGACAGCAACCTGGGGCGT PSPSPVLEP CTCCCCATTAGCTTGAGGCTCCAG GSEPNLAV ACCGCCTCCCGTCTGGGGAGGGG LSIPGDTM TGGGTGTGGAGGCAGTGCGGGCC TTIQMSVE CCAGACATGATCTCCTCTGTGATA E (SEQ ID TCCTTTCCTCCTTGGTTTGGATCT NO: 8) CGAATTCGGATC (SEQ ID NO: 22) 3C4 5'-UTR BC011652.2 FIG. 9 GGGRGAG ATCACAAATAGGACAATACTTGC BMI1 (SEQ ID GGRGAGA TGGTCTCCAGGTAACGAACAATA NO: 36) GGGRPEAA CACGTTTTACAGAAGGAATGTAG (SEQ ID ACATTCTATTATGGTTGTGGCATC NO: 9) AATGAAGTACCCTCCACAAAGCA CACACATCAGGTGGGGATTTAGC TCAGTGATCTTGATTCTCGTTGTT CGATGCATTTCTGCTTGATAAAAA ATCCCGGAAAGAGCAGCCGGCGC GAGGCGATCGAAGCGGGCGGAA AAGACAATGAAAGTTAAAAGTCG TTCAGCAGAAAATGAATGCGAGC CAAGCGGCCATCTTGAAGCGAGC TGCAGACGCCGCTGTCAATGGGC AACCAGCGCGGCCCCGAGCAGCC GCGGCCGCCACGCTCGTCTCATG CCGCCTCCGGCCGGCCTCCTCCTG CTCCGGCGCCTCGGCCTCCTCCGG CGCCTCGGCCTCCTCCTCCTCCGC CTCCGCCTCGACCTCCAACGCCTC CTCCTCCGGGGCCTCCTCCTCCTC CTCCTCGGC (SEQ ID NO: 23) 8A6 BRD2 BX908719.9 FIG. 10, ESRPMSYD TGTAGGGCTTCCGGGGTTTCTTAC (SEQ ID EKRQLSLDI GTAGGCAGGAAAGGACATAGCGC NO: 37) NKLPGEKL TCAAGCTCTCTAAGTGTGGATGG GRVVHIIQ CTTGAGTGTTTCAAAATCAATCTC AREPSLRD AATCTCTTCTGGGTTTGAATCACG SNPEEIEID TAAAGAGGGCTCCCTGGCTTGGA FETLKPSTL TTATATGCACAACTCGGCCCAGCT RELERYVL TCTCCCCAGGTAATTTGTTGATGT SCLRKKPR CCAGGCTCAGCTGCCGCTTCTCAT KPYSTYEM CGTAACTCATGGGCCTGCTCTC RFISWF (SEQ ID NO: 24) (SEQ ID NO: 10) 15F1 RP3-323M22 NM_005381.2 FIG. 11 LVSILLTKT TTACTGTTACCTGATCAATGACAG (Nucleolin) (SEQ ID IY (SEQ ID AGCCTTCTGAGGACATTCCAAGA NO: 38) NO: 11) CAGTATACAGTCCTGTGGTCTCCT TGGAAATCCGTCTAGTTAACATTT CAAGGGCAATACCGTGTTGGTTTT GACTGGATATTCATATAAACTTTT TAAAGAGTTGAGTGATAGAGCTA ACCCTTATCTGTAAGTTTTGAATT TATATTGTTTCATCCCATGTACAA AACCATTTTTTCCTACAAATAGTT TGGGTTTTGTTGTTGTTTCTTTTTT TTGTTTTGTTTTTGTTTTTTTTTTTT TTGCGTTCGTGGGGTTGTAAAAG AAAAGAAAGCAGAATGTTTTATC ATGGTTTTTGCTTCAGCGGCTTTA GGACAAATTAAAAG (SEQ ID NO: 25) 6E2 SFRS14 NM_001017392.3 FIG. 12 KAECFKNL AAGCAGAGTGCTTTAAAAATTTG (SEQ ID IVKKQKSL ATAGTAAAAAAGCAAAAATCTCT NO: 39) CSGFKEHL GTGCTCTGGTTTTAAGGAACATTT NEASILAQ GAATGAGGCAAGCATTTTAGCAC VSVSSSKR AGGTTTCTGTTTCAAGTTCAAAGA VWKSWEN GAGTCTGGAAAAGTTGGGAAAAT LISSFMVW TTAATATCATCTTTTATGGTGTGG NPAHLIISIP AATCCTGCCCATTTGATTATTTCT NLEKTSDL ATCCCAAATCTTGAAAAAACATC SMMSKLA AGACTTATCTATGATGTCAAAGCT AALE (SEQ (SEQ ID NO: 26) ID NO: 12) 12B2 5'-UTR BC011652.2 FIG. 9 QRSGRDNG AAGCTTATTATCTCATCATCAGTT BMI1 (SEQ ID DVGAGAPF ATAATTCTCTTATCTTCATCTGCA NO: 36) RLSSTSQPR ACCTCTCCTCTATCTTCATTAGAG RIKPIAPPP CCATTGGCAGCATCAGCAGAAGG RAPSPEXG ATGAGCTGCATAAAAATCCCTTCT AGGGGGG TCTCTTCATTTCATTTTTGAAAAG RGGGGGGP CCCTGGAACTAATTTGTATACAAT GGGGVGG ATCTTGGAGAGTTTTATCTGACCT RGGGGGG TATATTCAGTAGTGGTCTGGTCTT GGRGAGG GTGAACTTGGACATCACAAATAG GRGAGAG GACAATACTTGCTGGTCTCCAGGT GGRPEAA AACGAACAATACACGTTTTACAG (SEQ ID AAGGAATGTAGACATTCTATTAT NO: 13) GGTTGTGGCATCAATGAAGTACC CTCCACAAAGCACACACATCAGG NGGGGATTTAGCTCAGTGATCTT GATTCTCGTTGTTCGATGCATTTC TGCTTGATAAAAAATCCCGGAAA GAGCAGCCGGCGCGAGGCGATCG AAGCGGGCGGAAAAGACAATGA AAGTTAAAAGTCGTTCAGCAGAA AATGAATGCGAGCCAAGCGGCCA TCTTGAAGCGAGCTGCAGACGCC GCTGTCAATGGNCAACCAGCGCG GCCCCGAGCAGCCGCGGCCGCCA CGCTCGTCTCATGCCGCCTCCGGC CGGCCTCCTCCTGCTCCGGCGCCT CGGCCTCCTCCGGCGCCTCGGCCT CCTCCTCCTCCGCCTCCGCCTCGA CCTCCAACGCCTCCTCCTCCGCTT GAATTCGGATCCCCGAGCATCAC ACCTGACTGGAATACGAACAGCT CCACATNCNGT (SEQ ID NO: 27) 21D10 Homo sapiens BC150559.1 FIG. 13 PASASILAG TTGGGCGTTCAGAGAGTTCACTG hypothetica1 (SEQ ID VPMYRNEF GGTACTTCACTTGCTGAGCCATCC LOC388789 NO: 40) TAWYRRM TTTTGGTCTACTGACGACTTCGCC (LOC388789) SVVYGIGT ATTGTCCGGCTATAGTAAAGCAG WSVLGSLL TGAGCCCAACACAGACCAGGTGC YYSRTMA CGATCCCGTAGACCACCGACATC KSSVDQKD CGCCGGTACCAGGCCGTGAACTC GSASEVPS ATTTCGATACATGGGTACGCCAG ELSERPSLR CGAG (SEQ ID NO: 28) PHSSN (SEQ ID NO: 14)
TABLE-US-00003 TABLE 3 NCBI Gene Peptide Clone DNA Sequence Clone Gene Designation Sequence Sequence (Encoding Peptide Seqence) 8E10 BRMSL1 NM_032352.3 FIG. 15 APRTRTLR TCGTCGAGGCTCCTGCTCCTGTGA (SEQ ID ARRSPRME CTCTCGAGCAGCCAGAGGCTCCT NO: 42) IAQKWMM ACCTCTATCGAGTCTTTACCTACT KTVKEEEW ACTTCTGACACTTTCTTCTTCTTA NVWMKCPI CCTTACAAACCTACTTTACAGGTT LKNSLPIS AGAACTTTTTGTCAAATGGCTAG KINFIKND AGTTTCTAGTTGAAATATTTCTTG (SEQ ID CTAATTCAGTCCACCTACGTTTTG NO: 56) ATGTTCTTCAGTATCGACCTTTTC GTGGTCTTATGAACCTTGGCGACC GTTGAAATGTCCTTTTATACGTTT AAGCATGTTTCCATCGTCCTTAGA TATCTCTCGAGACGAATCTTAGAC ATTTCTTGTTTATACTTACACTTT AAGTTCGAA (SEQ ID NO: 70) 1D10 5'-UTR-BMI1 NM_005180.5 FIG. 9 GGRGGGG GGAGGTCGAGGCGGAGGCGGAG (SEQ ID GGGGRGA GAGGAGGAGGCCGAGGCGCCGG No: 36) GGGRGAG AGGAGGCCGAGGCGCCGGAGCA AGGGRPEA GGAGGAGGCCGGCCGGAGGCGG A (SEQ ID CATGAGACGAGCGTGGCGGCCGC NO: 9) GGCTGCTCGGGGCCGCGCTGGTT GNCCATTGACAGCGGCGTCTGCA GCTCGCTTCAAGATGGCCGCTTG GCTCGCATTCATTTTCTGCTGAAC GACTTTTAACTTTCATTGTCTTTTC CGCCCGCTTCGATCGCCTCGCGCC GGCTGCTCTTTCCGGGATTTTTTA TCAAGCAGAAATGCATCGAACAA CGAGAATCAAGATCACTGAGCTA AATCCCCNCCTGATGTGTGTGCTT TGTGGAGGGTACTTCATTGATGCC ACAACCATAATAGAATGTCTACA TTCCTTCTGTAAAACGTGTATTGT TCGTTACCTGGAGACCAGCAAGT ATTGTCCTATTTGTGATGTCCAAG TTCACAAGACCAGACCACTACTG AATATAAGGTCAGATAAAACTCT CCAAGATATTGTATACAAATTAG TTCCAGGGCTTTTCAAAAATGAA ATGAAGAGAAGAAGGGATTTTTA TGCAGCTCATCCTTCTGCTGATGC TGCCAATGGCTCTAATGAAGATA GAGGAGGACGGTTGCAGATGAAG ATAAGAGAATTATAANCTGATGA TGAGATAATAAGGCTTGCGGCCG CACTCGAGAAACAGT (SEQ ID NO: 71) 1H2 NKX3-1 NM_0067167.3 FIG. 16 GTNQRREG GGAGAGAGGGAAAATCAAGTGGT (SEQ ID KSSGIFQHF ATTTTCCAGCACTTTGTATGATTT NO: 43 V (SEQ ID TGGATGAGTTGTACACCCAAGGA NO: 57) TTCTGTTCTGCAACTCCATCCTCC TGTGTCACTGAATATCAACTCTGA AAGAGCAA (SEQ ID NO: 72) 4H9 RPSA NM_002295.4 FIG. 17 GKWCHAC CGGGAAATGGTGCCACGCATGCG (SEQ ID AELPEPAST CAGAACTTCCCGAGCCAGCATCC NO: 44) TSNPLSELP ACCACATCAAACCCACTGAGTGA CCCMGWQ GCTCCCTTGTTGTTGCATGGGATG CPHSAEEN GCAATGTCCACATAGCGCAGAGG LCYTAQW AGAATCTGTGTTACACAGCGCAA (SEQ ID TGGTAGGTAGGTTAACATAAGAT NO: 58) GCCTCCGTGAGAGGCTGGTGGTC AGCCCTGGGGTCAGTAACCACAA GAAGCCGTGGCTCCCGGAAGGCT GCCTGGATCTGGTTAGTGAAGGT TCCAGGAGTGAAGCGGCCAGCAA TTGGAGTGGCTCCAGTGGCAGCA GCAAACTTCAGCACAGCCCTCTG GCCAGTATTCCTGGAGGATATAA CACTGACATCAGCAGGGTTTTCA ATGGCAACAATTGCACGAGCTGC CAGCAGAAGCTT (SEQ ID NO: 73) 5B1 Cytochrome NM_004255.3 FIG. 18 INTLVTYD GATAAACACACTTGTTACCTATG C Oxidase 5 (SEQ ID MVPEPKIID ATATGGTTCCAGAGCCCAAAATC Subunit NO: 45) AALRACRR ATTGATGCTGCTTTGCGGGCATGC LNDFASTV AGACGGTTAAATGATTTTGCTAGT RILEVVKD ACAGTTCGTATCCTAGAGGTTGTT KAGPHKEI AAGGACAAAGCAGGACCTCATAA YPYVIQEL GGAAATCTACCCCTATGTCATCCA RPTLNELGI GGAACTTAGACCAACTTTAAATG STPEELGL AACTGGGAATCTCCACTCCGGAG DKV (SEQ GAACTGGGCCTTGA ID NO: 59) CAAAGTGTAAACCGCATGGATGG GCTTCCCCAAGGATTTATTGACAT TGCTACTTGAGTGTGAACAGTTAC CTGGAAATACTGATGATAACATA TTACCTTATTTGAACAAGTTTTCC TTTATTGAGTACCAAGCCATGTAA TGGTAACTTGGACTTTAATAAAA GGGAAATGAGTTTGAACTGAAA (SEQ ID NO: 74) 17B8 FAM53B NM_014661.3 FIG. 19 EVHIKKKT GGGAAGTCCACATTAAAAAGAAA (SEQ ID KQTLTNFQ ACAAAACAAACCCTAACTAACTT NO: 46) MGLLVRG CCAAATGGGTCTCCTGGTGCGGG REWPCPGC GGCGTGAGTGGCCGTGCCCTGGG AACLSKLP TGTGCTGCCTGTCTGAGCAAGCTT (SEQ ID CCCTAGCTGTGGAACCCCGGGCC NO: 60) CCCTGCTGCGGGCTCTGCCTTGGT GTCATGCCTGCTGCACCCCCGTTT CCACTGACGTGCCGTCTGTGGCTA TGGGGGTGGTCACTGGAATGACG GTCACTCCAGACGTCAGCCGGCA GGGATGCAGCAGGCTGGCCGCGC A (SEQ ID NO: 75) 3C11 UTR-Region AP003173.4 FIG. 20 DHSMVEFP ATTCTATGGTGGAATTTCCAAGA Chromosome (SEQ ID RIIVYPQFG ATAATTGTTTATCCTCAGTTTGGA 11 NO: 47) VGNEG GTAGGAAATGAAGGATAATTTTT (SEQ ID TCCATTTCACCTCTATTGCAAATT NO: 61) TATTTTTTCAAGCCACACAAAAA ATTGTCTAAGATAAAATGAGAAT TATTCAGATCAATTCTGCAATGAT ACAGGGAAGATGTGAAAGGAGG GCTCAATGCAGAGTTGTGAAGTT GAAAACCACTATTTCTGTTCTAAA GACACAGTAAGCAGAGATCCATC TCTCTTCAGGCATCCTGCTTCTCT GCAGGTTACTTCTGCTTTAAGGAA AGTACATTTTTAGAACAAAGCTT (SEQ ID NO: 76) 3F6 MAPKKK9 NM_033141.2 FIG. 21 SSGSGESRL TCAAGCGGGAGTGGAGAGAGTCG (SEQ ID QHSPSQSY CCTACAGCATTCACCCAGCCAGT NO: 48) LCIPFPRGE CCTACCTCTGTATCCCAT DGDGPSSD TCCCTCGTGGAGAGGATGGCGAT GIHEEPTPV GGCCCCTCCAGTGATGGAATCCA NSATSTPQ TGAGGAGCCCACCCCAGTCAACT LTPTNSLK CGGCCACGAGT RGGAHHR ACCCCTCAGCTGACGCCAACCAA RCEVALLG CAGCCTCAAGCGGGGCGGTGCCC CGAVLAAT ACCACCGCCGCTGCGAGGTGGCT GLGFDLLE CTGCTCGGCTG AGKCQLLP TGGGGCTGTTCTGGCAGCCACAG LEEPEPPAR GCCTAGGGTTTGACTTGCTGGAA EEKKRREG GCTGGCAAGTGCCAGCTGCTTCC LFQRSSRPR CCTGGAGGAGC RSTSPPSRK CTGAGCCACCAGCCCGGGAGGAG LFKKEEHQ AAGAAAAGACGGGAGGGTCTTTT ACGRTRVTS TCAGAGGTCCAGCCGTCCTCGTC (SEQ ID GGAGCACCAGC NO: 62) CCCCCATCCCGAAAGCTTTTCAAG AAGGAGGAGCACCAAGCTTGCGG CCGCACTCGAGTAACTAGTTAAC CCCTTGGGGC CTCTAAACGGGTCTTGAGGGGGT TANCTNGTTACTCGNGTGCGGCC GCNNGCTTGGTGCTCNNCNTTN (SEQ ID NO: 77) 21H4 cDNAb clone XR_113641.1 FIG. 22 QKLCQAKE ATCCCAGCACGGAGGCCCAGAAA (SEQ ID KGMCMKK ACTTTAAGATTTGAGTATTAATGT NO: 49) LRMLWEC CTCAAGGTCAGGAGCAACCTCAA QKLYSLGF GGCTAAAACTCAGATCTCAGGAC *(SEQ ID TCAATTTCACAGAAGTTCCACTAT NO: 63) AAAGGCAATAATCTAAAGCTTTA AATGATATGAAAATTTTGTAATA AGAGTTCAGTATTTCTGCCAACAT TGGCGCATGGATTGCAAAGTTCA CAGGATTGAAAACACCATCGACA TAATGGAAATTGAACAGCATCTG ATTACTGAGTGCTATATCAGCAA GTTAAAAGGATCTTTTGCATACCT TTTAATGGTATATATCCTAAAACT GAAGTGTTCAATATAGACATCCA GATTGAAA (SEQ ID NO: 78) 4C4 PSA M27274.1 FIG. 23 S E G R T V TGTGTGGGTATGAGGGTATGAGA (SEQ ID T N K V S R GGGCCCCTCTCACTCCATTCCTTC NO: 50) K Y T G TCCAGGACATCCCTCCACTCTTGG (SEQ ID GAGACACAGAGAAGGGCTGGTTC NO: 64) CAGCTGGAGCTGGGAGGGGCAAT TGAGGGAGGAGGAAGGAGAAGG GGGAAGGAAAACAGGGTATGGG GGAAAGGACCCTGGGGAGCGAA GTGGAGGATACAACCTTGGGCCT GCAGGCCAGGCTACCTACCCACT TGGAAACCCACGCCAAAGCCGCA TCTACAGCTGAGCCACTCTGAGG CCTCCCCTCCCCGGCGGTCCCCAC TCAGCTCCAAAGTCTCTCTCCCTT TTCTCTCCCACACTCTATCATCCC CCGGATTCCTCTCTACTTGGTTCT CATTCTTCCTTTGACTTCCTGATC CTGTGTATTTTCGGCTCACCTTGA TTTGTCACTGTTCTCCCCTC (SEQ ID NO: 79) 5A1 H2aa4 NM_001040874.1 FIG. 24 QRGSGQQE ACGCGGCTCGGGGACAACAAGAA (SEQ ID DAHHPSSP GACGCGCATCATCCCTCGTCACCT NO: 51) PAGHPQRR CCAGCTGGCCATCCGCAACGACG GTEQAAGQ AGGAACTGAACAAGCTGCTGGGC SHHRPGRR AAAGTCACCATCGCCCAGGGCGG LA (SEQ ID CGTCTTGCCTAACATCCAGGCCGT NO: 65) ACTGCTCCCTAAGAAGACGGAGA GTCACCACAAGGCAAAGGGCAAG TGAGGCTGACGTCCGGCCCAAGT GGGCCCAGCCCGGCCCGCGTCTC GAAG (SEQ ID NO: 80) 1B4 UBE2I NM_194259.1 FIG. 25 ILYPETLLK TGTGGCATCGTCAAAAGGAAGGG (SEQ ID LLISLRRFW ATTGGTTTGGCAAGAACTTGTTTA NO: 52) AEMMEFSR CAACATTTTTGCAAATCTAAAGTT YTIMSSEN GCTCCATACAATGACTAGTCACCT RDNLTSSFP GGGGGGGTTGGGCGGGCGCCATC N* (SEQ ID TTCCATTGCCGCCGCGGGTGTGCG NO: 66) GTCTCGATTCGCTGAATTGCCCGT TTCCATACAGGGTCTCTTCCTTCG GTCTTTTGTATTTTTGATTGTTATG TAAAACTCGCTTTTATTTTAATAT TGATGTCAGTATTTCAACTGCTGT AAAATTATAAACTTTTATACTTGG GTAAGTCCCCCAGGGGCGAGTTC CTCGCTCTGGGATGCAGGCATGC TTCTCACCGTGCAGAGCTGCACTT GGCCTCAGCTGGCTGTATGGAAA (SEQ ID NO: 81) 18D3 TIMP2 NM_003255.4 FIG. 26 CSKHSSLL ATGTTCTAAGCACAGCTCTCTTCT (SEQ ID LFSSCKQL CCTATTTTCATCCTGCAAGCAACT NO: 53) KIFKIKFTL CAAAATATTTAAAATAAAGTTTA (SEQ ID CATTGTAGTTATTTTCAAATCTTT NO: 67) GCTTGATAAGTATTAAGAAATAT TGGACTTGCTGCCGTAATTTAAAG CTCTGTTGATTTTGTTTCCGTTTG GATTTTTGGGGGAGGGGAGCACT GTGTTTATGCTGGAATATGAAGTC TGAGACCTTCGGTGCTGGGAACA CACAAGAGTTGTTGAAAGTTGAC AAGCAGACTGCGCATGTCTCTGA TGCTTTGTATCATTCTTGAGCAAT CGCTCGGTCCGTGGACAATAAAC AGTATTATCAAAGAGAAAAAAAA (SEQ ID NO: 82) 2B10 WDR77 NM_024102.2 FIG. 27 NSLPLFPPQ GCCACTTTTCCCACCCCAAAACA (SEQ ID NSMGPDIF GCATGGGGCCTGACATCTTCTGCC NO: 54) CPGPLSL CTGGTCCCCTTTCTCTTGATGTGG DVESLNAV AAAGTCTGAATGCAGTATTTATA FIDF* (SEQ GACTTCTAAGGTTTTAAAATCCAG ID NO: 68) TATCAAGAAGAAAATCAGAAATA
CTGGTTGGTGAAATAAAGAGTTT AGGCATTGTTGGCCTGTCTTTTTT GAAGCATGTGTGTTATGTGTAGTT AGATATATTTCACTTATGTGAGTC ATCATGGTGTTGGTCTTGTAGCCC ATTATTTTTCCTGTGCTTCCCCAG CTTCCCAAAGTAGCTAGTTAGAA CTTAAGGTAAATATTTATTCTTGG GTTGGTGGAGTGGATATTGCCAG TTAGGAGTCATGGATCAATTACT GATTATATTGAAAGTAAATATAA TCAATTATGTACTTTTGAGCTTTG CAGGTTCAATTTAGGTAAAAATC ACATTATGAAACTGGGAAAGTCT GAAGGAATATGGGCAAAATATTT CTCAGTAAAGCTT (SEQ ID NO: 83) 5F4 Deaminase AL031320.1 FIG. 28 VSGSQRVK GAGATGTAAGCGGCTCACAAAGG Domain Cont 1 (SEQ ID YLLVNPLQ GTGAAATATTTACTAGTTAACCCC NO: 55) KKFINPCY CTTGCAGAAAAAGTTATCAACCC RGF (SEQ TTGCTACAGAGGATTTTAAAAAA ID NO: 69) TAAAATACAGCTTGTTCTATCTTT AGCATCTAACTGGGGAAAAGAAT CATAACATGTGAAAGAATAAATA AGAAATTGTGCTAACAGTAAGGA GTGTTATATGAAATATTACCTGAA GAACATGAAACTTGAACTTGCTA GAGATAGAGAATATTTAAAGAGG CTAAGCAGAGCATTTCAGGGAAA GGGCAAGAAGAAGCCTGGGTTGT GTGTGAGGAAATCAGCTGACAGA GGAGGAGACTATTAAGGAAGCAT AAGGAAAGAAAGACAAAAAATT GGGGTAAAAATATGTACGGCTTT GAAAGCTT (SEQ ID NO: 84)
TABLE-US-00004 TABLE 4 NCBI Gene Clone DNA Sequence Clone Gene Designation Sequence Peptide Sequence (Encoding Peptide Sequence) 1G7 Lamin A/C BC014507.1 FIG. 29 SCGPSMRTRWS AAGCTTCGCCTCCTTGGCTGCCAG (SEQ ID SIRRSWRRLILPS CTGCTTCTGGAGCTGGCTGAGCTG NO: 85) WTMPGSLLRGT GGCAGAGAGGCTGTCGATGCGGA ATWWGLPTRSC TGCGCGACTGCTGCAGCTCCTCGT SSRASASTASLP GGGCAGCCCCCACCAGGTTGCTG SSASSRSSWQPR TTCCTCTCAGCAGACTGCCTGGCA RRSLRPHSS TTGTCCAGCTTGGCAGAATAAGTC (SEQ ID NO: 112) TTCTCCAGCTCCTTCTTATACTGC TCCACCTGGTCCTCATGCTGGGCC CGCAG (SEQ ID NO: 142) 1B10 Lsm3 AJ238095.1 FIG. 30 MRNDRAASRQIT AATGAGAAATGACCGAGCAGCTT (SEQ ID (SEQ ID NO: 113) CGAGGCAGATTACATGACTTATG NO: 86) ATCTACATTTAAATATGATCTTGG GAGATGTGGAAGAAACTGTGACT ACTATAGAAATTGATGAAGAAAC ATATGAAGAGATATATAAATCAA CGAAACGGAATATTCCAATGCTC TTTGTCCGGGGAGATGGCGTTGTC CTGGTTGCCCCTCCACTGAGAGTT GGCTGAAACAAAGAATTTGTCCT GTATGGAAAACGGGAGACTTTGT ACAGTGGCCTCTCTAAAAGTACA AAACATTCATAAGAGAAACCTGC ATACATTTTGATATTAAGAAATAA TTCCGGGGATTCTCCACTCCTGAA ATGAGTTGATTTGCAGATAACTCT ACAACTTCTTAAGCTAAATGGTAT TTTCATTTTTCTCAAGCTCTCCAA TAAATATGACCACCAA (SEQ ID NO: 143) 2D7 cDNA AC027307.5 FIG. 31 LAHRPPCAEPDP GGAGTTTCACTTTTGTTGCCCAGG clone (SEQ ID GQRMELPAPVP ATTGAGTGCAGTGCCCCGATCTTG Chromo 19 NO: 87) RPRGASKPRDG GCTCACTACAACCTCTGCCTCCTG TSSHCDMPNCQ GGTTCAAGCGACTCTCCTGCCTCA HPQGPGPAGEIR GTGTCCTGAGTAGCTGGGATTAC SRCRSCWLRAV AGGCGTCTGCCACCACGCCCGGC RCNPWLGR TAATTTTGTATTTTTAGTAGAGAA (SEQ ID NO: 114) CAGGTTTCACTATGTTGGTCAGGC TGGTCTTGAACTCCTGACCTCAGC GCATCCAGAATTTTAGACGGGGC CCCCAGGGTGAGGTCTTGGCACC CTCCAGTAGAGAAGAAGGGACAT GGGCCATACGTGGGGTGTCCTTTC TGGGAGCCTTGCGTCCCTTACCTG CCTAGCCAGGGATTGCACCTCAC AGCACGCAGCCAGCAGGAACGGC ACCGTGATCTGATTTCACCTGCGG GCCCTGGGCCCTGGGGGTGTTGA CAATTGGGCATATCACAGTGTGA GCTAGTCCCGTCTCGGGGTTTGGA GGCTCCACGTGGCCGTGGTACAG GAGCAGGCAGTTCCATCCTCTGG CCTGGATCAGGCTCTGCACACGG AGGCCTGTGGGCCAG (SEQ ID NO: 144) 1H3 ADAM NR_027878.1 FIG. 32 NSGASGSRNFSS TCGGCATAAAGTACCTCCTGGAA metallo- (SEQ ID CSAEDFEK GGAACCGACAGTCTTTACAACAG peptidase NO: 88) (SEQ ID NO: 115) TCACCATATGCACACTCAGCAAA domain 9 TGATTTAAGCTTACAGGTACTTCC TTCGCAGCAAGGGTCCAATTCAC ATTCCTTTGGAGTACCACAGTCAC ACTCTTCCCCAGCGTCCACCAACT TATTACCACAGGAGGGAGCACTA TAGGCTTCATCAGGCTTTGGAATA TTAAGAAGGCAGTTTCCTCCTTTA TTTAAAGTTACTTCTCAAAGTCCT CTGCACTGCAACTGCTAAAGTTTC TGGAACCCGATGCTCCTGAATTC (SEQ ID NO: 145) 3F5 alpha-2 NM_001185.3 FIG. 33 SSVPPQDTAPYS TCAAGCGTGCCCCCGCAGGACAC glyco- (SEQ ID CHVQHSSLAQPL AGCCCCCTACTCCTGCCACGTGCA protein1 NO: 89) VVPWEAS GCACAGCAGCCTGGC (AZGP1) (SEQ ID NO: 116) CCAGCCCCTCGTGGTGCCCTGGG AGGCCAGCTAGGAAGCAAGGGTT GGAGGCAATGTGGGATCTCAGAC CCAGTAGCTGCCCTTCCTGCCTGA TGTGGGAGCTGAACCACAGAAAT CACAGTCAATGGATCCACAAGGC CTGAGGAGCAGTGTGGGGGGACA GACAGGAGGTGGATTTGGAGACC GAAGACTGGGATGCCTGTCTTGA GTAGACTTGGACCCAAAAAATCA TCTCACCTTGAGCCCACCCCCACC CCATTGTCTAATCTGTAGAAGCCG GAAGCTTGCGGCCGCACTCGAGT AACTAGTTAACCCCTTGGGGCCTC TAAACGGGTCTTGAGGGGTTANC TNGTTNCTCGNGTGCGGCCGCNN GCTTCCGGCTTCTNCNGNTTNGNC NNTG N (SEQ ID NO: 146) 5F3 Hemk1 NM_016173.3 FIG. 7 VAVAQGSGALE GTGGCTGTTGCGCAGGGATCAGG (minus (SEQ ID SSKWPLLNLNG TGCACTTGAGTCTTCGAAGTGGCC strand) NO: 34) CLGRAEGQVLM ATTGCTCAACTTGAATGGCTGCCT ASHP GGGTCGGGCAGAAGGCCAGGTCC (SEQ ID NO: 117) TCATGGCTTCCCATCCCTAATGAC CGGAATACATGGGCTGCCAGGTC AGATGTGGGCCACATGGGAAGTC CCAGCTCTATTCTAGAAAATGCAT GTACCATCAGCTTACTGATAGAC ATTTACTGAACTTGGGTATGCCAG ATCCACAGGGGGCCCCAGAGATG AGGGGGATAAGAAGGTTTCTGAA GGCATGGTACAGAAGGTGCCAGC AGAGGTATGGGCTAGGGGAGGCA GGGAGAGCACAGAGCAGGCATCC TAAAGGAGGCAGCATTTGTGTTG GAGCTTGAAGAAGTG (SEQ ID NO: 147) 5F8 Desmocol- NG_016782.1 FIG. 34 SAFRGYLANNK TAAGCTTTCATCTTCCCCAACCCT lin 3 (SEQ ID (SEQ ID NO: 118) GATGTCTTCCTATTCTCACTGATC NO: 90) CCCCTACTGACTCAGCTTCACGCT TCTTGATTATACCTCTCTCCTGTA GAAAAGCCTTGGCTGGCTCTCCTT TAGGATGAGAATAAATCCGAAAT CCTTAGTGTAGCATTTAGAAGTCC TATCTCCCACTTGTTTCTTAATATT CTCTTCTCTAACACCGAACTTGTT TCAAGCCTCTTTTCCAACACATGA TTTCTTCTATTCTAAATCAATTTAT TTATTATTTGCTAAATAGCCCCTA AAC (SEQ ID NO: 148) 1G12 DAZ Asso- AC027307.5 FIG. 31 SLAHRPPCAEPD GGCTAATTTTGTATTTTTAGTAGA ciated (this is for (SEQ ID PGQRMELPAPV GAACAGGTTTCACTATGTTGGTCA protein a chromosome NO: 87) PRPRGASKPPRRD GGCTGGTCTTGAACTCCTGACCTC 19 clone, not (SEQ ID NO: 119) AGCGCATCCAGAATTTTAGACGG the specified GGCCCCCAGGGTGAGGTCTTGGC gene) ACCCTCCAGTAGAGAAGAAGGGA CATGGGCCATACGTGGGGTGTCC TTTCTGGGAGCCTTGCGTCCCTTA CCTGCCTAGCCAGGGATTGCACCT CACAGCACGCAGCCAGCAGGAAC GGCACCGTGATCTGATTTCACCTG CGGGCCCTGGGCCCTGGGGGTGT TTGACAATTGGGGCATATCACAG TGTGAGCTAGTCCCGTCTCGGGG GTTTGGAGGCTCCACGTGGCCGT GGTACAGGAGCAGGCAGTTCCAT CCTCTGGCCTGGATCAGGCTCTGC ACACGGAGGCCTGTGGGCCAG (SEQ ID NO: 149) 1G5 RPL34 NM_033625.2 FIG. 6 LFIFITQKSFIFLF GTCTTTTCATTTTTATTACTCAAA (Minus (SEQ ID SFLTLCLCLQHF AAAGTTTCATTTTTTTATTTAGCTT strand) NO: 33) HNDFLLLDKEST TCTGACTCTGTGCTTGTGCCTTCA LDPVTNTFSTHGT ACACTTTCACAACGATTTTCTGCT (SEQ ID NO: 120) CCTCGATAAGGAAAGCACGCTTG ATCCTGTCACGAACACATTTAGCA CACATGGAACCAA (SEQ ID NO: 150) 3C9 PERP NM_022121.4 FIG. 35 PYQIYQVMIN CTTACCAGATCTATCAGGTCATGA (Minus (SEQ ID (SEQ ID NO: 121) TAAATTAGACCCAGTCCATCTTTC strand) NO: 91) AATCCAGTCTACTCTGGTTCTGAA CATATAAACACAAAACACTACAG ATTTATTAATATAGCATTTTCCCA CACCCTAACCCTATAAAGAACTTT AAAAGAGAAAATTTCATCTAAAT ATTTCACACTTAAAGGAAAGCCTT ACCAACTATGGCAACAGGTTTGG ACCATGAAATAGTACTTTCCTAGA TGACATATCGAGTCAACATGAAG CCTTAGCTGAAATGAATGATTCA GGATATTAATGAGAAATTCTCAC AAATGATATGCATTTAGGAAATG ATTTTGCTTTCCTTAAATAGTTCG AAGGCTTGAAAATAAACTTTTTTT TTGCATTTCTTTTAAAAGTT (SEQ ID NO: 151) 3D11 Chromo- AC117381.5 FIG. 36 VSTFLSRVGRVS GTTTCCACATTCTTGTCAAGGGTT some 3 (Homo (SEQ ID LLNFLPF GGTAGGGTCAGTCTTTTAAATTTC UTR region sapiens 3 NO: 92) (SEQ ID NO: 122) TTGCCATTTTAGTGACTGTGCATT ropporin/ BAC RP11- GGTATTTCATTGTGGTTTATTTGC RhoEGF 783D3) ATGATGACTAATGCTCAACACCA ACTAATCATGTTGAGTATTTTTAA TGTGCTTATTTGCCACTCATATAT CTTCTTTGATGAAGTGTCTCTTCA AATATTTTGCCCATTTAAAAACTG TATTGATTCTTATTATTGAATTGC AATAATTCTTTCTATCCGGATATA TATCCTTTGCCAGATATGTGTATT ACAAATGTTTTCTCCTAGCCTTCC ACCTCAGCCTCCCAAGTAGCTGG GAATGCAGGTGTGCACCACCACT CCAGGGTTTTTTGTTGTTGTTGTT GTTGTTTTTCTGTAGAGACAGGGT CTTGCCATGCTGCCGAGGCTGCTC TCAAACTCCTGGGATCAAGAAAT CCTCCTGCCTCGGCCTCCCAAAGT GCTGACATTACAAGCATGAGCCA CTGTGCCTGGCTAACTTTTCATCT TTTAAAGTAGTGTCTTGCAAAGA ACAACATTTTAATGAAGTCCATTT ATCAACTTTTTGATTCATTGTCCA TGCTTTTTGCATAATAAGAAATCT TTGCCTGCCTCAAAATTGCAAAGC TT (SEQ ID NO: 152) 3E4 Cox5a NM_004255.3 FIG. 37 NTLVTYDMVPE AACACACTTGTTACCTATGATATG (SEQ ID PKIIDAALRACR GTTCCAGAGCCCAAAATCATTGA NO: 93) RLNDFASTVRIL TGCTGCTTTGCGGGCATGCAGAC EVVKDKAGPHK GGTTAAATGATTTTGCTAGTACAG EIYPYVIQELRPT TTCGTATCCTAGAGGTTGTTAAGG LNELGISTPEELG ACAAAGCAGGACCTCATAAGGAA LDKV ATCTACCCCTATGTCATCCAGGAA (SEQ ID NO: 123) CTTAGACCAACTTTAAATGAACTG GGAATCTCCACTCCGGAGGAACT GGGCCTTGACAAAGTGTAACCGC ATAATAAAAGGGAAATGAGTTTG AACTG (SEQ ID NO: 153) 4B11 Mito- HQ113226.2 FIG. 38 PPSHHIPNLSLTK GCCCCCATCTCATCATATACCAAA chondrion (SEQ ID RKPSPHSLNLIH TCTCTCCCTCACTAAACGTAAGCC sequence NO: 94) HSRQLRWIKPNP TTCTCCTCACTCTCTCAATCTTATC ATQNLSILLNYP CATCATAGCAGGCAGTTGAGGTG HRMNNSSSTVQP GATTAAACCAAACCCAGCTACGC (SEQ ID NO: 124) AAAATCTTAGCATACTCCTCAATT ACCCACATAGGATGAATAATAGC AGTTCTACCGTACAACCCTAACAT AACCATTCTTAATTTAACTATTTA TATTATCCTAACTACTACCGCA (SEQ ID NO: 154) 4B3 MYH9 NM_002473.4 FIG. 39 SAGSCSSA GGGTTCGTGTTCCTCAGCGTAGCC (Minus (SEQ ID (SEQ ID NO: 125) ATCAGGCTTGGCCAGCTGCTCCTT strand) NO: 95) GTAAAGCTGCCCCACAGTGCGGA ACATGCCCTTCCGCGTCTTGAAGG CCCCGGGCAGTGCGGTCTCCGAC ATGCCGGCCACCTGGTCCAGGCC GATGATGCGGTCCACATCCTTCCA CAGCTCCGAGACAAACTTGTCAG AGGACTGGTGGAGCAGTGTGGCG ATGTTGTCATTCAGGGGATCCATG TTCTTCATCAGCCACTCGTCAGCT
TTGTAATCCACCTTGCCGGCATAG TGGATAATGCAGAAATCAGCTTT GTCCTTCAGCTGCTTGGGCTTCTG GA (SEQ ID NO: 155) 4D10 ASND1 NM_019048.2 FIG. 40 KLLFALQLWNL AAATTACTTTTCGCCTTGCAGCTG (SEQ ID VLQPLLFCPNGP TGGAACTTGGTCTTACAGCCTCTG NO: 96) CSLDQELQKWK CTCTTCTGCCCAAACGGGCCATGC KLMKRHLINVD AGTTTGGATCAAGAATTGCAAAA GSKSCP ATGGAAAAAATTAATGAAAAGGC (SEQ ID NO: 126) ATCTGATAAATGTGGACGGCTCC AAATCATGTCCTTAGAAAATCTTT CTATTGAAAAGGAGACTAAATTG TAATGTGATTCACAATGTAACAAT ATAAAAATAAGTTTTTATATAATT ATATAAAAGTAAGATACTCTGCT GCTTTACTATTGTATAATAT (SEQ ID NO: 156) 4D9 Cathepsin NM_003793.3 FIG. 41 EDDYSYQGHMQ CAGAGGATGACTACAGCTACCAG F (SEQ ID SCNFSAEKAKV GGTCACATGCAGTCCTGCAACTTC NO: 97) YINDSVELSQNE TCAGCAGAGAAGGCCAAGGTCTA QKLAAWLAKRG CATCAATGACTCCGTGGAGCTGA PISVAINAFGMQ GCCAGAACGAGCAGAAGCTGGCA FYRHGISRPLRP GCCTGGCTGGCCAAGAGAGGCCC LCSPWLIDHAVL AATCTCCGTGGCCATCAATGCCTT LVGYGNRSDVP TGGCATGCAGTTTTACCGCCACGG FWAIKNSWGTD GATCTCCCGCCCTCTCCGGCCCCT WGEKGYYYLHR CTGCAGCCCTTGGCTCATTGACCA GSGACGVNTMA TGCGGTGTTGCTTGTGGGCTACGG SSAVVD CAACCGCTCTGACGTTCCCTTTTG (SEQ ID NO: 127) GGCCATCAAGAACAGCTGGGGCA CTGACTGGGGTGAGAAGGGTTAC TACTACTTGCATCGCGGGTCCGGG GCCTGTGGCGTGAACACCATGGC CAGCTCGGCGGTGGTGGACTGAA GAGGGGCCCCCAGCTCGGGACCT GGTGCTGATCAGAGTGGCTGCTG CCCCAGCCTGACATGTGTCCAGG CCCCTCCCCGGGAGGTACAGCTG GCAGAGGGAAAGGCACTGGTACC TCAGGGTGAGCAGAGGGCACTGG GCTGGGGCACAGCCCCTGCTTCCC TGCACCCCATTCCCACCCTGAAGT TCTGCACCTGCACCTTTGTTGAAT TGTGGTAGCTTAGGAGGATGTCA GGGTGAAGGGTGGTATCTTGGCA GTTGAAGCTGGGGCAAGAACTCT GGGCTTGGGTAATGAGCAGGAAG AAAATTTTCTGATCTTAAGCCCAG CTGTGTTCTGCCCCCGCTTTCCTC TGTTTGATACTATAAATTTTCTGG TTCCCTTGGATTTAGGGATAGTGT CCCCCTCCATGTCCAGGAAACTTG TAACCACCCTTTTCTAACAGCAAT AAAGAGGGTCCTTGTCCCGAAAA AAAAAAAAA (SEQ ID NO: 157) 4F1 Master- AP000779.4 FIG. 42 GTNQRQTMENH GGCAGACAATGGAAAACCATTGA mind-like (Homo sapiens (SEQ ID (SEQ ID NO: 128) AAAGGATTAAACTGGGAAGTGAT 2 genomic DNA, NO: 98) ATGTTCTCTTTTGCATTTAAAAAG Chromosome ATCACCAATGGGGATATGGAGAA 11q) TGGTCTGGATAGGTCTTAAGACTA GAGCCAGGAAGACATGTTAGAAG GCTATCAATTGACCCTAAAGACA CTGCTTCAATCCCTTTGATGACAG TGAGTTTGCTTTCCCCAGAGATAG CTTATTGGACCTCAGGACTGCTGT GAGAAACAGAAAATGCTCCTTTA CGTGTTGCCTGAAGTTAGGCTCAC CGATTTGGGGCATGTTCTAATTCT ACCAGCTAGGAACACACAGAATC GCTTGTCAAACATTCTGAGTCAGA TATGTCCTCCCTATGTCTTTTCTG AGAAAGGCATACAGAAATTCCCA GCTAAACATCACCAGTTCCCTCAT TTGTTCCTCAGATGATATGGTCCA TTCAAGTTTTGTAATCATCATGGG GGTAGATGGAGGGTCCCAGTCCT CACAACCATTCTGGTAATTTACTC TTGAATTTACTGGTTCACATGTAT CTATTTTGTAGTGTGGCTCCAGAA A (SEQ ID NO: 158) 5D11 CSNK2A2 NM_001896.2 FIG. 43 SSCSEYNVRVAS TCATCCTGCTCGGAGTACAATGTT (SEQ ID RYFKGPELLVD CGTGTAGCCTCAAGGTACTTCAA NO: 99) YQMYDYSLDM GGGACCAGAGCTCCTCGTGGACT WSLGCMLASMI ATCAGATGTATGATTATAGCTTGG FRREPFFHGQDN ACATGTGGAGTTTGGGCTGTATGT YDQLVRIAKVL TAGCAAGCATGATCTTTCGAAGG GTEELYGYLKK GAACCATTCTTCCATGGACAGGA YHIDLDPHFNDI CAACTATGACCAGCTTGTTCGCAT LGQHSRKRWEN TGCCAAGGTTCTGGGTACAGAAG LSIVRTDTLSAL AACTGTATGGGTATCTGAAGAAG RP TATCACATAGACCTAGATCCACA (SEQ ID NO: 129) CTTCAACGATATCCTGGGACAAC ATTCACGGAAACGCTGGGAAAAC TTATCCATAGTGAGAACAGACAC CTTGTCAGCCCTGAGGCCCTAGAT CTTCTGGACAAACTTCTGCGATAC GACCATCAACAGAGACTGACTGC CAAAGAGGCCATGGAGCACCCAT ACTTCTACCCTGTGGTGAAGGAG CAGTCCCAGCCTTGTGCAGACAA TGCTGTGCTTTCCAGTGGTCTCAC GGCAGCACGATGAAGACTGGAAA GCGACGGGT (SEQ ID NO: 159) 7A9 AURKAIP1 NM_001127230.1; FIG. 44 AARLGPSLECW CGGCCGCCCGCCTTGGCCCGTCTC NM_001127229.1; (SEQ ID AAGSAGPFTAH TGGAGTGCTGGGCAGCCGGGTCT NM_017900.2 NO: 100) RRPAQVGRPLSL GCGGGCCCCTTTACAGCACATCG (transcript ARGPSWSWRRC CCGGCCGGCCCAGGTAGGGCGGC variants) WSPGRCPSAPW CTCTCTCCCTCGCAAGGGGGCCCA RAGSRPAASCPD GCTGGAGCTGGAGGAGATGCTGG WIPGPQGLWLH TCCCCAGGAAGATGTCCGTCAGC RNPTSVRPAR CCCCTGGAGAGCTGGCTCACGGC (SEQ ID NO: 130) CCGCTGCTTCCTGCCCAGACTGGA TACCGGGACCGCAGGGACTGTGG CTCCACCGCAATCCTACCAGTGTC CGCCCAGCCAGATAGGGGAAGGG GCCGAGCAGGGGGATGAAGGCGT CGCGGATGCGCCTCAAATTCAGT GCAAAAACGTGCTGAAGATCCGC CGGCGGAAGATGAA (SEQ ID NO: 160) 3C1 Chromo- AC096741.3 FIG. 45 GKERENIRTNT GGCAGGGAAGGGAGAACATTAGG some 4 (Homo (SEQ ID (SEQ ID NO: 131) ACAAATACCTAATGCACGCCAGG sapiens BAC NO: 101) CCCTANTAATCGTAGATGATGGG clone RP11- TTGATGGGTGTAGCAAACCACCA 327017) TGGCACATGTATATCTATGTAACA AACCTGCACATTCTGTACATGTAT CCCAGAACTTCAAGTAAAATTTTA AAAAATTCAAAAAAAGTAATAGG AAAAGGGGAAACATCCACGTGAG CAGTCCAGTTTCCCAATCTGGAAC TTGGAGCTGTTCACCTGGTGGGTG TTTGTGACTATTCAGACACAGACA ACAAAGGCTACTCCAGATTGAAG TGCACTGCTTACTTTCAGTGACCT CATAGAACTACTCAACATTGTTTT TGGTGATTCCTGTGCTATGGTTTG AATGGCTCCGCTCCAAAACTCAG GTGTTGCCAATGNGATGGTATTA AGAAGTAGGGCATTTAAAAAACA ACAACAGGCCTGGCGCGGTGGCC CACGCCTGTAATCCCAGCACTTTG GGAGGCTAAGGCGGGCGGATCAC CGGAGGTCAGGAATTCAAAACCA GCCTGGCCAACATGGCGAAACCC TGTCTCTACTAAAAATACAAAAA TTAGCCAGGCATGGTTGCGGGCG CCTGTAATCCCGGCTACTCGGGA GGCTGAGGCAGGGGAATCCTTGA ACCCGGGA (SEQ ID NO: 161) 3C3 ARF6 NM_001663.3 FIG. 46 PKCRLQRQYTG GAAATGTAGACTGCAAAGGCAGT (SEQ ID KGGVGFVYEGV ATACAGGAAAAGGTGGAGTGGGT NO: 102) (SEQ ID NO: 132) TTTGTTTATGAGGGTGTCTGAAAA CTAAAATTGAGCGGGATATCATG GTATAGTTGGACAGTATTGGTCCT TCACACTTTGGCCATATTGTATAA TGGAGCTTTTACCAAAGATGTATG AGAAGTGTAAGACTATAAAAAAA TGAACTATTCAAAGTAAAACTCTT AACAAACATTTTACTTAAAGCAG ATGCAAAAGGGTATTCTCATGTA GGCTCCTGTTGGTGCAGAGGGAT TTTTTTGATTTCAGGATACAACTA AAGTACGAAGTTCTCAGTTTCACT TTAGTAGAAAGAGCTCTAGAAAT GAGGCTGATAAACACATCTAAGA ACACTGGTTGCTTTCTAAAATTTC CAAAGCTCCACCATAAATGTAAT TTTTAGTGTTTCAAATGATTGCAT TTTAAAGTATATAAATATGGGTTA TCCAATATCAATGCTATAGTAACA TCCTGAAACAAAACAAGCACAAA GGTATAAATGCCTAAACTGGAGG AAGCTTG (SEQ ID NO: 162) 3D1 3'UTR AL135937.2 FIG. 47 QTQTHTSAPLKC CTCAGACTCAAACACACACCTCC region 2 (Human DNA (SEQ ID QPWSFVEARICH GCTCCCTTGAAGTGCCAGCCCTGG JAG1 sequence NO: 103) GSQLVRCPVQH AGCTTTGTTGAGGCTCGCATCTGC from clone PSRIS CACGGGAGTCAGCTAGTACGTTG RP1-278022 on (SEQ ID NO: 133) CCCAGTTCAACATCCATCCAGGAT chromosome 20) TTCATAGGAACTTGAGAATCATTG TTTTTGGCTTGAATCCTGGGTTTG AGGTTTCTTCGTGTAGGAATCTGA AAAAAGGATTTGGAAACGTTGTT GTCTCTAATCCCAAAGTATGTATC TGGGAGGCTGCCTTCGCCATCACC CACCTAATAACTCAGG (SEQ ID NO: 163) 5A5 Mito- HQ113226.2 FIG. 48 PRLHQXKANYI AGACTTCACCAGTCAAAGCGAAC chondrion (SEQ ID YSIDPIT TACATATACTCAATTGATCCAATA sequence NO: 104) (SEQ ID NO: 134) ACTTGACCAACGGAACAAGTTAC CCTAGGGATAACAGCGCAATCCT ATTCTAGAGTCCATATCAACAATA GGGTTTACGACCTCGATGTTGGAT CAGGACATCCCGATGGTGCAGCC GCTATTAAAGGTTCGTTTGTTCAA CGATTAAAGTCCTACGTGATCTGA GTTCAGACCGGAGTAATCCAGGT CGGTTTCTATCTACTTCAAATTCC TCCCTGTACGAAAGGACAAGAGA AATAAGGCCTACTTCACAAAGCG CCTTCCCCCGTAAATGATATCATC TCAAGCTT (SEQ ID NO: 164) 3E1 Chromo- AL135937.22 FIG. 49 P Q T T A P R R A CTCGCTCAAACACACACCTCCGCT some 20 (SEQ ID R P R R S (SEQ ID CCCTTGAAGTGCCAGCCCTGGAG NO: 105) NO: 135) CTTTGTTGAGGCTCGCATCTGCCA CGGGAGTCAGCTAGTACGTTGCC CAGTTCAACATCCATCCAGGATTT CATAGGAACTTGAGAATCATTGTT TTTGGCTTGAATCCTGGGTTTGAG GTTTCTTCGTGTAGGAATCTGAAA AAAGGATTTGGAAACGTTGTTGT CTCTAATCCCAAAGTATGTATCTG GGAGGCTGCCTTCGCCATCACCC ACCTAATAACTCAGGC (SEQ ID NO: 165) 5A9 Chromo- AL034375.23 FIG. 50 G T I S I V C C W ATTGTTTGTTGTTGGGGGTGTCTT some 6 (SEQ ID G C L C Q H L V TGTCAGCATCTAGTACAGTGCCTG clone NO: 106) Q C L A D G C S I GCAGATGGATGCTCAATAAATAT UTR region N I D L M G Y E TGATTTAATGGGTTATGAGGGTGT (Minus G V N I K L A F I TAATATAAAATTAGCATTTATTCA strand) Q Q L L (SEQ ID GCAACTACTATGAGTCAGCCACT NO: 136) GGGCTAAGTGGCTTACATGTTAA GAACCTCACAGAAGCCAGGTGTG GTGGCTCACGCCTGTAATCCCAGC ACTTTGGGAGGCTGAAGCGGGCA GATCACCTGAGGTCAGGAGTTTG AGTCCAGGCTGGCCAACGTGGTG AAACCCCATCTCTACTAAAAATA CAAAAATTAGCCAGTTGTGGTGG CAGGCGCCTGTAGTCCCAGCCAC TCAGGAGGCTAAGGCAGGAGAAT AGCTGGAACCCGGGAGGTGGAGA
TTGCAGTGAGCCAAGATTGCACC ACTGCACTCCAGCCTGGGTGACA GAGTGAGACTCTGTCTCCAAAAA AAAAAGAAAAAGAAAAAGAACC TCCAGCAACCTAGTAGGTGAGCC CGGTTACTCTTGTTTTACAGGTGA GAAAATTGAGCCCTAGAGAAATA AAGTAACTTGCTTCAGGTCTCATG GTTAAGGGGAACCTGGGCCCTAA CAGTCCACTTCCTGTACCTTCAAC CACGGTTCTACCGCCTCCGCTAGG AAATGGCCCGAGGACATTCCTTA GCTGGCTTCAGCTTGCTCTTTTTC CCCTGCGGTCCACCCCTG (SEQ ID NO: 166) 5H2 MAPKKK5 NG_011965.1 FIG. 51 G M S H H A W P AGAGGGAGTATAGGGCTGTGCAC (SEQ ID R P S F F N T E Y AGAGACTATGATGGCCGTGCTAA NO: 107) F (SEQ ID NO: 137) GGTAAGAGTATTGATAATGTAAG CATACTTCCTCTATCAACAATAAT TGTTAACAGCTGCTTCAAGCACTT GATATTACCACTAGTTGTTAACTG AATCAAGCATGTGCTCCAAGTTC ACATTAATGTGAATTGAACAGCA TTGTGTACGTACGAGGAGCTTCAT GCAAGTGTTATACACTGCACTCAC AAGTATTATGATCTTACTAAGCAT TAGAAATACTCTGTGTTAAAGAA GCTTGGTCTAGGCCAAGCGTGGT GGCTCATGCCT (SEQ ID NO: 167) 1H5 RAS p21 BC020761.1 FIG. 52 D R R P G S F V L GATCGGAGGCCAGGGTCCTTTGT Protein (SEQ ID S F L S Q Met N V ACTTTCATTTCTTAGCCAGATGAA activator NO: 108) V T H F R I I A TGTTGTCACCCATTTTAGGATTAT (RASA1) Met C G D Y Y I TGCTATGTGTGGAGATTACTACAT G G R R F S S L S TGGTGGAAGACGTTTTTCTTCACT D L I G Y Y S H V GTCAGACCTAATAGGTTATTACA S C L L K G E K L GTCATGTTTCTTGTTTGCTTAAAG L Y P V A P P E P GAGAAAAATTACTTTACCCAGTT V E D R R R V R GCACCACCAGAGCCAGTAGAAGA A I L P Y T K V P TAGAAGGCGTGTACGAGCTATTC D T D E I S F L K TACCTTACACAAAAGTACCAGAC G D Met F I V H N ACTGATGAAATAAGTTTCTTAAA E L E D G W Met AGGAGATATGTTCATTGTTCATAA W V T N L R T D TGAATTAGAAGATGGATGGATGT E Q G L I V E D L GGGTTACAAATTTAAGAACAGAT V E E V G R E E GAACAAGGCCTTATTGTTGAAGA D P H E G K I W F CCTAGTAGAAGAGGTGGGCCGGG H G K I S K Q E A AAGAAGATCCACATGAAGGAAAA (SEQ ID NO: 138) ATATGGTTCCATGGGAAGATTTCC AAACAGGAAGCTT (SEQ ID NO: 168) 18H9 Hsp90b Ay359878.1 FIG. 53 YFAYLISEQNEE TGAAGTGGCAGCAGAGGAACCCA (SEQ ID NKINHNTQHPIL ATGCTGCAGTTCCTGATGAGATCC NO: 109) LSRVREGMGLD CCCCTCTCGAGGGCGATGAGGAT TLSLLPSTQGQE GCGTCTCGCATGGAAGAAGTCGA REKNTRHQQGE TTAGGTTAGGAGTTCATAGTTGGA PGGTGALEAAV AAACTTGTGCCCTTGTATAGTGTC GAHGDTIQGHK CCCATGGGCTCCCACTGCAGCCTC FSNYELLT (SEQ GAGTGCCCCTGTCCCACCTGGCTC ID NO: 139) CCCCTGCTGGTGTCTAGTGTTTTT TTCCCTCTCCTGTCCTTGTGTTGA AGGCAGTAAACTAAGGGTGTCAA GCCCCATTCCCTCTCTCACTCTTG ACAGCAGGATTGGATGTTGTGTA TTGTGGTTTATTTTATTTTCTTCAT TTTGTTCTGAAATTAAGTATGCAA AATAA (SEQ ID NO: 169) 4D7 ribosoma1 NM_001010.2 FIG. 54 C I V D A N L S V GTTGCATTGTGGATGCAAATCTGA protein (SEQ ID L N L V I V K K G GCGTTCTCAACTTGGTTATTGTAA S6 (RPS6) NO: 110) E K D I P G L T D AAAAAGGAGAGAAGGATATTCCT T T V P R R L G P GGACTGACTGATACTACAGTGCC K R A S R I R K L TCGCCGCCTGGGCCCCAAAAGAG F N L S K E D D CTAGCAGAATCCGCAAACTTTTCA V R Q Y V V R K ATCTCTCTAAAGAAGATGATGTCC P L N K E G K K GCCAGTATGTTGTAAGAAAGCCC P R T K A P K I Q TTAAATAAAGAAGGTAAGAAACC R L V T P R V L Q TAGGACCAAAGCACCCAAGATTC H K R R R I A L K AGCGTCTTGTTACTCCACGTGTCC K Q R T K K N K TGCAGCACAAACGGCGGCGTATT E E A A E Y A K GCTCTGAAGAAGCAGCGTACCAA L L A K R Met K GAAAAATAAAGAAGAGGCTGCAG E A K E K R Q E AATATGCTAAACTTTTGGCCAAG Q I A K R R R L S AGAATGAAGGAGGCTAAGGAGA S L R A S T S K S AGCGCCAGGAACAAATTGCGAAG E S S Q K (SEQ AGACGCAGACTTTCCTCTCTGCGA ID NO: 140) GCTTCTACTTCTAAGTCTGAATCC AGTCAGAAATAAGATTTTTTGAGT AACAAATAAATAAGATCAGA (SEQ ID NO: 170) 36C4 Homo AC128709 FIG. 55 L I C I S L M A N CCTGGGCAGTGATTAGGTCATAA sapiens (Homo sapiens (SEQ ID D V E H L F M F I AGGTGGAGTCCTCATGGATGGGA chromo- 3 BAC RP13- NO: 111) C H L S (SEQ ID TTAGTGTCTTTATAAAAGAGACCT some 3 616I3) NO: 141) TTGCCATGTGAGGTTACAGTGAG genomic AAGACATCTGTCTATGAAGAAAG contig TGGGCCCTCACCAAACACAGTCT GCTGGCACTTTGCACTTCAACTCC CCAGCTTCCAGAACTGTAAGGAA TATAAGTCTGTTGTTGGTAAGCCA CCCGGTCTATGATATTTTGTTATA GCAGCCCAAACAGACTAAGACAG GTGACAAATAAACATGAAAAGAT GTTCAACATCATTAGCCATTAGGG AAATGCAGATTAAAA (SEQ ID NO: 171)
[0101] An antibody, such as an autoantibody, to one or more of a protein, or a fragment of a protein, encoded by a gene such as listed in Tables 1, 2, 3 or 4, or a polypeptide encoded by a UTR sequence of a gene such as one listed in Tables 1, 2, 3 or 4, can be detected according to one or more methods described herein and used to characterize a cancer, such as prostate cancer. Many of the proteins may have a role in various cancers, including prostate cancer. For example, the human DCHS 1 protein (protocadherin-16 precursor) is believed to be a calcium-dependent cell adhesion protein found in the cell membrane of fibroblast cells. Without being bound by theory, DCHS1 is a cadherin, a class of type-1 transmembrane proteins. Cadherins typically play important roles in cellular adhesion, for example, by binding cells expressing similar cadherins to each other. Structurally, DCHS 1 is thought to contain 27 cadherin repeats (extracellular calcium ion-binding domains). DCHS 1 expression has been associated with certain cancers, potentially playing a role in tumor adherence (see, e.g., Sjoblom, et. al. Science, (2006) 314:268-274).
[0102] Another of the proteins, CEP164 is believed to be a centrosomal protein which binds chromatin and plays a role in the DNA damage-activated signaling cascade. It is known to interact with ataxia telangiectasia mutated (ATM) and ATM/Rad3-related (ATR) kinases which phosphorylate CEP164 upon replication stress, ultraviolet radiation (UV), and ionizing radiation (IR). CEP164 also plays a role in cell cycle regulation, specifically at the G2/M checkpoint and in nuclear division (see, e.g., Sivasubramaniam et al., Genes & Dev. (2008); 22(5):687-600). As CEP 164 plays a role in genome stabilization, misregulation or mutation of this gene and/or protein can play a role in certain cancers.
[0103] In a further example, the human KBTBD6 (kelch repeat and BTB (POZ) domain containing 6) is a protein expressed in a wide variety of normal tissues. Its expression and/or misregulation has also been noted in multiple cancer types, including prostate, ovarian, kidney and lung tumors. The function of the protein is not currently known, however, the presence of the kelch repeat and BTB domain suggest that the protein is involved in protein-protein interactions and actin filament organization.
[0104] Certain ribosomal proteins, such as RPS19 and RPL34 have also been associated with certain cancers. RPS19 (ribosomal protein S19) encodes a ribosomal protein that is a component of the 40S subunit. Located in the cytoplasm as part of the ribosomal complex, mutations in this gene are associated with Diamond-Blackfan anemia, suggesting a non-ribosomal function for the protein in erythropoietic differentiation. RPS19 protein is also known to interact with fibroblast growth factor-2 (see, e.g., Soulet et al., Biochem. Biophys. Res. Commun. (2001); 289:591-596). Increased expression of RPS19 has been associated with some cancers, but the role of RPS19 in cancer development is unknown. RPL34 (60S Ribosomal protein L34) is a ribosomal protein that is a component of the 60S subunit and is located in the cytoplasm. Expression of the gene encoding the RPL34 protein is known to be regulated by c-MYC and has been shown to have increased expression in primary invasive and metastatic breast cancer cells and colorectal cancer cells (see, e.g., Zucchi et al., Proc. Nat'l Acad. Sci., (2004); 101:18147-18152; Sjoblom, et. al. Science, (2006) 314:268-274).
[0105] Certain nucleic acid-binding proteins, such as RMB6 and HEMK1 have also been associated with certain cancers when misregulated and/or mutated. RBM6 (RNA binding protein 6) is a cytosolic protein that binds to poly-G homopolymers in vitro, but its function in vivo is not currently known. The protein thought to be phosphorylated (potentially by ATM or ATR) in its active form. The gene encoding the protein, without being bound by theory, is located in a portion of the genome, modifications of which are associated with cancerous transformation, such as lung carcinomas. Additionally, translocations of the gene which result in aberrant fusion proteins have been reported to be associated with cancer cells (see, e.g., Gu et al., Blood, (2007); 110:323-333). The human HEMK1 (HEMK methyltransferase family protein 1) protein is an S-adenosylmethionine-dependent methyltransferase and is also thought to bind nucleic acids. HEMK1 is considered a tumor-suppressor, misregulation of which is associated with various cancers, including prostate cancer, pancreatic cancer and liver cancer (see, e.g., U.S. Pat. App. Pub. No. 2008/0213791).
[0106] Thus one or more polypeptide probes, such as a fragment of a protein encoded by a gene, or a polypeptide encoded by a sequence of a UTR region of a gene, such as a gene listed in Tables 1, 2, 3 or 4, can be used to detect one or more antibodies, such as autoantibodies, from a sample from a subject. In one embodiment, the polypeptide probe comprises a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0107] In one embodiment, a polypeptide probe is a fragment of a protein encoded by CEP 164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain, or may be a polypeptide encoded by a UTR sequence of the gene, such as the 5' or 3' UTR sequence of CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain. In one embodiment, a polypeptide probe can be a fragment of a protein encoded by FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment, a polypeptide probe comprises a peptide sequence, or fragment thereof, such as those listed in Tables 1, 2, 3, and 4. The polypeptide probe can comprise SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0108] In another embodiment, a polypeptide probe is a fragment of a protein encoded by DCHS1, CEP164, KBTBD6, RPS 19, RPL34, RNA binding protein 6, or Hemk1, or may be a polypeptide encoded by a UTR sequence of the gene, such as the 5' or 3' UTR sequence of DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, or Hemk1. In one embodiment, a polypeptide probe can be a fragment of a protein encoded by eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. In one embodiment, a polypeptide probe comprises a peptide sequence, or fragment thereof, such as those listed in Tables 1 and 2. The polypeptide probe can comprise SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof.
Antibody Profiling Panel
[0109] Also provided herein is an antibody profiling panel. A panel as provided herein can be used to analyze one or more antibodies to a plurality of polypeptide probes, such as one or more autoantibodies. A panel allows for the simultaneous analysis of multiple antibodies, such as autoantibodies, to a plurality of polypeptide probes correlating with carcinogenesis and/or metastasis. For example, a panel can include markers identified as correlating with cancerous tissue, metastatic cancer, localized cancer that is likely to metastasize, pre-cancerous tissue that is likely to become cancerous, and pre-cancerous tissue that is not likely to become cancerous. Depending on the subject, panels may be analyzed alone or in combination in order to provide the best possible diagnosis and/or prognosis.
[0110] In one embodiment, an antibody profiling panel can comprise a plurality of polypeptide probes, wherein one or more of the probes is capable of binding an antibody. In another embodiment an antibody profiling panel can comprise a plurality of probes, wherein one or more of the probes is capable of binding an antibody that targets a foreign antigen. In another embodiment an antibody profiling panel can comprise a plurality of probes, wherein each of the probes is capable of binding an autoantibody.
[0111] In one embodiment, an antibody profiling panel comprises 2-100 probes, 50-200 probes, 100-500 probes 200-750 probes, 200-1000 probes, 2-5,000 probes or 2-10,000 probes. In one embodiment, an antibody profiling panel comprises at least about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 polypeptide probes. In another embodiment, an antibody profiling panel comprises at least about 50, 100, 150, 200, 250, 500, 750, 1000, 5000, 10,000, 15,000, 20,000, 25,000, 30,000, 40,000, 50,000, 60,000, 70,000, 75,000, or 100,000 polypeptide probes. In one embodiment, the probes are polypeptide probes. In another embodiment, the probes are molecules that mimic an epitope bound by a particular antibody.
[0112] An antibody profiling panel can comprise at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or 14 polypeptide probes, wherein the polypeptide probes are a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, such as genes listed in Tables 1, 2, 3, or 4. In one embodiment, the polypeptide probe comprises a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, the polypeptide probe comprises a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0113] In one embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes is a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain. In one embodiment, the polypeptide probe can comprise a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789.
[0114] In another embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes is a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, or Hemk1. In one embodiment, the polypeptide probe can comprise a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789.
[0115] In one embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes is a peptide sequence, or fragment thereof, as listed in Tables 1, 2, 3, or 4. In one embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes comprises a polypeptide sequence selected from SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes comprises a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0116] In another embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes comprises SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof, or a fragment thereof. In another embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes is encoded by SEQ ID NO: 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0117] In one embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14, or a fragment thereof. In another embodiment, an antibody profiling panel comprises a plurality of polypeptide probes, wherein at least a subset of the polypeptide probes is encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof.
[0118] In one embodiment, an antibody profiling panel can also comprise one or more polypeptide probes of the protein PSA, or fragment of PSA, in combination with one or more of the polypeptide probes discussed herein.
[0119] In one embodiment, an antibody profiling panel can comprise polypeptide probes including a full-length protein or fragment of PSA and one or more polypeptide probes comprising SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, an antibody profiling panel can comprise polypeptide probes including a full-length protein or fragment of PSA and one or more polypeptide probes comprising a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the polypeptide probe comprises the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0120] In another embodiment, an antibody profiling panel can comprise polypeptide probes including a full-length protein or fragment of PSA and a full-length protein encoded by a gene, fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, or Deaminase Domain. In yet another embodiment, an antibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes include a full-length protein or fragment of PSA and a full-length protein encoded by a gene, fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789.
[0121] In another embodiment, an antibody profiling panel can comprise polypeptide probes including a full-length protein or fragment of PSA and a full-length protein encoded by a gene, fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, or Hemk1. In yet another embodiment, an antibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes include a full-length protein or fragment of PSA and a full-length protein encoded by a gene, fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789.
[0122] In another embodiment, an autoantibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes includes a full-length protein or fragment of PSA and one or probes comprising a peptide sequence, or fragment thereof, as listed in Tables 1, 2, 3 and 4. In one embodiment, an autoantibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes includes a full-length protein or fragment of PSA and one or more probes comprising SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, an autoantibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes includes a full-length protein or fragment of PSA and one or more probes comprising a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, an autoantibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes includes a full-length protein or fragment of PSA and one or more probes comprising the full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0123] In another embodiment, an autoantibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes includes a full-length protein or fragment of PSA and one or more probes comprising SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof.
[0124] In another embodiment, an autoantibody profiling panel can comprise a plurality of polypeptide probes, wherein the probes includes a full-length protein or fragment of PSA and one or more probes comprising SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof, or a fragment thereof; or a polypeptide sequence encoded by a sequence selected from SEQ ID NOs. 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75,76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0125] In one embodiment, a PSA polypeptide probe can be combined with any two or more of the polypeptide probes described herein, such as a polypeptide probe derived from a protein encoded by a gene, fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789.
[0126] In another embodiment, a PSA polypeptide probe can be combined with any two or more of the polypeptide probes described herein, such as a polypeptide probe derived from a protein encoded by a gene, fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789.
[0127] In yet another embodiment, a PSA polypeptide probe can be combined with at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14 of polypeptide probes disclosed herein, such as listed in Tables 1, 2, 3, and 4. In one embodiment, a polypeptide probe comprises SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof, or a fragment thereof. In one embodiment, a polypeptide probe comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14, or a fragment thereof. In another embodiment, a polypeptide probe comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof.
[0128] In another embodiment, a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof. In another embodiment, a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof. In yet another embodiment a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof.
[0129] In one embodiment, a polypeptide probe disclosed herein is attached to a substrate (e.g., glass slide chip or nanowell chip). A polypeptide probe can be directly or indirectly attached to the substrate. In one embodiment, a polypeptide probe is attached to a substrate via a phage. The substrate can be any physically separable solid to which a polypeptide probe can be directly or indirectly attached including, but not limited to, surfaces provided by microarrays and wells, particles such as beads, columns, optical fibers, wipes, glass and modified or functionalized glass, quartz, mica, diazotized membranes (paper or nylon), polyformaldehyde, cellulose, cellulose acetate, paper, ceramics, metals, metalloids, semiconductive materials, quantum dots, coated beads or particles, other chromatographic materials, magnetic particles; plastics (including acrylics, polystyrene, copolymers of styrene or other materials, polypropylene, polyethylene, polybutylene, polyurethanes, TEFLON®, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic glasses, plastics, ceramics, conducting polymers (including polymers such as polypyrole and polyindole); micro or nanostructured surfaces such as nucleic acid tiling arrays, nanotube, nanowire, or nanoparticulate decorated surfaces; or porous surfaces or gels such as methacrylates, acrylamides, sugar polymers, cellulose, silicates, or other fibrous or stranded polymers.
[0130] The polypeptide probe can bound to a planar surface or to a particle, such as a bead or microsphere. In one embodiment, the polypeptide probe is attached to a bead. The bead can be a polystyrene, brominated polystyrene, polyacrylic acid, polyacrylonitrile, polyacrylamide, polyacrolein, polydimethylsiloxane, polybutadiene, polyisoprene, polyurethane, polyvinyl acetate, polyvinylchloride, polyvinylpyridine, polyvinylbenzylchloride, polyvinyltoluene, polyvinylidene chloride, polydivinylbenzene, polyglycidylmethacrylate, polymethylmethacrylate, or copolymers, blends, composites, or combination thereof. The bead can have a diameter of between about 1 nm-1000 μm, 1 nm-500 μm, 5 nm-500 μm, or 10 nm-100 μm. In one embodiment, the bead has a diameter of between about 10 nm and 100 μm. In yet another embodiment, the bead has a diameter of less than about 1000 μm, 500 μm, 400 μm, 300 μm, 200 μm, or 100 μm.
[0131] In one embodiment, the bead is labeled or stained with more than one dye, such as at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 different dyes. In one embodiment, the bead is labeled or stained with two dyes. In another embodiment, the two dyes are hydrophobic. In another embodiment, the two dyes are fluorescent dyes, such as squaric acid-based dyes. In yet another embodiment, the squaric acid-based dyes are selected from cyclobutenedione derivatives, symmetrical and unsymmetrical squaraines, substituted cephalosporin compounds, fluorinated squaraine compositions, alkylalkoxy squaraines, or squarylium compounds. In another embodiment, the squaric acid-based dyes are selected from a red fluorescent dye and an orange fluorescent dye, such as the red fluorescent dye comprising 1,3-bis(1,3-dihydro-1,3,3-trimethyl-2H-indol-2-ylidene)methyl]-2,4-dihydr- o xycyclobutenediylium, bis(inner salt) and the orange fluorescent dye comprising 2-(3,5-dimethylpyrrol-2-yl)-4-(3,5-dimethyl-2H-pyrrol-2-ylidene)-3-hydrox- y-2-cyclobuten-1-one.
[0132] In one embodiment, the substrate is coated using passive or chemically-derivatized coatings with any number of materials, including polymers, such as dextrans, acrylamides, gelatins or agarose. Such coatings can facilitate the use of the array with a biological sample.
Cancer Screening
[0133] A presence of an immune response to a specific protein expressed in cancerous cells can be indicative of a presence of cancer. Accordingly, the present invention provides a method (e.g., diagnostic or screening method) for detecting a presence of an antibody, such as an autoantibody, to a tumor or tumor-associated antigen. In one embodiment, the presence of an antibody in cancerous but not cancerous cells is indicative of the presence of cancer. In one embodiment, the antibody is an antibody to a tumor antigen.
[0134] A method or composition disclosed herein can find utility in the diagnosis, screening, or characterization of a cancer. In one embodiment, a presence of an antibody, such as an autoantibody, to a specific protein can be indicative of a cancer. In another embodiment, detection of an antibody in a sample, such as an autoantibody, can be indicative of a specific stage or sub-type of the same cancer. The information obtained by detecting an antibody as described herein can be used to determine a prognosis or theranosis, wherein an appropriate course of treatment can be determined. In another embodiment, a subject with a specific antibody or stage of cancer can respond differently to a given treatment than individuals lacking the antibody. The information obtained from a method disclosed herein can thus provide for the personalization of diagnosis and treatment.
[0135] In one embodiment, a cancer is characterized by detecting the level or presence or absence of an antibody, such as an autoantibody, in a sample. The cancer can be, but is not limited to, breast cancer, ovarian cancer, lung cancer, colon cancer, hyperplastic polyp, adenoma, colorectal cancer, high grade dysplasia, low grade dysplasia, prostatic hyperplasia, prostate cancer, melanoma, pancreatic cancer, brain cancer (such as a glioblastoma), hematological malignancy, hepatocellular carcinoma, cervical cancer, endometrial cancer, head and neck cancer, esophageal cancer, gastrointestinal stromal tumor (GIST), renal cell carcinoma (RCC) or gastric cancer. The colorectal cancer can be CRC Dukes B or Dukes C-D. The hematological malignancy can be B-Cell Chronic Lymphocytic Leukemia, B-Cell Lymphoma-DLBCL, B-Cell Lymphoma-DLBCL-germinal center-like, B-Cell Lymphoma-DLBCL-activated B-cell-like, and Burkitt's lymphoma. The cancer can also be a premalignant condition, such as Barrett's Esophagus.
[0136] In one embodiment, a method for screening or characterizing a prostate cancer is provided. In one embodiment, the method can comprise detecting in a sample obtained from a subject a presence and/or level of one or more autoantibodies to one or more polypeptide probes comprising a polypeptide probe is a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. A polypeptide probe can also comprise a polypeptide sequence, or a fragment thereof, selected from Table 1, 2, 3 and 4, such as a polypeptide probe comprising polypeptide probe comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14, or a fragment thereof, or a polypeptide probe comprising a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof. A polypeptide probe can also comprise SEQ ID NO: 12, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof, or a fragment thereof, or a polypeptide encoded by SEQ ID NO: 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75,76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0137] In one embodiment, the method can comprise detecting in a sample obtained from a subject a presence and/or level of one or more autoantibodies to one or more polypeptide probes comprising a polypeptide probe is a fragment of a protein encoded by a gene, or a fragment encoded by a sequence of a UTR region of a gene, wherein the gene is DCHS1, CEP164, KBTBD6, RPS19, RPL34, SFRS14, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. A polypeptide probe can also comprise a polypeptide sequence, or a fragment thereof, selected from Table 1 or Table 2, such as a polypeptide probe comprising SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 or 14, or a fragment thereof, or a polypeptide probe encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof.
[0138] In yet another embodiment, the method can comprise detecting in a sample obtained from a subject a presence and/or level of one or more autoantibodies to one or more polypeptide probes comprising SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof., or a fragment thereof; or polypeptide probe encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof.; or polypeptide probe comprising full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0139] Depending on the results, a cancer (or absence of cancer) can be characterized. For example, in a sample from a subject a presence or level of DCHS1, CEP164 and/or RPS19 autoantibodies is detected, indicating a presence of prostate cancer in the subject. Alternately, a method further comprises detecting a presence or level of one or more autoantibodies to one or more polypeptide probe comprising a fragment of eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. The fragment of a protein encoded by eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789 can comprise a polypeptide sequence selected from Table 2.
[0140] A method disclosed herein can comprise detecting a plurality of antibodies, such as through the detection of binding of one or more antibodies that bind to a plurality of polypeptide probes. In one embodiment, the antibodies are autoantibodies. In another embodiment, the antibodies are antibodies to foreign antigens. In one embodiment, the method comprises detecting in a sample one or more antibodies that binds to a panel of polypeptide probes, wherein the panel comprises 2-100 probes, 50-200 probes, 100-500 probes 200-750 probes, 200-1000 probes, 2-5,000 probes or 2-10,000 probes. In another embodiment, the panel of polypeptide probes comprises at least about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 polypeptide probes. In another embodiment, the panel comprises at least about 50, 100, 150, 200, 250, 500, 750, 1000, 5000, 10,000, 15,000, 20,000, 25,000, 30,000, 40,000, 50,000, 60,000, 70,000, 75,000, or 100,000 polypeptide probes. In one embodiment, the panels comprises a plurality of polypeptide probes, wherein a subset of the probes comprise fragments of the same full-length protein, such that autoantibodies to different epitopes bind to the different probes and indicate a presence of an immune response, or antibody, to the full-length protein.
[0141] A panel comprising multiple polypeptide probes allow for the simultaneous analysis of multiple markers correlating with carcinogenesis and/or metastasis. In one embodiment, a panel includes markers identified as correlating with cancerous tissue, metastatic cancer, localized cancer that is likely to metastasize, pre-cancerous tissue that is likely to become cancerous, pre-cancerous tissue that is not likely to become cancerous, or any combination thereof. Depending on the subject, a panel can be analyzed alone or in combination in order to provide a diagnosis, prognosis, or theranosis. One or more markers for inclusion on a panel can be selected by screening for their diagnostic, prognostic, or theranostic value.
[0142] Any of the proteins listed in Tables 1, 2, 3 or 4, or proteins encoded by the genes listed in Tables 1, 2, 3 or 4, in any combination, can be utilized to detect a presence of an antibody, such as an autoantibody, in a subject. In one embodiment, the protein is encoded SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0143] In one embodiment, detection of an autoantibody to a protein encoded by a gene, a fragment encoded by a sequence of a UTR region of a gene, or fragment of a protein encoded by a gene, wherein the gene is CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789, or any combination thereof, is indicative of a presence of prostate cancer in a subject. In another embodiment, any combination of two or more proteins (e.g., cancer markers) or fragments thereof is used to detect one or more autoantibodies (e.g., a panel consisting of one or more full-length or fragments of the polypeptides listed in Tables 1, 2, 3, and/or 4).
[0144] In another embodiment, detection of an autoantibody to a protein encoded by a gene, a fragment encoded by a sequence of a UTR region of a gene, or fragment of a protein encoded by a gene, wherein the gene is CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, LOC388789, or any combination thereof, is indicative of a presence of prostate cancer in a subject. In another embodiment, any combination of two or more proteins (e.g., cancer markers) or fragments thereof is used to detect one or more autoantibodies (e.g., a panel consisting of one or more full-length or fragments of the polypeptides listed in Tables 1 and 2).
[0145] In one embodiment, the method comprises detecting one or more antibodies that bind to at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or 14 polypeptide probes, wherein the polypeptide probes are full-length or fragments of proteins encoded by the genes listed in Tables 1, 2, 3, and/or 4, or polypeptides encoded by the UTR sequence of the gene. In one embodiment, the antibody profiling panel comprises a plurality of polypeptide probes, wherein one or more polypeptide probes is a protein or fragment of a protein encoded by CEP 164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789, or any combination thereof. In another embodiment, the antibody profiling panel comprises a plurality of polypeptide probes, wherein one or more polypeptide probes is a protein or fragment of a protein encoded by DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, LOC388789, or any combination thereof.
[0146] The cancer can be characterized with increased accuracy, such as with increased specificity, sensitivity, or both. The sensitivity can be determined by: (number of true positives)/(number of true positives+number of false negatives), whereas the specificity can be determined by: (number of true negatives)/(number of true negatives+number of false positives).
[0147] In one embodiment, the cancer can be characterized (e.g., detected, prognosed, etc.) with at least 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55,60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% sensitivity. In another embodiment, the cancer can be characterized (e.g., detected, prognosed, etc.) with at least 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55,60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% specificity.
[0148] Specificity or sensitivity of detection can be altered by altering the polypeptide probe make-up of a panel. In one embodiment, sensitivity of a diagnostic, prognostic, or theranosstic assay (e.g., an antibody detection assay, such as an autoantibody detection assay) can be increased by increasing the number of probes, increasing the diversity of probes (e.g, utilizing probes comprising distinct epitopes from the same and/or different markers), or tailoring the probes to a particular subject or cancer to be diagnosed/prognosed. Furthermore, the confidence level for determining the specificity, sensitivity, or both, may be with at least 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55,60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% confidence.
[0149] A method and system disclosed herein can also comprise detecting a plurality of antibodies, such as through the detection of antibodies binding to a plurality of polypeptide probes, and characterizing or screening for a cancer with increased or greater specificity as compared to a characterization based on detection of antibodies that bind to less than the plurality of polypeptide probes. In one embodiment, the antibodies are autoantibodies. In another embodiment, the antibodies are to foreign antigens.
[0150] Two or more polypeptide probes can be used to diagnose a particular cancer. For example, a cancer can be diagnosed by measuring the binding of autoantibodies to two polypeptide probe. The number of polypeptide useful for diagnosing a cancer includes, but is not limited to, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, and 20 polypeptide probes. In another embodiment, prostate cancer is diagnosed with 5 or more polypeptide probes. In one embodiment, prostate cancer is diagnosed with 5 polypeptide probes, which provides a diagnosis that has a higher sensitivity as compared to using less than the 5 polypeptide probes. In another embodiment, prostate cancer is diagnosed with 10 or more polypeptide probes. In another embodiment, a prostate cancer is diagnosed with 10 polypeptide probes, which provides a diagnosis that has a higher specificity as compared to using less than the 10 polypeptide probes.
Antibody Detection
[0151] The level, presence or absence of an antibody can be determined by detecting the binding of one or more autoantibodies to a polypeptide probe. Detection of an antibody can be either quantitative or qualitative. For quantitative assays, the amount of antibody detected can be compared to a control or reference to determine whether an antibody is overexpressed or underexpressed in a sample. For example, the control or reference can be a normal sample or a sample from a known disease state, such as a cancer sample.
[0152] Antibody binding to a polypeptide probe can be detected by techniques known in the art, such as, but not limited to, radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, immunodiffusion assays, in situ immunoassays (e.g., using colloidal gold, enzyme or radioisotope labels, for example), Western blots, precipitation reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement fixation assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays. Any of the assays used can be quantitative or qualitative, as desired.
[0153] Detection of an antibody bound to a polypeptide probe can be detected using labeling technology. For example, one or more antibodies in a sample collected from a subject to be tested can be directly labeled (e.g., with a fluorescent or radioactive label) and exposed to a polypeptide probe or probe panel. Detection of a signal from the interaction can be achieved using methodology appropriate to the type of label used (e.g., fluorescent microscopy can be used to detect binding of a fluorescently labeled autoantibody to a polypeptide probe). In one embodiment, an autoantibody is detected by detecting binding of a labeled secondary antibody or other antibody-binding reagent which specifically binds to the antibody bound to the polypeptide probe (e.g., a "sandwich immunoassay"). Many methods are known in the art for detecting binding in an immunoassay and are within the scope of the present invention. In one embodiment, the immunoassay described in U.S. Pat. Nos. 5,599,677, 5,672,480, or both, each of which is herein incorporated by reference, is used.
[0154] In one embodiment, automation is utilized to detect binding of one or more autoantibodies to a polypeptide probe or probe panels. Methods for the automation of immunoassays include those described in U.S. Pat. Nos. 5,885,530, 4,981,785, 6,159,750, and 5,358,691, each of which is herein incorporated by reference. Analysis and/or presentation of results can also be automated. In one embodiment, a computer with software that analyzes raw data and generates a prognosis, diagnosis, or theranosis based on the level, presence or absence of antibody binding to one or more polypeptide probes is used. A computer-based analysis program can be used to translate the raw data generated by the detection assay (e.g., a presence, absence, or amount of antibody binding to one or more polypeptide probes) into data of predictive value for a clinician. The clinician can access the predictive data using any suitable means. In one embodiment, the data is transmitted over a network. In another embodiment, the data is accessible by a clinician.
[0155] Any method capable of receiving, processing, and transmitting the information to and from a laboratory conducting the assay, medical personnel, and a subject can be used. In one embodiment, a sample (e.g., a biopsy or a serum or urine sample) is obtained from a subject and submitted to a profiling service (e.g., clinical lab at a medical facility, genomic profiling business, etc.), located in any part of the world (e.g., in a country different than the country where the subject resides or where the information is ultimately used) to generate raw data. In one embodiment, the sample comprises a tissue or other biological sample and the subject visits a medical center to have the sample obtained and sent to the profiling center. In another embodiment, a subject collects the sample themself (e.g., a buccal swab) and directly sends it to a profiling center. In another embodiment, the sample comprises previously determined biological information. The information can be directly sent to the profiling service by the subject (e.g., an information card containing the information may be scanned by a computer and the data transmitted to a computer of the profiling center using an electronic communication system). Upon being received by the profiling service, a sample can be processed and a profile produced (i.e., antibody level, presence or absence of antibody). A profile generated can be specific for the diagnostic, prognostic, or theranostic information desired for a subject. In one embodiment, a sample from a subject is analyzed for a presence or expression level of one or more antibodies to one or more proteins encoded by a gene, fragment of one or more proteins encoded by a gene, or fragment encoded by aUTR region of a gene, wherein the gene is CEP 164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment, the antibodies are autoantibodies. In another embodiment, a sample from a subject is analyzed for a presence or expression level of one or more antibodies to one or more proteins encoded by a gene, fragment of one or more proteins encoded by a gene, or fragment encoded by aUTR region of a gene, wherein the gene is DCHS1, CEP164, KBTBD6, RPS19, RPL34, RNA binding protein 6, Hemk1, eIF4G1, BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. In one embodiment, the antibodies are autoantibodies.
[0156] Profile data can be prepared in a format suitable for interpretation by a treating clinician. In one embodiment, rather than providing raw expression data, the prepared format represents a diagnosis, screening or risk assessment (e.g., likelihood of metastasis or PSA failure or the development of high prostate specific antigen levels in a patient following prostate cancer therapy (e.g., surgery)) for the subject, along with recommendations for particular treatment options. The data can be displayed to the clinician by any suitable method. In one embodiment, the profiling service generates a report that is printed for the clinician (e.g., at the point of care). In another embodiment, the report is displayed to the clinician on a computer monitor.
[0157] In one embodiment, the information is first analyzed at the point of care or at a regional facility. The raw data is then sent to a central processing facility for further analysis. In one embodiment, further analysis comprises converting the raw data to information useful for a clinician or subject, such as a patient. The central processing facility can provide the advantage of privacy (all data is stored in a central facility with uniform security protocols), speed, and uniformity of data analysis. The central processing facility can also control the fate of the data following treatment of a subject. In one embodiment, using an electronic communication system, the central facility provides data to the clinician, the subject, researchers, or any other individual. In one embodiment, a subject is able to directly access the data using the electronic communication system. In another embodiment, a subject chooses further intervention or counseling based on the result. In one embodiment, the data is used for research use. The data can be used to further optimize the inclusion or elimination of markers as useful indicators of a particular condition or stage of disease.
Antibody Test
[0158] The detection of one or more antibodies from a sample, such as described herein, can be used in conjunction with one or more other tests used for detecting or screening for cancer. The antibody detection can be used prior to, concurrent with, or subsequent to one or more other tests. In one embodiment, a genetic test for a mutation or expression level of one or more genes can be used in conjunction with determining the antibody profile of a subject.
[0159] Antibody detection can provide a non-invasive, inexpensive means for detecting or screening for a cancer. Thus, in one embodiment, the detection of a level, presence or absence of one or more antibodies can be used to determine whether a second sample or additional analysis of a sample from a subject is to be performed. In one embodiment, after detecting an expression level of one or more antibodies of sample obtained from subject to one or more polypeptide probes comprising a fragment of a protein encoded by, or a polypeptide encoded by a UTR sequence of, CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789, a biopsy can be recommended for the subject. In another embodiment, after detecting an expression level of one or more antibodies of sample obtained from subject to one or more polypeptide probes comprising a fragment of a protein encoded by, or a polypeptide encoded by a UTR sequence of, DCHS1, CEP164, KBTBD6, RPS19, RPL34, SFRS14, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789, a biopsy can be recommended for the subject.
[0160] In another embodiment, an expression level for one or more antibodies from a subject can be detected, and based on the expression level of the one or more antibodies, the subject can be identified as suspected of having cancer. In one embodiment, the subject is characterized as having a high probability or likelihood of having cancer. Based on the detection or expression level of the one or more antibodies, a recommendation that a biopsy be obtained can be made for the subject. In another embodiment, if there is a lack of detection or expression of the one or more antibodies, further analysis is not recommended and a biopsy not be obtained. (see for example, FIG. 1, "Autoantibody Test I")
[0161] In another embodiment, prior to detecting one or more antibodies from a subject, the subject is suspected of having cancer. The subject can have had a genetic test for a mutation or gene expression analysis, image analysis (such as magnetic resonance imaging (MRI), positron emission tomography (PET) scan, computerized tomography (CT) scan, nuclear magnetic resonance (NMR)), or biopsy, and have inconclusive or uncertain results. Thus, prior to further analysis and treatment for a suspected cancer, the subject can seek further verification of their likelihood of having a cancer, or their diagnosis, prognosis, or theranosis of a cancer.
[0162] In one embodiment, an antibody profiling panel described herein can be used in conjunction with a separate test which determines a presence or level of PSA (e.g., a serum PSA test). In one embodiment, the panels is utilized to diagnose or prognose a presence of a cancer (e.g., prostate cancer) in a subject. In one embodiment, a subject is suspected of having prostate cancer based on their PSA level, age, or both. A subject can be male and over 30, 35, 40, 45, 50, 55, 60, 65, 70 or 75 years of age. In another embodiment, the subject is between 30-80 , 40-75, 45-75, or 50-75 years of age. In another embodiment, the subject had a PSA blood test, digital rectal exam, or both. In yet another embodiment, the subject may have a PSA level of at least about 1.0, 1.5, 2.0, 2.5, or 4.0 ng/ml. The subject can have a PSA level of between about 1.0-15 ng/ml, 2.0-15 ng/ml, or 2.5-10 ng/ml.
[0163] In one embodiment, a biological sample from a subject, such as a subject with a PSA level greater than about 2.5 ng/ml, is contacted with one or more probes for an antibody, such as one or more probes for an autoantibody. Based on the expression level of the antibody, a biopsy for the subject can be recommended (see for example FIG. 1, "Autoantibody Test I"). The antibody test can comprise detecting one or more antibodies in a sample that bind to a polypeptide probe as described herein. In another embodiment, the antibody test is an autoantibody test.
[0164] In one embodiment, the antibody binds a polypeptide probe comprising SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, or a fragment thereof. In another embodiment, the antibody binds a polypeptide probe comprising a polypeptide sequence encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, or a fragment thereof. In yet another embodiment, the antibody binds a polypeptide probe comprising full-length or a fragment of a protein that is encoded by SEQ ID NO: 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101,102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or a fragment thereof.
[0165] In one embodiment, the antibody binds a polypeptide probe comprising a full-length or fragment of a protein encoded by, or a polypeptide encoded by a CEP164, RPL34, BRMSL1, NKX3-1, RPSA, Cytochrome C oxidase 5 Subunit, UTR-region of chromosome 11, MAPKKK9, cDNA clone XR--113641.1, PSA, H2aa4, UBE2I, TIMP2, WDR77, Deaminase Domain, FAM53B, 5'UTR BMI1, RP3-323M22, or LOC388789. In one embodiment, a polypeptide probe comprises SEQ ID NO: 2, 5, 9, 11, 14, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, or a fragment thereof, or a fragment thereof. In another embodiment, a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 16, 19, 23, 25, 28, 70, 71, 72, 73, 74, 75,76, 77, 78, 79, 80, 81, 82, 83, 84, or a fragment thereof.
[0166] In another embodiment, the antibody binds a polypeptide probe comprising a full-length or fragment of a protein encoded by, or a polypeptide encoded by a UTR of, DCHS 1, CEP 164, KBTBD6, RPS 19, RPL34, RNA binding protein 6, Hemk1, eIF4G1, 5'UTR BMI1, BRD2, RP3-323M22, SFRS14, or LOC388789. In one embodiment, a polypeptide probe comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or a fragment thereof. In another embodiment, a polypeptide probe comprises a polypeptide encoded by SEQ ID NO: 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, or a fragment thereof.
[0167] If a biospy is recommended and the biopsy is positive for a cancer such as prostate cancer, a biological sample obtained from the subject can be contacted with one or more probes for an antibody, which can be the same or different, as those used in deciding whether to obtain a biopsy. Based on the expression level of antibodies in the sample, a prognosis for the cancer can be provided. (see for example, FIG. 1, "Autoantibody Test II")
[0168] Thus, in one embodiment, a method of characterizing or screening for a cancer from a subject with a positive biopsy result is provided. In another embodiment, the subject has not yet provided a sample for detecting one or more antibodies. In yet another embodiment, the subject has provided an initial sample for detecting one or more antibodies and detection of the one or more antibodies is used in deciding whether a biopsy is obtained. Furthermore, in one embodiment, detection of one or more antibodies is used for a diagnosis, prognosis or theranosis of a cancer, such as prostate cancer. In one embodiment, the method comprises detecting an expression level for one or more antibodies, wherein the expression level of the one or more antibodies is indicative of the presence, absence, or stage of the cancer. In another embodiment, the indication is whether the cancer is aggressive or indolent.
[0169] In one embodiment, a cancer is classified based on the detection of one or more antibodies to one or more polypeptide probes disclosed herein. In one embodiment, the cancer is classified as aggressive or malignant. In another embodiment, the cancer is classified as indolent or benign. Furthermore, after classification, detection of one or more antibodies from a sample from the subject can be used to select a treatment or therapeutic for the cancer.
[0170] The present disclosure is not limited to the embodiments described above, but is capable of modification within the scope of the appended claims. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents of the specific embodiments of the present disclosure described herein.
EXAMPLES
Example 1
Probe Selection
[0171] Construction of T7 Phage Display Prostate Cancer cDNALibrary
[0172] mRNA was isolated from total RNA following Novogen's Straight A's mRNA isolation protocol. OrientExpression cDNA synthesis and cloning system were used for the construction of T7 phage prostate cancer cDNA libraries.
[0173] To eliminate the 3' bias inherent in oilgo(dT)-primed libraries, two libraries were constructed using directional oligo(dT) primer and random primer in parallel. After amplification, these two libraries were combined in same amount of titer.
[0174] Enrichment of Cancer Specific T7 Phage Library.
[0175] Protein A/G agarose beads (Pierce Biotechnology, Rockford, Ill.) were used to purify IgGs from the serum of prostate cancer patients. To enhance the selection of epitopes binding to IgGs specifically associated with prostate cancer, a dual procedure was performed.
[0176] First, a pre-clearing step was used to remove nonspecific clones by pre-absorbing the phage epitope libraries onto purified IgGs from normal serum pool from 10 control men. Next, the pre-cleared phage libraries were selected onto the pool of IgGs purified from the serum of 6 localized prostate cancer patients. In essence protein-A/G agarose beads provide a purification of the serum of IgGs. Fifty μl protein-A/G agarose beads were placed into 1.5 ml eppendorf tube and washed two times with 1×PBS. Washed beads were blocked with 4% nonfat milk at 4° C. for 1 hr. The beads were then incubated at 4° C. with 15 μl of pooled control sera at 1:30 dilution with 4% nonfat milk. After at least 2 hrs of incubation, the beads were washed three times with 1×PBS and then incubated with phage library (˜1010 phage particles) at 4° C. for at least 2 hrs. The mixture was centrifuged at 3000 rpm for 2 min. The beads with unspecifically bounded phage particles were discarded and the supernatant was collected for further immunoscreening.
[0177] Fifty μl fresh protein-A/G agarose beads were washed and blocked as same as above. The beads were then incubated at 4° C. for 3 hrs with 500 ml of PBS containing 15 ml patient sera pool at a 1:30 dilution. This amount of serum provides a three-fold molar excess of IgG to calculated number of protein-A/G binding capacity. The beads were washed three times with 1×PBS and then incubated with phage library supernatant from above allowed to react with the antibodies on the beads at 4° C. overnight. The mixture was centrifuged at 3000 rpm for 2 min and supernatant was discarded. The beads were then washed three times with 1×PBS.
[0178] To elute the bound phage 100 ml 1% SDS was used to strongly break up the antibody-antigen reaction without disrupting the T7 phage particles. The mixture of phage and elution buffer was incubated at room temperature for 10 min. The bound phages were removed from the beads by centrifugation at 8000 rpm for 8 min. Eluted phages were transferred to 10 ml BLT 5403 bacterial cells with OD600=0.6˜0.8 for amplification. Four or five cycles of affinity selections and biopanning were carried out with amplification of phage particles after each biopanning.
[0179] High Throughput Epitope Detection Using Phage Microarrays.
[0180] Random phage colonies were picked up and amplified in 96-well plates. Fresh phage lysates were spotted onto on FASTTM nitrocellulose coated glass slides (Schleicher & Schuell, Keene, N.H.). Extra T7 empty phage spots were spotted in quadruplicate as negative reference for normalizing the signal value from different slides. The arrays were dried overnight at room temperature. Before processing with serum, the arrays were rinsed briefly in a 4% nonfat milk/PBS with 0.1% tween-20 to remove unbound phage, then transferred immediately to 4% nonfat milk/PBS as a blocking solution for 1 hr at room temperature. Without allowing the array to dry, 2 ml of PBS containing human serum and T7-tag antibody (Novagen) at a dilution of 1:500 and 1:5000 respectively was applied to the surface in a screw-top slide hybridization tube.
[0181] The arrays were incubated at room temperature for 1 hour, and then washed gently three times in PBS/0.1% Tween-20 solution 10 min each. All washes were performed at room temperature. After washing, the arrays were incubated with 2 ml of PBS containing Cy3-labeled goat anti-mouse antibody and Cy5-labeled goat anti-human antibody (Jackson ImmunoResearch) at a dilution of 1:5,000 for both for 1 hr in the dark. Three washes were performed using PBS/0.1% Tween-20 solution with 10 min each. The arrays were then dried using a stream of compressed air and scanned using 532 nm and 635 nm lasers (Axon Laboratories).
[0182] Building Predictor and Validation of Biomarker Profile.
[0183] The arrays were quantified using GenePix software (Axon Laboratories). Raw ratios of each array were subtracted by median of ratios of the negative control spots with the observation that the signal for negative T7 empty phage on each chip correlates very well with the signal intensity for whole array. Then Z-transformation was applied to clones so that the mean of each clone is zero across arrays and the standard deviation is 1. Due to the fact a presence of antibodies specific to cancer was tested, epitopes with high reactivity in controls and low reactivity in patients were not expected. A GA/KNN algorithm, a machine learning language, was employed to calibrate the system. Briefly, the data set was randomly separated into a training set and a test set. In the training set, genetic algorithm (GA) was used to select optimized solutions (a subset of clones here) which had good fitness. The fitness was assessed by its ability to classify the training samples using the k-nearest neighbor (KNN) analysis (k=3 here). The fitness score was defined as the number of correctly classified training samples divided by the total number of training samples. The fitness score was specificed to be equal or greater than 0.95. After getting 4000 optimized solutions, clones were ranked by their frequency in the solutions and top genes were used to predict the test samples. This cycle of sample partition, solution searching, clone ranking and test sample prediction was repeated 10 times and high-ranked clones were selected as optimized classifier.
Sequence CWU
1
172114PRTArtificial SequenceSynthetic 1Pro Gln Thr Thr Ala Pro Arg Arg Ala
Arg Pro Arg Arg Ser1 5 10225PRTArtificial
SequenceSynthetic 2Pro Val Ser Ser Ser Gly Ser Tyr Ser Thr Pro Ile Arg
Lys Ser Leu1 5 10 15Arg
Arg Ala Ala Pro Pro Phe Arg Ala 20
2537PRTArtificial SequenceSynthetic 3Ser Ser Phe Ser Pro Leu Asn1
5455PRTArtificial SequenceSynthetic 4Ala Ala Arg Arg Pro His Asp
Ala Trp Ser Tyr Cys Lys Arg Arg Glu1 5 10
15Pro Ala Gly Val Xaa Gln Ser Ser Gly Ser Leu Pro Gln
Lys Val Arg 20 25 30Glu Ala
Glu Ser Pro Arg Met Gly Gly Tyr Arg Gln Ala Gly Gln Ala 35
40 45Gln Arg Ala Cys Ser Leu Arg 50
55564PRTArtificial SequenceSynthetic 5Gln Ala Arg Leu Phe Ile Phe
Ile Thr Gln Lys Ser Phe Ile Phe Leu1 5 10
15Phe Ser Phe Leu Thr Leu Cys Leu Cys Leu Gln His Phe
His Asn Asp 20 25 30Phe Leu
Leu Leu Asp Lys Glu Ser Thr Leu Asp Pro Val Thr Asn Thr 35
40 45Phe Ser Thr His Gly Thr Lys Thr Leu Leu
Leu Thr Ser Leu Phe Leu 50 55
60622PRTArtificial SequenceSynthetic 6Leu Arg Gly Ile Thr Lys Asn Asp Arg
Asn Phe Asn Arg Lys Ile His1 5 10
15Leu Asn Trp Ile Ser Lys 20710PRTArtificial
SequenceSynthetic 7Arg Gly Cys Cys Ala Gly Ile Arg Cys Thr1
5 108118PRTArtificial SequenceSynthetic 8Ile Arg Asp
Pro Asn Gln Gly Gly Lys Asp Ile Thr Glu Glu Ile Met1 5
10 15Ser Gly Ala Arg Thr Ala Ser Thr Pro
Thr Pro Pro Gln Thr Gly Gly 20 25
30Gly Leu Glu Pro Gln Ala Asn Gly Glu Thr Pro Gln Val Ala Val Ile
35 40 45Val Arg Pro Asp Asp Arg Ser
Gln Gly Ala Ile Ile Ala Asp Arg Pro 50 55
60Gly Leu Pro Gly Pro Glu His Ser Pro Ser Glu Ser Gln Pro Ser Ser65
70 75 80Pro Ser Pro Thr
Pro Ser Pro Ser Pro Val Leu Glu Pro Gly Ser Glu 85
90 95Pro Asn Leu Ala Val Leu Ser Ile Pro Gly
Asp Thr Met Thr Thr Ile 100 105
110Gln Met Ser Val Glu Glu 115922PRTArtificial SequenceSynthetic
9Gly Gly Gly Arg Gly Ala Gly Gly Gly Arg Gly Ala Gly Ala Gly Gly1
5 10 15Gly Arg Pro Glu Ala Ala
201089PRTArtificial SequenceSynthetic 10Glu Ser Arg Pro Met Ser
Tyr Asp Glu Lys Arg Gln Leu Ser Leu Asp1 5
10 15Ile Asn Lys Leu Pro Gly Glu Lys Leu Gly Arg Val
Val His Ile Ile 20 25 30Gln
Ala Arg Glu Pro Ser Leu Arg Asp Ser Asn Pro Glu Glu Ile Glu 35
40 45Ile Asp Phe Glu Thr Leu Lys Pro Ser
Thr Leu Arg Glu Leu Glu Arg 50 55
60Tyr Val Leu Ser Cys Leu Arg Lys Lys Pro Arg Lys Pro Tyr Ser Thr65
70 75 80Tyr Glu Met Arg Phe
Ile Ser Trp Phe 851111PRTArtificial SequenceSynthetic
11Leu Val Ser Ile Leu Leu Thr Lys Thr Ile Tyr1 5
101284PRTArtificial SequenceSynthetic 12Lys Ala Glu Cys Phe Lys
Asn Leu Ile Val Lys Lys Gln Lys Ser Leu1 5
10 15Cys Ser Gly Phe Lys Glu His Leu Asn Glu Ala Ser
Ile Leu Ala Gln 20 25 30Val
Ser Val Ser Ser Ser Lys Arg Val Trp Lys Ser Trp Glu Asn Leu 35
40 45Ile Ser Ser Phe Met Val Trp Asn Pro
Ala His Leu Ile Ile Ser Ile 50 55
60Pro Asn Leu Glu Lys Thr Ser Asp Leu Ser Met Met Ser Lys Leu Ala65
70 75 80Ala Ala Leu
Glu1392PRTArtificial SequenceSynthetic 13Gln Arg Ser Gly Arg Asp Asn Gly
Asp Val Gly Ala Gly Ala Pro Phe1 5 10
15Arg Leu Ser Ser Thr Ser Gln Pro Arg Arg Ile Lys Pro Ile
Ala Pro 20 25 30Pro Pro Arg
Ala Pro Ser Pro Glu Xaa Gly Ala Gly Gly Gly Gly Gly 35
40 45Gly Arg Gly Gly Gly Gly Gly Gly Pro Gly Gly
Gly Gly Val Gly Gly 50 55 60Arg Gly
Gly Gly Gly Gly Gly Gly Gly Arg Gly Ala Gly Gly Gly Arg65
70 75 80Gly Ala Gly Ala Gly Gly Gly
Arg Pro Glu Ala Ala 85
901477PRTArtificial SequenceSynthetic 14Pro Ala Ser Ala Ser Ile Leu Ala
Gly Val Pro Met Tyr Arg Asn Glu1 5 10
15Phe Thr Ala Trp Tyr Arg Arg Met Ser Val Val Tyr Gly Ile
Gly Thr 20 25 30Trp Ser Val
Leu Gly Ser Leu Leu Tyr Tyr Ser Arg Thr Met Ala Lys 35
40 45Ser Ser Val Asp Gln Lys Asp Gly Ser Ala Ser
Glu Val Pro Ser Glu 50 55 60Leu Ser
Glu Arg Pro Ser Leu Arg Pro His Ser Ser Asn65 70
7515221DNAArtificial SequenceSynthetic 15agctttcgct agagacgcct
ccataagtca cttgcccgtt ggcccccacg atcggggtcg 60gttgctcgca gggctgagca
gagatgtgcc aggagggttg ttctcacgca agaggacgct 120gtactcctgc tgctggaaag
taggcgcctc gtcgttgacg tcagcgacac tgacggtcag 180gacctgcgtg gccgagcgcg
gcggggagcc gtggtctgag g 22116267DNAArtificial
SequenceSynthetic 16tggaggagag gctgggctgc cccaagcccc tgctcagggc
ctcagaagcc atacaccttc 60actctgattg tgctcatcaa ggcccagcat gcaggaggct
caaagtagct tttggcttgg 120gtgttgacga gaagagaggt aacctggggt cattcttgac
acgttccagc cacctccggt 180tggcctcaat tatgccctga aaggtggtgc tgcccgcctc
agggacttgc gaatgggagt 240gctgtaggag ccggagctgc tcactgg
26717168DNAArtificial SequenceSynthetic
17gaattcgtca ttctcacctt tgaattaaag cttagactaa atagtaatat atcgtgggaa
60ggattttggt tttgtgatat ttctgtgaat taaggaatag atgttaacca ttattttgta
120gaaaagtgat ttgtatgtgg ttaattataa ataaaactgg taccagaa
16818481DNAArtificial SequenceSynthetic 18tttattaacc cagcatggtt
tgttctaatg cttcttgttg gcagctgcca cctgtccggc 60gattctgtcc agatctcttt
gtccctgagg tgtcagtttg cggccgccat cttggtcctt 120ttccaccatt ttcagcccct
ccagggcttg gaggacccgg cgggccacac tcttggagcc 180tcggctgaag tggctgggca
tgacgccgtt tctctgacgt cccccataga tcttggtcat 240ggagccaacc ccagcgccac
cccggaggta caggtgccgc gctgtgnaag cagctcgcgt 300gtagaaccag ttctcatcgt
agggagcaag ctctttgtgc ttggccagct tgacggtatc 360cacccattcg gggactttca
gcttcccgga ctttttgagg aaggctgcca gagctctgac 420naactcctgc tggttcacgt
cttttacagt aactccaggc atcgtgcggc ctccgcgctg 480c
48119326DNAArtificial
SequenceSynthetic 19ttctcgagtg cggccgcagc ttgggtatgg agacatatca
tataagtaat gctagggtcn 60gtggtaggaa gttttttcat aggaggtgta tgagttggtc
gtagcggaat cgggggtatg 120ctgttcgaat tcataagaac agggaggtta gaagtagggt
cttggttcca tgtgtgctaa 180atgtgttcgt gacaggatca agcgtgcttt ccttatcgag
gagcagaaaa tcgttgtgaa 240agtgttgaag gcacaagcac agagtcagaa agctaaataa
aaaaatgaaa cttttttgag 300taataaaaat gaaaagacgc gcttga
32620453DNAArtificial SequenceSynthetic
20ctctgagggg catcaccaaa aatgacagga atttcaacag gaagatacat ctgaattgga
60tctcgaaata aggagtttgt gtaagagaaa aggaggacac aagcaaggag acacaaaaga
120caatttgtcc aagagagtag tagtagaaac tgacaaaggt aaggctgctt ggtggccggg
180tgcagtgact cacgcctgta atcccagcac tttgggaggc caaggcgggt ggatcacctg
240aggtcaggag ttcgagacca ccctgaccaa caggtgaaac ccctctctac taaaaataca
300aacattagcc catagtccca gctactgggg aggctgaggc aggagaatcg cttgaacctg
360ggaggcggag gttgcagtga gccaagatcg tgccattgca ctccagcctg ggcgacagaa
420tgagactgtc tcaaaacaaa aggaaaaaaa aaa
45321388DNAArtificial SequenceSynthetic 21cacttcttca agctccaaca
caaatgctgc ctcctttagg atgcctgctc tgtgctctcc 60ctgcctcccc tagcccatac
ctctgctggc accttctgta ccatgccttc agaaaccttc 120ttatccccct catctctggg
gccccctgtg gatctggcat acccaagttc agtaaatgtc 180tatcagtaag ctgatggtac
atgcattttc tagaatagag ctgggacttc ccatgtggcc 240cacatctgac ctggcagccc
atgtattccg gtcattaggg atgggaagcc atgaggacct 300ggccttctgc ccgacccagg
cagccattca agttgagcaa tggccacttc gaagactcaa 360gtgcacctga tccctgcgca
acagccac 38822361DNAArtificial
SequenceSynthetic 22ttcttctaca gacatttgta tagttgtcat agtgtcccca
ggaatagaga ggactgcgag 60attaggctca gaccccggtt ccaagactgg ggatggtgat
ggggtcggag aaggcgacga 120aggctgggat tctgaagggc tatgctctgg gccaggcagc
cctggccggt cagcaatgat 180tgctccctgt gaccggtcat ctggccggac aatgacagca
acctggggcg tctccccatt 240agcttgaggc tccagaccgc ctcccgtctg gggaggggtg
ggtgtggagg cagtgcgggc 300cccagacatg atctcctctg tgatatcctt tcctccttgg
tttggatctc gaattcggat 360c
36123499DNAArtificial SequenceSynthetic
23atcacaaata ggacaatact tgctggtctc caggtaacga acaatacacg ttttacagaa
60ggaatgtaga cattctatta tggttgtggc atcaatgaag taccctccac aaagcacaca
120catcaggtgg ggatttagct cagtgatctt gattctcgtt gttcgatgca tttctgcttg
180ataaaaaatc ccggaaagag cagccggcgc gaggcgatcg aagcgggcgg aaaagacaat
240gaaagttaaa agtcgttcag cagaaaatga atgcgagcca agcggccatc ttgaagcgag
300ctgcagacgc cgctgtcaat gggcaaccag cgcggccccg agcagccgcg gccgccacgc
360tcgtctcatg ccgcctccgg ccggcctcct cctgctccgg cgcctcggcc tcctccggcg
420cctcggcctc ctcctcctcc gcctccgcct cgacctccaa cgcctcctcc tccggggcct
480cctcctcctc ctcctcggc
49924235DNAArtificial SequenceSynthetic 24tgtagggctt ccggggtttc
ttacgtaggc aggaaaggac atagcgctca agctctctaa 60gtgtggatgg cttgagtgtt
tcaaaatcaa tctcaatctc ttctgggttt gaatcacgta 120aagagggctc cctggcttgg
attatatgca caactcggcc cagcttctcc ccaggtaatt 180tgttgatgtc caggctcagc
tgccgcttct catcgtaact catgggcctg ctctc 23525373DNAArtificial
SequenceSynthetic 25ttactgttac ctgatcaatg acagagcctt ctgaggacat
tccaagacag tatacagtcc 60tgtggtctcc ttggaaatcc gtctagttaa catttcaagg
gcaataccgt gttggttttg 120actggatatt catataaact ttttaaagag ttgagtgata
gagctaaccc ttatctgtaa 180gttttgaatt tatattgttt catcccatgt acaaaaccat
tttttcctac aaatagtttg 240ggttttgttg ttgtttcttt tttttgtttt gtttttgttt
tttttttttt tgcgttcgtg 300gggttgtaaa agaaaagaaa gcagaatgtt ttatcatggt
ttttgcttca gcggctttag 360gacaaattaa aag
37326235DNAArtificial SequenceSynthetic
26aagcagagtg ctttaaaaat ttgatagtaa aaaagcaaaa atctctgtgc tctggtttta
60aggaacattt gaatgaggca agcattttag cacaggtttc tgtttcaagt tcaaagagag
120tctggaaaag ttgggaaaat ttaatatcat cttttatggt gtggaatcct gcccatttga
180ttatttctat cccaaatctt gaaaaaacat cagacttatc tatgatgtca aagct
23527761DNAArtificial SequenceSynthetic 27aagcttatta tctcatcatc
agttataatt ctcttatctt catctgcaac ctctcctcta 60tcttcattag agccattggc
agcatcagca gaaggatgag ctgcataaaa atcccttctt 120ctcttcattt catttttgaa
aagccctgga actaatttgt atacaatatc ttggagagtt 180ttatctgacc ttatattcag
tagtggtctg gtcttgtgaa cttggacatc acaaatagga 240caatacttgc tggtctccag
gtaacgaaca atacacgttt tacagaagga atgtagacat 300tctattatgg ttgtggcatc
aatgaagtac cctccacaaa gcacacacat caggngggga 360tttagctcag tgatcttgat
tctcgttgtt cgatgcattt ctgcttgata aaaaatcccg 420gaaagagcag ccggcgcgag
gcgatcgaag cgggcggaaa agacaatgaa agttaaaagt 480cgttcagcag aaaatgaatg
cgagccaagc ggccatcttg aagcgagctg cagacgccgc 540tgtcaatggn caaccagcgc
ggccccgagc agccgcggcc gccacgctcg tctcatgccg 600cctccggccg gcctcctcct
gctccggcgc ctcggcctcc tccggcgcct cggcctcctc 660ctcctccgcc tccgcctcga
cctccaacgc ctcctcctcc gcttgaattc ggatccccga 720gcatcacacc tgactggaat
acgaacagct ccacatncng t 76128190DNAArtificial
SequenceSynthetic 28ttgggcgttc agagagttca ctgggtactt cacttgctga
gccatccttt tggtctactg 60acgacttcgc cattgtccgg ctatagtaaa gcagtgagcc
caacacagac caggtgccga 120tcccgtagac caccgacatc cgccggtacc aggccgtgaa
ctcatttcga tacatgggta 180cgccagcgag
190299911DNAHomo sapiens 29gcgatcgcca tgcagaagga
gctgggcatt gtgccttcct gccctggcat gaagagcccc 60aggccccacc tcctgctacc
attgctgctg ctgctgctgc tgctgctggg ggctggggtg 120ccaggtgcct ggggtcaggc
tgggagcctg gacttgcaga ttgatgagga gcagccagcg 180ggtacactga ttggcgacat
cagtgcgggg cttccggcag gcacggcagc tcctctcatg 240tacttcatct ctgcccaaga
gggcagcggc gtgggcacag acctggccat tgacgaacac 300agtggggtcg tccgtacagc
ccgtgtcttg gaccgtgagc agcgggaccg ctaccgcttc 360actgcagtca ctcctgatgg
tgccaccgta gaagttacag tgcgagtggc tgacatcaac 420gaccatgctc cagccttccc
acaggctcgg gctgccctgc aggtacctga gcatacagct 480tttggcaccc gctacccact
ggagcctgct cgtgatgcag atgctgggcg tctgggaacc 540cagggctatg cgctatctgg
tgatggggct ggagagacct tccggctgga gacacgcccc 600ggtccagatg ggactccagt
acctgagctg gtagttactg gggaactgga ccgagagaac 660cgctcacact atatgctaca
gctggaggcc tatgatggtg gttcaccccc ccggagggcc 720caggccctgc tggacgtgac
actgctggac atcaatgacc atgccccggc tttcaatcag 780agccgctacc atgctgtggt
gtctgagagc ctggcccctg gcagtcctgt cttgcaggtg 840ttcgcatctg atgccgatgc
tggtgtcaat ggggctgtga cttacgagat caaccggagg 900cagagcgagg gtgatggacc
cttctccatc gacgcacaca cggggctgct gcagttagag 960cggccactgg actttgagca
gcggcgggtc catgaactgg tggtgcaagc acgagatggt 1020ggggctcacc ctgagctggg
ctcggccttt gtgactgtgc atgtgcgaga tgccaatgac 1080aatcagccct ccatgactgt
catctttctc agtgcagatg gctcccccca agtgtctgag 1140gccgccccac ctggacagct
cgttgctcgc atctctgtgt cagacccaga tgatggtgac 1200tttgcccatg tcaatgtgtc
cctggaaggt ggagagggcc actttgccct aagcacccaa 1260gacagcgtca tctatctggt
gtgtgtggct cggcggctgg atcgagagga gagggatgcc 1320tataacttga gggttacagc
cacagactca ggctcacctc cactgcgggc tgaggctgcc 1380tttgtgctgc acgtcactga
tgtcaacgac aatgcacctg cctttgaccg ccagctctac 1440cgacctgagc ccctgcctga
ggttgcgctg cctggcagct ttgtagtgcg ggtgactgct 1500cgggatcctg accaaggcac
caatggtcag gtcacttata gcctagcccc tggcgcccac 1560acccactggt tctccattga
ccccacctca ggcattatca ctacggctgc ctcactggac 1620tatgagttgg aacctcagcc
acagctgatt gtggtggcca cagatggtgg cctgccccct 1680ctagcctcct ctgccacagt
tagcgtggcc ctgcaagatg tgaatgataa tgagccccaa 1740ttccagagga ctttctacaa
tgcctcactg cctgagggca cccagcctgg aacttgcttc 1800ctgcaggtga cagccacaga
cgcggatagt ggcccatttg gcctcctctc ctattccttg 1860ggtgctggac ttgggtcctc
cggatctccc ccattccgca ttgatgccca tagcggtgat 1920gtgtgcacaa cccggaccct
ggaccgtgac caggggccct caagctttga cttcacagtg 1980acagctgtgg atgggggagg
cctcaagtcc atggtatatg tgaaggtgtt tctgtcagac 2040gagaatgaca accctcctca
gttttatcca cgggagtatg ctgccagtat aagtgcccag 2100agtccaccag gcacagctgt
gctgaggttg cgtgcccatg accctgacca gggatcccat 2160gggcgactct cctaccatat
cctggctggc aacagccccc cactttttac cttggatgag 2220caatcagggc tgttgacagt
agcctggccc ttggccagac gggccaattc tgtggtgcag 2280ctggagatcg gggctgagga
cggaggtggc ctacaggcag aacccagtgc ccgagtggac 2340atcagcattg tgcctggaac
ccccacacca cccatatttg agcaactaca gtatgttttt 2400tctgtgccag aggatgtggc
accaggcacc agtgtgggca tagtccaggc acacaaccca 2460ccaggtcgct tggcacctgt
gaccctttcc ctatcaggtg gggatccccg aggactcttc 2520tccctagatg cggtatcagg
actgttgcaa acacttcgcc ctctggaccg ggagctactg 2580ggaccagtgt tggagctgga
ggtgcgagca ggcagtggag tgcccccagc tttcgctgta 2640gctcgggtgc gtgtgctgct
ggatgatgtg aatgacaact cccctgcctt tcctgcacct 2700gaagacacgg tattgctacc
accaaacact gccccaggga ctcccatcta tacactgcgg 2760gctcttgacc ccgactcagg
tgttaacagt cgagtcacct ttaccctgct tgctgggggt 2820ggtggagcct tcaccgtgga
ccccaccaca ggccatgtac ggcttatgag gcctctgggg 2880ccctcaggag ggccagccca
tgagctggag ctggaggccc gggatggggg ctccccacca 2940cgcaccagcc actttcgact
acgggtggtg gtacaggatg tgggaacccg tgggctggct 3000ccccgattca acagccctac
ctaccgtgtg gacctgccct caggcaccac tgctggaact 3060caggtcctgc aagtgcaggc
ccaagcacca gatgggggcc ctatcaccta tcaccttgca 3120gcagagggag caagtagccc
ctttggcctg gagccacaga gtgggtggct atgggtgcgg 3180gcagcactag accgtgaggc
ccaggaattg tacatactga aggtaatggc agtgtctggg 3240tccaaagctg agttggggca
gcagacaggc acagccaccg tgagggtcag catcctcaac 3300cagaatgaac acagtccccg
cttgtctgag gatcccacct tcctggctgt ggctgagaac 3360cagcccccag ggaccagcgt
gggccgagtc tttgccactg accgagactc aggacccaat 3420ggacgtctga cctacagcct
gcaacagctg tctgaagaca gcaaggcctt ccgcatccac 3480ccccagactg gagaagtgac
cacactccaa accctggacc gtgagcagca gagcagctat 3540cagctcctgg tgcaggtgca
ggatggaggg agcccacccc gcagcaccac aggcactgtg 3600catgttgcag tgcttgacct
caacgacaac agccccacgt tcctgcaggc ttcaggagct 3660gctggtgggg gcctccctat
acaggtacca gaccgcgtgc ctccaggaac actggtgacg 3720actctgcagg cgaaggatcc
agatgagggg gagaatggga ccatcttgta cacgctaact 3780ggtcctggct cagagctttt
ctctctgcac cctcactcag gggagctgct cactgcagct 3840cccctgatcc gagcagagcg
gccccactat gtgctgacac tgagtgctca tgaccaaggc 3900agccctcctc gaagtgccag
cctccagctg ctggtgcagg tgcttccctc agctcgcttg 3960gccgagccgc ccccagatct
cgcagagcgg gacccagcgg caccagtgcc tgtcgtgctg 4020acggtgacag cagctgaggg
actgcggccc ggctctctgt tgggctcggt ggcagcgcca 4080gagcccgcgg gtgtgggtgc
actcacctac acactggtgg gcggtgccga tcccgagggc 4140accttcgcgc tggatgcggc
ctcagggcgc ttgtacctgg cgcggcccct ggacttcgaa 4200gctggcccgc cgtggcgcgc
gctcacggta cgcgctgagg ggccgggagg cgcgggcgcg 4260cggctgctgc gagtgcaggt
gcaagtgcag gacgagaatg agcatgcgcc cgcctttgcg 4320cgcgacccgc tggcgctggc
gctgccagag aacccggagc ccggcgcagc gctgtacact 4380ttccgcgcgt cggacgccga
cggccccggc cccaatagcg acgtgcgcta ccgcctgctg 4440cgccaggagc cgcccgtgcc
ggcgcttcgc ctggacgcgc gcaccggggc gctcagcgct 4500ccgcgcggcc tggaccgaga
gaccactccc gcgctgctgc tgctggtgga agccaccgac 4560cggcccgcca acgccagccg
ccgtcgtgca gcgcgcgttt cagcgcgcgt cttcgtcacg 4620gatgagaatg acaacgcgcc
tgtcttcgcc tcgccgtcac gcgtgcgcct cccagaggac 4680cagccgcctg ggcccgcggc
cctgcacgtg gtagcccggg acccggatct gggcgaggct 4740gcacgcgtgt cctatcggct
ggcatctggc ggggacggcc acttccggct gcactcaagc 4800actggagcgc tgtccgtggt
gcggccgttg gaccgcgaac aacgagctga gcacgtactg 4860acagtggtgg cctcagacca
cggctccccg ccgcgctcgg ccacgcaggt cctgaccgtc 4920agtgtcgctg acgtcaacga
cgaggcgcct actttccagc agcaggagta cagcgtcctc 4980ttgcgtgaga acaaccctcc
tggcacatct ctgctcaccc tgcgagcaac cgaccccgac 5040gtgggggcca acgggcaagt
gacttatgga ggcgtctcta gcgaaagctt ttctctggat 5100cctgacactg gtgttctcac
gactcttcgg gccctggatc gagaggaaca ggaggagatc 5160aacctgacag tgtatgccca
ggacaggggc tcacctcctc agttaacgca tgtcactgtt 5220cgagtggctg tggaggatga
gaatgaccat gcaccaacct ttgggagtgc ccatctctct 5280ctggaggtgc ctgagggcca
ggacccccag acccttacca tgcttcgggc ctctgatcca 5340gatgtgggag ccaatgggca
gttgcagtac cgcatcctag atggggaccc atcaggagcc 5400tttgtcctag accttgcttc
tggagagttt ggcaccatgc ggccactaga cagagaagtg 5460gagccagctt tccagctgag
gatagaggcc cgggatggag gccagccagc tctcagtgcc 5520acgctgcttt tgacagtgac
agtgctggat gccaatgacc atgctccagc ctttcctgtg 5580cctgcctact cggtggaggt
gccggaggat gtgcctgcag ggaccctgct gctgcagcta 5640caggctcatg accctgatgc
tggagctaat ggccatgtga cctactacct gggcgccggt 5700acagcaggag ccttcctgct
ggagcccagc tctggagaac tgcgcacagc tgcagccttg 5760gacagagaac agtgtcccag
ctacaccttt tctgtgagtg cagtggatgg tgcagctgct 5820gggcccctaa gcaccacagt
gtctgtcacc atcacggtgc gcgatgtcaa tgaccatgca 5880cccaccttcc ccaccagtcc
tctgcgccta cgtctgcccc gcccaggccc cagcttcagt 5940accccaaccc tggctctggc
cacactgaga gctgaagatc gtgatgctgg tgccaatgct 6000tccattctgt accggctggc
aggcacacca cctcctggca ctactgtgga ctcttacact 6060ggtgaaatcc gcgtggcccg
ctctcctgta gctctaggcc cccgagatcg tgtcctcttc 6120attgtggcca ctgatcttgg
ccgtccagct cgctctgcca ctggtgtgat cattgttgga 6180ctgcaggggg aagctgagcg
tggaccccgc tttccccggg ctagcagtga ggctacgatt 6240cgtgagaatg cgcccccagg
gactcctatt gtctccccca gggccgtcca tgcaggaggc 6300acaaatggac ccatcaccta
cagcattctc agtgggaatg agaaagggac attctccatc 6360cagcctagta caggtgccat
cacagttcgc tcagcagagg ggctagactt cgaggtgagt 6420ccacggctgc gactggtgct
gcaggcagag agtggaggag cctttgcctt cactgtgctg 6480accctgaccc tgcaagatgc
caacgacaat gctccccgtt tcctgcggcc ccattatgtg 6540gccttccttc ctgagtcccg
gcccttggag gggcccctgc tgcaggtgga ggcggatgac 6600ctggatcaag gctctggagg
acagatttcc tacagtctgg ctgcatccca gccggcacgt 6660ggattgttcc acgtagaccc
aaccacaggc actatcacta ccacagccat cctggaccgt 6720gagatctggg ctgaaacacg
gttggtgctg atggccacag acagagggag cccagccctg 6780gtgggctcag ctaccttgac
ggtgatggtc atcgacacca atgacaatcg ccccaccatc 6840ccccaaccct gggagctccg
agtgtcagaa gatgcgttat tgggctcaga gattgcacag 6900gtaacaggga atgatgtgga
ctcaggaccc gtgctgtggt atgtgctaag cccatctggg 6960ccccaggatc ccttcagtgt
tggccgctat ggaggccgtg tctccctcac ggggcccctg 7020gactttgagc agtgtgaccg
ctaccagctg cagctgctgg cacatgatgg gcctcatgag 7080ggccgtgcca acctcacagt
gcttgtggag gatgtcaatg acaatgcacc tgccttctca 7140cagagcctct accaggtaat
gctgcttgag cacacacccc caggcagtgc cattctctcc 7200gtctctgcca ctgatcggga
ctcaggtgcc aacggtcaca tttcctacca cctggcttcc 7260cctgccgatg gcttcagtgt
tgaccccaac aatgggaccc tgttcacaat agtgggaaca 7320gtggccttgg gccatgacgg
gtcaggagca gtggatgtgg tgctggaagc acgagaccac 7380ggggctccag gccgggcagc
acgagccaca gtgcacgtgc agctgcagga ccagaacgac 7440cacgccccga gcttcacatt
gtcacactac cgtgtggctg tgactgaaga cctgccccct 7500ggctccactc tgctcaccct
ggaggctaca gatgctgatg gaagccgcag ccatgccgct 7560gtggactaca gcatcatcag
tggcaactgg ggccgagtct tccagctgga acccaggctg 7620gctgaggctg gggagagtgc
tggaccaggc ccccgggcac tgggctgcct ggtgttgctt 7680gaacctctag actttgaaag
cctgacacag tacaatctaa cagtggctgc agctgaccgt 7740gggcagccac cccaaagctc
agtcgtgcca gtcactgtca ctgtactaga tgtcaatgac 7800aacccacctg tctttacccg
agcatcctac cgtgtgacag tacctgagga cacacctgtt 7860ggagctgagc tgctgcatgt
agaggcctct gacgctgacc ctggccctca tggcctcgtg 7920cgtttcactg tcagctcagg
cgacccatca gggctctttg agctggatga gagctcaggc 7980accttgcgac tggcccatgc
cctggactgt gagacccagg ctcgacatca gcttgtagta 8040caggctgctg accctgctgg
tgcacacttt gctttggcac cagtgacaat tgaggtccag 8100gatgtgaatg atcatggccc
agccttccca ctgaacttac tcagcaccag cgtggccgag 8160aatcagcctc caggcactct
cgtgaccact ctgcatgcaa tcgacgggga tgctggggct 8220tttgggaggc tccgttacag
cctgttggag gctgggccag gacctgaggg ccgtgaggca 8280tttgcactga acagctcaac
aggggagttg cgtgcgcgag tgccctttga ctatgagcac 8340acagaaagct tccggctgct
ggtgggtgct gctgatgctg ggaatctctc agcctctgtc 8400actgtgtcgg tgctagtgac
tggagaggat gagtatgacc ctgtatttct ggcaccagct 8460ttccacttcc aagtgcccga
aggtgcccgg cgtggccaca gcttgggtca cgtgcaggcc 8520acagatgagg atgggggtgc
cgatggcctg gttctgtatt cccttgccac ctcttccccc 8580tattttggta ttaaccagac
tacaggagcc ctgtacctgc gggtggacag tcgggcacca 8640ggcagcggaa cagccacctc
tgggggtggg ggccggaccc ggcgggaagc accacgggag 8700ctgaggctgg aggtgatagc
acgggggcct ctgcctggtt cccggagtgc cacagtgcct 8760gtgaccgtgg atatcaccca
caccgcactg ggcctggcac ctgacctcaa cctgctatta 8820gtaggggccg tggcagcctc
cttgggagtt gtggtggtgc ttgcactggc agccctggtc 8880ctaggacttg ttcgggcccg
tagccgcaag gctgaggcag cccctggccc aatgtcacag 8940gcagcacccc tagccagtga
ctcactgcag aaactgggcc gggagccacc tagtccacca 9000ccctctgagc acctctatca
ccagactctt cccagctatg gtgggccagg agctggagga 9060ccctaccccc gtggtggctc
cttggaccct tcacattcaa gtggccgagg atcagcagag 9120gctgcagagg atgatgagat
ccgcatgatc aatgagttcc cccgtgtggc cagtgtggcc 9180tcctctctgg ctgcccgtgg
ccctgactca ggcatccagc aggatgcaga tggtctgagt 9240gacacatcct gcgaaccacc
tgcccctgac acctggtata agggccgaaa ggcagggctg 9300ctgctgccag gtgcaggagc
cactctctac agagaggagg ggcccccagc cactgccaca 9360gccttcctgg ggggctgtgg
cctgagccct gcacccactg gggactatgg cttcccagca 9420gatggcaagc catgtgtggc
aggtgcgctg acagccattg tggccggcga ggaggagctc 9480cgtggcagct ataactggga
ctacctgctg agctggtgcc ctcagttcca accactggcc 9540agtgtcttca cagagatcgc
tcggctcaag gatgaagctc ggccatgtcc cccagctccc 9600cgtatcgacc caccacccct
catcactgcc gtggcccacc caggagccaa gtctgtgccc 9660cccaagccag caaacacagc
tgcagcccgg gccatcttcc caccagcttc tcaccgctcc 9720cccatcagcc atgaaggctc
cctgtcctca gctgccatgt cccccagctt ctcaccctct 9780ctgtctcctc tggctgctcg
ctcacccgtt gtctcaccat ttggggtggc ccagggtccc 9840tcagcctcag cactcagcgc
agagtctggc ctggagccac ctgatgacac ggagctgcac 9900atcgtttaaa c
9911305634DNAHomo sapiens
30ttgcgcgctg cagggcaaca ccccggcgtc cctggaagct gggggagcgg gagaaataac
60tttatttgga ctgagagctg gagaatgaga ataggacctg agagtatatt gggctaagga
120ggagaggtgt ttgagcccag atgagtcatg gctggacgac ccctccgcat aggagatcag
180ctggttctgg aagaagatta tgatgagacc tacattccta gtgagcaaga aattcttgaa
240tttgcccggg agattggtat tgatcccatc aaggaaccag aactgatgtg gctggcgcga
300gagggcatcg tggccccact gcctggagag tggaaaccat gccaggacat cacaggtgac
360atttactatt tcaacttcgc caacgggcag tctatgtggg accatccatg tgacgaacac
420tatcggagct tggtgatcca agagcgggca aagctgtcaa cttctggggc cattaagaag
480aagaaaaaaa aaaaggaaaa gaaagacaag aaggacagag acccccccaa aagttcgctg
540gccttgggtt cctcattagc cccagttcat gttcctcttg ggggcctggc tcctttacga
600ggtcttgtgg ataccccacc ctctgctctt cgtggatctc aaagcgtgag cctggggagc
660tcagtggagt ctggacgtca gcttggagaa ctcatgctgc cttcacaggg tctcaagacc
720tctgcttata caaagggtct cttgggctcc atatatgagg acaagactgc tctcagcctc
780ttgggtttag gagaagaaac caatgaggag gatgaggagg aaagtgacaa ccagagtgtc
840cacagctcaa gtgagcctct taggaaccta cacctggaca ttggggcact ggggggtgac
900tttgagtatg aggagtctct gagaacaagc cagccagagg agaagaagga tgtttctctg
960gattcagatg ctgccggtcc ccctactccc tgcaagccct ccagcccagg tgcagacagc
1020agtctgagca gtgctgttgg caaagggcga cagggaagtg gagcaagacc tggtcttcca
1080gaaaaagagg aaaatgagaa gagtgaacct aagatttgca ggaatctggt gacccccaag
1140gcagacccta caggcagtga gcctgccaaa gcctctgaaa aggaagcacc agaggacaca
1200gtagatgcag gagaggaggg ttccaggagg gaagaggcag ccaaggagcc aaagaagaag
1260gcttctgctc tggaagaggg cagttcagac gccagccaag aactggaaat tagtgaacac
1320atgaaggaac cacagctctc agactccata gcttctgacc ccaagtcctt ccatggcctg
1380gacttcggtt ttcgcagccg gatctcggag cacctgctgg atgttgatgt gctttcccca
1440gtcctgggtg gagcttgtcg gcaggcccag caaccactgg gaatagaaga caaggatgac
1500agccagtcca gccaagatga gctgcagagc aagcagtcca aaggcctgga ggagaggtta
1560tctcctccac ttccacacga ggagcgggcc cagagtcccc ctcgcagcct ggccactgaa
1620gaagagcctc cccagggccc cgaggggcag cccgagtgga aggaggcaga ggagcttggg
1680gaggactctg cagccagcct cagcctgcag ctgtccctcc agagggagca ggccccaagc
1740ccacctgctg cctgtgagaa gggcaaggag cagcattccc aggccgagga gctgggccct
1800gggcaggaag aggcagagga tcctgaggag aaggtggcgg tcagccccac cccgccagtc
1860tctccagagg tgcgatccac agagcctgtg gctcccccag agcagctctc agaggctgca
1920ctaaaggcca tggaagaggc agtggcccaa gtactcgagc aagaccagag gcacctgctg
1980gaatccaagc aagagaagat gcagcaactg cgggagaagc tgtgccaaga ggaggaagag
2040gagatcctcc ggcttcacca gcagaaagag caatctctca gttccttgag ggagcggctg
2100cagaaagcca ttgaggagga ggaggcccgg atgagagagg aggaaagcca gaggctatcc
2160tggctccgag ctcaggtcca gtccagcaca caagcagatg aggaccaaat cagggctgag
2220caagaggctt ccctgcagaa actgagagaa gagttggagt ctcaacagaa ggctgagagg
2280gccagcttgg aacagaaaaa taggcaaatg ctggagcagc tcaaggaaga gatagaggct
2340tcggagaaga gcgagcaggc tgccctgaat gctgcaaagg agaaggctct gcagcagctg
2400agggagcagc tggaagggga gaggaaagaa gctgtggcaa cgctggagaa ggagcacagt
2460gctgagctgg agcggctctg ctcctcattg gaggccaagc accgggaggt ggtctccagc
2520ctccagaaga agatacagga agctcaacag aaagaggagg cccagctgca gaagtgcctt
2580gggcaagtgg agcacagagt tcaccagaag tcttatcacg tggctgggta tgagcacgag
2640ctcagcagtc tcctgcgaga gaagcgccag gaagtggaag gggagcatga gaggaggttg
2700gacaagatga aggaggagca ccagcaagtg atggctaagg ccagagagca gtatgaagct
2760gaggagagga agcagcgggc tgagcttctg gggcacctga ccggagagct ggagcgcctg
2820cagagggccc atgaacgaga actggagact gtgaggcagg agcaacacaa gcgtcttgag
2880gacttgcggc gccggcacag ggagcaggaa aggaagctcc aggatttaga gttggacctt
2940gaaaccagag ctaaagatgt caaggccaga ttggctctgc tggaggtcca ggaggagacc
3000gcccggaggg agaagcagca gctgcttgat gtgcagaggc aggttgctct gaagagtgag
3060gaagccacag ccacccatca gcagctggag gaggcacaga aggagcacac ccacctgttg
3120cagtcaaacc agcagctccg agaaattctt gatgagctgc aggcccgcaa gctgaagctg
3180gagtcccaag tggatctgct gcaggctcag agccagcaac tgcagaaaca cttcagcagc
3240ctggaggctg aagctcaaaa gaagcagcac ctgttgagag aagtgacagt tgaggaaaat
3300aatgcttccc cacattttga gccagatctc catattgagg acctgaggaa atcccttgga
3360acaaaccaga ccaaagaggt gtcttcttct ctctcccaga gcaaggagga cttatacttg
3420gacagcctgt cctcccacaa tgtctggcac ctcctctctg ctgagggggt agccctccgt
3480agtgccaagg agttccttgt gcagcagaca cgctccatgc ggaggcggca gacagctctg
3540aaagctgccc agcagcattg gcgccatgag ctggccagtg cgcaggaggt ggccaaagac
3600ccaccaggca tcaaggccct ggaagatatg cgcaagaacc tggagaagga gaccaggcac
3660ctggatgaga tgaagtcggc catgcggaaa ggccacaacc tgctgaagaa gaaagaggag
3720aagctgaatc agttggagtc ctctctttgg gaagaggcct cagatgaggg cactctggga
3780ggatccccca ccaagaaggc agtaaccttc gacctcagtg acatggacag cctgagcagt
3840gaaagttctg aatctttttc cccgcctcac cgtgagtggt ggcggcagca gaggatcgac
3900tcaaccccga gtctcacctc ccgcaagatc cacgggctta gccactccct ccggcagatc
3960agcagccagc tgagcagtgt cctcagcatc ctggacagcc tcaaccctca gtcgccgccg
4020ccgctcctcg cctccatgcc agcccagctc cctccccggg accctaagag cacccccacc
4080cccacctact atggctccct ggccaggttc tcagccttat catctgctac acccacgtcc
4140acccaatggg cctgggattc agggcagggg cccaggctcc cctcctctgt ggctcaaacg
4200gtggacgact tcctgttgga gaagtggcgc aagtattttc catctggcat cccgctgctc
4260agcaacagcc ccaccccgct ggagagcagg ctgggttaca tgtctgccag tgagcagctc
4320cggctcctac agcactccca ttcgcaagtc cctgaggcgg gcagcaccac ctttcagggc
4380ataattgagg ccaaccggag gtggctggaa cgtgtcaaga atgaccccag gttacctctc
4440ttctcgtcaa cacccaagcc aaaagctact ttgagcctcc tgcagctggg ccttgatgag
4500cacaacagag tgaaggtgta tcgcttctga ggccctgagc aggggcttgg ggcagcccag
4560cctctcctcc acccagacca agtgcctgag gagctgcctg ccttcttcca tctgagaaag
4620caccctcctt ccccctttga cttgcaggag ccaccaggga ccagggggtt gagtggaaca
4680gtaaagccac acattctgtg actatataac ctatctcagg ctaaaatgtg tggactcgta
4740cgagctcttg tcattgacat ggcaagctga tggcgtgcgg tggctgcggg gtatcagggc
4800cgggagccct ttgggaggaa gggaggcgtt agaggagctg ccttcggagg ctcagggagt
4860ccctttggag ctggttgttt ccttggccct gcagcgcact gctcggggct cccaaggagg
4920ttgtgtgtat ggttcttaat tcatcaggac aaagaccccc agcatgtgtg taccctggga
4980cccgatttct ctgggcccac atctatctcc aatacctcag cctcagatca gaccctttct
5040tttttgtctt tcttctctta atttttaaat gcctcttttc ttgagcattc catctctctt
5100tttgaccctc tcaggactgg gcttagctgt ccagagccct gccggagggt gctgggggct
5160gtccctctgc aggcactgtg ttttcctcag gggctgtcct cagaacaccc ctcctgctcc
5220ctggggctcc tcagggagcc atttcagctg gagtctcagg tctcaaaaac aacttctcca
5280ggaggccaaa aaaagactgg gttggcttct ggtcctcatg atggctttta tcctcctggg
5340acactttggg tatattcatg ggcattgttt ccatctgtct tttctacctg tgccacccct
5400gccctgattc cacggctgcc tcaggcaggc aggcaaggag ctaggccggt gcccggccct
5460ggcagcaagg ggtctttgtg cagttggaga tgctgccgtt gtggcagagc gtcctgcagc
5520cccgcttcca tcagcaggct ctggggtggg ggctttgcag gggatgctct ctgatgtttg
5580ttccgttgtt taaataaaat gcacttattt ttgttttttt ttttgcaaaa aaaa
5634315228DNAHomo sapiens 31cattgtcgcc cacgctgcag tagcggcttc tgcggctcca
agccagcggg tcctgtgaag 60gcgagcagac gcggagaaag gacgcgggag tgagagaggg
tgagtcagcc actgtctaaa 120cgataacggg aggcggctct gcggggtagg gttgaattca
gtaaatgggc tcgtgctgct 180gtctcttcgg agacgctgct atcttagcgt cagcgaggga
aggttgagga ggagccagag 240ccgggtcctg cagcgtttct cgccatcagc gcccgtcgcc
atctccacca tgcagtcccg 300ggaagacgcc ccgcgctctc gccgcctagc cagtccccgt
ggtgggaagc ggcccaagaa 360gattcacaaa cccacagttt cggccttttt cacgggtcca
gaggaattaa aggacacggc 420ccattctgca gccctgctgg cacagctcaa gtccttctac
gatgcgcggc tgctgtgtga 480tgtgaccatc gaggtggtga cgcctggcag cgggcctggc
acgggtcgcc tgttcccctg 540caaccgcaat gtgctggccg cggcatgtcc ctacttcaag
agcatgttca caggtggcat 600gtacgagagc cagcaggcca gcgtgaccat gcacgatgtg
gacgccgagt ccttcgaggt 660gttggtcgac tactgctaca cgggtcgtgt gtctctcagt
gaggccaacg tggagcgcct 720gtacgcggcc tccgacatgc tacagctgga atatgtgcgg
gaagcctgtg cctccttctt 780agcccgacgt cttgacctga ccaactgcac cgccatcctc
aagtttgcag atgcctttgg 840ccatcgcaag ctgcgatccc aggcccagtc ctatatagct
cagaacttca agcaactcag 900ccacatgggt tcaattcggg aggagactct agcagatctg
accctggccc agctgctggc 960tgtcctgcgc ttggatagtc tggacgtgga gagtgagcag
acagtgtgcc atgtggcagt 1020gcagtggctg gaggctgctc ccaaagagcg gggtcccagt
gctgcagaag tcttcaagtg 1080cgtgcgctgg atgcacttca ctgaagaaga tcaggactac
ttagaagggc tgctgaccaa 1140gcccatcgtg aagaagtact gcctggacgt tattgaaggg
gccctgcaga tgcgctatgg 1200tgacctgttg tacaagtctc tggtgccagt gccaaacagc
agcagcagca gtagcagcag 1260caactctctt gtatctgcag cagaaaatcc accccagaga
ctgggtatgt gtgccaagga 1320gatggtgatc ttctttggac accccagaga tccctttctc
tgctgtgatc catactcggg 1380ggacctttac aaagtgccgt cacctttgac ctgtctggct
cacactagga ctgtcaccac 1440tctagctgtc tgtatctctc ctgaccatga catctatcta
gctgctcagc ccaggacaga 1500cctctgggtg tataaaccag ctcagaatag ttggcagcaa
cttgcagatc gcttgctgtg 1560tcgtgagggc atggatgtgg catatctcaa tggctatatc
tacattttgg gggggcgaga 1620ccctattact ggagttaagt tgaaggaagt ggaatgctac
aatgttaaga gaaaccagtg 1680ggcattggtg gctccactgc cccattcttt tttatccttt
gacctaatgg taattcgaga 1740ctatctctat gctctcaaca gtaagcgcat gttctgttat
gatcctagcc acaatatgtg 1800gctgaagtgc gtttctctga agcgcaatga ctttcaggaa
gcctgcgtct tcaatgagga 1860gatctattgt atctgtgata tcccagtcat gaaggtctac
aacccagtta gggcagaatg 1920gaggcaaatg aataatattc ccttggtctc agagaccaac
aactacagaa ttatcaagca 1980tggccaaaaa ttgttgctca tcacctctcg caccccacag
tggaaaaaga accgggtgac 2040tgtgtatgaa tatgatatta ggggagacca atggattaat
ataggtacca cattaggcct 2100cttgcagttt gattctaact ttttttgcct ctctgctcgt
gtttatcctt cctgccttga 2160acctggtcag agtttcctca ctgaagaaga agaaatacca
agtgagtcta gcactgaatg 2220ggacttaggt ggattcagtg agccagactc tgagtcagga
agttcaagtt ctctttctga 2280tgatgatttt tgggtgcgtg tagcgcctca gtgaaatgca
caggatcaac agggtttgtt 2340gtaactagat tgaaacacta agttgttttt actgttttgg
aaaatatctt aaatatcctt 2400tttgttccta aaggagagga aaagttgatt aacttctggt
ttggtttaga aaaagtaatg 2460tttgaaatac gaaggtaatt taatgttaca aattttaaca
ctcaaatcaa ccttttaata 2520attttctgtg ctaagggtcc agtatttatt tgattattta
gtatgtttat gtttcatgac 2580actaatttag tcttttgata cattttacat tctgtttact
gccacaagca ctgtggcaat 2640aacttttgaa ttttaatttt tataatagaa aaatgattag
gaattgctag atagtgtttt 2700gaaagcatat ctttttcttc agaacaatgt agacttccaa
aatggttaac ctaaggggtc 2760tttacaaaat gtgttataag ttaaacataa tttgggaagt
tttacttttg ttttcttcta 2820tgaagaaaaa aatgcaggct gggcgcggtg gctcacgcct
gtaatcctag cactttggga 2880ggccgaggca ggtggatcac ctgaggtcag ttcaagacca
gcctggccaa catggtgaaa 2940ccccgtctct actaaaaata caaaaattag ctgggcgtgg
tggcatgcgc ctgtaatccc 3000agctacccag gaggctgagg caggagaatt gctgaaaccc
gggagtcaga ggctgcagag 3060agccgagact gggccactgc actccagcct ggatgacaga
gtgagactcc gtctcaaaaa 3120aaaaaaaaaa aaaaaaagga aaaaaaaaaa agaaaaaaaa
ccatatgtgt attagggtga 3180ctgagtggtg acttcattta taataataca gagaatagct
ataagctcat tgacagtaaa 3240aacaacaaac caggattcta ctgtttgaaa agaagtttcg
ttttaatttt ggaatttaga 3300atgtgtattt gcaaagtcac caattttcat ctaaaaggtt
atattctagt tgtgtcacca 3360aatcatcaaa aaaccttaaa aaagaagtaa cttgctttgt
aggtttgtat tgttgatcta 3420aacctgatac atgcttcatt taatcaggaa taatcctttt
ttttctgctg gacatgtata 3480aatttcactg gattgtataa atttttatct attgccttaa
acatttacat gattctcaat 3540atgttttagc tgtacagttt tggtgttcat cttagaggat
tcttcagcag aagtgatatt 3600tctttactgt tttgtgaggt aatactgatt ttgaaaatat
atataagcta aaaacagtat 3660ttcgttgata tcagtagtca ttgtgttaac tataaagtca
agtgccagca aagaacttta 3720aaactgtaaa gctgtgtata gaactgtttt gtgtagcatg
gaaatattct gtcagctttt 3780taaagtcact aaatgttctt gattatcagc ttgaaggtat
ttttgtatta caagttgaca 3840gttgctgggt gtagtggctc atgcctgtaa tcctagcaac
tcggggctga ggtgggagga 3900ttgcttcagc ccaggagttt gagaccagcc tgggcaacat
agcaaaaccc catctctaca 3960aaaataaaaa atatgtctgg gcatggtggc ccaagtctga
gtcccagtta cttgggagga 4020tcacttgaat gtaggatcac ttgagtctag gagttcgggg
ctgcagctat catctgcagc 4080tataatcata gctcactgca gctatgatca tgtctcagca
ctccagcttt ggcaacagaa 4140cgagatccca tctcttagaa aaacaaagtt gatagttaaa
gaacataagt ggatgatggc 4200atttgaggcc actagtgaaa gtatgttttc tctaaaatat
ttctctaata gtgatataaa 4260tggctatttt attatgatgt ttgtatgtgt tttgtatttc
tctgtaaacc atgctccagt 4320ctttgttttt ctgttaccat aatgtaagag aaggtcctgg
aacagagact aaatcccacg 4380aaactgacat tgttaaacac actaaaacag aagtacttac
ctcttgaaga tttaatatat 4440aatggttgac atgatacatg tacatgatga atgaccagat
gcttatggtc tacattttcc 4500tttatcctgt tagtattacc ttccttaatc tttgttcatt
aacatgctaa ttcctcttca 4560gtgtttattt tctagtgaca gaatgctaac atttcttaca
ccctggcaga agggagagaa 4620atgtgttttg gggtgggtaa ctaaattttt gagtgaaata
tcataagatg agaatggaaa 4680gagggagaca caaagagtta taacaaaaaa acaatggttt
ttttagccat ttgactggct 4740ctttaaatag tctacaagac attcacgttt aacatcactt
ttagtgaaat aaaatgtgcc 4800atactagtat gtgcttcaaa agggcaaatg tgctttagtg
ccctaaggct aaattttggt 4860catttgacat cagagatgtt gtaagtattg cacttaatac
gcacctattt ctcaatagtg 4920ttattttttg gctagcattt tctttaccac tatcttgttg
atagcttttt gttctctaag 4980gttgaaacat gacagtgctt atctcaaaca gattacccat
ctgcagaact aaggaaagca 5040atttatgtat gaaagaaatt cttgaattcg tcattctcaa
cctttgaatt aaagcttaga 5100ctaaatagta atatatcgtg ggaaggattt tggttttgtg
atatttctgt gaattaagga 5160atagatgtta accattattt tgtagaaaag tgatttgtat
gtggttaatt ataaataaaa 5220ctggtacc
522832872DNAHomo sapiens 32gtactttcgc catcatagta
ttctccacca ctgttccttc cagccacgaa cgacgcaaac 60gaagccaagt tcccccagct
ccgaacagga gctctctatc ctctctctat tacactccgg 120gagaaggaaa cgcgggagga
aacccaggcc tccacgcgcg accccttggc cctccccttt 180acctctccac ccctcactag
acaccctccc ctctaggcgg ggacgaactt tcgccctgag 240agaggcggag cctcagcgtc
taccctcgct ctcgcgagct ttcggaactc tcgcgagacc 300ctacgcccga cttgtgcgcc
cgggaaaccc cgtcgttccc tttcccctgg ctggcagcgc 360ggaggccgca cgatgcctgg
agttactgta aaagacgtga accagcagga gttcgtcaga 420gctctggcag ccttcctcaa
aaagtccggg aagctgaaag tccccgaatg ggtggatacc 480gtcaagctgg ccaagcacaa
agagcttgct ccctacgatg agaactggtt ctacacgcga 540gctgcttcca cagcgcggca
cctgtacctc cggggtggcg ctggggttgg ctccatgacc 600aagatctatg ggggacgtca
gagaaacggc gtcatgccca gccacttcag ccgaggctcc 660aagagtgtgg cccgccgggt
cctccaagcc ctggaggggc tgaaaatggt ggaaaaggac 720caagatggcg gccgcaaact
gacacctcag ggacaaagag atctggacag aatcgccgga 780caggtggcag ctgccaacaa
gaagcattag aacaaaccat gctgggttaa taaattgcct 840cattcgtaaa aaaaaaaaaa
aaaaaaaaaa aa 872331003DNAHomo sapiens
33gtctgcaggt atggatgttg ttctcttttc cctgtcttta tttccttacc aatcggctgc
60catccgagga gctgaggaag cctagagctc tcagaagcag tcctttgagc tggtgtaggg
120gcactcagaa tggtccagcg tttgacatac cgacgtaggc tttcctacaa tacagcctct
180aacaaaacta ggctgtcccg aacccctggt aatagaattg tttaccttta taccaagaag
240gttgggaaag caccaaaatc tgcatgtggt gtgtgcccag gcagacttcg aggggttcgt
300gctgtaagac ctaaagttct tatgagattg tccaaaacaa agaaacatgt cagcagggcc
360tatggtggtt ccatgtgtgc taaatgtgtt cgtgacagga tcaagcgtgc tttccttatc
420gaggagcaga aaatcgttgt gaaagtgttg aaggcacaag cacagagtca gaaagctaaa
480taaaaaaatg aaactttttt gagtaataaa aatgaaaaga cgctgtccaa tagaaaaagt
540tggtgtgctg gagctacctc acctcagctt gagagagcca gttgtgtgca tctctttcca
600gttttgcatc cagtgacgtc tgcttggcat cttgagattg ttatggtgag agtatttaca
660cctcagcaaa tgctgcaaaa tcctgttttc ccccagagag ctggaggtta aatactacca
720gcacatccct agatactact caagttacag tatatgatca ctaatatagt atgctcttgg
780taccaggagc tctgatatat atctggtaca tgtttgataa tgacttgatt gttattataa
840gtacttatta atacttcgat tctgtaaaga gtttagggtt tgattttata aaatccaaaa
900tgagcctttt attgaatcca gttctctatg tgaccagttc tctgtatgaa tggaagggaa
960aagaattaaa aatcttgcaa aggggaaaaa aaaaaaaaaa aaa
1003345862DNAHomo sapiens 34gcgtccgagg gagcgcgcga cgggccacgc acgtccgggc
gtccagttcg gggcagcttc 60tccggctggt gggtgggtgg ggcagccttt caggcagggt
ggcaaccaac tatatctgag 120gaccagagcc attttggggc accagagctt gtgacctctc
catctccacc cagctgggtc 180caggggccac tctcagcact cacctcagca gctgacatca
taaagcagac ttgggaacct 240ggaagcactc tggagaacct ttccctgaga catggagctt
tggggccgaa tgctgtgggc 300cctcctgtct ggcccaggga ggaggggaag tacccggggc
tgggccttca gctcatggca 360accccaacca cctctggctg ggttatccag tgccatagaa
ctggtcagcc actggactgg 420ggtctttgag aagaggggta tccctgaggc ccgggaatcc
agtgagtaca tcgtggctca 480tgtccttgga gccaaaacat ttcagagcct gaggccggca
ctttggaccc agcccttgac 540ctctcagcaa ctacagtgta tccgggagct gagtagccgt
cgattgcaga ggatgccggt 600gcagtacatc cttggagagt gggacttcca ggggctcagc
ctaaggatgg tgcccccagt 660gtttattcct cggccagaaa cagaggaact ggttgagtgg
gtgctggaag aggtggccca 720gaggtcccat gctgtgggat ccccaggcag ccccctcatt
ctggaggtgg gctgcggatc 780aggagccatc tccctcagcc tgctgagcca gctcccccag
agccgagtca ttgctgtgga 840taagcgggaa gctgctatct ctctgaccca tgagaatgct
cagaggcttc ggttgcagga 900caggatttgg atcatccacc tcgacatgac ctcagaaagg
agctggacac acctgccctg 960gggccccatg gacctgattg tcagcaaccc tccctacgtc
ttccaccagg acatggagca 1020gctggcccct gagatccgca gctatgaaga ccccgcggcc
ctggatggtg gggaggaggg 1080catggacatc attacccaca ttctggcctt ggcaccccgg
ctcctgaaag actctggtag 1140tatcttctta gaagtggacc caaggcaccc ggagcttgtc
agcagctggc ttcagagccg 1200gcctgacctg taccttaatc ttgtggctgt gcgcagggac
ttctgtggga ggccccggtt 1260cctgcatatc cggaggtctg ggccatagca tggctgccct
gtggatgcct tgtcagtgcc 1320gccagcctga ccagagggga ggtggatggc actttccaga
gcccaggttc ttatggcatt 1380tcccagggtt ctgtgatttc cccatgctct gcatttctag
gatatttcta ggacacctgg 1440attggctcca tcacatcaga gtggctgagg gcagttgctc
tgtgttggtg aaattgctgt 1500gggggtatcg ggggatatgg ccagtaaagt attgagagac
taacaaatgg tgacctaatg 1560ttttgtccat gacttgcagg tcccctgacc cccttactcc
caggtagcac tggggcaagg 1620gtttccttct gccccagcag ggctggccgt cagtcccctg
cttggtagtg gtgtgggggt 1680gcagtgtgga ggaaggcacg tgagtcctca ctcctggcct
tggataccat gggtcctggc 1740atagagcagc tcactcccag ggattgatta gtcctccact
gccctgggtg catgcgtaca 1800caattccctg gccaagcctg gctcgagcac aggaagctca
tctgcgtttt ggctcaagga 1860tgactgcctg ctttctggag gggagggtct ggaggtcttt
gctgcacagt tcctgggtcg 1920cacatccacg ttcatttaac tgaaggcttg agccagtgag
gggtgtttcc tttttatccc 1980catagctttt agctaaaaca tccctcccga gttgaccccc
tggggtttca aataacccat 2040gtgtccctgg ttggggctgg ggagagtgag aagctgagat
actgggcaca gggttgtggc 2100ctccacccca gctctggtct gtgcagactc atggccacca
ggaggcctgc agatccagcc 2160ttcctgtcaa cagcgacagg aaatctctag gttggtgagt
gctggtgatg tgagcctaca 2220tcagggtggg tcctaagaaa catggcaaac caggctgtct
cattccacta gactgccccc 2280tgccaccctg gcacttccca gggcctggca gtatggtctg
atgggcagta tggtccaata 2340ggcagcatcc tctgctgcag ctgggagagc tgagttccag
ggctgtgtcc tgcagtggga 2400ccttgggcaa ctcctttccc tatgagaagc tggctcttct
gagtccaggg ccaacgccaa 2460ctggcaacct ctttactctt agtcaagtgg aatgtgcatg
ctggcatctg aatgtccatt 2520cgccaggcat ggagagcaag agaaggtatg tactgcctga
ggtcacatga cagtgaccaa 2580gtggagacag taagttagat ccctcccttt ggggagccta
tattgctgga gtcataccca 2640gcctaagtgt tgccctgcac tatggctgga ggacacattt
ggtagaggtc acactgcagc 2700tcccagtgcc ccagtgtcct gccctgtgcc cagccccagc
tgcatggact ctgagctgcc 2760cctggcttcc tttaaggagg ctgctccaga aggaacctgg
gtggggaggg cgaagggggt 2820gcacaaccag ggcaaggctc cccacttcct tagtccccca
tgctcacaga cctttgcctg 2880ctaaggtcct caccagtatt gccctttctg tctttctcct
tgtgcccttt ggctcttgct 2940gtcttcagca gcatctcagg gtagctgccc tgacctcgga
gcagtctgtc gcccccctac 3000acctcagcca gtcctggctt ccctgatggt ctctccctcc
tggcctcagg cccattcctg 3060aggaagggcc ttggcgagct tgtggatgtt gcaccagaag
agagtgcagt gttggagagt 3120gacactgtcg gggcagctgg ggccacaagc aggagccggc
ctcgggcaca actttctgcc 3180cagaaaaatg tgcagcttga ctctgctgag gaaaaggtcc
aagccaagag gactggcagg 3240cggggcctca agcctgcagc cactggcttg attgggccct
ggacgttgag cccagatgtt 3300ggagccacac cagcctggat ttcaatccca gaatctgccc
ctcaccagga tgtgaccttg 3360ggcagatgac ttcacctcac tcagccttgg cttctaaggc
tgagaaatgg gacttaatgc 3420tttattttat aggatgcatg tgaggagccc atggaatgtg
cctggcttgg cacattgtgg 3480catttttcct tgccttcctc ggagggcaga cacagggagg
aaggacccag tgccctcagg 3540cgtccatctg atgcatggga ccaacataag gcaggcaggg
atacaaggca gtctggaaag 3600aagggaaggc aggagtttca gtcttgggct cttgactcct
cactgttgtc tagagatgga 3660gccagcaggc tggtagcctg gcagcctaca tctcccctca
gcctctcctc actatggccc 3720cagtgccttg aggcccaggc cagggcagcc agtggctcta
gctcagggaa agccaggccc 3780acctgcccta tcccctccct tgctcctgag gccaaagcca
gagactcgaa cagcctcccc 3840accaccacca gcatatgtca aggagcactt gcaggcagaa
tgggaggagg acatggagct 3900gatggagtcc aggctgtgca agcccctgag gtcttgagag
atgtgcccac tgcccgtgca 3960gcctccttca gccagagccc agagcataga caggagtgta
ggagtccctg tttgatgtac 4020tctgggagag taattctatc tcctcttctg atagttgggg
aaactgaggc cttgtctcac 4080agttggatgc ttttcccagt tgtcagtggg tttctccatg
ggtctcatac agctgcctta 4140ttgaaatagg ccccgaaccc cctaaatgca aaaaatactc
ttttttgctc ctttaccccc 4200acctggaccc tgggctattg gctgctccca atccttgccc
caaacactta gctggctccc 4260catgacttaa gtgtgttctc ttgtgtccta tggaatccag
ttctgaagag gtgggggagg 4320acaactgtgg gaaaagccct gggggcccct cccaaggccc
catcagtgct ctgagtaggc 4380tgtcatcaga acaaagggct ccactgctga caaggtttga
gaactgctgg cttgaggtga 4440gaaccccttt aacctctgcg ggacagcatg tctttcccta
tccaccttcg attcttttct 4500cttttttttc ttcattggct ccttcttagt ggattctctt
ctctactgcc ctgggcttca 4560gcctttgtgc agtactctcg atgccctgaa cacacacctt
ccctttgccc aggcggtgca 4620aacaatccac ttcttcaagc tccaacacaa atgctgcctc
ctttaggatg cctgctctgt 4680gctctccctg cctcccctag cccatacctc tgctggcacc
ttctgtacca tgccttcaga 4740aaccttctta tccccctcat ctctggggcc ccctgtggat
ctggcatacc caagttcagt 4800aaatgtctat cagtaagctg atggtacatg cattttctag
aatagagctg ggacttccca 4860tgtggcccac atctgacctg gcagcccatg tattccggtc
attagggatg ggaagccatg 4920aggacctggc cttctgcccg acccaggcag ccattcaagt
tgagcaatgg ccacttcgaa 4980gactcaagtg cacctgatcc ctgcgcaaca gccacaccag
gagaacaggc tgtccttggc 5040ggcagtagga gcaggcgcca ggtttcctgg agctcttggc
ttcagccagc ccccagccag 5100agtcctggct aggacagtga cctgatctcc tcctcatgac
cttctgccct ggacaagccc 5160cctgaactgg atttgggact gtcaaagcaa ctctacccct
gctctggtag gctgaacagt 5220gaccccccaa aatggcagtg tcttaatcac ctaaacctta
acatgtgact atattacctt 5280cacatagcaa aatggacttt gcagatgtga ttaaggatct
tgagatggaa ggagtatcct 5340ggatttttca ggtaaactga gtataatcac aagggcctct
gtaaaggagg caggagtgtc 5400agagtgacgg aagaaaatgt atgtaacaat ggaagcagag
gtcagagtga tgcaattgct 5460ggaggaagag ccatgagccg aggaatgcag acagcctctt
ctcctctggg gcctcaagaa 5520gaatgcagtc ctgccaatac cttgatttta agccctgtga
aactgatttc agattgctga 5580cctccagaac agtaagatca taaatttgtg ttgttttcac
atgtgtgaaa acacatgtgt 5640gataatttgt tacagcagcc acgggaaacg aatatagatt
gtggtgccca aattagagtg 5700ctgctgtaac acacgcctac tgattgaagt ggctttggaa
ttgcaacgtg gaaatgggca 5760gaggctggaa gaattttgag agtcatgata aattgcctta
accacctctc ttctgatagg 5820tgatgtggcc aggggaactc ttcctcaacc ttcagaccta
aa 5862355662DNAHomo sapiens 35tcctcgacgg ccgccgcccg
cctggccttt tagggcctga ctcccgccct tcctggccta 60cactcctggg cggcggcagg
cctagcttct ggcccagtgc gggttccccg gcggcaggcg 120tatcctgtgt gcccctgggc
caggcccgaa cccggtgtcc ccgggtgggg ggtggggacg 180ccacggccga agcagctagc
tccgttcgtg atccgggagc ctggtgccag cgagacctgg 240aatttccggt ctggttggtc
tggggccccg cggagccagg ttgataccct cacctcccaa 300ccccaggccc tcggatgccc
agaacctgta ggccgcaccg tggacttgtt cttaatcgag 360ggggtgctgg ggggaccctg
atgtggcacc aaatgaaatg aacaaagctc cacagtccac 420aggcccccca cccgccccat
cccccggact cccacagcca gcgtttcccc cggggcagac 480agcgccggtg gtgttcagta
cgccacaagc gacacaaatg aacacgcctt ctcagccccg 540ccagcacttc taccctagcc
gggcccagcc cccgagcagt gcagcctccc gagtgcagag 600tgcagcccct gcccgccctg
gcccagctgc ccatgtctac cctgctggat cccaagtaat 660gatgatccct tcccagatct
cctacccagc ctcccagggg gcctactaca tccctggaca 720ggggcgttcc acatacgttg
tcccgacaca gcagtaccct gtgcagccag gagccccagg 780cttctatcca ggtgcaagcc
ctacagaatt tgggacctac gctggcgcct actatccagc 840ccaaggggtg cagcagtttc
ccactggcgt ggcccccgcc ccagttttga tgaaccagcc 900accccagatt gctcccaaga
gggagcgtaa gacgatccga attcgagatc caaaccaagg 960aggaaaggat atcacagagg
agatcatgtc tggggcccgc actgcctcca cacccacccc 1020tccccagacg ggaggcggtc
tggagcctca agctaatggg gagacgcccc aggttgctgt 1080cattgtccgg ccagatgacc
ggtcacaggg agcaatcatt gctgaccggc cagggctgcc 1140tggcccagag catagccctt
cagaatccca gccttcgtcg ccttctccga ccccatcacc 1200atccccagtc ttggaaccgg
ggtctgagcc taatctcgca gtcctctcta ttcctgggga 1260cactatgaca actatacaaa
tgtctgtaga agaatcaacc cccatctccc gtgaaactgg 1320ggagccatat cgcctctctc
cagaacccac tcctctcgcc gaacccatac tggaagtaga 1380agtgacactt agcaaaccgg
ttccagaatc tgagttttct tccagtcctc tccaggctcc 1440cacccctttg gcatctcaca
cagtggaaat tcatgagcct aatggcatgg tcccatctga 1500agatctggaa ccagaggtgg
agtcaagccc agagcttgct cctcccccag cttgcccctc 1560cgaatcccct gtgcccattg
ctccaactgc ccaacctgag gaactgctca acggagcccc 1620ctcgccacca gctgtggact
taagcccagt cagtgagcca gaggagcagg ccaaggaggt 1680gacagcatca atggcgcccc
ccaccatccc ctctgctact ccagctacgg ctccttcagc 1740tacttcccca gctcaggagg
aggaaatgga agaagaagaa gaagaggaag aaggagaagc 1800aggagaagca ggagaagctg
agagtgagaa aggaggagag gaactgctcc ccccagagag 1860tacccctatt ccagccaact
tgtctcagaa tttggaggca gcagcagcca ctcaagtggc 1920agtatctgtg ccaaagagga
gacggaaaat taaggagcta aataagaagg aggctgttgg 1980agaccttctg gatgccttca
aggaggcgaa cccggcagta ccagaggtgg aaaatcagcc 2040tcctgcaggc agcaatccag
gcccagagtc tgagggcagt ggtgtgcccc cacgtcctga 2100ggaagcagat gagacctggg
actcaaagga agacaaaatt cacaatgctg agaacatcca 2160gcccggggaa cagaagtatg
aatataagtc agatcagtgg aagcctctaa acctagagga 2220gaaaaaacgt tacgaccgtg
agttcctgct tggttttcag ttcatctttg ccagtatgca 2280gaagccagag ggattgccac
atatcagtga cgtggtgctg gacaaggcca ataaaacacc 2340actgcggcca ctggatccca
ctagactaca aggcataaat tgtggcccag acttcactcc 2400atcctttgcc aaccttggcc
ggacaaccct tagcacccgt gggcccccaa ggggtgggcc 2460aggtggggag ctgccccgtg
ggccggctgg cctgggaccc cggcgctctc agcagggacc 2520ccgaaaagaa ccacgcaaga
tcattgccac agtgttaatg accgaagata taaaactgaa 2580caaagcagag aaagcctgga
aacccagcag caagcggacg gcggctgata aggatcgagg 2640ggaagaagat gctgatggca
gcaaaaccca ggacctattc cgcagggtgc gctccatcct 2700gaataaactg acaccccaga
tgttccagca gctgatgaag caagtgacgc agctggccat 2760cgacaccgag gaacgcctca
aaggggtcat tgacctcatt tttgagaagg ccatttcaga 2820gcccaacttc tctgtggcct
atgccaacat gtgccgctgc ctcatggcgc tgaaagtgcc 2880cactacggaa aagccaacag
tgactgtgaa cttccgaaag ctgttgttga atcgatgtca 2940gaaggagttt gagaaagaca
aagatgatga tgaggttttt gagaagaagc aaaaagagat 3000ggatgaagct gctacggcag
aggaacgagg acgcctgaag gaagagctgg aagaggctcg 3060ggacatagcc cggcggcgct
ctttagggaa tatcaagttt attggagagt tgttcaaact 3120gaagatgtta acagaggcaa
taatgcatga ctgtgtggtc aaactgctta agaaccatga 3180tgaagagtcc cttgagtgcc
tttgtcgtct gctcaccacc attggcaaag acctggactt 3240tgaaaaagcc aagccccgaa
tggatcagta tttcaaccag atggaaaaaa tcattaaaga 3300aaagaagacg tcatcccgca
tccgctttat gctgcaggac gtgctggatc tgcgagggag 3360caattgggtg ccacgccgag
gggatcaggg tcccaagacc attgaccaga tccataagga 3420ggctgagatg gaagaacatc
gagagcacat caaagtgcag cagctcatgg ccaagggcag 3480tgacaagcgt cggggcggtc
ctccaggccc tcccatcagc cgtggacttc cccttgtgga 3540tgatggtggc tggaacacag
ttcccatcag caaaggtagc cgccccattg acacctcacg 3600actcaccaag atcaccaagc
ctggctccat cgattctaac aaccagctct ttgcacctgg 3660agggcgactg agctggggca
agggcagcag cggaggctca ggagccaagc cctcagacgc 3720agcatcagaa gctgctcgcc
cagctactag tactttgaat cgcttctcag cccttcaaca 3780agcggtaccc acagaaagca
cagataatag acgtgtggtg cagaggagta gcttgagccg 3840agaacgaggc gagaaagctg
gagaccgagg agaccgccta gagcggagtg aacggggagg 3900ggaccgtggg gaccggcttg
atcgtgcgcg gacacctgct accaagcgga gcttcagcaa 3960ggaagtggag gagcggagta
gagaacggcc ctcccagcct gaggggctgc gcaaggcagc 4020tagcctcacg gaggatcggg
accgtgggcg ggatgccgtg aagcgagaag ctgccctacc 4080cccagtgagc cccctgaagg
cggctctctc tgaggaggag ttagagaaga aatccaaggc 4140tatcattgag gaatatctcc
atctcaatga catgaaagag gcagtccagt gcgtgcagga 4200gctggcctca ccctccttgc
tcttcatctt tgtacggcat ggtgtcgagt ctacgctgga 4260gcgcagtgcc attgctcgtg
agcatatggg gcagctgctg caccagctgc tctgtgctgg 4320gcatctgtct actgctcagt
actaccaagg gttgtatgaa atcttggaat tggctgagga 4380catggaaatt gacatccccc
acgtgtggct ctacctagcg gaactggtaa cacccattct 4440gcaggaaggt ggggtgccca
tgggggagct gttcagggag attacaaagc ctctgagacc 4500gttgggcaaa gctgcttccc
tgttgctgga gatcctgggc ctcctgtgca aaagcatggg 4560tcctaaaaag gtggggacgc
tgtggcgaga agccgggctt agctggaagg aatttctacc 4620tgaaggccag gacattggtg
cattcgtcgc tgaacagaag gtggagtata ccctgggaga 4680ggagtcggaa gcccctggcc
agagggcact cccctccgag gagctgaaca ggcagctgga 4740gaagctgctg aaggagggca
gcagtaacca gcgggtgttc gactggatag aggccaacct 4800gagtgagcag cagatagtat
ccaacacgtt agttcgagcc ctcatgacgg ctgtctgcta 4860ttctgcaatt atttttgaga
ctcccctccg agtggacgtt gcagtgctga aagcgcgagc 4920gaagctgctg cagaaatacc
tgtgtgacga gcagaaggag ctacaggcgc tctacgccct 4980ccaggccctt gtagtgacct
tagaacagcc tcccaacctg ctgcggatgt tctttgacgc 5040actgtatgac gaggacgtgg
tgaaggagga tgccttctac agttgggaga gtagcaagga 5100ccccgctgag cagcagggca
agggtgtggc ccttaaatct gtcacagcct tcttcaagtg 5160gctccgtgaa gcagaggagg
agtctgacca caactgaggg ctggtggggc cggggacctg 5220gagccccatg gacacacaga
tggcccggct agccgcctgg actgcagggg ggcggcagca 5280gcggcggtgg cagtgggtgc
ctgtagtgtg atgtgtctga actaataaag tggctgaaga 5340ggcaggatgg cttggggctg
cctgggcccc cctccaggat gccgccaggt gtccctctcc 5400tccccctggg gcacagagat
atattatata taaagtcttg aaatttggtg tgtcttgggg 5460tggggagggg caccaacgcc
tgcccctggg gtcctttttt ttattttctg aaaatcactc 5520tcgggactgc cgtcctcgct
gctgggggca tatgccccag cccctgtacc acccctgctg 5580ttgcctgggc agggggaagg
gggggcacgg tgcctgtaat tattaaacat gaattcaatt 5640aagctcaaaa aaaaaaaaaa
aa 5662363251DNAHomo sapiens
36cagcaactat gaaataatcg tagtatgaga ggcagagatc ggggcgagac aatggggatg
60tgggcgcggg agccccgttc cggcttagca gcacctccca gccccgcaga ataaaaccga
120tcgcgccccc tccgcgcgcg ccctcccccg agtgcggagc gggaggaggc ggcggcggcc
180gaggaggagg aggaggaggc cccggaggag gaggcgttgg aggtcgaggc ggaggcggag
240gaggaggagg ccgaggcgcc ggaggaggcc gaggcgccgg agcaggagga ggccggccgg
300aggcggcatg agacgagcgt ggcggccgcg gctgctcggg gccgcgctgg ttgcccattg
360acagcggcgt ctgcagctcg cttcaagatg gccgcttggc tcgcattcat tttctgctga
420acgactttta actttcattg tcttttccgc ccgcttcgat cgcctcgcgc cggctgctct
480ttccgggatt ttttatcaag cagaaatgca tcgaacaacg agaatcaaga tcactgagct
540aaatccccac ctgatgtgtg tgctttgtgg agggtacttc attgatgcca caaccataat
600agaatgtcta cattccttct gtaaaacgtg tattgttcgt tacctggaga ccagcaagta
660ttgtcctatt tgtgatgtcc aagttcacaa gaccagacca ctactgaata taaggtcaga
720taaaactctc caagatattg tatacaaatt agttccaggg cttttcaaaa atgaaatgaa
780gagaagaagg gatttttatg cagctcatcc ttctgctgat gctgccaatg gctctaatga
840agatagagga gaggttgcag atgaagataa gagaattata actgatgatg agataataag
900cttatccatt gaattctttg accagaacag attggatcgg aaagtaaaca aagacaaaga
960gaaatctaag gaggaggtga atgataaaag atacttacga tgcccagcag caatgactgt
1020gatgcactta agaaagtttc tcagaagtaa aatggacata cctaatactt tccagattga
1080tgtcatgtat gaggaggaac ctttaaagga ttattataca ctaatggata ttgcctacat
1140ttatacctgg agaaggaatg gtccacttcc attgaaatac agagttcgac ctacttgtaa
1200aagaatgaag atcagtcacc agagagatgg actgacaaat gctggagaac tggaaagtga
1260ctctgggagt gacaaggcca acagcccagc aggaggtatt ccctccacct cttcttgttt
1320gcctagcccc agtactccag tgcagtctcc tcatccacag tttcctcaca tttccagtac
1380tatgaatgga accagcaaca gccccagcgg taaccaccaa tcttcttttg ccaatagacc
1440tcgaaaatca tcagtaaatg ggtcatcagc aacttcttct ggttgatacc tgagactgtt
1500aaggaaaaaa attttaaacc cctgatttat atagatatct tcatgccatt acagctttct
1560agatgctaat acatgtgact atcgtccaat ttgctttctt ttgtagtgac attaaatttg
1620gctataaaag atggactaca tgtgatactc ctatggacgt taattgaaaa gaaagattgt
1680tgttataaag aattggtttc ttggaaagca ggcaagactt tttctctgtg ttaggaaaga
1740tgggaaatgg tttctgtaac cattgtttgg atttggaagt actctgcagt ggacataagc
1800attgggccat agtttgttaa tctcaactaa cgcctacatt acattctcct tgatcgttct
1860tgttattacg ctgttttgtg aacctgtaga aaacaagtgc tttttatctt gaaattcaac
1920caacggaaag aatatgcata gaataatgca ttctatgtag ccatgtcact gtgaataacg
1980atttcttgca tatttagcca ttttgattcc tgtttgattt atacttctct gttgctacgc
2040aaaaccgatc aaagaaaagt gaacttcagt tttacaatct gtatgcctaa aagcgggtac
2100taccgtttat tttactgact tgtttaaatg attcgctttt gtaagaatca gatggcatta
2160tgcttgttgt acaatgccat attggtatat gacataacag gaaacagtat tgtatgatat
2220atttataaat gctataaaga aatattgtgt ttcatgcatt cagaaatgat tgttaaaatt
2280ctcccaactg gttcgacctt tgcagatacc cataacctat gttgagcctt gcttaccagc
2340aaagaatatt tttaatgtgg atatctaatt ctaaagtctg ttccattaga agcaattggc
2400acatctttct atactttata tacttttctc cagtaataca tgtttacttt aaaaattgtt
2460gcagtgaaga aaaaccttta actgagaaat atggaaaccg tcttaatttt ccattggcta
2520tgatggaatt aatattgtat tttaaaaatg catattgatc actataattc taaaacaatt
2580ttttaaataa accagcaggt tgctaaaaga aggcatttta tctaaagtta ttttaatagg
2640tggtatagca gtaattttaa atttaagagt tgcttttaca gttaacaatg gaatatgcct
2700tctctgctat gtctgaaaat agaagctatt tattatgagc ttctacaggt atttttaaat
2760agagcaagca tgttgaattt aaaatatgaa taaccccacc caacaatttt cagtttattt
2820tttgctttgg tcgaacttgg tgtgtgttca tcacccatca gttatttgtg agggtgttta
2880ttctatatga atattgtttc atgtttgtat gggaaaattg tagctaaaca tttcattgtc
2940cccagtctgc aaaagaagca caattctatt gctttgtctt gcttatagtc attaaatcat
3000tacttttaca tatattgctg ttacttctgc tttctttaaa aatatagtaa aggatgtttt
3060atgaagtcac aagatacata tatttttatt ttgacctaaa tttgtacagt cccattgtaa
3120gtgttgtttc taattataga tgtaaaatga aatttcattt gtaattggaa aaaatccaat
3180aaaaaggata ttcatttaga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
3240aaaaaaaaaa a
32513736230DNAHomo sapiens 37accatatcct cctacactct gagcaatctc acggggtaga
ccgcaggtta acacctctca 60gactccttga aaaatagctg gtgacgggtc agtgcccaga
gctcacctgc ctttcgccaa 120actctaaaca cccctgtgtg tttcccctac tataccctgt
tccctggggg caggtccctg 180cattatgaag ccactaggaa aatgagataa agctttccta
cttttcttcc cctgaaaaga 240cagattttgt tttttatttt ttgagaatac caagtaagat
tttatttttt atttatttta 300aattatttta acctttgttt taggttcaag ggtacacatg
caggtttgtt atataggtaa 360attgtgtgtc atcgggattt ggcgtaaaaa tttatttcat
cacccaggta ataagtatag 420tatctgatag gtagtgtttt gatcctctcc ctcctcccat
cctccaccct caagtagggc 480ccagtgtcta ttattccctt ttttgtgtcc atgtgtactc
aatgtttagc tcccacttat 540aaaagtgaga acatgcagta tttcattttc tgctcctgtg
ttagtttgcc taggataaca 600gcccccagct ccatccatga tgctgcaaaa gacgtgatct
cgtccttttt tgtctgtgga 660gtattccatg gtgtatatgt accacatttt ctttatacag
tctactgttg gtgggcattt 720aggctgattc catgtctttg ctattatgaa tactgctgca
gtgagcattc atgtgcatgt 780gtccttatgg tagaacaatg tatactcctt tgggtatatg
cctaataatg ggattcctgg 840gacgaatggt agctctgttt taaggttctt gagaaattgc
caaactgctt tcctcaatgg 900ctgaactaat ttatgttccc accagcagtg tataagcctt
ccgttttctc tgcaacctct 960ccaacatttg ttattttttg actttttaat aatagccatt
ctgactggtg tgagacggta 1020tctcattatg attttgattt gcatttttct aatcattagt
aatgttgaac attgtttcat 1080atgcttcttg gtcacgtgtg tgtcttgaaa aggcagattt
tatgtatttg cgtatttatt 1140tttttcacag gttttttttt tgaaagtctc actctgtcgc
ctaggctgga gtacagtggg 1200ataatctcgg ctcactgcaa tcttcgcctc ctgggttcaa
atgactctca tgcctcagcc 1260acttgagtag ctggggttac agtcatgtgc caccactcct
ggttagtttt tgtctttttt 1320ttttttttgg tagagacagg gtttcatcat gttggccagg
ctgttcttga actcctgacc 1380tcaagtgatc cacccacctc agcctcctaa agtgctagga
ttacaggcat gagccatcgt 1440gcctggcctg aaaaagcaga ttttaaacgg caattcattc
ttctatccca ttgtgaacta 1500tacagttgat ggattttcca tcactaactt gaaactctaa
attggcttcc ttctgctccc 1560cagtaggttt cagggctgcc tcttcacatc ttagtttctg
agaactcttg gattttatta 1620aatagtgagc taaacaaaac aggattgtgg aaggggcccc
ttgacaccac acttacctgc 1680cctccctcaa agtccctgat ctcaggaaaa tctaacacct
atgaagaaaa tggggataaa 1740aaatgcatac aaagattatt accaaaaacg aaagattcgt
tgtgtaacta attgagatta 1800actgaagctc tgccatagct cccagccact gcccccactc
accttgctta tatactctaa 1860ctctgctaac gaactgtcaa gtgtgttgga atgggcagaa
tatggggtgg ggagtgcata 1920atctgtagag cttctacaga tacagtgcta ggtaggtcct
ttctataata tctcatctca 1980tcttaaaaga cttgttggcc gggcatggtg gctcacgctt
gtaatcccag cactttggga 2040ggctgaggaa ggcatatcac ctgaggtcag gagtttgaga
ccagcctggc aaacatggtg 2100aaaccccgtc tctacaaaaa atacaaaaat tagctgggtg
tggtggcgcg tgcctgtaat 2160cccagctact ctggaggctg aggcaggaga atcgattgaa
cctgggaggt ggaggttgca 2220gtgagccgag atcgtgccac tgcactccag cctgggtgac
agaatgagac tgtctcaaaa 2280aaaaaaaaaa aaaaaaaaaa cttgttaatt gtcctcattt
cccaggttgg aaaacaggtc 2340caaagattca cacccaaggt ctaaaggctg taactcctct
tcttatacag ctgttacaca 2400tgcacgtgtg tacacacaca cacacataca cactctcttg
agcatgccca cacactcact 2460acatcttgga actgggatgg ctcaaataaa gggagttagt
gaggcctccg ctgagaaaga 2520gagaaagaga agagtcacaa tccataaccc aattcaccca
agtcttatct ttcctgtcct 2580cagagttcct tctgctctga gaaccaccgt cccttccact
ttctcttttg acaagtttca 2640aaactgaatt ttcccccaca cccccccaat acatttcccc
ctcacattcc tccccatcct 2700gcccaggtaa gctgttagcc taaccttata ggaaccaagt
cctgggatcc ttttcaatgt 2760ctacaaagcc tagccctggc aagggagcac tggctgtgtg
gtcctgtgcc agcactgaac 2820atggccctag ccagtaacag tggggctgaa tgtagttccc
tcttatgtct agatctctgc 2880tccggcagtc aaaggagatg tgaaaccttc tgtgaggcca
caacaggaaa tggtaggaga 2940ggatttcact tctctattaa ttcaaacact gagggagctt
tttagaataa agaaggacag 3000aaaacccaga cacctgtgct cagcagtgtt ttccttcctc
tcctcctccc aacccttcca 3060tttttacaga tatagctctg tctttccacc tctagccaat
tcaaaataac atttcagttg 3120ctctgtccat tgttacttat ttgttaatta ttgatatagc
accgggaccg aagaggtatg 3180gagccccaac caggttccca catgttgcct ttcttttatt
gcctctacac aaccacccaa 3240agagtgagtc ctctcctttc ccattgcctc tgcccttagc
ctgaccacca catgcctgca 3300gtaaactagt cccagggttt gtgtgcaaag cattactggg
aaaatacaga gtgagaagat 3360atggattctg cccccatatc gctttgcttg tacgtcaatt
ggggagtgag aacaaacact 3420ttaaatagtt tatattaaag taagtaagca ataaggccag
tggtcttaaa agagaagaga 3480gaaatcacca tggacatggt agacagggag tactctcagt
cgagagggcc tggaatgagc 3540cttgaatact gggctggatt tgtgttggag aggaggaagg
cagttggcat tgtaggtctg 3600gtgtatagct ccacaagctt gacaatgctg tgaggtgcca
tcagggagga ggtgtcctac 3660gagagcctgg gttagctaaa acaaagacaa gctacaataa
cgtcactggc actgcacgtt 3720ggaggaagtc acaaatgtga tttcttgttt ttttctgaga
gtatggccat aataataaat 3780ctcttctagg cacttcctaa agttgctcca tgtcagttcg
caggttcttg gggcagacgg 3840ttttaactga agtctccatt ttataaacac aaaattgctc
aaccagttaa tcacgcctca 3900tagcataaga ccacattcgt gacttcagtg tcttttcaaa
actacacaca cctacatcct 3960gccaagatta tattacttgc ccaatctgtc caatccccac
cccacccctg ccatctaccc 4020cttacctcac ctccgcccac acacacaccc tcctaccctg
tcaggattca ctgctctaga 4080ccctgacctt tggattatag tttctgtagt cagttcacca
tccttccaac ctacagtcaa 4140attatttgaa ctactaggga tagtctatct gatttgccac
aactattttt ccttttttaa 4200ttttattttt tgccaccaca actattgaag aatgctatct
tcatcttacc cacgagaaaa 4260tggaggcaga gggaggttaa gtggttgccc agatttaccc
agatactaag taataaaacc 4320attacttgaa ctcaggattt attactttaa atcctgtatt
gccaataatc aattggaaaa 4380taactgaaaa ttgcctacta tttataataa caataaaaac
catagcatat ttatgaatta 4440acatatcaaa tataagaatt ttaagaaaaa agaaaacttt
attgaagtgc acaaagacct 4500gagaggtgta gagatatacc atattcatgg ataggccatg
ctaacataat gacaacctct 4560ccccacatct ctaacctaaa tgctacccca attaaagtaa
cagtaggatt tcaggagaat 4620ttaacaaact gattatagaa tgtacatgga aataaagtcc
aagagtatct tagaatattt 4680tgataaagaa aaggaaaata aattttttgg gaaggtggtg
aaggaatgga gactagttct 4740actaaatagt aacacatatt aaaaagccaa aataatcaaa
caatatgata ctgattagta 4800atgagagaaa agcaaattaa aacaacaaaa taccactcta
cacccaccat gttgccaaca 4860tttgaaagtc aaataattac aagcattagt gagcataaag
ggaaatgtga actatcttgc 4920tttgttgatg ggagtgtaaa ctgtttatga tccctgaatt
atagaaatta taaactagtt 4980gggcgaaaaa attaacatag gaaataaagc ggcatatccc
aatccttagg ttgagtgctt 5040taagtcttgg aagatttcaa taaagagaaa ttaggggcag
gttcatggaa taagttgaac 5100tggagttgga cctatggagt gggttaagac aggaacaaga
tgagcagaat aaagaaagca 5160ttcttgtgag aggaaagagc ctgggcaaat gccctaaacc
aaaatcagat ataatacctc 5220aaggaagagt gaggaaaaaa gatttattca agaatagcat
tcctgctggg aatagtgagt 5280aatatttttt attagaaaag gggcaccaga ctagagagga
tactgagtgc ttctagagta 5340cttaagtaac agtatcatag aaggtttcat cagagagcat
ctaatctaag cccatcattt 5400tacagatgaa gactttgagg cccagagagg ggaagtgact
tgtctaaagt cacacagcat 5460aataaagcac ttttaagtct tgcctgacag gaaatatcta
gataagttgg aaaacagaga 5520gacagagaaa ttaggaagaa ctagaaagca ccacatctag
aattactaac atgagaataa 5580aaagaaaaac atctaaaatg gagaaaatac aatacttgaa
gctagtattg aggtatattt 5640cagaaaagag aaagaagtct acgaggcaac taagttctcc
tctgaagatc aagaccaata 5700atgataaggt taggttattc agcacatttt ctatgtgcca
aacactattt taagcattct 5760gtaggtatta acttatttaa gcttcacagc atgaggatat
gctgccttat ttcctatatt 5820aactttttca ctcaactagt tcataatttc tgtaattcgg
gcatcataaa cagtttacat 5880tcccaccaac agaccaagat attacagttc acattttcct
ttatcctcgc taatacttat 5940ttgactttca aatgttggca acatggtggg tgtagagtgg
taagggggac accattgtta 6000tcatcatcct tttacagaaa atgacaccaa agcacaagtt
aagtaacttg cccaagggct 6060cacagctaaa cgctgacagt tacgattgaa tccccagcag
tcaggttcca gagcccatgc 6120ttcttaaccg gtacacatga tgctgttaga aatgagatgg
ttcagagaca gtgcaacttc 6180tcttagggag aatttaatat tttcttttag attagactct
agtacaatgc caagaacaga 6240aactccctca ccaaataatt gccctctcaa ctttattgcc
accctgtcat ccaaagcaac 6300tcccagaccc taaggaatgc aagaaagaaa gcatatgcaa
agcaatttac caccagtggt 6360catgtgctgc cacctttcgt tatcttccca ggacagcacc
tgtgcagttc tccttggaca 6420gttcactcag gccaaggaac agattgtcag gaaagacatg
tgaattcttt gcccttccag 6480gctgttttca cttcatgtta ggggcttcat gatactgttt
tcccagaact gacataactg 6540attggtatag cacttgggag cttattcttc ccatccctga
gcttctgttt ctcagttacg 6600gtgagggttg aagggagtta tatgttcctc agggcagcct
atacgagaca taaacatttt 6660cacaaacagt aaaatacaca acacacacac acacgcacaa
aacacacaag cagcttcctt 6720aaccattttg taagcagatt attagaaaat aactctgcct
tcgtttctca catattttgc 6780acaaaccgat agatggaaaa acatcatgta ccgccaagac
cagggaataa gagctcagct 6840ggcaaattag gggttttccc tatttccctc cctaacgagg
tcaagctgtg ttcaggttaa 6900ggcatgctga atttgaaacg acaacccact caagttgaga
tatccagaaa caaataccat 6960gagttaagaa agaagccaca ctgatataaa gaaatgagat
ttattgcctt gtggggggaa 7020gggatgtggt tgtgataggc aggccactct gggatccctg
ggatgcaagc ccagggacag 7080cagagtcccc aggtgggaaa tctacacaca caccccaggg
atgtcccaga gacttcttct 7140accctaagag gagatcctgg gcaggatgtg agaaatctga
gcatcctctg tttggatggc 7200cgaagctgct ggcatcaaac tctggtctgg aagaatcagt
ctgggggaga gacagggatg 7260gaggaaaggc atcaggggat ccatcctcct cctccttctc
ctcctcctcc tcccccacaa 7320aggccttgct cgccctgcct gcaccacacc ctgcagaagt
tgatctctcc ttgttcccaa 7380atcatctcca agcacccttc ctacagcacc ccatgattcc
ttttttcact caaagcaatt 7440cttgtgaccc ataactgtgt gtgtgtaact gggtccccaa
ctgggaagat gtgcccccat 7500ggtgctggat acaggccccc acacccaagg gcctgaggat
cgctatatgt ccccccatgc 7560cacaaaataa tcctgacaca tgcacgcatg caccactgta
tctggctccc acaggctcac 7620ccgccccctc cagatgacat accacctgag caaggcttcc
ggaagtagat gatgagaaca 7680atgcccacga tgatgcccag cacacccagg ccaaaggcca
cgccacacag cacattctcc 7740agcagatctg agggcagtgc gttccggggt actggaggaa
atgagtggct cagcctgggg 7800acctagttag ggagcctccc acccagggaa atgacgtggg
tgtctgggat gacatgggag 7860actgggatgg gcttagggta ggaatggact aaacaaggta
ccagtggaga aagaagcctc 7920ctcccatgga tctatccctt tttgccccca aaaggaccag
aattccaggg agaaagcctc 7980accccaatag gcaattgctg tgtagcggtc aatttcgtga
gtcacaatgc aggagaaaat 8040gtcagaaggt tctggtgtga agtttaagta agaaaaggcc
tggaagctga gtccatcgac 8100agctgagaca aaagtaggcc caaatccttc cacagggacg
gaatgatgct gccagttcac 8160tgtcagcatg ggtgggaaga gattactgac aaaacagacc
aaagtgttgg gcttgccaaa 8220ctccaggggc ttcagcgtga acacttcagc gataggaaac
cctggtgggg ggattgaagt 8280gtagggggaa aaagagacta gtttagatgg tatctctgtg
tttggagggg ccatggcata 8340tggaggggag ggcagagaag aacacagtgg gtcaggcttt
gggagacaga gatgagcgag 8400gagctgggct ctgaagggag gtcttcttcc aggcaaggac
tgcagctaga cgtagaagca 8460gagccagatc caggctactc tggacccctc caccatgact
tccttcagca cttcctgtct 8520agagctcaca ttgatgtcta accatgcact gtcttctcac
taagacatag tcacgtcatc 8580agatatttcc actcttccca tccatcttgc tgggcatagt
agcacaagtg ttaatattca 8640gtaggtatca gttggtacct gttgaattca tcacattcaa
tacatagttc tgaatgccta 8700ctacatgcta ggtacttcgg cccaccaaaa gaacacaggg
tgcagaccaa ggctggtgga 8760aaaattaagg tgatgaagag aaccagaaag tatttgagat
ggggagctgg tatcaagggg 8820aattattcag tgtacagatc aatgaggtta atgcagccct
cctcccttca ctccccagaa 8880aactcctgac ctctggacac cgggattttc ccatcaagtt
ttggccctat ttgctggatc 8940atccactcgc agaactcttt gtcaaataaa atggcaggag
catctccctg ttcctgagcc 9000cagtcagcaa attcgggcag gcgaggcacc cgagtgttct
gggaaaagtc gaagaagaaa 9060agctggtcct cgtcgtaggc ctcagagagt cccacactgg
gactcccatc ctggcagtac 9120actgtgtgca ggaatgtgtg gttttgcagg tcatctggcc
acattggagt aggagctgca 9180aaggacacag ggtgaggttc agggaggtgg gagccttctc
ctccaactta aaaaacagca 9240aggtggggct aggcgcagtg gctcatgcct gtaatcccag
cactttggga ggccaaggtg 9300ggtggatcat gaggtcagga gtttgagacc agcctggcca
gcatggtgaa actccatctc 9360tactaaaaat acaaaaaagt agctgggcat gttggcatgc
gcctgtagct actcgggagg 9420ctgagggagg agaattgctt gaaccaggga ggcagaggtt
gccgggagct aagattaagc 9480cactgcactc cagcctgggt gacagagtga gactctgtct
caaaacaaaa caacaaaaac 9540aagcaaggcc tgcttaagga gcgtgggctg aggtgagacc
ctttcctgtg tctgttattt 9600agactccccc tcccaaaggg ggtgaagaac aaattatggc
atctctccaa gcttcccctg 9660cctataaaaa ggccagttgg caaaagtaaa gagttctact
ttctaaagtg acagattcag 9720gccaggcatg gtggctcatg cctgtaatcc cagcactttg
ggaggctgag gcaggcagat 9780tgcttgagcc caggagttca agaccaacct gggcaacaca
gcgagaccgt ctctacaaaa 9840aatacaaaaa cttagccagg tgtggtggca aacacctgtg
gtctcagcta ctctggaggc 9900tgaggcagga ggattgcttg tgcctaggaa gttggggctg
cagtgagcca tgattgtgcc 9960actggactcc agcccaggtg acagaatgag cccgtctcaa
aaaatatata tataaaggcc 10020gggcgcggtg gctcaagctt gtaatcccag cactttggga
ggccaaggcg ggtggatcac 10080ctgaggtcag gagtttgaga ccagcctggc aaacataatg
aaaccccatc tctactaaaa 10140atacaaaaat cagctgggtg tggtggcatg cgcctgtaat
cccagctact tgggaggctg 10200aggcaggaga gtctcttgaa ccccagaggc aggggttgca
gggagccgag atcacgtcac 10260tgcactctag cctgggtgac agagcgagat gccgtgtcaa
aaaaaataaa ttaaatcaaa 10320taaaaaattt aaaaatgtat atatataaaa taaagtgaca
gattcagagt cactgttcat 10380tgtgtgtttg ggggctgcac aaagacacct agccaaagaa
gcaagtgaaa gcctgcattc 10440tgctcaccat gccatacatc ctggcatagg gctgtatcct
cccaaagggg attcctttgt 10500ctaattcata ccaggccact gtattgacta gagaaggcca
tggatgggtt tctcactctt 10560agaagggaaa gaggaggaat ggctacagcc tccccaagcc
atagatggga ctgcctccca 10620ctatccccag acacaaatgg taaattggaa aacctgtatc
cagacatttc ttcagccact 10680tcattggcac caagcgtctc tcaaaatgtc ttctgttcct
taacctacca ggcctcccaa 10740agacagcaat gggagaagtg accccataac tgcataaaat
aatccctctt ctttgaagct 10800cttggcagga atcgctcagc cagcaggaaa cctttaaccc
aatacccaga aaaacagaca 10860tttggaggaa gagggatctt ccagattatt cttccattct
gccccatcct ctacagagaa 10920ggaaactaag acacttttca agaatcacaa gataagttaa
tgatagaaag cagagtagaa 10980tcttgagtgg aggagtgaaa ataacattca ctttgttcaa
atcccagctc taccactttc 11040caatggtgtg aacttgcaca aataactctg agtctcattt
tcttcatttg taaaatggag 11100agaacaatct ccgcttcaag agattgtctt aaatggaaca
tgcaaagcat cactgatatc 11160gtttaccaac cacacatagc agctgtcttt ccccactccc
ctgttgtttc cactgcctca 11220taagacttcc caccactcac aaagcacagc gcttttcctc
acaaagctga gtgggctccc 11280taggttcagg atggaagtaa ataggagtac catcttacct
tcagggacgg cccaggagtg 11340gggtagcagc cacagaagtg gtaacatctg tagcagcgca
gctccttggt tctgttcatg 11400acccatacct tcttgccaca cagtaggtag gagctaccaa
cccagccaac ccagcttccc 11460caactccctc cccgagaggg tggccttaga tcatgttttg
ccagatcatt tccaataggt 11520gcccttgtca ttttgtctaa accaatcaga gaagcgtagg
gtttaacatc atcagtcact 11580ggggagacgc ctggggccag taacctcctg aagacttggc
tgtttgacca gggcagagta 11640tggcatgtaa ctgggctggg aagcccagtg gaggaatgtt
gcttcctggt ggagttccct 11700ctttggtttc aagctgtcag cctcagtctg taagcgacca
gctggctctt cagagcagtg 11760ccacctcctg gcagaatgct gcaatgggga accgcatctt
ccccaagtaa acccccaggg 11820ctcttcggac cctgccttct cctccctcct ggctcttcct
ctttctcaaa aaaacttatt 11880ctccttcagg cattagctct aattcatttg gcagacatat
attgaaaata caagaaattc 11940tgggtgttgg gcccagggct agaaatacaa agatgaatag
gcatagtctg ccttcaaaga 12000gcttagagtc tagtgctggg ggagggggcc aagggataat
tacacaacaa tgtaatgtat 12060tcaaataaga atgtgccaag tgttttggaa gtcgcagtaa
ttttatgagg atgcggaata 12120ggaggaacat aatcaggcag gctcctaaga cttgaaggaa
aaacaatttg gccagcagaa 12180catgaaggaa gagaaaaaca cgccagggca aagggtaggc
agaagtacaa agatcacagg 12240catccagagg tcctctttgg agaccctgtg tactagttga
tatgaatgtt gtgaaggtcg 12300cttgggtgtt cctgtataat aggaggtaat ggggggtaga
aggatgttgt gataagctac 12360aaattcgggc aagggccaga tcacgtgggc cctgctacgc
cacaaggagg agcttgcttt 12420tacttagcag atgatagaga tattaaaact ggggaatgac
aatcatttta gcattttgga 12480aaaaatgttc tgattgatat ttcaaacaat gaactggagc
ttttaaagaa ttgaggcaaa 12540actgctgggc aagagtctat agcataccaa gatgaacagt
tgcacatata cacaccactc 12600ctgtagcaat acagcaataa tttaaatgac agataataag
agcctgaatt aagtcataat 12660tagaggaggc agaggagata gaatatcaag ataattagga
agtagaatct aaagggtttg 12720gctactgatt agctgtggga gtgggaaggt ggaggagtca
aagatatctc agatttccag 12780catgggtggc tgggtgggtg gtcagggatg gactgaattg
aagcagaaaa gaatgccatg 12840ggagcaggtt tacagagaga aagagcttga ttttgtacat
gttgaatttg aaatgccagt 12900ggaacagcca gctgaaactg catgggagcg cagtgaggcg
tgtgggtatg gaccccaggt 12960atggtctgaa gaccctgatt tgagagtcat cagcacaaat
gtcgaagcag aggccatgaa 13020taagatcacc caagtaaact gtgcagaagg agtgggaagt
gaaacaagga caaaagcatg 13080catgggctca aaccccaaac ctcataccag ttatccagga
tccagtcagg agcatttaac 13140tactttatgt gcttcagact gaaagaattt aatatagaga
attggttaca aaggtgttaa 13200aagggcaaga agtacaaaaa aaaaaaaaaa ggagagtcct
agaaatgtac attttaaaaa 13260aagattgcta tctggaaatc agaagctgcc atcatccctg
agctggaatc tgtaaatcta 13320ctcattgcct tgtgagagac actgtcatag tcagttccaa
tctactagaa aggtgccacc 13380tccttcaagg ctagaatcct tgagaaggta cttctgctca
ggaggctgga gtcctgagtc 13440tcccattctt cctgctgcta cagctacagc caatagctac
cagctattgc cagccaccgc 13500cactgtttag aggctgaagc aggatgcttc tcagtttctc
ttgccttctg atctcccatc 13560agtgcctcct actggcagaa tcaaaaagga agccagatgt
ccaggaaggc tgggaaatac 13620acacctggct gactcctaag ctaagcagtt caaaacacag
tagaggaggg tgtgtgtgtc 13680actgagacaa agataataac gagtacactg aaataccctg
gtttgtaaga atctggtggc 13740acgaggacca tccagagcac taagaaaaga ccaaggtaga
agcagatcag agaaataaaa 13800aagaggtgtg ccatgaagga gggcaaggtc agcattttta
aatgctactc aaaagtcaag 13860aaaggattga aaagtgtcct tagatttggt gattatgaga
tggctgacaa atttattgag 13920agcagtttca gtgttgtagt gggagtcaac tccagattgt
ggtgggctga gaagtaagtg 13980ggaggtgagg aagaaactgt cagtgtacat gcttcaagtt
tgttagacaa aagaaagaga 14040aagacagaag gggtggggga agaggcagtg agaaagctct
aatgtggcaa tcaagtaatc 14100tgagaaatta atatatgtga atattgtcca acagtgtttc
tgaggctttc aaaattcata 14160ccttccacct tttttttttt tttttttaag acaaagtttc
ccctgttgcc cagactggag 14220tgcagtggct acttacaggt gcaatcataa ctcactccag
tcttgaaccc ccgagttcaa 14280gcgatcctcc cgcctcagta gctggggact ataggcacat
gccactgtgc ctggcttcat 14340atcctctttt gataaacaag taatagcagc agtaatagcc
aaaaacaaaa acaactctat 14400gacctcctag atattctgga acagcaatgt gtatatatgt
gtgtgtgtct gtgtggtgga 14460ggcagggtgc cagggaagga ctagggtttg gaaatcatgg
taaccctcca gaaaacaaaa 14520gaacatttcc cagtatccca acatttatgc actaacccat
cagcggttct ggcagtgggg 14580agattcaggc ccctggacag tagaaaagaa gtttatgaga
ctaccagtgg ggagacatat 14640gggacacagc cacctagagt cctaaaccag gggttagcaa
actttttctg taaagggcca 14700gatggcaaat attttagaca ttgtgggcta tcagatctct
gtcatgagta ctcaactgtg 14760gcacgaaagc ctccatgcac aatatgtaaa tgaaggagag
tggctgtgtt cctagtttcc 14820tcctagcttt tcctcccact tcttgagcat ctccttctca
gtctccttca tagactcctt 14880cctttcagct actctttaaa tactggtgtt ccctggagtt
tttgtcctca accctctttt 14940tatttatgga cactaaaatt caaatttcat gtaattttca
tgtgtcacga aatattcttc 15000atttgctttt tttttcccta accatttaaa aatgtgaaga
ccattcttag cttttaggcc 15060atttaaaaac aggtggtagg caagattgtg ctcacagccc
atagtgtgct gaatgatgct 15120ctacacgtgg tcagaattgg tacgaaagcc ccaaattaaa
cccacccttc aaagaggaac 15180ctcagtcccc ttattattgg attggcaatc agttaacaaa
cactttgtgc cagttacacc 15240agtctatttg gaaggagatc tggggaagaa caggagaaac
tagactgggt ggaagggcat 15300aggaataggt acagcagaca ctgcaatttc tctgggtgag
aggaacaagg cagaggggtc 15360caagttctcc atagggagca cagtgtagac aagaccaagg
tgaggacaaa cataaccatc 15420cctcaccaag actgtggtga ggggtggtta actccattct
ccccttctat aatctcagtt 15480taaatggtaa caagttcaaa cacttataac tactcttccc
tccatgtaat ccttccccac 15540caggacctcc caactacctc catcataagt atctcaggaa
tagtctctca tcagtttgga 15600aagtaataat tgtgggcaag agatgagcaa ggcagccagt
tctgctttgc agtagttcac 15660tgtctacttt gtcattagct atgaatgcct ctgaaaataa
tggcacagca ccggtaaatc 15720caggaggctc tggctttcta acactcagct ctgccatccc
tttctagcat ttaaaaatgg 15780actctatttg gccaggcgca gtgattcacg cctgtaatcc
cagcactttg ggaggccgag 15840gggggtggat cacgaggtca ggagatcaag gccatcctgg
ttaatggtaa aatcccatct 15900ctactaaaaa tacaaaaaaa aaaaaaaatt agccaggcgt
gatggcgggt gcctgtaatc 15960caagctactc aggaggctga ggcaggagaa tcacttgaat
tcgggaggtg gaggttgcag 16020tgagctgaga tcgtgccatt gcactccagc ctgggtgaca
gagcaagact ccatctcaaa 16080aaataaataa ataaatatat aaaaaggact ctattttttt
tcccctagca gagtcagatt 16140tcttggaaaa gtcatgggca actgtggccc cgctcccatt
cttaccattt aatcttttaa 16200ctctcaacaa tgcaattgtt caccaatact tttgtgttgc
caaatcaaat gaactagtct 16260ctgcaacatc tgacactgtt ggccataccc tatctcctaa
attggtcaaa tttctggcat 16320ccctgatggc actctctcct agttttccct cctacttttc
tggcgtcccc ttttcagtcc 16380ctttgggact cctttctttc agcaaccctt taagtattgg
tgttccctgg agttttgtcc 16440tcaaccttta ctcttcttag actatacact tgccctggat
ggtcctctca tttactccca 16500catgccttct gttaccaccc atttgctaat gtcttccaag
cttacctctt cagctcagat 16560cttgctctga gttccacact acccatatct gaaccacttc
tggtcaaatc cacttggatg 16620ctatgcaata gcagtttttt gtttttgttt tttttttaaa
tatggaacgc ttcatgaatt 16680tgcatgttct taaactgtat tcttcacaat agcgttcctc
aagaaataaa aaaagtaagt 16740ttgatgatag caatcattta tttttgaatt tatttccaca
tagacataat gcaacatcaa 16800acacatttat ataatatttt ttattatgta acaatttatt
atatttaata agtctattta 16860ttgcaagcaa tagaaaccaa ttctggctaa cttacatttt
aaaaatgagg atttattgga 16920aagatactga tctaactcat gaaatgaaag taatagttga
ataagctagc ctcaggtaga 16980atagccacag ggaccttaga agcaggggtt gagttgccat
taatatgctc acctgcaaag 17040gcctcctgcc tctttatctt tcaagttttg ctttgctggg
agagcctctc tcactggctc 17100agcttgtatt aggtgtgtac cactggattc attggttgtg
gccaggtaca gtattacctc 17160tatggattag agctattcct agagaaggga gaatcatatg
aaaagtaacc acctcaatac 17220agctattttc aacatatggc atctcagaca attgtatgag
atcatctgag gcataaacat 17280aaggttaaat ctgtgtatta atgctcaaac agcatttcct
aactactcag gtgacatatg 17340tcatctgctt gatgatctct ggtcggtcac ttgtcttatc
acatattcaa attacattta 17400tcatgtgatt caatattgat ttattaattt aaaattatat
attccacgaa tttcctttga 17460atctctgact aaaaaggttt ttttaatttt actttgaaaa
gctccaagca cacacagaag 17520agaagaatct aataaactcc aatgtactct catgaatgtc
aacaattttc aacatttaac 17580attcttccat tcttgtttca tctattgttc tgcatttttt
ggagtatttt aaacaaattc 17640tgtcattaca tttcaccagt aaatactttt aggcatatct
ataatagata ataacctttc 17700ccttaacata actataatgc catcaccaca accaacaaaa
ttaaaaatta cttaacttca 17760tttgacccaa tctgttcatt tctcctagtt atctcaaaaa
tgtgtaagag aatgaagttt 17820taaatgaaaa gcagtgtctt ataattttca aaccgtgcca
ttagtttaaa aaaattggtg 17880agttttctat tttatgtttc ataagctatt gatggttcaa
taatgaattc taattaggta 17940ttccataggc aaataaagtt agcaattgtt actctgaatg
tatctccatc tcaagattac 18000aagagtacac tcatcacttt cccttcccaa tatattccaa
ctcctctctt atatttaaga 18060cttcagtgaa taacaagatg tccacccgag ctacaaatgt
gggtcatcgt tgatgacccc 18120atcttcctca aaccttccca ttcaattgtc ctaacaattc
tacctttcta atagctcttg 18180aatcttcctt tcttttcctt ccattcctac tggtccaggc
cttcaatggt tggttttcac 18240tgattattgc aactttcttt ataattggtc tctctctctc
caatcttatt attttccaca 18300gtgctgccag aaggatattt ttattatgct tagttgatca
tattatactt ctgcatgaaa 18360accttccatg attgttaatg atctactttc cttgtcatga
cccataatga cctgaagtct 18420acttacctac ttctatatgt cttttcaggt gaaatctcac
tcctctcagg aagccttcct 18480tgaacccaga gttgagatta atagcctctt cagtacgttt
ccaaagcacc ctgtgttggc 18540cattatcact gttttaattg tattattctc ttccatttat
atgtctgttt catagtcacc 18600tcatctctac tgcaaggtcc ttaggggagg gtgtactata
tatatatata tctccaccaa 18660gaggcccact aagtgacctt tcactcgatg aacaaatggg
ctaccagtct ctgaaggtgc 18720tgaactgaga atggaagagc cttcaggtat tagatgatga
tggattgtcc cttctaacag 18780atgtttcaaa ggtaaatctt atcaggttta tctataagcc
attctttttt tttttttttt 18840gagatggagt ttcactctgt tgccaaggct ggagtgcagt
ggtacggtgt ccgctcactg 18900caacctccac ctcccaggtt caagtgattc tcctgcctca
gcctctggag tatctgggac 18960tacgggcacg tgccaccata cccggctaat tttttttttt
tttttttgta tttttagtag 19020agatggggtt tcactgtgtt agccaggata atcttgatct
cctgacctcg tgatccacct 19080ggctcggcct ccctaagtgc tttgattaca ggcatgagca
accacaccca gtctctatga 19140gccattttac acctccacag ccttccctat atactctact
acccttccaa ttccattcta 19200ggcccttccc aagctccttg ccaactacca ttttcttcct
actccctgcc acctcctgtt 19260tcagagagca aacctagcca tccagctccc acatttactc
ttatttctac ctcagtacat 19320ttctccatac ccatattcat cctccctttt agtgacatta
ctatgatgca gcaatcctta 19380caactactct acaaggttat aatttattat ccccattata
taaacaagaa aactgggact 19440cagaaaggtt catttattta gcaaatattt attggccacc
ttctgtgtct agcagtatgc 19500tctgtatcag atacctgcca tcatcacact taaagtctaa
tgaaaataaa gagacattaa 19560acaagaaaac atacaaattt ataaactaaa aggtccacac
acacacacac acaaaatctc 19620ttagaattga taaattcagt acagttgcag gatacaaaat
tatcatataa aaattaatgg 19680tgcttctgga tacaaacagt aaactagtgg gaaaagaaat
caaagaaagt aatcccattt 19740acaatagcta caacccctcc ccccaccaaa aaaacaaaat
agaataccta gaataaacca 19800aggaggtgaa agatctctac aaggaaaact atgagacact
gaggaaaaaa actgaagagg 19860tcacaaaaaa atagaaagac atcctatgtc ttcggaagaa
ttcgtatcgt gaaaatgact 19920gtactaccaa aagcaatcta cagatttgtt gcaattccta
tcaaaataca aagatattcc 19980ttgcagaaac agaaaaaaca aacctaaaat taatatggaa
ccacagaaaa cacaaatagt 20040caaggtaatt ctgaacaaaa agaacaaagc tgtagacatc
ataccaccca acttcaaaat 20100atactacaaa gctacagtaa ctaaaagagc acggtactgg
cataaaaaca gatacacaga 20160ccaatagaac cgaataaagg acccagaaat aatagatcca
catcttaaca gccaactgat 20220tttcaacaaa ggtaccaaga tattcaatgg gaaaaggaca
cactcttcat taaatggtgc 20280tgggaacact gaataacaat atgcagaaaa atacaactac
acccccatct ctcatcaaat 20340acaaaaatta aatcaaaatg gattaaaaac ttaaatgtaa
gacctgaaac tataaaagtt 20400actgtaagaa aatactgggg aaatgctcaa gactttgagc
aaacattttt tggtttaaga 20460cttcaaaagg agaggcaatg aaagcaaaaa tacacaaatg
ggattacatc aagctaaaag 20520gcttctgcca cagcaaagga aacaatcaac agagtgaaga
gacaaccttc agaatgggaa 20580aaaatatgtg caaactatcc atctgataag ggattaataa
ccagaatata taaggaactc 20640aaactcaaca gcaaaaatcc tccaaataat cccatttgaa
aatgggcaaa tgatctgaat 20700agacatttct caaaagacat acaaatggcc aacaggcata
tgaaaaaatt ctcaacgtta 20760ctaaccatca gggatatgca aatcaaaacc acaatgagat
atcatctgaa tctaattaaa 20820atggctatta tcaaaaagac acagataaga gatactggtg
aggatgcaaa gaaaggggaa 20880tgctcatata ctgatggtag aaatgtaaat taacatagcc
actatggaaa acagcataaa 20940ggttcctcaa acaactaaaa atagatctac tagatgattc
agcaatccca ctgctgggta 21000tatatccaaa agaaaggaaa tcagtgtatc aaagagatgt
gtacatgccc atgtttattt 21060cagcactacc cacagtagcc aagacatgga atcaatctaa
gtgtctatca agtgactgga 21120taaagaaaat gtggtgtata tatatacaat ggatactagt
cagccataaa aaagaatgaa 21180atcctgtcat ttccagcaac atggatggaa ctggaagtca
ttatgttaat gaaataagtc 21240agacacagaa aaaaaaatat cacgttctca taagtgggag
ctaaaaaagt tgatcttatg 21300gaggtagagg gtagaatgat ggttaccaga gactgggaaa
gggagggggt ggagggggga 21360tgaagagaga ttcattaatg gttacaaaaa tatagttaaa
ttgaaggaat aaattctata 21420gtgtttgata gcacagctgg gtgactacag ttaacattaa
tttactgtat attccaaaat 21480agctagtaga tttgaagtgc tcccaacaga aggaaataat
aaatgtttga ggtgatggat 21540atcctaatta tcctgatttg atcattacac atcgtatgca
tgtatcaaaa tatcatatgt 21600accccataaa tatgtacaat tattatgtat caataaaaaa
taaaaaaaaa caattcagaa 21660gtccataaac ttggatggaa taaaaaaaag tcaactttat
tttcaaaaaa ctctcactga 21720aatctaattt tatgaatgta gaaaataaat ctttgtagta
ccagccagca gctgtaacac 21780tgtcatcaat agaaaacacc atcaattaat attttcatat
cacattatag ttgttacaga 21840catcttaaaa tatcacttac aattatggga gctgttaaac
ttgccaaaaa atcatgcttt 21900ttaatgtatt agtaaagaaa cactgtattg tattaataca
gaaacacata ctactagatc 21960atcacacgtt tctttgaata tagtagtgtc ccccacacag
caccaaatgt gattatacag 22020tttattccta tccatagata tacctatgat aaagtttaat
ttataaattt gcacaggaag 22080agattaacaa caaaatagga caattatatt gtaataaaag
ttatgtgaat atggtccttc 22140tgtctcatac acaaagtatc ttattgtact tattttcaga
ccaggttgac cttgggtaac 22200tgaaatcaca gaaattgaaa ctgcagttaa ggggggacca
ctgtattttg ataactatag 22260tttatatttt attttatgca tttacaaata ttatcagaca
agatccaaag gcttcaccaa 22320actgccaaaa aagctaatgg cacataaaaa gcttaaggag
tcctgattta atcagtcatt 22380caatgaacat gacatccttc ctggaaccat ctcctgttct
agcttcctca cattatgttg 22440ctctgcttct ccttgagatc ttccattggt tccacttcct
attcttgctt cctgtatgaa 22500gatgtaaccc aaagctcaat ccttcaccct aaattgtttt
tataccccct cttttacaaa 22560cctcagctac cttcgtggct gattcaaaca tcacctcaaa
ggtgactctc aaatctgctt 22620ttcctaatct tttttctcta acttcaatct tggatcttaa
actccctgct gtgcctagta 22680aacagaataa tatgccaccc agagtcagct gggttcaaat
cccagttctg ctacttacta 22740aaggtgtgac attaggtaaa tattacctgc tatggtttga
atctctcctc caaaactctt 22800gttgaaaata attgccattt tgacagtttt aagaagtggg
acctttaaga gttaattagg 22860tcatgagggc tctgctctca tgaatggatt aatgctacta
atgtaggtat gggttcccat 22920ttaaaagggg acattctgag gccgggcaca gtggctcaca
cctgtaatcc cagcactttg 22980ggaggccgag gcaggtggat catgaggtca ggagatggag
accatcctgg ctaacacggt 23040gaaaccccgt ccctactaaa aatacaaaaa attagccagg
cttggtggcg ggcacctgta 23100gtcctagcta cttgggaggc tgaggcagga gaatggtgtg
aacccgggag gaggagcttg 23160cagtgagtca agattgcact actgcactcc agtctgggcg
acagagcgag actccgcctc 23220aaaacaaaca aacaaacaaa caaacaaagg gtacattctg
gcctctattc tctctccatc 23280tcatgtgctt gtttgccttt ctgccgtggg atgatgcagc
acaaggctct caccagatgc 23340caatgccatg ctcttggact tccaagcaac tggaactgag
ccaaataaac tactgtttat 23400aaattaccca gtctgtggta ttctgtgata gcatcagaaa
acagactaag acgtcctttg 23460cttctgttgt ttcatttgaa aactgagggt gataatatta
gtattgactt tatagggtta 23520taaggattaa aagagttact acatgtactc attgcagtac
ctgacacatt ttaactactc 23580aataaatgtt ttgtatcacc aatcacatct ccttccaacc
ccgacatttt aatttgatgt 23640ttattaacat ggacggtgcc agccactgga agacagagtt
tctatctaac aacataattc 23700tgatcaagtc attagtcaaa aaatttcagt ggttccccac
tgattccaaa cttaacagca 23760ctggaaacct tctataatgt gttctctaat ataaatttac
ctcccatttt ctcttctcct 23820gctctacttc ttgtagctta tgttctggcc agactggact
agactactct ctgtgacaat 23880aacctgtgct gttctatgtc tgtctttcct cacataattc
taatgtctca ggtttgaagg 23940caataatttt gtctatgatt attcccctat acatggcacc
ccataaaaca tacacatttc 24000aatcttacct aagtcacata cttacttaca catcaattca
cctccatatt tgctcaattt 24060gtgagaacct aatattggcc agatactgtg ctaggaccta
gggatattaa aaaaaaaaaa 24120aaagcaaagc aagaaaaaga atgcataatg gccctgctct
caaaatcaag gtctagtact 24180agagagaaac atgtaatcac ataaatgcca ttcactgtgg
aaagtaaaat cataagggga 24240agggacacca aagaatgagc agttagctca acttgaacag
taacattaag cttttcagag 24300atgttatttg ggcgtacata gattggggaa aagtctactc
catatagaaa gtgcacatgt 24360gtaaaacaca gaggcatgaa acaaaatgat gtgtctggga
aacagttcaa tacagctgga 24420atatagggcc caagaggaag tggttagaca tgaggctgga
aagctaggca gactgttttg 24480gcaaacatag gaatttggac tttatcacat agccaataag
gaataacaca gagttttaaa 24540aagagctatg gccagggcta tattttggaa agctctctcc
tggcagtatt gtggcagagg 24600cagagaggaa agtctaaagc agcactgtcc aacagaactt
cttgtaatga ggccgcgcgc 24660agtggctcac gcctgtaatc ccagcacttt gggaggctga
ggcgggcgga tcacgaggtc 24720aggaattcga gactaatttg gccaacatgg tgaaaccccg
tgtctactaa aaatacagac 24780actagccggg tgtggtggca ggcgcctgta atcccagcta
ctcgggaggc tgaggcagaa 24840ttgcttgaac ccgggaggca gaggttgcag taagccaaga
ctgcgccact gcactccatc 24900ctaggccaca gagcaagact ccgtatcagg gaaagaaaaa
aacaacttct tgcaatgaca 24960caaatgttca ataatctgtg ctttcccata tgacagccac
tagtcacatg tggctattga 25020gaacttaaaa tgtggctagt gtattgaggc actaaattta
aaattgtatt aatttaaatc 25080caaatagcca tgtgtctagc aaataattta ggagactgtt
ggtatagctc aggtgataga 25140attaggacag aagggtgagt tgatggatag ttaagaggca
aaattatgag tctgtaaggg 25200tgtgagaaaa ggaaatcaag aacaggctcc cagattacag
actttgtggt taaacagcca 25260ccattactca ggacaacaga agagaaagag caggtctaga
gtgtatagtg atttcatcaa 25320ttttgaacat actggtgtct gagagttatc ccagtgggaa
tatttagtag aaagtttagc 25380ttagagagct gtctgaacta aagattcaga cttcagaggc
tttgagccat ggagtcagat 25440tacctagaga agttgaacaa aattagaagc aaacaagaat
cacagcaaat atcaacacat 25500aaaaaggggc taaggaagaa aaatctactg agactggaga
ggaacagtta cacaaatagg 25560aaaagaaaca agtgagagtg gtatagaagt caagggtaga
gagaatgtca ggaaggaaac 25620atgatcaaat gtcgaatgcc tcagaggtca aataaagtga
gaactgtaaa gtgcttcctg 25680actttgccag ttaggaggtt cttggtgaca tctgccagaa
aagttttggt ggtagcagcc 25740tgacagaggt agcttgaaga gtggggatgg ggaaagagaa
tgtgaccaag aattgagata 25800gtaaggataa tttcaatttc aggtcttggc tgtgcaagga
agccgagaga catgagtctc 25860taagagggca cgatattgag agggttgtta tctttctgtc
agcggggaaa ccaagagaaa 25920agtttaaaaa ggtcaaaagg gggagaaggg aagacagctt
ccgggtaaca gagaaggttg 25980accaggtcaa tagtaaagga tttcctcaaa ccgaagggag
gacctctagt gaaatgagaa 26040aggaatacac aattgaccca gtttgcaggt gggaaatggg
aagccagttc tgcaaattgg 26100cctttctgtt ctgtgaagtg ccatctgtcg gtgaggagag
attagggtct gcagcgtgaa 26160aatctggacc atactctggg taatcaaggg agaggttatc
ggctaatgac aaattaaagg 26220cttacttttt agctggcaac tgaatcacca taacatttta
tgttaccagt tccaaaattt 26280tggggggaat tcactcaagc ttgggagagg agagatcata
actttaagag tataagaggt 26340ttaaacggtc cactacgaaa taaatagaga aggaaaagtt
atcagctggt aaatatcgta 26400gaaggtagag cggtccaggg actcacaggt ctcactaaag
aaaagtctag cgtaggttca 26460cggcacggag agattttaag gctgcctaag actaaagcca
aatacgaagt ccacatctgc 26520ggtccgcacc ttatctctcc gcgcggcagg cgcgacgagg
gcgagaaact ccctctccag 26580tggtcgcacc acacgacacc agggaagggg cccctctctc
cagaccctca tatctccagg 26640tccaggcccc attttcctcc gctgacagct cagcagcgtg
cgcttccgct ggattcaggc 26700caggaccagc gaagccgcac cttacaccca ccgaggagga
aacaagcctg gccacccgag 26760gctaccccgc taggccgcgg gtagtggggg agggggcgct
gaggcaggag gtcagcaccc 26820gggcgcgggc tcccgcccca cgaaatgcgc gcgctccaag
ccccgccgcc ggagatgcgg 26880ttccggtccg gacgcctgcg cactacggct ctccccgcag
cctctggccc tccttccccc 26940tcccccagtc agggcgcacc cttgcgcctg cgctgtgtgt
gttcctggtc tgcggcagcc 27000atgctgaact cgtatggaga ggcgagtggg ggggacagag
tccaggactg cgggatagga 27060agctggggat atggacaagc agcagcgtta tagcgctctg
ggtttcggga cataggcctg 27120ggccatgcgg cccccttggc cccttggcgc gacccccagg
aacgttcgga aagctggtcc 27180tcgtggctgg gggaaaggcg gggggtgggg gggaagcggg
cacgtgaccc cggtcagcca 27240atctgggtgc tgctgacgtg gccgcgcggc cccgatgctc
tccccacccc cccagcccgt 27300tcgggaaggg aggggctggg ggctacgccc cctcccccag
cacggcttcg ttttctgggg 27360gggggttgac accccggatt acataccccg taccaagccg
agggcaactt tggaggcccc 27420ctggaaggct ttaggatcca ggtgagaagg ggcccttgtg
gggcggagat gtcagtcaag 27480tgcttaacca atggtgggga gtccgggagg gggattcttg
gggttcagga aagaatcctg 27540agagtgggaa gatttgtcct tcaaaccttt tacagccaat
gggagcgtgg agggggggcg 27600agcgggagag ggccatgggg ggggagggga atggccagcc
tcatgcctcc gtacccattg 27660gagggcaaag gggttagggg gcggtgtggc cccccctatt
ccattcgtcc cctgggggta 27720cagcagccgg gagccaggtg agaagggatc catcggcggc
cgagggaggg gtgacctggc 27780ggtgggctga ggagtggtgg ctgtggcccc tacccgtgga
tgtgaatgct ttaggagttg 27840gccacccatg ttgtgaactg aggttgttcc caggcgccaa
cttcctttct ccccagagcc 27900tctggaggga gcattgctgt gcgccctttg tgtccgcggt
aggggagctc cagtcgtcac 27960accgcaggct ggaggttacg cttcgagtcg cttaccgaat
ttgtgtgcat tcacgtggac 28020acggcctgtg gggccttttg cccctgtagg gtctttactg
agcacgtgtc tactccaggc 28080tggggtgctt acaagctgaa agcttgaggt ctgcttagga
acagaaacca ggcccaaggt 28140gggtgctggc agtagggggt ctagacagca tggtctgaga
tgcgagggag gctcgggacc 28200tggaatgatt tcacagctcc caaggtttcg ggtttctcca
gggtggcctc ttccatcgcc 28260tccctcatcc cctcccccag tcctgaacag ttctctcctt
gtgtactgcg ggggagggaa 28320cggaaaggag gaaagagtta ctttcccaaa ttactgagta
gcagtagcct ccctggtgac 28380tcatgtgggg gaagggagga tagaggatcg ggaggcagtg
attttccgga atgcagggaa 28440taaacgagag caatgtctgg ctgccctttt cctaaggcct
agtattttct cagcctccta 28500agtttttact ccatggccgg ccccctgatg ggcctctgtc
ctggcctgca gagccccggt 28560ggagaaaagc agatttggga ggttgggccg ctagggggag
gggaaaaggc ctctgcaaag 28620ttgctgtgtc attgccctcc atgctgcagc cacccaaacg
gggccgcttg tacttttggg 28680ggccagggcc tgatccctgg ctgggggaag gggactctgc
tctcctgacg ctcattttcc 28740cccgccctcc cggggtttgc cctactcggg gggtcagaag
acaggagatt ggcggccatt 28800ttagacgcag taaccgaggt tggagttgaa gggctactgc
agaggaggga gggtggcgtg 28860gttgcagctc aaggacctag gcccttacga gcccttcccg
ggcgaggggg aatcttaccg 28920tatatttgtt cacctacgtt gattattttt cccagatacg
tacacaagtt tgttttctcc 28980ctggtagcga agaaagggga aacgggggag gggacgcccc
accaaagccc aggttttctc 29040gggtggggga gatcctttca ctctcttgta agggggcggg
gacggcccca gagatgctct 29100ggagatcctg actctgggct ctggttgatt cacagagtct
gcacccttat ttagataacc 29160aagttaggag gaagacttaa gagtaagttg gggggagggg
gcgaaactga gctcccaaaa 29220tggctcctgc ccctcctcgg aggcggacgg ccggggggag
gggaggaggg gaggaggggg 29280agggctagtc tgagccgcag ccgccgcctc ctccgctcgc
cctcctccct ggcgctgacc 29340gatggaccag ccgctccgtg gggaggactc cggaccctgg
tgggggggcg ggggggttct 29400ttcgcccccg tggcggaggg cccctgagag gcggatacgg
gtgtgccttt gggggtgatg 29460tggcgtgtgg ggggaaaggt ccgagctcgc ctggaggggg
agggtttttc ccttaagtca 29520tccctcccag gacttgcttt ttctgctctg agccggacgc
cggaatggag tttgaggaag 29580aggtgaggtg tgttgcattg tatagggtag atggatgcgt
ttggagattt taatcccact 29640tttagggttg ccgaggattt ttcgaacgag cagaaatgta
ttggtaactg taggtgtgag 29700tggggaggga ttagaaaggt gcttggacgt gcaaatttgg
gagacgtatt ttagcttttg 29760tggtctttgg gactaaacag tagtaaataa tgttttgctc
gtctttccat cgtttggctt 29820gagggaggga gtggagtatt ataagactct ggcaacactg
ttttagactg tggggcatgg 29880gaacgttaga tcccctcatc gccgttctga agcccgtagc
tgttcgccat agaggagcag 29940gccgcggctt ctaagatggc gtctttttcc tcgtttcaga
ttcttcgctg ctgctgcctt 30000accgccgaga accaccaccc gccaggcgtc ttgcggccac
acccctggcg ggttcaggca 30060ggctacgccc acgcgacccc tcccgtttcc ctgctttggc
caatggagga gctacgaatg 30120gcacgacctg ctcgagcttg gcagtctcca gttgggctgt
gcatggaagc ttgggaagac 30180tttgttggaa ggggaggcgg ggagagagtg ctggaggctc
tggggcgatg gcttccgcac 30240ctcttccaac caccctcttt ccctggagtc ggcggaccac
agctcagcca attggcttgg 30300agatgtggcg ggttgccact tccctgtggg tctctgcggc
actcttctgc ctggtgactg 30360acaccttgga aatgaagttt atgacgtcat cgttgcggct
ggccaataga aaaagctccc 30420gcggagaggt gttccttccc cttcgactca gcttcttcac
ccgcgtgagc gagcgcgcgc 30480gcgcggaggg ggtggggaaa atctcaagca gggtggcgcg
catgagcggc gaagctcctc 30540ctccccgcct atatataaag ggctggcgcg gggctcggcg
gcgccatttc gtgctggagt 30600ggagcagcct ctagaacgag ctggaggatt ctgcctaccg
atacagagcc ttcgagtcgt 30660ccggggccgc cattacaatc cacctccatc cgcttggaaa
tggccttcgt cccggcctat 30720gactggtccc agcgggcagt acagaccccc tagaagcccc
tggagctccc ctttttcggg 30780ccccgcccaa tcctcggagt ctgtccaccc cctctactcc
gccctcaaga ggatttcaaa 30840gatggaggcg gcggctccct aaaccacttt tcgtgttcat
ccgcctccat ccgagatcga 30900aacgggacct cgtcggcccc gtaggggccc gacaagaaga
gggaatccct gcagaccaac 30960agcgggctat attgacgacg gtgtctgaga tcggggaccg
tcttttgaag agtcagtccc 31020tccttagttg cccgcctcag ctgaggccgc cgccattttc
ttgctgtccg ccgtctgcag 31080agcgcgccaa gctgcccgga gctctccgag aggccccaaa
gagactgctt tcgtgccggc 31140caggcagggg gtttgtcgcc tggaggccca agaggaacgg
cctcccccca acttagcggg 31200ttatgctgga ccgggcggtg aggggaaccg aggccacccg
gactttccgc ggctgagggc 31260agcgccggtt ccttgcggtc aagatgctgc aaaacgtgac
tccccacaat aagtacgttt 31320ccgcgagccg cgtgtgggaa ggggatgttg cagggcggcg
gcacaggggt gtggggcgcc 31380gtgttgggag tactgagcgg ccccggcgcg ctgctgttgc
ggcgcagctg tcgactcggt 31440cgcgcggagg gaattgagcg acggttttgg aacggtggtg
gcggctcggc tactgctcgt 31500ggaggggaat acaggttgtc aatttatacg ctattaatgc
cgccgtggcc cagtcttaac 31560cgagtcaggc agagctagtt tgacggtgga gtggagtgag
gttgaacagc aggtttggcg 31620tttggtgggt ctggtatcta gcggcggtct gttagccttt
taggggggat tcacggacac 31680ctctagcgcc ctgtagggtt gccatggtga cggagcgctt
aagggactgg caacggggat 31740tcccagagaa gggtaaaggg atcactctcc cgtgtgtgca
ggttcctaat gcccagggca 31800tgtcattaaa tcttttgctt tctttgggtg ggtgggttgt
gtgtggtgtt tgttggtgca 31860gggattgttt tttcctaaca ttaaaagttt gattcagggc
aggagggtag agctaaggtt 31920cctagttcag ctctgcgatg taaacaatga gattcccata
tgatgtttta attcttaggt 31980ggtaggaaag actgatcgga ggagcaccag agggactgta
aatgaaccac tgttagcgtt 32040tggtgtccgg agttggtgct acagggggaa ctggtagtgg
aatcgtgttg tgtagtgggt 32100gggtggaagg gggctatcac ttggtgacct tgactgtttt
gtacggcttt ttgacttcct 32160tggagtgagg agactctgat ttggtgcgaa taattttgag
ggcctggaag ttacgggctg 32220tgaagtctga caaattcttc cttgtctgaa tttgttttta
agttgatatg gttcttcctc 32280tgggtttcta gtctatgttc tgttgtggcg tgaactaccc
agaccttgtg gaagatggtg 32340ctctctcttc tatctaggtg gattattctg tgtcttatca
gcattttatg gaatttttta 32400tagccataat ttgttctttt cctccttacc ggcgctcaac
caccatggca accaccaaac 32460ccctagtgag gaggaagctt ggggtttgag tttcttaact
ccacccattt tgcttaatcc 32520ccatccccat agggctgtag ttctgagatg tcgtgccttg
tcagaaacaa tttgggagtt 32580ttttaaaata tgaaaaagaa cagatagagc ctatcagact
taagaaggtg ggatctagat 32640agtatactaa aaatattaat aaaaggaagg cggggccagc
aataaaagct ccacagattg 32700tttggatatt gtttctgctt aagaagcact tggcataagc
ttaaccacct cactagggcc 32760agcacctgga ttcatcagac tattgtgcag atgcactttt
tcctcatttg gacgatattg 32820ccctaatttt gttcccatct ttacaggctc cctggggaag
ggaatgcagg gttgctgggg 32880ctgggcccag aagcagcagc accagggaaa aggattcgaa
aaccctctct cttgtatgag 32940ggctttgaga gccccacaat ggcttcggtg cctgctttgc
aacttacccc tgccaaccca 33000ccacccccgg aggtgtccaa tcccaaaaag ccaggacgag
ttaccaacca gctgcaatac 33060ctacacaagg tagtgatgaa ggctctgtgg aaacatcagt
tcgcatggcc attccggcag 33120cctgtggatg ctgtcaaact gggtctaccg gtgagtagag
acattggagc cggggaggtg 33180tgggatgagc aagaatgcgt gtgaatgggg gtggtctgcc
tagtgtagat gctgcggccc 33240ctagggagtt cccatttctc ccctgtaggg cagttagcta
ccagatttct gggtatcttg 33300gtcctttgtg attgatccga ccgcttgctg taactatctt
ggcatctttc cttgtgccct 33360ccatgtgtcc ttccttaact tttgtgccct ggctccattt
tacagattcc cacctcgggt 33420tgggagagga ccacggtggc caaaattctt agcttcttcc
tttccctcat gcagcccatg 33480gatagccagc cccagaggta atgtcacagg atgggaagtt
tccagagtgg gtgggaggtg 33540ggtggttaga gaaaggcagc aggggcctcc ctgtggatgt
caagaatctt ttttatttat 33600ttatttattt tgtcccacag tttaattggg gccgcagttt
aactgttcct ttgatgcata 33660gggggtgtgt gtgtgtgtgt gtgtgtgtgt gagagtcggg
gatcggtagt ctccctataa 33720gcatttattt ttctgtggtt ctgacctaac atttctttat
ttaggattat cacaaaatta 33780taaaacagcc tatggacatg ggtactatta agaggagact
tgaaaacaat tattattggg 33840ctgcttcaga gtgtatgcaa gattttaata ccatgttcac
caactgttac atttacaaca 33900aggtgagttt ttctgtgtgt tcatttagta ggtggggaga
aacagtaatt tctattattg 33960ctggatatgt tgtctacata aagtttaaat cctttgctac
tgaaggtgtt atccaggtag 34020ggtagtcgga gtcttaaaaa cctgactcta gatggtacta
ttgaacacag tgatgtgact 34080tcagagctct agttgaaggt tatttagaac acttcatact
tgggggtggt ggtcctgttt 34140cttagaaatc accagagacc tgagtagacc agggatctgt
tttcttgtca gctctcaagt 34200tttttcttct ttcgaatttt gggagacagt taggagaaag
tggaaattag tagtggcctg 34260gagtagaaat tttctttaag atttgatgac aagatgactg
gtgggggtat ggtaatggcc 34320tagggcctga atgcctctga gaaagatggt gtgtatctat
cttctgttgg cattttttaa 34380ctttctttat tgctgtctgt gttctcatag cccactgatg
atattgtcct aatggcacaa 34440acgctggaaa agatattcct acagaaggtt gcatcaatgc
cacaagaaga acaagagctg 34500gtagtgacca tccctaagaa cagccacaag aagggggcca
agttggcagg taggaagagt 34560gggagttttg caaatggaca acttaaagat ggggaagaga
atcaaactac acttttttcc 34620ttttttctag cgctccaggg cagtgttacc agtgcccatc
aggtgcctgc cgtctcttct 34680gtgtcacaca cagccctgta tactcctcca cctgagatac
ctaccactgt cttcaacatt 34740ccccacccat cagtcatttc ctctccactt ctcaagtcct
tgcactctgc tggacccccg 34800ctccttgctg ttactgcagc tcctccagcc cagccccttg
ccaaggtatg atctgtggat 34860ttcctctggg cagcagggag gcaagggtct taagtaaagt
gggcttggag tgacaggttc 34920cctatcttgt ttctttctgc agaaaaaagg cgtaaagcgg
aaagcagata ctaccacccc 34980tacacctaca gccatcttgg ctcctggttc tccagctagc
cctcctggga gtcttgagcc 35040taaggcagca cggcttcccc ctatgcgtag agagagtggt
cgccccatca agcccccacg 35100caaagacttg cctgactctc agcaacaaca ccagagctct
aagaaaggaa agctttcaga 35160acagttaaaa cattgcaatg gcattttgaa ggagttactc
tctaagaagc atgctgccta 35220tgcttggcct ttctataaac cagtggatgc ttctgcactt
ggcctgcatg actaccatga 35280catcattaag caccccatgg acctcagcac tgtcaaggta
cccactgcat ggggcagatg 35340ggatgctcaa gcagtgatgg gagcctaggt gcaaaacaat
aagtctcctt atgtgggcac 35400acagcagtct ttggttcttg gcattttact tttataaaat
aatagtggaa cagaaggtct 35460ggtgttttga gaatttgtat ttcttggagt ttgaaacagt
agggtggggt ttctttgtct 35520tgagaaaaat actgtctata attaagtact aatgtggcag
tgttgggtta aggaagttat 35580agggtggaaa gacaggcata ggccacctct ctgtcactta
gaaatgattt ctttttctag 35640acataaatat ttcttcaacc cacccaaatt cctttgactt
caaacttgaa ccccagggca 35700cagatcctta aggtcatccc cactgtgctc tcaagagagg
gctcttcttg tggtgtctgg 35760ggttggcagg gaaaggtgag tcttcctgcc tgtgcagctt
ctgatgctgc ctccttctgc 35820agcggaagat ggagaaccgt gattaccggg atgcacagga
gtttgctgct gatgtacggc 35880ttatgttctc caactgctat aagtacaatc ccccagatca
cgatgttgtg gcaatggcac 35940gaaagctaca ggtgagtgga aaggttggag tttgaaaaat
aaatggtatg gggagttatt 36000ttgtcatgtg tgctgcatag cctcaacgtg agggtctcac
tgttctgtac agttgtaaat 36060tggagctata tcacttggtg gctgggtatg tagggcactg
tttatcagca tagttttgag 36120tttgtgcctc tttctaggat gtatttgagt tccgttatgc
caagatgcca gatgaaccac 36180tagaaccagg gcctttacca gtctctactg ccatgccccc
tggcttggcc 36230382732DNAHomo sapiens 38ctttcgcctc agtctcgagc
tctcgctggc cttcgggtgt acgtgctccg ggatcttcag 60cacccgcggc cgccatcgcc
gtcgcttggc ttcttctgga ctcatctgcg ccacttgtcc 120gcttcacact ccgccgccat
catggtgaag ctcgcgaagg caggtaaaaa tcaaggtgac 180cccaagaaaa tggctcctcc
tccaaaggag gtagaagaag atagtgaaga tgaggaaatg 240tcagaagatg aagaagatga
tagcagtgga gaagaggtcg tcatacctca gaagaaaggc 300aagaaggctg ctgcaacctc
agcaaagaag gtggtcgttt ccccaacaaa aaaggttgca 360gttgccacac cagccaagaa
agcagctgtc actccaggca aaaaggcagc agcaacacct 420gccaagaaga cagttacacc
agccaaagca gttaccacac ctggcaagaa gggagccaca 480ccaggcaaag cattggtagc
aactcctggt aagaagggtg ctgccatccc agccaagggg 540gcaaagaatg gcaagaatgc
caagaaggaa gacagtgatg aagaggagga tgatgacagt 600gaggaggatg aggaggatga
cgaggacgag gatgaggatg aagatgaaat tgaaccagca 660gcgatgaaag cagcagctgc
tgcccctgcc tcagaggatg aggacgatga ggatgacgaa 720gatgatgagg atgacgatga
cgatgaggaa gatgactctg aagaagaagc tatggagact 780acaccagcca aaggaaagaa
agctgcaaaa gttgttcctg tgaaagccaa gaacgtggct 840gaggatgaag atgaagaaga
ggatgatgag gacgaggatg acgacgacga cgaagatgat 900gaagatgatg atgatgaaga
tgatgaggag gaggaagaag aggaggagga agagcctgtc 960aaagaagcac ctggaaaacg
aaagaaggaa atggccaaac agaaagcagc tcctgaagcc 1020aagaaacaga aagtggaagg
cacagaaccg actacggctt tcaatctctt tgttggaaac 1080ctaaacttta acaaatctgc
tcctgaatta aaaactggta tcagcgatgt ttttgctaaa 1140aatgatcttg ctgttgtgga
tgtcagaatt ggtatgacta ggaaatttgg ttatgtggat 1200tttgaatctg ctgaagacct
ggagaaagcg ttggaactca ctggtttgaa agtctttggc 1260aatgaaatta aactagagaa
accaaaagga aaagacagta agaaagagcg agatgcgaga 1320acacttttgg ctaaaaatct
cccttacaaa gtcactcagg atgaattgaa agaagtgttt 1380gaagatgctg cggagatcag
attagtcagc aaggatggga aaagtaaagg gattgcttat 1440attgaattta agacagaagc
tgatgcagag aaaacctttg aagaaaagca gggaacagag 1500atcgatgggc gatctatttc
cctgtactat actggagaga aaggtcaaaa tcaagactat 1560agaggtggaa agaatagcac
ttggagtggt gaatcaaaaa ctctggtttt aagcaacctc 1620tcctacagtg caacagaaga
aactcttcag gaagtatttg agaaagcaac ttttatcaaa 1680gtaccccaga accaaaatgg
caaatctaaa gggtatgcat ttatagagtt tgcttcattc 1740gaagacgcta aagaagcttt
aaattcctgt aataaaaggg aaattgaggg cagagcaatc 1800aggctggagt tgcaaggacc
caggggatca cctaatgcca gaagccagcc atccaaaact 1860ctgtttgtca aaggcctgtc
tgaggatacc actgaagaga cattaaagga gtcatttgac 1920ggctccgttc gggcaaggat
agttactgac cgggaaactg ggtcctccaa agggtttggt 1980tttgtagact tcaacagtga
ggaggatgcc aaagctgcca aggaggccat ggaagacggt 2040gaaattgatg gaaataaagt
taccttggac tgggccaaac ctaagggtga aggtggcttc 2100gggggtcgtg gtggaggcag
aggcggcttt ggaggacgag gtggtggtag aggaggccga 2160ggaggatttg gtggcagagg
ccggggaggc tttggagggc gaggaggctt ccgaggaggc 2220agaggaggag gaggtgacca
caagccacaa ggaaagaaga cgaagtttga atagcttctg 2280tccctctgct ttcccttttc
catttgaaag aaaggactct ggggttttta ctgttacctg 2340atcaatgaca gagccttctg
aggacattcc aagacagtat acagtcctgt ggtctccttg 2400gaaatccgtc tagttaacat
ttcaagggca ataccgtgtt ggttttgact ggatattcat 2460ataaactttt taaagagttg
agtgatagag ctaaccctta tctgtaagtt ttgaatttat 2520attgtttcat cccatgtaca
aaaccatttt ttcctacaaa tagtttgggt tttgttgttg 2580tttctttttt ttgttttgtt
tttgtttttt ttttttttgc gttcgtgggg ttgtaaaaga 2640aaagaaagca gaatgtttta
tcatggtttt tgcttcagcg gctttaggac aaattaaaag 2700tcaactctgg tgccagaaaa
aaaaaaaaaa aa 2732394768DNAHomo sapiens
39ggcggcttgc gcctgcgcgg cgcggcgctg cggagaccgt tggttcattt gcatgtcccc
60gcctcgcgcg gcggcggcgg cgggtgagga gcctgaggcg gcggcggggg tggctccgcg
120cgcggtggtc tcgggggcaa aataacatgg cagccagacg aattacacag gagacttttg
180atgctgtatt acaagaaaaa gccaaacgat atcacatgga tgccagtggt gaggctgtaa
240gcgaaactct tcagtttaaa gctcaagatc tcttaagggc agtcccaaga tccagagcag
300agatgtatga tgacgtccac agcgatggca gatactccct cagtggatct gtagctcact
360ctagagatgc cggaagagaa ggcctgagaa gtgacgtatt tccagggcct tccttcagat
420caagcaaccc ttccatcagt gatgacagct actttcgcaa agaatgtggc cgggatctgg
480aattttctca ctctgattct cgggaccagg tcattggcca ccggaaattg gggcatttcc
540gttctcagga ctggaaattt gcgctccgtg gttcttggga acaagacttt ggccatccag
600tttctcaaga gtcctcttgg tcacaggagt atagttttgg tccctctgca gttttggggg
660actttggatc ttccaggctg attgagaaag agtgtttgga gaaggagagt cgggattatg
720acgtggacca tcctggggag gctgactctg tgcttagggg cggcagtcaa gtccaggcca
780gaggtcgagc tctaaacatc gttgaccagg aaggttccct cctaggaaag ggggagactc
840agggcctgct cacagctaag gggggtgttg ggaaacttgt cacattgaga aatgtgagca
900caaaaaaaat acccaccgtg aatcgtatta ctcccaaaac tcagggcact aaccaaatcc
960agaaaaacac tccaagtcct gatgtgaccc tggggacaaa cccagggaca gaagatatcc
1020agttccccat tcagaagatc cctctggggc tggatctgaa gaatcttcgg ctccccagaa
1080gaaagatgag ctttgacatc atagataagt ctgatgtttt ttcaagattt gggatagaaa
1140taatcaaatg ggcaggattc cacaccataa aagatgatat taaattttcc caacttttcc
1200agactctctt tgaacttgaa acagaaacct gtgctaaaat gcttgcctca ttcaaatgtt
1260ccttaaaacc agagcacaga gatttttgct tttttactat caaattttta aagcactctg
1320ctttgaaaac acccagagtt gataatgagt ttttaaacat gcttttagac aaaggtgctg
1380tgaagaccaa aaattgcttt tttgaaatca taaagccttt tgacaagtac ataatgagac
1440ttcaagaccg gcttctgaag agtgtcacac ctttgcttat ggcctgcaat gcctacgagc
1500taagtgtcaa gatgaagacc ctcagtaacc ccctggactt ggctcttgcc ctagaaacca
1560ccaactctct ctgccggaag tctttggccc ttttgggaca gacattttcc ttggcctctt
1620ctttccggca ggagaaaatc ttagaagctg tcggcctgca agatatagct ccctcacctg
1680ctgcgtttcc aaacttcgaa gactccactt tgtttgggcg agagtacata gaccacctga
1740aggcctggct agtcagcagc ggatgtcccc tccaggttaa gaaagccgaa ccagagccga
1800tgcgagagga ggagaaaatg attcctccta cgaaacctga aattcaggcc aaggctccaa
1860gtagtctgag tgatgctgtc ccccagcgag cagatcacag ggtagtgggc accatcgacc
1920agcttgtgaa acgtgtcatc gaaggcagcc tgtctcccaa agagagaact cttctcaaag
1980aggaccctgc ttactggttt ttgtctgatg aaaatagtct ggagtataaa tattacaagc
2040tgaagttggc agaaatgcag cggatgagcg agaacttgcg aggagccgac cagaagccga
2100cctcagcaga ctgtgcagtg agggccatgc tgtactcccg ggctgtccgc aacctcaaga
2160agaaactcct tccgtggcag cggcgggggc tcctccgtgc tcaagggctc cggggctgga
2220aggcgaggag agcgaccacc gggacccaga ccctcctatc ctcaggcacc aggctgaaac
2280accacggccg gcaggctcca ggcctctcac aggcaaaacc atccctgcca gacagaaatg
2340atgctgccaa ggactgcccg ccagacccag ttggaccttc tcctcaggac cccagcttag
2400aagcctcagg cccatccccc aagccagcag gagtggacat ctctgaagca cctcagacct
2460cttctccctg cccatctgct gacattgaca tgaagacaat ggagactgca gagaaactgg
2520ctagatttgt tgctcaggtg ggaccagaga tcgaacaatt cagcatagaa aacagcaccg
2580ataaccctga cctgtggttt ctacatgacc aaaatagttc tgctttcaaa ttctatcgaa
2640agaaagtgtt tgaactatgt ccatcaattt gtttcacgtc atctccgcac aaccttcaca
2700ctggtggtgg tgacaccacg ggttctcagg agagccccgt ggacctcatg gaaggggaag
2760cagagtttga agacgagccc cctccgcggg aggctgagct ggagagccca gaggtgatgc
2820ctgaggagga ggacgaggac gatgaggatg ggggagagga ggcccccgct cctggagggg
2880cgggcaagtc tgagggcagc acccctgccg acggccttcc cggcgaggct gccgaggacg
2940acctggctgg agcacctgcc ttgtcacagg cctcctcagg tacctgcttc cctcggaaga
3000ggatcagcag caagtcattg aaggttggca tgattccagc tcccaagaga gtgtgtctca
3060tccaggagcc aaaagtccat gaaccagttc gaattgccta tgacaggcct cggggtcgtc
3120ccatgtccaa aaagaagaaa cccaaggact tggacttcgc ccagcagaag ctgaccgata
3180agaacctggg cttccagatg ctgcagaaga tgggctggaa ggagggccat ggcctgggct
3240ccctcggaaa gggcatcagg gagccggtca gcgtgggaac cccctcggaa ggggaagggt
3300tgggtgctga cgggcaggag cacaaagaag acacattcga tgtgttccga cagaggatga
3360tgcagatgta cagacacaag cgggccaaca aatagatcaa aaccactgat gtgaaagata
3420agccttgaag cagcaattgc ccttaaaaca tcatccctgc cctggatcgg cctggagcca
3480gtgcccaagt acggtttggt gtgtacatga aaacaaacgt ctctgcagtc tctggggcgg
3540aggtttcgct ggcttttctt tctctcaaag aaaaaaacat gcaccatttt caatgtgctt
3600ttgcctctcc tctctgttca catgctttta gcagcaagtc ccctccaaat ctgtcttggt
3660tccccttcag aaggtggcgc tgcccccgaa aggcacctca gcctgtgagt gctgaggaac
3720cagctcctct ggctgatttt ccagttggac tggccattgc tctccagaag tgctctgtta
3780gcaaacgtga tgtggaaacg atcacagatg gtgttttctc gttgttcgcc agaatttata
3840cgggggagac aaattcccgg taattaccaa gtctgcactc gggtaccaaa gctctgaagc
3900tctctgaaca gttgccatac ttgagttgat gaatgtgtta ttcatggtgt ctcatctcat
3960caatgcatct tgagagactt aatgaaattt tagcaacagt atagaatagc tctatcgggt
4020ggggagtaat cattaaacag atgaaatcgg ccccagattt acatgtctct ttagaatcca
4080cagtgtaagc aaactacagt tacaaaggga tgggggttgt aaaccctctg agactctgca
4140cttttcgcac gtatggcatc gtcaagtgct gtcttattac agcctttgta aggagaggca
4200ggctcctcct ggggtgggct ctgcagctgc tctatttcca ggcatgtgat cgcccccgct
4260ctccagattc cccagcactc tgctgcgtgt aactccactc aattctccac tcatccttcc
4320ttgtgaagca ggatcgttga agttttaagt atgggcaaaa atctggaaaa cttaggatcc
4380ctctgacacc ccaggattag gggacacagc agtggctagg gcatcagcca cagaactgag
4440cgggaaatgc cacttgtatt ggctgtaaag aaatcctggc tttgggccag gcacagtggc
4500tcaagcctgt aatcccagca ctttaggagg ttgaggcgga tggatcacct gaggtcagga
4560gtttgagacc agcctggcca acatggtgta accccgtctc tactaaaaat acaaaaaaat
4620tagccaggcg tggtagcggg cacctgtaat cccagctact caggaggctg aggcaggaga
4680atcacttgaa ccggggaggc agaggttgca gtgagctgag atcatgccac tccactccag
4740cctgggcgac agagcaagac tccatctc
476840395DNAHomo sapiens 40ctgcgagaat cgaggcactc gctggcgtac ccatgtatcg
aaatgagttc acggcctggt 60accggcggat gtcggtggtc tacgggatcg gcacctggtc
tgtgttgggc tcactgcttt 120actatagccg gacaatggcg aagtcgtcag tagaccaaaa
ggatggctca gcaagtgaag 180tacccagtga actctctgaa cgcccaaaag gattttatgt
ggaaacagtt gtcacatata 240aagaagattt tgttccaaat acagaaaaga tcctcaacta
ttggaaatca tggactggtg 300gccctggtac agaaccatga ctggctgctg aattctgaaa
accaggactt ggttcaacat 360ttaaatttga tagttgccct gattcccatt ttggt
39541137091DNAHomo sapiens 41cgcggcgctg ggtcggtggc
ggaggctgag gagaaggagg agcgggccgt ggaggcttcg 60ccgcctaggt aagggcccgg
gactggaggg gaggcgtgcc agagcctgcc agggaatagc 120cagcagacag gcccgctcta
gacatcgcag gcccgcgcag cctgaaagct gtggcttcag 180tgtcgcgggg cggctgcggc
ctcgctcggg aagaagacca agcaacggtg agatgaggga 240ggcgccgccc gtggcaggaa
cgccccggaa ccgtcgcggg cctggggcgg ggcccggcgc 300ggcagtagat taccggtccc
gccgcggagc ggccagctgt gaggctgggg ccggcgcgtg 360gttgcggctc tgtgctccta
ctcttcggag ctgtaagcgg gctgttcttg cggttttcct 420gtttcagatc caattctgtg
gcatcactag gaagggagct cttgtgctta gcacgtagcc 480tcgtcctcag acttggacag
acacaaggga ggctccgctg gaccggaggg cacaagagct 540ccgagcccgg tcgtcggggc
ggtagaacct ggaagcggga gagtggtctg gtgggttctg 600cgcccgttag gcaatgaagg
agaaggatgt tttatcgtat tcacgcttta gattccatta 660gcggtgtaaa tagatgtttt
tctctttatt ttagaattga cgttaggcga atgggttcaa 720ctttgggaat gccttttttt
tttttttttt tttgaaggaa gggccctgtt tcgtagggta 780cataaaccgt gagcgtaatt
gtattttttg catattccag gtttgcttgt gaaggtcaga 840gtagccggat ttaagtgaag
gagttcagta gacatgcaga catggtcacc tggttcattt 900tctgaaccct ggattgtgcc
ctcggcttgc tagtttccac cttcctattg agaaatgcca 960ccagcgtgaa tgatttaaat
atgtcaccat tactgaattt gtgaggtctc taacgagagg 1020tgtcaagagc tggtgcgtga
tggtaggact ggcagtgaag aaagtaacta aataatatgt 1080taccattttg gtgaaacaca
aaagttgaat ttgaaccttg tctcagaaac tagcatctaa 1140ctagatacct aacctgcagg
acaggtccca ggtctctctg gatagttgta gcacctttcc 1200ttatagaatt ctattaccag
gccgagcctg gtggctcaca cctgtaatcc cagcactttg 1260ggaggctgag gtggggagtt
cgagaccagc ctgactaaca tggagaaacc gcgtctctac 1320taaaagtaca aaattagccg
ggcatggtgg cacatgcctg taatcccagc tacttgggag 1380gctgaggcag gagaatcgct
tgaacctggg aggcggaggt tgcggtgagc tgagattgct 1440ccattgcact ccagcctggg
ccacaagagt gaaactctgt ctcaaaaaaa aaaaaaaaaa 1500aaaaaaaaaa aaacccgcaa
aactcaacaa aaaccaacat agtagaggca gcgtttcgcc 1560ttatgcccag ctaatttttt
gtattttttt agtagaggcg gagtttcgct atgttggcca 1620ggctggtctt gaactactga
cctcaggtga tccacctgcc ttggcctccc aaagtgctgg 1680gattacaggc gtgagccacc
gtgcccggcc ctgttatagt atttctaaaa caaattgtga 1740gcctgggcaa catcgcaaaa
ccctgtctct acaaaaaata caaaaaaaaa aaattagcca 1800ggcgtggtgg catgctcctg
ttagccctaa ctactcagga ggctgagatg gaaaaatcgc 1860ttgagccggg gaggtagagg
ttgtagtaag gggagatagt gccactgcac tccaacctgg 1920gccacagaac aagactgtct
caaaaaaaaa aaaatcaatt aaataaattg tggtaaatat 1980atatttttat gtatgtttat
gtatatttta acaaaatttg ctctttaaac cattgttaag 2040tatacaattc agccaggcac
ggtggctcac gcctgtaatc ccagcacttt gggacgccga 2100ggtgggcgga tcacgaggtc
aggagatcga gaccatcctg gctaacacgg tgaaacccca 2160tctctactaa aaatacaaaa
aatgagccgt gcgtggtggt gggcgcctgt agtcccaggt 2220actcaggagg ctgaggcagg
agaatggtgt gaacccgaga ggcggagctt gcagtgagcc 2280gagattgcgc cactgcactc
cagcctgggc aacagagcga gactccgaga ctccatctca 2340aaaaaaaaaa aaaaaagtat
acaattcaat ggtattaatt acattcacaa tgtagtacaa 2400gcaataccac tatttctgaa
actttagtat ctcaaacaga aactctgtaa ccagggaggg 2460catggtggct cacgcctgta
atcccagcac tttgggaggt caacgtgggc agatcacttg 2520agttcaggag ttcaagaaca
gcctggccaa catggtgaaa ccccgtatct actaaaaata 2580caaaaattag ccatgcatgg
tggcatgcat ctgtaacacc agctactaag gaggcagagg 2640ttgcagtgag ctgaggtcat
gccattgcac ttcagcctgg gctgcacagc cagactccat 2700ctcaaaaaaa aagaaaaaaa
gaaactgtaa ccattaaaca agttaacttc ccatttcctc 2760ctcttaatct ctaatctact
ttgtgtctgt ctgtgagtgt gcttgttcta ggtactgcaa 2820atactaaatg gaatcataca
gtattgtcct tttttgtgtc tggtttattt cacttagtgt 2880aatggtttca aggttgatcc
atgttgtact gtgtatcaga atttcattcc tttttaaggc 2940ttaatccgtt gtgtgtgtac
actacatttt gtttattaat tcatttgtag cagacacttg 3000ggttgcttct gccttttgac
tattgtaaat aatgatgctg tgatcattgg tgtacaaata 3060tctctttgag tccctgcttt
gaattctttt gggtatatac ccagaaggga aattgctata 3120tggtaattat tattattatt
aattatttta tttatttttt tttgagacag ggtcttgctc 3180tgttgcccag gctggagttc
agtggcacag tcatggctca ctgcagcctc gaactcgagc 3240tcaagcagtc atcccgcctc
agcttcctga gtagctggga ctacaggcat gggctgccac 3300aaccagctaa tttttttgtt
taatttttat tttttgtgat gaagtcttgc cttgttgtct 3360agggtggtct cgaactcctg
agttcaagtg atcctcctgt cttggcctcc cgaagtgctg 3420gcattacagg catgagccac
cacatctggc ccataatttt tttattttaa tttttttgtg 3480gagacagggt ctccctatgt
tgcctatgct ggtctcaaac tcctggcctc aagccatttt 3540ccctccttgg cctctcaagg
tactggtatt acaggcatga gccactgcac ccagttgata 3600cttggttatt atatgtttag
ctttttgagg acccaccata ctgttttcct caatggctgc 3660atcgttttac attcccacca
gtaatacaca agggttccaa ttttcccaca tcctccccaa 3720cacttatttt ctgtttttcc
ttttttgata aatttgtgtg tgtatatgtg gttttttatt 3780tgtgtgtttt gatgatagcc
accctaatgg gtgtgaagtg gtatctcgtt gtgttttctt 3840ggtttttgct tgtttgtacc
tttttaccca tttttaagtg tgctacttag tagtagtaag 3900tacattcttc tttttgtgca
accataataa aaatccagct tcagaacttt tttcatcttc 3960ccaaactgag tttctgtacc
cattgaatag taactcccta ttctctcctc ccctgacaat 4020caccattctg ctttctgtct
ctatgaattt gactactcta ggtatctcat gtaagtggaa 4080tcatataata tttgatcttt
tgtgtatggc ttattttact tagcataata tcttcaggat 4140tcatccatct tgtagtatgt
atcagaattt tattcctttt taaggctgag taatattcca 4200ttatacatat ataccacatt
ttgtttatcc atttatctat tgatggacat ttgggttgtt 4260tccacctttt tgctcttgtg
aatataatgg tgctatgaat atcagtgtac aaatatcttt 4320tttttttttt tttttgtgag
acagtatcgc tcttgtcacc caggctggag tgcagtggcg 4380cgaccttggc tcactgcaac
ctctgcctcc tgggttcaag gcattctcct gcctcagcct 4440cccgagtagc tgggattaca
gatgtgcgcc accatgccta gctaattttt ttatttttag 4500tagagaagga gtttcgccat
gttgggcagg ctggtcttga acttctgacc tcaggtgatc 4560aacctgcctc ggcctcccaa
agtggtggaa ttacaggtgt cagccaccgc gcccagccac 4620aaatatcaag tctttacttt
catttctttt gggaatatat atactcagaa atggaatcga 4680caattgacag agcaaatggt
aattctatgt gtaatttttt tttaaatttt tttttttgag 4740acggattctt gctctgtcgc
ccaggctgga gtgcagtggc gtgatctcgg ctcactccaa 4800gctccgcctc ctgggttctt
gccattctcc tgcctcagcc tcccgagtag ctgggactac 4860agcatccgcc accacgcccg
gctaattttt tgtattttta gtagagacgg ggtttcaccg 4920tgttagccag gatagtctcc
atctcctgac ctcatgatct gcccgccttg gcctcccaaa 4980gtgctgggat tacaggcgtg
agccaccgcg cccggccaat tttttttttt ttttttttta 5040gacagggtct tgctctgttg
tccaggctgg agtgcagtgg tgcagtcaca gttctctgca 5100gccctgacct tctcagttca
agctatcctc tcacctcacc ctcttaagta gctgagacta 5160caggtgcatg ccaccatgcc
taactaattt ttttattttt ttgtagctgt gggatttcgc 5220taggttgccc aggctttatg
tatcattttt tgaggaactg ccttactgtt ttccacactg 5280gttgcaccat tttacattct
gttagcagtg tacaaaggtt ttgttataga ctgaattgtg 5340tccccctgaa aattcacgtg
ttgaagccct aagccccagt gtgactgtat ttggaaatag 5400gacctttaca gagaaattaa
aaagttagaa gatatcataa ggggctgggc gcgggtggct 5460catgcctgta atcccagcac
tttgggaggc tgaggcaggc ggatcacaag gtcaggagat 5520cgagaccatc ctggccaaca
cggtgaaacc ccgtctctac taaaaataca aaacattaac 5580cgggcgtggc ggcatgcacc
tgtagtccca gctgctgggg aggctggggc aggagaatgg 5640cgtgaacccg ggaggcacag
cttgcagtga gccaaaatcg cgccactgca ctccagcctg 5700ggcgacagag cgagactcca
tctcaaaaaa aaaaaaaaaa aaaaaaaaag aagatataag 5760gatgagacct taatccagca
ggactgctgt cttcgtaaga aaaggactgg ataccaggag 5820tgcgtgtaca gagagaaaaa
gctgcatgag gacagaggta gaagggggct gcctgcaagc 5880caaggagaga gacctcacct
aaaacaaacc ttgctgacac cttgatcttg gactcccagc 5940ctccagagct gtgagaataa
tttctgtggc ttaagccttc cactccatgg tattttgtta 6000tggcagtcct agcatactgt
gtaatatagg tttcaattca gtttctctgc atcctccaca 6060tcctggccaa cacttgttat
tttctttctt tttttttttt ggagacagat tctcgctctg 6120tcacgcaggc tggagtgcag
tggcacaatc ttggctcact gcaacctcca cctcccgggt 6180tcaagcgatt ctcctgcctc
agcctcccga gtaactggga ttacaggcag ccgccaccgt 6240gcccagctaa tttttgcatt
ttagttgaga tggtgtttct ccatgttggc caggctggtc 6300ttgaactcct gacgtcaggt
gacccgccag ccttggcctc ccaaagtgtt gggattataa 6360gcatgagcca ccgcgcctgg
cattttcttt ttttttgaga cagagtctca ctctgttgcc 6420caggccggag tgaagtggca
tgatctcggc tcactgcaac ctctgcctcc cagattgaag 6480caattcttgt gcctcagcct
cccgggtagc tgggattaca ggcgtgtgcc accacgcctg 6540gctaattttt gtattttagt
agagacaggg tttcaccata ttagccaggc tggtcttgaa 6600ctcctgacct caagtgatct
gtccaccttg gcctcccaag gtgctgggat tacaggtgtg 6660agccatctca cccggcctat
tttctgtttc gttttttttt ttttcattag tagctatcct 6720agtggatgtg aagtggtatc
ttattgtggt ttctgatttg catttccctg atgataagtg 6780atgttgagcg tctgttcatg
ttcttattgg ctatttgcat attctctttt ggagaagtat 6840ctattcatgt cttttgttga
ccattttaaa atggggtttt tcatcctggc taacacggtg 6900aaaccctgtc tctactaaaa
atacaaaaaa aaaaaaaaaa aaaaattagc cgggcgcagt 6960ggcaggcgcc tgtagtccca
gctactcggg aggctgaggc agaaggatgg tgtgaacctg 7020ggaggcagag ctcgcagtga
gcagagattg agccactgca ctccagcctg ggcgacagag 7080cgagactccg tctcaaaaaa
aaaaaggggg gggggggggg ttttgagctg ggtgtgcagg 7140tgcacacctg tattcccagc
tgctcaggag gctgaggtag gtggatctct tgagcccagg 7200tgtttgaggg tgcagtgagc
tgtgattgca ccactggact ctaccctggg tgacagagtg 7260gcccagtctc taaaaataaa
ataaaattag gtttttgtct gtgttgttga gttttaggag 7320tcctttatac actctagata
ttaattcctt gtcagatatt tgacttacaa atattttctc 7380tctgtggttg tctttatact
ctgttgatag tgtcttttga tgcacagagg ttttcatttt 7440gatgaagtcc aatttatctt
ctttttttaa aatctgtgcc tcatctgcaa atattaccaa 7500tcgaaagtca tgaaattttt
cccctaagat tttatagttt tagcgcttac gtttgggtct 7560ttgatccaat ttgagttaat
tttttatata tttttgttgt gtaagagtcc cactttattg 7620ttatgcatgt ggatattcag
ttttcggagt accattttcc atttgggaaa aagattgtac 7680tttccccatt ggatggtctt
gacacctttg ttgaaaatca gttgactaaa gttcaagact 7740agccttgcca acatggcaat
atcccgtctg tactaaaaat accacaatta gctgggcatg 7800gtggtgcctg gctgtaatcc
cagctactcg ggaggctgag gcaggagaat cgcttgaact 7860gggaggtgga ggctgcagtg
agctgagatt gcgccactgc cctccagcct gggcgacaga 7920gcgagacatg agaatctgtc
ttaaaaaaaa aagaaaattg accatatatg tgaggattta 7980tttctggtct ctccattcag
ttgattggtc tttatgtcta tctttatgtc cttactgcac 8040tgttgtgatg gctgtagcta
aatggtacac ttaaaaatgg ttaaaatagg ccaggcacgg 8100tggctcacgc ctataatctt
agcactttag gaggctgagg tgggcagatt gcctgtgctc 8160aggggttcga gaccagccta
ggcaacatag tgaaaccccg attttactaa aatacaaaaa 8220ttagctgggt gtggtgtgtg
cctgtaattc cagctactca ggaggctaag gcacaagaat 8280tgcttgaggc ctggtgctgt
ggctcacgcc tgtaatccca gcactttggg aggccgaggc 8340aggtcaagag atcgagacca
tcctggccaa tgtcatgaaa cctagtctct actaaaaata 8400caaaaaatta gctgggtgtg
gtggcgcgca cctagtccca gctacttggg aggctgaggc 8460aggaggatca cttgaaccca
ggaggtggag gttgcagtga gccaagattg cgccactgca 8520ctctagattg gcagcagagt
gagactctgt ctcaaaaaag aaaaaaaaaa aaaagaattg 8580cttgaaccca ggaggtagag
gttgcagtga gctgagattg caccctgcac tccagcctgg 8640gcaacagagt gagactattt
acatacccaa tttttttttt tttttttttt tgggatggtg 8700tcttgcactg tcgcccaggc
tggagtgctg tggcgtgatc ttggctcact gcaacctctg 8760cctcctgggt tcaagcaatt
ctcctgcctc aggctctcaa gtagctgggt tacaggtacc 8820tgccaccacg cctggctaat
ttcttgtatt tttagtagag atggagtttc actatattgg 8880ccaggctggt ctcaaatttc
tgaccttgtg atccgctggc ctcagcctcc caaagtgctg 8940ggactacagg tgtgagccac
cacgcctggt catacccaaa tattttacca taattataca 9000agaatttatt atttttattt
ttttcttttt aaattcttta atcttcttca tttgttaatg 9060ctttgctgaa tcataaaaaa
ttatgaaata aaaagaatag gtcttgttga ttcttctttt 9120tacttacctc cccctactta
ccccctctta ctttatcaaa gaaaacactt catttgaaac 9180ttaacggaag tacattctcc
cagagaggaa aatccttcag gacaacattt ttttttgttt 9240gcttgttttt tttgagacgg
agtctcactc tgtccccgag gctggagtgc agtggtgtga 9300tcgcagctca ttgcaacctc
tgcctcccgg gttcaagcga ttctcctgcc tcagcctccc 9360gagtagctgg gactacaggc
gcctgtcacc atgccctgct aatttctgta tttttagtag 9420aaacagttgg ccaggatggt
ttcaatctta tgactttgtg atctgaccac tttggcctcc 9480caaagtgctg ggaatacagg
cgtgagccac agtgctcagc caattttttg tatttttagt 9540ggagaaaagg tttcaccgtc
tttgccagga tggtcttgat ctcctgacct cgtgatccgc 9600ccgcctccca aagtgctggg
attacaggcc tgagctacca cgcccagcct ttttattttt 9660ttattttatt tattttattc
tcagccttct gggtaactgg gactacaggt gtataccacc 9720acgctcagct aatttatgta
tttttagtag aaatggggtt tcgccatatt ggccaggctg 9780gttttgaatt cctggtctca
agtgatctgc ctgcctccgc ctcctaaagt gctgagatta 9840caggcatgag ccactggccc
agactacact taaaattttc aaatcgagat attttggggg 9900gcaagggtgc ttctagcagc
cactaattcc agttcttgag tgcatattaa agttgctact 9960gtttaaaagc ttgtagttgg
atccagggag tgggtaggcg gtcagagtaa cccttgcttc 10020ttggtgtctc cttgatgctc
ttagctgaat gtcctgtgta gcccacaaca tttactttgg 10080gaaaaaatta agagtgttta
aagcaggatc aagctgctgc ataccacagc taaaactact 10140agaataagac ccctggttct
gtttcattgt tttttggagc taaagtcatg attaagaagg 10200atggcctggg atattggtac
tgtgctgcta gaggtgcaat tcctggttct ttgcaagata 10260gaccagagtg aaagcatttg
ttaggaatgt ttttattaat caagagtgaa aggcaaggcc 10320aggcgtggtg actcaggctt
gtaatcccag cactttggga ggccaaggtg tgggatcatt 10380tgaggtcagg agttcaagac
cagcctggcc aacatggtga aaccccgtct ctactaaaaa 10440tacaaaaatt ggctgggtgt
ggtggtgcat gcctgtaatc ccagctactc gggagactga 10500ggcaggagaa tcgcttgaat
ccgggagacg gaggttgcag taagctgaga tcatgtcact 10560gtggtacagt ctgggtgaca
gagggagact gtttcaaaaa aaaaaaacag aaagaatgaa 10620aggcaaaaca ttaaaaatag
aattaccatg tgatctaaca attttacttc tggatatata 10680tccaaaataa ttgaaaacaa
agaaaaagaa aaacagagtc tcgatgagat atttgtaccc 10740atgttcataa cagcgttatt
cacattagct aaaatgtgga agcaacccaa ctattcattg 10800atggatgaat agataaggaa
aatgtggtat gtacatataa ctgaaaaatt attcagtgtt 10860aggaaggaag gtaattctga
catatgctac aacatggatg aaccttgagg atattatgct 10920aagtgaaata agccagtcat
gtaaaagaca aataccatat aatttcactt agacactttg 10980agtagtgaaa atcatagaaa
cagaaaatag ttgtcaggga tggtgtgagg gatgaatcag 11040cagttactat ttctttttgt
ttgtttgttt tttgagatgg ggtcttgctc tgttgcccag 11100gctggagtac agtggtgtga
tcttggctca ctgcaacctc tgcctcccag gcacaagcca 11160tcttcccacc tcagcgtcct
cagtagctgg gactacagat gtgttccacc ttgtccggct 11220gatttgtgtg tgtgtatatg
tgtgtgtgtg tgtggagaca aggttttgcc atgttgccca 11280ggctggtctc gaactcctga
gctcaagcat caagcaatct acctttttca gctttccaaa 11340gtgctggcat tacagacaag
ggccactgtg cctggccttt actatatttt attttattta 11400ttatttattt atttatttat
ttatgtattt tgagatgaag tctcactctg ttgcccaggc 11460tggagtgcag tggcacgatc
ttggctcact gcatcctctg cctcccaagt tcaagtgatt 11520ctcctgcctc agcctccagt
tattattatt attattatta tttttttgtt gttctgtttt 11580tttgaggtgg agtctcgccc
tgtcgcccag gctggagtgc agtggcacaa actcggctca 11640ctgcaacctc catctcccag
gttcaagtga ttcttctgcc tcaacctccc aagtagctgg 11700gaatacaggt gcccgccacc
acgcctggct aatttttgta tttttagtag agacggggtt 11760tcaccacatt ggtcaggctg
gtcttgatct cctgatcttg tggtccacct gcctcggcct 11820cccaaagtgc tgggattata
ggtgtgagcc cccatgccct gccttgttat tattattatt 11880tttatttttt tgtctgagac
ggagtcttgc tctgtcaccc aggctagaat gcagtggcac 11940gatcttggct tagtacaacc
tctgcctccc gagttcaagt gattctcctg cctcagcctc 12000ccgagtatat aggactacag
gtgtgtgcca ccatggctaa tttttgtatt tttagtagag 12060atggggtttc accatgttgg
tcaggatggt ctagatctct tgacctcgtg atctacccgc 12120cttggcctcc caaagtgctg
ggattacagg catgagccac tgcgcctggc cccagttttt 12180gtatttttaa tagagacagg
gttttggcat gttggccagg ctggtctcag actcctgacc 12240tcaagtgatc tgcccacttc
agccttctga agtgctggga ttaaagacat gagcactgtg 12300cccagccact tttactatat
tttaaattag gttacttatc ctttgttttt tttttttttt 12360gagacgaagt tttgctcttg
ttgcccaggc tggtgtgcaa tggtgcatct cgactcaacg 12420caacctctgt ctcccgggtt
caagtgattc tcctgcctca gcctcccgag tagctgggat 12480tacaggcatg catcaccacg
ccagctaatt ttgtattttt agtagagaca gggtttctcc 12540atgttggtca ggctggtctc
aaactcccga cctcaggtga tccacctgcc ttggcctccc 12600aaagtgttgg gattacaggc
gtgtgccact gctcctggct tatttttctt tttgttactg 12660agttgaaatc attttttata
tattttagat acaagtcact taccaaatat gtaatttgca 12720caaattttct cccattctgt
gggatgtctt ttcatttaaa ccaaaaaatt gtagagatgg 12780gggttttgct gtgttgccca
ggttggtctt gaactcctgg tcttaagtga tcctctgacc 12840ttggcctcaa aaagtgctgc
gattataggc atgagccaat gtgcgcagct taccttttct 12900tcttttcttt ttttttgagg
cagggtcttg ctctgttgcc caggctggag tgcagtggtg 12960caatcatggc ttactgcagg
ctgaaactcc catgctcaag tgatcctccc actttagcct 13020cctaggtaac tgggacctta
ggggcgtgcc atcacacctt gctaattttt tttttttttt 13080gagatggagt cttgccctgt
cgcccaggtt ggagtgcagt ggagcgatct tggctcactg 13140caaattccac ctcccggatt
caagtgattc tcctccctca gcctcctgag tagctgggac 13200tacaggcgtg tgccaccacg
cccagctaat ttttgtattc tgagtagaga cgggatttca 13260ccacattggc caggctggtc
tcgatctctt gacctcgtga tctgcccgcc ttggcctccc 13320aaagtgctgg gattacaggt
gtgagtgtga gccaccgaac ctggcctttt tttttttttt 13380tgagaccgtc tctgtcaccc
aggctggagt gcagtaacat gacacaatct ccgctcactg 13440caacctctgc cttctgggtt
caagtgatcc ttctgccaca gcctcctgag tagctgggat 13500tgcaggcatg tgctaccacg
cctggctaat ttttgtattt ttagtagaga cggggtttca 13560ccatgttggc ctacctggtc
ttgaattcct gacctcagat gatctgcccg catcagcctc 13620ccaaagtgct ggggttacaa
gcgtgagcca ccacgcctag ctggacctga ctaattaaaa 13680aaaaaatttt gtaggctggg
cagggtggct cacacctgta atcccagtac tttgagaggt 13740ggaggcgggg taatcgcctg
aatcaggagt ttgagaccag cccgggcaac ataacgaaac 13800cctaggtcta ccagaaatac
acaaaaaaat tagccgagca tggtagtgca catttgtagt 13860cccagctact caggaggctg
aggtgggagg atggctggag ccagggaagc agtggttaca 13920gtgagccgag aatgtgccac
tgcactcccg cttgggtgac agagtgagat aaggtctcag 13980aaaaaaaaaa aaaatttata
ggccgggcgc aatggctcac gcctgtaatc ccagcacttt 14040gggaggacca ggcgggcgga
tcacaaggtc aggagatcga gaccaccctg gccaacatgg 14100tgaaaccccg tctccactaa
aaaaatacaa aaattagctg ggcgtggtgg cacgtgcctg 14160tagtcccagc tacttggcag
gctgaggcag aagaattgct tgaaccctgg aggcggaggt 14220tgcagtgagc cgagattgca
ccattgcact ccagcctggg cgacagagcg agactccatc 14280tcaaaaaaaa aaaaaaaaaa
aatttgtgaa gacaaggtct caatatttgc ccaggatggt 14340ctgaaacttc tgggctcaag
ccatccttct gcctcagcct cccaaagtat tggaattaca 14400ggtgtgagcc actgtgtctg
gcctatttat agactcttaa ttctgtttcc ttggtctgta 14460tgtctatact atgtcagtgc
cacactgtct tgattactgt agctttgtgg tgagttttgg 14520aattgggaag tgtcagtcct
ctaactttgt tgtatatatt cttttctgtt ttgcacagat 14580atcaggttac aaatattttg
cacacttttt tttttttttg aaatggagtc ttactctgtc 14640acccaggctg gagtgcagtg
gcgcgatctc agcccactgc aagctccgcc tcccaggttc 14700acaccattct cctgcctcag
cctccccagc agctgggact gcaggcgcac actgccatgc 14760ccagctaatt tttttgtatt
ttaagtagag acagggtttc actgtgttag ccaggatggc 14820ctcgatctcc tgacctcgtg
atccgcctgc ctaggcctcc caaagtgctg ggattacagg 14880cgtgagccac cgcacccggc
cttgcacatg tttttaaaac ttaatacata atagcttatc 14940ctgtatcaat taacatagct
actttattta ttcttagggc cgcatagtat ttttttttct 15000ttcttttttt tttttttttt
tttttttgag actgagtctc gctctgttgc ccaggctgga 15060gtgcagtggt gtgatcttgg
cttaagcaac ctctgcctcc tgggatcaag cctcgggatc 15120ctcctacctc aacctctgca
gtatttggga ctacagacac ctgctaccac acccagttaa 15180ttttcgtatt tttttgtaga
gatagggtct ctattgatgt gcatttaggc tttataatat 15240ttatatatat attttttgaa
acaaagtttt gctcttgttg cccaggctgg agtgcagtgg 15300catgatcttg gctcactgca
accttcgcct cccaggttca agtgattctc ctgccttaga 15360ctcccgagta gctgggatta
cagtttttaa aaaatgtatc ctaggctggg cgcagtggct 15420cacgcctgta atcccagccc
tttgggaggc tgaggcgggt ggatcacctg aggtttggag 15480tttgagacca gcctggccaa
catggtgaaa cctcgtctgt actaaaaata caaaaattag 15540ctgggtgtac tggcgggcac
ctgtaatctc agcttcttgg gaggctgaga caggagaatc 15600tcttgaactt gagaggcggt
ggttgcagtg agccattgca ctccagcctg ggtgtcaagc 15660aaaactctgt ctctctctct
ctctgtgtct ctctctctct ctctgtgtgt gtgtgtgtgt 15720gtgtgtgtat atgtatatat
attctgccaa tattttgtga ttagagagtt taaagtattt 15780acatttaaag taattactga
taaggacttt tgccattttg ctactacttt tatgtttagc 15840tgattttttt ttttttggta
gtgaaaaaaa attttttttt tttgagagca tgagactgtt 15900gcctaggctt tggtgagcaa
aatagtgcag tgccacaatc tcagctcact gcaactttgg 15960gctcaagtga tcctcctgtc
ccagtctcct gagtagctgg tagtataggt gtgccaccac 16020catgcctggc taatttttgt
attttttgta gagatagggt tttgccatgt tgcccaggct 16080ggtctcaaac tgggttcaaa
caatctacct gccttagcct tccaaagtgt tgggattaca 16140ggcattagcc actttctgcc
ccctcccccg cttttttttt tttttttttt tttttgagac 16200ggagtttcac tcttgttgcc
caggctggag tgcagtggca tgatttcagc tcactgcaac 16260ctccgcctcc cgggttcagg
cattttcctg cctctgcctc ccaagtagct gggattacag 16320gcttgccacc atgcctggct
aattttgtat ttttaataga gatggggttt ctctatgttg 16380gtcaggctgg tctcgaactc
ctgacctcag gtgatcctcc tgccttggct tcccaaagtg 16440ctgggattat aggcgtaagc
catcacgcct ggcccacgct ttattttttt atttttattt 16500tttattattt atttatttat
tttttgagac ggagtttcgt tcttgttgcc caggctggag 16560tgcaatggca taatctcagc
tcaccgcagc ctccgcctcc tgggttcaag tgattctcct 16620gcctcagcct cctgagtagc
tgaatttaca ggcatgcgcc accatgccca gctaattttg 16680tatttttagt agagacgggg
tttctccatg ttggtcaggc tggtctcgaa ctccagacct 16740caggtgatcc tcccgcctcg
gcctcccaaa gtgctgggat tacaggcgta agccaccagg 16800cctggcctgc ttttttaatt
ttttatttat tttttctttt taagagggag ggtcttgctg 16860tgttgtccag attggagaac
agtgatgaga tcatagctca ctgcagactt ggattcctgg 16920actcaagcaa tcctcccgct
tcattctttg caagtaactg gaagtgcaga catgtgccac 16980ctgccttttt tgttttttaa
atttttcata gagatggggt cttgctatat tgcctaggct 17040ggtctcaaac tcctggcctc
aagcaatcgg cttcctgaag tgctgggatt acagatgtta 17100gccactggcc tgttgtgaaa
atgttttgac tttcttctca ttttctttct ttcttttttt 17160ttttttttga agtagagaga
gtctcactat atggccaatg gtggtttcaa acccctgagc 17220ccaaggaatc ctcctgcctc
agcctcccag tgcttgtcgt gctaggacaa caagcatgag 17280ccactgtgcc tagccccttc
tcattttctt tttctttcta gtgcataagc aggcaacctt 17340attttcttat gtgtatattc
taaagatatg ttctttgcag ttaccatggg aattacactt 17400aacatctcac agttataatc
taatttgaat ttatactaac ttaagttcca tagtatacaa 17460atctctgctc ctatccagct
cctttctctt cccttttctg ttaagtcatg gattacatct 17520ttgtaaatcg tatctcagga
acctagatta ataatttttt atgcatctgt cttttagatc 17580acattgaaag tgaaaagtag
gagttacaaa gcaaaattgc aataatgcta gtttttacag 17640ttgcccctgt atttgccttt
accagagatc tttctttctt tttttttttt ttgggatgga 17700gtctcgctct ttcgcccagg
ctggagtgca atggcgcaat ctcagctgac tgtaacctct 17760gcctcccggg ttcaaaagat
tttcttgcct caggctcctg agtagctggg actgtagttg 17820tacgccacca cacgtggctg
atttttgtat ttttagtaga gatggggttt tgccatgttg 17880gccaggctgg tcttgaactc
ctgacctcag gtgtgagcca ccgcacctgg ccgagatctt 17940tatttcttca catggcttca
cgtctagctt ttaaaaattc attctgggcc gggcgcagtg 18000gctcacgcct gtaatcccga
cactttggga ggctaaggcg ggcggatcac gaggtcagga 18060gatcgagacc atcctggtta
acacagtgaa accccgtctc tactaaaaac acaaaaggcc 18120gggtgcggtg gctcacgcct
gtaatcccag cactttggga ggctgaggtg ggtggatcac 18180gaggtcagga gatcgagacc
atcctggcta acatggtgaa accccgtctc cactaaaaat 18240acaaaaaaca aaacaaaaca
aaaaaaacta ttagctggca ttgcggtggg cacctgtagt 18300cccagctact cgggaggctg
aggcaggaga atggcgtcaa cccaggaggc ggagcttgca 18360gtgagccaag atcacgccac
tgcactccag cctgggagac agcaagactc tgtctcaaaa 18420acaaaaaaca aaaaaccaca
aaaattagcc gggcgtggtg gcgggcgcct gtagtcccag 18480ttactcggga agctgaggca
ggagaatggc atgaacccag gaggtggagc ttgcagtgag 18540ccgagatcgc tcaactgcat
tccagccttg gcaacagagc gagactccat ttcaaaaaaa 18600aaaaaaattc attctgaaga
attccttttt tttttttttt ttttgtaaaa atggagtctc 18660actctgttgc cctggctgga
gtgctgagtg ccatggcatg atctcagctc actgcaacca 18720acccccactc caagttgaag
cgatactcct gcctcagcct cctgactagc tgggattagg 18780ggtgcctgct actgcacctg
gctaattttt gtatttttag tagagacggg tttcaccatc 18840ttggccaggc tggtgtcgaa
ctcctgacct cgtgaccaac ccacttcggc ctcccaaagt 18900gctgggatta caggcgtgag
ccactgtgcc cggactgaag aattcccttt tagcatttct 18960tacaaggtct gtatagtggt
aatgagcctc cctcagcttt tgtttatctg agaatgtctt 19020gatttttttc cttttttttt
tttttttttg agatggagtc tcgctctgtc gcccaggctg 19080gagtgcagtg gcgtgatctc
agctcactgc aagctccgcc tcctgggttc acaccattct 19140cctgcctcag cctcgtgagt
agctgggact acaggtgccc gccaccacgc ctggctaatt 19200tttttttttt tttttgtatt
tttagtagag acggggtttc actgtgttag ccaggatggt 19260ctcaatctcc tgaccttgtg
atccgcccgc ctcggcctcc caaagtgctg ggattacagg 19320tgtgagccgc ctcgcccggc
caatgttttt ccctattttt tgaaagacag tgttgccatt 19380tacagaattc ttggttggca
atttatattt agggtttttt tttttttttt tgagacagag 19440tcttgctctg ttgcccaggc
tggagtgcag tggtgtgacc tcggctcact gcaacctccg 19500cctccagggt tcaagtcatt
ctcctgcctc agcctcccaa gtagctggga ctacaggtgc 19560ccgccactac gcctggctaa
ttttttgtat ttttagtaga gacggggtgt caccatgttg 19620gccaggctgg tctcgaactc
ctgacctcaa gtgatccaca cgcctcagcc tcccaaagtg 19680cagggattac agacatgagc
ccccacgccc ggcctaggtc ttgtatgatc atacattttg 19740ccttggcatt catatggctt
tctaaatttc accatataca tgttgctttg gaatgtccta 19800atttgccaaa gagtttcacc
tcaacttctg tgggcatcta tctgtaatct cttgccccaa 19860gtgcctgtta gtctgtagtc
tgctttgcag ctttcattag caatacctgc tgctttctct 19920gcctgagttt tgtattaggt
tgaaatagaa acatgcacct tatgtctgtc cttcaaatac 19980ccccgcagac agggtagaac
agatatgtac gataatttgc aaataaggtc tgctttgctc 20040tttgagggag ggagctggga
attgggcttc tactgcttta agacaaaaaa cactgccatg 20100ctggagaggg ggtagggcaa
ggttgagtaa aacaccacag aactttcctt ctgttttgaa 20160gatggctttt tcttcattgg
atatttgctt gtaaaccttt gactcttttc taaaactgtc 20220aaatttggtt cagacagtta
ctacttgttt ttctgatgtt tctatgaagg aatgagacct 20280tgaaacttcc tagtctgcca
ttttgatgac ctatgggctg tctttgtact ctcttgatag 20340tgtcctttga tacacagaag
tttttaattt tggtgaagtc cctttatcta ctttttcttt 20400taaagttcct tgtgctgtag
gggtcatatt taagaaatca ttgccaaatc caaggtcatg 20460aagatttgcc tctttttcag
tagctataac aaaggtcctg gaaataactt cttatcttga 20520cttgagttac atgtctgtct
tcaaagcaat gactgtggtg agggtaatag attattccga 20580ttgctcatgc tggatggtgt
ccgatcaggt ctgagacagt ggggttgata ctacagtgct 20640gtttccaaaa aggaagggct
agtgagcgct agaaaaatca gtaaatactt acttcatgta 20700gtaaatgtga agcattcata
gcacattgaa aagtttatgg tgcccagagt accttttttt 20760tttttttttt ttgagacagc
ctcactctgt ttcctgaact ggaatgcagt ggtgcgatct 20820tggctcactg cagcctcaac
ctcctgggtt caagcgatcc tcccccactt cagccttcca 20880agaagctgag actacacata
gtcatcatgc ctgactaatt tttgtatata tattttttaa 20940gatggagtct cgctctgtca
cccaggctgg agtgcagtgg catgatcttg gctgactgta 21000gcctccgcct cccggtttca
agcgtttctc ctgcctcagc ctcctgcata gctgggatta 21060caggtgcctg ccaccacacc
tggctaattt ttgtattttt agtagagatg agatttcacc 21120atgttgccta ggctggtctc
gaactcctga cctcaggtga tccacctgcc tagcctccca 21180aagttctggt aatttttgta
ttttttgtag agatggcatt ttgctatgtt gcccaggctg 21240gtctcaaact ccttggctca
agcggtctgc ctgccttggc ctcccaaagt gttgaggtta 21300caggtatgag ccaccgtgcc
cgaccccaga gtacacattt taattaaaaa cttatttttc 21360tggccgggca cggtggctca
cgcctgtaat cccagcactt tgggaggccg aggtgggtgg 21420atcacaatgt taggagttcg
agaccagcct ggccaatatg gtgaaacccc atctctacta 21480aaaatacaaa aattagccgg
gcatggtgac gcgtgcctgt agtcccagct actcgggagg 21540ctgaggcaga agaatcgctc
gaaccgggga ggcagaggtt gtggtgggct gagatagtgc 21600cactggactc cagcctgggc
gacagagaga gattctgtct ttaaaaaaaa aaaaaaagta 21660tttttcttat tataaattta
atatgtaagt gatgtaagtg tttgaaagtg acttccagct 21720ggatgcggtg gctcatgcct
gtaatcctag cactttggga ggccgaggcg ggcggactgc 21780ttgagctcag gagtttgaga
ccagcctggg taacacagtg aaaacccgtc tctactaaaa 21840tacaaaaaaa ttagctgggc
ggccggcgtg cgcctgtagt tctagctact tgggaggctg 21900aggcaggaga attgcttgaa
cccggaggtt gcagtgggct gagatcgtgc ctttgcactt 21960cagcctgggc aacaaagcaa
gactccatct cttaaaaaaa aaaaaaaaaa agaaggccgg 22020gtgcagtggc tcacgcctgt
aatctcacac tttgggaggc ctaggtgggc ggatcatgag 22080gtcaggagat ctagaccaca
gtaaaccccg tctctactaa aaatacaaaa aattagctag 22140gcgtggtggc gggcgcctgt
agtcctagct actcgggagg ctgaggcagg agaattgctt 22200gaacccggag gttgcaatgg
gctgagatca tgcctttgca ctccagcctg ggcgacagag 22260cgagactcca tctcaaaaaa
aaaaaagaaa agaaagaaaa gaaagacctt caaaattatt 22320gctgctgatg tggtccctca
taaaccaagc agtgggaaac tggtttagct tttagttcac 22380attctaaagt actaattttt
gtggtttatt ttgtacaggt actgctataa ccagaatttg 22440gtagaaaaag gatttacttg
ttggggccct cttgataaaa agagatgtgg ggggattctc 22500gacctgctaa cagaactgga
ccttttcggt aagttctcaa atttgaatat tgaaattgcc 22560agtattttaa ttataaatgt
gtaacatttt cgcctactat aaatgaagat attttctctg 22620tggagaaata gtttctgatt
ttttaaaaat agaaatttgg ctgggcgcgg tggctcacgc 22680ctgtaatccc agcactttgg
gaggctgagg cgggcagatc atgaggtcag gagatcgaga 22740ccatcctggc tatcacggtg
aaaccccgtc tctactaaaa aatacaaaaa aaactagccg 22800ggcgtggtgg cggctgcctg
tagtcccagc tactcgggag gctgaagcag gagaatggtg 22860tgaacctggg aggcggagct
tgcagtgagc cgagatcgtg ccactgcact ccagcttggg 22920cgacagagga agactctgtc
tcaaaaacaa aaacaaaaaa aaaaaaagaa aaaaaaagaa 22980aaatagaaac tcaatttgga
aaataatttc gaaaatgatt gtgagcctga atacccagca 23040tgccaaatgt tttgtcacat
agcattttaa aattttattt atttgtttgt tttttgagac 23100aagtctctct ctgtctccca
ggctggagtg cagtggtgcg atcttgactt actgcaacat 23160ccgcctcccg tgttcaagtg
attctcctgc ctcagccttc tgagtagctg ggattacagg 23220cgcgtgccac tatgcctggc
taatttcatt attttaatat taaaaaatac ccaaatattt 23280tatttctttt tgtctcttag
cgaaggaata catatttggc tagtaaggaa agctagcaaa 23340atttacataa atgtttataa
aagttgtatt gagttcacta atttatgtct agaattcaga 23400gctgtgcctt gtctgtggca
tgttgacgca gtttgctaag ccacctctca attttagggg 23460ttacttggta ccaagaagag
tggagaaagt ggtagcattt agttgtaaat agattgtatt 23520ttaaatttgt agggaattaa
tttttttata gctagtatca tacacactgt attttaacta 23580gtatttaaac atttttcgta
ttgtgtttac aattaatgag atgctatatg aatgtgactt 23640ttttggtttt acttggtaca
tagcaaataa atctgacctt taaatgtatg cattcataag 23700tattgttgct ccagttgaaa
cttctattaa ctagtacatt ttcctttttt tacctttttt 23760caaaatggag tctcactctg
ttgcccatgc tggagtgcag gggtatgatc tcagctcact 23820gcagcctttg cctcctaggt
tcaagtgatt ctcctccctt agcctcctga gtagctggga 23880ctacaggtgt atgccaccat
gcctggctaa ttattgtatt ttttttttta gtagagatgg 23940cgtttcacca tgttggccag
gctgatctca aactcctgac ctcaagtgat ccacctacct 24000cagcctccca aagtgctggg
actataagtg tgagccaccg cacctgccat ttggattggc 24060aatctgcaag attttattac
ttaaatgcaa cagatgttct cattcattgt tctgaagctt 24120ggagttccaa tgaaaaattt
aggtggagaa ctgagtttag aaaatccata taatgtttag 24180taaaactagt atttcataaa
tgctgaatga cagagattgg tctttaaatt aaaacaacag 24240tgtgatgttg ggtatttttt
ttctttcaaa atactaagga ttagatcagt ggtcagcaaa 24300ctacagctga tagcctgttt
ttgtaaataa agttttactg gaaaacagcc actcttactc 24360atttgcagat tgtgtatggc
tgctttcatg ctatgatggc agagttgaat agttgtaaca 24420gagattgtat aacccacaaa
atccgatatg tttacgaact ggctcttcat ggaaaaagtt 24480tcctgacctc tcatctagat
caatggggtt gtacgttacc atttaaaaat atttaggttg 24540taatctatcc tcttattact
tgtatttatg ggtaactatt ttgtaagtaa ggctgtttcg 24600tatagaatta acgtggttta
ggtaagcatt cagaaatgtt aggttaattt agctttattg 24660tctaactttt ttcaaattta
gaacatttgt ctttgactcg tttaaactta tttaaaatta 24720tattttccca ccttaatttt
agtttaaatg taagtcatta tatgctgttt tttaacatct 24780ttgactagga gggagacagt
ttttgggaac taatttgaac caaaacagat ataggaaaat 24840gattttgtta catttccttt
gaacttttct tttaaaattt gtttttattt ggttgaaaat 24900aattttcata actactgata
ttttatatta gtagaatggt ttcttgattc gtctgtataa 24960aatacaaatc taagaaccct
gctacagtaa gttactctaa atctatttga tcttaattta 25020gaagagtaag ataatcttta
ggccatgttg gatgtgttct ggtcagaaaa catgtagatt 25080tcatacctca gtcctcatcc
catgagtgtc tgatgaagct taaatcttcc tgcaagaaag 25140acttgaatga ttttaaacat
gagagacact gtatttagtg gtaacatctt aattttagtg 25200ttaaattgta ttgcctaaga
agaacatcta gggcgggcgt ggcggctcac gcctgtaatc 25260ccagcacttt gggaggccga
ggcgggtgga tcacgaggtc aggagatcaa gaccatcctg 25320gctaacacgg tgaaaccccg
cctctacaaa aaatacaaaa aaattagctg ggcgtggtag 25380cgggcgcctg tagtcccagc
cccttgggaa gctgaggcag gagaatggcg tgaacccggg 25440aggcggagct tgcagtgagc
caatatcgcg ccactgcact ccagcctggg cgacagagcg 25500agactccgtc tcaaaaaaaa
aaaaaagaag aacatctaaa cttgctcctc ttatgatgaa 25560ccacatagac ataactagtg
ttaatggggg tcagtggaag tcatcatgtt ctgaaaatcc 25620attaaatgta catcattcta
gtgtttaggt taatgctgtt aaattcctgt tactttaaga 25680aagggttggc cgggcatggt
ggctcacgcc tgtaacccta accttgggga gacagagatg 25740ggtggctcac ctgaggtcaa
gagttcaaga ccagcctggg cagcatggta aaaccccatc 25800tctgctaaaa ataaaaaaat
tagctgggca tggtggcgca tgcctgtaat cccagctact 25860ctggaggctg aggcatgaga
attgcttgaa cccaggaggc agaggctgca gtgaaccgag 25920atcatgccat tgcactccag
cctgggcaac agagcgagac tccgtctcaa aaaaaaagaa 25980aaagagaaag aaaaggtttg
gcattgcaac tatttctctt gaactgagtg acccagaatc 26040agttgtcctt tgaattttag
tatagtagca tagtctgagc tcagaagggc cttatgatag 26100accctgtatg ttctgggagg
caagaattga gttggtatta atatcttaat gcttttgttt 26160tactgctgaa taacagatga
cccttcaggt cttttcatgt tttccttttt catgtctccc 26220tgcctaggat cctaggtgcc
taattgccta cttaaactag tttagggaat cttggactga 26280agccaaaaca tgtaaaatgc
cctgaaggtt aggcaaaggg aagaagttgg gtagtatgaa 26340agattaggtc acatcttgtt
tatctcttga gttctataaa ttgagaatgt aaatttaata 26400ctatgtctat ttttaaaatg
tattttattg ccatgaaaaa gtagcatgag acattggaat 26460atggaatatc agcttcttca
tttgggtcat ggggatcatg cttgaagacc taatgctctc 26520tctaggtcta tctcagcatt
gagcccctgg atgctgttgc gtggcttaga tgacttatac 26580atgctttgtg gcatgattca
tactaccttc taccttctgt gatacccttg ggtagttata 26640ataggaccca ggttagagtg
cttcttggtg gagccactgt agaactggga tttagatgca 26700gccagggctg atgctcagct
ggtgaacact ggtgtgcttg ttcctactgg tgatttacaa 26760ccagtgtttc ttctttttgg
gcctgcatcc attttgattg ggtggtgtcc atgctgtatc 26820tgtaataaaa tatttttgaa
tgttaccgct ggatgcagcg tgagaaagat acctcctgaa 26880acttactgta agaaatttac
agtgcattga tttttctgat atataggaat cgtcatgttg 26940accttggaat tcttaagttc
cctggctgta ggaaatggaa atttttgtag tatgtcacca 27000ttgttagctt atttggtatt
gcggattttc cctgttgcag gactgggtga aagctttttc 27060tgcagcagtc atgttgaaaa
ccttgtgttg actttcctcg tgttctgaaa tgggagcata 27120aaagtttact ccgccacttc
gtcttaaaat agcaaaactt tgctgttttc tgcagatcta 27180ggaccttgtt acagaactct
gccaaaaaaa aaatgtttac agaagaatgt gctgtgatta 27240gagaagaata tgctggtgtg
tagatttcaa actctctgga caatatgaat aacactgtct 27300ttgtttctac agtgggagcc
aagaagaaag gtttgctccc gggtggaaca gggattatcc 27360tcctcctccc cttaagagtc
atgctcaaga gagacactct ggcaactttc ctggcagaga 27420ttcacttccc tttgatttcc
aggggcattc ggggcctcct tttgcaaatg tagaggagca 27480ttctttcagc tatggagcta
gagacggacc gcatggtgac tatcgaggag gggagggacc 27540tggacatgat ttcagggggg
gagatttttc gtcttctgat ttccagagca gagattcatc 27600acagttggac ttcaggggta
gggacataca ttctggggat tttcgggata gagaaggacc 27660acctatggac tataggggtg
gagatggtac ttctatggat tatagaggta gggaggcacc 27720tcatatgaac tacagagaca
gggatgctca cgctgttgac ttcagaggta gggatgctcc 27780tccatctgac ttcaggggcc
ggggcactta tgatttagat tttagaggcc gggatggatc 27840ccatgcagat tttaggggaa
gggatttatc agatttggat tttagggcca gagaacagtc 27900ccgttctgat tttaggaata
gagatgtatc tgatttggac tttagagaca aagacggaac 27960acaagtagac tttagaggcc
gaggttcagg tactactgat ctagacttta gggacaggga 28020tacgccacat tcagatttca
gaggtagaca ccgatctagg actgatcagg attttagggg 28080cagagagatg ggatcttgta
tggaatttaa agatagggag atgccccctg tggatccaaa 28140tattttggat tacattcagc
cctctacaca agatagagaa cattctggta tgaatgtgaa 28200caggagagaa gaatccacac
atgaccatac gatagaaagg cctgcttttg gcattcagaa 28260gggagaattt gagcattcag
aaacaagaga aggagaaaca caaggtgtag cctttgaaca 28320tgagtctcca gcagactttc
agaacagcca aagtccagtt caagaccaag ataagtcaca 28380gctttctgga cgtgaagagc
agagttcaga tgctggtctg tttaaagaag aaggcggtct 28440ggactttctt gggcggcaag
acaccgatta cagaagcatg gagtaccgtg atgtggatca 28500taggctgcca ggaagccaga
tgtttggcta tggccagagc aagtcttttc cagagggcaa 28560aactgcccga gatgcccaac
gggaccttca ggtatgttga tggggtggat tgcttttttt 28620tttttttttt tttttttttt
tgagacggag tctcgctctg ttgcccagcc tggagtgcag 28680tggtgcgatc tctgctcatg
caagctccgc ctcctgggtt catgccattc tcctgcctca 28740gcctcctgag tagctgggac
tgactacagg cgcccaccac cacgcctggt gtgagccacc 28800gcgcccggcc tgcttttttt
ttttttcttt aaataagact tttgtgaagg atgacattta 28860tttatttatt tatttattta
tttttgaaac ggagtcttgc tctgtcaccc aggctagagt 28920gcagtgacat aatctcagct
cactgcaacc tccgcctccc agggtcaagc aattttcctg 28980cctcaacctc ctgagtagca
gggattgcag gcatgtgcca ccatgcccag ttaatttttg 29040tatttttagt gcagatgggg
tttcaccatg ttggccaggc tggtctcgaa ctcctgacct 29100cgtgatccgc ccacctcggc
ctcccaaagt gctggaatta caggcatgag ccaccgtgcc 29160tggccagttt tttttttttt
ttttcatttt atttttatct ttgcataacc attagaaagc 29220aaaatttgta ttcaggagtg
gaatgtagga atgtaaatct ctagagaaaa ggtcctcagc 29280tcagatcata tatatgtgtg
tgtgtgtgta tatatatata tgaatatata tgtatatata 29340tgaatatata tttatatata
tatatttctt ttttctttta ttcttttctt cctgcttcac 29400tttccatttg tgtatatatg
tgtgtgtata tatgaaggaa ctatatatat atatatttga 29460gacacggtct tgctctgtca
ctcgggctga agtgcggtgg tgtaattatg gctccttgca 29520gccttgacct cccaggctca
agcgatcctc ccacctcagc cttctgagta gctggaacta 29580cagatgtgcg ccagccacta
tgcctggcta gttttttttt ttttcctttg agaatgagtc 29640ttgctctgtc gctcaggctg
aagtgcagtt gtgcgatctc agctcactgc aacctctacc 29700tcctgggttc aaggggttcc
cccgcctcag ccttccagga agctgggact acaggtatat 29760ttcaccattc ctagttagtt
gtgttttttt ttttcttttt tgagatggag cctcaccgtg 29820ttgcctaggc tggagtgcag
tggcacgatc ttggctcaca gcaacctccg cctcccgtgt 29880tcaagcagtc ttcctgcctc
agcctcctga gtagttggga ctgtagttgt gcaccaccaa 29940atctgactaa tttttgtatt
ttttgtagag atgaagttta ggcatgttac ctaggctggg 30000ctggaacccc tgatctcaaa
tgatccaccc ttctcagctt cccaaagagc tgggatttca 30060ggcatgcacc accatgcctg
gccagcaatt tttgtatttt tttgtagaca gaaggttgca 30120acatatttcc caggctggtt
tcaaattcct gggttcaagc agtcccccca ccttagcttc 30180ccaaagtgct gggattacag
caatgagcca ctgcccctac ccttttgatg tgtgtttatt 30240cattattttg ttttatgatg
ctgatttaca tgccttggga taatttagtt tgaaagtata 30300tgtctttggg agttgactct
tgcaactctc gcttagttag acctgtgatt gtttagggat 30360cattttctta tttaaattca
ttgagagaat acttaggagt ctccctagtt gtgaagagct 30420gatattaatg ttgcaactat
cctcttgcag ctaacgtaat taacttaaat gttaaacttc 30480ttgaatatat gatttaagca
aggagggtta tatttgtaat tttacaatga aggtattctc 30540ttttaaagta gatttggctg
ggtacagtgg cctatgcttg taatttcagt gctttaggag 30600gctgaggtgg gaggatcact
tgaggccagg aacttgagac cagtgtggtg caacctcagg 30660agagaatgtg agggtgggga
agaaaaataa ggccaggcac agtggctcat gcctgtaatc 30720ccaacacttt gggaggcaaa
ggtgggcaga tcatttgagg tcaggatttc aagaccagcc 30780tggtcaacat ggtgaaaccc
catctctact aaaaataaca aaaattaggc caggcgtggt 30840ggttcttgcc tgtaatccca
acactttggg aagctgaggc aggtggatca tttgaggtcg 30900tgggtttgag accagcctga
ccaacacgga gaaaccccat ttctactaaa aatacaaaat 30960tagctgggcg tagtgatgca
tgtgtgtaat cccagctact cgggaggctg aggcaggaga 31020atcccttgaa cctgggaggc
agaggttgcg gggaggcaga ggttgcacta ttacactcca 31080gcctgggcag caagagcgaa
actccatctg aaaaaaaaaa aaaaaacgaa aaccaaaacc 31140agccaggtgt ggaggtgggc
gcctgtaatc ccaactactt gggaggctga ggcaggagaa 31200ttgcttgaac ctggggggcg
gaggctgcag tgggctgaga ttgtgccact gcactccagc 31260ctgggcgaca gagcgacact
ctgtctcaaa aaaaaaaaga cattatctag tcatcttctc 31320tcaccagagg tatgaagtac
tgctagttta cagcccattc tccagctctc agaccaggga 31380aatttttctt tttttttgag
acgggggtct cgctctgtca cccaggctgg agtgcagtgg 31440cacaatcttg gctcactgaa
acctctgcct cccaggttca agtgattctt ccgcctcagc 31500ctcctgagta gctgggacca
caggcgtgca cagcacagtt ggctaatttt tgtattttta 31560gtagagacgg ttttaccatg
ttggctaggc tgagaaaatt actgttttga gactatgtta 31620gtgtgtcttt ctggttatta
aagtcttact cagtcttgtc tctcgtaatg ttttgcttta 31680ctttgaagac tctttcagtg
agacttggtc ttagcacatt tacattctta tgatttgaag 31740tcacattctg gcactcagaa
caatagagaa aattgtaatt ttttatatct tcacgtgaca 31800tgtcattatc atttttgatc
ctgagtggct aaatttcatg ttgatttgtg ttttgtgcag 31860taaagtatat ttgtgaaata
atttttcatt ctcaatttaa ggatcaagat tataggaccg 31920gcccaagtga ggagaaaccc
agcaggctta ttcgattaag tggggtacct gaagatgcca 31980caaaagaaga ggtaaggcat
gtcttctctc ctgtttctct gtgtcaatta aaaattaaaa 32040aaacctttta atttgaaaaa
ttgtagattc acaagaaggt gcaaagaaat gcacagagaa 32100gtcttgtgta tttttttccc
atcttccctc agtgttaata ttttgcacaa ctgtggtata 32160gtatctaaac caggaaattg
accctggtat aatacataaa gtttattcag atttcaccat 32220ttatacatgc actcactgag
gtgaggttaa aaaaaattat gacaaatgat tgctctcttt 32280agacctgatc acatccttta
gagcatatta tttctggagt atgtacataa ggatgcagtt 32340tatttacaat agtaaaaact
agaaactgcc taactgccct gtatcaaagg attggctgac 32400taaattaagt ctgaacttat
ggcagtgctc gctctgtgcc aggcattgtg tgatacttac 32460aagcattagt tcatttaatt
atcacatatt taatataatc actctaaata ttaagcatta 32520ctgtatgtaa ttgttctaga
tactgagtga cacagcagtg tatattatca agtcactgcc 32580tccatggata atgaaaaagc
aagcaaaagg attacacaat tttagtcagc aaataaatac 32640tctgaagaaa actaaagtac
aggcggggca tggtagctcg gcctgtaact cggagacaga 32700gtcttgcttt gtcgcccagg
ctggagtgtg tggcgcgacc ttggtgcact gcaacctcca 32760cctccccagt tcaagcagtt
ctcctgccgc agcctcccga gtagctggga ctacaggcac 32820acaccaccac gcccagctaa
tttttgtact tttagtagag acggagtttc accacattgg 32880tcaggctggt cttgaactcc
tgacctcagg ttatctccct gcttctgcct cccaaagtac 32940tgccattaca ggcatgagcc
accaagccca gcccattttt gatttttttg aggcagcgtc 33000tcactttgtt gcccaggctg
gagtgcagtg gcacaatcac ggctcactgc agcttctacc 33060tcttgggctc aatcgatcct
accacctcag cctcctgagt agctgggacc acgggcatgc 33120atgctaatgg ggctgttttt
tgtattgtgt agttagggag acatcactga ggaagaggca 33180ttcgagccca ggcttgaatg
ccgtgagaga acagtttata tgaatatggg gaaatgaact 33240gcccaggcag ttcatgctga
ggaagtgctg tggccctgga ctgtaatgaa cccagtacat 33300cattttatat ttaacacatg
agaaactgga cactaaaagg ttacacagca agtgagcaga 33360gagcttggaa tgcacacagt
atgatttcag agcttaagcc tttgaaggtt atgctcttct 33420gcttttcttt tttttttttt
ttttttgaga cagagtctca ctctgtcacc caggctggag 33480tgcagtggcg cgatctcggc
tcactgcaac ctctgccgcc agggttcaag agattctcct 33540gcctcagcct cccaagtagc
tgggattaca agcacctgcc actgcaccca gctgattttt 33600gtatttttag tagagatggg
gtttcaccat cttggtcagg ctgatcttga actcctgacc 33660tcaagtgatc cacccgcctc
ggcctctcaa agtgctgaga ttacacgcat gagccaccgc 33720gcccagcatt ttgtttgttt
gtttgtttgt ttgtttttga gacagagtct tgctctgtca 33780cccaggctgg agtgcagtgg
cacaatcttg ggtcactgca acctccgcct ctcgggttca 33840aatggttctc ctgcctcagc
ctcctgagta gctgggacta caggcatgtg ccaccacgcc 33900cggctaagtt tttgtatttt
tagtagagac ggggtttcac cgtgttagct aggatggtct 33960cgatcccctg acgtcatgat
ccgcctgtct cggcctccca aagtgctagg attacagatg 34020tgagccaccg cttctggccc
tgcttttcct atgtacctga gaatttttaa atatttattt 34080atttattttt gagacagggt
actccagact ggagtgcaat ggcccaatca aggctcacta 34140cagcctcaaa ctcctgggct
caaactatcc tcccgagtag ctgggattat aggtgtgagc 34200cagtactcct ggctaatttt
tttttttttt ttgagatgga gtctcgctct gttgcccagg 34260ctggaatgca gtggtgcgat
cttggctcac tgcaagctcc ttctcccggg ttcacgccat 34320tcttctgcct cagcctccca
agtagctggg actacaggtg cccgccacca cgcctggcta 34380atttcttgta ttttttagta
gaaacggggt tttaccgtgt tagccaggat ggtctcaatc 34440tcctgacctt gtgatctgcc
cacctcggcc tcccaaagtg ctgggattac aggcgtgagc 34500caccgtgccc ggccaatttt
tttttttttt tttttttttt ttttttaaag atagtgtctc 34560gctctgttgc ccaggctgga
gtgcagtgtc atgatctcag ctcactgcag cctcagcctt 34620ccaggttcaa gtgattctcc
tgcctcagcc ttccaagtag ctgggattac aggtgtgtgc 34680caccacacca ggctaatttt
tgtattttta gtagaaatgg ggtttcacca tgttagccag 34740gctggtctcg aactcctgac
ctcaggttat ccacccgcct tggattccca aagtgctggg 34800attacatgtg tgagccacca
cgcccggtct ctcctggcta attaagaatt tttttttttt 34860ttttagagat agggtctcac
tatgttgccc aggcttgtct caaacatgtg gctttaagca 34920atcctctcac cttggcctcc
caaagtgctg ggattatagg caggagccac tgcatcccac 34980caatttttga ataattatgt
tctactcatt caatatgtga atgccttgag tgttcatagt 35040ttaactttgc ttttccaaag
taatcatggc tttaaattat gtatgataaa aactgttagg 35100gaaaatctga tattcagtgt
ttgattatga tttgtatcat ttgtataaat gccatatttt 35160tgcagattct taatgctttt
cggactcctg atggcatgcc tgtaaagaac ttgcagttga 35220aggagtataa cacaggtgag
tttcttgact tgcatatggc cttgggttag gaagggtctt 35280tgtcagatct ctgcatcatg
tgctacttaa aatttgtttc aagaaaccac aattaaaatt 35340tccagaagcc tcccgttggt
gcctccaaat aacaaccagc tttagtttta gctgtggttc 35400tttgtggatg tttgtccaca
catgggtgat gaggatgcat gttccagttc ttctgaatgc 35460ctgtgatata tagagtgttg
cagcaattgc cttgaatata ttttatataa ttattaaact 35520tgctatgcat gttcttcatg
gtggtggaat gtttatgctt gagcctaata ggatttaata 35580agcttgttgt atgtaaaatt
ttacattcat tgcttcagta aaatttatga cttcccagag 35640aaattgtaca aattagtggt
ttaattttca gttttgcttt gagaatggag tcctgttaca 35700gttattttgt tgaaatccat
gaatagaccc agaagagctt tccctttgac atctgttctg 35760tggtctgaat ggtagattaa
acttttcaga atatcctcct agttgtattt cacagtacca 35820atttcagtca tttcctttaa
atcttactac agtaaaagta ggcaaaggtg aaatgccaag 35880aactcaaggt ttttgaccaa
tatttttaga actatgtata ataataagtt tatttattta 35940aaaataaagg taatctttag
gtgacctatt ttgcagaatt ttaaatggaa gggaatagag 36000catgagtctt cacagaactt
agaatttcag taattcagtt aaagacatct tcaagtaaga 36060acatgtcata ttttgaggat
ataatttact attagcagtt tatcatggga taaaaatttt 36120gcattaacta gataacttct
tcagaatgct tctgcagagg aaaattatcc acaaaataaa 36180ttttggtgct tgaaagaata
tggtgttaag ttcagaaata atttgttctg taatttgaga 36240acaagctcag aagtattatt
tctcagagag ccaattattt atttgtttta aaaacatcaa 36300ccctgaattt gtggaagcat
gagtaagagt agatatatta ttattcttgg tatctcactt 36360atgttggtta tatttatttt
ttgcatatgc cttatacatg ctttctttgg gaactcaagg 36420tagaatttac aggctggaga
tgctttttaa ctctcaggat aataacctca gtctggtttc 36480atgaactgtg ctttcattaa
gtattgatat gtttaggaaa ggagatgtct taatatttaa 36540atagcagttc aaactccagt
ttctttagta ttcattgact ttctaattgt caaatttgtc 36600aggacagtaa aaattgtatt
aacatatagt gtctagagag gaagttctta aatttgccga 36660ttgtggtagc tgttagaatt
ggcagactga agacattgat acacatggga aatcattcag 36720ggcagtgctt aaaaataaaa
cgaaaaatac ctttcagcaa atacaatctt ttcttggcat 36780tctgttaagt tgtgtttttt
atttttgttt tttagtgaaa gaattggatt gctagtttca 36840tgttatttat attacatctc
tatgtgacaa ataggatgaa cttttgacaa tatcagccag 36900atcatgttac tcccatgtct
aaaaccctct tagggccttc atcttcactt ggaagaaatt 36960cccagcttct tcttttgtct
tacaaaccca tgcgtgagct gacccttggc tgtttgatct 37020cattcagtac tgccctccac
ctaccctatt ttgctgtagc cacactgagc ttttctcttg 37080tctttgacca atacaaactt
ctttctgtgt cagggtcttt gcactactct tctctctgat 37140ctttacttgt cttctggggt
ttagttcttg gcttcagttt cacgtctctg aggccttgtg 37200tcactctcaa atctaaaatc
atcgggcagt tgttttccat catatccttg tttggatcta 37260tcactgattg gatatttcta
tcactggtat ttttcagttg gatctatcac tgatctatca 37320ctggtcactg attggattga
atctgtcagt ggtattggat ctatcactga tatttttctc 37380cgtggttttg tgtatcttat
ttctctcact agagaggaat gtcagcagga gccttattcc 37440ttcttgtttc caccagtgct
tgacactcgg taggttccct atatgcatgg aatagattat 37500tatttatggt gtatgtgaag
agcagctgtg atttcccctc aggtgaggaa cataaaaggg 37560tagtgtaggt ttcacagcag
tgcagcttag gtcttacata tctgttgaag aatatgtctt 37620ggaacaatca gatgttctaa
gaactatagt gtttactgtt aaaagatcat atgtggtagt 37680caggcatggt gttgcacacc
tgtagtccta gctacttggg agtctgagat gggagaattt 37740tttgagcctg agaatttgag
atcagcctga gcaacatagc aagaccttgt ctcttaaaaa 37800gaaaaagaaa aaaaaatgtg
aatcttagta gtaacagtga cttaaaaatt tttttttata 37860agagaaaggg tcttactctg
ttgcccaggt tggagtgcat tggtacgatc atagcttact 37920gtaacctcaa acccctcggc
tcaagtgatc cttctgtctc aacctccaga gtatttggga 37980ctacaggtgc gtaccaccat
ggcaggctaa tttttaaact ttttgtagag gcgcggtctc 38040actatgtttc ccaggctggt
cttgaactcc tgggttcaag tgattctcct gcctcatcct 38100cccacagtgc tgggattaca
gatgtgaacc agtatgcaca gacaaaaagg tgacattcat 38160aggtgaaaac tggtaataaa
tattttaggc tgagtgatga cctgcagaga ccatgcagga 38220tggatattgc tcataagagg
ggaattgtgg agtacagtct gtcctgttag ttgatgtaat 38280ggagggctga tctataacac
aggagagaag attaacgcct cttcgttgac tctagtaatg 38340tattagtgta atttttgtct
cctctagagc tgtataagta cagggtcaca attttatcta 38400gaacctgtga ggttaaatga
gcttatgaat ttttcaagtt atagaaatgt agtttacata 38460gatcatatgg gaattatatc
tcccagggga atgtgtactc agacataata cttacgctgc 38520aaaattatta atattctcac
taacaggagt aaataaagtc tcacagtata ggccaggatt 38580tgcctcaaaa tgagtttgtt
gaattttacc aaaaaacttg acatttatgg gattttggaa 38640ttgtagataa gagattttgg
acctatatat gttgtgtata tttgaatttt tcatttgcca 38700tttacaaata cattataacc
ccatgaattg taaattatct tgaattatat gattatttct 38760ggaaaaagta ccaggagtaa
aatgtctttt ggtgactaga caaactctag tatatatata 38820aaatggaata cttctcagca
atgaagaaga aactactcat gcacctaaca acatggatga 38880atctcaatgg caatatgctg
agtgaaagaa actagactca taaggatata tacactacca 38940taaggaggaa tgaaatactg
atgtatgcta caagttggat gaaccttgaa aacattataa 39000aagaagccag acacaaaaga
ccaaatattg tgcaattcag tttatatgaa atatctagag 39060aaggcacacc cgtagagata
gaaagcagat tggtggttgc caggggctaa gcgggaatgg 39120ggaacgactc cctaatggtt
atggtacttc ttttgggctg atagaagtgt tctggaacta 39180ggtagtagtg atggttgcat
gacattgtga atgtacttaa tgctcctgaa ttgtacactt 39240taaaatgatg cattttattt
gatgtgtatt tgcttacttt gttttttttt tttttttttg 39300agatgaaatc ttgctcccgt
tgtgtaggca ggagtgcagt ggcatgacct cggctcactg 39360caacctccat ctcccgggtt
caaacgattc tccttcctca gcctcccaag taactgggat 39420tacaggtgtg tgccaccaca
cctggctaat tttttgtatt tttagtagag acggggtttc 39480gccatgttgg ccaggctggt
cttgaactcc cgacctcagg ttatctacct gcctgggcct 39540cccaaagagc tagcattaca
ggagtgagcc actgtgccca gccagcttac aattttttaa 39600aaaggctaca tactatatgt
gtatgtgtga tttcacttat gtgacattct ggaagggaca 39660aaattttagg gattggaaat
agtggtggcc agggtattgg gggaggagtt aactataaag 39720cggaagcatg agggaatttt
tgggtataat ggaattgttc tatatcttga ttgtggtgat 39780gatgtatcaa tgttaaattc
cccgagttga taactactgt ggttatgtta gagaacatct 39840tttttctttt cttttttttt
ttaaacggag tctcgtttgg tcacccaagc tggagcgtaa 39900tggcgcgatc tcagcttact
gcaacctctg cctcctggat tcaagcaatt ctgcctgcct 39960taacttcctg agtagctggg
attacaggcg cctgccccta ctcctagcta atttttgtat 40020tttttttagt agcgacaggg
ttgcgccatg ttgaccaggc tggtcttgaa cacctgacct 40080caggtgatct gcccaccttg
gcctcccaaa gtgctggaat tacagacgtg agccaccatg 40140cccggctgag agtatcttta
ttcttagaaa atacataatg aagtttttag aagtaaagta 40200ctgtgatgta tgcagctttc
tctcatggtt tcgaaaataa tacttgctat aaatggagaa 40260ggaaggaaga gagtattgat
aaagtagatg gatcacaatg ttattaatag ttgaatctgg 40320ggccacacgc ggtggctcac
gcctgtaatc ccagcacttt gggaggccaa ggcaggtaga 40380tcatctgagg tcaggagttt
gagaccagcc tggccaacat ggcgaacgaa acctgtctac 40440taaaaaatac aaaaattagc
cgggcgtggt ggcgggtgcc tgtaatccca gctactcggg 40500aggctaaggc aggagaatca
cttgaactcg ggaggcggag gttgcagtga gccaagatca 40560cgccattgca ctccagcctg
ggcgacagag caagaattca tcttaaaaaa aaaaaaaaaa 40620aaagttgaac ctgggtaaag
catatatgaa tcttttccct gtactattat tattgcaatt 40680tttttgtaac ttggaaatta
tttccaataa aaagttgaaa aactgacaaa actgatttat 40740tttattttat tttttatttt
tttgagacgg agtcttgcac tgtcaccagt gctggagtgc 40800agtggcgcga tatcggctca
ctgcaacctc cgcctcctgg gttcaagcga ttctcctgcc 40860tcagccatcg gagtagctgg
gattataggc gcctgccacc atgcccagct aattttttgt 40920attttttagt agagacgggg
tttcaccatg ttggccagcc tggtctcaaa ctgacctcat 40980gattcgtcca cctctgcctc
ccaaagtgct gggattacag gcatgagcca ctgcgtccgg 41040cctatatttt atctttaaat
gatcagcaga aaccttgtaa gctgaagact gcaatcaaca 41100gcttatgtca agtaaactat
agagcagtgg ttctcagagt ggatcctgga ccatcatcat 41160ctctttaccc cttgggaact
tgttggaatc caaattctta agccccatcc taaacctact 41220gaatcagaaa ctctggggtg
gggcccagta gcctgtgctt ttaagaagtc ctccagatat 41280ttttaatgta ccctgaggac
cactggcagt agataaagtg tttgtttaga ttctttattc 41340tagaactttt gtatagttta
aaagtgactt aataataagc aagtggacct tttgtaagta 41400gacaaagcta atgcttatgt
gctttaggag ccagtgctga tcacatgcct tgcctaccta 41460atatcagttc tcctgctctg
catagcagga gaaggagctg gagtagtgtt ggtactatct 41520tatgacttta gttatatgta
actaaggaca tataacttag ttgttttttc tgtttatata 41580tagtatactt cctccagaga
tcttggaatg gttgtagatc ttctcattca cacagtgttt 41640ctgtgacata tgaatgcagg
cagaattgct tttgattttt aggtttgttt gcatactacg 41700tagtatataa gcttgctgtg
atatttttcc aaaagggatt tatatcattt aagcaaaaat 41760gatacagctt ctggattatg
tttcctaata aggctcaaac atagaaagta attatagtaa 41820ctgaagtgct acagaattac
tttagtactg gtttattaac taatgtcaca aagttagagg 41880attactaagg tggtgttagt
aggaagaagc aatatcttgc tttagcccgt cagtgttcat 41940gtggtgaatg gacagtctct
gtattcttgg gaaggaaaat tcttcttgga aagtgagtat 42000ttgcaatgac taggtcagtc
acttggtctg ttgcctggca ttttgggtct actgaaagtg 42060acgttgtagc aaaggccctg
taccttctgc atttcttttc ttttcttttt tttttttttt 42120tttttttttt tttggtagaa
acaaggtctt gctttgttgc ccaggctgcc cttgacctcc 42180tgtcaagcag tcctcccacc
ttagcttcct gagtagctgg gactacaggc gtgtgccacc 42240atgcctggtt aatgtaaatt
tgtttggttt ttttgagaca gagtttcact cttgttgccc 42300aggttggagt gcagtgacgt
gatctcagct cactacagtc tctgcctcct gggttcaagc 42360gattctcctg cctcagtctc
ccaagtagct gggcttacag gcacccgcca ccacgcccag 42420ctaatttttt gtattttttt
agtagagacg gggtttcatc atgttggcca ggctggtctt 42480gaactcccga gctcaggtga
tccacccacc tcggcctccc aaagtgctgg gattacaggt 42540gtgagccacc gtgtctggcc
tatttttaaa ttttttttga gacagagtct ctctcagtca 42600cccaggctgg agtgcagtgg
tgcaatctca gctcactgca gtctctgcct cctgagttca 42660attctcctgc ctcagcctcc
ctagtagctg ggattacagg cctgccatcg tgcccagcta 42720atttttgtat ttttagtaga
gacagggttt caccatgttg gccaggctgg tttcaatctc 42780ctgacttcaa gcaatccacc
tgcctcggcc tcccaaagtg ctgggattac aggcatgaac 42840caccacgcct ggcctaaatt
ttttttttgt agagacaggg tctcacgctg ttgctcaggc 42900tggtcttaca ctccaaggct
caagcaatcc tcctgccttg gactcccaaa atgctgagat 42960tacaagtgta agacactgag
gccagctgcc ctttacattt cttaagggta acaggctcat 43020gtcctttcat tattcacaat
ttaaatattt tgagtcttta cttctgtgtc aatataacag 43080aagtaacttc cttacgaaga
aaattccaga gggaatcttt caatgtaggg atagaaatcc 43140attgtgaaac tcgagaattg
acactgatga tataaaacat gcacagtagc cgagtgtggt 43200gatgtgtgcg tgtagtctta
gctactcaac agtccgagac atgagctcag gagtttgtga 43260ccagcatggg caatatagtg
agactctgtc tcaaaaaaag gaaaaaaaaa agtgcatagt 43320ttatggtatc ccaactggag
gagctaaaga cagaatagct taacatcatt tagaaaaaaa 43380attataattg aaaagtgcaa
atacacattt tgcagtgttt ttggcattta caaaatatgt 43440aaacactttt agtttcttag
ggaaaagatg acgataggct gattgaaaaa tatcattttt 43500acttgtcaca tctctaaaac
agcagaagtt cttgttttta accaggagtc ctatcaggtt 43560tgatacaacc ttcggggagg
atgtggcagt tgaaatttaa ggaaacttag tttccttaag 43620gtggctgagc ttaaaaaatc
aaaatgttta ggaaggcagg agacactaat agggctgggc 43680tagtcttgtg gaggcagtgg
atggacgctt tggctggcct agggaagaat ctgtgattca 43740gtgctgcagg gatcaggtga
tcctggtgag agaggtcctg gaacaagggt taatttggtc 43800atttttggaa tgacctggga
tttggcttat ttattttatt tttaaaattt cccgctgggc 43860acagtggctc aaacctgtaa
ttccagcact ttggaacgcc aaggccagtg gatcactcga 43920gctcaggagt tcgagaccac
cctgggcaac atggtgaaac tctatctctc caaaaaaaat 43980acaaaaaaaa ttagctggat
gtggtggtgc atgcttgtag tcccagctac ttaggaggct 44040aaagcaggaa gatcacttga
gctagggagg tgagggtgga ggttgcagtg agccaagatc 44100atgccactgc actccagcat
gggcaacaga gagagacctt gtctcaaaaa aataaaatgg 44160tgaatgtaaa ataaaatggt
agctcacgcc tataatcctg gtactttggg aggccgagat 44220gggtggatca cttgaggcca
ggagttacag accagcctgg tcaatatggc aaaactccca 44280tctctactaa aaatacaaaa
actagctggg ctggtggtgt atgcctataa tcccagttac 44340tcaggaggct gaggcagagg
tcacagtgag ctgagatcac accactgcac tccaggctgg 44400atgacagagt gagaccctgt
ctaacgtgac atcacatcac atcacatcac atcacatcgc 44460atcgcatcgc atcgcatcgc
atcgcatcgc atcgcatcgc atcgcattgc atcacatcac 44520atcacaacat aacataaatt
ttcaaggcag aaatcttgta gtcagcctta ctgtttgttg 44580acaaggacac ggccctgagc
acagaaatct cggcagttga taaagccaag aagaaggata 44640ctaattaaag aaattttcag
attttgcatc ttctggcatc tcagctaaat agctctgagg 44700aggaggatgc cacttaccag
ttttgagaca caggcaggtt atattatttt cctgaaaacc 44760atttagctga gatggaattt
gcctctctga ggttggggaa ggtgtttgaa ctctgtttac 44820agccctctgt cagttccact
gccttgctga gttccctcac ccttctttag atagaattgc 44880tgttggcttc tatagtcctc
acttacctct tttgccaaat gctcaggtag ccttggctga 44940gtcttccagg tttgataagg
ctgtatgggg cttcctatgc cttttggtag ttagaagtca 45000ctgaagaggt acttctgcta
cagtgacaag aagaaaaggg cattactcag cttgtatagt 45060gcaagggctg cttgactccc
agcttcagtc taggcagggg aatttattta tacaattacc 45120ttaaatgagc accagataga
ggccatctat aaaaactgtt tacaggattt aaaaatacgt 45180tgacattggc ttcttccttt
aactttctgc ttgcaacaga acatctgatg cgacctatgc 45240tgctcactgt ttctaggtta
cattctctac ccttgcagtg taaattaatt tttgcctggt 45300tccatgtttc ttgcttaggt
tatctcttag gtcttttgtc tgatttaaat ataagccttc 45360ttaggactag atagtggtga
tggttgcact actttgtcaa tataccactg aattgtatgt 45420attcactctt ttaagaatga
gtttattttt atttttattt ttattttgag atagagcctc 45480actctgtcgc ccaggctgga
gtgcagtggc gtgatctcag ctcactgcaa cctccacctc 45540ccgggttcac gccattctcc
tgcctcagcc tcccgagtag ctgggactac aggcgcctgc 45600caccacgccc ggctaatttt
ttgtattttt agcagagacg gggtttcact gtgttagcca 45660ggatggtctc gatctcctga
ccttgtgatc cgcccacctc ggcctcccaa agtgctgggt 45720ttacaggcgt gagccaccat
gcctggcctt aagaatgagt tgattgttct tagtctcagt 45780tgagtacatt gtgttatgta
tagaaaatgt tatattttca tttttaaaaa ttattattat 45840tattttgaga tggggtctca
ctttgtcacc caggctggag tgcagtggca cggtcttggt 45900tcactggcaa cctccacctc
ccaggtacaa gtgattcttc tgcatcagcc tcctgaatag 45960cgggaattac aggcgcctgc
caccaagcct aagtaatttt tgtatttttt ttttttagta 46020gagacggggt ttcaccatgt
tagccaggct gatcttaaac tcctgacctc aagtgatcca 46080ttcgtctcag actcccaaag
tgctgggatt acagatgtga gccattgcgc ccagcccatt 46140ttaaaaaatt aaactggcct
ggtgcggtgg ctcacgcgtg tgatcccagc actttgggag 46200gccgaggcaa gcggatcatg
aggtcaggag attgagacca tcctggctaa catggtgaaa 46260ccccatctgt actaaaaaat
acaaaaaatt agccgggcat ggtggcgggc tcctgtagtc 46320ccagctaatt gggaggctga
gacaggagaa tggcatgaac ccgggaggca gagcttgcag 46380tgagccgaga tagcgccaat
gcactccagc ctgggcaaca gagcaagact ccgtctcaaa 46440aaaaaaaaaa aacaaaacaa
aaaaaaacca aaacattaaa ccatactctc taactgtgaa 46500gaagttgtga tttattcttt
agtgttacct gccattcttt ttgtctcttt ctctctcttc 46560tcttctcctc tcttctcttc
tctcttcttc cctcccttcc cctcccctcc cctccccttc 46620tcttttcttc tcttctcttt
tcttttcttt cagagttttg ctctgttgcc caggatggag 46680tgcattggca tgctcacggc
tcactgcagt gtcaacctcc caggttcaag ctgtcctcct 46740acctcaccct ccctagtagc
tgggactata gacatgcacc accatgccta attattttgt 46800attttttgta gagacgaggt
tttgccatgt tgcccaggct ggtcttgaac tcctgagctc 46860aagtgagcta cctgcctcag
cctcccaaaa tgctgtgatt acaggtgtga gccttatttt 46920attatttttt tttgggacag
agtctctctc tgtcctccag gctggagtgc agtggcacga 46980tcttggctca ctgcaacctc
tgcttctcgg gttcaagcaa ttctcctgcc tcagcctccc 47040aagtagcctc ccaaagtgct
gggattacag gcatgagcca ccatgccagg cctctgatgc 47100atatattttt taaaaatagt
attttccacc ttacagtgta tttaagagtt tgtaaatttc 47160cttttttgtt ttctttttgg
aacagtgttg ctctgttgcc caggctggag tgcagtgaca 47220tgatcttggc tcattgcaac
ctccacctcc cagattcaag tgattctcct gcctcagctt 47280cccgagtagc tgggattaca
ggtgcccgcc actacgccca gctaaatttt ttgtaatttt 47340agtagagaca ggtttcacca
tgttggccag gcaggtcttg atctcctgac ctcaagtgat 47400ccgcccacct cgacctccca
aagtcctggg attacggaca taagatactg tgcctggctg 47460agtttgtaaa tttctttctt
tcttttttct tttttttttg agacagagtc ttactctgtc 47520acctgggcta gaatgcaata
atgcgatctc tgctcactgc aacctctgcc tcctgggttc 47580aaacaattcc cctgcctcag
cctcctgagt agctgggatt acagccgcct gccactatgc 47640ccagctaatt tttgtatttt
ttgtagagat ggggttttgc cgtgtaggcc aggctggtct 47700agaactcctg acttcaggtg
atccacccac cttggcctcc caagcgtggg gattacaggt 47760atgagccacc acgcccggtc
atcaaagata atgtttttaa tgatcaggag cactttgaga 47820tgtttagaac aatctgaaac
ctgatttcca agccatctca aaatatactt tggtaatcaa 47880gacagggaaa tgatggtgtt
atatcatttg tgggactcaa ctgattttgt tgagtattga 47940ttttgctgtg ggattccttg
ttctcttggt tgtgttgggc ctactgcttt ttaaaaaagt 48000attttgagac agggtcttac
tctgttgctc aggctggagt gtagtggcgc agtctcttgt 48060ctctgcaacc tcaatctcct
gggctcaagg gatcctccca cctcagcctc ccaagtagct 48120gggaccacag gtacccacca
tcacacctgg ctaatttttg tattttttgt agacatgggg 48180gtcactgtct tgcccaggca
ggtcttgaac tcctaagctc aaacaaccgt cctgccttgc 48240cctcccaaat tgctggaatt
acaggtgtga gccagtgcgc ctggccttct tttttttttt 48300taaccactat tttttagaac
tagatttggc ctggaaagag aaaaaagata ttcctcgact 48360tgatctatat attttatggt
tcattcattt gctttagagg tagaaggagc aggaaaaagt 48420acaacaaaac aaaatcttac
ctttggtgtt taatttgaat gcccacagat gcttttgcat 48480ttattagtag tgagttttca
taattatcaa atatgtagta gaaaaatctg gctgtgcatg 48540gtggctaatg cctgtaaatc
cctatatgct gggaggctga ggcaggtgga ttttctgagc 48600tcaggagttc aagaccagcc
tgggcaacat ggcaaaaccc catctctgcc aaaaataagc 48660tgggtgtggt ggcacacgcc
tgtggtacca ggtactccgg aggctgagct gagaagattg 48720tggaggtttc agtgagccaa
gattgcacca ctgcactcca acctgggtga cagagtgaga 48780ctccatctca aaaaaaagaa
aaaaaaatct ccttgtccag gagctgtgtt gagtgggctg 48840tggactagca ggaattcata
gctctggtga aagatgacta gataatgtca tttttttttt 48900aaaagtccct gaatgattgt
gacagggtag gaaaatcatc acatagcaaa atcttcatta 48960gattttccct aatgacttat
caactgggtt tgtgcaccaa acgaaacaac ttcctgcctt 49020tgtttgtctg aaagtcaaag
aaaatattat tcaggtatat tatattgtac tccatgctac 49080agaagtttct ggcagcaata
taggttatat gccaatcggt taaataatat ttgtgggcca 49140ggcccggtgg ctcatgcctg
taatgccagc actttgggag gccgaggcgg gtggatcact 49200tgaggtcagg agttcaagac
cagccagggc aacatggtga aaccccatct ctactaataa 49260aacaaaaatt agcctagtgt
ggtggcacac gcctgtaatc ccagctactc aggaggctga 49320ggtaggagaa tcgcttgaac
ccggaaggtg gaggttgcag ctgagattgt gccattgcac 49380tctagcctgg ggccacaaga
gtgaaactgt ctcaaaataa ataaataaat aaataaaata 49440ataataatat ttgtgtaagt
acagggatat gtttcttcaa ctccaaagta tgagttaatg 49500tgcatatgcc aactctagaa
ataaagtatt aagtcaaaac tcccaagaaa atttccccaa 49560aaagttgcta acagacgtta
ttttatttta tttatttatt ttgatacaga gtctctccca 49620ctgtcaccca ggctggagtg
gtgcagtggc atcatctcga ttcactgtag cctccgcctc 49680ccagattcaa gccattctcg
tgcctcagcc tcccttgtag ctgggattac agttgcccac 49740caccacgcct ggctgatttt
tgtattttta gtagagatga ggtttcacca tgttggccac 49800gctggtctcg aactcctgac
ctcaagtgat ctgcccgcct tggcttccca aagtgctggg 49860attacagttg tgagccactg
cacctggcct ttaattttaa tttctaaaac tatggagtaa 49920tactacattg agggaacaga
attttctatt ccttcatttg tattattatt aaatacagtc 49980atgcattgca taatgacagg
aatacatttt gagaaatgga tcaagtgatt ttttcattgt 50040gtaaacatca tagggtatat
ttacacaaac tagatgttat agcctactat acagctaggc 50100tatattgtat agcctgttac
tcttcggcca caaaactgta cagtgtgtta ctgtattgaa 50160caccataggc aattgagaca
caacggcatt tgtgtatcta aatatagaaa aggtaatgca 50220ttgtgccacc aaatcaacaa
cagctatgat gtcactgggt gataggaatt tttcagtgcc 50280attataatct tatggaacca
ttgtttcata tgcatgcagt ttgctgttga tcaaaatgta 50340gttaagcagc acatggctgt
aattaaaaca ctattgtttg ttataataga aaataaaatt 50400tttcttttta gcctctgtat
taataaagag cactagaaag tactttgttt atcagataat 50460gaatatgttt gacagatgta
catacgtatt tatcaaatga atcttttttt gtgggggaaa 50520ccttaactaa gaataggcct
gtgttttaaa atggctgcct ggaggacaag tgctataagg 50580aaatttcagt ggtatttgct
tgacctggca ttaagtgggg ggaaaaacaa gccccaggtg 50640aattgataga tggatgtctg
aacatgttca ggaatgatgt tttgaacaat gtttgcctcc 50700tgtgtcatgt aggcagagag
atgataaaag tttttttccc ctcttgatac caggtaattc 50760tgataccgac taccagaagt
tagcttcaga ctccgcaggt tgaagggctt tgtcccataa 50820gaccattctt acttcagaca
ccaattgcaa tgatcagtta tcaggtccca aggttaccta 50880cacttatgtc tgatttggct
acaaaattgg aggttcccac agtctacccc ttcagatttg 50940ataactttct aatatggctg
caaaaaactc agagaaatac ttatgtttat cagtttttta 51000taaaggatac aattagccag
atgaggagat agatagggca aagtccagga gggtcctgag 51060tgttgagtgt aggagtctct
gtcctgtgga atatgccacc gtcccagcat gtagatgtat 51120tcaccaatca ggaagctctc
tgagcccttt tgtgtagttg tttttatgga ggtctcatta 51180tgtaggcagg attgattaaa
tcattgacag tgggtgattt gctcaagccc ctctcccctc 51240atcagaagtt ggtgggtggt
actgaaagtt ctgaacttct ggtcaaggct ttgtctttct 51300aggtagccct catcctgaag
ctatctaggg gctttccaag agttgtctta ttagaacaaa 51360gaacactcct atcaccctta
tcactcagga aattccaagg gttttaggag ctgtatgcca 51420ggaacctggg acagaccaag
tatctttctg tgataccaca gaatgggacc ccaaaagcca 51480gctccagctg gtgtctagtg
cctttagttg ggcactggat atcggttaca gggcataagt 51540ggcccagtgg ggttgccgtt
taacccatct ctgctgtatt aacctcatgt accttagctc 51600atggctaggt cgtttcaagt
ctcacctaat gtcagttgtt tcatccttct ctggatgcat 51660gttcacttct ggaataggtg
aatatctggg ccactatgtt tgctgtcatc ctgagcaaac 51720ttccagctta gaaaccagct
ttatggaatc atcccagagc ctttatttta ttttatttta 51780ttttatttta tttatttatt
tatttatttt ttgaggcaga gtcttgctct gtagcccagg 51840ctgaaatgca gtggcaaagt
catggctcac tgcagcttca acctcccagg ctcaagcaat 51900ccttccgttt cagcctccca
agtagctgag attacaggtg tgtaccacga cacctggctg 51960atttaaaacc ttttgtagag
atagtgtccc agtgtgtttg cccaggctgg tctcagactc 52020ctggggttaa gcgatcctct
tgcctcagcc tcccaaaatg ttgggattac gggcgtgagc 52080cactgaactt ggtcccagag
ccttttagaa cagtgttgag ttgcccttta tttgcaccag 52140ggctaaggca gtagaaaaaa
aatgtttatg ggccatgttt ttcttcctag tcaaaataaa 52200aatagccatg taatctatgg
aggcagcaga tatgttgtta gtatacacta gaagtcagga 52260aattcgtact gcctttcagc
tgctaaagta ctgggacata tttgagaagc agtaatgcag 52320aggcagctgt ctgatctttg
atctctgata atgcttattt cattgcatcc ctgaaaccac 52380cctgcaaagg atttatcatc
tttgctgctt tgcatatgga atagcatagg cccagagaga 52440cgtagcttga ctgcaatcac
atggtgagtt agttgtagct tctgcaaatg tacagaacta 52500agaagctact tttcttgtgt
gttattctag tgatgatggt cattataatt gatgtacctg 52560atattatgct aggtttaggg
atacagaaat gaaagaagat cacagtccct catctgggac 52620ctctgttttt ttggtgtcac
ctctctgcat agacagttct gcagtattga tgctgctgtt 52680ctggttgatc cttctgtcat
gcctgcacca tcttttctgc cagactgaag tgttcttgct 52740tggggaaaag cagatttgca
aaggttctct ttttcctgat tgttgctttg cagattgagt 52800atatttgttt gtttgttttt
aagtgaacaa aagttgaatg agattgatta ctggctcttt 52860aaagaataat tactccccct
tttgacttat gtagcatctt gaggtgatct atgaccgttt 52920gtacttgtca tgacttccat
tagattaaac tctggggcaa agacgttgct cttcattgtg 52980ctcatatgac accattactg
ccagtggaat tgaaataaat tgagtaaggg cgagtgtttc 53040ctaacaaatg ttatcctggg
cctgaggaac catcatcaag atggagtggc cctgcgatta 53100attttggact taaagcaaaa
aacaaacaaa tttttttctt taaataacca gttggcacag 53160atacagaata aaataagata
gatccacgtg tagtttttga aaatttaggt caggtggctc 53220actcctataa tcccagcact
ttgggaggcc aaggcgtgtg gataactcga ggttaggagt 53280ttaagaccag tctggccatc
atgatgaaac cccatctcta ctaaaagtac aaaaattagc 53340tgggcatggt ggcgcatgcc
tgtaaaccta gctactcagg aggctaaggc aggagaattg 53400cttgaacctg gtaggcggta
gttgcaatga gccgagattg cgccactgca ctccagcctg 53460ggtgacagag tgagactctg
tctcaaaaga aaaaaaaatt taagaaataa tcatcagtgt 53520atatcttcct tttttcattt
ttctttaaaa aaaaaaacaa cccttgtatg catagctgaa 53580ggagaaataa ttgaaagtgt
ttataagatt tcaaggtgat gggctggaca cagttgctca 53640tgcctaataa tctgcacgcc
tgtaatccca gctattcggg agcctgaggc aggagaatca 53700cttgaaccca ggaggcagag
gttgcagtgg gccgagatag tgccattgca ctctagcctg 53760ggcgacaaag gtgaaactcc
atctcaaata aaaaaaaaga tttcaaggtg atgggtttca 53820tgtggaccaa ttttatcctt
ccctgatgat aatttgacat atgagtcaga tattttccta 53880attttcgtaa ttcgagtggg
attgtgtgtt tgtttgtttg ttttgagaca gggtctcact 53940ctgttgttca ggctggagtg
cagtaggcca gtcatggctc actgtagcct tggcttctca 54000ggctcaagtg agcctcccac
ctcagcctct taagtagctg ggactatagg tgcgtgctcc 54060cacacctggc taattttttc
tgtttttttt tgtagagaca aggtctcatt atattgccga 54120ggctgggact cctgagctca
agtaatcctc ctaccttggt ctcctaaagt gctgggatta 54180tatccacgag ccaccacacc
cagcctcgca tgagatttta acagagcaaa gtacctgttg 54240gaaatcttgc gcacaaagcc
tcctttattc tgttattccc actgacagga attcagatac 54300ctggatcaat tctgtttcgg
ttttgctaaa atctctaact tgatatttta cttttctaaa 54360aacctgtatt atcaatgaaa
tggaattagg aaaacaggac ctatagaagt taagacctct 54420tcaatctatt gatgtttcat
ggtgcctttt atattcaaaa tgctttgttc tcacaaaaat 54480aatacttttt gtttggagaa
aaaggctgtg gggtgtgtgt gtgtgtgtgt gtgtgtgttt 54540tcctctcaaa gatagcagta
aaataaactc cttctgacaa aggcttctta aaagaaagga 54600gaaaaaaaaa ccttcctgct
aattgtgttc tttaaaatcc tgattccccg ttttactttc 54660tggatgtgta ttctgggctt
tttcaatgtc aaccaatact ctcttgatgg gaaattcagc 54720tggatttggg tatgttcatt
gggttttcct agaacagttt gaagatccat ctcatttacc 54780taaacaaata ttccttataa
ttattatgaa aattgggcct gttatagact aataattgac 54840ttaaaccata cagggttatg
tttgtcagta tctcgtgagt cagcttttct aggggcagag 54900attgaagagt tagttctgag
attgaatact atttatcagg gttttgtttt gtgtacctta 54960ttctcctgta accacctggt
tggcttttat catagataca tttttgggaa acaggcaacc 55020acatggttaa tgaagataga
gaagacgtga aatttgttac ctttatagat tttttcccct 55080tgccctgttc tcattcttct
catttgccta aaaaaaaata taaggaggcc gggtgcggtg 55140gctcacgcct gtaatcccag
cactgaggca ggcagatcac ctgagctcag gagttcgaga 55200ccagcctggc caacgtggcg
aaactccgtc tctactgaaa atacaaaaat tagccgggcg 55260tggtagtccc agctactgca
ggtacctgag gcaggagaat tgcttgagcc tgagaggcag 55320aggttgcaat gagccgagat
tgtgtgccat tgcattccag cctgggtgac aaagcaagac 55380tctgtctcaa aaaaaaaaaa
aaaagtataa ggagtattca catttctatg agatctgtaa 55440atttaggtta gaaaatttag
ttaactgtgt tttgtaatag tcatataaat aagcacaaag 55500accctccaga cttcttccca
gcatgtgaca gtggaagaaa ggggtaataa agtagatttt 55560tttgttactc tcattggtaa
aaataagtct gtccatggga aggttaacac tgagtttacc 55620atcttgatga ttccatatgg
ttcctagcaa ttctaatctc aaagttggtt ggcagaatgt 55680ttaggtcttt gggtagaata
tcttctgtgc cttttctgtg aattgtaaaa ttacatttgg 55740gaaataaaga aaaaaatccc
tgattatccc actatagcaa tacaaccact gtaaacattt 55800tggtatacag ttgtgttgca
ttatatgcat tttctgattt ttgtatgtcc acctgtgctt 55860atttgaactg tatccccctc
cccacttccc acaccctgtt ttctcactcc tggagtgagc 55920atgggcagtg gggatgagac
tcgcctgtgg cttcagtttg tctccttttc taagtttctc 55980tgagtgggca ttcactgtgc
tggctgtgat tctgttattt aaagcaatat attttcatac 56040cttatggccc ttaaatgcaa
gccaacctct tcatctggtg tcaaccaaag gaaaagtgat 56100ctgttgcagc gctggaggaa
aaactggcaa tgttggactt acctaaattg aaagatggta 56160tgttgttctt caccttgggg
tcttcaagta tgatttttga cagtgcatgg tttttatctt 56220acatgctgac ttttgtctct
aacccttgag ttagatgcaa tttaattcca gccctttttc 56280cctatataca ttttacataa
ttatccataa gggtatattc atttttaagg ctcttaaaat 56340atactatatc aagtatcttt
ccatattgct atacaatttt tgtagctgtc atttataata 56400agacatttta gttttcgtct
tttcagaaga attttgggag ctagtataat cagctcctta 56460gaatgctttc taatttgcat
actcaggtct actcacaata gttctgccat agatatttaa 56520aatagaagca actgttatgc
tgctaaattg aatatttctt aactaggctt atttcttaac 56580aggggcatag atgtatgttt
tcaggcatat gggacccttt ctgtaactag gcttctagag 56640tttagaatta agattattta
aattggtcta tgatcttatt gaagagtgag aggctagagt 56700gtagtggtta aaaacatcaa
cttgaatctg gactgcttgc atttaaagct cagtattggt 56760acttacttgg ttactttgat
cagtttacct atcctttctt tgccgccttc tacatggcta 56820aaatcaggtt aataatattt
acctcttaag atagtattgt gaatattaaa tcagtatata 56880caaagtattt agaataaaat
cttgaaccaa caagttttat gtaaatatta cttactttca 56940taggctagtt tgctaattgc
tgaaaatcct tatggcacaa ccatgagtct tgaacacaca 57000gaataccttt ttttttttta
acgttttagg cagtatagtt aaaccttaaa tttctgttct 57060tgtttgatag ctaaagtttc
agtcagaata aatttagtgt tgggcttgtg aatataatat 57120taaatctgaa gtatgttgtc
aacatatagt attgcagggt tgatgtctag aaatgctata 57180ttagatgctc ataatgtttt
ctgtatcttt ttcttcccaa tgtctacttg tcctttagca 57240aagtatgaac gttgtcatga
atcttttctc tctgtccaca gattctgtgt gctcctctgg 57300gccacagtag ttacttcttt
aagcacagaa aggaaactta gggctttgcc agttttagta 57360ttagttctct atgtttttca
tccgggcagc tatggagagg gttgctttcc acacacctgg 57420gtactccatt gatatgttct
gtagaggtaa tcaacacact agagagtaca cctgtttgtt 57480ccatggctaa ccctttctga
ttgtagacat gcatttgagt gtttgcagtg gatatttggt 57540gctaacaggt gtcttagttc
cttctgttat tctgtaatgt ttcccaagaa tattcagagc 57600tgtttataaa atgcagagtg
atttatgtat taggtgttca atgagtttga tcaagaagat 57660tcacttgaaa ggaattaact
aacaaagcag tttccattgt taataggata tgcatgctgt 57720ttctctaaag tatttttatt
tcttcaaaga gttattaagc agaggagact gattttgtgg 57780taagtttgga ggggttagtt
ttaatccacg ttggtcaaaa ctaaaagtag attagaaaat 57840ctatttctca tcctacagta
gtgctgaggt ttctagtaga ttgtttttct tcttcctagt 57900cattttctga aactcaaaac
aagaatcaac ctataccatt gtaatgtttc acagttaact 57960tggagtattt aacaagtcta
aaatcaaagt ttattgttat tagtaaaaca ttttgaagcc 58020attcatttca tgtgacaagg
aatcttattt caccaaatgt ggtatgtttt taaacttata 58080ctttctattg ttctagtttt
gttgatcttt gatattgatg cagtgatatc agtttcctat 58140tgttatatat actttgtggt
caaaattatc atagggtttt gtgtttttct ttgcctgagt 58200tttgcttctc atctgaacat
cacatctttt tcttgcggtt ccatttacac agtgtgattc 58260ctagagatga gcttctttat
cctctgaggc agtggaggaa gcatggaagc cacttgggga 58320gcactgcatt gacaagtgta
tttttgtagt cacatggtgg cagtgtcagg gaaattatag 58380gactggttag attctagttc
agcaacctat aaatcccagg acttcccagt ctcgtgagtc 58440atacaactct caggcctgtg
ctgcaaatga cacttgctta ggagtaaggt gaagggtatt 58500ttatagctct aatggtttgt
acagttctta aacatgtatt gattgctaac aactgctgtc 58560tttctcccag cttgccccac
caccagtctt tgtgcataag cacaattttg gacatagtta 58620tttgtactta tttatgcttt
tacaccttct tccttttata aagattttag ctggtttatg 58680acttgagttg aaacaaggaa
aaaaagagga gacctgaaat ggtctgtccc ctgccaacca 58740gaagcctcct gtggtatcca
aacagaatag ttgcctcagt ctgtcagcac ttctgtcttt 58800gaaggtggtt tctgcttgaa
aagtggtgac tattagcata gcctggggat aattgctttt 58860ttttcttctc tcgggatacc
tttttttttt tttttttcca gatactttct tgctcttgtc 58920gactttgttt ttccagaaga
tttagcctgt ggttaaaatg tttcgggtcc ccacgtgaac 58980tctctgtggg attacccaat
tctggggtac cttcaccaga tcaccagtgc taaagagggc 59040aaaggatctt cttggttaat
agaaaaggct gttttggaat gaatctcaaa gtccagaaac 59100atcgagactt ttcttcaata
cttttttcta tttggggtag caactttacc tagtgtaggg 59160gagggagggg ttagttggga
gggcttgtgt ttaaggggtt cagaaacagg ggatttaagt 59220gtgtcttttg tgtttgcaag
gcactaacac cactcccgtc tgtatttaaa tgctgtcccc 59280aggttacgac tatggctatg
tctgcgtgga gttttcactc ttggaagatg ccatcggatg 59340catggaggcc aaccaggttg
ctttatactt cggtcaaatg atgctggaag gatatatttt 59400tttatatatg gggagggagg
gtttcaaatg attttacttt ggaaaggtac aagaagtcta 59460tctgtggagc atactgtatt
ccaaccatcg gttgtgagga aaatctttaa aaaggctgga 59520aagctttctc tacaaaactt
aatgggcaca gagtgcattt taaaagctag agcccagttg 59580cttttggact agattccaaa
gacaatagtt ggaaaaaaaa aaaaaagaca catctggagt 59640gtttcctttt ggagtgtgac
tgagatggta atcctgatgc aaagaatgat ccttgattgt 59700ctgtgacccc aaggatctgc
ctagcacaga aattctaggt caatagttac acccagacct 59760agggtgaaga cctctgatgg
tgacttctgt ggcatcagat cctgcctgca ggggctactt 59820ccaaaagaga gctatcaggg
aagagagagg agtggattgt tggtgtctat tgcattcatc 59880attgtttttt gccaattgga
gttgcatact caagtccttg gctgcgtata gtcagagctg 59940gtgaatcaga atctgtactc
accttacgtt tgaactatct ggagttactc agcttgccac 60000ctagattttt catctatgtc
tttaatagaa ccctacctgg tagttttgag aggaattaat 60060aaataggtag aatccttctt
gttatggtgc ttccttgggg aaagttgttt tctttgggtt 60120gtttcagttc ctccatctgt
aaagtaggaa aagaaactta ggaatatagt ttgatgtgtt 60180ttttttttct tttttttttt
ttttaatgta cccactgcct atacttaaca gtgtgaatac 60240agtgggccca gaatctttct
ttctttcttt tttttttttt gagacggagt tttgctcttg 60300ttgcccaggc tggattgcaa
tggtgcgatc tcggctcact gctacctcca cctccctggt 60360tcaagcgatt ctcctgtctc
agccccctga gtagctggga ttacaggcat gcgccaccac 60420gccggctaat tttgtatttt
tagtagagat ggggtttctc catgttggtc aggctgctct 60480cgaactccag acctcaggtg
atctgcctgc ctcggcctcc caaagtcctg ggattacagg 60540catgagccac cgtgcccagc
caggcccaga atcttaaaag aaggctctgc cagagaagag 60600tagttattag atgagaactc
ttcttcttct gtagcctgat gctttgttca gctttgttta 60660actcagtgtg gctcattata
cgtacttttc tcttcttggc caagttctcc tcttatgggt 60720atggagatga catgctctaa
atgctttggg agcaagcact cattagagaa gacttttgat 60780gtatccttat cttgttagta
gtttaagctt gtcagatcct taaagaatga caggcttagg 60840accatatccc ctagacttaa
gaggattctc attgaccatt tgttcagtgt ccatcactga 60900atcacttacc aaatacagtt
gacactctgt atccacaggt tccacaccca tagattcaac 60960caaatgctga ttggacatat
tcaggaaaaa aatgcattaa cactgcaaca ataaaaaata 61020atacaggcca ggagtggtgg
cttactctgt aatcccaaca ttttgggagg cccgggtggg 61080aggattgctt gaggccagga
gtttgagacc agcctgggca acacagggag accccatctc 61140tacaaaaaat aaaagtgaaa
aaattagcca agtgtggtgg ctatcaactt gggaggctaa 61200gatgagagga ttacttgagt
ctggattgag actgcagtga gctgtgatca ctctgctgca 61260ctctagcctg gggtgacaga
gtgagacccc gtctcaaaaa acaaaaaagt acagttaact 61320atttatatag tctttattag
gtattagata taagtaatct agagatggtt taaagtatgt 61380tggaggatgt gtgtaggttg
tatgcaaata ccatgtgatt ttatataagg gacttgagca 61440tcctgagatt tttgtgtcct
tgtgggtcct ggaaccaatc ccctgtggac accaagggac 61500aactgtacta accatgtgtc
agaaactgct acatgccaat tttggagaga agaaaaaagc 61560ttccaatctg tgtgctttcg
gtggatccta ttctgacagt ctgtccaatt ttgagaacac 61620tcattaattc ataagcagtg
aatgtgatta agtcgttcgc ctctgtgcta aatactcaat 61680gtaatagctg atagctgagt
gctataaaga aaatgaagca gggtattggg agaatgcatc 61740atggtggcaa ttttagaggg
gtggtcaggg aaacttcttg aggagtgaca tacatttaag 61800ttgtgactct tggcgaataa
tgtatccaga acacttacta tagtacctag cacttggtag 61860catttgaatt aatttgaaat
tcagtgtcct tctttctctc tcttaccctc ctccacatgt 61920caagtaattt ccaattataa
attttgtgtg tgtgtgtgag acggagtcca gccaggctgg 61980agtgcagtgg cgtaatcttg
gctcactgca acctccgcca cccgggttcc agagatcctc 62040ctgtctcagc ctcccaggta
gctgggacta cagatatgcg ccaccatgct tgggtaaatt 62100ttttttcttt tttttttttt
ttgagatgga gtctcgctct gttgcccagg ctggagtgcg 62160gtggcacgat ctcagctcat
tgcaacctct acctcctggg ttcaagtgat tctcctgcct 62220cagcctccca aatagctggg
attacaggtg cccgccacca cacctggcta atttttgtat 62280ttttagtaga gatggggttt
caccatgttt gccaggctgg tctggaactc ctgacctcag 62340gtgatccgac tgccttggcc
tcccaaagtg ctgggactgc aggcgtgagc caccatgtcc 62400tgccaatttt tgtattatta
gtagagatgg ggtttcacta tgttggccag gctggtcttg 62460aactgcagac cttaggtgat
ctgcccacct tggcctccca aagtgctggg atgacacgca 62520cgagtcaccg tgcctggcct
tcaattataa ttataagaaa ataaatttat ttttatatct 62580gaagtttaat aaaactaatt
ctttaaggaa atggatgtgg attaaactcc ttatgacata 62640gtaaacaatc ttatgagaga
cataagaatg tgagggaaga agtcctgtct cctcagggtg 62700aataaagtaa atattttggg
aggctgaggc gagcggatca tgaggtcagg agagcaagac 62760catcctgacc aacaaggtga
aaccccgtct ctactaaaat acaaaaaaat tagccaggtg 62820tggtggcgca cgcctgtagt
cccagctact tgggaggctg gggcaggata attgcttgaa 62880cccaggaggt ggaggttgca
gtgagccaag attgcaccac tgcactccag cctgctgaca 62940gagcaagact ctgtctcaag
aaaacaataa aattgaataa ataaataaat aaataaaata 63000aatatttgtg gaagataaaa
tgtgtttgta ggccgggcac tatggctcaa gcttataatc 63060ccaccacttt gggagaccaa
ggctggagga tcacttgagc ccaggagttt gaaatgagca 63120tggggtaaat agtgagaccc
tgtctaaatt taaaaaaaaa aaaaaaaaaa aaaaagtctt 63180tgtctatcct ttcccccagt
tttacttaca gaccaaattg gtatggattc tgagtcacca 63240cgatctgctt ggcaactctt
agtagagcct gagtgtgtgt gtgcctctga gaaggttact 63300ccgaagtact ttgagttttt
ttgtaactct ttgctattcc gactcttgat gtgaaatgtc 63360ttttatttat cattggctgg
tacttgtagg cctaggggat ggaaataaag gaattttctg 63420ctagcttgct ttgtcaaata
ttgttgggta tgtgtgcctt cgtgaagttg ctcaagatga 63480taaccaaggt ccctctagcc
ttttcctggt gcctagatca agctgttaaa cagtaggatg 63540ctctgcagca gtactgagct
ttgtggctgt ggtgaccgat cagggtatca cttaggcagc 63600agctgtctat ctggagaaat
aatttccaac aggtatgaag gtatgaatct gttagtctgt 63660accatcacca tttctgtcta
ggagaagggg gcagccagca agcactgtca ggcagagcct 63720ttcgttccac ccttcctgca
aagtgtattt ctagccctgt catatgccct tggctttctt 63780tgttgtcaag tctctgggag
attgagggta catattattt ccttctgctt tgtgtgccct 63840tgcactggga cttggggagg
ggagtaagaa gtattgtgtt aaaatgttaa tccctttcat 63900tggttgccca gttgtgagta
ctagccctct cagactgttg gcatttggta tgcagggatt 63960agcattttat gttctcaagt
atgctggtgt gatgcttatt gtctattatt tggccaaatt 64020agtcactaaa gtgcccttat
agaagataac tctgggagag gtatttattt ctctgaaatt 64080tttattctcc tttccccttt
cctttccttt cctttttctt tttctttttt tctttccttt 64140ttctcccctc ccccccctcc
cctctcctct tattggagac aaggtctccc tctgtcacct 64200acgctggagt gtagtggtac
aatcatggct cactgcggcc tcgatctctt gtgccgaagt 64260gatcctccca actcagttct
ctttagtagc tggaactacc accaccacag ctggctattt 64320tttttttttt ttttttgtag
aggcagggtt ttgcaacatt ccccaggctg gtcttgaact 64380cctggactca agcaatttac
ctatctcggc ctcccaaagc actgggattc caggtgtgag 64440ccactatgcc tggcctattt
ttaaattttt atttttttga gacttagggt tctgttctgt 64500tgctcaggct ggagtacagt
ggtacgatga gagctcattg cagctttgaa ctcctgggct 64560taagcaatcc tctcacctca
gccttctgag tagctggact acaggcacct gccaccatgt 64620tcggctaatt aaaaaaataa
caaactctgt tcgtaaagat ggggtcttgc tgtgttgctc 64680aggctgctct tgaactcctt
gcctcaagtg agcctcccac ctggacctgc caaattgctg 64740ggattataag catgagccac
tgcgcccagc cttactcacc tttttgtatg acactatcag 64800tctttctaaa gtgcaaagaa
aaagggttct gttatcatct gatgtgaaaa ttcctttaaa 64860cattgacttt ttctggtgtg
aggaatgaaa gctgtggaat acgtgaagtt ttatgaaata 64920gtgttttttt gtgtgtgtgt
caacaaaatt aagagagttt gggttattga agatacaaga 64980gtgtttttga aggtatatat
aggaaaccaa atctcaaatg tggtctgtcc ttgtgattaa 65040aattagagca atagggaagc
caggtgtgat ggctcacacc tgtaattcca gcacttttgc 65100aggctgtgac aggaggatca
cttgagccca ggagttgagt ccagcctggg taacatagca 65160agacctcatc tctacaaaac
attgttaaaa attagctggg tgtagtggca catgcctatt 65220gtcccagcta tttggaaggc
taaagtggga ggattgcttg agcctgggag gtcaaagcta 65280cagtgagccg tgattgtgcc
actgcactgc aacctgggcg acagagagat cctgcctcaa 65340aaaaaaaaaa aaaagcaaca
gagaaagctt atgtttttag tgatgagaat gctatttgtg 65400aggccatgat ggaaaaaatt
gaagaaccta gtttgttgga aacttaaatt ggtagtaaag 65460acataatact atctgaaaca
ctttagtact taaattgtgt gcattccaag caacaaaacc 65520aataatctgt aggttgaagg
ttgtagtgtt acctaaacaa ctatcacccc aaaaacactt 65580cattgaggag tatccagcat
cctagccaga gctcaactgt ataacttatg gctggaatca 65640tgccattctt gctggaaact
tcaatttcag tactttttcc ttatcaccct cagaagggta 65700gtagtagaaa catggggaac
tgcattctaa aatgagtgta taggttcata acctagctag 65760aaaaaaaaat taaaacaatt
aatgagtaca aaccaagggt tattgaagag tctcgctctc 65820aagagagttg gggtattcaa
gaaaattgaa agtgagttta aggatcgatg acttgattac 65880acattttggc tatttatcca
ctgattgaga cttttttttt tgagatggag tctcactggt 65940tcgcccaggc tgtagcgcag
gggtgcgatt tatccactga ttgagacttt tttttttttt 66000tttttcagat ggagtctcgc
tgtgtcgccc aggctgtagc acagaggtgc tcactgcaac 66060ctccgcctcc tgggttcaag
tgattctcct gccttagcct cccgagtaac tgggattaca 66120agcatgtgcc accacgcctg
gctaattttt gtattttcag tagaaatggg gtttcaccat 66180gttggccagg ctggtcttga
actcctcacc tcaggtgatc cgcccgcctc ggcctcccag 66240agtgctggga ttacacatgt
gagccactgt gcccagccca gtgattgaga ctcgactgga 66300catgaagcag tataatgtag
cagtataaca tagtattctg gaagcagact accgggggtt 66360gcatttcggc tccatcactt
tctaaggtgt acttgaacaa gtggcttaac ctctctgtgt 66420tttaacgtac tctcacacac
atctagggat taaataagtt aatgcatgta aggtgattag 66480aactggggct ggtggccggg
tgcggtggct catgcctgta atcctagcaa gttgggaggc 66540caagacgggc ggatcacgag
gtcaggagat ggagaccatc ctggctaaca tggtgaaacc 66600ccgtctctac taaaaataca
aaaaaattag ctgggcgtgg tggcgggcgc ctgtagtccc 66660agctacttgg gaggctgagg
caggagaatg gcgtgaactg ggaggcggag cttgcagtga 66720gccgagatcg caccactgca
ctccagcctg ggcgacagag tgagactcca tctcaaaaaa 66780aaaaaaaaag aactggggct
ggcacaaagt gaatgttgag tgcatctttg ttgttttcac 66840acaacttctc atctgaaaca
aagtcttaag ttacagcagc tctggtcttg gcttaatgga 66900gtatatggca aaaagaggat
ttggtggcag tgcctaggag gatttttttt tttcccatca 66960acaatacttc tcatttagcc
tgttgattga tacggattat caggggactc cttccagctt 67020ccctagttgg agtttttttt
tttttttttc cttttttgag acagggtctc attctgtctc 67080ctaggctgga gtgcagtggt
gcgatctcgg ctcactgcaa cctccgtttt tggggctcaa 67140gccactctca tgcctcagcc
tcccaagtag ctgtggctac agacacgtgc ctggctaatt 67200ttgtattttt gtagagacgg
ggttttgcca tattgcccag gctgatctcg aactcctgag 67260gtcaaagcga tctgcctacc
tcagcctccc aaagtgctgg attacaggag tgagctacca 67320tgtccggccc ttagtaggag
tttctgctgc cttagccttc aagagagaat cttaaatttt 67380cttttttttt tttgagacag
agtctggctc tgtcgcccag gttggagtgc ggtggcgtga 67440tctcggctca ctgcatgctc
cgcctcccgg gttcacacca ttctctcgcc tcagcctcct 67500gagtagctgg gactacaggc
gcctgccacc acacccggct aatttttttg tatttttagt 67560agagacgggg tttcaccatg
ttagccagga tggtctcgat ctcctgacct cgtgatccac 67620ccgcctcggc ctcccaaagt
gctgggatta caggcgtgag ccacctctcc cggccataag 67680aatcttaaat tttctaaaga
gaaagagcag gagacagaca gtaccacatg gagtatgttt 67740aggccatgta ggaaatctag
cctgtggctt taaaaccgta agttctaaat tagctgggta 67800tggtggtgca cacctgtagt
cctagctact ctggaggctg aggtaggagg atcacttgtg 67860cccaggagtt caaggttgca
gtgagctgtg atggtgtcac cgcactccag cctgggcaac 67920agaatgagat gctgtctctc
aaagcaaaac accctaagct ctgataacca gcccattatt 67980tgccacatct caggctcttt
aattatgaga ggtgctctaa acgactcatt ttaattctct 68040cgaatttgaa aaataaacat
ttatcatttg gcagttttaa gggaaccttc tgatatgtgt 68100cctacaatgg gtttataatt
atttttgtca caaatcatgg tttatttcta tggattaaag 68160tagtttagtt cttaatttgt
tctaaattgg aaatatacct atatgtttta acctcgtgct 68220tcagtgttgt cacatctcat
tagttcaggg gtcgtacaaa ggcatagttc agttagccat 68280cttgattata actttggttt
atgaccttat gtatgttcag atggtatagg gttcgtagca 68340cagaaagatt tagaattcca
gcttcattac ctcctggctc ttttgtaact tttttttttt 68400tttttttttt tttttgagac
ggatcttgct ctgttgtcca agctggagtg cagtggtgtg 68460atctgggctc aatgcaacct
ccacctcccg ggttaaagcg attctcctgc cttggcctcc 68520cgagtagctg ggattacggg
catacaccac cacgcccagc taatgtttat tttagtagag 68580atggggtttc accatgttgg
ccaggctgga cttgaactcc tgacctcagg tgatccaccc 68640accttggcct ttcaaagtgt
tgggattata ggcgtgagcc accgtgcctg gcctctcttt 68700tgtaacttct gaacctcagt
tttctcatct gtaaaatgag aggatgatca taataccacc 68760catagtgcag ttgtgaggtt
agagtatgta gtatatgtaa agtgatcagc atgataactg 68820gcatgtggta agtgctctgt
agtaaagggt gattcataac actggactct gcttggttgt 68880accaacttct cattttccct
ggctccttat ccacctcttg ggattcagag ttggctgaaa 68940gtggcaggca gtgctgcttt
gggtggcagc ttgattttag acagccagtt cacatagtgc 69000ttttgttcag gacctctcgg
gatttctaga cagacagcaa gagagttggg ctaacacctg 69060tcatgaagtg tctaaggaat
gagtgcacaa gcattcaggc atgtgagggc agaagaccat 69120gaccatacct gccttcctac
agtaaacagc ctgttgtttc tgcaggtagc attgcaggta 69180gttcttttat cagaaaattc
ttgtaggctg caggtgacat tgagtgttat taggtatctt 69240cttcattcaa gttgaacttg
gaggttacag tatatcttta tgtccccctc tccacaggtg 69300tttaagtgtt gtcattcatc
ctctagtgca tagattatgt gtgcacattt cttgttaagg 69360atattgatga actgatagtt
tatctagaat aatgtttatt ttatatttta ttttattgag 69420acagggtctt gctctatcac
ccaagctgga gtgcagcggc atgatcatgg ctcactgcag 69480cctcaacctc ctgggttcaa
gccatcctcc ctacctcagc cttctgaata gttgggacta 69540caggtgtgcg ccaccacacc
tggctaattt tgagggggta gaggggaggt acagatgaga 69600tctcactgtg ttgtccaggc
tggccttttg ctcctggact caagcagtcc tgcctcagac 69660tcacaaagtt ctggaattac
agatgtgagc cactgtaccc agcctagaat aattattatt 69720tatttttatt tttatttatt
tattttttga gacagagttt tgctcttgtt acccaggctg 69780gagtgcgatg gcacagtctt
ggctcactgc aacctctgcc tcccgggttc cagtgattct 69840cctgcctcag cctcccatgt
agctggaatt acaggcacac caccacacct ggctaatttt 69900tgtattttta gtagagacag
ggtttcacca tgttggccag gctgctctcg aactcctgac 69960ctcaggcaat ccacccgtct
cggcctccca aagtgctggg attacaggcg tgagtgatgg 70020cacccagcca gaataattag
ttttaatctc acagggtgag atttgtgagg ttaattttgt 70080atattaatga tgtatatatt
accaaaatct gtggtcaagt gaaatttgtg cttaatcttt 70140gcaaatgcta tttccaaagg
aaaatatgta ggagaaaagg tggtgtatca caggatgtag 70200agtagtggtt actgggcaca
agggtggccg gggagtcggg gggtggcagg agaggataga 70260gaatgataac tgattgatac
agggtctctt ttttgggatg aggaaaatat tttagaatta 70320aatagtgagg atggttgacc
aagcttgtgc atgtactaaa agccattaaa ttgtatatac 70380tttaaaacag tggattttat
ggtatgtgaa ttttatctca attttaaaaa aagtctttaa 70440atgtagtatg aaactttttt
taaggccagg cagggtggct cacacctgta atcccagcac 70500tttgggaggc tgaggcgggc
agatcacctg aggtcaggag ttctagacta gcctggccaa 70560catgatgaaa ccctgtctct
accaaaaata cgaaaattag cccagcatgg tggtgtgttc 70620ctgtagtccc agctactcgg
gaggctgagg caggagaatt gcttgaactc aggaggcaga 70680ggttgcagtg agctgagatt
gtaccactgc actccagcct gggcgacaga gcaagactgt 70740ctcaaaaaaa aaaaaaaaaa
aaaaaaagtt tttttagggt tccagcacaa tgggaatgag 70800tccagatcta aaataaagta
cagattcatt taccaccctc caccctaccc caacccccca 70860aaaagattgt ctatcagttt
gtcaggaagt tagagtaaaa tggtcttaaa atgcatcaag 70920agggctgggc acagtggctg
atgcctgtag tttcagctac tcaggaggct gagataggag 70980gatcacttga gcccaggaat
tcgagtgagc catgattaga tcactgcact ctagcctgaa 71040tgacagagca ataccttgtc
tcttaaaaaa aaaaaggcat gaagaatttt tttgctaatg 71100gtatctactt accacagagg
aacatttaag ctaaacatct gaaagattat ggatggagtt 71160ggtaacaggc tccatttgaa
ctggttatgt agtttatgct cagtaaggtt gaacggactt 71220tctgctttga gttattcaca
gttaaaaata aaggactatt ttgaagtaga ccgaaaatga 71280aaataacatt aagaaatcct
tggactaatt tttaggggag attcctgtaa tcggatggtt 71340tgtagttgtc aatgtagacc
tttcctggtt tcctgaaatt gctaatcaaa gctcaaagcc 71400atgggaaaag actggattgc
agctagaatg tgtgctctcc acatatgtct ttcttagagg 71460cctctttcaa gcagcattga
cactatggct atcatctttg accctcttag tatacagaga 71520gttgtaggtt ttcttttttt
aagggggaaa acattattga cataaattat atatcataaa 71580agtcactcat tttaactgta
caattcaatg attttttagt aaatttacca agttgtaaca 71640tttattatta taattagttt
tacaacattt ttcttttctt tctttttttt tttttctttt 71700tctttttttc tgggacacag
gatcttgctc tgttgcccaa gctgagtgca gtggcatgac 71760catggctcac tgcagcctcc
acctcccggg ctcaagcaat tctcccacct caacctcctg 71820agtagctgga actataagtt
ggaaccatcg tgcccagcta attttttatt ttttgtagag 71880agaaggtctt gctatattgt
ccaggttggt cttgaacttc taaactcaag caatccttcc 71940tgcctcacct tcccaaagtg
ctgggattac aggtgtgaac catcatgcct ggtctagaac 72000attttcatta cctcaatcgg
atccccgttt ggggatacat ttacattttt aattttttaa 72060tttttatttt ttttagagac
gaggtctcaa tctattgcca aggtggtctt gaactcctgg 72120tttcaagtga tcctcccacc
ttggtttccc gaagtgctgg gattacaggc atgaaccacc 72180atgcccagtc cattccaatt
ttttttttct ttttttttga gatagagcct cactctgtcg 72240cccaggctgg agtgcagtgg
cgtgatctca gctcactgca acctccacct cccgggttca 72300cgccattctc ctgcctcagc
ctcccgagta gctgggacta caggtgcctg ccaccacgcc 72360cggctaagtt tttgtatttg
tagtagagac ggggtttcac cgtgttagcc aggatggtct 72420caatctcctg accctgtgat
ccgcccgtct cagcctccca aagtgctgag attacaggcg 72480tgagccaccg tgcctggccc
attccaattt tttacaaaag tgatttcaga cttataaaaa 72540agctgcaaaa attcctgtgt
tcttttcacc tagattctac cttttttttt tttttttttt 72600ttgaggcgga gttttgctct
tgtttcccag gctggagtgc aatggcgcaa tctcggctca 72660ccacaacctc cccgtcccgg
gttcaagcaa ttctcctgcc tcagcctccc aagtaattgg 72720gattacagcc atgcgccacc
acgcctggct aattttatat tttttagtgg agaccaggtt 72780cctccatgtt ggtcaggctg
gtattgaact cccgacctca ggtgatctga ccacctgggc 72840ctcctaaagt gctgggatta
caggcgtgag ccaccgtgcc aggcccaccc agattcttct 72900tagcacattt gaatgcagat
ttttgaatag ttatgatcta ttctcattga aaaagggaca 72960tcatttgact tgacctccca
ccagactctt cctttgaggt tggatggagg tgcttaatgg 73020atgctgtgga tggtgtgtga
atttccattg ggttgagtgg atgatgtatg tggaaggcga 73080ttgggattta ctttgtcggt
gtctccaaga ggtcccccac tgggctttgt caggtgctgg 73140ggttggaggt caagaagtag
ggcaacatct aaagcttcta ctcctgggca ctgtgaggtt 73200tttataggtc ttttaaaaaa
aacagtgaat aggccgaacg cggtggctca cacctgtaat 73260cccagcactt tcagaggccg
agggaggcgg atcacgaggt caagagatca agaccatcct 73320ggcctcgtgg tgaaacccca
tctctactaa aaatacaaaa attagctggg catggtggca 73380catgtctgta gtcccagcta
ctcgggaggc tggagcagga taatcgcttg aaccctggag 73440gtggaggttg cagtgagccc
agatttcacc actgcactcc agcctggcga cagcgaggct 73500ctgtctcaaa aatatgttct
tccatgagac agcgggcatt tggatgcctg atacaaaaag 73560aggagggact atgtgctagt
cagctttaga ctgagaagca gcagcaacca tggcaaaggg 73620gaagcaaact ttcctgagtg
gccttaataa tgttattcgt caggcagtgg ctcttaaaca 73680ggggcttcaa gcagtgattt
ttgacatgct cttctcctcc ccaaccactg gacatttggc 73740aatgtctgga gacatttttg
gttgtcacca ctgggagagg gtgctactgg tatctagtga 73800atagagccag ggatgctgct
aaacatccta cagtgcaaag ggcagctctc cacacaaaga 73860atcatctggc ccaaaaatct
ctattgctga ggttgaaaaa tactggtgta aggagacaag 73920agttgtggtt agtcagaaag
gatgacctgg cttgccgtgg attgtcttat aataatcagt 73980tatctctttc cttgccttat
tcctggtccc aacagagtga ggattggcaa gggggtttgg 74040gaatatagtg ggaatgctgt
gtagtgagag tgcaggcacg gcactccaga ctaccagtca 74100cgagcttagc ctgtgtcctt
ggggtaggag ctgtagaata agacctattt tgatatgtgg 74160accagaataa gttctttaaa
taatcaaagg taataaacat tcttaaaata tactatcact 74220aaggtagtct gtcatccagc
agaatgaggg agtagtcaga agattacaca tatttggcag 74280caattactag aaaaaacaaa
caagttgaga gttttcaaaa tagatgttac ttcatatttc 74340agatagtttt ccagggaata
ttgaaaatgc aagtgcagat tttcacatcc ttctttatac 74400tgattaaaac atttgaatct
attggatcat cttttcatta ggctttactt cacagggcca 74460tctactggat cctgtatgct
gatatagtta aggggactga cctcaaagta aaagatgcat 74520atattttatc ttaatacaat
atcactttgc tgtgaagggg agctgctgtg tatatagaat 74580gctgtgtaat agtgattggg
ctgttgggaa tcacattgga aatatcagta agcaactcat 74640tttaactttt gttaacacag
ttaagtgctg agcacctctt gtgtttgaag ctctgtgcta 74700ggtaatatgt gttcattaat
gaatgaaaaa acaatacaaa aattagccag gcatggtggc 74760gtacacctgc agtcccagct
actcaggagg ctgaggcaca agaattgctt gaacccagaa 74820ggtggaggtt gcggtgagcc
gagatcacgc cactgtactc cagcctggcc aacagagtga 74880gactgtctca aaaaaaaaaa
aaaaaaaaaa aaaagttttt tatttttaaa ttttttgttt 74940tatttctttt ttactttttt
ttcttttgag acagagtcac gctctgtcac ccaagctgga 75000gtgcagtagc accatcttgg
ctcactgcaa ccccccgcct gccaggttca agtggttgtc 75060ctgcttcagc ctcccaagta
gctgggacta caggtaccca ccaccacgcc cggctaattt 75120ttgtattttt agcagaggcg
gggtttcacc atattggcca ggctggtctc aaactcctga 75180ccttatggtc tgcccgcctc
agcctcccaa agtgctggga ttacaagcat gagccactgt 75240gcctggcaaa atttttattt
tattattatt attatttttt tttttttttt tgagatggag 75300cctcgctctg ttgcccaggc
tggagtgcag tggcgcgatc tcggatcact gcaagctccg 75360cctcctgggt tcatgccatt
ctcctgcctc agcctcctga gtagctggga ctacaggcgc 75420gtgccaccac gcccggctaa
ttttttgaat ttttttagta gaggcggggt ttcaccatgt 75480tagccaggat ggtctccatc
tcctgacctc gtgatccacc tgcctcagcc tcccaaagtg 75540ctgggattac aggcgtgagc
caccgctccc ggccaatttt tattttattt ttaattgata 75600attgtacatg tttatggagt
acccatgtta tgatacatgt gcacattgta gaataatttt 75660taattgataa ttgtatacgt
ttatggagta cccacgttat gatacatgtg tacattgtag 75720aatgattgaa tcagactagt
taacatatcc atcacctcat gtagttattt ctttgtagtg 75780agaacattta aaatctcttt
tagcaatttt gaaatagata caatacattg ttattaacta 75840tagtcaccat gctgtgcaat
agataactaa aacttcttcc tcctgtctga ctgaaacttt 75900atactctttg actaacattc
tcccgttctc ctccacccgc cttctccacc cacggcctct 75960ggtaaaccac cattctgctc
tctacttctg tctgaatatt tgattttttt agattgcaca 76020tgtgagatca tgcagtattt
gtctttctgt acctagttta taatacactt agctaagtgt 76080ccttcatgtt tttccacatg
tcgcaaatgg cagaatttcc ttctttttta aggccaaata 76140gtatttcatt gtgcttacat
accacatttt cattatccat tcattcattg atgggcaatg 76200gatgaatgga tatcatggct
attgtgaata gtactgcagt gaacatggga atgcaggtat 76260ctctcagaca taatgatttc
agtttcattg gatatatact gtacccaaaa gtgggactgc 76320tagatcatat ggtgattctc
gttttagttt tttttttttt aagaacctcc atacagtttc 76380caaaatatct gtactaattt
acattcccac agtgtaaagg gttccctttt ctccatatcc 76440tcactaacac ttgttaccgt
tcatcttttt tatagtaacc atgctaacaa gtatgaggtg 76500acatctcatt atggttttgt
ttgtttgttt gagacagtgt cttgctgcat cacacaggct 76560ggagttcagt ggcgtgatcc
cagctcattt gcagccttaa cttcctgcac tcaagcagtc 76620ctcccacctc agcctcccag
gtagctggtg tgtcaccatg cctagcgttt tttttttttt 76680ttttttgaga cagagtctcg
ctgtgttgcc caggctggag tgcagtggta tgacctcggc 76740ttactgcaat ctctgcctcc
cgggttcaag taattctcat gcctcagcct cctgagtagt 76800tgagattaca ggcatgtgcc
accacaccca gttaactttt gtatttttag tagagatgag 76860gtttcattat gttgtccggg
ctggtcttga actcctaggc tcaagtgatc ctcccacctt 76920ggtttctgaa agtgctggga
ttaccagcat gaaccactat gcccagctcc ttatggtttt 76980aatttgtaat tctctgataa
ttattgatgt tgaacatttt gtcatatatt ttttggcaat 77040tttttttctt cttttaaaaa
ttttgttttt agccataagg ccaggaatgc acgtatgtct 77100tctttcaaga aatgtctggg
ctgggcacag tggctcacgc ctgtaatccc aacactttgg 77160gaggccgagg cgggtggatc
acgaggtcag gagatcgaga ccatcctggc taacatggtg 77220aaaccccgtt tctactaaaa
atacaaaaaa attagctggg tgtggtggtg ggcgcctgaa 77280gtcccagcta tgtgggaggc
tgaggcagga gaatggcgtg aacccaggag gtggagcgtg 77340cagtgagcca agatcgcgcc
actgcactcc agcctgggcg acagagcaag actctgtctc 77400aaaaaaaaaa aaaagaaaaa
gaaaaaaaaa tgtctattca ggtcctttgc ccatttttta 77460atagggttat ttgttttcat
tattgagtag tttgagttct ttgtacattt tggatattag 77520ccctttatca gatggaagat
ttgtaagtat tttctctcaa tctgtgcatt gtttcttcac 77580tttgttaatt gtttccttgc
tttgcagaag ctttttagtt tgacgcaatt ccatttgtct 77640gtttttgctt ttgttgcctg
gcctttgggg gtcatgcaca agaaatcatt gcctagacca 77700gtgttgtgga gctttccaac
tatagtttct tctagtagtt ttacaatttc tgttcttaca 77760tgaagctatg aacagttcct
gtatagttat ccctgccacc cttctcccaa cattacatac 77820acagcctccc caactatcag
catcctgcag tgtagtgtat atgttacaat cagtgaagca 77880acattgatac atcattatca
agggttcact ctgggtgttg taccttctat gggtttccac 77940aaatgtatgt catatatcca
ccattatagt atcatacaga atagtttcat tgccctagaa 78000accctctttt ctccacctgt
ttgttctttc ctcttgcaaa cccctgcaac cactgaactt 78060tttattgtcc gtgtagtttt
gccttttgca gaattttata tagttggaat tggacaatat 78120gtagcctttt cagattggct
tctttcattt agtagtacat ttctctatgt agtctcattc 78180ctctatgtct ttttgtggtt
tgatagctca tttcttttta gcactgaata atatcccatt 78240gtatggatat atcacagttt
attcattcac ctactaaatg acattttggt tgcttccatg 78300ttttgacagt tacgaataaa
gctgcaataa atatccatat gcatgttttt gtacggacat 78360acgttttcaa ctagtttggg
taaatacaag gggcatgatt actggatcgt atggtaggag 78420tgtgtttttt tttttttttt
tttttttttt ttttgacacg gagccttgct ctgtcaccag 78480ctggagtgca gtggtgcgat
ctcggttcat tgcaacctct gcctcccagg ttcaagtgat 78540tcttctgcct cagcctccca
agtagctggg actacaggtg catgaccatg cccagctaat 78600tttttgtatt tttagtagag
acagggtttc aacatgttgg ccaggatggt cttgatcttg 78660tgacctcgtg attcgtccac
ctcggcctcc caaagtgttg ggattacagg cgtaagccac 78720tgcacccagc ctgtagagta
tgtttaattt tgtaagaaac tgtcaaacag tttttccaaa 78780gtagcgatta caatttgcat
tgctaccagc aatgaattag agttctgttg ctctgtatcc 78840ttgccagcat ttggatggta
gccattttta tttttattta tttatttttt ttttttgaga 78900caaggtcttg ctctttcacc
caggctggag tacagttgga cgatctcagc tcactgcagc 78960ctccgcctcc caggttcaag
ttattctcct gcctcagcgt tctgcatagc tgggattaca 79020ggcacgcacc accacaccca
gctaattttt gtatttttag tttcaccatg ttggctaaga 79080tggtcttgaa ctcctgacct
taggtgatct gccccgcctt ggcctcctga attgctggga 79140ttacaggcat gagccaccat
gcctggcctc ctttgggtat ttctattgga cagtcatgtc 79200attcatgaat aaagacaatt
ttatttcttc ctttctaatc catatacctt ttatgtcctt 79260ttcttggctt attgcactag
ctaggatttc tagtacaatg ctgaaaggag ctgtctttct 79320cttcttttct ctcctttcct
tgccttttcc ttttcttctt tttctttctt ttcttcctat 79380agagataggg tctcgctatg
ttgccaaaac tggtctccag ctcttgggcc caggtgatcc 79440tcccacctca gcctcccaaa
gtgctgggat tacaggtgtg agccaccaca cctagctgaa 79500aaggagctgt tgagaataca
tccttgtctt gttcctgatg ttagtgggaa gaaagcatct 79560agtctctcac cataagtgtg
atgttagcta taggtttatc aagttgagga ggttcccctc 79620tgttcctagt ttgctgagag
gttttttttt ttaaatcatg aaaggggatt ggatttttgt 79680caaatgattt ttctgcatct
attggtatgt tcatgttaat ttcttcttca gcatgtcgat 79740gtgatggatt acattaattg
attttttttt ttttttttag atgcagggtc tcactctgtt 79800gcccaggcta gagtgcagtg
gcacaatcac agctcactat aacctcaagt tcctcagctc 79860aagcaacttt cccatctcag
ctttccaagt agctaggact acaggcacat accaccatac 79920ccatctagtt ttttaaaaca
ttatttgtaa agatgaagtc tctctatttt gtccaggctg 79980gtctggaact cctgggcggg
ctcaagcagt cttcaccttg gcctcccaat ttgtttggat 80040tacaggtgtg agccactatg
cccagcctca tttttgttat tagtaatttg tatcttcttt 80100ctttttttct tagactggtt
aaatgtttat caattttatt gatcttttca aagaaccaac 80160ttttggtttc actgatttat
ctctattgat ttactgtttt caatttcatt gacttcagct 80220ctaattttta ttattttctt
ctgcttactt ttgatttaat ttgctctttt actggtttcc 80280taaagtggaa gctcagatta
ttgattttta gatttttctt ctcttttaat atatgcattc 80340agtgctataa atttccctct
cagcactgct ttttgtgtat cgcacaaatt ttgataagtt 80400gtgtttttca ttatcgttta
cagttgtgtg ttaatcccca tacagttaat gatggggata 80460aattctgaga aatgcactct
taggcaattt tgtctttgtg caaataccat ggagtgtaca 80520tacacaaacc taaatggtat
agcctgctac ccacctaggc tatatcattt agcctattgc 80580tccttaactg caaacctgta
caacttgtta ccatattgta tatgataggc agttgtgaca 80640cagtagtatc taaagataga
aacggtacag tgaaaataca gtatttcagt attttgggac 80700caccatcata tatgcaagcc
cattgttgac tgagatgtca ttatacagca tctgaccata 80760attcggaata tttttaaatt
cctcttgaga tttcttcttt agcttgtgtg ttatttagaa 80820gtatgttttt aaatctccat
atactttggg atttttacaa ctatattact gttactgact 80880tctagtttaa ttctattgtg
atctgagagc atatattatt ttttctgtca ttttaaactg 80940gaaaaggtat gttttatggc
ccataatgtg ctgcgtgagc ttgaagagaa tatgtagttc 81000gctgttgctg gatgaaatag
tctacaaatg ttgattagat tgctgctgtt attttgatgc 81060gtatccttcc agatttttct
atgcatgtat catctatctg tgtatctatc tgtaggatag 81120gagagtcttg tacaaatggt
tttataactc tttaacttca aatattgtgg acttacttcc 81180ttgtcattaa atacatttaa
ggctgggtgc agtggctcat acctgtaatc ctagcacttt 81240gggaggccga aacaggcaga
tcacctgagg tcaggagttt gagaccagcc tagccaacat 81300gttgaaaccc cgtctctact
aaaaatacaa aaattagctg ggtgtggtgg cacacgcctg 81360taatcccagc tgctcaagag
gctgaggcac gaaaatcggt tgaacccaag gaggcggagg 81420ttgcggtgaa ccaagattgc
gccagtgcac tccagcctgg gtgacagagc aaaactttgt 81480ctctaaataa ataaataaac
aaataaaata catacctatg tacatacata cattttaaga 81540atcattttga tatattcatc
tccatactga ggaatttaag tgcttttttt tttttttttt 81600tttttttttt tttgagacag
agtctcactt tgttgcccag gctggagtgt ggcggcacga 81660tcttggctca ctgcaacctc
tctacctcct gggttcagga aattctcctg cctagccggg 81720tgagatttcc tctttagctt
gtgtgttatt tagaagcatg tttttgtacc tatcgtagct 81780tctctagaga agggaggtag
gagaatcgct tgagcccggg aggtcaaggc tgcagtgact 81840gacccatgac catgccactg
cactgtagcc tgggtgacag agtgagcccc tgtctcaaaa 81900aggaaaaaaa agaaatcagc
atattttatg acttaataaa tgtattcaaa ttccatccag 81960atatttccta atttattatt
ttactaacag tgtttgagag cacttgtctc ccctgccttc 82020caaccagtgt caagtgtatt
ttaacaaaat acttgtattg ggtagtagta catggttggt 82080tgttactctc taatcgcctg
ttgtgtttga aatatttaat aattttttta atgttgctag 82140tgtagtgaag aagataatga
tttagttttt cttctttctt tttttttttt gagatggagt 82200ttcacccttg ttgcccaggc
tagagtgcaa tggtgcgatc tcagctcacc aaaacctctg 82260cctcccgggt tcaagtgatt
ctcctgcctc agcttcccga gtagctggga ttataggctc 82320atgtcaccac gcctggctaa
ttttgtattt ttagtagaga cagggtttct ccatgttggt 82380caggctggtc gcgaactccc
gatcttaggt gatctgccta ctttggcctc ccaaagtgct 82440gggattacag gcgtgagcca
ccgcacctga caaatgatgt agtttttctc ccttaggtta 82500ttagtaggca gaatagtttt
acatttgatt attagttatt catatttctt ttgtgacttg 82560ttggttctta atatatctat
tcagccaaaa atgaaaaata ggatatctta gcctgtctag 82620tcttaaggta aatatatgtg
ggatataagg gagtttgggg gctgggcgca gtgactcaca 82680cctgtaatcc cagcacgttg
ggaagctgag gtgggctgat cacttgagcc caggagttca 82740agaccagcct gggcaatgta
gcaaaacccc atctctacca aaagtacaaa aattagccag 82800gtacagtggc acatacctgt
attcccagct actagggagg ctgagatgga aggatagctt 82860gagcccaaga ggttgaggct
gcagtgagct ataagcatgc cccactacat tccagcctgg 82920gtgacagagc gagaccctgt
ctcaaaaaaa agattttttt gaaaagttga aaatgagtat 82980attcgctgaa tacgagatga
gttttcccaa gaatttatcc ctcagaatct ttcacgttct 83040tcctcctcct tctcctcctc
ctgctttctt cttcttcttt cttctttttc tgtttcttct 83100tcttgctttt ataaagtctt
agctcctgtg gagttttctc tcagttactt cttatttatt 83160tatttgagac agagtttcac
tcttgttgcc caggctggag tacagtggcg cgatctcggc 83220tgactgcaac ctccgcctcc
tgggttcaag ctattctcct gtttcagcat cccaagtagc 83280tgggattaca ggtgcctgcc
accacacctg actaatttct gttacttctt ttgagccaca 83340aagtatttga aaaagatgca
ttaagtagtg accgcagtcc gtgctagtat tgggtgctta 83400cagaggtcta gtagaatacc
gtgttttaaa aggaggtgaa tttaataatt gctgtgatta 83460ctctggcatt atacgctcac
aaataaaatg tttggtgatt tttttttttt ttttttttgg 83520agacagattc ttgctctgtc
acccaggctg tgcaatgatg tgatctcagc ttactgcaac 83580ctccgagttc aagtgattct
cgtgcctcag cctctcgagt agctgggatt acaggcaccc 83640gccatcatgc ctggctaatt
tttgtatttt tgtagagatg gggtttcacc atgttggcca 83700ggctggtctt gaactcctga
cttcaggtga tccacccatc tcagcctccc aaagtgctgg 83760gattacaggt gtgagccact
gctcccagcc gggtgtgata tttttaataa aacaagtatt 83820caaattcact tacaggacca
atgaaagaat cgtttgtcgt aattttatgc caaagggtac 83880ttgtggctta agataaactt
cccataatga cattatccac agattcaaaa agtagtttat 83940cttaaacaac ttctgtgaca
ttttaaaatg atgtggctta gaaaattgct aggttatcta 84000aaatggctct attgatgatg
taaatgtagc acatgaagag cttgaataaa atagactttt 84060gaagtgtgca aatggaaaga
acagtccttc taaataatta tttcccctcc cttttattga 84120cgtatacata cagaaaagat
atcatgtcgt aagtgtattg cttagtgaat tactccaaag 84180ttggatatac ctggttaacc
accacctgaa tgaaaaaaac agaacactgc ttcatatgga 84240gaagcccctc ctgcccctcc
tggtcattgt ccttttcatc cctcccacag gtagtcactg 84300agttctaata ccacagagtc
ttttgacttt cttttgagcc ttatgtaatt agaatcacaa 84360aagatgtatt cttttgcctg
acttttatac ttagtattgt ttttgaaatt catcttgtgt 84420gtaactgcga tttgttcatt
ttcattgctt agtgaattat tccaaagttg gatatacctg 84480gttaaccacc acccgaatga
aaaaaacagt ttttggccgg gcacgatggc tcacgcctgt 84540tatcccagca ctttgggagg
ctgaagcgtg cagattacga ggtcaggaga tcaagaccat 84600cctggctaac acggtgaaac
cccgtctcta ctaaaaatac aaaaaattag ctgggcgtgg 84660tgacgggccc ctgtagtccc
agctactcag gaggctgagg caggacacct gtaatcccag 84720ctacttgaga tgctgaaaca
ggagagtggc gtgaacttgg gagatggagc ttgcagtgag 84780ccgagattgc gccactgcac
tccagcctgg gcgacagagc aagactccgt ctcaaaaaac 84840aaaaaacaaa aaacaagaaa
acagttttcc agtctaagaa tgtattacaa tttattcaaa 84900ttccactcta gatggactgt
gggttttttt ttttccccca tttggagcta tggcaaatga 84960tgttttttca aagttgttat
ttctcagcca ggcgcggtgg ctcacgcctg taatcccagt 85020actttgggag actgaggtgg
gcagatcacc tgaggtcaga agcaagacca gcctggctaa 85080catggcgaaa ccccgtcttt
tctaaaaata caaaaattag ccaggtgtgg tgatgggcac 85140ctgtaatccc agctacacag
gaggctgagg caggataatc acttgaaccc aggaggtaga 85200ggttgcagtg agctgagatc
acaccactgc actccagcct gggtgacaga gcgagactct 85260atctcaaaaa agaaaacaaa
acaccacgga attgttattt ctcttggcga ataggtagat 85320gcacttattc ctgttaatat
atacctacct gtgaatgtgc ttgttggatt ttctatgtat 85380cttctgtctg ccacctagaa
atttaacctt ttatatatat acaactttaa tttttttttt 85440ttttttttta agagacaggg
tgtcactatg ttgcccaggc tggttgggaa ctcctggcct 85500taagccgtcc tcctgcttca
gtctcccaaa gtgttgggaa tataggcgtg agccactgtg 85560ccccactgtt caagttttca
ttgattgctg cctacatata gttgttcaac agctattgat 85620tccccctgct ctgtatatat
gtctcctagt gtaggtatca gggttacagc agtaattaag 85680accacattat ttcattttat
catttaaata tataagacta attgataaat taagtataga 85740actttgacca acatggtgaa
accccatctc tactagaaat acaaaaatta gctgggtgtg 85800gtggcagacg cctgtaatcc
cagctactca ggaggccgag gcagaactgc ttggagatgg 85860aggttgcagt gaaccaatat
cagaccacta tactccagct tggatgacag agggagactt 85920tgtctctttt tttttttctt
tttttttgag acggaatctc gccgtcttcc aggctggagt 85980gcagtggcac gatctcggct
cactgcagcc tccgcctccc gggttcaagc gattcttcta 86040cctcagcctt ccgagtagct
gggattacag gcacccacca ccatgcccgg ctaatttttg 86100tatttttagt agacagggtt
tcaccatgtt ggccaggctg gtctcaaacc cctgacctca 86160agggatcaac ctgctttggt
ctcccaaagt gctaggatta taggcgtgag ccactgtgcc 86220cggccctttt tttttttttt
ggagacagaa tttcgcccag ttgccagact ggagtgcagt 86280ggcacgatct cagctcactg
caacctctgc ttcatgggtt caagccattt tcctgcctca 86340gcctcccaaa tagctgggac
tacaggcatg caccaccacg tctggctaat tttttgtatt 86400tttagtaaag ccagagtccc
aaagtgctgg gactaggcag gcgtgaacca ccacgcctgg 86460ccaagactct gtctctcaaa
aaaaaaaaaa agaaaaaaaa atataggact ttgggaggcc 86520gaggcaggca gatcacctga
ggtcaaaagt ttgagaccag cctgactaac atggtgaatc 86580cccatatcta ccaaaaaata
caaaaattag gcaggtgtgg tggcgtgcac ctgtagtccc 86640agctattggg gaagccgagg
tgggagattg tacctgggag gcagtgagca gagatcgcac 86700cactgcactc cagcctgggt
gacagagtga gaccttgtct caccaaaaaa aaaaaaaaaa 86760aaaaaatagc ataggtaggc
atttgatgat ttgatgattt cattcgcatc cctaaaagtt 86820tatttgttcc tgggtcgtca
gatagctttt tggccatctt cctgttgaga aaattgatgt 86880acccttctgg agtcctccaa
ttttccatta taatatggta agtgggagct agagctttgg 86940gtaagaattg ggatgtgata
aggaggatga gttttgcagt ggtgtgcatg gttaggagga 87000gaaaaagctg gaggcagagt
gttcacttag aggcttgggg taggaggggt aggtttaagt 87060ggtgctcatc tgggccagaa
tagggcaaaa agggaagaat gaaataacca gatgtctttg 87120ctttgtcagt agtcttgcag
ccctgaaagc tttttttgtt gtgttatatt tgttgtaatt 87180gaggtataat ccacataaca
taaaacttac ctctttcaag tgtacaattt agtagttttt 87240agtatattca taaaattgtg
caactatcac cactgatacc agaacatttc tgggaacaaa 87300aagaaactat atatccatta
agagtcactc tccattttct cctacttcct tctctacccc 87360cagtcatctg ctagtcggct
ttctgtctct atagatttgc ctgctctgga tatttcatat 87420aaatggaatc atataccata
tggtcttttg tgactggctt cttttactta gcctaatgtt 87480tttaaggttc atccatgtta
tatgaatcag tacttaaatc atttataggg ttgaataata 87540ttccatcata tggatatacc
acattgtctt tatctgctca ttaattggta gacatttagg 87600ttgtttccac ttttgtttat
tatgaataat actattcaca ttcatgtaca aggttttgtg 87660tggacacatt ttcagttctc
ttcgatatat accaaagagc cacaatgcta aaacttccag 87720ctttttacca gctatcccca
gatgcgtagc ctagtaagcc ccatgttgga gtggtgtagt 87780gttgaaaaca tggcatactc
atacattaga taaccaggtt tcaattctgg tttggaagcc 87840tttggatatt tgcattaccc
atttgaattc tctcttgggc tgtgtttggt ttggggtttt 87900gtacttgttt tttttttttt
aactagatgt tttgaggcac ttggtactgt ggacatgtgt 87960cagtcttaaa tatttgggtt
ttgagcatat caagggcttg gtttgcagtt gacagttgaa 88020tagcagtctt cttccttcca
ttccttacag attctcctgt tcagagtcaa ccattgaata 88080gcatatttat tgtttctgcc
tgtgtgtctg ttagtgctca tatggtctag ttcctgagtt 88140aagaagtata gggtagtggt
catctttttt ctttgacttg attcctgcgt actgtgaatg 88200cagagcaatg caggatatgt
tgggttttct acaaacagag catcagccca gagacatgtt 88260tgcatttgtt tctgtcaggt
ttcctggctc aactggcacc ctttaaggcc agagaacgtt 88320agtttaggca cttttcctag
taaaatactt cttgtggctc ttcctgtgta cttggaataa 88380aggaggcatt ccattgttag
acatgcttgg gtagttcagg gtaatcttag agtcatgaga 88440gatatgatat aaaggaataa
ctagctaaac cagaaaaaat gcctgggtaa tgactagcaa 88500ataggtggtc aacagatgtc
ctcattagat tgaaaggtcc atgaaagcag ggactatttc 88560ttttctttac tgcttaaaaa
ggttagaact ggacctggca acatatgatg agctaaataa 88620atacatattt gtgaattggg
ttaacacata ttgcataaag tggttttggc tctgttttat 88680tcttcataag ccctagtgat
ctttttaatt tctgtaaaat gtggtcttga cccccccaac 88740ccaagtgacc tccttatttg
ctaggctctg atatttctgt taggtttcta ctgtattttc 88800tgagatagca attagtagat
actatttctc ctttgatgga gctagccata tattcttgtt 88860tgttcatttt agctttcaaa
tttctgtctg attcttgttc ttttactctg gaatgtagtg 88920aatggaatga cttggaaggt
acaaggtagg tcagtttagg ttgtctaggg ccttgcattt 88980aaaagtttaa tttgatgaca
tggtggatta caagaatgta acagtatcaa aatgatacta 89040tcttcttgtg gtggtatgta
gacttaaaaa gagaaactgc agagaaaagg gtcccttagg 89100atgtagagca gcagttgata
tgtgagaagt tgatgccttg cattagggat taggagtaga 89160tgtggaagga agagatcagg
tttgaaagag tttaaacaaa gaatctctag gatttgataa 89220cactggatat cagaggggaa
ggtacaagag agagggcaga atcaaaggcc actcagaggt 89280taaaggaatc ataccggttt
ggcatggtgg ctcacgcctg tcatcccagc actttgggag 89340gctgaggcgg gcagatcacg
aggtcaggag ttcgagacca gcctagccaa tatggcgaaa 89400ccccgtctct actaaaaata
caaaaattag ctgggcgtgg tggcgtgtac ctgtaggccc 89460agctactcag gagactgagg
cagaagaatc acttgaaccc aggaggcaga ggttgcagtg 89520agccgagatc gtgccactgc
actccagcca gggcgacaga gcgagactct gtctcaaaaa 89580ataataataa taataaataa
ataaaggagt aattccaaca cttgggaggc cgaggcagga 89640ggattgcttg agcccaggag
ttcaagacca gcctgggcaa catagtaaaa cctcatcgct 89700ataaaaattt tttaaaaaga
aatttagcca ggcatggtgg tgtgcccctg tagttcccat 89760tactagagag gttgaggtgg
aaggatctct tgaacccaag aggtcgagag tacagtgagc 89820catgatgcac cagggcactc
cagcatgggc aacagagtga gactttggga ggccatggca 89880gaaggattgc ttgagcccag
gagttcgaga ccagcctggg caatgtagtg ggaccttgtc 89940tctataaaaa ttttacaaat
atatataaaa gctgggcatg ggggcacgtg cctgtagtcc 90000cagtgactgg tgggtggggc
gggggtgagg tgggagaatc acttgggccc aggaagtcga 90060gattgcagtg agccatgatc
atgccactgc tctctagcct gggtgacaga gtgagactct 90120ttttgtctta aaaaaaaaaa
aaaaaaaaaa aaaaaatggt tgtaccttga acagatacaa 90180agcatgtaga agaggaaagc
atttgggagg gagaataatt ggttggatac attaagtgtc 90240aagtgacagt aggacctcta
gaaatacaca aacagagctc cacaggtttt ttcattgtca 90300tttcttatac cttttgttcc
actacctact tttttcctac aactttctgt ttattttata 90360gtttatgaat tttaagcaaa
atacttcctt ctgcctctta ccagtaattt tcaaaagcgt 90420ctgtattggt taggattaga
tttggctggg aatgacagaa aactaaaaat aaaagcagtt 90480taaacaagtt tatttctctc
taatgcaaat gaagtttgag ctgtccaggc tttcttatgg 90540tggtttggtc atgatcaggg
acccaggttc tttcaaccat gtagccccat cttaacatgt 90600gatttctatc ttattgttca
agatggctat ttgagtgtca gttatcagtt ttatttagca 90660accaatggga aggaaggggg
atgaaaatgg gccctgtctt taaggatact tcctggacat 90720agtgagtaga aggatggtta
ccagagtatg ggaagggtag ttagggggct ggggggaagg 90780tgggaatggt aaaggggtat
aaaaaaggta gaatgagtaa gaccatcaga gaaatgcaaa 90840tcaaaaccac aatgatatag
gtggctcacg cctatatgta tctcacacca gttagaatag 90900tgatcagtaa aaagccagga
aacaacaggt gctggagagg atgtggagaa acaggaacac 90960ttttacactg ttggtgggac
tgtaaactag ttcagccatt gtggaagaca gtgtggcgat 91020tcctcaagga tctagaacta
gaaataccat ttgacccagc catcccatta ctgggtatat 91080acccaaagga ttataaatca
tgctgctata aagacacatg cacatgtatg tttattgcgg 91140cactattcac aatagcaaag
acttggaacc aatccaaatg tccatcaatg atagactgga 91200ttaagaaaat gtggcacata
tacaccatgg aatactatgc agcaataaaa aaggatgagt 91260tcatgtcctt tgtagggaca
tggatgaagc tgtaaaccat cattctgagc aaactatcta 91320agggcagaaa accggacacc
acatgttctc acttatacgt gggaattgaa caatgagaac 91380acttggacac agagcgggga
acatcacaca ctggggcctg tcgtggggtg ggggaggggg 91440gagtgatagc attaggagat
atacttaatg taaatgacga gttaatgggt gcagcacacc 91500aacatggcac atgtatacat
gtgtaacaaa cctgcacatt gtgcaccatg taccctagaa 91560cttaaagtat aaaaaaaaag
acctactatt tgataccaca atagggtgag tatagtcaat 91620aatgacttaa ttgtacattt
taaaataaca taaaaagaaa aaaataaaat aatgcagagt 91680ataatttgat tggttgtaac
tcaaaagata aatgcatgag gggatggata ctctattccc 91740catgatatgc ttatttcaca
ttgcatgcct gtatcaaaac atctcctgta ctccataaat 91800aaatacacct actatgtatc
cacaaaaatt tcttaaaaaa ggatactttt gagcgtttca 91860agcattactt ctagttatgt
tcagttgatc agaatttagt catagccaca cttcagcttc 91920aaggagggct gcagaacgtc
tttattttag gcagctatgt gcccagttaa aaagcagatt 91980ttctcccaag gtaaagagag
cagataggca ttaggagact actagtagtc ttttaatttt 92040ccaggccggg cacggtggct
cacacctgta atcccagcac tttgggaggt cgaggcaggc 92100ggatcatgag atcaagagat
ggagaccatc ctggccaaca tggtgaaacc ccatctctac 92160taaaaaaaat acaaaaatta
gctgggcgtg gtggtgcgtg cctgtagtcc aagctactca 92220ggaggctgag gcaggagaat
tggttgaacc caggaggtgg aggttgcagt gagcgaaggt 92280cgtgccattg cgctccagcc
tggcaacagg gcgagactcc atctcaaaaa aaaaaaaaaa 92340aaagcaggga tttgctccca
aggtaagaga gcaaatagac attgggagac tattagtagt 92400ctcttaattt cccagaatga
gaaccagatt ctttccggtt acagaactcg tttctccaaa 92460cattaattat tcttataata
attttaaaaa atactaaata tataattatc accagccaaa 92520tgcttctttt aagaaataga
gacagggggc cgggcacggt ggctcacgcc tataatccca 92580gcactttggg aggccgaggc
aggtggatca cctaaggtca gagttcgaga ctagcctggc 92640caacatgggg aaaccctgtc
tctactaaaa atacaaaatt agccgggcat ggtggtgcat 92700gcctgtaatt ccagctattc
gggaggctga ggcaggagaa ccgcttgaaa caaggaggca 92760gaggttgcag tgagccgaga
tcgtgccatt gcactccaac ctgggcaaca agagcaaaac 92820tccatctcaa aaaaaaaaga
aaaagaaata gagaagagac agggaagcca agctcatgcc 92880tgtaatcaca gcacttcggg
aggccaaggt gggcagatca cctgaggtca ggagtttgag 92940accagcctgg ccaacatgga
gaaacccagt ctctactaaa aatacaaaaa ttagctgggc 93000atggtggtgc ataccggtaa
tcccagctac tcaggaggct cagacaggag aagtgcttga 93060acccgggagg cagaggttgc
agtgagccaa gactgtgcca ctgcactcca gcctgggtga 93120cagagtgaga ctctgtctcg
aaaagaaaaa aaagaaaaag agacggggcc tcacatatgt 93180acagtggtat gatccgtagt
tcactataat cttgagctcc tgaaacctga tgctttaaaa 93240caaaacagta caaaactact
aaatttataa ttaaatatat aaataaaata taataaaaat 93300gttcacttct gtttttatat
tctttaaaat gacccatagg ctggtgatta gtaactaaag 93360catatgctgt ggaacatcca
gcactgatgt aagtatatga agtttgaatg ccaggtcagt 93420agattcagaa gctaagttac
tgtatggtaa agaccatgtt ttgcctgagc agctttggat 93480atggtttttt cttttttttc
tttttttgag atggagtctc gctctgtcac caggtggagt 93540gcagtggcgt aatctcagct
cactgcaagc tctgcctccc aggttcaagt aattctgcct 93600cagcctcccg agtagctggg
gctacaggtg cataccacca cgcccagcta atttttgtat 93660ttttagtaga gatggggttt
taccatgtag gccaggatgg tctcaatctc ccgacctcgt 93720gatccccctg ccttggcctc
ccaaagtggt aggattacag gactgagcca cagcacttgg 93780ccggatatag tttttctatg
tgtgtttttc ctaaacctta ttatacataa acatacaagg 93840acagagatca aatgccccct
gtctagaaac accatttctg ccaggcccat cttaataaga 93900ctatgtcttc tttttatttg
tttctatact tccttttttt tttttttttt tctgagacag 93960ggtttcactc ttgttgccac
cacactcagc taatttttgt gtttttagta gagacaaggt 94020ttcatcatgt tagccaggct
ggtctggaac tcctgacctg aagtgatccc cccacctcgg 94080catcccgaag tgctgggatt
acaagcgtga gccatcacgc tcagcctaga cttcttagtg 94140tggtgtttca ttttcttttc
tctggttccc atccagcttt gttcattgta catgctcacg 94200gtgcacttta tatgacctgt
tggcatattt tctcactctc tttttgtctc tcttcacttc 94260cagcagtgtt aaataactct
ttccattctg cagttttcct gataagaatt tcagatggtg 94320gtggccaggt gcggtggctc
acgcctgtaa tcccagcact ttgggaggcc aaggcggcag 94380atcacttgag gtcaggagtt
tgagaccagc ctggccaaca tggcgaaacc ccatctctac 94440taaaaataca aaagctagcc
gggtgtagta gcgcatgctt gtaatcccag ctactaggga 94500ggctgagtca ggagaattgc
ttgaacccgg gaggcggaag ttgcagtgag ccgagatcac 94560aacactgcac tccagcctgg
gcgacagagc gagactccgt ctccaaaaaa aaaggcaatg 94620aataattgga caaggaacca
aaacttttat tctgaaaaga gaaaattcca gtctatagca 94680agggcagttt tccttctaag
gaacagtact gatatatcat ggctaaagaa gcaggctcag 94740cttctttgtc cctttcacta
atttgctatg gcttctaaca taggctagga aaagaaaaaa 94800atctgtttct ctttctcctc
tcctctcctc tcctcttccc tctcctctcc tctcctctca 94860tcttccctcc cctcccctcc
cctctcctcc cctactcccc tctcctcccc tcccctctct 94920ttatctgtct atctgctaag
ggcagcaaat ctgtatccat acaggtctgc agcaacttca 94980attcttgcct cctcagaaga
aacaatttga ctgagggtca taaggcagaa ggagagacca 95040aggcaagttt tacaacagga
gagagtttat ttaaaagctt tagaacagga atgaaaggaa 95100ggaaagtaca cttggaagag
ggccaagcag gtgacctgaa agacaagtgc accaacacat 95160agcctttcaa caggatagag
agcagttaaa actgccctgg aaaagccaga cttacaggct 95220actctgtata atagaaactt
caggacaggg tgcggtggct cacacctgta atctcagcac 95280tttgggaggc cgaggtgggc
ggatcacgag gtcagaagat cgagaccatc ctggctaata 95340cggtgaaacc ccgtctctac
taaaaataca aaaaattagc cgggcatggt ggcgggtgcc 95400tgtagtccca gctacttggg
aggctgaggc aggagaatgg tgtgaacctg ggaagcggag 95460cttgcagtga gctgagatca
tgccattgca ctccagcctg gtcgacagag ccagactccg 95520tctcaaaaaa aaataaataa
aaaagaaact tcagcatgct tcctaatact gttcaaaggt 95580ctcccttttt atgattttat
ttaaaaaaat tttttttttt tgagacagag tctcactctg 95640ttgcccaggc tggaacgcag
tggcgtgatt tcggctcact gcaacctccc ctcccaggtt 95700caagcaattc tcgtgcctca
gcctcctgag tagctgggat tacaggtgcc caccaccatg 95760tctggctaat ttttttgtat
ttttaataga gacagggttt caccatcttg gccaggctag 95820tcttgaactc cacaccttgt
gatccaccca ccttggcctc ccaaagtgct gggattacag 95880acgtgagcca ctgcgcccag
ctcaattttt atatttttgg tacagaccag gtttcactat 95940attggccagg ctgttctcaa
actcctgacc tcagttgatt cgcccacctc agctcccaaa 96000gtgctgggat tacaggcatg
agccactgcg cccagcaggg tctccctttt taaacgtatt 96060ttctttttat agcctacaaa
ctacaagaga tgccttttaa taaactggat ggtatgtctt 96120aacgtctgat ggagtttaaa
ggcatccaag ggttacgtct gtgatagatt gccaaggcat 96180acaggtctga tcaggagagt
ttcttgatga ctagctatgg gctatgcctt tgtagcacat 96240gatcccaact ccagcaggga
tatagttagt gacatgctgg ctttgtcttc tccctaactc 96300ctggattact acaaatttct
tcttcgtgca ggaatcattc cctcactcta tacatatctg 96360ctgttaaaaa aaaaaaagtt
aagatattat agccattata ttgtagcagc catgatatta 96420tagctcagta aatgctgctt
tccaaatatt ggctaattta accatagcat gtcttcaatg 96480ttagaagcca gccctcattt
ttatcaaggg ctgaagtttg ataattcttt gtgttatttg 96540cttgtgaaaa taagtagaac
aaaaaggatt agggacctaa ccttgtatcc catgtatccc 96600agtgaacctt ttctgactta
aagcttcctt tctttttttt tggagatggg agtcttgctc 96660tgtcgcgagg ctagagtgca
gtggcgcgat cttggctcac tgcagcctcc gcctcctggg 96720ttcaagtgat tctcctgcct
cagcctccca agtaattggg actacaggct catgccacca 96780tgcccagcta attttttttt
taatttttag tagagacggg gcttcaccat gttggccagt 96840atggtctcga tctcttgacc
tcgtgatcca tccaccttgg cctcccaaaa agcttccatt 96900cttagtcttg gtacttctaa
gtggcattgg gtcaatagct ttctgcctaa gaagagaatt 96960ggctgggcat gatggctaac
acctgtaatt ccagcccttt gggaggctgt ggcaggagga 97020tcatttgagc ccaggagttc
aagaccagcc ggggcatcat aggaagaccc catgtctgca 97080taaaataaaa taaattagcc
agacttggtg acatgcacgt attgtcccag cttgtcagga 97140agctgaggtg ggatgattgc
ttgagctcag gagatcaagg ctacaatgag ctatgatcat 97200acaacaccag tgcactctag
cctgagtgac agagcaagac cctgtctcaa aaaaagcagg 97260ggggcatagt cacctcccta
aaatattagt tgaacagtat gtattcagaa gtccagaggc 97320tctgtatttt attaatattt
tcaaggcact atttctgcag aaatcaagtc agcaagactc 97380tttgaggacg ttacaggcag
aggggctaaa gatacctttg aggaagctca agtacttggg 97440tgggaggtga tagataaagg
gtcagtagaa ataatgtctc tttttatttt ttttcccatt 97500aaaaaatttt gttttaatag
caatggagat ggggtctcac tgtgttccct gagctggtct 97560ggtctcgagc tcctgggttc
aagcagttct cccaccttga ccttctaaag tgtagggatt 97620atagacatga gccaccatgc
gtggcaaatt tcttttcttt cctttttttt tttttttttt 97680tgagacagag ttttgctctt
gttgcccagg ctggagtgtg gtggcacgat cttggttcac 97740tgcaccctcc acctcccagg
ttcaggtgat tctcttgcct cagcctcctg agtagctggg 97800attacaggcg cccgccacca
tgcccgggta atttttgtat ttttagtaga gatgggattt 97860caccatgttg gccaggctgg
tcttgaactc ctgacctcag gtgatccacc cgcctcagcc 97920tcccaaagtg ctgggattac
aggtgtgagc caccgctgcc ggttccaatg tctcttttgg 97980atggtggatc ctgaagaata
gctgctggtt ctttggggat gcctggggaa tactgtgcag 98040gctttgtgat gggctcagca
gtgaggcctg tacagtatct taggtcttgt gggcctcagt 98100ctgctctctt ggctgttctc
taccacctcc tgccattaag tttttaagaa aaaggaatag 98160ttttattata ttctttggta
aacaaagcaa attaagaagc tttatatttt ccacatttat 98220ttaccaaact ccctatttgt
ttttctctat agtgattcag tttagagacc tattcaatga 98280agcatgcctt gatgttgaat
ttagagtcta ctttttccag aagaaaagag ccagggagct 98340ccaatagtag tcatctcaga
atataaaagt gttatagaaa tgatgtaaat caggccgggt 98400acaggggctc acgcctgtaa
tcccagcact ttgggaggcc gaggcgggcg gatcatgagg 98460tccggagatc gagaacatcc
tggctaacag ggtgaaaccc cgtctctact aaaaatacaa 98520aaaaaatcag ccaggtgtgg
tggccggcac ctgtagtccc aactactcag gaggctgaga 98580caggagaatg gcgtgaaccc
aggaggaaga gcttgcagtg agccgagatc gcgccactgc 98640actccagcct aggcaacaga
gcaagactcc gtccccaaaa aaagaagaaa aagaagaaaa 98700gaaatgatgt aaatcagctg
cccttcactc tgtgttgagg tgggggatgt ccctaattgc 98760agtaggagag agcctctctt
ttatctggga ctaaaagccc ttgccctaca tacctcataa 98820ttattttagg gttaactgat
tcaattgtca gaaaagaaca agctgtatct tgtttctgta 98880catattctac tttgtgagta
tttttatttc attgctatgt gattggaatc aactcaggaa 98940agaggaaaaa aataagatag
aggttataga attctgaatt ctgaagggaa ttctgagaat 99000tatcagtaaa atatgtcaaa
atgtgatatt ttacttccac caagaattag gccatatctt 99060tgtgtgaaaa taaattatta
ttatttattt atttattttg agatggagtc tcgctctttt 99120cacccaggct ggagtgcaat
cacacaatct cggctcgctg caacctccac ctcccaggtt 99180caagcgatgc tcctgcctca
gcctcccgag tagctgggat tagaagcgcc cattaccaca 99240cccagctaat tttgtacttg
tagtagagac agggtttcac catgttggcc aggctggtct 99300cgaactcctg acctcaggtg
atccaccccc ccccccccca cccttggtct cccaaagtgc 99360tgggattaca ggcatgggcc
accgcaccca gcatacggaa ataaattatt aaccagagaa 99420attttgacta aggtttttat
aaatgttagg tgaaccattg ctctaaaaga tacaaaatta 99480taacaagctg aaaagttttt
taaaaatctg cattttagtg gttcagtttt tcagttgttc 99540tgagtgctaa tagttggagt
ttataaattg taagaagcaa tctacggaga ttctgtgatg 99600aaggaatttg ttgaatgccc
tgtctgcctc acagtctcag tctttatgat agagtcttgt 99660cttctcacaa ggagagaaaa
gatttgaggc tcttttgatt acttacttac ttgcttattt 99720atatattttg cctctttgtt
tttgccgcaa atacaaatgt aatggaacct tagaatagga 99780gagacgtgtg gatcccctgg
taggcactgt tctttctatg ttcctggagc caagttcatg 99840gaattacctc caagactacg
gatccctggt tttctttcat catgatagga ggcattttct 99900agaacctgaa tcttacttta
aaatgcatgt aagacctgca aggagtggta gtgaagtggg 99960tggaatatat tcttagcacc
agacaccttt aaaatattta agttctcggc cgggtgccct 100020ggctcacgcc tgtaatccca
acactttggg aggccgaggt gggcagctca cgaggtcagg 100080agaccgagac catcctggct
aacacggtga aaccccatct ctactaaaaa tacaaaaaat 100140tagccaggcg tggtgatggg
tgcctgtagt cccagctact cgggaggctg aggcaggaaa 100200attgcatgaa cccgggaggc
agagcttgca gtgagctgag atcgcaccac tgcactccag 100260cctgggtgac agagcaaggc
tccttctcaa aaaaaaaaaa aaaaaaaaaa aaaaatatat 100320atatatatat atatatacac
acacacacac acacacacac gtgtgtatat atatacacac 100380acacatgcat atatatatac
acacacatgt atatctatag atatatacat atatatgtgt 100440atatttacat tttcttatgt
cagggtctgg cttggagtgt attgtgttcc cagagcagaa 100500ttcttttttt tttttttgag
attgggtctt actttgtcac ccaggctgga atgcaatggc 100560gtgagcttgg ctcactgcag
cctcgacctc acaggttcaa gcaaccctcc cacctcagcc 100620cctggagtag ttaggataac
aggcgcacac taccattttg tattttttgt agaggcgggg 100680ttttatcaca ttgcccaggc
tggtctcgaa ctcctgagct caagcaatcc acctgccttg 100740acctccccaa atgctggggt
tacaggcgtg agccactgtg cccagccgca gagttcatct 100800tgagaccctg acttctgcca
gctctgatcc tagtgggtgg ggctctgggg ctcagtgaaa 100860cagtcagccg ttttgcttca
gagaacacaa ataagatttt ggcttgatgc tggttgttgc 100920tggcgtcata tagtctaaaa
cgtttgctgt caagaacatt ttagtaaaag tttttgttgt 100980gctttcatct agtcaagaaa
agataggaag tggcagctga cagggcagtg tcttcatgcc 101040cctcaacctt acattggaca
ctgaagtagg attgtgtttt cactggaagt cccagtgggg 101100ccttatctcc tggatgctca
aagtgcagct cagatcctgt tgggtaaaaa gtctagtcaa 101160aatggaggac atggagaagg
ccaacaggca gagctataga gctgacatag ggcattcttt 101220gtacttccct tagccactgt
actttctttc ttcctccatc tcctccttcc ctcttctatc 101280tcattttggt ttggcctttg
ggaatagtgg gttttaaaaa atatttgaac tataacatat 101340ccttgtacca taaagaatga
gcctgactgc tttacaaagg atttctataa aaagtaatct 101400tttatactaa gagaaatgac
acatctgttt taaacctgtt acttttcttc cccgggcttt 101460gctctttctg caggtccgtt
tgacatggtt cttgaaactc ctggtagcag ccatttacta 101520gtagcactct ttatcttaga
cacagcacct aaagcaattg taggtgtttt aagaacagaa 101580agcccatctt aagcagacca
gtttgaggga ttggcagtgc tgtcaagaaa caagggcttt 101640gtggcagtct ctctaaaaac
tccctatgag tccatttctt gcaaacttct ttagactcta 101700ctgtatcttt tcatcagaag
ctacctcttt gatgtgggaa gtgtcatgaa tggactgact 101760ctctggaatt taaaaacaaa
gacaatatgg caaaaagaaa acctgacttt tagtactgta 101820tgtgttgcta attagctctg
tattcttggg cagactactc catgtatccc agccatccat 101880atgccctatt tgtaaggatc
taatgagatg atattgtgaa gaatgccttt gtaaactgta 101940aattgctttg tgaataaaga
tactatctct gataaacagt accagttctc agccaccaat 102000aacctgatac tcccatactg
tgtttggaag aaacacaaaa caatgaagag taattgtgac 102060ttttcaatgt gagttgtatt
cacaaagctc atatactttt tccctgcctt ttgatactgt 102120ttatcgcttt ctgtgttgta
atgggaagat cacacagcaa tcattttctc agtacaaagt 102180ataactacaa ctgagcttgc
attgaagatc tttaacaaag atgcaaagct gctgtccaga 102240aatgttttct ttccattttc
tcttgtacct cccagtattt taagaatcct tgaggctggg 102300caccataact cacgcctgta
atctcaacac tttgggaagc tgaggcagga ggatcacttg 102360ggcccaggag tttgagacca
gcctgggcaa catagtgaga cccccatctc tacaaaaaaa 102420tttaaaaatt agctgggcat
ggtggtgtgc acctgtggtc tcagctactt agggaggctg 102480aggtaggagg attgcttgag
cctgggaggt caaggctgca gtcagtcatg attgcaccac 102540tgtgctctat ctagcctcca
acctgggcaa cagaagcgag accctgtctt ttttttaaaa 102600aaaaaagact atccttgatg
attggttttg agccaacgga atgggagcat atggtagagt 102660ttcaacactc tgaccctagt
ccttctgaca ggcagtcaca aaatgagatc atgaagtctc 102720taagagcagc tgatgaaaaa
ggaaatggga atgtagatgt tcaatcagca gccctccaga 102780cccagagttt gctcctctgt
ggtgtctcta ggtggagaat aaggacttga tttgccattc 102840tggagtgcaa atatctagct
ttttgcagct tcatattaag atttcttgaa atgtacttag 102900taatatccat gtgtgacttt
gccaagtgat ggctttgggc tggaaaggat tttagcaggt 102960tttagtctaa tttaagccta
atctaacact gctgagaaag gaggagatgt ctttggtttt 103020actttctaat atatggtacc
tcttagccgg gtgcagtggc tcatgcctgt aatcccagca 103080cttcgggagg ccgaggcagg
cgatcacttt aggccaggag ttcaagacca gcctggccaa 103140catggtgaaa ccccatctct
actaaaaata caaaaattat cccggtgtag tggcgcacac 103200ctgtaatccc agctacttgg
gaggcagaaa caggagaatc gcttgaacct gggaggcaga 103260ggttgcagtg tgccaagatc
atgccactgc atgccactcc agcctgggca acagagcaag 103320accctgtctc aaaaaaaaaa
aagagagatc tatctctctt ctttttatat acatatacat 103380atacatacat acatacatat
atgtatgtac acacatatat atatatggcc cctctttttt 103440tatttgagtc ggaatctggc
tctcttgcca ggctagagtg cagtggcatg atcttggctc 103500actgcaacct ctgacttcct
ggttcaaacg gttctcctgc ctcagcctcc cgagtagctg 103560ggattacaag catgtgccac
cacacccagc tcacttttgt atttttagta gagacgggat 103620ttcaccatgt tggcagggat
ggtcttgatc tcctgacctt gtgatcctcc cacctcagcc 103680tcccaaagtg ctgggattac
aggcatgggc caccgtgccc agcctttttt tttttttttt 103740taaagagacg gagtctcact
ctgtcaccca ggctggagtg cagtggcgtg atcttggctc 103800agtgcaacct ccacctcccg
ggttctagca attctgcctc agtcttccga ctggctggga 103860ctgcaggtgt atatcaccgc
aaccagctaa ttttttgtat tttagtagag acagggtttc 103920actgtgttgc ccaggctggt
ctcgaactga gctcaggcag tccacccgcc tcggcctccc 103980aaagtgctag gattacaggc
gtgagccacc gtgcctggcc tatatggtac ctctttagga 104040gccagacctg gttaatcaga
cacatggctt tcatgactcc tttgcttgag tagcttaata 104100actcaataaa tcaaaagatg
aataaatatt ctaatgtgtg aagatactct aatagataat 104160aggcaattaa gaatggacat
ccacggctgg gcgctggggc tcatgcctgt aatcccagca 104220ctttgggagg ctgaggcggg
tggatcatga ggtcaggagg tagagcccat cctggccaac 104280atggtgaaac cccatctctg
ctaaaataca agctactcga gaggccgagg caggagaatt 104340gctcgaactt gggaggcgga
ggttgcagtg agccaaaatc gcatcactgc actccagcct 104400ggcgacagag cgagactccg
tctcaaaaaa aaaaaaaaag aatggacatc tactgaaggt 104460gattgcatca tcctacccat
tcattaatct aactccctac aggatacttt cctaggagac 104520actgacaggt ctgttttctg
aaatccagag aaaggcagca atggggaggg gtgcagtgta 104580tgtatgtcat acctgtgctt
ggtatatctg agttgcctgt gtatgatagc agctggggaa 104640tcaaatcata gataaattgt
tctcatacag gtttgtccta tgactaccta ttcttattaa 104700acaattggct atattgaccc
tttttggttt tggaaaaata ataataattt ttttaagaga 104760gaaaaagaaa caattggcta
cccttcaaca gtgatgttaa aaccatttca cattctttag 104820cagtggtcac tgtcctatgt
ctaactatgt gcaggttgag aaaaaggact gcccgagtta 104880tagatgattc tgtgagaata
agaaatcatt gcttttgtaa cacatgaggt aaaagtaatc 104940tcaaagttga catgctgatg
gggactcctg gcaaggggag ttccctgccc tcaacaaaag 105000gtcatccaca gctactggaa
catttttgtt gtctgagaag tataaagtgc cttagaaata 105060cctgaatcca ttaatgcctc
cagttggtga aatcagaatt tgcaggtgac tgaaattgac 105120agtagtgcct tgttcttact
cactgttcaa atgacaaccc acatgtttta tggattgggt 105180atacagatgt atgctctaac
agcagtatct ccctccagag ccactgtgta ccaagcacca 105240ggtcctccag ggatagttgg
ctctattcag tctttgattc attcaacaag agcttactaa 105300gctccttttt ggtaccagat
actctttgtt gctgaaaata aataaaaggc cagcaagatt 105360aagtagactg tgagatctgg
accagtaatt tgacaacaca aagtactgtc gtaaagatac 105420agtttctgat gtgtagtgac
cattccgtat gaaagcttag tctttcagga gattaaaatg 105480ggtggtggaa tattcctacc
tagcaagcaa gcaaggtgaa atgagtggct gtttgactcc 105540cacctgctga tgctggtctt
ttttggttcc tagggcttat aatgatcaac atttcttgag 105600ccctcactat attctatgct
aagctcttta catgtatgaa tttacttaat cttcacaacc 105660accctaagaa ataggtactg
ttgtccttac tttacagatg aggaaatgga agcacaaaga 105720agttaaggac cttgctgaag
gtcatggagt agaggcagga ttcaaattta gggaactcag 105780cctacagtcc atgctcttaa
agatgttata tcctgtctct gggcttagaa ggggttcatc 105840ttaggccgga cacagtggct
cacgtctgta atcccagcac tttgggaggc caaagcgggc 105900agatcacgag gtcaggagtt
cgagaccagc ctgaccaaca tagtgaaacc ccatctctac 105960taaaaataca aaaattagcc
aggcatggtg gtgtgcgcct gtagtcccag ctactcggga 106020ggctgaggca ggagaattgc
ttgaacctgg gaggcggagg ttgtggtgag ccgagatcgt 106080gccactgtac ttgagagtga
gtgacagagc aagactctgt ctcaaaaaaa aaaaaaagac 106140ggccaggcgc agtggcttac
gcctgtaatc ccagcacttt gggaggccga ggtgggcgga 106200ttacctaagg ttgggaattc
gagaccagcc tgaccaacgt ggagaaaccc cgtctctact 106260aaaaatacaa aattagccaa
gcgtggtggc atatatctat aatcccagct actcgggagg 106320ctgaggcagg agactcgctt
gaacctggga ggcggaggtt gcagtgagcc gagatcacgc 106380catagcactc cagcctgggc
aacaagagcg aaactctgtc tcagggaaaa aaaaaaaaaa 106440aaaaggaggg ggcgcttcat
cttgactaac ttcctgcatt ggtggagctt gatagagtgg 106500tccttcccag atccttccct
gcatacagag cctgtctctt ttctgattgg tccctaaggc 106560cagattacct gtccctaata
ctgagcagaa gctggtgaat gaaacaggag atccctcagt 106620caaaacaaaa ggaaaaagaa
aaatgaaaca ggagatccct tctctacagc ccagatgtaa 106680gtccagctgt gcccttcacc
acctgggtga ccccacctct gtgaacatag gtcctcatct 106740gtaaagtgta gataatgtta
tttcatcgga tcatttaggg gattaaataa gataatgtac 106800ttcgtggttt ctggctctta
gtaagtgctt aataaatgtt agcgattttt attatcattg 106860tccttagcct tgagaacaag
ccagggaata gtgtctcaga ccagatgcta agacctaggt 106920agatgggcaa ttttccttgg
ttttgacaag acaataattt tatcctgtgt atttctcttg 106980acttttttga tgtgaaaagc
agagaggtaa agcattattt gacagatgta tggattcaag 107040caagaaactg aggtccaatt
gcaaagaaat ggcttgtata actcagagcc ctgtctgagg 107100aaacacagag gaccctagag
ggcggagaat gaacacagcg caggggctag ttccagagtc 107160gcattctcgg ttagttcact
ttcaagtgtg ggtgagggtc ccttgtcagt aggcagagaa 107220tttttttccc ctgcaccaac
acatacctgc tgcctagtgt ttattaaaca aaactttatt 107280ttaatgtgaa atagaattca
tgacttgtcc aaaatggaga ggcaagggag ctctttaaca 107340ggcttgttga gccccttttc
ccacctgttc ctgtgccaga ctttcccaaa ggcttacttg 107400ccaatggttg ctcctcagat
ctcagggcta gctcactcta taggctccaa gccagagtga 107460taccgccgcc gccgctgttg
ctcccaccag ccaatcagtt tcctgctgta aggatgtaac 107520ttgctgtgaa gctttcacct
tcctcctttc ttcctgtctt caatgttgta tgtctttgtc 107580ctggtgcttt tgccatacag
ccagtgtttc aaagaaaatt ttcaggcact aaagttatag 107640cccttactac ctttccaagg
agatgtgaga tagctgtgga aaagaagagg gctcctctgc 107700ctctgtgcag aaggaacagt
ttacttcttg atagtgtgct agctcctgag ctaggtgggg 107760gacttgctgg gattcaagag
agtgcattac ctgacctctg gacaagtaga ctgggcatag 107820cctgcccaag gacagcaccc
taacctgcag gaaccaaggc cgaagactga tttcaccttc 107880tcgtactccc ctttcctaag
ctaaagcttg ctctgtaaca ctgccccagg tctgtggctt 107940aaaacagcca tttcctttca
ccagtgaatt aagctcactc tttataaaat gtttcagctt 108000ggggattgga aaggctctct
gtgcctttct gtctctgtct gtttctccaa gggttgatgt 108060tgatggcttc tgtctttgtc
tttacaggga actctaatga tccaggacaa agaagttacc 108120ctggagtatg tatcaagcct
ggatttttgg tactgcaaac gagtaagtac caagaatccc 108180tttctttaga agtaagtatc
tggaataaca gctcctccat atctctagga aggctgcctg 108240ctaacatgca ttcccaagga
caaagctctt cttcctcagg tcacttcagt tgaacaggag 108300gaggtcaaga caaggtcatt
cataatttct ccttcccagc tgctacatgt ggccatagag 108360agttctggac ctgcaattgg
agacactttc ccaaggacat gtgccattat ttctatcagt 108420tataaaaata acagttcctt
gacatataat atcttctcac ctctcctggg ggtggtcata 108480aaggaattct tggttggaaa
agtaggtttg gagagactag ttctttggga gtcgtacatt 108540ttttggatat tcttgggttt
ccaagggtat agaacttcag acaccatggc attttacctc 108600tattaaactc catattctct
tagagtggga tatttaaaat tttaggctat actctttttt 108660tttgaaacgg aatctcattc
tgttgcccag gctagagtgc aatggcgtga tttccactca 108720ctacaacctc tggctcctgg
gttccagtga ttctgctgcc tcagcctccc gagtagctgg 108780gattacaggc acttgccacc
tcacctgcct gatttttgta tttttagtag agatggggtt 108840tcaccatgtt ggccaggctg
gtcttgaact ccgacctcaa gtgatccacc tgcctcagcc 108900tcccaaagtg ctgggattat
aggcatgagc caccgcgctg gcctgtttat ttatttattt 108960atttattttg agacagagtc
ttgctctgtc gcccaggctg gagtgcagtg gcgcgatctc 109020agctcactac aacctctgcc
acccgggttc aaacgattct cctgccccag cctcccgagt 109080agctaggatt acagttgtgt
gccaccatgc tcagctaatt tttttgtagt ttttagtaga 109140gatggggttt caccatcttg
gccaggctgg tcttgaactc ctgacctcat tatccaccca 109200cctcggcctc ccaaagtatt
gagattacag gcttgagcca cggcacccag ccggctatac 109260tctttaaagg tccagtttga
ttgcagtgag catgaaaata taatttgttt tcattgctac 109320tacttagtat caaaaataat
tatgaaaaat atataaagtt tctgagcccc gacacactaa 109380aaatgttaca gtacttgaaa
aaatttagta aagactttag cttgacattt gttagtctcg 109440gtagaattga cattgtgtta
gtctcggtag aatacaactt gaagagctat gattgttatt 109500agccaaagta ctcatatttc
atggatatac tcccttatgg tgtcatttta ggaagatatt 109560tcgtttcctt ttattgagat
aaaatacatg taacattaca tttgccattt taaccatttt 109620gaagcattaa ttcagtgaca
ttaagtacct tcacaatgtt gtgcagctat caacactact 109680tcctagaact tctttttttt
ttttttttaa ataagagatg ggatctcact atgttgccca 109740ggctggtctc acagtccctg
gctcaagtca tcctctcacc tcaacctccc aaatagctgg 109800gactataggt gccatcatgt
ccaggttagt tccagaaatt ttttttttct gtcttttttt 109860tgagacagga tctcactctt
gtttctcaag ctggagtaca gtgatgtgat catggctcac 109920tgtacccttg acctcctgtg
ctcaagcgat cctctcacct tggcctcccg aagttctggg 109980attacaggtg tgagctgcca
tatctagcct cagatctttt ttaaaccctc aaaaggaaac 110040ctcttatcat taatcagtaa
cttcccactt cttcttcccc cagtccccag aaaccattaa 110100tcttttttct atctccatgg
atttgcctat tccggatatt tcatataaat ggaatcaaaa 110160tatgtaaact tttctgttgg
cctttcacct agcatgtttt cagagttcat gtatgttgca 110220gtatttatca gtacctcatt
tctttttgtg gctaaataat atgaatatat cacattttgt 110280tcatccattc ctcaattgat
ggacatttgg gttgtttcta ccctgacttt ggtgaataat 110340agaacctttg tgtgctagtt
tttgtttgaa cagctgtttt cagttatttg ggggtatgta 110400tccaggagtg gaattgctga
gtcatatggt aattttatat ttaactcttt gaggaaccat 110460caaactgtat ttcttttatt
ttattagcaa accttttcat agaccacagc tgtaccattt 110520tatattccag caatatgtaa
gggcttcatt tctccacctg cttgccaaca tttgttcttt 110580tccctttatt tgataatagc
catcctaatg ggtatgaaat aatatctcat tgtggttttg 110640atttgcattt tcctaatgac
tttgagggtt ttttttcatg tgtttgttgg ccatttgtat 110700acctcctttg gagaaatgtt
caaccaagtc ctctgccctt tggaattgat ttgcatgtat 110760ttttgttgtt gagttataag
agtactttat attttctgga tattaatccc ttatcagata 110820tatgatttat aaatattttc
tatgtgttat ctttcacttt cttgagagta tcctttctaa 110880agaaaaaaaa agagagagag
agagataagg tgtggctcat ggctgtaatc ccaacacttt 110940gggaggctaa agtgggcaga
tcacttgagc ccaggagttc gagaccagcc tgggcaacat 111000ggcaaaaccc catctctaca
aaaaatacaa aatttaactg ggtgtggtgg tgcatgccta 111060tgatcgcggc tactaagcag
gctgaggtgg gaggatcacc tgagcccagg aggtcgaggc 111120atcagtgagc tatgatagtg
ccactgtact tcagtactcc atcctgggtg acagagcaag 111180accttgtctc aaaatttttt
ttagctgggt gtggtggctc acgcctataa tcccagcact 111240ttgggaggcc gaggcaggcg
gatcatctga ggtcgggagt tggagatcag cctgaccaac 111300atggagaaac cccatctcta
ttaaaaatac aaagttagct gggcatggtg gcacatgcct 111360gtaatcccag ctacttggga
ggccgaggca ggagaatcac ttgaacctgg gaggcagagg 111420ttgcggtgag ctgaaattgc
actattgcac tccagcctgg acatcaagag tgaaactcca 111480tctcaaaaac aaaaaagaaa
aattttaagt ttattatgta cctattaaaa tttttttgta 111540attaaaacaa atgctaatgg
cggtattatt cataatagcc aaaaaatgga aataaccaaa 111600atgtccattg gctgatggat
ggatgaacaa gttggcatat ccatacaatg aaatgctatt 111660tgacaatgaa aaggaatgaa
gtactgatgc atgttacaac ctagatgaac cttgaaaata 111720ctatgccaga cacagaagac
catacattgc acaattccat gtccctaggg gtaagaatgg 111780gggaggtaac tccactagat
ttcttttggg gtgatgaaaa tgtttcagaa ttagattatg 111840gtgatggttg cactatacat
ttactaaaaa tcattgaatt gtacacataa aataggtaaa 111900ctttatgggg tttgtttttg
tttttaagag agagtcttgg tttgtcaccc aggctggatt 111960gcagtggcac aatctcggct
cacgacaacc tccacctccc aggttcaagt gattctcgtg 112020cctcagcctc ccaagtagct
gggattacag gcgtgtgcca ccatccccag ctaatttttg 112080tatttttaat agagatgagg
tttctccatg ttggctaggc tggtcttgaa ctcctggccc 112140gaaatgatcc aacttcctcg
gcctcccaaa gtactgggat tactggcatg agccatcatg 112200ccaggcctgt tttatgctat
ttaaattata cctactaagg ttaggatcct aactgccact 112260cactaactga agtgtcacat
actttattcg ttggcatgta tatactcagt tgtcccagca 112320ccatttgttg aagagactat
tctttcccca ttggcacttt ccccattgtt agaaatcagt 112380tgaccataat ctataggttt
attcctagat tctcagtttt attctgttga tctatatgtt 112440tacaaatagc accagttacc
acagcagctc tcctgtagta acaactctcc aatcccagta 112500gcttaaaaca gcaagcatat
tcttcactca cattacatgt cagggactat gggttgtttg 112560ctacagttct gttccacgtg
gcttctcatc ccaggaccca ggcggaagaa acagtctcaa 112620tatggggcag tgtccctctg
gctaagggag agagaggttc attcacgcaa gcagtggctc 112680ctaaggcttc tcttagacct
agtgtaggtc atgttcactc atgttttatt ggtgaaagca 112740aggaaggcac atggccaagg
ctgacaatgg agagaggaag tatactcacc ctgtgggaag 112800gcataacagc catttggcag
tgggcagggg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 112860tgtgtgtgtg tgtgtgtgtt
tataatctgt ttatagggaa gggaacaatg aaataactga 112920ctgtagtgat cttcctcaag
tgagttaacc tctctaggcc tcagtttcct catctacaaa 112980atgaggagat aagagtaccc
atttcatgaa gtttattggg gttgtcagga tcaataagtg 113040atgacatata cagtaggcca
aatacatggt atgtactatt taagaattag ccggctgggc 113100gcagtgactc acacctataa
tcccagcaat ttgggaggcc gaggcgggca gatcacctga 113160ggtcgggagt tcgagaccag
cctgaccaac atggagaaac cctgcctcta ctaaaaatac 113220aaaattagcc aggtgtagtg
gcacatgcct gtaatcccgg ctactcggga ggctgaggca 113280ggagaatcgc ttgaacccgg
gaggtggagg ttgtggtaag ccgagatcat gccgttgcac 113340tccagcctgg gcaacaagag
tgaaactccg tctcaaaaga aaaaaaaaaa aagaattagc 113400cactgctact attgttattg
ttttctcctc aactccatct ggcagacctt tactcgccct 113460ataaggccct cctcaaatac
catcctcttt atagttctta ctcttttatt tcctgccaac 113520caagtttctg cccccatggc
atttggaagc tcagtggcaa aagttcaggg atttcggggt 113580tgggcagtgt gcttgacttt
ttgttcacat gttcagacaa aaataattac attcacatta 113640aaaatgtctc ttaccttatt
ctgggctagt gaatgttccc tttcaatgtc ttttagatag 113700ctgccagaga cactatctgt
atctcttcct cctaccttgt acctcattat cagtgtttga 113760gaaaggagtt gataactgaa
ttctcagttc tagccaaatg tgaatgggga tctcatagtc 113820agttcaggcc caagttttgg
gtgcagactg taaatggctt tgggacaata atattctata 113880aaccatgtaa cagtagtttt
ctaggcatat ttcctatagg aatctttatc cagggcaaag 113940gcatttgggc tgcaccaaag
tcccagatgc cttgttataa ggtagctctc aaacagtagc 114000tcatcagatc ccatctgcca
gctctaatca gtggggaata tcagattctt tttttaagct 114060ttgaggggat ctgggatatg
gcttgtttct ttcatttttg gggggtttca ctttgttaga 114120tatacataag atttttaaaa
atgttttcag tcaaattgat ttccttcttc cttacagtgt 114180aaggcaaaca ttggtgggca
ccgatcttcc tgttcattct gcaagaaccc aagagaaggt 114240gagtggcgaa agtggtagca
gtttttatct cgtgcattga gcaaaacaaa tttcatgttt 114300tccttggctt tgaagaatta
tcatccctaa atccaagttg atctacaaac cttttttttt 114360ttttttgaga tggagtctcg
ctgtgttgcc caggctggag tgcagtggca ccatcttggc 114420tcactgcaac ctccagctcc
caggttcaag cgattcccct gccctagcct cctgattagc 114480tgggattcca ggcatgtgcc
accacgccct gtagcccggc taattttttt gtatttttag 114540tagagacggg gtttcaccat
gttggtcagg ctggccttga actcctgacc ttgtgacccg 114600acccaccttg gcctcccata
gtgctgggat tacaggtgtg aatcactgca caaggcctgc 114660aaacctttat ttatttattt
atttttgaga cagagtctcg tactcaccca ggctggagtg 114720cagtggcgca atctcggatc
actgcaagct ccgcctccca ggttcacgct gttctcctgc 114780ctcagcctct ctagtagctg
ggactatagg cgcccaccac catgcccagc taattatttg 114840tattttagta gagacggagt
ttcaccgtgt tagccaggat ggtctcgatc tcctgacctc 114900gtgatctgtc tgcctcagcc
tcccaaagtg ctgcgattac aggcgtgaac caccacgacc 114960ggcccaaacc tttcaaaagt
gcaatttgag ctaggcatgg tggctaacgc ttgtaatccc 115020agcactttgg gaggccaagg
caggtggatc acctgtggtc aggagttcaa gaccagcctg 115080accaacatgc cgaaaccctg
tctctactaa aattacaaaa attagccaca ggtgtggtgg 115140cacatgcttg gaatcccagc
tccttgggag gctgagacac tagaatcgct tgaacccagg 115200agtcagaggt tgcagtgagc
tgagatctcg ccactgcact ccagcctagg caacagagtg 115260agaaaaaaaa aattgcagtt
tggtgcccaa cttaacgtaa cctgttagta aatgatttca 115320gatcttattt tcaccagagg
aaagagatag ggttgtgggc tcctaggcta aagtggctaa 115380gtgggcagct gagcagaggt
cagtatattg ttatttggaa tacatttaag gattaaggat 115440gttaggttga aaaagagtct
ttatgacatc agtctgtgtg gcaaaccttt ctcccactcc 115500tacttctttg aagttattgg
gaatcatttg ctctattgtt ttctctttta cattctgtaa 115560gcatttcagg attttcaaga
gaaaaacatt tgttaaaata acagtaaaaa cataaatagg 115620agaaaataat caggatgtgg
ggaacatttt attattttag aggaataaaa ctaccagctt 115680ctcaagcact tatctttaat
gtaaatttct ttagagaaat ttcaggtagg caacttcgaa 115740gagtcagaca catgcatcca
taacaacagt cctgtagtca tcccttaagg aaagccacag 115800catgaccata aaatatagtt
cagtgcaggg attcaggtag ccttctgttt gttgcaaggt 115860tagagtttaa tgtgcctaca
aggagtttct taggtgggct tttgtcctct tgtggagatt 115920ttactctggt gaagactgaa
aggcaggtgt tctgaaaatc tttaggggaa ggctgtgtat 115980gttctagaaa ccaaaccaaa
atgtgggaag gaggatgaac aactgagatt tttgcttgtt 116040aggtcacttc aggttaggca
aagttgtgtt tttttccccc cacaagaaac actttttttc 116100aaagctattc cagcaaatga
atagatagtt ttttgttttt tttctttttt tttttgagac 116160ggagtcttgc tctgtcaccc
aggctgaagt gcagtggcgc aatctcggct cactgcaagc 116220tctgcctccc aggttcacgc
cattctcctg cctcagcctc ccaagtagct gggattacag 116280gcacccgcca ccgtgcccag
ctaatttttt gtattttcag tagagacagg gtttcactgt 116340gttagccagg atggtctcga
tctcctgacc tcgtgatctg cccgcctcag tctcccaatg 116400tgctgggatt acaggcgtga
gccaccgctc ccggccatga atagatagtg tatgaaaacc 116460actgggcacc ataccactaa
gatgagacag ctttaatctg gaaacctgtc actgctatta 116520tgtaatctct atattgctct
catataatac ctctttttga gccacatgga ttccagtgaa 116580ccctccaaga atgaattagt
tacaagaatg tgcccctaat tataaaacaa actataaaga 116640caaattatcc tgctgtagta
ggacatttga aataaatcat ttatattttg aaggacgtct 116700gcccattatg tttatttgca
tataaaggag ctacgtgcag atagggtctg ttcctagctt 116760cactggagga gggcctgtgg
tcttacagga tatgagtagc tgtttgagca ctgtaacact 116820ggaagaagca aggcttctag
atgtgtgttt gggatatgtg tttctactaa accttaagta 116880agggccatat cttcggtaat
tttgtcccca gatgtgttgt tatcattgat tatgatagtc 116940aggttcaagg tgtcatgaag
gatttgttat atttaaatgt ttagtaggtg atatagagat 117000ttcataagat tacatttttt
aaatgcttgg atagtttctt ctgtgaacta tttcatgtcc 117060tgtctcagct tcacttaaaa
tattttgtca ggaactgtca gaggactttt tattagatat 117120ttctgagata atattaaaag
cattccaggc cgggcgtgtt tgctcacacc tgtaatccca 117180gcactctggg aggccgaggc
aagtggatca cctgaggtca ggagttcgag accagcctgg 117240ccaacatggt gaaacctcgt
ttctactaaa aatacaaaaa attacctggg cgtggtggtg 117300ggcacctgtg atcccagcta
ctctgaaggc tgaggcagga gaatcgcttg aacccgggag 117360gcagaggttg cagtgagcca
agatcatgcc attgcacttc agctgggcaa caagagcaaa 117420actccgtctc aaaaaaaaaa
aaaaaaaaaa aaggcattcc agtatgagta tttgctggca 117480ggtaaggaga aattacagta
gcagtgtttt ttcttttttt ttttttttga taaagctttc 117540tagagattct ctttgtttct
gttccactag tgacagaggc caagcaagaa ttaataacct 117600accctcagcc tcagaaaaca
tccataccag caccattgga aaaacagccc aaccagcccc 117660taagaccagc tgataaggaa
cctgaaccca ggaagaggga agaaggccaa gagtcacgct 117720taggacatca aaagagagaa
gcagaaaggt atctgcctcc ttctcgaagg gaagggccaa 117780ctttccgaag agaccgagag
agggagtcat ggtctggaga gacacgccag gatggagaga 117840gcaaaagtaa gtagtttgtc
agggcacata ccagactgtg atcatcacaa tggagcatag 117900atggccaatg ttatgtccgg
gagctatctg ctttccagta ccctgagaga tctgtgcatg 117960acctgatgac agaggccatt
gctgtctgtg gaccttcctg tactgcttaa aggaatctat 118020gcccttcaaa tagtaaattg
ctatatgaat gcagtaaggc atgattttag atttctaagt 118080attggtgaag aaaagtatgc
agtatttatt tgtttagcat ttttttacag aaccagcctt 118140gctagtagca tctatagtaa
aaaatgacag tcagattctt gggacttcaa aaatttatct 118200ttctctccct tgtgttgccc
ttctcccatt tatggttgat tcagctatca tgctaaagcg 118260tatctatcgt tccacaccac
ctgaggtgat agtggaagtg ctggagccct atgtccgcct 118320tactactgcc aacgtccgta
tcatcaagaa cagaacaggc cctatggggc atacctatgg 118380ctttattgac ctcgactccc
atgcggtgag tttcctccac cttggattgg cctagagaca 118440gatggctaaa gaaccttcaa
gaaggtttga ctgggggccg ggcctggtgg cttacgcctg 118500taatcccagc actttgggag
gccgaggtgg gtggatcacg aggtcaggaa atcaagacca 118560tcctggctaa cacggtgaaa
ccctgtctct actaaaaaat acagaaaaat tagctgggcg 118620tggtggcagg cgcctgtagt
cgcagctact cgggaggctg aggcaggaga atggcgtgaa 118680ctccggaggc ggagcttgca
gtgagccgag atcgcgccac tgcacttcag cctgggtgac 118740agagcgagac tctgtctcaa
aaaaaaaaaa aaaaaatttg agggacttct tgatcatttg 118800aattcttgtg tgctacctga
tatcataatc cctcttgctc tctcctttgg gtttattgtt 118860cattcaggtc aggtgacagc
cctcaaaagt taggatcccg tctggttttc taggttcatt 118920tttttcttgt gtcatttact
gtttccaact tactcgcttg tgaagaatct gagtactgaa 118980tccttcatga ttttagtgaa
ctttctgatt tattttgtcc agccacagat ggttttatat 119040ttgatgataa aacatttcct
ctttttcctc aaagtattta tagattcctg tggcttaaat 119100ttttagttgc ggggcctttt
tctatggaag taaggtgaag ataatgaaag tcattggtat 119160ttcttagatt tttcatgctc
aaaagtcaca agggactttg taaactgaat ctgattgatg 119220ataattgcaa cctaaaagaa
gaggatttga atttctgaag tttatgccag aactgacatc 119280tattctgatt cctgttccaa
tcagtccttc attaaaagtt gcctgtttct gccagtatgc 119340tcttactgtt aaaattttga
cagaatataa tgtagtaaat ttatcctctg agaaggaaaa 119400tccacgttca cttctctttc
aaaggagaat ttttctgtct ttgggttctg gcattttctg 119460tctctgggtt caagtgtgtc
tggttctata ggaagctctt cgtgtggtga agatcttaca 119520gaaccttgat ccgccattta
gcattgatgg gaagatggta gctgtaaacc tggccactgg 119580aaaacgaagg taaggcagaa
gggtgaggat ctcttgtgct gcccccactt gtgtttttga 119640gaggaaactc cttttcctgg
ctggaaaaac agtaaagcat gatgttttcc taacatggac 119700tgcttcagat aggtgtttat
tacagtttct ttctgaagcc tgacttgtcc tgactctcga 119760attgttttct ttcttgaata
atactaggta cttttgtcct ttcccttttg actgtctggt 119820atctttgggt cccaaatggc
ctggcgtggt agcacatatc tctattccaa gctactaaag 119880aggctgaggc gagatgggga
gcgggttaca tgagcccagg agttctaggc catagtgtgc 119940aatgaagatg cctgtgaata
accactgtac tctaccctgg gcaacacagc aagaccctat 120000ctcttaacaa aaaaatgatg
gtacagtttt ggatgtgcag acacatgtca atacattctt 120060gccccttgca atcctaggaa
aatgctgtcc tggcttttcc ttcccctgac cttgtgcata 120120tttccatagc actgggaaat
ctaatttctc tttcctcctt cactcatctt gacccaggag 120180tggtaacttg gaaatggcca
tgtcagagaa acaggcttac caatatgggg catatcttgc 120240tctagcaccc tccacttaat
ggctgttttg ctccaccact tggctttgta agagtcttac 120300tgctcattgg gcaggcgtgg
tggctcacgc ctgtaatctc agcacctggg gaggccgagg 120360cgggcagatc atgaggtcag
gagattgaga tcatcctggc taacacggta aaaccccgtc 120420tctactaaaa atacaaaaaa
aaaaaaatta gctgggcgtg gtggtgggca cctgtagtcc 120480cagctacttg ggaggctgag
gcaggagaat ggtgtgaacc caggaggcgg agcttgcagt 120540gagctgagat cacgccaccg
cactccagcc tgggcgacag agcaagactc cgtctcaaaa 120600aaaaaaaaaa aaaaaaaaag
agtcttactg ctcattcttt caggagtgtc tggaccaccc 120660aacctgcttg ctgtctaggt
tggttccttt ccctgcaaaa tgaggaacag aggatttctc 120720gataggaact gtaggattaa
gtactcgtca aatgccactt ggtagcagcc ttaagaattg 120780ttgtgttatc tgttgcagaa
atgattctgg ggaccattct gaccacatgc attactatca 120840ggtaggctgt aacaggtggg
gagtgctcta ttaaaatcct caggtgacta taagggtgat 120900cttgaatttt ctttagtggg
tgactgttaa ggtgaatgac cattggatag ttctgtaatt 120960ttaacttgcc tttctgtgat
agggtaaaaa atatttccga gataggaggg gaggtggcag 121020aaattcagac tggtcttcag
atacaaatcg acaaggacaa cagtgtaagt aacctttgtt 121080ttatttctgt tgctcttttt
tgcttgactt gctactcatt acttgacatc tgtgtgatca 121140cagttggcaa gatacactgt
tgactgaggg tgctcatcca gagagaggca tctgtagatg 121200cacctatttg tgttggtcac
cctaattctt gggttcttga tgagtctcca gtaagggctt 121260cattggacag agactaacat
tggctctgat cttgttacct ttagcatcat ctgactgcta 121320catatatgat tctgctactg
gctactatta tgaccccttg gcaggaactt attatgaccc 121380caatacccag gtgagtttgg
ggcttttttt tttttttttt tttttttacc tctgtcaatg 121440attcttttga gaaaagcacc
cataatttgc tacttgagga ttttattccc tggattctct 121500ggatgctcat tgcatgaaaa
gtggaaaagt ttagatctat ggaaacagaa ctgttgccta 121560tatggaaaat cagtgccttg
tggcaataca ggtaagaaca gtgttgctct tgaaaaagtg 121620gacagtgggt ggtctgaatg
tgtcctggtc cctggagtgg gtttttagat tgatgtggac 121680tcttcttaga cttgtaagta
aaaaagttgt ttcttcccct aaaagggaac tgtgcgcctt 121740agacctggaa ttgctgggaa
actgaaacat tctgtagact tacttgtttc caactgtatc 121800gcagcaagaa gtctatgtgc
cccaggatcc tggattacct gaggaagaag agatcaagga 121860aaaaaaaccc accagtcaag
gaaagtcaag tagcaagaag gaaatgtcta aaagagatgg 121920caaggagaaa aaagacagag
gagtgacgag ggtaagagga attgttaatt tgctgtcttt 121980tgccacatag ttattaaaat
gttggaggta cgaacagagg atatctatgt ttgcaagtgt 122040aaagtaactt taaaaatact
ctgtcagccg ggcgtggtgg ctaacgcctg taatcccagc 122100actttgggag gccaaggcgg
gcggatcatg aggtcaggag atcgagacca tcctggccaa 122160catggtgaaa cccctgtctc
tactaaaaat acaaaaatta gctgcgtgtg gtggtacacg 122220cctgtagtcc cagctactca
ggaggctgag gcaggagaat tgcttgaacc ctggaggcag 122280aggttgcagt gagccgagat
cgcgccacta cactccagcc tggcaacaga gcaagactct 122340gtatcaaaaa aaaaaaaaaa
acctctgtta atgagtattt ttacctggtg taggcaattc 122400cctcacctct tatatcccaa
ctctctcttt tacaaatggg aaaactatgg atggtagaac 122460aaagtggccc agctcaaatc
ccaacacctc agctccatac attttcactt ttctacattc 122520cttttttagt gtttgacttt
atacacattt ctctagttgt aattatagca ggagatactg 122580tttagtcact ttttatccta
agtatttttt ccatgtttct atatactcta ttatttttaa 122640tgcccacatg gtaaaaattc
acggtataac tgtaccttca ttttcttcat ctctcctaca 122700ttatttgtct tctctttcta
atcttttctt tttccttttt tttttttttt ttctgagaca 122760aagtcttcct ctgtctccca
ggttggagtg cagtggcatg atcatagctc acttctacgt 122820caaacccatg ggcttaagca
gtcctcccac ctcagcctcc caagtagcgg ggactacagg 122880catgagccac catgaccagc
taatttttgc ttttttgtag agacaggatc ttgctagatt 122940gaccaggctg atctcgaact
tctggcctca agtaagcttc ctgtctcagt ctcccaaagt 123000gcttcagtta caggcaagac
ccaccttgct cgcctctttc taatcttata ctgtcataat 123060atataacatt tagcattttg
tttcttcttt taaattactc cctatgacac attttcagaa 123120tcagagatga tgaacatttt
tacatctaat acaaaatcaa attattaggc agggtgcagt 123180ggctcacacc tgtaatccca
gcacgttggg aggccaagac aggtggatgc ctgagtttag 123240gagtttgaca ccagcaacat
ggtgaaactc catctctacc aaaaatacaa aaaaaattag 123300cctactgtgg tgatgcatgc
ctgtagtcca agctacttgg gagactgagt taagaggatc 123360gcttgagccc aggagattgc
agtgagctgt gattgcgcca ctgcactcca gcatggacaa 123420cagagccaga cttgtctcaa
aaaaaaaaaa aaaagaaaat ctgccgggca tggtggctca 123480tgcctgtaat cccagcactt
tgagaggcca aggcaggcgg attactttag gtcaggagtt 123540tgagaccgcc tagccaatat
ggtgaaaccc ccatctctac taaaaagaca aaaattagct 123600ggacgtggtg gcgcaagcct
gtagtcccag ctactcagga ggctgaggca ggagaatctc 123660ttgaacctga gaggcagagg
ttgcagtgag ccaagatcac acctaccttg atatcagtta 123720tgcattagtg aaaatggatg
aatttgcttg tgattcaatt cataacacct ttttttccct 123780tttttttctt ttgagacgga
gccgctctgt cgcccaggct ggagtgcagt ggcgtgatct 123840atctcggctc actgcaacct
ccgccttcca ggctcaaggg attctcctgc ctcagcctcc 123900tgagtagctg ggatatcagg
cgctgccaca acgcccagct aatttttgta tttttagtag 123960agacgcggtt tcaccatgtt
ggtcaagctg gtctcgaact cctgaccttg tgatccgccc 124020acctcagcct accaaagtgc
tgggattaca ggcatgagcc actgcgccca gccttttttt 124080ccccttctaa cactgttagt
tgtttagaga tacagaaaag aggagagaga gtgtgtgtgt 124140gtgtttaaaa acttagagtc
atactgattt aatatttgga ctctgcttca gccacttaat 124200ctgtcaaact atattcccaa
tcatttgtaa aattaagata gtaaagctta cataggagga 124260tcatagtaaa gtctgaagaa
gacaatgttt atatatacat gcctcatctg gtctgacata 124320cagtaatcat gcaatatata
ctaacgtttt attttatttt attttatttt ttgagacaga 124380gtctctctct gtcacccagg
ctggaatgga gtggcacgat ctcggctcac tgcaacctct 124440gcctcccagg ttccagcagt
tcttctacct cagcctccca agtagctggg attacaggcc 124500aaaaccacca cacccagcta
atttttgtat ttttactaga gacggggttt caccatgttg 124560gccaggctgg agcacagtgg
cacaatcttg gctcactgca agctccgcct ctcgggttca 124620ttctcctgcc tcagcctccc
tactaactgg gactacaggt gcccgccacc acgcccagct 124680aattttttgt atttttagta
gagatggagt ttcactgcat tagccagggt ggtctcgatc 124740tcctgacgtt gtgatccacc
tgccttgacc tcccagagtg ctgggattat aggcgtgagc 124800caccgcaccc agcccagcct
ttatcagtta ttatgagtga atatcatgtg agagttacct 124860ctggtttgat cagtttcagg
aaaatgccag tgaagggaag gcccctgcag aagacgtctt 124920taagaagccc ctgcctccta
ctgtgaagaa ggaagagagt ccccctccag taagaccaac 124980attgatcccc tggacctagg
gctggggctg gggatggttc cgagtagaag aggaagcgca 125040aaggctgatg ccttcctctg
gtgttggtct tttacctcac tatgtctccc gaataaggat 125100tcccatttct tttgagtaca
agcatgagat aaagttttct gtctgctaat gggggtatta 125160ctggagaacc agaggcagtt
atctggactc tttctctctg ccctgtgcca ttcttaccag 125220acgagatgcc tagccctttt
tatcatcttg ttcttgtcag ttctctaaat caccaaggaa 125280acccgttttc tcagcctcaa
tctttcctgc cttttggcat cacacaagaa tctcttagat 125340atggagtgca tgcgtggtca
tttttttata gtttctgcct gttcagagtg aatgatgcta 125400atattggtgc ccatttttta
gatgccttca agcagtagtc tcaacctaat caccagtgat 125460tctgattgaa tgcaggtata
taacaatagt gaccatgcat tatttattta ttttgagtga 125520tcatagacca atgattatgc
atcattattt aacagttctt ataaggtacc cttttcctgc 125580tccgcattat taattcagct
cattgtggca tctgtcttaa ccatgctttg cctttacctt 125640acatgtgagc tggatctgtc
tacccaagtg cctattaatg cagttgcttt tagtttactt 125700cctaaatcct ctttgctaga
gtcttaatga aagtcatctt ttcttccctc catgagttac 125760agtaatttgg aggtatttat
ctcttcctct ttgtaatttg taacctttta ctattttcta 125820tgtttatttt cctttctctt
ccttctcctc acattctgtt gctagagtca cttctaaagg 125880aatctttctt gtttattctt
aatgaacaag gagcaaagcc aagctctggc catgttgctt 125940tcatctggga aatgagcagc
atggctagtg agtttatttt gaacccaatt caatgaaatg 126000agatgcccat atcagaatat
caaaaaaaat ggaccccaaa atataggttg aatttggtat 126060tgatccctgg ccttctcctt
ccagcctaaa gtggtaaacc cactgatcgg cctcttgggt 126120gaatatggag gagacagtga
ctatgaggag gaagaagagg aggaacagac ccctccccca 126180cagccccgca cagcacagcc
ccagaagcga gaggagcaaa ccaagaagga gaatgaagaa 126240gacaaactca ctgactggaa
taaactggct tgtctgcttt gcagaaggca gtttcccaat 126300aaagaagttc tgatcaaaca
ccagcagctg tcagacctgc acaaggtatt aggggaagga 126360gctatgccct ttcaaactgt
tgactcttgg ccgggctttg tggctcatgc ctgtaatcct 126420agcactttgg gaggccgagg
cgggtggatt gcctgggctc agaagtacaa gaccagtctg 126480ggcaacatgg tgaaaccccc
tttgtactaa aatacaaaaa attagccagg tgtggtgttg 126540tgtgcctgta gtcccagcca
ctcgggaggc tgaggcagga gaattgctag aacctgggag 126600gcagaggttg cagtgagccg
agatcgtgcc actgcactcc agcctgggta acagagcaag 126660actccatctc ttaaaaaaca
aaacaaaaca aaactgttga ctcatattat tgatggggat 126720tatggggaat aaaaaagatt
atttaggccg ggcctagtgg tttacacctg taatcccagc 126780actttgggag gccaaggcac
ctaggtagat cacttgagat caggagtttg agaccagctt 126840ggccaacatg gtgaaactgt
ctctactaaa aatacaaaaa ttacctggat gtggtggcgc 126900atgcctgtaa tcccaactac
ttgggaggtt gaggcaggag aatcgcttga acctgggagg 126960caaaggttgc agtgaaccga
gatcacacca ctgcactcca gcctgggtga cagaccaaga 127020ctctatctca aaaaaaaaaa
aaaaaaaaaa aaaaagccgc agcagcttat acaatccttc 127080ctcagtgtat atcagcccca
gttcctatca ttaaaacagt ccaattcaag aatgaattgc 127140tctggattaa ggttatgcct
accctcaaag aacttccatg tataggccga agccaagcat 127200tatgactgtg gctagggtgc
caaatatgga ggatgggtag gaagagaaag ggttgtggaa 127260taggacatta cttgctgggt
ttctcatctt agctgtgtca ttaacgttac agttggacct 127320cagataagcc ccttttcttc
tttggtcctt gtaacttcat ctgattctat ccagctctga 127380cagtgtgcag ttttcaccat
aggtgagtca aattctgcca tttcttcatg tagtgaatat 127440tgttatgagc cacagcacaa
catctatact tgggatgtta aaccgacata cattggtctt 127500cccctgtagt attcccattt
atatgaactg accaaggatc caaattatgg acaaataaag 127560tccctaaatg gactcacatt
ctcagagcaa tttgtttcac accccttctc tagtagatgt 127620tgcaagagca ggtgatggaa
ctagattcag actttctctg aatacagagc tcaaagtttt 127680atttagctaa aagctgagaa
gttctgcttt tggtaatagg tacactactt ttcccagcca 127740tctctgtgga ggctttgcaa
agataggact ctgaaaagct cctgataatc cctggaacag 127800actacctccc atgtcctttg
acctgaagtt gtgagttgtc agactgacac attgaaattt 127860cacccatctg atgtaaatac
taataaatgg ctaaagagat aaaaagtaat cgtcaggaaa 127920gaggagccac aggtctggtg
aattcacaaa ctgaactggt cataggacag tggaaagtag 127980actgtagtac ttttcctttc
cttaaggtcg tctgctacaa agaaccacca cttcatgtaa 128040gagctgcttt ggactcctta
agtttcatac atatgtctga gggcttgtgt agtagagcca 128100tgcgtgagga atttgcaact
ctcagagcag tctcttggaa ccctggggct cctttccatg 128160tttctctggg ggctgaaaga
gtgactcatg tctgggaatg gtatgtatgg cagagtatgt 128220gggcatttgg ttttcttcac
tggtgtgccc acatcctctg tcccatgatt ttcaacttag 128280ataaagagat agatatttgt
ttcccacatc ttggagataa gtaaaatgat attcctctta 128340tgccatacca cataactaat
ctgcatgaca agaccagtta gggattgttg gttgcaggat 128400acagtgatca tttagtagat
ctgatcaatc aaaagagcta caatccaaaa gcaactattg 128460ggaaaggcct agaagcatct
ctaggaccat tgtttcttag acctatactc atagaattgc 128520ctctcttctc agcaaaacct
ggaaatccac cggaagataa aacagtctga gcaggagcta 128580gcctatctgg aaaggagaga
acgagaggta aactttggtg acctattact cccttgacct 128640cagctctttt tgctttctga
tatagacttc ataggctgtg ctgatccctc cttataagaa 128700gatggagaac aaaagcagcc
tcaaaagata gtgcatacat ttgccaaatt atataataca 128760atcaaaatag gtgcttttta
ttatttgtaa gtttatactt caatgaagtt gatatctttt 128820ttaaaaggtg gtgttagggt
ctctaggtag ataacactcc tctttcctgc ttagctttta 128880aattagttga gttaatgaac
aagtgttgaa tagcgctgct gaaatagcat cttttactat 128940taaaggctaa gctggaggaa
gtagcttagt gtcagagtca aatggacttg ctacctcaac 129000cacacagtta gggtgaatta
cccagtcata ggcttcactg gcctctctca tgatggttaa 129060gaacccacct atgggtcagg
cacggtggct cacgcctata atcccagtac tttgggaggc 129120tgagacgggc ggatcacttg
agctcacaag tttgaaacca gcctgggcga catggcgaaa 129180tcctatctct acaaaaaata
taaaaattag gtggacatgg ggtgtgtgcc tgtagtccca 129240gctacttgag aggctgaggg
aggatcgcat gagctgggag gcagaggttg cagtgagctg 129300agtttgtgcc actgcgctcc
agcctgggtc atagagccag accttgtctc aaaaaaaaaa 129360aaaaaaaagg aagccacctg
tggagagcca ggcacagtgg cacatgcatg taatcccagc 129420agtttaggag gctgaggtgg
gagaattgct tgagcccaag agttccaggc tgcagtgagc 129480tatgatcaca gccctgtact
ccagcctggg tcacagagta agtccctgtc tcaaaaccaa 129540acaaaagaat ccacctatgg
aggactgtta gagatagtga attcacaaac tgaactggcc 129600ataggacagt ggaaagtaga
ttgtagtatt tttcctttcc ttagagttgt ctactacaaa 129660gaaccacctc tccatgtaag
agctgctttg gactccttaa gttttatatt atatgcccga 129720gggcttgtat agtggagggc
ttgtgtactt tcccctgctt ctcagaaggg gaaaagacag 129780cggaaccaag cgtgccaact
tattctttcc aaatgtttaa gttaggaagt cactgctttc 129840tctagaagaa cgtgtaaagg
agtgagagat tccaggagtt accaagtgag ctactttcac 129900tttaaaagaa ataacaaggc
cgggtgcggt ggctcacacc tgtaatccca gcactttggg 129960aggccgaggc tggtggatca
tgaggtcagg agttcgagac tagcctgact aacatagtga 130020aaccccgtct ctactaaaaa
tagaaaaatt agctgggcat tgtggcactc acctgtagtc 130080ccagctactt gggaggctga
ggcaggagaa tcgcttgaac ctgggaggcg gaggttgcag 130140tgagctgaga tcacgccagt
gtactccagc ctgggcaaca gagtgagact ctgtctcaag 130200aaaaaaataa taataataac
agcaatgggg tagaatttcc ccactcccca attccctcag 130260gtggcaatct caggtctgct
cttctgctta ccaacaggga aagtttaaag gaagaggaaa 130320tgatcgcagg gaaaagctcc
agtcttttga ctctccagaa aggaaacgga ttaagtactc 130380cagggaaact gacaggtaag
ccaggaactc ttcattcagc ctaggcctca agcctaatga 130440taaaaccacc tcctccttca
actgtactgc tgttttctgt ctcagggaga tgatattatg 130500agtagattct gtctgaactg
ctaaaacatg aggtctatgc cagccttttt actatctgtc 130560tttatacggg gagtgtacat
ggaaggttgg ctggcagctt cgccttccca aagccagggc 130620tggagtagcc atgatcggga
accctttctg tcttcatcag taatactgca ccctctttac 130680gggcctgata agaatgtcac
actcttgggc tttttctcta gggaacctcc attctcacac 130740ataggtgcta aataaatggt
tggctgctga tggagatgta tgatatctag cttcctatac 130800ttgttttcag tcagctagtt
cccaagttgt aagcccagag ttatatagaa tttgttgata 130860acccactgtt tacaggtgtc
aagtgcaaga aatactcagg tggacaagac atagattatc 130920cttgactgaa cacagaatag
acaagactta ggtgatggtg cgtctcatag ggcagacaca 130980gaaatcagtg gggaagggaa
gggcatttca gggaatttca tataccaggg atataagagc 131040ttatgatgtg tttgaggagt
tgcaaatagt ttgatggtcc tgaacactgc aggtatattg 131100ttgagtgaca gtagataagc
ctggtccaaa agatgcaggc cagttcatga agtttaaaca 131160ccttgaacac cttgctaagg
ctttatctta aaggcagtgg acggtcatgg aataatttta 131220agcagggtat tgacttagct
ttgcattttg gagagattac taatcatgtg gaagatgagt 131280ttgtagagag actaatgcat
tatgcaaatt ctatagtaat tcaagtgaaa gatcatgatt 131340gcctgagtga aggtgatgag
tctagaaagg agagtggcct ataatcccaa cacagagagg 131400ctgaggaagg aggatctctt
gagcctagga gttccaggcc agcctaggca acatagggag 131460aagggagacc ctgcctctat
ttaaaaaaag aaaagaaaag gagtgtggct tagagagagg 131520tgtcagatct gccagtcttt
gtgatcacct ggggaaaggg agaagtcact gatggtgttc 131580aggtctctgg tctctggata
gctaggagga gaagggacag taaagtcctt gaaaaggaaa 131640aatgggggcc aggcgtggtg
gcttacgcct gtaatcccag cactttggga ggccgaggcg 131700ggtggatcac aaggtcagga
gttcgagacc agcctggcca agatggtgaa accccttctc 131760tactaaaaat ataacaatta
gctgggcgct gtggcaggcg cctgtaatcc cagctactca 131820ggaggctggg gcagaagaat
cgctcaaacc tgggaggcag aggttgcagt gagctgagat 131880catgccactg cactctagcc
tgggtgacag agcaagactc tgtctcaaaa aaaaaaaaaa 131940aaaaaaaaga aaaggaaagg
aaaaatgggg ccaggtgtgg tggctcacac ctgtaatccc 132000agcactttgg gaggctgagg
caggtggatc acttaaggtc aggagttcga gaccagcctg 132060gccaacatgg tgaaaccctg
tctctaccaa aaatataaaa aaattagcca ggcgtggtgg 132120tgggtacctg taatcccagc
tactcgggag actggggcag gagaatcgct tgaacatggg 132180aggtggaggt tgcagtgagc
caagattgca ccactgtact ctagcctggg taatagagcg 132240agactccaaa tcaaaaaaaa
aaaagaaaag aaaagaaaag gaaaagtggg taacaagtgg 132300atgcatgagc agaaggaaag
ggagataatt gacagagcaa ggcccttgag gaggctggac 132360aggttttggg gctctggcat
tccagcttat ttgatccaac ccacaataag agaagtattt 132420ttgtatcatg gcccaataat
aaagtgtgtg tgtgcacaac tgaaaaagtt ttcatctaaa 132480atactttctt accaggtaca
gtgaaccctg atatttttat tcaagtctag tctctcttca 132540tttttatgag ttgttacagt
gggaccattt agtgtgacat tccattgggt cattctctgc 132600aatttgaaat acagtggatt
aggactaggt gaaggagtca gccatcagga ggaaggacac 132660cttggccttg agtcttctgg
gacaaggctt aggtggggtg cggaaagaga cccttcttta 132720ttctcagcac cctttatacc
acattctcct ggctcttctc ctttccagtc accttttctg 132780ctcctcttcc ttttctggat
aaatccaggt gttctctagg actcttctct cagtgcttct 132840ttggtcttgc tgctctaccc
tcttgacctg ggctttctaa ggtacccatg gcctcaacca 132900ccaccacagt ctaacaagtc
caaatctcct gtattattat ttcagagtag cagcatcata 132960gcatcactgt ctacatggtc
tgatccatcc tcctccttta tcccctgtgt ccaattagtg 133020accaaatccc taattaagtc
ttgccccctg tcttagtctg ttttatgata ccataactga 133080ataccacaga ctgggtaatt
tataatgaac agaaatttat ttggctcatg cttctggagg 133140ctgggaggtc caagattgag
gagctgcatc tggtgagggc cttcttgctg tgtcacctca 133200tggtggaaag taaaagaaca
agagagctta ggcaaaagag ggggttggga gaaagaaacc 133260agactaatca ttttatcagg
agaacccact cctgcaataa cagcattaat ccatttgtga 133320gggcagagct ctcatgacct
aatcacttcc tgaagtttca cctctcaata ctgttgcatt 133380ggggattatg tttccaacat
atgtactttg agggacacat ttaaaccaca gcatctccca 133440ttctattcca cctccacact
ggactcctac tcccagtctt tgctcccact gttcttcagt 133500ccattctcta ccctgccacc
aaaatgactt ttgtaaagag aaatctactc ttataacttg 133560tctttttaca aaccgtatac
cttgcctaca gggaggcctg agctccaact tttgccagaa 133620ggatgaggtt cagagacatg
atttagctta ataagttcaa ggttttttac agtctgaccc 133680catgcagcct tttttttttt
tccttttgtt ttgagacagt ctcattctgt cgcccaggct 133740ggagtgcaat ggcacgatct
tggctcactg caacctccgc ctcccaggtt caagcgattc 133800tcctgcctca gcctccccag
tagctgggac tatgggctaa tgtttgtatt tttagtagag 133860aggggtttca cctgttggtc
agggtggtct cgaactcctg acctcaggtg atccacccgc 133920cttggcctcc caaagtgctg
ggattacagg cgtgagtcac tgcacccggc caccaagcag 133980ccttaccttt gtcagtttct
actactactc tcttggacaa attgtctttt gtgtctcctt 134040gcttgtgtcc tccttttctc
ttacacaaac tccttatttc gagatccaat tcagatgtat 134100cttcctgttg aaattcctgt
catttttggt gatgcccctt cagagttttc gttccttcta 134160ctgcatttct tttttttttt
tgaaacagag tttcactctt gttgcccagg ctggagtgca 134220atggcgcgat atcagctaac
cacaacctcc acctcctggg ttcaagcgat tctcctgcct 134280cagcctcccg agtagctagg
attacaggca tgcgccacca cacccggcta attttgtatt 134340tttagtagag acagggtttc
tccatgttgg tcaggctggt cacgaactcc caacctcagg 134400tgatctgccc acctcagcct
cccaaagtga ttccttctac tgtatttcta tagcagacat 134460ctactgttgc tacatccatg
gttgagctct cttcaatgtt ctataagcat ctcttgacat 134520aatgtttgag acctttcttg
tgaacagggc catatcttag tagtctgtgt acccagcaac 134580aaaacatagc tatcaggcac
tcagaggtac tgttaaatat acttacttaa taagaggcag 134640atatgaatca agaggacaga
gattttatat taggcttata agcaggtctt catcaaaatg 134700atggtgtcag gttgggcatg
gtggctcatg cctgtaatcc agcactttgg gaggccaagg 134760catgcggatt acctgaggtc
aggagtttga gagcagcctg gccaacacag tgaaactctg 134820tctctactga aaaaaaaaaa
aattaaaaat tagccaggtg tggtggcggg cacctgcaat 134880cccagctaat cgggaggctg
aggcaggaga atcgcctgaa cccaggaggc agaggttgca 134940gtaagctgag ttcgagccat
tgcactccag cctgggcaaa aagagtgaaa ctccgtctca 135000aaaaaaaaaa aaaaggaagt
gatggtgtct gcttcttttg cagtgatcgt aaacttgttg 135060ataaagaaga tatcgacact
agcagcaaag gaggctgtgt ccaacaggct actggctgga 135120ggaaagggac aggcctggga
tatggccatc ctggattggc ttcatcagag gaggtaaaat 135180ggtttccatc ttttgggggg
tgacatgaac ctggaatgta attaactttc actttctggc 135240ctagagtgat gtctttgcca
ttttgctggg ctttctctac tgctgggata ggacatgaga 135300gttgaacact ttagccttga
atactgggtt atagcttggc aggctgggcc ctttgcagtt 135360tggagttagg aagagaagga
aggagttgga atggatttca tcatactttt acatggagta 135420aatagtagag cagtatctga
ggcagtttga gactgaagaa tcatttgggc aaaagaacca 135480gggaatcagc aatgaaaggt
acagaggcat ctctgagagg gactgtcagc ggaagtcttt 135540ggtggctaaa atttaaggag
catgttgttc tggttcccat gaaggacttt gcccctcata 135600tttcaagagc ctctagaaaa
ggtgataaga ggaaacatta cccattttgt gttggcttgc 135660ttctcctctg aaaatgccaa
ccataagaga ttggcttatt tctctcctac cgagtttctc 135720atatctctgg tattaaagcc
tgtatcttgc aatcatagca tcaccaccca ccttaattca 135780tcttgggtat ttgtttaata
atgaaagatt cttttctttt tttttttttg agacagagtc 135840ttgctctgtc gcccaggctg
gaatgcagtg gtgcgatctc agctcactgc aacctcctcc 135900tcccaggttc aagcaattct
cccaccccaa cctcctgagt agctgggatt acaggtgcat 135960accaccatac ccagctaatt
tttgtgtttt tagtagagac agagttttgc catgttggcc 136020aggctggtct cgaactcctg
gcctcaagtg atccgcccac ctcagcctcc caaagtgttg 136080ggattacagg cgtgagccac
tgtgcccggc caaaagattc tttaaaaaaa ttatcctgcc 136140agggtccggg cgcagtggct
tatgcttgta atcccagcac tttgggaggc cgaggtgggt 136200ggatcacaag gtcaggagtt
cgagaccagc ctgaccaata tgatgaaacc cctgtctcta 136260ctaaaaatac aaaaattagc
tgggtgcagt ggcgcgcgcc tgtaatcaca gctactcagg 136320aggctgaggc agaagaatcg
cttgtaccgg ggaggcagag gttgcagtga gccaagatct 136380tgatcgtgcc actgcactcc
agcctgggtg acagagcgag actctgtctc aaaaaaaaaa 136440ttattctgcc aggtgtggtg
gctcacatct gtaatcccaa cactttggga ggccaaggtg 136500ggcggatcac ttgaggccag
gagttcgaga ccagcctggc caacatggcg aaaccctgtc 136560tctactaaaa atacaaaaat
tagccgggcg tggtggcagg cgcctgtagt cccagctact 136620cagaggctga ggcacaagaa
ttgcttgaac cggggaggca gacttgcagt gagcccagat 136680cgcaccactg cactctagcc
cgggcgacag agcatgactc catctaaaaa aaaaaaaaaa 136740attatcctat atactgcttc
ttactagtcc agaaatgcct gtggtcaaag accagcgctg 136800aggctaatta atctataggg
cccacttcat agtttgtctt tgttttacag gctgaaggcc 136860ggatgagggg ccccagtgtt
ggagcctcag gaagaaccag caaaagacag tccaacgaga 136920cttaccgaga tgctgttcga
agagtcatgt ttgctcgata taaagaactc gattaagaaa 136980ggagacaagt tccatgggat
acaacctccc tcttgttttg tttgtctctc cttttctttt 137040gttactgttc ttgctgctag
aactttttta aataaacttt ttttcaatgt g 137091422604DNAHomo sapiens
42ggggaggagc caagggggcg agcaagctcg gtggctgggt gggttggggc gttccgcgcg
60cccttcattg aagcggcggt ggccgggctg ggcgccggta gtggaaagcg acggcgcggc
120tggaaaatgc cagtccattc ccgaggggat aagaaggaga ccaaccatca cgatgagatg
180gaggtggact acgccgaaaa tgaggggagc agctccgagg acgaggacac tgagagctcg
240tcggtctccg aggatggaga tagctcagaa atggatgatg aagactgtga aagaagaaga
300atggaatgtt tggatgaaat gtccaatctt gaaaaacagt ttaccgatct caaagatcaa
360ctttataaag aacgattaag tcaggtggat gcaaaactac aagaagtcat agctggaaaa
420gcaccagaat acttggaacc gctggcaact ttacaggaaa atatgcaaat tcgtacaaag
480gtagcaggaa tctatagaga gctctgctta gaatctgtaa agaacaaata tgaatgtgaa
540attcaagctt ctcgccagca ttgtgagagc gaaaagctgt tgctatatga tacagtccag
600agtgaactag aggagaagat aagaaggctt gaagaggata ggcacagcat tgatattacc
660tcagagctgt ggaatgatga gcttcagtca agaaaaaaga ggaaggatcc tttcagtcct
720gacaaaaaga agccagttgt tgtttcaggt ccatatatag tttatatgct acaagatctt
780gatattcttg aagactggac aacaattagg aaggcaatgg ctacattggg gccacacaga
840gtgaaaacgg aaccacctgt gaaactggaa aaacatctgc acagtgctag atctgaagag
900ggaagactat attatgatgg tgaatggtat atacgtggac aaacaatatg tattgataaa
960aaagatgaat gtcctacaag tgctgtaatt acaacaatta accatgatga agtttggttt
1020aagaggcctg atggaagcaa atctaagctt tacatttcac agctacagaa aggaaaatat
1080tcaattaaac attcataatc atgatttaag tgttatctaa atttacctta ttagtgttac
1140caaatgtaag tgccatgaga gtaaaaaaat gtattcaata acttaatatt ctcactgaat
1200catgagagaa tgtgtatttg taggtagtac tctaaataga tctcattgat atgttattaa
1260aagaaacagt aataaaaatt ttatcacgat ccttacgttg atttgcctct taggtccgat
1320gaccaatagg tattctgtat atggtagggg tttctttcta aacatttttc tttggtttta
1380aaaaaagtta tgcaaatttg tcttatcttt agtaaactat gactacattt atctgcaatt
1440tttaaaattt tccatatctt tgtcattcat tgtgtgtttg taaataaggc cgatagaatg
1500tttcctataa atggtttgta ctagtacatt agtgttaaac cagaactgaa atttaaacat
1560atatatatat gaggatgtat atatggcatc atcagcttat ttagaactga tggccatacc
1620ttacaatctt gttttaccca aaattaagct attggggttg aaagctaaaa ggagcacttt
1680tgtagaatag caacttttct tttcctcttt cttgattgta tggtggggtg gtgacctatt
1740tttacaaatt atacctaatg agtaaaatta gtgtaaagtg ataacatgct tctacctgta
1800tttctagtga ccctttagcg gcaggtattt atacctggta tttatgatgc agtatataag
1860tggtgaacaa taactgacag tattgtgctt gctgtacatg tctggtcttt tgaaacagat
1920tttagtaagc attttccaga ggtaaaactg tgtccttatt ctaattttat tcctagggca
1980aagtagacag ggattatttc cttgaatcta tttccaaatt aatatttttt tctttggtat
2040ttctacactt taaggccatt tggtgcaatt tagaaagtgt tggcctccct tccgctagcc
2100acattcaaaa ttaacttcca aaacctcagg aacagtacaa agaattgaaa ccctcaatat
2160ggcagcacag ccggctgtag tgtatattta gggtacacca aatcaggtat tcctggtggt
2220cttgtgcact ttaatttctg ttacaatgag ttaagaggat gaggaagaaa tctacttatt
2280aacacttact gcagaaatgt ctgcattatt ccgtttgttt tcttattatt ttacctctcc
2340aaacatcttc ctgtgcagat cactacttca tagttgccaa attttaaaac acttaactgc
2400tgaaattcag tgtcagcaaa gtgatattac gttgttctgt ttctaattaa ccttagcaaa
2460tgtacataat gtcaaaaccc aatagtattt gacagtactt atgtatacaa tgtttgataa
2520gcatttttaa taagatttgt atttttaaat ttagtatata ataaaaagat gtgtttcagt
2580gtgaaaaaaa aaaaaaaaaa aaaa
2604433281DNAHomo sapiens 43gcggtgcggg ccgggcgggt gcattcaggc caaggcgggg
ccgccgggat gctcagggtt 60ccggagccgc ggcccgggga ggcgaaagcg gagggggccg
cgccgccgac cccgtccaag 120ccgctcacgt ccttcctcat ccaggacatc ctgcgggacg
gcgcgcagcg gcaaggcggc 180cgcacgagca gccagagaca gcgcgacccg gagccggagc
cagagccaga gccagaggga 240ggacgcagcc gcgccggggc gcagaacgac cagctgagca
ccgggccccg cgccgcgccg 300gaggaggccg agacgctggc agagaccgag ccagaaaggc
acttggggtc ttatctgttg 360gactctgaaa acacttcagg cgcccttcca aggcttcccc
aaacccctaa gcagccgcag 420aagcgctccc gagctgcctt ctcccacact caggtgatcg
agttggagag gaagttcagc 480catcagaagt acctgtcggc ccctgaacgg gcccacctgg
ccaagaacct caagctcacg 540gagacccaag tgaagatatg gttccagaac agacgctata
agactaagcg aaagcagctc 600tcctcggagc tgggagactt ggagaagcac tcctctttgc
cggccctgaa agaggaggcc 660ttctcccggg cctccctggt ctccgtgtat aacagctatc
cttactaccc atacctgtac 720tgcgtgggca gctggagccc agctttttgg taatgccagc
tcaggtgaca accattatga 780tcaaaaactg ccttccccag ggtgtctcta tgaaaagcac
aaggggccaa ggtcagggag 840caagaggtgt gcacaccaaa gctattggag atttgcgtgg
aaatctcaga ttcttcactg 900gtgagacaat gaaacaacag agacagtgaa agttttaata
cctaagtcat tcctccagtg 960catactgtag gtcatttttt ttgcttctgg ctacctgttt
gaaggggaga gagggaaaat 1020caagtggtat tttccagcac tttgtatgat tttggatgag
ttgtacaccc aaggattctg 1080ttctgcaact ccatcctcct gtgtcactga atatcaactc
tgaaagagca aacctaacag 1140gagaaaggac aaccaggatg aggatgtcac caactgaatt
aaacttaagt ccagaagcct 1200cctgttggcc ttggaatatg gccaaggctc tctctgtccc
tgtaaaagag aggggcaaat 1260agagagtctc caagagaacg ccctcatgct cagcacatat
ttgcatggga gggggagatg 1320ggtgggagga gatgaaaata tcagcttttc ttattccttt
ttattccttt taaaatggta 1380tgccaactta agtatttaca gggtggccca aatagaacaa
gatgcactcg ctgtgatttt 1440aagacaagct gtataaacag aactccactg caagaggggg
ggccgggcca ggagaatctc 1500cgcttgtcca agacaggggc ctaaggaggg tctccacact
gctgctaggg gctgttgcat 1560ttttttatta gtagaaagtg gaaaggcctc ttctcaactt
ttttcccttg ggctggagaa 1620tttagaatca gaagtttcct ggagttttca ggctatcata
tatactgtat cctgaaaggc 1680aacataattc ttccttccct ccttttaaaa ttttgtgttc
ctttttgcag caattactca 1740ctaaagggct tcattttagt ccagattttt agtctggctg
cacctaactt atgcctcgct 1800tatttagccc gagatctggt cttttttttt tttttttttt
ttttttttcc gtctccccaa 1860agctttatct gtcttgactt tttaaaaaag tttgggggca
gattctgaat tggctaaaag 1920acatgcattt ttaaaactag caactcttat ttctttcctt
taaaaataca tagcattaaa 1980tcccaaatcc tatttaaaga cctgacagct tgagaaggtc
actactgcat ttataggacc 2040ttctggtggt tctgctgtta cgtttgaagt ctgacaatcc
ttgagaatct ttgcatgcag 2100aggaggtaag aggtattgga ttttcacaga ggaagaacac
agcgcagaat gaagggccag 2160gcttactgag ctgtccagtg gagggctcat gggtgggaca
tggaaaagaa ggcagcctag 2220gccctgggga gcccagtcca ctgagcaagc aagggactga
gtgagccttt tgcaggaaaa 2280ggctaagaaa aaggaaaacc attctaaaac acaacaagaa
actgtccaaa tgctttggga 2340actgtgttta ttgcctataa tgggtcccca aaatgggtaa
cctagacttc agagagaatg 2400agcagagagc aaaggagaaa tctggctgtc cttccatttt
cattctgtta tctcaggtga 2460gctggtagag gggagacatt agaaaaaaat gaaacaacaa
aacaattact aatgaggtac 2520gctgaggcct gggagtctct tgactccact acttaattcc
gtttagtgag aaacctttca 2580attttctttt attagaaggg ccagcttact gttggtggca
aaattgccaa cataagttaa 2640tagaaagttg gccaatttca ccccattttc tgtggtttgg
gctccacatt gcaatgttca 2700atgccacgtg ctgctgacac cgaccggagt actagccagc
acaaaaggca gggtagcctg 2760aattgctttc tgctctttac atttctttta aaataagcat
ttagtgctca gtccctactg 2820agtactcttt ctctcccctc ctctgaattt aattctttca
acttgcaatt tgcaaggatt 2880acacatttca ctgtgatgta tattgtgttg caaaaaaaaa
aaaaaagtgt ctttgtttaa 2940aattacttgg tttgtgaatc catcttgctt tttccccatt
ggaactagtc attaacccat 3000ctctgaactg gtagaaaaac atctgaagag ctagtctatc
agcatctgac aggtgaattg 3060gatggttctc agaaccattt cacccagaca gcctgtttct
atcctgttta ataaattagt 3120ttgggttctc tacatgcata acaaaccctg ctccaatctg
tcacataaaa gtctgtgact 3180tgaagtttag tcagcacccc caccaaactt tatttttcta
tgtgtttttt gcaacatatg 3240agtgttttga aaataaagta cccatgtctt tattagattt a
3281441155DNAHomo sapiens 44cgcctgtctt ttccgtgcta
cctgcagagg ggtccatacg gcgttgttct ggattcccgt 60cgtaacttaa agggaaattt
tcacaatgtc cggagccctt gatgtcctgc aaatgaagga 120ggaggatgtc cttaagttcc
ttgcagcagg aacccactta ggtggcacca atcttgactt 180ccagatggaa cagtacatct
ataaaaggaa aagtgatggc atctatatca taaatctcaa 240gaggacctgg gagaagcttc
tgctggcagc tcgtgcaatt gttgccattg aaaaccctgc 300tgatgtcagt gttatatcct
ccaggaatac tggccagagg gctgtgctga agtttgctgc 360tgccactgga gccactccaa
ttgctggccg cttcactcct ggaaccttca ctaaccagat 420ccaggcagcc ttccgggagc
cacggcttct tgtggttact gaccccaggg ctgaccacca 480gcctctcacg gaggcatctt
atgttaacct acctaccatt gcgctgtgta acacagattc 540tcctctgcgc tatgtggaca
ttgccatccc atgcaacaac aagggagctc actcagtggg 600tttgatgtgg tggatgctgg
ctcgggaagt tctgcgcatg cgtggcacca tttcccgtga 660acacccatgg gaggtcatgc
ctgatctgta cttctacaga gatcctgaag agattgaaaa 720agaagagcag gctgctgctg
agaaggcagt gaccaaggag gaatttcagg gtgaatggac 780tgctcccgct cctgagttca
ctgctactca gcctgaggtt gcagactggt ctgaaggtgt 840acaggtgccc tctgtgccta
ttcagcaatt ccctactgaa gactggagcg ctcagcctgc 900cacggaagac tggtctgcag
ctcccactgc tcaggccact gaatgggtag gagcaaccac 960tgactggtct taagctgttc
ttgcataggc tcttaagcag catggaaaaa tggttgatgg 1020aaaataaaca tcagtttcta
aaagttgtct tcatttagtt tgctttttac tccagatcag 1080aatacctggg attgcatatc
aaagcataat aataaataca tgtctcgaca tgagttgtac 1140ttctaaaaaa aaaaa
115545784DNAHomo sapiens
45gcccacgcgc cagagtcgca gtgggcgggc ctacgtgctc cgcccgctgt gagcctgtcc
60ggcccccgcc cgctccggag caacccgcga gcttacaccg gcttctctct gtcctcagcc
120cgcgcgccgc catcgccgtc atgctgggcg ccgctctccg ccgctgcgct gtggccgcaa
180ccacccgggc cgaccctcga ggcctcctgc actccgcccg gacccccggc cccgccgtgg
240ctatccagtc agttcgctgc tattcccatg ggtcacagga gacagatgag gagtttgatg
300ctcgctgggt aacatacttc aacaagccag atatagatgc ctgggaattg cgtaaaggga
360taaacacact tgttacctat gatatggttc cagagcccaa aatcattgat gctgctttgc
420gggcatgcag acggttaaat gattttgcta gtacagttcg tatcctagag gttgttaagg
480acaaagcagg acctcataag gaaatctacc cctatgtcat ccaggaactt agaccaactt
540taaatgaact gggaatctcc actccggagg aactgggcct tgacaaagtg taaaccgcat
600ggatgggctt ccccaaggat ttattgacat tgctacttga gtgtgaacag ttacctggaa
660atactgatga taacatatta ccttatttga acaagttttc ctttattgag taccaagcca
720tgtaatggta acttggactt taataaaagg gaaatgagtt tgaactgaaa aaaaaaaaaa
780aaaa
784465740DNAHomo sapiens 46cgccgccgcc gcacgccgcc tgcctcctgc acgccgccgc
cgcgcctagc gcccgggccc 60gcgacaccgc ccgctaagcg ccgggccgag ttcacgcagc
cgcggtctgg cggctccgcg 120gcggcggcgg gtgcgggcgg cctggccggt gccggttaaa
gggacgagtt gcaaacactt 180caggaagtga caagtcgatt tcctcctccc cgggagtcgc
tcgtacaaag cgctcggcgc 240cggcaggcga gcgtgcgcgc ggcggacgcg cggcgggcac
cccggacgac ttggcgagcg 300ctggcggtga cggcgcgggg tccgcgcccg gagcgccccg
ccgcgcacag gagttgacca 360catttggcca tttcccagaa gggccccacc ccaagggtga
gtggccaatg gggagctgtt 420tctgctgaca tcaattcccc aggaggtact caccccaagt
ctgcccaagt gaagatggct 480gatacccacc ctgggatgga gcccagcgcc tgaggccctt
atcatggtga tggtcctaag 540tgaaagcctc agcacccggg gagctgactc cattgcatgt
gggaccttca gccgtgaact 600gcacacgcca aagaagatga gtcaaggacc tacacttttc
tcttgtggaa ttatggaaaa 660tgacagatgg cgagacctgg acaggaaatg ccctcttcag
attgaccaac cgagcaccag 720catctgggaa tgcctgcctg aaaaggacag ctcactatgg
caccgggagg cagtgaccgc 780ctgcgctgtg accagtctga tcaaagacct cagcatcagc
gaccacaacg ggaacccctc 840agcaccccct agcaagcgcc agtgccgctc actgtccttc
tccgatgaga tgtccagttg 900ccggacatca tggaggccct tgggctccaa agtctggact
cccgtggaaa agagacgctg 960ctacagcggg ggcagcgtcc agcgctattc caacggcttc
agcaccatgc agaggagttc 1020cagcttcagc ctcccttccc gggccaacgt gctctcctca
ccctgcgacc aggcaggact 1080ccaccaccga tttggagggc agccctgcca aggggtgcca
ggctcagccc cgtgtggaca 1140ggcaggtgac acctggagcc ctgacctgca ccccgtggga
ggaggccggc tggacctgca 1200gcggtccctc tcttgctcac atgagcagtt ttcctttgtg
gaatactgtc ctccctcagc 1260caacagcaca cctgcctcaa caccagagct ggcgagacgc
tccagcggcc tttcccgcag 1320ccgctcccag ccgtgtgtcc ttaacgacaa gaaggtcggt
gttaaaaggc ggcgccctga 1380agaagtgcaa gagcagaggc cttctctaga ccttgccaag
atggcacaga actgtcagac 1440cttcagcagc ctcagctgcc tgagcgcagg gacagaggac
tgcggtcccc agagcccctt 1500cgcccgccac gtcagcaaca ccagggcctg gaccgccctg
ctctcagcct ccggcccagg 1560gggcaggacc cccgctggga ccccggtccc tgagcctctt
cccccttcct tcgacgacca 1620cctcgcctgc caggaggacc tgtcctgtga ggagtcagac
agctgcgccc tggacgagga 1680ttgtggcagg agagcggagc cggctgcagc ctggcgggac
cgcggggccc ctgggaacag 1740cctctgctcc ctggacggcg agttggacat tgagcagata
gagaagaact gagggggtgt 1800gggcccaggc agggctgggg tgtgctggca tcgacagccc
ccactctggg cactaggtgg 1860gcccttgaag gggagcccaa ctcgtgggcc tgatgaaagc
ttcctgagtg gtgtcgggtc 1920ccagagaggg agcccacctg ctgcctgggg gagagcctgg
cctggccgcg tcatacagcg 1980ggtgtgtcag cctctcaccg gctccccgag cgtggcagcc
accaggtcca cagaactact 2040gcagcccaga ggacagcttt gaagtttgcg tcttttctgc
ctctttccct gtgggatgtt 2100gggcagtctc tgttgtcccc ggcagagctg ggcaccgctc
tgtatccccc tggtggtggg 2160ggctgtcagg gagggcctgg ggtgggggcc aggggccatc
tgctatgtca gggcccttct 2220tggcctcact caggttcact tctggggagt cggccccgca
gcttctttca ctcagtttta 2280ctccgtgcct tctctcccag gtctccctgc ttcaggcttg
ggaaggttcg ggagatgctt 2340ccttctgtaa caccagaacc atttggcctt aattccaatg
tgagagacag aatccctggg 2400gtgctggact ggccctccag agggtaagcc atgtccggag
tctcgggccc aaggaacgat 2460ttggagggtg cttgttaggg cctcccgtgt tgggtagaaa
tttggtggat ctgttggctg 2520aaaagacgga cttgcttgcc tctcctacag catggagagg
ctgaccccat ggctctgcca 2580ccgttggggc agggttagca gatggcagcc cttctctgtg
gctgacaggt cactgagtga 2640taagcatggt tggttccggt gagtgtaggg atggcacgat
accagggcag cctcttgaaa 2700acggcctcgg gagacgggag ctgcgagcag gtgggcagat
gagggcccta tgcgcactca 2760ggggtgaagg gcgtccgctg gccactctgc aggggcccct
gcaggattcc aggcacctcc 2820cgtttgtcct tgaggactgc tggctgtaac cagggcacat
cacccacctc aagacaagcc 2880cacgcccttg tcagcttagg gggagcccag tcctgagggc
tgcatctctg ttgtaggccc 2940agccaccggc acaaagctgg attcatgctc cctgccccta
ccccaccctg gctcctcacc 3000ctggggcatc cgaggagcct agccccctga gggtttgctc
tcctctcaag gtttgtagct 3060cctctccggc tgccttgcag acaccaccac atgggctctg
ctctatggga atctggcttt 3120tagcgaatgt ggcgtcttct gcaaacaata gcaattgggc
tggcttagga gcaagtggct 3180cattttccca taaggctaaa aataactggt gcgctccctt
gtgttggctg acacgcgcgt 3240tcaaagcact tttgtagtca ctttgctttt gctcgtcttc
atggacgagt gaacgcctcg 3300cttctgcagg ttgagtccag atgcttctca ccttctttct
cctcaagaaa gatgcttttt 3360gggaaacgtt gtttaaatct tattttttta ctacatcaaa
aggatggtgg ttcaagttcc 3420caatatgtgg gtggcacttc ttaaaaatca gctttaagga
gctggcagaa agcccccagc 3480cccacagccc tgagagatgg tgttgctagc tcaggtggct
gacacatggg gtatgccggg 3540cactgggcag gtcccagagc cggggaacca gctcacctct
ggttgctgta gctcctgccg 3600gaggcatgtc tacttgtgat cccggacagc cgaacccaag
agctggtggc tctgagcaga 3660cagagacatc ttggcctgtc cctgcctggg ggtcatggag
accatgtctt cttagagcaa 3720atgtggaggc ggccagggca gttgttgggt gaatgtggag
agcacatggc catgtcttgc 3780ccccggagta ccactgggcg tggggggtcc tggcaccaca
tgcccggtgt ggccgagggc 3840acacagcctc tatagcaggc cttcctgtgg aaggcagagg
cagtgaggga ggtggacggt 3900gccagctgag gctgaggcat gcagcagccc ccagctacct
ttgcttaggg ctggggtggg 3960aggcacatgg tgacaggtat atgtcgtggg actggggtgt
gggtgacctg ccctcaaacc 4020ttgcctgcca cctccccatt caggcctggt ggcaggaagg
gacaagctgt ggagctggct 4080gagtcacagc cacctcccca cctccccgca agctggtccc
atcgaccagc aagcccagcc 4140ccagggcgct tagggagaaa tgacccagcc tcctcagacc
ccgcctgcct gtcctgtgcc 4200caccacgcag cagtcagggg agaaaatggt ggctatccct
tctgcttaga gaaagaaatg 4260gcctttagct ggtttcatgt ttgtgttttg actggaggga
gtagacccta tctataaggt 4320gccaccccat catccaagct gccacactgc ccggagcagc
ctgttcctgc actccaccct 4380gctggcccca ggacttctga tctcagtcct ctgggaggga
ggttcgccta ggaggtgccc 4440cccacattgg tgtccccatg ggcagcaggc agacagctca
cccccaccag catgatggcc 4500ccagctgggg gcagtggcag gagccttact tttgtcacag
ccttgcccac aaaccctgcc 4560tctgagggga gactgaggaa gggcagagcc agaagcaagc
cgtgccaggc catctgcctg 4620ctcatggggt cctaaagcgc gggctaagcc tgcaggaaag
ccggggcggt ggggggggct 4680tagtgccaca tgcaccccac tcattccaaa gccaccaaac
tgccaggggc tgccgtccac 4740ccgtggggcc caggggctgg ggccacagcc ttgccatttt
cgttgccata ccctcttgcc 4800ttactcgcgg tggaggccgg atttgcacgg gcagacgtgc
acctgggccc gtggggagct 4860tgttctgacc agacgtacag attttcattc tcagaaagcc
ttacttttca accaaatttt 4920tgtagccagt tttgtgaatt tgtacactga aagaaaattt
aaataaaggg gaagtccaca 4980ttaaaaagaa aacaaaacaa accctaacta acttccaaat
gggtctcctg gtgcgggggc 5040gtgagtggcc gtgccctggg tgtgctgcct gtctgagcaa
gcttccctag ctgtggaacc 5100ccgggccccc tgctgcgggc tctgccttgg tgtcatgcct
gctgcacccc cgtttccact 5160gacgtgccgt ctgtggctat gggggtggtc actggaatga
cggtcactcc agacgtcagc 5220cggcagggat gcagcaggct ggccgcgcac cggggctcgg
gcaccctctg gccccacact 5280ggcaatgatg ccacaccttg ccatgtccac gctgttggtc
aaacccctct gtcatgcctc 5340tttaaagaga aaagaagaga aagatttttt ttttttttaa
tggcagaccg aagtggagat 5400cttgtagcct agataggata gtctgacctt ctagcatagt
ctttttggca aatgatttgt 5460gttttcagtg tgtggggaag ctgtcctggg ggctggggcg
acagatagca cataggctgt 5520ttctggggct gcaggggctt ccctgagctg gatgttgtgg
gtgttgccgt gcttcaggaa 5580gtgtggcgac cagaaagcgt agacccgggg cccagggtct
gcccgcccct gcagcctggc 5640ctccccgcac aggctgtggc ttgcactcca gccgctctag
tctctcagga atttgcttgt 5700tacttgtact gtgtaaataa agcttcctgg ttcaataccc
5740471980DNAHomo sapiens 47gcaatggaca agtcttggtt
aaatgtgctt tggaggaact tcctgaaatg gggaagagga 60tcatctgaaa atgagataga
gatccacatc tgatttgtaa ttttgaacct aatagtttat 120tatttatatt tgagagtatc
ctaaatctgc tattagcagc caaaaatgaa tacaagaaag 180tacaatcgtt atttaaaaga
agcaagttat agttgacaaa gattaaaatg ttaaaagttg 240tttgaagttt aggcaactga
caataacaga acaacttatt aataacagta atgaagttaa 300aaattataga gcatttgcta
taacctaagt atgtccgttt aaacttcacc actttcttag 360attaggaagc tgaccttcag
ataagtaaaa ttatatcgga aaggtcctct taattcacag 420tgccaaatcc agattttccc
tgacttcccc aaatgccact tataagataa tttaattatt 480attcatcccc tgatgactgc
aggaaaacct ctgtgggtaa gtagagataa atgtgaagag 540cagaagcaaa gaaaagagct
agcagtagtg aatgttgaac ttcatgtgct aattggtgtg 600tgtccatttc tgatacagcc
actttgagac aagggctata tcatccatga attggatctt 660aatgtccatt gctgtatttt
tacttctcta gtttttaaga aatttaggct gtggttcaca 720ttgtgtattc gaaagataga
atacctcgct aactagacaa acaaaagctt tgttctaaaa 780atgtactttc cttaaagcag
aagtaacctg cagagaagca ggatgcctga agagagatgg 840atctctgctt actgtgtctt
tagaacagaa atagtggttt tcaacttcac aactctgcat 900tgagccctcc tttcacatct
tccctgtatc attgcagaat tgatctgaat aattctcatt 960ttatcttaga caattttttg
tgtggcttga aaaaataaat ttgcaataga ggtgaaatgg 1020aaaaaattat ccttcatttc
ctactccaaa ctgaggataa acaattattc ttggaaattc 1080caccatagaa ttgaattcat
tgtacgtgtg aattgcacct tttaagcttt taaatgatgt 1140ggcattttta tttagcagca
ttccaaaagg gaccacgaaa taaatgagct ccctggtttt 1200gcagcatttt ataattccaa
tatgaaagtt ttagcattat tactaactga agaatcagaa 1260aggaaattca tagactatca
cttctgggtt ttcaagtatt tttaatccat gcaactcttc 1320ctccaaactt tttcttcaac
ttctcatgag aaagtcagca tataaagttc ttaaaagctg 1380tgctcccctg accgaaatgg
agatgagtac catggtggga gaatgcatct ttccccctcg 1440agagtcctct agcacctgcg
gtggtctctg gaagaactca gcagaactcc caagtgccaa 1500ggaacacata ttacagaaca
acggactgca gaaattcaga tagatgaaaa ctatagatca 1560ttctaggtac tttgttccca
gacttataat actcccaata gcttctctaa tgtatgatca 1620agtggctgtc tgctgtaata
ttttcagagc tataatgttt atatctaacc tcttatattt 1680atgtccaaat cagctggtat
attttggctt attctgagca gtagctgcta gatctatctt 1740gtggtacaca ttaagcctat
tccttcttcc acagttcttc ttgacattat gctacttaaa 1800aagtcatccc ttatcaaaat
caaatttcat tattttagtt atatcacatc caatatttaa 1860ttgtgtaaac cactctttac
tctagctatt cgtcctcaga attgcttctg ttataaatgc 1920tctttttgaa cagacttcct
agagtagaag agaaagctcc agatatgatc tgatgggggt 1980485602DNAHomo sapiens
48atggagccct ccagagcgct tctcggctgc ctagcgagcg ccgccgctgc cgccccgccg
60ggggaggatg gagcaggggc cggggccgag gaggaggagg aggaggagga ggaggcggcg
120gcggcggtgg gccccgggga gctgggctgc gacgcgccgc tgccctactg gacggccgtg
180ttcgagtacg aggcggcggg cgaggacgag ctgaccctgc ggctgggcga cgtggtggag
240gtgctgtcca aggactcgca ggtgtccggc gacgagggct ggtggaccgg gcagctgaac
300cagcgggtgg gcatcttccc cagcaactac gtgaccccgc gcagcgcctt ctccagccgc
360tgccagcccg gcggcgagga ccccagttgc tacccgccca ttcagttgtt agaaattgat
420tttgcggagc tcaccttgga agagattatt ggcatcgggg gctttgggaa ggtctatcgt
480gctttctgga taggggatga ggttgctgtg aaagcagctc gccacgaccc tgatgaggac
540atcagccaga ccatagagaa tgttcgccaa gaggccaagc tcttcgccat gctgaagcac
600cccaacatca ttgccctaag aggggtatgt ctgaaggagc ccaacctctg cttggtcatg
660gagtttgctc gtggaggacc tttgaataga gtgttatctg ggaaaaggat tcccccagac
720atcctggtga attgggctgt gcagattgcc agagggatga actacttaca tgatgaggca
780attgttccca tcatccaccg cgaccttaag tccagcaaca tattgatcct ccagaaggtg
840gagaatggag acctgagcaa caagattctg aagatcactg attttggcct ggctcgggaa
900tggcaccgaa ccaccaagat gagtgcggca gggacgtatg cttggatggc acccgaagtc
960atccgggcct ccatgttttc caaaggcagt gatgtgtgga gctatggggt gctactttgg
1020gagttgctga ctggtgaggt gccctttcga ggcattgatg gcttagcagt cgcttatgga
1080gtggccatga acaaactcgc ccttcctatt ccttctacgt gcccagaacc ttttgccaaa
1140ctcatggaag actgctggaa tcctgatccc cactcacgac catctttcac gaatatcctg
1200gaccagctaa ccaccataga ggagtctggt ttctttgaaa tgcccaagga ctccttccac
1260tgcctgcagg acaactggaa acacgagatt caggagatgt ttgaccaact cagggccaaa
1320gaaaaggaac ttcgcacctg ggaggaggag ctgacgcggg ctgcactgca gcagaagaac
1380caggaggaac tgctgcggcg tcgggagcag gagctggccg agcgggagat tgacatcctg
1440gaacgggagc tcaacatcat catccaccag ctgtgccagg agaagccccg ggtgaagaaa
1500cgcaagggca agttcaggaa gagccggctg aagctcaagg atggcaaccg catcagcctc
1560ccttctgatt tccagcacaa gttcacggtg caggcctccc ctaccatgga taaaaggaag
1620agtcttatca acagccgctc cagtcctcct gcaagcccca ccatcattcc tcgccttcga
1680gccatccagt tgacaccagg tgaaagcagc aaaacctggg gcaggagctc agtcgtccca
1740aaggaggaag gggaggagga ggagaagagg gccccaaaga agaagggacg gacgtggggg
1800ccagggacgc ttggtcagaa ggagcttgcc tcgggagatg aaggatcccc tcagagacgt
1860gagaaagcta atggtttaag taccccatca gaatctccac atttccactt gggcctcaag
1920tccctggtag atggatataa gcagtggtcg tccagtgccc ccaacctggt gaagggccca
1980aggagtagcc cggccctgcc agggttcacc agccttatgg agatggcctt gctggcagcc
2040agttgggtgg tgcccatcga cattgaagag gatgaggaca gtgaaggccc agggagtgga
2100gagagtcgcc tacagcattc acccagccag tcctacctct gtatcccatt ccctcgtgga
2160gaggatggcg atggcccctc cagtgatgga atccatgagg agcccacccc agtcaactcg
2220gccacgagta cccctcagct gacgccaacc aacagcctca agcggggcgg tgcccaccac
2280cgccgctgcg aggtggctct gctcggctgt ggggctgttc tggcagccac aggcctaggg
2340tttgacttgc tggaagctgg caagtgccag ctgcttcccc tggaggagcc tgagccacca
2400gcccgggagg agaagaaaag acgggagggt ctttttcaga ggtccagccg tcctcgtcgg
2460agcaccagcc ccccatcccg aaagcttttc aagaaggagg agcccatgct gttgctagga
2520gacccctctg cctccctgac gctgctctcc ctctcctcca tctccgagtg caactccaca
2580cgctccctgc tgcgctccga cagcgatgaa attgtcgtgt atgagatgcc agtcagccca
2640gtcgaggccc ctcccctgag tccatgtacc cacaaccccc tggtcaatgt ccgagtagag
2700cgcttcaaac gagatcctaa ccaatctctg actcccaccc atgtcaccct caccaccccc
2760tcgcagccca gcagtcaccg gcggactcct tctgatgggg cccttaagcc agagactctc
2820ctagccagca ggagcccctc cagcaatggg ttgagcccca gtcctggagc aggaatgttg
2880aaaaccccca gtcccagccg agacccaggt gaattccccc gtctccctga ccccaatgtg
2940gtcttccccc caaccccaag gcgctggaac actcagcagg actctacctt ggagagaccc
3000aagactctgg agtttctgcc tcggccgcgt ccttctgcca accggcaacg gctggaccct
3060tggtggtttg tgtcccccag ccatgcccgc agcacctccc cagccaacag ctccagcaca
3120gagacgccca gcaacctgga ctcctgcttt gctagcagta gcagcactgt agaggagcgg
3180cctggacttc cagccctgct cccgttccag gcagggccgc tgcccccgac tgagcggacg
3240ctcctggacc tggatgcaga ggggcagagt caggacagca ccgtgccgct gtgcagagcg
3300gaactgaaca cacacaggcc tgccccttat gagatccagc aggagttctg gtcttagcac
3360gaaaaggatt ggggcgggca agggggacag ccagcggaga tgaggggagc tggcgggcac
3420agccctttct cagggttgga ccccctgaga tccagcccta cttcttgcac tgataatgca
3480ctttgaagat ggaagggatg gaaacagggc cacttcagag ggtctcctgc cctgcagggc
3540ctttctaccc gtgtccactg gaggggctgt ggccatcagc tctggctgtg taggggagga
3600aggggtgcat gcatgtcccc caccctccac agtcttcctt gcctttagag tgaccctgca
3660gagtcactca gccaaatctg tctgctgctc cctctcctca gccagttggg tgtgcgcaga
3720gctgtcatag ggtccctttg tcagccccga gttcagcttc ccaaacacca gtgttggata
3780ttctgtgatt gattttggtc ctcctccgct gtcccccaac acccaggaat gggaatctgg
3840cttggttcga gataggagct tttctgtgtc ctaagccctt tcatgctagc aggaagactg
3900aaagcaaggt ggcccagtgt ggggtcatag ggcttgatag acctggcact gcctatctgc
3960acttccaggt gccccaccta tttatctgag cccacaggtg gaaaggggaa ctgcctcagt
4020gagaacgggg ggacggggat gttaggaaaa atacagtaaa gttgcaatga agaggttcat
4080gaagtatgtc cttgttcttt ttggaaactc tcggcaaagg gcaaaccagc aagtattgag
4140ggtacccatc tagctacttg gggtcaggac ctcgtcagac caggttcgga tacaatcatc
4200tgctcatccc aggaatagtt tcttggggga ctcactcact ggtgccagtt ctaagtcaga
4260gacaaaattc cactgtctgt tccttttgct gtctgaactt tatgtgttac tcccttcctt
4320tggtcttcac tctaatccct ggagtttgtg ggcttttggt tatgtttggt tagtagatat
4380caccgcaatg ccctagaaca gctatgaagc agaataccat atggccacct ggacattggg
4440acttgggaat tcactctcaa ctgggccatc catgttgtga tgcccttgaa gtaaaatgga
4500gccagcagga gtaccttctg taaatgcatg tggcaaagtg ctatttatag ggtgcccagg
4560gagccgctga tgtacaataa ccttgaggtc ccccatactg aaaactgacc aaggcctgtg
4620cacaggtagc ccctcatgct gggctctgga ccatgagctg agtaggaagg atagcagagg
4680ccaaccctga ccttcctgga agttgtttcc ttaacttgaa tgttgagctt cctctaaagc
4740tttctcgtgt atgtcttctc catgccacta ctctgaggcc tcctgtgtta tgtgtgaaca
4800gttgtcttta tgtgggaatg acgacttgat tgggagtaga gtctcaaggt cattcccctc
4860ttccctcaag actctctgaa tgctgctcca ctgtcttttg tcttggaggt cactcagcag
4920gttccttgca tttgctgcct ggatgtgcag ctggcaacag tgatgaattg gtcactgctc
4980tttctctata actgggatag atgtcctgcc ttggggtcac taaaggggtg accttgttcc
5040ttgctttatg agcccattag cactttggtt caaggggccc accaagtctt ggacgggaag
5100gcgctactgg ttttattgcc caaggttttg ttattgcttc tcttctgtgt ccttctcttt
5160gttcagtgaa gccaatatgt aagatactgt ttttgtcccc attcccctac tcctgagcta
5220ggaggaaaaa atgtgaatct taccagcagt tccagccaac caagtgattc ttcttcattc
5280ttgatgggga gaagtacata caaagtttgt tctgacaggg cgcggtggct cacgcctgta
5340atcccagcgc tttgggaggc agaggcaggt ggatcacctg aggtcgggag ttcgagacca
5400gcctgaccaa catggagata tcctgtctct actaaaaata caaaaaaatt agccaggcat
5460ggtggcacgt gcctgtaatc ccagctactc gcaaggctga ggcaggagaa tcgcttgaac
5520ctgggaggcg gaggttgcag tgagccaaga ttgcgccatt gcactccagc ctgggcaaca
5580agagagaaac tctgtctcaa aa
560249824DNAHomo sapiens 49ccgggccccg ccgcgccgcc tccttcccag ctcgcccgcc
caggcctggc ctcctgcttt 60tccatttgat tccctgcctc tttctattcg gactggaatg
ccgggccagg ctccggggcg 120cgccgctgcg gcagccgcac ctcgcaggtc ccccggccga
ccccgacgcg gaagcggcgg 180ccctcctcgc cgtcggggag ccagggagcc ggggacgatc
agtcacataa ggcttagagg 240atcaaggatc ctgcccagat gacttaccga aatgttacag
attaagttgg tgtggtaacc 300tgggctgagc actctgggag aggaagagaa gagagaagac
aggaaacaac tgaactatga 360ccaatcccag cacggaggcc cagaaaactt taagatttga
gtattaatgt ctcaaggtca 420ggagcaacct caaggctaaa actcagatct caggactcaa
tttcacagaa gttccactat 480aaaggcaata atctaaagct ttaaatgata tgaaaatttt
gtaataagag ttcagtattt 540ctgccaacat tggcgcatgg attgcaaagt tcacaggatt
gaaaacacca tcgacataat 600ggaaattgaa cagcatctga ttactgagtg ctatatcagc
aagttaaaag gatcttttgc 660atacctttta atggtatata tcctaaaact gaagtgttca
atatagacat ccagattgaa 720actcaggcag tgaattacat acacaacaaa tcagttgaac
atggcagagc ttgtcagact 780tatgaaagat taaatacatt ttacatttcc acaagtgtgg
tatt 824507130DNAHomo sapiens 50gaattccaca ttgtttgctg
cacgttggat tttgaaatgc tagggaactt tgggagactc 60atatttctgg gctagaggat
ctgtggacca caagatcttt ttatgatgac agtagcaatg 120tatctgtgga gctggattct
gggttgggag tgcaaggaaa agaatgtact aaatgccaag 180acatctattt caggagcatg
aggaataaaa gttctagttt ctggtctcag agtggtgcag 240ggatcaggga gtctcacaat
ctcctgagtg ctggtgtctt agggcacact gggtcttgga 300gtgcaaagga tctaggcacg
tgaggctttg tatgaagaat cggggatcgt acccaccccc 360tgtttctgtt tcatcctggg
catgtctcct ctgcctttgt cccctagatg aagtctccat 420gagctacaag ggcctggtgc
atccagggtg atctagtaat tgcagaacag caagtgctag 480ctctccctcc ccttccacag
ctctgggtgt gggagggggt tgtccagcct ccagcagcat 540ggggagggcc ttggtcagcc
tctgggtgcc agcagggcag gggcggagtc ctggggaatg 600aaggttttat agggctcctg
ggggaggctc cccagcccca agcttaccac ctgcacccgg 660agagctgtgt caccatgtgg
gtcccggttg tcttcctcac cctgtccgtg acgtggattg 720gtgagagggg ccatggttgg
ggggatgcag gagagggagc cagccctgac tgtcaagctg 780aggctctttc ccccccaacc
cagcacccca gcccagacag ggagctgggc tcttttctgt 840ctctcccagc cccacttcaa
gcccataccc ccagcccctc catattgcaa cagtcctcac 900tcccacacca ggtccccgct
ccctcccact taccccagaa ctttctcccc attgcccagc 960cagctccctg ctcccagctg
ctttactaaa ggggaagttc ctgggcatct ccgtgtttct 1020ctttgtgggg ctcaaaacct
ccaaggacct ctctcaatgc cattggttcc ttggaccgta 1080tcactggtcc atctcctgag
cccctcaatc ctatcacagt ctactgactt ttcccattca 1140gctgtgagtg tccaacccta
tcccagagac cttgatgctt ggcctcccaa tcttgcccta 1200ggatacccag atgccaacca
gacacctcct tcttcctagc caggctatct ggcctgagac 1260aacaaatggg tccctcagtc
tggcaatggg actctgagaa ctcctcattc cctgactctt 1320agccccagac tcttcattca
gtggcccaca ttttccttag gaaaaacatg agcatcccca 1380gccacaactg ccagctctct
gattccccaa atctgcatcc ttttcaaaac ctaaaaacaa 1440aaagaaaaac aaataaaaca
aaaccaactc agaccagaac tgttttctca acctgggact 1500tcctaaactt tccaaaacct
tcctcttcca gcaactgaac ctggccataa ggcacttatc 1560cctggttcct agcacccctt
atcccctcag aatccacaac ttgtaccaag tttcccttct 1620cccagtccaa gaccccaaat
caccacaaag gacccaatcc ccagactcaa gatatggtct 1680gggcgctgtc ttgtgtctcc
taccctgatc cctgggttca actctgctcc cagagcatga 1740agcctctcca ccagcaccag
ccaccaacct gcaaacctag ggaagattga cagaattccc 1800agcctttccc agctccccct
gcccatgtcc caggactccc agccttggtt ctctgccccc 1860gtgtcttttc aaacccacat
cctaaatcca tctcctatcc gagtccccca gttccccctg 1920tcaaccctga ttcccctgat
ctagcacccc ctctgcaggc gctgcgcccc tcatcctgtc 1980tcggattgtg ggaggctggg
agtgcgagaa gcattcccaa ccctggcagg tgcttgtggc 2040ctctcgtggc agggcagtct
gcggcggtgt tctggtgcac ccccagtggg tcctcacagc 2100tgcccactgc atcaggaagt
gagtaggggc ctggggtctg gggagcaggt gtctgtgtcc 2160cagaggaata acagctgggc
attttcccca ggataacctc taaggccagc cttgggactg 2220ggggagagag ggaaagttct
ggttcaggtc acatggggag gcagggttgg ggctggacca 2280ccctccccat ggctgcctgg
gtctccatct gtgtccctct atgtctcttt gtgtcgcttt 2340cattatgtct cttggtaact
ggcttcggtt gtgtctctcc gtgtgactat tttgttctct 2400ctctccctct cttctctgtc
ttcagtctcc atatctcccc ctctctctgt ccttctctgg 2460tccctctcta gccagtgtgt
ctcaccctgt atctctctgc caggctctgt ctctcggtct 2520ctgtctcacc tgtgccttct
ccctactgaa cacacgcacg ggatgggcct ggggggaccc 2580tgagaaaagg aagggctttg
gctgggcgcg gtggctcaca cctgtaatcc cagcactttg 2640ggaggccaag gcaggtagat
cacctgaggt caggagttcg agaccagcct ggccaactgg 2700tgaaacccca tctctactaa
aaatacaaaa aattagccag gcgtggtggc gcatgcctgt 2760agtcccagct actcaggagg
ctgagggagg agaattgctt gaacctggga ggttgaggtt 2820gcagtgagcc gagaccgtgc
cactgcactc cagcctgggt gacagagtga gactccgcct 2880caaaaaaaaa aaaaaaaaaa
aaaaaaaaaa agaaaagaaa agaaaagaaa aggaatcttt 2940tatccctgat gtgtgtgggt
atgagggtat gagagggccc ctctcactcc attccttctc 3000caggacatcc ctccactctt
gggagacaca gagaagggct ggttccagct ggagctggga 3060ggggcaattg agggaggagg
aaggagaagg gggaaggaaa acagggtatg ggggaaagga 3120ccctggggag cgaagtggag
gatacaacct tgggcctgca ggccaggcta cctacccact 3180tggaaaccca cgccaaagcc
gcatctacag ctgagccact ctgaggcctc ccctccccgg 3240cggtccccac tcagctccaa
agtctctctc ccttttctct cccacacttt atcatccccc 3300ggattcctct ctacttggtt
ctcattcttc ctttgacttc ctgcttccct ttctcattca 3360tctgtttctc actttctgcc
tggttttgtt cttctctctc tctttctctg gcccatgtct 3420gtttctctat gtttctgtct
tttctttctc atcctgtgta ttttcggctc accttgtttg 3480tcactgttct cccctctgcc
ctttcattct ctctgtcctt ttaccctctt cctttttccc 3540ttggtttctc tcagtttctg
tatctgccct tcaccctctc acactgctgt ttcccaactc 3600gttgtctgta tttttggcct
gaactgtgtc ttccccaacc ctgtgttttt ctcactgttt 3660ctttttctct tttggagcct
cctccttgct cctctgtccc ttctctcttt ccttatcatc 3720ctcgctcctc attcctgcgt
ctgcttcctc cccagcaaaa gcgtgatctt gctgggtcgg 3780cacagcctgt ttcatcctga
agacacaggc caggtatttc aggtcagcca cagcttccca 3840cacccgctct acgatatgag
cctcctgaag aatcgattcc tcaggccagg tgatgactcc 3900agccacgacc tcatgctgct
ccgcctgtca gagcctgccg agctcacgga tgctgtgaag 3960gtcatggacc tgcccaccca
ggagccagca ctggggacca cctgctacgc ctcaggctgg 4020ggcagcattg aaccagagga
gtgtacgcct gggccagatg gtgcagccgg gagcccagat 4080gcctgggtct gagggaggag
gggacaggac tcctgggtct gagggaggag ggccaaggaa 4140ccaggtgggg tccagcccac
aacagtgttt ttgcctggcc cgtagtcttg accccaaaga 4200aacttcagtg tgtggacctc
catgttattt ccaatgacgt gtgtgcgcaa gttcaccctc 4260agaaggtgac caagttcatg
ctgtgtgctg gacgctggac agggggcaaa agcacctgct 4320cggtgagtca tccctactcc
caagatcttg aggggaaagg tgagtgggga ccttaattct 4380gggctggggt ctagaagcca
acaaggcgtc tgcctcccct gctccccagc tgtagccatg 4440ccacctcccc gtgtctcatc
tcattccctc cttccctctt ctttgactcc ctcaaggcaa 4500taggttattc ttacagcaca
actcatctgt tcctgcgttc agcacacggt tactaggcac 4560ctgctatgca cccagcactg
ccctagagcc tgggacatag cagtgaacag acagagagca 4620gcccctccct tctgtagccc
ccaagccagt gaggggcaca ggcaggaaca gggaccacaa 4680cacagaaaag ctggagggtg
tcaggaggtg atcaggctct cggggaggga gaaggggtgg 4740ggagtgtgac tgggaggaga
catcctgcag aaggtgggag tgagcaaaca cctgccgcag 4800gggaggggag ggccctgcgg
cacctggggg agcagaggga acagcatctg gccaggcctg 4860ggaggagggg cctagagggc
gtcaggagca gagaggaggt tgcctggctg gagtgaagga 4920tcggggcagg gtgcgagagg
gaagaaagga cccctcctgc agggcctcac ctgggccaca 4980ggaggacact gcttttcctc
tgaggagtca ggaactgtgg atggtgctgg acagaagcag 5040gacagggcct ggctcaggtg
tccagaggct gccgctggcc tccctatggg atcagactgc 5100agggagggag ggcagcaggg
atgtggaggg agtgatgatg gggctgacct gggggtggct 5160ccaggcattg tccccacctg
ggcccttacc cagcctccct cacaggctcc tggccctcag 5220tctctcccct ccactccatt
ctccacctac ccacagtggg tcattctgat caccgaactg 5280accatgccag ccctgccgat
ggtcctccat ggctccctag tgccctggag aggaggtgtc 5340tagtcagaga gtagtcctgg
aaggtggcct ctgtgaggag ccacggggac agcatcctgc 5400agatggtcct ggcccttgtc
ccaccgacct gtctacaagg actgtcctcg tggaccctcc 5460cctctgcaca ggagctggac
cctgaagtcc cttccctacc ggccaggact ggagccccta 5520cccctctgtt ggaatccctg
cccaccttct tctggaagtc ggctctggag acatttctct 5580cttcttccaa agctgggaac
tgctatctgt tatctgcctg tccaggtctg aaagatagga 5640ttgcccaggc agaaactggg
actgacctat ctcactctct ccctgctttt acccttaggg 5700tgattctggg ggcccacttg
tctgtaatgg tgtgcttcaa ggtatcacgt catggggcag 5760tgaaccatgt gccctgcccg
aaaggccttc cctgtacacc aaggtggtgc attaccggaa 5820gtggatcaag gacaccatcg
tggccaaccc ctgagcaccc ctatcaactc cctattgtag 5880taaacttgga accttggaaa
tgaccaggcc aagactcaag cctccccagt tctactgacc 5940tttgtcctta ggtgtgaggt
ccagggttgc taggaaaaga aatcagcaga cacaggtgta 6000gaccagagtg tttcttaaat
ggtgtaattt tgtcctctct gtgtcctggg gaatactggc 6060catgcctgga gacatatcac
tcaatttctc tgaggacaca gataggatgg ggtgtctgtg 6120ttatttgtgg gatacagaga
tgaaagaggg gtgggatcca cactgagaga gtggagagtg 6180acatgtgctg gacactgtcc
atgaagcact gagcagaagc tggaggcaca acgcaccaga 6240cactcacagc aaggatggag
ctgaaaacat aacccactct gtcctggagg cactgggaag 6300cctagagaag gctgtgagcc
aaggagggag ggtcttcctt tggcatggga tggggatgaa 6360gtaaggagag ggactggacc
ccctggaagc tgattcacta tggggggagg tgtattgaag 6420tcctccagac aaccctcaga
tttgatgatt tcctagtaga actcacagaa ataaagagct 6480cttatactgt ggtttattct
ggtttgttac attgacagga gacacactga aatcagcaaa 6540ggaaacaggc atctaagtgg
ggatgtgaag aaaacaggga aaatctttca gttgttttct 6600cccagtgggg tgttgtggac
agcacttaaa tcacacagaa gtgatgtgtg accttgtgta 6660tgaagtattt ccaactaagg
aagctcacct gagccttagt gtccagagtt cttattgggg 6720gtctgtagga taggcatggg
gtactggaat agctgacctt aacttctcag acctgaggtt 6780cccaagagtt caagcagata
cagcatggcc tagagcctca gatgtacaaa aacaggcatt 6840catcatgaat cgcactgtta
gcatgaatca tctggcacgg cccaaggccc caggtatacc 6900aaggcacttg ggccgaatgt
tccaagggat taaatgtcat ctcccaggag ttattcaagg 6960gtgagccctg tacttggaac
gttcaggctt tgagcagtgc agggctgctg agtcaacctt 7020ttactgtaca ggggggtgag
ggaaagggag aagatgagga aaccgcctag ggatctggtt 7080ctgtcttgtg gccgagtgga
ccatggggct atcccaagaa ggaggaattc 713051534DNAHomo sapiens
51cgactttccc gatcgccagg caggagtttc tctcggtgac tactatcgct gtcatgtctg
60gtcgtggcaa gcaaggaggc aaggcccgcg ccaaggccaa gtcgcgctcg tcccgcgctg
120gccttcagtt cccggtaggg cgagtgcatc gcttgctgcg caaaggcaac tacgcggagc
180gagtgggggc cggcgcgccc gtctacatgg ctgcggtcct cgagtatctg accgccgaga
240tcctggagct ggcgggcaac gcggctcggg acaacaagaa gacgcgcatc atccctcgtc
300acctccagct ggccatccgc aacgacgagg aactgaacaa gctgctgggc aaagtcacca
360tcgcccaggg cggcgtcttg cctaacatcc aggccgtact gctccctaag aagacggaga
420gtcaccacaa ggcaaagggc aagtgaggct gacgtccggc ccaagtgggc ccagcccggc
480ccgcgtctcg aaggggcacc tgtgaactca aaaggctctt ttcagagcca ccca
534521478DNAHomo sapiens 52gggtcctcgg agctgctctg gctgcgcgcg gagcgggctc
cggagggaag tcccgagaca 60aagggaagcg ccgccgccgc cgccccgctc ggtcctccac
ctgtccgcta cgctcgccgg 120ggctgcggcc gcccgaggct gccctgagga tctgtgtttg
gtgaaaagga gccaaattca 180cctgcagggc aggcggctct agcagcttca gaagcctggt
gccctggcga cactggacct 240gccttggctt ctttgatccc aaccccaccc ccgatttctg
ctctgctgac tggggaagtc 300atcgtgccac ccagaacctg agtgcgggcc tctcagagct
ccttcgtccg tgggtctgcc 360ggggactggg ccttgtctcc ctaacgagtg ccagggactt
tgaacatgtc ggggatcgcc 420ctcagcagac tcgcccagga gaggaaagca tggaggaaag
accacccatt tggtttcgtg 480gctgtcccaa caaaaaatcc cgatggcacg atgaacctca
tgaactggga gtgcgccatt 540ccaggaaaga aagggactcc gtgggaagga ggcttgttta
aactacggat gcttttcaaa 600gatgattatc catcttcgcc accaaaatgt aaattcgaac
caccattatt tcacccgaat 660gtgtaccctt cggggacagt gtgcctgtcc atcttagagg
aggacaagga ctggaggcca 720gccatcacaa tcaaacagat cctattagga atacaggaac
ttctaaatga accaaatatc 780caagacccag ctcaagcaga ggcctacacg atttactgcc
aaaacagagt ggagtacgag 840aaaagggtcc gagcacaagc caagaagttt gcgccctcat
aagcagcgac cttgtggcat 900cgtcaaaagg aagggattgg tttggcaaga acttgtttac
aacatttttg caaatctaaa 960gttgctccat acaatgacta gtcacctggg ggggttgggc
gggcgccatc ttccattgcc 1020gccgcgggtg tgcggtctcg attcgctgaa ttgcccgttt
ccatacaggg tctcttcctt 1080cggtcttttg tatttttgat tgttatgtaa aactcgcttt
tattttaata ttgatgtcag 1140tatttcaact gctgtaaaat tataaacttt tatacttggg
taagtccccc aggggcgagt 1200tcctcgctct gggatgcagg catgcttctc accgtgcaga
gctgcacttg gcctcagctg 1260gctgtatgga aatgcaccct ccctcctgcc gctcctctct
agaaccttct agaacctggg 1320ctgtgctgct tttgagcctc agaccccagg tcagcatctc
ggttctgcgc cacttccttt 1380gtgtttatat ggcgttttgt ctgtgttgct gtttagagta
aataaactgt ttatataaag 1440gttttggttg cattattatc attgaaagtg agaggagg
1478533670DNAHomo sapiens 53cgcagcaaac acatccgtag
aaggcagcgc ggccgccgag aaccgcagcg ccgctcgccc 60gccgcccccc accccgccgc
cccgcccggc gaattgcgcc ccgcgcccct cccctcgcgc 120ccccgagaca aagaggagag
aaagtttgcg cggccgagcg gggcaggtga ggagggtgag 180ccgcgcggga ggggcccgcc
tcggccccgg ctcagccccc gcccgcgccc ccagcccgcc 240gccgcgagca gcgcccggac
cccccagcgg cggcccccgc ccgcccagcc ccccggcccg 300ccatgggcgc cgcggcccgc
accctgcggc tggcgctcgg cctcctgctg ctggcgacgc 360tgcttcgccc ggccgacgcc
tgcagctgct ccccggtgca cccgcaacag gcgttttgca 420atgcagatgt agtgatcagg
gccaaagcgg tcagtgagaa ggaagtggac tctggaaacg 480acatttatgg caaccctatc
aagaggatcc agtatgagat caagcagata aagatgttca 540aagggcctga gaaggatata
gagtttatct acacggcccc ctcctcggca gtgtgtgggg 600tctcgctgga cgttggagga
aagaaggaat atctcattgc aggaaaggcc gagggggacg 660gcaagatgca catcaccctc
tgtgacttca tcgtgccctg ggacaccctg agcaccaccc 720agaagaagag cctgaaccac
aggtaccaga tgggctgcga gtgcaagatc acgcgctgcc 780ccatgatccc gtgctacatc
tcctccccgg acgagtgcct ctggatggac tgggtcacag 840agaagaacat caacgggcac
caggccaagt tcttcgcctg catcaagaga agtgacggct 900cctgtgcgtg gtaccgcggc
gcggcgcccc ccaagcagga gtttctcgac atcgaggacc 960cataagcagg cctccaacgc
ccctgtggcc aactgcaaaa aaagcctcca agggtttcga 1020ctggtccagc tctgacatcc
cttcctggaa acagcatgaa taaaacactc atcccatggg 1080tccaaattaa tatgattctg
ctcccccctt ctccttttag acatggttgt gggtctggag 1140ggagacgtgg gtccaaggtc
ctcatcccat cctccctctg ccaggcacta tgtgtctggg 1200gcttcgatcc ttgggtgcag
gcagggctgg gacacgcggc ttccctccca gtccctgcct 1260tggcaccgtc acagatgcca
agcaggcagc acttagggat ctcccagctg ggttagggca 1320gggcctggaa atgtgcattt
tgcagaaact tttgagggtc gttgcaagac tgtgtagcag 1380gcctaccagg tccctttcat
cttgagaggg acatggccct tgttttctgc agcttccacg 1440cctctgcact ccctgcccct
ggcaagtgct cccatcgccc cggtgcccac catgagctcc 1500cagcacctga ctccccccac
atccaagggc agcctggaac cagtggctag ttcttgaagg 1560agccccatca atcctattaa
tcctcagaat tccagtggga gcctccctct gagccttgta 1620gaaatgggag cgagaaaccc
cagctgagct gcgttccagc ctcagctgag tctttttggt 1680ctgcacccac ccccccaccc
cccccccccc gcccacatgc tccccagctt gcaggaggaa 1740tcggtgaggt cctgtcctga
ggctgctgtc cggggccggt ggctgccctc aaggtccctt 1800ccctagctgc tgcggttgcc
attgcttctt gcctgttctg gcatcaggca cctggattga 1860gttgcacagc tttgctttat
ccgggcttgt gtgcagggcc cggctgggct ccccatctgc 1920acatcctgag gacagaaaaa
gctgggtctt gctgtgccct cccaggctta gtgttccctc 1980cctcaaagac tgacagccat
cgttctgcac ggggctttct gcatgtgacg ccagctaagc 2040atagtaagaa gtccagccta
ggaagggaag gattttggag gtaggtggct ttggtgacac 2100actcacttct ttctcagcct
ccaggacact atggcctgtt ttaagagaca tcttattttt 2160ctaaaggtga attctcagat
gataggtgaa cctgagttgc agatatacca acttctgctt 2220gtatttctta aatgacaaag
attacctagc taagaaactt cctagggaac tagggaacct 2280atgtgttccc tcagtgtggt
ttcctgaagc cagtgatatg ggggttagga taggaagaac 2340tttctcggta atgataagga
gaatctcttg tttcctccca cctgtgttgt aaagataaac 2400tgacgatata caggcacatt
atgtaaacat acacacgcaa tgaaaccgaa gcttggcggc 2460ctgggcgtgg tcttgcaaaa
tgcttccaaa gccaccttag cctgttctat tcagcggcaa 2520ccccaaagca cctgttaaga
ctcctgaccc ccaagtggca tgcagccccc atgcccaccg 2580ggacctggtc agcacagatc
ttgatgactt ccctttctag ggcagactgg gagggtatcc 2640aggaatcggc ccctgcccca
cgggcgtttt catgctgtac agtgacctaa agttggtaag 2700atgtcataat ggaccagtcc
atgtgatttc agtatataca actccaccag acccctccaa 2760cccatataac accccacccc
tgttcgcttc ctgtatggtg atatcatatg taacatttac 2820tcctgtttct gctgattgtt
tttttaatgt tttggtttgt ttttgacatc agctgtaatc 2880attcctgtgc tgtgtttttt
attacccttg gtaggtatta gacttgcact tttttaaaaa 2940aaggtttctg catcgtggaa
gcatttgacc cagagtggaa cgcgtggcct atgcaggtgg 3000attccttcag gtctttcctt
tggttctttg agcatctttg ctttcattcg tctcccgtct 3060ttggttctcc agttcaaatt
attgcaaagt aaaggatctt tgagtaggtt cggtctgaaa 3120ggtgtggcct ttatatttga
tccacacacg ttggtctttt aaccgtgctg agcagaaaac 3180aaaacaggtt aagaagagcc
gggtggcagc tgacagagga agccgctcaa ataccttcac 3240aataaatagt ggcaatatat
atatagttta agaaggctct ccatttggca tcgtttaatt 3300tatatgttat gttctaagca
cagctctctt ctcctatttt catcctgcaa gcaactcaaa 3360atatttaaaa taaagtttac
attgtagtta ttttcaaatc tttgcttgat aagtattaag 3420aaatattgga cttgctgccg
taatttaaag ctctgttgat tttgtttccg tttggatttt 3480tgggggaggg gagcactgtg
tttatgctgg aatatgaagt ctgagacctt ccggtgctgg 3540gaacacacaa gagttgttga
aagttgacaa gcagactgcg catgtctctg atgctttgta 3600tcattcttga gcaatcgctc
ggtccgtgga caataaacag tattatcaaa gagaaaaaaa 3660aaaaaaaaaa
3670542428DNAHomo sapiens
54cgtccagttt gagtctaggt tggagttgga accgtggaga tgcggaagga aaccccaccc
60cccctagtgc ccccggcggc ccgggagtgg aatcttcccc caaatgcgcc cgcctgcatg
120gaacggcagt tggaggctgc gcggtaccgg tccgatgggg cgcttctcct cggggcctcc
180agcctgagtg ggcgctgctg ggccggctcc ctctggcttt ttaaggaccc ctgtgccgcc
240cccaacgaag gcttctgctc cgccggagtc caaacggagg ctggagtggc tgacctcact
300tgggttgggg agagaggtat tctagtggcc tccgattcag gtgctgttga attgtgggaa
360ctagatgaga atgagacact tattgtcagc aagttctgca agtatgagca tgatgacatt
420gtgtctacag tcagtgtctt gagctctggc acacaagctg tcagtggtag caaagacatc
480tgcatcaagg tttgggacct tgctcagcag gtggtactga gttcataccg agctcatgct
540gctcaggtca cttgtgttgc tgcctctcct cacaaggact ctgtgtttct ttcatgcagc
600gaggacaata gaattttact ctgggatacc cgctgtccca agccagcatc acagattggc
660tgcagtgcgc ctggctacct tcctacctcg ctggcttggc atcctcagca aagtgaagtc
720tttgtctttg gtgatgagaa tgggacagtc tcccttgtgg acaccaagag tacaagctgt
780gtcctgagct cagctgtaca ctcccagtgt gtcactgggc tggtgttctc cccacacagt
840gttcccttcc tggcctctct cagtgaagac tgctcacttg ctgtgctgga ctcaagcctt
900tctgagttgt ttagaagcca agcccacaga gactttgtga gagatgcgac ttggtccccg
960ctcaatcact ccctgcttac cacagtgggc tgggaccatc aggtcgtcca ccacgttgtg
1020cccacagaac ctctcccagc ccctggacct gcaagtgtta ctgagtagat tggatttaag
1080acaaaaagca agtcccccat gagtgtccac ttctttgccc tgccctctca gcttgtgaga
1140caacacagga gccttctata gtatgttgat atgctagatc tgtgccgtta ataggcatcg
1200tctctcagcc tgagggaggc tggattctgg gttcctgtag tcacagggag gaaaagcttt
1260cttaaaaatg gacatgtatg tgcgtgtgag tgtgtgtgta gatttatagt ttttggtagt
1320ggcaggaata aaaaaaatcc atcctacatc ttccctaagc actgcctctc tctcaccccc
1380caaaacaagt tgacgaaagg gttttatgta gctgtctatg aggaattggc cgtgtctggg
1440tgggttatgg gatgtgggca tccctgggtt cttggaagca gctcttatgc tactcataga
1500gatgggattg actttatttt tttatagtgc ttaattcacc attatgagaa atgcttccag
1560tcacaaaaat gcagcccagc tcactctgag gaagaagcag gacttggtac ggttttacac
1620aactccttac cattaaactg aatcagaaat ccattttctg gctgaataaa aagtttggct
1680tgcctgtgta atgcccactc ccttccccct ggctccctag tgatgggaca tatatgagag
1740agaagtgttt ttctatcata gacaccatag gggaaagttt ggggatgaag gagagcttaa
1800aggtgtttca attaagttag aaaactgaca caggctgttg agaattcttt gccacttttc
1860ccaccccaaa acagcatggg gcctgacatc ttctgccctg gtcccctttc tcttgatgtg
1920gaaagtctga atgcagtatt tatagacttc taaggtttta aaatccagta tcaagaagaa
1980aatcagaaat actggttggt gaaataaaga gtttaggcat tgttggcctg tcttttttga
2040agcatgtgtg ttatgtgtag ttagatatat ttcacttatg tgagtcatca tggtgttggt
2100cttgtagccc attatttttc ctgtgcttcc ccagcttccc aaagtagcta gttagaactt
2160aaggtaaata tttattcttg ggttggtgga gtggatattg ccagttagga gtcatggatc
2220aattactgat tatattgaaa gtaaatataa tcaattatgt acttttgagc tttgcaggtt
2280caatttaggt aaaaatcaca ttatgaaact gggaaagtct gaaggaatat gggcaaaata
2340tttctcagta aagcttccat gcttcaccct tgacatgatt acccttgagt aaaacatggg
2400aatttgtaaa aaaaaaaaaa aaaaaaaa
2428551980DNAHomo sapiens 55ttcgattttg gtgctgtgaa aagaatagaa aagaaaaaga
aaatgaagag gtaagctcat 60agcagattct ctttgtatgg atttaaggga aggacattat
ccacaacaga aaactgacca 120tttggatttt cttgtttgta gaaggtcttt aacatttcca
ctgcttcctc agcccgatat 180ccagggatac actgatggaa tgagaaagtt gagaataaac
ataggcctat gaaaatgtgt 240gctgtatccc ataaaaacaa catatatata catgattatg
taaacagatt tcagatgtta 300ataaactttg gggatattag taacatgggt aaggaggtac
acttccaaaa gatgtttgat 360atatcatctt tttcattact cccaatcaac tgttattagg
catcactccc aatcaactgt 420tattcatcca ttaactatta tagaagttac cagctttgtg
atcttgggtt aggcacttaa 480actctccatg ccttatttat acaatgctgg caataatagc
acttacttca ggggattttg 540tgaggattaa gtgagataat acctgttaaa taccaggcac
atcataagtg ctcattaagc 600attagttatt tttatctgct cctatttact agtggtccat
taagcattcc atgctataga 660gctagggttg gcaaattata cttggtggac caaatctgtt
ccatagctga gaactgtgag 720ctaagaatgg tttttatatc ttaaaagctt tgttaaagaa
aaaaaaagac taggtgacag 780agatgtaagc ggctcacaaa gggtgaaata tttactagtt
aaccctttgc agaaaaagtt 840tatcaaccct tgctacagag gattttaaaa aataaaatac
agcttgttct atctttagca 900tctaactggg gaaaagaaat cataacatgt gaaagaataa
ataagaaatt gtgctaacag 960taaggagtgt tatatgaaat attacctgaa gaacatgaaa
cttgaacttg ccttagagat 1020agagaatatt taaagaggct aagcagagca tttcagggaa
agggcaagaa gaagcctggg 1080ttgtgtgtga ggaaatcagc tgacagagga ggagactatt
aaggaagcat aaggaaagaa 1140agacaaaaaa ttggggtaaa aatatgtacg gctttgaaag
cttgtcagaa gagtttggac 1200ttaaaaccaa gcacccttct gaagtgcatg aagtgacaca
atgagcatct ggaaggaagg 1260agccagaaag cataggcaca gaggacagga ggaccagcta
ctgtgagatg ctgttcagaa 1320cgaacctccc attctcctgt gtcttcagtc tgcccttgcc
tgggcctccg acacctgcat 1380aaaccttcgc cataacaaat aaccttccat ccaccctgtc
ccgtcaaagg ctgacaccct 1440gctcctgcct tcactcctca gtggcctcat cttcactggc
ttgagttccc agcacttcac 1500tgagtctgcc ctctcagaaa tccccaggtc cctactgacc
aaaacacttg cctcctttca 1560gattcctcaa ctctgcagtc ctggaggcaa ctggccacac
ctgctctgtc tgaccgctct 1620tgcctccctt ggcttctcag cattttacca tcctaaccac
tgccagccag tcccgtcaca 1680gctgccccct gcttcctgct gtgttaagtg ctggagctcc
ccagaggtcc ccctccactc 1740cactcgcaca ctcagagccc tctcctctta cgtgggatga
gagcagtggt tctcaaccat 1800tgctgctcag gagaaccagt tggaactctc tggaaacaca
gcactgttgg ccccctgcct 1860tctgattcag atggtctggg gcagggactg agcagagtca
ggcacagaag cctccaggtg 1920attctaacgg gcagtccggg atgagaactg ctgagttaca
ggcctcgaag gaaactgcac 19805655PRTArtificial SequenceSynthetic 56Ala Pro
Arg Thr Arg Thr Leu Arg Ala Arg Arg Ser Pro Arg Met Glu1 5
10 15Ile Ala Gln Lys Trp Met Met Lys
Thr Val Lys Glu Glu Glu Trp Asn 20 25
30Val Trp Met Lys Cys Pro Ile Leu Lys Asn Ser Leu Pro Ile Ser
Lys 35 40 45Ile Asn Phe Ile Lys
Asn Asp 50 555718PRTArtificial SequenceSynthetic
57Gly Thr Asn Gln Arg Arg Glu Gly Lys Ser Ser Gly Ile Phe Gln His1
5 10 15Phe Val5847PRTArtificial
SequenceSynthetic 58Gly Lys Trp Cys His Ala Cys Ala Glu Leu Pro Glu Pro
Ala Ser Thr1 5 10 15Thr
Ser Asn Pro Leu Ser Glu Leu Pro Cys Cys Cys Met Gly Trp Gln 20
25 30Cys Pro His Ser Ala Glu Glu Asn
Leu Cys Tyr Thr Ala Gln Trp 35 40
455977PRTArtificial SequenceSynthetic 59Ile Asn Thr Leu Val Thr Tyr Asp
Met Val Pro Glu Pro Lys Ile Ile1 5 10
15Asp Ala Ala Leu Arg Ala Cys Arg Arg Leu Asn Asp Phe Ala
Ser Thr 20 25 30Val Arg Ile
Leu Glu Val Val Lys Asp Lys Ala Gly Pro His Lys Glu 35
40 45Ile Tyr Pro Tyr Val Ile Gln Glu Leu Arg Pro
Thr Leu Asn Glu Leu 50 55 60Gly Ile
Ser Thr Pro Glu Glu Leu Gly Leu Asp Lys Val65 70
756039PRTArtificial SequenceSynthetic 60Glu Val His Ile Lys Lys
Lys Thr Lys Gln Thr Leu Thr Asn Phe Gln1 5
10 15Met Gly Leu Leu Val Arg Gly Arg Glu Trp Pro Cys
Pro Gly Cys Ala 20 25 30Ala
Cys Leu Ser Lys Leu Pro 356122PRTArtificial SequenceSynthetic
61Asp His Ser Met Val Glu Phe Pro Arg Ile Ile Val Tyr Pro Gln Phe1
5 10 15Gly Val Gly Asn Glu Gly
2062150PRTArtificial SequenceSynthetic 62Ser Ser Gly Ser Gly
Glu Ser Arg Leu Gln His Ser Pro Ser Gln Ser1 5
10 15Tyr Leu Cys Ile Pro Phe Pro Arg Gly Glu Asp
Gly Asp Gly Pro Ser 20 25
30Ser Asp Gly Ile His Glu Glu Pro Thr Pro Val Asn Ser Ala Thr Ser
35 40 45Thr Pro Gln Leu Thr Pro Thr Asn
Ser Leu Lys Arg Gly Gly Ala His 50 55
60His Arg Arg Cys Glu Val Ala Leu Leu Gly Cys Gly Ala Val Leu Ala65
70 75 80Ala Thr Gly Leu Gly
Phe Asp Leu Leu Glu Ala Gly Lys Cys Gln Leu 85
90 95Leu Pro Leu Glu Glu Pro Glu Pro Pro Ala Arg
Glu Glu Lys Lys Arg 100 105
110Arg Glu Gly Leu Phe Gln Arg Ser Ser Arg Pro Arg Arg Ser Thr Ser
115 120 125Pro Pro Ser Arg Lys Leu Phe
Lys Lys Glu Glu His Gln Ala Cys Gly 130 135
140Arg Thr Arg Val Thr Ser145 1506330PRTArtificial
SequenceSynthetic 63Gln Lys Leu Cys Gln Ala Lys Glu Lys Gly Met Cys Met
Lys Lys Leu1 5 10 15Arg
Met Leu Trp Glu Cys Gln Lys Leu Tyr Ser Leu Gly Phe 20
25 306416PRTArtificial SequenceSynthetic 64Ser
Glu Gly Arg Thr Val Thr Asn Lys Val Ser Arg Lys Tyr Thr Gly1
5 10 156542PRTArtificial
SequenceSynthetic 65Gln Arg Gly Ser Gly Gln Gln Glu Asp Ala His His Pro
Ser Ser Pro1 5 10 15Pro
Ala Gly His Pro Gln Arg Arg Gly Thr Glu Gln Ala Ala Gly Gln 20
25 30Ser His His Arg Pro Gly Arg Arg
Leu Ala 35 406644PRTArtificial SequenceSynthetic
66Ile Leu Tyr Pro Glu Thr Leu Leu Lys Leu Leu Ile Ser Leu Arg Arg1
5 10 15Phe Trp Ala Glu Met Met
Glu Phe Ser Arg Tyr Thr Ile Met Ser Ser 20 25
30Glu Asn Arg Asp Asn Leu Thr Ser Ser Phe Pro Asn
35 406725PRTArtificial SequenceSynthetic 67Cys Ser Lys
His Ser Ser Leu Leu Leu Phe Ser Ser Cys Lys Gln Leu1 5
10 15Lys Ile Phe Lys Ile Lys Phe Thr Leu
20 256836PRTArtificial SequenceSynthetic 68Asn
Ser Leu Pro Leu Phe Pro Pro Gln Asn Ser Met Gly Pro Asp Ile1
5 10 15Phe Cys Pro Gly Pro Leu Ser
Leu Asp Val Glu Ser Leu Asn Ala Val 20 25
30Phe Ile Asp Phe 356927PRTArtificial
SequenceSynthetic 69Val Ser Gly Ser Gln Arg Val Lys Tyr Leu Leu Val Asn
Pro Leu Gln1 5 10 15Lys
Lys Phe Ile Asn Pro Cys Tyr Arg Gly Phe 20
2570343DNAArtificial SequenceSynthetic 70tcgtcgaggc tcctgctcct gtgactctcg
agcagccaga ggctcctacc tctatcgagt 60ctttacctac tacttctgac actttcttct
tcttacctta caaacctact ttacaggtta 120gaactttttg tcaaatggct agagtttcta
gttgaaatat ttcttgctaa ttcagtccac 180ctacgttttg atgttcttca gtatcgacct
tttcgtggtc ttatgaacct tggcgaccgt 240tgaaatgtcc ttttatacgt ttaagcatgt
ttccatcgtc cttagatatc tctcgagacg 300aatcttagac atttcttgtt tatacttaca
ctttaagttc gaa 34371711DNAArtificial
SequenceSynthetic 71ggaggtcgag gcggaggcgg aggaggagga ggccgaggcg
ccggaggagg ccgaggcgcc 60ggagcaggag gaggccggcc ggaggcggca tgagacgagc
gtggcggccg cggctgctcg 120gggccgcgct ggttgnccat tgacagcggc gtctgcagct
cgcttcaaga tggccgcttg 180gctcgcattc attttctgct gaacgacttt taactttcat
tgtcttttcc gcccgcttcg 240atcgcctcgc gccggctgct ctttccggga ttttttatca
agcagaaatg catcgaacaa 300cgagaatcaa gatcactgag ctaaatcccc ncctgatgtg
tgtgctttgt ggagggtact 360tcattgatgc cacaaccata atagaatgtc tacattcctt
ctgtaaaacg tgtattgttc 420gttacctgga gaccagcaag tattgtccta tttgtgatgt
ccaagttcac aagaccagac 480cactactgaa tataaggtca gataaaactc tccaagatat
tgtatacaaa ttagttccag 540ggcttttcaa aaatgaaatg aagagaagaa gggattttta
tgcagctcat ccttctgctg 600atgctgccaa tggctctaat gaagatagag gaggacggtt
gcagatgaag ataagagaat 660tataanctga tgatgagata ataaggcttg cggccgcact
cgagaaacag t 71172126DNAArtificial SequenceSynthetic
72ggagagaggg aaaatcaagt ggtattttcc agcactttgt atgattttgg atgagttgta
60cacccaagga ttctgttctg caactccatc ctcctgtgtc actgaatatc aactctgaaa
120gagcaa
12673404DNAArtificial SequenceSynthetic 73cgggaaatgg tgccacgcat
gcgcagaact tcccgagcca gcatccacca catcaaaccc 60actgagtgag ctcccttgtt
gttgcatggg atggcaatgt ccacatagcg cagaggagaa 120tctgtgttac acagcgcaat
ggtaggtagg ttaacataag atgcctccgt gagaggctgg 180tggtcagccc tggggtcagt
aaccacaaga agccgtggct cccggaaggc tgcctggatc 240tggttagtga aggttccagg
agtgaagcgg ccagcaattg gagtggctcc agtggcagca 300gcaaacttca gcacagccct
ctggccagta ttcctggagg atataacact gacatcagca 360gggttttcaa tggcaacaat
tgcacgagct gccagcagaa gctt 40474412DNAArtificial
SequenceSynthetic 74gataaacaca cttgttacct atgatatggt tccagagccc
aaaatcattg atgctgcttt 60gcgggcatgc agacggttaa atgattttgc tagtacagtt
cgtatcctag aggttgttaa 120ggacaaagca ggacctcata aggaaatcta cccctatgtc
atccaggaac ttagaccaac 180tttaaatgaa ctgggaatct ccactccgga ggaactgggc
cttgacaaag tgtaaaccgc 240atggatgggc ttccccaagg atttattgac attgctactt
gagtgtgaac agttacctgg 300aaatactgat gataacatat taccttattt gaacaagttt
tcctttattg agtaccaagc 360catgtaatgg taacttggac tttaataaaa gggaaatgag
tttgaactga aa 41275281DNAArtificial SequenceSynthetic
75gggaagtcca cattaaaaag aaaacaaaac aaaccctaac taacttccaa atgggtctcc
60tggtgcgggg gcgtgagtgg ccgtgccctg ggtgtgctgc ctgtctgagc aagcttccct
120agctgtggaa ccccgggccc cctgctgcgg gctctgcctt ggtgtcatgc ctgctgcacc
180cccgtttcca ctgacgtgcc gtctgtggct atgggggtgg tcactggaat gacggtcact
240ccagacgtca gccggcaggg atgcagcagg ctggccgcgc a
28176327DNAArtificial SequenceSynthetic 76attctatggt ggaatttcca
agaataattg tttatcctca gtttggagta ggaaatgaag 60gataattttt tccatttcac
ctctattgca aatttatttt ttcaagccac acaaaaaatt 120gtctaagata aaatgagaat
tattcagatc aattctgcaa tgatacaggg aagatgtgaa 180aggagggctc aatgcagagt
tgtgaagttg aaaaccacta tttctgttct aaagacacag 240taagcagaga tccatctctc
ttcaggcatc ctgcttctct gcaggttact tctgctttaa 300ggaaagtaca tttttagaac
aaagctt 32777532DNAArtificial
SequenceSynthetic 77tcaagcggga gtggagagag tcgcctacag cattcaccca
gccagtccta cctctgtatc 60ccattccctc gtggagagga tggcgatggc ccctccagtg
atggaatcca tgaggagccc 120accccagtca actcggccac gagtacccct cagctgacgc
caaccaacag cctcaagcgg 180ggcggtgccc accaccgccg ctgcgaggtg gctctgctcg
gctgtggggc tgttctggca 240gccacaggcc tagggtttga cttgctggaa gctggcaagt
gccagctgct tcccctggag 300gagcctgagc caccagcccg ggaggagaag aaaagacggg
agggtctttt tcagaggtcc 360agccgtcctc gtcggagcac cagcccccca tcccgaaagc
ttttcaagaa ggaggagcac 420caagcttgcg gccgcactcg agtaactagt taaccccttg
gggcctctaa acgggtcttg 480agggggttan ctngttactc gngtgcggcc gcnngcttgg
tgctcnncnt tn 53278358DNAArtificial SequenceSynthetic
78atcccagcac ggaggcccag aaaactttaa gatttgagta ttaatgtctc aaggtcagga
60gcaacctcaa ggctaaaact cagatctcag gactcaattt cacagaagtt ccactataaa
120ggcaataatc taaagcttta aatgatatga aaattttgta ataagagttc agtatttctg
180ccaacattgg cgcatggatt gcaaagttca caggattgaa aacaccatcg acataatgga
240aattgaacag catctgatta ctgagtgcta tatcagcaag ttaaaaggat cttttgcata
300ccttttaatg gtatatatcc taaaactgaa gtgttcaata tagacatcca gattgaaa
35879439DNAArtificial SequenceSynthetic 79tgtgtgggta tgagggtatg
agagggcccc tctcactcca ttccttctcc aggacatccc 60tccactcttg ggagacacag
agaagggctg gttccagctg gagctgggag gggcaattga 120gggaggagga aggagaaggg
ggaaggaaaa cagggtatgg gggaaaggac cctggggagc 180gaagtggagg atacaacctt
gggcctgcag gccaggctac ctacccactt ggaaacccac 240gccaaagccg catctacagc
tgagccactc tgaggcctcc cctccccggc ggtccccact 300cagctccaaa gtctctctcc
cttttctctc ccacactcta tcatcccccg gattcctctc 360tacttggttc tcattcttcc
tttgacttcc tgatcctgtg tattttcggc tcaccttgat 420ttgtcactgt tctcccctc
43980236DNAArtificial
SequenceSynthetic 80acgcggctcg gggacaacaa gaagacgcgc atcatccctc
gtcacctcca gctggccatc 60cgcaacgacg aggaactgaa caagctgctg ggcaaagtca
ccatcgccca gggcggcgtc 120ttgcctaaca tccaggccgt actgctccct aagaagacgg
agagtcacca caaggcaaag 180ggcaagtgag gctgacgtcc ggcccaagtg ggcccagccc
ggcccgcgtc tcgaag 23681380DNAArtificial SequenceSynthetic
81tgtggcatcg tcaaaaggaa gggattggtt tggcaagaac ttgtttacaa catttttgca
60aatctaaagt tgctccatac aatgactagt cacctggggg ggttgggcgg gcgccatctt
120ccattgccgc cgcgggtgtg cggtctcgat tcgctgaatt gcccgtttcc atacagggtc
180tcttccttcg gtcttttgta tttttgattg ttatgtaaaa ctcgctttta ttttaatatt
240gatgtcagta tttcaactgc tgtaaaatta taaactttta tacttgggta agtcccccag
300gggcgagttc ctcgctctgg gatgcaggca tgcttctcac cgtgcagagc tgcacttggc
360ctcagctggc tgtatggaaa
38082352DNAArtificial SequenceSynthetic 82atgttctaag cacagctctc
ttctcctatt ttcatcctgc aagcaactca aaatatttaa 60aataaagttt acattgtagt
tattttcaaa tctttgcttg ataagtatta agaaatattg 120gacttgctgc cgtaatttaa
agctctgttg attttgtttc cgtttggatt tttgggggag 180gggagcactg tgtttatgct
ggaatatgaa gtctgagacc ttcggtgctg ggaacacaca 240agagttgttg aaagttgaca
agcagactgc gcatgtctct gatgctttgt atcattcttg 300agcaatcgct cggtccgtgg
acaataaaca gtattatcaa agagaaaaaa aa 35283506DNAArtificial
SequenceSynthetic 83gccacttttc ccaccccaaa acagcatggg gcctgacatc
ttctgccctg gtcccctttc 60tcttgatgtg gaaagtctga atgcagtatt tatagacttc
taaggtttta aaatccagta 120tcaagaagaa aatcagaaat actggttggt gaaataaaga
gtttaggcat tgttggcctg 180tcttttttga agcatgtgtg ttatgtgtag ttagatatat
ttcacttatg tgagtcatca 240tggtgttggt cttgtagccc attatttttc ctgtgcttcc
ccagcttccc aaagtagcta 300gttagaactt aaggtaaata tttattcttg ggttggtgga
gtggatattg ccagttagga 360gtcatggatc aattactgat tatattgaaa gtaaatataa
tcaattatgt acttttgagc 420tttgcaggtt caatttaggt aaaaatcaca ttatgaaact
gggaaagtct gaaggaatat 480gggcaaaata tttctcagta aagctt
50684401DNAArtificial SequenceSynthetic
84gagatgtaag cggctcacaa agggtgaaat atttactagt taaccccctt gcagaaaaag
60ttatcaaccc ttgctacaga ggattttaaa aaataaaata cagcttgttc tatctttagc
120atctaactgg ggaaaagaat cataacatgt gaaagaataa ataagaaatt gtgctaacag
180taaggagtgt tatatgaaat attacctgaa gaacatgaaa cttgaacttg ctagagatag
240agaatattta aagaggctaa gcagagcatt tcagggaaag ggcaagaaga agcctgggtt
300gtgtgtgagg aaatcagctg acagaggagg agactattaa ggaagcataa ggaaagaaag
360acaaaaaatt ggggtaaaaa tatgtacggc tttgaaagct t
401852407DNAHomo sapiens 85gagcgccgca cctacaccag ccaacccaga tcccgaggtc
cgacagcgcc cggcccagat 60ccccacgcct gccaggagca agccgagagc cagccggccg
gcgcactccg actccgagca 120gtctctgtcc ttcgacccga gccccgcgcc ctttccggga
cccctgcccc gcgggcagcg 180ctgccaacct gccggccatg gagaccccgt cccagcggcg
cgccacccgc agcggggcgc 240aggccagctc cactccgctg tcgcccaccc gcatcacccg
gctgcaggag aaggaggacc 300tgcaggagct caatgatcgc ttggcggtct acatcgaccg
tgtgcgctcg ctggaaacgg 360agaacgcagg gctgcgcctt cgcatcaccg agtctgaaga
ggtggtcagc cgcgaggtgt 420ccggcatcaa ggccgcctac gaggccgagc tcggggatgc
ccgcaagacc cttgactcag 480tagccaagga gcgcgcccgc ctgcagctgg agctgagcaa
agtgcgtgag gagtttaagg 540agctgaaagc gcgcaatacc aagaaggagg gtgacctgat
agctgctcag gctcggctga 600aggacctgga ggctctgctg aactccaagg aggccgcact
gagcactgct ctcagtgaga 660agcgcacgct ggagggcgag ctgcatgatc tgcggggcca
ggtggccaag cttgaggcag 720ccctaggtga ggccaagaag caacttcagg atgagatgct
gcggcgggtg gatgctgaga 780acaggctgca gaccatgaag gaggaactgg acttccagaa
gaacatctac agtgaggagc 840tgcgtgagac caagcgccgt catgagaccc gactggtgga
gattgacaat gggaagcagc 900gtgagtttga gagccggctg gcggatgcgc tgcaggaact
gcgggcccag catgaggacc 960aggtggagca gtataagaag gagctggaga agacttattc
tgccaagctg gacaatgcca 1020ggcagtctgc tgagaggaac agcaacctgg tgggggctgc
ccacgaggag ctgcagcagt 1080cgcgcatccg catcgacagc ctctctgccc agctcagcca
gctccagaag cagctggcag 1140ccaaggaggc gaagcttcga gacctggagg actcactggc
ccgtgagcgg gacaccagcc 1200ggcggctgct ggcggaaaag gagcgggaga tggccgagat
gcgggcaagg atgcagcagc 1260agctggacga gtaccaggag cttctggaca tcaagctggc
cctggacatg gagatccacg 1320cctaccgcaa gctcttggag ggcgaggagg agaggctacg
cctgtccccc agccctacct 1380cgcagcgcag ccgtggccgt gcttcctctc actcatccca
gacacagggt gggggcagcg 1440tcaccaaaaa gcgcaaactg gagtccactg agagccgcag
cagcttctca cagcacgcac 1500gcactagcgg gcgcgtggcc gtggaggagg tggatgagga
gggcaagttt gtccggctgc 1560gcaacaagtc caatgaggac cagtccatgg gcaattggca
gatcaagcgc cagaatggag 1620atgatccctt gctgacttac cggttcccac caaagttcac
cctgaaggct gggcaggtgg 1680tgacgatctg ggctgcagga gctggggcca cccacagccc
ccctaccgac ctggtgtgga 1740aggcacagaa cacctggggc tgcgggaaca gcctgcgtac
ggctctcatc aactccactg 1800gggaagaagt ggccatgcgc aagctggtgc gctcagtgac
tgtggttgag gacgacgagg 1860atgaggatgg agatgacctg ctccatcacc accacggctc
ccactgcagc agctcggggg 1920accccgctga gtacaacctg cgctcgcgca ccgtgctgtg
cgggacctgc gggcagcctg 1980ccgacaaggc atctgccagc ggctcaggag cccaggtggg
cggacccatc tcctctggct 2040cttctgcctc cagtgtcacg gtcactcgca gctaccgcag
tgtggggggc agtgggggtg 2100gcagcttcgg ggacaatctg gtcacccgct cctacctcct
gggcaactcc agcccccgaa 2160cccagagccc ccagaactgc agcatcatgt aatctgggac
ctgccaggca ggggtggggg 2220tggaggcttc ctgcgtcctc ctcacctcat gcccaccccc
tgccctgcac gtcatgggag 2280ggggcttgaa gccaaagaaa aataaccctt tggttttttt
cttctgtatt tttttttcta 2340agagaagtta ttttctacag tggttttata ctgaaggaaa
aacacaagca aaaaaaaaaa 2400aaaaaaa
240786567DNAHomo sapiens 86gcgcagggtt tgaaacatgg
cggacgacgt agaccagcaa caaactacca acactgtaga 60ggagcccctg gatcttatca
ggctcagcct agatgagcga atttatgtga aaatgagaaa 120tgaccgagag cttcgaggca
gattacatgc ttatgatcaa catttaaata tgatcttggg 180agatgtggaa gaaactgtga
ctactataga aattgatgaa gaaacatatg aagagatata 240taaatcaacg aaacggaata
ttccaatgct ctttgtccgg ggagatggcg ttgtcctggt 300tgcccctcca ctgagagttg
gctgaaacaa agaatttgtc ctgtatggaa aacgggagac 360tttgtacagt ggcctctcta
aaagtacaaa acattcataa gagaaacctg catacatttt 420gatattaaga aataattccg
gggattcttc cactcctgaa atgagttgat ttgcagataa 480ctcacaactt cttaagctaa
atggtatttt catttttctc aagctctcca ataaatatga 540ccaccaagaa aaaaaaaaaa
aaaaaaa 567871980DNAHomo sapiens
87tttaagggtg tacaagctct aattgttttt tttttttttt tgagatggag tttcactctg
60tagcccaggc tggagtgcag tggcgcaatc gcggctcact gcaagctccg cctcctgggt
120tcacaccatt ctcctgcctc agtctcccga gtagctggga ctacaggcgc tcgccaccac
180gcccggctaa tttttttgta tttttagtag agacggggtt tcaccatgtt agccagggtg
240gtctcgatct cctgaccttg tgatccgcct gcctcggcct cccaaagtgc tgggattaca
300ggcgtgagtg actgcgccca gcctcacagg ctctaattct tgactaattt tcctgtacac
360gtcacttgta attgaaaagc tgagtgtaag atcagccgac acacccagag ttttatttta
420ttttatttat ttatttatgg tttttttttg agatggagtc tcactctgtc gcccaggcta
480gagtgcagtg gcgccatctc ggcttactgc aagctccacc tcctgggttc acgccattct
540cctacctcag tctcctgagt agctgggact acaggcgccc accaccacgc ctggctaatt
600tttttgtatt tttagtagag acagggtttc accgtgttag gcaggatggt ctcgatctcc
660tgacctcgtg attcgcccgc ctcggcctcc caaagcgctg ggattagaag cgtgagccac
720cgcgcccgga ctattttatt tatttttttg agatggagtt tcacttttgt tgcccaggat
780tgagtgcagt gccccgatct tggctcacta caacctctgc ctcctgggtt caagcgactc
840tcctgcctca gtgtcctgag tagctgggat tacaggcgtc tgccaccacg cccggctaat
900tttgtatttt tagtagagaa caggtttcac tatgttggtc aggctggtct tgaactcctg
960acctcagcgc atccagaatt ttagacgggg cccccagggt gaggtcttgg caccctccag
1020tagagaagaa gggacatggg ccatacgtgg ggtgtccttt ctgggagcct tgcgtccctt
1080acctgcctag ccagggattg cacctcacag cacgcagcca gcaggaacgg caccgtgatc
1140tgatttcacc tgcgggccct gggccctggg ggtgtttgac aattggggca tatcacagtg
1200tgagctagtc ccgtctcggg ggtttggagg ctccacgtgg ccgtggtaca ggagcaggca
1260gttccatcct ctggcctgga tcaggctctg cacacggagg cctgtgggcc agatgactga
1320caggagggga gttgggtgga acctcggcct gcctgatatc cagcaacaga gggcaagggc
1380ggcagcacct ccagcatgac agtcccttcc aagcacgtca ggatgctccc ttgcctgtgc
1440tggcagcttc ctaaacatgg ggactgggca tggtggcagg tttttgtcct tctgaaagag
1500caattttgct gtgaggttac ttgctccttg agttcttgtc tgaggcccac ctggcggctg
1560ctccgtgagg aacgaggtgg ccctgctgca gctcagcatc ccgccacgct cccaggagtg
1620tgtgtttcct ggggggagcg gcccgggacc gtggctctgt ggtccattct gtggatgtcc
1680acaaggcctg ggcgttctgt gggtttgggt ggcagtcccg tctgggcagc tcctgctggg
1740ctgggtgtgg gtctcctgct ggtctgcccc cagctgcaca acgtgtcttg tgccttgccc
1800tcttgtacct ctgcaggttt tggctacggg cctccacctc caccgccaga tcagtttgcc
1860cctccggggg ttcctcctcc accagccact cccggggcag cacctctggc tttcccaccg
1920cctccgtctc aggctgcccc ggacatgagc aagcccccga cagctcagcc agacttcccc
1980884005DNAHomo sapiens 88cggcagggtt ggaaaatgat ggaagaggcg gaggtggagg
cgaccgagtg ctgagaggaa 60cctgcggaat cggccgagat ggggtctggc gcgcgctttc
cctcggggac ccttcgtgtc 120cggtggttgc tgttgcttgg cctggtgggc ccagtcctcg
gtgcggcgcg gccaggcttt 180caacagacct cacatctttc ttcttatgaa attataactc
cttggagatt aactagagaa 240agaagagaag cccctaggcc ctattcaaaa caagtatctt
atgttattca ggctgaagga 300aaagagcata ttattcactt ggaaaggaac aaagaccttt
tgcctgaaga ttttgtggtt 360tatacttaca acaaggaagg gactttaatc actgaccatc
ccaatataca gaatcattgt 420cattatcggg gctatgtgga gggagttcat aattcatcca
ttgctcttag cgactgtttt 480ggactcagag gattgctgca tttagagaat gcgagttatg
ggattgaacc cctgcagaac 540agctctcatt ttgagcacat catttatcga atggatgatg
tctacaaaga gcctctgaaa 600tgtggagttt ccaacaagga tatagagaaa gaaactgcaa
aggatgaaga ggaagagcct 660cccagcatga ctcagctact tcgaagaaga agagctgtct
tgccacagac ccggtatgtg 720gagctgttca ttgtcgtaga caaggaaagg tatgacatga
tgggaagaaa tcagactgct 780gtgagagaag agatgattct cctggcaaac tacttggata
gtatgtatat tatgttaaat 840attcgaattg tgctagttgg actggagatt tggaccaatg
gaaacctgat caacatagtt 900gggggtgctg gtgatgtgct ggggaacttc gtgcagtggc
gggaaaagtt tcttatcaca 960cgtcggagac atgacagtgc acagctagtt ctaaagaaag
gttttggtgg aactgcagga 1020atggcatttg tgggaacagt gtgttcaagg agccacgcag
gcgggattaa tgtgtttgga 1080caaatcactg tggagacatt tgcttccatt gttgctcatg
aattgggtca taatcttgga 1140atgaatcacg atgatgggag agattgttcc tgtggagcaa
agagctgcat catgaattca 1200ggagcatcgg gttccagaaa ctttagcagt tgcagtgcag
aggactttga gaagttaact 1260ttaaataaag gaggaaactg ccttcttaat attccaaagc
ctgatgaagc ctatagtgct 1320ccctcctgtg gtaataagtt ggtggacgct ggggaagagt
gtgactgtgg tactccaaag 1380gaatgtgaat tggacccttg ctgcgaagga agtacctgta
agcttaaatc atttgctgag 1440tgtgcatatg gtgactgttg taaagactgt cggttccttc
caggaggtac tttatgccga 1500ggaaaaacca gtgagtgtga tgttccagag tactgcaatg
gttcttctca gttctgtcag 1560ccagatgttt ttattcagaa tggatatcct tgccagaata
acaaagccta ttgctacaac 1620ggcatgtgcc agtattatga tgctcaatgt caagtcatct
ttggctcaaa agccaaggct 1680gcccccaaag attgtttcat tgaagtgaat tctaaaggtg
acagatttgg caattgtggt 1740ttctctggca atgaatacaa gaagtgtgcc actgggaatg
ctttgtgtgg aaagcttcag 1800tgtgagaatg tacaagagat acctgtattt ggaattgtgc
ctgctattat tcaaacgcct 1860agtcgaggca ccaaatgttg gggtgtggat ttccagctag
gatcagatgt tccagatcct 1920gggatggtta acgaaggcac aaaatgtggt gctggaaaga
tctgtagaaa cttccagtgt 1980gtagatgctt ctgttctgaa ttatgactgt gatgttcaga
aaaagtgtca tggacatggg 2040aaatgaatac tgcattgagg gacggacttc tggtcttctt
cttcctaatt gttcccctta 2100ttgtctgtgc tatttttatc ttcatcaaga gggatcaact
gtggagaagc tacttcagaa 2160agaagagatc acaaacatat gagtcagatg gcaaaaatca
agcaaaccct tctagacagc 2220cggggagtgt tcctcgacat gtttctccag tgacacctcc
cagagaagtt cctatatatg 2280caaacagatt tgcagtacca acctatgcag ccaagcaacc
tcagcagttc ccatcaaggc 2340cacctccacc acaaccgaaa gtatcatctc agggaaactt
aattcctgcc cgtcctgctc 2400ctgcacctcc tttatatagt tccctcactt gattttttta
accttctttt tgcaaatgtc 2460ttcagggaac tgagctaata cttttttttt ttcttgatgt
tttcttgaaa agcctttctg 2520ttgcaactat gaatgaaaac aaaacaccac aaaacagact
tcactaacac agaaaaacag 2580aaactgagtg tgagagttgt gaaatacaag gaaatgcagt
aaagccaggg aatttacaat 2640aacatttccg tttccatcat tgaataagtc ttattcagtc
atcggtgagg ttaatgcact 2700aatcatggat tttttgaaca tgttattgca gtgattctca
aattaactgt attggtgtaa 2760gatttttgtc attaagtgtt taagtgttat tctgaatttt
ctaccttagt tatcattaat 2820gtagttcctc attgaacatg tgataatcta atacctgtga
aaactgacta atcagctgcc 2880aataatatct aatatttttc atcatgcacg aattaataat
catcatactc tagaatcttg 2940tctgtcactc actacatgaa taagcaaata ttgtcttcaa
aagaatgcac aagaaccaca 3000attaagatgt catattattt tgaaagtaca aaatatacta
aaagagtgtg tgtgtattca 3060cgcagttact cgcttccatt tttatgacct ttcaactata
ggtaataact cttagagaaa 3120ttaatttaat attagaattt ctattatgaa tcatgtgaaa
gcatgacatt cgttcacaat 3180agcactattt taaataaatt ataagcttta aggtacgaag
tatttaatag atctaatcaa 3240atatgttgat tcatggctat aataaagcag gagcaattat
aaaatcttca atcaattgaa 3300cttttacaaa accacttgag aatttcatga gcactttaaa
atctgaactt tcaaagcttg 3360ctattaaatc atttagaatg tttacattta ctaaggtgtg
ctgggtcatg taaaatatta 3420gacactaata ttttcataga aattaggctg gagaaagaag
gaagaaatgg ttttcttaaa 3480tacctacaaa aaagttactg tggtatctat gagttatcat
cttagctgtg ttaaaaatga 3540atttttacta tggcagatat ggtatggatc gtaaaatttt
aagcactaaa aattttttca 3600taacctttca taataaagtt taataatagg tttattaact
gaatttcatt agttttttaa 3660aagtgttttt ggtttgtgta tatatacata tacaaataca
acatttacaa taaataaaat 3720acttgaaatt ctcttttgtg tctcctagta gcttcctact
caactattta taatctcatt 3780aattaaaaag ttataatttt agataaaaat tctagtcaaa
tttttacaga tattatctca 3840ctaattttca gacttttgcc aaagtgtgca caatggcttt
ttgttaataa agaacagatt 3900agttttgaag aaggcaaaaa tttcagtttt ctgaagacag
catgttattt taacaatcaa 3960gtatacatat taaaaattgt gagcaatctc aaaaaaaaaa
aaaaa 4005891278DNAHomo sapiens 89ccattggcct gtagattcac
ctcccctggg cagggcccca ggacccagga taatatctgt 60gcctcctgcc cagaaccctc
caagcagaca caatggtaag aatggtgcct gtcctgctgt 120ctctgctgct gcttctgggt
cctgctgtcc cccaggagaa ccaagatggt cgttactctc 180tgacctatat ctacactggg
ctgtccaagc atgttgaaga cgtccccgcg tttcaggccc 240ttggctcact caatgacctc
cagttcttta gatacaacag taaagacagg aagtctcagc 300ccatgggact ctggagacag
gtggaaggaa tggaggattg gaagcaggac agccaacttc 360agaaggccag ggaggacatc
tttatggaga ccctgaaaga catcgtggag tattacaacg 420acagtaacgg gtctcacgta
ttgcagggaa ggtttggttg tgagatcgag aataacagaa 480gcagcggagc attctggaaa
tattactatg atggaaagga ctacattgaa ttcaacaaag 540aaatcccagc ctgggtcccc
ttcgacccag cagcccagat aaccaagcag aagtgggagg 600cagaaccagt ctacgtgcag
cgggccaagg cttacctgga ggaggagtgc cctgcgactc 660tgcggaaata cctgaaatac
agcaaaaata tcctggaccg gcaagatcct ccctctgtgg 720tggtcaccag ccaccaggcc
ccaggagaaa agaagaaact gaagtgcctg gcctacgact 780tctacccagg gaaaattgat
gtgcactgga ctcgggccgg cgaggtgcag gagcctgagt 840tacggggaga tgttcttcac
aatggaaatg gcacttacca gtcctgggtg gtggtggcag 900tgcccccgca ggacacagcc
ccctactcct gccacgtgca gcacagcagc ctggcccagc 960ccctcgtggt gccctgggag
gccagctagg aagcaagggt tggaggcaat gtgggatctc 1020agacccagta gctgcccttc
ctgcctgatg tgggagctga accacagaaa tcacagtcaa 1080tggatccaca aggcctgagg
agcagtgtgg ggggacagac aggaggtgga tttggagacc 1140gaagactggg atgcctgtct
tgagtagact tggacccaaa aaatcatctc accttgagcc 1200cacccccacc ccattgtcta
atctgtagaa gctaataaat aatcatccct ccttgcctag 1260cataaaaaaa aaaaaaaa
1278901980DNAHomo sapiens
90tcatttcaaa atttaggagt taatttatat ttttaattga atcagatttc ataggcatag
60atattgtctg tcaatattca tatgtttata tagtggtaat ttattaaact tcttaatcca
120gatgtattat tttagttatc ttttttccac tctagtgtca tagtttaaac ttgttctttg
180atgttgagta tttattataa caatagtttt ttttgcctgc actctacaat gtatatttcc
240agatataatt tgtttatgta acttgttgac catttataat ggggaaaaaa gcttgctaaa
300agttctcaag atagctagga aaatatcaat gagatatatc taaaagaaag ggagaggggt
360ttggaagatt actgccactc tctttcctta tatatttctt aggacttctg aggtgctttt
420atgcttcttg ttttgtgtaa agtatatata tatatatata tatatacaca cacacaaagt
480atatataaac acaaagtata tatatacaca cacatataca caaagtatat atatatacac
540acaaagtata tatatatgta cacaaaatat atatatatac acaaaagtac ttacaaggca
600tgttcttacc tcaaaaagat gccaacttat ttatgagaaa tagatcctac tttatggaaa
660agcaaaatag gaacatgaca ataaaccaat atgataaagc actgtcagag ttcaaaaaca
720cctatgatac ctaaatgtac tcatgtagtt tggatcaacc agaaaggctg gtgacaagag
780gtacagctta cttggtaact taaagaataa gaagggtttg aaagtgaaga gacggtgaga
840atagctaaag aagaggaaaa cagcatagcc tacaagacag gagatgataa agtttagggg
900ctatttagca aataataaat aaattgattt agaatagaag aaatcatgtg ttggaaaaga
960ggcttgaaac aagttcggtg ttagagaaga gaatattaag aaacaagtgg gagataggac
1020ttctaaatgc tgcactaagg atttcggatt tattctcatg gtaaaggaga gccagccaag
1080gcttttctac aggagagagg tataatcaag cagcgtgaag ctgagtcagt agggggatca
1140gtgagaatag gaagacatca gggttgggga agatgaaagc ttagtttaag catgagttaa
1200ttctaccagg atgatggtaa ttgttatatt aagataggga tgaataagaa atatttcaaa
1260ggtataaagg ataagcttgt tgactgactg aacttaagga acaaagtaaa aagcagagtc
1320aaagtggcag aggctatagc cagggacaac gactacatat ccagcctttt ctatgtctcg
1380gggtgaagat gcctttctta ttcactattt ctctcttcaa ctcctccaca ccaccatgca
1440aaatcatagc ccatctatgc ttgacgtgcc tacatgtaga aacctgtgat gatctctcca
1500gcgagaaagc aggtttaatc ccttgacagt ccttgactca tagtaagttc ttattttatt
1560tttaagaccg gcatggatga cttttactta atatctgttc tttgccattt aatgctagag
1620ctgatgatat tgagtggcca tttcacaata tgtacctgtt ctgtgttagg aacacttcta
1680aaaggggctt ggaattatta atttatacaa aaacataaaa tttcatcttg aatctataaa
1740cttgctttaa tacaatgagt aaaagtgatc attttagctt tggatctgaa tttcacttga
1800aggcatgcac atgggattag gagttgggtg aataatcagg actggaaaag taaacctaga
1860aattattgac atggataaag agttgttgat accctgtgag aaggaacttt gggaaatgtg
1920gatggaggag gacagaaagg agcagagaat aaaagtatga aagctagccc tgtaggctca
1980914319DNAHomo sapiens 91ctctgagtca ccggaatcta ggtggggccg cccggagcgg
cgtcctcggg agccgcctcc 60ccgcggcctc ttcgcttttg tggcggcgcc cgcgctcgca
ggccactctc tgctgtcgcc 120cgtcccgcgc gctcctccga cccgctccgc tccgctccgc
tcggccccgc gccgcccgtc 180aacatgatcc gctgcggcct ggcctgcgag cgctgccgct
ggatcctgcc cctgctccta 240ctcagcgcca tcgccttcga catcatcgcg ctggccggcc
gcggctggtt gcagtctagc 300gaccacggcc agacgtcctc gctgtggtgg aaatgctccc
aagagggcgg cggcagcggg 360tcctacgagg agggctgtca gagcctcatg gagtacgcgt
ggggtagagc agcggctgcc 420atgctcttct gtggcttcat catcctggtg atctgtttca
tcctctcctt cttcgccctc 480tgtggacccc agatgcttgt cttcctgaga gtgattggag
gtctccttgc cttggctgct 540gtgttccaga tcatctccct ggtaatttac cccgtgaagt
acacccagac cttcaccctt 600catgccaacc ctgctgtcac ttacatctat aactgggcct
acggctttgg gtgggcagcc 660acgattatcc tgattggctg tgccttcttc ttctgctgcc
tccccaacta cgaagatgac 720cttctgggca atgccaagcc caggtacttc tacacatctg
cctaacttgg gaatgaatgt 780gggagaaaat cgctgctgct gagatggact ccagaagaag
aaactgtttc tccaggcgac 840tttgaaccca ttttttggca gtgttcatat tattaaacta
gtcaaaaatg ctaaaataat 900ttgggagaaa atatttttta agtagtgtta tagtttcatg
tttatctttt attatgtttt 960gtgaagttgt gtcttttcac taattaccta tactatgcca
atatttcctt atatctatcc 1020ataacattta tactacattt gtaagagaat atgcacgtga
aacttaacac tttataaggt 1080aaaaatgagg tttccaagat ttaataatct gatcaagttc
ttgttatttc caaatagaat 1140ggactcggtc tgttaagggc taaggagaag aggaagataa
ggttaaaagt tgttaatgac 1200caaacattct aaaagaaatg caaaaaaaaa gtttattttc
aagccttcga actatttaag 1260gaaagcaaaa tcatttccta aatgcatatc atttgtgaga
atttctcatt aatatcctga 1320atcattcatt ttagctaagg cttcatgttg actcgatatg
tcatctagga aagtactatt 1380tcatggtcca aacctgttgc catagttggt aaggctttcc
tttaagtgtg aaatatttag 1440atgaaatttt ctcttttaaa gttctttata gggttagggt
gtgggaaaat gctatattaa 1500taaatctgta gtgttttgtg tttatatgtt cagaaccaga
gtagactgga ttgaaagatg 1560gactgggtct aatttatcat gactgataga tctggttaag
ttgtgtagta aagcattagg 1620agggtcattc ttgtcacaaa agtgccacta aaacagcctc
aggagaataa atgacttgct 1680tttctaaatc tcaggtttat ctgggctcta tcatatagac
aggcttctga tagtttgcaa 1740ctgtaagcag aaacctacat atagttaaaa tcctggtctt
tcttggtaaa cagattttaa 1800atgtctgata taaaacatgc cacaggagaa ttcggggatt
tgagtttctc tgaatagcat 1860atatatgatg catcggatag gtcattatga ttttttacca
tttcgactta cataatgaaa 1920accaattcat tttaaatatc agattattat tttgtaagtt
gtggaaaaag ctaattgtag 1980ttttcattat gaagttttcc caataaacca ggtattctaa
acttgtttcc agtttgtagt 2040ttttccattt ttcaaatctg gggaaaggaa ttaaaaaaaa
aatgggtaat aagaacatgg 2100gatataatga aaagtggttt ttgtttgttt ttttgtttga
agttttaagg gccttgctca 2160ttttaggtgt ccaaaaccaa tttttgagtg gagattaatg
aattctaata gtctattccc 2220tgaacttttc ctcaatgaac aataccctag acacacatta
aacaatttct ctgcagtgct 2280atcaaccaga ggaaaatgga ctaagagatt tctggcaggt
tcagacaccc gggggacatg 2340tgtgcagtgt agctgaagcc tcctccttgt gctggggtcc
ccttccattc aggtggtggg 2400gtagcagtct ctctattttc cccttgccct ccttcccatt
ttatcatttg ttattttttt 2460tcccaccata agtcatatgt tacttccact atggtgtatg
tcattgtgag gatgggtgca 2520gagaggctgg gtgggagaac ggaaatatat ctccctaggg
ctactgttgg ccagctagtc 2580cttggcagtg aatttttcta tgcttttcaa aatgcgaggt
gaatgtttct catagagaaa 2640tgtaatctgg gtgattatac caaaattgaa aagaaaaacc
cacacaacta tgccgtggct 2700ggtggagaat ttgaagtggt cattaaaaat gttaaaaatc
ccatctttta aagtgatacc 2760acagctcatt caagaagata ctggatatct agagattaag
aaacgtggtc tcctgttaaa 2820catgaaaatg actccgttta taagcttctc taccacatgc
acttgtcttt gcatgatttc 2880ccatccagcc ttcttcccct cctcaatcac acaatacctt
aacggcgcac atttaggaaa 2940aatgcaacct cctgggacca acgagcctga tataatagaa
ccatgtcaac ctaaagtatt 3000tatgacaaag ataaactctt attttgcaga aatggtctgc
ttccttcagc cttgttctag 3060tatagagatc tgccattcct tgttgatcca gattcaccaa
gacagatacc tttatgtcat 3120aacagaaggg aagttccaga ggattctgga gagtaatgaa
gaattgggct gagaaaccac 3180ctgaaggcta acagtgcatt gcatgagatt tcccacagta
aagctgaggt gctttttggt 3240tcagtaatta aatattgagt tcccaccctt taaataagca
gttctaggtt cctaagcaat 3300tatttcactc tgtaagtagc cagacatgct aagtggcact
tactgctgat tgtaacaaag 3360aagtaatata tcaaggtctt tccatgttca cacaaggtag
cttgtgtgta ataacttagc 3420ttcaaaacca tagactgcag aactcacaag ttcaacagcc
tttccttttt taaggaaatg 3480aaaacaatgg aaaatatagt catcataact taattcggtt
tatttttttt ttctgtaaac 3540tccccctgaa agacattcct attaatacag taaatgtgaa
cactgacttg tttttataag 3600cacatctgaa agggcatatt tgagtctcat cccaactttg
gtccttgcta tctgtgcagg 3660cttgggcagg tcatctccct gctggtctca atatcctcac
ctgtaaaatg attgtaaatg 3720atccccctac cttcaagatt ctctgattga tagaattttt
tctttaatta aaaaatttta 3780aatattcctt gagttggaag cactgatcaa taagtggatt
gcttagggag gttggaacga 3840atagattcag tcccaacttc ctcttttaaa ttccctcttc
ctcactcttc ctgcaacact 3900tatttttaca gttgagtttt aaaaataagt aatatataaa
ataatttctg tagtgtggtt 3960tcagatttaa aaattcctgc agacaggctg ggcttgcaac
cccatcagtc gatggtcaga 4020gccctttgct ttttgagacc atttttaggt gagcttggct
tgcctggata cagtgtgcag 4080tgcattcttc ctgaattttg caattctggt atctgggtgt
attttctagg tgtgtcaggg 4140tgagtgtaat ccacctaggg tgtggaaaaa gccaagaaag
ggaaattaaa agaggttcct 4200atccagtcat gttaatgatc ttccacttgt actatcctgt
gcttcgttgt taacctcgaa 4260aacatacttt gttggctgca aaaataaaca aagggaaact
caaaaaaaaa aaaaaaaaa 4319921980DNAHomo sapiens 92ggcaccgtgg gagtttgcag
ctctggttgc tccaagagca caaatattaa tgtagcacag 60atattaatat tattaattag
cacagacatt aatgtagtca cagaaagaaa aagagatgaa 120aaagagacag gttcttcact
gcatgagagg ctccgtttgg gatctctcag aaatgtggaa 180gcagaggcta cagcacaagc
ctgggttatt gctagtagca agacagaaaa taaggcttgg 240gtaagctgta gttatagtta
caatggaaat gactggccca agagagtgct acagattaca 300tagcagctac taagaaaaag
gacaggcaga aggggtaggc aagacatgtt ctctggctgt 360tgcagccacc aaaaagccag
gatacaaagg cagggagtta tctgaactgc cttcctggag 420ggtcatgcat ttaggatccg
actcattgac tcttttcctt aattttgctc tgtacatttc 480tctaagaggg ctaaccagtg
tcaaggtttg ataatatctg aaatggtatt ctggtgccaa 540agtatcatct cacaaattat
ttagaaattg caaagagaaa atatatttta taatccagat 600atctggcagt taaccacatg
accaaattta gcatcactaa cagtaggaca actagatatt 660atatacctct tgctgtgata
tactatgaag tacacatcat caactatgaa gtattatttt 720ttttttcttt gagatagggt
catgctctgt cgcccaattt agagtgcagc gatgcaatca 780tagctcactg cagctttgac
ctcccagtct caagtgatcc tcccacctca gcctccctag 840tagctgggac tacagatgtg
ttccaccaca cctggctaat ttttatatat tttttgtagt 900gatggggttt caccatgttg
cacaggctgg tcttgaactc ctgggcttaa gcaatctgcc 960tgaaagttct gggattatag
gcatgagcca ctgtgtccag actatgaagt attcttgcca 1020aaactgatca acctaaatct
aatcaagctt ctgggccaga actgtccaat agcaatgtaa 1080tgtcagctac atgtaattta
aaattttcta gttgccacca aaagcacaga aaagaaaaaa 1140tagataaatt gtgctacatc
aagattaaat acttctttgc atcaaaggac ataatcaaca 1200cagagaaaag gcaaaccact
gaatgggaga aaatatttgc aaattgatat tcataatatg 1260taaagaatct ttacaactca
acacccacaa aataaaaaaa aagattaaaa aatggggaaa 1320ggacttgaat agacatttct
ccaaagaaga tgtacaactt gccaataagc acaagaaaag 1380actaattatg agggaaatgc
aaattaaaac cacaatgaga tcaaacacat tatgttggct 1440atcataaaaa gaaagtgcca
ggcgcaatga tcacagctac tcaacaggct gggtggaaga 1500atcccttgag accaggagtt
agaggctgca gtgtgttatg atcatgcctg tgaatagcca 1560ctgcactcca acataggtaa
catagcaagc cccatccata aaataaaata aaataaaata 1620aaataaaggc aacaaaaaat
aacaagtatt ggtaaggatg tggagaaatt ggaaccctcg 1680tgcattgctg gtgggtgtgt
aaaaaggtat ggctgctgtg aaaaatggga tggctattct 1740tcaaaaaatt aaccacagaa
ttactatatg atccagcaat cccacttctg catacacatc 1800caaaagaagt ggactcaagg
actcagacag atatttgtac ccccctgttc atagcagcat 1860tatttacaat agccaaaaag
tagaagcaac cacagattca tcaatgtatg aatggataaa 1920caaaatgtgg catatacaca
tagtgggata tcattcagct ttaaaaaggg aggaaattct 198093784DNAHomo sapiens
93gcccacgcgc cagagtcgca gtgggcgggc ctacgtgctc cgcccgctgt gagcctgtcc
60ggcccccgcc cgctccggag caacccgcga gcttacaccg gcttctctct gtcctcagcc
120cgcgcgccgc catcgccgtc atgctgggcg ccgctctccg ccgctgcgct gtggccgcaa
180ccacccgggc cgaccctcga ggcctcctgc actccgcccg gacccccggc cccgccgtgg
240ctatccagtc agttcgctgc tattcccatg ggtcacagga gacagatgag gagtttgatg
300ctcgctgggt aacatacttc aacaagccag atatagatgc ctgggaattg cgtaaaggga
360taaacacact tgttacctat gatatggttc cagagcccaa aatcattgat gctgctttgc
420gggcatgcag acggttaaat gattttgcta gtacagttcg tatcctagag gttgttaagg
480acaaagcagg acctcataag gaaatctacc cctatgtcat ccaggaactt agaccaactt
540taaatgaact gggaatctcc actccggagg aactgggcct tgacaaagtg taaaccgcat
600ggatgggctt ccccaaggat ttattgacat tgctacttga gtgtgaacag ttacctggaa
660atactgatga taacatatta ccttatttga acaagttttc ctttattgag taccaagcca
720tgtaatggta acttggactt taataaaagg gaaatgagtt tgaactgaaa aaaaaaaaaa
780aaaa
784941980DNAHomo sapiens 94ttcacttctg agtcccagag gttacccaag gcacccctct
gacatccggc ctgcttcttc 60tcacatgaca aaaactagcc cccatctcaa tcatatacca
aatctctccc tcactaaacg 120taagccttct cctcactctc tcaatcttat ccatcatagc
aggcagttga ggtggattaa 180accaaaccca gctacgcaaa atcttagcat actcctcaat
tacccacata ggatgaataa 240tagcagttct accgtacaac cctaacataa ccattcttaa
tttaactatt tatattatcc 300taactactac cgcattccta ctactcaact taaactccag
caccacgacc ctactactat 360ctcgcacctg aaacaagcta acatgactaa cacccttaat
tccatccacc ctcctctccc 420taggaggcct gcccccgcta accggctttt tgcccaaatg
ggccattatc gaagaattca 480caaaaaacaa tagcctcatc atccccacca tcatagccac
catcaccctc cttaacctct 540acttctacct acgcctaatc tactccacct caatcacact
actccccata tctaacaacg 600taaaaataaa atgacagttt gaacatacaa aacccacccc
attcctcccc acactcatca 660cccttaccac gctactccta cctatctccc cttttatact
aataatctta tagaaattta 720ggttaaatac agaccaagag ccttcaaagc cctcagcaag
ttgcaatact taatttctgt 780aacagctaag gactgcaaaa ccccactctg catcaactga
acgcaaatca gccactttaa 840ttaagctaag cccttactag accaatggga cttaaaccca
caaacactta gttaacagct 900aagcacccta atcaactggc ttcaatctac ttctcccgcc
gccgggaaaa aaggcgggag 960aagccccggc aggtttgaag ctgcttcttc gaatttgcaa
ttcaatatga aaatcacctc 1020ggagctggta aaaagaggcc tagcccctgt ctttagattt
acagtccaat gcttcactca 1080gccattttac ctcaccccca ctgatgttcg ccgaccgttg
actattctct acaaaccaca 1140aagacattgg aacactatac ctattattcg gcgcatgagc
tggagtccta ggcacagctc 1200taagcctcct tattcgagcc gagctgggcc agccaggcaa
ccttctaggt aacgaccaca 1260tctacaacgt tatcgtcaca gcccatgcat ttgtaataat
cttcttcata gtaataccca 1320tcataatcgg aggctttggc aactgactag ttcccctaat
aatcggtgcc cccgatatgg 1380cgtttccccg cataaacaac ataagcttct gactcttacc
tccctctctc ctactcctgc 1440tcgcatctgc tatagtggag gccggagcag gaacaggttg
aacagtctac cctcccttag 1500cagggaacta ctcccaccct ggagcctccg tagacctaac
catcttctcc ttacacctag 1560caggtgtctc ctctatctta ggggccatca atttcatcac
aacaattatc aatataaaac 1620cccctgccat aacccaatac caaacgcccc tcttcgtctg
atccgtccta atcacagcag 1680tcctacttct cctatctctc ccagtcctag ctgctggcat
cactatacta ctaacagacc 1740gcaacctcaa caccaccttc ttcgaccccg ccggaggagg
agaccccatt ctataccaac 1800acctattctg atttttcggt caccctgaag tttatattct
tatcctacca ggcttcggaa 1860taatctccca tattgtaact tactactccg gaaaaaaaga
accatttgga tacataggta 1920tggtctgagc tatgatatca attggcttcc tagggtttat
cgtgtgagca caccatatat 1980957505DNAHomo sapiens 95gagggcgggg cgggaaggcg
gcgaggagcc gagctgggtg cggtgaggcg cgcagatcac 60cgcggttcct gggcagggca
cggaaggcta agcaaggctg acctgctgca gctcccgcct 120cgtgcgctcg ccccacccgg
ccgccgcccg agcgctcgag aaagtcctct cgggagaagc 180agcgcctgtt cccggggcag
atccaggttc aggtcctggc tataagtcac catggcacag 240caagctgccg ataagtatct
ctatgtggat aaaaacttca tcaacaatcc gctggcccag 300gccgactggg ctgccaagaa
gctggtatgg gtgccttccg acaagagtgg ctttgagcca 360gccagcctca aggaggaggt
gggcgaagag gccatcgtgg agctggtgga gaatgggaag 420aaggtgaagg tgaacaagga
tgacatccag aagatgaacc cgcccaagtt ctccaaggtg 480gaggacatgg cagagctcac
gtgcctcaac gaagcctcgg tgctgcacaa cctcaaggag 540cgttactact cagggctcat
ctacacctat tcaggcctgt tctgtgtggt catcaatcct 600tacaagaacc tgcccatcta
ctctgaagag attgtggaaa tgtacaaggg caagaagagg 660cacgagatgc cccctcacat
ctatgccatc acagacaccg cctacaggag tatgatgcaa 720gaccgagaag atcaatccat
cttgtgcact ggtgaatctg gagctggcaa gacggagaac 780accaagaagg tcatccagta
tctggcgtac gtggcgtcct cgcacaagag caagaaggac 840cagggcgagc tggagcggca
gctgctgcag gccaacccca tcctggaggc cttcgggaac 900gccaagaccg tgaagaatga
caactcctcc cgcttcggca aattcattcg catcaacttt 960gatgtcaatg gctacattgt
tggagccaac attgagactt atcttttgga gaaatctcgt 1020gctatccgcc aagccaagga
agaacggacc ttccacatct tctattatct cctgtctggg 1080gctggagagc acctgaagac
cgatctcctg ttggagccgt acaacaaata ccgcttcctg 1140tccaatggac acgtcaccat
ccccgggcag caggacaagg acatgttcca ggagaccatg 1200gaggccatga ggattatggg
catcccagaa gaggagcaaa tgggcctgct gcgggtcatc 1260tcaggggttc ttcagctcgg
caacatcgtc ttcaagaagg agcggaacac tgaccaggcg 1320tccatgcccg acaacacagc
tgcccaaaag gtgtcccatc tcttgggtat caatgtgacc 1380gatttcacca gaggaatcct
caccccgcgc atcaaggtgg gacgggatta cgtccagaag 1440gcgcagacta aagagcaggc
tgactttgcc atcgaggcct tggccaaggc gacctatgag 1500cggatgttcc gctggctggt
gctgcgcatc aacaaggctc tggacaagac caagaggcag 1560ggcgcctcct tcatcgggat
cctggacatt gccggcttcg agatctttga tctgaactcg 1620tttgagcagc tgtgcatcaa
ttacaccaat gagaagctgc agcagctctt caaccacacc 1680atgttcatcc tggagcagga
ggagtaccag cgcgagggca tcgagtggaa cttcatcgac 1740tttggcctcg acctgcagcc
ctgcatcgac ctcattgaga agccagcagg ccccccgggc 1800attctggccc tgctggacga
ggagtgctgg ttccccaaag ccaccgacaa gagcttcgtg 1860gagaaggtga tgcaggagca
gggcacccac cccaagttcc agaagcccaa gcagctgaag 1920gacaaagctg atttctgcat
tatccactat gccggcaagg tggattacaa agctgacgag 1980tggctgatga agaacatgga
tcccctgaat gacaacatcg ccacactgct ccaccagtcc 2040tctgacaagt ttgtctcgga
gctgtggaag gatgtggacc gcatcatcgg cctggaccag 2100gtggccggca tgtcggagac
cgcactgccc ggggccttca agacgcggaa gggcatgttc 2160cgcactgtgg ggcagcttta
caaggagcag ctggccaagc tgatggctac gctgaggaac 2220acgaacccca actttgtccg
ctgcatcatc cccaaccacg agaagaaggc cggcaagctg 2280gacccgcatc tcgtgctgga
ccagctgcgc tgcaacggtg ttctcgaggg catccgtatc 2340tgccgccagg gcttccccaa
cagggtggtc ttccaggagt ttcggcagag atatgagatc 2400ctgactccaa actccattcc
caagggtttc atggacggga agcaggcgtg cgtgctcatg 2460ataaaagccc tggagctcga
cagcaatctg taccgcattg gccagagcaa agtcttcttc 2520cgtgccggtg tgctggccca
cctggaggag gagcgagacc tgaagatcac cgacgtcatc 2580atagggttcc aggcctgctg
caggggctac ctggccagga aagcatttgc caagcggcag 2640cagcagctta ccgccatgaa
ggtcctccag cggaactgcg ctgcctacct gaagctgcgg 2700aactggcagt ggtggcggct
cttcaccaag gtcaagccgc tgctgcaggt gagccggcag 2760gaggaggaga tgatggccaa
ggaggaggag ctggtgaagg tcagagagaa gcagctggct 2820gcggagaaca ggctcacgga
gatggagacg ctgcagtctc agctcatggc agagaaattg 2880cagctgcagg agcagctcca
ggcagaaacc gagctgtgtg ccgaggctga ggagctccgg 2940gcccgcctga ccgccaagaa
gcaggaatta gaagagatct gccatgacct agaggccagg 3000gtggaggagg aggaggagcg
ctgccagcac ctgcaggcgg agaagaagaa gatgcagcag 3060aacatccagg agcttgagga
gcagctggag gaggaggaga gcgcccggca gaagctgcag 3120ctggagaagg tgaccaccga
ggcgaagctg aaaaagctgg aggaggagca gatcatcctg 3180gaggaccaga actgcaagct
ggccaaggaa aagaaactgc tggaagacag aatagctgag 3240ttcaccacca acctcacaga
agaggaggag aaatctaaga gcctcgccaa gctcaagaac 3300aagcatgagg caatgatcac
tgacttggaa gagcgcctcc gcagggagga gaagcagcga 3360caggagctgg agaagacccg
ccggaagctg gagggagact ccacagacct cagcgaccag 3420atcgccgagc tccaggccca
gatcgcggag ctcaagatgc agctggccaa gaaagaggag 3480gagctccagg ccgccctggc
cagagtggaa gaggaagctg cccagaagaa catggccctc 3540aagaagatcc gggagctgga
atctcagatc tctgaactcc aggaagacct ggagtctgag 3600cgtgcttcca ggaataaagc
tgagaagcag aaacgggacc ttggggaaga gctagaggct 3660ctgaaaacag agttggagga
cacgctggat tccacagctg cccagcagga gctcaggtca 3720aaacgtgagc aggaggtgaa
catcctgaag aagaccctgg aggaggaggc caagacccac 3780gaggcccaga tccaggagat
gaggcagaag cactcacagg ccgtggagga gctggcggag 3840cagctggagc agacgaagcg
ggtgaaagca aacctcgaga aggcaaagca gactctggag 3900aacgagcggg gggagctggc
caacgaggtg aaggtgctgc tgcagggcaa aggggactcg 3960gagcacaagc gcaagaaagt
ggaggcgcag ctgcaggagc tgcaggtcaa gttcaacgag 4020ggagagcgcg tgcgcacaga
gctggccgac aaggtcacca agctgcaggt ggagctggac 4080aacgtgaccg ggcttctcag
ccagtccgac agcaagtcca gcaagctcac caaggacttc 4140tccgcgctgg agtcccagct
gcaggacact caggagctgc tgcaggagga gaaccggcag 4200aagctgagcc tgagcaccaa
gctcaagcag gtggaggacg agaagaattc cttccgggag 4260cagctggagg aggaggagga
ggccaagcac aacctggaga agcagatcgc caccctccat 4320gcccaggtgg ccgacatgaa
aaagaagatg gaggacagtg tggggtgcct ggaaactgct 4380gaggaggtga agaggaagct
ccagaaggac ctggagggcc tgagccagcg gcacgaggag 4440aaggtggccg cctacgacaa
gctggagaag accaagacgc ggctgcagca ggagctggac 4500gacctgctgg tggacctgga
ccaccagcgc cagagcgcgt gcaacctgga gaagaagcag 4560aagaagtttg accagctcct
ggcggaggag aagaccatct ctgccaagta tgcagaggag 4620cgcgaccggg ctgaggcgga
ggcccgagag aaggagacca aggctctgtc gctggcccgg 4680gccctggagg aagccatgga
gcagaaggcg gagctggagc ggctcaacaa gcagttccgc 4740acggagatgg aggaccttat
gagctccaag gatgatgtgg gcaagagtgt ccacgagctg 4800gagaagtcca agcgggccct
agagcagcag gtggaggaga tgaagacgca gctggaagag 4860ctggaggacg agctgcaggc
caccgaagat gccaagctgc ggttggaggt caacctgcag 4920gccatgaagg cccagttcga
gcgggacctg cagggccggg acgagcagag cgaggagaag 4980aagaagcagc tggtcagaca
ggtgcgggag atggaggcag agctggagga cgagaggaag 5040cagcgctcga tggcagtggc
cgcccggaag aagctggaga tggacctgaa ggacctggag 5100gcgcacatcg actcggccaa
caagaaccgg gacgaagcca tcaaacagct gcggaagctg 5160caggcccaga tgaaggactg
catgcgcgag ctggatgaca cccgcgcctc tcgtgaggag 5220atcctggccc aggccaaaga
gaacgagaag aagctgaaga gcatggaggc cgagatgatc 5280cagttgcagg aggaactggc
agccgcggag cgtgccaagc gccaggccca gcaggagcgg 5340gatgagctgg ctgacgagat
cgccaacagc agcggcaaag gagccctggc gttagaggag 5400aagcggcgtc tggaggcccg
catcgcccag ctggaggagg agctggagga ggagcagggc 5460aacacggagc tgatcaacga
ccggctgaag aaggccaacc tgcagatcga ccagatcaac 5520accgacctga acctggagcg
cagccacgcc cagaagaacg agaatgctcg gcagcagctg 5580gaacgccaga acaaggagct
taaggtcaag ctgcaggaga tggagggcac tgtcaagtcc 5640aagtacaagg cctccatcac
cgccctcgag gccaagattg cacagctgga ggagcagctg 5700gacaacgaga ccaaggagcg
ccaggcagcc tgcaaacagg tgcgtcggac cgagaagaag 5760ctgaaggatg tgctgctgca
ggtggatgac gagcggagga acgccgagca gtacaaggac 5820caggccgaca aggcatctac
ccgcctgaag cagctcaagc ggcagctgga ggaggccgaa 5880gaggaggccc agcgggccaa
cgcctcccgc cggaaactgc agcgcgagct ggaggacgcc 5940actgagacgg ccgatgccat
gaaccgcgaa gtcagctccc taaagaacaa gctcaggcgc 6000ggggacctgc cgtttgtcgt
gccccgccga atggcccgga aaggcgccgg ggatggctcc 6060gacgaagagg tagatggcaa
agcggatggg gctgaggcca aacctgccga ataagcctct 6120tctcctgcag cctgagatgg
atggacagac agacaccaca gcctcccctt cccagacccc 6180gcagcacgcc tctccccacc
ttcttgggac tgctgtgaac atgcctcctc ctgccctccg 6240ccccgtcccc ccatcccgtt
tccctccagg tgttgttgag ggcatttggc ttcctctgct 6300gcatcccctt ccagctccct
cccctgctca gaatctgata ccaaagagac agggcccggg 6360cccaggcaga gagcgaccag
caggctcctc agccctctct tgccaaaaag cacaagatgt 6420tgaggcgagc agggcaggcc
cccggggagg ggccagagtt ttctatgaat ctatttttct 6480tcagactgag gccttttggt
agtcggagcc cccgcagtcg tcagcctccc tgacgtctgc 6540caccagcgcc cccactcctc
ctcctttctt tgctgtttgc aatcacacgt ggtgacctca 6600cacacctctg ccccttgggc
ctcccactcc catggctctg ggcggtccag aaggagcagg 6660ccctgggcct ccacctctgt
gcagggcaca gaaggctggg gtggggggag gagtggattc 6720ctccccaccc tgtcccaggc
agcgccactg tccgctgtct ccctcctgat tctaaaatgt 6780ctcaagtgca atgccccctc
ccctccttta ccgaggacag cctgcctctg ccacagcaag 6840gctgtcgggg tcaagctgga
aaggccagca gccttccagt ggcttctccc aacactcttg 6900gggaccaaat atatttaatg
gttaagggac ttgtcccaag tctgacagcc agagcgttag 6960aggggccagc ggccctccca
ggcgatcttg tgtctactct aggactgggc ccgagggtgg 7020tttacctgca ccgttgactc
agtatagttt aaaaatctgc cacctgcaca ggtatttttg 7080aaagcaaaat aaggttttct
tttttcccct ttcttgtaat aaatgataaa attccgagtc 7140tttctcactg cctttgttta
gaagagagta gctcgtcctc actggtctac actggttgcc 7200gaatttactt gtattcctaa
ctgttttgta tatgctgcat tgagacttac ggcaagaagg 7260catttttttt ttttaaagga
aacaaactct caaatcatga agtgatataa aagctgcata 7320tgcctacaaa gctctgaatt
caggtcccag ttgctgtcac aaaggagtga gtgaaactcc 7380caccctaccc ccttttttat
ataataaaag tgccttagca tgtgttgcag ctgtcaccac 7440tacagtaagc tggtttacag
atgttttcca ctgagcatca caataaagag aaccatgtgc 7500tacga
7505962487DNAHomo sapiens
96gctattggta agactcgcgg gaaaagaaag ggtgagcgcg gctggaagcg cgcatgcgct
60gtggctaatg ccgtaggctc cttcagggct gagccatccc gcgtgtcttg cgctcggtgg
120aaatgcccag ccgagggacg cgaccagagg acagctctgt gctgatcccc accgacaatt
180cgaccccaca caaggaggat ctaagcagca agattaaaga acaaaaaatt gtggtggatg
240aactttctaa ccttaagaag aataggaaag tatataggca acaacagaac agcaatatat
300tctttcttgc agaccgaaca gaaatgctgt ctgagagcaa gaatatattg gatgaactga
360aaaaagaata ccaagaaata gaaaacttag acaagaccaa aatcaagaaa tagtcaacct
420gatttcacat aacaatgtgt ggcatttgtt gttctgtaaa cttttctgct gagcatttca
480gtcaagattt aaaagaggac ttactatata atcttaaaca gcggggaccc aatagtagta
540aacaattgtt aaagtctgat gttaactacc agtgtttatt ttctgctcac gtcctacact
600tgaggggtgt tttgactacc cagcctgtgg aagatgaaag aggcaatgtg tttctatgga
660atggagaaat ttttagtgga ataaaggttg aagctgaaga gaatgacact caaattttgt
720ttaattatct ttcctcctgt aagaatgaat ctgagatttt gtcactcttc tcagaagtac
780aaggtccctg gtcatttata tattatcaag catctagtca ttatttatgg tttggtaggg
840atttttttgg tcgccgtagc ttgctttggc attttagtaa tttgggcaag agtttctgcc
900tctcttcagt tggcacccaa acatctggat tggcaaatca gtggcaagaa gttccagcat
960ctggactttt cagaattgat cttaagtcta ctgtcatttc cagatgcatt attttacaac
1020tgtatccttg gaaatatatt tctagggaga atattattga agaaaatgtt aatagcctga
1080gtcaaatttc agcagactta ccagcatttg tatcagtggt agcaaatgaa gccaaactgt
1140atcttgaaaa acctgttgtt cctttaaata tgatgttgcc acaagctgca ttggagactc
1200attgcagtaa tatttccaat gtgccaccta caagagagat acttcaagtc tttcttactg
1260atgtacacat gaaggaagta attcagcagt tcattgatgt cctgagtgta gcagtcaaga
1320aacgtgtctt gtgtttacct agggatgaaa acctgacagc aaatgaagtt ttgaaaacgt
1380gtgataggaa agcaaatgtt gcaatcctgt tttctggggg cattgattcc atggttattg
1440caacccttgc tgaccgtcat attcctttag atgaaccaat tgatcttctt aatgtagctt
1500tcatagctga agaaaagacc atgccaacta cctttaacag agaagggaat aaacagaaaa
1560ataaatgtga aataccttca gaagaattct ctaaagatgt tgctgctgct gctgctgaca
1620gtcctaataa acatgtcagt gtaccagatc gaatcacagg aagggcggga ctaaaggaac
1680tacaagctgt tagcccttcc cgaatttgga attttgttga aattaatgtt tctatggaag
1740aactgcagaa attaagaaga actcgaatat gtcacttaat tcggccattg gatacagttt
1800tggatgatag cattggctgt gcagtctggt ttgcttctag aggaattggt tggttagtgg
1860cccaggaagg agtgaaatcc tatcagagca atgcaaaggt agttctcact ggaattggtg
1920cagatgagca acttgcaggt tattctcgtc atcgtgtccg ctttcagtcg catgggctgg
1980aaggattgaa taaggaaata atgatggaac tgggtcgaat ttcttctaga aatcttggtc
2040gtgatgacag agttattggt gatcatggaa aagaagcaag atttcctttc ctggatgaaa
2100atgttgtctc ctttctaaat tctctgccga tttgggaaaa agcaaacttg actttacccc
2160gaggaattgg tgaaaaatta cttttacgcc ttgcagctgt ggaacttggt cttacagcct
2220ctgctcttct gcccaaacgg gccatgcagt ttggatcaag aattgcaaaa atggaaaaaa
2280ttaatgaaaa ggcatctgat aaatgtggac ggctccaaat catgtcctta gaaaatcttt
2340ctattgaaaa ggagactaaa ttgtaatgtg attcacaatg taacaatata aaaataagtt
2400tttatataat tatataaaag taagatactc tgctgcttta ctattgtata atatagtagt
2460tttaaagttc aaaaaaaaaa aaaaaaa
2487972032DNAHomo sapiens 97ggaggactca ggccccgctg gccgcgggct cggtacccgg
tgggtcggtg gagcgtctgt 60tgggtccggg ccgccggctt cgccctcgcc atggcgccct
ggctgcagct cctgtcgctg 120ctggggctgc tcccgggcgc agtggccgcc cccgcccagc
cccgagccgc cagctttcag 180gcctgggggc cgccgtcccc ggagctgctg gcgcccaccc
gcttcgcgct ggagatgttc 240aaccgcggcc gggctgcggg gacgcgggcc gtgctgggcc
ttgtgcgcgg ccgcgtccgc 300cgggcgggtc aggggtcgct gtactccctg gaggccaccc
tggaggagcc accctgcaac 360gaccccatgg tgtgccggct ccccgtgtcc aagaaaaccc
tgctctgcag cttccaagtc 420ctggatgagc tcggaagaca cgtgctgctg cggaaggact
gtggcccagt ggacaccaag 480gttccaggtg ctggggagcc caagtcagcc ttcactcagg
gctcagccat gatttcttct 540ctgtcccaaa accatccaga caacagaaac gagactttca
gctcagtcat ttccctgttg 600aatgaggatc ccctgtccca ggacttgcct gtgaagatgg
cttcaatctt caagaacttt 660gtcattacct ataaccggac atatgagtca aaggaagaag
cccggtggcg cctgtccgtc 720tttgtcaata acatggtgcg agcacagaag atccaggccc
tggaccgtgg cacagctcag 780tatggagtca ccaagttcag tgatctcaca gaggaggagt
tccgcactat ctacctgaat 840actctcctga ggaaagagcc tggcaacaag atgaagcaag
ccaagtctgt gggtgacctc 900gccccacctg aatgggactg gaggagtaag ggggctgtca
caaaagtcaa agaccagggc 960atgtgtggct cctgctgggc cttctcagtc acaggcaatg
tggagggcca gtggtttctc 1020aaccagggga ccctgctctc cctctctgaa caggagctct
tggactgtga caagatggac 1080aaggcctgca tgggcggctt gccctccaat gcctactcgg
ccataaagaa tttgggaggg 1140ctggagacag aggatgacta cagctaccag ggtcacatgc
agtcctgcaa cttctcagca 1200gagaaggcca aggtctacat caatgactcc gtggagctga
gccagaacga gcagaagctg 1260gcagcctggc tggccaagag aggcccaatc tccgtggcca
tcaatgcctt tggcatgcag 1320ttttaccgcc acgggatctc ccgccctctc cggcccctct
gcagcccttg gctcattgac 1380catgcggtgt tgcttgtggg ctacggcaac cgctctgacg
ttcccttttg ggccatcaag 1440aacagctggg gcactgactg gggtgagaag ggttactact
acttgcatcg tgggtccggg 1500gcctgtggcg tgaacaccat ggccagctcg gcggtggtgg
actgaagagg ggcccccagc 1560tcgggacctg gtgctgatca gagtggctgc tgccccagcc
tgacatgtgt ccaggcccct 1620ccccgggagg tacagctggc agagggaaag gcactgggta
cctcagggtg agcagagggc 1680actgggctgg ggcacagccc ctgcttccct gcaccccatt
cccaccctga agttctgcac 1740ctgcaccttt gttgaattgt ggtagcttag gaggatgtcg
gggtgaaggg tggtatcttg 1800gcagttgaag ctggggcaag aactctgggc ttgggtaatg
agcaggaaga aaattttctg 1860atcttaagcc cagctctgtt ctgcccccgc tttcctctgt
ttgatactat aaattttctg 1920gttcccttgg atttagggat agtgtccctc tccatgtcca
ggaaacttgt aaccaccctt 1980ttctaacagc aataaagagg tgtccttgtc ccgaaaaaaa
aaaaaaaaaa aa 2032981980DNAHomo sapiens 98ttgagtaata gaaaataaat
ctgggtcact tttttgtagc tgtaaatcca gccttagtaa 60tcctgacctc cattaacata
gctagtattt caaattccac tgtaacagtt gctctgactc 120tttgggggct gggaggcaat
ccaagtagcc agagaagcaa ttgtttcaca tgcttcaatc 180ctgccactcc agaaaaaata
taagggggac tagggcaaaa gaaaatctct tatttgtttt 240ccatttctca tttctcgtat
ctttattgct tctctctcat ccttaacctg tatctccctt 300cagctgatgc ctgattacct
tctaccatgt tcaacattat gatcagtcac ctactatgtg 360ccaggaagtg tgcagtgtgt
gaggatacca gaccctacct actgggagct tacagtctag 420ctcaacaggc acatcattaa
ataagcaatt gcagcaatta tattaagtgc tgggccaagg 480gaggtaccag aagtcataag
aatccctcct ctgaggggat agaagtgaag acttcagagg 540ggaagtaatg attctggatg
tgtaggactc agccaggtga agtgtaaaag taaggatgga 600ggagagtgtt ctaaaagagg
gaacaacata atcaaagttc tggacaggag agagatttga 660catatttgag gaagtgaaaa
ttttatctag aaacttgcaa tgagtaagta aacaccaggt 720caagaggaac tgagagattg
gcagacaatg gaaaaccatt gaaaaggatt aaactgggaa 780gtgatatgtt ctcttttgca
tttaaaaaga tcaccaatgg ggatatggag aatggtctgg 840ataggtctta agactagagc
caggaagaca tgttagaagg ctatcaattg accctaaaga 900cactgcttca atccctttga
tgacagtgag tttgctttcc ccagagatag cttattggac 960ctcaggactg ctgtgagaaa
cagaaaatgc tcctttacgt gttgcctgaa gttaggctca 1020ccgatttggg gcatgttcta
attctaccag ctaggaacac acagaatcgc ttgtcaaaca 1080ttctgagtca gatatgtcct
ccctatgtct tttctgagaa aggcatacag aaattcccag 1140ctaaacatca ccagttccct
catttgttcc tcagatgata tggtccattc aagttttgta 1200atcatcatgg gggtagatgg
agggtcccag tcctcacaac cattctggta atttactctt 1260gaatttactg gttcacatgt
atctattttg tagtgtggct cctgaaactg aaaaacctac 1320cccaggtatt ctgtgaacag
acagagtaga gagtctgtca ctgcccacgg agagatgatt 1380aggcttccgg gaaaaggtga
gaacactggc aaagttccgg aaggaggaac aatatccctt 1440cttcccttct tcatgagtcg
taccatccct tacttttggc tggtcacata accacccaaa 1500ataagggcta cattttccag
ccactctagc agctaggggt gacagagtga ctaagattta 1560cctggaagta tcgtgtgtga
cttctgggaa gggtccttaa agagaggggt agtcctggct 1620gggtgcggtg gctcacgtct
gtaatcccag cactttggga ggccgaggca ggcggatcac 1680aaggtcagga gttcaagacc
agcctggcca agatgctgaa accccatctc taataaaaat 1740acaaaaaaat tagccgggca
tgctggcggg cgcctgtaat cccagctact taggaggctg 1800agatggagaa ttgcttgaac
ttgggaggca gagtttgcag tgggccaaaa tggcgccact 1860gcactccagc ctgggcaaca
gagcaagcct ccgtctcaaa aaaaaaaaaa aaaaaaaaaa 1920aagagagggg tagtccttgt
tgctgttgct gcaggtattt tctccttctt cccagctgga 1980991674DNAHomo sapiens
99gcggccgccc gccgccgcgc tcctcctcct cctcctccag cgcccggcgg cccgctgcct
60cctccgcccg acgccccgcg tcccccgccg cgccgccgcc gccaccctct gcgccccgcg
120ccgccccccg gtcccgcccg ccatgcccgg cccggccgcg ggcagcaggg cccgggtcta
180cgccgaggtg aacagtctga ggagccgcga gtactgggac tacgaggctc acgtcccgag
240ctggggtaat caagatgatt accaactggt tcgaaaactt ggtcggggaa aatatagtga
300agtatttgag gccattaata tcaccaacaa tgagagagtg gttgtaaaaa tcctgaagcc
360agtgaagaaa aagaagataa aacgagaggt taagattctg gagaaccttc gtggtggaac
420aaatatcatt aagctgattg acactgtaaa ggaccccgtg tcaaagacac cagctttggt
480atttgaatat atcaataata cagattttaa gcaactctac cagatcctga cagactttga
540tatccggttt tatatgtatg aactacttaa agctctggat tactgccaca gcaagggaat
600catgcacagg gatgtgaaac ctcacaatgt catgatagat caccaacaga aaaagctgcg
660actgatagat tggggtctgg cagaattcta tcatcctgct caggagtaca atgttcgtgt
720agcctcaagg tacttcaagg gaccagagct cctcgtggac tatcagatgt atgattatag
780cttggacatg tggagtttgg gctgtatgtt agcaagcatg atctttcgaa gggaaccatt
840cttccatgga caggacaact atgaccagct tgttcgcatt gccaaggttc tgggtacaga
900agaactgtat gggtatctga agaagtatca catagaccta gatccacact tcaacgatat
960cctgggacaa cattcacgga aacgctggga aaactttatc catagtgaga acagacacct
1020tgtcagccct gaggccctag atcttctgga caaacttctg cgatacgacc atcaacagag
1080actgactgcc aaagaggcca tggagcaccc atacttctac cctgtggtga aggagcagtc
1140ccagccttgt gcagacaatg ctgtgctttc cagtggtctc acggcagcac gatgaagact
1200ggaaagcgac gggtctgttg cggttctccc acttttccat aagcagaaca agaaccaaat
1260caaacgtctt aacgcgtata gagagatcac gttccgtgag cagacacaaa acggtggcag
1320gtttggcgag cacgaactag accaagcgaa gggcagccca ccaccgtata tcaaacctca
1380cttccgaatg taaaaggctc acttgccttt ggcttcctgt tgacttcttc ccgacccaga
1440aagcatgggg aatgtgaagg gtatgcagaa tgttgttggt tactgttgct ccccgagccc
1500ctcaactcgt cccgtggccg cctgtttttc cagcaaacca cgctaactag ctgaccacag
1560actccacagt ggggggacgg gcgcagtatg tggcatggcg gcagttacat attattattt
1620taaaagtata tattattgaa taaaaggttt taaaagaaaa aaaaaaaaaa aaaa
16741001032DNAHomo sapiens 100cccgcacccc ctgggattgt gggaaatgta gttttttgcc
tccgtaaggg accaggcgga 60gctgaggaac cgcgcgagga ctgggaccgt gattccacta
accggaaacc gtcgcctttc 120gggcccggcg gggcctgagc caatgcagaa tcgggggccg
cgaggacgcc agcgggcgct 180gtgcgtagga accgccgggt ggccgctgcc gatcggggcc
gacttgggga cggaccggaa 240gtgcccgagg gcggccgcag aacggtcaat ttgagccgcg
tcgagctccc ctgggacctg 300tggccgccgc ccacagacca tgctcctggg gcgcctgact
tcccagctgt tgagggccgt 360tccttgggca ggcggccgcc cgccttggcc cgtctctgga
gtgctgggca gccgggtctg 420cgggcccctt tacagcacat cgccggccgg cccaggtagg
gcggcctctc tccctcgcaa 480gggggcccag ctggagctgg aggagatgct ggtccccagg
aagatgtccg tcagccccct 540ggagagctgg ctcacggccc gctgcttcct gcccagactg
gataccggga ccgcagggac 600tgtggctcca ccgcaatcct accagtgtcc gcccagccag
ataggggaag gggccgagca 660gggggatgaa ggcgtcgcgg atgcgcctca aattcagtgc
aaaaacgtgc tgaagatccg 720ccggcggaag atgaaccacc acaagtaccg gaagctggtg
aagaagacgc ggttcctgcg 780gaggaaggtc caggagggac gcctgagacg caagcagatc
aagttcgaga aagacctgag 840gcgcatctgg ctgaaggcgg ggctaaagga agcccccgaa
ggctggcaga cccccaagat 900ctacctgcgg ggcaaatgag tctggcgccg cccttcccgc
ccgttgctgc tgtgatccgt 960agtaataaat tctcagagga ctcagccttt aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa 1020aaaaaaaaaa aa
10321011980DNAHomo sapiens 101aagaaataag cttattcaag
acctgtagga ccaattttag caagaatcct gctaaatcaa 60tttatgattt cccccccgct
ccacaccctt gaaatctgat cacccttgat atatagctcc 120tcatctccca cctttgatct
gtaagtcctt ggcctgcctt tagcaagagt cctattaggt 180cgggttagca agaatccccc
tacacttgat gtctcctctt aataattttc ccctcttagt 240gaattttcct ctcccctcac
actctgccca ttggctataa atttccagct gtctttgctg 300tattcagaat agagccctat
ctctgcctcc tactgtaata ctctaatgca atatagtctt 360caataaagct ttacttacca
tctcaaccag catcagaata atttttcctt taacatatcc 420aagcttggtc agaattaggg
tgtacctaca cctacctgca ctattaatac tccgcacagc 480agggagaaag gaactaccta
ccaggtgtca tgggcatgga aggatgtgag gaacgctagc 540actggccaaa tacagtggcc
tcacaagcca tcttcacctt caggaaaatg aatattgagc 600tgccacagac actctgctgc
cctcttaatt taccattacc atgaatctac aggatgctct 660gttccaaaca cccagtacat
tcttatacat cttgtcctga tagatgcctg tgaggtaggc 720agggctgaga attatgaatt
gttgtctatc aggctgaagt gactttccaa aaattgaagt 780tgacagcaat aaggtcaaga
atcagctctg tgctgttttg acggagtgag tcattgcctc 840cttgaatctg gcacatacca
gccaactgtc aaggtttgtt cttccacatg gtctaactgc 900taaatacaaa gtatactagg
tttgtcagct tagggcatgt ttgcttccac tctgaaaaca 960tttcagctgc cctaatatat
tgctataaag aattctctta ttattactgt cttcctcctc 1020atatttagct ctgtcttcca
tcacttcaaa agaagcattt gtagcttccc catcctcttt 1080ctttctagtt gactttgaag
actatctata taagtatttc tggcataaaa ctgacaggta 1140aatgacttca aagctaattt
ccgccccccc ccaccccttg ccctttttca gtctcaagat 1200accatgtcag tcctctattc
actctcaaaa atgatggctt aactgcacag tgccgttctg 1260ggtcaattct taaatatact
agaatatact agacatatct ggctcattta agtcattctt 1320caccaatctt tcttcttatt
tacctccttc ctcaacttgg aaattttgcc ttttcacaat 1380atgtggatag ccatttctgc
caagattgtg ccgacaagac tggttataaa tctacctact 1440ttgtaaaagg ggaatatttt
tgtaaccatt gcatatctct attaaaacat gaaagaaaca 1500ctgaaggcca agtgttcaag
tgacacgcag gaaaaaaaaa agctgatatt cagaaagcca 1560agcatacaga gaaataatga
gaggttaatg aagtgagttc tgaatcacaa gtgctgttca 1620gaaaacaaaa aaagacatct
gtgaaggctg accttggaac tagtcactgt tattcagtcc 1680atatgtatgt atgtttttat
taaaataact gttcaaagtt aactttcatc caagttaact 1740tctgaagaaa taaaaaggca
tcacgttaag gtttcaaaaa tttaaccatt ctacctttag 1800caatggttag tccaccttat
tttcacacat ttccatctta atgaaagcaa gtacattaaa 1860ggatactcag aatagctgca
aggcatacca caagatgtac cacaagatta gaaatttctt 1920taaaagtaat taagatcggc
cgagtgcagt ggctgactcc agcaatccca gcattttggg 19801023939DNAHomo sapiens
102ggtcggcctc tgctgcgcct gcgtggtcgg gaggggaagt gaggcggttt cctcggcgcc
60ttttccggca gcggcggcgg cagaactggg aggaggagtt ggaggccgga gggagcccgc
120gctcggggcg gcggctggag gcagcgcacc gagttcccgc gaggatccat gacctgacgg
180ggccccggag ccgcgctgcc tctcgggtgt cctgggtcgg tggggagccc agtgctcgca
240ggccggcggg cgggccggag ggctgcagtc tccctcgcgg tgagaggaag gcggaggagc
300gggaaccgcg gcggcgctcg cgcggcgcct gcggggggaa gggcagttcc gggccgggcc
360gcgcctcagc agggcggcgg ctcccagcgc agtctcaggg cccgggtggc ggcggcgact
420ggagaaatca agttgtgcgg tcggtgatgc ccgagtgagc ggggggcctg ggcctctgcc
480cttaggaggc aactcccacg caggccgcaa aggcgctctc gcggccgaga ggcttcgttt
540cggtttcgcg gcggcggcgg cgttgttggc tgaggggacc cgggacacct gaatgccccc
600ggccccggct cctccgacgc gatggggaag gtgctatcca aaatcttcgg gaacaaggaa
660atgcggatcc tcatgttggg cctggacgcg gccggcaaga caacaatcct gtacaagttg
720aagctgggcc agtcggtgac caccattccc actgtgggtt tcaacgtgga gacggtgact
780tacaaaaatg tcaagttcaa cgtatgggat gtgggcggcc aggacaagat ccggccgctc
840tggcggcatt actacactgg gacccaaggt ctcatcttcg tagtggactg cgccgaccgc
900gaccgcatcg atgaggctcg ccaggagctg caccgcatta tcaatgaccg ggagatgagg
960gacgccataa tcctcatctt cgccaacaag caggacctgc ccgatgccat gaaaccccac
1020gagatccagg agaaactggg cctgacccgg attcgggaca ggaactggta tgtgcagccc
1080tcctgtgcca cctcagggga cggactctat gaggggctca catggttaac ctctaactac
1140aaatcttaat gagcattctc cacccatccc ctggaaggag agaaatcaaa aacccattca
1200taggattatc gccaccatca cctctttcaa ttgccacttt ctcttctttt gaatttgaac
1260tctggagtta ctgttctaca gtttggcggg gacggggctt gggggttttc tcttttgttt
1320gtttcccttt ctttttcctt tttttttttt tttttttttt gttggctttg cgttaggatg
1380ctctgatctg acatttgaca tgaacacaaa gttgctagat gctcttgttg acttccagca
1440gatgggatgg gggaaacaca gcagttcttg gtaaagtcct ttgtaataat agtttgattt
1500ttttatttcg agagaatctt tcattttcct atgtatgctt ttttcctttt ttgcccagtt
1560tccttatcac ttgctgtaga tggcttattt tgcattcatg cagactatgt tgcaagtctg
1620tttcatctag taaactgaaa attattgctt aatcaaactg ccgtttgtct tttatattta
1680aggccttccc cccccttcct tatgagttct aacttagtaa tttcaaatgt gaccttttat
1740atctaagacc agtatagtaa acttagccca cagtggcaaa taatgagtaa tattgtaata
1800tgttccagtt gcacctcagt atgttaaaca ggtaatgtaa gaagttctct gaaatgtcag
1860caagtaagtt ctgaaacaca tcatgcatga gtaggaataa aacccaagtt ccccataacg
1920tagataactt aatgctgcat aaaaatatga aagtgtaacc catgaaggac actttttctt
1980tccactgcaa agttagccac tttgctgttt ttcctctttt ttaaactttg aaaatagact
2040ctttccagaa attggagcaa taatggtgtt accacacaca gattaaataa tttgtagata
2100ttttaagtga cttttgggca aaactggaat gtatactttt accttgtttc aaacacctaa
2160gaccagtaat ttaaaaatta ctaaaaggtt tactttgttc attaataaaa catttaacaa
2220ttcaaattat atgcaccttt tacctagttg aaaaaaatac acattcctgt tttcacatta
2280tagcaactga ttaagctgaa gctgtaagtc attttttata gatgagtgat ccgcatctcc
2340atcaattaga acactggaaa agatgtttta taaaagaggt atttaatttt gtttgtagga
2400ttaactcatg caaataataa aaaagatatc ctgttggttc aatagtacac tgtctccttt
2460aaggaaggaa gcgtgatgaa tgaatgatgt gtagacttga gggatgacta ttaaagggga
2520cgtaggatga agagaaagaa cctacagatg acaatgaatg taaacttatt tttcttcatg
2580tgtaagcagt gtgctcgctg gtgatatcca gatcctaaca agattacttg gttagctggt
2640taggaccagt aactggattg cgaccactat gataatattt tgaaccaaat gttaatgctt
2700gatgcagaat tgtaaagcag catctggttc ctatatagcc ttaaggatta attttagtga
2760tcctcaagga attaaatagg gaatttcaga aatgtagact gcaaaggcag tatacaggaa
2820aaggtggagt gggttttgtt tatgagggtg tctgaaaact aaaattgagc gggatatcat
2880ggtatagttg gacagtattg gtccttcaca ctttggccat attgtataat ggagctttta
2940ccaaagatgt atgagaagtg taagactata aaaaaatgaa ctattcaaag taaaactctt
3000aacaaacatt ttacttaaag cagatgcaaa agggtattct catgtaggct cctgttggtg
3060cagagggatt tttttgattt caggatacaa ctaaagtacg aagttctcag tttcacttta
3120gtagaaagag ctctagaaat gaggctgata aacacatcta agaacactgg ttgctttcta
3180aaatttccaa agctccacca taaatgtaat ttttagtgtt tcaaatgatt gcattttaaa
3240gtatataaat atgggttatc caatatcaat gctatagtaa catcctgaaa caaaacaagc
3300acaaaggtat aaatgcctaa actggaggaa acttgaaacc ctcatgttaa atcttaaatg
3360tagtatttct aacttgtgaa gacagattgg taggcagcca tttttttgtg tcttaaaata
3420actgggggca tagttaaaat tttatacatc aagtgattgc tattattgaa tgttgcaggt
3480gagatgtggt tatttttagt ttatttgaaa tgtttgactg gaaagggggg agggggaagc
3540aaatatttga aatttggaaa accctaaacc ttttggtaag aaattgtaat tttcacttaa
3600aattttcttt aaggatataa gaggtttata attgatgtag ttaaattgaa caataaccat
3660tggtgactgg agcaggtaat tatagcctgc agaaaaaatt atctaagaat tttaaaaata
3720agatcctgaa gttgtttaat tgcatccatt tctgtattta tgtgaattta taaactgcag
3780taagttttga atgaggttaa tcttgtttaa tataagtaaa tgagtctgta gactgtgatc
3840tccccaaact aaaaagtaca gtacttggaa ttgtgttctt tatggttgta gtgttggtaa
3900agcactaata tgcagaaaat aaaggaatta cacagtgca
39391031980DNAHomo sapiens 103ttgtcactag aaggaggaag gaatgctgtg tggcaaggaa
aggaattaat gtccacttga 60gatggatttg agaaggatgt ctgcaagcag aaataaaggg
ttaaagggtg cttacattaa 120aaattttgat agctactgct ctcaaaattg tatcgcgatt
tatattccca gtgccaacag 180tgtttgaaga gtctggctct ccagagcctg tagacactgg
atattgtcag tgttttaaat 240ttcagcagat ctgatatcta aaaatggtat ctcactgttc
taaactgttc tagtatggtg 300ttctacgtga acaccctttc atgtgtttag ctggcctttg
catttctttt tttttttttt 360tgaactgcct atgcatatct tttggtcaaa attcaattga
attgcccttt tttattgtta 420gagctgctac aaacattact gaaaaggcaa ctcagttgtg
tgtgtgtgta tgcacacaca 480tatatattta taatacatat gttacttagg gtttgtacca
gctctatgag ctccttgagg 540gtggcacctt gctgtaatac agcctgacac ctaatacgaa
gtagaaatca gtgattattt 600atcacgcaaa taaagaaaca aataagtgaa cgaatgaatg
agtcaatagt gttgactgcc 660ttgtattgtc ctaggcccaa gagacagtga aatatccctg
ttcttgtata cttttctgta 720agtttctgga agtttctctg taaagcatct cagtaagctt
ttctataggc tgtgagaaac 780gcatgagtca ggctaatagg aggcatataa ttttgaattg
cttttcagaa atggccttca 840tattccttta cactcactca tcctgttgat aagagcagat
ggcctactgc atgtgactca 900gactcaaaca cacacctccg ctcccttgaa gtgccagccc
tggagctttg ttgaggctcg 960catctgccac gggagtcagc tagtacgttg cccagttcaa
catccatcca ggatttcata 1020ggaacttgag aatcattgtt tttggcttga atcctgggtt
tgaggtttct tcgtgtagga 1080atctgaaaaa aggatttgga aacgttgttg tctctaatcc
caaagtatgt atctgggagg 1140ctgccttcgc catcacccac ctaataactc aggctcccgg
ggccatttcg ctcaagtgca 1200ttcattcctt tggtagaatc aaaagaaact gatccaggtg
acagagtacc tgggttctaa 1260tcccagtttt gatgagcaag ttatttaccc cttacagccc
cattttccct attctaaaat 1320gatatggttg caactgacga tctccaagtc tccgtccaac
tcaacaattc agagtggaat 1380tctgaattct gctctgccac caacagcatg tcctcggagc
tttgcctatt actcatgaga 1440atgtcaacgt ctgggtaaat agatattttg gggtcagctc
taaaaaaccc agaagtacgt 1500attgtatgtt gattttggca cacggacaag cctgaacagg
gctgtgtcaa gccttttacc 1560atgatagctg ccggaagaaa ggccaggcga agcagtctgg
gtgagctgct tggaatgaag 1620aggaccagcc cacatcccat ggcacagatg accttcagga
gaagtggagg ggagcagcta 1680atgtaaagaa atcattagca tctgtgttgg aaatggctta
tgacactgtc tcaaagccac 1740gttctcagac aacagggaaa gctgtaaata gatgcacaca
gttatccaag catagcagag 1800taaaactaaa ggaaagccaa attaaacagg ctcaaccaaa
gttttgagtg aaagtgttga 1860atattgctca tgccttcaga acgggaagct ctgtttagaa
tactcacaat ggtgggtcct 1920cttgaggtga ctacaggctg gtaggtcggt tctatcctcc
ccctaggagc catctcagca 19801041980DNAHomo sapiens 104gttcaacttt
aaatttgccc acagaaccct ctaaatcccc ttgtaaattt aactgttagt 60ccaaagagga
acagctcttt ggacactagg aaaaaacctt gtagagagag taaaaaattt 120aacacccata
gtaggcctaa aagcagccac caattaagaa agcgttcaag ctcaacaccc 180actacctaaa
aaatcccaaa catataactg aactcctcac acccaattgg accaatctat 240caccctatag
aagaactaat gttagtataa gtaacatgaa aacattctcc tccgcataag 300cctgcgtcag
attaaaacac tgaactgaca attaacagcc caatatctac aatcaaccaa 360caagtcatta
ttaccctcac tgtcaaccca acacaggcat gctcataagg aaaggttaaa 420aaaagtaaaa
ggaactcggc aaatcttacc ccgcctgttt accaaaaaca tcacctctag 480catcaccagt
attagaggca ccgcctgccc agtgacacat gtttaacggc cgcggtaccc 540taaccgtgca
aaggtagcat aatcacttgt tccttaaata gggacctgta tgaatggctc 600cacgagggtt
cagctgtctc ttacttttaa ccagtgaaat tgacctgccc gtgaagaggc 660gggcatgaca
cagcaagacg agaagaccct atggagcttt aatttattaa tgcaaacagt 720acctaacaaa
cccacaggtc ctaaactacc aaacctgcat taaaaatttc ggttggggcg 780acctcggagc
agaacccaac ctccgagcag tacatgctaa gacttcacca gtcaaagcga 840actactatac
tcaattgatc caataacttg accaacggaa caagttaccc tagggataac 900agcgcaatcc
tattctagag tccatatcaa caatagggtt tacgacctcg atgttggatc 960aggacatccc
gatggtgcag ccgctattaa aggttcgttt gttcaacgat taaagtccta 1020cgtgatctga
gttcagaccg gagtaatcca ggtcggtttc tatctacttc aaattcctcc 1080ctgtacgaaa
ggacaagaga aataaggcct acttcacaaa gcgccttccc ccgtaaatga 1140tatcatctca
acttagtatt atacccacac ccacccaaga acagggtttg ttaagatggc 1200agagcccggt
aatcgcataa aacttaaaac tttacagtca gaggttcaat tcctcttctt 1260aacaacatac
ccatggccaa cctcctactc ctcattgtac ccattctaat cgcaatggca 1320ttcctaatgc
ttaccgaacg aaaaattcta ggctatatac aactacgcaa aggccccaac 1380gttgtaggcc
cctacgggct actacaaccc ttcgctgacg ccataaaact cttcaccaaa 1440gagcccctaa
aacccgccac atctaccatc accctctaca tcaccgcccc gaccttagct 1500ctcaccatcg
ctcttctact atgaaccccc ctccccatac ccaaccccct ggtcaacctc 1560aacctaggcc
tcctatttat tctagccacc tctagcctag ccgtttactc aatcctctga 1620tcagggtgag
catcaaactc aaactacgcc ctgatcggcg cactgcgagc agtagcccaa 1680acaatctcat
atgaagtcac cctagccatc attctactat caacattact aataagtggc 1740tcctttaacc
tctccaccct tatcacaaca caagaacacc tctgattact cctgccatca 1800tgacccttgg
ccataatatg atttatctcc acactagcag agaccaaccg aacccccttc 1860gaccttgccg
aaggggagtc cgaactagtc tcaggcttca acatcgaata cgccgcaggc 1920cccttcgccc
tattcttcat agccgaatac acaaacatta ttataataaa caccctcacc
19801051980DNAHomo sapiens 105ctggagaatc ccttgaaccc aggaggagga ggttgcagtg
agcgatcctg ccacggcact 60ccagcagggg tgacaagaat gaaactctat ttcaaaataa
agaaaaaaaa gaaaaaaaaa 120gaaaacccaa cctcaactag cttaagcaaa agcaaattta
tgtggtgaaa gggtggatct 180ggttttagaa tcagttacca agggctcaag aagtgtcaca
gggactgatc cccccacccc 240cccgtccccc atgtcgtgtc attctccagg tctttccttt
tcattgccag gagctccagg 300tttctaccct cagaactcca aggccattgg aaaacagagg
gcagttttct tagttgccca 360aggaaatgtt cccaaattgt atcaaaagcc cacctctagg
ttaattattg tggctggagg 420atgtaatcca ttcataggtc agggctggcc aggtgtagtg
gctcatgcct gtaatcccag 480cactttggga gactgagatg ggtgggtcac ttgaggtcag
aagttcgaga ccagcctggc 540caacaggatg aaaccccgtc tctactaaaa atacaaaaat
tagccaggca tggtggcggg 600cgcctgtaat cccagctgct cgggaggctg aggcaggaga
atggattgaa cccaggaggt 660ggaggttgca gtgagcagag atcacgccac tgcactcaag
cccaggcaac gaagcgagac 720tccttctcaa aaaaaaaaaa agagagaaac ataggctagg
actaggcata tgccatgcct 780tgtgacataa actggacatg gggaagggga gtgattcccc
agtgttagtt agccttgctc 840ttgtcactag aaggaggaag gaatgctgtg tggcaaggaa
aggaattaat gtccacttga 900gatggatttg agaaggatgt ctgcaagcag aaataaaggg
ttaaagggtg cttacattaa 960aaattttgat agctactgct ctcaaaattg tatcgcgatt
tatattccca gtgccaacag 1020tgtttgaaga gtctggctct ccagagcctg tagacactgg
atattgtcag tgttttaaat 1080ttcagcagat ctgatatcta aaaatggtat ctcactgttc
taaactgttc tagtatggtg 1140ttctacgtga acaccctttc atgtgtttag ctggcctttg
catttctttt tttttttttt 1200tgaactgcct atgcatatct tttggtcaaa attcaattga
attgcccttt tttattgtta 1260gagctgctac aaacattact gaaaaggcaa ctcagttgtg
tgtgtgtgta tgcacacaca 1320tatatattta taatacatat gttacttagg gtttgtacca
gctctatgag ctccttgagg 1380gtggcacctt gctgtaatac agcctgacac ctaatacgaa
gtagaaatca gtgattattt 1440atcacgcaaa taaagaaaca aataagtgaa cgaatgaatg
agtcaatagt gttgactgcc 1500ttgtattgtc ctaggcccaa gagacagtga aatatccctg
ttcttgtata cttttctgta 1560agtttctgga agtttctctg taaagcatct cagtaagctt
ttctataggc tgtgagaaac 1620gcatgagtca ggctaatagg aggcatataa ttttgaattg
cttttcagaa atggccttca 1680tattccttta cactcactca tcctgttgat aagagcagat
ggcctactgc atgtgactca 1740gactcaaaca cacacctccg ctcccttgaa gtgccagccc
tggagctttg ttgaggctcg 1800catctgccac gggagtcagc tagtacgttg cccagttcaa
catccatcca ggatttcata 1860ggaacttgag aatcattgtt tttggcttga atcctgggtt
tgaggtttct tcgtgtagga 1920atctgaaaaa aggatttgga aacgttgttg tctctaatcc
caaagtatgt atctgggagg 19801061980DNAHomo sapiens 106attcatctgt
gttattggga aatgatgtga acttaatttc tctttccctt ctaaaacttt 60gcttactgaa
tggaaatgtt cctgagatct gtttatttgg ttctatattt atgtacctcc 120cttttaaaat
agagaataca tgttaatgtt tctttgatga ctcagtgtgt attatcggta 180acagtccatt
catgatgttg ccataccaca cagcataatt ttctatctgc ttctgattga 240ttcttcattc
tcccttgatc tcagtttgtc atttaataca tctaagtttt tcactcaaca 300aatcaaatac
tgatggagaa tctgctatac accaggcact gtgctgctag gagctgagga 360ttgaacgggg
agaaacagga agctccctgc tctcatagtg cttcttagtt ggggagaaaa 420gacattcatg
atataatcac ataaatacct atttttatat gtaaaaaatg ttgtcaaaga 480aaagaacggg
gtgatgggaa cagttggaag aggtgtaaaa actccagaga agctgtggct 540cctagaaaga
aggtaggttt taggactaga atggtgatag tgggccggaa gagagagagt 600gcattcgaaa
gacactgagg agattgcatc agtaggactt ggtgacacat tagatgcaga 660ggaagaggga
cagaaatgct tcaaggagga cttttaggca tctgtcttgg gtaactagat 720gatgccaatg
gctgagatgg ggaattcttg ggtagatgag gtttggtggg atggtgattg 780ttataacttt
gactttgaac gtgctgagtt caggtgacat tgtgataccc caaaggaggt 840gcagagtagg
tagctggaga cacaggcccg aagatgatga gaggtctggc ctagaaacat 900ggatgcagga
gtcatggatc catcaaggca ctgtgagttt ggatgagatc atctagcaga 960acacttaagt
ggagaagcaa agtggtctag agactaagcc atgaggaact ccaacactta 1020gaggcgtaga
aagcaggtag aaagggaaca cctgaagact taggaaggag gggccagaaa 1080gggatgatgg
cacccgaaga cagtggtgtt caggaagcca agggaggaag gtatttagac 1140aggaggggga
gagcagaatt ggcaaagctg tggagaaagt gagatgagaa ctcctattaa 1200aaacacacaa
ctggtccaat gacatgggat ggcatggaaa tcactgatga ccaagcagga 1260gacaggggtg
gaccgcaggg gaaaaagagc aagctgaagc cagctaagga atgtcctcgg 1320gccatctcct
agcggaggcg gtagagccgt ggttgaaggt acaggaagtg gactgttagg 1380gcccaggttc
cccttaacca tgagacctga agcaagttac tttatttctc tagggctcaa 1440ttttctcacc
tgtaaaacaa gagtaacagt gctcacctac taggttgctg tgaggttctt 1500tttctttttc
tttttttttt ggagacagag tctcactctg tcacccaggc tggagtgcag 1560tggtgcaatc
ttggctcact gcaatctcca cctcccgggt tccagctatt ctcctgcctt 1620agcctcctga
gtggctggga ctacaggcgc ctgccaccac aactggctaa tttttgtatt 1680tttagtagag
atggggtttc accacgttgg ccagcctgga ctcaaactcc tgacctcagg 1740tgatctgcct
gcttcagcct cccaaagtgc tgggattaca ggcgtgagcc accacacctg 1800gccttctgtg
aggttcttaa catgtaagcc acttagccca gtggctgact catagtagtt 1860gctgaataaa
tgctaatttt atattaacac cctcataacc cattaaatca atatttattg 1920agcatccatc
tgccaggcac tgtactagat gctgacaaag acacccccaa caacaaataa
19801071980DNAHomo sapiens 107tgagcctagg agtttgagat caccccaggc agtgtggcaa
aaccgcatct ctacatgaaa 60aatacaaaaa taagtcaggc atggcagcat gtgcctgtgg
tcctggctac tagggaggct 120gaggtgagag gatcaattga gcccaggagg tcaaggccac
agtgagctga gattgcacca 180ctgcactctg gcctggggga cagagtgaga ccctgtctca
aaaaaaaaaa aaaaaaatag 240tattgtatca atgttaattt cctggttttg ataatagtgc
caaaggtata taaactgtta 300aggcaagagc aagtggctga aggctataca ggaactctct
gcactatttt tgcaacttct 360ctgttatcct aaaattattt caaaataaaa agttaaaaaa
aaagtgttta ggccgggcgc 420ggtggctcac gcctataatc ccagcacttt gggaggccga
ggcgggcgga tcacgaggtc 480aggagatcaa gaccatcctg gctaacacag tgaaacccca
tctctactaa agatacaaaa 540aattagccgg gcgaggtagc gggcgcctgt agtcccagct
acgtgggagg ctgaggcagg 600agaatggcat gaaccccagg gggtggagcc tgcagtgagc
cgagatcgtg ccactgcact 660ccagcctggg tgaaagagcg agactccttc tcaaaaaaaa
aaaaaaaaaa aaaaagtgtt 720taatcttttt tccaaaagga gcacacagaa cagagagtac
agtacaagtc ccttaagaat 780ttgttttttc tcagactatt ttctcacttg tcatcaagaa
tcagccttta gattattggc 840agcattagtc ctctagtaca gtctgcttgt gggtgaccag
atggagtaat gctgagcaca 900gagactatga tggccgtgct aaggtaagag tattgataat
gtaagcatac ttcctctatc 960aacaataatt gttaacagct gcttcaagca cttgatatta
ccactagttg ttaactgaat 1020caagcatgtg ctccaagttc acattaatgt gaattgaaca
gcattgtgta cgtacgagga 1080gcttcatgca agtgttatac actgcactca caagtattat
gatcttacta agcattagaa 1140atactctgtg ttaaagaagc ttggtctagg ccaagcgtgg
tggctcatgc ctataatctc 1200agcactttgg gaggccaagg caggcagatc acatgaggcc
aggaatttga gaccagcctg 1260gccaacatgg tgaaacccca tctctactaa aaatacaaat
attagccagg tatgatggcg 1320catgcctata atcctaacta ctcaggaggc cgaagcagaa
gaatcacttg aacctgggag 1380gcggaggttg cagtgagcca agatcatgcc actgcactcc
agcctgggtg acagagtgag 1440actctgtctc aaaaaaaaaa aaaagaaaga aaagaaaaag
aaacttggtc tagttatttt 1500ccttcctctg gggaagtaac catttgggtg ggaatagttt
tgttgttgat cccatcttgc 1560tggtttggaa acaatgcact ggctccactt ttccactcat
gggctttaag gcccccttga 1620gtcccagtct ttctcctgac acatggctgt ctcctgacag
tcccctctgc tttacattgt 1680tctcagaggg tcctgggcca tcgtttgagc ttcattcttt
caaatacact tccctctttc 1740tctatcaagc caaggctccc ctcccccaga actctgcata
ggcccttcag cctccatgaa 1800tcccttagtg agtgagtaaa ctaccactgg attcagtcac
tgcaaatgta ctttatttac 1860cccttagcac tcttactaca tgtatgtgtt agggttcttc
aaagaaacag aaccaatagg 1920atacatagag atatataaga gaagatttat aatgggaatt
ggctcatgtg attatggagg 19801081063DNAHomo sapiens 108agaatacgag
gaggaagagg tggccatacc gttgaccgct cctccaacta accagtaagt 60taagactgct
gttcaggaat ttgggaagct ggccccagaa aagaagtgga aatgaagggg 120tggtatcacg
gaaaacttga cagaacgata gcagaagaac gcctcaggca ggcagggaag 180tctggcagtt
atcttataag agagagtgat cggaggccag ggtcctttgt actttcattt 240cttagccaga
tgaatgttgt caaccatttt aggattattg ctatgtgtgg agattactac 300attggtggaa
gacgtttttc ttcactgtca gacctaatag gttattacag tcatgtttct 360tgtttgctta
aaggagaaaa attactttac ccagttgcac caccagagcc agtagaagat 420agaaggcgtg
tacgagctat tctaccttac acaaaagtac cagacactga tgaaataagt 480ttcttaaaag
gagatatgtt cattgttcat aatgaattag aagatggatg gatgtgggtt 540acaaatttaa
gaacagatga acaaggcctt attgttgaag acctagtaga agaggtgggc 600cgggaagaag
atccacatga aggaaaaata tggttccatg ggaagatttc caaacaggaa 660gcttataatt
tactaatgac agttggtcaa gtctgcagtt ttcttgtgag gccctcagat 720aatactcctg
gcgattattc actttatttc cggaccaatg aaaatattca gcgatttaaa 780atatgtccaa
cgccaaacaa tcagtttatg atgggaggcc ggtattataa cagcattggg 840gacatcatag
atcactatcg aaaagaacag attgttgaag gatattatct taaggaacct 900gtaccaatgc
aggatcaaga acaagtactc aatgacacag tggatggcaa ggaaatctat 960aataccatcc
gtcgtaaaac aaaggatgcc ttttataaaa acattgttaa gaaaggttat 1020cttctgaaag
aggccaaaaa aaaaaaaaaa aaaaaaaaaa aaa
10631092599DNAHomo sapiens 109agctctctcg agtcactccg gcgcagtgtt gggactgtct
gggtatcgga aagcaagcct 60acgttgctca ctattacgta taatcctttt cttttcaaga
tttttatttt agatgcctga 120ggaagtgcac catggagagg aggaggtgga gacttttgcc
tttcaggcag aaattgccca 180actcatgtcc ctcatcatca ataccttcta ttccaacaag
gagattttcc ttcgggagtt 240gatctctaat gcttctgatg ccttggacaa gattcgctat
gagagcctga cagacccttc 300gaagttggac agtggtaaag agctgaaaat tgacatcatc
cccaaccctc aggaacgtac 360cctgactttg gtagacacag gcattggcat gaccaaagct
gatctcataa ataatttggg 420aaccattgcc aagtctggta ctaaagcatt catggaggct
cttcaggctg gtgcagacat 480ctccatgatt gggcagtttg gtgttggctt ttattctgcc
tacttggtgg cagagaaagt 540ggttgtgatc acaaagcaca acgatgatga acagtatgct
tgggagtctt ctgctggagg 600ttccttcact gtgcgtgctg accatggtga gcccattggc
aggggtacca aagtgatcct 660ccatcttaaa gaagatcaga cagagtacct agaagagagg
cgggtcaaag aagtagtgaa 720gaagcattct cagttcatag gctatcccat caccctttat
ttggagaagg aacgagagaa 780ggaaattagt gatgatgagg cagaggaaga gaaaggtgag
aaagaagagg aagataaaga 840tgatgaagaa aaacccaaga tcgaagatgt gggttcagat
gaggaggatg acagcggtaa 900ggataagaag aagaaaacta agaagatcaa agagaaatac
attgatcagg aagaactaaa 960caagaccaag cctatttgga ccagaaaccc tgatgacatc
acccaagagg agtatggaga 1020attctacaag agcctcacta atgactggga agaccacttg
gcagtcaagc acttttctgt 1080agaaggtcag ttggaattca gggcattgct atttattcct
cgtcgggctc cctttgacct 1140ttttgagaac aagaagaaaa agaacaacat caaactctat
gtccgccgtg tgttcatcat 1200ggacagctgt gatgagttga taccagagta tctcaatttt
atccgtggtg tggttgactc 1260tgaggatctg cccctgaaca tctcccgaga aatgctccag
cagagcaaaa tcttgaaagt 1320cattcgcaaa aacattgtta agaagtgcct tgagctcttc
tctgagctgg cagaagacaa 1380ggagaattac aagaaattct atgaggcatt ctctaaaaat
ctcaagcttg gaatccacga 1440agactccact aaccgccgcc gcctgtctga gctgctgcgc
tatcatacct cccagtctgg 1500agatgagatg acatctctgt cagagtatgt ttctcgcatg
aaggagacac agaagtccat 1560ctattacatc actggtgaga gcaaagagca ggtggccaac
tcagcttttg tggagcgagt 1620gcggaaacgg ggcttcgagg tggtatatat gaccgagccc
attgacgagt actgtgtgca 1680gcagctcaag gaatttgatg ggaagagcct ggtctcagtt
accaaggagg gtctggagct 1740gcctgaggat gaggaggaga agaagaagat ggaagagagc
aaggcaaagt ttgagaacct 1800ctgcaagctc atgaaagaaa tcttagataa gaaggttgag
aaggtgacaa tctccaatag 1860acttgtgtct tcaccttgct gcattgtgac cagcacctac
ggctggacag ccaatatgga 1920gcggatcatg aaagcccagg cacttcggga caactccacc
atgggctata tgatggccaa 1980aaagcacctg gagatcaacc ctgaccaccc cattgtggag
acgctgcggc agaaggctga 2040ggccgacaag aatgataagg cagttaagga cctggtggtg
ctgctgtttg aaaccgccct 2100gctatcttct ggcttttccc ttgaggatcc ccagacccac
tccaaccgca tctatcgcat 2160gatcaagcta ggtctaggta ttgatgaaga tgaagtggca
gcagaggaac ccaatgctgc 2220agttcctgat gagatccccc ctctcgaggg cgatgaggat
gcgtctcgca tggaagaagt 2280cgattaggtt aggagttcat agttggaaaa cttgtgccct
tgtatagtgt ccccatgggc 2340tcccactgca gcctcgagtg cccctgtccc acctggctcc
ccctgctggt gtctagtgtt 2400tttttccctc tcctgtcctt gtgttgaagg cagtaaacta
agggtgtcaa gccccattcc 2460ctctctactc ttgacagcag gattggatgt tgtgtattgt
ggtttatttt attttcttca 2520ttttgttctg aaattaaagt atgcaaaata aagaatatgc
cgtttttata cgaaaaaaaa 2580aaaaaaaaaa aaaaaaaaa
2599110829DNAHomo sapiens 110cctcttttcc gtggcgcctc
ggaggcgttc agctgcttca agatgaagct gaacatctcc 60ttcccagcca ctggctgcca
gaaactcatt gaagtggacg atgaacgcaa acttcgtact 120ttctatgaga agcgtatggc
cacagaagtt gctgctgacg ctctgggtga agaatggaag 180ggttatgtgg tccgaatcag
tggtgggaac gacaaacaag gtttccccat gaagcagggt 240gtcttgaccc atggccgtgt
ccgcctgcta ctgagtaagg ggcattcctg ttacagacca 300aggagaactg gagaaagaaa
gagaaaatca gttcgtggtt gcattgtgga tgcaaatctg 360agcgttctca acttggttat
tgtaaaaaaa ggagagaagg atattcctgg actgactgat 420actacagtgc ctcgccgcct
gggccccaaa agagctagca gaatccgcaa acttttcaat 480ctctctaaag aagatgatgt
ccgccagtat gttgtaagaa agcccttaaa taaagaaggt 540aagaaaccta ggaccaaagc
acccaagatt cagcgtcttg ttactccacg tgtcctgcag 600cacaaacggc ggcgtattgc
tctgaagaag cagcgtacca agaaaaataa agaagaggct 660gcagaatatg ctaaactttt
ggccaagaga atgaaggagg ctaaggagaa gcgccaggaa 720caaattgcga agagacgcag
actttcctct ctgcgagctt ctacttctaa gtctgaatcc 780agtcagaaat aagatttttt
gagtaacaaa taaataagat cagactctg 8291111980DNAHomo sapiens
111ttgatcttcc tgcctcagcc ttccaagtag ctgggactta aaggcgtgag ccaccacacc
60tgactaattt tcgtattttt tgtagagatg gggtttcgcc atgttgcccg ggctgttctc
120gaactcctga gctcaagcaa tctgcccacc tcagcctccc aaagcgctgg gattacaggc
180atgagccacc atcccagcca aaactataaa acttttagaa aagaacatag aagaaaatct
240ttgggtcctg ggggcaaaga gctctgagac ttgacatcaa aagcatgccg cataatagga
300aaatactaga ctttatttag gggttaagag tttagactct ggactctctc agccttggtt
360tcactagtta gctctatcac taactacatt gggcattgaa aattcctctg ttgtcccacg
420tggtgcatgg atgattgtag acgaggacac tgagatcctg aaggcagaag taatttctct
480aagcaacgtt gttggttggt ggcagagtct gggttacaac ccctggtttc ctgattccga
540gtccaagtga aatacttttg cccctgcagt agaccctgct acagaggata aaaaggcacg
600tcataggcta ggagaaaaat tttgcctacc acatatgtaa ccaaggacta gcagctagga
660catctgaaga attctcaaca ttcaacgggg tagaagaatg aacgattcaa tagaatatgg
720gcaaaagaca tgaagaggca ttttaccaaa catagggtgc tatggtccga atgtttgcat
780tctcctcaaa ttcctgtgtt gaaatcctaa cccccaaggt attggtatta ggaggcaggg
840gccctgggaa gtgattaggt cataaaggtg gagtcctcat ggatgggatt agtgtcttta
900taaaagagac ctttgccatg tgaggttaca gtgagaagac atctgtctat gaagaaagtg
960ggccctcacc aaacacagtc tgctggcact ttgcacttca actccccagc ttccagaact
1020gtaaggaata taagtctgtt gttggtaagc cacccggtct atgatatttt gttatagcag
1080cccaaacaga ctaagacagg tgacaaataa acatgaaaag atgttcaaca tcattagcca
1140ttagggaaat gcagattaaa accacagcga aatatcatga tacagttttc agcatggcta
1200aactagaaaa tagtgacacc accaaatgcc gacaaggctg tggggaaact gggttgttca
1260gacactgcca ctggggctgt agcgtactat agccactttc ataaacagtt tgtcagtttc
1320ttaaaaaact aaacctgcaa ctaccatatg acccagcaat tacacccctg ggcacctacc
1380caagagaaat gaaaactcaa cgtttgcgca aaaacctgtg taggaatgtt caagcagctt
1440tattcataat atgcccaaac aggaaacaac tcagctgtcc ttcagtaggt aaatagttaa
1500gcaaattgtc atacccctgt gtcatggagc actacctagc aataacaagg agcaaattat
1560tgatacataa caatctggat gaatctccag agaattatgt tgaatgaaaa aagccagccc
1620ctgaaggata catactgtat gatgccattt acataacatt cttgaaattc taaaattaca
1680gagatgggga acagatttgt ggttaaagat ggagccgggt gggaagaaag taggtgtggc
1740tataaacggg taacatgaag gatccttgtg gtgatggaaa tttctgtatt tttattgtat
1800ccgtgtcagt atcctggttg tgatatggta atacagtttt gcaagatact acccttaggg
1860gaaatgaggt aagacctggc atctctctgt attatttctt aattgcatgt gaatctacaa
1920ttatttcaaa ataaaaagta tgattgaagt aactctcagg aagcttagcc tactgtggat
198011279PRTArtificial SequenceSynthetic 112Ser Cys Gly Pro Ser Met Arg
Thr Arg Trp Ser Ser Ile Arg Arg Ser1 5 10
15Trp Arg Arg Leu Ile Leu Pro Ser Trp Thr Met Pro Gly
Ser Leu Leu 20 25 30Arg Gly
Thr Ala Thr Trp Trp Gly Leu Pro Thr Arg Ser Cys Ser Ser 35
40 45Arg Ala Ser Ala Ser Thr Ala Ser Leu Pro
Ser Ser Ala Ser Ser Arg 50 55 60Ser
Ser Trp Gln Pro Arg Arg Arg Ser Leu Arg Pro His Ser Ser65
70 7511312PRTArtificial SequenceSynthetic 113Met Arg
Asn Asp Arg Ala Ala Ser Arg Gln Ile Thr1 5
1011476PRTArtificial SequenceSynthetic 114Leu Ala His Arg Pro Pro Cys
Ala Glu Pro Asp Pro Gly Gln Arg Met1 5 10
15Glu Leu Pro Ala Pro Val Pro Arg Pro Arg Gly Ala Ser
Lys Pro Arg 20 25 30Asp Gly
Thr Ser Ser His Cys Asp Met Pro Asn Cys Gln His Pro Gln 35
40 45Gly Pro Gly Pro Ala Gly Glu Ile Arg Ser
Arg Cys Arg Ser Cys Trp 50 55 60Leu
Arg Ala Val Arg Cys Asn Pro Trp Leu Gly Arg65 70
7511520PRTArtificial SequenceSynthetic 115Asn Ser Gly Ala Ser
Gly Ser Arg Asn Phe Ser Ser Cys Ser Ala Glu1 5
10 15Asp Phe Glu Lys
2011631PRTArtificial SequenceSynthetic 116Ser Ser Val Pro Pro Gln Asp Thr
Ala Pro Tyr Ser Cys His Val Gln1 5 10
15His Ser Ser Leu Ala Gln Pro Leu Val Val Pro Trp Glu Ala
Ser 20 25
3011737PRTArtificial SequenceSynthetic 117Val Ala Val Ala Gln Gly Ser Gly
Ala Leu Glu Ser Ser Lys Trp Pro1 5 10
15Leu Leu Asn Leu Asn Gly Cys Leu Gly Arg Ala Glu Gly Gln
Val Leu 20 25 30Met Ala Ser
His Pro 3511811PRTArtificial SequenceSynthetic 118Ser Ala Phe Arg
Gly Tyr Leu Ala Asn Asn Lys1 5
1011936PRTArtificial SequenceSynthetic 119Ser Leu Ala His Arg Pro Pro Cys
Ala Glu Pro Asp Pro Gly Gln Arg1 5 10
15Met Glu Leu Pro Ala Pro Val Pro Arg Pro Arg Gly Ala Ser
Lys Pro 20 25 30Pro Arg Arg
Asp 3512051PRTArtificial SequenceSynthetic 120Leu Phe Ile Phe Ile
Thr Gln Lys Ser Phe Ile Phe Leu Phe Ser Phe1 5
10 15Leu Thr Leu Cys Leu Cys Leu Gln His Phe His
Asn Asp Phe Leu Leu 20 25
30Leu Asp Lys Glu Ser Thr Leu Asp Pro Val Thr Asn Thr Phe Ser Thr
35 40 45His Gly Thr
5012110PRTArtificial SequenceSynthetic 121Pro Tyr Gln Ile Tyr Gln Val Met
Ile Asn1 5 1012219PRTArtificial
SequenceSynthetic 122Val Ser Thr Phe Leu Ser Arg Val Gly Arg Val Ser Leu
Leu Asn Phe1 5 10 15Leu
Pro Phe12376PRTArtificial SequenceSynthetic 123Asn Thr Leu Val Thr Tyr
Asp Met Val Pro Glu Pro Lys Ile Ile Asp1 5
10 15Ala Ala Leu Arg Ala Cys Arg Arg Leu Asn Asp Phe
Ala Ser Thr Val 20 25 30Arg
Ile Leu Glu Val Val Lys Asp Lys Ala Gly Pro His Lys Glu Ile 35
40 45Tyr Pro Tyr Val Ile Gln Glu Leu Arg
Pro Thr Leu Asn Glu Leu Gly 50 55
60Ile Ser Thr Pro Glu Glu Leu Gly Leu Asp Lys Val65 70
7512461PRTArtificial SequenceSynthetic 124Pro Pro Ser His
His Ile Pro Asn Leu Ser Leu Thr Lys Arg Lys Pro1 5
10 15Ser Pro His Ser Leu Asn Leu Ile His His
Ser Arg Gln Leu Arg Trp 20 25
30Ile Lys Pro Asn Pro Ala Thr Gln Asn Leu Ser Ile Leu Leu Asn Tyr
35 40 45Pro His Arg Met Asn Asn Ser Ser
Ser Thr Val Gln Pro 50 55
601258PRTArtificial SequenceSynthetic 125Ser Ala Gly Ser Cys Ser Ser Ala1
512651PRTArtificial SequenceSynthetic 126Lys Leu Leu Phe
Ala Leu Gln Leu Trp Asn Leu Val Leu Gln Pro Leu1 5
10 15Leu Phe Cys Pro Asn Gly Pro Cys Ser Leu
Asp Gln Glu Leu Gln Lys 20 25
30Trp Lys Lys Leu Met Lys Arg His Leu Ile Asn Val Asp Gly Ser Lys
35 40 45Ser Cys Pro
50127131PRTArtificial SequenceSynthetic 127Glu Asp Asp Tyr Ser Tyr Gln
Gly His Met Gln Ser Cys Asn Phe Ser1 5 10
15Ala Glu Lys Ala Lys Val Tyr Ile Asn Asp Ser Val Glu
Leu Ser Gln 20 25 30Asn Glu
Gln Lys Leu Ala Ala Trp Leu Ala Lys Arg Gly Pro Ile Ser 35
40 45Val Ala Ile Asn Ala Phe Gly Met Gln Phe
Tyr Arg His Gly Ile Ser 50 55 60Arg
Pro Leu Arg Pro Leu Cys Ser Pro Trp Leu Ile Asp His Ala Val65
70 75 80Leu Leu Val Gly Tyr Gly
Asn Arg Ser Asp Val Pro Phe Trp Ala Ile 85
90 95Lys Asn Ser Trp Gly Thr Asp Trp Gly Glu Lys Gly
Tyr Tyr Tyr Leu 100 105 110His
Arg Gly Ser Gly Ala Cys Gly Val Asn Thr Met Ala Ser Ser Ala 115
120 125Val Val Asp 13012811PRTArtificial
SequenceSynthetic 128Gly Thr Asn Gln Arg Gln Thr Met Glu Asn His1
5 10129115PRTArtificial SequenceSynthetic 129Ser
Ser Cys Ser Glu Tyr Asn Val Arg Val Ala Ser Arg Tyr Phe Lys1
5 10 15Gly Pro Glu Leu Leu Val Asp
Tyr Gln Met Tyr Asp Tyr Ser Leu Asp 20 25
30Met Trp Ser Leu Gly Cys Met Leu Ala Ser Met Ile Phe Arg
Arg Glu 35 40 45Pro Phe Phe His
Gly Gln Asp Asn Tyr Asp Gln Leu Val Arg Ile Ala 50 55
60Lys Val Leu Gly Thr Glu Glu Leu Tyr Gly Tyr Leu Lys
Lys Tyr His65 70 75
80Ile Asp Leu Asp Pro His Phe Asn Asp Ile Leu Gly Gln His Ser Arg
85 90 95Lys Arg Trp Glu Asn Leu
Ser Ile Val Arg Thr Asp Thr Leu Ser Ala 100
105 110Leu Arg Pro 11513089PRTArtificial
SequenceSynthetic 130Ala Ala Arg Leu Gly Pro Ser Leu Glu Cys Trp Ala Ala
Gly Ser Ala1 5 10 15Gly
Pro Phe Thr Ala His Arg Arg Pro Ala Gln Val Gly Arg Pro Leu 20
25 30Ser Leu Ala Arg Gly Pro Ser Trp
Ser Trp Arg Arg Cys Trp Ser Pro 35 40
45Gly Arg Cys Pro Ser Ala Pro Trp Arg Ala Gly Ser Arg Pro Ala Ala
50 55 60Ser Cys Pro Asp Trp Ile Pro Gly
Pro Gln Gly Leu Trp Leu His Arg65 70 75
80Asn Pro Thr Ser Val Arg Pro Ala Arg
8513111PRTArtificial SequenceSynthetic 131Gly Lys Glu Arg Glu Asn Ile Arg
Thr Asn Thr1 5 1013222PRTArtificial
SequenceSynthetic 132Pro Lys Cys Arg Leu Gln Arg Gln Tyr Thr Gly Lys Gly
Gly Val Gly1 5 10 15Phe
Val Tyr Glu Gly Val 2013340PRTArtificial SequenceSynthetic
133Gln Thr Gln Thr His Thr Ser Ala Pro Leu Lys Cys Gln Pro Trp Ser1
5 10 15Phe Val Glu Ala Arg Ile
Cys His Gly Ser Gln Leu Val Arg Cys Pro 20 25
30Val Gln His Pro Ser Arg Ile Ser 35
4013418PRTArtificial SequenceSynthetic 134Pro Arg Leu His Gln Xaa Lys
Ala Asn Tyr Ile Tyr Ser Ile Asp Pro1 5 10
15Ile Thr13514PRTArtificial SequenceSynthetic 135Pro Gln
Thr Thr Ala Pro Arg Arg Ala Arg Pro Arg Arg Ser1 5
1013647PRTArtificial SequenceSynthetic 136Gly Thr Ile Ser Ile
Val Cys Cys Trp Gly Cys Leu Cys Gln His Leu1 5
10 15Val Gln Cys Leu Ala Asp Gly Cys Ser Ile Asn
Ile Asp Leu Met Gly 20 25
30Tyr Glu Gly Val Asn Ile Lys Leu Ala Phe Ile Gln Gln Leu Leu 35
40 4513718PRTArtificial SequenceSynthetic
137Gly Met Ser His His Ala Trp Pro Arg Pro Ser Phe Phe Asn Thr Glu1
5 10 15Tyr
Phe138160PRTArtificial SequenceSynthetic 138Asp Arg Arg Pro Gly Ser Phe
Val Leu Ser Phe Leu Ser Gln Met Glu1 5 10
15Thr Asn Val Val Thr His Phe Arg Ile Ile Ala Met Glu
Thr Cys Gly 20 25 30Asp Tyr
Tyr Ile Gly Gly Arg Arg Phe Ser Ser Leu Ser Asp Leu Ile 35
40 45Gly Tyr Tyr Ser His Val Ser Cys Leu Leu
Lys Gly Glu Lys Leu Leu 50 55 60Tyr
Pro Val Ala Pro Pro Glu Pro Val Glu Asp Arg Arg Arg Val Arg65
70 75 80Ala Ile Leu Pro Tyr Thr
Lys Val Pro Asp Thr Asp Glu Ile Ser Phe 85
90 95Leu Lys Gly Asp Met Glu Thr Phe Ile Val His Asn
Glu Leu Glu Asp 100 105 110Gly
Trp Met Glu Thr Trp Val Thr Asn Leu Arg Thr Asp Glu Gln Gly 115
120 125Leu Ile Val Glu Asp Leu Val Glu Glu
Val Gly Arg Glu Glu Asp Pro 130 135
140His Glu Gly Lys Ile Trp Phe His Gly Lys Ile Ser Lys Gln Glu Ala145
150 155 16013988PRTArtificial
SequenceSynthetic 139Tyr Phe Ala Tyr Leu Ile Ser Glu Gln Asn Glu Glu Asn
Lys Ile Asn1 5 10 15His
Asn Thr Gln His Pro Ile Leu Leu Ser Arg Val Arg Glu Gly Met 20
25 30Gly Leu Asp Thr Leu Ser Leu Leu
Pro Ser Thr Gln Gly Gln Glu Arg 35 40
45Glu Lys Asn Thr Arg His Gln Gln Gly Glu Pro Gly Gly Thr Gly Ala
50 55 60Leu Glu Ala Ala Val Gly Ala His
Gly Asp Thr Ile Gln Gly His Lys65 70 75
80Phe Ser Asn Tyr Glu Leu Leu Thr
85140152PRTArtificial SequenceSynthetic 140Cys Ile Val Asp Ala Asn Leu
Ser Val Leu Asn Leu Val Ile Val Lys1 5 10
15Lys Gly Glu Lys Asp Ile Pro Gly Leu Thr Asp Thr Thr
Val Pro Arg 20 25 30Arg Leu
Gly Pro Lys Arg Ala Ser Arg Ile Arg Lys Leu Phe Asn Leu 35
40 45Ser Lys Glu Asp Asp Val Arg Gln Tyr Val
Val Arg Lys Pro Leu Asn 50 55 60Lys
Glu Gly Lys Lys Pro Arg Thr Lys Ala Pro Lys Ile Gln Arg Leu65
70 75 80Val Thr Pro Arg Val Leu
Gln His Lys Arg Arg Arg Ile Ala Leu Lys 85
90 95Lys Gln Arg Thr Lys Lys Asn Lys Glu Glu Ala Ala
Glu Tyr Ala Lys 100 105 110Leu
Leu Ala Lys Arg Met Glu Thr Lys Glu Ala Lys Glu Lys Arg Gln 115
120 125Glu Gln Ile Ala Lys Arg Arg Arg Leu
Ser Ser Leu Arg Ala Ser Thr 130 135
140Ser Lys Ser Glu Ser Ser Gln Lys145
15014122PRTArtificial SequenceSynthetic 141Leu Ile Cys Ile Ser Leu Met
Ala Asn Asp Val Glu His Leu Phe Met1 5 10
15Phe Ile Cys His Leu Ser
20142219DNAArtificial SequenceSynthetic 142aagcttcgcc tccttggctg
ccagctgctt ctggagctgg ctgagctggg cagagaggct 60gtcgatgcgg atgcgcgact
gctgcagctc ctcgtgggca gcccccacca ggttgctgtt 120cctctcagca gactgcctgg
cattgtccag cttggcagaa taagtcttct ccagctcctt 180cttatactgc tccacctggt
cctcatgctg ggcccgcag 219143438DNAArtificial
SequenceSynthetic 143aatgagaaat gaccgagcag cttcgaggca gattacatga
cttatgatct acatttaaat 60atgatcttgg gagatgtgga agaaactgtg actactatag
aaattgatga agaaacatat 120gaagagatat ataaatcaac gaaacggaat attccaatgc
tctttgtccg gggagatggc 180gttgtcctgg ttgcccctcc actgagagtt ggctgaaaca
aagaatttgt cctgtatgga 240aaacgggaga ctttgtacag tggcctctct aaaagtacaa
aacattcata agagaaacct 300gcatacattt tgatattaag aaataattcc ggggattctc
cactcctgaa atgagttgat 360ttgcagataa ctctacaact tcttaagcta aatggtattt
tcatttttct caagctctcc 420aataaatatg accaccaa
438144555DNAArtificial SequenceSynthetic
144ggagtttcac ttttgttgcc caggattgag tgcagtgccc cgatcttggc tcactacaac
60ctctgcctcc tgggttcaag cgactctcct gcctcagtgt cctgagtagc tgggattaca
120ggcgtctgcc accacgcccg gctaattttg tatttttagt agagaacagg tttcactatg
180ttggtcaggc tggtcttgaa ctcctgacct cagcgcatcc agaattttag acggggcccc
240cagggtgagg tcttggcacc ctccagtaga gaagaaggga catgggccat acgtggggtg
300tcctttctgg gagccttgcg tcccttacct gcctagccag ggattgcacc tcacagcacg
360cagccagcag gaacggcacc gtgatctgat ttcacctgcg ggccctgggc cctgggggtg
420ttgacaattg ggcatatcac agtgtgagct agtcccgtct cggggtttgg aggctccacg
480tggccgtggt acaggagcag gcagttccat cctctggcct ggatcaggct ctgcacacgg
540aggcctgtgg gccag
555145306DNAArtificial SequenceSynthetic 145tcggcataaa gtacctcctg
gaaggaaccg acagtcttta caacagtcac catatgcaca 60ctcagcaaat gatttaagct
tacaggtact tccttcgcag caagggtcca attcacattc 120ctttggagta ccacagtcac
actcttcccc agcgtccacc aacttattac cacaggaggg 180agcactatag gcttcatcag
gctttggaat attaagaagg cagtttcctc ctttatttaa 240agttacttct caaagtcctc
tgcactgcaa ctgctaaagt ttctggaacc cgatgctcct 300gaattc
306146463DNAArtificial
SequenceSynthetic 146tcaagcgtgc ccccgcagga cacagccccc tactcctgcc
acgtgcagca cagcagcctg 60gcccagcccc tcgtggtgcc ctgggaggcc agctaggaag
caagggttgg aggcaatgtg 120ggatctcaga cccagtagct gcccttcctg cctgatgtgg
gagctgaacc acagaaatca 180cagtcaatgg atccacaagg cctgaggagc agtgtggggg
gacagacagg aggtggattt 240ggagaccgaa gactgggatg cctgtcttga gtagacttgg
acccaaaaaa tcatctcacc 300ttgagcccac ccccacccca ttgtctaatc tgtagaagcc
ggaagcttgc ggccgcactc 360gagtaactag ttaacccctt ggggcctcta aacgggtctt
gaggggttan ctngttnctc 420gngtgcggcc gcnngcttcc ggcttctncn gnttngncnn
tgn 463147388DNAArtificial SequenceSynthetic
147gtggctgttg cgcagggatc aggtgcactt gagtcttcga agtggccatt gctcaacttg
60aatggctgcc tgggtcgggc agaaggccag gtcctcatgg cttcccatcc ctaatgaccg
120gaatacatgg gctgccaggt cagatgtggg ccacatggga agtcccagct ctattctaga
180aaatgcatgt accatcagct tactgataga catttactga acttgggtat gccagatcca
240cagggggccc cagagatgag ggggataaga aggtttctga aggcatggta cagaaggtgc
300cagcagaggt atgggctagg ggaggcaggg agagcacaga gcaggcatcc taaaggaggc
360agcatttgtg ttggagcttg aagaagtg
388148292DNAArtificial SequenceSynthetic 148taagctttca tcttccccaa
ccctgatgtc ttcctattct cactgatccc cctactgact 60cagcttcacg cttcttgatt
atacctctct cctgtagaaa agccttggct ggctctcctt 120taggatgaga ataaatccga
aatccttagt gtagcattta gaagtcctat ctcccacttg 180tttcttaata ttctcttctc
taacaccgaa cttgtttcaa gcctcttttc caacacatga 240tttcttctat tctaaatcaa
tttatttatt atttgctaaa tagcccctaa ac 292149419DNAArtificial
SequenceSynthetic 149ggctaatttt gtatttttag tagagaacag gtttcactat
gttggtcagg ctggtcttga 60actcctgacc tcagcgcatc cagaatttta gacggggccc
ccagggtgag gtcttggcac 120cctccagtag agaagaaggg acatgggcca tacgtggggt
gtcctttctg ggagccttgc 180gtcccttacc tgcctagcca gggattgcac ctcacagcac
gcagccagca ggaacggcac 240cgtgatctga tttcacctgc gggccctggg ccctgggggt
gtttgacaat tggggcatat 300cacagtgtga gctagtcccg tctcgggggt ttggaggctc
cacgtggccg tggtacagga 360gcaggcagtt ccatcctctg gcctggatca ggctctgcac
acggaggcct gtgggccag 419150157DNAArtificial SequenceSynthetic
150gtcttttcat ttttattact caaaaaagtt tcattttttt atttagcttt ctgactctgt
60gcttgtgcct tcaacacttt cacaacgatt ttctgctcct cgataaggaa agcacgcttg
120atcctgtcac gaacacattt agcacacatg gaaccaa
157151397DNAArtificial SequenceSynthetic 151cttaccagat ctatcaggtc
atgataaatt agacccagtc catctttcaa tccagtctac 60tctggttctg aacatataaa
cacaaaacac tacagattta ttaatatagc attttcccac 120accctaaccc tataaagaac
tttaaaagag aaaatttcat ctaaatattt cacacttaaa 180ggaaagcctt accaactatg
gcaacaggtt tggaccatga aatagtactt tcctagatga 240catatcgagt caacatgaag
ccttagctga aatgaatgat tcaggatatt aatgagaaat 300tctcacaaat gatatgcatt
taggaaatga ttttgctttc cttaaatagt tcgaaggctt 360gaaaataaac tttttttttg
catttctttt aaaagtt 397152644DNAArtificial
SequenceSynthetic 152gtttccacat tcttgtcaag ggttggtagg gtcagtcttt
taaatttctt gccattttag 60tgactgtgca ttggtatttc attgtggttt atttgcatga
tgactaatgc tcaacaccaa 120ctaatcatgt tgagtatttt taatgtgctt atttgccact
catatatctt ctttgatgaa 180gtgtctcttc aaatattttg cccatttaaa aactgtattg
attcttatta ttgaattgca 240ataattcttt ctatccggat atatatcctt tgccagatat
gtgtattaca aatgttttct 300cctagccttc cacctcagcc tcccaagtag ctgggaatgc
aggtgtgcac caccactcca 360gggttttttg ttgttgttgt tgttgttttt ctgtagagac
agggtcttgc catgctgccg 420aggctgctct caaactcctg ggatcaagaa atcctcctgc
ctcggcctcc caaagtgctg 480acattacaag catgagccac tgtgcctggc taacttttca
tcttttaaag tagtgtcttg 540caaagaacaa cattttaatg aagtccattt atcaactttt
tgattcattg tccatgcttt 600ttgcataata agaaatcttt gcctgcctca aaattgcaaa
gctt 644153263DNAArtificial SequenceSynthetic
153aacacacttg ttacctatga tatggttcca gagcccaaaa tcattgatgc tgctttgcgg
60gcatgcagac ggttaaatga ttttgctagt acagttcgta tcctagaggt tgttaaggac
120aaagcaggac ctcataagga aatctacccc tatgtcatcc aggaacttag accaacttta
180aatgaactgg gaatctccac tccggaggaa ctgggccttg acaaagtgta accgcataat
240aaaagggaaa tgagtttgaa ctg
263154236DNAArtificial SequenceSynthetic 154gcccccatct catcatatac
caaatctctc cctcactaaa cgtaagcctt ctcctcactc 60tctcaatctt atccatcata
gcaggcagtt gaggtggatt aaaccaaacc cagctacgca 120aaatcttagc atactcctca
attacccaca taggatgaat aatagcagtt ctaccgtaca 180accctaacat aaccattctt
aatttaacta tttatattat cctaactact accgca 236155332DNAArtificial
SequenceSynthetic 155gggttcgtgt tcctcagcgt agccatcagg cttggccagc
tgctccttgt aaagctgccc 60cacagtgcgg aacatgccct tccgcgtctt gaaggccccg
ggcagtgcgg tctccgacat 120gccggccacc tggtccaggc cgatgatgcg gtccacatcc
ttccacagct ccgagacaaa 180cttgtcagag gactggtgga gcagtgtggc gatgttgtca
ttcaggggat ccatgttctt 240catcagccac tcgtcagctt tgtaatccac cttgccggca
tagtggataa tgcagaaatc 300agctttgtcc ttcagctgct tgggcttctg ga
332156279DNAArtificial SequenceSynthetic
156aaattacttt tcgccttgca gctgtggaac ttggtcttac agcctctgct cttctgccca
60aacgggccat gcagtttgga tcaagaattg caaaaatgga aaaaattaat gaaaaggcat
120ctgataaatg tggacggctc caaatcatgt ccttagaaaa tctttctatt gaaaaggaga
180ctaaattgta atgtgattca caatgtaaca atataaaaat aagtttttat ataattatat
240aaaagtaaga tactctgctg ctttactatt gtataatat
279157877DNAArtificial SequenceSynthetic 157cagaggatga ctacagctac
cagggtcaca tgcagtcctg caacttctca gcagagaagg 60ccaaggtcta catcaatgac
tccgtggagc tgagccagaa cgagcagaag ctggcagcct 120ggctggccaa gagaggccca
atctccgtgg ccatcaatgc ctttggcatg cagttttacc 180gccacgggat ctcccgccct
ctccggcccc tctgcagccc ttggctcatt gaccatgcgg 240tgttgcttgt gggctacggc
aaccgctctg acgttccctt ttgggccatc aagaacagct 300ggggcactga ctggggtgag
aagggttact actacttgca tcgcgggtcc ggggcctgtg 360gcgtgaacac catggccagc
tcggcggtgg tggactgaag aggggccccc agctcgggac 420ctggtgctga tcagagtggc
tgctgcccca gcctgacatg tgtccaggcc cctccccggg 480aggtacagct ggcagaggga
aaggcactgg tacctcaggg tgagcagagg gcactgggct 540ggggcacagc ccctgcttcc
ctgcacccca ttcccaccct gaagttctgc acctgcacct 600ttgttgaatt gtggtagctt
aggaggatgt cagggtgaag ggtggtatct tggcagttga 660agctggggca agaactctgg
gcttgggtaa tgagcaggaa gaaaattttc tgatcttaag 720cccagctgtg ttctgccccc
gctttcctct gtttgatact ataaattttc tggttccctt 780ggatttaggg atagtgtccc
cctccatgtc caggaaactt gtaaccaccc ttttctaaca 840gcaataaaga gggtccttgt
cccgaaaaaa aaaaaaa 877158568DNAArtificial
SequenceSynthetic 158ggcagacaat ggaaaaccat tgaaaaggat taaactggga
agtgatatgt tctcttttgc 60atttaaaaag atcaccaatg gggatatgga gaatggtctg
gataggtctt aagactagag 120ccaggaagac atgttagaag gctatcaatt gaccctaaag
acactgcttc aatccctttg 180atgacagtga gtttgctttc cccagagata gcttattgga
cctcaggact gctgtgagaa 240acagaaaatg ctcctttacg tgttgcctga agttaggctc
accgatttgg ggcatgttct 300aattctacca gctaggaaca cacagaatcg cttgtcaaac
attctgagtc agatatgtcc 360tccctatgtc ttttctgaga aaggcataca gaaattccca
gctaaacatc accagttccc 420tcatttgttc ctcagatgat atggtccatt caagttttgt
aatcatcatg ggggtagatg 480gagggtccca gtcctcacaa ccattctggt aatttactct
tgaatttact ggttcacatg 540tatctatttt gtagtgtggc tccagaaa
568159522DNAArtificial SequenceSynthetic
159tcatcctgct cggagtacaa tgttcgtgta gcctcaaggt acttcaaggg accagagctc
60ctcgtggact atcagatgta tgattatagc ttggacatgt ggagtttggg ctgtatgtta
120gcaagcatga tctttcgaag ggaaccattc ttccatggac aggacaacta tgaccagctt
180gttcgcattg ccaaggttct gggtacagaa gaactgtatg ggtatctgaa gaagtatcac
240atagacctag atccacactt caacgatatc ctgggacaac attcacggaa acgctgggaa
300aacttatcca tagtgagaac agacaccttg tcagccctga ggccctagat cttctggaca
360aacttctgcg atacgaccat caacagagac tgactgccaa agaggccatg gagcacccat
420acttctaccc tgtggtgaag gagcagtccc agccttgtgc agacaatgct gtgctttcca
480gtggtctcac ggcagcacga tgaagactgg aaagcgacgg gt
522160363DNAArtificial SequenceSynthetic 160cggccgcccg ccttggcccg
tctctggagt gctgggcagc cgggtctgcg ggccccttta 60cagcacatcg ccggccggcc
caggtagggc ggcctctctc cctcgcaagg gggcccagct 120ggagctggag gagatgctgg
tccccaggaa gatgtccgtc agccccctgg agagctggct 180cacggcccgc tgcttcctgc
ccagactgga taccgggacc gcagggactg tggctccacc 240gcaatcctac cagtgtccgc
ccagccagat aggggaaggg gccgagcagg gggatgaagg 300cgtcgcggat gcgcctcaaa
ttcagtgcaa aaacgtgctg aagatccgcc ggcggaagat 360gaa
363161662DNAArtificial
SequenceSynthetic 161ggcagggaag ggagaacatt aggacaaata cctaatgcac
gccaggccct antaatcgta 60gatgatgggt tgatgggtgt agcaaaccac catggcacat
gtatatctat gtaacaaacc 120tgcacattct gtacatgtat cccagaactt caagtaaaat
tttaaaaaat tcaaaaaaag 180taataggaaa aggggaaaca tccacgtgag cagtccagtt
tcccaatctg gaacttggag 240ctgttcacct ggtgggtgtt tgtgactatt cagacacaga
caacaaaggc tactccagat 300tgaagtgcac tgcttacttt cagtgacctc atagaactac
tcaacattgt ttttggtgat 360tcctgtgcta tggtttgaat ggctccgctc caaaactcag
gtgttgccaa tgngatggta 420ttaagaagta gggcatttaa aaaacaacaa caggcctggc
gcggtggccc acgcctgtaa 480tcccagcact ttgggaggct aaggcgggcg gatcaccgga
ggtcaggaat tcaaaaccag 540cctggccaac atggcgaaac cctgtctcta ctaaaaatac
aaaaattagc caggcatggt 600tgcgggcgcc tgtaatcccg gctactcggg aggctgaggc
aggggaatcc ttgaacccgg 660ga
662162547DNAArtificial SequenceSynthetic
162gaaatgtaga ctgcaaaggc agtatacagg aaaaggtgga gtgggttttg tttatgaggg
60tgtctgaaaa ctaaaattga gcgggatatc atggtatagt tggacagtat tggtccttca
120cactttggcc atattgtata atggagcttt taccaaagat gtatgagaag tgtaagacta
180taaaaaaatg aactattcaa agtaaaactc ttaacaaaca ttttacttaa agcagatgca
240aaagggtatt ctcatgtagg ctcctgttgg tgcagaggga tttttttgat ttcaggatac
300aactaaagta cgaagttctc agtttcactt tagtagaaag agctctagaa atgaggctga
360taaacacatc taagaacact ggttgctttc taaaatttcc aaagctccac cataaatgta
420atttttagtg tttcaaatga ttgcatttta aagtatataa atatgggtta tccaatatca
480atgctatagt aacatcctga aacaaaacaa gcacaaaggt ataaatgcct aaactggagg
540aagcttg
547163277DNAArtificial SequenceSynthetic 163ctcagactca aacacacacc
tccgctccct tgaagtgcca gccctggagc tttgttgagg 60ctcgcatctg ccacgggagt
cagctagtac gttgcccagt tcaacatcca tccaggattt 120cataggaact tgagaatcat
tgtttttggc ttgaatcctg ggtttgaggt ttcttcgtgt 180aggaatctga aaaaaggatt
tggaaacgtt gttgtctcta atcccaaagt atgtatctgg 240gaggctgcct tcgccatcac
ccacctaata actcagg 277164337DNAArtificial
SequenceSynthetic 164agacttcacc agtcaaagcg aactacatat actcaattga
tccaataact tgaccaacgg 60aacaagttac cctagggata acagcgcaat cctattctag
agtccatatc aacaataggg 120tttacgacct cgatgttgga tcaggacatc ccgatggtgc
agccgctatt aaaggttcgt 180ttgttcaacg attaaagtcc tacgtgatct gagttcagac
cggagtaatc caggtcggtt 240tctatctact tcaaattcct ccctgtacga aaggacaaga
gaaataaggc ctacttcaca 300aagcgccttc ccccgtaaat gatatcatct caagctt
337165276DNAArtificial SequenceSynthetic
165ctcgctcaaa cacacacctc cgctcccttg aagtgccagc cctggagctt tgttgaggct
60cgcatctgcc acgggagtca gctagtacgt tgcccagttc aacatccatc caggatttca
120taggaacttg agaatcattg tttttggctt gaatcctggg tttgaggttt cttcgtgtag
180gaatctgaaa aaaggatttg gaaacgttgt tgtctctaat cccaaagtat gtatctggga
240ggctgccttc gccatcaccc acctaataac tcaggc
276166717DNAArtificial SequenceSynthetic 166attgtttgtt gttgggggtg
tctttgtcag catctagtac agtgcctggc agatggatgc 60tcaataaata ttgatttaat
gggttatgag ggtgttaata taaaattagc atttattcag 120caactactat gagtcagcca
ctgggctaag tggcttacat gttaagaacc tcacagaagc 180caggtgtggt ggctcacgcc
tgtaatccca gcactttggg aggctgaagc gggcagatca 240cctgaggtca ggagtttgag
tccaggctgg ccaacgtggt gaaaccccat ctctactaaa 300aatacaaaaa ttagccagtt
gtggtggcag gcgcctgtag tcccagccac tcaggaggct 360aaggcaggag aatagctgga
acccgggagg tggagattgc agtgagccaa gattgcacca 420ctgcactcca gcctgggtga
cagagtgaga ctctgtctcc aaaaaaaaaa gaaaaagaaa 480aagaacctcc agcaacctag
taggtgagcc cggttactct tgttttacag gtgagaaaat 540tgagccctag agaaataaag
taacttgctt caggtctcat ggttaagggg aacctgggcc 600ctaacagtcc acttcctgta
ccttcaacca cggttctacc gcctccgcta ggaaatggcc 660cgaggacatt ccttagctgg
cttcagcttg ctctttttcc cctgcggtcc acccctg 717167316DNAArtificial
SequenceSynthetic 167agagggagta tagggctgtg cacagagact atgatggccg
tgctaaggta agagtattga 60taatgtaagc atacttcctc tatcaacaat aattgttaac
agctgcttca agcacttgat 120attaccacta gttgttaact gaatcaagca tgtgctccaa
gttcacatta atgtgaattg 180aacagcattg tgtacgtacg aggagcttca tgcaagtgtt
atacactgca ctcacaagta 240ttatgatctt actaagcatt agaaatactc tgtgttaaag
aagcttggtc taggccaagc 300gtggtggctc atgcct
316168457DNAArtificial SequenceSynthetic
168gatcggaggc cagggtcctt tgtactttca tttcttagcc agatgaatgt tgtcacccat
60tttaggatta ttgctatgtg tggagattac tacattggtg gaagacgttt ttcttcactg
120tcagacctaa taggttatta cagtcatgtt tcttgtttgc ttaaaggaga aaaattactt
180tacccagttg caccaccaga gccagtagaa gatagaaggc gtgtacgagc tattctacct
240tacacaaaag taccagacac tgatgaaata agtttcttaa aaggagatat gttcattgtt
300cataatgaat tagaagatgg atggatgtgg gttacaaatt taagaacaga tgaacaaggc
360cttattgttg aagacctagt agaagaggtg ggccgggaag aagatccaca tgaaggaaaa
420atatggttcc atgggaagat ttccaaacag gaagctt
457169361DNAArtificial SequenceSynthetic 169tgaagtggca gcagaggaac
ccaatgctgc agttcctgat gagatccccc ctctcgaggg 60cgatgaggat gcgtctcgca
tggaagaagt cgattaggtt aggagttcat agttggaaaa 120cttgtgccct tgtatagtgt
ccccatgggc tcccactgca gcctcgagtg cccctgtccc 180acctggctcc ccctgctggt
gtctagtgtt tttttccctc tcctgtcctt gtgttgaagg 240cagtaaacta agggtgtcaa
gccccattcc ctctctcact cttgacagca ggattggatg 300ttgtgtattg tggtttattt
tattttcttc attttgttct gaaattaagt atgcaaaata 360a
361170487DNAArtificial
SequenceSynthetic 170gttgcattgt ggatgcaaat ctgagcgttc tcaacttggt
tattgtaaaa aaaggagaga 60aggatattcc tggactgact gatactacag tgcctcgccg
cctgggcccc aaaagagcta 120gcagaatccg caaacttttc aatctctcta aagaagatga
tgtccgccag tatgttgtaa 180gaaagccctt aaataaagaa ggtaagaaac ctaggaccaa
agcacccaag attcagcgtc 240ttgttactcc acgtgtcctg cagcacaaac ggcggcgtat
tgctctgaag aagcagcgta 300ccaagaaaaa taaagaagag gctgcagaat atgctaaact
tttggccaag agaatgaagg 360aggctaagga gaagcgccag gaacaaattg cgaagagacg
cagactttcc tctctgcgag 420cttctacttc taagtctgaa tccagtcaga aataagattt
tttgagtaac aaataaataa 480gatcaga
487171319DNAArtificial SequenceSynthetic
171cctgggcagt gattaggtca taaaggtgga gtcctcatgg atgggattag tgtctttata
60aaagagacct ttgccatgtg aggttacagt gagaagacat ctgtctatga agaaagtggg
120ccctcaccaa acacagtctg ctggcacttt gcacttcaac tccccagctt ccagaactgt
180aaggaatata agtctgttgt tggtaagcca cccggtctat gatattttgt tatagcagcc
240caaacagact aagacaggtg acaaataaac atgaaaagat gttcaacatc attagccatt
300agggaaatgc agattaaaa
31917230PRTArtificial SequenceSynthetic 172Gly Gly Arg Gly Gly Gly Gly
Gly Gly Gly Gly Arg Gly Ala Gly Gly1 5 10
15Gly Arg Gly Ala Gly Ala Gly Gly Gly Arg Pro Glu Ala
Ala 20 25 30
User Contributions:
Comment about this patent or add new information about this topic: