Patent application title: IDENTIFICATION OF SNARE YKT6 SEQUENCES
Inventors:
Ryogen Llc (Suffern, NY, US)
James Ryan (Augusta, GA, US)
James Ryan (Augusta, GA, US)
Assignees:
RYOGEN LLC
IPC8 Class: AC12Q168FI
USPC Class:
435 611
Class name: Measuring or testing process involving enzymes or micro-organisms; composition or test strip therefore; processes of forming such composition or test strip involving nucleic acid nucleic acid based assay involving a hybridization step with a nucleic acid probe, involving a single nucleotide polymorphism (snp), involving pharmacogenetics, involving genotyping, involving haplotyping, or involving detection of dna methylation gene expression
Publication date: 2013-05-23
Patent application number: 20130130251
Abstract:
Provided are isolated genomic polynucleotide fragments that encode human
SNARE YKT6, human glucokinase, human adipocyte enhancer binding protein
(AEBP1) and DNA directed 50 kD regulatory subunit (POLD2), vectors and
hosts containing these fragments and fragments hybridizing to noncoding
regions as well as antisense oligonucleotides to these fragments. The
invention is further directed to methods of using these fragments to
obtain SNARE YKT6, human glucokinase, AEBP1 protein and POLD2 and to
diagnose, treat, prevent and/or ameliorate a pathological disorder.Claims:
1. A method of identifying a nucleotide sequence variant of a
5'-noncoding region, 3'-noncoding region or intron region of SEQ ID NO:5
or its complementary sequence comprising (a) isolating genomic
polynucleotide from a sample and (b) determining the presence or absence
of a nucleotide sequence variation in said genomic polynucleotide by
comparing the nucleotide sequence of SEQ ID NO:5 with the nucleotide
sequence of the isolated genomic polynucleotide and establishing if and
where a difference occurs between the two nucleic acid sequences thereby
identifying a nucleotide sequence variant of SEQ ID NO:5 or its
complement.
2. The method according to claim 1, wherein said variant encodes a protein having human SNARE YKT6 activity.
3. A method for detecting the presence of: (a) a nucleic acid molecule 39,000 nucleotides in length which is at least 99% identical to SEQ ID NO:5 which encodes a polypeptide that has human glucokinase activity, wherein SEQ ID NO:5 consists of a 5'-noncoding region shown in sequence segment 39000-15464 of SEQ ID NO:5, a 3'-non coding region shown in sequence segment 4319-1 of SEQ ID NO:5, exons regions shown in sequence segments 15463-15362, 12033-11950, 10215-10114, 9211-9107, 8466-8401, 5576-5475, 4352-4320 of SEQ ID NO:5 and intron regions shown in sequence segments 15361-12034, 11949-10216, 10113-9212, 9106-8467, 8400-5577, 5474-4353, of SEQ ID NO:5; (b) a fragment of (a), comprising at least nucleotides 20485-33460 of SEQ ID NO:6 which encodes a polypeptide having human glucokinase activity and (c) a nucleic acid molecule which is a complement of the nucleic acid molecules specified in (a)-(b) in a sample, comprising contacting the sample with a polynucleotide probe comprising at least 20 contiguous nucleotides that hybridizes to said nucleic acid molecule under stringent conditions and determining whether the polynucleotide probe binds to said nucleic acid molecule in the sample.
Description:
PRIORITY CLAIM
[0001] This application is a divisional of application Ser. No. 12/533,105, filed Jul. 31, 2009, which is a divisional of Ser. No. 10/642,946, filed Aug. 18, 2003, now U.S. Pat. No. 7,588,915, issued Sep. 15, 2009, which is a continuation of application Ser. No. 09/957,956, filed Sep. 21, 2001, now abandoned, the contents of which all are incorporated herein by reference. application Ser. No. 09/957,956 is a non-provisional application of and claims priority under 35 U.S.C. §119(e) to provisional application Ser. No. 60/234,422, filed Sep. 21, 2000, also incorporated herein by reference.
FIELD OF THE INVENTION
[0002] The invention is directed to isolated genomic polynucleotide fragments that encode human SNARE YKT6, human glucokinase, human adipocyte enhancer binding protein 1 (AEBP1) and DNA directed 50 kD regulatory subunit (POLD2), vectors and hosts containing these fragments and fragments hybridizing to noncoding regions as well as antisense oligonucleotides to these fragments. The invention is further directed to methods of using these fragments to obtain SNARE YKT6, human glucokinase, AEBP1 protein and POLD2 and to diagnose, treat, prevent and/or ameliorate a pathological disorder.
BACKGROUND OF THE INVENTION
[0003] Chromosome 7 contains genes encoding, for example, epidermal growth factor receptor, collagen-1-Alpha-1-chain, SNARE YKT6, human glucokinase, human adipocyte enhancer binding protein 1 and DNA polymerase delta small subunit (POLD2). SNARE YKT6, human glucokinase, human adipocyte enhancer binding protein 1 and DNA polymerase delta small subunit (POLD2) are discussed in further detail below.
Snare YKT6
[0004] SNARE YKT6, a substrate for prenylation, is essential for vesicle-associated endoplasmic reticulum-Golgi transport (McNew, J. A. et al. J. Biol. Chem. 272, 17776-17783, 1997). It has been found that depletion of this function stops cell growth and manifests a transport block at the endoplasmic reticulum level.
Human Glucokinase
[0005] Human glucokinase (ATP:D-hexose 6-phosphotransferase) is thought to play a major role in glucose sensing in pancreatic islet beta cells (Tanizawa et al., 1992, Mol. Endocrinol. 6:1070-1081) and in the liver. Glucokinase defects have been observed in patients with noninsulin-dependent diabetes mellitus (NIDDM) patients. Mutations in the human glucokinase gene are thought to play a role in the early onset of NIDDM. The gene has been shown by Southern Blotting to exist as a single copy on chromosome 7. It was further found to contain 10 exons including one exon expressed in islet beta cells and the other expressed in liver.
Human Adipocyte Enhancer Binding Protein 1
[0006] The adipocyte-enhancer binding protein 1 (AEBP1) is a transcriptional repressor having carboxypeptidase B-like activity which binds to a regulatory sequence (adipocyte enhancer 1, AE-1) located in the proximal promoter region of the adipose P2 (aP2) gene, which encodes the adipocyte fatty acid binding protein (Muise et al., 1999, Biochem. J. 343:341-345). B-like carboxypeptidases remove C-terminal arginine and lysine residues and participate in the release of active peptides, such as insulin, alter receptor specificity for polypeptides and terminate polypeptide activity (Skidgel, 1988, Trends Pharmacol. Sci. 9:299-304). For example, they are thought to be involved in the onset of obesity (Naggert et al., 1995, Nat. Genet. 10:1335-1342). It has been reported that obese and hyperglycemic mice homozygous for the fat mutation contain a mutation in the CP-E gene.
[0007] Full length cDNA clones encoding AEBP1 have been isolated from human osteoblast and adipose tissue (Ohno et al., 1996, Biochem. Biophys Res. Commun. 228:411-414). Two forms have been found to exist due to alternative splicing. This gene appears to play a significant role in regulating adipogenesis. In addition to playing a role in obesity, adipogenesis may play a role in ostopenic disorders. It has been postulated that adipogenesis inhibitors may be used to treat osteopenic disorders (Nuttal et al., 2000, Bone 27:177-184).
DNA Polymerase Delta Small Subunit (POLD2)
[0008] DNA polymerase delta core is a heterodimeric enzyme with a catalytic subunit of 125 kD and a second subunit of 50 kD and is an essential enzyme for DNA replication and DNA repair (Zhang et al., 1995, Genomics 29:179-186). cDNAs encoding the small subunit have been cloned and sequenced. The gene for the small subunit has been localized to human chromosome 7 via PCR analysis of a panel of human-hamster hybrid cell lines. However, the genomic DNA has not been isolated and the exact location on chromosome 7 has not been determined.
OBJECTS OF THE INVENTION
[0009] Although cDNAs encoding the above-disclosed proteins have been isolated, their location on chromosome 7 has not been determined. Furthermore, genomic DNA encoding these polypeptides have not been isolated. Noncoding sequences can play a significant role in regulating the expression of polypeptides as well as the processing of RNA encoding these polypeptides.
[0010] There is clearly a need for obtaining genomic polynucleotide sequences encoding these polypeptides. Therefore, it is an object of the invention to isolate such genomic polynucleotide sequences.
SUMMARY OF THE INVENTION
[0011] The invention is directed to an isolated genomic polynucleotide, said polynucleotide obtainable from human chromosome 7 having a nucleotide sequence at least 95% identical to a sequence selected from the group consisting of:
[0012] (a) a polynucleotide encoding a polypeptide selected from the group consisting of human SNARE YKT6 depicted in SEQ ID NO:1, human glucokinase depicted in SEQ ID NO:2, human adipocyte enhancer binding protein 1 (AEBP1) depicted in SEQ ID NO:3 and DNA directed 50 kD regulatory subunit (POLD2) depicted in SEQ ID NO:4;
[0013] (b) a polynucleotide selected from the group consisting of SEQ ID NO:5 which encodes human SNARE YKT6 depicted in SEQ ID NO:1, SEQ ID NO:6 which encodes human glucokinase depicted in SEQ ID NO:2, SEQ ID NO:8 which encodes human adipocyte enhancer binding protein 1 depicted in SEQ ID NO:3 and SEQ ID NO:7 which encodes DNA directed 50 kD regulatory subunit (POLD2) depicted in SEQ ID NO:4;
[0014] (c) a polynucleotide which is a variant of SEQ ID NOS:5, 6, 7, or 8;
[0015] (d) a polynucleotide which is an allelic variant of SEQ ID NOS:5, 6, 7, or 8;
[0016] (e) a polynucleotide which encodes a variant of SEQ ID NOS:1, 2, 3, or 4;
[0017] (f) a polynucleotide which hybridizes to any one of the polynucleotides specified in (a)-(e);
[0018] (g) a polynucleotide that is a reverse complement to the polynucleotides specified in (a)-(f) and
[0019] (h) containing at least 10 transcription factor binding sites selected from the group consisting of AP1FJ-Q2, AP1-C, AP1-Q2, AP1-Q4, AP4-Q5, AP4-Q6, ARNT-01, CEBP-01, CETS1P54-01, CREL-01, DELTAEF1-01, FREAC7-01, GATA1-02, GATA1-03, GATA1-04, GATA1-06, GATA2-02, GATA3-02, GATA-C, GC-01, GFII-01, HFH2-01, HFH3-01, HFH8-01, IK2-01, LMO2COM-01, LMO2COM-02, LYF1-01, MAX-01, NKX25-01, NMYC-01, S8-01, SOX5-01, SP1-Q6, SAEBP1-01, SRV-02, STAT-01, TATA-01, TCF11-01, USF-01, USF-C and USF-Q6
as well as nucleic acid constructs, expression vectors and host cells containing these polynucleotide sequences.
[0020] The polynucleotides of the present invention may be used for the manufacture of a gene therapy for the prevention, treatment or amelioration of a medical condition by adding an amount of a composition comprising said polynucleotide effective to prevent, treat or ameliorate said medical condition.
[0021] The invention is further directed to obtaining these polypeptides by
[0022] (a) culturing host cells comprising these sequences under conditions that provide for the expression of said polypeptide and
[0023] (b) recovering said expressed polypeptide.
[0024] The polypeptides obtained may be used to produce antibodies by
[0025] (a) optionally conjugating said polypeptide to a carrier protein;
[0026] (b) immunizing a host animal with said polypeptide or peptide-carrier protein conjugate of step (b) with an adjuvant and
[0027] (c) obtaining antibody from said immunized host animal.
[0028] The invention is further directed to polynucleotides that hybridize to noncoding regions of said polynucleotide sequences as well as antisense oligonucleotides to these polynucleotides as well as antisense mimetics. The antisense oligonucleotides or mimetics may be used for the manufacture of a medicament for prevention, treatment or amelioration of a medical condition.
[0029] The invention is further directed to kits comprising these polynucleotides and kits comprising these antisense oligonucleotides or mimetics.
[0030] In a specific embodiment, the noncoding regions are transcription regulatory regions. The transcription regulatory regions may be used to produce a heterologous peptide by expressing in a host cell, said transcription regulatory region operably linked to a polynucleotide encoding the heterologous polypeptide and recovering the expressed heterologous polypeptide.
[0031] The polynucleotides of the present invention may be used to diagnose a pathological condition in a subject comprising
[0032] (a) determining the presence or absence of a mutation in the polynucleotides of the present invention and
[0033] (b) diagnosing a pathological condition or a susceptibility to a pathological condition based on the presence or absence of said mutation.
DETAILED DESCRIPTION OF THE INVENTION
[0034] The invention is directed to isolated genomic polynucleotide fragments that encode human SNARE YKT6, human glucokinase, human adipocyte enhancer binding protein 1 and DNA directed 50 kD regulatory subunit (POLD2), which in a specific embodiment are the SNARE YKT6, human glucokinase, human adipocyte enhancer binding protein 1 and DNA directed 50 kD regulatory subunit (POLD2) genes, as well as vectors and hosts containing these fragments and polynucleotide fragments hybridizing to noncoding regions, as well as antisense oligonucleotides to these fragments.
[0035] As defined herein, a "gene" is the segment of DNA involved in producing a polypeptide chain; it includes regions preceding and following the coding region, as well as intervening sequences (introns) between individual coding segments (exons).
[0036] As defined herein "isolated" refers to material removed from its original environment and is thus altered "by the hand of man" from its natural state. An isolated polynucleotide can be part of a vector, a composition of matter or could be contained within a cell as long as the cell is not the original environment of the polynucleotide.
[0037] The polynucleotides of the present invention may be in the form of RNA or in the form of DNA, which DNA includes genomic DNA and synthetic DNA. The DNA may be double-stranded or single-stranded and if single stranded may be the coding strand or non-coding strand.
[0038] The human SNARE YKT6 polypeptide has the amino acid sequence depicted in SEQ ID NO:1 and is encoded by the genomic DNA sequence shown in SEQ ID NO:5. The genomic DNA for SNARE YKT6 gene is 39,000 base pairs in length and contains seven exons (see Table 4 below for location of exons). As will be discussed in further detail below, the SNARE YKT6 gene is situated in genomic clone AC006454 at nucleotides 36,001-75,000.
[0039] The human glucokinase is depicted in SEQ ID NO:2 and is encoded by the genomic DNA sequence shown in SEQ ID NO:6. The human glucokinase genomic DNA is 46,000 base pairs in length and contains ten exons (see Table 3 below for location of exons).
[0040] The human adipocyte enhancer binding protein 1 has the amino acid sequence depicted in SEQ ID NO:3 and is encoded by the genomic DNA sequence shown in SEQ ID NO:8. The adipocyte enhancer binding protein 1 is 16,000 base pairs in length and contains 21 exons (see Table 2 below for location of exons). As will be discussed in further detail below, the human AEBP1 gene is situated in genomic clone AC006454 at nucleotides 137,041-end.
[0041] POLD2 has an amino acid sequence depicted in SEQ ID NO:4 and a genomic DNA sequence depicted in SEQ ID NO:7. The POLD2 gene is 19,000 base pairs in length and contains ten exons (see Table 1 below for location of exons). As will be discussed in further detail below, the POLD2 gene is situated in genomic clone AC006454 at nucleotides 119,001-138,000.
[0042] The polynucleotides of the invention have at least a 95% identity and may have a 96%, 97%, 98% or 99% identity to the polynucleotides depicted in SEQ ID NOS:5, 6, 7 or 8 as well as the polynucleotides in reverse sense orientation, or the polynucleotide sequences encoding the SNARE YKT6, human glucokinase, AEBP1, or POLD2 polypeptides depicted in SEQ ID NOS:1, 2, 3, or 4 respectively.
[0043] A polynucleotide having 95% "identity" to a reference nucleotide sequence of the present invention, is identical to the reference sequence except that the polynucleotide sequence may include on average up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the reference sequence. The query sequence may be an entire sequence, the ORF (open reading frame), or any fragment specified as described herein.
[0044] As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention can be determined conventionally using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the query and subject sequences are both DNA sequences. An RNA sequence can be compared by converting U's to T's. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=1, Joining Penalty=30, Randomization Group Length=0, Cutoff Score=1, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject nucleotide sequence, whichever is shorter.
[0045] If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for 5' and 3' truncations of the subject sequence when calculating percent identity. For subject sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by calculating the number of bases of the query sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identify, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score is what is used for the purposes of the present invention. Only bases outside the 5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query sequence are calculated for the purposes of manually adjusting the percent identity score.
[0046] For example, a 95 base subject sequence is aligned to a 100 base query sequence to determine percent identity. The deletions occur at the 5' end of the subject sequence and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases at 5' end. The 10 unpaired bases represent 5% of the sequence (number of bases at the 5' and 3' ends not matched/total numbers of bases in the query sequence) so 5% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 95 bases were perfectly matched the final percent identity would be 95%. In another example, a 95 base subject sequence is compared with a 100 base query sequence. This time the deletions are internal deletions so that there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only bases 5' and 3' of the subject sequence which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are made for purposes of the present invention.
[0047] A polypeptide that has an amino acid sequence at least, for example, 95% "identical" to a query amino acid sequence is identical to the query sequence except that the subject polypeptide sequence may include on average, up to five amino acid alterations per each 100 amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the referenced sequence or in one or more contiguous groups within the reference sequence.
[0048] A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Com. App. Biosci. (1990) 6:237-245). In a sequence alignment, the query and subject sequence are either both nucleotide sequences or both amino acid sequences. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=1, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=1, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is shorter.
[0049] If the subject sequence is shorter than the query sequence due to N- or C-terminal deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for N- and C-terminal truncations of the subject sequence when calculating global percent identity. For subject sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminal of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This final percent identity score is what is used for the purposes of the present invention. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the purposes of manually adjusting the percent identity score. That is, only query residue positions outside the farthest N- and C-terminal residues of the subject sequence.
[0050] The invention also encompasses polynucleotides that hybridize to the polynucleotides depicted in SEQ ID NOS: 5, 6, 7 or 8. A polynucleotide "hybridizes" to another polynucleotide, when a single-stranded form of the polynucleotide can anneal to the other polynucleotide under the appropriate conditions of temperature and solution ionic strength (see Sambrook et al., supra). The conditions of temperature and ionic strength determine the "stringency" of the hybridization. For preliminary screening for homologous nucleic acids, low stringency hybridization conditions, corresponding to a temperature of 42° C., can be used, e.g., 5×SSC, 0.1% SDS, 0.25% milk, and no formamide; or 40% formamide, 5×SSC, 0.5% SDS). Moderate stringency hybridization conditions correspond to a higher temperature of 55° C., e.g., 40% formamide, with 5× or 6×SCC. High stringency hybridization conditions correspond to the highest temperature of 65° C., e.g., 50% formamide, 5× or 6×SCC. Hybridization requires that the two nucleic acids contain complementary sequences, although depending on the stringency of the hybridization, mismatches between bases are possible. The appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the degree of complementation, variables well known in the art. The greater the degree of similarity or homology between two nucleotide sequences, the greater the value of Tm for hybrids of nucleic acids having those sequences. The relative stability (corresponding to higher Tm) of nucleic acid hybridizations decreases in the following order: RNA:RNA, DNA:RNA, DNA:DNA.
Polynucleotide and Polypeptide Variants
[0051] The invention is directed to both polynucleotide and polypeptide variants. A "variant" refers to a polynucleotide or polypeptide differing from the polynucleotide or polypeptide of the present invention, but retaining essential properties thereof. Generally, variants are overall closely similar and in many regions, identical to the polynucleotide or polypeptide of the present invention.
[0052] The variants may contain alterations in the coding regions, non-coding regions, or both. Especially preferred are polynucleotide variants containing alterations which produce silent substitutions, additions, or deletions, but do not alter the properties or activities of the encoded polypeptide. Nucleotide variants produced by silent substitutions due to the degeneracy of the genetic code are preferred. Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any combination are also preferred.
[0053] The invention also encompasses allelic variants of said polynucleotides. An allelic variant denotes any of two or more alternative forms of a gene occupying the same chromosomal locus. Allelic variation arises naturally through mutation, and may result in polymorphism within populations. Gene mutations can be silent (no change in the encoded polypeptide) or may encode polypeptides having altered amino acid sequences. An allelic variant of a polypeptide is a polypeptide encoded by an allelic variant of a gene.
[0054] The amino acid sequences of the variant polypeptides may differ from the amino acid sequences depicted in SEQ ID NOS:1, 2, 3 or 4 by an insertion or deletion of one or more amino acid residues and/or the substitution of one or more amino acid residues by different amino acid residues. Preferably, amino acid changes are of a minor nature, that is conservative amino acid substitutions that do not significantly affect the folding and/or activity of the protein; small deletions, typically of one to about 30 amino acids; small amino- or carboxyl-terminal extensions, such as an amino-terminal methionine residue; a small linker peptide of up to about 20-25 residues; or a small extension that facilitates purification by changing net charge or another function, such as a poly-histidine tract, an antigenic epitope or a binding domain. Examples of conservative substitutions are within the group of basic amino acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino acids (glycine, alanine, serine, threonine and methionine). Amino acid substitutions which do not generally alter the specific activity are known in the art and are described, for example, by H. Neurath and R. L. Hill, 1979, In, The Proteins, Academic Press, New York. The most commonly occurring exchanges are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, as well as these in reverse.
Noncoding Regions
[0055] The invention is further directed to polynucleotide fragments containing or hybridizing to noncoding regions of the SNARE YKT6, AEBP1, human glucokinase and POLD2 genes. These include but are not limited to an intron, a 5' non-coding region, a 3' non-coding region and splice junctions (see Tables 1-4), as well as transcription factor binding sites (see Table 5). The polynucleotide fragments may be a short polynucleotide fragment which is between about 8 nucleotides to about 40 nucleotides in length. Such shorter fragments may be useful for diagnostic purposes. Such short polynucleotide fragments are also preferred with respect to polynucleotides containing or hybridizing to polynucleotides containing splice junctions. Alternatively larger fragments, e.g., of about 50, 150, 500, 600 or about 2000 nucleotides in length may be used.
TABLE-US-00001 TABLE 1 Exon/Intron Regions of Polymerase, DNA directed, 50 kD regulatory subunit (POLD2) Genomic DNA LOCATION (nucleotide no.) EXONS (Amino acid no.) 1. 11546 . . . 11764 1 73 2. 15534 . . . 15656 74 114 3. 15857 . . . 15979 115 155 4. 16351 . . . 16464 156 193 5. 16582 . . . 16782 194 260 6. 17089 . . . 17169 261 287 7. 17327 . . . 17484 288 339 8. 17704 . . . 17829 340 381 9. 18199 . . . 18303 382 416 10. 18653 . . . 18811 417 469 `tga` at 18812-14 Poly A at 18885-90
TABLE-US-00002 TABLE 2 AEBP1 (adipocyte enhancer binding protein 1), vascular smooth muscle-type. Reverse strand coding. LOCATION (nucleotide no.) EXONS (Amino acid no.) 21. 1301 . . . 1966 1158 937 20. 2209 . . . 2304 936 905 19. 2426 . . . 2569 904 857 18. 2651 . . . 3001 856 740 17. 3238 . . . 3417 739 680 16. 3509 . . . 3706 679 614 15. 3930 . . . 4052 613 573 14. 4320 . . . 4406 572 544 13. 4503 . . . 4646 543 496 12. 4750 . . . 4833 495 468 11. 5212 . . . 5352 467 421 10. 5435 . . . 5545 420 384 9. 6219 . . . 6272 383 366 8. 6376 . . . 6453 365 340 7. 6584 . . . 6661 339 314 6. 7476 . . . 7553 313 288 5. 7629 . . . 7753 287 247 4. 7860 . . . 7931 246 223 3. 8050 . . . 8121 222 199 2. 8673 . . . 9014 198 85 1. 10642 . . . 10893 84 1 Stop codon 1298-1300 Poly A-site 1013-18
TABLE-US-00003 TABLE 3 Glucokinase LOCATION (nucleotide no.) EXONS (Amino acid no.) 1. 20485 . . . 20523 1 13 2. 25133 . . . 25297 14 68 3. 26173 . . . 26328 69 120 4. 27524 . . . 27643 121 160 5. 28535 . . . 28630 161 192 6. 28740 . . . 28838 193 225 7. 30765 . . . 30950 226 287 8. 31982 . . . 32134 288 338 9. 32867 . . . 33097 339 415 10. 33314 . . . 33460 416 464 Stop codon 33461-3
TABLE-US-00004 TABLE 4 SNARE YKT6. Reverse strand coding. LOCATION (nucleotide no.) EXONS (Amino acid no.) 7. 4320 . . . 4352 198 188 6. 5475 . . . 5576 187 154 5. 8401 . . . 8466 153 132 4. 9107 . . . 9211 131 97 3. 10114 . . . 10215 96 63 2. 11950 . . . 12033 62 35 1. 15362 . . . 15463 34 1 Stop codon at 4817-19 Poly A-site: 4245-4250
TABLE-US-00005 TABLE 5 TRANSCRIPTION FACTOR BINDING SITES SNARE BINDING SITES YKT6 GLUCOKINASE POLD2 AEBP1 AP1FJ-Q2 11 11 AP1-C 15 15 7 6 AP1-Q2 9 5 AP1-Q4 7 4 AP4-Q5 36 5 43 AP4-Q6 17 23 ARNT-01 7 5 CEBP-01 7 CETS1P54-01 6 CREL-01 7 DELTAEF1-01 64 12 5 50 FREAC7-01 4 GATA1-02 19 GATA1-03 12 6 GATA1-04 25 6 GATA1-06 8 5 GATA2-02 10 GATA3-02 5 GATA-C 11 6 GC-01 4 GFII-01 6 HFH2-01 5 HFH3-01 10 HFH8-01 4 IK2-01 49 29 LMO2COM-01 41 6 27 LMO2COM-02 31 5 7 LYF1-01 10 13 6 MAX-01 4 MYOD-01 7 MYOD-Q6 32 19 7 12 MZF1-01 99 40 15 94 NF1-Q6 5 7 NFAT-Q6 43 8 7 8 NFKAPPAB50-01 4 NKX25-01 13 14 5 NMYC-01 12 8 S8-01 30 4 SOX5-01 21 20 4 4 SP1-Q6 8 SAEBP1-01 4 SRV-02 5 STAT-01 6 TATA-01 8 TCF11-01 47 28 5 19 USF-01 12 8 6 8 USF-C 16 12 12 8 USF-Q6 6
[0056] In a specific embodiment, such noncoding sequences are expression control sequences. These include but are not limited to DNA regulatory sequences, such as promoters, enhancers, repressors, terminators, and the like, that provide for the regulation of expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation signals are also control sequences.
[0057] In a more specific embodiment of the invention, the expression control sequences may be operatively linked to a polynucleotide encoding a heterologous polypeptide. Such expression control sequences may be about 50-200 nucleotides in length and specifically about 50, 100, 200, 500, 600, 1000 or 2000 nucleotides in length. A transcriptional control sequence is "operatively linked" to a polynucleotide encoding a heterologous polypeptide sequence when the expression control sequence controls and regulates the transcription and translation of that polynucleotide sequence. The term "operatively linked" includes having an appropriate start signal (e.g., ATG) in front of the polynucleotide sequence to be expressed and maintaining the correct reading frame to permit expression of the DNA sequence under the control of the expression control sequence and production of the desired product encoded by the polynucleotide sequence. If a gene that one desires to insert into a recombinant DNA molecule does not contain an appropriate start signal, such a start signal can be inserted upstream (5') of and in reading frame with the gene.
Expression of Polypeptides
Isolated Polynucleotide Sequences
[0058] The human chromosome 7 genomic clone of accession number AC006454 has been discovered to contain the SNARE YKT6 gene, the human glucokinase gene, the AEBP1 gene, and the POLD2 gene by Genscan analysis (Burge et al., 1997, J. Mol. Biol. 268:78-94), BLAST2 and TBLASTN analysis (Altschul et al., 1997, Nucl. Acids Res. 25:3389-3402), in which the sequence of AC006454 was compared to the SNARE YKT6 cDNA sequence, accession number NM--006555 (McNew et al., 1997, J. Biol. Chem. 272:17776-177783), the human glucokinase cDNA sequence (Tanizawa et al., 1992, Mol. Endocrinol. 6:1070-1081), accession number NM--000162 (major form) and M69051 (minor form), AEBP1 cDNA sequence, accession number NM--001129 (accession number D86479 for the osteoblast type) (Layne et al., 1998, J. Biol. Chem. 273:15654-15660) and the POLD2 cDNA sequence, accession number NM--006230 (Zhang et al., 1995, Genomics 29:179-186).
[0059] The cloning of the nucleic acid sequences of the present invention from such genomic DNA can be effected, e.g., by using the well known polymerase chain reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with shared structural features. See, e.g., Innis et al., 1990, PCR: A Guide to Methods and Application, Academic Press, New York. Other nucleic acid amplification procedures such as ligase chain reaction (LCR), ligated activated transcription (LAT) and nucleic acid sequence-based amplification (NASBA) or long chain PCR may be used. In a specific embodiment, 5' or 3' non-coding portions of each gene may be identified by methods including but are not limited to, filter probing, clone enrichment using specific probes and protocols similar or identical to 5' and 3' "RACE" protocols which are well known in the art. For instance, a method similar to 5' RACE is available for generating the missing 5' end of a desired full-length transcript. (Fromont-Racine et al., 1993, Nucl. Acids Res. 21:1683-1684).
[0060] Once the DNA fragments are generated, identification of the specific DNA fragment containing the desired SNARE YKT6 gene, the human glucokinase gene, the AEBP1 gene, or POLD2 gene may be accomplished in a number of ways. For example, if an amount of a portion of a SNARE YKT6 gene, the human glucokinasegene, the POLD2 gene or AEBP1 gene, or its specific RNA, or a fragment thereof, is available and can be purified and labeled, the generated DNA fragments may be screened by nucleic acid hybridization to the labeled probe (Benton and Davis, 1977, Science 196:180; Grunstein and Hogness, 1975, Proc. Natl. Acad. Sci. U.S.A. 72:3961). The present invention provides such nucleic acid probes, which can be conveniently prepared from the specific sequences disclosed herein, e.g., a hybridizable probe having a nucleotide sequence corresponding to at least a 10, and preferably a 15, nucleotide fragment of the sequences depicted in SEQ ID NOS:5, 6, 7 or 8. Preferably, a fragment is selected that is highly unique to the encoded polypeptides. Those DNA fragments with substantial homology to the probe will hybridize. As noted above, the greater the degree of homology, the more stringent hybridization conditions can be used. In one embodiment, low stringency hybridization conditions are used to identify a homologous SNARE YKT6, the human glucokinase, the AEBP1, or POLD2 polynucleotide. However, in a preferred aspect, and as demonstrated experimentally herein, a nucleic acid encoding a polypeptide of the invention will hybridize to a nucleic acid derived from the polynucleotide sequence depicted in SEQ ID NOS:5, 6, 7 or 8 or a hybridizable fragment thereof, under moderately stringent conditions; more preferably, it will hybridize under high stringency conditions.
[0061] Alternatively, the presence of the gene may be detected by assays based on the physical, chemical, or immunological properties of its expressed product. For example, cDNA clones, or DNA clones which hybrid-select the proper mRNAs, can be selected which produce a protein that, e.g., has similar or identical electrophoretic migration, isoelectric focusing behavior, proteolytic digestion maps, or antigenic properties as known for the SNARE YKT6, the human glucokinase, the AEBP1, or POLD2 polynucleotide.
[0062] A gene encoding SNARE YKT6, the human glucokinase, the AEBP1, or POLD2 polypeptide can also be identified by mRNA selection, i.e., by nucleic acid hybridization followed by in vitro translation. In this procedure, fragments are used to isolate complementary mRNAs by hybridization. Immunoprecipitation analysis or functional assays of the in vitro translation products of the products of the isolated mRNAs identifies the mRNA and, therefore, the complementary DNA fragments, that contain the desired sequences.
Nucleic Acid Constructs
[0063] The present invention also relates to nucleic acid constructs comprising a polynucleotide sequence containing the exon/intron segments of the SNARE YKT6 gene (nucleotides 4320-15463 of SEQ ID NO:5), human glucokinase gene (nucleotides 20485-33460 of SEQ ID NO:6), AEBP1 gene (nucleotides 1301-13893 of SEQ ID NO:8) or POLD2 gene (nucleotides 11546-18811 of SEQ ID NO:7) operably linked to one or more control sequences which direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences. Expression will be understood to include any step involved in the production of the polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
[0064] The invention is further directed to a nucleic acid construct comprising expression control sequences derived from SEQ ID NOS: 5, 6, 7 or 8 and a heterologous polynucleotide sequence.
[0065] "Nucleic acid construct" is defined herein as a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acid which are combined and juxtaposed in a manner which would not otherwise exist in nature. The term nucleic acid construct is synonymous with the term expression cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence of the present invention. The term "coding sequence" is defined herein as a portion of a nucleic acid sequence which directly specifies the amino acid sequence of its protein product. The boundaries of the coding sequence are generally determined by a ribosome binding site (prokaryotes) or by the ATG start codon (eukaryotes) located just upstream of the open reading frame at the 5' end of the mRNA and a transcription terminator sequence located just downstream of the open reading frame at the 3' end of the mRNA. A coding sequence can include, but is not limited to, DNA, cDNA, and recombinant nucleic acid sequences.
[0066] The isolated polynucleotide of the present invention may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the nucleic acid sequence prior to its insertion into a vector may be desirable or necessary depending on the expression vector. The techniques for modifying nucleic acid sequences utilizing recombinant DNA methods are well known in the art.
[0067] The control sequence may be an appropriate promoter sequence, a nucleic acid sequence which is recognized by a host cell for expression of the nucleic acid sequence. The promoter sequence contains transcriptional control sequences which regulate the expression of the polynucleotide. The promoter may be any nucleic acid sequence which shows transcriptional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
[0068] Examples of suitable promoters for directing the transcription of the nucleic acid constructs of the present invention, especially in a bacterial host cell, are the promoters obtained from the E. coli lac operon, the prokaryotic beta-lactamase gene (Villa-Komaroff et al., 1978, Proc. Natl. Acad. Sci. USA 75: 3727-3731), as well as the tac promoter (DeBoer et al., 1983, Proc. Natl. Acad. of Sciences USA 80: 21-25). Further promoters are described in "Useful proteins from recombinant bacteria" in Scientific American, 1980, 242: 74-94; and in Sambrook et al., 1989, supra.
[0069] Examples of suitable promoters for directing the transcription of the nucleic acid constructs of the present invention in a filamentous fungal host cell are promoters obtained from the genes encoding Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans acetamidase, Fusarium oxysporum trypsin-like protease (WO 96/00787), NA2-tpi (a hybrid of the promoters from the genes encoding Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose phosphate isomerase), and mutant, truncated, and hybrid promoters thereof.
[0070] In a yeast host, useful promoters are obtained from the Saccharomyces cerevisiae enolase (ENO-1) gene, the Saccharomyces cerevisiae galactokinase gene (GAL1), the Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase genes (ADH2/GAP), and the Saccharomyces cerevisiae 3-phosphoglycerate kinase gene. Other useful promoters for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488.
[0071] Eukaryotic promoters may be obtained from the genomes of viruses such as polyoma virus, fowlpox virus, adenovirus, bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and SV40. Alternatively, heterologous mammalian promoters, such as the actin promoter or immunoglobulin promoter may be used.
[0072] The constructs of the invention may also include enhancers. Enhancers are cis-acting elements of DNA, usually from about 10 to about 300 bp that act on a promoter to increase its transcription. Enhancers from globin, elastase, albumin, alpha-fetoprotein, and insulin enhancers may be used. However, an enhancer from a virus may be used; examples include SV40 on the late side of the replication origin, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin and adenovirus enhancers.
[0073] The control sequence may also be a suitable transcription terminator sequence, a sequence recognized by a host cell to terminate transcription. The terminator sequence is operably linked to the 3' terminus of the nucleic acid sequence encoding the polypeptide. Any terminator which is functional in the host cell of choice may be used in the present invention.
[0074] The control sequence may also be a suitable leader sequence, a nontranslated region of an mRNA which is important for translation by the host cell. The leader sequence is operably linked to the 5' terminus of the nucleic acid sequence encoding the polypeptide. Any leader sequence that is functional in the host cell of choice may be used in the present invention.
[0075] The control sequence may also be a polyadenylation sequence, a sequence which is operably linked to the 3' terminus of the nucleic acid sequence and which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence which is functional in the host cell of choice may be used in the present invention.
[0076] The control sequence may also be a signal peptide coding region, which codes for an amino acid sequence linked to the amino terminus of the polypeptide which can direct the encoded polypeptide into the cell's secretory pathway. The 5' end of the coding sequence of the nucleic acid sequence may inherently contain a signal peptide coding region naturally linked in translation reading frame with the segment of the coding region which encodes the secreted polypeptide. Alternatively, the 5' end of the coding sequence may contain a signal peptide coding region which is foreign to the coding sequence. The foreign signal peptide coding region may be required where the coding sequence does not normally contain a signal peptide coding region. Alternatively, the foreign signal peptide coding region may simply replace the natural signal peptide coding region in order to obtain enhanced secretion of the polypeptide. However, any signal peptide coding region which directs the expressed polypeptide into the secretory pathway of a host cell of choice may be used in the present invention.
[0077] The control sequence may also be a propeptide coding region, which codes for an amino acid sequence positioned at the amino terminus of a polypeptide. The resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally inactive and can be converted to a mature active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide coding region may be obtained from the Bacillus subtilis alkaline protease gene (aprE), the Bacillus subtilis neutral protease gene (nprT), the Saccharomyces cerevisiae alpha-factor gene, the Rhizomucor miehei aspartic proteinase gene, or the Myceliophthora thermophila laccase gene (WO 95/33836).
[0078] Where both signal peptide and propeptide regions are present at the amino terminus of a polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide and the signal peptide region is positioned next to the amino terminus of the propeptide region.
[0079] It may also be desirable to add regulatory sequences which allow the regulation of the expression of the polypeptide relative to the growth of the host cell. Examples of regulatory systems are those which cause the expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Regulatory systems in prokaryotic systems would include the lac, tac, and trp operator systems. In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and the Aspergillus oryzae glucoamylase promoter may be used as regulatory sequences. Other examples of regulatory sequences are those which allow for gene amplification. In eukaryotic systems, these include the dihydrofolate reductase gene which is amplified in the presence of methotrexate, and the metallothionein genes which are amplified with heavy metals. In these cases, the nucleic acid sequence encoding the polypeptide would be operably linked with the regulatory sequence.
Expression Vectors
[0080] The present invention also relates to recombinant expression vectors comprising a nucleic acid sequence of the present invention, a promoter, and transcriptional and translational stop signals. The various nucleic acid and control sequences described above may be joined together to produce a recombinant expression vector which may include one or more convenient restriction sites to allow for insertion or substitution of the nucleic acid sequence encoding the polypeptide at such sites. Alternatively, the polynucleotide of the present invention may be expressed by inserting the nucleic acid sequence or a nucleic acid construct comprising the sequence into an appropriate vector for expression. In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
[0081] The recombinant expression vector may be any vector (e.g., a plasmid or virus) which can be conveniently subjected to recombinant DNA procedures and can bring about the expression of the nucleic acid sequence. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vectors may be linear or closed circular plasmids.
[0082] The vector may be an autonomously replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. The vector may contain any means for assuring self-replication. Alternatively, the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids which together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used.
[0083] The vectors of the present invention preferably contain one or more selectable markers which permit easy selection of transformed cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like. Examples of bacterial selectable markers are the dal genes from Bacillus subtilis or Bacillus licheniformis, or markers which confer antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or tetracycline resistance. Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. An example of suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take of the nucleic acids of the present invention, such as DHFR or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA, 77:4216 (1980).
[0084] The vectors of the present invention preferably contain an element(s) that permits stable integration of the vector into the host cell genome or autonomous replication of the vector in the cell independent of the genome of the cell.
[0085] For integration into the host cell genome, the vector may rely on the polynucleotide sequence encoding the polypeptide or any other element of the vector for stable integration of the vector into the genome by homologous or nonhomologous recombination. Alternatively, the vector may contain additional nucleic acid sequences for directing integration by homologous recombination into the genome of the host cell. The additional polynucleotide sequences enable the vector to be integrated into the host cell genome at a precise location(s) in the chromosome(s). To increase the likelihood of integration at a precise location, the integrational elements should preferably contain a sufficient number of nucleic acids, such as 100 to 1,500 base pairs, preferably 400 to 1,500 base pairs, and most preferably 800 to 1,500 base pairs, which are highly homologous with the corresponding target sequence to enhance the probability of homologous recombination. The integrational elements may be any sequence that is homologous with the target sequence in the genome of the host cell. Furthermore, the integrational elements may be non-encoding or encoding nucleic acid sequences. On the other hand, the vector may be integrated into the genome of the host cell by non-homologous recombination.
[0086] For autonomous replication, the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question. Examples of bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and pUB110, pE194, pTA1060, and pAM§1 permitting replication in Bacillus. Examples of origins of replication for use in a yeast host cell are the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and CEN6. The origin of replication may be one having a mutation which makes its functioning temperature-sensitive in the host cell (see, e.g., Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433).
[0087] More than one copy of a polynucleotide sequence of the present invention may be inserted into the host cell to increase production of the gene product. An increase in the copy number of the polynucleotide sequence can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the nucleic acid sequence where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the nucleic acid sequence, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
[0088] The procedures used to ligate the elements described above to construct the recombinant expression vectors of the present invention are well known to one skilled in the art (see, e.g., Sambrook et al., 1989, supra).
Host Cells
[0089] The present invention also relates to recombinant host cells, comprising a nucleic acid sequence of the invention, which are advantageously used in the recombinant production of the polypeptides. A vector comprising a nucleic acid sequence of the present invention is introduced into a host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier. The term "host cell" encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
[0090] The host cell may be a unicellular microorganism, e.g., a prokaryote, or a non-unicellular microorganism, e.g., a eukaryote. Useful unicellular cells are bacterial cells such as gram positive bacteria including, but not limited to, a Bacillus cell, or a Streptomyces cell, e.g., Streptomyces lividans or Streptomyces murinus, or gram negative bacteria such as E. coli and Pseudomonas sp.
[0091] The introduction of a vector into a bacterial host cell may, for instance, be effected by protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or conjugation (see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771-5278).
[0092] The host cell may be a eukaryote, such as a mammalian cell (e.g., human cell), an insect cell, a plant cell or a fungal cell. Mammalian host cells that could be used include but are not limited to human Hela, embryonic kidney cells (293), lung cells, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and CV1, quail QC1-3 cells, mouse L cells and Chinese Hamster ovary (CHO) cells. These cells may be transfected with a vector containing a transcriptional regulatory sequence, a protein coding sequence and transcriptional termination sequences. Alternatively, the polypeptide can be expressed in stable cell lines containing the polynucleotide integrated into a chromosome. The co-transfection with a selectable marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation of the transfected cells.
[0093] The host cell may be a fungal cell. "Fungi" as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined by Hawksworth et al., In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK) as well as the Oomycota (as cited in Hawksworth et al., 1995, supra, page 171) and all mitosporic fungi (Hawksworth et al., 1995, supra). The fungal host cell may also be a yeast cell. eastO as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may change in the future, for the purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, F. A., Passmore, S. M., and Davenport, R. R., eds, Soc. App. Bacteriol. Symposium Series No. 9, 1980). The fungal host cell may also be a filamentous fungal cell. "Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
[0094] Fungal cells may be transformed by a process involving protoplast formation, transformation of the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus host cells are described in EP 238 023 and Yelton et al., 1984, Proceedings of the National Academy of Sciences USA 81: 1470-1474. Suitable methods for transforming Fusarium species are described by Malardier et al., 1989, Gene 78: 147-156 and WO 96/00787. Yeast may be transformed using the procedures described by Becker and Guarente, In Abelson, J. N. and Simon, M. I., editors, Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et al., 1983, Journal of Bacteriology 153: 163; and Hinnen et al., 1978, Proc. e Natl Acad. f Sci.s USA 75: 1920.
Methods of Production
[0095] The present invention also relates to methods for producing a polypeptide of the present invention comprising (a) cultivating a host cell under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide.
[0096] In the production methods of the present invention, the cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art. For example, the cell may be cultivated by shake flask cultivation, small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
[0097] The polypeptides may be detected using methods known in the art that are specific for the polypeptides. These detection methods may include use of specific antibodies, formation of an enzyme product, or disappearance of an enzyme substrate. In a specific embodiment, an enzyme assay may be used to determine the activity of the polypeptide. For example, AEBP1 activity can be determined by measuring carboxypeptidase activity as described by Muise and Ro, 1999, Biochem. J. 343:341-345. Here, the conversion of hippuryl-L-arginine, hippuryl-L-lysine or hippuryl-L-phenylalanine to hippuric acid may be monitored spectrophotometrically. POLD2 activity may be detected by assaying for DNA polymerase activity (see, for example, Ng et al., 1991, J. Biol. Chem. 266:11699-11704).
[0098] The resulting polypeptide may be recovered by methods known in the art. For example, the polypeptide may be recovered from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation.
[0099] The polypeptides of the present invention may be purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing, differential solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publishers, New York, 1989).
Antibodies
[0100] According to the invention, the SNARE YKT6, human glucokinase, AEBP1 or POLD2 polypeptides produced according to the method of the present invention may be used as an immunogen to generate any of these polypeptides. Such antibodies include but are not limited to polyclonal, monoclonal, chimeric, single chain, Fab fragments, and an Fab expression library.
[0101] Various procedures known in the art may be used for the production of antibodies. For the production of antibody, various host animals can be immunized by injection with the polypeptide thereof, including but not limited to rabbits, mice, rats, sheep, goats, etc. In one embodiment, the polypeptide or fragment thereof can optionally be conjugated to an immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). Various adjuvants may be used to increase the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum.
[0102] For preparation of monoclonal antibodies directed toward the SNARE YKT6, human glucokinase, AEBP1 or POLD2 polypeptide, any technique that provides for the production of antibody molecules by continuous cell lines in culture may be used. These include but are not limited to the hybridoma technique originally developed by Kohler and Milstein (1975, Nature 256:495-497), as well as the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 1983, Immunology Today 4:72), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the invention, monoclonal antibodies can be produced in germ-free animals utilizing recent technology (PCT/US90/02545). According to the invention, human antibodies may be used and can be obtained by using human hybridomas (Cote et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) or by transforming human B cells with EBV virus in vitro (Cole et al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, pp. 77-96). In fact, according to the invention, techniques developed for the production of "chimeric antibodies" (Morrison et al., 1984, J. Bacteriol. 159-870; Neuberger et al., 1984, Nature 312:604-608; Takeda et al., 1985, Nature 314:452-454) by splicing the genes from a mouse antibody molecule specific for the SNARE YKT6, human glucokinase, AEBP1 or POLD2 polypeptide together with genes from a human antibody molecule of appropriate biological activity can be used; such antibodies are within the scope of this invention.
[0103] According to the invention, techniques described for the production of single chain antibodies (U.S. Pat. No. 4,946,778) can be adapted to produce polypeptide-specific single chain antibodies. An additional embodiment of the invention utilizes the techniques described for the construction of Fab expression libraries (Huse et al., 1989, Science 246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity for the SNARE YKT6, AEBP1, human glucokinase or POLD2 polypeptides.
[0104] Antibody fragments which contain the idiotype of the antibody molecule can be generated by known techniques. For example, such fragments include but are not limited to: the F(ab')2 fragment which can be produced by pepsin digestion of the antibody molecule; the Fab' fragments which can be generated by reducing the disulfide bridges of the F(ab')2, fragment, and the Fab fragments which can be generated by treating the antibody molecule with papain and a reducing agent.
[0105] In the production of antibodies, screening for the desired antibody can be accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA (enzyme-linked immunosorbent assay), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, for example), western blots, precipitation reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays), complement fixation assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc. In one embodiment, antibody binding is detected by detecting a label on the primary antibody. In another embodiment, the primary antibody is detected by detecting binding of a secondary antibody or reagent to the primary antibody. In a further embodiment, the secondary antibody is labeled. Many means are known in the art for detecting binding in an immunoassay and are within the scope of the present invention. For example, to select antibodies which recognize a specific epitope of a particular polypeptide, one may assay generated hybridomas for a product which binds to a particular polypeptide fragment containing such epitope. For selection of an antibody specific to a particular polypeptide from a particular species of animal, one can select on the basis of positive binding with the polypeptide expressed by or isolated from cells of that species of animal.
[0106] Immortal, antibody-producing cell lines can also be created by techniques other than fusion, such as direct transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein-Barr virus. See, e.g., M. Schreier et al., "Hybridoma Techniques" (1980); Hammerling et al., "Monoclonal Antibodies And T-cell Hybridomas" (1981); Kennett et al., "Monoclonal Antibodies" (1980); see also U.S. Pat. Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,451,570; 4,466,917; 4,472,500; 4,491,632; 4,493,890.
Uses of Polynucleotides
[0107] Diagnostics
[0108] Polynucleotides containing noncoding regions of SEQ ID NOS:5, 6, 7 or 8 may be used as probes for detecting mutations from samples from a patient. Genomic DNA may be isolated from the patient. A mutation(s) may be detected by Southern blot analysis, specifically by hybridizing restriction digested genomic DNA to various probes and subjecting to agarose electrophoresis.
[0109] Polynucleotides containing noncoding regions may be used as PCR primers and may be used to amplify the genomic DNA isolated from the patients. Additionally, primers may be obtained by routine or long range PCR, that can yield products containing more than one exon and intervening intron. The sequence of the amplified genomic DNA from the patient may be determined using methods known in the art. Such probes may be between 10-100 nucleotides in length and may preferably be between 20-50 nucleotides in length.
[0110] Thus the invention is thus directed to kits comprising these polynucleotide probes. In a specific embodiment, these probes are labeled with a detectable substance.
Antisense Oligonucleotides and Mimetics
[0111] The invention is further directed to antisense oligonucleotides and mimetics to these polynucleotide sequences. Antisense technology can be used to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are based on binding of a polynucleotide to DNA or RNA. A DNA oligonucleotide is designed to be complementary to a region of the gene involved in transcription or RNA processing (triple helix (see Lee et al., Nucl. Acids Res., 6:3073 (1979); Cooney et al, Science, 241:456 (1988); and Dervan et al., Science, 251: 1360 (1991)), thereby preventing transcription and the production of said polypeptides.
[0112] The antisense oligonucleotides or mimetics of the present invention may be used to decrease levels of a polypeptide. For example, SNARE YKT6 has been found to be essential for vesicle-associated endoplasmic reticulum-Golgi transport and cell growth. Therefore, the SNARE YKT6 antisense oligonucleotides of the present invention could be used to inhibit cell growth and in particular, to treat or prevent tumor growth. POLD2 is necessary for DNA replication. POLD2 antisense sequences could also be used to inhibit cell growth. Glucokinase and AEBP1 antisense sequences may be used to treat hyperglycemia.
[0113] The antisense oligonucleotides of the present invention may be formulated into pharmaceutical compositions. These compositions may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic and to mucous membranes including vaginal and rectal delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration.
[0114] Pharmaceutical compositions and formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable.
[0115] Compositions and formulations for oral administration include powders or granules, suspensions or solutions in water or non-aqueous media, capsules, sachets or tablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders may be desirable.
[0116] Compositions and formulations for parenteral, intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.
[0117] Pharmaceutical compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions may be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.
[0118] The pharmaceutical formulations of the present invention, which may conveniently be presented in unit dosage form, may be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include the step of bringing into association the active ingredients with the pharmaceutical carrier(s) or excipient(s). In general, the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.
[0119] The compositions of the present invention may be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, liquid syrups, soft gels, suppositories, and enemas. The compositions of the present invention may also be formulated as suspensions in aqueous, non-aqueous or mixed media. Aqueous suspensions may further contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.
[0120] In one embodiment of the present invention, the pharmaceutical compositions may be formulated and used as foams. Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product. The preparation of such compositions and formulations is generally known to those skilled in the pharmaceutical and formulation arts and may be applied to the formulation of the compositions of the present invention.
[0121] The formulation of therapeutic compositions and their subsequent administration is believed to be within the skill of those in the art. Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC50 as found to be effective in in vitro and in vivo animal models.
[0122] In general, dosage is from 0.01 ug to 10 g per kg of body weight, and may be given once or more daily, weekly, monthly or yearly, or even once every 2 to 20 years. Persons of ordinary skill in the art can easily estimate repetition rates for dosing based on measured residence times and concentrations of the drug in bodily fluids or tissues. Following successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 ug to 10 g per kg of body weight, once or more daily, to once every 20 years.
Gene Therapy
[0123] As noted above, SNARE YKT6 is necessary for cell growth, POLD2 is involved in DNA replication and repair, AEBP1 is involved in repressing adipogenesis and glucokinase is involved in glucose sensing in pancreatic islet beta cells and liver. Therefore, the SNARE YKT6 gene may be used to modulate or prevent cell apoptosis and treat such disorders as virus-induced lymphocyte depletion (AIDS); cell death in neurodegenerative disorders characterized by the gradual loss of specific sets of neurons (e.g., Alzheimer's Disease, Parkinson's disease, ALS, retinitis pigmentosa, spinal muscular atrophy and various forms of cerebellar degeneration), cell death in blood cell disorders resulting from deprivation of growth factors (anemia associated with chronic disease, aplastic anemia, chronic neutropenia and myelodysplastic syndromes) and disorders arising out of an acute loss of blood flow (e.g., myocardial infarctions and stroke). The glucokinase gene may be used to treat diabetes mellitus. The AEBP1 gene may be used to modulate or inhibit adipogenesis and treat obesity, diabetes mellitus and/or osteopenic disorders. POLD2 may be used to treat defects in DNA repair such as xeroderma pigmentosum, progeria and ataxia telangiectasia.
[0124] As described herein, the polynucleotide of the present invention may be introduced into a patient's cells for therapeutic uses. As will be discussed in further detail below, cells can be transfected using any appropriate means, including viral vectors, as shown by the example, chemical transfectants, or physico-mechanical methods such as electroporation and direct diffusion of DNA. See, for example, Wolff, Jon A, et al., "Direct gene transfer into mouse muscle in vivo," Science, 247, 1465-1468, 1990; and Wolff, Jon A, "Human dystrophin expression in mdx mice after intramuscular injection of DNA constructs," Nature, 352, 815-818, 1991. As used herein, vectors are agents that transport the gene into the cell without degradation and include a promoter yielding expression of the gene in the cells into which it is delivered. As will be discussed in further detail below, promoters can be general promoters, yielding expression in a variety of mammalian cells, or cell specific, or even nuclear versus cytoplasmic specific. These are known to those skilled in the art and can be constructed using standard molecular biology protocols. Vectors have been divided into two classes:
[0125] a) Biological agents derived from viral, bacterial or other sources.
[0126] b) Chemical physical methods that increase the potential for gene uptake, directly introduce the gene into the nucleus or target the gene to a cell receptor.
Biological Vectors
[0127] Viral vectors have higher transaction (ability to introduce genes) abilities than do most chemical or physical methods to introduce genes into cells. Vectors that may be used in the present invention include viruses, such as adenoviruses, adeno associated virus (AAV), vaccinia, herpesviruses, baculoviruses and retroviruses, bacteriophages, cosmids, plasmids, fungal vectors and other recombination vehicles typically used in the art which have been described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple protein expression. Polynucleotides are inserted into vector genomes using methods well known in the art.
[0128] Retroviral vectors are the vectors most commonly used in clinical trials, since they carry a larger genetic payload than other viral vectors. However, they are not useful in non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be delivered in aerosol formulation. Pox viral vectors are large and have several sites for inserting genes, they are thermostable and can be stored at room temperature.
[0129] Examples of promoters are SP6, T4, T7, SV40 early promoter, cytomegalovirus (CMV) promoter, mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine leukemia virus (MMLV) promoter, phosphoglycerate kinase (PGK) promoter, and the like. Alternatively, the promoter may be an endogenous adenovirus promoter, for example the E1 a promoter or the Ad2 major late promoter (MLP). Similarly, those of ordinary skill in the art can construct adenoviral vectors utilizing endogenous or heterologous poly A addition signals.
[0130] Plasmids are not integrated into the genome and the vast majority of them are present only from a few weeks to several months, so they are typically very safe. However, they have lower expression levels than retroviruses and since cells have the ability to identify and eventually shut down foreign gene expression, the continuous release of DNA from the polymer to the target cells substantially increases the duration of functional expression while maintaining the benefit of the safety associated with non-viral transfections.
Chemical/Physical Vectors
[0131] Other methods to directly introduce genes into cells or exploit receptors on the surface of cells include the use of liposomes and lipids, ligands for specific cell surface receptors, cell receptors, and calcium phosphate and other chemical mediators, microinjections directly to single cells, electroporation and homologous recombination. Liposomes are commercially available from Gibco BRL, for example, as LIPOFECTIN108 • and LIPOFECTACE.sup.••, which are formed of cationic lipids such as N-[1-(2,3 dioleyloxy)-propyl]-n,n,n-trimethylammonium chloride (DOTMA) and dimethyl dioctadecylammonium bromide (DDAB). Numerous methods are also published for making liposomes, known to those skilled in the art.
[0132] For example, Nucleic acid-Lipid Complexes--Lipid carriers can be associated with naked nucleic acids (e.g., plasmid DNA) to facilitate passage through cellular membranes. Cationic, anionic, or neutral lipids can be used for this purpose. However, cationic lipids are preferred because they have been shown to associate better with DNA which, generally, has a negative charge. Cationic lipids have also been shown to mediate intracellular delivery of plasmid DNA (Felgner and Ringold, Nature 337:387 (1989)). Intravenous injection of cationic lipid-plasmid complexes into mice has been shown to result in expression of the DNA in lung (Brigham et al., Am. J. Med. Sci. 298:278 (1989)). See also, Osaka et al., J. Pharm. Sci. 85(6):612-618 (1996); San et al., Human Gene Therapy 4:781-788 (1993); Senior et al., Biochemica et Biophysica Acta 1070:173-179 (1991); Kabanov and Kabanov, Bioconjugate Chem. 6:7-20 (1995); Remy et al., Bioconjugate Chem. 5:647-654 (1994); Behr, J-P., Bioconjugate Chem 5:382-389 (1994); Behr et al., Proc. Natl. Acad. Sci., USA 86:6982-6986 (1989); and Wyman et al., Biochem. 36:3008-3017 (1997).
[0133] Cationic lipids are known to those of ordinary skill in the art. Representative cationic lipids include those disclosed, for example, in U.S. Pat. No. 5,283,185; and e.g., U.S. Pat. No. 5,767,099. In a preferred embodiment, the cationic lipid is N4-spermine cholesteryl carbamate (GL-67) disclosed in U.S. Pat. No. 5,767,099. Additional preferred lipids include N4-spermidine cholestryl carbamate (GL-53) and 1-(N-4-spermind)-2,3-dilaurylglycerol carbamate (GL-89).
[0134] The vectors of the invention may be targeted to specific cells by linking a targeting molecule to the vector. A targeting molecule is any agent that is specific for a cell or tissue type of interest, including for example, a ligand, antibody, sugar, receptor, or other binding molecule.
[0135] Invention vectors may be delivered to the target cells in a suitable composition, either alone, or complexed, as provided above, comprising the vector and a suitably acceptable carrier. The vector may be delivered to target cells by methods known in the art, for example, intravenous, intramuscular, intranasal, subcutaneous, intubation, lavage, and the like. The vectors may be delivered via in vivo or ex vivo applications. In vivo applications involve the direct administration of an adenoviral vector of the invention formulated into a composition to the cells of an individual. Ex vivo applications involve the transfer of the adenoviral vector directly to harvested autologous cells which are maintained in vitro, followed by readministration of the transduced cells to a recipient.
[0136] In a specific embodiment, the vector is transfected into antigen-presenting cells. Suitable sources of antigen-presenting cells (APCs) include, but are not limited to, whole cells such as dendritic cells or macrophages; purified MHC class 1 molecule complexed to §2-microglobulin and foster antigen-presenting cells. In a specific embodiment, the vectors of the present invention may be introduced into T cells or B cells using methods known in the art (see, for example, Tsokos and Nepom, 2000, J. Clin. Invest. 106:181-183).
[0137] The invention described and claimed herein is not to be limited in scope by the specific embodiments herein disclosed, since these embodiments are intended as illustrations of several aspects of the invention. Any equivalent embodiments are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.
[0138] Various references are cited herein, the disclosure of which are incorporated by reference in their entireties.
Sequence CWU
1
1
81198PRTHomo sapiens 1Met Lys Leu Tyr Ser Leu Ser Val Leu Tyr Lys Gly Glu
Ala Lys Val 1 5 10 15
Val Leu Leu Lys Ala Ala Tyr Asp Val Ser Ser Phe Ser Phe Phe Gln
20 25 30 Arg Ser Ser Val
Gln Glu Phe Met Thr Phe Thr Ser Gln Leu Ile Val 35
40 45 Glu Arg Ser Ser Lys Gly Thr Arg Ala
Ser Val Lys Glu Gln Asp Tyr 50 55
60 Leu Cys His Val Tyr Val Arg Asn Asp Ser Leu Ala Gly
Val Val Ile 65 70 75
80 Ala Asp Asn Glu Tyr Pro Ser Arg Val Ala Phe Thr Leu Leu Glu Lys
85 90 95 Val Leu Asp Glu
Phe Ser Lys Gln Val Asp Arg Ile Asp Trp Pro Val 100
105 110 Gly Ser Pro Ala Thr Ile His Tyr Pro
Ala Leu Asp Gly His Leu Ser 115 120
125 Arg Tyr Gln Asn Pro Arg Glu Ala Asp Pro Met Thr Lys Val
Gln Ala 130 135 140
Glu Leu Asp Glu Thr Lys Ile Ile Leu His Asn Thr Met Glu Ser Leu 145
150 155 160 Leu Glu Arg Gly Glu
Lys Leu Asp Asp Leu Val Ser Lys Ser Glu Val 165
170 175 Leu Gly Thr Gln Ser Lys Ala Phe Tyr Lys
Thr Ala Arg Lys Gln Asn 180 185
190 Ser Cys Cys Ala Ile Met 195 2464PRTHomo
sapiens 2Met Pro Arg Pro Arg Ser Gln Leu Pro Gln Pro Asn Ser Gln Val Glu
1 5 10 15 Gln Ile
Leu Ala Glu Phe Gln Leu Gln Glu Glu Asp Leu Lys Lys Val 20
25 30 Met Arg Arg Met Gln Lys Glu
Met Asp Arg Gly Leu Arg Leu Glu Thr 35 40
45 His Glu Glu Ala Ser Val Lys Met Leu Pro Thr Tyr
Val Arg Ser Thr 50 55 60
Pro Glu Gly Ser Glu Val Gly Asp Phe Leu Ser Leu Asp Leu Gly Gly 65
70 75 80 Thr Asn Phe
Arg Val Met Leu Val Lys Val Gly Glu Gly Glu Glu Gly 85
90 95 Gln Trp Ser Val Lys Thr Lys His
Gln Thr Tyr Ser Ile Pro Glu Asp 100 105
110 Ala Met Thr Gly Thr Ala Glu Met Leu Phe Asp Tyr Ile
Ser Glu Cys 115 120 125
Ile Ser Asp Phe Leu Asp Lys His Gln Met Lys His Lys Lys Leu Pro 130
135 140 Leu Gly Phe Thr
Phe Ser Phe Pro Val Arg His Glu Asp Ile Asp Lys 145 150
155 160 Gly Ile Leu Leu Asn Trp Thr Lys Gly
Phe Lys Ala Ser Gly Ala Glu 165 170
175 Gly Asn Asn Val Val Gly Leu Leu Arg Asp Ala Ile Lys Arg
Arg Gly 180 185 190
Asp Phe Glu Met Asp Val Val Ala Met Val Asn Asp Thr Val Ala Thr
195 200 205 Met Ile Ser Cys
Tyr Tyr Glu Asp His Gln Cys Glu Val Gly Met Ile 210
215 220 Val Gly Thr Gly Cys Asn Ala Cys
Tyr Met Glu Glu Met Gln Asn Val 225 230
235 240 Glu Leu Val Glu Gly Asp Glu Gly Arg Met Cys Val
Asn Thr Glu Trp 245 250
255 Gly Ala Phe Gly Asp Ser Gly Glu Leu Asp Glu Phe Leu Leu Glu Tyr
260 265 270 Asp Arg Leu
Val Asp Glu Ser Ser Ala Asn Pro Gly Gln Gln Leu Tyr 275
280 285 Glu Lys Leu Ile Gly Gly Lys Tyr
Met Gly Glu Leu Val Arg Leu Val 290 295
300 Leu Leu Arg Leu Val Asp Glu Asn Leu Leu Phe His Gly
Glu Ala Ser 305 310 315
320 Glu Gln Leu Arg Thr Arg Gly Ala Phe Glu Thr Arg Phe Val Ser Gln
325 330 335 Val Glu Ser Asp
Thr Gly Asp Arg Lys Gln Ile Tyr Asn Ile Leu Ser 340
345 350 Thr Leu Gly Leu Arg Pro Ser Thr Thr
Asp Cys Asp Ile Val Arg Arg 355 360
365 Ala Cys Glu Ser Val Ser Thr Arg Ala Ala His Met Cys Ser
Ala Gly 370 375 380
Leu Ala Gly Val Ile Asn Arg Met Arg Glu Ser Arg Ser Glu Asp Val 385
390 395 400 Met Arg Ile Thr Val
Gly Val Asp Gly Ser Val Tyr Lys Leu His Pro 405
410 415 Ser Phe Lys Glu Arg Phe His Ala Ser Val
Arg Arg Leu Thr Pro Ser 420 425
430 Cys Glu Ile Thr Phe Ile Glu Ser Glu Glu Gly Ser Gly Arg Gly
Ala 435 440 445 Ala
Leu Val Ser Ala Val Ala Cys Lys Lys Ala Cys Met Leu Gly Gln 450
455 460 31158PRTHomo sapiens
3Met Ala Ala Val Arg Gly Ala Pro Leu Leu Ser Cys Leu Leu Ala Leu 1
5 10 15 Leu Ala Leu Cys
Pro Gly Gly Arg Pro Gln Thr Val Leu Thr Asp Asp 20
25 30 Glu Ile Glu Glu Phe Leu Glu Gly Phe
Leu Ser Glu Leu Glu Pro Glu 35 40
45 Pro Arg Glu Asp Asp Val Glu Ala Pro Pro Pro Pro Glu Pro
Thr Pro 50 55 60
Arg Val Arg Lys Ala Gln Ala Gly Gly Lys Pro Gly Lys Arg Pro Gly 65
70 75 80 Thr Ala Ala Glu Val
Pro Pro Glu Lys Thr Lys Asp Lys Gly Lys Lys 85
90 95 Gly Lys Lys Asp Lys Gly Pro Lys Val Pro
Lys Glu Ser Leu Glu Gly 100 105
110 Ser Pro Arg Pro Pro Lys Lys Gly Lys Glu Lys Pro Pro Lys Ala
Thr 115 120 125 Lys
Lys Pro Lys Glu Lys Pro Pro Lys Ala Thr Lys Lys Pro Lys Glu 130
135 140 Glu Pro Pro Lys Ala Thr
Lys Lys Pro Lys Glu Lys Pro Pro Lys Ala 145 150
155 160 Thr Lys Lys Pro Pro Ser Gly Lys Arg Pro Pro
Ile Leu Ala Pro Ser 165 170
175 Glu Thr Leu Glu Trp Pro Leu Pro Pro Pro Pro Ser Pro Gly Pro Glu
180 185 190 Glu Leu
Pro Gln Glu Gly Gly Ala Pro Leu Ser Asn Asn Trp Gln Asn 195
200 205 Pro Gly Glu Glu Thr His Val
Glu Ala Gln Glu His Gln Pro Glu Pro 210 215
220 Glu Glu Glu Thr Glu Gln Pro Thr Leu Asp Tyr Asn
Asp Gln Ile Glu 225 230 235
240 Arg Glu Asp Tyr Glu Asp Phe Glu Tyr Ile Arg Arg Gln Lys Gln Pro
245 250 255 Arg Pro Pro
Pro Ser Arg Arg Arg Arg Pro Glu Arg Val Trp Pro Glu 260
265 270 Pro Pro Glu Glu Lys Ala Pro Ala
Pro Ala Pro Glu Glu Arg Ile Glu 275 280
285 Pro Pro Val Lys Pro Leu Leu Pro Pro Leu Pro Pro Asp
Tyr Gly Asp 290 295 300
Gly Tyr Val Ile Pro Asn Tyr Asp Asp Met Asp Tyr Tyr Phe Gly Pro 305
310 315 320 Pro Pro Pro Gln
Lys Pro Asp Ala Glu Arg Gln Thr Asp Glu Glu Lys 325
330 335 Glu Glu Leu Lys Lys Pro Lys Lys Glu
Asp Ser Ser Pro Lys Glu Glu 340 345
350 Thr Asp Lys Trp Ala Val Glu Lys Gly Lys Asp His Lys Glu
Pro Arg 355 360 365
Lys Gly Glu Glu Leu Glu Glu Glu Trp Thr Pro Thr Glu Lys Val Lys 370
375 380 Cys Pro Pro Ile Gly
Met Glu Ser His Arg Ile Glu Asp Asn Gln Ile 385 390
395 400 Arg Ala Ser Ser Met Leu Arg His Gly Leu
Gly Ala Gln Arg Gly Arg 405 410
415 Leu Asn Met Gln Thr Gly Ala Thr Glu Asp Asp Tyr Tyr Asp Gly
Ala 420 425 430 Trp
Cys Ala Glu Asp Asp Ala Arg Thr Gln Trp Ile Glu Val Asp Thr 435
440 445 Arg Arg Thr Thr Arg Phe
Thr Gly Val Ile Thr Gln Gly Arg Asp Ser 450 455
460 Ser Ile His Asp Asp Phe Val Thr Thr Phe Phe
Val Gly Phe Ser Asn 465 470 475
480 Asp Ser Gln Thr Trp Val Met Tyr Thr Asn Gly Tyr Glu Glu Met Thr
485 490 495 Phe His
Gly Asn Val Asp Lys Asp Thr Pro Val Leu Ser Glu Leu Pro 500
505 510 Glu Pro Val Val Ala Arg Phe
Ile Arg Ile Tyr Pro Leu Thr Trp Asn 515 520
525 Gly Ser Leu Cys Met Arg Leu Glu Val Leu Gly Cys
Ser Val Ala Pro 530 535 540
Val Tyr Ser Tyr Tyr Ala Gln Asn Glu Val Val Ala Thr Asp Asp Leu 545
550 555 560 Asp Phe Arg
His His Ser Tyr Lys Asp Met Arg Gln Leu Met Lys Val 565
570 575 Val Asn Glu Glu Cys Pro Thr Ile
Thr Arg Thr Tyr Ser Leu Gly Lys 580 585
590 Ser Ser Arg Gly Leu Lys Ile Tyr Ala Met Glu Ile Ser
Asp Asn Pro 595 600 605
Gly Glu His Glu Leu Gly Glu Pro Glu Phe Arg Tyr Thr Ala Gly Ile 610
615 620 His Gly Asn Glu
Val Leu Gly Arg Glu Leu Leu Leu Leu Leu Met Gln 625 630
635 640 Tyr Leu Cys Arg Glu Tyr Arg Asp Gly
Asn Pro Arg Val Arg Ser Leu 645 650
655 Val Gln Asp Thr Arg Ile His Leu Val Pro Ser Leu Asn Pro
Asp Gly 660 665 670
Tyr Glu Val Ala Ala Gln Met Gly Ser Glu Phe Gly Asn Trp Ala Leu
675 680 685 Gly Leu Trp Thr
Glu Glu Gly Phe Asp Ile Phe Glu Asp Phe Pro Asp 690
695 700 Leu Asn Ser Val Leu Trp Gly Ala
Glu Glu Arg Lys Trp Val Pro Tyr 705 710
715 720 Arg Val Pro Asn Asn Asn Leu Pro Ile Pro Glu Arg
Tyr Leu Ser Pro 725 730
735 Asp Ala Thr Val Ser Thr Glu Val Arg Ala Ile Ile Ala Trp Met Glu
740 745 750 Lys Asn Pro
Phe Val Leu Gly Ala Asn Leu Asn Gly Gly Glu Arg Leu 755
760 765 Val Ser Tyr Pro Tyr Asp Met Ala
Arg Thr Pro Thr Gln Glu Gln Leu 770 775
780 Leu Ala Ala Ala Met Ala Ala Ala Arg Gly Glu Asp Glu
Asp Glu Val 785 790 795
800 Ser Glu Ala Gln Glu Thr Pro Asp His Ala Ile Phe Arg Trp Leu Ala
805 810 815 Ile Ser Phe Ala
Ser Ala His Leu Thr Leu Thr Glu Pro Tyr Arg Gly 820
825 830 Gly Cys Gln Ala Gln Asp Tyr Thr Gly
Gly Met Gly Ile Val Asn Gly 835 840
845 Ala Lys Trp Asn Pro Arg Thr Gly Thr Ile Asn Asp Phe Ser
Tyr Leu 850 855 860
His Thr Asn Cys Leu Glu Leu Ser Phe Tyr Leu Gly Cys Asp Lys Phe 865
870 875 880 Pro His Glu Ser Glu
Leu Pro Arg Glu Trp Glu Asn Asn Lys Glu Ala 885
890 895 Leu Leu Thr Phe Met Glu Gln Val His Arg
Gly Ile Lys Gly Val Val 900 905
910 Thr Asp Glu Gln Gly Ile Pro Ile Ala Asn Ala Thr Ile Ser Val
Ser 915 920 925 Gly
Ile Asn His Gly Val Lys Thr Ala Ser Gly Gly Asp Tyr Trp Arg 930
935 940 Ile Leu Asn Pro Gly Glu
Tyr Arg Val Thr Ala His Ala Glu Gly Tyr 945 950
955 960 Thr Pro Ser Ala Lys Thr Cys Asn Val Asp Tyr
Asp Ile Gly Ala Thr 965 970
975 Gln Cys Asn Phe Ile Leu Ala Arg Ser Asn Trp Lys Arg Ile Arg Glu
980 985 990 Ile Met
Ala Met Asn Gly Asn Arg Pro Ile Pro His Ile Asp Pro Ser 995
1000 1005 Arg Pro Met Thr Pro
Gln Gln Arg Arg Leu Gln Gln Arg Arg Leu 1010 1015
1020 Gln His Arg Leu Arg Leu Arg Ala Gln Met
Arg Leu Arg Arg Leu 1025 1030 1035
Asn Ala Thr Thr Thr Leu Gly Pro His Thr Val Pro Pro Thr Leu
1040 1045 1050 Pro Pro
Ala Pro Ala Thr Thr Leu Ser Thr Thr Ile Glu Pro Trp 1055
1060 1065 Gly Leu Ile Pro Pro Thr Thr
Ala Gly Trp Glu Glu Ser Glu Thr 1070 1075
1080 Glu Thr Tyr Thr Glu Val Val Thr Glu Phe Gly Thr
Glu Val Glu 1085 1090 1095
Pro Glu Phe Gly Thr Lys Val Glu Pro Glu Phe Glu Thr Gln Leu 1100
1105 1110 Glu Pro Glu Phe Glu
Thr Gln Leu Glu Pro Glu Phe Glu Glu Glu 1115 1120
1125 Glu Glu Glu Glu Lys Glu Glu Glu Ile Ala
Thr Gly Gln Ala Phe 1130 1135 1140
Pro Phe Thr Thr Val Glu Thr Tyr Thr Val Asn Phe Gly Asp Phe
1145 1150 1155 4469PRTHomo
sapiens 4Met Phe Ser Glu Gln Ala Ala Gln Arg Ala His Thr Leu Leu Ser Pro
1 5 10 15 Pro Ser
Ala Asn Asn Ala Thr Phe Ala Arg Val Pro Val Ala Thr Tyr 20
25 30 Thr Asn Ser Ser Gln Pro Phe
Arg Leu Gly Glu Arg Ser Phe Ser Arg 35 40
45 Gln Tyr Ala His Ile Tyr Ala Thr Arg Leu Ile Gln
Met Arg Pro Phe 50 55 60
Leu Glu Asn Arg Ala Gln Gln His Trp Gly Ser Gly Val Gly Val Lys 65
70 75 80 Lys Leu Cys
Glu Leu Gln Pro Glu Glu Lys Cys Cys Val Val Gly Thr 85
90 95 Leu Phe Lys Ala Met Pro Leu Gln
Pro Ser Ile Leu Arg Glu Val Ser 100 105
110 Glu Glu His Asn Leu Leu Pro Gln Pro Pro Arg Ser Lys
Tyr Ile His 115 120 125
Pro Asp Asp Glu Leu Val Leu Glu Asp Glu Leu Gln Arg Ile Lys Leu 130
135 140 Lys Gly Thr Ile
Asp Val Ser Lys Leu Val Thr Gly Thr Val Leu Ala 145 150
155 160 Val Phe Gly Ser Val Arg Asp Asp Gly
Lys Phe Leu Val Glu Asp Tyr 165 170
175 Cys Phe Ala Asp Leu Ala Pro Gln Lys Pro Ala Pro Pro Leu
Asp Thr 180 185 190
Asp Arg Phe Val Leu Leu Val Ser Gly Leu Gly Leu Gly Gly Gly Gly
195 200 205 Gly Glu Ser Leu
Leu Gly Thr Gln Leu Leu Val Asp Val Val Thr Gly 210
215 220 Gln Leu Gly Asp Glu Gly Glu Gln
Cys Ser Ala Ala His Val Ser Arg 225 230
235 240 Val Ile Leu Ala Gly Asn Leu Leu Ser His Ser Thr
Gln Ser Arg Asp 245 250
255 Ser Ile Asn Lys Ala Lys Tyr Leu Thr Lys Lys Thr Gln Ala Ala Ser
260 265 270 Val Glu Ala
Val Lys Met Leu Asp Glu Ile Leu Leu Gln Leu Ser Ala 275
280 285 Ser Val Pro Val Asp Val Met Pro
Gly Glu Phe Asp Pro Thr Asn Tyr 290 295
300 Thr Leu Pro Gln Gln Pro Leu His Pro Cys Met Phe Pro
Leu Ala Thr 305 310 315
320 Ala Tyr Ser Thr Leu Gln Leu Val Thr Asn Pro Tyr Gln Ala Thr Ile
325 330 335 Asp Gly Val Arg
Phe Leu Gly Thr Ser Gly Gln Asn Val Ser Asp Ile 340
345 350 Phe Arg Tyr Ser Ser Met Glu Asp His
Leu Glu Ile Leu Glu Trp Thr 355 360
365 Leu Arg Val Arg His Ile Ser Pro Thr Ala Pro Asp Thr Leu
Gly Cys 370 375 380
Tyr Pro Phe Tyr Lys Thr Asp Pro Phe Ile Phe Pro Glu Cys Pro His 385
390 395 400 Val Tyr Phe Cys Gly
Asn Thr Pro Ser Phe Gly Ser Lys Ile Ile Arg 405
410 415 Gly Pro Glu Asp Gln Thr Val Leu Leu Val
Thr Val Pro Asp Phe Ser 420 425
430 Ala Thr Gln Thr Ala Cys Leu Val Asn Leu Arg Ser Leu Ala Cys
Gln 435 440 445 Pro
Ile Ser Phe Ser Gly Phe Gly Ala Glu Asp Asp Asp Leu Gly Gly 450
455 460 Leu Gly Leu Gly Pro 465
539000DNAHomo sapiens 5ccagacatag gcaaggcgca aggtgataca
gtaggcagcc accatggggg ccaggaggct 60ccagcagagg ccacacaacc agcccagaat
ccaggacaga gagctggaat ggagacaggg 120aagccagata ccaggccaga ctggccaggt
gctacaggcc tgtgggccag gccaggcttg 180gggacttcgt cctgggtgtg aaggagacag
gcacccctga ggccttccct ctgcatctcc 240agcccaagct aagcgcaaac tcttaggttg
gagtaaggag taaccccctg ccaagtttct 300cctgtcctca ggctccaccc accacctatg
ctgcctggcc ccatggggca cacgctcagg 360cccagcctgg gaaagcaact gcacctgcct
gtgctatgct ggcccttctc agcctcaatg 420ccctcctccc tccccgacgc accctcgtgg
cccccgctgg gccccctgat gcaccctcat 480gtctccatgg caacctgctc agagtgtggc
cctgcccttg gctcccctcc acacctgtgt 540cccaggcagt gccacggcac tttcctaaac
agaaggatgg gcttcaaaac agtcccagac 600actaaacaca cctgcatttt gggtccaagt
aacttctgac aagacgagtg cccctacaca 660ctctcagtcc tatccactat gggcaaggag
cctgaaggat cccccagaac tggctaaagc 720cctcagtctc ctcctccacc ctgagcacct
tcacgcggca gagtggccct ggatgtcagc 780ttcttgctcc ccatggtctg cacctggaca
ggtgctctca ggtgtgtggg tgggcaggtg 840gcaggtccca agagccaggt gcaaagaatc
taggccagtg cccacgagtg ctgcagtgtc 900tgtccccagc atggtatcta gggctccact
tgcctatcag ctgtaatcgg aggaggcttt 960ccaggccagg cctcccccag gaaggctgca
ggcactgcgg atcgtgcgcc ctcacatgca 1020ttattcctga ggcccttctg cagatgccat
cagggcagca actctgatga ggtattaggg 1080cacagcacac agggctaagc caccctgtac
tgggccaagc gctacaggca aaaaggacac 1140caccgacggg catttcattc atcgctttta
tttttatata tttttgagag ggagcctcac 1200tctgtcgccc aggctggagt gcagtggcgc
gatcttggct cactgcaact tctccctcct 1260gggttcaagt gattctcctg cctcagcctc
ccgagtagct gagattacag gtgcccgcca 1320ccatgcccag ctaacttttg tattttagta
gacatggggt ttcaccatgt tggtcaggct 1380ggtctcgaac tcccgacctc aaatgatctg
cctacctcag cctcccaaag tgctgggatt 1440acaggcatga gccactgcac ccggcccatt
catcactttt aaatagcacc ctctgaacaa 1500agctccctgg gccacatgac cctaagggtt
accccatccc accccaaccc aggtctggca 1560ggtcctcaga acaggaaaag ctgagcactg
cccaaggctg cttgctgggc cagtcagaga 1620ggtctctgcc ttccaggatc agaagtacag
gctgaaagca gccttgggcc cgcctccctg 1680ggaggctaca gaggcttcag agggttccct
gaactcaaaa ccagatgtga gacttgaatt 1740tgacttaccc ctggttcacc tcccaaccaa
agcaggggtc agctttggct cctccaggaa 1800ccaggaagct tccaggtacc ctgtggagcc
ccctctgctc ctgaaaagtt gccacctgtg 1860cttggtggga tgccaggtgg tctcagattg
accctggggt cagcggtgag ggacaggaag 1920cctacagcgg gatcaggatg gggatggggc
ctcctgtccc atggctctgc agctatgagg 1980cagctttcct agggtgggtc tcctggctgc
agctaagacc aggcaacagg attcagcaat 2040gacagggctt cttctactcc agggctccct
cacctggtta acagcaaaaa agaaaataca 2100gttcctgcta gcaaggtcta tagaaaggag
gtgaaggagt caggcctgca gctacctctc 2160ctggacagga gctggtcagg ataacttgga
cccttgcatg cggcaggccc acaggcacac 2220agcatgaggc cactctctcc cccgggggaa
gggcttggtg aagaaaggat tcccctgaag 2280cacaaagaaa gcacaggacc actgtgaaat
ttcaagacaa ctttatccag acaggcgcct 2340ctcaaataga acacagggaa gttaggcagc
agttactaaa atacagtctc gccaaatgat 2400ttacaacaga acacaacagg agcaggggat
ctgtgggtgg ggctgggctg ggccctctat 2460ctcacagggc ctgagtcaag ccagcccgcc
ctgcaaggca ggggctgacc tgcaagcgga 2520gatctcactt cctcttaccc caaattcata
cctccatttt ccccgccccc atctctcccc 2580agggtcctca agtgggaaag ggagaggtag
catccctcgg atccaggccc actccactcc 2640gtctccggca ccagtgggca ggctgagtct
gggcctcaag gggccctggg cttagggtat 2700ctatggcagt aggaaaatga catggacagg
ctcttcaggg gtaggctaaa gtcctctggc 2760cagcagtacc cagagaaaat gggcagcagc
aggtaaacca gccaggaggt ggagtcctct 2820gaacccacag cagaccccac cctcctgccc
agcccctgcc cacattgggg gtcaggacca 2880ctgagactct ggtcaggaca gtgggtgctc
tcagcagtgt ggcaagctca gagcagagct 2940cccaaggacc ataccacact ggttcaaaac
ccataggtga caccatccca gcagaagctt 3000ccatgggtgc tggatcccag ggctgcatcc
tgagcacagg tgggcagact ggaacataac 3060actaggaccc aagggatcca gaacatttta
ggcccatctc ctgggctgct ccagcctgtt 3120gccatgactt gggcagtgag tgggcctcct
gccaggtggc agggcacagc ttagaccaaa 3180cccttggcct cccccctctg cagctacctc
tgaccaagaa ggaactagca agcctatgct 3240ggcaagacca taggtggggt gctgggaatc
ctcggggccg gctggcaccc actcctggtg 3300ctcaagggag agacccactt gttcagatgc
ataggcctca ggcggttcaa ggcagtctta 3360gagccacaga gtcaaataaa aatcaatttt
gagagaccac agcacctgct gctttgatcg 3420tgatgttcaa ggcaagttgc aagtcaaggc
aagtgtccca gaggccctgg gcagctgagt 3480gcacctgtgt ttgatcttcc cctgatgatg
gacactccca gctgaccatc caaacaccag 3540gaaaacatcc ccctttcctg ggctcagttc
ctagtctact tgctggtacg aacccaaccc 3600acacactccc cgcccacaat gcagctcctt
ccaaatcctc ccacaagcca cctttgtggg 3660acttggaagc tgcttaggat gggccctgcc
ctctgcggga agccaatcct agcagaaagg 3720taagctaaac aacagtctca gaatctgaga
cccagtgact gttccccccg ccccaggcct 3780tgggcctgaa gtgggggcct gcctgtggcc
tctgtggtgg gctcactccc acccccaaca 3840gtggccccag gagaggcttt cccaagagtc
ttcaaactcc acccacccca gccctagcat 3900cagggactcc ccacccccca ctggagtgtt
aatatcatta atgtacaaat aagatccaaa 3960gatataccaa agatcgagaa acagctggct
ccgacctccc tcccacagag ccttcccagg 4020gttagctgaa aaagagccct ttggcatcta
cagaagccag tcggagttta tggtttcatt 4080tgcccaaaaa tacacctttg gggacctcaa
attctttcca agaatcacta ccacacatat 4140gaatttgaac attcgccacc cttccaccat
ccatttctcg caggaacttc aaaataaaaa 4200tggccagtct gcccccactc tggctcctcg
tctatggctg tctcttcttt tccaggggct 4260gcagttctga tgtgaatgat ggtgccattc
cagcattggg cctctggcag gctgcatcac 4320atgatggcac agcatgagtt ttgtttccgg
gccttggaaa aaaacaaaga ggagctgaga 4380aggaggactg acgaagtaag ggaagcccca
atcctggcag gcgtggcaga gggagctcca 4440caggacacag ccaggcagag aaactagcac
tagaacaggg tgggggtgga ggccttgagg 4500gaagctgtcc acaagcaatt cccatcacca
agcacaaggc gggccccggc ttccaaaact 4560agtctgggat cctttttcct ttcttttctc
acaccccatt aatgctatca aaaagtgagt 4620aaaattccta cagttaggcc aggtacaaac
aaaggaccaa taatacaaat gggattggca 4680gaatatctta actttgcccc actcctgtct
tcacacaatg ctatctgacc accacggtgg 4740tgtttcttcc tagaagatgg tcctgaggac
aacagatgtg gttcccactt gggatgtggt 4800ttgtggggac cactgttgcc accttctctc
ttgctttctg gtcacagact atcttcctaa 4860tcccacctag ccatctccct ccaatgtgca
catgaaagca aatgtgtgtg gacagaccaa 4920gtaaatttgt ccctatgact atccaaccat
gggccaacag tgccatctcc acataggaag 4980acatgagcac tgacctgaga gaaagcggca
gtcagcagca cccatccttg tcaattaaat 5040attttctgtc aaagggaaat taaaagctta
agaacctctt caggaaggct gaattgcttg 5100catcttaaag acttatgtct actcagcaga
aagaggaata agattcaaca gtaaatctct 5160ggtgatcaga acttgaacca gccttcctgg
actgggagta ggagttcaga aatcagccag 5220agcagcagag ggcagagcag aggcaggagt
ggaacaaggc ctcggcccgc atcgactcca 5280acggcgccca agtgaactgc ctccaaccac
ctgggcctga ggcgctcacc ttaggctctt 5340gccgcacaag gaatcatcca ccatgattca
acagtctaag aaagacccgt tcatagtgga 5400gagtgccaga agcagcaagc tgcgactgct
ctctagagag aacacccagg aggcagcagg 5460tgctgggtac tcacagtttt atagaaggct
ttagactgtg ttcccagcac ctcggatttg 5520gacaccaagt catctagctt ctcacctcgc
tctaacagag actccatggt gttgtgctgg 5580acaaaaaaga aaagagaatc cagctctgtt
cagtacgtgc cctgacatga gcccctcata 5640tttcagtcat gggggaaagt gccttacctg
ggttcctctc caacacacac aaacttcacc 5700tctaggtgtc gagactcggt ccaagaatag
ttactgtcca agtggatgga acagaacctg 5760gtgacattcc cgtgaaatct agaagatcta
actgggatgt agcagacttc ccaaaaagct 5820gtccccagca caggcttaga taaccagcac
tccaggaaaa ctcatatata tatatacaca 5880cacatttata tatacatttg tgtgtgtgtg
tgtgtgtgca cgcacatgtg cgtgtgcatg 5940gagctttgga aaaaagagta gctgggcact
atatgattgt actgggttgg agagtgaccc 6000acaccgcacc ccccaacccc aaccgcatcc
cagaaattaa catccccaga atctctgaat 6060gtgaccatat ttagaaatag ggtcttggca
gatgtaacta gttaggaaga ggtaatactg 6120gattagggtg gcatctaatt ccatgactga
tgtcctggta agaaacggaa acacacacac 6180agaaggtcac gtgacggcag aggcagagcc
tgaagtgatg cacctctaat ccaaggaatg 6240ccaaggatgg ccagcagcca ccagaggctg
gagagaggcc tgggacagac actcagagcc 6300ccaaaagaca ccagccaggc ccacagagct
atctgttaaa agcaaatatt tgagggtttc 6360tgttgacagc agccacagga aacaaaaggc
ggtgggaaat ggctattgag cacttgatgt 6420gaggcaagtc caaactgagc agcgctctga
gtacagacac accagatttc agatgcaaac 6480tcacacatgc ttcattagta agttttatac
tgaaaaaaaa acaagtttta taccgattac 6540atgttggaaa aattgtattt ggatatactg
cgttaagtaa aatatataat taaattaaat 6600tctacctatt ttccttttat cattttaaaa
tatggctcct agaaaattct aagttacaca 6660catgccccaa atatatacca gacagcacta
tgacagaaca tgtcctgcct tctaaatggg 6720ctatgtccta aatgtcatca ctacaaactc
tgacttagga aatgaaaaca ctgaccccat 6780gggaaggggt ctagagatgg agacctcaca
agagccagca gctctgctgc cagggccctc 6840aggaagcagc agctcgcttc tctcctcaga
tggccactgc tgcagcagct agatgcacac 6900atgaagcgcc atagaacaag gagccagcaa
gaatgtcctt catccctaca cacagctgag 6960cgactcaaat ttttaacaca gaaagttaac
tgattcagat atgcacacca atcatctaga 7020ttttacaact gcagctagat gaggctgggt
gaataggact catccactcc ccaccgtggg 7080gagaggagaa acagcgggtg tcccaggtgt
catggtactc agactaggac ttgagcaaca 7140gaaagagatg gcttgaggag aaaacggaga
aatgccacct aggtggtaag aaagctcaca 7200aggtttcaaa agacacagat accatgagac
tttcacatct atcgttcatt ccaaagccac 7260gttatttgga gtgcagtcag cacacctgtg
tttgaagccc ctgggatgct ttttataaaa 7320tgcaggttcc caggctccat cgcaggccaa
caactccaac cccaggagac gctgatgtac 7380acactaaagc tatgcctgtg taaatggtaa
agctttgtat gtgggtttca atccactcca 7440ggtatctatc aactgctgag catggtataa
actaggcact gtatcatgag caggatggaa 7500agatgtccca gtgctcatac gctggtcagg
gagacatgta aacaagcagt gacaaaactg 7560tgacatctgg tcagaaaggc ccaaccttca
ggcgcctgtg tgtgagctgg gcaagaaagg 7620gtataagaga gaacagggcc cagtcaggag
actgtgagtt agtttgcact ttatcctggg 7680gcggatctga gagctgctga agggttctaa
gttgtgcaga tcaatgacta ctctctggtg 7740gacagactgg aggtgagcag gaggcaaggg
gaccacttag aggcaaaggc tgtaagagaa 7800aaacctgaga aaaacagata gctgcttaca
ttccacttgt atgcaaaaat ttaaaaaaaa 7860agagttgaag caacagttac aaatcaggag
atttcagctc aaaatgcagg gttctggctc 7920ttttcaaagg ggcctatgtg acaaccctgg
gcccatattc cagaagctgc cctgtggtca 7980gtgcacggtg cttcaatctg ttcaccttca
atgcaaacgc tgcaagggga ggcacctgtg 8040gggtgtggag gcacccgaaa ccctaacaaa
ggcaccaggg tgggaatcca ggtcttcaga 8100agccaaaccc taggaaccca gtaaatggtc
agacaggcag tagccatgag gaagggagac 8160ttgagggttc cactggttcc cagcttggtc
ccctagaaac aatgggtgcc attaaccaag 8220agaagggtat aggaaagaca gtctgatgcc
cggggtgggg gaaggggtgg gcaatcccac 8280ttgctggaga gtgccgtggt tactattata
ttaaaacgag gatggatctg tgcatgcctg 8340gccagtggaa atcgcacccc cgcctcagtt
cttgggcttg ctctccatct tcctgcttac 8400cagaatgatt ttggtctcat ctagttcggc
ctgcacttta gtcatgggat cagcttctcg 8460tgggttctag gaaagagtga aaaataataa
agtcaggact ggagtggcta cctgcaaaca 8520aaacctaaaa ctgaggaagc tggacaaact
ttcacaggtt aaaaaccaca gcctgggccg 8580ggcacagtgg ctcacgcctg taatcccagc
attttgggag gatgaggcgg gtggatcacc 8640agagatcaag agttcgagac cagcctgacc
aacatggtga aaccgtctct actaaaaata 8700caaaaattag ccaggcgtgg tggcacatgc
ctgtaatccc agctactcgg gaggctgagg 8760caggagaatc gcttgaaccc aggaggtgca
ggttgagtga gccgagattg cgccactgca 8820ctccagcctg ggaaacagag tgagactcca
actcaaaaaa caaaaaacaa aaaaaaaaac 8880ccacagcctg tttaacatgt aacagaaacc
caaagcctgc ctagagcttg ggttccccgg 8940tctgaacgta gattctctgt tttccaaaca
gtaaggcttg agagaggaca ccagcatcag 9000aagctgtcag aagtaattag accagaacta
tcagggcagt tggctttttc agtttcacat 9060ggattctggg ccacatggtg tctgctgaag
cttcctttaa ccctacctgg tatctactga 9120ggtgaccatc cagggctggg taatggattg
tagcagggga tcctactggc cagtctatcc 9180tgtcgacttg cttggagaat tcatctagta
cctgcaagac aaaggagact caacaagcct 9240cccactgtgc actcaccagt ggtctcaatg
acagggcttc acccctgagc acctcaccct 9300gaatgaggct ccttggcctt cacagcccag
gaaggaggaa tgagggggac atataatggc 9360aacagagaaa atctaggcta aagttctttc
caaattttta tcattaaaac atatcctaaa 9420tattctgaga atcaaaagta tgcccagccc
gagatgaacc tcacttgggg agtaataaag 9480gtatttgaat tttaaactac agatttccag
aaaaaagggg cactggtcct ctaattttcc 9540aaagcaattt tttaaaaaag agaattaggt
cccctagatt taagaaacca ccagattcca 9600tgtgtttgga ggtattttgg tgctctgggg
tataggatga agcctctgac ttcaaagagt 9660taatattagt aattagcacc gtacgcaaaa
aaatttaaag aatgcttagg tgctaagctc 9720tgtggtgcaa ctgactgaca tcaaggtaga
gggatgcagc aactgcagga ggcaatgggg 9780agagtgaagg cattcaagag ggagactcct
tgagcagaag cacagggggc gagaacacaa 9840ggcacagctg tctccgaggg tcccatccca
gagaatagat gctatgactc agtggcctag 9900acccagctca catgagggac agcaccgggg
aggaaaccca tacagggatg ccaaattgtc 9960tcttgggttg cagggaaggg ggctgaaaaa
tgtgttgact ttggacacat catttcatcc 10020cttatgtctc agggactgcc atcaacccct
gtcccagtcc ataaatgtgc ccattcatca 10080tccaagtcca ggagaggcaa ataaaaaact
caccttctcc agcaaggtaa aggccacccg 10140ggatgggtat tcattgtcag caatgaccac
acctgcaaga ctatcattcc ggacgtagac 10200gtggcacaga tagtctaagg agacaagaga
tcagacacat ggatgctgac atgagggctt 10260cagacttctt ttaatccccc caaatcaaag
catccaatgt taggccaaat gaagccactc 10320ggaagctcaa tagctctggg caagtcttgt
ggagaggctt agcagcacag cccaatgggc 10380cacacacagg agcttggccc aacgcctgct
ttaggaccag taaataccca gaggcccagt 10440atgcaaagcc agggcttaaa gaaacagcca
gtggtgcaga aaacacaccc ttgacaacat 10500ggccccagga gcatttccaa gtgtattcct
taagctcggg tcaggccaag ctatatctta 10560gggatctgga gcccttgggg ctctgtgctg
ctcccaaact tagggaaccc tggacaagcc 10620aagaggcctc tgctttctta aaaaatcttt
tcagagcagc caaaagacag gaaattaccc 10680cccagggcct cagtcttcca tattatagca
acctgctggg tttgctccac tctggtgggt 10740gactgggagt aggggggtta gtctagaaaa
agattagcta ctgccagcta aggcctccag 10800agcactgtgc taaaatcctc atatgattga
aaggtacagt tgtacaggtc ttccgcaaaa 10860tattcacaat ccacaggatt gttcatttcc
atcactttga aaggattcag agttgataca 10920gctaaccata tccccaagga aagagaaatg
taaggattac agcttacaaa taagaacctt 10980cttgtcctta aggatctgac ccagaagatt
ccaatgctaa acaacagaaa aacaaataaa 11040agaggaggga atgatggtga gcccctgaaa
tcagaaaaga gcagagataa atgagaacaa 11100gaatgaggag gaggaagagg acagggggtt
gtcaccaatg ctctccagat tttgtatacc 11160atccccaatt aagattcaaa catggggtca
aagtgcatac cctccaaaga aactgagaac 11220ctggtcagtg gaggaattgt ctttaagtaa
taaacgtggg aagggcaggc acagtttgaa 11280gaacagagca agaacactga aatatttgtg
atgcgatttc acttctatga tgttaatagc 11340acagagatcc cacataaagt gtatatagtc
aatcctgcct gtatcataac tgacatttat 11400atcatcaatt cagtaactct atgtcacgtg
acttgaggtt agcataagtg tgagatgatc 11460tttgtcccta cctgatgaaa ctcatgtaac
tctttcctga tctgtctgta taacatacac 11520atctaaataa atgcctaaac ctgaattatc
agaaagaaaa aatagttttt tcagattcct 11580gatcaaaaaa tctacgatgc acagaataca
tatagtacct caacagtgct agctggaaat 11640ccttttttga ggggtctgca actctgaaga
ggatagggaa gaatacgata tgaaggctgc 11700ttactgctcc aaaagagtca gaccctaatc
ttaaatgagt ctaagtttga gggcaatttt 11760atctgggaag ctcagacttc aacagtgggc
acagaattct gcataaatag gaaaaggaag 11820aggtgggaaa gagagaacaa gctagaggag
gagtagggtc ccagtagaaa ggagaaagct 11880gggtgctatg tgaggtgagg catggcagcc
aggccagcac acgcacagaa gttggagggt 11940cttcttacct tgttctttga cagaagctct
agtgcctttc gatgagcgct ccacaatcag 12000ttgactcgtg aaggtcatga attcctgaac
gctaagaaac acaaaatgta tttattgcct 12060acttcttatc accttgtccc caacacagtg
gaaagtgacc tctgggctta tacattaagt 12120agacattgct tcttggtttc attcctttcc
ctcccatccc tagtaacaaa cactctataa 12180atgagcacaa atactgataa ttatgaatta
tcatcaccat gaaagctcca tctgtttgct 12240acctggctca ccaaaacagg tgaattttct
ggggggtttt tccacaggat acagtcaatt 12300ttacattttg gtgaatgcat aatttggaat
gcaatggaaa aacaagaggc aggtcctgct 12360ctcaaggtcc caataacttc caagaagcag
gacatttata agaactgcac tagaagaata 12420gtgtgcaaaa actgtcaggc agaaatgcac
aaccatttat ggctgtgtcc acatgacaga 12480ccctcgcaat gccacataca cccatagtga
gtgctggctc aggtctgctg gggctcgtcc 12540acagaacgag cgcaagacac tctggatgga
acaaaaggaa aactgctcat ccaagacaaa 12600gaagtgggaa atggctcata caaagggtga
aagggagaag gtccatcatg ggctcaacag 12660agagatctat ccagaacaga acagtcacag
gagatggtac agccagagga agaggtgctg 12720acaaggagcc tccaactgag gatgtgatat
aaagggcaac cagggccatc aaagcagggt 12780gctcaaatgg gagtctgcag caggctccag
cagagccata taggtaactg aaggcctgac 12840tctgggcctg tgtgctgtgc ctccacatta
aaaaaatcaa gatttgtgca acagttaaac 12900gaggtaatac gtgtaaagca cttggaacaa
tgcctgcaca cacagtatta cttgttaata 12960tcttgaggga ctgaagtgat caaaataacc
cctcagaaaa gaagacctca aacaaggaag 13020gctttgcagt aaacctagag acagcatttg
agacacggct ataaagagac aaaggaagaa 13080ctgcattgtg acagcatgta tacaaagacc
aaaaaagctg ggaaactact ttttcaactt 13140tggaatcggg taattatagg gcacaaagga
cgtaagtaaa gcggtcttat aagaaaacaa 13200gctcaggccg gacgtggtgg ctcaagcctg
taatcctagc actttgggag gccaaggcag 13260gcggatcact tgagctcagg agttcgagac
cagcctggct aacatggtaa aaccccatct 13320ctactaaaaa tacaaaaatt agccgggtgt
ggtggtgcgc gcctgtaatc ccagctactt 13380gggaggctga ggcaggagaa tcacttgaac
ccaggaggcg gaggttgcag tgagctgaca 13440ctgtgccact gcactccagc ctgggtgaca
gagcaagact ccatctaaaa taaaataaat 13500aaataaataa atcagctggg acatgtgttg
ttttaagaca tattagtaga gatgtccctt 13560tagtgttgca gctgttagtc attggaaact
agtgtgggca tcccaagcag gtgaggtata 13620agtcctacaa gtgaaatctc tgagaatctt
aagtactaat gggaaggaaa aaggaaaaag 13680aatcagagcc aagttggcac caaaagttcc
atctgagaaa agcaacaaca cagagcagtg 13740aatgtaggcc atggtaaaga ctgcaaagac
caagaacccc aagaaggagc taaaagataa 13800tgcagcaatt ccgcttctgg gtaaatacca
aaaaaatgcg agcagggtct tgaagagata 13860tttgtacatc catgttcata gcagtatcat
tcacaatggc tgaaatgtgg aagcaaccca 13920ggtgtccact gacagatgaa cagataagca
aaatgtggtg aataatacaa tggattattc 13980agccttaaaa aggaaagaaa ttctgatata
tgcaacaaga tgcatgagcc ttgaggacat 14040tatgctacat gaaataagcc agacacacaa
aaactatatg attccattta tctaaggtcg 14100ccagaaaagt caaaatcaca gagacaaatt
agaatggcag ttgccatggg ctgggggaga 14160agggaatgtg tttaatagac acgaatttga
taaaaaggag ttctggagac gattgacagt 14220gatggctgca caacactatc aatctatttc
atatcaatgc actcactaca cgcttaaaga 14280tagtgaagat aaattttgtg taccatttta
ccacaattaa aaatattttt ttaaaagaac 14340tcaaagaagc agaaagtttc aacaaaataa
catttttttt tttttacatc cagcaagtcc 14400ttggcaaaga actctcatca agaaccagct
gcactgaagc agggaaaaca gaatccaaac 14460ggcagattcc atcagatttt gagacaagat
gaccatagat accgaccatg tagggtcctc 14520cttctttcgt gcctgagtca ccccaatccc
tcccacgaat ggtctggaag tgtctgtgtt 14580acttctaaca cgttccagca attaaagcgc
cccagaaaca agtaaaagcc tgtaagccct 14640acagatccca tgcttcattt gcatcttccg
tgtggaatcc ttttgtacca ctagtgtcca 14700actaaaaagc gttaaacctg gctttcagtt
ctagctggtt gtgatataac ctcttggtac 14760ctcagtgact tcacccatta aaaacaaaca
aaaaaaagta tatcactatc tctcatacag 14820aattgttggg aagccccgca agaaaatcaa
aatatggctc tcaagatgcg gcacccaagc 14880tcccagagtc agaatcactg ggtgggaagt
gttggtctaa aatataaata ccgaggcctc 14940aatctactaa ttcagaacat cttggcatga
agcttggaaa tctgcactac ttcacagtct 15000ccttaaaatt tttacacgac agaaatttga
aaaacactga gtagagaact atattctaga 15060atggtataag ctcttaaaga gctaatgttg
gttcctcaaa ggtagagtcc acggccagat 15120tccattatag gagaccaagc ccggacagca
gaccccgggc cctccccacc ccgccccgcc 15180tctgactcgg acaccagcct tctcagaccc
cgggcactcg gccaccccgc cctgccccta 15240cccttggcct cctccaccct cccctcatcc
ctccgccgac cccaggccca ctccgactcg 15300gacccccacc ccagtcctct ccgcccgacc
gccacggccc accagcctgt gccgctcacc 15360tggatctctg gaaaaagctg aaggaagaca
catcgtatgc ggctttgagc agcaccacct 15420tggcctcgcc tttgtagagg acgctgaggc
tgtacagctt catggctccg cgccctcagg 15480ccgcccgcct gcccagctgc gggacccgtt
ctcagggagc agcgcggccg ccgcccctcg 15540ggaccgccgc cgcctaccgg cctctcagca
gccggctgct gacggggcca ccgccggctt 15600cctcctcctg gctcgcaatc cacttccgga
tccggtcagc ctggttgagg gttctcatac 15660tccggatgca gaaatgtgag cccggaagta
caatgcagcg aggggcggga tgccacgcct 15720cgcgtaagct tggcccctcc ctgctcgcca
ggtggagtcg ggcgcgcggc gggataccgt 15780actgtcttgt gctgggtggt gctgggcctc
ccacagcggc ctgaaccctt cttttttttt 15840tttttctttt ctttcttttt ttaaagtaag
catttttttt attattatac tttaagtttt 15900agggtacatg tgcacaacgt gcaggtttgt
tacatatgta tacatgtgcc atgttggtgt 15960gctgcaccca ttaactcgtc atttagcatt
aagtatatct cctaatgcta tccctccccc 16020ctccccccac cccacaacag tccccggtgt
gtgatgttcg ccttcctgtg tccatgtgtt 16080cttattgttc aattcccacc tatgagtgag
aacatgcggt gtttggtttt ttgtccttgc 16140aatagtttgc tgagaatgat ggtttccagc
ttcatccatg tccctacaaa ggacatgaac 16200tcatcatttt ttatggctgc atagtattcc
atggtgtata tgtgccacat tttaggagga 16260gcttgtacca ttccttctga aactattcca
atcaaaagaa aaagagagaa tcctccctaa 16320ctcattttat gaggccagca tcatcctgat
accaaagggt ggcagagaga gacacaacaa 16380aaaaagaatt ttagaccaat atccttgatg
aacattgaag caaaaatcct cagtaaaata 16440ctggcaaacc gaatccagca acacatcaaa
aagcttatcc accatgatca agtgggcttc 16500atccctggga tgcaaggctg gttcaacata
cgaaaatcag taaacgtaat ccagcatata 16560aacagaacca aagacaaaaa ccacatgatt
atctcaatag atgcagaaaa ggcctttgac 16620aaaattcaac aaccctcatg ctaaaaactc
tcaataaatt aggtattgat gggacgtatc 16680tcaaaataat aagagctatc tatgacaaac
ccacagccaa tatcatactg aatggacaaa 16740aactggaagc attccctttg aaaactggca
caagactggg atgccctctc tcaccactcc 16800ttttcaacat agtgttggaa gttctggcca
gggcaatcag gtaggagaag gaaataaagg 16860gtattcaatt aagaaaagag gaagtcaaat
tgtccctgtt tgcagatgac atgattgtat 16920atctagaaaa ccccatcgtc tcagcccaaa
atctccttaa gctgataagc aacttcagca 16980aagtctcagg atacaaaatc aatgtgcaaa
aatcacaagc agtcttatac accaataaca 17040gacagagagc caaatcatga gtgaactccc
attcacaatt gcttcaaaga gaataaaata 17100cctaggaatc caacttacaa gggatgtgaa
ggacctcttc aaggagaact acaaacgact 17160gctcaatgaa ataaaagagg atacaaacaa
atggaagaac attccatgct catgggtagg 17220aagaatcagt atcgtgaaaa tggccatact
gcccaaggta atttatagat tcaatgccat 17280ccctatcaag ctaccaatga ctttcttcac
agaattggaa aaaactaaag ttcatatgga 17340accaaaaaag agcccgcatt gccaagtcaa
tcctaagcca aaagaacaaa gctggaggca 17400tcacactacc tgacttctaa ctatactaca
aggctacagt aaccaaaaca gcatgctact 17460ggtaccaaaa cagagatata gagcaatgga
acagaacaga gccctcagaa ataatgccgc 17520atatctacaa gcatctgatc tttgacaaac
ctgacaaaaa caagcaatgg ggaaaggatt 17580ccctatttaa taaatggtgc tgggaaaact
ggctagccat atgtagaaag ctgaaactgg 17640atcccttcct tacaccttat acaaaaatta
attcaagatg gattaaagac ttacatgtta 17700gacctaaaac cataaaaacc ctagaagaaa
acctaggcaa taccattcag gacataggca 17760tgggcaagga cttcatgtct aaaacaccaa
aagcaatggc aacaaaagcc aaaattgaca 17820aatgggatct aattaaacta aagagcttct
gcacagcaaa agaaactacc atcagagtga 17880acaggcaacc tacagaatgg gagaaaattt
ttgcaaccta ctcatctgac aaagggctaa 17940tatccagaat ctacaatgaa ctcaaacaaa
tttacaagaa aaaaacaaac aaccccatca 18000acaaatgggc gaaggatatg aacagacact
tctcaaaaga agacatttat gtagccaaaa 18060aacacatgaa aaaatgctca tcatcactgg
ccatcagaga aatgcaaatc aaaaccacaa 18120tgagatacca tctcacacca gttagaatgg
tgatcattaa aaagtcagga aacaacaggt 18180gctggagagg atgtggagaa ataggaacac
ttttacactg ttcgtgggac tgtaaactag 18240ttcaaccatt gtggaagtca gtgtggcgat
tcctcaggga tctagaactg gaaataccat 18300ttgacccagc catcccatta ctaggtatat
acccaaagga ttataaatca tgctgctata 18360aggacacatg cacacgtatg tttattgtgg
cactgttcac aatagcaaag acttggaacc 18420aacccaaatg aacccttctt tttgcttgcg
ttgttgaaag aaggcaagtc tatggatagg 18480aatgagtgag gcacagctcc ctgaggatgc
catatcttgc ccgtttcttg tgtattaagt 18540gacatcacgt gttaccaaac taaaccggct
gcatttgcct gcgcacaaca taaaaccaaa 18600cacccaagca ttggattttt gtagcaagaa
agatgtattg ccaagcagcc ttgcaagggg 18660acagaagacg ggctcaaatc tgtctcccaa
tacttgcttc gcagcagtag atttaaggga 18720gagattttgg aagtggagtt tcgggctgga
cggtgattgg ctgaaacgaa gaagtgttta 18780gaaaatctct tggtcatgag ctgttgcttc
ttcatgctgc ttcaagggtc acatgcagat 18840tcaggaggtg gtataaaaca agctgtggga
atttgggctg tgacatcaaa gggccgctcc 18900tcgggctagt aagtctattt tgcacaggct
ccagtcagcc atattggttc caacctgttc 18960cagcaagttg tataagcaga ggggattata
gcaaactgtt tccttatcgg ctgccctgca 19020agacaagctc aagatttctg ttagttacca
gtttctttaa ccctgtcggg cacagtttca 19080catgtaatca gaaaggaact tgcaagacac
atacaactga aagaaacttg gtctttggaa 19140gttgtcagta aggtcacaaa gttgtgatgc
tagaagcagc cgtatctgag attatgggaa 19200agagatgata tattggaaaa acaacagcat
cactttaaac attactctaa atcaaggttt 19260ctcaaccttg gcactattga cattttgggt
tagatagttc tttcttgttg ggagactgcc 19320ctgtacattg tgtaggcagc atctcaggcc
tttgtagaaa tgtcagtacc aacccacccc 19380ctccccactg cacaatcaaa aacgtcaaaa
tgtcctttgg gagcagtagt tttgagaaac 19440attgctttgc agatatatat gtttgtttgt
ttgttttgct ttgtgacagg gtcttactct 19500gttgcccagg cagaagtgca atggtgtgat
cccactcact gcaacctctg cctcccaggt 19560tcaagcgatt ctcatgcctc agcctcccga
gtagctggga ttacaggaat gcatccatac 19620acgcggctaa tttttgtatt tttaatagag
atgggatttc accatgttgg ccaggctggt 19680ctgaaactcc tggcctcatg tgatccaccc
acctcgacct cccaaattgc tgggattaca 19740agcttaagcc actgcgccca gctgagaaac
attgctttaa ataatctgtg gtgaaaggaa 19800gttcccacca cctgcccact cactcagtac
ctctgtcacc aaccctcttc cctgggtgtt 19860tccaagtaca gagggtggaa agggcttttc
cacatttccc ctgttttggt agtaaacatt 19920aggaacagcc attggccgtg gctaggctca
gccacccaca gatatggaca cagtagtctg 19980acaagctggg ttgctgggtg ctatcagtcc
aggctcaact gcttgcactg acaccatttc 20040cctataggag gcaggtgaga gccatttctg
aggaaagtct ctggagcccc tcttccttcc 20100actgaaagtt gtgcaaaaag atcaggaaga
cagcgcttgg atggaataaa tttcagtgta 20160tccacttgac acattatagt ggctgtccca
aagtttacct tatgccaagt actttccatg 20220tgccacatca tttaatcctc acaaaaacag
gggaaaatat tattgccacc ctacagacat 20280agagactgag attcaattta aggagatggt
tggtaaggga cagagttggg gttcagatgt 20340caacagtgaa atgcttaaca aactgtcatg
cagcccactc ctggcaactc ttcctgctcc 20400tctctggcct cactcagcct ctactgttcc
aggaagcctc attcatagtc atgtggttgc 20460agacttccca agctcactgt gttaccaaaa
agcaagacct gccttctgct gcatcgcccc 20520agctgtcacc caacttggat tcagtcccag
cactgacaca tcacaaaatc acaaaagtga 20580gcaaaccatt acctccctga gtctcctttt
gtttttatct ataaaactag aaaaatattc 20640tttccatagg aatgttgttg gaaataataa
aacattatat tacaagctct agtcattgtt 20700gatgtttaac aggtaacagt gataattatt
tgtcttctca ttaatgaaga aaaggattat 20760taatcataga gggtggaagg catctatggg
aagtagagat ttgaagatag gctaaaaccc 20820aagtaaggcc tctagattag ataatagtat
tgtatctatt ttaatttcct gctttccatc 20880actgtgccat ggttatataa gagaagtctt
tgtttatagg aaatatacac aagaatttag 20940aagtaaaggg acattgtgtc tgcaacttac
tcttacaggg tgtgtgtgtg tgtgtgtgtg 21000tgtgtgtgtg agagagagag agagacagag
agagagagag acagagagaa agagaatgat 21060aaagcaaata caggaatcag gatgaagcgt
atctgtttgt ttgttttgct ttgtgatagg 21120gtcttgctct gttgcccagg caggagtgca
atggtgtgat cccgctcact gcaacctctg 21180cctcccaggt tcaagcgatt ctcatgcttg
tattgttctt gcacctgttc tgcaagtaca 21240acattgtggg aatggaaaat gcaggaaatg
ggcagtaagg ctatgaacga agcccgcaca 21300ggagtgtggg tagcagagtt ctctagtcca
ggctcccacc tgaggtgctg ggacctagaa 21360gaaaagcctc tctgcagaca gaactggagt
taacgctgtc cacgataaat ggcccaggcc 21420ctgttaagtt tgccccattg agcaaaacaa
gtacccaccc gcctttgcag ccttgcctag 21480ctcacataag gtgccagccc ttgctgtaca
gcagaacctt tggggagctg gacaaaagcc 21540tatcaaggag cataccccca ggaagcccag
tccaggtggg gagcccagcc acacaatggc 21600ccttgccccc acacctcctc attcagtcag
ctaaggccat ggcagctgag ctgcctccac 21660agctcatata ggaaaagggt gtggaaaggg
gccaccaatg tggtcaggcc tccatggcct 21720gagtaggtca ccaagcctca ggtgcacaga
cttgatgtca tcaatcaggg tctgtcagca 21780cacctagccc tcaggaacac tgctccccac
tgcaacccca caccaaggca tcctgggctc 21840cctctgggtt ctccaggccc cagggaagac
agacagagtc tgccaccaaa ggtttgagct 21900ctgccactgg ctacgaagca ataggggatg
tcagagcaag ggaggaacag gacaggagta 21960tacgtgggca ggaagggatt acagccaagg
aagacaggag gcaggtgccc tgattttgag 22020gctgtgcccc agcaggggct tcccagaagc
tgtatttgtc ctaagacacc cctctgcagc 22080tgaggggcta gagatggata tgtagctgtg
ttaggccatt cttgcattgc tataaagaaa 22140tacctgagac caggtaattt ataaagaaaa
gaggtttcat tggttcacag ttctgctggc 22200tttgcaagag gcatggtgct ggcatctgct
cagcctttga ggaggcctca ggaaacttac 22260agtcatggcg gaaggcaaag gggaagcagg
cacatcacac agtggaagca ggagtgagag 22320agagagaggc actgggaggt gccacacttt
taaacaacca gatctcgtgt gaactcagag 22380caagagctga ctcatcacca aggggatggc
ccaagccatt catgagggat ccacccccat 22440gactcagaca cctcccacca ggccccacct
ccaatattgg ggattacaat tcagatgaga 22500tttggtgggg acacatatcc aaaccatatc
agttatcagt agccatactg gatgaatgcc 22560aggaacttag aattaggaca catggtcatt
taggcaagtg gcttgtcctg tcaatggtac 22620cctgatagtc gtggggttgc cccgtacaaa
aagcgagagg aagtctacag agctgtcaaa 22680gaggggcagg tggaaaggcc tgcagaggag
tcccctgctc cacaaccagg cgtgcacctc 22740ccacatcctc ggggctgtag gccccacatg
agagcagaaa gaaggatgca gaggaaggcc 22800aagaacacaa ggtgtgccct tggaaaggct
gggcacacca aacacaacct aataaacaac 22860agcaatgagc acacagggaa agtactcaca
gggaaaccat catgaactag aggctgatcc 22920cacaccctgc cacatggggc cccaggcccc
agcctatcaa ccagtggtcc ttattgccac 22980agcgattggt ctttggatag gcacctgatg
caagcttcag ccaatcaaca ggccactcag 23040ctggccatca gtaggccatc caatcagagc
aaagcccagg actttcttcg actcttaaga 23100aaagagaagc aaagtaactg gcacagattg
gagaggatca aggaaccccg agctggatac 23160atacaaactt tgggttaaca tggatgatta
aatacatatg tttatgtgaa ccacctccca 23220aatatgctcc actataatga cacaagacaa
agggcagggg gagaccaatt gcaaggtggc 23280gcaaatgaga gatgctacca agggtggcgg
gggagagagg ggagcagttg tcaagttagg 23340aggcaacagg ctgagggaca gggaccagca
gacggggagg gaggggctga agcagaagtg 23400tccagtgtct ggagggatgg ggccagaaag
gcaaggggca tcctgaagaa gctatacctg 23460gggagggcag ctctctcccc acctgctccc
caattcatca gccaggaatg ccccatccac 23520cccaccccag ggaggaggac agaggacttt
cgtttgggag cattgaatgg ttcagagatt 23580ctgcaactct gcggtcccca actaaactgc
tcattgtttc aagcagtccc tgttgggtaa 23640atgtccccca ttgtaaccgg actcggattc
caccgcttga aagccaaata caagaggaga 23700ggtttggtgg gaggaaaagt ggttttaact
agagccagca aaccaagaag atggtgaatt 23760gttgttttaa agcattcaat tatctcaaat
tttaaaattt atcataggat tctgaaagga 23820aaacttggta tgggacatac gtgggagcag
tgcagggtac agggtctatg tgtcttgatc 23880caatggctgt cttgagtatc acctatcctg
aggtctggtt ggtgttatct ttccttcggc 23940cagatggtgg tgggtgaatt gtttcgactc
cccctaagtt ggaggattcc gcaggggttc 24000cgtgtctggt ttttgtttca agattagccc
ctggaattcc caaataagca tagagttaga 24060taagcgggca tggtgcaaag gagtgtctag
tgggaaaggg agagaagcag agtttcaaag 24120tacatttcaa ggttacattt taagactaaa
gaaaaagcct taaaatgcat ttttaaagct 24180gatttaatgc ttggctacac taggctgtgg
ccagtgtgca gtgtggctgc tcttggatca 24240ggtgatgttt catcagctgt gtccagggag
ggcagggcca tgtggcagaa cctgggacct 24300ctgtgtgagg gactaccttg gcccctgtcc
ttagcaggaa gctatggtaa ggaaccctta 24360gggagacatt aaattgggga gaccgtccct
gccaatcctt taacctcccc agcctcagcg 24420acctcagttg gaaagtggtg gtaataatac
taccactgac caggtgtggt ggccagacat 24480tccacacttt ggcttcagcc gctccctccc
cactctactg taatcccagc actttgggag 24540gaagaggtag gcggaacctg aggtctggag
ttgagaccag cctggtcaac atggtgaaac 24600cccatatcta ctaaaaagaa agtacaaaaa
attagccagg tgcagtggca cacgtgtgtg 24660gtcccagcta ctcgtgggtc tgaggcatga
aaattgtttg agcctgggag gcagaggttc 24720cattgagtgg agatcgagcc actgcactcc
agcctgggtg atagaacgag attctgtctc 24780aaaaaaataa aaataaaata ataataataa
taccactgcc tgccacacta agattgtctg 24840attagatgac agaatgaatg caaaagtact
ttgtgaatca taaatgtttt catcaatatt 24900agttataatg acaattgctc cttctcctaa
taaatgtatt gcctttcttt aggaataaat 24960ataacaagaa atgtgtaaga tatatatgag
aaaaataata aaattcacct gaaggacata 25020aaagaagacc aaaataaatg aaacaacaca
tacttctaga tgagaaaact caatattata 25080aagaggttag ttctctaaaa tgaatcccta
aacccacaaa gtcaatgtat ttccaatgaa 25140attgtcaaca gcattatttt ccgaagtggg
atgagtagtg ctaagattta taagaaagcc 25200aacattccag agcagtgggg aagggattgc
ttcaccacca aatagccata ttagagattc 25260ccttgcacca tacccaaacc accatctccc
aggacccggg agagcagaaa agaggaatga 25320gaagaaaggc gaggatgtga ggtgtgccct
cataatggcg gtgcacgcag cacaagcaat 25380tgcagaaaga ctaaagtact gaacaaatag
aaaacttgga aaaatattag aaggaaatgt 25440gggagaacat ttttgcaatt tggggattgg
aaacggtttt cttaacaaga tataaaaacc 25500ccaaaacaag aaaacaaagg ttgaaattca
taaaaactag atacttctgt atgatgaaag 25560acacgattaa tcaagttgtt aagtttagca
atagactagg ggagatatca tagtatattt 25620aacagacaaa ggattaatag atactacaga
tgaaatataa aatagtttct ccaagtccat 25680aggcagaaga taatccaata gcaacatagt
taagtaatgt aaacaaatca tccttagaag 25740aagaaatgca atcaccaaga aacacatgaa
aaggtgtcca gcattttgca attcaagcaa 25800caatgaggtg acagatcggc aaaaaactca
taaagattta tcatctgaag gattggccaa 25860gataaagcca aacttctcgt gttggcagaa
gaaactggtg aagccatgtg aagaggccac 25920gtggtcctgc ctaccaagat gtaaaatgtg
tacagcattt gaactagcaa ttcagcctcc 25980aggagccatc cagaagaaac actgacacac
acttagactc cggtgaaatt caaggacttc 26040tgccacagcc tgcttcgtaa tagtgaaaat
ctgaaactgc ctcaatgacc gtcaatagga 26100agttgatttt aaagtgttac agcacatctg
tctggagaga tcgcactggc cactcctcct 26160caccccctct gctggacctc tgagcgtagg
tggcctggag ctgggtcctg agccctcttt 26220ggtctatacc gacactaccc aatatggtag
ccaccagtca cgctggacac ttgaaaagtg 26280gccgatcctg actgagaagg gccacgagtg
ggaaaaacac accagacctc agtgacttag 26340gcagaagtat gttttgttcc agactattga
ctgagcccgc agctgagttg gctccagcac 26400cctggccccc tgctccatcc actcactggg
actccccact gcacagggca acctctccag 26460gggcacttgg gctgcgaagg ggagagtggg
tggcatccca ggctgaagct tcctgagcag 26520ggccagagga ggagccagtc cctgtgggcc
tctgttctga cagtgtcaac ctcagccagg 26580cttgtgtggg ccaggtgtac tgttctggtt
cagatttcaa ggagatagtc agggcaggcc 26640gcgccaaagc cctccgatgg gctcccctac
tgcctggcag acctgtccag ctttggactc 26700tggccctgcg acctggaagt caggctgcca
agaggtccag gcagtggcct ccactgtgga 26760gggtctctgg agagtttaca gccctagata
ggggggttag ggatgtgaga tggtcccagg 26820ggcctgctcc tgagccacgc caagctgcct
gctccctttc ctctgcttcc agactcacgg 26880gatcctctgc tcatcagaac aggagtgtgg
gagaccctga gacactgccc caggatctga 26940acaggtggca aaggcttaac aggctagcgg
tcactgtagt gacaaggcga ttgagtggtc 27000accatggtga tggggatgga ggctctttgc
caccagtccc agttttatgc atggcagctc 27060taatgacagg atggtcagcc ctgctgaggc
cactcctggt caccatgaca accacaggcc 27120ctctcaggag cacagtaagc cctggcagga
gaatccccca ctccacacct ggctggagca 27180ggaaatgccg agcggcgcct gagccccagg
gaagcaggct aggatgtgag agacacagtc 27240acctgcagcc taattactca aaagctgtcc
ccaggtcaca gaagggagag gacatttccc 27300actgaatctg tctgaaggac actaagcccc
acagctcaac acaaccagga gagaaagcgc 27360tgaggacgcc acccaagcgc ccagcaatgg
ccctgcctgg agaacatcca ggctcagtga 27420ggaagggtcc agaagggaat gcttgccgac
tcgttggaga acaatgaaaa ggaggaaact 27480gtgactgaac ctcaaacccc aaaccagccc
gaggagaacc acattctccc agggacccag 27540ggcgggccgt gacccctgcg gcggagaagc
cttggatatt tccacttcag aagcctactg 27600gggaaggctg aggggtccca gctccccacg
ctggctgctg tgcagatgct ggacgacaga 27660gccaggatgg aggccgccaa gaaggagaag
gtatctcgcc ctccattggg cattctggga 27720gtgtttgctt gcctgtcccc aacattccat
ggtttgtttg agcctcagaa tctgatttta 27780tgcacaggct ctttgagaag ggtcttgcca
ggggtgcctt ctggggcagg aaggccccta 27840ctgcctggca gacccatcca gctttggact
ctggtcctgc gacccggaag tcaggctgcc 27900aagaggtcca ggcagtggcc tccactgggg
aggggctctg gagagtttag agccctagat 27960gtgggggtta gggacatgag gtcttgtgga
caaagcccac tacctgattt tgagacaaca 28020ctcactagac atggtgacaa gtcaaagatg
ccttgcctcc taccaggaat cacttcgcag 28080ggagcccgag ggctgctgtg gcctgctgag
gagtgcaggg cagttacttt ttccaaaaac 28140aaagagaaat ccaggcatgc tctgagccag
ccctgagccc agcagtgagc aaggagagag 28200ctggagacag gggactttgc tgtgaaacac
tggggggaat gtgcctgcat caccccagct 28260gggggcccag gcagagtggg ggagaagggg
taagtgggca gagccagtca ctttgggcat 28320gcttccctct cgcctctgtg tgaaatgacc
aggtcagcat aaaccccggg ctggctgtgc 28380ttctggcaga gctaatgatg ttaggaggaa
aacaaccaac ccaagtgaga gggtgcgcag 28440ccagacagct ggaccggccg aggccccaac
caagtcccag atctgcctgt cactggtgct 28500atggcagcaa tttggatgag aaatcctgcc
caaagggccc cttcaggcca cccggggaga 28560aggaagcggc tgtctttggc atgaccagaa
agatggctcg gagctaggga gaggtggaca 28620tgtgggctgt ggagatctgg cactttcccc
aaacaaggag agaaagcata gtgtgcctat 28680gtgtgaatgt gctatgtgtg catgtttgtg
cctgtgcata cctgcatgtg tacatgcatg 28740tgcacatatg tgtgcacagg gaatcacttt
aataaaggcc acagcagagc tgtccctgag 28800ccccttgcat tcacagtggc atgtgagtga
accaccttct taggctgggc atccagtctc 28860agactctggg gctgcccatg ccccatcctt
tatctgctcc acgtgtgagg ggttgctggt 28920cctgaccagg gccagctgtg aaccccagaa
tcctgggaag tcactgacat tcttgtcagg 28980gccaagagtg gagcaaggca atgcctcggg
cacaaacttt aaggggtcac cagaaacatc 29040aatcatcaag atatatgcta ttttaaataa
tcaaaatgaa tgcaaaaaaa atttatgatg 29100gacaacatac caaattctaa acaaaggcag
gatgagtatc actggcttct gcacttttct 29160ccacccagtc tacccctctt ctagtgcctg
gatcgcaggg tgccaaggcc tggatgaggg 29220aagcgtggag ctgcaatggc cactcctgtc
tgcctgttct ggctgcacag aggactcagt 29280ccttgtcttg ggggaaccta tcttggtttt
agggtcatcc taaggatctg atgttttcca 29340agtgagctgg ctgtccaggc ccacccaggt
tcagtccagt cctgtgtctc tgggaagtgc 29400tgcccctacc ccaagccagt gtttgacctt
ggagcaatga gcaatgccct ccttccactt 29460tcaaagttgt ccccaagacg tcagctgtgg
ttgtctctgt gcagacaccg aggaggaact 29520gtcttctttc tccttttggt tgctttggag
gaaagtaaag tgttgctggt ttccctcttt 29580ctacttcttt gattgagagc agccgtcttg
ccggtaccaa ccttccagat cttacctgtg 29640gttgcaggag cctgtggcct cagtcctgtg
cccagtgact tctccatgtg gatgtcagct 29700ccttaggggc aagcctgatt ccactgacac
tactcccacc cctcataagc cccttcttac 29760cagctgcagt tgcctggtac cccaccatcg
ctgactcatt cctttggcat caaggttcat 29820cccttactgg gccaccactt ctgggtggcc
tgaaataggg ccctgggcat ccctcttggg 29880gaccttttgg tctatatttt cactctcacc
tcactaagga cagatgagta aatctggtta 29940actttgcctg atagatttgg tgaccttttt
tcaggaagga gcctggaaag atgagattca 30000ggtgtattgg tcagcttaga ctgccataag
agaataccat ccactgatgg cttagaaaca 30060acagaaatct atttctcact attctagagg
ctggacgtcc aagatcagat gccagcatgg 30120tcaggttgca gggagggctc tcttcctgac
ttgcagaccg ccaccttctt gctgtgtcct 30180cacatcgtgg agagagagtg aaaacaagct
ctctggtgtc tcttcttata agaatgctaa 30240tcctatgatg ggggctcccc ctccttacct
catctaaacc taattatctc ccaaaggtct 30300catctccaga taccatcaca ctggggttag
ggctttgaca tatgaatctg gggggacaca 30360attcaatctg taacaccagg agggcatgcc
gggaggaact gaccttcctc cctccagctg 30420ccctggacac ctttgcccca ttgaaggagc
aggctcagaa gtggaatgag gatggaataa 30480ggtgcactcc atcatgctta cccacatccc
tggcaggaat tgtcctgggc cccagcagga 30540gagatgcccc cccatactgc catggcacct
gctctgagac aggtgtgcag agtgcaaagc 30600tccaggtggc ccccaagcag gtgtgctggg
aggaggggcc cgtgtgggag gagcaggcag 30660cgccaaggcc tagcggagca gtgacaggtc
cctgacttca gggaatgggc acgctgtggg 30720caggcagctg gtgtgggggt gagggctggg
gctgcatctg tgggaccagg gctgggccat 30780ccatcatatg ccgtgtcaca accccagtgc
ccctgctgta gccaggacag gaggctgggc 30840caggctggga ggtgacaaga gtgggggctg
tccccaggag aagcactctg ctgcctgtgc 30900ccaggcctct ggggatgagg acccctcaga
aggagtagct atgtctagga agccccaggg 30960caggagcaag ccaaagggga catcattagt
gagatccagg ggatcagtgg gccacagaag 31020ccccagcgtg agcccctctg actgatgcag
ctaggcccac acctgcacct gcccacagca 31080agacccccag gaggagaggg gacagatgga
gagaggcaca aagtgcccct ggcctctgcc 31140ttgaagccac cccaaggcaa gagagatttg
agcccctgtt tagtgacctc caggggaaca 31200ttctggccca tctgatgtgg gaagcccctt
gtggagtctg tcattcctca gctgagccag 31260gcctttggag gcagcccagg catgtcccct
gtgtgctcct atccctgtgt tgggacacct 31320ggcccagccc ctccttctgc ctttctcttc
ccttcccttc tcaggagtgg acacttcctc 31380ctttagcccc ctcacagctg tgtgaacttc
tctgtatctc tctctttctg tctctttctc 31440cccctctctc tctgtctcat tgtctctctg
tgtgtctgtc tgtagtattc tctctctgtc 31500tctgtcactc tgtctctctc tctctctgtg
tctacctttc tgtatttcgc tttgtttctt 31560tttctctgtg tgtgtgtgtg tgtatctgtt
tttctcactc tctctctgtg tctatctttc 31620tgtatttcgc tttgtttctt tttctgtgtg
tgtgtgtgtg tatatctgtt tttctcactc 31680tctcaatctc tctctctctt tctgtctctc
ttttgctggc ctgagcaaag agggagcccc 31740atcctgatgc tacataaccg tgaaccagca
cagacagaat tgtaggaaag tcctgcaagt 31800agaaggatag aaggatgagg gaagaaacgc
catgtgagtc atgacagatc cctttccagg 31860agccactgac tcaccctgcc tcctgccctc
ccactgtgac actattactc acagacaggc 31920ccggattaaa cctatgttcc aggtgccctg
tggttcccac agtgtggctc cctgggtctg 31980gcctcaggct ccacaggtgc ccagccctgc
caaagtctcc agagcagctg tccagctggg 32040gagctgcggg gccccttcac agagcgcatg
ggaagaagtt ccatcctaca cattacatcg 32100agagggacgt gcctgagaag gggagctgga
gcccgtgcag ccccctgctt gcgtgcagaa 32160catagtgtac cctgagcatg ccatgaaaaa
cacaaacgca caaagttgta aagaaaaaag 32220aaatgacagg tggctgtaaa atcagttata
gcccacgaga ggcccactaa tgagtggtga 32280tttcagctga ttacaaagaa atgatggtgt
ttctgtaatg aactaaacat gcactcgtgc 32340gtgcacacac gcgcacgtat agtcacataa
ctgaccagcc ctatgcatca cttgttaatt 32400acttagtaac tgtaacaata atagtttcca
ataagtgagc cttagtctct gcgcaagggt 32460cagtttattg agcacacggg ggccttgcag
tgggggcagg tgatctgctc ctgggagccg 32520ccagcctctc ctctcctgct cttcatcttc
ctccgtggtg ggaaattgtc tcactgcttc 32580tacacctgag gctgaacatc tccctttatt
tcagtctgaa acacatgtaa aaatatactg 32640gaatgaatta aggttgcaat tattgatatc
aggcagtgag tacatcaggg tttattatac 32700tatctccttt acttacttcg aagttctcta
ttaccaaaaa attaaaaact ataaaagaaa 32760gaaaaaggaa atgaggctag attcaacaca
gattactctt accaaaccct tcgtagtccc 32820aggagtcccc taacacaagc acttgtgacc
tggagtgata ttcacagcat tccttacctg 32880gcaatacctg agtattagcc cccccagtgg
gatctttgtt gtagacaacc agcaactatc 32940agcccagcca ataaacaagt aggaaagggg
agtgctggag aggccaagaa gtgggatttt 33000ccatgctcct gggctgtgat ccagagggca
cggctgtgag gctgatctca atgaacactc 33060tgtcttggaa gtacagggat cctctgctac
ctgaaaacgt tctgagtatt cactttcatg 33120gattgcaaag tcatttaccc aaaattcact
ctccaaatga aaagtgagta tgatgaatca 33180gtattcaagt tccacctggg tcctgggaga
gggcatggac atcatatccc agctgttccg 33240acaggaggac ccaatctgag tctcactgcc
tgcctgcatc gtttgtctgc tgccagcctg 33300cacagtagga agggaaaaca tgatttgtat
ctgttttagg tcaggttccc aagaagtaga 33360gcctgagatt ggaattcttg gaaaatggtg
tttgcgggag cgctgtcagc agaagctata 33420aggaagttgg ggggacagaa aacgagaggt
aagaagccag tcaaaaaggc aggtccagct 33480taagtccgcc tcagtctggt tccacaaggg
ctctgatgca tgaagaatat cacagggttg 33540tccctcctgg gagaggggcc agcctattgt
acctgtatca aagccaccag ctgagggcca 33600gtggggaggg aagatcttcc aggcatttcc
aggaaactct caggagaagg gtgtagctgt 33660gagcagtctg cagctgctgc tcactgcggc
taaaggctgg gtgtgcaggc cagtcagcca 33720gtgaggtgcc aacagcaggc actacagtcc
accccttgac tgctcagacc tactgctttc 33780cactttaagc tctctccatc caggcacagc
ttcagggaaa acttacaatt ggagaaacag 33840agggatgaac tacaatgccc acttctgcat
gtgattgtaa gactgtcact gatactcacc 33900atcatgcccc atccccacca tccattctag
tgtccccttc cccttggcta acactgctgg 33960tctaggtgac ttccctagag caggagccaa
acccttatcc ctgaggcatc tgaatcctgg 34020attcctttat caggctattg ttgttgtaag
ttgtccattc ccaattacaa ctggacatga 34080gactaccaag aaacaccctg gcaaatcatc
tgagtgcaag ccatattctt cctgctccat 34140tatgtagcgg tagtcctacc tcctaatgac
aagggtaaat tgccacattt tgctccttgt 34200gccaggatgg taataccttt ctctacctgc
ttggctactg gcacaaggaa gcacagcatg 34260accaggaggc aattgtagct gtacatttag
tgaatgtgtt aatgtatcac ctggtggaag 34320gaccccctct gagaaccagg acttctagac
ccacaaaacc taaagttgtg aatggcggaa 34380gcacaaattt cccaagtgga tcatggagag
tgatgaagag ttcttggttc ccaaacccac 34440atattttacc tttcaggaac atggcctcat
cccatagcca ttagagtgca tattgcattc 34500tggaggagac tgggccctcc tcatgggtgt
catcttcaag atgacagctc cactgtgcct 34560ccaagaggat gctccaccac cctatctgtg
attccttggt tagcaggaca ggctgctgca 34620ctgagggtag gaaaggcaag tccattgatg
gctggaatac atgtcaatcc aagtcaagag 34680aaaatgccgc cctttccagg ttggaagggg
cccgatttag ccaacttgtc acccagtagt 34740ggctggttgg tctcctccag gagcagtgtt
ataccaggaa ttcagcacca gtcgctattg 34800ctggcagttc ttacattcaa cagcagcaaa
actaggtcag ccttgatgag agggaatgta 34860tgcttctggg cacaggcatg gcttccttct
ctgactccat gactatctat ttctgagtgc 34920atggtggccg acattcagct gcctgcccat
cctatccact tggttattat tgcctcttcc 34980acaagaagtg gtttctggct gtcattaatg
tctcatactt tgtgcccact cacacaggtt 35040tagctctaca acttttcccc atgccaccac
ttttccacaa tcttctaatg ttgctccttc 35100caagctactg aagaacgagc taagctattc
accaatgtcc atgagtctat atttacctta 35160ggccacatct ctctccacac aaagtgaata
agcaggtgca ccctccaaaa ctctactaag 35220aggatttctt ctccccagtg tctttcaggg
ccaccttgag tggggctgaa gtacagcaga 35280agtccatttc cagcttgcat caacattcca
aactaaccta tccatgatca atgcatagat 35340gggtttttcc ctcctccagc agctagacaa
aagacacccc ccaccaggag gccatatttg 35400catgtgggtg aaagagaggc acaggggcca
atattcgtgc aacagtggta gatggcaggt 35460gggtctgggc cacctgtccc tgcagcttat
ctgtgccatc tggacctgct caagcctgat 35520tccagatata ccatttccat cttatgatgg
atggcttatg acctagtggg tctgacagca 35580ccaaactcat aatgggcagt tatggccaca
tggtcactta atgtcctatg gtcagacact 35640ctgctgagtg gcatgccagg aaatgcttta
caagtggtgt ttggttctct gctgcagatg 35700gcatgacctt ggtccggagc cctaggggtt
tggacagtga ctcctgttgg ggcctaatct 35760cacattccat gcagagtatc atcagatttg
ccaatcacat agcctaaggg tcaggactga 35820tccaaccagt ttttgcagag atcaaactgg
agaatgaaag gttgatatga tgtgaccatc 35880atatcacgtt tttctctctt gaaaagtatg
cagatgtctg aaagagacaa gtgccccagg 35940agaaaatgca tgccttcctc aggatcggcc
cccacctccc ctcctggcca caaggagggt 36000caaatctcag catggcccaa cttggacctg
tcaaggaaga agaaaaaaat tgtatgccaa 36060aggaactcag tctttggcta acaagtacta
gacatccttt aagtctttga gaatggtaat 36120aatttctgcc atccctccag atttgtgttt
ttctgttttg gctgggtggg aatgcagcat 36180tttcactttg cctttgttat tacaaatgtt
gcttattcta taaatcaagg aaccattgta 36240agggctcttc tgatggttaa gtatatccat
tccaatgatt tattcgggat ccaaggaaat 36300gatttctggg tgaatacaca gaactagtgg
atccaatttg agacatacct gggccagaac 36360tatatttgtc gtcttacccc aataagcctg
cactctacta ggacagccat gacagcactt 36420tgggacccta gatataagtg tgaattgctg
gctgggcatg gtggctcacg cctgtaatcc 36480cagcattttg ggaggctgag gcaggtagat
cacctgaggt caggagttga agaccagcct 36540ggccaacacg gtgaaacccc atctctacta
aaaaatacaa aaattagctg ggcgtggtgg 36600tgggtgcctg taatcccagc tactcgggag
gctgaggcag ggagaattgc ttgaacccag 36660gaggcggagg ttgcagtgag ccaaaatcac
accactgcac tccagcctgg gtgacagagc 36720gagattccat ctcaaaaaaa gaaaaaaaaa
agtgtgaatt gctatgaaat cactatcaaa 36780agatctgagt gttaccctta ctcagtgtgg
tcgaatataa atagccatag gttcctgtta 36840tacacacttg ctgtggtgct acagagtctt
tcctcatggg aacccagtcc ctctttcagt 36900caatgggttc tggttcgaga actggctgag
gtttggaaac tgtgcctttc catcataact 36960ttccactggg gtgactgacc ttggccttct
gttcatcctt tctagcccct aagaatccaa 37020cactctatta gccttctcct tagaccccta
taagctaatc ccttctagtt gttagtctga 37080ccttggtgcc caatatgata attattccca
ctttgcttct gatatgcttc taagtgctgc 37140ccctggtctc tgcccttaag tgatctatca
tccccactgc cattaggggg agaagctctg 37200aaaaagagtt gtctcccatc aactctggtc
tacaaaggac agccctactg agcctcagcc 37260atgtgcccga caccagcaga ttctttacag
cctgggaagc agagtgtctt ccctgccttt 37320ccagggaaca tagccagctt acaggctttt
tgatcttata gagtaggtca gttatatttt 37380gccccatttc ttttatcctt ttgatcactt
cctcttggcc caccatgtaa actcaagcat 37440ccctgcttca tttaatcgag ctgttgcttt
ttctaagcta ccaagagcaa ccccagcaat 37500atatcagagc cctctcttgg gacccttgct
agggtgttaa atcctgcatc ataggagaat 37560gcccccacat cagcaaagtc cccttatcct
cttgatatcc cacctgcccc agtccagcac 37620cttcaggatc tggtctcaat cacaggatcc
agcacctttg ggactgttgc aagcataaga 37680tccagcactt ttgggatcta gtctcccact
tcctgctagt acttgttagc caaagactga 37740gttcctttgg catacaattt ttttcttctt
ccttgacagg tccaagttgg gccatgctga 37800gatttgaccc tccttgtggc caggagggga
ggtaggggcc gatcctgagg aaggcactca 37860ttttctcctg gggcacttgt ctctttcaga
catctgcata cttttcaaga gagaaaaggc 37920ctccttctca cagcaagact acttctgtag
atgcaggtgg ctcgtgggaa tctggcaatt 37980caaaattctc aagtgtactc actagcacat
tagaaaacca gtagtacaca tctctttcca 38040aatcttcatt cagtgacact atgtcagtag
ctggaaatgg gccatggtgg gtgtatttaa 38100accatgaaaa tcagaaaatg ctacaaacca
gggcatcccg catctctaga cagcagattg 38160ttggccattt cccagcatac cattgtgtat
actccttccc atcagggccg tggcttgcct 38220tggtggagga ctcagccctt gctgaagttc
tgctactgct cttacaattg agtcctatgc 38280ctggtctcca gctctgcctg cctcactaca
ggagacaagc atctctttga acactgccga 38340gaagaccctc tggctctcag gcttggcttt
aaatcgatag acctgagcct gccattttct 38400cttttccatg catcactcca ctgatccaca
ggtctcagtg gcatagtcct tcgggttagc 38460atctccccca caccctcggt gccagagaca
ctgagtaaga aagtacctcc ctgtctaccc 38520ccatccccgc tccccacagg cagggccttg
gcgatccact gctgcaatgt gccagagact 38580gtcagtactc ctaccaccag tgaggtggca
accagctggg aagtgatcca actccagagt 38640cccgccctca taggctgatt tctaggacca
cccctggtat actgtgttag gttcttgaag 38700cagagcctga gataaggatt ctggcacctg
tgattgagtg ggagggtgct ctcaggatga 38760gatggggtag aaataggcaa aggtacagat
tcagcagcag ttgagcctca gtctgaccca 38820gcagggagct ctcaaatgtg aatgacatca
cagagttgtc cctctgaggc aggggccagc 38880ctttgtgctc ctacatgagt cagtcactgg
ctggaggccc ctggggaaag gctagggctg 38940ccagctttag caaataaaaa attagggcac
tcagttaaat tgaatttcag ataaacaaca 39000645980DNAHomo sapiens 6actagcacat
tagaaaacca gtagtacaca tctctttcca aatcttcatt cagtgacact 60atgtcagtag
ctggaaatgg gccatggtgg gtgtatttaa accatgaaaa tcagaaaatg 120ctacaaacca
gggcatcccg catctctaga cagcagattg ttggccattt cccagcatac 180cattgtgtat
actccttccc atcagggccg tggcttgcct tggtggagga ctcagccctt 240gctgaagttc
tgctactgct cttacaattg agtcctatgc ctggtctcca gctctgcctg 300cctcactaca
ggagacaagc atctctttga acactgccga gaagaccctc tggctctcag 360gcttggcttt
aaatcgatag acctgagcct gccattttct cttttccatg catcactcca 420ctgatccaca
ggtctcagtg gcatagtcct tcgggttagc atctccccca caccctcggt 480gccagagaca
ctgagtaaga aagtacctcc ctgtctaccc ccatccccgc tccccacagg 540cagggccttg
gcgatccact gctgcaatgt gccagagact gtcagtactc ctaccaccag 600tgaggtggca
accagctggg aagtgatcca actccagagt cccgccctca taggctgatt 660tctaggacca
cccctggtat actgtgttag gttcttgaag cagagcctga gataaggatt 720ctggcacctg
tgattgagtg ggagggtgct ctcaggatga gatggggtag aaataggcaa 780aggtacagat
tcagcagcag ttgagcctca gtctgaccca gcagggagct ctcaaatgtg 840aatgacatca
cagagttgtc cctctgaggc aggggccagc ctttgtgctc ctacatgagt 900cagtcactgg
ctggaggccc ctggggaaag gctagggctg ccagctttag caaataaaaa 960attagggcac
tcagttaaat tgaatttcag ataaacaaca aattattttt tagtatatgt 1020cccaaattgt
gcataacata atgtgttttc tccgccagcc ctgggaaggg cgtaacttcc 1080caggtatttc
taggtgaagt aactttgtag atcaggagta agtcccagga aagaagtcca 1140gctcttctct
tcagccctgg gcagctgggg gtaggcacag gggcccagca ggcacccata 1200gcatctccta
cagcatctga aatgaacagg gtcatcacgt actacataca aatgtaccca 1260ctgctgagtt
cttcagggat tatatcatta ggtacttggt attttaaata cattacatta 1320tgcagaagtc
ctttgtggat tgctatattt ggagagtttt gtgatattgg ggggattaga 1380tggagttttc
agatgggcat catacggttt ttcatttaaa accctagagt attgtaatcc 1440tagggagtga
tcctgcgatt agtaaattag ctctccaata gattttcaat gtggttgcaa 1500aggacatgca
tgtggttcac cctcccagga aatccagaag ggcagcattg gcctgagtgg 1560cctgagtttg
gctggttggg ctggtaatgc tggacaaaga caatgggtgg aatggtttgc 1620ttccctcagt
cctttcagac acagcccagc ccaccacgtc aagccagtgg gtgcatctgc 1680aaccaatccc
catgagaact gcagcctctc agaggtgggc aagttggccc gggtgggtca 1740ggaggatcag
atgttgagga aatctttgga ttggaggcag gcagagcagg gaagcatcgg 1800gtgattctat
gacagaccca gggctccaag ctgcagttca ggaggggcac tggcacggcc 1860tctgctcaac
tcccccttga gtgacatcag gtgaagtgcc gacaacacag aaggcagcaa 1920atgctgccag
tcaggtctgc ttcccaggac agccagttgc taacccttct ccagcacagc 1980actggatttt
ggtcacctgg ctgggagctc cacctcccca gctgctgcct cacctgcttt 2040tccaaacccc
accctgtaaa cggtaactac attttgtgcc cactacgcct cgtttccatc 2100tctttggagc
acctctcacg tggagctgaa cagaacgacc tgttaagccc accgtgtctg 2160ttagggttgt
ctaggctgta tcagataccc aactaaaact ggattcacca acaggtattg 2220tcaaagcaca
taagaaagag tccagaggca ggcagctctc agcctggtgt caggctctgg 2280gtcagctttc
cagattctct taaccttccc cacatctgcc agatgccgcc acaggcacag 2340gaggtacaaa
caaacccaaa aatgttctgg aaacaagaag ggaaggggat ccccaccata 2400tctccccaga
ggccttcctt ctcacatctc actgtactga agccagctct agcagaagac 2460agcagggtga
atttgtccag ggtattcagc ccccagtgct gggtccatta ctacttgacc 2520cctgaataaa
acagaggttc catgagcaag aaggaagggg aactggatgt tagagggcaa 2580gaatgtatcc
atcccacccc taggagcacg catggacaac tgccccattt ttgctcctat 2640tgcagcccag
gggctagccc agagaccttg ccagtgctga gtcacaagat gctgggaaag 2700tgagaccaga
gcctggtctt ggggaacagc tcaaggccgc attggtctgc aggtcataga 2760gcagctgctg
agcagtgaga gcccacgatg ggccaggccc tgggtcttgg agacctgaat 2820gagatagact
gggttcctgt tctcctgggc attgcctctt agagggcaaa gacaattaac 2880aataaacaaa
tagaacatga agtgttttcc gatagtgact gatatacttt ggatatttgt 2940cctctccaaa
tctcatgttg aaatgtaatt ccttatgttg gaggtggggc ctggaaggag 3000gtgtctgggt
catgggggca gatccctcat gaatggttta gtgccatccc cttggtgatg 3060agtgagttca
cgtgagagct ggttgtttga aagagcctgg ccccctctca ttctcctgct 3120cccactcttg
catgagacac ctgctccccc ttctccttct gccatgattt taagattcca 3180gggacttcac
aagaagcaaa tgctaacgcc atgcttcttg ttctgtctgc aaaactgtaa 3240gccaattaaa
cctcttttct ttgtaattta tccagtcttg ggtatttctt tataacagca 3300caagaacagc
ctaatacagt gatgctctcc aagtgacctt tgggctgaga cctgaagaag 3360aaggggaagc
agttaggtct gatagctcat gcctgtaatc ccagctcttt aggaggctga 3420agtgggagga
ctgcttgagc ctaggagttg aagaccagct tggaaaacat agcaagaccc 3480tggctctaca
aaaatatttt ttaattggcc aggtgtggtg gtgcacacct gtagtcccac 3540ctacttggaa
ggctgaggca ggagcatctc ttgagcccag gaggttgaga ctgcagtgag 3600tcatgttcac
accactgcac tccagcttgg gtgacagagc aagacctgtc tcgaaaaaga 3660agaaagaaga
aagtaggaag aagaagaaga agaagaagaa gaagaagaag aagaagaaga 3720agaagaagaa
gaagaagaag aagaggaaga ggaacaagaa caagaagaag aacaagaaga 3780acaagaagaa
gaacaaggag aacaagaaga agaataagaa gaagaaggag aagaagaaga 3840aggagaggaa
gaagaagaag aggaagagga ggaagaggag gaggaggaag atgaggagga 3900ggaagcagaa
gcagaagaaa aagaaagaaa agaaagaaag agaaagaaag aaaagggaag 3960gagggaagga
aggaaggaag gaaaaaggga aggaaaggga aggagaggga gagggagaag 4020gaagaacaaa
gaagaaagaa ggagaagcag aggcttgtgc tggatagcct tgcttttgcc 4080aatgaccttg
ctgattttca gggggtcctg gtgtcttagt ccatttgtgt tgctgtaaag 4140gcatacctga
ggctggataa tttacagaga aaagaggttt atttggctga gagttctgca 4200ggctctacaa
gaagcatggc accaatgcct acttctgatg agggcctcag tctgcttcca 4260ctcatggcag
aaggtgaagc agagcctgca tgtgcagata tcacatggtg agagaggaag 4320cacgaggggg
cagggaggtg ccagcctctt cctaatagta agctgtcttg agaactaata 4380gagtaagaaa
taactcacac cctgccccca aggaagggca ttaatctatt catgaagtat 4440ctgcccccat
gacccaaaca tctcccatta ggccccccac ctccaacatt gaggatcaaa 4500tttcaacatg
aggttccggt gggcaaacat ccagctataa tactgggcaa tgctgaccag 4560actcttcccc
tctcaggccc agagctcctt ggccctgtaa caacagaaaa ttgcgtttga 4620gtgtcaagat
ttttccttta gtccccatgc agctccttag aatgaggtgg catcttctcc 4680cttttcatag
gtgaagaaac agaagctctg gaggaacgaa tcattcatcc aaggtcaggt 4740agctagtaag
cgtcccacca gctccccaga tctcctgttt cctgtcccaa gtcccactga 4800gtgagctgga
acaatggctt cactggcacc tgccgggaat ggtggcaggt gcctataatc 4860ccagctactc
gggaggctga ggcatgagaa tcacttgaac ccgggaggca gaggttgcag 4920cgagccaaga
tcacaccact gcactccagc ctggataaca aacggagatt ccatttaaaa 4980aaattaacat
ataatataca tacagtaaca ttcacttttt aagtgtacag tttgatgagt 5040tttatcaaat
gtatatggtt atataaccac catcaccatt aaggcagaat cttcccatca 5100ctcaaataat
tccctcagcc ccacctcttg ctgtcaatca cttctcccac cctagccact 5160ggaaatcatt
catctgtttt ctgtcccctt ggttttgcct tttctagaat gttctataca 5220tgagaccact
gagaatatag tcttctgtgt ctggcttctt tcacttaaca taatgcctag 5280ctcagcagtg
tgtcaatcct ccctcccttg ccattgctga gcagtgagta ttccactgta 5340tggctgtgct
acggtgtgtt catccattta ttcattcacc agctaatggg catttggatt 5400gtttccaggc
tttggctatg atgagtgaag ctgctgtgaa tgttcaagta caagtctttg 5460tgtagacagg
ggttttcaat tggcgggata aatacctagg agtagtatcg tgtggttaag 5520cgtacgttta
aacttagaaa aactgtcaaa ctgttttcca atgtggcctg taccatgttg 5580catttccatc
agcagtgttt gagaattcca attgctccac atcctcctcc cgacacttgg 5640tttcacccat
cttttaaata ttagccactc tggtgactgt gtagtgatat gtcagtgtgg 5700ttgtaatttg
catttctatg attgactaat aataatgttg cagatatttc tgtatgctta 5760gtgggcattt
ttggtgagtt tttaaaaatt gggttgttgt caccgtctta ttgagttgga 5820agaattcttt
atatgttctg gatgtttatt catgtgtgtg tctgctaaga ggtgagactg 5880gttctaccct
ggtcctaaca agcaccctgg gcctgcatcc ctttttgtgt ctgtgagctg 5940ggtctgcagc
cctctcctcc cactacctac tgcccagcag tacccctcac ccatcactgt 6000ggctcctgca
atgacatctc agcctgtctc tccctccctc cagctagcca gaggcaggat 6060ggctcagtga
cacagggtgg gccctgaaga cagagtgcca gggtttggac cttgtattag 6120caagagtcac
aagggaaact tactttatct ctccatagct ctgttgtgag gatccaataa 6180attaatccat
agaagagctt aggacagcac ctggcacaaa gtatacatga gctattatga 6240tgttattctt
ccaacccatt gtttctgtgt tgtcataaac atgaatgcag gactcagtgt 6300cccagctctg
tgtccctcgc atacattccc taacagccca caggtcttgc ctgtcaccgc 6360ctcattcaat
aagtgatgac tctgcctctt ccttggctgg ggccttgcat tggacatttc 6420tgtatccata
tttgtttttt aaaaactagc tgttggccgg gcgcggtggc tcacatctct 6480aatcccagca
cttgggaggc agagacaggt ggatcatgag gtcaggagtt caaggccagc 6540ctggccaaca
tggtgaaacc ccatctgtac aaaaaatacg aaaattagct gggcgtggtg 6600gcatgcacct
gtaatcccag ctacttggga ggctgaagca ggagaatcgc ttgaacctgg 6660gaggcagagg
ttgtagtgag ccaatatagc gccactgcac tccagcctgg gcaacacagc 6720aaaactccat
ctcaaaaaaa aaaaaaacaa aaaacaacct agctggactt gacactcttg 6780ttagaggaag
atttttccac atctgttaac ttttcttcta ttgttatcca tctgtgcagg 6840tttttctgtc
ctcctgagtc attttgataa tttatattat attttgaaaa tcatccattt 6900cctatagttg
tttattagtg tcttctctgt tatatttgat cagattacca aatcttgctc 6960attgattgcc
catttatttt attgtgttta tttttttgag acagggtctc actcgacagc 7020ccaggctgaa
gtgcagtggt gcaatcatgg ctcactgcag ccttgacctc ctgggctcaa 7080gcaattctcc
cacctcagcc tcctgagtag ctgggacctc aggcacacgc caccacagct 7140ggctaatatt
ttatttattt atttatttat ttatttttgt agagatgggg tctcactatg 7200ttgcccaggc
tggtttcaaa ctccttggtt caagtgatcc tcctgcctca gcttcccaaa 7260gtactgggat
tacaggagtg agccaccatg cccagcccct atttacttta tagtaagtgc 7320cttcatgggc
ataaatgttc ctctgagaca gctttggcta ttagccatac ttttaatatt 7380ttgtacattc
atggttattc atttataaat ggtctgtaat gcaatgcaga tttccccttt 7440ggcccaaatg
ccatttacag cagcactttt ctctttctga gcagacagaa tattttggtt 7500tcccctctgt
tgtttatttc tcgtctgcct cgcctcattt gctaggtgtt cccttggtgt 7560gccttaagta
tgagccactc aaatatttgt gtttctctaa acacccctga cactgtcctg 7620ctggtttctc
tatctggaat atccttccct tcttggccag ttccccctag tgcatcaaag 7680aaatcctgct
cttttgcctt cagaaaacaa aacaaaacga aacctatcag tctccttatg 7740tccccaaaga
catagctttg ctggtatctg gttgtattga gctgttcatt tgtctcttct 7800gctagatggt
aagctccttg gaaactaaaa actaatcact tttctaactt cagactgagc 7860acaaattagg
ttctcaagaa acattgaata atgagtgatc cggtatcccc ttccaacata 7920tttttggtca
ttgataccat cattctgagt agttactagg gaacacttca ctgcagtaac 7980caatacagca
aaacgtgaaa tacagttaca tagtagaatt gtatttcttg cccatataat 8040agtcaagtgc
agttcttcat cagctgggag gttctcctcc acacagtcat ttaggaatcc 8100agggaacata
gcagaggttg ctagctctag acccaaaccc atgtcctctt tgtccacagt 8160gaggacaatg
ccagcaacag ctggccagct gttctgtagt tctcagcctc cctcgcagtg 8220agatgtctcc
atgcaatttc agtggagcaa catataccat ttccatttcc aggtgtaggc 8280tcctaagaag
agggtggctt cttcatgttc tttctcacct ttccgtaggc tagctgcaga 8340taatgatgag
gctttaggga gtgggtggag ccataaagta gaagcctgga ttcctaaatg 8400acggtgtgaa
gtgttcccta atttcacgta attgtttctt aatttcctgt ttgggttatt 8460tgttgctaag
gtataaaaaa accctgattt ttgtgtgttg atatttgtgt gctgcaactt 8520tgctgaatta
gcttattagc tcaatttgat ctcagatatt agctcaaata ttttgggaga 8580ttatttatgg
ttatctacat aagatcatgt catctgaaat aaagatagtt ctatttcctt 8640ctttctatct
tagtccattt gggctgctgt aacaaaatgc cataaattgg aggctgagaa 8700gtccaagatc
aaggcccaag ctaattcact gtctgatgaa ggcctgcttt ctggttcata 8760catggcacct
tctagctgtg tcctcacatg gtggaaaagg caaggtagct ctctgggatt 8820cctttttgtt
tgtttgtttg ttttgttgtt tttgtttgat tttttgagac agagtctcac 8880tctgtcacca
ggctggagtg cagtggcaca atctcggctc attgcaacct ctgactccct 8940ggttcaaacg
attctcctgc ctcagcctcc tgagtagctg ggattacagg tacccatcac 9000catgtccagc
tactttttgt atttttagta gagacagggt ttcaccatgt tggccaggat 9060ggtctcgatc
tcttgacctc gtgatctgcc caccttggcc tcccaaagtg ctgggattac 9120aggcatgagc
caccgtgcct gtcctccggt attcttttta taagggctct ttttcttttt 9180atgtgggctc
taccctcatg acctagcacc ttctaaggcc ccacctctta atatcatcac 9240acagcagatt
taatatatga attttgaggg gacacattct ttccatagca ctttccagta 9300tggatacctt
ttatttattt ttcttcccta attgctttgg ttagaaatgt cttccctaat 9360tgctccacta
ctatgttgaa aagaagtggc aaaagtgggt attcttgtct tgctcctctc 9420ttaggaagaa
agtttaagtc ttttgccatt aaatatgacg ttagctatgg ggttttcata 9480tatgacattt
atcatgttga ggaaattttc ttcttgtttc aatgatgaca gggtgttgag 9540ttttgtcaga
tgctttttct gcatcaatca atatgaccat gtagtttctt tgttttattc 9600cattattgta
gtacattaca ttaatttttg catgttgaac tattcttgtg ttcctgggat 9660aaatttcact
tggttatggt gtataatcca taaccataac ctgaagatat gctgaagagg 9720ctaagtgcca
tggctcatgc ctgtaattcc aacactttgg gaggctggtg tgggaggatc 9780acctgaaatc
aggagtttta gaagagcctg ggcaagtaaa caagatccca tctctacaaa 9840aaattgaaaa
ttaccgctgg gcatggtggc tcacgcctgt aatcccagca ctttgggtgg 9900ccgaggcagg
cagatcacct gaggtcggga gttctagacc agcctgacca acatagaaaa 9960accccgtctc
tactgaaaat acagaattag ccaggcgtgg tggcacatgc ctgtaatccc 10020agctactcag
gaggctgagg caggaaaatc acttgaacct gggagacgga ggttgcagcg 10080agccaagatc
atgccattgc actccagcct gggcaacaag agcaaatctc cgtctcaaaa 10140aaaaaaaaaa
gaaaagaaag aaagaaagaa aagaaaagaa agaaaattag cttgatgtgg 10200tggttgtgca
cctttagtcc tagctactca ggaggctgag gcaggaggat tgtttgagcc 10260caggaggttg
aggctgcagt gagccatgat tgcaccactg cactccagcc tgagcaacaa 10320agtaagacct
catcactaaa aacaaatttt ttaatactga agaattttat ttgctggtat 10380tttgttgagg
attttgcatc tatattcaca agaaatatta ctctgtagtt tttcttcttg 10440tagtatcttt
gtctggtttc agtatcaagg caatgctggc ctcatgagat caatcaggaa 10500gtgttacttc
ctcttttatt ttttggaaga atttgagaga attggtgtta attcttcttt 10560aaatggttgg
tagaattacc agtgtagaca tctggtcctg ggattttctt tgttgggagg 10620ttttttagta
ctaattccat ttccttactt gttattagtc taatgagatt ttctgtttct 10680tcttgagcta
gttgtagtag ctcatgtgtg gaatttttct atttcatcta agttatccaa 10740gtttacctaa
gttaaagttc cattttatct aacttgggta agccaacaaa caatactaaa 10800ttgttcatag
tattctctca tagtcctttt tttctctaaa gtcagtaata acgttcactc 10860tttcattttt
tcattcctga ttttaataat ctgagttctt tctctccccc tccctgcaat 10920tgagagtcat
ttaaaagtgt cttgattaaa ttttatatat ctgtgagttt tccagttttc 10980cctctgttat
tctcttctag ttttatttca tgtgatccaa aaagatactt tatatgattt 11040caattttttt
acatttacta agacttgttt tgtgactaaa atatccttga gaatttccat 11100gcacatttga
gaaaaatgca cattctgctg ttgttggaca gagtgttctg tatatgtctg 11160ttaggtctaa
ttggtttaga gtattgttct agtcctctct ttccttattg atcttctgtc 11220tagttgttta
atccattatt caaagtagtg gccgggcacg gtggctcaca cctgtaatcc 11280cagcactttg
ggaggccgag gagggtggat cacaatgtca ggaggttgag accagcctgg 11340ccaacatggt
gaaactccgt ctctactgaa aatacaaaaa atttgctgga catggtggca 11400cacgcctgta
atcccagcta ctcaggaggc caaggcagga gaatcacttg aacccaggag 11460gcagaagttg
cagtgagctg agatcgcacc attgcactgc agcctgggca acagagcaag 11520actctgtctc
gagaaacaac aaaaacaaaa acaaaaaaca aagtagtgta ctaaagtctc 11580caactactat
tgtagaactc tatttctccc ttcaatgttg caaaattttg tttcatgtat 11640tttggtgttc
tgttctttat aatttttata tcttcttaat ggatgaaaac ttttatcaac 11700atataatgtt
ctttgtctct tgagactttt ttttttaact taaaatctat ttgggctgat 11760aatacagcca
ccacaactct catattggtt gttattttca tagaatatct tcttccatcc 11820ttctacttta
aaattcttct atctttatat ctaaagtgag cctcttgtag atagcatata 11880ggtggataat
gttctcttta ttcactctgc caatatctgc cttttaactg gagtttaatc 11940tatttatata
taaaataatt actgattagg aaggacttac ttctaccact cagctatttt 12000ttttctgtgt
gtcttataca tttttaagtt tctcaattcc tccattactg gatttttttt 12060tttacttctt
gattttgtgt ctgtgttgtt acattttgat tattttctcc ttttgatagc 12120ggcaggaggc
agccaaatgc ctggcagata gaagcttgtc ccccatgaaa ccccaccttc 12180aagccaaaaa
atagcctgaa ggctgaaaga ccggactgct ggtcccagat gaaacccatg 12240atccagagtg
agaacttcca ttcctgtttg cctgccctct aaataatccc ttttaaccaa 12300tcgaatgttg
ccttttccaa tactacctat ggcctgcccc tcccccattc tgagcccata 12360aaagccctgg
aatcagccac attgggggca ctttgccaac ttcaggtagg gggaccacct 12420ctgtatccct
tctctgctga aagctgtttt catcactcaa tgaaactctc accttgctcc 12480ctctttgatt
gtcagcgtat cctcattttt cttgggtgtg gtacaagaac tcgggaacca 12540gtgcacaagc
cagacttggt ctgggcagca cgggttagtg ggccatctcc cacagcaggt 12600agcatggcca
agtgaggcct gggcagggca tcaccaaggt ccctggcttg caaagtgacc 12660aaggaaaaaa
tcctgtgtca ctttcctttt ctcatatttt ttagttattt tcctaatgat 12720tgccttgagg
atggcaatta acatcttaca cttataagaa gctagtttga ataatagttc 12780caatagtaca
tgaacactct actcctatat atctccatcc ttcttccttt atattgttat 12840tcccacaaat
tatgttttta tacattatat cctcactaac ataaacttat tattattttc 12900tgcatttgcc
ttttaaatca tacaggaaaa caagaatcac aaagaaaaac tacattaata 12960tttgctgtta
tatttaccta tatagtgaca tttaacagtg tatttttatg tcttcagatg 13020tctttgaatt
actacttagt gtcttttcat tttagcctca atgtttccct ttagcatttc 13080ctatagggca
ggcctgccgg taattaattc cctttggttt tctttatctg aaatgtctaa 13140tttctttttt
attcttgaag aatagttttg ctggctataa gattcttagt taatagtttt 13200tttcccagca
cttcaattat tattaaagtg ttattattat tattattatt attttgagat 13260ggagtctccc
tctgtcactc aggctggagt gcagtggcgc aatctctgct cactgcaacc 13320tccgcctccc
aggttcaagc aattctcctg cctcagcctc ccgagttagc tgggattaca 13380ggtgcccgcc
accatgccca gctaattttt gtatttttag tagagacggg gtttcaccat 13440gttggtcagg
ctgatcttga actcctgacc tcaagtgata cacccacctt ggcctcccaa 13500agtgctggga
ttagaggcat gagccaccat gcctggtcta aagtgtaatt attattacag 13560ctgccatttg
gcctccttgg tttctaatga gaaatcatct gttaaactta ttgcaaatcc 13620ttggtatgta
tgctatgtgt catttctctc ttgctgcttc caagattctc tctctgtctt 13680tgtcttttga
caatttgact ataatgtgtt tcagtgtgaa tttcttagag tttatcccac 13740ttggatttca
ttgagcttct tggatgtgta cgtttgtctt tcaccaaatc tgggaaatta 13800tttcaccatt
tctcaaatat ctttttcttc ccctttccat ctctcttctt ctggagctcc 13860cgtatactta
gttggcatga ctgatggtat cctactggtc cctcaggttc tgttcatttt 13920tcttctttct
ttttttctgc tctgcagact ggataacttc aatcgccttt tcttcaagtt 13980caatgattat
ttcttctgcc tgctcaaatt ggccatttaa cccctccagt gactttttca 14040tttcagtatt
gtacttttca gatccagaat ttctatttgg ttcctcttta ataaattctt 14100tttattgtca
ttccccatct gttcatacat tgctctccca atttcctgta gttctttgtc 14160catggttttc
tttagttaat taagcatatt taagacagtt gacttaatgt ctttgactag 14220taattccaat
gtctaaaatt ccttatggat agcttctttt aaattatttt tgtcctgtta 14280gagagtcata
tcttcctctt tatttgcttt gtaatacttt gttgaaaact taacattttg 14340agtagtaaaa
tgtggtaatt ctgaagccag attctccccc tcctttgaga ttggttttgt 14400tgtttgttga
gggctgcagt tgtccatttg tatagtgact tttccaaacg atttttgcaa 14460agtatgtatt
ctctcttgtg tctggtcact gacgtttctg ttctggtgcc tctgcagtca 14520gcctatgacc
tggaagagca ttccttaaat gcatagattt ttttaaaacc caagaaacaa 14580aaaacctagc
atgtatgtac ctttttaaaa atcttctgat agatgccacc tggaaggctg 14640ctgctgcctg
aaggggcaga aacaaaggca agctctactc tgagccctca gggaaccacc 14700agataaacaa
aagaaatttg attctccaaa tttctggaag acaaggtcct ttctgcccac 14760tcctgctcca
gccagctgct ctaggaacac aattactgtc cacatggcca caggaatgtt 14820gaagaatgca
ggatggtagc tggtttgccc acaccactca cttatgagcc atcagcatgc 14880ctctcccttc
atcgagcact cccatggttg ctgtaagtgt ccaatcaggt tccagaattc 14940tgaaagagtt
gactcttaca ggattttttt cttttctaac ttgctggttg tttagataga 15000ggaaccaatt
cctgaagttt cctacgttgc cagcttcatg aggatcattc cctagtaact 15060cttttcagac
aaaaagcttc attgatttac tgtaggacta gcatcaaaga gtctatgcca 15120cctagtctgt
ctccttaaaa cacagaaata atcagtatgc attggggtag gagtttggca 15180ttagatctgc
cgtaaatcaa gagctgggga cagcccatgt cttaaactct gacccaaggg 15240ctaaaatatc
ctttggtagc aacaacagct acaaactatt gaacaacttg tatgtgccaa 15300gagccttacc
tgcattatcc cattgaatcc tctcaacagc cctgtgaggt agtagaattg 15360ttgcctgccc
cttactgagg cctagaaaca ttaaggaatt tgcccgaggc cctagagcca 15420gtgagtggca
aagccagtct ccagactcag gctggagatc ctacagttct gtgttacccc 15480agtgttatcc
tgcctctcag cacagagtct tggatgattc tcctaacccc tccctaggca 15540atgcacaggg
ctgctccctg cacccttact catgctctgc tcttcaaccc caacagtgct 15600ggccttaggc
tttatccctg acacccagcc ccaggctcca ttccatctgt tgacagaggc 15660aaacactggg
gcaaaactga cctctgtgga taccactgtg tccacctcca ccagcttcag 15720ctgaagcctc
tgaacatctc cagcatggaa gaagccccaa aggatatttc ctgtccccca 15780gcatatgctt
gaccctgaag ccctccccat ctagtcaaga agaccaaact gttaacaatc 15840ctggagtcag
agtgacccat gggtgaatct tagccaagtc actcatagct gttgcatcct 15900agtaaatccc
ttaactccca taggcttcag tttccctgca tataaaatga cagccttcag 15960ctcatcggcc
agtttcaatc catctaaagg gtctagcaca tcccctggca tgtggaagcc 16020acagggcaca
cactagttgt ggtcatttga tcctggcatg ctctgctgtc tctcggctct 16080ccccttgcct
ctttccctga tgtcctggcc atcagccact gcctaacacc ctcccactca 16140ccaggccctt
agcctgcccc ttagcacaag agcacagccg gtctcaagtc taccctgctg 16200taagcaaaca
cttgcaacat catgctgacc tccaggccct gttgcatcag cgtgcccaca 16260cttggtgccc
agctggtact gagggtatca gggaacaggc cagtggtgga agggcggaca 16320ctttgggttc
cctggtttcc tggctcccaa tatctttccc aatggcatat ggggtctagc 16380agcttggctc
atttaactgt gaacctctac cctttagaat ctgggcctcc aggcttgctt 16440ctgtgcaaaa
tggcagataa ggctcaacct ttcttttttt aacttcattg ttaaatatta 16500ctccattaat
acccatttac tgcagaaaag gtaggaaata cagataagca aaaaggaaaa 16560taaattaaaa
tcctcatacc accatcatca agataattac tgtcaccatt ttggtatatt 16620tcctcccaat
acatatatta tctatatcgt atatacgaca aaaatggatc atactatgtt 16680tcctgttctt
cccctgtgtt agtcatctat tgctgtataa caaactgcct caaaacttag 16740tggcttcacc
tttccgtgta ttatgatgac aagaatgtgg tatgacactg tcttatatct 16800ggatcatatg
ctaaaagata gaaaatggtt tctaaactta tttgttctgt aataacaaaa 16860ttttatttca
taaagtgttt ttaaaaaaaa ccatagtagc ttgaaacaac aaacctttgt 16920tatctcacac
agtttctgta ggtcaagaat tcagaagcag cttagctggg tggtctggct 16980tggtgtctct
cctgaggtca gggttttggc tggggctgca tcacctgaag gcttgactgg 17040ggccagagga
gctgcttcca aagtggtcca ctcacatggc tggcaagttg gagttgcgta 17100ttggcaagag
acttcgcttc ttctcaatgg atcttcccag agttcttgta ggcaacctca 17160tagcatagca
gttggcttcc cccagaggga acagtccagg agagaacaag gcagaaacca 17220cagggtcttt
tctggcttag gctccaaagt catactccac catttctgca ttatcatatt 17280agttacacag
gctagaccta ttctgcatgg aagagactat accatggggt gaataccaga 17340agcagggcta
attgaaggcc agcttcaagg gcggctacac attccctttc aacagtatgt 17400catgaacatc
tttccatgcc aatagagcag atgaatctta ccatttttaa tgactacatg 17460taagtgtagc
ataatttatt taaccaacct cctgtagttg ggtatgtggg ttgtgtctcg 17520ttttttgata
gtagaattaa tcatcttgaa tatccatcac caaacttgtc atattatttt 17580cttttgatga
atgaaaaaga aaatcaagtc atgtctgtca atcagaaccc tgagcaacta 17640agaaatgggg
gtaccactgg gacatagagc aaggtccctt ctgattctgc tcttgtcttt 17700ctctccccat
gaaatgggga gttcactatc tactgagaca tcctagccca cagctgcaca 17760gttctgtctt
tttagaaagc tctaagcaga aacaatgttc atccatcctc ctcgggacag 17820cccttgagct
actgaagact ctaagcatgt cctggtcatc ctccatgagc catcatctct 17880gaggccctcc
ccttcttggc ccctcttctc tggacaggtt ctggacagtc ttgcccttcc 17940aaaattcctg
gaaagcagga actgttcctg ctacaatgac tctcaactcc agtgcagtac 18000agactgttgg
tgtcacccct tatcctgaag aagaggcact gagacaggac aagggtgggt 18060gcccaggagg
gctggcatga gtcatgagaa tctggtcccg gagaattaga cggtgtgggg 18120aagtaggggt
gttgggccgc tttctggcct catggatgcc aatgaatatc agcaggtggc 18180tcccagaaag
gaactctagg ggatgcctgt tgctctaaat agaggctaga gagggcactg 18240gcagttcagt
caaccaagaa agggggccca cttgcctcag cttcaggctt tgtacacatc 18300ctcagccttt
cttgagaact gaatttagat tctcctcccc tgtgctgtgt gcttggccca 18360gaagaagggc
aagtctcgct gggtggctgc ttcttggcct ggctgaacca gaaggcccca 18420gtgccactcc
aaacctgggt gtgagccctg cccccatgag caaacagtag ctcagagctg 18480ggggctgtgg
gggtcagtgg cctgtcacat gagatctgat gaggccatct ctgctctata 18540ttgggaaagg
gatcaattgt atcaagggct ttcttgggag tgatcactct ggccattggc 18600gagagacctg
gcattctgac aaggcaccct ccataccctg acccacttgc cagctccagc 18660taattttagc
aggctttggc aggtgccagc aagtacatag catgtggatg tcactcccag 18720gtgagcccaa
ggagaggcct gggccagagc ctggaagtca tggtctatgc ccatggaggc 18780acccaaagca
agcctgaggc ctggactttg cagtcacaaa attaagaatg atacccctgt 18840tttttgtttg
tttttgatca gttggccacc ttcctccacc accccttccc caagttccat 18900acagacccct
ggattgtatg aaatgcaaat cgaacctctc tgcagatgaa aatccactgg 18960ggatcccctt
gcctccaaga gcaagtccag acctgcacca gcgcgggcca ggccccctta 19020ggaccccctc
cctgtccaag ggcatttcag taagtgttct gtggccaagg cagcctggtg 19080actttctgcc
cgcacaaggc tgaggaatgg aagatgggta ggctggctct gcacaccccc 19140tcctgctggg
cagcaatccc taccccatgt tcacagagtg tggccggctg ccccatggct 19200ctgtccccgt
ggccctgtca actgttaccc acatggccta ccctcccttt ctgccctgcc 19260tctgacccca
tggcaggggg cagagtattt gagcagccgc caggctgagc cctttcagtg 19320cagaagccct
gggctgccag cctcaggcag ctctccatcc aagcagccgt tgctgccaca 19380ggcgggcctt
acgctccaag gctacagcat gtgctaggcc tcagcaggca ggagcatctc 19440tgcctcccaa
agcatctacc tcttagcccc tcggagagat ggcgatggat gtcacaagga 19500gccaggccca
gacagccttg actctggtaa gggtcacacc aaagttaggg actttgcact 19560gggagagcag
cacccagggc agggcccttg gttttgcaga ttaccaaaac taaggctggg 19620ggcagggaag
gcgagcaggc ttggggcacc ttggaaggag gcacatgggc cttgggggtc 19680ctggctaggg
cagctgtgcc tgccactggc cctctgccca ccacccctcc tcactgtggc 19740tatccagtgt
ccagcctctc gaggggttct agggtactta ttcctggagc taacggtgac 19800ccaggacacc
agtgtccggg gcctggcctg gggcttttat ggggggagct ggctggctgc 19860ccagggctgt
ctggctctct gggggctctg catggcattt ccaggggttg gtggatcagg 19920gattctgtcc
ctcaggagaa tgtgggcact agcccaaggc cactcacttc tgtgtacata 19980gccacctgag
ggcccaggaa tggagggggc caggctacag ctggacatct ggcactcgga 20040tgggctctgg
agcccccagg cctgcagcat ctgcccaggg actgccctgg cccttggcca 20100tttcctcagg
gacccacagc tccaccagcc ggcccctccc agtgctggaa tagacagttc 20160ctcagtccac
atctgccaaa ggcggcacta gaaggcatcc tgcctttttt actgcgttct 20220ggaggtgggg
tcacaaagca ctgctcactg cataaaaggg acagcatcct gcccctggca 20280gccctgcctg
accagctccg cctctcccac tgctatccaa cctgtacacc ctggtgacca 20340tgtccaggcc
agtggcctta aggactgtct ctgtactgat ggctccacat ctacctctcc 20400agccagactc
tcctctgaac tcgggcctca catggccaac tgctacttgg aacaaatcgc 20460cccttggctg
gcagatgtgt taacatgccc agaccaagat cccaactccc acaacccaac 20520tcccaggtca
gatggaacct cttcttccca ggcccttctg ttcctctcct cagcccctcc 20580cacctccctt
cagaataagt ctagactctt atcgctttca ccaagcctgc gcccagcatc 20640cctgcacagg
gattgttagg acagcctgac gccctgcttc caccctgccc caagatgccc 20700ctgctctgca
gcccggcgcc tccaggcttc tcacctcctg ctgctcacag ctcagcctca 20760ctccctccct
ccccgcctct gctccagcct cagtgcaggt cccctgctcc catcttctgg 20820cagcagctgc
ccgacctggt ccctcttcat ctgtccccat tccttcaccc cccagcctgt 20880ccccaacttg
actgaggttc tttcctgcag atccccgccc ttgagagggg ttggtcccac 20940tgtcaactct
gcttctgtgc cctgtgccgc acctggcatt cagtgagcat ctgctgaaga 21000gatgagggtc
agatgccctg cagggagtgt gggggcgtcc tcaggcaaga aaagttgtac 21060gtttggctgt
gggccctgat tatgtgtcct gtgacctctt gggtgaggtc agcaagagaa 21120acctctgcaa
gctggctggg gctgcctccc agaggctgcc agggggaggg acaggctctg 21180tctgtgctct
tcttccgagg ctacacctgg ggcgccaggc tctcagggct ccccaggtac 21240caccacattt
cctacactgc ttgggaaagc cctgtaagtt tgcacagaca cccagcatga 21300ggctcgccag
agagatactt gtagctgggg tctgggcacc aggaacagct tggtgctggg 21360cctgaagtcg
ggcaggatgc agcctggcca ggtgagagga aagcttggag ccagtgcctg 21420ggttcaaact
cctctgtggc ctatggttct gtgggcttgg ggaagggttt gtacctctgt 21480gtccagtttc
ctcacttata aaaaaaggag ataataaaag tacccatgtc ccagggtggc 21540tgtagcaata
atagggaggg gtgcccagag caggtctggc acacaggaag tgtgcatcag 21600cctcagtccc
tgccattggg cttgtcctgg gagtctgtga agccaacctc tgctccacaa 21660tgtgaccccc
aggcttgtga gaccaagctg ggtcagagct tcctcctctg gggttgcacc 21720aggaggggaa
cttctgcagg cccagatgca ccctgaggaa agggcttgtt cccaccaaga 21780acaaggctca
cctttggagg atgctcccca catgagaggt gaacccccag gtctactggt 21840gactgcagcc
tcggaagctg acagcatcta tcctccaacc catgcccact gggaagtgtg 21900tgaggggtcc
tcataggccc tgcggtgtgg acaatgcaga gaccctgtag catctggcta 21960gggcggggcc
cagataagag ccctgtgcca ggagagcctg gccggttctg ccactgtggg 22020gagacaggct
cccccacccc atgtcccctg cttccctgca gcccacagag aatacagacc 22080tacttttaca
gaaatccaga tttttgtgta aaagtgtctc tattttaagt agattttaag 22140tggtggcagc
aaatttaagc ttttgagaat attatacaga acaaatcaga ttcacaggcc 22200agatgcaact
ttatttacag aaatgggatc aggtcctacc tcaggtccca tctcacgttt 22260tcacttatgc
ctatacgtct ccttcacggg aaaggccaca agaggccctg cggtaagtgt 22320cccggtgttg
atttaaagtc cccaacagtg aatatgaggg tcctcactgt tgcagcaaga 22380ggataccccc
ctgtgtatct tggaaatgcc tgcagccctc ttgctgcaga acagattctt 22440aggagagaaa
ctgtcagatc aaagttaaac ttagagaaac tccaaattgc cctctgaaca 22500gacggtatca
gtttgacatc atccaatacc gggattcctc ggggagaact ttctggccta 22560gaaggcagta
gagccaggac ttcacccagt cagtggcagg gccacacgtg ggccttgata 22620cagaggggga
agacttgagc ctcctcgaca ccctacaggg cccagcctcc caacatgtga 22680taagagaaac
aacagccaac ttgtacctag ctctccttat tctccaaggg ctgggccagt 22740tctccccaca
gccctgcaag ggaggatcac tcaagggccc caactgtctg acaatacagc 22800cacactctga
tcagccacct gggcataggc tccatgccat tgtcctccgc caagacctca 22860gactgaaatg
ttggctcctc ccatgaagaa cctggggcca aaggaccaga gtccaggtcc 22920gtggctgcca
ggatgggcca cttggagaga ggcacaaggg tggtgccagg caggtgtgag 22980ggctggacct
ttgcaagagc agcatcactt ttgttgagag cccacaggta tcttataatt 23040gggtcctagg
acttcctgcc agtagccatt gtgtgcatgg atttgggtgc tggcctcacc 23100atggtgtgct
ggctgcccat gcctgcaata atgacttctg taagcctttc ttcatctgca 23160agatgggtgc
tgctggcacc tcctccccgg tgctgtggtg acagggcata gtgtgtgagg 23220ctgctatgtg
aagcacctaa tgcagggcct ggcatatgga ggaattcagc aaatgacaga 23280tgccttcaca
gttagttcct ggcatcctct acattggtgg gtgtaggaaa gaaagacaga 23340ggaggcaaaa
gttgtagctg tggggcattg aggacagcct ggattgttcc acagagccct 23400gaggacatct
ccaggggtgt gctctgcagg ggcagctgga ttggagggtt aggggtcggg 23460gagggcgtgc
actcccaccc atgctcacag cctcggaaca gtgcctgctc agccaacatg 23520ggtgtttgat
tctgtgtctt ttgtcacaga ctttatcagc cccatccctt tctgaccttg 23580cctcagttta
aattttacat gtggggcctc attaagagac atggttctta actaaagatc 23640tgtatccatt
aggaatgctt tgggctgcag gaagacaaac acctgactca ctgtggcata 23700agtggtttgc
gtctgctccc ataagctgca cgtggagggt ggatctggca ttactctctc 23760ttccctacat
ttgcagtatg ctaacagctt taacctccag ccttgtttct tcatggttgc 23820agggtggcta
tcacagcgct ggccatcaca tccttacaca gctgtgttta caaatttagg 23880gggacattga
agctcctccc ctgctaaaat caggcttccc ttcacctgtc attggccaga 23940actgggtgaa
atgcccaact ctagaccgat catcagtaag aggagtatag aattgctgtg 24000cccaccttag
attaatcatg gcgcaatgtg ctccccatac caacaaaatc tgagttctag 24060aaactgagga
agaagaggaa aatggccgtc ttgcctcctg gctgggattc agagcatctc 24120caaccctctg
agcttatgtg taagactgtg ggcaaaagtg tgtgagtttt tgtggaatgg 24180atccacggct
tttatcagag catctttcct ttttcttttt gattcaagat gaaaatattc 24240ttatgattat
ttttctcacc actgcccaga gataaccagc acattaacat ggccttttct 24300ccatgaatag
cactagggtg cccagtggac agacacatag ctgtccacac accagcttgc 24360tggggatgca
taggcagagt cacatctgca ctcacggcct gtcctcacac tgccatgtgg 24420agagccagca
gccacaccat gggccgtcca tgctcacggg agtggcagta tcagatctga 24480gcttcgtgtg
cccaggcgtc tctcacatca gtgcataggg accctctttg ttctgtggcc 24540cagtgtgccc
atgccacaga tggcttcagt cagcagacac ctccttctag acactcacac 24600tcactcctgg
ctggccctta gcacacctgt gcagacaggc ccatttattt tcttgtgtaa 24660atcccaagta
ggaggactgg gtctctctga cagcaatgcc agctgcctgg caccctccag 24720acaggtggct
caagccccac ctcgccagct ctcccagtta gcccctcctt tccctggctc 24780tgacctgagg
gacgaagcag ggtgctacag gacgctgtgc cacagggata tcgtcaggga 24840cagaagctac
tctgccctct gctgctcacc cctccaacac gctgtgggct gcatttgttg 24900agtggctggt
accagactct gctcttctga ctttccagct ggttttacct gtagtaaagt 24960ttgagaagat
gggtcatcct gaccccgggg tcagaagaca gaaggaggcc catggcgtgt 25020gggggagatg
ccccgtgagg ccctcggtgt gcagatgcct ggtgacagcc ccaccctgag 25080gtccccagcc
taccccctcc ccagcccgac tgctcccatc cccctccctg tgcaggtaga 25140gcagatcctg
gcagagttcc agctgcagga ggaggacctg aagaaggtga tgagacggat 25200gcagaaggag
atggaccgcg gcctgaggct ggagacccat gaagaggcca gtgtgaagat 25260gctgcccacc
tacgtgcgct ccaccccaga aggctcaggt accacatggt aaccggctcc 25320tcatccagaa
gcagctgtgg gctcagccct agctgggaga agcaccccag gcactcccag 25380actcacagcc
agcccgagac agaatctcct ggggagcaat gaagtcctcg acttgggcca 25440gttctcaccc
ttggctcctc tggtccggcc ctggggcact cgggctcacc ctggagctgg 25500caaacctcag
gaaaactggc gttttaaatc tcactcctgg ccaggtgcag tggctcaccc 25560ctgtaacttc
aacactttgg gaggccaaag caggcggatc tcttgaggcc aggagtttga 25620gaccagcctg
cccaacatgg tgaaaccccg tctctactaa aaatacaaaa attatccagg 25680catggtggca
cattcctgta gttccagcta ctcgggaggc tgaggcataa gaattgcttg 25740aacccgggag
gccgaggttg cagtgagcca aaatcgcgcc actgcactcc agcctggggt 25800gacagggtga
gacaccatct caaaaaaaaa aaaaaaaaaa gacctcactg ctccccatgg 25860gcacttaggg
aactctccca gcccagttct gcagctgggc cattgcacta gatcctcagt 25920tggtccctgg
gctctcggtg actgtccagg gcaggagttt cccattgact tttccctggt 25980tgacctttga
ccccttccac agttgacact ggtgtcccca ggtgtctggt ggccccttgt 26040ccagctccct
tagtcccttg tgccttccct cctcctcttt gtaatatccg ggctcagtca 26100cctggggccc
acccagccca aggccagcct gtgggtgtcc ctgaggctga cacacttctc 26160tctgtgcctt
tagaagtcgg ggacttcctc tccctggacc tgggtggcac taacttcagg 26220gtgatgctgg
tgaaggtggg agaaggtgag gaggggcagt ggagcgtgaa gaccaaacac 26280cagatgtact
ccatccccga ggacgccatg accggcactg ctgagatggt gagcagcgca 26340ggggccgggg
cagggggcca aggccatgca ggatctcagg gcccagctag tcctgacggg 26400aggtgccacc
tgtctaccag gggtggggag agcgggggct ggaggaccac ccagcctcag 26460aggcagctgg
aggcctgggt gaacaggact ggccaacatg tccccaagtc ccacagtcac 26520catctggcca
gcattgagag gggaacgggc tgaggaagag ttagtggcaa gaggaacccc 26580agccagtcac
accttgtcca gtttaccaga ggaaaaacca atgtgtaaga acagaaatgt 26640gacccggcag
ccagtgcact gcccccctct ccaaaggcca cccctcaccc tccaccagca 26700tgcacagaaa
gtggggtgac agcaatcaca atgtctaccc aggcagcaag gacccctgac 26760catggggagg
actggggtgc agggaacata gaagcagaat gaggcctagg gggagttggg 26820caaggccaga
gccctagctg cagccaagca catggccaag gccagctcct ggaagggcag 26880ggctccgagg
caggaggcag gaggctgccc gtggctaccc gtcctcacac ccctgcagct 26940tgctagtctg
tctgtgggct gggtgtgaat caaggcagtg ggatggtgtg gggacctccc 27000tggccccagc
agccagtgag gagcctggtc agtcagcaga gcattcagca gtatccagtt 27060ccatggagag
gcccgtgtga ggggagtcgg ggctggtctt cagtaaggat gggtggccag 27120ggcccctaga
agtagaaaag gagactccgg gtgctggaga cagaaatcaa ggatgtgcct 27180ccatgtggag
cctcaggaat agctggccag gcctgaggct gaacctcaca aggttcagct 27240gggagggcta
ggctgacaga gcacagccgg gccagggacc agcctgccct gtgttgcctt 27300gtcccgaggg
ccactgtcag caggtctctg gcatggggga ggcttagggc ctgagcccaa 27360caagcagcag
cggaagagga gagggaaact gtggacaggc ctggcattca gtggccaggt 27420gttgcagtgt
ccctgaggaa tagcttggct tgaggccgtg gggagggctg ccggccagcg 27480caccccccca
tgccagatgg tcaccatggc gtgcatcttc cagctcttcg actacatctc 27540tgagtgcatc
tccgacttcc tggacaagca tcagatgaaa cacaagaagc tgcccctggg 27600cttcaccttc
tcctttcctg tgaggcacga agacatcgat aaggtgggcc gggtggaggg 27660gcagaaggca
gatgagggga ggcacaggca ccccagagga actctgcctt caaatgtagc 27720ccccatacca
tgtgctcaga agggagatct ggattcaaat tgtggccatg tcacctgcca 27780cctctaatgc
tgtggaaaag aagcatcaca ttagctaatt ctggctgtgc gccttgtgag 27840gcaccagcta
tgatcacccc actccagtgg aaagagcagc tggcagtagg gtggggctca 27900aactcaggca
gccgggctct gggtcacctg caggccacgg tcatgtcaca ctgcctctag 27960ctgagtcaga
aatgtgaagg aactgagatt ctacccttcc tgcaagctag caaagtggcc 28020tgccagttac
atctgtgcat gcacacacac acacagttat atatgcacac acataaaaca 28080cgagaccttt
gggtcaggga gaaagccaga tcctcactca cggcagaagc agcagccaaa 28140gcaacatctc
atgtggtttt ccaagcccca gtccctacag agacagagag ggccaggtgg 28200cacctgtgca
tgcagcgggg taccttgcag gagggaaatc ctgattttac acaaagctgc 28260tccccccacg
ccctgccttg actctgggat gacgtctcag agctgtgcag tacaacattc 28320ttaaattggc
tgggactcag ccctgcagaa atatgatatc ttcaaggaga atcgttccca 28380aaacctctca
aagctatggg gctgctctga gcctgtttcc tcagctgtaa agtagggtgc 28440atacttttat
ggccctgtgc aggaggtagt gacaggccct agcaccctgc ctccagtata 28500tgttagcagc
cacgaggcct atctctcccc acagggcatc cttctcaact ggaccaaggg 28560cttcaaggcc
tcaggagcag aagggaacaa tgtcgtgggg cttctgcgag acgctatcaa 28620acggagaggg
gtgagggggc acctgtacct gccggggggg ctgccctggg ccacccaccc 28680cagcactgcc
tgcctttctc cttggcttcc agcactgcag cttctgtgct tcttggcagg 28740actttgaaat
ggatgtggtg gcaatggtga atgacacggt ggccacgatg atctcctgct 28800actacgaaga
ccatcagtgc gaggtcggca tgatcgtggg taagggctcc ttgcacccct 28860gccccttcca
gactgctgag gctccctgtg tacaacaggc ttcaagggcc ctgtggggtg 28920aggaccaaac
tacttaacaa ccggtgatgt cagagcagag cctggtgcta cagcctgggt 28980ggtcttgggg
tatcaagatg gaagcaccgt gtacagtagg aagcatttca acgccatgat 29040gccacattcc
tgcatcagat ggtatgccag ctgcatatcc acctcaccca tcaggattat 29100aattaaaaca
cttatctggt aaattgacca actggacaga ttggtccaag tggaagagga 29160taagcaaaag
tggtaccatc tccacccgaa tggtctttcc acgggcctgc ccctgcccct 29220gcccccaccc
aaagtgaagg caggtaccag gaaagggagc agcagtccgc ccctcccagc 29280agaggggtct
tccacaccaa ctcggacctt tctcagaagt tccggaggtc attataacca 29340gccttcactg
aggagcaatc caatcagatc agttatctgc tgtgcgcaca gccgtgtggt 29400tctatacttc
tcttacttcc attttcacct ttcagaagga acgttgtctt taaatccagc 29460atctaaacgt
gagccccagc catccctggc tgtgatcccc ccagcccttt ccaccctatc 29520ctctggaact
gcctggggct ccccaagaca cttccacatg aattcccacc aagccaagct 29580gcagctgctg
ggcccaggca taacccctcc tggggcagag gtggcaagga gtgacccacc 29640actcacatct
gccccacatc cactcttgac tctgctcagt gtttaaaaac atgtttataa 29700caattaccaa
gatctgaaaa ttaggagaat tcacatcaaa gtctggattt ctgtttgttc 29760ataaaaaact
agaaggcagc caggcaaggt ggctcacgcc agtaatccca acactttggg 29820aggctaaggc
aggcgggtca cttgaggtca ggatttgaag actagctggc caacaaggtg 29880taacctcgtc
tctactaaaa atacaaaaat tagctgggtg tgatggcgca tgcctgtaat 29940cccaggtact
caggagactg aggcaggaga attgcttaaa ccctggaggc agaggttgca 30000gtgagccaag
atcacgccac tgcactccag cctgggtgat ggagtgagtg agactctgtc 30060tccaaataaa
taaataaata aataaaaact ggaagtctaa gcatcactga gccctgattc 30120ctatgtggca
gctcgactga ccagcatttg agttgctgtc cctgacagct ttgggggtgt 30180gcagcccaca
cagtcatgct agcttgaggc tctgctgtca gcagtttgaa actcttaata 30240acttgtgaac
aaaagactcc atgttgtcac tctgcacagg ggccagcaaa ttacaaaatt 30300ccatatccgg
aattgtctac aggagcctct gggctgctcc caagggccca caccatgcct 30360tactcacttt
gggttgccat ccaaacatgt ctcatgacaa agaagctcaa acatgtgcat 30420ggacagtgcc
agaaaacaag ggtcgtacat agacaaaata aaatgataac gtcccacaac 30480catttctttg
atacacactg tttctctcag tcctcccaac cacctaggta acaggcaggg 30540aaggtgttac
tgttgcctgt taggaaagag gacagccctg aaagctgtcc ctggccactg 30600aagcaaccca
ggtcttccag ccccagggag agccgccttt ccattgttcc agacaaagca 30660gagacaggca
tgggggagcg ggagagggac tcctgtgggc aggaaccagg ccctactccg 30720gggcagtgca
gctctcgctg acagtccccc cgacctccac cccaggcacg ggctgcaatg 30780cctgctacat
ggaggagatg cagaatgtgg agctggtgga gggggacgag ggccgcatgt 30840gcgtcaatac
cgagtggggc gccttcgggg actccggcga gctggacgag ttcctgctgg 30900agtatgaccg
cctggtggac gagagctctg caaaccccgg tcagcagctg taaggatgcc 30960cccctccccc
acaacccagg ccctgggccg ctctggtgca gcggcagatg ggagccgggc 31020cattgcagat
aatgggcttg tttttaaaca actctgggga aaagcaaact gacaatccgt 31080tcgtaagctc
catcccttct gctcagtcat gacctgcccc tgtgagagat gaagggttag 31140tcccagttgt
gatgtgataa gcccagacct ctttccttcc gacaggtgat cgtgcatgca 31200gaggaggctc
tgagacgccc ccagcaaggt tcctgggttt aacccaacat tccccaaagt 31260atgtatttgg
ccacattcac agaaagaata ttagtctttt gtggaatgct gcgggttgac 31320agtcacagct
tggaaaccaa cccacagaga gctcatcatt aatcatggct atcacttgtt 31380taccacctac
tgtgccaggc ctatgctaat tactttatta gcgtcctctc tgccgctcgc 31440aggcctctat
tattataggt cagtagtatt cgatttattt aaattaaata cggaaggtca 31500tagattaagc
aagaaagtgc cagcaacatg gtgcgtgcct ctgactgggc actaaccctc 31560caagtcttag
ttttcccaac cataactggc caatgaacag cagctctgga tgcagctaaa 31620ggaagactga
agctgtaggt cccgtgctcg gcgcagggcc ccctgcaagg aaggtttcgg 31680agggactgga
tggggtcttt gaactatctg tctttccctt tactgcagtg ggcccagggg 31740caggccaaag
ttgctcccgt gattgacttg aacgtgcacg ttcctaatcc ctgacatttc 31800taaagctctg
gctcattaac gagggaaaga cgtgaaccag ctgggggagt ggggatcgca 31860gtgccccacg
tggccgcctc gtgacctcag tggggagcag tggggccggc tcccggcttc 31920cacctgcatg
aggggccctc cctcgtgcct gctgatgtaa tggacctgcc ctatgtccag 31980gtatgagaag
ctcataggtg gcaagtacat gggcgagctg gtgcggcttg tgctgctcag 32040gctcgtggac
gaaaacctgc tcttccacgg ggaggcctcc gagcagctgc gcacacgcgg 32100agccttcgag
acgcgcttcg tgtcgcaggt ggagaggtgt gcggaggagg agggtgggtg 32160caaagggcag
gggctgggga cgcccgggca ctgcagactt ggtctcaggg cgacgctgag 32220tcccaggccc
ggggcgcagg gatgggaaac tagggcctgg ggcgggattc cgggcgtggg 32280cggggcccgg
ggcggggcac agggggcggg ggagtgggcg gggcccgagg ccgggcgctg 32340gaggcgaggg
cggggcaggg acgggtccaa gggcaggagg ctgggacagg acggggatgc 32400aaagggaggg
gcggggcccg agacggggag gagggggagg gcccaagggg aggaggcggg 32460gtccggacgg
ggatgccaag agcagggatg ggagcgagcc tgcgtccggg cactggtccc 32520catccgtgag
tcccctcggt gctccctgcc cgccgtggcc atcctctcac atcactcaca 32580accccaaggc
gcggcatggt tgacaccccc acgttaggac ggagaccctg ggcttagtta 32640gagggggcag
tactaaccag tccctggcgg aaacgctttg gctgggtgag gtgagcggga 32700tcgcccccat
ttctccagag aggggtcccg gctcagcgag ggaaagaggc cgccgctggg 32760gggacggctg
gccggggccc ctccctggag aacgagaggc cgccgctgga gggggatgga 32820ctgtcggagc
gacactcagc gaccgcccta cctcctcccg ccccgcagcg acacgggcga 32880ccgcaagcag
atctacaaca tcctgagcac gctggggctg cgaccctcga ccaccgactg 32940cgacatcgtg
cgccgcgcct gcgagagcgt gtctacgcgc gctgcgcaca tgtgctcggc 33000ggggctggcg
ggcgtcatca accgcatgcg cgagagccgc agcgaggacg taatgcgcat 33060cactgtgggc
gtggatggct ccgtgtacaa gctgcacccc aggtgagccc gccccgctct 33120ctccctggta
aagtggggcc caaaaagcgc gcgctccaag gttccttgcg gttcccaagc 33180tccaagattt
cgtagtcctc ttctcgtccc ccttggccta gatttggggg aagggtcgac 33240tgcgtgcagg
gcgcccggta atgaatgtgg aggatgaggt gggaggaggg acggcagccc 33300tgcttctctt
ctgcccagct tcaaggagcg gttccatgcc agcgtgcgca ggctgacgcc 33360cagctgcgag
atcaccttca tcgagtcgga ggagggcagt ggccggggcg cggccctggt 33420ctcggcggtg
gcctgtaaga aggcctgtat gctgggccag tgagagcagt ggccgcaagc 33480gcagggagga
tgccacagcc ccacagcacc caggctccat ggggaagtgc tccccacacg 33540tgctcgcagc
ctggcggggc aggaggcctg gccttgtcag gacccaggcc gcctgccata 33600ccgctgggga
acagagcggg cctcttccct cagtttttcg gtgggacagc cccagggccc 33660taacgggggt
gcggcaggag caggaacaga gactctggaa gccccccacc tttctcgctg 33720gaatcaattt
cccagaaggg agttgctcac tcaggacttt gatgcatttc cacactgtca 33780gagctgttgg
cctcgcctgg gcccaggctc tgggaagggg tgccctctgg atcctgctgt 33840ggcctcactt
ccctgggaac tcatcctgtg tggggaggca gctccaacag cttgaccaga 33900cctagacctg
ggccaaaagg gcagccaggg gctgctcatc acccagtcct ggccattttc 33960ttgcctgagg
ctcaagaggc ccagggagca atgggagggg gctccatgga ggaggtgtcc 34020caagctttga
atacccccag agaccttttc tctcccatac catcactgag tggcttgtga 34080ttctgggatg
gaccctcgca gcaggtgcaa gagacagagc ccccaagcct ctgccccaag 34140gggcccacaa
aggggagaag ggccagccct acatcttcag ctcccatagc gctggctcag 34200gaagaaaccc
caagcagcat tcagcacacc ccaagggaca accccatcat atgacatgcc 34260accctctcca
tgcccaacct aagattgtgt gggtttttta attaaaaatg ttaaaagttt 34320taaacatggc
ctgtccactg ttctttgact tctgtgcatt aggactgtgg ggacaatcta 34380taaagagtct
gcgtcacatg catgaagaca cttcagtatc tcggcaatgc cctccagaca 34440gctcctccag
ccatctgtgc caaggggagt gtgaggagtg acagaccagg ctgtaggaac 34500aggaatgggg
tgtcatgggg gatggcagag cagtggacag tacactgcct ggcccgggcc 34560cctgcttgcc
tgcccatgga atgtgtgcag agggagtgcc aggccaggtg ctgctctgga 34620gaagtggggg
aatgaggctg gtcctgctgc aggtcagtct cagcaccgtc ctgtccagtc 34680agagtcactt
aggtttgcca gtgagtaggg gcccagatac atgttggatt tctaaggtcc 34740ctccagatgc
tcctgtcagt ggaacgccta tttagagtta gccaagcgta ggcataatgc 34800catctttctg
cagcataaaa tacagtgaca tagaaacata tttgtgtgat tttcatgcat 34860tccttttttg
atgagagata ttacccagct aattaggaac aactgttttg tttccttcag 34920atcataaccc
aaagttgtga ttttgaaaag tcatgtcccc cttcagattt cttgttttct 34980gctacttctc
atgtggaatt gctttggctc ttcttagttc tcttgagtct aaattattcc 35040ttataagttg
gtgcaagcat ctgattattt tgttatcatt actgttatgc tcaagcattc 35100acagagtgga
acacatttta atatcaattg ctttctattt ctcctttata ttacagttca 35160ggacattgta
ttaattatta aaattctatt cgtaggtagg ttatatgact gaattgaaat 35220agataaaatg
aatttctttt ctagataaca aaggaggtgt cataaaacac ttgttatggg 35280ccagtgtgat
ggctcatgcc tataatctca gtgctttgag aggctgaggt ggaggattgc 35340ttgaggccag
gaatttgaga ccagcctggg gcaacatagc aagaccccat ctcttaaaaa 35400aaaaagggtg
gggcgggggg gcactgctgg gcgcggtggc tcatgcctgt aatcccagca 35460ctttgggaag
ccaaagcagg tggatcaaaa ggtcaggagt tcgagatcag cctggccaac 35520atggtgaaac
cccaactcta ctaaaaatac aaaaattagc cgggcatgat ggcgggtgct 35580tataatccca
gctactcagg aggctgaggc agaagaattg cttgaaccca ggaggcggag 35640gttgcagtga
gcagagattg caccactgca ctccagcctg ggcaacagag cgaaactctg 35700tctcaaaaat
gaattaatta attaaaaaaa gaaaaaaaaa acactgggca gggtggtgtg 35760cacctgtagt
cccaactact ccagaggctg aggcaggaag gagcacttga gcccaggagg 35820ttgtctgcag
tgagctctac tcatgccact gcactccagc ctgggtgaca gagctcagtg 35880gcttacacct
gtaatcctag cactttggga ggctgaagca ggcagatcac ctaagatcag 35940gagttcgaga
ccggctggcc aacatgataa aaccccgtct ttactaaaaa taaaataaaa 36000taaaaaatat
atataaaaat tagctgggtg tggtggcaca tgcctataat cccagctgct 36060tgggaggctg
aggaacaaga atggcttgaa cccgggaggc agaggtggca gtgagctgag 36120atcgcgccac
tgcactccag cctgtgcgag agtgagactc tgtctcaaaa aaaaaaaagg 36180gaatttaaga
aatttaaaag aaaactcttg ttatataaaa agggtattgg gtctgacaga 36240taagagctcc
tgcactctac cagccagcta ctgacagaca taggtctggc tccagtggag 36300gggcagcagc
cagtgagccc agcctggggt ggcccactcc tgctgcctcc aggatgtccc 36360ctgtttcccc
agcccctctg ctgtgccctc ggccccagaa gctggcgaga ctgcttctct 36420ggaacagcat
cacgcaggcc tgcccatcgg cccactgtgc accaggcctt ctggggatac 36480agatgtcaac
caggtggggt gctcaggagg ggcacagaag ccaggaatga caaacacatc 36540agccaccagg
caaatgggaa atgtgcccca gaagctccct gctgaggatg ttagggagag 36600cattctgaag
tagtgtggtt gagatgaggc ttgaggaagg caaggctcca aacagcaggg 36660cagactggga
gcaaggtaga ctgcatggga gggcagctga tggagctcct taaccctctg 36720gaattgcccc
aaagccaagc aaagtgttct tcttggggtc acagctagct cagggatgcc 36780ttctgcccct
tggtcagagg ggcaaaaggt cagagcctag ggtcaccaaa acctctggga 36840agccccgggg
gtctcaggcc acagaccatc ctcagaacta cacactgccc tcccatgcct 36900ggcgggggcc
ctggactggc cctcaccagc tgtcttcttg cactggccag ggttctggct 36960ggactggcaa
ggaggggtgg tcagatacag gagtaactgg atcccttcat caggacctag 37020ggtggtgaga
gctttgagcc tgctctgctc caggcagaca ttgtgtctgg ccctgccagg 37080atggatagac
agcaggatgt tacacgttga ggacatgaag gtcatcagga atgtggctgg 37140aatctgttag
gcctccccca gcccaggcgg gggctgccaa gtttgggcct atcctctgtt 37200cctctcctta
tttggacctt caggtgataa ggctgagaca taaaggaggc tgggccctgc 37260caccacgaca
gcagccacac ctctgcagag agaatggtga gtgcctgctg gggaagaaag 37320gctagcggtc
tcccaggtgc tggcctttgg gctgggggag cagagttttc tgtgcttgtg 37380ttgggttgag
ggtggtcccc agggagagga agaggatcct ggccctggct ctcctgggaa 37440tgctctggga
ctgtgcatga tgggtggggt ggggagactc tgaggagttg gggagaggac 37500ccctccctac
tcacagtgtt gcaggccagc aggaaggcgg ggacccgggg caaggtggca 37560gccaccaagc
aggcccaacg tggttcttcc aacgtctttt ccatgtttga acaagcccag 37620atacaggagt
tcaaagaagt gagtgcccac tcccagtagc ctcagatccc atcctggccc 37680ccccacccca
ccccacatac atacccccct tctaccctga ccttgcctct cacaccaccc 37740aggtctctcc
cccacctccc accttcccta gagctggggg ctgctcccac ctgaaggccc 37800ccatcccaca
ggccttcagc tgtatcgacc agaatcgtga tggcatcatc tgcaaggcag 37860acctgaggga
gacctactcc cagctgggtg cgtgcaccca cctcccaccc tgcgcactgg 37920ggtccctact
ctgagctgct gggcgggtgg gagtggctgg ggggacagga ctctgctccc 37980ctgcttcccc
tcctccccgt ctcctcacac tgcccttccc cccttgtcac gccttgcttc 38040cacttcacct
tcccgaccca cagctgcctc tgcccctcca gcccctgtgg ccaggatgga 38100gggagggcgg
cctgggcctt ctgggggaca cccagggtcc ctgtgtgcac ctcatgcccc 38160acccccacca
gggaaggtga gtgtcccaga ggaggagctg gacgccatgc tgcaagaggg 38220caagggcccc
atcaacttca ccgtcttcct cacgctcttt ggggagaagc tcaatggtga 38280gcctgggaca
gagctgggca cccttggcca ggcagggagc ctgcaccctg cctgaacccc 38340acctgaaccc
tgcctgaacc ccacctgaac cttacatgaa ccccacctga accctaactg 38400aaccccacct
ggacccacct ggactcttcc tggccatgac ccattccaag cacatcctct 38460gccccagaat
cccatgtgca ctggtcaccc cagtgctgac ttggagccag gaaatgtgcc 38520ttcagccccc
acccccaaat tccagtctcc cagccaagct gcccgcctca ggaggatgac 38580cattcccagc
cccactgatc cccgagaaac attttatgtt agggaatacc cccacctctt 38640ctgggatgtg
ggaggctcct catgcagccc agttcctcct gcgggggacc tgggatgctg 38700gagacatgga
tgctcacctg gctgcctcgg ccttccaggg acagaccccg aggaagccat 38760cctgagtgcc
ttccgcatgt ttgaccccag cggcaaaggg gtggtgaaca aggatgagta 38820agtatgggcc
cagccagatg aggagcaccg tggtggaagc agagagcggg gtgaggcccc 38880tagtgagggg
ggctgcctgt gcttcggggc cttacactgc tctttggggt gcagccaacc 38940cttccctgcg
ccatgggagc ctccgtaccc accttccctg tgcagtcact cccccgcagt 39000ctcctgctca
gaccctcctc accccccagg ttcaagcagc ttctcctgac ccaggcagac 39060aagttctctc
cagctgaggt gaggctgccc agccccttca atactcatcc ccagcacctt 39120ctctgggcct
tcacccatga cccagagccc agtaccagtg aggcagttgc tggaagggtg 39180agccgagggc
ccttctggag gaggtgccat ctctgttgag acctagaggg taaagatgtg 39240gagtcagaaa
agagggcagg gtgcgccagg cagggagact gtgcacagac ctggggggaa 39300gtggataggg
agaggtttcg tacactcggg gtgggcctgt gcctgtggct ggaggggcgt 39360cctttgcctc
ttggcccaca tttgcactga ctcctcactc tgcccagagt cagccaagag 39420aaaaacatta
acccagagtc tggggtctag ggttgaaaag ctaaggcaaa aagcacagat 39480gcagggggca
gacagaaagg ccacaggact caggtgaggt ctctgccggg ctgggccagg 39540agccagggga
ctgccactca ccagtgtccc ctgcaggtgg agcagatgtt cgccctgaca 39600cccatggacc
tggcggggaa catcgactac aagtcactgt gctacatcat cacccatgga 39660gacgagaaag
aggaatgagg ggcagggcca ggcccacggg ggggcacctc aataaactct 39720gttgcaaaat
tggaattgct gtggtgtctt gtctgtgaca gatgggttgg ggaccagcca 39780agggggatcc
cagggtctca gtgcgcacat caccatgatc atggccacca tctacctcct 39840gggagctggc
ccctcgccag ctcaccttga ttcactccca tgatgccaag tgaagtgtga 39900actatgatca
tgcctagttt acagatgagg acactgaggc ccagaaagtg tgagcatctt 39960accaaggcca
gccctctaga agaggagatg gtgggattta caccacctcc accaagccca 40020ggaatgagcc
acaaagtggg cactgcccag ctacttgggg ctgtgcagag aagaggctgc 40080ttgctgggca
ctcagcaaac tctgcccaac agcccagcgg gtgggcagca gccctgggac 40140ccccacaccc
aaccacacag cctcccctgg cccactgctc gcaccccatc tcaatacact 40200ggcttgggtg
cctccctgca tgggcccttt gtgaaaggca gagaggtacc catttgaaac 40260acaaccagct
tctcattgca aatacaggca aggcactaag acatgaggaa catggacacc 40320aaagcagggg
ccaggtaaca tgcaaatttc tagaggaaat gcccagaacc tggcatcatg 40380cctcctgagc
ccctcatgcg ccgtgagggg taagagggtc agacagctgg agtgtaggga 40440gacgacttct
caggagagaa tagttagtgc tcccgtcacc cttcatctga gaacccaaga 40500gctagaggag
aaagtgatcc tcatgagtac cagaggagca gcaggggaca tccaaagcac 40560cagagagaga
aacagagaca gagagacagg cagtgacagc tcaaacctca gccagatcca 40620gagcatacaa
agtctcctgc ctacaggaca gcccagtaag agctctcagc ttgcctcctt 40680ccctccccac
aagccctgct gcaatccctg tacctggggg tcagtgggaa ggaggtgagc 40740gagaaaggag
gggcacccct tcctgaaggc cccaagagga aaggcgtttt cacccagaca 40800ggtgttcagt
tttgatttta tctggcgcct ggcaatttaa ttactaaatt gaaacttgag 40860actttctgga
attatggcat tttctgttgc ttagagagat tacaaaagtc acgaactgcc 40920tgagtttcca
tcctgaaagc aggccaccag cccactccac tgaccatgct ggaacagtgg 40980atgaacaaaa
tcaagtacca ttaggattct accacatgag tctgcttgtt caacaagctg 41040atttcataaa
gtaagggatc atgttataat ccaagctcta caggggtaaa ttgtgaaaga 41100ctaaaatgaa
ccaaaaagat cataggtgtc cagttatctg atttgatggg gtgtctgaac 41160cttttgttat
ctttgagctg tttcaaaact ctctaaatta ttattattat ttttgagaca 41220gagtctctct
ctgtcaccca ggctggagtg cagtggcatg atctcagctc actgcaacct 41280ccacctccca
ggttcaagtg attctcatgc ctcaccctcc caagtagcta gtattacaga 41340tgggcacacc
ttgcctggct aatttttgta tttttaatag agacgtggtt tcaccatgtt 41400agccaggctg
gtctcgaact cctgacctcc gttgatccac ctgcctctgc ctcccaaagt 41460gctgggatta
caggggtgag ccaccgtgcc ctgccacaac tctaaattat aactaatagc 41520aaggcaatgg
ttcttctcta ttaacgtgca aataaatgtt gtccagtgga agcacaactg 41580atttttccct
tctctgtgga agaagccaat tttgcatcta ttaagcaaat tcatctgggc 41640attcctaacc
gtctacacat gcaccggctc tttgaattct tctctgaacc aggcccagga 41700ataagccaca
agatgagcac tgcccagctc cttgggctgt cacatcttat tgattcccac 41760atgaattcac
aagtaaataa aatatttggc ggttgttcac ttagtatgca agtcaatatt 41820ttgctttaaa
aatattatcc tttcacactc ctgatatagt tgtctgataa ggttagtcct 41880tcccacacca
aaactgcctg tattagtgtt gtttggaata aactgagggt agaatgtata 41940tggtgtgtgt
atgtggtgtg tgtgtttgtg tgtgtgtgtg tgtgagagag agagagagac 42000aaaagagaga
gacagaagga tagagagaaa cagatgggca cagacccagg acatgagttc 42060agcctacact
gaccaatatg acagccactg gccacttgaa atgtggtgtg agttgggata 42120tgccaaaagt
gtaaaatgca cacaatattt tgaagatttc atacaaaaaa gaatgcaaac 42180atctcattaa
taacttttat atagatcaca tgttgaaatg ataatgtttt ggatattaga 42240ttattactaa
aattaatttc acctatttct tttcactttt taaatgtggc tactagaata 42300tttagaattc
cataagtggc ttgcatttct ggctttcact cctgttggaa agcactgagt 42360tagactgtgt
agtacgtcta tttaagactg cagtttccag gccgaacacc gtggctcacg 42420cctataatcc
cagcactttg ggaggccgag gcgggcagat cacctgaggt caggagtttg 42480agataagcct
ggctaacgtg gtgaaaccct gtctctacta aaaatacaga aattagccag 42540gtgtggtagt
gcatgcctgt agtcccagct actagggagg ctgaggcagg agaatctctt 42600gaacccagaa
ggggaggttg cagtgagcca agatcaagcc actgcactcc agcctagatg 42660acagagcaag
actccatctc aaaaaaaaaa aagtagaata aaaataaata aataaataaa 42720gactgcagtt
tctgggagac tctgaggcag gcattagcct tctctgcaga gagtacttgc 42780agcagggagc
agcagttttg atgtcctcaa aaggagccaa tttcatttgg gtagggttgc 42840ctctgagtat
tctagcagta cagacagaaa ggagagaagg ctgtttccag aaagcagaga 42900tcatacgaat
tacttgtgag accaaacttg ttcctcaggt gaagctcagg catcccttat 42960gtggagtgtc
taacagtcta cacctgagga tgttggacat aagggggtgt gaggtgggca 43020tggctgggga
gagctctggg agggggaaaa ccagctccat gttgtccacc cactgaaagg 43080aaagctccct
ctgggggagg tagatgcccc ctggccaggc ctgcagggcc ctgctcactg 43140tgagccctgt
gtggtcctgg cctgggtccc accagccatt gccaggcaac agctcccagt 43200tggaaaacag
agcaaggctc cctcttagaa aaaaaaaaaa gaaagaaaga aaagaaaaga 43260aatacaacag
gtaactaagc atgacggctc acgcctgaaa tcccagctac ttgggaggcc 43320aaggcagagg
attgcttgag actgggaggt tgaggcagca gtgagccagg attctgcaat 43380tgcactccag
cctgggtgac aaagtgagac cctagtaaaa aaaaaaaaaa tagagacaga 43440gaaagaaaga
catgcaacag ggccaggcgc agtgactcat acctgtgatc ccaacacttt 43500gggaggcaga
gaagggagga ttgcttaaga ccaggagtgc aagaccaacc tgggcaacat 43560ggcaaaaacc
catctcttca aaaaataaaa aaattagcct gttgtggtgg tgcgcaccta 43620tagtcccaga
tattcaggga gcttgaacca ggtccaggct gcagtaagcc atgatcgtgc 43680cactgcactc
cagcctgggt gacagagcga gaccttgtga gaaagaaaag aaagaaggga 43740aggaaggaag
gagggaagga gggaaggagg gaggaaggga ggaaggaaga atataggacc 43800caaaggccta
aatgccccta ctgtgcccca gttctgcgtg actcaggacc agcctcctcc 43860acactcccac
caccacaacc ctgcacccta cttgttcctg ggggccccaa ggggagcctc 43920accagaagcc
tcctcataaa cccactgccc cttacctttc ctgtctttct agaagcctca 43980gaagccttgc
cactctaagg acacctccat ctgagccaag gcgctcgctc cagatgtccc 44040agagctcctg
gtcctgggtg tccctgccac acaacccccc atggagccct gctctggctc 44100aagccccctg
actgtgcatg agcaggcctg ttgccctcac tgggactgtc cagagccttc 44160ccatctctct
ggagggactt ccatcagttt ctgccccttc tcctctgcca agaactcacg 44220ttcagtctga
tagcagaaga atcatctggc accctcctga atggaaccca gagtacctcc 44280tttgtggacc
ggtctctgga ttttccccac tctctccctt cagccatgct gatggcagag 44340aaggtaagaa
cttccagccc acttctctgg cgaggggaac ttgtcatctg ggtctgcaga 44400gaaggttcca
ccttatgctc atagtacatt atctttacta tgtactagga tatcacattt 44460aaaaggacaa
aaaaggccag gcagtggctc atgcttgtaa tcctagcact ttgggaggct 44520gaggcaggtg
gattacctga ggccaggagt tcaagaccag cctgaccaac atggcgaaac 44580cccatctcta
ttaaaaatac aaaaattagc tgggtgtcgt ggcatgtgcc tacaatccca 44640actacttggg
aggctgaagc aagagaatca cttgaaccca ggaggcagag gatgcagtga 44700gctgagatcg
tgccactgca caccagcctg ggcgacaaac cgagactcca tctcaaaaaa 44760taataataat
aaaatacaac aaaataaaag aacaaaaaaa aagaaatgta aaatacttga 44820aggggcttgt
ataacattaa taggattgac agtatctgct ttccaggctg aagtgattca 44880ttcattattc
tagacgtctt tagtcctttg caatttgtgg taattaggct tttcttttta 44940acattaaaaa
tatacaaaaa taaaaggcaa aaaaagcatc atcccattag tctgaccttc 45000ccctcctcca
tccctgcccc aacaccctga agaccctgga tgcaaacaaa ggcccgaggg 45060agcctcttcc
ctcgcagtgc aggcctcacc tggggctcag agtcagaatc tgcattttat 45120tccctaggac
aacctctagt cagggcagag gccggctgtg ctgcccaagt gccctaaccc 45180tagctttgag
gcaccagaag ggcaaatgca aattaaaaat gagaataagt ttattctcct 45240tggtgaaaaa
aaaaaaaaaa gactttcccc tctccttttt ctttagaaaa tctatcattg 45300caagttcctt
cctggacttt ttttatgtag atctgttcaa aagctaaata agcctctttc 45360aagtttcaca
tcccaggaat gtctccttaa ggacctagga gccaccattt gaagtgtaat 45420caccaaggga
gatacatcct tatctcccag tttccgtggg caaaggggag cctaacttta 45480gcccggtgcc
tagctcaagt tgcaaacaca cttccagtct taaaggaatg aatttatttt 45540ttttccttta
ggcaaaccca ggtagccacc acagttacct ggggattcac agagaactgt 45600gtgtgaccac
tggtgctgtc aagtcctctt acctgagcac ctgtgacgtt tcccttgaga 45660acgtgtacgg
gatgggttgc acctggttat atacaagcgt gagacttctt tctgcctttg 45720taatttatta
gcagattatc tgtgatgagc atcgcaatct gtttaatgcc tattcaataa 45780ttaaattttt
ctttctcttc ttttgtggaa aggttttctg cattggcagg agatttttgt 45840tttcgattat
gtccccaaca tgcctgatgt tccacccctc aagagcctca gccttgccca 45900gggagggcat
gggggtgagt ggcctctccc acagagagtg ctggccaagt tggcccaggt 45960gcgcagcaag
ggctgctgcc
45980718999DNAHomo sapiens 7ccctcctcca tccctgcccc aacaccctga agaccctgga
tgcaaacaaa ggcccgaggg 60agcctcttcc ctcgcagtgc aggcctcacc tggggctcag
agtcagaatc tgcattttat 120tccctaggac aacctctagt cagggcagag gccggctgtg
ctgcccaagt gccctaaccc 180tagctttgag gcaccagaag ggcaaatgca aattaaaaat
gagaataagt ttattctcct 240tggtgaaaaa aaaaaaaaaa gactttcccc tctccttttt
ctttagaaaa tctatcattg 300caagttcctt cctggacttt ttttatgtag atctgttcaa
aagctaaata agcctctttc 360aagtttcaca tcccaggaat gtctccttaa ggacctagga
gccaccattt gaagtgtaat 420caccaaggga gatacatcct tatctcccag tttccgtggg
caaaggggag cctaacttta 480gcccggtgcc tagctcaagt tgcaaacaca cttccagtct
taaaggaatg aatttatttt 540ttttccttta ggcaaaccca ggtagccacc acagttacct
ggggattcac agagaactgt 600gtgtgaccac tggtgctgtc aagtcctctt acctgagcac
ctgtgacgtt tcccttgaga 660acgtgtacgg gatgggttgc acctggttat atacaagcgt
gagacttctt tctgcctttg 720taatttatta gcagattatc tgtgatgagc atcgcaatct
gtttaatgcc tattcaataa 780ttaaattttt ctttctcttc ttttgtggaa aggttttctg
cattggcagg agatttttgt 840tttcgattat gtccccaaca tgcctgatgt tccacccctc
aagagcctca gccttgccca 900gggagggcat gggggtgagt ggcctctccc acagagagtg
ctggccaagt tggcccaggt 960gcgcagcaag ggctgctgcc caaaggctcc ctcctggttg
gcatgggtcg ggaccctgtt 1020gtgttgtgtt ttcgctcttt ttcgtagagt tcaagggggt
cctgctatgt tgtccagact 1080ggtcttgaac tgacctcaag ggatcctctc gtctcagcct
cccaaagtgc tgggattact 1140gtgcccagct ttgtgttgta ttttctgatc ttatcctgca
acctcttgag cccccaacct 1200gggccccagt tcctgctgtg ccccagcctg ccagccctct
ctctctgcat attctttctt 1260tagctgagtt aacaccactg ataaggttaa agacaggctc
ttaaatttct gccctggcat 1320gagaaatatg tgacccacat gcttctccag cttagctgtc
cagtgtaact gtcagggact 1380gatgggcgcg tgctggccca cagcccacct cagtcctgac
cctccctgac aggctgagag 1440aggccccagc ctgaacctgg actcccccat gttctgatat
tcctgcacaa gagtgcagag 1500gcctggttaa gctggagaaa cataaggaat aggtaggtct
gcacacactc acctcttcct 1560ttgcagtgaa ccttctagaa tcttctagat ggaaaagctg
ggggtgtgga ggtgtaggga 1620taggacagct gggggaggcc ttggccaagg tcaaggagta
ggcccagtct ccctctctgt 1680gtgcctgtct gggactcggt ttcctgtctg tgaagcaggg
ctggacggga tattgacagc 1740acctgatggt cattgagctc ctctgcccca ggcactcagc
tgctgggcac agtgcacacg 1800tggcagtccg gtgccctctc acgctccgtg atgactgagt
ctgtagttac acccctggcc 1860tcagaataaa gactacactt tctgcctccc tcactggcag
gtatgactag gtgtggtggc 1920agttttctcc ttaagagaca gatgtttgtg cctccctcca
acccgctggc taacacctag 1980ctggcacaca gcctcctggg gctatgaaga tgagggccac
agccacaggg tgggggagcc 2040gtgagctggg tctggctgcg tctctgacat atgggggcat
cacacatcac ctctacctcc 2100catcgaatgc tacacgaaga gaacaaactc cacctgatgg
aagctgctgt tgtttgaagt 2160ctttcatgct cacaacagaa cctaacccca accaatacag
tatgagtatt ggccccacgt 2220ggttaagcaa gctgtccaag gttacacaca gctgggaggt
ggtggagctg ggtttgagcc 2280tgttattgac ctttgtgcag acagacctca gagcagagca
caaggcagca aggctgtggg 2340tctggggctc cctctccagg agaatcaact ggctgcacac
agcctggaga gcccatgggc 2400aacctgagtc cttgcacctg gaagtttctg tgtcccacac
atatccagga gcttaaaatg 2460aagatgtctg aattacccaa cctcttgata gcaccaaccc
aaccttccca gcctcctctt 2520ctgaggtcag cccagagcaa gccccttgca aagctgattt
aactcagaac cactgggcat 2580acccacaggg cagtgaccct gcagccctcg atcaaatgtg
cagatggact tgggggtggg 2640ctggtacccc agatggcctc attctcccag ggttgcagag
cccctgaaag ccacagccct 2700gtgtgcacac cactggggag tcatcacagg atacttcaag
aattcagtgc caggcaaggt 2760ggctcatggc tgtaatccca gcacttcggg aggctgaagc
gggcagatca cctgaggtca 2820ggagctagag accaccctgg tcaacatagg gaaaccccat
ctctactaaa aatacaaaaa 2880ttatctgggc gtggtggcgg gtgcctgtaa tcccagctac
tcaggaggct gagaccggaa 2940aatcgcttga gcctgggagg cagaggttgc agtgagctga
gattgcactg ctgcactcca 3000gcttggggga cagagtaaga ctccatctca gaaaaaagag
ttctgtgtat catttaatgt 3060ggagatcctc ccatcacgag gatgaggctg tttctctact
ccccagatct gggctggcct 3120gtggtttgtt gacctcagcc ttgtagttct cactttcctg
gaacctgaat gccaccacgc 3180gacatccata agacaaagcc caggataaaa gatcacttgg
agagacaggc ctggcctggc 3240accaccccgg ctgaggctgg acccctggga aggagactct
gatggacctc cagacccagt 3300caaatgacca cttccaaggt caggcaagaa gggacaaaga
gccactggct cagcccacag 3360catctgagaa ataagaaacc gctgcatttt ttgagccagt
aagatttgac aggtttgttt 3420tgcagcaata gatgagtggt acctcatctt agcccatgtt
ctgatgaaga caaacagtag 3480cattgacaaa gttttaagaa aagttaacca aaaactggga
ttcctttctt cattttgacc 3540ctttgttaca agaaacagag gcccacccca ccagactcac
tgttcactgg tccctgagtg 3600cctgtgagtc tcagtgggag ttaccttgag accagccctt
ctgagtggag ggtgctgggt 3660gctgaggtca agtcgagctc agtccaggct aaaaggagag
cagctctggc caggctgtca 3720gggctgtggc ctccccaaga acctcctacc ctggcccctc
caggctttgc tgctatggtt 3780gtgtgagggg agttgctgtc ccagcattct ggcccccttg
cccccagccc ctccctgacc 3840tccacgggct tcaggcctca gtccagagtc acctcctcta
ggaagccatc ccccagtgca 3900agtctgggca acattcctcc ttgcctggcc cacctgctca
ctctcatgct atggctttct 3960gtaagcaaac acaaagatag gaacaactct gtccctggca
cagagcagat gctctggcaa 4020tatctcatga gtgaatgaag gcacatgaca aacctccaga
cctgtggaga ctgaaggctg 4080agagccttta tagatgctgt ggggccgagg agtttgccaa
ctacagcagg tcatgcccag 4140aggtttctct ctgggtagca aggtgtgtct cccaccaaag
gccattggca tggggcccgc 4200cctgctgacc cgaggcagtg cacagcagag gccagatgca
gtgagaagga gcctctcctt 4260ggcctgctgt ctgctgccat gcctgtgggg gcgtggacac
aagtgtgtgg catagaaggt 4320ggtgtggcag gtgagaggtt gggggtgtgt atgtagcagg
tgtctgtgtg tgtatgtgca 4380tgtgggggtg tgtgtgcatg catgtgtgtg tgtgcatatg
cacgtgtgtg catatgcatg 4440tgtgtgcatg gagagagaag acctcctctt tctggcccct
ctcctagctg cccccctccc 4500tcctgctgcc aacacactgt caacccttca ctgtcttttt
ccttgggact cgttgatctg 4560tctctaccat cccaggtgtc tggagcagcc tctaaccttc
catctgccaa ggtacttcag 4620ccccacccct cccagctgtg gaatgtcccc taggatgtgc
cactgacaca aagagccaca 4680cagctccaaa atagaatatt atctaaccca ctgctccctt
tgctgtcagc aacacctcca 4740ccatgcttct cccaggaccc cccttgaact ctctgcttcc
tccctgaggc caaaggaaag 4800acaggaaagg ggccaccttc ctgtccttgg gtcccacaga
gatgtatcct tgtaatgaaa 4860cctactttat gcttgagttg tatccagtta gtttctgtgg
cttgcaatca agacccacac 4920ccacctcaac ccaggctcta gagagtagac ccttgttttt
gcctggcttg ggtcgacctg 4980gcacctgcca gggtcccagc ctctgagtca gcccaccttg
ccctcatcgg tgccacctcc 5040aggcggctgt acatagactc tggcttctgc cctggcctgg
cctctgggaa ctgcagctgt 5100ctgcttccat cctatgtgga tggtgcctga aagtgaatag
ggatcagtta ccagcccagt 5160atctgtcccc ttctcaatag cactgattcc tatggggaac
tgcttttctt ggactatgta 5220tgggtttggt gggagggtag ttcctgtaac caaccctaca
gggtgtagga acctagactc 5280tcagcaacat aacaggcagc aggctcccaa gctaagtctg
gccagctggg ccacctctcc 5340cagattctgt ttcatgagag catcatccaa gagcagtggg
aacactgggg acggtccagc 5400ctaggactgg tatgcagatc agagaatccc agatagaagg
tgattgctgt tcttccagtt 5460tcttggccct ccagagcaac catacttccc atctgcccca
aaacctgatc ctccaaactc 5520ccaccatttc tgtgcatccc caatatctaa tagatcaact
gcctttcatt tacatttgtc 5580acaaccaaat gatacacctg cccttcaccc actactgaac
tgcagctggg ttagtccaaa 5640ttcagggccc acgtgtcatt tcaagcctgt cttgaataat
gtacaccttc ctgcaatgtg 5700aggatggcca ccaccttggt cttataccca cgggtgtcct
gagctacatt tctcataatc 5760aaaaataaac tcaacacatc actccagcct gagcaacaga
gcaagacact agctctaaaa 5820ataaaaaata aaaacaaaca aatgaaaaac ccagcaaact
tggggaaaga ggaagcacct 5880gatttccaga gtttccacat catgagatgc aaatgtccag
ttttcaacaa caacaacaac 5940aacaaaaaaa aaatcacaag gcatacaaag aaataggaga
ctaagaccca ctcaaaggaa 6000aagaataaat aagcagaagc cataccagag gaaaaccaga
tggctgactt actagacaaa 6060tactttaaaa caactgtctt aaagatgctt gaagagctaa
aggaaaatgt gaacaaagtc 6120aagaaagtga tggaacaaat ggaaattcca ataaagtgat
agaaaacttt ttggagtttt 6180ttttcttggt agcaaaaaat tatgaagctg aagaatacaa
taaattccct agagggcttc 6240aaaggcagat gtaagcaaac ttggccaggt gcagtggctc
atgctcataa tccagcactt 6300tggaaggctg aggcaggagg attgcttgag cccaggagtt
tgaaaccagc ctgggcaaca 6360tagaaaaacc ctatctttaa aaaaacttat ataaaattta
aaaattataa aatttattta 6420aaaaatcagc aatttgaaga ctggacaggg aaattatcaa
atttgaggaa cagaaaggaa 6480aaagatggaa gaaaaataaa cagagcctaa gagacctgcg
ggacaccatc aagcagacta 6540atacccattg tggaaattcc agaaagaaaa gagagtgaag
gaccagagag attattagga 6600gaaataatgg ctgaaaatgt ctcaaatttg atgaatgaca
tgaatatgaa cattcaaaaa 6660tctcgacaaa ctccaagtag gaaaaactca aagatactca
tactgagatt catcataatc 6720aaactgctga aagccaaaga caaggagaca atatcaaaag
ctgcaagaga gaagtgactc 6780atcacataca agggatcttc aaaaagatta tcagatatct
tggctgggca cggtggctca 6840cacctgtaat cttagcactt tgggaggccg aggcaggtgg
atcacttgag gtcaggagtt 6900tgagaccagc ctggccaaca tggcaaaaac ccatctccat
taaaaataca aagattggtg 6960aggcatggtg gtgcatgcct gtaatcccag ctactcggga
ggctgaagca ggagaatcac 7020ttgaacctgg gaggcggagg gtgcaccaag ccaagatcgt
gccaccactg cactccagcc 7080tgggtgacag agtgtgacct tgtttcaaaa aaaaaagaaa
aagaaaaaga aaaaaaagat 7140catcagctat ctcatcagaa acctcagagg ccaaaaggca
gtagattgat atattcaaag 7200tgctaaaaga aaaaaataaa tctgtcagct gagaatcctg
tatctgtatc tcacttaacc 7260attattttaa aataagggaa aatgaagaca ttcccagata
aacacaagct gagggagttc 7320attatcacta gatctgccct gcaaagaaag ccaaagaaag
cctttcagga tgaaatgaaa 7380ggatactaga cagtgactca aagctgaata aagaggccag
gcatagtggc tcacacctgt 7440aatctcagca ctttgggagg ctgagatggg cggatcacct
gaggagttgg agaccagcct 7500ggctaatatg gtggaacccc atctctacga aaaatacaaa
aattagccag gtgtggtggc 7560acatgcctgt aatcccagct acttgggagg ctgaggcaag
agaatcacct gaacccagga 7620ggcggaggtt gcagtgagcc gagattgtgc caccgcactc
cagcctgggt gacagagtga 7680taccctgtct caaaaaaaaa agccgaataa acgaataaag
atctcatcta tggccgtacc 7740accctgaatg tgtccaatct cagaagctaa gcagagttgg
gcctggttag tacttggagg 7800ggagaaataa cggtctatgc taaaggaaaa ttcaggtgca
attaaagtaa aattaattat 7860ataaaagaga atacattaaa agctagtatt attgtaactt
tggtttgtaa ttccaccaag 7920tggaatttgt tcctgaaatg ctagaatggt tcaacataaa
aatcaataaa tgtaatagac 7980cacattaaca gaaaaaaaac ccacacggtc atctcaattg
atgtcaaaaa agtatttgac 8040aaaattcaac actcttttga aagaagaaaa agctcaacaa
actaagaata ggaggaaact 8100acctcaaata ataaaatcca taggccaaat ccccaaactc
acagctagca acatatttaa 8160tgctaaagac tgaaagcttc ccctttaaga tccggaataa
gacaaagatg cccactttca 8220ccacttctac tcaacatagt atgggaagtt ctagccagag
taatcaggta agaaaaaaga 8280aataaaaagc atctgaattg gaaaggaaaa agtaaaatta
tttgtttgcc caatacatgt 8340acaatgtttc aggtgaaggc tcagaacagt acaaccttac
cagcaagagt cctgctgtct 8400ctgtgtgaat cccagctatt actcactagc tacatgatct
ctcttgccct ccctgcctca 8460atttcctcat gtgtaaagtg ggagaaaaat aatagttcat
gcttcaaagg ttttttgttt 8520gtttgcttgc tttgagacag cgtctggctc tgtcgctcag
gctgaagtgc agtggtgcaa 8580tcttaggtca ctgcaacctc agcctcctgg gcttaagcga
tcctcccacc tcggcctccc 8640aaagtgttgg gatacaggcg tgaaccactg tgtctgaccc
aaaggattat ttgaggagca 8700gatgaattaa tgtgtcataa cctcaaagca gttgcaaagg
cgtttaataa ttaaaatatc 8760acattttaaa ttaaaatata aggctgggcg tggtggctca
tgcctgtaat cccagcactt 8820tgggaggctg aggtgggagg atcacttgag cccaggagtt
ccacactagc ctgggcacca 8880ttgggagacc ctgtctctac acacacacgc acacacacac
acacacacac aaacttaaag 8940tagccaggcg tggtgctgcg cgcctgttgt cccagctact
cgggaggctg aggcgggaga 9000atcactggag cctgggagtt cgaggctgca gtgagccgag
atcgcaccac tgcactccag 9060cctgggccac agagcaagac gctgcctcaa acaaacaaac
aaaaacaaaa attaaaatat 9120taagtaataa ttaacgagtg ttaatatcca ctcgttgtgg
agacaagacc tggacttagg 9180aaacaggccc agggaagtag cagaacagta gcgctagagg
acgcctggga gaatcagcgc 9240gcggcgggaa gagcccggga agcttagtgg ggaagcgtct
cttgatgggg tgaggaattc 9300tataaattag tggagatgga aaaaaaaaaa aaaaagtatt
cccaaagtgg gagacagcac 9360tcagaaagac gtggtggtaa gaacgagtat gagtaacggg
gacaacgagg acactggaga 9420ttggggagtg ttgggctgga agctggtgtg cagctgtggg
caagctaggg aggaccccga 9480aaccgccaat gcgtttcccg gacgcagacg ctggcaggac
gggaggaacc ccgagacccc 9540gcgccatccc ttcaggaaga gttacttctc cccggccaag
ttagtgggcc ttgggccttc 9600tttctgttgg gatcctcctc gcgtgtcgcc atcgctacaa
gtgggcagct ctgcggggaa 9660agctgggacg ctgggggctt caccaaggag gctggcggcc
gaccactggg aggtctggcg 9720gggtgacgac cactgggagg tttgggcagg gcctgacggg
gtgacgcggt cagcccactg 9780gaggccgaca ccccccgtca gcccaacccc tgcacgcgcg
gccgccaacc aaagacccgc 9840ggcgccggcc tgcgagcccc cgccccgcgt tgcccaggaa
accgagggtg tggctccgcg 9900ttctctgggc gtcccaggga ctgggcgcac agtggtcggc
gggatgaggc gcctggtgac 9960ggacggggcg aggagggcag cgattggtga gattaggcga
tgggcgggga agccgcgcgg 10020ggattagcga gttgcggcga tgggcggggc aggcgcgcgg
ggattggcgg gatgcggcgc 10080gccgcgcgtt gagtggggtc cagggaaacg gggtcagctg
ggggtggcag ttccaggccg 10140cgaggccggg ctcctgggtc ggtgggctgg tgtcttggcg
gacgtcccgc agctgccgcg 10200tggatccgag ccggggcacc cgccgtgact gggacagccc
ccagggcgct ctcggcccca 10260tcccgagtag cgcggcctgg ctgctgccgc catcaagcac
gttcgagcca aaagctccta 10320acgagtcact cgttagacac gtgtgcggag cctgtgtccc
aggccagtgc tgtcccgtgg 10380agatagattg caagccgcta gggaattttt taactttcta
gtaggtgtac gaaaaaagta 10440aaacgaaaca aatcaattgg agtaaatcca taaatatatt
caaactatta tttcaattgt 10500atgtgaaaaa attattggga tattctttgt actattctta
gaaatccatt gtgtgtccaa 10560cccaaacatc acagttggac tcaccacatc tcctgtactt
cgtagcccta ggtggctagt 10620ggcataagac acaaaaatct cagctctcct ggagcttatg
gtctagttgg agcaggcaga 10680caatacattt aaaatataca gtttgttaga aggtaaatgt
tgtaaacaac aataacagtt 10740gaagtactgg ggagagttgc agttgtaaat cagatgggca
gggcacaagg taacatttga 10800gtaaagatgt aagaacttga aggagatggg caagtgagct
ctataagtat acgggagagg 10860ggcaagcaag agttcagagg ccccttgctg tggggaggga
tccaaggtgg aggagtggga 10920accaggaggg gagaggacca gtggagcaga tctcataggc
agttgtaagg acttggggcc 10980ttattcaatg aaatgaggac actttggaga gttttgaaca
gagcagtgac tgatttatgt 11040tttggttttg gtttagttct attattattt aataataggc
ttattatttc acagaagttt 11100tatttaataa ggcagacctc ttgtctggaa atgagacagg
tgccggagag ctggatggag 11160gcagatcggg aattccattt ggggcaaact gaacttgatt
gagaccctgg tagttgtcca 11220gatggaacag gacacctgag tctagggttc gggaagaact
ccagatggga caaacactcc 11280tagctttcct tttctctttt tggatgaccg ctacagggtg
agacatcggt atccaggcac 11340gataaatttc caagtggaca caatgtctgg tgtcaactac
agctgttctc cttcttttcc 11400cagtatcctt tgggtgcagt gagacaccag gagagctgct
gctttggggg atggacaggg 11460gcagcaggaa tgcctttgtg ttttcgcagt gaacctcctt
ggcctgggcg aagctgtgtg 11520gaccaagcaa gtcaggagtg tggccatgtt ttctgagcag
gctgcccaga gggcccacac 11580tctactgtcc ccaccatcag ccaacaatgc cacctttgcc
cgggtgccag tggcaaccta 11640caccaactcc tcacaaccct tccggctagg agagcgcagc
tttagccggc agtatgccca 11700catttatgcc acccgcctca tccaaatgag acccttcctg
gagaaccggg cccagcagca 11760ctggggtaag tgagagtttg ggaaggtgct tcccccacag
catccctgaa cttagaagtg 11820ttctgcaaga gaatgggaac agtttatcta attgatccca
cttcctgtta ccttgggaaa 11880attaacctct ttttccctca gtttcttctt aagatagtaa
caaggattaa attaagtaat 11940ttgtgggttt ggagttagtt ttagttcaga ggctggttgg
agatgaggac ttagttctgg 12000cggtgatggc gattacttca ctggcagagg aaaatggttt
tcctatcttc agtgcagatt 12060attcaggtat ttgcctgtgc tgtagccaga gagcccctca
gtgtggcaag cctggcgcca 12120ggcaccagga gccaagactg gtgaggatgc actctctggt
ctcgagggga ccccctctgt 12180tcactcatgt ctgtttgcct ctcctcctgg cccccatatt
tgctggccat gaattttcct 12240gtcccttggg ccctctgtct ttcctaataa agtggcctgc
ccaacacaac ccttgttctt 12300tgcccccatt tcttccctgg tgatctctcc tgcagttgga
ttactcttgg tggtgaagca 12360gggaccccca tctccccctt tgagtttatt tgagttttag
gtgctgctgc attcccccat 12420tcctaccact tacataagag tggctttcca ggtaattttc
aaatccatct cctattatat 12480ttttaaactg aggatttagt aggtgagacc aggtcttact
catttttact gtccttggca 12540ccaggcaaaa tggatctcag ccctagttgc acattggaat
cccctgggga gctttgagaa 12600gcccatctca tcccatgcca agccaagatc aattctcgtt
ataggcaggc aggagaaccc 12660tgggcctaga aatctagcta gaacctcaaa ttcattaggg
atatgtatta gtccattttc 12720acattgctat aaaaaactac ctgagatagg gtaatttata
aagaaaagag gtttaattga 12780ctcacagttc ctcatggctg gggaggcctc aggaaactta
acaatcatgg cagaaggtga 12840agggaaagca aggctctttt acatgatagc aggagagaga
gagcaagggg aactgccaac 12900catttttaaa ccatcagatc gcatgatggc ttgatctcac
tcaccatcac aagaacagca 12960tgggggaaat ccacccccac aatccagtca cctcccacca
ggtccctccg tcaacaccgt 13020gtggattata attccagatg agatgtgggt ggggacacag
agccaaatca tatcaggatg 13080ttttctgttt tgtttacctg agacaaagtg ctgttcacct
ctcctctccc acataatcag 13140gggctccctc ctgcggctcc ggtagctttt cctcactttc
ctttcagccc tcgggacacc 13200ttccttggct cctttcagag ctcagttact acttgggccc
aatgtcaatg ccaccttcta 13260gattctttcc ggcagcacct cctctggtcg cacatttctc
ttccagttat tggagctgtc 13320aaaaaagctc cccagtgatg gacgatagcg atttcactgt
gctcacagac tggtcaggaa 13380accaaacagc tgccacagtg aatgtgttga tagcagcggg
gcagcagtag cactcgctca 13440caggcctggt ggttggtgct ggcccccacc ctgaatacct
acatgtggct tctccatgtg 13500gcctgtgcat cctcactgaa gctcagcctg tctctccaaa
ttggtctttc cactcacctg 13560ttccccaaac ctgcccagac cttcctgctg taggcttttc
ccttcacttg gcacactctt 13620tcccttgtct tcccatggcc ccatctaagc cccactgtca
gctgaagtgt tatattcttt 13680gaggggccac ctgaagccac cttgcaatga gggcctccgt
tttctacctc agctcaccat 13740ttgttcacag cacttgtcac tgtggcgagt tacttgtcta
tggcctgttg tcgttctcct 13800gcctagaccc agtgggctga gtgggggcaa gtgttggctt
ttatgtccag ttttgatctt 13860ggtgccagca cattgcctgg gtggaagcat gtcctactat
cggttacagg gatgtcattc 13920tgcccagtgc tcaggggcat acacttggat cccagttgtg
tgcccttgga cacattgctt 13980aacctctctg tgcatcagtt gggtgataat atctactcct
ggcacatttt cagcgttggc 14040tgagttacat gtacagtgct taggccacct gggggagagt
aagagtggga tacgtgagga 14100tgtggagtct gttgcatttc tgtctgctgc tggcatcctt
cttgtcttgt tttgagttgc 14160tcgcctctgt ctgctcccta gggcgtagat ttgaggaata
ttcctggttc ttcccaggca 14220gcaggggctc aggctgtgct ggagtcagct aggctaaggg
gctggtctgg catccgcgtt 14280gtcctgtcac ctccttggtg ttttctccag gcctggatct
gtgctgtgtg ggcacctgta 14340ttcctccctc ctgccctcac tgattctcca tacctttctt
ctcgagagtg ccaagcccct 14400cccatgtgtt cttgttcata cctaggatcc cgggaagggg
ctggggaaga cggtgcccag 14460gtgccctggg taaacaaagc cacctgactc cacgggaatg
gaatgggtgg aggggatctg 14520aggtctgcat tttgagtatc tctggtctca gaggatgaag
catttggtgg gggttggggg 14580tggggggtag ggtggaagaa tctaaagtct taaaagaaaa
tggcagttat ttgtgggaca 14640gggctgtgtt gagacttggc atgcttcttt ttaagagtca
gtgttgtaat ttaggtataa 14700gtgaagcagt actttgtatt agtttcctgt aggcgctgta
acaaagcacc acaaactggt 14760tgacttaaaa caacagacat ggccgggcac ggtggctcac
gactgtaatc ccagcacttt 14820gggaggccga ggcgggcaga tcacaaggtc aagagattga
gaccatcctg gctaacacgg 14880tgaaaccctg tctctactaa aaatacaaaa aaaaaaaaat
tagctgggcg tggtggcaca 14940cgcctgtagt cccagctact cgggaggctg aggcaggaga
atggcgtgaa cccgggaggc 15000ggagcttgca gtgagctgag atcgcgccac tgcactccag
cctggatgac agcgagactc 15060cgcctcaaaa caaaaacaaa aacagaaaca acaataacag
aaaaacacag acatttactc 15120tctggcagtt ctggaggcca gaagttgaaa tccagatgtc
agcaggattg gctccttctg 15180aaggcccgag gggagggtcc ttcctggcct cctccctggt
gttcctgggc ttgtggccgc 15240atcactccgc tctgcccgtc ttcacactcc ctcttgtctg
tgtgtctgtc tctctgttct 15300catgaggaca cttggcatcc agggcccaac cacacccaga
gtccctggtc tcctgtggct 15360gactcacttt ttactgtcac cgtgaagtcc agggggtcct
tgtacttgat gttctctcct 15420ggcaaggcca gggccctgtg attggcctct catggagtgc
tgggcagggc ctccatggcc 15480tctgtcgggc gggggggcta cttcatctct gagtctgtac
ccctcgtgtc ccaggcagtg 15540gagtgggagt gaagaagctg tgtgaactgc agcctgagga
gaagtgctgt gtggtgggca 15600ctctgttcaa ggccatgccg ctgcagccct ccatcctgcg
ggaggtcagc gaggaggtga 15660ggcagggtgc tacacagtgg ggccgccagg cagacctggc
ctcccactag aacacctccc 15720tggaggtggg gttgtgggga agcaggttca gagacaatgg
actccagagg ggtgggggct 15780gcggtgccag ctcactaaca ccagagcttt ggtgggctct
ggccccaaga ttatacctcc 15840tgtctctgca ttccagcaca acctgctccc ccagcctcct
cggagtaaat acatacaccc 15900agatgacgag ctggtcttgg aagatgaact gcagcgtatc
aaactaaaag gcaccattga 15960cgtgtcaaag ctggttacgg gtagggagcc caatgagagg
atgtgggtga tgcaggtgaa 16020gagcccagcg gtggtgtgtt agggatggtg tgagtgggga
gcctgggggg agtggggggg 16080tgtggcctgg gcacacgtgt gttcttgagg aggtaggtga
ggctccaggc ggtcggaggc 16140catcagattg ggtgagacct ggctgggaga tgggtctccc
cacctccatc caagggcagt 16200gactccagga agcaggcatg catcctggag tcctaggtga
gaattcacca atgtggttgt 16260ggagaactgg cttgttttgc ccgttggggt gactggaagg
agtggtagca cctggggctc 16320cctgctcagg cctgatgcca ctgctcccca gggactgtcc
tggctgtgtt tggctccgtg 16380agagacgacg ggaagtttct ggtggaggac tattgctttg
ctgaccttgc tccccagaag 16440cccgcacccc cacttgacac agataggtga gcagcagttc
tcgggagctg gaaccagctc 16500atggtcagtg gaatctttga gttgcaccta ggaggggctg
cctcccttct cggcaccctg 16560gaggacccca ccttctcccg caggtttgtg ctactggtgt
ccggcctggg cctgggtggc 16620ggtggaggcg agagcctgct gggcacccag ctgctggtgg
atgtggtgac ggggcagctt 16680ggggacgaag gggagcagtg cagcgccgcc cacgtctccc
gggttatcct cgctggcaac 16740ctcctcagcc acagcaccca gagcagggat tctatcaata
aggtatggag cccacctggc 16800tgcattcagc cccagcccag gagcctgcaa gcctgtaaga
ccctccttcc ccagggcgag 16860tagggtaccc tgtgaggtct cgcaggtcgg tgggaagcgc
cctgcagtga ctctggggcc 16920tcctgcaatg gggctcctca tgcccaggcc ctcgctgagg
atggtgggag gcttgaaggg 16980agtgagggtc tatgggacaa caactgcatc ttccagctgg
tggggctcta ctctcctctg 17040agcctgggac tcgcctgggc ctgatggcct tctgggcttc
tattccaggc caaatacctc 17100accaagaaaa cccaggcagc cagcgtggag gctgttaaga
tgctggatga gatcctcctg 17160cagctgagcg tgagcgagct gggggctgga ggggtgatgg
ggattgcagt cttcaaagct 17220gccactgggc aacagaaggc aggcaggagg gcagggggag
tggccggagt tggtgtaggg 17280ggctccttcg gggccctgtg agctctccct gccctgtgcc
ttccaggcct cagtgcccgt 17340ggacgtgatg ccaggcgagt ttgatcccac caattacacg
ctcccccagc agcccctcca 17400cccctgcatg ttcccgctgg ccactgccta ctccacgctc
cagctggtca ccaaccccta 17460ccaggccacc attgatggag tcaggtagct ggcacagcca
cacttcagtc tgacccagcc 17520ttttgcctca ggaggcacaa agaagggagg ggagggaggg
cccaggaagg tggcagggct 17580gcagaggccc acctagcatc tgttccttct ctctggggca
tccccacaag agcgccagat 17640gagctctggg ctgaccacta tgggtggcac ccaaagccaa
gagtcagctg agctttgcct 17700tgcagatttt tggggacatc aggacagaac gtgagtgaca
ttttccgata cagcagcatg 17760gaggatcact tggagatcct ggagtggacc ctgcgggtcc
gtcacatcag ccccacagcc 17820ccggacactc taggtaacag gctcagccat acagggtggg
agcagagggc caggaggcct 17880ggcaggaccc tgaagtgcac agggtccccc tgtgggtttg
cacttgccag cattgctgag 17940aactgtctga ggagaagttc agaggcttgg cacctgctct
ggaagctact ctggaatctt 18000aattctaagg ccaatggctg cccaccccaa cgggcagcaa
cagcagggcc aaggtcttgt 18060gacaatgtct ggaggtgccc ctattgtcac actgggggtc
tcctactggc ctgcaatggg 18120aggaggggct gcagccccac atcctgtgca gagtgctagt
gctgaggcgg aaccctcctc 18180agagctgccc cttctcctct aggttgttac cccttctaca
aaactgaccc gttcatcttc 18240ccagagtgcc cgcatgtcta cttttgtggc aacaccccca
gctttggctc caaaatcatc 18300cgaggtaatt tttgtcttct gggggcccag gctgatttgc
tgatttgctc tcacctgggg 18360acaaggttca cagagaagaa aacctgcatt gtggagtccc
cctggccctt gtgggatgga 18420cagctgaggt cttctgcaca gctgccattt cactgtggga
gccaagctgc ctcgccagct 18480gggcagggac tggaacggct cccagcctgt gtgcctctca
aggctaatct ctggtctcct 18540attgtcactg ccccactgtg tgccaatggg gactcctgtt
tatttctggc agcttctctt 18600tgaggcagga cttacttgga acctacagtg ggtcctatgt
gacttctttg caggtcctga 18660ggaccagaca gtgctgttgg tgactgtccc tgacttcagt
gccacgcaga ccgcctgcct 18720tgtgaacctg cgcagcctgg cctgccagcc catcagcttc
tcgggcttcg gggcagagga 18780cgatgacctg ggaggcctgg ggctgggccc ctgactcaaa
aaagtggttt tgaccagaga 18840ggcccagatg gaggctgttc attccctgca gtgtcggcat
tgtaaataaa gcctggcact 18900tgctgatgcg agccttgagc cctgggcact ctggctatgg
gactcctgca ggggtgccca 18960cagtgaccat agcccatgca cccaccagcc ggtctccct
18999816161DNAHomo sapiens 8cagcagggcc aaggtcttgt
gacaatgtct ggaggtgccc ctattgtcac actgggggtc 60tcctactggc ctgcaatggg
aggaggggct gcagccccac atcctgtgca gagtgctagt 120gctgaggcgg aaccctcctc
agagctgccc cttctcctcc aggttgttac cccttctaca 180aaactgaccc gttcatcttc
ccagagtgcc cgcatgtcta cttttgtggc aacaccccca 240gctttggctc caaaatcatc
cgaggtaatt tttgtcttct gggggcccag gctgatttgc 300tgatttgctc tcacctgggg
acaaggttca cagagaagaa aacctgcatt gtggagtccc 360cctggccctt gtgggatgga
cagctgaggt cttctgcaca gctgccattt cactgtggga 420gccaagctgc ctcgccagct
gggcagggac tggaacggct cccagcctgt gtgcctctca 480aggctaatct ctggtctcct
attgtcactg ccccactgtg tgccaatggg gactcctgtt 540tatttctggc agcttctctt
tgaggcagga cttacttgga acctacagtg ggtcctatgt 600gacttctttg caggtcctga
ggaccagaca gtgctgttgg tgactgtccc tgacttcagt 660gccacgcaga ccgcctgcct
tgtgaacctg cgcagcctgg cctgccagcc catcagcttc 720tcgggcttcg gggcagagga
cgatgacctg ggaggcctgg ggctgggccc ctgactcaaa 780aaagtggttt tgaccagaga
ggcccagatg gaggctgttc attccctgca gtgtcggcat 840tgtaaataaa gcctgagcac
ttgctgatgc gagccttgag ccctgggcac tctggctatg 900ggactcctgc aggggtgccc
acagtgacca tagcccatgc acccaccagc cggtctccct 960cctccccatc cctgacacct
cagaatgtga gcagtccgtg ccatgagctt gttttattgg 1020agtgaccttg gctccctccc
tctgccccta ctccaacact gcagcaaccc catctcttac 1080gagactggca ggtggagcag
gagcctctac acagcctctg gctcttaggt cccagtcatg 1140tttgcacccc ctcaaagggg
caggaccagc ccttcctttc agtgtccata ccaggggcct 1200tccatgtgct gatgggtgat
gtgactgtgg tcagcaggct tgggaagtgc tgctgctgta 1260gcttgagttg ggctggggtc
ttggtaggac gctgatctca gaagtcccca aagttcactg 1320tgtaggtctc tactgttgtg
aaggggaatg cctggccagt ggctatctcc tcctctttct 1380cctcctcctc ctcttcctca
aactcgggtt ccagctgggt ctcgaactca ggctccaact 1440gggtctcaaa ctcgggctcc
accttggtcc caaactcggg ctccacctcg gtcccaaact 1500ctgtcaccac ctctgtgtag
gtctcagtct ccgactcctc ccagccagcg gtggttggcg 1560gtatgaggcc ccagggctct
atggtagtgc tcagggtggt ggcaggggca gggggcagcg 1620tgggaggcac agtgtggggg
cctagggtgg tggtggcgtt gaggcgccgc agccgcatct 1680gtgcccgaag ccgcaggcgg
tgttgtaggc gtcgctgctg caggcgtcgc tgttgggggg 1740tcatagggcg cgatgggtct
atgtgtggga taggccggtt cccgttcatg gccatgatct 1800cccggatgcg cttccagttg
gagcgagcca ggatgaagtt gcactgagtg gccccgatgt 1860catagtcaac attgcaggtc
ttggcgctcg gggtgtagcc ctccgcgtgg gctgtcacgc 1920ggtactcacc cgggttcaag
attcgccagt aatcaccacc actggctgcg gagggagaac 1980gatccggctg ccccagagcg
cccctcccag gcccccaccc tcccactcag tcctgccccc 2040agccccgccc tccccctctg
agttcccgcc cccagcaccg ccctccctct ctgaatttcg 2100cccccaggct ccccagactc
tacctgctcg ctgagttcct caagccccca ccctctctgg 2160cgggtcctcc ctcagaaaga
tggggtaaag gtgtgcacac taggtacctg tcttcacgcc 2220gtgattaatg ccactcacag
agatggtggc gttggcaatg gggatgcctt gctcgtccgt 2280caccaccccc ttaatgccgc
ggtgcaccta gggaagcagg tgagggctgc tggtcctcag 2340gaaggtccaa tgtggtccgc
tgctccctcc cgcccatcca ggagcctgtg cagcctcctc 2400tccccaggca ttgccctagc
caccccacct gctccatgaa ggtgagcagc gcctccttgt 2460tgttctccca ctcgcggggc
agctcactct catgagggaa cttgtcacag cccaggtaga 2520aggagagctc caggcagttg
gtatgcaggt aactgaagtc attgatagct ggccggggac 2580agatacagac ccaaagtcag
cccctctccg gaccaggccc cgcccacagc ccctcccagg 2640ctgactcact cccggtccgg
gggttccact tggccccgtt gacgatgccc atgccgccgg 2700tgtagtcctg ggcttggcag
cctccgcggt agggctcggt caaggtgagg tgtgcggagg 2760cgaaggagat ggcaagccac
cggaagatgg cgtggtctgg agtctcctgg gcctcggaga 2820cctcgtcctc atcctccccc
cgggctgctg ccatggctgc ggccagcagc tgctcctggg 2880taggcgtgcg ggccatatcg
taggggtagg atactagccg ctcgccgccg ttcagatttg 2940ctcccagcac gaaggggttc
ttctccatcc aggcaatgat ggcccggacc tccgtggata 3000cctggagtgg ccagcacgtg
tgaggccagg gctgcagctc cggccactat ccccaaccta 3060gcccgatcac cctccatgaa
gcttcacacc agtactcgca cgatcccctg tcccccaacc 3120cccagagcct cagcgtctgg
agttcaggca ccgtcagccc cacccccaag cccagaacac 3180caggacccca gggtccagct
gctccctcct gccctttcag ccaggctgta gcctcaccgt 3240ggcatctggc gaaaggtagc
gttcagggat gggcaagtta ttgttgggga cccggtaggg 3300gacccatttc ctctcctcag
ctccccagag cacagagttg agatccggga aatcttcaaa 3360gatgtcaaag ccctcctcag
tccacagtcc cagcgcccag ttcccaaact ctgagccctg 3420tggggagcca gcagggtagg
catcggctac ccacaccccc acaaccccca gctgcctgga 3480ccctggccag cctcaccctt
caacccacca tctgcgctgc cacctcgtag ccatcagggt 3540tcagtgaggg caccaggtgg
atgcgtgtgt cctgcaccag gctgcgcaca cgtgggttcc 3600catcgcggta ctctcggcac
aggtactgca tgagcagcag caacagctct cggcccagca 3660cctcgttgcc atggatccca
gcagtgtagc ggaactcggg ctcccctgca agggcgggag 3720cctcagtgag cactcagtct
cccgaggccc agggcagctg aggaaggacc cagacccacc 3780tcatacccga gggtctgggg
gacagctggg gctcctaggg ccctgtaaga caagccagaa 3840tccccagaga ggctccggaa
caggcgggag gcagtgagct ctgcacatca gcagcagagg 3900ccagctgctg gcccccacag
accctccccc agttcatgct ccccagggtt gtctgagatc 3960tccatggcat agatcttgag
gcctcgtgag ctcttgccca ggctgtaagt gcgggtgatg 4020gtggggcact cctcgttcac
caccttcatg agctggcgca gagggggagg acgtggaatc 4080aatcatgcaa tccgtccccc
gctgaccatg ccccttccac ttccagggcc tgctctatgg 4140cgagggacgg gcatgacccc
ttcacgcagc ccccaggtac tggcctcctt cctaaggtga 4200gggacagcca gcatccctgg
aaccagtagg gactgggccc agtgacagaa gcaccaggca 4260cacactcccg tcagccacag
acaggtccca cccccagccc caggatatat gctcccaacc 4320tggcgcatgt ccttgtagct
gtggtgccgg aaatccaggt catcggtggc caccacctca 4380ttctgtgcgt agtagctgta
gacagctgca agggaggcgg ggttgtcttt agctgggtgc 4440cggctggccc accctagcac
cccacctcca ctcagagccc ctgccagccc tccacactca 4500cgggccacag agcaccccag
cacctccagg cgcatgcaca ggctgccatt ccaggtgagt 4560gggtagatgc ggatgaaacg
agccaccacc ggctctggga gctcactcag cacgggtgtg 4620tccttgtcca cgttcccatg
aaaggtctgg ggagaggcag gcctcagagc agtactgcca 4680gcccctctga gagcccaccc
ctcgcccaga caatgggagc agagccaaga gcctgggcat 4740ggtgcccacc atttcctcat
agccgttggt gtacatcacc catgtctggc tgtcattgct 4800gaagcccacg aagaaggtgg
tcacaaaatc gtcactgtgg agtggacagt ggtcagagca 4860agggtcttcc ccctcccagg
ccctcaggtg gcctgagcct ccctcttccg agccccaaga 4920atttaagagc tagcagggtg
gtgctgcacg gcccaggtgt tgagcctggg tcctatgccc 4980gtcacatagc catgggcagg
tgatctgtcc ctaaactcat gtgctatcag gacacagggg 5040ctgactgacc aggctgagga
gtggggatgg gcagggtgag tccctcactg atctttttgg 5100ccttctttgg ctgggccaaa
gaagggccca ctggaatctc cttaatggga cacagagcca 5160tgcctatgta gccactcccc
tctgccaact atccatgagc ctggccacgc actggatgct 5220ggagtctctg ccctgggtga
tgacgcctgt gaaccgggta gtcctcctgg tgtccacctc 5280tatccactgg gtcctggcat
cgtcctcggc acaccacgca ccatcatagt agtcgtcctc 5340agtggcaccg gtctgtccag
ggggcagggg aggctgagca tgggcggagg agtcccttat 5400cccagttggg agatgggccc
atcccaatgc ccacctgcat gttgagccgg ccgcgctgtg 5460cccccaggcc gtggcgcagc
atggaggagg ctcggatctg gttgtcctca atacggtgtg 5520actccatccc aatgggggga
cactctgagg acgcgtaccc cagaatggtg gctcactagc 5580tccatccttc cctccaccaa
acccagaacc aaggagccca gagcccactc ccggcacatc 5640gggggcacag tcagagggca
gctctggtca gctggtggct ccctggtgcc ctgcaccagc 5700ccacctggaa tcgactcaaa
gccaggccag gagctgtttc caatcccagc ctgtgcttcc 5760cctccctggg cctcagctgc
cccatctgga gaacgggctg accatgccca gctctcaggg 5820gacacacgtg aaatcacagg
tagagctccc ccagggcgca gccacagatg tcatccagat 5880ggggaccgtc tgcacaatgg
ccctgcaggg atacctgtga aggtacctga ggtcctcact 5940ccccaccaag gccccaggtc
ctccccctac cacgcccagc cactaggggc cctggggagc 6000tgccaccctc ctgaagcagg
ccagcctggg gtccagggct ggggcagcca agcgaggcta 6060tcctgggctc ccggggcccc
tcccttctgg gtcccaagaa tctgagtagg aaagggttcc 6120ggggacctgg gtcctgtttg
tgacattggg ccagtcactt gtcccagcac ccccatcctg 6180tggcccccac cctcaccccc
ttgtgccccc cacttactga ctttctccgt aggcgtccac 6240tcctcctcca actcctcgcc
ctttcggggc tctagggaca atgaagggag gacatggcac 6300caagggcccg ggaggcaatc
aggagtccag atgctgcccc acagggaccc aggccccaag 6360ccccagccac acacctttgt
ggtccttgcc cttctccact gcccacttgt cggtctcctc 6420cttggggctg ctgtcctcct
ttttgggttt ctctggaagg tgcaaggtag gaggggccag 6480tcagcctggc tctgggcttt
gaggaccatg tggggtggat caggcaggcc ccaggtggcc 6540ttcagggcag gcctggtgtg
ggaagtcctt ggtcccactc actcagctcc tccttctctt 6600cgtccgtctg gcgctcagca
tcgggcttct ggggcggagg aggcccaaag taatagtcca 6660ctatggggag ggagagccag
ctgaggctgc cctgaccctg ctgcggggcc tcagctcctg 6720ggtccacagg agctcagcag
gacaggaccg cgccagaggg gaggaggacg ggagatgggg 6780gacagctgag ttgggagagg
gtcttgcagg agtcaggagc agcccgagct caggggcagc 6840tgagcaagac cctgctgaag
tcaccagccc ggccttccag gagcatctgg cctggggaaa 6900ggactcgagg cccagggcat
gggaaaggcc tggagggaca actggcacct gtgcctgggg 6960ttgcgggctg gggggtgaga
tggggagaca ttggaggcac tgatggggac ctgggggcag 7020ggaaatggcg atgcacgggc
tgccacccag gaggaaaggg aacctgaggg ctccagggac 7080gcaggggcat gagcaacagg
gaggcaaaag ccctcgggct ccctgaagag agtggggcag 7140tggccacgag ccagcgggaa
gccagttaga gcacaggact gggagggctg gaacccacat 7200gggtgacagg gcagagtgtg
tgcctaggga cacccctgtg ggggtcacag ccaagcagga 7260accagggaag cggccaagga
aagaccagcc tgagggcaga ggagacaggg cagtggctgg 7320ggtgggcacg cagggacagc
agggacagcg aggtaaccac gggcacaggt ggggttgcaa 7380ggtgggtgag ttgccccagc
tggctcctga ccacacccca gccccgaccc ccacctgcct 7440atgtccctca gactctgggg
tgctgggtac tcactgtcat cgtagttggg gatcacgtaa 7500ccatcaccat agtcaggggg
cagcgggggc agcagaggct tcacaggagg ctctggggag 7560gcggggaggt taggaggggg
ccagagcgcc gtggccatgg cacctcctct cctgcccccc 7620atcctaccaa tcctctcctc
cggggctggg gccggggcct tctcctcagg gggctctggc 7680cagacccgct cgggcctcct
ccttctgctt gggggtggcc tgggttgctt ctggcgccga 7740atgtactcaa ctgaggggga
ggctggctca gagtggggcc caaggctggg atgggcccat 7800tggcacatcc cccaggccag
gggtccgacc caggtggggc tggcaggacc ctactcaaag 7860tcctcatagt cctccctctc
gatctggtca ttgtagtcca gtgtgggttg ctcggtctcc 7920tcctccggct ctgaggggaa
agcgctggta gctgcctgac aaccccaccc aggcctactc 7980tggggaagcc ctcagtccaa
ccagccaggg cagctggccc caaggccagg cggatgacgg 8040ccactcacca ggctggtgct
cctgtgcctc cacatgggtc tcctctcctg gattctgcca 8100gttatttgag aggggcgccc
ctgcaacaca ggagttccag aagcaggtgg gcgggaggcc 8160tgctctgacc accttgggag
cctcaggcca ccagccaccc atagagccca cacagagcct 8220gtggacaccc tcctgaggcc
gagctcactc caaggaggcc tgagctcctc tggccttcag 8280catcctgctg gcatctcatg
gggccagaga gctgggccca ccttctgggg aacctactgt 8340gctgctggag gccctaccac
aaagctgtcc ccagcgggag aaggcaggag ggaactccat 8400gggctcagag cccagggaca
tctgggcagg ggcctgaggg acagaggtcc cacccaaaag 8460gctgccaagc cctctcccta
cccaaaagag gctacagcac tgagggagcc caccaatcaa 8520attgtgaaat ttatagcaaa
agtgaggttc ccatccagtg gggagctgaa ggtctatagg 8580aagcagggcc ccagaaacct
gcctcccact ccctgcctcc acccgagcag gcagtcagag 8640ccccatcacc ccagaggagc
ccggcacaaa cctccctcct ggggtagctc ctcggggcca 8700gggctggggg gtgggggcag
tggccactcc agggtttctg agggagccag aatggggggc 8760ctcttccctg acgggggctt
cttggtggcc ttgggtggct tctctttggg cttcttggtg 8820gccttgggtg gctcctcctt
gggcttcttg gtggccttag gtggcttctc cttgggcttc 8880ttggtggcct tgggtggctt
ctccttcccc ttcttgggcg gcctggggga cccctccaag 8940gactccttgg gcaccttggg
gcctttgtct ttcttgcctt tcttcccttt gtctttggtc 9000ttttccggag gcactgtcca
agatgcagac tcgtgtcaaa tgaacagagc cagctctgtg 9060cccccatgag gcccctctct
agatgcccag aacctgggca cagggactct tgtcagttcc 9120cagtgcggat cagcaaactg
agaggttaag tcatttgccc aagtggcaaa ctgggatccg 9180gacccagatt ttctgtctgc
aagtctgggg ctgtgaccac caatctcaac ctctctaaag 9240actgagcgta gggttcccag
ttcccagggg gaggccctca tccccccacc tgccaaaacc 9300tcaatagggg ttccttacta
tccactcctc cactattctg ttctgggcac agaaggggca 9360gagaggtgac tgagccatcc
aggcctggag gagcatctgg tcatccctgc caactgccat 9420acaaaggaag ggacatgggc
ccaagacctt cccctggtct cctacggggc aagaaaagct 9480tcaaagaaaa gggacacttg
gttgagtatt gaagcccaaa gaagaggaag tggtctcctt 9540tcgagaagta aggggtttgg
aattgattgg aaggataggg agtcctgggg ggttcaggga 9600tcacacagag gacagaaaag
acaggtaggg agcttgtggc tgcacactca tttcagagtc 9660tgggagaggg agcagggact
ggttgtgagg attccccatg ggaatcctcc caggacccta 9720agcaggagct gcaagtgctg
ttgagaacct gatgagaggt ggggagcatg agggaagttt 9780ggcagaaaca caggaaagct
accaaatgca gacagccagg ggacgcaggg ctgctagagc 9840ggtgccccag agccaggaga
gcaagcctgg aaggagagcc agaggcagga ggggcacagg 9900cagcccaggg tgtgggaagc
agccaggaaa gatctagagc tggggtggca ggggaggggc 9960tgctgacatc aggaatgttg
gatggtgcct tggaatctcc tgggagacag ggatcacaag 10020accctctgcc accttccaga
gggccacgat gaaaacagct aagatttact gacaactgat 10080tatgcaagag gccgtgggtt
aaatgcttca gtgatgcatc acctcatcta atttcctgta 10140ctaatgtagg accacccatt
gctcaccacc acctgaagcc ctgtgctcac caccacctga 10200aactctctca cctacgtgag
acctcctgga gtaggagggc aaaggcagga gggagggacg 10260acgtgaagct gtgccaccaa
cagggagagt ggtcccatta gtatggcagg gggtgacaca 10320gcacagtccc ctgtggctca
agcctagtac ctgtcgcgta ctggaggaat ggggataagc 10380gacccgtaca accacagcac
caaccctaga gccaccggcc cccaaaagcg gccctgccgc 10440ccgggtgctg gatgtgcctc
cacgccagcg ctgacctcgg cctagcacag ggtccctcca 10500ggcatctggg ctcgcgtgcg
cattagtaag ccagccattc ctcccctagc agactgggga 10560gtggccagac cctaccgaat
ccccctgttc ccacctgaga tgccagcccc ccacaccccc 10620gccctgccct gggctcttac
cttctgcggc cgtccctggc cgcttccctg gcttgccccc 10680cgcctgggct tttcggaccc
gcggggtggg ctcgggaggc ggcggggcct ccacgtcgtc 10740ctcccggggc tcaggttcta
gctctgacag gaagccctcg aggaactcct cgatctcgtc 10800gtcggtcagc accgtctgcg
ggcgccctcc agggcacagg gccagcaacg ccaggaggca 10860gctgagcagg ggcgccccgc
gcacggccgc catggccgcg gcacgcgcgg ggggctccgg 10920ggagggcgcg gggggtcagg
ggctctgggt ctctgggaaa gggcggagag gggatcgaga 10980cgggtgaggg aatccaggaa
ggggcgggag agaggatggg gtgagcgagg gaatccggga 11040aagggaggga gagtggatta
gggtgggcga ggggacccgg gaaggggtgc tggggggctc 11100cgaagccaga ggggctcagg
ggtggtcggg gcgctccgag gtctggcggc taataggcgc 11160tccggccccg cgtggcgcac
tcccgcgcgg atagccgtct ccaaagcgct ggcggggccc 11220ggggcggggg cgccggggct
tccggagccg gctccccacc cccggggagg aggaggagga 11280agagaaggag gagccgagag
tggacggagg ggctgcgggg gggcgggggg cggggggcgg 11340ggggctaggg gcggggcagg
cgggcgggcg ctggcggcga gcgtcccaag cccggagact 11400tgcgcctagg acagaggggc
agggggcggg gcgactggga agacagaggg cctgagggaa 11460ggaaaggtgg tggggagggc
ctggggtgcg ggtctgaggg ggccgacatc cctcctcctt 11520ctgccctagg cacccccctt
aaggcgggac cccgagtcca ccggggctct gagccctccg 11580cgggtgacca ggaaccctgg
acggaaagcc gtggtgtcag gcctctgaga cctctctcaa 11640ttcggagggc cacagaaagg
ccaccccatc cttcccaggc tctggagcct ctgcccatgg 11700gccctgctgc atcccagcgt
caattcattc agtcatccta ccaacctctt caggtcggtg 11760tggggccggg ccccgtgctg
ggccccaggg agggacagca cagtgggaac tcactttcca 11820gccaggaggc aggtgcaaaa
ctgccctcag agtggccagc tgccccgctg ggggtaggag 11880tcccatgtaa gggcatgcca
tccctcccct ccgggtccca acgtggacaa atagccattt 11940atcaccttct tcttaccaga
actcattttt taaaaagtgt ctaccatacc tccagctgcc 12000acatggaccc agagggccca
gaggacccag aaggcaggtg gattgagtgt caactgatcc 12060caggatccat cagggatgtg
caccttggtg cctggtgttt gccataaggc ttctccaggg 12120caaatgttgg ctgccctaca
acggccatca acaggcagag tggtcccatt agtatggcag 12180ggcgtgacac agcacagtcc
cccgtgactc aagcctagtc cctgtctcat actggaggaa 12240tggggagcta aggacagagc
tccgaggaca ttccccctta aaggaatgag gacacaagag 12300aaagctcaca ggtagtccat
gggccaagtg cagaggcaga cagccctaag ccacgattgt 12360ctgcggggtt tggccccagt
gaagtagtca ggtagggaag cctaggagcc cctgggatga 12420ttgacagggc agagtttgga
cctggggtca aaaggaaaga ggaaaagtgg gtcaggaagc 12480acctgggtcc ccagagcagc
cccgagtgag ttggagcagg cagcagccgg ggaggccaca 12540gtggaggctg ctgggcctgg
gatacatgcc accccctggg agcaggacca caaggaggcc 12600ttgcctcctc tcacacctgg
tcctgccaag accctgcctt tgctttctca ctgcatctcc 12660ttgaaaaagc agtgggactg
tgtcaggttc tggctctacc tcccaggcac cacatctcgg 12720caggtagcct cagtgccgtc
cacctgtgtc cctgttctcc ttgtcgttca tacaggatca 12780tgcatgtgct gtgcctagca
cacattcttg gcactcacac tgctgccttt tagctctcat 12840catttgccct cagagatcaa
cctgagctgt gcccactggg gcgctcagag cagaccctga 12900gccccaacac ccaggctccc
tgtgcacctg agcctgcctc tgcctgccac gtgcccccag 12960gccagtcctg gtggcagcaa
ggatccgcaa gctctcccct ttcctcatcc tctgcaaagc 13020tctgaatcat ctttctcaaa
acttgttctg ggaatttgct ccgttgcccc agttgagcat 13080gtcaagcccg gcggcccaag
gctggggtga agcagcgtgg cacgtcactt ccctgggaac 13140aactcacaca tggattggat
ttgggtccaa catcctctgc cagggaaaat agaagccata 13200agaaaacaaa aaaggaacag
aaggaggctt ttcttcagtc acagcgagtc accaacaaaa 13260acatgtgcaa aagctctcat
ggagagctgg gccacaagga gggccatgat gttgggggcc 13320ctctgacacc aagggtgtgg
gcaggtggat gggaggcagc tgccctccat gccaggctga 13380tgtgcctccc tttgggtggt
ggggctggga ctcccactcc acttgaagac ctgcaccaaa 13440aagtccttta gccctgtgcc
caggctctgc cacggggccg gtgaggggac ttctcccctc 13500tgctgccaga gtgaagccag
tcagggggat gggaggcttg tagccaagag cacctagtgg 13560ctttcagggt cccttacccc
tgccacttag cagggtctgc acctgcatcc aagtgttctc 13620ctgggctaca gtggggggct
ggtagacact ctggtgatcc actttcagct tcccacatgg 13680atgtggcagg gactgctttg
gcatttccct accccaaggg acagccactg cggcaggact 13740gggctgggga gggtggggcc
tgcgctgggg agggtgcccc ctgtcccttg ctgctgctgg 13800aatgggaagg agagttgttg
agagagccag aactgtccaa gggtggaagc tggcgaaact 13860gacctgcagg gaacagggag
acagggagca tggcccagtg agtaggtcct atgtagctct 13920gaggccatca accctgccat
gagggctgag accccaagag agaagttgag gttgggtcag 13980gggcctgtta gtgccagctg
aggaggggga caggccagcc tcctcccact gggacccaag 14040ctatagctcc tgagcctcca
gagctgcctg gtgcctcaac ctggtcagag gtggaaactc 14100acctgccagc aggcccagtg
tgcctgagtt ctgactgtgg ggatctgcag ggcacagaag 14160gataagaggt catcagggcc
tggggacagg caggagtggc agggtctggg aggctgggag 14220cagaccctcc caacctgccc
catggcctcc gtggccccca ggacccccat ggcagcagct 14280cagacacggg ttgtgcctca
gaaggaagtg aagctgtgtg taccgagatg gcccagcaaa 14340ccctttgtat gtaaacttcc
gccacagccc agctgtccag caccagcatg tgtatctggg 14400ggagggggat aaatagaagg
tctgggaggc ctgggatctg gccagcaggc tactgggatc 14460acagatgcca gcccctccat
atctccgctt gagtcctgga tctgcctcct gggaccaaag 14520gggaaaggac caggctaggc
tccttccttt ttgttcttcc ctcttggggg aggctcctag 14580aaactccccc ttctctgccg
cccaagtgcc tggatattac cagtggggtt agcctgtttg 14640ggcccacaag atgggatggc
tcccagagcc atgggacctg aggtctccca gacagtgtct 14700agccaccctc acaactggca
gaacaatttc cttggttttc aacaacttga aaaacatatg 14760tgattttcca cagtccggtg
cttctcaggc ctggctgctg agtgagcaga gttcatgctg 14820aattccttcc actcaccaca
gggcagacag caagcccagc tgtggggact cggttggggt 14880gggggtcacc acagcaaggc
gcggggagtg gggagggggg caggcttcca gcactgatga 14940gtaattctgc tgcccgaaga
tctgggaaga gggcatgtga caacttagtg caacaatctg 15000cccagtgtta ggtcagaagg
aaggagaggt cgttcaaaat ggagtctggt ggaaaaaata 15060atgtttggcc ccacctcata
cctccctcaa aattaactcc agattaatga ggtagatgtt 15120agaagaggaa ccagggaagg
actacaagaa aatatggagt ctttatttac attgtgaggt 15180tttctttagg ttttgtttgt
ttttgttttt gatatggagt ctcactctgt cacccaggct 15240ggagtgcagt ggtgcgatcc
cggctaactg caacctccgc ctcccaggtt caagagattc 15300tcctgcctca gcctcccaag
tatctgggga ttacaggcac atgccaccat gcccggcttt 15360tttttttttt tttttttttt
gtatttttag tagagatggg gtttcaccat gttgaccagg 15420cagatctcaa actcctgacc
tcaagtgatc cacccgcctc agcctcccaa agtgctgggc 15480gcccggcatg tgtgcccagc
ctatattgac attcttgatg gagaagtctc ttaaggaagg 15540acagagaagt ttggttgcat
aaaagttttt accttctgta catcaaaata tactgaaaat 15600gaaaataaag agcaaacaaa
atactgagaa agaatgcagt gcttagagag cgaacattcc 15660tggcctcctg tagttttagg
aagcagctgt ggcctcagac ccatctgctg tgaacctcta 15720ctccatattt attgcacttt
ctgtctgtga gcgtcggttt ctctcctcta taacaatagg 15780ataataatga cactaccatg
ccttgcaaaa atgctacaag ggttcactga gataaatctg 15840gagagtcatg cctgaaaaat
agtaagtcgt tgataaaggg aagctgctat taataaataa 15900agctttttct tttttttttt
tttgagatgg aatctcactc tggcgcctag gctggagtgc 15960agtgatgcaa tcttggctca
ctgcaacctc cgcctcctgt gttcaagcaa tcctcctact 16020tcagcatcct cagtagctgg
gactacaggt gcgcaccacc atgcccggct agttttttac 16080atttttaaag ctattaatag
gccagccaca gtggctcatg cctataatcc cagcactttg 16140ggaagctgag gcaggtggat c
16161
User Contributions:
Comment about this patent or add new information about this topic: