Patent application title: NOGO Receptor Homologs
Inventors:
Stephen M. Strittmatter (Guilford, CT, US)
Richard L. Cate (Cohasset, MA, US)
Dina W.y. Sah (Boston, MA, US)
IPC8 Class: AA61K3817FI
USPC Class:
4241301
Class name: Drug, bio-affecting and body treating compositions immunoglobulin, antiserum, antibody, or antibody fragment, except conjugate or complex of the same with nonimmunoglobulin material
Publication date: 2009-07-09
Patent application number: 20090175850
Claims:
1-3. (canceled)
4. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 2.
5-16. (canceled)
17. An isolated polypeptide comprising an amino sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:14.
18-19. (canceled)
20. The polypeptide of claim 17, further comprising an amino acid sequence of a heterologous polypeptide.
21. The polypeptide according to claim 20, wherein said heterologous polypeptide is an Fc portion of an antibody.
22. (canceled)
23. An isolated antibody that binds to the polypeptide of claim 17.
24-25. (canceled)
26. A method of decreasing inhibition of axonal growth of a CNS neuron, comprising the step of contacting the neuron with an effective amount of the polypeptide of claim 17.
27. A method of treating a central nervous system disease, disorder or injury, comprising administering to a mammal an effective amount of the polypeptide of claim 17.
28. A method of decreasing inhibition of axonal growth of a CNS neuron comprising the step of contacting the neuron with an effective amount of the antibody according to claim 23.
29. A method of treating a central nervous system disease, disorder or injury, comprising administering to a mammal an effective amount of the antibody according to claim 23.
30. A method for identifying a molecule that binds the polypeptide of claim 17 comprising the steps of:(a) providing the polypeptide of claim 17;(b) contacting the polypeptide with a candidate molecule;(c) detecting binding of the candidate molecule to said polypeptide.
31. An isolated polypeptide comprising an amino acid sequence that is at least 80% identical to amino acids 1 to 310 of SEQ ID NO:2, wherein said polypeptide decreases inhibition of axonal elongation.
32. An isolated polypeptide comprising an amino acid sequence that is at least 80% identical to amino acids 31 to 310 of SEQ ID NO:2, wherein said polypeptide decreases inhibition of axonal elongation.
33. The polypeptide of claim 32, wherein said polypeptide comprises amino acids 31 to 310 of SEQ ID NO:2.
34. The polypeptide of claim 32, further comprising a heterologous polypeptide.
35. The polypeptide of claim 34, wherein said heterologous polypeptide is selected from the group consisting of Fc, Glutathione S-transferase (GST), a Histidine tag (His tag), and alkaline phosphatase (AP).
36. A recombinant host cell which produces the polypeptide fragment of claim 32.
37. A method of decreasing inhibition of axonal growth of a CNS neuron, comprising the step of contacting the neuron with an effective amount of the polypeptide of claim 32.
38-52. (canceled)
Description:
FIELD OF THE INVENTION
[0001]The invention relates to neurology and molecular biology. More particularly, the invention relates to CNS neurons and axonal growth
BACKGROUND
[0002]Among the mechanisms through which the cells of an organism communicate with each other and obtain information and stimuli from their environment is through cell membrane receptor molecules expressed on the cell surface. Many such receptors have been identified, characterized, and sometimes classified into major receptor superfamilies based on structural motifs and signal transduction features. The receptors are a first essential link for translating an extracellular signal into a cellular physiological response.
[0003]Receptors on neurons are particularly important in the development of the nervous system during embryogenesis. The neurons form connections with target cells during development through axonal extension of the neurons toward the target cells in a receptor-mediated process. Axons and dendrites have a specialized region of their distal tips known as the growth cone. Growth cones enable the neuron to sense the local environment through a receptor-mediated process and direct the movement of the axon or dendrite of the neuron toward the neuron's target cell. This process is known as elongation. Growth cones can be sensitive to several guidance cues, for example, surface adhesiveness growth factors, neurotransmitters and electric fields. The guidance of growth at the cone depends on various classes of adhesion molecules, intercellular signals, as well as factors that stimulate and inhibit growth cones.
[0004]Interestingly, damaged neurons do not elongate in the central nervous system (CNS) following injury due to trauma or disease, whereas axons in the peripheral nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS environment that is not permissive for axonal elongation. Classical grafting experiments by Aguayo and colleagues (e.g., Richardson et al., (1980) Nature 284, 264-265) demonstrated that CNS axons can in fact elongate over substantial distances within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. Therefore, given the appropriate environment, CNS axons can regenerate, implying that CNS axonal injury can potentially be addressed by appropriate manipulation of the CNS environment.
[0005]The absence of axon regeneration following injury can be attributed to the presence of axon growth inhibitors. These inhibitors are predominantly associated with myelin and constitute an important barrier to regeneration. Axon growth inhibitors are present in CNS-derived myelin and the plasma membrane of oligodendrocytes that synthesize myelin in the CNS (Schwab et al., (1993) Annu. Rev. Neurosci. 16, 565-595). Myelin-associated inhibitors appear to be a primary contributor to the failure of CNS axon regeneration in vivo after an interruption of axonal continuity, whereas other non-myelin associated axon growth inhibitors in the CNS may play a lesser role. These inhibitors block axonal regeneration following neuronal injury due to trauma, stroke or viral infection.
[0006]Numerous myelin-derived axon growth inhibitors have been characterized (see, for review, David et al., (1999) WO995394547; Bandman et al., (1999) U.S. Pat. No. 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein (MAG), which have inhibitory activity for axonal extension, have been described as well (Schwab et al., (1990) WO9005191; Schwab et al., (1997) U.S. Pat. No. 5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor that was originally characterized based on the effects of the purified protein in vitro and monoclonal antibodies that neutralize the protein's activity (Schwab (1990) Exp. Neurol. 109, 2-5). The Nogo cDNA was first identified through random analysis of brain cDNA and had no suggested function (Nagase et al., (1998) DNA Res. 5, 355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa myelin-associated axon growth inhibitor was discovered only recently (GrandPre et al., (2000) Nature 403, 439-444; Chen et al., (2000) Nature 403, 434-439; Prinjha at al., (2000) Nature 403, 383-384).
[0007]Importantly, Nogo has been shown to be the primary component of CNS myelin responsible for inhibiting axonal elongation and regeneration. Nogo's selective expression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. axons) is consistent with the inhibitory effects of CNS myelin, in contrast to P.S. myelin (GrandPre et al., (2000) Nature 403, 434-439). In culture, Nogo inhibits axonal elongation and causes growth cone collapse (Spillmann et al., (1998) J. Biol. Chem. 272, 19283-19293). Antibodies (e.g., IN-1) against Nogo have been shown to block most of the inhibitory action of CNS myelin on neurite growth in vitro (Spillmann et al., (1998) J. Biol. Chem. 272:19283-19293). These experiments indicate that Nogo is the main component of CNS myelin responsible for inhibition of axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown to enhance axonal regeneration after spinal cord injury, resulting in recovery of behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 343, 269-272; Bregman et al., (1995) Nature 378, 498-501). Thus, there is substantial evidence that Nogo is a disease-relevant molecular target. Agents that interfere with the binding of Nogo to its receptor would be expected to improve axonal regeneration in clinical states in which axons have been damaged, and improve patient outcome.
[0008]Modulation of Nogo has been described as a means for treatment of regeneration for neurons damaged by trauma, infarction and degenerative disorders of the CNS (Schwab et al., (1994) WO9417831; Tatagiba et al., (1997) Neurosurgery 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et al., (1993) U.S. Pat. No. 5,250,414); Schwab et al., (2000) U.S. Pat. No. 6,025,333).
[0009]Antibodies which recognize Nogo have been suggested to be useful in the diagnosis and treatment of nerve damage resulting from trauma, infarction and degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272, Schwab et al., (1997) U.S. Pat. No. 5,684,133). For CNS axons, there is a correlation between the presence of myelin and the inhibition of axon regeneration over long distances (Savio and Schwab (1990) Proc. Natl. Acad. Sci. 87, 4130-4133; Keirstead et al., (1992) Proc. Natl. Acad. Sci. 89, 11664-11668). After Nogo is blocked by antibodies, neurons can again extend across lesions caused by nerve damage (Schnell and Schwab (1990) Nature 343, 269-272).
SUMMARY OF THE INVENTION
[0010]Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgR1) in mice and humans have been discovered. Various domains in the polypeptides encoded by the NgR2 and NgR3 genes have been identified and compared to domains in mouse and human NGR1 polypeptides. This comparison has led to identification of a consensus sequence (NgR consensus sequence) that characterizes a family of proteins (NgR family). Based on these and other discoveries, the invention features molecules and methods for modulating axonal growth in CNS neurons.
[0011]The invention provides a polypeptide that contains a polypeptide containing a tryptophan rich LRRCT domain consisting of the amino acid sequence:
TABLE-US-00001 [SEQ ID NO: 19] N X1 W X2 C X3 C R A R X4 L W X5 W X6 X7 X8 X9 R X10 S S S X11 V X12 C X13 X14 P X15 X16 X17 X18 X19 X20 D L X21 X22 L X23 X24 X25 D X26 X27 X28 C
[0012]wherein X is any protein amino acid or a gap, and the polypeptide does not include amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NGR1) or SEQ ID NO: 17 (mouse NgR1).
[0013]Preferably, X17 and X23 are (independently) arginine or lysine. In some embodiments, the amino acid sequence of the LRRCT domain is residues 261-310 of SEQ ID NO:2, or residues 261-310 of SEQ ID NO. 2 with up to 10 conservative amino acid substitutions. In some embodiments, the polypeptide contains the following NTLRRCT amino acid sequence:
TABLE-US-00002 [SEQ ID NO:18] C P X1 X2 C X3 C Y X4 X5 P X6 X7 T X8 S C X9 X10 X11 X12 X13 X14 X15 X16 P X17 X18 X19 P X20 X21 X22 X23 R X24 F L X25 X26 N X27 I X28 X29 X30 X31 X32 X33 X34 F X35 X36 X37 X38 X39 X40 X41 X42 L W X43 X44 S N X45 X46 X47 X48 I X49 X50 X51 X52 F X53 X54 X55 X56 X57 L E X58 L D L X59 D N X60 X61 L X62 X63 X64 X65 P X66 T F X67 G L X68 X69 L X70 X71 L X72 L X73 X74 C X75 L X76 X77 L X78 X79 X80 X81 F X82 G L X83 X84 L Q Y L Y L Q X85 N X86 X87 X88 X89 L X90 D X91 X92 F X93 D L X94 N L X95 H L F L H G N X96 X97 X98 X99 X100 X101 X102 X103 X104 F R G L X105 X106 L D R L L L H X107 N X108 X109 X110 X111 V H X112 X113 A F X114 X115 L X116 R L X117 X118 L X119 L F X120 N X121 L X122 X123 L X124 X125 X126 X127 L X128 X129 L X130 X131 L X132 X133 L R L N X134 N X135 W X136 C X137 C R X138 R X139 L W X140 W X141 X142 X143 X144 R X145 S S S X146 V X147 C X148 X149 P X150 X151 X152 X153 X154 H155 D L X156 X157 L X158 X159 X160 D X161 X162 X163 C
wherein X is any amino acid residue or a gap and wherein the polypeptide is not the polypeptide of SEQ ID NO: 5 (human NGR1) or SEQ ID NO: 17 (mouse NgR1). For example, X6, X37 and X38 may represent a gap. Specific examples of polypeptides of the invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial CTS domain, if present, consists of no more than the first 39 amino acids of the CTS domain. While the polypeptide may contain a functional GPI domain, a functional GPI domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of the invention optionally includes an amino acid sequence of a heterologous polypeptide, e.g., an Fc portion of an antibody.
[0014]The invention also provides a nucleic acid encoding an above-described polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably linked to an expression control sequence; and a transformed host cell containing the vector. A method of producing a polypeptide of the invention is also provided. The method includes introducing a nucleic acid encoding the above-described polypeptide into a host cell, culturing the cell under conditions suitable for expression of the polypeptide, and recovering the polypeptide.
[0015]The invention also provides an antisense molecule whose nucleotide sequence is complementary to a nucleotide sequence encoding a polypeptide selected from the group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid is from 8 to 100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The invention also provides a nucleic acid encoding such an antisense molecule.
[0016]The invention also provides an antibody that binds to an above-described polypeptide. Polypeptides or antibodies of the invention can be formulated into pharmaceutical compositions containing the polypeptide or antibody and a pharmaceutically acceptable carrier.
[0017]The invention also provides a method for decreasing inhibition of axonal growth of a CNS neuron. The method includes the step of contacting the neuron with an effective amount of a polypeptide or antibody of the invention.
[0018]The invention also provides a method for treating a central nervous system disease, disorder or injury. The method includes administering to a mammal, e.g., a human, an effective amount of a polypeptide or antibody of the invention. Exemplary diseases, disorders and injuries that may be treated using molecules and methods of the invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, encephalomyelitis, multifocal leukoencephalopathy, panencephalitis, Marchiafava-Bignami disease, Spongy degeneration, Alexander's disease, Canavan's disease, metachromatic leukodystrophy and Krabbe's disease.
[0019]The invention also provides a method for identifying a molecule that binds a polypeptide of the invention. The method includes the steps of: (a) providing a polypeptide of the invention; (b) contacting the polypeptide with the candidate molecule; and (c) detecting binding of the candidate molecule to the polypeptide.
[0020]Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention belongs. In case of conflict, the present application, including definitions, will control. All publications, patent and other references mentioned herein are incorporated by reference.
[0021]The materials, methods and examples presented below are illustrative only, and not intended to be limiting. Other features and advantages of the invention will be apparent from the detail description and from the claims.
BRIEF DESCRIPTION OF THE FIGURES
[0022]FIG. 1A-1B shows an alignment of NgR2 (SEQ ID NO:2) and NgR3 (SEQ ID NO:4) with the known NgR, NgR1 (SEQ ID NO:5) and the Consensus Sequence (SEQ ID NO:6).
[0023]FIG. 2. mNgR3 does not bind hNogoA(1055-1120). COS-7 cells were transfected with vectors encoding myc-NgR1 or myc-NgR3, fixed, and stained with anti-myc antibodies or AP-hNogoA(1055-1120).
[0024]FIG. 3. An alignment of the amino acid sequences of human NgR1, murine NGR1, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino acid #1 of murine NgR3. The consensus sequence is listed below. The LRR NT domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR 5, and LLR 7 are indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded boxes; and the LLR CT domain is indicated by a shaded box. Amino acids in bold in LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine residue in LRR4. Box at C terminus indicates putative GPI signals.
DETAILED DESCRIPTION OF THE INVENTION
[0025]The present invention provides purified and isolated polynucleotides (e.g., DNA sequences and RNA transcripts, both sense and complementary antisense strands, both single- and double-stranded, including splice variants thereof) encoding NgR homologs, referred to herein as NgR. Unless indicated otherwise, as used herein, the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic acid sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, peptide, oligopeptide, or amino acid sequence. Specific proteins are designated by number, e.g., "NgR2" is a human NgR homolog, "NgR3" is a murine-derived NgR homolog, and "NgR1" is the known NgR identified by Dr. Stephen Strittmatter. Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention include genomic DNA, cDNA and DNA that has been chemically synthesized in whole or in part.
[0026]Standard reference works setting forth the general principles of recombinant DNA technology known to those of skill in the art include Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York (1998); Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2d Ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y. (1989); Kaufman et al., Eds., HANDBOOK OF MOLECULAR AND CELLULAR METHODS IN BIOLOGY AND MEDICINE, CRC Press, Boca Raton (1995); McPherson, Ed., DIRECTED MUTAGENESIS: A PRACTICAL APPROACH, IRL Press, Oxford (1991).
[0027]As used herein, the term "axon" refers to a long cellular protrusion from a neuron, whereby action potentials are conducted, either to or from the cell body.
[0028]As used herein, the term "axonal growth" refers to an extension of the long process or axon, originating at the cell body and proceeded by the growth cone.
[0029]As used herein, the term "central nervous system disorder" refers to any pathological state associated with abnormal function of the central nervous system (CNS). The term includes, but is not limited to, altered CNS function resulting from physical trauma to cerebral tissue, viral infection, autoimmune mechanisms and genetic mutation.
[0030]As used herein, the term "demyelinating disease" refers to a pathological disorder characterized by the degradation of the myelin sheath of the oligodendrocyte cell membrane.
[0031]As used herein, the term "growth cone" refers to a specialized region at the tip of a growing neurite that is responsible for sensing the local environment and moving the axon toward its appropriate synaptic target cell.
[0032]As used herein, the term "growth cone movement" refers to the extension or collapse of the growth cone toward a neuron's target cell.
[0033]As used herein, the term "neurite" refers to a process growing out of a neuron. As it is sometimes difficult to distinguish a dendrite from in axon in culture, the term "neurite" is used for both.
[0034]As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the CNS whose function is to myelinate CNS axons.
[0035]"Synthesized" as used herein and understood in the art, refers to polynucleotides produced by purely chemical, as opposed to enzymatic, methods. "Wholly" synthesized DNA sequences are therefore produced entirely by chemical means, and "partially" synthesized DNAs embrace those wherein only portions of the resulting DNA were produced by chemical means. By the term "region" is meant a physically contiguous portion of the primary structure of a biomolecule. In the case of proteins, a region is defined by a contiguous portion of the amino acid sequence of that protein. The term "domain" is herein defined as referring to a structural part of a biomolecule that contributes to a known or suspected function of the biomolecule. Domains may be co-extensive with regions or portions thereof; domains may also incorporate a portion of a biomolecule that is distinct from a particular region, in addition to all or part of that region. Examples of NgR protein domains include, but are not limited to, the signal peptide, extracellular (i.e., N-terminal) domain, and leucine-rich repeat domains.
[0036]As used herein, the term "activity" refers to a variety of measurable indicia suggesting or revealing binding, either direct or indirect; affecting a response, i.e., having a measurable affect in response to some exposure or stimulus, including, for example, the affinity of a compound for directly binding a polypeptide or polynucleotide of the invention, or, for example, measurement of amounts of upstream or downstream proteins or other similar functions after some stimulus or event. Such activities may be measured by assays such as competitive inhibition of NGR1 binding to Nogo assays wherein, for example, unlabeled, soluble NgR2 is added to an assay system in increasing concentrations to inhibit the binding of Nogo to NGR1 expressed on the surface of CHO cells. As another example, one may assess the ability of neurons to extend across lesions caused by nerve damage (as in Schnell and Schwab (1990) Nature 343, 269-272) following inhibition of Nogo by various forms of NgR2 and/or NgR3 as a biological indicator of NgR function.
[0037]As used herein, the term "antibody" is meant to refer to complete, intact antibodies, and Fab, Fab', F(ab)2, and other fragments thereof. Complete, intact antibodies include monoclonal antibodies such as murine monoclonal antibodies, chimeric antibodies, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and humanized antibodies.
[0038]As used herein, the term "binding" means the physical or chemical interaction between two proteins or compounds or associated proteins or compounds or combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds, Van der Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be either direct or indirect, indirect being through or due to the effects of another protein or compound. Direct binding refers to interactions that do not take place through or due to the effect of another protein or compound but instead are without other substantial chemical intermediates.
[0039]As used herein, the term "compound" means any identifiable chemical or molecule, including, but not limited to, small molecules, peptides, proteins, sugars, nucleotides or nucleic acids, and such compound can be natural or synthetic.
[0040]As used herein, the term "complementary" refers to Watson-Crick basepairing between nucleotide units of a nucleic acid molecule.
[0041]As used herein, the term "contacting" means bringing together, either directly or indirectly, a compound into physical proximity to a polypeptide or polynucleotide of the invention. The polypeptide or polynucleotide can be in any number of buffers, salts, solutions etc. Contacting includes, for example, placing the compound into a beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the like, which contains the nucleic acid molecule, or polypeptide encoding the NgR or fragment thereof.
[0042]As used herein, the phrase "homologous nucleotide sequence," or "homologous amino acid sequence," or variations thereof, refers to sequences characterized by an identity at the nucleotide level, or a homology at the amino acid level, of at least the specified percentage. Homologous nucleotide sequences include those sequences coding for isoforms of proteins. Such isoforms can be expressed in different tissues of the same organism as a result of, for example, alternative splicing of RNA. Alternatively, isoforms can be encoded by different genes. Homologous nucleotide sequences include nucleotide sequences encoding for a protein of a species other than humans, including, but not limited to, mammals. Homologous nucleotide sequences also include, but are not limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence does not, however, include the nucleotide sequence encoding NgR1. Homologous amino acid sequences include those amino acid sequences which contain conservative amino acid substitutions and which polypeptides have the same binding and/or activity. A homologous amino acid sequence does not, however, include the amino acid sequence encoding other known NgRs. Percent homology can be determined by, for example, the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, Madison Wis.), using the default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math, 1981, 2, 482-489, which is incorporated herein by reference in its entirety).
[0043]As used herein, the term "isolated" nucleic acid molecule refers to a nucleic acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other proteins with which it is associated in nature, i.e., a nucleic acid that has been removed from its native environment. Examples of isolated nucleic acid molecules include, but are not limited to, recombinant DNA molecules contained in a vector, recombinant DNA molecules maintained in a heterologous host cell, partially or substantially purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the nucleic acid) in the genomic. DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular material or culture medium when produced by recombinant techniques, or of chemical precursors or other chemicals when chemically synthesized.
[0044]As used herein, the term "heterologous" refers to a nucleotide or amino acid sequence that is a different, or non-corresponding sequence, or a sequence derived from a different species. For example, a mouse NgR nucleotide or amino acid sequence is heterologous to a human NgR nucleotide or amino acid sequence, and a human NgR nucleic or amino acid sequence is heterologous to a human immunoglobulin nucleotide or amino acid sequence.
[0045]As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does not anchor itself in a membrane. Such soluble polypeptides include, for example, NgR2 and NgR3 polypeptides that lack a sufficient portion of their GPI anchor signal to anchor the polypeptide or are modified such that the GPI anchor signal is not adequate to result in replacement of the peptide with a GPI anchor. In preferred embodiments, up to 5, 10, 20 or 25 amino acids are removed from the C-terminus of NgR2 or NgR3 to make the respective proteins soluble. As used herein soluble NgR polypeptides include full-length or truncated (e.g., with internal deletions) NgR.
[0046]Soluble NgR polypeptides may include the entire NgR protein up to the putative GPI signal sequence (e.g., amino acid 1 to about amino acid 395 of NgR2, and from amino acid 1 to about amino acid 438 of NgR3). In other embodiments, the signal peptide of the proteins may be removed or truncated (e.g., all or part of the signal sequence of NgR2, which spans amino acid 1 to about amino acid 30 of SEQ ID NO:2, may be removed; all or part of the signal sequence of NgR3, which spans amino acid 1 to about amino acid 40 of SEQ ID NO:4, may be removed). In some embodiments, the mature NgR2 (SEQ ID NO:8) and the mature NgR3 (SEQ ID NO:9) are used.
[0047]Soluble NgR polypeptides include at least one of the putative ligand-binding portions of NgR, including the first cysteine-rich region (SEQ ID NO:10, the leucine repeat region (SEQ ID NO:12) and the second cysteine-rich region (SEQ ID NO:11). In some embodiments, soluble NgR polypeptides consist of amino acid 1 through about amino acid 395 of SEQ ID NO:2, or amino acid 1 through about amino acid 438 of SEQ ID NO:4.
[0048]In other embodiments, the soluble NgR polypeptides are fusion proteins that contain amino acids 30 through about amino acid 395 of mature NgR2 or amino acid 40 through about amino acid 438 of NgR3, the C-terminal 10 amino acids of a human IgG 1 hinge region containing the two cysteine residues thought to participate in interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy chain constant domain. This type of recombinant protein is designed to modulate inhibition of axonal elongation through inhibition of the Nogo ligand binding to NGR1, or by inhibiting the ligand of the NgR from interacting with cell surface NgR The NgR portion of the fusion binds to the Nogo ligand and the IgG1 portion binds to the FcγRI (macrophage) and FcγIII (NK cells and neutrophils) receptors.
[0049]The production of the soluble polypeptides useful in this invention may be achieved by a variety of methods known in the art. For example, the polypeptides may be derived from intact transmembrane NgR molecules by proteolysis using specific endopeptidases in combination with exopeptidases, Edman degradation, or both. The intact NgR molecule, in turn, may be purified from its natural source using conventional methods. Alternatively, the intact NgR may be produced by known recombinant DNA techniques using cDNAs, expression vectors and well-known techniques for recombinant gene expression.
[0050]Preferably, the soluble polypeptides useful in the present invention are produced directly, thus eliminating the need for an entire NgR as a starting material. This may be achieved by conventional chemical synthesis techniques or by well-known recombinant DNA techniques wherein only those DNA sequences which encode the desired peptides are expressed in transformed hosts. For example, a gene which encodes the desired soluble NgR polypeptide may be synthesized by chemical means using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the amino acid sequence of the desired soluble NgR polypeptide. Specific DNA sequences coding for the desired peptide also can be derived from the full-length DNA sequence by isolation of specific restriction endonuclease fragments or by PCR synthesis of the specified region from cDNA.
[0051]A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule having the nucleotide sequence of SEQ ID NOs:1, 3 or a complement of either of these nucleotide sequences, can be isolated using standard molecular biology techniques and the sequence information provided herein. Using all or a portion of the nucleic acid sequences of SEQ ID NOs:1 or 3 as a hybridization probe, NgR nucleic acid sequences can be isolated using standard hybridization and cloning techniques (e.g. as described in Sambrook et al., eds., MOLECULAR CLONING: A LABORATORY MANUAL 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989; and Ausubel, et al., eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, N.Y., 1993).
[0052]A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer.
[0053]As used herein, the terms "modulates" or "modifies" means an increase or decrease in the amount, quality, or effect of a particular activity or protein.
[0054]As used herein, the term "oligonucleotide" refers to a series of linked nucleotide residues which has a sufficient number of bases to be used in a polymerase chain reaction (PCR). This short sequence is based on (or designed from) a genomic or cDNA sequence and is used to amplify, confirm or reveal the presence of an identical, similar or complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions of a DNA sequence having at least about 10 nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 nucleotides. They are chemically synthesized and may be used as probes.
[0055]As used herein, the term "probe" refers to nucleic acid sequences of variable length, preferably between at least about 10 and as many as about 6,000 nucleotides, depending on use. They are used in the detection of identical, similar or complementary nucleic acid sequences. Longer length probes are usually obtained from a natural or recombinant source, are highly specific and much slower to hybridize than oligomers. They may be single- or double-stranded and carefully designed to have specificity in PCR, hybridization membrane-based, of ELISA-like technologies.
[0056]The term "preventing" refers to decreasing the probability that an organism contracts or develops an abnormal condition.
[0057]The term "treating" refers to having a therapeutic effect and at least partially alleviating or abrogating an abnormal condition in the organism.
[0058]The term "therapeutic effect" refers to the inhibition or activation factors causing or contributing to the abnormal condition. A therapeutic effect relieves to some extent one or more of the symptoms of the abnormal condition. In reference to the treatment of abnormal conditions, a therapeutic effect can refer to one or more of the following: (a) an increase in the proliferation, growth, and/or differentiation of cells; (b) inhibition (i.e., slowing or stopping) of cell death; (c) inhibition of degeneration; (d) relieving to some extent one or more of the symptoms associated with the abnormal condition; and (e) enhancing the function of the affected population of cells. Compounds demonstrating efficacy against abnormal conditions can be identified as described herein.
[0059]The term "abnormal condition" refers to a function in the cells or tissues of an organism that deviates from their normal functions in that organism. An abnormal condition can relate to cell proliferation, cell differentiation, cell signaling, or cell survival. An abnormal condition may also include obesity, diabetic complications such as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty acid uptake and metabolism.
[0060]Abnormal cell proliferative conditions, for example, include cancers such as fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, wound healing, psoriasis, diabetes mellitus and inflammation.
[0061]Abnormal differentiation conditions include, for example, neurodegenerative disorders, slow wound healing rates and slow tissue grafting healing rates.
[0062]Abnormal cell signaling conditions include, for example, psychiatric disorders involving excess neurotransmitter activity.
[0063]Abnormal cell survival conditions may also relate to conditions in which programmed cell death (apoptosis) pathways are activated or abrogated. A number of protein kinases are associated with the apoptosis pathways Aberrations in the function of any one of the protein kinases could lead to cell immortality or premature cell death.
[0064]The term "administering" relates to a method of incorporating a compound into cells or tissues of an organism. The abnormal condition can be prevented or treated when the cells or tissues of the organism exist within the organism or outside of the organism. Cells existing outside the organism can be maintained or grown in cell culture dishes. For cells harbored within the organism, many techniques exist in the art to administer compounds, including (but not limited to) oral, parenteral, dermal, injection, and aerosol applications. For cells outside of the organism, multiple techniques exist in the art to administer the compounds, including (but not limited to) cell microinjection techniques, transformation techniques and carrier techniques.
[0065]The abnormal condition can also be prevented or treated by administering a compound to a group of cells having an aberration in a signal transduction pathway to an organism. The effect of administering a compound on organism function can then be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, more preferably a monkey or ape, and most preferably a human.
[0066]By "amplification" it is meant increased numbers of DNA or RNA in a cell compared with normal cells. "Amplification" as it refers to RNA can be the detectable presence of RNA in cells, since in some normal cells there is no basal expression of RNA. In other normal cells, a basal level of expression exists, therefore in these cases amplification is the detection of at least 1-2-fold, and preferably more, compared to the basal level.
[0067]The amino acid sequences are presented in the amino to carboxy direction, from left to right. The amino and carboxy groups are not presented in the sequence. The nucleotide sequences are presented by single strand only, in the 5' to 3' direction, from left to right. Nucleotides and amino acids are represented in the manner recommended by the IUPAC-IUB Biochemical Nomenclature Commission or (for amino acids) by three letters code.
Nucleic Acids
[0068]Genomic DNA of the invention comprises the protein-coding region for a polypeptide of the invention and is also intended to include allelic variants thereof. It is widely understood that, for many genes, genomic DNA is transcribed into RNA transcripts that undergo one or more splicing events wherein intron (i.e., non-coding regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be spliced by alternative mechanisms, and therefore be subject to removal of different RNA sequences but still encode a NgR polypeptide, are referred to in the art as splice variants which are embraced by the invention. Splice variants comprehended by the invention therefore are encoded by the same original genomic DNA sequences but arise from distinct mRNA transcripts. Allelic variants are modified forms of a wild-type gene sequence, the modification resulting from recombination during chromosomal segregation or exposure to conditions which give rise to genetic mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as opposed to non-naturally occurring variants arising from in vitro manipulation).
[0069]The invention also comprehends cDNA that is obtained through reverse transcription of an RNA polynucleotide encoding NgR (conventionally followed by second-strand synthesis of a complementary strand to provide a double-stranded DNA).
[0070]Preferred DNA sequences encoding a human NgR polypeptide is set out in SEQ ID NOs:1 and 13. A preferred DNA of the invention comprises a double stranded molecule comprising the coding molecule (i.e., the "coding strand") along with the complementary molecule (the "non-coding strand" or "complement") having a sequence unambiguously deducible from the coding strand according to Watson-Crick base-pairing rules for DNA. Also preferred are other polynucleotides encoding NgR polypeptides, as shown in SEQ ID NO:3, which comprises murine NgR homolog, NgR3.
[0071]Also preferred are nucleotide sequences that encode at least a portion of a NgR polypeptide that has at least one biological function of a NgR. More preferred are nucleotide sequences that encode a portion of NgR that encodes at least the mature NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide sequences that encode the portion of NgR that encodes at least the ligand-binding region of NgR.
[0072]The invention further embraces other species, preferably mammalian, homologs of the human NgR DNA. Species homologs, sometimes referred to as "orthologs," in general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% homology with human DNA of the invention. Generally, percent sequence "homology" with respect to polynucleotides of the invention may be calculated as the percentage of nucleotide bases in the candidate sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID NOs:1, 3 or 13, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity.
[0073]The polynucleotide sequence information provided by the invention makes possible large-scale expression of the encoded polypeptide by techniques well known and routinely practiced in the art. Polynucleotides of the invention also permit identification and isolation of polynucleotides encoding related NgR polypeptides, such as human allelic variants and species homologs, by well-known techniques including Southern and/or Northern hybridization, and polymerase chain reaction (PCR). Examples of related polynucleotides include human and non-human genomic sequences, including allelic variants, as well as polynucleotides encoding polypeptides homologous to NgR and structurally related polypeptides sharing one or more biological, immunological, and/or physical properties of NgR. Non-human species genes encoding proteins homologous to NgR can also be identified by Southern and/or PCR analysis and are useful in animal models for NgR disorders. Knowledge of the sequence of a human NgR DNA also makes possible through use of Southern hybridization or polymerase chain reaction (PCR) the identification of genomic DNA sequences encoding NgR expression control regulatory sequences such as promoters, operators, enhancers, repressors, and the like. Polynucleotides of the invention are also useful in hybridization assays to detect the capacity of cells to express NgR Polynucleotides of the invention may also provide a basis for diagnostic methods useful for identifying a genetic alteration(s) in a NgR locus that underlies a disease state or states, which information is useful both for diagnosis and for selection of therapeutic strategies.
[0074]The disclosure herein of a full-length polynucleotide encoding a NgR polypeptide makes readily available to the worker of ordinary skill in the art every possible fragment of the full-length polynucleotide. The invention, therefore, provides fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide encoding NgR. Preferably, fragments of polynucleotides of the invention comprise sequences unique to the NgR-encoding polynucleotide sequence, and therefore hybridize under highly stringent or moderately stringent conditions only (i.e., "specifically") to polynucleotides encoding NgR (or fragments thereof). Polynucleotide fragments of genomic sequences of the invention comprise not only sequences unique to the coding region, but also include fragments of the full-length sequence derived from introns, regulatory regions, and/or other non-translated sequences. Sequences unique to polynucleotides of the invention are recognizable through sequence comparison to other known polynucleotides, and can be identified through use of alignment programs routinely utilized in the art, e.g., those made available in public sequence databases. Such sequences also are recognizable from Southern hybridization analyses to determine the number of fragments of genomic DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be labeled in a manner that permits their detection, including radioactive, fluorescent and enzymatic labeling.
[0075]Fragments of polynucleotides are particularly useful as probes for detection of full-length or fragment of NgR polynucleotides. One or more polynucleotides can be included in kits that are used to detect the presence of a polynucleotide encoding NgR, or used to detect variations in a polynucleotide sequence encoding NgR.
[0076]The invention also embraces DNAs encoding NgR polypeptides that hybridize under moderately stringent or high stringency conditions to the noncoding strand, or complement, of the polynucleotide in any of SEQ ID NOs:1 or 3
[0077]Stringent conditions are known to those skilled in the art and can be found in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, N.Y. (1989), 6.3.1?6.3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain hybridized to each other. A non-limiting example of stringent hybridization conditions is hybridization in a high salt buffer comprising 6×SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA and 500 mg/ml denatured salmon sperm DNA at 65° C. This hybridization is followed by one or more washes in 0.2×SSC, 0.01% BSA at 50° C. An isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to the sequence of SEQ ID NOs:1 or 3 corresponds to a naturally occurring nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). As used herein, "stringent hybridization conditions" means: 42° C. in a hybridization solution comprising 50% formamide, 1% SDS, 1 M NaCl, 10% (wt/vol) dextran sulfate, and washing twice for 30 minutes at 60° C. in a wash solution comprising 0.1×SSC and 1% SDS.
Vectors
[0078]Another aspect of the present invention is directed to vectors, or recombinant expression vectors, comprising any of the nucleic acid molecules described above. Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to express DNA which encodes NgR. As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), that serve equivalent functions.
[0079]Expression of proteins in prokaryotes is most often carried out in E. coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve three purposes: (1) to increase expression of recombinant protein, (2) to increase the solubility of the recombinant protein; and (3) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 3140), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein.
[0080]Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amrann et al., (1988) Gene 69, 301-315) and pET 11d (Studier et al., GENE EXRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 60-89).
[0081]One strategy to maximize recombinant protein expression in E. coli is to express the protein in host bacteria with an impaired capacity to proteolytically cleave the recombinant protein. See, Gottesman, GENE EXPRESSION TECHNOLOGY: METHODS ENZYMOLOGY 185, Acadermic Press, San Diego, Calif. (1990) 119-128. Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in E. coli (Wada et al., (1992) Nucleic Acids Res. 20, 2111-2118). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques.
[0082]In another embodiment, the NgR expression vector is a yeast expression vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSec1 (Baldari, et al., (1987) EMBO J. 6, 229-234), pMFa (Kurjan and Herskowitz (1982) Cell 30, 933-943), pJRY88 (Schultz et al., (1987) Gene 54, 113-123), pYES2 (Invitrogen Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.).
[0083]Alternatively, NgR can be expressed in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF9 cells) include the pAc series (Smith et al., (1983) Mol. Cell. Biol. 3, 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170, 31-39).
[0084]In yet another embodiment, a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed (1987) Nature 329, 840) and pMT2PC (Kaufman et al. (1987) EMBO J. 6, 187-195). When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, adenovirus 2, cytomegalovirus and Simian Virus 40. For other suitable expression systems for both prokaryotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et al., (Eds.) MOLECULAR CLONING: A LABORATORY MANUAL. 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.
[0085]In another embodiment, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immunol. 43, 235-275), in particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J. 8, 729-733) and immunoglobulins (Banerji et al. (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Sci. USA 86, 5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230, 912-916), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 264, 166). Developmentally-regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the α-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546).
[0086]The invention further provides a recombinant expression vector comprising a DNA molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in a manner that allows for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense NgR mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the antisense orientation can be chosen that direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be chosen that direct constitutive, tissue-specific or cell-type-specific expression of antisense RNA The antisense expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the control of a high efficiency regulatory region, the activity of which can be determined by the cell type into which the vector is introduced. For a discussion of the regulation of gene expression using antisense genes see Weintraub et al., Antisense RNA as a molecular tool for genetic analysis, REVIEWS-TRENDS IN GENETICS, Vol. 1(1) 1986.
[0087]Preferred vectors include, but are not limited to, plasmids, phages, cosmids, episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments integratable into the host genome by homologous recombination). Preferred viral particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia viruses and retroviruses. Preferred expression vectors include, but are not limited to, pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors include, but are not limited to, pSPORT® vectors, pGEM® vectors (Promega), pPROEXvectors® (LTI, Bethesda, Md.), Bluescript® vectors (Stratagene), pQE® vectors (Qiagen), pSE420® (Invitrogen) and pYES2® (Invitrogen).
[0088]Preferred expression vectors are replicable DNA constructs in which a DNA sequence encoding NgR is operably linked or connected to suitable control sequences capable of effecting the expression of the NgR in a suitable host. DNA regions are operably linked or connected when they are functionally related to each other. For example, a promoter is operably linked or connected to a coding sequence if it controls the transcription of the sequence. Amplification vectors do not require expression control domains, but rather need only the ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants. The need for control sequences in the expression vector will vary depending upon the host selected and the transformation method chosen. Generally, control sequences include, but are not limited to a transcriptional promoter, enhancers, an optional operator sequence to control transcription, polyadenylation signals, a sequence encoding suitable mRNA ribosomal binding and sequences which control the termination of transcription and translation. Such regulatory sequences are described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, etc. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion proteins, etc.).
[0089]Preferred vectors preferably contain a promoter that is recognized by the host organism. The promoter sequences of the present invention may be prokaryotic, eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL promoters of bacteriophage lambda (THE BACTERIOPHAGE LAMBDA, Hershey, A. D. (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1973), which is incorporated herein by reference in its entirety; LAMBDA II, Hendrix, R. W. (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1980), which is incorporated herein by reference in its entirety); the trp, recA, heat shock, and lacZ promoters of E. coli and the SV40 early promoter (Benoist et al., (1981) Nature 290, 304-310, which is incorporated herein by reference in its entirety). Additional promoters include, but are not limited to, mouse mammary tumor virus, long terminal repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, human hemoglobin, human muscle creatine and human metallothionein.
[0090]Additional regulatory sequences can also be included in preferred vectors. Preferred examples of suitable regulatory sequences are represented by the Shine-Dalgarno sequence of the replicase gene of the phage MS-2 and of the gene cII of bacteriophage lambda. The Shine-Dalgarno sequence may be directly followed by DNA encoding NgR and result in the expression of the mature NgR protein.
[0091]Moreover, suitable expression vectors can include an appropriate marker that allows the screening of the transformed host cells. The transformation of the selected host is carried out using any one of the various techniques well known to the expert in the art and described in Sambrook et al., supra.
[0092]An origin of replication can also be provided either by construction of the vector to include an exogenous origin or may be provided by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell chromosome, the latter may be sufficient. Alternatively, rather than using vectors which contain viral origins of replication, one skilled in the art can transform mammalian cells by the method of co-transformation with a selectable marker and NgR DNA. An example of a suitable marker is dihydrofolate reductase (DHFR) or thymidine kinase (see, U.S. Pat. No. 4,399,216).
[0093]Nucleotide sequences encoding NgR may be recombined with vector DNA in accordance with conventional techniques, including blunt-ended or staggered-ended termini for ligation, restriction enzyme digestion to provide appropriate termini, filling in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and ligation with appropriate ligases. Techniques for such manipulation are disclosed by Sambrook et al., supra and are well known in the art. Methods for construction of mammalian expression vectors are disclosed in, for example, Okayama et al., (1983) Mol. Cell. Biol. 3:280, Cosman et al. (1986) Mol. Immunol. 23:935, Cosman et al., (1984) Nature 312:768, EP-A-0367566, and WO 91/18982, each of which is incorporated herein by reference in its entirety.
Host Cells and Transformed Host Cells
[0094]According to another aspect of the invention, host cells are provided, including prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or vector of the invention) in a manner that permits expression of the encoded NgR polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. Polynucleotides of the invention may be introduced into the host cell as part of a circular plasmid, or as linear DNA comprising an isolated protein coding region or a viral vector. Methods for introducing DNA into the host cell that are well known and routinely practiced in the art include transformation, transfection, electroporation, nuclear injection, or fusion with carriers such as liposomes, micelles, ghost cells and protoplasts. Expression systems of the invention include bacterial, yeast, fungal, plant, insect, invertebrate, vertebrate and mammalian cells systems.
[0095]Host cells of the invention are a valuable source of immunogen for development of antibodies specifically immunoreactive with NgR. Host cells of the invention are also useful in methods for the large-scale production of NgR polypeptides wherein the cells are grown in a suitable culture medium and the desired polypeptide products are isolated from the cells, or from the medium in which the cells are grown, by purification methods known in the art, e.g., conventional chromatographic methods including immunoaffinity chromatography, receptor affinity chromatography, hydrophobic interaction chromatography, lectin affinity chromatography, size exclusion filtration, cation or anion exchange chromatography, high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still other methods of purification include those methods wherein the desired protein is expressed and purified as a fusion protein having a specific tag, label or chelating moiety that is recognized by a specific binding partner or agent. The purified protein can be cleaved to yield the desired protein, or can be left as an intact fusion protein. Cleavage of the fusion component may produce a form of the desired protein having additional amino acid residues as a result of the cleavage process.
[0096]Knowledge of NgR DNA sequences allows for modification of cells to permit, or increase, expression of endogenous NgR Cells can be modified (e.g., by homologous recombination) to provide increased expression by replacing, in whole or in part, the naturally occurring NgR promoter with all or part of a heterologous promoter so that the cells express NgR at higher levels. The heterologous promoter is inserted in such a manner that it is operatively linked to endogenous NgR encoding sequences. (See, for example, PCT International Publication No. WO 94/12650, PCT International Publication No. WO 92120808, and PCT International Publication No. WO 91/09955.) It is also contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter DNA If linked to the NgR coding sequence, amplification of the marker DNA by standard selection methods results in co-amplification of the NgR coding sequences in the cells.
[0097]The DNA sequence information provided by the present invention also makes possible the development (e.g., by homologous recombination or "knock-out" strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express functional NgR or that express a variant of NgR. Such animals (especially small laboratory animals such as rats, rabbits and mice) are useful as models for studying the in vivo activities of NgR and modulators of NgR.
[0098]Suitable host cells for expression of the polypeptides of the invention include, but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression vector is employed, then the appropriate host cell would be any prokaryotic cell capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, Streptomyces and Staphylococcus.
[0099]If a eukaryotic expression vector is employed, then the appropriate host cell would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but are not limited to, non-human mammalian tissue culture cells and human tissue culture cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell culture has become a routine procedure (see, Tissue Culture, Academic Press, Kruse and Patterson, Eds. (1973), which is incorporated herein by reference in its entirety).
[0100]In addition, a yeast cell may be employed as a host cell. Preferred yeast cells include, but are not limited to, the genera Saccharomyces, Pichia and Kluveromyces. Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can contain an origin of replication sequence from a 2T yeast plasmid, an autonomously replication sequence (ARS), a promoter region, sequences for polyadenylation, sequences for transcription termination and a selectable marker gene Shuttle vectors for replication in both yeast and E. coli are also included herein.
[0101]Alternatively, insect cells may be used as host cells. In a preferred embodiment, the polypeptides of the invention are expressed using a baculovirus expression system (see, Luckow et al., Bio/Technology, 1988, 6, 47; BACULOVIRUS EXPRESSION VECTORS: A LABORATORY MANUAL, O'Rielly et al. (Eds.), W.H. Freeman and Company, New York, 1992; and U.S. Pat. No. 4,879,236, each of which is incorporated herein by reference in its entirety). In addition, the MAXBAC® complete baculovirus expression system (Invitrogen) can, for example, be used for production in insect cells.
[0102]Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
[0103]Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al. (MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals.
[0104]For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. Various selectable markers include those that confer resistance to drugs, such as G418, hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell on the same vector as that encoding NgR or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).
[0105]In a preferred embodiment, the polypeptides of the invention, including forms of NgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, NgR/Ig fusions and fragments and variations of each of the above are expressed in Chinese Hamster Ovary (CHO) cells.
[0106]In order to introduce the DNA fragment coding for the NgR protein or polypeptide into the CHO cell to express the recombinant NgR protein or polypeptide, it is necessary to construct the expression vector.
[0107]The vectors for CHO expression include, but are not limited to, pA1-11, pXT1, pRc/CMV, pRc/RSV and pcDNAINeo. The promoter is not specifically limited provided it effectively promotes expression in CHO cells. Examples of suitable promoters are: SRα, SV40, LTR, CMV, and HSV-TK. Of these, CMV and Srα promoters are preferred.
[0108]In addition to the above-mentioned promoters, the expression vectors may contain enhancers, splicing signals, polyadenylation signals, selectable markers and an SV40 replication origin. Suitable selectable markers include, but are not limited to the dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate (MTX), the ampicillin resistance gene, and the neomycin resistance gene.
[0109]Examples of the expression vectors each containing the DNA coding for NgR, portions, fragments and soluble constructs thereof, include the vector (such as one described above), into which the promoter is operably linked (preferably upstream) to the nucleotide sequence encoding the desired NgR construct; a polyadenylation signal downstream from the nucleotide sequence encoding the NgR construct; and, preferably, the vector includes an operable DHFR gene. Preferably, the ampicillin resistant gene is also operably contained in the vector.
[0110]CHO cell lacking the DHFR gene (Urlaub, G. et al., (1980) Proc. Natl. Acad. Sci. USA 77, 4216-4220) and CHO-K1 (Proc. Natl. Acad. Sci. USA 60, 1275 (1968)) are suitable for use.
[0111]The NgR expression vectors prepared as above are introduced into CHO cells by any known method, including, but not limited to the calcium phosphate method (Graham and van der Eb (1973) Virol. 52, 456-467) and electroporation (Nuemann et al., (1982) EMBO J. 1, 841-845).
[0112]Transformants carrying the expression vectors are selected based on the above-mentioned selectable markers. Repeated clonal selection of the transformants using the selectable markers allows selection of stable cell lines having high expression of the NgR constructs. Increased MTX concentrations in the selection medium allows gene amplification and greater expression of the desired protein. The CHO cell containing the recombinant NgR can be produced by cultivating the CHO cells containing the NR expression vectors constitutively expressing the NgR constructs.
[0113]Media used in cultivating CHO cells includes DMEM medium supplemented with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 40° C. for about 15 to 72 hours with aeration.
[0114]A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) NgR protein. Accordingly, the invention further provides methods for producing NgR protein using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding NgR has been introduced) in a suitable medium such that NgR protein is produced. In another embodiment, the method further comprises isolating NgR from the medium or the host cell.
[0115]In situations where the NgR polypeptide will be found primarily intracellularly, intracellular material (including inclusion bodies for Gram-negative bacteria) can be extracted from the host cell using any standard technique known to one of ordinary skill in the art. Such methods would encompass, by way of example and not by way of limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by French press, homogenization and/or sonication followed by centrifugation.
[0116]If the NgR polypeptide has formed inclusion bodies in the cytosol, such inclusion bodies may frequently bind to the inner and/or outer cellular membranes. Upon centrifugation, the inclusion bodies will be found primarily in the pellet material. The pellet material can then be treated at pH extremes or with one or more chaotropic agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives in the presence of a reducing agent such as dithiothreitol at alkaline pH or tris-carboxyethyl phosphine at acid pH to release, break apart and solubilize the inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel electrophoresis, immunoprecipitation or the like. Various methods of isolating the NgR polypeptide would be apparent to one of ordinary skill in the art, for example, isolation may be accomplished using standard methods such as those set forth below and in Marston et al (1990) Meth. Enzymol. 182, 264-275 (incorporated by reference herein in its entirety).
[0117]If isolated NgR polypeptide is not biologically active following the isolation procedure employed, various methods for "refolding" or converting the polypeptide to its tertiary structure and generating disulfide linkages, can be used to restore biological activity. Methods known to one of ordinary skill in the art include adjusting the pH of the solubilized polypeptide to a pH usually above 7 and in the presence of a particular concentration of a chaotrope. The selection of chaotrope is very similar to the choices used for inclusion body solubilization but usually at a lower concentration and is not necessarily the same chaotrope as used for the solubilization. It may be required to employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, to generate a particular redox potential allowing for disulfide shuffling to occur in the formation of the protein's cysteine bridge(s). Some of the commonly used redox couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, dithiothreitol (DTT)/dithiane DTT, 2-mercaptoethanol (bME)/dithio-b(ME). To increase the efficiency of the refolding, it may be necessary to employ a cosolvent, such as glycerol, polyethylene glycol of various molecular weights and arginine.
Transgenic Animals
[0118]The host cells of the invention can also be used to produce non-human transgenic animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have been introduced. Such host cells can then be used to create non-human transgenic animals in which exogenous NgR sequences have been introduced into their genome or homologous recombinant animals in which endogenous NgR sequences have been altered. Such animals are useful for studying the function and/or activity of NgR and for identifying and/or evaluating modulators of NgR activity. As used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal includes a transgene. Other examples of transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is integrated into the genome of a cell from which a transgenic animal develops and that remains in the genome of the mature animal, thereby directing the expression of an encoded gene product in one or more cell types or tissues of the transgenic animal. As used herein, a "homologous recombinant animal" is a non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous NgR gene has been altered by homologous recombination between the endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to development of the animal.
[0119]A transgenic animal of the invention can be created by introducing NgR-encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID NOs:1 or 3 can be introduced as a transgene into the genome of a non-human animal. Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR gene, can be isolated based on hybridization to the human NgR cDNA (described further above) and used as a transgene. Intronic sequences and polyadenylation signals can also be included in the transgene to increase the efficiency of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably linked to the NgR transgene to direct expression of NgR protein to particular cells. Methods for generating transgenic animals via embryo manipulation and microinjection, particularly animals such as mice, have become conventional in the art and are described, for example, in U.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan 1986, in MANIPULATING THE MOUSE EMBRYO, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. Similar methods are used for production of other transgenic animals. A transgenic founder animal can be identified based upon the presence of the NgR transgene in its genome and/or expression of NgR mRNA in tissues or cells of the animals. A transgenic founder animal can then be used to breed additional animals carrying the transgene. Moreover, transgenic animals carrying a transgene encoding NgR can further be bred to other transgenic animals carrying other transgenes.
[0120]To create a homologous recombinant animal, a vector is prepared which contains at least a portion of a NgR gene into which a deletion, addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, the NgR gene. The NgR gene can be a human gene (e.g., SEQ ID NOs:1 or 13), but more preferably, is a non-human homolog of a human NgR gene. For example, a mouse homolog of human NgR gene of SEQ ID NOs:1 or 13 can be used to construct a homologous recombination vector suitable for altering an endogenous NgR gene in the mouse genome. In one embodiment, the vector is designed such that, upon homologous recombination, the endogenous NgR gene is functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a "knock out" vector).
[0121]Alternatively, the vector can be designed such that, upon homologous recombination, the endogenous NgR gene is mutated or otherwise altered but still encodes functional protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of the endogenous NgR protein). In the homologous recombination vector, the altered portion of the NgR gene is flanked at its 5' and 3' ends by additional nucleic acid of the NgR gene to allow for homologous recombination to occur between the exogenous NgR gene carried by the vector and an endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic acid is of sufficient length for successful homologous recombination with the endogenous gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' ends) are included in the vector. See e.g., Thomas et al. (1987) Cell 51:503 for a description of homologous recombination vectors. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced NgR gene has homologously recombined with the endogenous NgR gene are selected (see e.g., Li et al. (1992) Cell 69:915).
[0122]The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: TERATOCARCINOMAS AND EMBRYONIC STEM CELLS: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 113-152. A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term. Progeny harboring the homologously recombined DNA in their germ cells can be used to breed animals in which all cells of the animal contain the homologously recombined DNA by germline transmission of the transgene. Methods for constructing homologous recombination vectors and homologous recombinant animals are described further in Bradley (1991) Curr. Opin. Biotechnol. 2:823-829; PCT International Publication Nos.: WO 90/11354; WO 91/01140; WO 92/0968; and WO 93/04169.
[0123]In another embodiment, transgenic non-humans animals can be produced that contain selected systems that allow for regulated expression of the transgene. One example of such a system is the cre/loxP recombinase system of bacteriophage P1. For a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. Natl. Acad. Sci USA 89:6232-6236. Another example of a recombinase system is the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the transgene, animals containing transgenes encoding both the Cre recombinase and a selected protein are required. Such animals can be provided through the construction of "double" transgenic animals, e.g., by mating two transgenic animals, one containing a transgene encoding a selected protein and the other containing a transgene encoding a recombinase.
[0124]Clones of the non-human transgenic animals described herein can also be produced according to the methods described in Wilmut et al. (1997) Nature 385:810-813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit the growth cycle and enter G0 phase. The quiescent cell can then be fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal of the same species from which the quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to morula or blastocyte and then transferred to pseudopregnant female foster animal. The offspring borne of this female foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated.
Antisense
[0125]Also provided by the invention are antisense polynucleotides that recognize and hybridize to NgR polynucleotides. Full-length and fragment antisense polynucleotides are provided. Fragment antisense molecules of the invention include (i) those that specifically recognize and hybridize to NgR RNA (as determined by sequence comparison of DNA encoding NgR to DNA encoding other known molecules). Identification of sequences unique to NgR encoding polynucleotides can be deduced through use of any publicly available sequence database, and/or through use of commercially available sequence comparison programs. After identification of the desired sequences, isolation through restriction digestion or amplification using any of the various polymerase chain reaction techniques well known in the art can be performed. Antisense polynucleotides are particularly relevant to regulating expression of NgR by those cells expressing NgR mRNA.
[0126]Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in SEQ ID NO:1, 3, 13 or sequences complementary or homologous thereto, derived from the nucleotide sequences of the present invention encoding NgR are useful as diagnostic tools for probing gene expression in various tissues. For example, tissue can be probed in situ with oligonucleotide probes carrying detectable groups by conventional autoradiography techniques to investigate native expression of this enzyme or pathological conditions relating thereto. In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs of a NgR protein of SEQ ID NO:2, 4 or 14 or antisense nucleic acids complementary to a NgR nucleic acid sequence of SEQ ID NOs:1, 3 or 13 are additionally provided.
[0127]In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the coding strand of a nucleotide sequence encoding NgR. The term "coding region" refers to the region of the nucleotide sequence comprising codons which are translated into amino acid residues (e.g., the protein coding region of human NgR corresponds to the coding region SEQ ID NO:1, 3 or 13). In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding NgR. The term "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions).
[0128]Antisense oligonucleotides are preferably directed to regulatory regions of a nucleotide sequence of SEQ ID NO:1, 3, 13 or mRNA corresponding thereto, including, but not limited to, the initiation codon, TATA box, enhancer sequences, and the like. Given the coding strand sequences encoding NgR disclosed herein (e.g., SEQ ID NO:1, 3 or 13), antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to the entire coding region of NgR mRNA, but more preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding region of NgR mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the translation start site of NgR mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed using chemical synthesis or enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used.
[0129]Examples of modified nucleotides that can be used to generate the antisense nucleic acid include: 5-fluorouracil, 5-bromouracil, S-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N2-carboxypropyl)uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).
[0130]The antisense nucleic acid molecules of the invention (preferably oligonucleotides of 10 to 20 nucleotides in length) are typically administered to a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the protein, e.g., by inhibiting transcription and/or translation. Suppression of NgR expression at either the transcriptional or translational level is useful to generate cellular or animal models for diseases/conditions characterized by aberrant NgR expression. The hybridization can be by conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in the major groove of the double helix.
[0131]Phosphorothioate and methylphosphonate antisense oligonucleotides are specifically contemplated for therapeutic use by the invention. The antisense oligonucleotides may be further modified by adding poly-L-lysine, transferrin polylysine or cholesterol moieties at their 5' end.
[0132]An example of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to target selected cells and then administered systemically. For example, for systemic administration, antisense molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using the vectors described herein. To achieve sufficient intracellular concentrations of antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.
[0133]In yet another embodiment, the antisense nucleic acid molecule of the invention is an α-anomeric nucleic acid molecule. An α-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual β-units, the strands run parallel to each other (Gaultier et al., (1987) Nucleic Acids Res. 15, 6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al., (1987) Nucleic Acids Res. 15, 6131-6148) or a chimeric RNA-DNA analogue (Inoue et al., (1987) FEBS Lett. 215, 327-330).
[0134]The NgR sequences taught in the present invention facilitate the design of novel transcription factors for modulating NgR expression in native cells and animals, and cells transformed or transfected with NgR polynucleotides. For example, the Cys2-His2 zinc finger proteins, which bind DNA via their zinc finger domains, have been shown to be amenable to structural changes that lead to the recognition of different target sequences. These artificial zinc finger proteins recognize specific target sites with high affinity and low dissociation constants, and are able to act as gene switches to modulate gene expression. Knowledge of the particular NgR target sequence of the present invention facilitates the engineering of zinc finger proteins specific for the target sequence using known methods such as a combination of structure-based modeling and screening of phage display libraries (Segal et al., (1999) Proc. Natl. Acad. Sci. USA 96, 2758-2763; Liu et al., (1997) Proc. Nail. Acad. Sci. USA 94, 5525-5530; Greisman et al. (1997) Science 275, 657-661; Choo et al., (1997) J. Mol. Biol. 273, 525-532). Each zinc finger domain usually recognizes three or more base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in length to render it unique in any known genome, a zinc finger protein consisting of 6 tandem repeats of zinc fingers would be expected to ensure specificity for a particular sequence (Segal et al., (1999), above). The artificial zinc finger repeats, designed based on the promoter of NgR sequences, are fused to activation or repression domains to promote or suppress NgR expression (Liu et al., (1997), above). The promoter of NgR may be obtained by standard methods known to one of ordinary skill in the art with the disclosure contained herein and knowledge of the NgR sequence. Alternatively, the zinc finger domains can be fused to the TATA box-binding factor (TBP) with varying lengths of linker region between the zinc finger peptide and the TBP to create either transcriptional activators or repressors (Kim et al., (1997) Proc. Natl. Acad. Sci. USA 94, 3616-3620. Such proteins and polynucleotides that encode them, have utility for modulating NgR expression in vivo in both native cells, animals and humans; and/or cells transfected with NgR-encoding sequences. The novel transcription factor can be delivered to the target cells by transfecting constructs that express the transcription factor (gene therapy), or by introducing the protein. Engineered zinc finger proteins can also be designed to bind RNA sequences for use in therapeutics as alternatives to antisense or catalytic RNA methods (McColl et al., (1997) Proc. Natl. Acad. Sci. USA 96, 9521-9526); Wu et al., (1995) Proc. Natl. Acad. Sci. USA 92, 344-348). The present invention contemplates methods of designing such transcription factors based on the gene sequence of the invention, as well as customized zinc finger proteins, that are useful to modulate NgR expression in cells (native or transformed) whose genetic complement includes these sequences.
Ribozymes and PNA Moieties
[0135]In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes, described in Haselhoff and Gerlach (1988) Nature 334, 585-591) can be used to catalytically cleave NgR mRNA transcripts to thereby inhibit translation of NgR RNA. A ribozyme having specificity for a NgR-encoding nucleic acid can be designed based upon the nucleotide sequence of a NgR DNA disclosed herein (i.e., SEQ ID NOs:1, 3 or 13). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071; and Cech et al. U.S. Pat. No. 5,116,742. Alternatively, Ng mRNA can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) Science 261, 1411-1418.
[0136]Alternatively, NgR gene expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region of the NgR (e.g., the NgR promoter and/or enhancers) to form triple helical structures that prevent transcription of the NgR gene in target cells. See generally, Helene (1991) Anticancer Drug Des. 6: 569-584; Helene. et al., (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher (1992) BioEssays 14, 807-815.
[0137]In various embodiments, the nucleic acids of NgR can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup et al., (1996) Bioorg. Med. Chem. Lett. 4, 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in Hyrup et al., (1996) above; Perry-O'Keefe et al., (1996) Proc. Nail. Acad. Sci. USA 93, 14670-14675.
[0138]PNAs of NgR can be used in therapeutic and diagnostic applications. For example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination with other enzymes, e.g., S1 nucleases (Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization (Hyrup et al., (1996), above; Perry-O'Keefe (1996), above).
[0139]In another embodiment, PNAs of NgR can be modified, e.g., to enhance their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be generated that may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup (1996), above and Finn et al. (1996) Nucleic Acids Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl) amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et al. (1989) Nucleic Acids Res. 17, 973-988). PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al. (1996), above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. (1975) Bioorg. Med. Chem. Lett. 5:1119-1124.
[0140]In other embodiments, the oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (see Letsinger et al., (1989) Proc. Natl. Acad. Sci. USA 86, 6553-6556; Lemaitre et al., (1987) Proc. Natl. Acad. Sci. USA 84, 648-652; PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134). In addition, oligonucleotides can be modified with hybridization triggered cleavage agents (see, e.g., Krol et al., (1988) Biotechniques 6, 958-976) or intercalating agents (see, e.g., Zon (1988) Pharm. Res. 5, 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, etc.
[0141]Automated sequencing methods can be used to obtain or verify the nucleotide sequence of NgR. The NgR nucleotide sequences of the present invention are believed to be 100% accurate. However, as is known in the art, nucleotide sequence obtained by automated methods may contain some errors. Nucleotide sequences determined by automation are typically at least about 90%, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid molecule. The actual sequence may be more precisely determined using manual sequencing methods, which are well known in the art. An error in a sequence which results in an insertion or deletion of one or more nucleotides may result in a frame shift in translation such that the predicted amino acid sequence will differ from that which would be predicted from the actual nucleotide sequence of the nucleic acid molecule, starting at the point of the mutation.
Polypeptides
[0142]The invention also provides purified and isolated mammalian NgR polypeptides encoded by a polynucleotide of the invention. Presently preferred is a human NgR polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID NO:14. Another preferred embodiment is a mouse NgR polypeptide comprising the amino acid sequence of NgR3, as set forth in SEQ ID NO:4.
[0143]One aspect of the invention pertains to isolated NgR proteins, and biologically active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR antibodies. Preferably, fragments of NgR proteins comprise at least one biological activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or tissue sources by an appropriate purification scheme using standard protein purification techniques. In another embodiment, NgR proteins are produced by recombinant DNA techniques. Alternative to recombinant expression, a NgR protein or polypeptide can be synthesized chemically using standard peptide synthesis techniques.
[0144]The invention also embraces polypeptides that have at least 99%, at least 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the preferred polypeptide of the invention. In addition, the invention embraces polypeptides having the consensus sequence shown in SEQ ID NO:6, shown in Table 5) excluding the previously characterized NgR ("NgR1"), and polypeptides comprising at least about 90% of the consensus sequence.
[0145]The term "percentage of sequence identity" is calculated by comparing two optimally aligned sequences over that region of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the region of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. The term "substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually at least 99 percent sequence identity as compared to a reference sequence over a comparison region.
[0146]In one aspect, percent homology is calculated as the percentage of amino acid residues in the smaller of two sequences which align with identical amino acid residue in the sequence being compared, when four gaps in a length of 100 amino acids may be introduced to maximize alignment (Dayhoff, in ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, D.C. (1972), incorporated herein by reference).
[0147]A determination of homology or identity is typically made by a computer homology program known in the art. An exemplary program is the Gap program (Wisconsin Sequence Analysis Package, Version 8 for UNIX Genetics Computer Group, University Research Park, Madison, Wis.) using the default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math, 1981, 2, 482-489, which in incorporated herein by reference in its entirety). Employing the GAP software provided in the GCG program package, (see Needleman and Wunsch (1970) J. Mol. Biol. 48, 443-453) the following settings for nucleic acid sequence comparison may be used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the CDS (encoding) part of the DNA sequence shown in SEQ ID NOs:1, 3 or 13. BestFit was originally written for Version 1.0 by Paul Haeberli from a careful reading of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman (1981), above. The following Bestfit settings for nucleic acid sequence comparison may be used: GAP creation penalty of 8.0 and GAP extension penalty of 2, the coding region of the analogous nucleic acid sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the CDS (encoding) part of the amino acid sequence shown in SEQ ID NOs:2, 4 or 14.
[0148]Alternatively, homology may be determined by hybridization analysis wherein a nucleic acid sequence is hybridized to the complement of a sequence encoding the aforementioned proteins under stringent, moderately stringent, or low stringent conditions. See e.g. Ausubel, et al., Eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, N.Y., 1993, and below.
[0149]Polypeptides of the invention may be isolated from natural cell sources or may be chemically synthesized, but are preferably produced by recombinant procedures involving host cells of the invention.
[0150]An "isolated" or "purified" protein or biologically active portion thereof is substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the NgR protein is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized. The language "substantially free of cellular material" includes preparations of NgR protein in which the protein is separated from cellular components of the cells from which it is isolated or recombinantly produced. In one embodiment, the language "substantially free of cellular material" includes preparations of NgR protein having less than about 30% (by dry weight) of non-NgR protein (also referred to herein as a "contaminating protein"), more preferably less than about 20% of non-NgR protein, still more preferably less than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR protein. When the NgR protein or biologically active portion thereof is recombinantly produced, it is also preferably substantially free of culture medium, i.e., culture medium represents less than about 20%, more preferably less than about 10%, and most preferably less than about 5% of the volume of the protein preparation.
[0151]The language "substantially free of chemical precursors or other chemicals" includes preparations of NgR protein in which the protein is separated from chemical precursors or other chemicals that are involved in the synthesis of the protein. In one embodiment, the language "substantially free of chemical precursors or other chemicals" includes preparations of NgR protein having less than about 30% (by dry weight) of chemical precursors or non-NgR chemicals, more preferably less than about 20% chemical precursors or non-NgR chemicals, still more preferably less than about 10% chemical precursors or non-NgR chemicals, and most preferably less than about 5% chemical precursors or non-NgR chemicals.
[0152]Biologically active portions of a NgR protein include peptides comprising amino acid sequences sufficiently homologous to or derived from the amino acid sequence of the NgR protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 4 or 14 that include fewer amino acids than the full length NgR proteins, and exhibit at least one activity of a NgR protein. Typically, biologically active portions comprise a domain or motif with at least one activity of the NgR protein. A biologically active portion of a NgR protein can be a polypeptide which is, for example, 10, 25, 50, 100 or more amino acids in length.
[0153]A biologically active portion of a NgR protein of the present invention may contain at least one of the features that is conserved between the NgR proteins (e.g., a conserved cysteine as the N-terminus of the mature protein, four conserved cysteines in the N-terminus before a leucine-rich region, four conserved cysteines C-terminal with respect to a leucine repeat region, eight leucine-rich repeats, and a hydrophobic C-terminus). An alternative biologically active portion of a NgR protein may contain at least two of the above-identified domains. Another biologically active portion of a NgR protein may contain at least three of the above-identified domains. Yet another biologically active portion of a NgR protein of the present invention may contain at least four of the above-identified domains.
[0154]Moreover, other biologically active portions, in which other regions of the protein are deleted, can be prepared by recombinant techniques and evaluated for one or more of the functional activities of a native NgR protein.
[0155]In an embodiment, the NgR protein has an amino acid sequence shown in SEQ ID NO:2, 4 or 14. In other embodiments, the NgR protein is substantially homologous to SEQ ID NO:2, 4 or 14 and retains the functional activity of the protein of SEQ ID NO:2, 4 or 14, yet differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in detail below.
[0156]Accordingly, in another embodiment, the NgR protein is a protein that comprises an amino acid sequence at least about 45% homologous to the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:4 or SEQ ID NO:14 and retains the functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14.
[0157]Use of mammalian host cells is expected to provide for such post-translational modifications (e.g., glycosylation, truncation, lipidation and phosphorylation) as may be needed to confer optimal biological activity on recombinant expression products of the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are embraced by the invention.
[0158]The invention also embraces variant (or analog) NgR polypeptides. In one example, insertion variants are provided wherein one or more amino acid residues supplement a NgR amino acid sequence. Insertions may be located at either or both termini of the protein, or may be positioned within internal regions of the NgR amino acid sequence. Insertional variants with additional residues at either or both termini can include, for example, fusion proteins and proteins including amino acid tags or labels.
[0159]Insertion variants include NgR polypeptides wherein one or more amino acid residues are added to a NgR acid sequence or to a biologically active fragment thereof.
[0160]Variant products of the invention also include mature NgR products, i.e., NgR products wherein leader or signal sequences are removed, with additional amino terminal residues. The additional amino terminal residues may be derived from another protein, or may include one or more residues that are not identifiable as being derived from specific proteins. NgR products with an additional methionine residue at position -1 (Met-1-NgR) are contemplated, as are variants with additional methionine and lysine residues at positions -2 and -1 (Met-2-Lys-1-NgR). Variants of NgR with additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are particularly useful for enhanced recombinant protein production in bacterial host cells.
Polypeptide Variants
[0161]The invention also embraces NgR variants having additional amino acid residues which result from use of specific expression systems.
[0162]As used herein, a NgR "chimeric protein" or "fusion protein" comprises a NgR polypeptide operatively linked to a non-NgR polypeptide. A "NgR polypeptide" refers to a polypeptide having an amino acid sequence corresponding to NgR, whereas a "non-NgR polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a protein that is not homologous to the NgR protein, e.g., a protein that is different from the NgR protein and that is derived from the same or a different organism. Within a NgR fusion protein the NgR polypeptide can correspond to all or a portion of a NgR protein. In one embodiment, a NgR fusion protein comprises at least one biologically active portion of a NgR protein. In another embodiment, a NgR fusion protein comprises at least two biologically active portions of a NgR protein. In yet another embodiment, a NgR fusion protein comprises at least three biologically active portions of a NgR protein. Within the fusion protein, the term "operatively linked" is intended to indicate that the NgR polypeptide and the non-NgR polypeptide are fused in-frame to each other. The non-NgR polypeptide can be fused to the N-terminus or C-terminus of the NgR polypeptide.
[0163]For example, in one embodiment a NgR fusion protein comprises a NgR domain operably linked to the extracellular domain of a second protein. Such fusion proteins can be further utilized in screening assays for compounds which modulate NgR activity (such assays are described in detail below).
[0164]For example, use of commercially available vectors that express a desired polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the desired polypeptide having an additional glycine residue at position -1 after cleavage of the GST component from the desired polypeptide.
[0165]In another embodiment, the fusion protein is a NgR protein containing a heterologous signal sequence at its N-terminus. For example, the native NgR signal sequence (i.e., amino acids 1-30 of SEQ ID NO:2 and amino acids 1-40 of SEQ ID NO:4) can be removed and replaced with a signal sequence from another protein. In certain host cells (e.g., mammalian host cells), expression and/or secretion NgR can be increased through use of a heterologous signal sequence.
[0166]In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion protein in which the NgR sequences comprising one or more domains are fused to sequences derived from a member of the immunoglobulin protein family. The NgR-immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an interaction between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress NgR-mediated signal transduction in vivo. NgR-immunoglobulin fusion proteins can be used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR ligand/NgR interaction may be useful therapeutically for both the treatment of proliferative and differentiative disorders, as well as modulating (e.g., promoting or inhibiting) cell survival. Moreover, the NgR-immunoglobulin fusion proteins of the invention can be used as immunogens to produce anti-NgR antibodies in a subject, to purify NgR ligands, and in screening assays to identify molecules that inhibit the interaction of NgR with NgR ligand.
[0167]A NgR chimeric or fusion protein of the invention can be produced by standard recombinant DNA techniques. For example, DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that give rise to complementary overhangs between two consecutive gene fragments that can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (Eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide). A NgR-encoding nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the NgR protein.
[0168]Variants resulting from expression in other vector systems are also contemplated.
[0169]Insertional variants also include fusion proteins wherein the amino terminus and/or the carboxy terminus of NgR is/are fused to another polypeptide.
[0170]In another aspect, the invention provides deletion variants wherein one or more amino acid residues in a NgR polypeptide are removed. Deletions can be effected at one or both termini of the NgR polypeptide, or with removal of one or more non-terminal amino acid residues of NgR. Deletion variants, therefore, include all fragments of a NgR polypeptide.
[0171]The invention also embraces polypeptide fragments of the sequence set forth in SEQ ID NO:2, 4 or 14 wherein the fragments maintain biological (e.g., ligand binding and/or intracellular signaling) immunological properties of a NgR polypeptide. Fragments comprising at least 4, 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino acids of SEQ ID NO:2, 4 or 14 are contemplated by the invention. Preferred polypeptide fragments display antigenic properties unique to, or specific for, human NgR and its allelic and species homologs. Fragments of the invention having the desired biological and immunological properties can be prepared by any of the methods well known and routinely practiced in the art.
[0172]In still another aspect, the invention provides substitution variants of NgR polypeptides. Substitution variants include those polypeptides wherein one or more amino acid residues of a NgR polypeptide are removed and replaced with alternative residues. In one aspect, the substitutions are conservative in nature; however, the invention embraces substitutions that are also non-conservative. Conservative substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below.
TABLE-US-00003 TABLE 1 X.sub.aa# (based on a NTLRRCT Column I Column II domain) (R1, R2, R3) (R2 + R3 only) X1 G, R, M X2 A, D, C X3 V, T X4 N, P, S X5 E, A, S X6 nothing, K nothing X7 V, M, P X8 T, V V X9 Q, P Q X10 Q, A Q X11 Q, H, N X12 G, N N X13 L, F F X14 Q, A, S X15 A, S X16 V, I X17 V, T, E, L X18 S, G X19 L, I X20 A, E, V, P X21 A, S, D X22 S, T X23 Q, E X24 IVL X25 Q, H Q X26 N, G N X27 R, L X28 T, G, R, S X29 F, L, T, H X30 L, V L X31 Q, R, P X32 Q, P, A P X33 G, A G X34 H, T, S X35 S, G, R X36 P, S, A X37 C, nothing nothing X38 R, nothing nothing X39 A, N X40 M, L X41 V, L, T X42 T, I T X43 L, I X44 Y, F, H X45 N, V N X46 I, L X47 T, S, A X48 F, Y, T, R X49 A, H, Y, D X50 P, A P X51 N, S, G, A X52 T, A T X53 E, R, T X54 G, H X55 F, L X56 V, Q, H X57 H, A, L X58 E, Q E X59 G, S G X60 R, A R X61 Q, H Q X62 R, H H X63 T, S X64 L, V L X65 A, E, D X66 E, D, A X67 Q, H Q X68 V, E, G X69 K, R X70 H, Q X71 A, S, T X72 Y, H X73 Y, D Y X74 K, R X75 G, Q X76 S, Q S X77 A, S, E X78 P, G P X79 A, G, P X80 G, N X81 I, V, L X82 G, R X83 H, V, A X84 S, A S X85 D, E X86 H, S, A X87 I, L X88 E, L, Q X89 Y, H, A X90 Q, P Q X91 D, N X92 I, L, T X93 V, A, R X94 V, A, G X95 S, T S X96 K, R X97 L, I L X98 W, R, S X99 S, L X100 L, V L X101 G, T, P X102 Q, P, E X103 G, H, R X104 I, T, V, A X105 V, G, H X106 N, S X107 E, G, Q X108 Q, R X109 L, V X110 Q, A X111 W, G, H X112 H, R, P X113 K, A, H X114 H, R X115 D, G X116 H, R, S, G X117 T, M X118 T, I X119 F, Y X120 N, A X121 S, N X122 T, A, S X123 E, S, A X124 Q, P X125 G, T X126 D, E D X127 C, A X128 P, D X129 V, G, P, R X130 A, S X131 E, Q Q X132 F, Y F X133 G, A, D X134 A, P X135 D, A, V X136 G, D X137 A, E X138 S, P X139 E, A X140 L, F X141 R, Q X142 R, K R X143 R, K R X144 F, A X145 G, V X146 A, D, E X147 T, P X148 A, V, S X149 T, S, L X150 E, G, P, Q X151 L, E, R X152 R, L R X153 G, D X154 Q, H, A X155 Q, R X156 K, R X157 L, A, R X158 R, A R X159 V, A, E X160 E, A, N X161 F, L F X162 R, Q X163 N, A, G
[0173]Variant polypeptides include those wherein conservative substitutions have been introduced by modification of polynucleotides encoding polypeptides of the invention. Amino acids can be classified according to physical properties and contribution to secondary and tertiary protein structure. A conservative substitution is recognized in the art as a substitution of one amino acid for another amino acid that has similar properties. Exemplary conservative substitutions are set out in Table 2 (from WO 97/09433, page 10, published Mar. 13, 1997 (PCT/GB96/02197, filed 916/96), immediately below.
TABLE-US-00004 TABLE 2 Conservative Substitutions I SIDE CHAIN CHARACTERISTIC AMINO ACID Aliphatic Non-polar G A P I L V Polar - uncharged C S T M N Q Polar - charged D E K R Aromatic H F W Y Other N Q D E
[0174]Alternatively, conservative amino acids can be grouped as described in Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, N.Y. (1975), pp. 71-77] as set out in Table 3, immediately below.
TABLE-US-00005 TABLE 3 Conservative Substitutions II SIDE CHAIN CHARACTERISTIC AMINO ACID Non-polar (hydrophobic) A. Aliphatic: A L I V P B. Aromatic: F W C. Sulfur-containing: M D. Boderline: G Uncharged-polar A. Hydroxyl: S T Y B. Amides: N Q C. Sylfhydryl: C D. Boderline: G Positively Charged (Basic): K R H Negatively Charged (Acidic): D E
[0175]As still another alternative, exemplary conservative substitutions are set out in Table 4, below.
TABLE-US-00006 TABLE 4 Conservative Substitutions III Original Residue Exemplary Substitution Ala (A) Val, Leu, Ile Arg (R) Lys, Gln, Asn Asn (N) Gln, His, Lys, Arg Asp (D) Glu Cys (C) Ser Gln (Q) Asn Glu (E) Asp His (H) Asn, Gln, Lys, Arg Ile (I) Leu, Val, Met, Ala, Phe, Leu (L) Ile, Val, Met, Ala, Phe Lys (K) Arg, Gln, Asn Met (M) Leu, Phe, Ile Phe (F) Leu, Val, Ile, Ala Pro (P) Gly Ser (S) Thr Thr (T) Ser Trp (W) Tyr Tyr (Y) Trp, Phe, Thr, Ser Val (V) Ile, Leu, Met, Phe, Ala
[0176]In addition, amino acid residues that are conserved among family members of the NgR proteins of the present invention, as indicated by the alignment presented herein, are also predicted to be particularly unamenable to alteration. For example, NgR proteins of the present invention can contain at least one domain that is a typically conserved region in NgRs. Examples of these conserved domains include, e.g., leucine-rich repeat domain. Amino acid residues that are not conserved or are only semi-conserved among members of the NgR proteins may be readily amenable to alteration.
[0177]Full-length NgRs have an LRR region characterized by the amino acid consensus sequence shown in SEQ ID NO:19. At least some full-length NgRs also include a CT signaling (CTS) domain and a GPI domain.
[0178]The NgR domain designations used herein are defined as follows:
TABLE-US-00007 mNgR1 hNgR1 SEQ ID hNgR2 hNgR3 mNgR3 Domain SEQ ID: 5 NO: 17 SEQ ID: 2 SEQ ID: 14 SEQ ID: 4 Signal Seq. 1-26 1-26 1-30 -- 1-40 LRRNT 27-56 27-56 31-59 -- 41-69 LRR1 57-81 57-81 60-82 5-27 70-92 LRR2 82-105 82-105 83-106 28-51 93-106 LRR3 106-130 106-130 107-131 52-76 106-141 LRR4 131-154 131-154 132-155 77-100 142-165 LRR5 155-178 155-178 156-179 101-124 166-189 LRR6 179-202 179-202 180-203 125-148 190-213 LRR7 203-226 203-226 204-227 149-172 214-237 LRR8 227-250 227-250 228-251 173-196 238-261 LRRCT 260-309 260-309 261-310 206-255 271-320 CTS 310-445 310-445 311-395 256-396 321-438 (CT Signaling) GPI 446-473 456-473 396-420 370-392 439-462
[0179]In some embodiments of the invention, the above domains are modified. Modification can be in a manner that preserves domain functionality. Modification can include addition, deletion or substitution of certain amino acids. Exemplary modifications include conservative amino acid substitutions. Preferably such substitutions number 20 or fewer per 100 residues. More preferably, such substitutions number 10 or fewer per 100 residues. Further exemplary modifications include addition of flanking sequences of up to five amino acids at the N terminus and/or C terminus of one or more of the domains.
[0180]In some embodiments, the isolated nucleic acid molecule encodes a polypeptide at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% homologous to SEQ ID NO:2, 4 or 14.
[0181]Mutations can be introduced into SEQ ID NOS:1, 3 or 13 by standard techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. Conservative amino acid substitutions can be made at one or more amino acid residues predicted to be non-essential. Alternatively, mutations can be introduced randomly along a NgR coding sequence. This can be accomplished, e.g., by saturation mutagenesis. The resulting mutants can be screened for NgR biological activity. Biological activities of Ng may include but are not limited to: (1) protein:protein interactions, e.g., with other NgRs or other cell-surface proteins involved in Nogo-related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti-NgR antibody.
[0182]It should be understood that the definition of polypeptides of the invention is intended to include polypeptides bearing modifications other than insertion, deletion, or substitution of amino acid residues. By way of example, the modifications may be covalent in nature, and include for example, chemical bonding with polymers, lipids, other organic and inorganic moieties. Such derivatives may be prepared to increase circulating half-life of a polypeptide, or may be designed to improve the targeting capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention further embraces NgR polypeptides that have been covalently modified to include one or more water-soluble polymer attachments such as polyethylene glycol, polyoxyethylene glycol or polypropylene glycol. Variants that display ligand binding properties of native NgR and are expressed at higher levels, as well as variants that provide for constitutively active receptors, are particularly useful in assays of the invention; the variants are also useful in providing cellular, tissue and animal models of diseases/conditions characterized by aberrant NgR activity.
[0183]Chemically modified NgR polypeptide compositions in which the NgR polypeptide is linked to a polymer are included within the scope of the present invention. The polymer may be water soluble to prevent precipitation of the protein in an aqueous environment, such as a physiological environment. Suitable water-soluble polymers may be selected from the group consisting of, for example, polyethylene glycol (PEG), monomethoxypolyethylene glycol, dextran, cellulose, or other carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer polyoxyethylated polyols (e.g. glycerol) and polyvinyl alcohol. The selected polymer is usually modified to have a single reactive group, such as an active ester for acylation or an aldehyde for alkylation, so that the degree of polymerization may be controlled. Polymers may be of any molecular weight, and may be branched or unbranched, and mixtures of such polymers may also be used. When the chemically modified NgR polymer is destined for therapeutic use, pharmaceutically acceptable polymers will be selected for use.
[0184]When the polymer is to be modified by an acylation reaction, the polymer should have a single reactive ester group. Alternatively, if the polymer is to be modified by reductive alkylation, the polymer should have a single reactive aldehyde group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is water stable, or mono Cl-ClO alkoxy or aryloxy derivatives thereof (see U.S. Pat. No. 5,252,714, incorporated by reference herein in its entirety).
[0185]Pegylation of NgR polypeptides may be carried out by any of the pegylation reactions known in the art, as described, for example, in the following references: Focus on Growth Factors 3, 4-10 (1992); EP 0 154 316; and EP 0 401 384 (each of which is incorporated by reference herein in its entirety). Preferably, the pegylation is carried out via an acylation reaction or an alkylation reaction with a reactive polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A preferred water-soluble polymer for pegylation of polypeptides such as NgR is polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to encompass any of the forms of PEG that have been used to derivatize other proteins, such as mono (Cl-ClO) alkoxy- or aryloxy-polyethylene glycol.
[0186]Chemical derivatization of NgR polypeptides may be performed under any suitable conditions used to react a biologically active substance with an activated polymer molecule. Methods for preparing pegylated NgR polypeptides will generally comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a reactive ester or aldehyde derivative of PEG, under conditions whereby NgR polypeptide becomes attached to one or more PEG groups, and (b) obtaining the reaction products. It will be apparent to one of ordinary skill in the art to select the optimal reaction conditions or the acylation reactions based on known parameters and the desired result.
[0187]Pegylated and other polymer:NgR polypeptides may generally be used to treat conditions that may be alleviated or modulated by administration of the NgR polypeptides described herein. However, the chemically-derivatized polymer:NgR polypeptide molecules disclosed herein may have additional activities, enhanced or reduced biological activity, or other characteristics, such as increased or decreased half-life, as compared to the nonderivatized molecules. The NgR polypeptides, fragments thereof, variants and derivatives, may be employed alone, together, or in combination with other pharmaceutical compositions. The cytokines, growth factors, antibiotics, antiinflammatories and/or chemotherapeutic agents as is appropriate for the indication being treated.
[0188]The present invention provides compositions comprising purified polypeptides of the invention. Preferred compositions comprise, in addition to the polypeptide of the invention, a pharmaceutically acceptable (i.e., sterile and non-toxic) liquid, semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium. Any diluent known in the art may be used. Exemplary diluents include, but are not limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium stearate, methyl- and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, dextrose, sorbitol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter.
[0189]Variants that display ligand binding properties of native NgR and are expressed at higher levels, as well as variants that provide for constitutively active receptors, are particularly useful in assays of the invention; the variants are also useful in assays of the invention and in providing cellular, tissue and animal models of diseases/conditions characterized by aberrant NgR activity.
[0190]With the knowledge of the nucleotide sequence information disclosed in the present invention, one skilled in the art can identify and obtain nucleotide sequences which encode NgR from different sources (i.e., different tissues or different organisms) through a variety of means well known to the skilled artisan and as disclosed by, for example, Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, Second Edition, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1989), which is incorporated herein by reference in its entirety.
[0191]For example, DNA that encodes NgR may be obtained by screening of mRNA, cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene sequence information provided herein. Probes may be labeled with a detectable group, such as a fluorescent group, a radioactive atom or a chemiluminescent group in accordance with procedures known to the skilled artisan and used in conventional hybridization assays, as described by, for example, Sambrook et al. (1989) above.
[0192]A nucleic acid molecule comprising any of the NgR nucleotide sequences described above can alternatively be synthesized by use of the polymerase chain reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the nucleotide sequences provided herein. See U.S. Pat. Nos. 4,683,195 to Mullis et al. and 4,683,202 to Mullis. The PCR reaction provides a method for selectively increasing the concentration of a particular nucleic acid sequence even when that sequence has not been previously purified and is present only in a single copy in a particular sample. The method can be used to amplify either single- or double-stranded DNA. The essence of the method involves the use of two oligonucleotide probes to serve as primers for the template-dependent, polymerase-mediated replication of a desired nucleic acid molecule.
[0193]A wide variety of alternative cloning and in vitro amplification methodologies are well known to those skilled in the art. Examples of these techniques are found in for example, Berger et al., Guide to Molecular Cloning Techniques, METHODS IN ENZYMOLOGY 152 Academic Press, San Diego, Calif., which is incorporated herein by reference in its entirety.
[0194]The nucleic acid molecules of the present invention, and fragments derived therefrom, are useful for screening for restriction fragment length polymorphism (RFLP) associated with certain disorders, as well as for genetic mapping.
Antibodies
[0195]Also comprehended by the present invention are antibodies (e.g., monoclonal and polyclonal antibodies, single chain antibodies, chimeric antibodies, bifunctional/bispecific antibodies, humanized antibodies, human antibodies, and complementary determining region (CDR)-grafted antibodies, including compounds which include CDR sequences which specifically recognize a polypeptide of the invention) specific for NgR or fragments thereof. Preferred antibodies of the invention are human antibodies which are produced and identified according to methods described in WO93/11236, published Jun. 20, 1993, which is incorporated herein by reference in its entirety. Antibody fragments, including Fab, Fab', F(ab')2, and Fv, are also provided by the invention. The term "specific for," when used to describe antibodies of the invention, indicates that the variable regions of the antibodies of the invention recognize and bind NgR polypeptides exclusively (i.e., are able to distinguish NgR polypeptides from other known NgR polypeptides by virtue of measurable differences in binding affinity, despite the possible existence of localized sequence identity, homology, or similarity between NgR and such polypeptides).
[0196]The antigenic peptide of NgR comprises at least 8 amino acid residues of the amino acid sequence shown in SEQ ID NO:2, 4 or 14 and encompasses an epitope of NgR such that an antibody raised against the peptide forms a specific immune complex with NgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, more preferably at least 15 amino acid residues, even more preferably at least 20 amino acid residues, and most preferably at least 30 amino acid residues. Preferred epitopes encompassed by the antigenic peptide are regions of NgR that are located on the surface of the protein, e.g., hydrophilic regions.
[0197]It will be understood that specific antibodies may also interact with other proteins (for example, S. aureus protein A or other antibodies in ELISA techniques) through interactions with sequences outside the variable region of the antibodies, and, in particular, in the constant region of the molecule. Screening assays to determine binding specificity of an antibody of the invention are well known and routinely practiced in the art. For a comprehensive discussion of such assays, see Harlow et al. in ANTIBODIES: A LABORATORY MANUAL, Cold Spring Harbor Laboratory Press; Cold Spring Harbor, N.Y. (1988), Chapter 6. Antibodies that recognize and bind fragments of the NgR polypeptides of the invention are also contemplated, provided that the antibodies are specific for NgR polypeptides. Antibodies of the invention can be produced using any method well known and routinely practiced in the art.
[0198]For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, goat, mouse or other mammal) may be immunized by injection with the native protein, or a synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic preparation can contain, for example, recombinantly expressed NgR protein or a chemically synthesized NgR polypeptide. The preparation can further include an adjuvant. Various adjuvants used to increase the immunological response include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as Bacille Calmette-Guerin and Corynebacterium parvum or similar immunostimulatory agents. If desired, the antibody molecules directed against NgR can be isolated from the mammal (e.g., from the blood) and further purified by well known techniques, such as protein A chromatography to obtain the IgG fraction.
[0199]The term "monoclonal antibody" or "monoclonal antibody composition," as used herein, refers to a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope of NgR. A monoclonal antibody composition thus typically displays a single binding affinity for a particular NgR protein with which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular NgR protein, or derivatives, fragments, analogs or homologs thereof, any technique that provides for the production of antibody molecules by continuous cell line culture may be utilized. Such techniques include, but are not limited to, the hybridoma technique (see Kohler and Milstein (1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma technique (see Kozbor et al., (1983) Immunol. Today 4, 72) and the EBV hybridoma technique to produce human monoclonal antibodies (see Cole et al., (1985) in MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the practice of the present invention and may be produced by using human hybridomas (see Cote et al., (1983) Proc Natl. Acad. Sci. USA 80, 2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole et al., (1985), above).
[0200]According to the invention, techniques can be adapted for the production of single-chain antibodies specific to a NgR protein (see e.g., U.S. Pat. No. 4,946,778). In addition, methods can be adapted for the construction of Fab expression libraries (see e.g., Huse et al., (1989) Science 246, 1275-1281) to allow rapid and effective identification of monoclonal Fab fragments with the desired specificity for a NgR protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies can be "humanized" by techniques well known in the art. See e.g., U.S. Pat. No. 5,225,539. In one method, the non-human CDRs are inserted into a human antibody or consensus antibody framework sequence. Further changes can then be introduced into the antibody framework to modulate affinity or immunogenicity. Antibody fragments that contain the idiotypes to a NgR protein may be produced by techniques known in the art including, but not limited to: (i) an F(ab)2 fragment produced by pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing the disulfide bridges of an F(ab')2 fragment; (iii) an Fab fragment generated by the treatment of the antibody molecule with papain and a reducing agent and (iv) Fv fragments.
[0201]Additionally, recombinant anti-NgR antibodies, such as chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are within the scope of the invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art, for example using methods described in PCT International Application No. PCT/US86/02269; European Patent Application No. 184,187; European Patent Application No. 171,496; European Patent Application No. 173,494; PCT International Publication No. WO 86/01533; U.S. Pat. No. 4,816,567; European Patent Application No. 125,023; Better et al., (1988) Science 240, 1041-1043; Liu et al., (1987) Proc. Natl. Acad. Sci. USA 84, 3439-3443; Liu et al., (1987) J. Immunol. 139, 3521-3526; Sun et al., (1987) Proc. Natl. Acad. Sci. USA 84, 214-218; Nishimura et al., (1987) Cancer Res. 47, 999-1005; Wood et al., (1985) Nature 314, 446-449; Shaw et al., (1988) J. Natl. Cancer Inst. 80, 1553-1559); Morrison (1985) Science 229, 1202-1207; Oi et al., (1986) Biolechniques 4, 214; U.S. Pat. No. 5,225,539, Jones et al., (1986) Nature 321, 552-525; Verhoeyan et al., (1988) Science 239, 1534; and Beidler et al., (1988) J. Immunol. 141, 4053-4060.
[0202]In a preferred embodiment of the invention a portion of a NgR is joined to an Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion protein is soluble. The NgR/Fc fusion protein may be formed by recombinant techniques as described above. In one embodiment, a portion of a NgR including the entire amino acid sequence of NgR except the C-terminal hydrophobic region is fused to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR and the Fc is also human. More preferably, the human Fc portion is derived from an IgG antibody. In other embodiments, the N-terminal signal sequence is omitted. Such antibodies are useful in binding Nogo to prevent Nogo signaling through the NgR.
[0203]In one embodiment, methods for the screening of antibodies that possess the desired specificity include, but are not limited to, enzyme-linked immunosorbent assay (ELISA) and other immunologically-mediated techniques known within the art. In a specific embodiment, selection of antibodies that are specific to a particular domain of a NgR protein is facilitated by generation of hybridomas that bind to the fragment of a NgR protein possessing such a domain. Antibodies that are specific for one or more domains within a NgR protein, e.g., domains spanning the above-identified conserved regions of NgRs, or derivatives, fragments analogs or homologs thereof, are also provided herein.
[0204]Anti-NgR antibodies may be used in methods known within the art relating to the localization and/or quantitation of a NgR protein (e.g., for use in measuring levels of the NgR protein within appropriate physiological samples, for use in diagnostic methods, for use in imaging the protein, and the like). In a given embodiment, antibodies for NgR proteins, or derivatives, fragments analogs or homologs thereof, that contain the antibody derived binding domain, are utilized as pharmacologically-active compounds [hereinafter "Therapeutics"].
[0205]An anti-NgR antibody (e.g. monoclonal antibody) can be used to isolate NgR by standard techniques, such as affinity chromatography or immunoprecipitation. An anti-NgR antibody can facilitate the purification of natural NgR from cells and of recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody can be used to detect NgR protein (e.g., in a cellular lysate or cell supernatant) in order to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgR antibodies can be used diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, β-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; examples of bioluminescent materials include luciferase, luciferin and aequorin, and examples of suitable radioactive material include 125I, 131I, 35S or 3H.
[0206]Another aspect of the present invention is directed to methods of inducing an immune response in a mammal against a polypeptide of the invention by administering to the mammal an amount of the polypeptide sufficient to induce an immune response. The amount will be dependent on the animal species, size of the animal, and the like but can be determined by those skilled in the art.
[0207]Another aspect of the invention is directed to anti-idiotypic antibodies and anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes determinants of another antibody (a target antibody). Generally, the anti-idiotypic antibody recognizes determinants of the antigen-binding site of the target antibody. Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is generally prepared by immunizing an animal (particularly, mice) of the same species and genetic type as the source of the target monoclonal antibody, with the target monoclonal antibody. The immunized animal mounts an immune response to the idiotypic determinants of the target monoclonal antibody and produces antibodies against the idiotypic determinants of the target monoclonal antibody. Antibody-producing cells, such as splenic cells, of the immunized animal may be used to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic antibody may also be used to immunize animals to produce anti-anti-idiotypic antibodies. These immunized animals may be used to generate anti-anti-idiotypic monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies may bind to the same epitope as the original, target monoclonal antibody used to prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other monoclonal antibodies with the same antigen specificity as the original target monoclonal antibody.
[0208]If the binding of the anti-idiotypic antibody with the target antibody is inhibited by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces an antibody response with the same specificity as the target antibody, it mimics the antigen of the target antibody. Such an anti-idiotypic antibody is an "internal image anti-idiotype" and is capable of inducing an antibody response as if it were the original antigen. (Bona and Kohler (1984) ANTI-IDIOTYPIC ANTIBODIES AND INTERNAL IMAGE, IN MONOCLONAL AND ANTI-IDIOTYPIC ANTIBODIES: PROBES FOR RECEPTOR STRUCTURE AND FUNCTION, Venter J. C. et al. (Eds), Alan R Liss, New York, N.Y., pp 141-149, 1984) Vaccines incorporating internal image anti-idiotype antibodies have been shown to induce protective responses against viruses, bacteria, and parasites (Kennedy et al., (1986) 232, 220-223; 1047; McNamara et al., (1985) Science 226, 1325-1326). Internal image anti-idiotypic antibodies have also been shown to induce immunity to tumor related antigens (Raychauhuri et al., (1986) J. Immunol. 137, 1743-1749; Raychauhuri et al., (1987) J. Immunol. 139, 3902-3910; Bhattacharya-Chatterjee et al., (1987) J. Immunol. 139, 1354-1360; Bhattacharya-Chatterjee et al., (1988) J. Immunol. 141, 1398-1403; Herlyn. et al. (1989) Intern. Rev. Immunol. 4, 347-357; Chen et al. (1990) Cell Imm. Immunother. Cancer 351-359; Herlyn et al., (1991) in vivo 5, 615-624; Furuya et al. (1992) AntiCancer Res. 12, 27-32; Mittelman, A. et al. (1992) Proc. Natl. Acad. Sci., USA 89, 466-470; Durrant. et al., (1994) Cancer Res. 54, 4837-4840; Mittelman. et al. (1994) Cancer Res. 54, 415-421; Schmitt et al. (1994) Hybridoma 13, 389-396; Chakrobarty. et al. (1995) J. Immunother. 18, 95-103; Chakrobarty. et al. (1995) Cancer Res. 55, 1525-1530; Foon, K. A. et al. (1995) Clin. Cancer Res. 1, 1205-1294; Herlyn et al. (1995) Hybridoma 14, 159-166; Sclebusch et al. (1995) Hybridoma 14, 167-174; Herlyn. et al. (1996) Cancer Immunol Immunother. 43, 65-76).
[0209]Anti-idiotypic antibodies for NgR may be prepared, for example, by immunizing an animal, such as a mouse, with a immunogenic amount of a composition comprising NgR2 (SEQ ID NO:2), NgR3 (SEQ ID NOs:4 or 14), or immunogenic portion thereof, containing at least one antigenic epitope of NgR. The composition may also contain a suitable adjuvant, and any carrier necessary to provide immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the cells of the immunized animal as described above. A monoclonal antibody recognizing an epitope of NgR is then selected and used to prepare a composition comprising an immunogenic amount of the anti-NgR monoclonal antibody. Typically, a 25 to 200 μg dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant.
[0210]Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. Typically, animals are immunized by any suitable route of administration, such as intraperitoneal, subcutaneous, intravenous or a combination of these. Anti-idiotypic antibody production may be monitored during the immunization period using standard immunoassay methods. Animals with suitable titers of antibodies reactive with the target monoclonal antibodies may be reimmunized with the monoclonal antibody used as the immunogen three days before harvesting the antibody producing cells. Preferably, spleen cells are used, although other antibody producing cells may be selected. Antibody-producing cells are harvested and fused with myeloma cells to produce Hybridomas, as described above, and suitable anti-idiotypic antibody-producing cells are selected.
[0211]Anti-anti-idiotypic antibodies are produced by another round of immunization and Hybridoma production by using the anti-idiotypic monoclonal antibody as the immunogen.
[0212]Antibodies of the invention are useful for, e.g., therapeutic purposes (by modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and purification of NgR. Therefore, kits comprising an antibody of the invention for any of the purposes described herein are also comprehended.
Kits
[0213]The present invention is also directed to kits, including pharmaceutical kits. The kits can comprise any of the nucleic acid molecules described above, any of the polypeptides described above, or any antibody which binds to a polypeptide of the invention as described above, as well appropriate controls, such as positive and/or negative controls. The kit preferably comprises additional components, such as, for example, instructions, solid support, reagents helpful for quantification, and the like. For example, the kit can comprise: a labeled compound or agent capable of detecting NgR protein or mRNA in a biological sample; means for determining the amount of NgR in the sample; and means for comparing the amount of NgR in the sample with a standard. The compound or agent can be packaged in a suitable container.
Screening Assays
[0214]The DNA and amino acid sequence information provided by the present invention also makes possible identification of binding partner compounds with which a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner compounds include solution assays, in vitro assays wherein NgR polypeptides are immobilized and cell-based assays. Identification of binding partner compounds of NgR polypeptides provides candidates for therapeutic or prophylactic intervention in pathologies associated with NgR normal and aberrant biological activity.
[0215]The invention also provides a method (also referred to herein as a "screening assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, peptidomimetics, small molecules (e.g., molecules of less than 1,000 Daltons) or other drugs) that bind to NgR proteins or have a stimulatory or inhibitory effect on, for example, NgR expression or NgR activity.
[0216]In one embodiment, the invention provides assays for screening candidate or test compounds which bind to or modulate the activity of a NgR protein or polypeptide or biologically active portion thereof. The test compounds of the present invention can be obtained using any of the numerous approaches in combinatorial library methods known in the art, including: biological libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the "one-bead one-compound" library method; and synthetic library methods using affinity chromatography selection. The biological library approach is limited to peptide libraries, while the other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of compounds (Lam (1997) Anticancer Drug Des. 12, 145).
[0217]Examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et al., (1993) Proc. Natl. Acad. Sci. USA 90, 6909; Erb et al., (1994) Proc. Natl. Acad. Sci. USA 91, 11422; Zuckermann et al. (1994) J. Med. Chem. 37, 2678; Cho et al., (1993) Science 261, 1303; Carrell et al., (1994) Angew Chem. Int. Ed Engl. 33, 2059; Carell et al., (1994) Angew Chem. Int Ed Engl. 33, 2061; and Gallop et al., (1994) J. Med. Chem 37, 1233
[0218]Libraries of compounds may be presented in solution (e.g., Houghten (1992) BioTechniques. 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Pat. No. 5,223,409), spores (Ladner, above), plasmids (Cull et al. (1992) Proc. Natl. Acad. Sci. USA 89, 1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) Science 249, 404-406; Cwirla et al. (1990) Proc. Natl. Acad. Sci. USA 87, 6378-6382; Felici (1991) J. Mol. Biol. 222, 301-310; Ladner, above).
[0219]1. Cell-Based Assays
[0220]The invention also provides cell-based assays to identify binding partner compounds of a NgR polypeptide. In one embodiment, the invention provides a method comprising the steps of contacting a NgR polypeptide expressed on the surface of a cell with a candidate binding partner compound and detecting binding of the candidate binding partner compound to the NgR polypeptide. In another embodiment, an assay is a cell-based assay comprising contacting a cell expressing a membrane-bound form of NgR protein, or a biologically active portion thereof, on the cell surface with a test compound and determining the ability of the test compound to modulate (e.g., stimulate or inhibit) the activity of the NgR protein or biologically active portion thereof.
[0221]In one embodiment, an assay is a cell-based assay in which a cell which expresses a membrane-bound form of NgR protein, or a biologically active portion thereof, on the cell surface is contacted with a test compound and the ability of the test compound to bind to a NgR protein determined. The cell, for example, can be of mammalian origin or a yeast cell. Determining the ability of the test compound to bind to the NgR protein can be accomplished, for example, by coupling the test compound with a radioisotope or enzymatic label such that binding of the test compound to the NgR protein or biologically active portion thereof can be determined by detecting the labeled compound in a complex. For example, test compounds can be labeled with 125I, 35S, 14C, or 3H, either directly or indirectly, and the radioisotope detected by direct counting of radioemission or by scintillation counting. Alternatively, test compounds can be enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase or luciferase, and the enzymatic label detected by determination of conversion of an appropriate substrate to product. In one embodiment, the assay comprises contacting a cell which expresses a membrane-bound form of NgR protein or a biologically active portion thereof, on the cell surface with a known compound which binds NgR to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with a NgR protein, wherein determining the ability of the test compound to interact with a NgR protein comprises determining the ability of the test compound to preferentially bind to NgR or a biologically active portion thereof as compared to the known compound.
[0222]Determining the ability of the test compound to modulate the activity of NgR or a biologically active portion thereof can be accomplished, for example, by determining the ability of the NgR protein to bind to or interact with a NgR target molecule. As used herein, a "target molecule" is a molecule with which a NgR protein binds or interacts in nature, for example, a molecule on the surface of a cell which expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a molecule associated with the internal surface of a cell membrane or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a NgR protein or polypeptide of the present invention. In one embodiment, a NgR target molecule is a component of a signal transduction pathway that facilitates transduction of an extracellular signal (e.g., a signal generated by binding of a compound to a membrane-bound NgR molecule) through the cell membrane and into the cell. The target, for example, can be a second intercellular protein that has catalytic activity or a protein that facilitates the association of downstream signaling molecules with NgR. In a preferred embodiment, the detection comprises detecting a calcium flux or other physiological event in the cell caused by the binding of the molecule.
[0223]Specific binding molecules, including natural ligands and synthetic compounds, can be identified or developed using isolated or recombinant NgR products, NgR variants, or preferably, cells expressing such products. Binding partners are useful for purifying NgR products and detection or quantification of NgR products in fluid and tissue samples using known immunological procedures. Binding molecules are also manifestly useful in modulating (i.e., blocking, inhibiting or stimulating) biological activities of NgR, especially those activities involved in signal transduction.
[0224]2. Cell-Free Assays
[0225](a) Direct Binding:
[0226]The invention includes several assay systems for identifying NgR binding partners. In solution assays, methods of the invention comprise the steps of (a) contacting a NgR polypeptide with one or more candidate binding partner compounds and (b) identifying the compounds that bind to the NgR polypeptide. Identification of the compounds that bind the NgR polypeptide can be achieved by isolating the NgR polypeptide/binding partner complex and separating the binding partner compound from the NgR polypeptide. An additional step of characterizing the physical, biological and/or biochemical properties of the binding partner compound is also comprehended in another embodiment of the invention. In one aspect, the NgR polypeptide/binding partner complex is isolated using an antibody immunospecific for either the NgR polypeptide or the candidate binding partner compound.
[0227]In still other embodiments, either the NgR polypeptide or the candidate binding partner compound comprises a label or tag that facilitates its isolation, and methods of the invention to identify binding partner compounds include a step of isolating the NgR polypeptide/binding partner complex through interaction with the label or tag. An exemplary tag of this type is a poly-histidine sequence, generally around six histidine residues, that permits isolation of a compound so labeled using nickel chelation. Other labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, N.Y.), well known and routinely used in the art, are embraced by the invention.
[0228](b) Immobilized NgR
[0229]In one variation of an in vitro assay, the invention provides a method comprising the steps of (a) contacting an immobilized NgR polypeptide, or a biologically active fragment thereof with a candidate binding partner compound and (b) detecting binding of the candidate compound to the NgR polypeptide. In an alternative embodiment, the candidate binding partner compound is immobilized and binding of NgR is detected. Immobilization is accomplished using any of the methods well known in the art, including covalent bonding to a support, a bead or a chromatographic resin, as well as non-covalent, high affinity interactions such as antibody binding, or use of streptavidin/biotin binding wherein the immobilized compound includes a biotin moiety. Binding of a test compound to NgR, or interaction of NgR with a target molecule in the presence and absence of a candidate compound, can be accomplished in any vessel suitable for containing the reactants. Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided that adds a domain that allows one or both of the proteins to be bound to a matrix. For example, and not by way of limitation, GST-NgR fusion proteins or GST-target fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St Louis, Mo.) or glutathione derivatized microtiter plates, that are then combined with the test compound or the test compound and either the non-adsorbed target protein or NgR protein, and the mixture is incubated under conditions conducive to complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the beads or microtiter plate wells are washed to remove any unbound components, the matrix immobilized in the case of beads, and the complexes determined either directly or indirectly, for example, as described above. Alternatively, the complexes can be dissociated from the matrix, and the level of NgR binding or activity determined using standard techniques.
[0230]Other techniques for immobilizing proteins on matrices can also be used in the screening assays of the invention. For example, either NgR or its target molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, Ill.), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which do not interfere with binding of the NgR protein to its target molecule, can be derivatized to the wells of the plate, and unbound target or NgR trapped in the wells by antibody conjugation. Methods for detecting such complexes, in addition to those described above for the GST-immobilized complexes, include immunodetection of complexes using antibodies reactive with the NgR or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic activity associated with the NgR or target molecule.
[0231]Detection of binding can be accomplished (i) using a radioactive label on the compound that is not immobilized, (ii) using of a fluorescent label on the non-immobilized compound, (iii) using an antibody immunospecific for the non-immobilized compound, (iv) using a label on the non-immobilized compound that excites a fluorescent support to which the immobilized compound is attached, (v) determining the activity of the NgR, as well as other techniques well known and routinely practiced in the art.
[0232]Determining the activity of the target molecule, for example, may be accomplished by detecting induction of a cellular second messenger of the target (i.e. intracellular Ca2+, diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target an appropriate substrate, detecting the induction of a reporter gene (comprising a NgR-responsive regulatory element operatively linked to a nucleic acid encoding a detectable marker, e.g. luciferase), or detecting a cellular response, for example, cell survival, cellular differentiation, or cell proliferation.
[0233](c) Competition Experiments
[0234]In yet another embodiment, the assay comprises contacting the NgR protein or biologically active portion thereof with a known compound which binds NgR to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with a NgR protein, wherein determining the ability of the test compound to interact with a NgR protein comprises determining the ability of the test compound to preferentially bind to NgR or biologically active portion thereof as compared to the known compound.
[0235]In yet another embodiment, the cell-free assay comprises contacting the NgR protein or biologically active portion thereof with a known compound which binds NgR to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the test compound to interact with a NgR protein, wherein determining the ability of the test compound to interact with a NgR protein comprises determining the ability of the NgR protein to modulate the activity of a NgR target molecule.
[0236]The cell-free assays of the present invention are amenable to use of both the soluble form or the membrane-bound form of NgR. In the case of cell-free assays comprising the membrane-bound form of NgR, it may be desirable to utilize a solubilizing agent such that the membrane-bound form of NgR is maintained in solution. Examples of such solubilizing agents include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-114, Thesit®, Isotridecypoly(ethylene glycol ether)n, 3-(3-cholamidopropyl)dimethylamminiol-1-propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-1-propane sulfonate (CHAPSO), or N-dodecyl-N,N-dimethyl-3-ammonio-1-propane sulfonate.
Modulators
[0237]Agents that modulate (i.e., increase, decrease, or block) NgR activity or expression may be identified by incubating a putative modulator with a cell containing a NgR polypeptide or polynucleotide and determining the effect of the putative modulator on NgR activity or expression. The selectivity of a compound that modulates the activity of NgR can be evaluated by comparing its effects on NgR to its effect on other NgR compounds. Selective modulators may include, for example, antibodies and other proteins, peptides or organic molecules which specifically bind to a NgR polypeptide or a NgR-encoding nucleic acid. Modulators of NgR activity will be therapeutically useful in treatment of diseases and physiological conditions in which normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and modulators may be used in the treatment of such diseases and conditions associated with demyelination. NgR polynucleotides and polypeptides, as well as NgR modulators, may also be used in diagnostic assays for such diseases or conditions.
[0238]Methods of the invention to identify modulators include variations on any of the methods described above to identify binding partner compounds, the variations including techniques wherein a binding partner compound has been identified and the binding assay is carried out in the presence and absence of a candidate modulator. A modulator is identified in those instances where binding between the NgR polypeptide and the binding partner compound changes in the presence of the candidate modulator compared to binding in the absence of the candidate modulator compound. A modulator that increases binding between the NgR polypeptide and the binding partner compound is described as an enhancer or activator, and a modulator that decreases binding between the NgR polypeptide and the binding partner compound is described as an inhibitor.
[0239]In another embodiment, modulators of NgR expression may be identified in a method wherein a cell is contacted with a candidate compound and the expression of NgR mRNA or protein in the cell is determined. The level of expression of NgR mRNA or protein in the presence of the candidate compound is compared to the level of expression of NgR mRNA or protein in the absence of the candidate compound. The candidate compound can then be identified as a modulator of NgR expression based on this comparison. For example, when expression of NgR mRNA or protein is greater (statistically significantly greater) in the presence of the candidate compound than in its absence, the candidate compound is identified as a stimulator of NgR mRNA or protein expression. Alternatively, when expression of NgR mRNA or protein is less (statistically significantly less) in the presence of the candidate compound than in its absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein expression. The level of NgR mRNA or protein expression in the cells can be determined by methods described herein for detecting NgR mRNA or protein.
High Throughput Screening
[0240]The invention also comprehends high-throughput screening (HTS) assays to identify compounds that interact with or inhibit biological activity (i.e., affect enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit screening of large numbers of compounds in an efficient manner. Cell-based HTS systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays are designed to identify "hits" or "lead compounds" having the desired property, from which modifications can be designed to improve the desired property. Chemical modification of the "hit" or "lead compound" is often based on an identifiable structure/activity relationship between the "hit" and the NgR polypeptide.
[0241]Another aspect of the present invention is directed to methods of identifying compounds that bind to either NgR or nucleic acid molecules encoding NgR, comprising contacting NgR, or a nucleic acid molecule encoding the same, with a compound, and determining whether the compound binds NgR or a nucleic acid molecule encoding the same. Binding can be determined by binding assays which are well known to the skilled artisan, including, but not limited to, gel-shift assays, Western blots, radiolabeled competition assay, phage-based expression cloning, co-fractionation by chromatography, co-precipitation, cross linking, interaction trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, which are described in, for example, Ausubel et al. (Eds.), CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 1999, John Wiley & Sons, NY, which is incorporated herein by reference in its entirety. The NgR proteins, for example, can be used as "bait proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Pat. No. 5,283,317; Zervos et al., (1993) Cell 72, 223-232; Madura et al., (1993) J. Biol. Chem. 268, 12046-12054, Bartel et al., (1993) BioTechniques 14, 920-924; Iwabuchi et al., (1993) Oncogene 8, 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact with NgR ("NgR-binding proteins" or "NgR-bp") and modulate NgR activity. Such NgR-binding proteins are also likely to be involved in the propagation of signals by the NgR proteins as, for example, upstream or downstream elements of the NgR pathway.
[0242]Other assays may be used to identify specific ligands of a NgR receptor, including assays that identify ligands of the target protein through measuring direct binding of test ligands to the target protein, as well as assays that identify ligands of target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC methods or other physical and analytical methods. Alternatively, such binding interactions are evaluated indirectly using the yeast two-hybrid system described in Fields et al., (1989) Nature 340, 245-246, and Fields et al., (1994) Trends Genet. 10, 286-292, both of which are incorporated herein by reference. The two-hybrid system is a genetic assay based on the modular nature of most transcription factors used for detecting interactions between two proteins or polypeptides. It can be used to identify proteins that bind to a known protein of interest, or to delineate domains or residues critical for an interaction. Variations on this methodology have been developed to clone genes that encode DNA binding proteins, to identify peptides that bind to a protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of interacting proteins to bring a transcription activation domain into close proximity with a DNA binding domain that binds to an upstream activation sequence (UAS) of a reporter gene, and is generally performed in yeast. The assay requires the construction of two hybrid genes encoding (1) a DNA-binding domain that is fused to a first protein and (2) an activation domain fused to a second protein. The DNA-binding domain targets the first hybrid protein to the UAS of the reporter gene; however, because most proteins lack an activation domain, this DNA-binding hybrid protein does not activate transcription of the reporter gene. The second hybrid protein, which contains the activation domain, cannot by itself activate expression of the reporter gene because it does not bind the UAS. However, when both hybrid proteins are present, the noncovalent interaction of the first and second proteins tethers the activation domain to the UAS, activating transcription of the reporter gene. For example, when the first protein is a NgR gene product, or fragment thereof, that is known to interact with another protein or nucleic acid, this assay can be used to detect agents that interfere with the binding interaction. Expression of the reporter gene is monitored as different test agents are added to the system. The presence of an inhibitory agent results in lack of a reporter signal. The compounds to be screened include (which may include compounds that are suspected to bind NgR, or a nucleic acid molecule encoding the same), but are not limited to, extracellular, intracellular, biological or chemical origin.
[0243]The function of the NgR gene product is unclear and no ligands have yet been found which bind the gene product. The yeast two-hybrid assay is useful to identify proteins that bind to the gene product. In an assay to identify proteins that bind to a NgR receptor, or fragment thereof, a fusion polynucleotide encoding both a NgR receptor (or fragment) and a UAS binding domain (i.e., a first protein) may be used in addition, a large number of hybrid genes each encoding a different second protein fused to an activation domain are produced and screened in the assay. Typically, the second protein is encoded by one or more members of a total cDNA or genomic DNA fusion library, with each second protein-coding region being fused to the activation domain. This system is applicable to a wide variety of proteins, and it is not even necessary to know the identity or function of the second binding protein. The system is highly sensitive and can detect interactions not revealed by other methods; even transient interactions may trigger transcription to produce a stable mRNA that can be repeatedly translated to yield the reporter protein.
[0244]Other assays may be used to search for agents that bind to the target protein. One such screening method to identify direct binding of test ligands to a target protein is described in U.S. Pat. No. 5,585,277, incorporated herein by reference. This method relies on the principle that proteins generally exist as a mixture of folded and unfolded states, and continually alternate between the two states. When a test ligand binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the target protein), the target protein molecule bound by the ligand remains in its folded state. Thus, the folded target protein is present to a greater extent in the presence of a test ligand which binds the target protein, than in the absence of a ligand Binding of the ligand to the target protein can be determined by any method which distinguishes between the folded and unfolded states of the target protein. The function of the target protein need not be known in order for this assay to be performed. Virtually any agent can be assessed by this method as a test ligand, including, but not limited to, metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small organic molecules.
[0245]Another method for identifying ligands of a target protein is described in Vieboldt et al. (1997) Anal. Chem. 69:1683-1691, incorporated herein by reference. This technique screens combinatorial libraries of 20-30 agents at a time in solution phase for binding to the target protein. Agents that bind to the target protein are separated from other library components by simple membrane washing. The specifically selected molecules that are retained on the filter are subsequently liberated from the target protein and analyzed by HPLC and pneumatically assisted electrospray (ion spray) ionization mass spectroscopy. This procedure selects library components with the greatest affinity for the target protein, and is particularly useful for small molecule libraries.
[0246]The methods of the invention also embrace ligands, especially neuropeptides, that are attached to a label, such as a radiolabel (e.g., 125I, 35S, 32P, 33P, 3H), a fluorescence label, a chemiluminescent label, an enzymic label and an immunogenic label. Modulators falling within the scope of the invention include, but are not limited to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric effectors, and peptides. The NgR polypeptide or polynucleotide employed in such a test may either be free in solution, attached to a solid support, borne on a cell surface or located intracellularly or associated with a portion of a cell. One skilled in the art can, for example, measure the formation of complexes between NgR and the compound being tested. Alternatively, one skilled in the art can examine the diminution in complex formation between NgR and its substrate caused by the compound being tested.
[0247]Another aspect of the present invention is directed to methods of identifying compounds which modulate (i.e., increase or decrease) activity of NgR comprising contacting NgR with a compound, and determining whether the compound modifies activity of NgR The activity in the presence of the test compared is measured to the activity in the absence of the test compound. Where the activity of the sample containing the test compound is higher than the activity in the sample lacking the test compound, the compound will have increased activity. Similarly, where the activity of the sample containing the test compound is lower than the activity in the sample lacking the test compound, the compound will have inhibited activity.
[0248]The present invention is particularly useful for screening compounds by using NgR in any of a variety of drug screening techniques. The compounds to be screened include (which may include compounds which are suspected to modulate NgR activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. The NgR polypeptide employed in such a test may be in any form, preferably, free in solution, attached to a solid support, borne on a cell surface or located intracellularly One skilled in the art can, for example, measure the formation of complexes between NgR and the compound being tested. Alternatively, one skilled in the art can examine the diminution in complex formation between Nogo-R and its substrate caused by the compound being tested.
[0249]The activity of NgR polypeptides of the invention can be determined by, for example, examining the ability to bind or be activated by chemically synthesized peptide ligands. Alternatively, the activity of the NgR can be assayed by examining their ability to bind calcium ions, hormones, chemokines, neuropeptides, neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity of the NgR can be determined by examining the activity of effector molecules including, but not limited to, adenylate cyclase, phospholipases and ion channels. Thus, modulators of NgR activity may alter a NgR receptor function, such as a binding property of a receptor or an activity. In various embodiments of the method, the assay may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP assay such as a [35S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a diacylglycerol assay, an Aequorin assay, a Luciferase assay, a FLIPR assay for intracellular Ca2+ concentration, a mitogenesis assay, a MAP Kinase activity assay, an arachidonic acid release assay (e.g., using [3H]-arachidonic acid) and an assay for extracellular acidification rates, as well as other binding or function-based assays of NgR activity that are generally known in the art. NgR activity can be determined by methodologies that are used to assay for FaRP activity, which is well known to those skilled in the art. Biological activities of NgR receptors according to the invention include, but are not limited to, the binding of a natural or an unnatural ligand, as well as any one of the functional activities of NgRs known in the art. Non-limiting examples of NgR activities include transmembrane signaling of various forms, which may involve phosphatidylinositol (PI) association and/or the exertion of an influence over PI; another exemplary activity of NgRs is the binding of accessory proteins or polypeptides that differ from known GPI proteins.
[0250]The modulators of the invention exhibit a variety of chemical structures, which can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may function as activators or inhibitors (competitive, uncompetitive and non-competitive) (e.g., antibody products) of NgR receptors. The invention does not restrict the sources for suitable modulators, which may be obtained from natural sources such as plant, animal or mineral extracts, or non-natural sources such as small molecule libraries, including the products of combinatorial chemical approaches to library construction, and peptide libraries.
[0251]Other assays can be used to examine enzymatic activity including, but not limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are described in, for example, ENZYM ASSAYS: A PRACTICAL APPROACH, Eisenthal and Danson (Eds.), 1992, Oxford University Press, which is incorporated herein by reference in its entirety.
[0252]The use of cDNAs in drug discovery programs is well-known; assays capable of testing thousands of unknown compounds per day in high-throughput screens (HTSs) are thoroughly documented. The literature is replete with examples of the use of radiolabelled ligands in HTS binding assays for drug discovery (see Williams (1991) Med. Res. Rev., 11, 147-184; Sweetnam et al., (1993) J. Nat. Prod 56, 441-455 for review). Recombinant receptors are preferred for binding assay HTS because they allow for better specificity (higher relative purity), provide the ability to generate large amounts of receptor material, and can be used in a broad variety of formats (see Hodgson (1992) Bio/Technology 10, 973-980; each of which is incorporated herein by reference in its entirety).
[0253]A variety of heterologous systems is available for functional expression of recombinant receptors that are well known to those skilled in the art. Such systems include bacteria (Strosberg et al. (1992) Trends Pharmacol. Sci. 13, 95-98), yeast (Pausch (1997) Trends Biotechnol. 15, 487-494), several kinds of insect cells (Vanden Broeck (1996) Int. Rev. Cytol. 164, 189-268), amphibian cells (Jayawickreme et al. (1997) Curr. Opin. Biotechnol. 8, 629-634) and several mammalian cell lines (CHO, HEK293, COS, etc.; see Gerhardt et al. (1997) Eur. J. Pharmacol. 334, 1-23). These examples do not preclude the use of other possible cell expression systems, including cell lines obtained from nematodes (PCT application WO 98137177).
[0254]In preferred embodiments of the invention, methods of screening for compounds which modulate NgR activity comprise contacting test compounds with NgR and assaying for the presence of a complex between the compound and NgR. In such assays, the ligand is typically labeled. After suitable incubation, free ligand is separated from that present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular compound to bind to NgR.
[0255]In another embodiment of the invention, high throughput screening for compounds having suitable binding affinity to NgR is employed. Briefly, large numbers of different small peptide test compounds are synthesized on a solid substrate. The peptide test compounds are contacted with NgR and washed. Bound NgR is then detected by methods well known in the art. Purified polypeptides of the invention can also be coated directly onto plates for use in the aforementioned drug screening techniques. In addition, non-neutralizing antibodies can be used to capture the protein and immobilize it on the solid support.
[0256]Generally, an expressed NgR can be used for HTS binding assays in conjunction with its defined ligand. The identified peptide is labeled with a suitable radioisotope, including, but not limited to, 125I, 3H, 35S or 32P, by methods that are well known to those skilled in the art. Alternatively, the peptides may be labeled by well-known methods with a suitable fluorescent derivative (Baindur et al. (1994) Drug Dev. Res. 33, 373-398; Rogers (1997) Drug Discov. Today 2, 156-160). Radioactive ligand specifically bound to the receptor in membrane preparations made from the cell line expressing the recombinant protein can be detected in HTS assays in one of several standard ways, including filtration of the receptor-ligand complex to separate bound ligand from unbound ligand (Williams (1991) Med. Res. Rev. II, 147-184; Sweetnam et al. (1993) J. Nat. Prod. 56, 441-455). Alternative methods include a scintillation proximity assay (SPA) or a FlashPlate format in which such separation is unnecessary (Nakayama (1998) Curr. Opin. Drug Disc. Dev. 1, 85-91 Bosse et al. (1998) J. Biomol. Screening 3, 285-292). Binding of fluorescent ligands can be detected in various ways, including fluorescence energy transfer (FRET), direct spectrophotofluorometric analysis of bound ligand, or fluorescence polarization (Rogers (1997) Drug Discov. Today 2, 156-160; Hill (1998) Curr. Opin. Drug Disc. Dev. 1, 92-97).
[0257]Examples of such biological responses include, but are not limited to, the following: the ability to survive in the absence of a limiting nutrient in specifically engineered yeast cells (Pausch (1997) Trends in Biotechnol. 15, 487-494); changes in intracellular Ca2+ concentration as measured by fluorescent dyes (Murphy et al. (1998) Cur. Opin Drug Disc. Dev. 1, 192-199). Fluorescence changes can also be used to monitor ligand-induced changes in membrane potential or intracellular pH; an automated system suitable for HTS has been described for these purposes (Schroeder et al. (1996) J. Biomol. Screening 1, 75-80). Melanophores prepared from Xenopus laevis show a ligand-dependent change in pigment organization in response to heterologous NgR activation; this response is adaptable to HTS formats (Jayawickreme et al. (1997) Curr. Opin. Biotechnol. 8, 629-634). Assays are also available for the measurement of common second messengers, including cAMP, phosphoinositides and arachidonic acid, but these are not generally preferred for HTS.
[0258]Preferred methods of HTS employing these receptors include permanently transfected CHO cells, in which agonists and antagonists can be identified by the ability to transduce the signal for the binding of Nogo in membranes prepared from these cells through the putative GPI anchor. In another embodiment of the invention, permanently transfected CHO cells could be used for the preparation of membranes which contain significant amounts of the recombinant receptor proteins; these membrane preparations would then be used in receptor binding assays, employing the radiolabelled ligand specific for the particular receptor. Alternatively, a functional assay, such as fluorescent monitoring of ligand-induced changes in internal Ca2+ concentration or membrane potential in permanently transfected CHO cells containing each of these receptors individually or in combination would be preferred for HTS. Equally preferred would be an alternative type of mammalian cell, such as HEK293 or COS cells, in similar formats. More preferred would be permanently transfected insect cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant yeast cells expressing the Drosophila melanogaster receptors in HTS formats well known to those skilled in the art (e.g., Pausch (1997), above).
[0259]The invention contemplates a multitude of assays to screen and identify inhibitors of ligand binding to NgR receptors. In one example, the NgR receptor is immobilized and interaction with a binding partner is assessed in the presence and absence of a candidate modulator such as an inhibitor compound. In another example, interaction between the NgR receptor and its binding partner is assessed in a solution assay, both in the presence and absence of a candidate inhibitor compound. In either assay, an inhibitor is identified as a compound that decreases binding between the NgR receptor and its binding partner. Another contemplated assay involves a variation of the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by detection of a positive signal in a transformed or transfected host cell, as described in PCT publication number WO 95/20652, published Aug. 3, 1995.
[0260]Candidate modulators contemplated by the invention include compounds selected from libraries of either potential activators or potential inhibitors. There are a number of different libraries used for the identification of small molecule modulators, including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of random peptides, oligonucleotides or organic molecules. Chemical libraries consist of random chemical structures, some of which are analogs of known compounds or analogs of compounds that have been identified as "hits" or "leads" in other drug discovery screens, some of which are derived from natural products, and some of which arise from non-directed synthetic organic chemistry. Natural product libraries are collections of microorganisms, animals, plants, or marine organisms that are used to create mixtures for screening by: (1) fermentation and extraction of broths from soil, plant or marine microorganisms or (2) extraction of plants or marine organisms. Natural product libraries include polyketides, non-ribosomal peptides, and variants (non-naturally occurring) thereof. For a review, see Cane et al., Science (1998) 282, 63-68. Combinatorial libraries are composed of large numbers of peptides, oligonucleotides, or organic compounds as a mixture. These libraries are relatively easy to prepare by traditional automated synthesis methods, PCR, cloning, or proprietary synthetic methods. Of particular interest are non-peptide combinatorial libraries. Still other libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. For a review of combinatorial chemistry and libraries created therefrom, see Myers (1997) Curr. Opin. Biotechnol. 8, 701-707. Identification of modulators through use of the various libraries described herein permits modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate activity.
[0261]Still other candidate inhibitors contemplated by the invention can be designed and include soluble forms of binding partners, as well as such binding partners as chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses non-peptide modulators, as well as such peptide modulators as neuropeptides other than natural ligands, antibodies, antibody fragments, and modified compounds comprising antibody domains that are immunospecific for the expression product of the identified NgR gene.
[0262]Other embodiments of the invention comprise using competitive screening assays in which neutralizing antibodies capable of binding a polypeptide of the invention specifically compete with a test compound for binding to the polypeptide. In this manner, the antibodies can be used to detect the presence of any peptide that shares one or more antigenic determinants with NgR. Radiolabeled competitive binding studies are described in Lin et al., (1997) Antimicrob. Agents Chemother. 41, 2127-2131, the disclosure of which is incorporated herein by reference in its entirety.
[0263]In other embodiments of the invention, the polypeptides of the invention are employed as a research tool for identification, characterization and purification of interacting, regulatory proteins. Appropriate labels are incorporated into the polypeptides of the invention by various methods known in the art and the polypeptides are used to capture interacting molecules. For example, molecules are incubated with the labeled polypeptides, washed to remove unbound polypeptides, and the polypeptide complex is quantified. Data obtained using different concentrations of polypeptide are used to calculate values for the number, affinity, and association of polypeptide with the protein complex.
[0264]Labeled polypeptides are also useful as reagents for the purification of molecules with which the polypeptide interacts including, but not limited to, inhibitors. In one embodiment of affinity purification, a polypeptide is covalently coupled to a chromatography column. Cells and their membranes are extracted, and various cellular subcomponents are passed over the column. Molecules bind to the column by virtue of their affinity to the polypeptide. The polypeptide-complex is recovered from the column, dissociated and the recovered molecule is subjected to protein sequencing. This amino acid sequence is then used to identify the captured molecule or to design degenerate oligonucleotides for cloning the corresponding gene from an appropriate cDNA library.
[0265]Alternatively, compounds may be identified which exhibit similar properties to the ligand for the NgR of the invention, but which are smaller and exhibit a longer half time than the endogenous ligand in a human or animal body. When an organic compound is designed, a molecule according to the invention is used as a "lead" compound. The design of mimetics to known pharmaceutically active compounds is a well-known approach in the development of pharmaceuticals based on such "lead" compounds. Mimetic design, synthesis and testing are generally used to avoid randomly screening a large number of molecules for a target property. Furthermore, structural data deriving from the analysis of the deduced amino acid sequences encoded by the DNAs of the present invention are useful to design new drugs, more specific and therefore with a higher pharmacological potency.
[0266]Comparison of the protein sequence of the present invention with the sequences present in all the available databases showed a significant homology with the transmembrane portion of G protein coupled receptors. Accordingly, computer modeling can be used to develop a putative tertiary structure of the proteins of the invention based on the available information of the transmembrane domain of other proteins. Thus, novel ligands based on the predicted structure of NgR can be designed.
[0267]This invention further pertains to novel agents identified by the above-described screening assays and uses thereof for treatments as described herein.
Compositions and Pharmaceutical Compositions
[0268]In a particular embodiment, the novel molecules identified by the screening methods according to the invention are low molecular weight organic molecules, in which case a composition or pharmaceutical composition can be prepared thereof for oral or parenteral administration. The compositions, or pharmaceutical compositions, comprising the nucleic acid molecules, vectors, polypeptides, antibodies and compounds identified by the screening methods described herein, typically comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. As used herein, "pharmaceutically acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration. The nature of the carrier or other ingredients will depend on the specific route of administration and particular embodiment of the invention to be administered. Examples of techniques and protocols that are useful in this context are, inter alia, found in Remington's PHARMACEUTICAL SCIENCES, 16th ed., (1980) Osol, A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples of such carriers or diluents include, but are not limited to, water, saline, Ringer's solution, dextrose solution and 5% human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used. The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active compound, use thereof in the compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.
[0269]A pharmaceutical composition of the invention is formulated to be compatible with its intended route of administration. Examples of routes of administration include oral and parenteral (e.g., intravenous, intradermal, subcutaneous, inhalation, transdermal (topical), transmucosal and rectal administration). Solutions or suspensions used for parenteral, intradermal or subcutaneous application can include the following components: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates, and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.
[0270]Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor EL® (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy syringeability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin.
[0271]Sterile injectable solutions can be prepared by incorporating the active compound (e.g., a NgR protein or anti-NgR antibody) in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.
[0272]Oral compositions generally include an inert diluent or an edible carrier. They can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic administration, the active compound can be incorporated with excipients and used in the form of tablets, troches or capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, Primogel or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate or orange flavoring.
[0273]For administration by inhalation, the compounds are delivered in the form of an aerosol spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such as carbon dioxide or a nebulizer.
[0274]Systemic administration can also be by transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, and include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppositories. For transdermal administration, the active compounds are formulated into ointments, salves, gels or creams as generally known in the art.
[0275]The compounds can also be prepared in the form of suppositories (e.g., with conventional suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery.
[0276]In one embodiment, the active compounds are prepared with carriers that will protect the compound against rapid elimination from the body, such as a controlled release formulation, including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. The materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared according to methods known to those skilled in the art, for example, as described in U.S. Pat. No. 4,522,811. It is especially advantageous to formulate oral or parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity of active compound calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for the dosage unit forms of the invention are dictated by and directly dependent on the unique characteristics of the active compound and the particular therapeutic effect to be achieved.
[0277]The nucleic acid molecules of the invention can be inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by any of a number of routes, e.g., as described in U.S. Pat. No. 5,703,055. Delivery can thus also include, e.g., intravenous injection, local administration (see U.S. Pat. No. 5,328,470) or stereotactic injection (see e.g., Chen et al. (1994) Proc. Natl. Acad. Sci. USA 91, 3054-3057). The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells that produce the gene delivery system.
[0278]The pharmaceutical compositions can be included in a container, pack or dispenser together with instructions for administration.
[0279]The dosage of these low molecular weight compounds will depend on the disease state or condition to be treated and other clinical factors such as weight and condition of the human or animal and the route of administration of the compound. For treating human or animals, between approximately 0.5 mg/kg of body weight to 500 mg/kg of body weight of the compound can be administered. Therapy is typically administered at lower dosages and is continued until the desired therapeutic outcome is observed.
[0280]Another aspect of the present invention is the use of the NgR nucleotide sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, including but not limited to humans and other mammals and invertebrates. Any of the nucleotide sequences disclosed herein, or any portion thereof, can be used, for example, as probes to screen databases or nucleic acid libraries, such as, for example, genomic or cDNA libraries, to identify homologs using screening procedures well known to those skilled in the art. Accordingly, homologs having at least 50%, more preferably at least 60%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, and most preferably at least 100% homology with NgR sequences can be identified.
[0281]The present compounds and methods, including nucleic acid molecules, polypeptides, antibodies, compounds identified by the screening methods described herein, have a variety of pharmaceutical applications and may be used, for example, to treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In a particular embodiment, the present molecules are used in gene therapy. For a review of gene therapy procedures, see e.g. Anderson Science (1992) 256, 808-813, which is incorporated herein by reference in its entirety.
[0282]The present invention also encompasses a method of agonizing (stimulating) or antagonizing a NgR natural binding partner associated activity in a mammal comprising administering to said mammal an agonist or antagonist to one of the above disclosed polypeptides in an amount sufficient to effect said agonism or antagonism. One embodiment of the present invention, then, is a method of treating diseases in a mammal with an agonist or antagonist of the protein of the present invention comprising administering the agonist or antagonist to a mammal in an amount sufficient to agonize or antagonize-NgR-associated functions.
[0283]Methods of determining the dosages of compounds to be administered to a patient and modes of administering compounds to an organism are disclosed in U.S. application Ser. No. 08/702,282, filed Aug. 23, 1996, and International patent publication number WO 96/22976, published Aug. 1, 1996, both of which are incorporated herein by reference in their entirety, including any drawings, figures or tables. Those skilled in the art will appreciate that such descriptions are applicable to the present invention and can be easily adapted to it.
[0284]The proper dosage depends on various factors such as the type of disease being treated, the particular composition being used and the size and physiological condition of the patient. Therapeutically effective doses for the compounds described herein can be estimated initially from cell culture and animal models. For example, a dose can be formulated in animal models to achieve a circulating concentration range that initially takes into account the IC50 as determined in cell culture assays. The animal model data can be used to more accurately determine useful doses in humans.
[0285]Plasma half-life and biodistribution of the drug and metabolites in the plasma, tumors and major organs can also be determined to facilitate the selection of drugs most appropriate to inhibit a disorder. Such measurements can be carried out. For example, HPLC analysis can be performed on the plasma of animals treated with the drug and the location of radiolabeled compounds can be determined using detection methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory activity in the screening assays, but have poor pharmacokinetic characteristics, can be optimized by altering the chemical structure and retesting. In this regard, compounds displaying good pharmacokinetic characteristics can be used as a model.
[0286]Toxicity studies can also be carried out by measuring the blood cell composition. For example, toxicity studies can be carried out in a suitable animal model as follows: (1) the compound is administered to mice (an untreated control mouse should also be used); (2) blood samples are periodically obtained via the tail vein from one mouse in each treatment group; and (3) the samples are analyzed for red and white blood cell counts, blood cell composition and the percent of lymphocytes versus polymorphonuclear cells. A comparison of results for each dosing regime with the controls indicates if toxicity is present.
[0287]At the termination of each toxicity study, further studies can be carried out by sacrificing the animals (preferably, in accordance with the American Veterinary Medical Association guidelines Report of the American Veterinary Medical Assoc. Panel on Euthanasia, (1993) J. Am. Vet Med. Assoc. 202; 229-249). Representative animals from each treatment group can then be examined by gross necropsy for immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in tissue are noted and tissues are examined histologically. Compounds causing a reduction in body weight or blood components are less preferred, as are compounds having an adverse effect on major organs. In general, the greater the adverse effect the less preferred the compound.
[0288]For the treatment of cancers the expected daily dose of a hydrophobic pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. Plasma levels should reflect the potency of the drug. Generally, the more potent the compound the lower the plasma levels necessary to achieve efficacy.
[0289]NgR mRNA transcripts have been found in the brain and heart. SEQ ID NOs: 1 and/or, 3 will, as detailed above, enable screening the endogenous neurotransmitters/hormones/ligands which activate, agonize, or antagonize NgR and for compounds with potential utility in treating disorders including CNS disorders (e.g., stroke) and degenerative disorders such as those associated with demyelination.
[0290]For example, NgR receptor activation may mediate the prevention of neurite outgrowth. Inhibition would be beneficial in both chronic and acute brain injury. See, e.g., Donovan et al., (1997) J. Neurosci. 17, 5316-5326; Turgeon et al., (1998) J. Neurosci. 18, 6882-6891; Smith-Swintosky et al., (1997) J. Neurochem. 69, 1890-1896; Gill et al., (1998) Brain Res. 797, 321-327; Suidan et al., (1996) Semin. Thromb. Hemost. 22, 125-133.
Pharmacogenomics
[0291]Agents, or modulators that have a stimulatory or inhibitory effect on NgR activity (e.g., NgR gene expression), as identified by a screening assay described herein can be administered to individuals to treat (prophylactically or therapeutically) disorders (e.g., a disease condition such as a demyelination disorder) associated with aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's genotype and that individual's response to a foreign compound or drug) of the individual may be considered. Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation between dose and blood concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's genotype. Such pharmacogenomics can further be used to determine appropriate dosages and therapeutic regimens. Accordingly, the activity of NgR protein, expression of NgR nucleic acid or mutation content of NgR genes in an individual can be determined to thereby select appropriate agent(s) for therapeutic or prophylactic treatment of the individual.
[0292]Pharmacogenomics deals with clinically significant hereditary variations in the response to drugs due to altered drug disposition and abnormal action in affected persons. See e.g., Eichelbaum (1996) Clin. Exp. Pharmacol. Physiol. 23, 983-985 and Linder (1997) Clin. Chem. 43, 254-266. In general, two types of pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a single factor altering the way drugs act on the body (altered drug action) or genetic conditions transmitted as single factors altering the way the body acts on drugs (altered drug metabolism). These pharmacogenetic conditions can occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of fava beans.
[0293]As an illustrative embodiment, the activity of drug metabolizing enzymes is a major determinant of both the intensity and duration of drug action. The discovery of genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why some patients do not obtain the expected drug effects or show exaggerated drug response and serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among different populations. For example, the gene coding for CYP2D6 is highly polymorphic and several mutations have been identified in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug response and side effects when they receive standard doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra-rapid metabolizers who do not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification.
[0294]Thus, the activity of NgR protein, expression of NgR nucleic acid, or mutation content of NgR genes in an individual can be determined to thereby select appropriate agent(s) for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the identification of an individual's drug responsiveness phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when treating a subject with a NgR modulator, such as a modulator identified by one of the exemplary screening assays described herein.
Monitoring Clinical Efficacy
[0295]Monitoring the influence of agents (e.g., drugs, compounds) on the expression or activity of NgR (e.g., the ability to modulate aberrant cell proliferation and/or differentiation) can be applied not only in basic drug screening, but also in clinical trials. For example, the effectiveness of an agent determined by a screening assay as described herein to increase NgR gene expression, protein levels or upregulate NgR activity, can be monitored in clinical trials of subjects exhibiting decreased NgR gene expression, protein levels, or downregulated NgR activity. Alternatively, the effectiveness of an agent determined by a screening assay to decrease NgR gene expression, protein levels, or downregulate NgR activity, can be monitored in clinical trials of subjects exhibiting increased NgR gene expression, protein levels, or upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, preferably, other genes that have been implicated in, for example, a disease or disorder, can be used as a "read out" or markers of the immune responsiveness of a particular cell.
[0296]For example, genes, including NgR, that are modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates NgR activity (e.g., identified in a screening assay as described herein) can be identified. Thus, to study the effect of agents on demyelination disorders, for example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of NgR and other genes implicated in the disorder. The levels of gene expression (i.e., a gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as described herein, or alternatively by measuring the amount of protein produced by one of the methods as described herein or by measuring the levels of activity of NgR or other genes. In this way, the gene expression pattern can serve as a marker, indicative of the physiological response of the cells to the agent. Accordingly, this response state may be determined before, and at various points during, treatment of the individual with the agent.
[0297]In one embodiment, the invention provides a method for monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the screening assays described herein) comprising the steps of (i) obtaining a pre-administration sample from a subject prior to administration of the agent; (ii) detecting the level of expression of a NgR protein, mRNA, or genomic DNA in the preadministration sample; (iii) obtaining one or more post-administration samples from the subject; (iv) detecting the level of expression or activity of the NgR protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing the level of expression or activity of the NgR protein, mRNA or genomic DNA in the pre-administration sample with the NgR protein, mRNA or genomic DNA in the post administration sample or samples; and (vi) altering the administration of the agent to the subject accordingly. For example, increased administration of the agent may be desirable to increase the expression or activity of NgR to higher levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased administration of the agent may be desirable to decrease expression or activity of NgR to lower levels than detected, i.e., to decrease the effectiveness of the agent.
Methods of Treatment
[0298]The present invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant NgR expression or activity.
[0299]Diseases and disorders that are characterized by increased (relative to a subject not suffering from the disease or disorder) levels or biological activity may be treated with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, or analogs, derivatives, fragments or homologs thereof; (ii) antibodies to a NgR peptide; (iii) nucleic acids encoding a NgR peptide; (iv) administration of antisense nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion within the coding sequences of coding sequences to a NgR peptide) are utilized to "knockout" endogenous function of a NgR peptide by homologous recombination (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators (i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the invention or antibodies specific to a peptide of the invention) that alter the interaction between a NgR peptide and its binding partner.
[0300]Diseases and disorders that are characterized by decreased (relative to a subject not suffering from the disease or disorder) levels or biological activity may be treated with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or analogs, derivatives, fragments or homologs thereof or an agonist that increases bioavailability.
[0301]Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art include, but are not limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.).
[0302]In one aspect, the invention provides a method for preventing, in a subject, a disease or condition associated with an aberrant NgR expression or activity, by administering to the subject an agent that modulates NgR expression or at least one NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant NgR expression or activity can be identified by, for example, any or a combination of diagnostic or prognostic assays as described herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the NgR aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or NgR antagonist agent can be used for treating the subject. The appropriate agent can be determined based on screening assays described herein.
[0303]Another aspect of the invention pertains to methods of modulating NgR expression or activity for therapeutic purposes. The modulatory method of the invention involves contacting a cell with an agent that modulates one or more of the activities of NgR protein activity associated with the cell. An agent that modulates NgR protein activity can be an agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or more NgR protein activity. Examples of such stimulatory agents include active NgR protein and a nucleic acid molecule encoding NgR that has been introduced into the cell. In another embodiment, the agent inhibits one or more NgR protein activity. Examples of such inhibitory agents include antisense NgR nucleic acid molecules and anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present invention provides methods of treating an individual afflicted with a disease or disorder characterized by aberrant expression or activity of a NgR protein or nucleic acid molecule. In one embodiment, the method involves administering an agent (e.g., an agent identified by a screening assay described herein), or combination of agents that modulates (e.g., upregulates or down-regulates) NgR expression or activity. In another embodiment, the method involves administering a NgR protein or nucleic acid molecule as therapy to compensate for reduced or aberrant NgR expression or activity.
Gene Therapy
[0304]Mutations in the NgR gene that result in loss of normal function of the NgR gene product underlie NgR human disease states. The invention comprehends gene therapy to restore NgR activity to treat those disease states. Delivery of a functional NgR gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, Anderson (1998) Nature, supplement to 392(6679):25-20. For additional reviews of gene therapy technology see Friedmann (1989) Science 244, 1275-1281; Verma (1990) Sci. Am. 68-84; and Miller (1992) Nature 357, 455-460. Alternatively, it is contemplated that in other human disease states, preventing the expression of, or inhibiting the activity of, NgR will be useful in treating disease states. It is contemplated that antisense therapy or gene therapy could be applied to negatively regulate the expression of NgR.
[0305]The present invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant NgR expression or activity.
[0306]Diseases and disorders that are characterized by increased (relative to a subject not suffering from the disease or disorder) levels or biological activity may be treated with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, or analogs, derivatives, fragments or homologs thereof; (ii) antibodies to a NgR peptide; (iii) nucleic acids encoding a NgR peptide; (iv) administration of antisense nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion within the coding sequences of coding sequences to a NgR peptide) are utilized to "knockout" endogenous function of a NgR peptide by homologous recombination (see, e.g., Capecchi (1989), above); or (v) modulators (i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the invention or antibodies specific to a peptide of the invention) that alter the interaction between a NgR peptide and its binding partner.
[0307]Diseases and disorders that are characterized by decreased (relative to a subject not suffering from the disease or disorder) levels or biological activity may be treated with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or analogs, derivatives, fragments or homologs thereof, or an agonist that increases bioavailability.
[0308]Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or mRNAs of a NgR peptide) Methods that are well-known within the art include, but are not limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.).
[0309]In one aspect, the invention provides a method for preventing, in a subject, a disease or condition associated with an aberrant NgR expression or activity, by administering to the subject an agent that modulates NgR expression or at least one NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant NgR expression or activity can be identified by, for example, any or a combination of diagnostic or prognostic assays as described herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of the NgR aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or NgR antagonist agent can be used for treating the subject. The appropriate agent can be determined based on screening assays described herein.
[0310]Another aspect of the invention pertains to methods of modulating NgR expression or activity for therapeutic purposes. The modulatory method of the invention involves contacting a cell with an agent that modulates one or more of the activities of NgR protein activity associated with the cell. An agent that modulates NgR protein activity can be an agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or more NgR protein activity. Examples of such stimulatory agents include active NgR protein and a nucleic acid molecule encoding NgR that has been introduced into the cell. In another embodiment, the agent inhibits one or more NgR protein activity. Examples of such inhibitory agents include antisense NgR nucleic acid molecules and anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present invention provides methods of treating an individual afflicted with a disease or disorder characterized by aberrant expression or activity of a NgR protein or nucleic acid molecule. In one embodiment, the method involves administering an agent (e.g., an agent identified by a screening assay described herein), or combination of agents that modulates (e.g., upregulates or down-regulates) NgR expression or activity. In another embodiment, the method involves administering a NgR protein or nucleic acid molecule as therapy to compensate for reduced or aberrant NgR expression or activity.
[0311]The present invention is not to be limited in scope by the specific embodiments described herein. Indeed, various modifications of the invention in addition to those described herein will become apparent to those skilled in the art from the foregoing description and accompanying figure. Such modifications are intended to fall within the scope of the appended claims.
[0312]The following Table 5 contains the sequences of exemplary polynucleotides and polypeptides of the invention.
TABLE-US-00008 TABLE 5 The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCT CCTGCTGATGCTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGC TCTGCACCTGCTACTCATCCCCGCCCACCGTGAGCTGCCAGGCCAACAAC TTCTCCTCTGTGCCGCTGTCCCTGCCACCCAGCACTCAGCGACTCTTCCT GCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCTTTGGGTCCAACC TGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC ACTTTCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCG GCACCTGCGCTCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGC AGTCGCTGCATTTGTACCGCTGCCAGCTCAGCAGCCTGCCCGGCAACATC TTCCGAGGCCTGGTCAGCCTGCAGTACCTCTACCTCCAGGAGAACAGCCT GCTCCACCTACAGGATGACTTGTTCGCGGACCTGGCCAACCTGAGCCACC TCTTCCTCCACGGGAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC GGCCTGGGCAGCCTGGACCGGCTGCTGCTGCACGGGAACCGGCTGCAGGG CGTGCACCGCGCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACC TGTTCAACAACAGCCTGGCCTCGCTGCCCGGCGAGGCGCTCGCCGACCTG CCCTCGCTCGAGTTCCTGCGGCTCAACGCTAACCCCTGGGCGTGCGACTG CCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGCGTGTCCAGCT CCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCG GCCGGGCAGCCGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGG TGGCCGAGGCCGGGGCGCCCCCAGCCGATCCCTCCACCCTCTACCGAGAT CTGCCTGCCGAAGACTCGCGGGGGCGCCAGGGCGGGGACGCGCCTACTGA GGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGAGGGGAGCAGA TGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCC CCACCACCTC The following amino acid sequence <SEQ ID NO. 2> is the predicted amino acid sequence derived from the DNA sequence of SEQ ID NO. 1: M L P G L R R L L Q A P A S A C L L L M L L A L P L A A P S C P M L C T C Y S S P P T V S C Q A N N F S S V P L S L P P S T Q R L F L Q N N L I R T L R P G T F G S N L L T L W L F S N N L S T I Y P G T F R H L Q A L E E L D L G D N R H L R S L E P D T F Q G L E R L Q S L H L Y R C Q L S S L P G N I F R G L V S L Q Y L Y L Q E N S L L H L Q D D L F A D L A N L S H L F L H G N R L R L L T E H V F R G L G S L D R L L L H G N R L Q G V H R A A F R G L S R L T I L Y L F N N S L A S L P G E A L A D L P S L E F L R L N A N P W A C D C R A R P L W A W F Q R A R V S S S D V T C A T P P E R Q G R D L R A L R E A D F Q A C P P A A P T R P G S R A R G N S S S N H L Y G V A E A G A P P A D P S T L Y R D L P A E D S R G R Q G G D A P T E D D Y W G G Y G G E D Q R G E Q M C P G A A C Q A P P D S R G P A L S A G L P S P L L C L L L L V P H H L The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: ATGTCTTGGCAGTCTGGAACCACAGTGACACAATCTCCCGTGCAGGCTGC TCAGGTCTCAGGGTGCTGTGTGGAATTGCTGCTGTTGCTGCTCGCTGGAG AGCTACCTCTGGGTGGTGGTTGTCCTCGAGACTGTGTGTGCTACCCTGCG CCCATGACTGTCAGCTGCCAGGCACACAACTTTGCTGCCATCCCGGAGGG CATCCCAGAGGACAGTGAGCGCATCTTCCTGCAGAACAATCGCATCACCT TCCTCCAGCAGGGCCACTTCAGCCCCGCCATGGTCACCCTCTGGATCTAC TCCAACAACATCACTTTCATTGCTCCCAACACCTTCGAGGGCTTTGTGCA TCTGGAGGAGCTAGACCTTGGAGACAACCGACAGCTGCGAACGCTGGCAC CCGAGACCTTCCAAGGCCTGGTGAAGCTTCACGCCCTGTACCTCTATAAG TGTGGACTGAGCGCCCTGCCCGCAGGCATCTTTGGTGGCCTGCACAGCCT GCAGTATCTCTACTTGCAGGACAACCATATGGAGTACCTCCAAGATGACA TCTTTGTGGACCTGGTCAATCTCAGTCACTTGTTTCTCCATGGTAACAAG CTATGGAGCCTGGGCCAAGGCATCTTCCGGGGCCTGGTGAACCTGGACCG GTTGCTGCTGCATGAGAACCAGCTACAGTGGGTTCACCACAAGGCTTTCC ATGACCTCCACAGGCTAACCACCCTCTTTCTCTTCAACAACAGCCTCACT GAGCTGCAGGGTGACTGTCTGGCCCCCCTGGTGGCCTTGGAGTTCCTTCG CCTCAATGGGAATGCTTGGGACTGTGGCTGCCGGGCACGTTCCCTGTGGG AATGGCTGCGAAGGTTCCGTGGCTCTAGCTCTGCTGTCCCCTGCGCGACC CCCGAGCTGCGGCAAGGCCAGGATCTGAAGCTGCTGAGGGTGGAGGACTT CCGGAACTGCACAGGACCAGTGTCTCCTCACCAGATCAAGTCTCACACGC TTACCACCTCTGACAGGGCTGCCCGCAAGGAGCACCATCCGTCCCATGGG GCCTCCAGGGACAAAGGCCACCCACATGGCCATCCGCCTGGCTCCAGGTC AGGTTACAAGAAGGCAGGCAAGAACTGCACCAGCCACAGGAACCGGAACC AGATCTCTAAGGTGAGCTCTGGGAAAGAGCTTACCGAACTGCAGGACTAT GCCCCCGACTATCAGCACAAGTTCAGCTTTGACATCATGCCCAGCGCACG ACCGAAGAGGAAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCA GTGGGGTGCAGCAGGCATCCTCAGGCACGGCCCTTGGGGCCCCACTCCTG GCCTGGATACTGGGGCTGGCAGTCACTCTCCGGC The following protein sequence <SEQ ID NO. 4> is deduced protein of SEQ ID NO:3: M S W Q S G T T V T Q S P V Q A A Q V S G C C V E L L L L L L A G E L P L G G G C P R D C V C Y P A P M T V S C Q A H N F A A I P E G I P E D S E R I F L Q N N R I T F L Q Q G H F S P A M V T L W I Y S N N I T F I A P N T F E G F V H L E E L D L G D N R Q L R T L A P E T F Q G L V K L H A L Y L Y K C G L S A L P A G I F G G L H S L Q Y L Y L Q D N H I E Y L Q D D I F V D L V N L S H L F L H G N K L W S L G Q G I F R G L V N L D R L L L H E N Q L Q W V H H K A F H D L H R L T T L F L F N N S L T E L Q G D C L A P L V A L E F L R L N G N A W D C G C R A R S L W E W L R R F R G S S S A V P C A T P E L R Q G Q D L K L L R V E D F R N C T G P V S P H Q I K S H T L T T S D R A A R K E H H P S H G A S R D K G H P H G H P P G S R S G Y K K A G K N C T S H R N R N Q I S K V S S G K E L T E L Q D Y A P D Y Q H K F S F D I M P T A R P K R K G K C A R R T P I R A P S G V Q Q A S S G T A L G A P L L A W I L G L A V T L R The following protein sequence <SEQ ID NO. 5> is NgR1 from humans: M K R A S A G G S R L L A W V L W L Q A W Q V A A P C P G A C C Y N E P K V T T S C P Q Q G L Q A V P V G I P A A S Q R I F L H G N R I S H V P A A S F R A C R N L T I L W L H S N V L A R I D A A A F T G L A L L E Q L D L S D N A Q L R S V D P A T F H G L G R L H T L H L D R C G L Q E L G P G L F R G L A A L Q Y L Y L Q D N A L Q A L P D D T F R D L G N L T H L F L H G N R I S S V P E R A F R G L H S L D R L L L H Q N R V A H V H P H A F R D L G R L M T L Y L F A N N L S A L P T E A L A P L R A L Q Y L R L N D N P W V C D C R A R P L W A W L Q K F R G S S S E V P C S L P Q R L A G R D L K R L A A N D L Q G C A V A T G P Y H P I W T G R A T D E E P L G L P K C C Q P D A A D K A S V L E P G R P A S A G N A L K G R V P P G D S P P G N G S G P R H I N D S P F G T L P G S A E P P L T A V R P E G S E P P G F P T S G P R R R P G C S R K N R T R S H C R L G Q A G S G G G G T G D S E G S G A L P S L T C S L T P L G L A L V L W T V L G P C The following amino acid sequence <SEQ ID NO:6> is a Consensus Sequence of NgR based on homology with NgR1 C P X X C X C Y X X P X X T X S C X X X X X X X X P X X X P X X X X R X F L X X N X I X X X X X X X F X X X X X X X X L W X X S N X X X X I X X X X F X X X X X LE X L D L X D N X X L R X X X P X T F X G L X X L X L X L X X C X L X X L X X X X F X G L X X L Q Y L Y L Q X N X X X X L X D D X F X D L X N L X H L F L H G N X X X X X X X X X F R G L X X L D R L L L H X N X X X X V H X X A F X X L X R L X X L X L F X N X L X X L X X X X L A X L X X L X X L R L N X N X W X C X C R A R X L W X W X X X X R X S S S X V X C X X P X X X X G X D L X X L X X X D X X X C X X X X X P X X P X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X G X X X X X X X X X X X X P P X X X S X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X R X X X X X X X X X X X X X X X X X X X X X X X X X X X L X X X X X X X X X X L The following protein sequence <SEQ ID NO:7> is the 66 amino acid active domain of Nogo: R I Y K G V I Q A I Q K S D E G H P F R A Y L E S E V A I S E E L V Q K Y S N S A L G H V N C T I K E L R R L F L V D D L V D S L K The following protein sequence <SEQ ID NO:8> is the amino acid sequence of the mature NgR2: C P M L C T C Y S S P P T V S C Q A N N F S S V P L S L P P S T Q R L F L Q N N L I R T L R P G T F G S N L L T I W L F S N N L S T I Y P G T F R H L Q A L E E L D L G D N R H L R S L E P D T F Q G L E R L Q S L H L Y R C Q L S S L P G N I F R G L V S L Q Y L Y L Q E N S L L H L Q D D L F A D L A N L S H L F L H G N R L R L L T E H V F R G L G S L D R L L L H G N R L Q G V H R A A F R G L S R L T I L Y L F N N S L A S L P G E A L A D L P S L E F L R L N A N P W A C D C R A R P L W A W F Q R A R V S S S D V T C A T P P E R Q G R D L R A L R E A D F Q A C P P A A P T R P G S R A R G N S S S N H L Y G V A E A G A P P A D P S T L Y R D L P A E D S R G R Q G G D A P T E D D Y W G G Y G G E D Q R G E Q M C P G A A C Q A P P D S R G P A L S A G L P S P L L C L L L L V P H H L The following protein sequence <SEQ ID NO:9> is the amino acid sequence of the mature NgR3: C P R D C V C Y P A P M T V S C Q A H N F A A I P E G I P E D S E R I F L Q N N R I T F L Q Q G H F S P A M V T L W T Y S N N I T F I A P N T F E G F V H L E E L D L G D N R Q L R T L A P E T F Q G L V K L H A L Y L Y K C G L S A L P A G I F G G L H S L Q Y L Y L Q D N H I E Y L Q D D I F V D L V N L S H L F L H G N K L W S L G Q G I F R G L V N L D R L L L H E N Q L Q W V H H K A F H D L H R L T T L F L F N N S L T E L Q G D C L A P L V A L E F L R L N G N A W D C G C R A R S L W E W L R R F R G S S S A V P C A T P E L R Q G Q D L K L L R V E D E R N C T G P V S P H Q I K S H T L T T S D R A A R K E H H P S H G A S R D K G H P H G H P P G S R S G Y K K A G K N C T S H R N R N Q I S K V S S G K E L T E L Q D Y A P D Y Q H K F S F D I M P T A R P K R K G K C A R R T P I R A P S G V Q Q A S S G T A L G A P L L A W I L G L A V T L R The following amino acid sequence <SEQ ID NO: 10> is a conserved cysteine motif (Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: C P X X C X C Y X X P X X T X S C The following amino acid sequence <SEQ ID NO: 11> is a conserved cysteine motif (Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: N X W X C X C R A R X L W X W X X X X R X S S S X V X C X X P X X X X G X D L X X L X X X D X X X C The following amino acid sequence <SEQ ID NO: 12> is a conserved Leucine-rich domain of the NgR and homologs based on the Consensus Sequence: R X F L X X N X I X X X X X X X F X X X X X X X X L W X X S N X X X X I X X X X F X X X X X L E X L D L X D N X X L R X X X P X T F X G L X X L X L X L X X C X L X X L X X X X F X G L X X L Q Y L Y L Q X N X X X X L X D D X F X D L X N L X H L F L H G N X X X X X X X X X F R G L X X L D R L L L H X N X X X X V H X X A F X X L X R L X X L X L F X N X L X X L X X X X L A X L X X L X X L R L
[0313]Unless otherwise indicated, X is any amino acid. For example, X where indicated may be no amino acid. Additional features of the invention will be apparent from the following Examples. Examples 1-5 are actual, while the remaining Examples are prophetic.
[0314]As shown by the following Examples, a gene encoding novel NgRs have been identified by computational analysis of DNA sequence data. The proteins encoded by NgR2 and NgR3 have a putative signal sequence, eight leucine-rich repeat domains in a conserved leucine-rich region (SEQ ID NO:12), a conserved cysteine-rich region (SEQ ID NO:10) N-terminal to the leucine-rich region, a second cysteine-rich domain (SEQ ID NO:11) C-terminal to the leucine-rich region, and a putative glycophosphatidylinositol-linkage (GPI-linkage) site. NgR2 and NgR3 differ from the previously identified NgR sequence. The NgR homologs, when compared to known NgRs, show a consensus sequence (SEQ ID NOs:6). The putative mature NgR2 and NgR3 are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively.
Example 1
Tblastn Query of the HTG Database
[0315]The protein sequence for the human NgR (NgR1) (SEQ ID NO:5) was used to query the high throughput genomic (HTG) database the use of which is familiar to those skilled in the art. The HTG database is a part of GenBank, a comprehensive NIH genetic sequence database, which includes an annotated collection of all publicly available DNA sequences (Nucleic Acids Res. (2000) 28, 15-8). The HTG database includes sequences obtained from genomic DNA. Within genomic DNA, genes are typically encoded by multiple segments of DNA called exons. Thus when one aligns a cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic sequence, the sequence will be broken up into segments depending on the number of exons in the gene.
[0316]The BLAST algorithm, which stands for Basic Local Alignment Search Tool is suitable for determining sequence similarity (Altschul et al., (1990) J. Mol. Biol. 215, 403-410, which is incorporated herein by reference in its entirety). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The basic BLAST algorithm involves first identifying high scoring sequence pair (HSPs) by identifying short words of length W in the query sequence that either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Extension for the word hits in each direction are halted when: 1) the cumulative alignment score falls off by the quantity X from its maximum achieved value; 2) the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or 3) the end of either sequence is reached. The Blast algorithm parameters W, T and X determine the sensitivity and speed of the alignment. The Blast program uses as defaults a word length (W) of 11, the BLOSUM62 scoring matrix (see Henikoff et al., (1992) Proc. Natl. Acad. Sci. USA 89, 10915-10919, which is incorporated herein by reference in its entirety) alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands.
[0317]The BLAST algorithm (Karlin et al., (1993) Proc. Natl. Acad. Sci. USA 90, 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a statistical analysis of the similarity between two sequences. One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a NgR gene or cDNA if the smallest sum probability in comparison of the test nucleic acid to a NgR nucleic acid is less than about 1, preferably less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
[0318]To query the HTG database with the NgR protein sequence, we used a variation of the BLAST algorithm known as the tblastn program, which compares a protein query sequence against a nucleotide sequence database dynamically translated in all reading frames (J. Mol. Biol. (1990) 215, 403-410: Nucleic Acids Res. (1997) 25, 3389-3402). The results of the tblastn search indicated the presence of genes in the database with a significant identity to the NgR. In addition to finding hits to genomic clones which contain the human and mouse NgR genes, we found hits to clones where the identity was not as high, but still very significant. Three human clones were found (Accession numbers: AC068514, AC016869, AC013606) with an e-value of 4e-43 and one mouse clone was found (Accession No. AC021768) with an e-value of 1e-78. The three human clones all appeared to encode the same gene, so further analysis was confined to AC013606.
Example 2
Prediction of the Human NgR2 Protein Sequence (AC013606)
[0319]The human NgR protein sequence aligned with two regions of translated sequence from nucleotide sequence AC013606, indicating that the new gene was encoded by at least two exons. In order to define the complete gene, we used the computer program GENSCAN® (J. Mol. Biol. (1997) 268, 78-94) which can identify complete exon/intron structures of genes in genomic DNA. The gene prediction by GENESCAN® contained seven exons. By comparing these predicted exons to the NgR, it was concluded that the new human gene contains two of these exons and a part of another (containing the initiating methionine). The predicted cDNA (mRNA) encoded by these three exons was assembled from AC013606 (HTG11; deposited March 2000; length=143899; GenBank release 118.0; SEQ ID NO:15) by combining nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA sequence is SEQ ID NO:1 (nucleotide sequence of human NgR2; AC013606). The translation of this cDNA provides the protein sequence of human NgR2 (SEQ ID NO:2).
[0320]We used the protein sequence of human NgR2 as a query sequence against the human EST database. A number of hits of high significance were found indicating that the NgR2 mRNA is expressed in a number of tissues including fetal brain. Furthermore, two of these ESTs provided support for the exon structure that we deduced. One EST (Accession No: GB_EST19:AI346757) contains 565 nucleotides corresponding to amino acids 84-271 of the human NgR2 (SEQ ID No:4). This spans the second intron located between amino acids 171 and 172, and provides positive evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST (GB_EST26:AI929019) contains 545 nucleotides, part of which corresponds to amino acids 1-75 of the human NgR2 (SEQ ID NO:2). This spans the first intron located between amino acids 10 and 11, and provides positive evidence for the splicing of exons 1 and 2 at the mRNA level.
Example 3
Prediction of the Mouse NgR3 Protein Sequence (AC021768)
[0321]The human NgR protein sequence aligned with only one region of translated sequence from nucleotide sequence AC021768, indicating that most of the new mouse gene was encoded by one large exon. However, upon inspection, the protein encoded by this exon was missing an initiating methionine. In order to define the complete gene, we used the computer program GENSCAN as described above. The gene prediction by GENSCAN contained two exons; the large one found by visual inspection and a short one at the 5' end which provided an initiating methionine. The predicted cDNA (mRNA) encoded by these two exons was assembled from AC021768 (HTG14; deposited March 2000; length=215980; GenBank release 118.0; SEQ ID NO: 16) by combining nucleotides from the two exons whose coordinates are: the complement of 164265-164325 (exon 1); and the complement of 155671-156992 (exon 2). The sequence for this cDNA sequence is SEQ ID NO:3 (nucleotide sequence of mouse NgR3; AC021768). The translation of this cDNA provides the protein sequence of mouse NgR3 (SEQ ID NO:4).
[0322]We used the protein sequence of mouse NgR3 as a query sequence against the mouse EST database. One hit of high significance was found indicating that the NgR2 mRNA is expressed in the heart. This EST (GB_EST20:AI428334) contains 463 nucleotides, part of which correspond to amino acids 45-193 of mouse NgR3 (SEQ ID NO:4).
Example 4
Similarity Between the NgRs
[0323]An alignment between NgR1 and the two new receptors is shown in FIG. 1A-1B. The similarities between these proteins include:
[0324](1) The SignalP program, which locates the signal sequence cleavage position, predicts a cleavage before the first conserved cysteine in all the proteins. Thus the mature protein in all cases will have a cysteine at the N-terminus.
[0325](2) All proteins contain eight Leucine Rich Repeats (LRR). LRRs are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each LRR is composed of a beta-alpha unit.
[0326](3) All three proteins contain a leucine rich repeat N-terminal domain (LRRNT), in which four cysteines are conserved. LRRs are often flanked by cysteine rich domains at both their N and C termini.
[0327](4) All three proteins contain a LRR C-terminal domain (LRRCT). The LRRCTs of the three NgR proteins can be distinguished from those of other LRR containing proteins, by the pattern of typtophans and cysteines which are completely conserved in this domain.
[0328](5) All three proteins contain a conserved cysteine in the fourth LRR domain.
[0329](6) All three proteins contain a conserved potential glycosylation site in the eighth LRR domain.
[0330](7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NGR1, an indication that they probably also undergo a modification similar to NgR1, where a GPI moiety is covalently linked to a C-terminal amino acid. This allows the protein to remain tethered to the cell.
Example 5
Preparation of Nogo Proteins
[0331]A Nogo binding assay was developed which utilizes a method widely used in examining semaphorin and ephrin axonal guidance function (Flanagan & Vanderhaeghen (1998) Annu. Rev. Neurosci. 21, 3 09-345, Takahashi et al., (1999) Cell 99, 59-69). It involves fusing a secreted placental alkaline phosphatase (AP) moiety to the ligand in question to provide a biologically active receptor binding agent which can be detected with an extremely sensitive calorimetric assay. For Nogo, an expression vector is created encoding a signal peptide, a His6 tag for purification, AP, and the 66 amino acid active domain of Nogo. The fusion protein can be purified from the conditioned medium of transfected cells in milligram amounts. This protein is biologically active as a growth cone collapsing agent with an EC50 of 1 nM.
[0332]Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein may be prepared. For GST-Nogo, an expression vector (e.g., a pGEX vector) is created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. GST-Nogo may be purified from the culture medium and used as a GST fusion protein, or GST may be cleaved from the Nogo portion of the fusion protein with an enzyme that recognizes the specific amino acid cleavage sit engineered between the GST portion and the Nogo portion of the fusion protein. Such sites are part of the commercially available GST vectors. The specific cleavage sites and enzymes may be used in accordance with the Manufacturers specifications.
[0333]It has been found that AP-Nogo is actually slightly more potent than GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a prokaryotic cell.
[0334]Binding of Nogo to immobilized NgR homologs may be performed in an ELISA-type assay in which AP-Nogo is allowed to react with an immobilized receptor homolog. Specificity of binding may be demonstrated in a competitive binding assay using increasing amounts of GST-Nogo in the type of assay to show a decreasing amount of binding of AP-Nogo (as judged in the calorimetric assay).
Example 6
Transfected COS Cell Binding Assays
[0335]The homologs of the present invention may be used in transfection studies in COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS cells with nucleic acid sequences encoding NgRs will make them capable of binding Nogo. AP alone does not bind with any stable affinity to these transfected cells, indicating that any affinity of Nogo for NgR2 or NgR3 would be due to the 66 amino acids derived from Nogo. Furthermore, specific affinity of Nogo for the NgR2 or NgR3 proteins may be tested in displacement of AP-Nogo assays using GST-Nogo. NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested using this assay.
Example 7
Expression of NgR in Human Cell Lines using Northern Blot and a Random-Primed Probe
[0336]A Northern blot is purchased from a commercial source, or RNA samples from cells of interest are run on an agarose gel and blotted to a membrane using any of the well known techniques for Northern blotting. The blot is probed with a fragment of NgR2 (SEQ ID NO:1) or NgR3 (SEQ ID NO:3). The probe is prepared from 50 ng of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) Anal. Biochem. 132, 6-13). Hybridization is carried out at 68° C. for 1 hour in ExpressHyb® solution (Clontech, Cat. No. 8015-1) followed by washing with 2×SSC/0.05% SDS at room temperature and two washes with 0.1×SSC/0.1% SDS at 50° C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an appropriately sized band on the blot.
Example 8
Cloning of cDNA Corresponding to NgRs
[0337]To obtain the full-length clone corresponding to NgR2 from a cDNA library, the following method may be used. A cDNA library is generated using standard methods from a tissue known to contain NgR2. Such a tissue was identified in Example 2. 1×106 plaque forming units from the cDNA library may be screened in duplicate on OPTITRAN® filters. The filters are hybridized with 32P-labeled oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. The hybridization reaction may consist of 400 mls plaque screen buffer (50 mM Tris pH 7.5, 1M NaCl, 0.1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% Ficoll) containing 10% Dextran sulfate and 100 μg/ml tRNA and 80 μmol each 32P-labeled oligonucleotide at 65° C. overnight. The filters are washed twice with 2×SSC/1% SDS and twice with 1×SSC/1% SDS and exposed to film. Duplicate positives are purified. DNA from each of these clones is analyzed by restriction enzyme digest followed by agarose gel electrophoresis and Southern blotting. The filters are hybridized to the 32P-labeled oligonucleotides used for the original hybridization to confirm that inserts hybridize to the probe. The insert is then sequenced to confirm that it represents the cDNA for NgR2. Similar methods may be used to generate a full-length clone corresponding to NgR3.
[0338]Alternatively, a full-length clone of NgR2 or NgR3 can be obtained by a person of ordinary skill in the art employing conventional PCR techniques.
Example 9
Hybridization Analysis to Demonstrate NgR Expression in the Brain
[0339]The expression of NgR in mammals, such as the rat, may be investigated by in situ hybridization histochemistry. To investigate expression in the brain, for example, coronal and sagittal rat brain cryosections (20 μm thick) are prepared using a Reichert-Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-free slides (CEL Associates, Inc., Houston, Tex.), and stored at -80° C. Sections are processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold phosphate-buffered saline (PBS), acetylated using acetic anhydride in triethanolamine buffer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% alcohol at room temperature. Subsequently, sections are delipidated in chloroform, followed by rehydration through successive exposure to 100% and 95% alcohol at room temperature. Microscope slides containing processed cryosections are allowed to air dry prior to hybridization. Other tissues may be assayed in a similar fashion.
[0340]A NgR-specific probe may be generated using PCR. Following PCR amplification, the fragment is digested with restriction enzymes and cloned into pBluescript II cleaved with the same enzymes. For production of a probe specific for the sense strand of NgR, a cloned NgR fragment cloned in pBluescript II may be linearized with a suitable restriction enzyme, which provides a substrate for labeled run-off transcripts (i.e., cRNA riboprobes) using the vector-borne T7 promoter and commercially available T7 RNA polymerase. A probe specific for the antisense strand of NgR may also be readily prepared using the NgR clone in pBluescript II by cleaving the recombinant plasmid with a suitable restriction enzyme to generate a linearized substrate for the production of labeled run-off cRNA transcripts using the T3 promoter and cognate polymerase. The riboprobes may be labeled with [35S]-UTP to yield a specific activity of about 0.40×106 cpm/pmol for antisense riboprobes and about 0.65×106 cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently denatured and added (2 pmol/ml) to hybridization buffer which contains 50% formamide, 10% dextran, 0.3 M NaCl, 10 mM Tris (pH 8.0), 1 mM EDTA, 1×Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing sequential brain cryosections may be independently exposed to 45 μl of hybridization solution per slide and silanized cover slips may be placed over the sections being exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 52° C. to allow hybridization to occur. Equivalent series of cryosections are then exposed to sense or antisense NgR-specific cRNA riboprobes.
[0341]Following the hybridization period, coverslips are washed off the slides in 1×SSC, followed by RNase A treatment involving the exposure of slides to 20 μg/ml RNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M NaCl for 45 minutes at 37° C. The cryosections are then subjected to three high-stringency washes in 0.1×SSC at 52° C. for 20 minutes each. Following the series of washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, and 100% ammonium acetate in alcohol, followed by air drying and exposure to Kodak BioMax® MR-1 film. After 13 days of exposure, the film is developed, and any significant hybridization signal is detected. Based on these results, slides containing tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB-2 nuclear track emulsion and the slides are stored in the dark for 32 days. The slides are then developed and counterstained with hematoxylin. Emulsion-coated sections are analyzed microscopically to determine the specificity of labeling. The signal is determined to be specific if autoradiographic grains (generated by antisense probe hybridization) are clearly associated with cresyl violate-stained cell bodies. Autoradiographic grains found between cell bodies indicate non-specific binding of the probe.
[0342]In some cases, such as using a probe to detect a NgR homolog in a heterologous species, in order to achieve optimal hybridization, it may be necessary to decrease the stringency conditions. Such conditions are well known to those of ordinary skill in the art and examples are provided above.
[0343]Expression of NgR in the brain provides an indication that modulators of NgR activity have utility for treating neurological disorders. Some other diseases for which modulators of NgR may have utility include depression, anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of NgR modulators, including NgR ligands and anti-NgR antibodies, to treat individuals having such disease states is intended as an aspect of the invention.
Example 10
Northern Blot Analysis of NgR-RNA with a PCR-Generated Probe
[0344]Northern blot hybridizations may be performed to examine the expression of NgR mRNA A clone containing at least a portion of the sequence of SEQ ID NO:1 may be used as a probe. Vector-specific primers are used in PCR to generate a hybridization probe fragment for 32P-labeling. The PCR is performed as follows:
TABLE-US-00009 Mix: 1 μl NgR-containing plasmid 2 μl fwd primer (10-50 pM) 2 μl rev primer (10-50 pM) 10 μl 10 × PCR buffer (such as that provided with the enzyme, Amersham Pharmacia Biotech) 1 μl 10 mM dNTP (such as #1 969 064 from Boehringer Mannheim) 0.5 μl Taq polymerase (such as #27-0799-62, Amersham Pharmacia Biotech) 83.5 μl water
[0345]PCR is performed in a Thermocycler using the following program:
TABLE-US-00010 94° C. 5 min 94° C. 1 min 55° C. 1 min {close oversize brace} 30 cycles 72° C. 1 min 72° C. 10 min
[0346]The PCR product may be purified using QIAquick PCR Purification Kit (#28104) from Qiagen, and radioactively labeled with 32P-dCTP (#AA0005/250, Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go DNA Labeling Beads" (#27-9240-01) from Amersham Pharmacia Biotech. Hybridization is carried out on Human Multiple Tissue Northern Blot from Clontech as described in manufacturer's protocol, or on a Northern Blot prepared by running RNA samples from cells of interest on an agarose gel and blotting to a membrane using any of the known Northern blotting protocols. After exposure overnight on Molecular Dynamics Phosphor Imager screen (#MD146-814) bands of an appropriate size are visualized.
Example 11
Recombinant Expression of NgR in Eukaryotic Host Cells
[0347]A. Expression of NgR in Mammalian Cells
[0348]To produce NgR protein, a NgR-encoding polynucleotide is expressed in a suitable host cell using a suitable expression vector and standard genetic engineering techniques. For example, a NgR-encoding sequence described in Table 4 is subcloned into the commercial expression vector pzeoSV2 (Invitrogen, San Diego, Calif.) and transfected into Chinese Hamster Ovary (CHO) cells using the transfection reagent FuGENE6® (Boehringer-Mannheim) and the transfection protocol provided in the product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by growth in the presence of 100 μg/ml zeocin (Stratagene, LaJolla, Calif.). As an alternative to FuGENE6®, the expression vector may carry the gene for dihydrofolate reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure allows for stable transformation of CHO cells. Optionally, NgR may be purified from the cells using standard chromatographic techniques. To facilitate purification, antisera is raised against one or more synthetic peptide sequences that correspond to portions of the NgR amino acid sequence, and the antisera is used to affinity purify Nogo-R. The NgR also may be expressed in-frame with a tag sequence (e.g. polyhistidine, hemaglutinin, FLAG) to facilitate purification. Moreover, it will be appreciated that many of the uses for NgR polypeptides, such as assays described below, do not require purification of NgR from the host cell:
[0349]B. Expression of NgR in CHO Cells
[0350]For expression of N in Chinese hamster ovary (CHO) cells, a plasmid bearing the relevant NgR coding sequence is prepared, using a vector which also bears the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into CHO cells. Selection under MTX drug pressure allows for preparation of stable transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a selectable marker such as DHFR.
[0351]C. Expression of NgR in 293 Cells
[0352]For expression of NgR in mammalian cells 293 (transformed human, primary embryonic kidney cells), a plasmid bearing the relevant NgR coding sequence is prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the recombinant protein with the anti-myc antibody, a C-terminal polyhistidine for purification with nickel chelate chromatography, and a Zeocin resistant gene for selection of stable transfectants. The forward primer for amplification of this NgR cDNA is determined by routine procedures and preferably contains a 5' extension of nucleotides to introduce the HindIII cloning site and nucleotides matching the NgR sequence. The reverse primer is also determined by routine procedures and preferably contains a 5' extension of nucleotides to introduce an XhoI restriction site for cloning and nucleotides corresponding to the reverse complement of the NgR sequence. The PCR conditions are 55° C. as the annealing temperature. The PCR product is gel purified and cloned into the HindIII-XhoI sites of the vector.
[0353]The DNA is purified using Qiagen chromatography columns and transfected into 293 cells using DOTAP® transfection media (Boehringer Mannheim, Indianapolis, Ind.). Transiently transfected cells are tested for expression after 24 hours of transfection, using western blots probed with anti-His and anti-NgR peptide antibodies. Permanently transfected cells are selected with Zeocin and propagated Production of the recombinant protein is detected from both cells and media by Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies.
[0354]D. Transient Expression of Nogo-R in COS Cells
[0355]For expression of the NgR in COS7 cells, a polynucleotide molecule having a nucleotide sequence of SEQ ID NO:1, for example, can be cloned into vector p3-CI. This vector is a pUC18-derived plasmid that contains the HCMV (human cytomegalovirus) promoter-intron located upstream from the bGH (bovine growth hormone) polyadenylation sequence and a multiple cloning site.
[0356]The forward primer is determined by routine procedures and preferably contains a 5' extension which introduces an XbaI restriction site for cloning, followed by nucleotides which correspond to a nucleotide sequence of SEQ ID NO:1. The reverse primer is also determined by routine procedures and preferably contains 5'-extension of nucleotides which introduces a SalI cloning site followed by nucleotides which correspond to the reverse complement of a nucleotide sequence of SEQ ID NO:1
[0357]The PCR consists of an initial denaturation step of 5 min at 95° C., 30 cycles of 30 sec denaturation at 95° C., 30 sec annealing at 58° C. and 30 sec extension at 72° C., followed by 5 min extension at 72° C. The PCR product is gel purified and ligated into the XbaI and SalI sites of vector p3-CI. This construct is transformed into E. coli cells for amplification and DNA purification. The DNA is purified with Qiagen chromatography columns and transfected into COS 7 cells using Lipofectamine® reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours after transfection, the media and the cells are tested for recombinant protein expression.
[0358]NgR expressed from a COS cell culture can be purified by concentrating the cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon concentrator fitted with a YM-10 membrane and stored at -80° C. NgR3 may also be expressed using this method and the nucleotide sequence of SEQ ID NO:3 or SEQ ID NO:13.
[0359]E. Expression of NgR in Insect Cells
[0360]For expression of NgR in a baculovirus system, a polynucleotide molecule having a nucleotide sequence of SEQ ID NO:1, 3 or 13 can be amplified by PCR. The forward primer is determined by routine procedures and preferably contains a 5' extension which adds the NdeI cloning site, followed by nucleotides which correspond to a nucleotide sequence of SEQ ID NO:1 (or SEQ. ID NO:3 or SEQ ID NO:13, respectively). The reverse primer is also determined by routine procedures and preferably contains a 5' extension which introduces the KpnI cloning site, followed by nucleotides which correspond to the reverse complement of a nucleotide sequence of SEQ ID NO:1 (or SEQ ID NO:3 or SEQ ID NO:13, respectively).
[0361]The PCR product is gel purified, digested with NdeI and KpnI, and cloned into the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, Calif.). The pAcHTL expression vector contains the strong polyhedrin promoter of the Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6×His tag upstream from the multiple cloning site. A protein kinase site for phosphorylation and a thrombin site for excision of the recombinant protein precede the multiple cloning site is also present. Of course, many other baculovirus vectors could be used in place of pAcHTL-A, such as pAc373, pVL941 and pAcIM1. Other suitable vectors for the expression of NgR polypeptides can be used, provided that the vector construct includes appropriately located signals for transcription, translation, and trafficking, such as an in-frame AUG and a signal peptide, as required. Such vectors are described in Luckow et al., Virology 170: 31-39, among others.
[0362]The virus is grown and isolated using standard baculovirus expression methods, such as those described in Summers et al. (1987) A MANUAL OF METHODS FOR BACULOVIRUS VECTORS AND INSECT CELL CULTURE PROCEDURES, Texas Agricultural Experimental Station Bulletin No. 1555.
[0363]In a preferred embodiment, pAcHLT-A containing NgR gene is introduced into baculovirus using the "BaculoGold®" transfection kit (Pharmingen, San Diego, Calif.) using methods established by the manufacturer. Individual virus isolates are analyzed for protein production by radiolabeling infected cells with 35S-methionine at 24 hours post infection Infected cells are harvested at 48 hours post infection, and the labeled proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can be isolated and used for scaled up expression.
[0364]For expression of a NgR polypeptide in a Sf9 cells, a polynucleotide molecule having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO:13) can be amplified by PCR using the primers and methods described above for baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A (Pharmingen) for expression in Sf9 insect. The insert is cloned into the NdeI and KpnI sites, after elimination of an internal NdeI site (using the same primers described above for expression in baculovirus). DNA is purified with Qiagen chromatography columns and expressed in Sf9 cells. Preliminary Western blot experiments from non-purified plaques are tested for the presence of the recombinant protein of the expected size which reacted with the NgR-specific antibody. These results are confirmed after further purification and expression optimization in HiG5 cells.
[0365]F. Expression of Soluble Forms of NgR2 and NgR3 as NgR-Ig Fusion Proteins.
[0366]To generate a NgR2-Ig fusion protein, standard methods may be used as described in the literature (e.g. Sanicola et al. (1997) Proc. Natl. Acad. Sci. USA. 94, 6238-6243). For example, a DNA fragment encoding NgR2 without the sequence encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA fragment encoding the Fc domain of IgG1 (which may be human IgG1), and the chimeric fragment may be cloned into an expression vector to generate a plasmid. The plasmid may then be transfected into Chinese hamster ovary cells to generate a stable cell line producing the fusion protein. The fusion protein is then purified from conditioned media using standard methods. For example, clarified conditioned media from the cell line may be loaded by gravity directly onto Protein A Sepharose. The column may then be washed with five column volumes each of PBS, PBS containing 0.5 M NaCl, and 25 mM sodium phosphate, 100 mM NaCl (pH 5.0). The bound protein may then be eluted with 25 mM NaH2PO4, 100 mM NaCl (pH 2.8) and immediately neutralized with 1/10 fraction volume of 0.5 M Na2HPO4 (pH 8.6).
[0367]Similar methods may be used to generate a NgR3-Ig fusion protein.
Example 12
Interaction Trap/Two-Hybrid System
[0368]In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid library screening method can be used. This assay was first described in Fields et al. (1989) Nature 340, 245, which is incorporated herein by reference in its entirety. A protocol is published in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 1999, John Wiley & Sons, NY and Ausubel, F. M. et al. 1992, SHORT PROTOCOLS IN MOLECULAR BIOLOGY, fourth edition, Greene and Wiley-interscience, NY, which is incorporated herein by reference in its entirety. Kits are available from Clontech, Palo Alto, Calif. (Matchmaker Two-Hybrid System 3).
[0369]A fusion of the nucleotide sequences encoding all or partial NgR and the yeast transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an appropriate plasmid (i.e., pGBKT7) using standard subcloning techniques. Similarly, a GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e. pGADT7) from cDNA of potential NgR-binding proteins (for protocols on forming cDNA libraries, see Sambrook et al. 1989, MOLECULAR CLONING: A LABORATORY MANUAL, second edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.), which is incorporated herein by reference in its entirety. The DNA-BD/NgR fusion construct is verified by sequencing, and tested for autonomous reporter gene activation and cell toxicity, both of which would prevent a successful two-hybrid analysis. Similar controls are performed with the AD/library fusion construct to ensure expression in host cells and lack of transcriptional activity. Yeast cells are transformed (ca. 105 transformants/mg DNA) with both the NgR and library fusion plasmids according to standard procedure (Ausubel, et al., 1992, SHORT PROTOCOLS IN MOLECULAR BIOLOGY, fourth edition, Greene and Wiley-interscience, NY, which is incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgR with AD/library proteins results in transcription of specific yeast plasmid reporter genes (i.e., lacZ, HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient media to screen for expression of reporter genes. Colonies are dually assayed for β-galactosidase activity upon growth in Xgal (5-bromo-4-chloro-3-indolyl-b-D-galactoside) supplemented media (filter assay for b-galactosidase activity is described in Breeden et al., (1985) Cold Spring Harb. Symp. Quant. Biol., 50, 643, which is incorporated herein by reference in its entirety). Positive AD-library plasmids are rescued from transformants and reintroduced into the original yeast strain as well as other strains containing unrelated DNA-BD fusion proteins to confirm specific NgR/library protein interactions. Insert DNA is sequenced to verify the presence of an open reading frame fused to GAL4 AD and to determine the identity of the NgR-binding protein.
Example 13
Antibodies to Nogo-R
[0370]Standard techniques are employed to generate polyclonal or monoclonal antibodies to the NgR receptor, and to generate useful antigen-binding fragments thereof or variants thereof, including "humanized" variants. Such protocols can be found, for example, in Sambrook et al. (1989), above, and Harlow et al. (Eds.), ANTIBODIES A LABORATORY MANUAL; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1988). In one embodiment, recombinant NgR polypeptides (or cells or cell membranes containing such polypeptides) are used as antigen to generate the antibodies. In another embodiment, one or more peptides having amino acid sequences corresponding to an immunogenic portion of NgR (e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular portions, are preferred. The antigen may be mixed with an adjuvant or linked to a hapten to increase antibody production.
[0371]A. Polyclonal or Monoclonal Antibodies
[0372]As one exemplary protocol, recombinant NgR or a synthetic fragment thereof is used to immunize a mouse for generation of monoclonal antibodies (or larger mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the manufacturer's recommendations. For an initial injection, the antigen is emulsified with Freund's Complete Adjuvant and injected subcutaneously. At intervals of two to three weeks, additional aliquots of NgR antigen are emulsified with Freund's Incomplete Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum sample is taken from the immunized mice and assayed by western blot to confirm the presence of antibodies that immunoreact with NgR. Serum from the immunized animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that recognize NgR. Alternatively, the mice are sacrificed and their spleen removed for generation of monoclonal antibodies.
[0373]To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum-free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 units/ml penicillin, and 100 μg/ml streptomycin (RPMI) (Gibco, Canada). The cell suspensions are filtered and washed by centrifugation and resuspended in serum-free RPMI. Thymocytes taken from three naive Balb/c mice are prepared in a similar manner and used as a Feeder Layer NS-1 myeloma cells, kept in log phase in RPMI with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, Utah) for three days prior to fusion, are centrifuged and washed as well.
[0374]To produce hybridoma fusions, spleen cells from the immunized mice are combined with NS-1 cells and centrifuged, and the supernatant is aspirated. The cell pellet is dislodged by tapping the tube, and 2 ml of 37° C. PEG 1500 (50% in 75 mM HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the addition of serum-free RPMI. Thereafter, the cells are centrifuged, resuspended in RPMI containing 15% FBS, 100 μM sodium hypoxanthine, 0.4 μM aminopterin, 16 μM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5×106 thymocytes/ml, and plated into 10 Corning flat-bottom 96-well tissue culture plates (Corning, Corning, N.Y.).
[0375]On days 2, 4, and 6 after the fusion, 100 μl of medium is removed from the wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are screened by ELISA, testing for the presence of mouse IgG that binds to NgR. Selected fusion wells are further cloned by dilution until monoclonal cultures producing anti-NgR antibodies are obtained.
[0376]B. Humanization of Anti-NgR Monoclonal Antibodies
[0377]The expression pattern of NgR as reported herein and the potential of NgRs as targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors (antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as NgR antagonists. Following are protocols to improve the utility of anti-NgR monoclonal antibodies as therapeutics in humans by "humanizing" the monoclonal antibodies to improve their serum half-life and render them less immunogenic in human hosts (i.e., to prevent human antibody response to non-human anti-NgR antibodies).
[0378]The principles of humanization have been described in the literature and are facilitated by the modular arrangement of antibody proteins. To minimize the possibility of binding complement, a humanized antibody of the IgG4 isotype is preferred.
[0379]For example, a level of humanization is achieved by generating chimeric antibodies comprising the variable domains of non-human antibody proteins of interest with the constant domains of human antibody molecules. (See, e.g., Morrison et al., (1989) Adv. Immunol., 44, 65-92). The variable domains of NgR-neutralizing anti-NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from cDNA generated from mRNA isolated from the hybridoma of interest. The V region gene fragments are linked to exons encoding human antibody constant domains, and the resultant construct is expressed in suitable mammalian host cells (e.g., myeloma or CHO cells).
[0380]To achieve an even greater level of humanization, only those portions of the variable region gene fragments that encode antigen-binding complementarity determining regions ("CDR") of the non-human monoclonal antibody genes are cloned into human antibody sequences. (See, e.g., Jones et al., (1986) Nature 321, 522-525; Riechmann et al., (1988) Nature 332, 323-327; Verhoeyen et al., (1988) Science 239, 1534-1536; and Tempest et al., (1991) Bio/Technology 9, 266-271). If necessary, the β-sheet framework of the human antibody surrounding the CDR3 regions also is modified to more closely mirror the three dimensional structure of the antigen-binding domain of the original monoclonal antibody. (See Kettleborough et al., (1991) Protein Engin. 4, 773-783; and Foote et al., (1992) J. Mol. Biol. 224, 487-499).
[0381]In an alternative approach, the surface of a non-human monoclonal antibody of interest is humanized by altering selected surface residues of the non-human antibody, e.g., by site-directed mutagenesis, while retaining all of the interior and contacting residues of the non-human antibody. See Padlan (1991) Mol. Immunol. 28, 489-498.
[0382]The foregoing approaches are employed using NgR-neutralizing anti-NgR monoclonal antibodies and the hybridomas that produce them to generate humanized NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions wherein NgR expression or ligand-mediated NgR signaling is detrimental.
[0383]C. Human NgR-Neutralizing Antibodies from Phage Display
[0384]Human NgR-neutralizing antibodies are generated by phage display techniques such as those described in Aujame et al. (1997) Human Antibodies 8, 155-168; Hoogenboom (1997) TIBTECH 15, 62-70; and Rader et al. (1997), Curr. Opin. Biotechnol. 8, 503-508, all of which are incorporated by reference. For example, antibody variable regions in the form of Fab fragments or linked single chain Fv fragments are fused to the amino terminus of filamentous phage minor coat protein pIII. Expression of the fusion protein and incorporation thereof into the mature phage coat results in phage particles that present an antibody on their surface and contain the genetic material encoding the antibody. A phage library comprising such constructs is expressed in bacteria, and the library is screened for NgR-specific phage-antibodies using labeled or immobilized NgR as antigen-probe.
[0385]D. Human NgR-Neutralizing Antibodies from Transgenic Mice
[0386]Human NgR-neutralizing antibodies are generated in transgenic mice essentially as described in Bruggemann et al. (1996) Immunol. Today 17, 391-397 and Bruggemann et al. (1997) Curr. Opin. Biotechnol. 8, 455-458. Transgenic mice carrying human V-gene segments in germline configuration and that express these transgenes in their lymphoid tissue are immunized with a NgR composition using conventional immunization protocols hybridomas are generated using B cells from the immunized mice using conventional protocols and screened to identify hybridomas secreting anti-NgR human antibodies (e.g., as described above).
Example 14
Assays to Identify Modulators of NgR Activity
[0387]Set forth below are several nonlimiting assays for identifying modulators (agonists and antagonists) of NgR activity. Among the modulators that can be identified by these assays are natural ligand compounds of the receptor; synthetic analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or antibody-like compounds derived from natural antibodies or from antibody-like combinatorial libraries; and/or synthetic compounds identified by high-throughput screening of libraries; and the like. All modulators that bind NgR are useful for identifying NgR in tissue samples (e.g., for diagnostic purposes, pathological purposes, and the like). Agonist and antagonist modulators are useful for up-regulating and down-regulating NgR activity, respectively, to treat disease states characterized by abnormal levels of NgR activity. The assays may be performed using single putative modulators, and/or may be performed using a known agonist in combination with candidate antagonists (or visa versa).
[0388]A. cAMP Assays
[0389]In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are measured in NgR-transfected cells that have been exposed to candidate modulator compounds. Protocols for cAMP assays have been described in the literature. (See, e.g., Sutherland et al., (1968) Circulation 37, 279; Frandsen et al., (1976) Life Sciences 18, 529-541; Dooley et al., (1997) J. Pharmacol. Exp. Therap. 283, 735-41; and George et al., (1997) J. Biomol. Screening 2, 235-40). An exemplary protocol for such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN® Life Science Products, is set forth below.
[0390]Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, such as the transfection protocol provided by Boehringer-Mannheim when supplying the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well microplates from the FlashPlate® assay kit, which are coated with solid scintillant to which antisera to cAMP has been bound. For a control, some wells are seeded with wild type (untransfected) CHO cells. Other wells in the plate receive various amounts of a cAMP standard solution for use in creating a standard curve.
[0391]One or more test compounds (i.e., candidate modulators) are added to the cells in each well, with water and/or compound-free medium/diluent serving as a control or controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 minutes at room temperature. The assay is terminated by the addition of lysis buffer containing [125I]-labeled cAMP, and the plate is counted using a Packard Topcount® 96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or from standards) and fixed amounts of [125I]-cAMP compete for antibody bound to the plate. A standard curve is constructed, and cAMP values for the unknowns are obtained by interpolation. Changes in intracellular cAMP levels of cells in response to exposure to a test compound are indicative of NgR modulating activity. Modulators that act as agonists of receptors which couple to the Gs subtype of G proteins will stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP levels. Agonists of receptors which couple to the Gi/o subtype of G proteins will inhibit forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP levels of 50-100%. Modulators that act as inverse agonists will reverse these effects at receptors that are either constitutively active or activated by known agonists.
[0392]B. Aequorin Assays
[0393]In another assay, cells (e.g., CHO cells) are transiently co-transfected with both a NgR expression construct and a construct that encodes the photoprotein apoaquorin. In the presence of the cofactor coelenterazine, apoaquorin will emit a measurable luminescence that is proportional to the amount of intracellular (cytoplasmic) free calcium. (See generally, Cobbold, et al. "Aequorin measurements of cytoplasmic free calcium," In: McCormack J. G. and Cobbold P. H., eds., CELLULAR CALCIUM: A PRACTICAL APPROACH. Oxford:IRL Press (1991); Stables et al., (1997) Anal. Biochem. 252, 115-26; and Haugland, HANDBOOK OF FLUORESCENT PROBES AND RESEARCH CHEMICALS. Sixth edition. Molecular Probes, Eugene, Oreg. (1996)).
[0394]In one exemplary assay, NgR is subcloned into the commercial expression vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that encodes the photoprotein apoaquorin (Molecular Probes, Eugene, Oreg.) into CHO cells using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection protocol provided in the product insert.
[0395]The cells are cultured for 24 hours at 37° C. in MEM (Gibco/BRL, Gaithersburg, Md.) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 μg/ml streptomycin, at which time the medium is changed to serum-free MEM containing 5 μM coelenterazine (Molecular Probes, Eugene, Oreg.). Culturing is then continued for two additional hours at 37° C. Subsequently, cells are detached from the plate using VERSEN (Gibco/BRL), washed, and resuspended at 200,000 cells/ml in serum-free MEM.
[0396]Dilutions of candidate NgR modulator compounds are prepared in serum-free MEM and dispensed into wells of an opaque 96-well assay plate at 50 μl/well. Plates are then loaded onto an MLX microtiter plate luminometer (Dynex Technologies, Inc., Chantilly, Va.). The instrument is programmed to dispense 50 μl cell suspensions into each well, one well at a time, and immediately read luminescence for 15 seconds. Dose-response curves for the candidate modulators are constructed using the area under the curve for each light signal peak. Data are analyzed with SlideWrite, using the equation for a one-site ligand, and EC50 values are obtained. Changes in luminescence caused by the compounds are considered indicative of modulatory activity. Modulators that act as agonists at receptors which couple to the Gq subtype of G proteins give an increase in luminescence of up to 100 fold. Modulators that act as inverse agonists will reverse this effect at receptors that are either constitutively active or activated by known agonists.
[0397]C. Luciferase Reporter Gene Assay
[0398]The photoprotein luciferase provides another useful tool for assaying for modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co-transfected with both a NgR expression construct (e.g., NgR in pzeoSV2) and a reporter construct which includes a gene for the luciferase protein downstream from a transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or NF-kappa B. Expression levels of luciferase reflect the activation status of the signaling events. (See generally, George et al. (1997) J. Biomol. Screening 2, 235-240; and Stratowa et al. (1995) Curr. Opin. Biotechnol. 6, 574-581). Luciferase activity may be quantitatively measured using, e.g., luciferase assay reagents that are commercially available from Promega (Madison, Wis.).
[0399]In one exemplary assay, CHO cells are plated in 24-well culture dishes at a density of 100,000 cells/well one day prior to transfection and cultured at 37° C. in MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 μg/ml streptomycin. Cells are transiently co-transfected with both a NgR expression construct and a reporter construct containing the luciferase gene. The reporter plasmids CRE-luciferase, AP-1-luciferase and NF-kappaB-luciferase may be purchased from Stratagene (Legally, Calif.). Transfections are performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) according to the supplier's instructions. Cells transfected with the reporter construct alone are used as a control. Twenty-four hours after transfection, cells are washed once with PBS pre-warned to 37° C. Serum-free MEM is then added to the cells either alone (control) or with one or more candidate modulators and the cells are incubated at 37° C. for five hours. Thereafter, cells are washed once with ice-cold PBS and lysed by the addition of 100 μl of lysis buffer per well from the luciferase assay kit supplied by Promega. After incubation for 15 minutes at room temperature, 15 μl of the lysate is mixed with 50 μl of substrate solution (Promega) in an opaque-white, 96-well plate, and the luminescence is read immediately on a Wallace model 1450 MicroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, Md.).
[0400]Differences in luminescence in the presence versus the absence of a candidate modulator compound are indicative of modulatory activity. Receptors that are either constitutively active or activated by agonists typically give a 3-20-fold stimulation of luminescence compared to cells transfected with the reporter gene alone. Modulators that act as inverse agonists will reverse this effect.
[0401]D. Intracellular Calcium Measurement using FLIPR
[0402]Changes in intracellular calcium levels are another recognized indicator of receptor activity, and such assays can be employed to screen for modulators of NgR activity. For example, CHO cells stably transfected with a NgR expression vector are plated at a density of 4×104 cells/well in Packard black-walled, 96-well plates specially designed to discriminate fluorescence signals emanating from the various wells on the plate. The cells are incubated for 60 minutes at 37° C. in modified Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3® AM, Fluo-4® AM, Calcium Green®-1 AM, or Oregon Green® 488 BAPTA-1 AM), each at a concentration of 4 μM. Plates are washed once with modified D-PBS without 1% fetal bovine serum and incubated for 10 minutes at 37° C. to remove residual dye from the cellular membrane. In addition, a series of washes with modified D-PBS without 1% fetal bovine serum is performed immediately prior to activation of the calcium response.
[0403]A calcium response is initiated by the addition of one or more candidate receptor agonist compounds, calcium ionophore A23187 (10 μM; positive control), or ATP (4 μM; positive control). Fluorescence is measured by Molecular Device's FLIPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al. (1998) Drug Dev. Res. 44, 14-20). The F-stop for the detector camera is set at 2.5 and the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 20 seconds prior to addition of candidate agonist, ATP, or A23187, and the basal fluorescence level is subtracted from the response signal. The calcium signal is measured for approximately 200 seconds, taking readings every two seconds. Calcium ionophore A23187 and ATP increase the calcium signal 200% above baseline levels. In general, activated NgRs increase the calcium signal at least about 10-15% above baseline signal.
[0404]E. [35S]GTPγS Binding Assay
[0405]It is also possible to evaluate whether NgR signals through a G protein-mediated pathway. Because G protein-coupled receptors signal through intracellular G proteins whose activity involves GTP binding and hydrolysis to yield bound GDP, measurement of binding of the non-hydrolyzable GTP analog [35S]-GTPγS in the presence and absence of candidate modulators provides another assay for modulator activity. (See, e.g., Kowal et al., (1998) Neuropharmacology 37, 179-187.).
[0406]In one exemplary assay, cells stably transfected with a NgR expression vector are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of ice-cold Ca2+/Mg2+-free phosphate-buffered saline, and scraped into 5 ml of the same buffer. Cells are pelleted by centrifugation (500×g, 5 minutes), resuspended in TEE buffer (25 mM Tris, pH 7.5, 5 mM EDTA, 5 mM EGTA), and frozen in liquid nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (1 ml TEE per plate of cells), and centrifuged at 1,000×g for 5 minutes to remove nuclei and unbroken cells.
[0407]The homogenate supernatant is centrifuged at 20,000×g for 20 minutes to isolate the membrane fraction, and the membrane pellet is washed once with TEE and resuspended in binding buffer (20 mM BEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl2, 1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen and stored at -70° C. until use.
[0408]Aliquots of cell membranes prepared as described above and stored at -70° C. are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM MgCl2, 1 mM EDTA, 120 mM NaCl, 10 μM GDP, and 0.2 mM ascorbate, at a concentration of 10-50 μg/ml. In a final volume of 90 μl, homogenates are incubated with varying concentrations of candidate modulator compounds or 100 μM GTP for 30 minutes at 30° C. and then placed on ice. To each sample, 10 μl guanosine 5'-O-(3-[35S]thio) triphosphate (NEN, 1200 Cimmol; [35S]-GTPγS), was added to a final concentration of 100-200 μM. Samples are incubated at 30° C. for an additional 30 minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCl2, at 4° C. is added and the reaction is stopped by filtration.
[0409]Samples are filtered over Whatman GF/B filters and the filters are washed with 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCl2. Filters are counted by liquid scintillation spectroscopy. Nonspecific binding of [35]-GTPγS is measured in the presence of 100 μM GTP and subtracted from the total. Compounds are selected that modulate the amount of [35S]-GTPγS binding in the cells, compared to untransfected control cells. Activation of receptors by agonists gives up to a five-fold increase in [35S]-GTPγS binding This response is blocked by antagonists.
[0410]F. [3H]Arachidonic Acid Release
[0411]The activation of NgRs may also potentiate arachidonic acid release in cells, providing yet another useful assay for modulators of NgR activity. (See, e.g., Kanterman et al., (1991) Mol. Pharmacol. 39, 364-369.) For example, CHO cells that are stably transfected with a NgR expression vector are plated in 24-well plates at a density of 15,000 cells/well and grown in MEM medium supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 μg/ml streptomycin for 48 hours at 37° C. before use. Cells of each well are labeled by incubation with [3H]-arachidonic acid. (Amersham Corp., 210 Ci/mmol) at 0.5 μCi/ml in 1 ml MEM supplemented with 10 mM HEPES, pH 7.5, and 0.5% fatty-acid-free bovine serum albumin for 2 hours at 37° C. The cells are then washed twice with 1 ml of the same buffer.
[0412]Candidate modulator compounds are added in 1 ml of the same buffer, either alone or with 10 μM ATP and the cells are incubated at 37° C. for 30 minutes. Buffer alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each well are counted by liquid scintillation spectroscopy. Agonists which activate the receptor will lead to potentiation of the ATP-stimulated release of [3H]-arachidonic acid. This potentiation is blocked by antagonists.
[0413]G. Extracellular Acidification Rate
[0414]In yet another assay, the effects of candidate modulators of NgR activity are assayed by monitoring extracellular changes in pH induced by the test compounds (see, e.g., Dunlop et al. (1998) J. Pharmacol. Toxicol. Meth 40, 47-55). In one embodiment, CHO cells transfected with a NgR expression vector are seeded into 12 mm capsule cups (Molecular Devices Corp.) at 4×105 cells/cup in MEM supplemented with 10% fetal bovine serum, 2 mM L-glutamine, 10 U/ml penicillin, and 10 μg/ml streptomycin. The cells are incubated in this medium at 37° C. in 5% CO2 for 24 hours.
[0415]Extracellular acidification rates are measured using a Cytosensor microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the sensor chambers of the microphysiometer and the chambers are perfused with running buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml penicillin, 10 μg/ml streptomycin, 26 mM NaCl) at a flow rate of 100 μl/minute. Candidate agonists or other agents are diluted into the running buffer and perfused through a second fluid path. During each 60-second pump cycle, the pump is run for 38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is re-started at 60 seconds to start the next cycle. The rate of acidification of the running buffer during the recording time is calculated by the Cytosoft program. Changes in the rate of acidification are calculated by subtracting the baseline value (the average of 4 rate measurements immediately before addition of a modulator candidate) from the highest rate measurement obtained after addition of a modulator candidate. The selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the receptor result in an increase in the rate of extracellular acidification compared to the rate in the absence of agonist. This response is blocked by modulators which act as antagonists of the receptor.
Example 15
mNgR3 does not Bind hNogo-A(1055-1120)
[0416]To functionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to bind hNogo-A(1055-1120), a cDNA expression vector for a myc epitope-tagged mNgR3 protein was created. The mouse NgR3 cDNA was amplified by PCR from mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc tag followed by the mature mNgR3 sequence. This plasmid was transfected into COS07 cells, and expression of a myc-tagged protein of the predicted size was verified by immunoblot analysis. Alkaline phosphatase-hNogo-A(1055-1120) binding studies and myc immunohistology were conducted as described (Fournier et al., supra).
[0417]The cells expressing mNgR3 express the myc-tagged protein but binding to AP-hNogo-A(1055-1120) was not observed under the conditions employed (FIG. 8).
Example 16
Identification of Partial Human NgR3 cDNA and Protein Sequences
[0418]The tblastn program was used to search for the human homolog of mouse NgR3. The mouse NgR3 protein sequence (SEQ ID NO:4) was used to query a proprietary human expressed sequence tag (EST) database from Incyte yielding one highly significant hit: Incyte Template ID 190989.1. This sequence (937 nucleotides) contains an open reading frame of 312 amino acids in the second reverse frame that exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID NO:4), strongly indicating that it is part of the human NgR3 homolog.
[0419]A query of SEQ ID NO:4 against the public human EST database in Genbank also produced a hit with a 465-bp EST (Accession number: R35699; Version number: R35699.1; GI: 792600). There are a number of single nucleotide deletions and insertions within this sequence which cause frame shift errors. All of the reliable sequence contained in this public EST is present in the Incyte EST (Template ID 190989.1).
[0420]To obtain more nucleotide sequence that would extend the amino acid sequence at that carboxy terminal end, the I.M.A.G.E. Consortium clone No. 38319, which corresponds to Genbank accession No. R35699, was purchased from Incyte Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of a NotI/HinD III fragment containing the sequence of interest, cloned into the Not/HinD III sites of the vector Lafinid BA (http://image.llnl.gov/image/html/libs/lafmidBA.shtml). The clone was received as an agar stab, which was streaked out on LB agar plates containing 50 ug/ml ampicillin to isolate individual colonies. Six colonies were grown in LB medium with antibiotic, and plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA Purification System (Promega #A7500). These DNAs were subsequently digested with NotI and HinD III restriction enzymes to confirm that the clones contained an insert. The insert of one isolate was sequenced using a combination of vector specific and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 1176 nucleotides (SEQ ID NO:13). A translation of this sequence provides a partial sequence for human NgR3 of 392 amino acids (SEQ ID NO:14).
[0421]The nucleotide sequence of SEQ ID NO:13 differs from the Incyte EST sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO:13 are CG, whereas the corresponding nucleotides in the Incyte Template ID 190989.1 are GT (i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In addition, position 641 in SEQ ID NO:13 is a C, whereas the corresponding nucleotide in the Incyte Template ID 190989.1 sequence is an A (i.e., position 641 of the complement of Incyte Template ID 190989.1). This results in two changes in amino acids when comparing SEQ ID NO:14 to the ORF encoded by Incyte Template 190989.1: SEQ ID NO:14 contains a valine at position 5, whereas the ORF encoded by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO:14 contains an alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989.1 contains a glutamic acid.
[0422]The nucleotide sequence of SEQ ID NO:13 differs from the public EST (Accession number: R35699; Version number: R35699.1; GI: 792600) sequence at two positions (within the first 200 nucleotides of reliable sequence). Nucleotide positions 12-13 in SEQ ID NO:13 are CG, whereas the corresponding nucleotides in the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R135699; Version no: R35699.1; GI: 792600) This leads to a single amino acid change when comparing SEQ ID NO:14 to the ORF encoded by the public EST: SEQ ID NO:14 contains a valine at position 5, while the ORF encoded by the public EST contains a leucine.
[0423]A Bestfit analysis of the partial human amino acid sequence with the full-length mouse amino acid sequence indicates that the human NgR3 amino acid sequence is complete at the carboxy terminal end and that they share 89.54% identity. An alignment of all the NgR proteins is shown in FIG. 9. Although the human NgR3 amino acid sequence is missing the first 25 amino acids, it can be determined that the human NgR3 protein contains the following features in common with the other NgR sequences: (1) eight Leucine Rich Repeat (LRR) domains; (2) an LRR carboxy-terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a conserved potential glycosylation site in the eighth LRR domain; and (5) a hydrophobic carboxyl terminus.
[0424]As those skilled in the art will appreciate, numerous changes and modifications may be made to the preferred embodiments of the invention without departing from the spirit of the invention. It is intended that all such variations fall within the scope of the invention.
[0425]The entire disclosure of each publication cited herein is hereby incorporated by reference. This application claims benefit from U.S. provisional application 60/238,361, filed Oct. 6, 2000, which is incorporated by reference herein in its entirety.
KEY FOR SEQUENCE LISTING
[0426]SEQ ID NO:1 human NgR2 cDNA sequence derived from genomic sequence AC013606 [0427]SEQ ID NO:2 human NgR2 amino acid sequence [0428]SEQ ID NO:3 mouse NgR3 cDNA sequence derived from AC021768 [0429]SEQ ID NO:4 a mouse NgR3 amino acid sequence [0430]SEQ ID NO:5 a human NgR1 amino acid sequence [0431]SEQ ID NO:6 a consensus amino acid sequence for NgRs [0432]SEQ ID NO:7 #1055-1120 amino acid residues of hNogoA (Nogo-66) [0433]SEQ ID NO:8 a mature human NgR2 amino acid sequence [0434]SEQ ID NO:9 a mature mouse NgR3 amino acid sequence [0435]SEQ ID NO:10 a consensus NgR LLRNT amino acid sequence [0436]SEQ ID NO:11 a consensus NgR LRRCT domain amino acid sequence [0437]SEQ ID NO:12 a consensus NgR LRR domain amino acid sequence [0438]SEQ ID NO:13 a partial human NgR3 nucleotide sequence [0439]SEQ ID NO:14 a partial human NgR3 amino acid sequence [0440]SEQ ID NO:15 a genomic sequence encoding a human NgR2 sequence. [0441]SEQ ID NO:16 a genomic sequence (complementary strand) encoding a mouse NgR3 [0442]SEQ ID NO:17 a mouse NgR1 amino acid sequence [0443]SEQ ID NO:18 a consensus sequence for the NTLRRCT domain of NgR [0444]SEQ ID NO:19 an consensus NgR LRRCT domain amino acid sequence
Sequence CWU
1
1911260DNAHomo sapiens 1atgctgcccg ggctcaggcg cctgctgcaa gctcccgcct
cggcctgcct cctgctgatg 60ctcctggccc tgcccctggc ggcccccagc tgccccatgc
tctgcacctg ctactcatcc 120ccgcccaccg tgagctgcca ggccaacaac ttctcctctg
tgccgctgtc cctgccaccc 180agcactcagc gactcttcct gcagaacaac ctcatccgca
cgctgcggcc aggcaccttt 240gggtccaacc tgctcaccct gtggctcttc tccaacaacc
tctccaccat ctacccgggc 300actttccgcc acttgcaagc cctggaggag ctggacctcg
gtgacaaccg gcacctgcgc 360tcgctggagc ccgacacctt ccagggcctg gagcggctgc
agtcgctgca tttgtaccgc 420tgccagctca gcagcctgcc cggcaacatc ttccgaggcc
tggtcagcct gcagtacctc 480tacctccagg agaacagcct gctccaccta caggatgact
tgttcgcgga cctggccaac 540ctgagccacc tcttcctcca cgggaaccgc ctgcggctgc
tcacagagca cgtgtttcgc 600ggcctgggca gcctggaccg gctgctgctg cacgggaacc
ggctgcaggg cgtgcaccgc 660gcggccttcc gcggcctcag ccgcctcacc atcctctacc
tgttcaacaa cagcctggcc 720tcgctgcccg gcgaggcgct cgccgacctg ccctcgctcg
agttcctgcg gctcaacgct 780aacccctggg cgtgcgactg ccgcgcgcgg ccgctctggg
cctggttcca gcgcgcgcgc 840gtgtccagct ccgacgtgac ctgcgccacc cccccggagc
gccagggccg agacctgcgc 900gcgctccgcg aggccgactt ccaggcgtgt ccgcccgcgg
cacccacgcg gccgggcagc 960cgcgcccgcg gcaacagctc ctccaaccac ctgtacgggg
tggccgaggc cggggcgccc 1020ccagccgatc cctccaccct ctaccgagat ctgcctgccg
aagactcgcg ggggcgccag 1080ggcggggacg cgcctactga ggacgactac tgggggggct
acgggggtga ggaccagcga 1140ggggagcaga tgtgccccgg cgctgcctgc caggcgcccc
cggactcccg aggccctgcg 1200ctctcggccg ggctccccag ccctctgctt tgcctcctgc
tcctggtgcc ccaccacctc 12602420PRTHomo sapiens 2Met Leu Pro Gly Leu Arg
Arg Leu Leu Gln Ala Pro Ala Ser Ala Cys 1 5
10 15Leu Leu Leu Met Leu Leu Ala Leu Pro Leu Ala Ala
Pro Ser Cys Pro 20 25 30Met
Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys Gln Ala 35
40 45Asn Asn Phe Ser Ser Val Pro Leu Ser
Leu Pro Pro Ser Thr Gln Arg 50 55
60Leu Phe Leu Gln Asn Asn Leu Ile Arg Thr Leu Arg Pro Gly Thr Phe 65
70 75 80Gly Ser Asn Leu Leu
Thr Leu Trp Leu Phe Ser Asn Asn Leu Ser Thr 85
90 95Ile Tyr Pro Gly Thr Phe Arg His Leu Gln Ala
Leu Glu Glu Leu Asp 100 105
110Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr Phe Gln
115 120 125Gly Leu Glu Arg Leu Gln Ser
Leu His Leu Tyr Arg Cys Gln Leu Ser 130 135
140Ser Leu Pro Gly Asn Ile Phe Arg Gly Leu Val Ser Leu Gln Tyr
Leu145 150 155 160Tyr Leu
Gln Glu Asn Ser Leu Leu His Leu Gln Asp Asp Leu Phe Ala
165 170 175Asp Leu Ala Asn Leu Ser His
Leu Phe Leu His Gly Asn Arg Leu Arg 180 185
190Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp
Arg Leu 195 200 205Leu Leu His Gly
Asn Arg Leu Gln Gly Val His Arg Ala Ala Phe Arg 210
215 220Gly Leu Ser Arg Leu Thr Ile Leu Tyr Leu Phe Asn
Asn Ser Leu Ala225 230 235
240Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu Phe Leu
245 250 255Arg Leu Asn Ala Asn
Pro Trp Ala Cys Asp Cys Arg Ala Arg Pro Leu 260
265 270Trp Ala Trp Phe Gln Arg Ala Arg Val Ser Ser Ser
Asp Val Thr Cys 275 280 285Ala Thr
Pro Pro Glu Arg Gln Gly Arg Asp Leu Arg Ala Leu Arg Glu 290
295 300Ala Asp Phe Gln Ala Cys Pro Pro Ala Ala Pro
Thr Arg Pro Gly Ser305 310 315
320Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val Ala Glu
325 330 335Ala Gly Ala Pro
Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp Leu Pro 340
345 350Ala Glu Asp Ser Arg Gly Arg Gln Gly Gly Asp
Ala Pro Thr Glu Asp 355 360 365Asp
Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gln Arg Gly Glu Gln Met 370
375 380Cys Pro Gly Ala Ala Cys Gln Ala Pro Pro
Asp Ser Arg Gly Pro Ala385 390 395
400Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu
Val 405 410 415Pro His His
Leu 42031383DNAMus sp. 3atgtcttggc agtctggaac cacagtgaca
caatctcccg tgcaggctgc tcaggtctca 60gggtgctgtg tggaattgct gctgttgctg
ctcgctggag agctacctct gggtggtggt 120tgtcctcgag actgtgtgtg ctaccctgcg
cccatgactg tcagctgcca ggcacacaac 180tttgctgcca tcccggaggg catcccagag
gacagtgagc gcatcttcct gcagaacaat 240cgcatcacct tcctccagca gggccacttc
agccccgcca tggtcaccct ctggatctac 300tccaacaaca tcactttcat tgctcccaac
accttcgagg gctttgtgca tctggaggag 360ctagaccttg gagacaaccg acagctgcga
acgctggcac ccgagacctt ccaaggcctg 420gtgaagcttc acgccctcta cctctataag
tgtggactga gcgccctgcc cgcaggcatc 480tttggtggcc tgcacagcct gcagtatctc
tacttgcagg acaaccatat cgagtacctc 540caagatgaca tctttgtgga cctggtcaat
ctcagtcact tgtttctcca tggtaacaag 600ctatggagcc tgggccaagg catcttccgg
ggcctggtga acctggaccg gttgctgctg 660catgagaacc agctacagtg ggttcaccac
aaggctttcc atgacctcca caggctaacc 720accctctttc tcttcaacaa cagcctcact
gagctgcagg gtgactgtct ggcccccctg 780gtggccttgg agttccttcg cctcaatggg
aatgcttggg actgtggctg ccgggcacgt 840tccctgtggg aatggctgcg aaggttccgt
ggctctagct ctgctgtccc ctgcgcgacc 900cccgagctgc ggcaaggcca ggatctgaag
ctgctgaggg tggaggactt ccggaactgc 960acaggaccag tgtctcctca ccagatcaag
tctcacacgc ttaccacctc tgacagggct 1020gcccgcaagg agcaccatcc gtcccatggg
gcctccaggg acaaaggcca cccacatggc 1080catccgcctg gctccaggtc aggttacaag
aaggcaggca agaactgcac cagccacagg 1140aaccggaacc agatctctaa ggtgagctct
gggaaagagc ttaccgaact gcaggactat 1200gcccccgact atcagcacaa gttcagcttt
gacatcatgc ccaccgcacg acccaagagg 1260aagggcaagt gtgctcgcag gacccccatc
cgtgccccca gtggggtgca gcaggcatcc 1320tcaggcacgg cccttggggc cccactcctg
gcctggatac tggggctggc agtcactctc 1380cgc
13834461PRTMus sp. 4Met Ser Trp Gln Ser
Gly Thr Thr Val Thr Gln Ser Pro Val Gln Ala 1 5
10 15Ala Gln Val Ser Gly Cys Cys Val Glu Leu Leu
Leu Leu Leu Leu Ala 20 25
30Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr
35 40 45Pro Ala Pro Met Thr Val Ser Cys
Gln Ala His Asn Phe Ala Ala Ile 50 55
60Pro Glu Gly Ile Pro Glu Asp Ser Glu Arg Ile Phe Leu Gln Asn Asn 65
70 75 80Arg Ile Thr Phe
Leu Gln Gln Gly His Phe Ser Pro Ala Met Val Thr 85
90 95Leu Trp Ile Tyr Ser Asn Asn Ile Thr Phe
Ile Ala Pro Asn Thr Phe 100 105
110Glu Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gln
115 120 125Leu Arg Thr Leu Ala Pro Glu
Thr Phe Gln Gly Leu Val Lys Leu His 130 135
140Ala Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly
Ile145 150 155 160Phe Gly
Gly Leu His Ser Leu Gln Tyr Leu Tyr Leu Gln Asp Asn His
165 170 175Ile Glu Tyr Leu Gln Asp Asp
Ile Phe Val Asp Leu Val Asn Leu Ser 180 185
190His Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Gln
Gly Ile 195 200 205Phe Arg Gly Leu
Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gln 210
215 220Leu Gln Trp Val His His Lys Ala Phe His Asp Leu
His Arg Leu Thr225 230 235
240Thr Leu Phe Leu Phe Asn Asn Ser Leu Thr Glu Leu Gln Gly Asp Cys
245 250 255Leu Ala Pro Leu Val
Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Ala 260
265 270Trp Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu
Trp Leu Arg Arg 275 280 285Phe Arg
Gly Ser Ser Ser Ala Val Pro Cys Ala Thr Pro Glu Leu Arg 290
295 300Gln Gly Gln Asp Leu Lys Leu Leu Arg Val Glu
Asp Phe Arg Asn Cys305 310 315
320Thr Gly Pro Val Ser Pro His Gln Ile Lys Ser His Thr Leu Thr Thr
325 330 335Ser Asp Arg Ala
Ala Arg Lys Glu His His Pro Ser His Gly Ala Ser 340
345 350Arg Asp Lys Gly His Pro His Gly His Pro Pro
Gly Ser Arg Ser Gly 355 360 365Tyr
Lys Lys Ala Gly Lys Asn Cys Thr Ser His Arg Asn Arg Asn Gln 370
375 380Ile Ser Lys Val Ser Ser Gly Lys Glu Leu
Thr Glu Leu Gln Asp Tyr385 390 395
400Ala Pro Asp Tyr Gln His Lys Phe Ser Phe Asp Ile Met Pro Thr
Ala 405 410 415Arg Pro Lys
Arg Lys Gly Lys Cys Ala Arg Arg Thr Pro Ile Arg Ala 420
425 430Pro Ser Gly Val Gln Gln Ala Ser Ser Gly
Thr Ala Leu Gly Ala Pro 435 440
445Leu Leu Ala Trp Ile Leu Gly Leu Ala Val Thr Leu Arg 450
455 4605473PRTHomo sapiens 5Met Lys Arg Ala Ser Ala
Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 1 5
10 15Trp Leu Gln Ala Trp Gln Val Ala Ala Pro Cys Pro
Gly Ala Cys Val 20 25 30Cys
Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gln Gln Gly Leu 35
40 45Gln Ala Val Pro Val Gly Ile Pro Ala
Ala Ser Gln Arg Ile Phe Leu 50 55
60His Gly Asn Arg Ile Ser His Val Pro Ala Ala Ser Phe Arg Ala Cys 65
70 75 80Arg Asn Leu Thr Ile
Leu Trp Leu His Ser Asn Val Leu Ala Arg Ile 85
90 95Asp Ala Ala Ala Phe Thr Gly Leu Ala Leu Leu
Glu Gln Leu Asp Leu 100 105
110Ser Asp Asn Ala Gln Leu Arg Ser Val Asp Pro Ala Thr Phe His Gly
115 120 125Leu Gly Arg Leu His Thr Leu
His Leu Asp Arg Cys Gly Leu Gln Glu 130 135
140Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gln Tyr Leu
Tyr145 150 155 160Leu Gln
Asp Asn Ala Leu Gln Ala Leu Pro Asp Asp Thr Phe Arg Asp
165 170 175Leu Gly Asn Leu Thr His Leu
Phe Leu His Gly Asn Arg Ile Ser Ser 180 185
190Val Pro Glu Arg Ala Phe Arg Gly Leu His Ser Leu Asp Arg
Leu Leu 195 200 205Leu His Gln Asn
Arg Val Ala His Val His Pro His Ala Phe Arg Asp 210
215 220Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn
Asn Leu Ser Ala225 230 235
240Leu Pro Thr Glu Ala Leu Ala Pro Leu Arg Ala Leu Gln Tyr Leu Arg
245 250 255Leu Asn Asp Asn Pro
Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 260
265 270Ala Trp Leu Gln Lys Phe Arg Gly Ser Ser Ser Glu
Val Pro Cys Ser 275 280 285Leu Pro
Gln Arg Leu Ala Gly Arg Asp Leu Lys Arg Leu Ala Ala Asn 290
295 300Asp Leu Gln Gly Cys Ala Val Ala Thr Gly Pro
Tyr His Pro Ile Trp305 310 315
320Thr Gly Arg Ala Thr Asp Glu Glu Pro Leu Gly Leu Pro Lys Cys Cys
325 330 335Gln Pro Asp Ala
Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 340
345 350Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val
Pro Pro Gly Asp Ser 355 360 365Pro
Pro Gly Asn Gly Ser Gly Pro Arg His Ile Asn Asp Ser Pro Phe 370
375 380Gly Thr Leu Pro Gly Ser Ala Glu Pro Pro
Leu Thr Ala Val Arg Pro385 390 395
400Glu Gly Ser Glu Pro Pro Gly Phe Pro Thr Ser Gly Pro Arg Arg
Arg 405 410 415Pro Gly Cys
Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 420
425 430Gln Ala Gly Ser Gly Gly Gly Gly Thr Gly
Asp Ser Glu Gly Ser Gly 435 440
445Ala Leu Pro Ser Leu Thr Cys Ser Leu Thr Pro Leu Gly Leu Ala Leu 450
455 460Val Leu Trp Thr Val Leu Gly Pro
Cys465 4706440PRTArtificial SequenceDescription of
Artificial Sequence Consensus sequence 6Cys Pro Xaa Xaa Cys Xaa Cys
Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 1 5
10 15Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa
Pro Xaa Xaa 20 25 30Xaa Xaa
Arg Xaa Phe Leu Xaa Xaa Asn Xaa Ile Xaa Xaa Xaa Xaa Xaa 35
40 45Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Leu Trp Xaa Xaa Ser 50 55 60Asn
Xaa Xaa Xaa Xaa Ile Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 65
70 75 80Leu Glu Xaa Leu Asp Leu
Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa 85
90 95Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu
Xaa Leu Xaa Xaa 100 105 110Cys
Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa 115
120 125Leu Gln Tyr Leu Tyr Leu Gln Xaa Asn
Xaa Xaa Xaa Xaa Leu Xaa Asp 130 135
140Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly145
150 155 160Asn Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa 165
170 175Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa
Xaa Xaa Xaa Val His Xaa 180 185
190Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa
195 200 205Asn Xaa Leu Xaa Xaa Leu Xaa
Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa 210 215
220Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys
Arg225 230 235 240Ala Arg
Xaa Leu Trp Xaa Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser Ser
245 250 255Xaa Val Xaa Cys Xaa Xaa Pro
Xaa Xaa Xaa Xaa Gly Xaa Asp Leu Xaa 260 265
270Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa
Xaa Pro 275 280 285Xaa Xaa Pro Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 290
295 300Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa305 310 315
320Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa
325 330 335Xaa Xaa Xaa Xaa Xaa
Pro Pro Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa 340
345 350Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa 355 360 365Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 370
375 380Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Arg385 390 395
400Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
405 410 415Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 420
425 430Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 435
440766PRTHomo sapiens 7Arg Ile Tyr Lys Gly Val Ile Gln Ala
Ile Gln Lys Ser Asp Glu Gly 1 5 10
15His Pro Phe Arg Ala Tyr Leu Glu Ser Glu Val Ala Ile Ser Glu
Glu 20 25 30Leu Val Gln Lys
Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr 35
40 45Ile Lys Glu Leu Arg Arg Leu Phe Leu Val Asp Asp
Leu Val Asp Ser 50 55 60Leu Lys
658390PRTHomo sapeins 8Cys Pro Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro
Thr Val Ser Cys 1 5 10
15Gln Ala Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr
20 25 30Gln Arg Leu Phe Leu Gln Asn
Asn Leu Ile Arg Thr Leu Arg Pro Gly 35 40
45Thr Phe Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn
Leu 50 55 60Ser Thr Ile Tyr Pro Gly
Thr Phe Arg His Leu Gln Ala Leu Glu Glu 65 70
75 80Leu Asp Leu Gly Asp Asn Arg His Leu Arg Ser
Leu Glu Pro Asp Thr 85 90
95Phe Gln Gly Leu Glu Arg Leu Gln Ser Leu His Leu Tyr Arg Cys Gln
100 105 110Leu Ser Ser Leu Pro Gly
Asn Ile Phe Arg Gly Leu Val Ser Leu Gln 115 120
125Tyr Leu Tyr Leu Gln Glu Asn Ser Leu Leu His Leu Gln Asp
Asp Leu 130 135 140Phe Ala Asp Leu Ala
Asn Leu Ser His Leu Phe Leu His Gly Asn Arg145 150
155 160Leu Arg Leu Leu Thr Glu His Val Phe Arg
Gly Leu Gly Ser Leu Asp 165 170
175Arg Leu Leu Leu His Gly Asn Arg Leu Gln Gly Val His Arg Ala Ala
180 185 190Phe Arg Gly Leu Ser
Arg Leu Thr Ile Leu Tyr Leu Phe Asn Asn Ser 195
200 205Leu Ala Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu
Pro Ser Leu Glu 210 215 220Phe Leu Arg
Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg225
230 235 240Pro Leu Trp Ala Trp Phe Gln
Arg Ala Arg Val Ser Ser Ser Asp Val 245
250 255Thr Cys Ala Thr Pro Pro Glu Arg Gln Gly Arg Asp
Leu Arg Ala Leu 260 265 270Arg
Glu Ala Asp Phe Gln Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro 275
280 285Gly Ser Arg Ala Arg Gly Asn Ser Ser
Ser Asn His Leu Tyr Gly Val 290 295
300Ala Glu Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp305
310 315 320Leu Pro Ala Glu
Asp Ser Arg Gly Arg Gln Gly Gly Asp Ala Pro Thr 325
330 335Glu Asp Asp Tyr Trp Gly Gly Tyr Gly Gly
Glu Asp Gln Arg Gly Glu 340 345
350Gln Met Cys Pro Gly Ala Ala Cys Gln Ala Pro Pro Asp Ser Arg Gly
355 360 365Pro Ala Leu Ser Ala Gly Leu
Pro Ser Pro Leu Leu Cys Leu Leu Leu 370 375
380Leu Val Pro His His Leu385 3909421PRTMus sp. 9Cys
Pro Arg Asp Cys Val Cys Tyr Pro Ala Pro Met Thr Val Ser Cys 1
5 10 15Gln Ala His Asn Phe Ala Ala
Ile Pro Glu Gly Ile Pro Glu Asp Ser 20 25
30Glu Arg Ile Phe Leu Gln Asn Asn Arg Ile Thr Phe Leu Gln
Gln Gly 35 40 45His Phe Ser Pro
Ala Met Val Thr Leu Trp Ile Tyr Ser Asn Asn Ile 50
55 60Thr Phe Ile Ala Pro Asn Thr Phe Glu Gly Phe Val His
Leu Glu Glu 65 70 75
80Leu Asp Leu Gly Asp Asn Arg Gln Leu Arg Thr Leu Ala Pro Glu Thr
85 90 95Phe Gln Gly Leu Val Lys
Leu His Ala Leu Tyr Leu Tyr Lys Cys Gly 100
105 110Leu Ser Ala Leu Pro Ala Gly Ile Phe Gly Gly Leu
His Ser Leu Gln 115 120 125Tyr Leu
Tyr Leu Gln Asp Asn His Ile Glu Tyr Leu Gln Asp Asp Ile 130
135 140Phe Val Asp Leu Val Asn Leu Ser His Leu Phe
Leu His Gly Asn Lys145 150 155
160Leu Trp Ser Leu Gly Gln Gly Ile Phe Arg Gly Leu Val Asn Leu Asp
165 170 175Arg Leu Leu Leu
His Glu Asn Gln Leu Gln Trp Val His His Lys Ala 180
185 190Phe His Asp Leu His Arg Leu Thr Thr Leu Phe
Leu Phe Asn Asn Ser 195 200 205Leu
Thr Glu Leu Gln Gly Asp Cys Leu Ala Pro Leu Val Ala Leu Glu 210
215 220Phe Leu Arg Leu Asn Gly Asn Ala Trp Asp
Cys Gly Cys Arg Ala Arg225 230 235
240Ser Leu Trp Glu Trp Leu Arg Arg Phe Arg Gly Ser Ser Ser Ala
Val 245 250 255Pro Cys Ala
Thr Pro Glu Leu Arg Gln Gly Gln Asp Leu Lys Leu Leu 260
265 270Arg Val Glu Asp Phe Arg Asn Cys Thr Gly
Pro Val Ser Pro His Gln 275 280
285Ile Lys Ser His Thr Leu Thr Thr Ser Asp Arg Ala Ala Arg Lys Glu 290
295 300His His Pro Ser His Gly Ala Ser
Arg Asp Lys Gly His Pro His Gly305 310
315 320His Pro Pro Gly Ser Arg Ser Gly Tyr Lys Lys Ala
Gly Lys Asn Cys 325 330
335Thr Ser His Arg Asn Arg Asn Gln Ile Ser Lys Val Ser Ser Gly Lys
340 345 350Glu Leu Thr Glu Leu Gln
Asp Tyr Ala Pro Asp Tyr Gln His Lys Phe 355 360
365Ser Phe Asp Ile Met Pro Thr Ala Arg Pro Lys Arg Lys Gly
Lys Cys 370 375 380Ala Arg Arg Thr Pro
Ile Arg Ala Pro Ser Gly Val Gln Gln Ala Ser385 390
395 400Ser Gly Thr Ala Leu Gly Ala Pro Leu Leu
Ala Trp Ile Leu Gly Leu 405 410
415Ala Val Thr Leu Arg 4201017PRTArtificial
SequenceDescription of Artificial Sequence Consensus sequence 10Cys
Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 1
5 10 15Cys1150PRTArtificial
SequenceDescription of Artificial Sequence Consensus sequence 11Asn
Xaa Trp Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa Trp Xaa 1
5 10 15Xaa Xaa Xaa Arg Xaa Ser Ser
Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 20 25
30Xaa Xaa Xaa Gly Xaa Asp Leu Xaa Xaa Leu Xaa Xaa Xaa Asp
Xaa Xaa 35 40 45Xaa Cys
5012196PRTArtificial SequenceDescription of Artificial Sequence Consensus
sequence 12Arg Xaa Phe Leu Xaa Xaa Asn Xaa Ile Xaa Xaa Xaa Xaa Xaa
Xaa Xaa 1 5 10 15Phe Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 20
25 30Xaa Xaa Xaa Ile Xaa Xaa Xaa Xaa Phe
Xaa Xaa Xaa Xaa Xaa Leu Glu 35 40
45Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa
50 55 60Thr Phe Xaa Gly Leu Xaa Xaa Leu
Xaa Leu Xaa Leu Xaa Xaa Cys Xaa 65 70
75 80Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa
Xaa Leu Gln 85 90 95Tyr
Leu Tyr Leu Gln Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa
100 105 110Phe Xaa Asp Leu Xaa Asn Leu
Xaa His Leu Phe Leu His Gly Asn Xaa 115 120
125Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu
Asp 130 135 140Arg Leu Leu Leu His Xaa
Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala145 150
155 160Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa
Leu Phe Xaa Asn Xaa 165 170
175Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa
180 185 190Xaa Leu Arg Leu
195131176DNAHomo sapiens 13gagggcatcc ccgtggacag cgagcgcgtc ttcctgcaga
acaaccgcat cggcctcctc 60cagcccggcc acttcagccc cgccatggtc accctgtgga
tctactcgaa caacatcacc 120tacatccacc ccagcacctt cgagggcttc gtgcacctgg
aggagctgga cctcggcgac 180aaccggcagc tgcggacgct ggcacccgag accttccagg
gcctggtgaa gcttcacgcc 240ctctacctct acaagtgtgg gctcagcgcc ttgccggccg
gcgtctttgg cggcctgcac 300agcctgcagt acctctacct gcaggacaac cacatcgagt
acctccagga cgacatcttc 360gtggacctgg tcaacctcag ccacctgttt ctccacggca
acaagctgtg gagtctgggc 420ccgggcacct tccggggcct ggtgaacctg gaccgtcttt
tgctgcacga gaaccagctg 480cagtgggtcc accacaaggc attccacgac ctccgcaggc
tgaccaccct cttcctcttc 540aacaacagcc tctcggagct gcagggtgag tgcctggccc
cgctgggggc cctggagttc 600ctccgcctca acggcaaccc ctgggactgt ggttgtcgcg
cgcgctccct gtgggaatgg 660ctgcagaggt tccggggctc cagctccgct gtcccctgtg
tgtcccctgg gctgcggcac 720ggccaggacc tgaagctgct gagggccgag gacttccgga
actgcacggg accagcgtcc 780ccgcaccaga tcaagtcaca cacgctcacc accaccgaca
gggccgcccg caaggaacac 840cactcacccc acggccccac caggagcaag ggccacccgc
acggcccccg gcccggccac 900aggaagccgg ggaagaactg caccaacccc aggaaccgca
atcagatctc taaggcgggc 960gccgggaaac aggcccccga gctgccagac tatgccccag
actaccagca caagttcagt 1020tttgacatca tgcctacggc ccggcccaag aggaagggca
agtgtgcccg caggaccccc 1080atccgtgccc ccagcggggt gcagcaggcc tcctcggcca
gttccctggg ggcctccctc 1140ctggcctgga cactggggct ggcggtcact ctccgc
117614392PRTHomo sapiens 14Glu Gly Ile Pro Val Asp
Ser Glu Arg Val Phe Leu Gln Asn Asn Arg 1 5
10 15Ile Gly Leu Leu Gln Pro Gly His Phe Ser Pro Ala
Met Val Thr Leu 20 25 30Trp
Ile Tyr Ser Asn Asn Ile Thr Tyr Ile His Pro Ser Thr Phe Glu 35
40 45Gly Phe Val His Leu Glu Glu Leu Asp
Leu Gly Asp Asn Arg Gln Leu 50 55
60Arg Thr Leu Ala Pro Glu Thr Phe Gln Gly Leu Val Lys Leu His Ala 65
70 75 80Leu Tyr Leu Tyr Lys
Cys Gly Leu Ser Ala Leu Pro Ala Gly Val Phe 85
90 95Gly Gly Leu His Ser Leu Gln Tyr Leu Tyr Leu
Gln Asp Asn His Ile 100 105
110Glu Tyr Leu Gln Asp Asp Ile Phe Val Asp Leu Val Asn Leu Ser His
115 120 125Leu Phe Leu His Gly Asn Lys
Leu Trp Ser Leu Gly Pro Gly Thr Phe 130 135
140Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gln
Leu145 150 155 160Gln Trp
Val His His Lys Ala Phe His Asp Leu Arg Arg Leu Thr Thr
165 170 175Leu Phe Leu Phe Asn Asn Ser
Leu Ser Glu Leu Gln Gly Glu Cys Leu 180 185
190Ala Pro Leu Gly Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn
Pro Trp 195 200 205Asp Cys Gly Cys
Arg Ala Arg Ser Leu Trp Glu Trp Leu Gln Arg Phe 210
215 220Arg Gly Ser Ser Ser Ala Val Pro Cys Val Ser Pro
Gly Leu Arg His225 230 235
240Gly Gln Asp Leu Lys Leu Leu Arg Ala Glu Asp Phe Arg Asn Cys Thr
245 250 255Gly Pro Ala Ser Pro
His Gln Ile Lys Ser His Thr Leu Thr Thr Thr 260
265 270Asp Arg Ala Ala Arg Lys Glu His His Ser Pro His
Gly Pro Thr Arg 275 280 285Ser Lys
Gly His Pro His Gly Pro Arg Pro Gly His Arg Lys Pro Gly 290
295 300Lys Asn Cys Thr Asn Pro Arg Asn Arg Asn Gln
Ile Ser Lys Ala Gly305 310 315
320Ala Gly Lys Gln Ala Pro Glu Leu Pro Asp Tyr Ala Pro Asp Tyr Gln
325 330 335His Lys Phe Ser
Phe Asp Ile Met Pro Thr Ala Arg Pro Lys Arg Lys 340
345 350Gly Lys Cys Ala Arg Arg Thr Pro Ile Arg Ala
Pro Ser Gly Val Gln 355 360 365Gln
Ala Ser Ser Ala Ser Ser Leu Gly Ala Ser Leu Leu Ala Trp Thr 370
375 380Leu Gly Leu Ala Val Thr Leu Arg385
39015143899DNAHomo sapiensmodified_base(2044)..(2144)a, t, c, g,
other or unknown 15aagcacatac aggtgacatt acagaactga cagttatgcc aggcactgta
cttagcccct 60ataccatcct caaacagctg tatgatgtag attgggtatt aaccccatta
ataacaaaag 120tacagggaac aaagtgactt tccaaaggtc atgccattca aaggagggtg
aatcttaggt 180tggacgcagg ctgtctgact ctggagtctg aggtgttaat gctgcctcct
ccatgggaac 240agcccaagtg aaaaacagct gatccactct tcatttactt ggcatctgtg
ctaagctggt 300ccctgagcca agctctgagc aacagaaaca gaagctctgc attaggagct
tgtgagcatg 360tcaatgccgg gtaaaggagt gctggaaacc gctgggatgg ccgccgagca
ctaggccgtt 420gaaggtgggc tctgtgtgac tggttcctct acactctggc ctggctgcct
gcaggaagaa 480gatcaagctg agtgggctgg ccctggacca caaggtgaca ggtgacctct
tctacaccca 540tgtgaccacc atgggccaga ggctcagcca gaaggccccc agcctggagg
acggttcgga 600tgccttcatg tcaccccagg atgttcgggg cacctcagaa aaccttcctg
agagtgagtg 660tctggtcaag gtgccggcct tgggggatag tgatggtggg tcctcatatt
cagtgagcac 720tcatggttga gtatttattc gcacccctct tcagtcctta caacacccca
tgatgtaggt 780ggggcatgct cctcatttac agatgggcac atcaaagctc agctaacgct
gggaagttca 840gattcagggt taccctgctg gattcctggg attggggagg gaggagcttc
caaaatgggg 900acaaggtctc tgggcctgtc gggtagctgg tttcctcagg gccccttgca
acctctgagc 960ttattgcatc aggtgcagcc aggcccgtga gcctcctggc aggggtcctc
cacacctggc 1020tgtcttttgc cccctgctgg tcacaggagg agctgcagca cctgcctggg
ctgcttctca 1080ggagggtaca tgaagatccc aggaccgcca gctccatgat aagtggaagg
agctccttgg 1140agtcaggagc gggagttgag gagtttgagt cctgctctcc agttataggc
tatgtgactt 1200gtgtagatca cctaaccttg ctcttgattt ccttacctct taaactagca
ctaaaagcac 1260cccacaaact gtaaagttag ttgtgatgat tgaatgacac catgggtgtg
gaagctcttt 1320gtaaagtgca aaacggtgtg cagtttgagg gtggttaccc ccagtgccga
ttctcagagg 1380gcaacatggc taagggcacg agctggagtt aggctgacct gctgcttcca
gccctgtgag 1440cttgagcaag tcatttaact tcctgagctg cagtttcctc atcagtaaaa
tgtgataagg 1500atagggttgt tgtaagattt tattaaatgg ggtaataaat gtcaagtatg
tagcccatag 1560tgagtgcttc agagtttttt tcttttgttt ctttcccccc cgccccgaga
tggagcctta 1620ctctgttgcc caggctggag tgcagtggca tgatcttggc tcactgcaac
ctccgcctcc 1680cgggttcaag caattctcct gcctcagcct cccaaatagc tgggactaca
ggcgtgcacc 1740accatgctcg gctaattttt gtatctttag tagagacggg gtttcaccat
gttggccagg 1800ctggtctcga actcctgacc tcatgatgct cctgcctcag cccccgaaag
ttttgggatt 1860acaagtgtga gcccccgtgc cctgccaggt tttttttttt tttttttttt
tgtaaaacac 1920ccacagggta ttgctgttgc ctgggctgga gtgcggtagt gcaatcatag
ttcactgcag 1980ccttgacctc ctgggctcaa gtgatcctcc tgcctcagcc tcctgagtag
ctgggaatac 2040aggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 2100nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttttgta
ttttaagtag 2160agacagggtt ttcccaatgt tggccaaggc tggtctaaaa ctcccaacct
caggtgatcc 2220acccacctca gcctcccaaa gtactgggat tacaggcgtg agacaccgtg
cccagccagg 2280aggcttattt tcttgataaa ttacccagtc tcaggtattt ctctacagcg
atgcaagaac 2340agcctaatac atccaggctc agcatcagtg gacccaggtg ggagagctta
agatgtcaag 2400gtctgaatgc cgcttccaca cacctttggg acctagggac tccctctctt
tttctttttt 2460cagtagaaga tgttatcttc tcctttctct gaccagtagt tggtgatggt
ttcagagata 2520gtttttcagt caagatatat ttcagtggct tcactgagcc caagttccct
cgcctctcta 2580ggactttatt tccttgtttc tagaagaggg ataacacata ttttctaagg
tggttgtgag 2640attaagggag ctggtaccgg gtggtgcata aggacaggat agagcaatgg
tgagaccact 2700caaaaagcga aaagttgacc tgcgagggtg acacttatca aatcagcaca
cagtgggagt 2760ggaaggaatg tccctcatca gttacaatat ttggagagtg caagttatag
aaaacccagc 2820cctggccggg cgcggtgggt catgcctata atcccagcac tttgggaggc
tgaggcaggt 2880ggatcacgag gtcaggagtt caagaccagc ctgaccaacg tggtgaaacc
ccacctctac 2940taaaaataca aaattagctg ggcgtggtgg tgtgtgcctg taatctgagc
tactcaggag 3000gctgaggcac gagaatcact tgaaaccggg aggtggagtt tgcagtgagc
cgagatcgca 3060ccactgcact ccagcctggg caacagagcg agactccatc tcaaacgaaa
aaaaaaaaag 3120aaagaaaacc cagctctaac tggcttaaac agtaagaaga tctattatat
tatccatctc 3180aggcagcagc aagcccagag gtaggggact ccaaggttgg ttgatccagg
gcttaacgat 3240gtcatcaaag acccaggttc tttctgtctc ggcacctctg tctgcagggc
cagcttcatc 3300ctaagccaga ttgttcttgt cttgattaca agttggctgc tgggccagca
gacgctgcct 3360gcctccctgt tcatcttcag aagtagaaag tggcccttcc ccagtcatgg
aatgaaagag 3420tttcctttct gtctgggatt gcttaggtcc acccacctga agccaatgac
tgtcaccagg 3480aaggtaatat acactgattg tcttaagtca gggttcctga gccagtcttg
ggcaaggagt 3540gtgatactgt catgattgtc ttgggctcat cagggcagct ctgcagatga
gatcaaactc 3600caagctacat tattctgaac agtgggaagt aggaaagaga cattttggga
gatacaaaac 3660acaatgtcta tcccatatcc ctaggtccag gtcacagtgt cttggttgga
catcaaatgt 3720agaaaaagaa agactgtcca tccatttatc tacctattca tctggttttt
gatttttttt 3780aaattttatt ttaagacatt ctcactctgt cacccagact ggagtgcagt
ggtttgatca 3840tggctcatgg cagcctcaac ctcccaggct caagtgaccc tcccatgctc
aagtgatcct 3900cctacctcag cctcccaagt agctagaact aaaggtgcat gccaccacgc
tcagttaatt 3960tttgcatttt ttgtagagat ggggtttcgt catgatgccc atgctagtct
ggaattcctg 4020aactcaagca atatgcctgc ctttgcctcc caaaatgctg ggattgtagg
catgagccac 4080tgctcctggc tcatctgttt aataatttat gaaacaacta ctgggtgctg
agcacggggc 4140caggggctgg agatctagca gggaccaggc agatctctgc caagtcgttg
gtttcttaaa 4200ggttttgctc ataattcccc ttttcttttc tctttcgttt tttttctttt
ctttctttct 4260ttctttcttt tttttttttt gagacagagt ctcactctgt tacccaggct
ggagtgcagt 4320ggtgcgatct cagctcactg caacctctgc ctcctgggtt caagcgattc
tcctgcctca 4380gcctcccgag tagctgggac tacaggcgcc tgccaccatg cccggctaat
ttttgtgttt 4440ttagtagaga ctgggtttca ccatattggc caggctggtc ttgaactcct
gaccttgtga 4500tccgcccgct tcggcctccc acagtgctgg gattacaggc gtgagccacg
gcgcccagcc 4560agtttccctt ttcaatgagg cctccctgac ctccatactc tactcctcca
cctggcccac 4620tcagctctac tttttcttcc ccatagcact caagacctcc taacatacta
cgtaagttat 4680ttatttacta ggcttactgt gtattgtctg tcttcctcta ctagaatgta
aactccatga 4740gaatagaaat ttttgccttt ttatttagtg tggtgtctgc agcccctggc
ttagtccctg 4800gcatacaaca gtcactccac ccacagttgc tgaataagtg actaaaggtc
cctgccctca 4860tattgttatg agggagtgtg catgttgtta gagaaaaatc tgaggcacaa
taaaatttta 4920tagagtttaa gttttctttt ttaagcaatc cacgaattgg ggtagtttca
gaggtagttt 4980ttcagtcatg acgtatttca atggcttcac tgagcccaag ttctttcacc
tctctaggac 5040tttatttcct tatttctaga acggggataa cacatagttc ataaggcagt
tatgagagta 5100agggagctgg tatggggtga tgcataagga caggatagag cagtggtgag
accgctcaga 5160tgacaaagcg tcagagacca gtatttacga cggaaatgtg gaagcatgat
aaagaaatta 5220tttgggctgg gcacaatgac tcacaactaa taaaactttg ggaggccaag
gtgggaggat 5280cacttgactt gcagaaggtc aaggctgcag tgagctgtga ttttgccact
gcactccagc 5340ctggtcaaca gagtgagacc ctggctcgaa acgttatttg attggttaca
gttatacagt 5400tgccttattt ggtctattcc atttgaaagt tcctagttct ataattttaa
gtttgttggc 5460tgtttctgat tggttaagct taagttttgt tttcctttaa tacagttaag
tgccccataa 5520tgacattttg gtcaaggaca gaccacatat acagtggtgg tcccataaga
ttataatgga 5580gctgaaacat tcctattgtc tatggcgtag tggtcctgat gttgtagcgc
aatgcattag 5640ttatatgttt gtggcaatgc tggtgtaaac acacctactg cactgccagt
gatataaaag 5700aatagcacat acagttatat atagtacata atatctgata atgataatac
ataactatat 5760tactggttta tatatttact atattattta tctttatttt atttttgaga
cagagtctca 5820ttctgtcacc caggctggag tgcagtggcg cgatcttggc tcaccgcaac
ctccgcttcc 5880tgggttcaag tgattctcct gcctcagtct cctgagtagc tgggattaca
ggtgtgcacc 5940atgacaccct gctaatatgt tttgtatttt tagtagagat ggggtttcac
catgttggcc 6000aggctggtct tgaactactg acctcaagtg atcaccccgc ctcggcttcc
caaagtgctg 6060ggattacagg cgtgagccac cacgcatggc ctatttataa ttattttaga
gtgtacgcct 6120tatacttata aaaaaaagct aactgtcaaa cagcctcggg caggtccttc
aacagatatt 6180ccagaagaca ttgttatcat aggagatgac agctccgtgc atattattgt
ccctgaaaac 6240cttctagtgt ggaagtggaa gacagtgata ttgatgatag gacccagtgt
aggcctaggc 6300taatgtgtgt gtttgtgtct ttgcttttaa caagaaagtt taaaaagtta
aaataaaata 6360caaaaatttt taaatagaaa aaagctgccc aggaacaatg gctcacacct
gtaatcccac 6420cattcgggga ggccaaggtg ggtggattgc ttgagctcag gagttcaaga
ccagcctggg 6480caacatggtg aaaccccatc tctacaaaaa atacaaaaat tagccgggtg
tggtggcatg 6540cggctatagt tccagctaat cgaggggctg aggtgggagg atcactgggg
gggaggtggt 6600tgaggctgna gtgagctgtg attgnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 6660nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 6720nnnnatattc ttaaaaaaat ttttttttat ttttgagaca gaatttctct
cttgttgccc 6780aggctggagt gcaatggcgc tatctcagct cagggcaacc tccacctcct
gggttcaagc 6840gattctcctg ccttagcctc ccaggtacag gcgcccgcca ccatgctcgg
ctaatttttg 6900tatttttagt agagatgggg tttcaccatg ttgtccaggc tggtcttgaa
atcctgcctc 6960aggtgatcca cccccctcgg cctcccaaag tgctggaatt tacaggcgtg
agccactgtg 7020cctggcctcc tttacatttt tttaaattta attttaattt tttaattttt
aatttctcat 7080atatatatat ttttaagact agccaagtga agcagtggga gtggaaaagg
aactggtttt 7140gatcaatagg tgtaaacacc actgcactgg gaccagccta ttttacattc
ctgttagcag 7200tgatgagggt tcactttctt tgtagcctca acaatatgtg tcgttgccca
tctttttttt 7260tttttttttt tttttttttg agatggagtc tcactctgtt gcctaggctg
gaatgcaatg 7320gcatgatctc agctcactgc aacctccgcc tcccaggttc aagtgattct
tgtgtctcag 7380cctcctgagt agatgggatt acaggcgtcc accaccacgc ccggctaatt
ttttgtattt 7440tcagtagaga tggggtttca ccatgttggc caggttggtt tcgaactcct
gacctcaagt 7500gatccgccca cctcggcctc ccaaagtgct gggattacag gcatgagcca
ccgcgcccgg 7560cctgcccatc ttttttttgt tatagccatc ctagtggatg taaagttttt
ttgtgatttt 7620gatttgtgtt tccctactga tcaatgatgt tgagcatctt ttcctgtgct
tattggcttt 7680tggtatatct ttggagaaag gtctattcag gtcctttgcc cactttaaaa
ttaggttatc 7740tttctattac tgagatgtaa gagttcttta tgttctagat ataagtctcc
tacatatgat 7800ttgtaaaaat tttccttcca ttattgggtt gtctttcact ttcttttggt
gtcctttagt 7860gcacaacagt ttttaatatt gaagtccaat tttctatttt tctcttttgc
cacttgtatc 7920ttggtgtcat gtttaaggaa ctattgccta atctcaggtc acaaagattt
acacctgtgt 7980ttccttcttt ccttccttcc ttccttcctt ccttctttcc ctccctccct
ctctccctcc 8040ctccctctct ccctccctcc ctccttccct tcctccctcc ctccctcctt
ccttccttcc 8100ttccttcctt ccttccttcc ttccttcctt ccttcctttg tccttctgac
ggaatcttgc 8160tctgtcaccc aggctggagt gtagtggcac gatcttggct cactgcaacc
tctgcctcct 8220gggttcaagc aattctcctg cctcagcctc ctgagtagct gggactacag
gcacacacca 8280ccatgcccag ctaatttttg tatttttagt agagacgggg tttcaccaca
ttggccagga 8340tggtttcgat ctcctgacct cgtgatccac ccgccttggc ctcccaaagt
gctgggattg 8400caggtgtgag ccaccatgcc cggcctgtgt tttcttagag ttttgtagtt
ttagctctta 8460tagttagatc cttgatccat tttgagttga ttttgtatat agtgtgagat
atccacctgg 8520tgttgtaaat tgcccagaag tgggtatgct tctaaatctg gctgttaggg
attactagag 8580gtgaccaaag tgaatttttt ctttgtttct tttttttttt ggagacagag
tctccgtcac 8640ccaggctgga gtgcaatggc ttcatcttgg ctcagtgcaa cctctgcctt
ctggtttcaa 8700gcagttctcc tgcctcagac tcctgagtag ctggtattac aggcgtgtac
caccatgctt 8760ggctaatttt tgtattttta gtaaagatgc agtttcacct gttggccagg
cttttctgga 8820actcccggcc tcaagtgatc catctgcctc tacctcccaa agtgctggga
ttacaggtgg 8880gagccaccgt gcccagtcct tttctcagaa tttatttgtt tttttttgtt
ttgtttcatt 8940tttgagatag ggtctcactc tgtcagctag gcaggagttc agtggtgtga
tcattgctgc 9000agccttgaac ttctggactc acgtgatctt cccacctcag cctcctgagt
agctaggatt 9060acaggcatgt gcttccacac ctggctaatt ttttaatttt ctaggactta
tttgtccatt 9120cttgcaaagc agggtacaac atgcctatct ctacctacct ctcttccctt
caagggactc 9180cagccaaaat ccttgaggct ctcgggctga ctgtgggtgc tgttgcctga
tctgcctcag 9240tcatgctgca tgatcaaaag tgtccgtttt ctgcttcttg gaactttatt
cactttgggt 9300gtcagtcttc ctctgcagtg tcccaagaac acagaattag accaggaatc
tgtgttgcca 9360tagtgtgtgg aaagaggcag acttccaact ccgctatgtg ctgttgggtg
attgaagctt 9420aattttcttt ctatctttct ttcttttctt ttcttttttt ttttttggag
atggaatctc 9480gctctgttgc ccaggctgga gtgcagtggt gcgatctcac ctcactgcaa
cctccgcctc 9540ccaggttcaa gcgattctcc tgcctcagcc tcctgagtag ctgggattac
aggtgcatgc 9600caccatgccc ggctaatttg tgtaatttta gtagaaacag tgtttcacca
tattggtcag 9660gctggtctcg acctcctcac ctcaggtgat ccacccgcct tggcctccca
aagtgtcggg 9720attacaggcg tgagccaccg tgcctggcac ttaattttct taatacctca
attaccccat 9780atggtaaaat gggactagta atccatacct tatagcgctg ttgtgaaaat
gaaatgaggg 9840taagcagata aaatttcaga ctacggatgg gattgttact acattctgaa
cctggctttg 9900ctgttatttg ctatgtgacc ttatcttctc tggatctcca ttctttccaa
gtctataaaa 9960caaagtggac aattgtcaac ctttcttcca aagagcaatg atttaaggat
caaatgatgt 10020catttaacaa aaatatgaag agctcaacaa atgaggaact cattattatt
attacaatta 10080ttattatttt agaaataggg tcttgttctc ttgcctaggc tggagtccag
tggtataaac 10140acagctcaat gcatcttcag cctcctggat acaagtgatc ctcatgtctc
atccccctaa 10200gtagctggga ccacaggcat gtaccaccac gcacggctaa ttttttattt
tttattttta 10260ttttttgaga cagtcttgct ttgtcgccca gactggagtg cagcagcgca
atcaccgctc 10320actgcaacct ccgcctcctg ggttcaagtg attctgctgc ctcaacctcc
caagtagctg 10380ggattacagg cctgtgccac catgcccggc taattttttt gtatttttgg
taaagacggg 10440gtttcaccat gttgcccagg ctgatctaga acccctggcc tcaagtgatc
cccctttctt 10500ggcctcctaa agtgctagga ttacaggcgt gagcctctgc acctggcctc
ggctaatttt 10560ttattttttg tagagacagg ttctcactat gttgccaggg ctggtcttga
actcctgggc 10620tcaagtgatc ttcccacctc agcctcccaa agtgctgaga ttacagatgt
gagccactgt 10680gcctggcctg gaactcatta ttgaagcatt cactagtatc aactttgggg
ttacctggcc 10740acatcctctg acctacctat aagggtatca cagctaacgg agcctctgtt
tctcagaatt 10800taggcagaag cagttcaatt tatcacaaac tactctatat ccagcataag
tgcccaaata 10860aaacaattgc taaagttctt taggcattta ctgtttgtta gttagatatt
tagtcctcac 10920tacaaatctg tgatacaggt attattttta ttaaccccat tttatagaag
agaaacctga 10980agctcagaga tgctaagtaa cttgtgcaag gtcacacagc tagtaaataa
agggcagagt 11040aaagatttag tttcacattg gactccagaa cctttctact gggactcatg
ggaatagtgt 11100ggatgtccct gaccttcagt ggcccagggc tctcctgggg gaatccagcc
atagacaaga 11160caccagcgag agcccaatcc taagattttg tttgtttgtt tttgagacaa
ggtctcactc 11220tgtcaccaga ctggagtgca gtggcatgat caatgctcac tgcaaccttg
atctcccagg 11280ctcaagcaat cctcccacct cagcctcctg agtagcttgg actacaggtg
cacaccacca 11340cacctgacta attttaaaat tttatttaat taattactta ctattatttt
ttgagacagg 11400gtatcacttt gtcacccaag ctggactgca atggtgtggt ctcagctcat
tgcgtcctcc 11460acctcccagg ttcaagtgat cctcccacct cagcctctgg agttgcaggg
actgcaggtg 11520tgcgccacta tgctcagcta atgtttttat tttttgtata gatggggtct
cactatgttg 11580ccagggctag tctcaaactc ttggactcaa gcgatcctcc tgtcttggcc
tcccaaagtg 11640ccgggattac aggcataaac caccacaccc aacccctaag gtgtttttgc
tgaatgtgac 11700catgtcagag gcaggaaagg gaagcatcat ggggttagga aaggaacact
gagcagggag 11760acaaagaaaa tgggatcatt ttgtgagtgt tcgctgtgtg tgtatgtgtg
acaattctca 11820gagccagcct ctcaggtggt tgagaccaca gtccccattt cccagatgag
ataatggagc 11880ctcagagagt ttctgcagca cagctagtgg aattagaatt tgaacccggc
tcttccagac 11940tccaggtgct tcacaaccat cccaaaccta gtcatttgca gtttaccttc
atgattttac 12000catttccctt tgccatagct agtgttattt acttaataat tccttttgaa
tcagtctgct 12060taaaaaaaaa tagcttcatt ctaaagtgta atattcttgg aatatcgggt
ttgctgttac 12120ccacccccac acgttataca tatacatgta tgtttctaat acatatatat
gtacgtatat 12180acgtgtatcg ttttttgtta ttttttttgt tgttgttagt tttttttaga
tggagtctct 12240ctctgtagcc caggctggag tgcagtggtg tgatttcggc tcactggaac
ctctgcctcc 12300tgggttcaag cgattctcct gcctcagcct ctggagtagc tgggattaca
ggcacccacc 12360actacacccg gctaatgttt gtatttttag tagagacagg gtttcaccat
gttggccagg 12420tgggtcttga actcctgatc tcaagtgatc cacctgcttt ggcttcccaa
agtgctggga 12480ttataggtgc gagctactgc ggctggccaa tgtatgtttt taatacacat
tcaaataacg 12540aataactatg aaacctgaaa aactgctcca tgttacttcc tgaacccatc
ttgagtgctc 12600acatgctgtg cataccacat attgggaaac actgctttcc ctggcttcca
agcccagctt 12660aatcactgtc ccatcctatg cttcgcttta tttgtctata aatgttgggg
ttgggggttg 12720atgccaaaga ccttttctgt tgtcattaac atggacacag ctctaagagg
tcttggcatc 12780ttgggctggc tctcctttta gttcagaatt tggattttta tccaactact
cagagtgatc 12840aagccttcct tatgaatgaa ctcgttggtc aaactcataa aaggctgatc
gataaaacag 12900gaatgaatgt atgaattgac actaagtcat tagcatttca cgggaatgga
ttctccgtta 12960gtggaagagc acatgtcctt tctggcactg atgtgtgctt gggaaactta
ctgagctaac 13020tggcccatgt aacacagagg ccctttggtg cagtggaaaa ctgttgactt
tggagattat 13080cttgagtttg aatctgagcc tgcctgtaag aagctggcta actgaattgc
tttgcttctt 13140ggacccttac catttataaa atggggacca ttgtactcac cctttagggt
tattgcatgg 13200attaaatggg attctctata gaaaatattg gcacaaagta ggtgtaaatt
tgcacgctag 13260tgggattgtt tgtgagggaa attgtcattt gattatcaaa gacttaggag
caggaacagt 13320gtctaattca gggactgcaa atggaaatgc cagctgaggc caggcatttg
ctaataattg 13380ggtaaagcag ggcaggtgta gaatagcaat gtctgggaat taaaagagag
gtgaggacgt 13440gtatgacctt gagaaggcaa gccctggcaa aaggggatgg cctccactca
gctacagtca 13500tgcctagatc ttctaacttt ttatttttat ttttattttt tgagacggag
tcttgctctg 13560tcacccaggc tggagtgcag tggcgcgatc tcggctcact gcaagctccg
cctcccgggt 13620tcacgccatt ctcctgcctc agcctcccaa gtagctggga ctacgggcgc
ccaccaccat 13680gcccggctaa tttttttttt gtatttttag tagagatggg gtttcaccgt
gttagccagg 13740atggtctcga tctcctgact ttgtgattta ccctccttgg cctcccaaag
tgctgggatt 13800acaggcttga gccaccgcac ctggccgatc ttctaacttt ttaaagagaa
gcaagacatc 13860tggattttta tgtgataact cctgatttta aactggcacc caattataat
ttacaacact 13920ataagggtca acattgccag cagagcaaaa catgggtggg ggcaactgct
ggtcaccggt 13980gtgcagcctc tggtctaaaa tcatctttgt atttcttctt gctttacgca
ttgtcccagc 14040acagtgctgt tgtatagtaa atatccagta agtgggtgta gaatgaataa
accaatgcag 14100ataaacctgt agagaggccg ggcacagttg ctcatgtctg taatctcagc
acnnnnnnnn 14160nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 14220nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntagtccca gcactttggg
aggccaaggt 14280gggtagatca cctgaggtca ggagttcaag accagcctgg ccaatatagt
gaaaccccgt 14340ctctacaaaa ataaaaaaat tatctgggca tgattgcagg tgcctctaat
cccagctact 14400cgggaggctg aggccggaga attgcttgaa cctgggaggc ggaggttgta
gtgagccgag 14460atcatgccat tgcactccag cctaggtgac ggagcaagat tctgtctcaa
aaaaaaaaaa 14520aaaaaaaaag aaaaaagaaa agaaaaagaa acaatgaatg agtgtgaggc
tcatggtagt 14580attggttcct gagagtagcc aaccttattg gtcatcccag ccacgaagtg
aaatggtacc 14640cctggcttgg gccaatgaat gaggaagaat aatggcaaat gggggtctat
gcctccaccc 14700tccaccacta gggaggtctc aagcttgaaa tccagtgacc aggtttttag
gtcctggacc 14760tggccagtcc tcctacagtc aagtagataa gtggagggtt tggtccgttg
ggctacggag 14820atagtgatca aggccgttac tctgcaatca gactcagaaa tggcctctca
gttacttctc 14880catttgtggg tcttttggaa gagcagagaa gaggaaggaa tttaggtctt
ctcaccctct 14940gggctgcctg tccctgctcc ctgagccatg gagggctggg gtggaatatg
gggaataaat 15000ctgtactttt tttttttttt ttttttgaga cagagtctcg ctccgtcgcc
caggctggag 15060tgccgtggcg tgatctctgc tcacagcagc atctgcctcc cgggttcaag
ttattcttcc 15120acctcagtct cctgagtagc tgggattaca ggtgcccacc accacgcccg
gctaattttt 15180gtatttttag tagagacagg gtttcactgt gttgggcagg ctggtctcaa
atacctgacc 15240tcaggtgatc cacccgcaca tgcctcccaa agtgctggaa ttacaggcat
gagccaccgt 15300gcccggtcct accaatctgc acattttaat tgacaagggt caccctccac
tcatgtgcca 15360ggcatagttc tgagaagcat cccacaagga tgcctctgag ttcaccctga
caagtccact 15420agctcttggc agagacatct ggcaaattca aggcttgaga catgctggcc
tctctttaaa 15480gtgcagcaaa ttttgtctag agcttggtca gttaaaattt tgatgttttg
ttttgcatta 15540atttcaattt ttaagaaatg ttgcattaaa atgttattta tcttgaatag
taaatttctt 15600agtgtcccct taatttctta gtgtgtctga gttgagagcc tcccctgcct
gattctagtc 15660cagaccctgg ggtgacagaa gactggtggg agatgggagg tgaggagggg
agtgttggtt 15720ggagaggatg atctacagag tgctggagag actctgtatg gagcttttca
tgctgcctgt 15780ttgccagccc tgaagctatg ccttgaggtt gggcaaggtg gcatatccta
gatcagagat 15840cctcaactgg ggccattttt ctccccagag gacatttgga aacatgtgga
gacatttttg 15900atcatctgcg ggggtgggga gaggggctac tgacatctgg tgagtagaga
ccagagggac 15960cattaaactt tctacaacgc ccaggacagc ccctccacaa taaagagtta
tttgacctca 16020catattaata gcacaaagtt gaggaacctt gatctagatc cacagcacag
aagaaaggat 16080gtagattttt cacacattaa agatgagaaa gcttgtgcct gtaatccctg
tgactcagga 16140ggctgtggca ggaggattac ttgagcccag gaattcaggg ttacagtgaa
ctatcatcgc 16200agcactgcac tccagcctgg gtgacagagc aagattttgt ctcttaaaaa
aaaaaaagat 16260gaggacaggc acagtggctc atgcctgtaa tcccagcatt ttgggaggcc
gaagtgggtg 16320gatcacgagg tcaggagttc aagaccagcc tggccagcat agtgaaaccc
catctctact 16380aaaaatacaa aaaattagcc agctacttgg gaggctgagg caggagaagc
gcttgaaccc 16440gggaggtgga gcttgcagtg agccaaaatc ttgccattgc actccagcct
gggcgacaga 16500gcaagactcc gtctcaaaaa gaaaaaaaaa aaagatgaga aagaggaagg
gagagaaaaa 16560agagagagag gaaagaaaga gagaaggttt tggagtcaaa aagacttaga
aattccagtt 16620cttccacttc ccatggaacc ttggcaagtt gccttctctc tttctctgaa
tctcacattt 16680tgcctctgtg aagtaggggt ggtacctggt ggagatgatg cggagatgag
ggtgaggggt 16740gtgttgcaca ctatgcccct aggatgggtg agagcttggg agcactgaac
ctccctttcc 16800cctcttgttt cttcccccca ttgtctccca ccagctccct gggatctcca
cttcactctc 16860tgggattcca ccagcaggag gctactcctg gagttaaggc gtgttgttca
gactggggca 16920ttttaggggg cataaataat aattatgcct ggacaatgga cataacatct
agggccttct 16980gaagcaaacc agggtgtggg gtacccaaac aaggcagtag gccccaggag
gcaggtccct 17040gcagtcccag cagagagcag ggcacagggt tgagaagact gagcaaactt
cattatcagc 17100tcctttgtcc cccactctgt cctggagcaa tcattctggc ctcttcccac
ttccccaaaa 17160acccagtata aaggctgctt ctggcccctg aagccagagg cactgagagt
ggaggtctca 17220gactcttgga aggtgagttc ttttctggct gcccaggcag gaccagtgta
ggccctggga 17280agaagcagca cctcataggg caaacacgta ggaggcctgt ccttaggaac
atcatagcta 17340agcagacctg tccccgcagg ggcaggagtc tgggctaagg gtgatactgg
agagcagcaa 17400cggagactgg aagacaaatg aaatttggta cctgagttat ccctcccacc
attccttttc 17460tagactctcc agctcagggt ctgttcatgg caagaggaga aagcaatctt
gtttgctctt 17520taatcaaaca attaaacaaa tattccctct atactatgtg ccaggggcta
tactagacac 17580acaaagacag ccccaagaag gacggtggag tagtgtcctc gctaaaagac
agtagatatg 17640caatgcctct tgctcctgcc ctttctcctg ctgggaacag tttctgctct
tcatctgggt 17700aagtctctcc cttccctcct catgcgtctt tccctttttt cctttttcct
acactcccct 17760ccccccgctt ttatttgcac tcatgaggcc aggaccacag ccttccctct
ttagctgata 17820cagctcatct ccggtaagat atcacttgga ctcagaactg taacctggaa
ctttctcttt 17880tttgtttgat ttttttttgt tgttgttgtt tttgtttttt tttttgtttg
ttttttgttt 17940tgttttgaga cggagtctcg ctctgttgcc caggctggag tgcagtggcg
cgatctcggc 18000tcaccacaaa ctccgcctcc cgggttcaag caattcttct gcctcagcct
cctgagtagc 18060tgggactaca ggcacatgcc accacgcctg gctaatcttt gtatttttag
tagagatggg 18120gtttcaccat atttgccagg ctggtctcaa actcctaacc ttgtgattcg
cccgccccgg 18180cctcccaaag tgctgggatt acaggcgtga gccaccgcac ccggcaaact
gtaacctgaa 18240ctttcagaag gaaaaaccac ccacctgtta agatgaaggg ctggtgactg
ccccaggctt 18300ctcacacgtg ctttctccca ccttcaaaac acacactcgt ggtgtcggcc
agaagtcagg 18360ttcttgtcca tttgtgggtg tgacccgaga gatctctcct tacctaacac
caaggaaatc 18420ctccagtctt gtcttcaggt ggaattccta ggaaagctcg agcgacgttg
ctggagctgt 18480ccacggtgct ggaactagga agctcttgac ctgatggcag gttacctctt
cttcccagag 18540aatgatgccc cccatctgga gagcctagag acacaggcag acctaggcca
ggatctggat 18600agttcaaagg agcaggagag agacttggct ctgacggagg aggtgattca
ggcagaggga 18660gaggaggtca aggcttctgc ctgtcaagac aactttgagg atgaggaagc
catggagtcg 18720gacccagctg ccttagacaa ggacttccag tgccccaggg aagaagacat
tgttgaagtg 18780cagggaagtc caaggtgcaa gatctgccgc tacctattgg tgcggactcc
taaaactttt 18840gcagaagctc aggtaagtag tagggaggct actgcggagg acctggggga
aaagagagta 18900cattcagtct tctgttccct attcatttag gctagtggtt ctcaaagcct
cgcatgcatc 18960agaatcacct ggagttgttg ttaaaacaca gctttctggg cctcacctgc
acgacttctg 19020atttaggagg gctgaggtga agcctgagaa tttgcattta caacaaatcc
ccaggtgatg 19080atgatattgt tggtctgggg agaaccaccg atttaaacaa aaggctttgg
tgttagaaac 19140gcctgtgtta aattctggtt ctgcctttta ttagctgtgt tacctgggca
agttgctttg 19200cctttcaaag ctttagcacc ttcatttgta aaacgaagat atatagcacc
aacttcttag 19260agttgtggtg agcattaaat gagataatac atgaaaagtg tttggaatag
tcactgggct 19320gtaataaact ctcaataagc ggtggttata attattatga gtattatcat
ttcctgtagg 19380attgtcctga cagctaatta agaagcaaaa gataggatta agggaggcaa
gtaggtttat 19440ttttaacctg aaaagggatg ccgggctctt gcctggagac tcagaaactt
gaaataaatg 19500agagggaatt cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 19560nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
ngaattctct 19620gttagcacat agccagaaca tctagaaggg gtggtaggag tggggattag
aggttccagc 19680tggaggcaat ggcacttgca aaggctttgt tgaagtggcg taagtgtgga
ggtggagcat 19740tcaggaaagg agagcttcag cttcagtgtg gctggagtgc tgggtgtgaa
gagaggtgaa 19800gatgaggctt ggaggctggg cagattttgc tccaaaagag cttggtgaac
tgtgataagg 19860agtttggatt ttctcctact aaggacaaca gcaaactatt gaagagttta
aatcgttcag 19920tgacaatgac acgtttgcgt tttggtggct cactcgagct gccagccagg
tagacagtgg 19980cagaagatgg aagataaagc actaaagggt gatgaggcag gaagccagtg
aggagagaaa 20040ggggacgatg tgagtgacag taaatcattt gttgggttgc tattgtgtgc
taagctctgt 20100gctaaattct tcacgtgtat tatttcagct aatccatcta acaactctgt
aaggcaggta 20160caatcgttcc cagctgaaga agctgaggct ctcaaaagct agtaacttgc
ctaagttcat 20220gcagcatgca agttgtccag ccaggattct aacttagaca ccagaggcca
cttttaacca 20280ctgctctagg actgggggaa atggtcccta gtgagatatg tgtcgagttt
catatttcat 20340tcaacaatat tgttggcctg ctacatgtga agagctgtgg aaagcgccca
aagtgagtta 20400gatccctatg agcaagtggg atgggggtgg agtggacagt aggagggctg
gaacacacat 20460aaaagggtat aagaaataac aattaggccg gccaggggtg gtggctcacg
cttttaatcc 20520cagcactttg ggaggccgag gagggtggat cacttgaggc caggagtttg
agaccagccc 20580ggccaacatg gtaaaacccc atctctacta aaaatgcaaa aattagctgg
gctggtggtg 20640cacgcctgta atcccagcta cttgggaggc tgaggcacga gaatcacttg
aacccaggag 20700gcagaggtta cagtgaactg agattgcacc actctactcc agcctgggag
acagagtgtg 20760accctgtctc aaaaaaagaa aacaaaacaa gtaggtactt tctgccatag
ggaggattca 20820taaactgcta gtcctcaggt gcatttttgc ttatcagttt taaaaatcag
agaatgtctc 20880aaagaattag gatgtcagct tcttttgaaa atttgggcca gaagcggtgg
ctcacgcctg 20940taatcccagc actttgggag gctgaggtgg gtagatcacc cgaggtcagg
agttggagac 21000cagcctgacc aacatggcga aaccccgtat ctactaaaaa tacaaaaatt
agctgggctg 21060gtggtgcatg cctttagttc cagctactca ggatgctgag gcatgagaat
cacttgaacc 21120cgggaggcag gggttacagt gaaatgagat tgcaccactg cactctagcc
tgggagacag 21180agcaagaccc tgcctcgaaa aaaagaaaaa gaaaatttgg aagatctgac
aacagttgac 21240ctgcattcct gctcggcaac agcctgatgg tggatgggca gaggctcagt
tgtctgccaa 21300acctcccatc actgatgtct tccctcgctg tcatcatctg cttgacatgt
aggcatttgg 21360tgtgtgcctt ctgctctggg tgcccagatg aattggatgc tatatgagaa
aacattctgt 21420aaatgtcttg tggtaggcaa cctcaaagat cactggggcc tccaatgatc
cctccttcct 21480ggtattcatg cctgtgtata atcctctccc ttgagtgtgt actacacctg
gatacttgct 21540tctaataaac agaacacagc aagggtaatg ggatgctact tctaaggtta
aattacaaga 21600gtgtaaagtc tgtcttgttt gtttccctct cttgatcttc ctctcattct
ctctctctcc 21660ctctctctca ctttcttact gtcttgtcct tccctttgtt tactctgatg
aagcaagcta 21720gcaagcatcc atgttgtgag ctgacctatg aagaggccca tgtggtggta
aggaactgag 21780ggcagcctct acccagcaag gaactgagtc actcatcata tgggtgagct
tggagacaaa 21840tccttcccca cttgagcttt cagatgacgg cagccctggc tgatgctttg
caggcttgtg 21900agagaccctg agacagaaca ctcagctaag ctatacccta tctcctgaga
tagagtataa 21960tacatgtagt tttaagctac tatgttttgg gataatttgt tactcagcaa
tagataacca 22020atacatatac catgtacata actgtttcag ttgtctgaga ctatatttag
tcattttaca 22080cctacatcaa gaatgtgtca ggcaccattc caggtacttg gaatacatca
attaacagaa 22140taggtaaaga ggccaggcat agggctcaca tctataatcc cagcactttg
ggaggcccag 22200gtgggaggac tgcttgagcc caggagttga gaccagcctg ggtaaaatag
tgagacactg 22260tctcaactaa aaaaaaaaaa aattagttgg gcacagtggc acatgcctgt
ggtgccagct 22320gctcaggagg ctgaggtggg aagatcgctt gagcccagga gtttgaagct
ccagtgagcc 22380acggtcacaa aactgcactc tagcctgagc aacagaaaaa gaccctgtct
caattaaaaa 22440aaaaaaaaaa aaaaggaaag aaagaaaaaa ataggtaaag atccttgatt
cttgccctct 22500tggaacttct attctagagg gggatggttt ttcacagtag aagtctgtgt
tgacagcgct 22560gtttaaagct ccttcagcat ctggggaaaa ggttnnnnnn nnnnnnnnnn
nnnnnnnnnn 22620nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 22680nnnnnnnnnn nnnnattttt tagagatagg gtcttgctat gttgcccacc
aggctggtct 22740tgaactcctg ggctcaagca atcctcctgc ctcagcctcc tgagtagctg
ggaatacagg 22800tgtgcaccac catgcctggc ttatttcata tatatatatt tttatatata
tgtatattta 22860tatatataaa tatatatata atttctgtat ataaataaat aaatatatat
atatatatat 22920ttttagagat agggtcttgc tatgttgacc accaggtctt gaactcctgg
gctcaagtga 22980tcctcctacc tctgcctttc aaagtgttgg gattacaggc gtgagccatg
gcacctaact 23040gagttatttt taccacacga agcataggac atacatccaa aaatgttctg
agctgagcaa 23100gagcctggag gcaagtgaat ctgaactttc ccgtctttga agaaaccagt
ctctctccaa 23160agtcacatag ttagtgtcac tccccccaag aactgcatga gctgggacaa
tcagagggca 23220gtggaaggtc tggggctcag gggcgccccc tgctgtctcc ccagggtctg
tccccttacg 23280caagagcctc tgctccccca ctttcctgtg gagcctcctc accatgggca
tgacccagct 23340gcggatcatc ttctacatgg ctgctgtgaa caagatgctg gagtaccttg
tgactggtgg 23400ccaggagcat ggtgaggcac cgctgaggcc cctgggggtt gggggcacag
gcgggtcacc 23460ctggctgagc tcccctcacc atacgtttcc ctacccacag agacaaatga
acagcaacaa 23520aaggtggcag agacaggtag ggctatgaaa gcagggccct ggctcacgcc
caccccactg 23580caacccgctt ctcagggggc gggactcctc taggcctggg cccacccagg
taaccctttt 23640gtgggatgta agagtctggg ttcagaggaa ggctattttg gtgctctctg
gcctccgctg 23700gaaggggtga tagtgtccac tgagtgccag ttcctgaccc cactgccctt
cccatcctgc 23760ccagttgggt tctactcctc cgtcttcggg gccatgcagc tgttgtgcct
tctcacctgc 23820cccctcattg gctacatcat ggactggcgg atcaaggact gcgtggacgc
cccaactcag 23880ggcactgtcc tcggagatgc caggtgacct gcctgtacag ggatggtgac
agcaagtggt 23940caggcagtgc ttttcatttt ctctgtgcgt ttacatccag cagcttgttg
ctttctccca 24000agaaccctag gagatcaggg gtacctcccc attttacaga tgaggaaact
gaggctagga 24060agggacctgg cttgcttaat aataagaata gctaatgcag agtgctgact
gtgcacttgg 24120caccttgcct tgtttagtcc tacaacacct ctttgaggta gatgcgttaa
tatcttcatt 24180ttgcagttga ggaaaccgag gtacagggtt gcacagttag gtcattcacc
caagatcaca 24240cagctttcag tggcagcctc cagaacctgt gttataaggg tacacgctaa
agtcttgtta 24300gggctagaat aggtagagtt ggtatattag atatttattg ctgtataaca
aatcacccca 24360aggcttggca ttttaaaaca acaaacactt ctcatctcat acagtttctg
acagtcagaa 24420atcagggaga gactcagccg gctgattctg agtcacagtc tctcatgaag
acatagtcag 24480gctgtcagcc agggctgcag tcatctgaag ggctgactgg ggttggagaa
tctatgtcag 24540ttcaattacc cccatggcct ctccataggg ctgctcagga cacagcacct
gctttccctt 24600gagcaagagg gctaagcgac agagaccccg tatcttctct cacataatct
cagacgtagc 24660ataccatcac ttctgttacg ttctattata ggcacagagc aaccctgata
tactgtggaa 24720ggagactgga caaagcaggg gaataccagg aggcaggatc cttgagggct
gtcttgttgg 24780ctggagacca ccattgaggg tttttttttt tttttttatt gagacagtct
tgctctgtcg 24840cccaggctgg agtgcagtgg cacgatctca gctcactgca acctctgcct
cccaggttca 24900agcgattctc ctgcctcagc ctcccgagta gctgggattc accatggagt
cttgaaccca 24960gattctgtga ctgcttttgc tctttttgtg ttcatccaaa cagtccctgt
ttatcctaag 25020aggatgggag aaagagactg ggagagaagg aaatccagtg gcctccctcc
ctgctagcag 25080agcctggccc tggcactgag ccttcctcct ctaccctctg ctcctaatgg
tgagggtccc 25140ctagcagggc ccttctgtcc aggacacatg ggccgcctgt cctcacccca
gcctactgac 25200ctctctcctg ggctggcctc agtgcccttg attgtgccgg agagaggaag
cgctggacag 25260tcaggccaag ctgctgtccc caggagggca tctgcttatg tctagggcag
ggacaccttc 25320ctgaggactt ctgatgagag acggtgtgag agcttcccac ttcccacctt
ccttcccatc 25380cttggttctc aaaccttcaa gtgtgcatga gaatcactta gtgggggata
tttgtccaaa 25440tgcagatttg cagatatccc cgctgagatt ctgagggccg agatgaggcc
tgtgaatctg 25500catgttaaga aagcacccgc tttgatgcgt gtgtcattgg gtaggggagc
aacactttga 25560gaaacatgga gctagagaac gtgggtttct atgggtttcc catagaaaca
tggatttctg 25620tgttttctgc tgccctgaca tcgaaggcac atctgaaggg ggaggggcca
ggccaagaac 25680cagggagtcc tgggaacgta gaggcagcag ccagtgactt cccgtactcc
tcagggacgg 25740ggttgctacc aaatccatca gaccacgcta ctgcaagatc caaaagctca
ccaatgccat 25800cagtgccttc accctgacca acctgctgct tgtgggtttt ggcatcacct
gtctcatcaa 25860caacttacac ctccaggtac ccaccttcat ccttcccctc tccctgcctc
ccgaggctcc 25920tccaaaggga tggtccatcc agcacctgcc ttccaggaag cgcagttctg
gtcttctgat 25980ctggatctat tttccgggtt ctccaggaag tgtttctagt agattgggtt
ggcgaggggg 26040tgggaattga ggcccagttg gcctcttcgc cctacccctc cttcctccag
cctccacaca 26100ctctcctaac ctcttcactc tctctttttg gttttagttt gtgacctttg
tcctgcacac 26160cattgttcga ggtttcttcc actcagcctg tgggagtctc tatgctgcag
tgtgagtctg 26220ttgggctgaa atgccttcct gagctttgca accgtgatca gagaacccca
gggaagggtt 26280gggagggccc caggcatccc ctaatgcacc tctctctgag accctctgat
ggcagggagc 26340tcacttcctt aaaggcagcc tatcctgctg taattgactc cccctgttgg
agtcttccct 26400tagaggaagc tgaaatacct ggcttgatga cactttggtt ctatgtctgc
tgtttgaaac 26460ggcccccaga atggcctccc ctccatgccc accctgaaga aatttcccaa
gggcagccat 26520ttgccttata attttcctct tcatgttgga cagtccccac ttgcatctct
ctcctggttt 26580cccctgctgg gcgctgctga gggactctcc cctgtgtatg tgatggagta
acaggacatt 26640acaataatga tgacaaaatg acaaccatta tcaagtgctc cgttggtgca
ggcagcaggc 26700aggatccttg accatcactc cctgagttca gcctcactgc agcggtctcg
gcagagggca 26760gctctctttc cttcatctgc tcaagccaga accctggagt ttccttgatg
tttctctccc 26820tcacactcca tgttcactcc atcctcagta cagccagcag cagcttctac
acaccccaaa 26880tctgaccctt cttgtcacct ccactgctgc ctctccagtc ctagccacca
acatctctag 26940cctggattat tgtggcagcc tttagtctcc cacatctgcc ctggccccgc
tgtctcagtc 27000tatttttaac acaggggctg cagtcacctg tcaggacata agtctcttca
catcactctg 27060tggtgtcctg tctcatctgt ctcagagtaa aagccaaagg ctttactatg
gcctaaaaag 27120ccctgcaagc tctggcccca gcacttcact cccctctagc tccccctcct
ccattgttca 27180ctctgccaca gccacagtgc ttcctagtgc tccggaagtc tcaagtgtgt
tccctgcttg 27240gcatctttgc atgtactagt ccctgtttct agaacattct tctccagata
tctgcaaggt 27300gcccaatctt accttctctc cttcttcagg tctttccctg actgtcctct
tctcagtgag 27360gcctcccttg gctgtcccat gtacaattgc aacctcccta ctgcccgctt
ctctgcttgg 27420tttttctcag cgtttatcac taacactctg cctatctctt gcttattgtc
tgaccgccac 27480ctgctccatg ggaatgccac ctcctcgatg gcaggaatct gttgacttgc
ttgatcgtgg 27540tatctccagc acctagagca gtgcctggca catagtaggt tctcagctaa
atgtttgttg 27600acagaataca gtggacagtc ctgcgaggtc aatgccatcc ctgttattag
tggaggaagt 27660ggggctcagg gagtttgagc cacttgccaa tatcacacat acaggaggtg
tgagaaccca 27720gctcagtggc cctgaagttg gagcatttgc cctcaaggct ggggaccaaa
gagcccatgc 27780aaagagcccg aacgcttaag caccaccctg cctggccagc ggggnnnnnn
nnnnnnnnnn 27840nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 27900nnnnnnnnnn nnnnnnnnnn nnnncccact gcgcctggcc cattactttt
aatggcaaaa 27960accacaatta cttttgcacc cacataaata gttaccatgg gctgagcatg
gtggctcagg 28020cctgcaatcc cagcactttg ggaggctgag ccaggcggat cacttgaggc
caggagttca 28080agaccagcct ggccaacatg gtgaaacccc gtctccacta aaaaatacaa
aaattagctg 28140ggtgtggtgg cgcgtgcctg taatcccagc tattcaggag gcagaggttg
cagttcactg 28200aaatcatgcc actgcactcc agcctgggcg acagaatgag actctgtctc
aaaaataaat 28260aaataaataa ataaatattt accatgtttt gaccacctgt tatgtgccaa
ctgtattact 28320taaaaacacc catgggaggc tgggcacagt ggctcacgcc tgtaatcgga
cactttggaa 28380gggcaagcgg ggaggatccc ttaaggccag gagttcaaaa ccagcctagg
taacacagta 28440agccctgtct ctacaaaaaa taaaaaaatt aactgggcat ggtggtgtgt
gcctgtaacc 28500ccagctcctc gggaggcaga gggagaggtt cgcttgagcc cagcagtttt
aggttgcagt 28560gagccaggac caagacacta cactccagcc tgagtgacag agcaagacac
tgcctctaaa 28620caaacaaaca aacaaaagcg acctgtgggt aggtaggaac aggctcatag
tacagatgag 28680aaagcagagc ttggagggct caagcgattt gccaagcaga ggtccaagcc
gaggtctctc 28740tgaatccaaa gttaattccg tctatcatat caccacagcc ctctctgccc
cagggagagt 28800ctctgcccac tccagccact cacgtgtaat tgacttcctc aggggcagga
aaggcttcga 28860tgggccagtt gagggtgcag ttcagaaaga taaggcaggc caggccagac
caggtgaaca 28920tgatgaccac gaaggccaca ccggcatcgt agatcagctg tgagaggagg
gggcaggccc 28980gtgggggaga ctgcctggcc ccagacccca ccaaggtaga tcccaggcct
cagaggcctt 29040aaagaagttc tcttctcccc ttgtccttgt gcccaatttg cagatgagga
aaccaagacc 29100agaagtttag agtcagactc agaagaccca tcattccttt ttctttttca
cttgaggccc 29160cctagagagc tatgaaatag tctccacaaa gcctgaagtt gctggccact
ggctcaaaat 29220atctctgaaa tttccattat cttaaaaaaa tacatacatt tttgcctatg
actccacaaa 29280cattcatgtt catgttcgca caaaaatgtc catttcatag tacgtacaaa
ggaaacttag 29340tgctctaggt ttaccgggcc taatcgtgtt tatcctgccc cttcctggca
cattccccag 29400gggaaaaggc aaacccagac tgctcatgct cagccttttc tcacctttcc
caggtcctcc 29460cacgtgcaac aactgggggg gttggggaga gggaggtgca agtgctctgc
ccaagggctc 29520tcaaccccag ggcaggtaag ttctcaattg aatgagattc tgtgcaaatg
tgtcagccct 29580tcttatggaa gaagctgatg caccatctgt cctcttgtcc tccccatacc
atctgaccag 29640gataattaat gtctgctctc ccctcaggct cctgctcaaa cctttttctc
tgcagtcttg 29700gaccttggtg ccttttcctc cctaggggca ggacagagct tcaaagggcc
acacccccaa 29760atgtgtggag gtaagatctg gctcttcaaa cactacttca gttgaaaaga
agggagaact 29820gcccaccctc catgcctgcc caccagaaca actgatggcc cccccaccca
tgcgctctct 29880caaactcctt tggagacact gagcaaaagt accttcttta gtactctttg
taaagtgcaa 29940aacggtatgc agtttggtac tgcccaccgt ggaggttgag gagcatggca
tggctcaaag 30000ggtcctttga tatttgacag aggaaattga ggcccccatc ttgcactgag
ctaaaacttt 30060ggtcccctgg cttcgaggta caccaggttg acctgtccag gatccagcct
ggcataaact 30120cactttgtga ccttggacca aaccacccat cctctctgga aggtgtggaa
aaatgtggcc 30180ccaaaggctg aataaagcca gagagtcagg gaccttgaac gcatgtgaag
gggctggact 30240tgattctgta ggtgaagcta aaccactgaa ggtttttcag cagtgtgtga
gccagttccc 30300catctgagat ctttctggaa gtcacgtgag tgacagagta cagagaaaaa
gaatcagagg 30360cagggagacc agctgagaaa gcttgctgtg gcccaggaga gagggggaag
gcctgcattg 30420ggatgatgac agagaaagga gagcggagaa gtcagacccg tgggtcagca
ctagctgctg 30480ctcactcggc cccacccggt tcttgtgtca agacaaaaag aaaacccagg
tggcctcata 30540ccttgattcc tgggaacgta atggcagaag aggcgtaaga gccaatcatg
agggccatta 30600acgtggagcg caggttccca aacatgttgg gcagctgagg agggaaagca
gcacccatga 30660ggtggggaca ccgtgaccct tgcccagcat tcccagccct gctccataca
atagctccag 30720gagacgcagc agaaaagccc caaggtaaaa caaacagaaa aatcaatgtg
ggaaactgta 30780ctctgccccc tgcctacaca gtcacagtgc cctttagctt caaaaaggct
cccagacacc 30840cctcagagag acattttgtt aattttgttt aattccaggt ttcccaagtt
tgttacgtaa 30900cacctctgaa aaacacatgg aataggtgct taagaaacac tgatcttggc
tgggcgcagt 30960ggctcatgcc tgtaatccca gcactttggg aagccgaagc tggtgggaag
cttgaggtca 31020ggagttcaag accagcctgg acaacatggt gaaaccccat ctccaccaaa
aatacaaaaa 31080ttagctaggc atggtggcat gcgcctgtaa tcccacctac tccagaggct
gaggcaggag 31140aatcgcttga acctgggagg tggaggttgc agtgagccga gatcgcacca
ctgcacttta 31200gcctgggtga cagagcgaga ctatgtcccc accccccaaa aaaaaagaaa
agaaaagaaa 31260gaaacagtga tcttgtccaa cccatttgag atgagacaat tgagacccag
ggaggaaaag 31320tgtactcaag ttcacagagc acattaatgg ctttctcccc attgtcgttg
tcccagccct 31380aacccaaggc tgtgaccatg gctgtgtccc ggtaataggc agtgcctctt
aaccctctcg 31440gttgacgtcc cagcccagtt tctgcctaat caggacaaat cacatcctgg
gaggtgaggg 31500tggaaataag ggagggaact gagccagggc agacagtctc cagaggaggt
ggctctgacg 31560cagagcaggg tcagaaccca caccaggaga gaatttaatt gatcatgtgt
tccactcacc 31620tgcctcagcc aagccctcag ggcaggggaa ggcaaagtca ggatgccctt
cgcacacacc 31680ctcctctggc cccaccatcc tccccaagtc actagatccc acagctgaga
aggaccttag 31740gatccgtaca aagcctaaac acactccaca gagggggaaa ctgagactct
gaagggaggc 31800ctcaacagct ctggtaaaaa aggcgtttag gccgggcgca gtggctcaca
cctgtaatcc 31860cagcactttg ggaggccgag gcgggtggat tgcctgagct caggagttcg
cgaccagcct 31920gagcaacacg gtgaaacccc gtctccacta aaatacgaaa aaattagccg
ggcgtggagg 31980cgtgcacctg tagtcccagc tactcgggag gctgaggcag gagaattgct
tgaacctagg 32040aggcagaggt tgcagtgagc cgagatcgcg ccactgcact ccagcctggg
cgacactgcg 32100agactccgtc tcaacaacaa aaaaaaaaaa atggtgttta aacacatata
actaaattat 32160ccttccccct tcccctgaag tggctggctc aggaaaaacc tctacccact
caggcagagg 32220ttttcctgca ccctgcatcc gtgaggcacc actgccaagg acgccaggga
aggctgccag 32280gcctggagag gggcagggcc ccctcccctc caaggggcca caaacgctgt
ctgcgcccag 32340taccgtgggt aaggcgaggc cggccggcta accccgggct ggcggccttg
cagcgtgcgt 32400ggcaacagca gctgggcccg caagactcag cacgggacgt cctcgtccaa
gtctgggcca 32460agagcagcgg cccagggggc ggggccggcc agagggagcg gggagaggct
gaggggcggt 32520gccagcgccg gaccctgcca ttggctggag attacaggag gcggggacat
agcagggagg 32580agccgctgga caagccccac ccggccgcca gggagggtct gaggtcaaga
gccggagaga 32640agggatttag ggccctgggc caagttgcac agcagggaga aggggctgcg
cagaggggcg 32700gggagaaagg gatccgcttc cttcctttag agctgtgaaa tgtccccggt
tggaattaaa 32760ggcggctgct ggggagaggt gaaattcagc caaaaccacc cagtcaggca
gcccttctca 32820gagataaaca gtccgagcca gcccggccag gaaccttccc ctccaacctc
cctaagcctt 32880taacactcct aagcctttaa cgcgtttaca cactcacata aataaacaca
ctttgagcaa 32940cacacataca ccactcacca catgtaatag gtcaagccat gtgcacgacg
aggtgtcgac 33000aatttcatat ggttcaacct agtacactca caaacacacc taccaactca
tggctttcac 33060agggacgggg tcacacaccc actctcccac gacatggcaa gcgtgcacac
gctatctcaa 33120gctgctccct ccccctcaag atcatgttac ccagttttat tttcttccca
gcacctatga 33180cgactgacat aatttattag tttacttgtt tattgggtta tctgtgcccc
tcacccccaa 33240aatgtaacct ccagcaggga ggatgactcg gtcagtcctg attgtgctgt
agtccaggac 33300ctagaacaga gctccatgga cattcatggg ctctgtacac acaaacacac
acattaacat 33360acaccccgac acacagcctc atccacacac acacagcctc acacctgctc
tttgcagcca 33420cctgcacagt ttctcacaca ctcacttgat ctagtgatct gcgtccacag
gcccctcccc 33480cagcccactc atactgccct caccccactc actctgccct caccccactc
gggggaactc 33540tgctgccagg ccaggcctgt gacactcacc gtgagtgaag tgaacgttag
gcagatgcca 33600ccaaagccat tcagggacag cgccaggaat atcaacggag acagagctgg
aaaggggaaa 33660gcagcagatg agggcatttg gggagctgtg ggaagccaag ggcgggagct
ggggtaaaca 33720tccgccttca tcccacctat tcttttcttg tggggccaca agaggacaga
caactcacct 33780tccacgtccc gggaggccag ggccatgagg gtgcaggacg cagtgaagca
ggcactgtgg 33840agacacaggg aagggcgagg ggttggcctg tgagcacccc ccctcccctc
cccctgcagc 33900acggtccctg tcctcccgtt ccccatagcc cagccacctc acctgccaac
cagccgcacg 33960ggtcgggggc caaagcggtc catgaggatc cccagtggca gggtggtggc
gctgagcacg 34020aaggaaccaa tggtgaagcc caggttgagc atctcgtcct gctggtcaca
gcctggccac 34080ctgcgctgct catcctgggt ggtgttggtg ctgctctcag ctgaggaggg
ggaagggagg 34140gctcagcaca tgacaccagg aacagctggg cacaggagac agcagcccac
agtcaggcgg 34200cctgctttca aatcccatgc caagtgcctt tgggggtacc ctagagtcac
atctcctctg 34260atggggctgc tccagaaatg gcagccatta gtacctgacc ctgggagagt
cttgtgcaca 34320cacagcctga ggcttcaact agctcaaatg aaatactgga cataaaagta
tttactaagt 34380tgtaatatgc actcagtgtc caagcttagg gggttgtgga cccccaacaa
gaagtgcccc 34440catatctaga ggcaaaggca aaggcagtga gtggtactct aatggctata
acaagaattc 34500attaaaatgg cccggcgtgg tagttcatgc ctgtaatccc accactgtgg
gaggctgaga 34560caggcagatc gcttaagcct acaagtttga gaccagcctg ggcaacatgg
taaaacccca 34620tctctaaata aaaaaaagaa atttagaaag aacactaaaa cttagaggaa
gctttcccga 34680taaatgatag tctgataaaa taatagctaa tacttattga gcacttaact
atgctccagg 34740cactgttata agtcagttaa taaagtatcc cgttccctag gtgatgaagc
tgaggcacag 34800aatgagaacc aggcactgcc ctccagtccc ctctagaagt ccacttggag
gacttgtcct 34860taacggtaaa ctgccaactt ggagttgtga caagttaagg agaaaagcta
gtgataggag 34920acaaagggct gcttcgcttt actcaatgct cannnnnnnn nnnnnnnnnn
nnnnnnnnnn 34980nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 35040nnnnnnnnnn nntccctccc tccctccctc cctcccttcc ttccttcctt
ccttccttcc 35100ttccttcctt ccctccatct tctccacacc tggtatcatc atacagaagc
agagaggact 35160gcacttggtg aaagtttcaa ttctcctgtg tggagaggtg agcactgagg
aaggggtggg 35220ggctgtcaaa ggagacttac ccaatctttc cagcccacca atcccttgcc
cagtgtttct 35280ataaaataag ggccttttgc atctgattta agtaggaagc tgattcctga
gcccctcaga 35340tctgctgaat cagatagcta agggggccct ggaatctgca ttttagcaag
cggaggtggg 35400ttatgaagca ctccaaagtt tgagacacgc ctcaaaggtg gagtggttct
gtggggggca 35460gaaaggaaaa tgcaaagggg gaaggggtca cacttgggga aggtttcaga
caataccgag 35520tggaaagggt gatgccaggt gtggggagta acagatagag gaggcaaagt
gagtggagac 35580caagccagac cggggaggag ggggccacag ccaaggtgag acaggtcagc
agccagaaac 35640cgaagcagac acttgcaggg tgcaccccgc cctctcttcg tggcaatctg
agaccgagga 35700cgtggagacc ctggagagcc cccaaccttg tttctggggg gtgggtcaga
gaggaagcct 35760ctcatccccg ggcaccagcg gccttcccgg gaggctcaac acgcagatac
ctggataggc 35820gtccatcact cccccggcca gagcccacca acgctcctcg aggtccgacc
ttgtccctcc 35880ttctacccca cagtccccag tcctagctca tctgcataaa gctccaatta
acatgttttt 35940cctttgctat ttgcgatccc agaactcgtt ccccaccccg agcccgtttc
ccgccgcttc 36000ctcgcccctg ggagggcggc cccattaacc ctcgcgaccc gggccgctcc
tggcggtcct 36060gaccccgcca ccccgtcccg cggcgggggt ctgggggtga ggggcgcgcc
ctggggcaga 36120ggattgcgcg gcagggtctg ccacagggca gaggccaggg ctctccggga
aaaaggcagg 36180cgcatatatg cccccctttc tgggaaaaga cggggagggg ggcttctcct
gggagactcc 36240aggcttcgaa attcctcgtt ccctatcctc cggcccccgc acccctcctc
ctccccgcca 36300cgcaccctct cccctcccca gccatctgtt ccactccgca gcgccgcgac
aaacacggct 36360ccagctcgct tccgcccctg cccagccccc tccccaagcc ccggggagtg
ggggagtgag 36420cagacgccct tctcctagga ggccggaatt tctgcctcca tctcccaccg
gggtccggct 36480ggccagaggc aagcttcgag accccccacc aaccaccacc accgttgcga
gggccggtga 36540ggctgcagat aacgcttgca aggacgggag tcggggaggg tgtagggcga
gtttaaagga 36600cgggcagagc aagccccggg aagaggcagg ggttttccct cccgggtcgc
cgccccccgc 36660accctcggag ccagccgcag ccacgcagcg ccgcctgccg ggcacaccaa
ggacctggcg 36720cgcacgtggc gcttaccccc acccccgggt ccgctcctgg ctcgcgctca
gcctccccag 36780actattcgca aattgaggat cccggacaca gagtgcagag accccggcaa
gcctactgaa 36840agccagccga acccgctggt gggtgctagc caattctgat tttgtacttt
acaaaaacaa 36900aaaaagtcag tgttggaagt cgggagtctg ggctcagagc agcagggatc
tgcgatgtga 36960ctttgccaag tctccagacc cctgaggaca ggttttccta tctgaaaacg
gaggggacag 37020tctctcttat taacttctca agagaaacaa agacaaaggg agggaaaatg
gcttagctgg 37080aatgctgtct tacagagcca acctttggag gtgggggaga tggccaaggc
ctctgaggtc 37140actcttggcc ccaggagcag ctgagaaccg gaaagaagct tgggacctcc
tttctgcaga 37200gctatccttt ccacagactg ccgaggttcc aaattgagct ccaccaccta
acactgtgtg 37260cccttgggtg tgtgccttaa cctctctggg cttgtttcct acagcgacaa
gaaagaatga 37320caacaccaac ctcttaggct atagtttgga taaaatgaga tagctgtgta
gaacagacag 37380atcctaaacc aatgttagtt ttcccttcat ttggggactt gctctaacct
ccagggctta 37440tgtcccagag gcacaagcag gtgcagggct ggataaataa ggtatgtctt
tctgcaggat 37500ctcttgtcct cactgatggt gtcttctctt gatatagata attttaaagc
ttcacgttat 37560ttatttattt actttaaagc ctcactttaa tgttaaaggt aaatgtaaat
atagtataac 37620aaggaagctc aaaatttgca taaagtttta agataaaata ggagactcca
aaaaagtgtt 37680actttcggca ggccctaggg atgctatggt gggaagtttg agtcatacct
tagcattctt 37740tctaaagcat tctgtcctaa tcctctgtat ggagaaaagc cagcttcctg
gatgtacccc 37800aaatcctggg aagtaggggg caggagctgg actccctcca agcactaagg
gcagggcatg 37860gttgggaaca gggaggtgag ccagacagcc agaggcgaac gggctggcat
gccaagcgtc 37920ctagttaatg cccagctgag cctgggtgaa gaaggatggg ggtgtgggga
agacaccccc 37980caccaaccgc caaagacagg cgcacaccag ccagtctctc acttcccttt
ttatttcctc 38040taagacttgc aagcagcagc accagagagg gaacctgccc tcctggccct
ggaaggggcc 38100gacccccaac ccctaaccca ggacacagct ggcacctcag gcccctttcc
ttctgaaagg 38160agggctgtgt ctctctcaca ttcacacata cacagacaca tgcatgtgtg
cacactcatg 38220gcacatggga cctcaggggt agcctgtttg ccgatccccc caagaggtac
caggaggcag 38280accgctagaa ggagataaga ggcaccctgg tctcctccaa cccaaggagg
aagaaagctc 38340aacccctcta ggatagggac tgtcttcagt caatggagcg ttgacttagg
gggcgttttt 38400gaaggttttt tttcctcctt tttgcagtct ttacaaaaat agaacttctc
ttggtattta 38460taaatctacg gccatggctc tatgtgcatg ttacaggtag aaaagccata
tggggcactc 38520cttttggttg ctcaggcctt gattgcctgt catccaggtc ccttggtctg
agaagtctat 38580gcggtcacct cagagccgct aagcaccttc agtgggccca tcccattggc
ggcgtactcc 38640tgctggagcc gggcacggta atagaagagg taggaaggca acaggaatcc
caggagtgag 38700aatagcagga ggcccagatt cacctttagg gcaaggagag agaaacagag
tcaagtaggt 38760agtcatctgc ccttagcctc ccacagggag gagaaggcgg ccatttttct
ccaggtcctg 38820agccagaata aatacagcta gtacttatta tgtgtagtca ttgttccacc
agtatctcac 38880ttaatgttca gcaattctgc aaagtggctg agatgagact tctcaggtat
aacaagtggc 38940agggcctggt gggtgcccac accatatggc actcactagg taggtatgag
gaaggcacag 39000cactgtagga gtctgggctg gtcaggctgc tcccgaaatg gggccttctg
ggctcacccc 39060tctgaccttt ggagatgtta accaatggga tcccgttcag ggtggcgaga
ggaggctctc 39120agacacagtt caaggaactg ggatgcacag cctggtggac agaaggcttg
gaaggcccag 39180gacacgcggg ctctgactcg gttcacatcc cactctgcat tactcactgt
gtgactttgg 39240gcaaataatg gcaattctta ctgagtgcct ccttctcagg gctgttgtgg
cgaagatgta 39300agttaaaaaa aagtatgcat catgcttagc acatagtgag tgcttggtaa
atagaagcag 39360ttatttcatc acaattcttt gggaggaggg tttacgtgtg ggtggcccca
cagggcagat 39420gaaagatcag cgtcagggag gcagatgagt tcaatgtaag gaaaagactt
actaacagca 39480gcagggctgc ctcgtgcagg agtgggtgcc ctaccactga gggtatctaa
gctaagaggg 39540aagggtcccc tttcaggggt gctggagaca ggatcccaca ctaggtagaa
ctggattgga 39600ccaatggtgc ctgaacacag gcccaagagt caggactggc cacttcacaa
agcacctgga 39660gtttactaaa aacagactcc taggaggtca ggcactgtgg ctcacgcctg
taaccccagc 39720actctgggag gccaaggtga gaagatcatt tgaggccagg agtttaagac
tagcctgtgc 39780aacatggcaa gaccctgttt atctgtacaa aatttttttt taaaaaatta
gccaggtatg 39840gtagccatca cctgtggttg cagctactca gaaggctggg gccggaggat
cgcttgagcc 39900caggaatcag aggctgcagt gagctgtgat tttaccaccg cactccagac
tgggcaacag 39960aacaagacac cttctctaca aaaaaaaaaa aacaataggg ccgggcgcgg
tggctaaggc 40020atgtaatccc agcactttgg gaggctgagg agggcagatc acgaggtcgg
gagatcgagg 40080ccatcctggc tagcacggtg aaaccccgtc tctactaaaa atccaaaaaa
aaaaaaaaaa 40140ttagctgggc gtggtggtgg gcgcctgtgg tcccagctac ttgagaggct
gaggcaggag 40200aatggcatga acccgggagg cggagcttgc agtgagccga gatcgcacca
ctgcactcca 40260gcctgggcaa cagaatgaga ctccgtctca aaaaataaaa ataaaaataa
ataaataaat 40320aaaataacaa taaattaaaa acaaaaacag actcctacgg tcaggctgag
atatcctgat 40380tcaggggact ggggaatctg tatttttaac actccgtgag gggttctaaa
aggcagacaa 40440cttggaaacc tgcagattag agacctctga ggtgcctctg gctgagatga
gtgagggatg 40500gcaccacata caaggcccta cccctgcccc caggagagtg gctcctgctc
cccccacacc 40560aaccctcgct ctcacccaga agggctctcc tttcaggggt cccaccatcc
ccatgaaaag 40620tggctgctga agcaaggcga acacagcact ggtgagggac tgcaggcctg
tcagcgtccc 40680aaaaggggtt ggatgggaac ctgtccccaa aacgggagat caaagggtgg
tgggggcctt 40740tcagcccagg caagaacttt ttcttttcct tcccaacatg ggnnnnnnnn
nnnnnnnnnn 40800nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 40860nnnnnnnnnn nnnnnnnnnn nncactccag cttgggtgac agagtgaaac
cctgtctcaa 40920aagaaaaaaa aatcttaaag aataaggata taaagaaaga aaatattttt
gtgtagctgt 40980tcaatgtttg tatttcaagc caagtgttat tacaaaacag tcaaaagttt
ttaaaaattt 41040aaaagtttat aaagtaaaaa agctaagtaa gctagggtta atttttttat
cgaacaaaga 41100aaaatatctt tgtataaact tagtgtagtc taagtgtaca ttgtttttat
tttatttatt 41160ttttattttt ttgaaatgga gtttcactct tgttgcccag gctggagtgc
aatggcatga 41220tcttggctca cggcaagctc tgtctcctgg gttcaagcga ttctcctgcc
tcagcctccc 41280aagtagctgg gattataggc acccgccacc atgcatggct agtttctttg
catttttttt 41340ttgaaatgga attttgctct ttgacccagg ctggagtgca atggtgcaat
ctgggctaaa 41400tgcaacctcc acctcccagg ttcaagagat tctcctgcct cagcctcctg
agtagctggg 41460attacaggca tgcaccacca cactcggcta atttttgtat ttttagtaga
gacagggttc 41520tcaactaaag agaaccatgt tggccaggct ggtctagaat tcctgacctc
aggtgatcca 41580cccacctcgg cctcccaaag tgctgggatt gcaggcatga gccaccatgc
ccagccagta 41640tacagtgttt ataaagcctc cagtagtgta cagcaatgtc ctagaccttc
acattcactt 41700actactcact cactcactca cccagagcaa ctgccagtcc tgcaagctgc
atgcatgata 41760agtgccctat ataggtgaac cattttttaa tattttatac tatattttta
ctgcaccttt 41820tctatgatta gctacacaaa tgcttaccat tgtgttacaa ctgcctacag
taatcagtac 41880agtactatgt atgggtttgt agcctaggct ataccatgtt gcctacgtgt
gtagtcgtct 41940atactgtcta gtttgtacac tctatcatgt ttgcataaag ataaaatcac
ctaatgacac 42000atttctctga gtgtattcct gttgttaagc aacacatgta taaacattta
caagaaatag 42060ctcaaatttt tttttctttt gatacagggt cttgctttgt cacccaggct
ggagtgcagt 42120ggcgcaatct cggcgcactg cgacatctac ctccccggtt caatcgattc
tccggcctta 42180gcctcctgag tagttaggac tacaggcacg caccaccacg cctggctaat
ttttttgtat 42240ttttattaag agatggggtt ttgccatgtt ggctaggctg gtctcgaact
cctgacctca 42300ggtgatctgc ccgccttggc ctcccaacat gctgggatta caggcatgag
ccaccatgcc 42360cagccattac gtttttttgg ttgtttaatt tttttttttt taagagacag
attctcactc 42420tgtcatcaag gctggagtgc aatggcacaa ccatagctca ctgcagcctc
caactcctgg 42480gctcaaggga ccctcctgcc tcagccttcc cagtaactga gactacaggt
gtgagccacc 42540atgctcagct aattattttt tatcttttat tttttgtaga gggggggtct
ttctatgttg 42600ctcaggtttg tctcaaactc ctgggctcaa tcaattctcc tgctttggcc
tcccaaaggg 42660ctgggattac aggtgtgagc ctgaaaacct tctagtgtgg aagtggaaga
taggcccagg 42720ccacttatgt tttcaagtta agcaaggttt aggtcactta tgaagcctga
ctagttttgt 42780ttgcttaagg gatctgcagg cctgacctcg gttttcattt gttttaacag
tgtctatgtg 42840tatgtgtgtg tttatgtacg tgcatgatgg ggggaaagct cagaaatcaa
gtaagccaaa 42900cacaaacatg taattataag cagggataaa ttctatgatg aagaagtatg
ggccacggga 42960gagtacttgt gccagtctgg tgatcaggaa caatgtcctt tgggaagtga
catttgagcc 43020atgccctgaa gtacggtagg agttggttag gggtgaggca gtaagaccca
gagctggggc 43080ttcctgcaca agctcagctg ggcactgagg acccagtgga ctctgctaca
gggcagtgag 43140gagcagaaag gctgaggaag gctgggtgtg gtggctcaca cttgtaatcc
cagggctttg 43200agaggctgat gggggaaaat cggtagagct caggagtttg agaccagcct
gagcaacata 43260gcaagactcc atccctgtaa aaagctttta aaaattagct gggtgtggtg
gtatgcatct 43320gcagtctcag ctactcaaga ggctggggta aggattgctt gagcctagga
ggtggacgct 43380gcagtgcgcc acgattgtgc cactgtactc caacctagga gacaaagcga
gatcctgtct 43440caaaactgaa tgaataggct gtgtgcggtg gctcactcct gtaatcccag
cacttttgga 43500ggctgaggtg ggtggatcac ctgtgattgg gagtttgaga ccagcctggc
caatatggtg 43560aaacccgata caaaaattaa ctgggcatgg tggctcacat ctgtaattcc
agctactcgg 43620gaggctgagg catgagaatg tcttgaaccc ggggggcaga gggtgcagtg
agctgagatc 43680gcaccactgc actccagcct gggagacagc gagactccat ctcaaaaaaa
aaataataat 43740aataacaatt aaaaaaaaat taaaaggcca gggagcactg gcagcctgtc
caaggtttca 43800ggtcacttta gtaaagggag aacaatggct cctcccagga cctctgggat
ctcagcattg 43860atacgacagt catggaaatg ctagggccca ggcagaccat ctcagggaaa
acaagtggct 43920ctgccctgcc ttggccactt cctggccctc tgcatgcccc agggtctcag
caccaagctg 43980ttctcagtga gtagctctca tttagtgcca gggctctcgg gcttacatcc
tacgatgacg 44040atggaatgca taaaagatgg ggctgtgata gcccagagct aggggtttga
atctcatgag 44100atgttcatgg agccctggga gggagctcag tgcaagttca tttctctttt
ttggttgaga 44160tggggctcag aggaggaagg acttgttcaa agacacacag ggagtgtttc
agtgtgggac 44220ggaggtttat ggagaaaggg tgaccatcca aggcttggac aaagatcatg
acttcgacca 44280gcaagcctca actctgtaga cttggtgggg gccaggccct cccaaacaca
cctgacaggt 44340gtctgtggtc ttggggacat tgtcgctccc cttcctgctg atgctctgct
gtccctctcc 44400catgaagcgt atctcttcgc cgtcccccat ccttgctgag agaggatggg
ttctcttctg 44460accaatactg aagatcttta gtaaagttct cttttttttc attttctgaa
agtccctctc 44520ttgagaaatc aggacaagtg agtcagggcc aggacaaaaa acagtgtggg
acgagtgtgg 44580tggctcacgc ctgtaatccc agcactttgg gaggccaagg tggcggatca
cttgaggtca 44640tgagtttgag actagcctgg ccaacatggt gaaacctcgt ctctacaaaa
tacaaaaatt 44700agccaggcgt ggtggtgcat gcctgtaatc ccagctattc gggaggctga
ggcaggagaa 44760tcacatgaac ccaggaggcg gaggttgcag cgagctgaaa ttgggccact
gcactctggc 44820ctcttggcaa cagagccaga ctacctctca aaacaaaaac aaaaacaaac
gacaaacagt 44880gtagactttg tgtttttctc aaaagcactg tcaagccagt gcccgcagca
gtgggcctag 44940acacctccag tcttgcctca gggtcagttt ccagcctccc tggacacttc
ccccaggtat 45000gtgtactttt tgattgtcct aaatccagag tctgtggcct gacctggttt
gtcacagctc 45060tcagtccctc cccatcccga atcccaggga gccgcaggtg tgtgcagaag
aggcacacca 45120cactcaatac atcttgcatc ctcgctggac ccaatccatt ggcttggtga
tgtacagact 45180gagcctcatt atagccgttc gttcctgttg acctttccag atcaatctgc
cagcttggct 45240tctccgagtt tcgcttgtca gcatttctcc aatcccatca tgtactttgg
acctctttgt 45300tgggtggctt gctttatctg aaattttcag atttgacttc aggtctctcc
tttgtccctt 45360aatatggctt aatggtggac cctgtcaggg gtagagaaaa tattgaggag
ccctgacttt 45420gaggtgcaca agttagaggg ttagacaagt ccagccacaa ccagcccaag
ctgcagtgta 45480gggaggcctg tccagctgct ccacggttga gggtggagca tacaggaagg
cttccttctt 45540gctgcagccc aggtgttctg gctgccctag ctgcctggct ttggtagaag
aaagaaaggc 45600tctgtctctg acttgtcaac taatggcact atgagattgc acataattaa
cctgggtctg 45660ctcttccaaa agccttgggc ctctgactgc aacatggagt ctgggtatca
ctccccatcc 45720ctgcgccact cacctgctct ggcgctaggc gtgtgcctaa tcacttaatt
tctctgtgct 45780gcctcttagg tatcacttcc cctgatccca aatacttacc aggtgtggga
tgacacctga 45840ctagttactc cttggaggta tctgcttctc accggggact ccgaaaccaa
acgaaaagca 45900aggccaagcc cagcctaaag gacgcttcct acatgacttc aggcttgcgg
gggctggagc 45960gtgggggtgg caatggagtt ggggggggct cagggagggg atgtggaagt
gctttgcttt 46020gcaaactcta gagaaccgtg taaataggag tgattattct gtcccttccc
tttctttcca 46080acaggaatca gcatcccaca gcccatgttc agctatgaag aatggaaact
gaggctccgg 46140gaggggtata gggaggagcc agcagggtct tgagttcata ttagtgccct
ttcctccata 46200ggcacatctg tgttttcttt tattttattt tgaatttaat tttttttttt
tttggcagag 46260tcttgctctg tcgcccaggc tggagtgcag tggcgcggtc tcagttcact
gcaatctccg 46320cctcctgggt tcaagtgatt ctcctgcctc agcctcccga gtagctggga
ttacaggtgt 46380acaccaccac acccagctga tttttgcaat tttagtagag acagggtttc
acagtgttgg 46440ccaggcttgt cttgaaatcc tgacctcaag tgatctgcta gcctcggcct
cccaaagtgc 46500tggtattata ggtgtgagcc actgcgctcg gccacatctg tgttttaaat
gagaggaaag 46560gggataatgt gcattttgtg gaagcttggg ccgtttgtgt ctaggactct
tatgatcttc 46620ataagttttc ccccagggag gacactgttc cacttaggga gtcaggaccc
ccagtcctta 46680caagattcag cctctcaaaa tggagacagc agttccaggc ctgggctggg
ttctgttcac 46740actaggagag ggcaagtgag tggtgtttgg gatgtgggga agtattatga
aaacagagat 46800gctccaattc ctagtgatag gaaaccatta agctacttgg catcttaaaa
ccaagagcgg 46860ttcaagttct gagattgtta acacacctta caacaccgcc gccgttatta
ggaagaagct 46920ctgtttgatg acgtcccaca ctgtgggtac ctttatgaac aggaatttgc
tttttcaaat 46980cccagagaag taagattaaa gttggctgtt ctccatcctt gaaaaatttg
gttttagggt 47040gaattcaaga atgactgacc atacagaatg gggagcaaac ttgggaagaa
agaaggcaca 47100gttcagagct ctcccaatag tcacccctga actgcacccg gaccatcagt
tatctctgtg 47160ggtagagctc aggaatctaa aatccatttt aaaattaaag tatatcgggg
ctgggcgcgg 47220tggctcatgc ctgtaatccc agcactttgg gaggccgagg tgggaggatc
acgaggtcag 47280gagtttgaga ccagcctggc cacatggtga aaccccgtct ctactaacaa
tacaaaaatt 47340agccaggcat ggtggcagac acctgtagtc ccagctattc ggaaggctga
gtcagaagaa 47400ttgcttgaac ctgggaggca gaggttgcag taagccaaga ttgtgccact
gcactccagc 47460ctgggcaaca gagggagact ctgtctcaaa aaaaaaaaaa aaaaaattaa
agtatgtcat 47520acatactgtt acaggcacag accttaagtg tacagcccaa tgaaatttta
cacatctata 47580cagctatata actaccacct atatcaagac acattccagg aactcagact
ccatcatacc 47640cctcctcagc agaggtaaca gacccacacc tctcctgctc cggtggtaat
taaccactat 47700tctaactttt ctatcaatta gttttgccca ttcttgagct tcacacagat
atacattgtc 47760aggcatgatg actcatgcct gtaatctcag cactttggga ggccgagacg
ggagtatcac 47820ttgagcccag gagttggaga ctactctgga caacatagtg agacccccga
ctctacaaaa 47880aaaataaatt agctggtcat ggtggtgcgt gcctgtagtc ttagctattt
gagacgctga 47940gagaggagaa tctcttgagc ctgggaggtt gaggctgaag tgagccgtga
ttgcaccact 48000gcactgcagc ctaggtgaca gagtgagatt ctgcctcaaa aaagaaaaaa
tatggccggg 48060cgcggtggct caagcctgta atcccagcac tttgggaggc caaggcgggc
ggatcacgag 48120gtcaggagat ggagaccatc ctggctaaca cggtgaaacc ctgtctctac
taaaaataca 48180aaaaaagaaa gaaaaaaaaa ttagccaggc atggtggcgg gctcttgtag
tcccagttac 48240ttgggaggct gaggcaagag aatggtgtga acccgggagg cagagcttgc
agtgagccga 48300gatcgcacca ttgcactcca gcctgggcga cagagtaaga ctctgtctca
aaaaaaaaaa 48360ggaaaaagaa aaaatatata tacattgtgt actttttggc atctggttta
ttttgctcaa 48420tatcacatct gcgaaattaa tctacactgt gtgtatgaaa ggttggttct
ttttgttgtg 48480atgcagtatt ccgtcgtgtg actacgggac aatttgctta tccgtattcc
tatcggtggg 48540catttgggct gttaccaggt tctggctgtt atgaataaag ttgctatgga
tattcttgta 48600cactacttct ggtgagcgta tgcactcatt tcgcttatgt aaatatcttg
ggtggaatta 48660cctgatcata aggtaggtgt gttggctttg taatgtgctg acttggttat
gctgaattcc 48720cttttttgtg tatttctggt tagagcggaa catgagggtg tctcttcagg
gaatctggag 48780ggtggaaggg aagcaggagt cggtttctgg ctcacacatg ttgtgactga
actgctggta 48840cacctggttg gcatggagct ggcttctcct ttggcgttgc ctactgttgg
ggcaggtgtg 48900tatgtggtta gctccatgca atgaacccgg gcttctgcaa aatacattaa
caacgacaga 48960gacaacaaaa gctgatgtgg atttaaaggc ttcagttcan nnnnnnnnnn
nnnnnnnnnn 49020nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 49080nnnnnnnnnn nnnnnnnnnc cagcagtggt tctcaactga gcatagtttt
gcctcagagg 49140ggacatttgg taatgtctgc agacattttt tgattgtcac agcccagccg
agaaggtact 49200actagtatct ttttggtaga ggctagagag gctgctaaac atctaacaat
gcacaggaca 49260ggcctctgta acaaaaaagt atccagtcaa aaatgtccac agtgttgaga
ggtttaggta 49320agtaggcgct aaaacataag gagactgtgc ctgagagcaa gaaggagtaa
ttggaaagtg 49380ctggtgtgat tagctctggg ttttagaaag ctcattttgg ctgcttgtag
acagtgcatc 49440agaggtggag gagggtggta agactggagg cagggaaagt aatttgggag
ccactgaaat 49500gatccaggtg aaaaacggtc agcaggtgac taggaaagtg gcagaggcaa
tggggatggg 49560tggctggatg agatggtgaa gaaagcacta taactaacta atgtgtggat
gatgggcagg 49620aggggtgaag gatgaccaga gtcctgcctt gcaggtctag ttggaaggtg
atggtttctc 49680ctgagaaagt gaccacaaaa agtgaagcag gtttgtgcgt gtgtgtgtgt
gtgtgtgtgt 49740gtgtgtgttg agttcagtct gagatgtgtt ggactcacaa tgtccatggg
acatccaagt 49800ggagaagcat cttgggtgac catatgtgtg agtctgcagc tcagaaacag
gcctggggct 49860ggagatgaag acttgggaat gatctgcgta tatatttggt agcttgagcc
acaagagtag 49920atgacataac ccgtggtggg tgtgcagaat taggagagac gtgcaccaag
aagccaggtg 49980atccccaata tttaaccatc tggaagaata agaggagcct gccaacagaa
attgggaggg 50040aatggccaca aaggctactg agaagggaag cagttcttaa gaagggggaa
gtgaagaggt 50100atcactactg cagaggtcaa gtaggataag aactgaagaa tgtctgttgg
gtttggcaat 50160ggggtagtca gtgggcacct gggcaaaagc agttttggtg gagcaatagg
gataacagaa 50220acaagactgc tatggtaaga ggaggaagag ggtgttgagg aagtggccag
cgagtctaca 50280ccacttgctg gaggagcttg gctttggtgc aaagcagaga agccagctca
ctcattgact 50340taacctccaa gaaacacaaa atcatccata tcctggctca aattccagca
ctaccaggag 50400atggttggcc cctagaaatg ccatcccact tctcctctgc ttatcctatc
ctatctgtca 50460gtctgttgag cccaggctaa gcgctacctc ctcaagcaag ccttctctgc
ctgccgtcac 50520actttaagtg atcctgacaa cactgaaaat gtgtgtctct tccattcatg
ttagttctac 50580acttctgagt atctcctcaa tatattgcct tgttttacta atatgctcgt
tctgtttgcc 50640ttatttatca gctaccttaa acctccctgc aactagagat tctctttaag
tatttgttga 50700ataaatgaat gaatcaatcg atgatccaga gcctggtaga ggcttgtgtc
catggtggat 50760gaggctcaga aaatacctgt agaatcgaaa taaatgcatg tgtgctctga
tctaaactca 50820gctaaacttt ctccaggggg taaagttcaa gttgattagt caattgatta
attaattcat 50880tatgtaatgg aaaaactcct tctatgacct gggcagagtt ataggcagtg
aacaagacag 50940acaaggtcct tgttgtcatg aagtttgctt tctgaaggag agagataata
aacaagaaac 51000cagtaagaaa gcaagattat atcattttgg taaatgttct tgtggaaata
aatgtgatga 51060tgtgtaacaa aagtaccaaa taggagagtg gggtgggtgg gcttctttta
gaaagagttc 51120tcggagaagg cttatctgag gaggtggcct tttaaccagt acaaatgctt
tagcttggcc 51180agtggagctg ggaccaggat gacaagggtc acttgtcatg ccagtgagtt
tgagcttgta 51240gacaagagcc tgatcatgaa agactttgca gatggtggta atgggtttgg
gttaattgct 51300actatgtggg aagactttga atgggaagca tggggacaat ggcctgtgat
acatgttatc 51360aaatatggtc gcaggggcta gtgaggtggc agcagagata gggagaagta
gacggactgg 51420ggaaggtaga agatggggca ggggaggcaa ttactgcaaa gacatattcc
ttctaagctc 51480actgagtgtt catggtctct gggagcagag gttcctggag gggaaagagg
ataatgtcac 51540ttcctgagga agcgggaaga acccatctga gacgtgggga ctgtgctggt
tcgtttctaa 51600ggggccttcc agatctcaca tgccaatcgt cttggtctat gtcaattgtt
ggggcatcca 51660aatggggaac tgttgtccag gccgatttca cagaacaacc gcccagtcca
tatctcccga 51720gccattcacc cttgcagtgg cgttagctct ttcaccagct tttatctgcc
ccgtggggat 51780gttggccaag cccagttaac aagcagttga tcagccccag agatcaggtc
cctggagtct 51840gtcacttttc tgagggtggg gagagaatcc tggagcagaa catgtaacta
gaagggccac 51900ctggcttcct atggtctgag ggagagaatg gtgggatctc tggcctgaat
caaacctccc 51960tttctcagtg tccatcttac ctctctgctg taccttcgtt attttccagc
agctcctcag 52020cccgttcctg tgggaccctt ctctgccaat ccctacaccc actgtaaatt
tcaccgtggg 52080agggagatgg gccttgaggg ctgtattagt cttctattct gcataacaaa
ttgcctcaaa 52140tttagcagct tcaaacaact catgtttatt agctcatcgt gagttcatca
gcagtgtggg 52200cccagcatgg ctaggttttc tgctcagggt ctcacaaggc taaaatcaag
atgttgtctg 52260ggctgtgtgc tcatctggag tttagggttc tcttccaggc tcacgtggtt
gtggcagaat 52320tctgttccct ggagttgcag ggctgaggtc ctgttttctt gctgactgtc
agatgagggc 52380tgctctcagg tcctcgaggc tgcccacatt gcttgccacg tgcgtggtct
tttccatcct 52440tgaagccagt gatggagaat ttcccttgga ttgaatcacc cacatggttg
gactctctga 52500cttcaggaag agagccctgt ctcttttatg ggatcacctg attagatcat
acccatagag 52560ggcagttcct tttccttaaa gtcaactgtg gcatgtaaca tcacacaacc
acaggagtaa 52620aatccatcat atttacagtc ccagggatta tgcacagtgc accaggggac
aactgaattc 52680tgcctgtcaa aagggccaag caggacttta ttggtgaaga acagtggaat
gtcattcttg 52740gttcttccag aaaaaaatca ctcagtaaag ttagaggttc tcttgccttt
tgggaagtca 52800tcaaagaatc tcatggaggg tttggacctt caccctagaa acatcacacc
atgttttcta 52860taattgcagg gttcatggtc ccttgaagcc tattcatagt ttccaggttg
aaaagctctg 52920ctgcagggtg tggggaggga tgcaggtgga ggtgagggct gaatagtgtg
agctgcatat 52980ctggagctgt ggtggttttt ttagtcttta agctgtcatg tgttgggggt
tgggcatggg 53040aggggcatcc caagagctcc ttggtattga caccatctcc aaggtgatct
ctgctctgcc 53100tggtgcacac atgtttttct cctgttgcaa cagcccactc ttgtagaaga
gcagacccct 53160cagtaccagg tctgaccctg gacagcttgt accaggagct acagcacact
cccccacaag 53220cctaaagttg ggatgagccc cccgagaatt agatcagaaa agattaaatg
cagaggtgat 53280ctgtcaggtc ccctttggaa gtgctggtat ggagaggatt gactgagtct
gtttaggaac 53340ctccaagctc tgtagtaact ttagggctag aaaggaggat gcctaagatt
caggatcctg 53400cagtgatgag tcaacatttc ttggggaagg aggcagggct gaggattaaa
cggagatgat 53460gggtatcgtt ctcttgctca aaggcactgg accccaaggc ctccagctct
tcgctcccat 53520ttgaaattca agtcctgagc acaccacagt tgtgatgcag ggaaagaatg
tgcttatcag 53580agagcctggg caagtgggcc ccttgtgagt accgttcaac ctcatttatg
tcattggcac 53640caaaagtaga catcagtctc ttgaaagttt gattaatgct ggtcacactc
aaagaccctg 53700ggtagcattc atttactaag caattactaa ataccagttt ctgtgctaaa
tgctgcatca 53760gtcagggctc ttaatggcag gcagcagaaa ctctccttgg ctgatctaag
tagaaaaatc 53820caggactgaa aggaaacgga gtagctcatg aaattgcagg aagggccgga
aaaccagaca 53880tggagccaaa gtcaggctgc agaacaggtc tagggaggat cccactgctg
ctgagaccta 53940gaccttgtgt ctggcaccca ggatgttgta gggctcagac cctggatcaa
tgtatcctgc 54000agtgcctctg tgggtactgc aactccagga actcaatctt gtcaacgcca
ccgccagaga 54060gaggccttct tggcctccat ctttttggtc actagctcca gattcaaaat
cttgaataga 54120tgcttcttct ctttgataga gcccagtcat atgcgttagc tgcaaaggaa
gctgaaaatc 54180tattaggaac ttttgtcttc aaaaatgaga ggcctgtcct ccaccaagat
ccataggaaa 54240tggaatccaa gaaaccacag gaaggggtga ggtgactggg cagctcacag
catgcatgct 54300acatgtgaat tatctcattc atttctcaca ctacccagtg aggtaggtat
tgtcatccct 54360acttcataaa tgatgatatg aggtacagaa agtttaagga acttgcccag
gacacgacac 54420gcagctatta agtgctagac ccagtcaatt tgagtctgac ttggactgtc
tgactccaga 54480agccaccctc tcagacactg ctgtatactt ccagtgaatg ttgatgaaat
tttcagggtt 54540gctaagctgt ggatttcaga tcctggattg tatgacctaa aagagagact
tccctaggag 54600tgagggtccc tgaacagtca actggtttcc aagaatgggc tccctctcat
caccttatga 54660cagtaatcct ctgtccaaca gccaaagagg tcctgtgggg agggcttgca
gatgggagtg 54720cgcagagccc agctcaaagc tcctgactag gctcttgttg agtattcctt
tgattcctgc 54780ttctgtcttt ttaaatcaat ggagacaggg gagggttatc tccatcctcg
gctcaagatg 54840aaatgcatcg ttcctcgttt ttctcattcc ttcccaatgt gtgtactgtt
aactttagtt 54900atgaaggaaa ttacagtgtc ctgtgcatat accaaggctg tccaacctcc
acacctttgc 54960tcaagctgtt ccttctactt gaaatgcctg tttccttccc ttctaattgc
atctttccat 55020ccaggtagga atcagctcct tggttcatgg agccttttct gctctgtttt
actatgcatg 55080gacttccttc tgaattagca gaggatgttt cctagcttgg tcttaaccct
tctccttttg 55140tttgacctca atttactcat cttacaaatt aggttgtaag ctaattgaat
acaggatcta 55200tgcttcactc tgattttatc tccacctgga tagcatcatt tttgacacac
aagcaggcat 55260atgggagggg agagaagttt ggtgccagaa agaactggat ttgaattcta
accctgttgt 55320ttacgtgagt acgttactta accattaatt acttcaatgt atatttatta
agtacctact 55380atgtgccggg cactgtacta agcaccaagg atacaatggt gagtaaagag
atgcagcctt 55440caccatcacg aaggaagaca gatgttaatc cattaaccaa gtaatctcac
aagaaaagta 55500aaatgactaa ctgataagga caagcccctg gagctacaag agggtgtata
cagggcatcg 55560atccaataag ggcagtgttg cggggagatc aggagccaca cagagcctgg
gttgtctcac 55620ttggaaaatg gggtatcaac cacctacctc actaggtttt taaaatcagg
ttaaatgagg 55680taatacttgc catgaacagt attttgttga ttgatgattg attgaaacgg
agtctcactc 55740tctcgcccaa gctggagtgc agtggtgcaa tctcagctca ctgcaacctc
tacttcctgg 55800gttcaagtga ttctcctgcc tcagactccc aagtagctgg gattacaggc
agccacccct 55860atgcctgact aatttttgta tttttagtag agacaaggct ttgcaatgtt
gaccaggctg 55920gtctcaacct cctgacctca aaagatccac ccacctcagc ctcccaaagt
gctgggatca 55980caggcatgag ccactgcatc cagccacttg ccatgcatgg catttaaaaa
tgttcagtaa 56040atgttaccat aatgaaggct ggtaggttgg ccaactgagt ggtctgattc
agaaggaaag 56100aagttagaca tacgtgaaca tttcctgtac ttgaagatcc tcaggacagt
gactcctaga 56160cccatcttcc atcacagtca gctgggaagc ttttaaaaaa atgcagacat
ctgaccttca 56220cgctagacct attagccaag cagaagtttc tgggcagggc atctgcatat
ttttaaaaat 56280ctttaataag gcagcctcaa aattacagat tcagcacgca tttaccataa
ccactgaaga 56340aatgcaaagt tataaaaaga agataaacaa caatctgtct cctgctttct
tccctctcct 56400cccctgcttc tggaggcaac aaggtcaact atttggtgtg attcctttta
gcattccctc 56460catcaatggt cacataagga tgctcacaga taagcaccta tgcgggggtt
ttttttttcc 56520ttgtaaaact attcacatac taaatacttt cctcagtatc ttgccttttt
tcacttcatg 56580tcacagaaac atctcttcag gtttatagat acaggtccag ctcttctttt
catagccata 56640taacattctg tagaatagag aggacacatt ttactcagtg tccgattgat
ggatatcaat 56700attgttttca tttctacaaa tagtcaagga ataacataac tctgtaaaag
ttttattact 56760tataggcgca tttatgccta aaggatagtc tcaaaagagt gaaactgatc
aaatgtgcat 56820ttttttattt taataggtat ggacagattt gttctcaaaa tgtttgtggc
agttcaaaac 56880accagtaaaa caggggagat atgtattttg gaaaagcacc caaggcgatt
ctgaagtgta 56940gcccaggata agaaccattg cccagagctg ttccagatgg cccctgggtt
cctgaagtgg 57000gtatcgggag agaaatcttc actgaatgaa tgagtgggct ccccagggaa
gtgatgaaat 57060ggtccttatc agccttgcta tctccctctg acagaggcaa actctctctc
cctgggggaa 57120gttcctccaa ggcctctata taagaagtct ttgtgagagg aagcaaagaa
ggacctgggc 57180tttgggaaga tctaaagacc caggaaggtc tctgggtggg tgagtgcttt
ctctgctgtg 57240gtggagctgg tgacagttta ttctcccagg aggtccctgg ctgtggctga
cagtttctgg 57300agggctggca ggcgtctacc tgtggctttc aggttatgag gatgtcagca
ggggcagcct 57360tcatcctctg ccttgcacat tccttctgcg ggatgtgaaa gtgctccttg
gctggggaaa 57420ggagatggtg gagacatgga ggagggtgtg ggtggcttct tgaactctga
ggaggggaca 57480taccttctaa gtcctatgtg ttcctaggaa agccaataat cattgcttct
cccgcctttt 57540ttatgtcata gactctgagg gacccattaa gtacaaacaa ataagcgtaa
tagtcccttc 57600tttacttccg ggcctgaagg aaagccagcc tcagccaccc ctcagggttt
gctgcgttct 57660gtttagaaag aggtccttgc gtcctggatc ctggagcatc aggagctggg
cttggcatga 57720gcttttctgg cccatcctga tttctattca ggccttcttt ttctccacct
cactcccacg 57780gtcccctaat ggtgtgattg tgatgtgtgt gcatgtgtgt ctgtgtgtgt
caatgacaaa 57840ctgtgttctc cgttgcagga taaagccaag atgaaactcc ccttacttct
ggctcttcta 57900tttggggcag tttctgctct tcatctaagt aagtgttttt tgccttcagt
ctttctttct 57960ctgttttttc cctttctatg gtagatgggg tcagagttac acacccaccc
ccttctttga 58020tcgtcttcta tttctgaatt tctgtgtgct taaagggatg gggactctat
ggccaggagt 58080tgaaaggatt tctcaaggcg tctgttatgt ctgtggtctt ggttctactg
tgacattccc 58140aattttgtcc tttctccatt atgcttactt tgagcttact gagtgccttc
tctcctttaa 58200ctctcttagc atcgccatga agtaggtggt attgtatacc catttcacag
aaatacagct 58260ggtggatgat ggaaccagta cccaagccca tgactgcccg actctaagtc
catgctctta 58320accaccttga ccttgtcagg cagcttgggt tcccctcata gagactgggt
tccaggttcc 58380ccttcccagg cagagttgag cactctgatg cccagggcaa ggtgtgagct
gtctgtggtt 58440ctggggagga acaaggggag atgtgaagga aggacactta gctatcctcc
ctgccagggt 58500ctgagacttc cacctttgag acccctttgg gtgctaagac gctgcctgag
gatgaggaga 58560caccagagca ggagatggag gagacccctt gcagggagct ggaggaagag
gaggagtggg 58620gctctggaag tgaagatgcc tccaagaaag atggggctgt tgagtctatc
tcagtgccag 58680atatggtgga caaaaacctt acgtgtcctg aggaagagga cacagtaaaa
gtggtgggca 58740tccctgggtg ccagacctgc cgctacctcc tggtgagaag tcttcagacg
tttagtcaag 58800cttgggtgag tggcctatgg ctgaggctga ggtgggagca tggaacgggt
gtgggatatg 58860cccccagcat tgctatcact ggctcttttt cccattgagg gccctggggg
tgtcagtaga 58920acctgagcct cagagaggtg ttggggtaag aggggagggc cacctacaaa
cagaagttgc 58980attttggtct ccaaccttca aatggttgtg gcaggggagg gagggaatga
attgtgggga 59040ctcaagaccc atgtgaattc atgtaggaag gatgctccat tctttgtctt
ttatcctgcc 59100ctgtagttta cttgccggag gtgctacagg ggcaacctgg tttccatcca
caacttcaat 59160attaattatc gaatccagtg ttctgtcagc gcgctcaacc agggtcaagt
ctggattgga 59220ggcaggatca caggctcggt aagagaagtg tgaacactaa atggggtgca
cctgctgatc 59280tcagccagca ctcagcttgc atcagatttg tctgtttttc tcctgtataa
tctccagaag 59340aaccagggat agatggacac ccacagacaa cactgagggg gctgcctggg
cattcaggga 59400agagctaagg atttagaatc aggaggtttg ggtccaagtt cctttccatc
tctcactatc 59460tatgtaactt aagttagctg ggcatggtgg tgcatgtctg taatcctagc
tacttgggag 59520gctgaggcag gagagtcact ggaacctggg agacagaggt tgcggtgagc
cgagatggag 59580ccattgcact ccagcctggg caacaagagc gaaactccgc ctcaaaaata
aataaataaa 59640taaataaaat aaaaaaaaaa ttaaaacaag accatgagtt tgtttcctca
tctctaggat 59700gagttggcaa cccttgttct accttttgtt agggctggaa ggacaagcct
gtcactggga 59760tgcatagaat ctgatggtga taattgccgt ggatcagcat ttcagatgac
taggacagtt 59820cccatcatgg tccagcaggg aagggcccat tgcccggtgg gcagcagaaa
gagctggcag 59880atacggggcc aggtctgctt ctctgccttc cctctgcccc atcccttctt
cccctcttgc 59940tttctccagg gtcgctgcag acgctttcag tgggttgacg gcagccgctg
gaactttgca 60000tactgggctg ctcaccagcc ctggtcccgc ggtggtcact gcgtggccct
gtgtacccga 60060ggtgaggtgg ggctggggat gaacgatgga aaggtctggg agatgggaag
tgccccaagg 60120aggagatgct acaaagagcc tgaccctttg tgggagaggc ttcctgggtc
ttttatatac 60180tctgactcca cagcagtgtg tgggtgggaa aagaggccct cctgtgggtt
gagttgggat 60240ggacaagagg ctgaaagtcc ctttctgttc tgccttcaca ggaggccact
ggcgtcgagc 60300ccactgcctc agaagacttc ctttcatctg ttcctactga gctggtccca
gccagcagtt 60360cagagctgcc ctctcctggg cagctgcctc ccctcctctg cttgccatcc
ctccctccac 60420ctccctgcaa taaaatgggt tttactgaaa tggatttatt ttctcctctg
atcgcggatc 60480cactctgctt agccctcatt gaaacttctt ccttatcatc tctccccaca
ccacaacttt 60540catagaagtg tcagaagcta ctactccttg aggaggagga tggagggtgg
agttgggtct 60600atggagcctt ttggagatgg aggaatgggc tcagctagtt ctcttcatag
aacacctgat 60660tactgggcac ctgcatagtg ctgccaggac ctttcaaggt tgtaggtaga
ctcccaatgg 60720cccagtttgc atctctgtaa ccaaaggcct tttctctctc tctctccaac
cccagaactg 60780tggttggttt tatatgtaag gaagttaaca tgtccctggg aacagtccac
aacattcagg 60840aatgaatgta taagtaccgc aatccccggc ccctcaagtg gaataaatct
aacatgtatt 60900gggcaccatt tcccagtggc ctgctgtggt agttggcctt attccatgca
tttttatggg 60960ctgccttccc ttcctcaact gcattctctg ctccttccta ctctctgcaa
ctcccaaata 61020aacacttgta cgcaactccc tctctcagga tctccttctg gggaaacctg
atataagaca 61080gcttgccatg cgtcagactc tgaatgaggc ctgggaatac aagacatagt
cctctggcac 61140ttgggatata tggttatttg taacataggc acaaaaacat ctactagttg
ttatcgctta 61200ttgagcaccc acaacatacc ccctgctgtg gcaggcacct tgcctagatg
acctcatgtg 61260atcaataatt atgagcccta ttttacagaa ccaggctcag agaagttagg
atctgtcaaa 61320agacttgccc aagactgaac ctctaaatgc aactcatatt gaaattcaac
tctgctccaa 61380agcatgttac tttaaccctt gtgcttttac agctggctac tctcccctta
tggtcacacg 61440gggatgaagc acggggggag gaaagccaga ctgtctcact cttgggttca
tcttgggaca 61500caggacacca gcccagctgg aggtgaggga gctttaatca gaggggaggg
aggaaggcat 61560tctcaacccc ttctgtacta gggaggtcag cagaagaaaa taattcaatg
ttctaaagcc 61620atttttttct ccagcattcc tccaattcat agatcttcat atgggattag
gggctcagag 61680aggggtgaaa caagaactct atttttttgg agtgtggtat agagaaggga
tgctacttct 61740ctaaggtcac atagtaagtt gagaaagaga gagaaatcaa actcaggttc
atttcaacta 61800ttgttccaca agaatctgtt gatttcaaag atggtggact atgggttcat
ccctgtggtg 61860agtgctgtga ggatgcagct gaggtggaac tttcactcct tgccctcttg
gactttatat 61920tctggtgtgg aaaggcattg cttcccttat ttcaatatta acaacaaagg
gtaataatat 61980ttcccattta ttaagcattt actaggtgtc aggtactgtg ctaaatgtta
ggtgaacttt 62040gtcttgttcc tcataaatct ctgccgctgt gggtgtgtac tttgacagaa
gtttgacttc 62100cagtccacag agatcttctt tgggggagta atatcaagaa ggggcacgaa
ggaagctgca 62160gggctcctag tcccatcctg tatctcgacc taggcatgtt tacattggtg
cattcactgt 62220gaagtttccc tgagcagtcc actctatagt gtgctttata ggagcacatt
gtacatccat 62280tgaaaaattt ttcttggccg ggcacggtgg ctcatgtctg taatcccagc
actttgggag 62340gccgagacag gcggatcacc tgaggtcggg agtttgagac ctgcctgacc
aacatggaga 62400aaccccgtct ctactaaaaa tacaaaaaaa ttagccgggt gtggtggcac
atgcctgtaa 62460tcccagctac tcaggaggtt gaggctggag aatcgcttga acctgggagg
cgaaggttgc 62520agtgagccga gatcgtgcca ttgcactcca gcctgggcaa caagagcgaa
actccgtctc 62580aaaagaaaga aagagatttt ttctttttct taaaaagtaa aaatcatgaa
ataaggggac 62640tgggctaata ttccaaaata tgggtttgtg tgtgaatttt cctctccagt
aagatactaa 62700ctaagctctg tgaaactgtt tatctatggt tctttatcat tgaatccttg
gagttcctta 62760cactgtgcag agcacagagt aggggctcaa tcaacagtgc actcattgct
ttttcataga 62820caagggccac cctcactcaa ctcatgtgcc aggcatagtt ctgagagctt
tgcttaagct 62880gatnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 62940nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnggtcact
aatttggtat 63000agagttatgt tcattgattt cattttattt tgtttctgat ttttaaagat
tgttttactt 63060gttttcttcc tattattatt ttattttatt tgtaaaacat ttacatatca
gacatttaca 63120ttttcccaaa ggtaaaactg tgaaacaaga tatattcaaa gaagtttact
ttccctctct 63180gtttcttgta ccccttttcc tcttctttag gtaaccattt ttattttttt
aaatataaac 63240attgtgtagg tgtatataca tgtattagtc tgttttcatg ctgctgataa
agacctatct 63300gagactggga agaaaaagag gtttaattgg acttacagtt ccacatggct
ggcaaggcct 63360cagaatcatg gcaggaggtg aaaggcactt cttacacggt ggtggcaaga
gaaaaatgag 63420gaagaatcaa aagtggaaac ccctgataaa cccatcagat ctcgtgagat
ttattcacta 63480tcacaagaat agcgtgggaa agactggccc ccatgattca gttaccctcc
cccactgggt 63540cccacccaca atacgtggga attctgggag atataattca acgtgagatt
tgggtgggga 63600cacagccaaa ccatatcaat acatttccct ctcttttaga taaaaggtag
tatactgtat 63660acactattct gcagagtttt tttttttttt gatgtaactc tatcctgagg
gtgctctgta 63720gcagggacct ctcatgcctt ttaaccactg cctgggtctc cattacatgg
ctgcagcata 63780gttgccacag cattcctgta ctgatgacta tttggattgt ttccagtctt
ttgctattac 63840cagtagtgtt acaaagagga tctggctaca tgttcagggt ggggaggggc
agatgtgtag 63900cctgtcagga gggtattgca gtaatccatg actgagttaa tggtagttta
aagctaggat 63960gagtcagtgg ggttggagag aagtgggcac atttgaatga tatgtaggag
gtgaatgatc 64020agcattattg atgagtttga ggtggggcat gtggggaaag gattcgagga
tgactcccag 64080gtttctgttg ggacagtgga tggatagtgg ctcctcccct ttttccaatc
ttccttggcc 64140cttcgctgac ttctgttggg ttggcctaca gagagcttct ttttcctctc
tgttcgccca 64200ggttcctcca ctttggcggt ggccctctgc tcgacggtgc cttcgctggc
cctgacatcc 64260ctgctgtgcc tgggcttcgc cctctgtgcc tcagtcccca tcctccctct
ccagtacctc 64320accttcatcc tgcaagtgat cagccgctcc ttcctctatg ggagcaacgc
ggccttcctc 64380acccttgcgt aagtggcctt ggggcgggct ctgtggagac ggacacactg
gggcaaagag 64440aagctggagg taaagaaatt gggaggcaag gcggggcctg gaggcagtca
ggtgcgggag 64500actgggtttg ggggcaggtg tggagggggt gagaccagag gtggtgggaa
ggatagaaca 64560ttcatgcact tgagccttta catctgcggt gccctctccc tctgttttct
acctggtgaa 64620ctcgtattca tcctctgagg cccacttctg tttcagttct ccagggaaga
aatggaaaag 64680tgtcttccct tctttgtgcc cttagtactc tagtcttact tcctttgcta
gtgcgtgcat 64740tgtctggcat gccatccatt tacatgcctg tcttttcttt cctggtgcag
cctgcatgag 64800ggtcctgtct gtttttccag ggccccgcat gtgccttctt ctgggttctg
tgggtcaaat 64860gtctgagcag agctgaagag ggaaaggcca gacaggtgtg gttggagggc
aggcctagga 64920caggggagct ggggacaagc ggccgacagc ccccagaggc caggcttctg
cttggaggga 64980gggtccctga agctcactgg aacccctctg gtttctctcc ccagtttccc
ttcagagcac 65040tttggcaagc tctttgggct ggtgatggcc ttgtcggctg tggtgtctct
gctccagttc 65100cccatcttca ccctcatcaa aggctccctt cagaatgacc cattttacgt
gagtactggg 65160aggatgggga tccctggcag gaggcctggg ccttaggcct tggctgcccc
aaatctggct 65220gtgatggcct gggtatgtag catggtgcag cttcccaaag ggtctgtgtt
attcaagtat 65280ttggggcaaa agtatttgtg tgtgtgggga aacagacatt ctggagtagg
gtggggaatt 65340ctcacgaaac ttcaagcaaa atcctgagac ctcaaaggtg tttcctgctt
gtggtgagtg 65400caggcccacc ctggcctctc ccctaggccc acacagggtt tccacagttg
gccccaggga 65460caggacctct gtgctttcac ctctgtgtcc ttacacctgg agggatgctc
tgaggtcctg 65520ctctaggagg tggtcgtgag tctcctgctc tttgcagaaa ctgaggctca
aagaggttac 65580ttacgtgttc agaggcacca gctaaggagc aaaagtcaac tttgaattct
gtgttttgac 65640tactgcacag ctctatttgc ctcatttttt atttttaaag cagcaaatct
tagaatagga 65700gtttaaatcc atcacttgga gaaaagaaag actaaatgtt ttttgttttt
gttttggaga 65760cacgatcttg ctttgtcacc caggctggag tgcagtggca caatctcggc
tcactgcagc 65820ctcgatctcc tggactcaag cgatcctctc atctcagcct cctgagtagc
tgacactaca 65880ggcatgtgcc accatgccaa gcttatttta ttttattttt ttgatagaca
ctggggtttc 65940gctatgttgc ctgggctggt tttgaattcc tggcctcaag cgatccaccc
gtctctgcct 66000tccaaaatgc tgtgattaca ggcgtgaacc actgtgcatg gccaaaagag
taaacttgaa 66060atctgaggcg aatgacttga ttgtgacatc aggtgaccta gtaatcagct
gtgtattcta 66120gctggtgcct ctaccagctt cccatgtgac cttgaacatg tcattgaatg
ctcgctaggc 66180ctctgtttct ttatctgtga aatgggcttg atattcctcc tctaccccaa
ccgatagtgc 66240agaatgaaaa gtaactgaaa gtccttcctc cagggcacca tagtgtctgg
gtgaaaagta 66300gaatataaac tcggtagact tctggtccct tcattggtca tggaatggac
cagtgcttgc 66360ttcattgagc aacagttctg ttgttcagaa ttcctggatt tcacctcact
tctgctctcc 66420ctgcaggtga atgtgatgtt catgcttgcc attcttctga cattcttcca
cccctttctg 66480gtatatcggg aatgccgtac ttggaaagaa agtccctctg caattgcata
gttcagaagc 66540cctcactttt cagccccgag gatggttttg ttcatcttcc accacctttg
aggacctcgt 66600gtcccaaaag actttgccta tcccagcaaa acacacacac acacacacac
acacacaaaa 66660taaagacaca caaggacgtc tgcgcagcaa gaaaagaatc tcagttgcca
agcagattga 66720tatcacacag actcaaagca aaggcatgtg gaacttcttt atttcaaaac
agaagtgtct 66780ccttgcactt agccttggca gacccttgac tccaggggag atgacctggg
ggaggaagtg 66840tgtcaactat ttctttaggc ctgtttggct ccgaagccta tatgtgcctg
gatcctctgc 66900cacgggttaa attttcaggt gaagagtgag gttgtcatgg cctcagctat
gcttcctggc 66960tctccctcaa gagtgcagcc ttggctagag aactcacagc tctgggaaaa
agaggagcag 67020acagggttcc ctgggcccag tctcagccca gccactgatg ctggatgacc
ttggcctgac 67080cctggtctgg tctcagaatc acttttccca tctgtaaaat tgagatgaat
tttggtgttg 67140aaagttcttc ctggagcaga tgtcctagaa ggttttagga atagtgacag
agtcaggcca 67200ccccaagggc catgggagcc agctgacctg cttgaccgaa ggatttctga
cagactatct 67260ttggggatgt tttcaagaag ggatataagt tatttacttt gggcatttaa
aagaaaattt 67320ctctcgggaa taattttata gaaaaataaa gcttctgtgt ctaaggcaac
tactgtttcc 67380atctctctag gctttgggcc ggggctgtgt gtgtgtgtgt gtgtgtgttt
gtgtgtatgt 67440gtatgtttct gaggaggccc taccctggca tgagagggta gggaatctgg
ctacacatct 67500agtgtggcag ctggacccag aggtggggca ggaaccctga ctatgattca
ccccgctggt 67560cctgggatgt gggcccagag acttcctccc ccaggaaccc ctctgcttcc
tcttcctctc 67620cacatcctta actaacttta gcagaaccct actcctcact acacaccccc
agctagaagc 67680gctggatgga atcagaaatt cctagtttga gtttcaattc tgcccctcag
cagctgggca 67740agccccttaa ccactctgag tcactagttc cccacctgca aagtgcagtt
aatcatttct 67800atctctgatg gcgattgtga gaatgtaaag tcattgcaac tgcctagcac
atggtaggag 67860cacatgaggg tttgctcctg tgtttactca tgacccttgg ggaggacggg
ggcaaagagg 67920gagaagttga gggtgcagga ggagagatgg caggtgggtg ggatgggaga
atctggggca 67980cacctgctgt ctcattccca ccttgctagg agagggacta ggaaagaaca
gtgggaggca 68040gggggatggg ggtggaaggc agggggtggc aggcaggttc atccatccat
tcattcaaca 68100aatgtttatt gagcacctgc cacgtgtcag gccctgtcct gggtgctggg
gctataaaga 68160tgcagaaggg tctgaaaccc agctcttcct tcttcctgtg gatgtcgggg
tgtaatttcc 68220aggggccagg agcctgggtc tgagggcgga caccaaagtt ctagtggtgt
ctattagcag 68280cgtttaaatc taatggatgg atttggtctt gttaccctgc tcaaaagctt
tcagcagctc 68340cccactgtcc acaggacaaa aatccagatg ctagcctggc attcaaggct
gtcactagtg 68400tgatctcaac ctctcccctt ccctctttac ctcctaccaa cagcggggca
gagcccaccc 68460ctgtggacca agattcccag tctctgggtc tgtgtgtgca ccagttcctc
tgcgtgggtg 68520gctcaccctg cctcagcttg tgaaatccat ctggtctgct gggatcctgc
tcaaaatgtc 68580atcttctcca aaaatcatta ctcaggcttt ccagcatgtc tgagtccctg
gcacttggtc 68640acacccttcc tggtgactgg catttgcctc cacatcatga ccctcccacc
ccttgcctgg 68700gcagcatact ccaggaggca aggtctgttc tcgcctggct ctaattaatc
tgtgcttacc 68760atccacatgg taccagctaa ttcttgttga atgaatgatc gttgaatgag
tggattcttg 68820ttttggcctc agaaccaatt agaaggagcc agaaaaacac atgggggtgg
gggaggtgca 68880gtgtggtgca gtggaaaaaa acccttctgg aaatctcagc tctgtcactt
actttgtcag 68940ctctgtgact ttggatggac cacttctttg tcagtatggt gggagaaata
gacatgcctc 69000tctgggctgt tgtaaggatt acaaattagg tcgagtgctt ggcatgtggt
gggttgaaca 69060gatcacagct agcattacag atgatatatt aaagccaaaa aaagatgcct
aatgtccacc 69120agttggtgaa cggacaaagg aaatgtacca tatttgggat attatttggc
aatcaaaaaa 69180agtactgaca cctgctacaa cacggatgaa tcttgaaaac attagactaa
gtgaaagaag 69240ccagacacaa gaaactgcta atgattccat ttaaatatga aatatcgggc
cagggtgcag 69300tggctcatgc ctgtaatccc agcactttgg gatgccaagg tgggcagatc
acttgaggcc 69360aggagttcgt gaccagcctg gccaacatgg cgaaaccccg tctctactaa
aaattagccg 69420agtgtagtgg catgcacctg taatcccagc tacttggttg gctgaggcac
aagaattggt 69480tgagcctggc aggtggaggt tgcagtgagc caagatcgtg ccactgcact
ccagcctgga 69540tgacacagtg aggttccgtc tcaaaaaaaa aaaaaaaaaa ggaaaaagaa
aaaaagaaat 69600ttccagaata ggccaatctg tagaggcaga aagtagattc atgattgggt
aggcctgggt 69660gtggaggcca tgggtagtga tggctaatgg ggaaggggtt tcttttgggg
tgatgaaaat 69720gggtggactt atggtatgtt aattatacct caataaaact gttatttaaa
ggaagaaaag 69780atgcctggat tccccaggaa gtgtacagta gacttctgtg agaatcagaa
atgatttctg 69840gggaagatgg gcgagaggag agtaagtggg agaagtgacc acgtgcgcaa
ctctcatcgt 69900tctgccctga gagccttcct cctgcaactt tatttattta tttattttga
aacaggttct 69960cactctgtta ccctggctgg agtgcagtgg tgtgatctca gctcactgca
gcctcgacct 70020gccaggctca agcaatcctc ctgtttgagc tcctgagtag ctgggactac
aggcgcatgc 70080caccacatct ggctaatctt ttatttattt atttatttat ttatagagat
tggggagtct 70140cactctgttg ctcaggctgg tgtcaaacgc ctggactcaa gtgatcctcc
caccttggcc 70200tcccaaagtg ttgggattat gggtgtgagc cactgtacct ggcacctcct
gcaacttctt 70260cctcaagtgg aaccaatgag gaagcaagca actcagagct ttcacaagtt
ttgatttcaa 70320tcagcaacgg gcttccaatg caacccttct ctcctgtaac cagcctcagt
agagaggaac 70380tggaggtgaa ttggccccca tcacaccccc acagtgccaa gctgggccct
tccatcaggg 70440ggagaacaca tgccgtgtaa gggacagcca acagcataaa ataggaattg
tgtgatgatc 70500ccttttaagc ctattcagcc cagggaagtg catatgatca gccccatttc
atagatgaag 70560aaagtcaggt tcacccatta gcacattgtg gggctggtat ttaaaccagg
tctgtctggc 70620tcccaaggtc acattcattt agacattacc tttactttac atttcttctt
cttttcttct 70680tcttcttctt cttcttcttc ttcttcttct tcttcttctt cttcttcttc
ttcttcttct 70740tcttcttctt cttcttcttc ttcttcttct tcctcttctt cctcttcttc
ctcttcttcc 70800tcttcttcct cttcttttct tcttcctctt cttcctcttc ttcctcttct
tcttcttctt 70860cttcttcttc ttcttcctct tcttctttct tcttcttctt cttttttttt
tgaggtgggg 70920tcttgctcta ttgcccaggt tgaatgcagc atcatcatac ctaaatgcag
ccttgaactc 70980ctggccttaa gcaatccccc tgcctcggcc tccaaaagtg ccaagatttc
aggcatgagc 71040caccatgccc agcctgcatt tattctcttg taagaaagat atcatttaaa
acagacgaga 71100aaataaagag ggacatgaaa aagacgcatc accattaatt ggaccactca
gagataatca 71160tggttaacat gttggtatgt tccctcccgt catttgactg gatgtatgtg
ataatttaaa 71220tgatctcata agcttttcct tatgtaatca aatagtagcc aaaaacatga
ttttaaatgg 71280ctgctcacaa ccccatctcg tggttctgcc acgccttgtt tatccccatc
caccccctac 71340tccctttccc cttccctgcc tgtgtggggg tcctagatga cggtgagcca
gagggcagcc 71400ttggtcagca gattggagag tgcaaataat aaaaacactc agaaggcgag
ctgttgtcaa 71460gtgggcttat cacaaaagag caccttggga tattccagag aatgacctca
tacccgctaa 71520tcactatcca taatctggtg ctaactgtac tttagctgaa ggtgctggca
ggtcctgccc 71580aggtgctgct aagaacactt ctattctgtg agaatcagag atgatttcta
gggaaaatgg 71640gcgagaggga gtaagcagga gaaacaaccc acaggcacag ctctcatctt
tctgccctga 71700gagccttcct cctgccacgt ggttttgttt gtttgtttgt ttgtttgttt
cagatagggt 71760ctcactctgt cacccaggct ggagtgtagt ggcaagatca tggctcactg
aagcctcgac 71820ctcccaggct caagcagtcc tccccaaatt caaagcttgg agtgatggtc
ccagtggtta 71880tgtctaggag ccctttttcc tgccagcccc tcaggggatt gatgactctc
aaatgcttca 71940ggtgtgacat gggcacagca gtgagtcatt cctctgacat tctttgggaa
gaacattttc 72000catccaggct tccaggcata agatccagtc ctctggtgat aaggagttca
cagacaggac 72060aatgtctgag tgtatcttaa acccaggacc atggcttgtg ttcacaccag
accctccagg 72120gattttgagg tgttttgttt gtttgtttgt ttgtttgttt gttttttgag
acagagtctc 72180tctctgtcgc caggctggag tgcagtggca cgatctcagc tcactgcaac
cttcgcctcc 72240cggttcaagc gattctcctg tctcagcctc ctgagtagct gggactacag
gtgtgcacca 72300ccacacccgg ctaatttttg tatttttaat agagactgtg tttcaccatg
ttggacagga 72360tggtcttgat ctcttgacct cgtgatcctc ccgcctcggc ctcccaaaat
actgggatta 72420caggcatgag ccaccgtggc ccgcccaatt ttgagttttt atgttctaat
cccaaacatc 72480tgctcacagg cccctcagca tattctttcc tgggtccagt gtcacctccc
aggcctgcag 72540gctggctaga gcagtagggt gtgtgggaaa gctctgggct ttgcaggcac
tgatcagctg 72600tgtgacctta accaccctga acctcagttt cctcacctgt aatggaaata
ggtaccacgg 72660cagtttgttg caaggactag agagtaacct tgggaataaa aggtagcagc
agcttgggct 72720ctggagatgg actgtccaag accaacttcc agttcctccc cacacaagct
ctggcactta 72780gattcctggt acctccgctg cttcatctgt aaaatggagt aacaatagga
atactttata 72840gagttgtaag gattgagtgg ctggatgaac gtcaagcact tcaaagggga
cctggcatgt 72900agtgagtgat caatataaac cacctggctt gtagcaggtg tgctgtgtgt
ggctgcaggt 72960gttattagta acatctgtgt gcccttcaga gcgtgcacca cacttcacac
cttgtggagt 73020ctggaatgcc actattatag ttcaggatag aaaacctccc tgcaagcact
cgctttagct 73080tgtctccacc gaacaaaaca acacaagttc tttattactt ggaatgggaa
aacttcaaag 73140gcaaaaaaaa aaaaagactt tcgagttacc ccaaatctta agccaaagtc
aatgaaaaat 73200atcaatcttc atattcaatt tttgcgatac ttttgtctcc ccagcagtca
atggagagaa 73260tccaagcaca cagaaatgtc aattaccagg ggcagggcta tgaattcctt
tcagagccct 73320gggctgggga agagtgcagg cagacagatc tgggtcctgt tatcacgttc
ttagattggg 73380tgtccttgta ggagtcatga agcatcttag tgcctttgtt tgctacctat
aatgcctacc 73440tcagagagta ataaggataa gtaaggctct acgtgaaaag tgctcggccc
tggcacatag 73500taggtccttc attaatggca gctactaatt tttattacat acgcaaaatc
acattacagg 73560tcaagtacgc tacatgacag tgaaacagtt tttttgtttg tttgttttga
gacagagtct 73620cgctctgtca cccaggctgg agtgcagtgg cacgatcttg gctcaccgca
acttctgcct 73680tcaagcaatt ctcctgtctc agcctcccga gtagctggga ttacaggcat
gtgccaccac 73740gccagctaat tttttttggt atttttagta gagacggggt ttcaccatat
tggccagact 73800ggtctcaaac tcctgacctt gtgatctgcc caactcagac tcccaaagtg
ctgggattac 73860tggcatgagc caccgcacct ggctgtgaaa cagttttatt gtgtttctgt
ggaatgtgtc 73920ctacccaacc tatagctaac tcctatagtt ccctcagttc tcagctcaga
tatcccttcc 73980tttctgtact gttacctagt actggttttc atagcaccag gtacctctct
ggcatagagc 74040ttgtcacagt tgcagtttaa tgtaccatca taggatttta aaaatattca
gttgtgtctt 74100ccattaggct ttcatttggg aactccacgc aggcagcagc tgtatatttt
gtattgccta 74160ctgtatcctg agaactttgt accctactta gcacagaatg gaggctcagt
aaatactgga 74220catgagagag agagagagag agagaggaga gggagagaga gagagagaga
ttcaacctac 74280aatcccagct ctgagcttct agttccctga tggtgaggac tgtgatgtgt
ctcacacggt 74340aatgagcact tatgcagaag aggctcagaa aatttctcct catggccaac
ggaagactta 74400gagttctttt ccaagctcca ccgtttgctg gcatgcaaaa tttggactat
cacttaagtt 74460ttccaagcct tgctttttct atccctaaca taggacaata ttcagcattg
ttgtttgttt 74520gttgggggca ccatgtttca ggcacttagt agattattgt accaccacat
ttcaattggt 74580cctcctcaag ccctgcaaca tctgtgaggt ggtcatcctt aacaactcac
agatgagcaa 74640caggagactg gggggatgag ggaactgcca aggaggtcca gcttatgggc
agcagagcca 74700agaatggaac cagggtcttt tattttttta tttttttatt tttatttttt
aaccagggtc 74760ttttaacatc cgaggaccac attctttgtg ctttccaaat catcacctgc
cccatgcaac 74820ttacagggta agttacatta aacaacgtat gtaaatggct ttgtgctagt
tattcaccac 74880cacaggggaa gtgagtcacg gacaagagtg cagccgctcc attcggatcc
tggctctgac 74940acttacctgg aaaatgactt aaccattccc aggatcagct gtttgtctgt
aatttaggta 75000gtttaatggc acttgtgtcc tagagttgtt tagaaggttg aataatatgg
agcacttaac 75060atacttagca cctagaaaca cttcctaaat attagttgct gctgttgtta
tcgttattaa 75120aatttctgcc taagatctca tttcagggag cccaactcaa tctttgacaa
gcttaaacaa 75180aaattgcttt tcttcattta ttcacttaca cagcaaacat gaattgagcc
tgtactgtgt 75240ttccagaact gtgcaggacc agagaggcac aggtgaagga agcaaggctc
tggctctact 75300ggggaaacag caagaagatt gctacaatga ggtgggaaga gggctggact
agagagaagc 75360cctgattagt gtccttgcta cctttctctg ggagagccaa ggcaggcttc
ctggaagagg 75420tgatccttgg ctgaaacttc gatgaagaaa aggaaagagc gcagtggtta
gggaggaaag 75480ggcattctgg gcagatgaaa tgacatgtga caaaatatgg gtgatcannn
nnnnnnnnnn 75540nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 75600nnnnnnnnnn nnnnnnnnnn nnnnnnngca ctccagcctg ggtcactgag
tgagagaccc 75660tgtctcaaaa aaattaaaaa aaaaaagtcc agaagaacat ttgggtctca
ctctgtggcc 75720caggctggag tatagtggca caatcatagc tcactgcaca ttcaaactcc
tggcctcaag 75780tgatcctcct gccttagcct tgaaataagc ttggattaca gatgagccac
cacacccagc 75840cagaccatta ttcataatag ccaaaatgtg aaaacaaccc aaatctccat
caactgacaa 75900atggataaat agaatggtgg ttgatccata caatggagta tttactcagc
aataaaaaga 75960agtcctgata catgctacaa ggatgaacct cgaaaacatt atgctaagtg
aaagcagcca 76020atcacaaaag gctacatatt acaagattcc atttaaatga aatgttcaga
ataggtaaat 76080ctaactttta tcacaggcaa agctatgaca ggaaatagat gagtggttgc
ctagtgcttg 76140ggggcagagg tgggggtgag gcgagtgagt actgctaatg gtacagagtt
acttttgggg 76200ataaagaaac tgttctgaaa tggactctgg tgatggttgc actactctga
acatactaaa 76260actgttaaat tatatacttg aaatgggtga cttgtgaggc atggaaatta
tatcttaata 76320aagctgtttt acatatttta catatttaaa aatgcaggtg gagggatgag
ccctctaaag 76380agaagcagga gtttgaggag gttctaaata ttgtgtggtg ggtactgagg
catataaatt 76440tgtcagacct catcaaaatg tatgatgtaa tcttcaagaa agttgatttt
aaagaaacac 76500caccagcacc aggtggagaa ggcaggaaga agttacacaa ggggtaggcc
aagagtggtg 76560gctcatgtct ataatcccag cactgcggga ggccgagctg ggtgggtgac
ttgaggtcag 76620gtgttcgaga ccagcctggc caacatggtg aaagcccgtc tctactaaaa
atacaaaaat 76680tagccagacg tgctcgcgtg aacccagggg gagaaggttg cagtgagcga
agatcatgcc 76740aatgcactcc agcctgggtg acagagtgag actctgtctc aaaaaaaaaa
aaaaaagtta 76800cataggggac agtggcaggt gtcaagggca ggcagggtct ctcctatctc
caggataaac 76860tcatagggga cttagatgcc atgtgggtcc ctaatagccc tccacttggt
tcttgcagcc 76920actcttatgt gtatcatttc atgtcaggcc tcttcttccc aacccaccca
gccatcccag 76980cctggctgcc aaccccacct cctccagccc ctgtcacccc ataattgggg
ccaggaggca 77040tgggagagtc gccatctctc ggtgccatct gttgcatctt tacagataac
catggctgga 77100tgcggcagat cctggggtgg agcagccgct gttcagagca gtgatcaaga
cctccccatc 77160tccacccctc aaggaatcgg ttttcttcca tagccacatc aggtgctgtg
caggaaggag 77220ttgaaacgag aagccaggag caacgagaag gacactaaca tttattaagc
actgcagact 77280ctcacagcac tcccacggaa tcgatattat tatccccatt ctgaagacca
ggcaactgaa 77340gctcaatgtt taaggaactc accgaagtca ccaactgata aaagtgatgg
aagctgggat 77400tcaaatccaa gctaaacttc cttccaagct tactccacaa cacagaggtt
ggggaaaggg 77460gataaaaaga gaggggagcc caattccatt tccacccagc tcctgaggcg
gagcttgtca 77520gcacagctct ctccttccca gaataggaag atacccatca gaggcaagtc
ctagacacca 77580gcagtggtaa ctccctgccc caaggcagct gcagacagcc tatggctgta
gttactgctc 77640ccaaagagtg ttagaattcc cactcccagc ttcggggcca ctcacacaag
gtgattgaag 77700tggaaaccag agactctcca caatgccctc ctagagtaaa tgaggctatg
taactttgtc 77760caaatgagta atttgaaaac ctgggggctc ccagctcctg aaaagggaag
gatgtggggc 77820cctttatatt catactccac tttgtgcagc tctcccttgt cttatgatag
ccctattaag 77880aaattcctct cccagcacgt ctccttcaaa gagctctaga cctgaggctg
tcagaggctt 77940aggactctgc ctattagtcc cagggtctgg atgaccagca ggacacctgg
cattcagtga 78000ccactggatt agataaatga aacagtgggc agagtgccac ccaatctccc
cctgaagttt 78060gaagaggtcg agaagtgagg ctgtccaact gctgaccctg ctttctgtcc
acctggccac 78120ctaacctttt ctggcttcca cctgcccctt tgccatccct ccccccagcc
cacccagccc 78180attttcaggc atacctgggc acgtgctgga atagaagccc tcgttcttca
gaatgatcaa 78240cagggagccc cagcccagga gtacagcaga gaagaagagg ttctccagca
cagccgtgca 78300ggccatccac cagcgcctcc ggtacgcctg ttgcagcgtg ggggccatgc
tggccccgag 78360cctgcacaga aacagagcgc tgggtgaagg gccccccagt ggccccaggg
aagggtcctg 78420catcatggtg gcacccgaga cctctcgggc cagcccgcga ggagcccctc
atggaggccc 78480catagagccc tgggcttccc agccggtgcc aaggagctgg ctccgcgcgc
actagcagtg 78540ccagaggtgc acgcggcacg gggctcccgc tgagccacta tcggaaacaa
ggaaggtcct 78600gtctgcgcgc tgcagcttcc tagcaggctg ccgggttctc tcacccaggc
cagggcgctc 78660agggccgggc tgctggggag aaagtccgca tctgcccagg tccccagagg
acagcaaggg 78720gcagagcgcg ctctgaagca ccgcgggccc atgtccggac tctcgcgcca
ggaaagaccc 78780ctagaagctg gcaggaagaa gggcaagttc aaggctaccc tacgacccca
tcttccagtt 78840gcccctccaa gacctctcct tccctctggg gccgggcgac agcaagccct
ccccctttcc 78900gtatcaggtg acccacgacc ctacagtctc tcgggccaag ccaacagctg
ccacgtggag 78960ggagacccag gacgggctct cctcggttcc ctcctccccc gcgcgcccct
cactcactcc 79020gcagggctcg gggcaccagg ctttgcacct cggaacccgc ttgcccccct
ccagccccgg 79080gagggggctc ggacttcggc aggaagtctg gcggctgctg actttataag
ggcagcggtg 79140gcggatgggc tggcgggcgg gtgtgtttac caaagggagg gaaagagccc
cagctccccc 79200cgccgcggcc gctgcagcct cggcgggagg agagggaacg cgggcagcgc
gggggcgggg 79260agcgacaact gggatgagac cgaggaaagc ggagaggaga agggcaagaa
agacccagag 79320agaggggagg aagtaccagt cacttcttcc agggggactc ggtattctca
tctgtgaaac 79380ggggctttgg gttcaagcgc tccaggaggt ccgctggaac tctggcaaac
gcgcagctct 79440aagcagagga agtgcagcga gcggggaccc gggaggaaga gaagagtcgg
aggggtcaga 79500gaaaagaaaa gggaaggacg cgcttggcga gatgggacac tgtgccgcgg
gaccgcgggc 79560gcaagtaacg gtctttcctt gggaagcctg gcagtgtcgg cgggagccgg
cctcggtgtc 79620tctcagccga cgcatagccg gagaccctac gcgcgccccc tccccgccca
cgctgctcac 79680ctccggtcac cggcaaatga gcagccagca gctgcggacg cctccgggag
cgcaacgctt 79740tcgcggcgcg tccggagtcc cgtgggccca gccctgagcc gcgccggcgc
tggggtcttc 79800tctgcgtgca ggacccggcc gccacggagc ttcagcctga cagcccggtg
gcctcgcctc 79860cgctgtctcc tcggaagaag cgggggaact gggaacccgc cgggcgccag
aggtctgcga 79920agctgggctt ggatgaagtg gatctgcgga gttgatagtt gtatttacac
gcgtccggag 79980ctgcgccccg aggtgggggc gggggctccc ttcttttccc ctccccttag
gtcgagtttc 80040acgcgcacgt gactcgcccg ctggtcccgg acactctccc tctggcacag
ccccagcacc 80100tacatttcca ccctggaccc ccatcttctc ccccaagccc ccagactaac
atcaggcagc 80160gccctctgta tccttgttca aaacaaagtg cgattcggct gaagccgact
gaccgcgatt 80220cagggccgcc ttgggtgggg ttttgaactg tgcagctgga agcagtgttt
tccgagaggc 80280agagtggcac gggtttcttt ggagttagtc agatcgaggt ctgagtcttg
actttttaac 80340tgactaccct gggttaccta gggcaagtta cctctctgag cctcagcttc
ctcctcttta 80400aattcggtta aaatggaacc tacctaactg cccaaaggaa tcgcgattgt
gatgcaggta 80460aaatgctaag catagcattt ggcatagtaa gcataatgtt aattgttgct
gctgtcatta 80520tttcagaaga cctggtgatc ggatgcttcc agatcaacaa ttgattgact
ccaggtaaat 80580ctctcagcct ccctgagcct cagtatcctc atctgtaaaa tagactacta
tggtgtggag 80640taatgagaag taatctcatt acatgtgagt ttaattgtgt gttaagagtg
ctgctaatgc 80700atgctgagct taatacctag gtgatgggtt gataggtgca ataaaccacc
atggcataca 80760tttacctacg taacaaacct gcacattctg cacatgtacc ccagaactta
aaataaaaat 80820aaaatttttt taaaaaaaga gtgatactgg tggccaggtg tggtggttca
tgcctgtaat 80880cccagaactt tgggaggcca aggcaggagg atcgcttgag ctcaggagtt
cgagaccaac 80940ctggacaaca tggtgaaacc ccgtctctac aaaaaagaaa aaaaaatagc
caggcatggt 81000ggtgtgcacc tgcagtctca gctacccagc aggctgaagt gggaggatca
ctgagctgga 81060gagatggagg ctgcagtgag ccaagatcat gccactacac tccagcctgg
gtgacagagt 81120aagactctgt ctcaaaaaca aaacaagaat gactacagaa agctccaaga
aggcctcaga 81180taaaagggaa cccctgaaca gatgagccac caagccaaga gaggaactaa
tggctaccat 81240agacagggca ctttccaaaa taaaaatact gttattaatt cctcaagaca
tcatggtccc 81300atttaaacct catagctttt cacagaggga gaaactgcag gcttgaagct
ggagcaaggt 81360tagaggtagg atgcagagtc aggtcggcct ggcatttaag tacggctcct
tccattcctc 81420ccagaaggag aatggcaaga gcaaaggctt agctgtggga atggcacaag
gagttctcgg 81480tggccaaagc acatgtcagg ctctgatggt ttaacttctt aaaatgcaat
actgcctccc 81540agaacttcca gatcaaggtc aaactcctca gctctacaca gggggaccta
gagtcaactt 81600tctaagctag gagagtcatg gatccctttg agaatacaaa agacagtggg
cgcggtggca 81660gtggctcatg cctgtaatcc caacattttg ggaggctgag gcaggaggat
cacttgagcc 81720caggagttca agacctgctt ggtcaacata gtgagacccc tatttctaca
aaaaattcag 81780ctgagcatgg tggcatgtgc ctgtagtctc agttactggg gaggctgaag
taggatgatc 81840cctgagcctg ggaggtccag gaagctggag tgagccgaca tctcgccact
gcactccagc 81900ctgggtgaca gagaccctgt ctcaaaaaaa aaaaaaaaaa gaagaaatat
gttattgatc 81960tactcttgac aaaaatgctt gtgtgaacat ggacacacac actcatcaac
attcacattt 82020caaggttttc atggaccctt tccatgaggc tctagtggtc catggacccc
catggctgga 82080acacttgctc ttcctcatct caacccacat ttccatggag ttggactgtc
tgctgcatga 82140ggacacaggc ctcatttggt gtgttcattc actgctgtgt atcccagcac
ccagaacagc 82200acctcaccta aggggcactc agcacatgtg cagtgaagag tcagtcagct
ggtttcacac 82260ctcccagtct ttgcacctgc tattccttct tgtgggaatg acagatttcc
ttcatttctt 82320tttttttttt ttttgacaga ttccagctct gttgcccgag ttggagtaca
gtggcacgat 82380ctcagctcac tgcaacctct gcctcccagg ttcaagcaat tctcatgcct
cagcctccca 82440agtagctggg attacaggtg cacaccacca cctgtgagct gatatttttt
tcttttcttt 82500tcttttttcc tgagacagag tctcactctg ttgcccaggc tggagtgcag
tggcgtgatc 82560tcggctcact gcaagctcca cctcccgggt tcaagtgatt ctcctgcctc
agcctcccaa 82620gtagctgaga ctacaggcgc gcaccaccat gcctggctaa tttttgtatt
ttttagtaga 82680ggcggggttt caccatattg gacaggctgg tctcgaactc ctgacctcgt
gatccgccca 82740cgttggcctc ccaaggtgct gagattacag gtgtgagcca ctgcactcgg
ccattttttg 82800tattttttta gtagagatgg ggtttcacca tgttggccag gctggtcttg
aactcttggc 82860ctcacgtgat ccacccacct tggccaccca aagtgttggg attacaggca
tgaaccactg 82920cgctcagcct ccttcttcat ttctaatgta ctcatccttc acaactcagc
tcaagtttca 82980cttctctctg gaagctctac tctaggctgg attcagggcc ttgtccacat
acccaccaaa 83040tactctgctt acctctatgg aagtccccac actgatctag aataatcagc
ttagttttct 83100gcccccatcc cgccccatga gatgtacatc ttgtgggggc aggaaccacc
acgtggtagg 83160tgatttgtgt gcctgctgcc tatcacaggg cctggcgcct aataagcttg
cggccaacat 83220ttgttgaata aatgaaaagg gaatggtggg aaaggaagct gaaaaggtag
gctaaaatca 83280gtttggaatt acctctggga ggccaaggac tttcagtctt gcagggtagg
taacaggaaa 83340ctcctggatt ttgttttctt ttggttttgt ttgtttttaa tgaagggtag
cgttatcgtc 83400aggtttttgt gtttaattaa tggagcatat attggaaagg acagagacct
taaagcagtt 83460aggagaccac cataatagtt cacattttgc agccataaaa aggaatgagg
ccaggcatgg 83520tggctcactc ctgtaatctt atcacttcgg gaggttgagg caggcggatc
acctgaggtc 83580aggagtttga gaccagcctc accaacatgg agaaacccca tctctactaa
aaatacaaaa 83640ttatccaggc gtggtggtac atgcctgtaa tcccagctac tcaggaggct
gaggcaggag 83700aatagcttga atctgggagg cagaggttgc ggtgagccga gatcgtgcca
ttgcattgca 83760ggtacatgga tgaagctgga agccatcatc ctcagcaaac taacacagga
acagaaaacc 83820aaacaccgca tgttctcact cataagtagg agctgaacat tgaaaacaca
tggacacaga 83880ggggaacatc acacactagg gcccgttggg gagtgggggt tggggggtaa
ggggagggaa 83940cttagaggac gggacaatag gtgcagcaaa ccaccatgac acacgtatac
atatgtgaca 84000aacctgcaca ttctgcacat ggatcctgtt ttgttttaag aagaaataaa
gaaaaaacca 84060agaagaaaca aacaaacaaa aataattccc atttaaaaca ataaaaaata
ggccaggcat 84120ggtgactcag gtctataatc ccaacacttt gggaggccaa cgcgggcaga
tctcttgagc 84180ccaggagttc aaggccagcc tgggcaacat ggcaaaaccc tgtctctaca
aaaaatataa 84240aacaaacaaa caaaatagcc aggagtggtg gtgcatgcct gtcatcccag
ctactcaggt 84300ggctgaggtg ggagaatcac ttaagcctgg gaggcggagg tagcagtgag
ctgagatcgt 84360gccactgcac tccacctgga gcaacagagc aagattttgt ctctaaataa
ataaataaaa 84420taataaaaaa cagagaagag gaaagacacc tgagatatat ttccatatct
gaatcaatag 84480gatttatcaa cgttctcctc tacccccaaa actaattcct tcctaaactc
tgttctcctg 84540acactactca taggttaagt ataacagcat tatcacattg gctgtcatgt
gggctcctgg 84600ctagaggctg cttcacagct taatggacaa gagcactgag acagggtggg
tctaaatcct 84660ggctctgcag ctgattattt gtgtgatttt gtccaaatca ctccatctca
tgagcctcac 84720tcttctagtc tgttaagtgc tgaaaataaa agtatccaat tcaattcatt
atttaatgaa 84780ttatttagcc taacaaatag ctattataaa tatttaggct gggcacagtg
gctcacgcct 84840gtaatcccag cactttggga ggccaaggtg ggcagatcac ctgagtcagg
agtttgagac 84900cagcctgacc aacatggtga aaccccgtct ctactaaaaa tacaaaaatt
agctgggtgt 84960ggtggcatgt gcctgtaatc ccagctactc aggaggctga ggcaggagaa
cgcttgaacc 85020caggagacag aggctgcagt gagccaagat cgtgccactg cactctagcc
tgagcaacag 85080agcaagactc tgtctcaaaa aaaaaaaaaa aatctctgca tgaagaatgt
acataaaatg 85140gtgcagccat ttcggaaaac agtttggcag gtcctcaaat agttaaacat
agagttacca 85200ctatagccca gcaattccac tcctaaatat actacaccca agagaattga
gaatatttgt 85260taacacaaaa atgtgtatac aagtatttat agctgtatta ttcattacag
ctaaaaagtg 85320caaacatccc agcagtccat cagctgatga acggagaaac aaaatgtggt
atacccatac 85380aatgtcatat tatttggcca taaaaaggaa gtactgatac atgctacaac
atggatgaac 85440cttgataatg ttattctaag tgaaagaaac cagacacaaa agaccacata
ttgtatgact 85500gcatttatat gaagtgccca gaataggcaa atccacagag acagaaagta
gattagtggt 85560tgccagagac tggagggagg agataatggg aaatgtggaa tgactgctaa
tggtatgggg 85620tttcttcttg gggtaatgaa aatgttgtac aattagataa tggtgatcat
tgtaaaactt 85680tgtgaatata caacatgctg aattttatac tttattatat tttatttttt
ttgagacaag 85740gtctcgctct gtcacccagg ctggagtgca gtggcacgat ctcagctcac
tgcaatctct 85800ctgcctccca ggctcaagca atcctcctgc ctcagcctcc tgagtacctg
acactacagc 85860atgtgctacc atgcctggac aatttttgca tttttagtag agacagggtt
tcgctatgtt 85920gcccaggctg attttgaact cctggactca agtgctccgc ccacctcagc
ctcccaaagt 85980gctaggatta caggtgtaag ccaccactcc cggcctaaat tgtattcttt
aaaagactga 86040attgtatggt gtgcgaatta tatctcaatt taaaaaaaac aaaacaaaac
aaaaaaaaaa 86100cctttgcgtg tgtcaggcac tagggattcg atgctgaata agacacagac
cctaccctca 86160gagaacacag agcccagcag gagagagtca cagatgaatc aagtgttaca
tcatctatag 86220gaagcgccat ggaagaaaga catggtgcca tgagaacata cgcttagaga
agggaatttc 86280atctagactg gggctcaggg aggaatcttt cagggtgatg cttgtgctca
gagttttcca 86340tgtcagaatc agtagaattt atcaatcctc cagaggagga aacagcaaat
gaaaaatctt 86400acaacaggag gatgcggaga cattccgaga gctgatcaag ggctggtgtg
aacaaagcac 86460ataggatgca gagcctgtgg tgtgaggttg cagctggaaa ggtaaaacac
taattacatt 86520ggatcttctg agacaataaa gagtatgcaa taatctcaaa cgaccgaaac
tgaccttcct 86580cctccctaac ttgcttgctt ccactgttgc ccgtatcata aaagcaccac
cctcttctac 86640ccagtggctt aagacacgaa actcaagtca tcccaggctt tctccccacc
tcactctcca 86700catccagcct atcagcgagc ttgtgggtct taccacgtaa agacttctca
tctccagcta 86760ctaccatccc ccaagcccag atcaccatca gctcaggcct ggactcctgc
aacctttcta 86820accgggtctt cccaatccta cccccgcaac atgaccccaa tagcccatca
gaatggacta 86880atcgagatgt agatttgatc aggccacatc ccttgaaagg cttcctgtga
ccctcgggga 86940aatgcacaaa ctcccaatga tggcccctga gtcctgtgcc atctgggtct
gccctctgcc 87000ctctgtgtct ttgccatggt aacctccttc acacccatta atactccatg
ctctctccta 87060cctcaagttc ttcctgggct ggaacattct ctgcactagc ctagccaact
aaccctttag 87120atcttttgtt tgtttgtttg tttgtttgtt tgtttttgag acagtcttgc
tctgttgcca 87180ggctggagtg caatggtgca atctatctcg gctcactgca acctctgcct
gccgggttca 87240agcaattctt ctgccttagc gtcctgagta gctgagacta taggcaccta
ccatcacgcc 87300cggctaattt ttgtattttc agtggaggtg ggttttcacc atgttggcca
ggctggtctc 87360gaactcctgg cctcaaatga ccaccctcct cggcctccta aagtgctggg
attacaagca 87420tgagccactg tgcccaggca acacttcaga tcttaatgat catttccttt
aagtgcctga 87480cctcttgtag taactagcct gactccagca atgaatcctt ttgcaatgta
acctatataa 87540catctgagtt tccctttgat aaaactcatc atatatttgt tcctctgaca
gttcagaggg 87600caagggcctt tgcccacctt cctcaccact atcctctcac cacttaacac
agaactcacc 87660acccaccatg cctcctgcct gacaaattcc taaccatcct tcaaatctca
ctcacctatt 87720accttctggg aggcagtctt ccctgagcac caagacaatg ggacacattc
ctttatacac 87780cctgctgaac atctcttttt tgaggggcgg gtagagatga gtgtctcact
atgctgccca 87840ggctgacctc aaactcctgg cctcaagcga tcctcctgcc ttggcctccc
aaaatgctgg 87900gattacaggc atgagccact gtacctgacc gcaactgggt tagnnnnnnn
nnnnnnnnnn 87960nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 88020nnnnnnnnnn nnnnnnnnnn nnncctgggc aacaaagacc cttcctctac
aaaaaaaaaa 88080aaaaaaaaaa aaaaaaaaaa aaattatttt aaattagcca gacatggtag
tgcatgcctg 88140tagtccaagc tacttgggag gctgaaggga gaggatcact tgagcccagg
aggttgaggc 88200tgcagtgagc cgtgatcgta ccactgtact ccagcttggg caacagagtg
agacctcatc 88260cctaaaaata aagaagaaaa tatggcaatt tgactgtaca tctctaatgg
gatatatcct 88320aaggatgaga aaggaataag gaaggacaga aaaaaggaaa caaagaagta
gcaacagtat 88380ttagcaattg tattgttatc aagtaacatc aatattggta aaaccagtaa
ttatatttaa 88440aatactatat atgtgtatgt acatttacat atgcatatgt taggaaccaa
gtttatcaga 88500ggaagagaaa gggctacaaa tgtaaaatca aggaaataaa aatttgaata
aaaatatcag 88560tattaagtat ttatgatatt tttcttataa aaaaattata tatatgttaa
ctctatccaa 88620aacccaaaag cagtgacaac ccaggagcaa taaaaaacct cagcatccag
actgtagtct 88680ctaccatttc caattaaaga aacccagggc tagttgggaa aaatgacaat
ttcatgtcta 88740gggcaagaaa cacacctagt gaaatggacc tgaacattta attgtgttag
aaagtaagga 88800aactctctag aaataatgtg atttcatcta aaagacacag attctgggct
ggtaaagttt 88860tcaatggcca aaggtgagac aatttgagca tcaagaagaa tcatgacaga
acagattaaa 88920acatgtcaaa tatattttaa aatgaaatat tataaaagaa acaattagta
gccatccctg 88980aaggtcacta gggcaccaac tcatatttca aactggtaaa taaatgtgta
agccaagcat 89040ttatttctgg gtaacaaaat agtaaggaat gtttttcttt ctagaagaat
tctagtgatt 89100aaaagtagaa gatagaaata gaaaatcatc cttttggcca ggggcagtgg
ctgtaatccc 89160agaactttgg gaggccaagg caggtggatc actggagatc aggagtttga
gaccagcctg 89220gccaacatgg tgaaacccca tctctactaa aaatacaaaa attagccagg
tagtgggtgc 89280ctgtaatgcc agctactcgg gatgctgagg caggagaatc gcttgaacct
gggaggcgga 89340ggtcgcagta ggctgagatt acgccactgc actccagcct aggcaacaca
gtgagactct 89400gtctcaaaaa aaaaaaaaga aagaaaagaa aatcatcctt ttgcgatcct
aatgaaataa 89460tgggcctagg cattgatcat taatggctcc taaaatcact aagtatatgg
ttgatgggaa 89520actttatagt ggatggatca gactcgcaat gtctaaacca gttgatcaat
cttaacatcg 89580taacaagaca acagacacca ggggctgctg acaggagaac agaggaaacc
catagctcta 89640ccactgagtt attcacggca aaaaaaaaaa aaaaaaaatt aaactgcgtt
tcctccaagc 89700ttctaatcct gttgtttaca ggaaataccc aaggaaagga atacttttaa
atgacacatt 89760aaaacaacgc caaatccaaa atatggggaa atgacccagt ttcttcaaca
aataaacaag 89820aaaaggtagg ggggaggact gttctagatt ttaaaagcta tagaagacac
agcaaccaaa 89880tacactgcat ggaccaggca tggatcctaa ttggaacaaa ccaactgtaa
aaggatgtat 89940ttgaaatgat tggaggaatt tgaacagtga ctgcacagta gatgatatga
agaaattatt 90000gttatttttt aggtgtgatc atgattttat ggtgatgttt aagtaaaaga
ggccttattt 90060gttagagata catgtacggg tatagagaaa tatttacgga tgaaatgata
cgatgtctga 90120gatttgcttt gaaaactcta gcaggtgtgg gagaagcagg tgcatgggtg
ggggaaggga 90180tagatgaaat aagtatgcaa aatgttagtc tacttttgtc cctcctgacc
cagcaggtta 90240aaatacctca gcatacctct actcctccaa ccaggtccaa ggatcaggcc
aaaactccct 90300gatgtggtaa acagcctgac cccttcttac ctctctctct ccagccactt
ccctaagatt 90360ccccagtgct ctgtgcccta gccagcccga ctcatctgcc cagattcctc
aatgtttcac 90420tctctcattc accattttga ccccactgtg ctcctgggcc actctccaag
gccccgcctc 90480ttcatctcct ccctccttac tcatccttca ggtcttggct taggtgccat
tccctccagg 90540aagccttccc tgacaccaat cccatcctca cctagaacag attatgtgcg
cttctttgtg 90600ccccccatgg ccccctgtgg gtttgcttca cggattataa ctgcctgact
acctgccttt 90660ctccaccctc tagactgaga acaccttgag aaaaagaaca catctatctt
gtctgtcatt 90720gaatccctgg tgtctggcac catggctgac acataactat cactcagtga
ctagtgtttt 90780aatgaatgaa tgagtgcaac tagacagggt taagaacaaa agagaagacc
aggcatggtg 90840cttcacgcct gttatcccag cattttggga ggctgaggcg ggcagatcac
ccgaggtcag 90900gagttcaaga ccagcctgac atggtgaaac cccgtctcta ctaaaaagac
aaaaattagt 90960gggggatggt ggcacacgcc tgtaatccca gctacttggg aggctgaggc
aggagaatct 91020cttgaaccca ggaggtgcag gctgcaatga tctgagatca caccactgca
ctccagcctg 91080ggcaacagag taagactcta tcacaaaaaa aaaaaaaaaa aaaaaaaaaa
gagcgagaga 91140agatgtcatg gggtaaatga agacctccct tcctggttcc ctgaccagcc
cctgccctcc 91200cccgcatctc acctgtcttt cttgcttcct tctggtactt ctgtttaagc
cggtccatga 91260gcaggccatt ccagggggca cacagcactc cgaactgagt gaaggcaaag
gcatttgtgt 91320aggtgctgac tgcagaagga agagagaggt tggttgatga gaagtttcca
aaactccctt 91380ccaggcaggg actctcccac cttacccttg tctgcatgtc cctcctcccc
acaccatcag 91440accctcctct ggtgtgtaca gccctgctgg gaggctctgt gttcccagct
gggacatgca 91500gatgggctac ctcccagccc taccacatac ctcgtgccat gtccccaccg
gccatgttgg 91560tcagcaagga gttgagagtg ccaatgaaga ggtagtgcca caactgtatc
acagacagcc 91620acaccaggtg ccaggcaaag cgccgagaga aagcgtagct ccagaaggag
cggagttcct 91680gcttctgccc tgcccctggg gtctctaatg gggagaggag gatctgggcg
tgaattacga 91740ggaaagtgga caggtaggat ggggagtgtg gaggcttcaa tggaacattt
cagatccggg 91800cccaccttct acccttggct cccagaactc acccggcgtg ctctgcacct
gcctcggact 91860cacacatccc cacctccctg ctttgtcatg ctggccctac caccttggat
gaccctctgt 91920gttcttctct attgaaatcc gatctgtctc tcacagcctg gtcaatgaca
cttcttgcag 91980taataccttc ctgatctttc tcagcgagaa aggtgaaagg aacgacaagg
agaggagaaa 92040gtcagaaggg agaggagaat gagtgtggat actctgttct aacctgctcc
tcagcacctc 92100cctttctttt gataccagta tcctgagttt ctttgggaaa tcttcctcta
ccctaatctt 92160catggtccag atgggaccat gaattcagtg ttctgttcct ctctcaaggt
taaccaatga 92220gatggttcct cctaacaagg cagaccggcc atgagtttag attaggatgg
acttaatcta 92280aaataggtcc tacaccctgg caagttcaat gtcctccctt gattttggaa
gcttcccaga 92340accctattct tctttcttta aaataaataa ataaatacat gttttggatc
caattgtcag 92400atggtaaaaa taaaaacaaa aaaatcaatt ttattctgta tatttaagat
atacaatatg 92460aggtcatagg atacatatag ctactaagat ggttactaca gttaagcaaa
ttaacatatc 92520catcatctca catttctacc tgttttgtga caagagcagt taaaatctac
ttgtttagga 92580aagtcccaaa cacaatgcag tttgatgacc tacagtcttc gtgctgtgca
ttagatctct 92640aggcctgttc atcctgctca tctgctcctt tttgtccttc gacctgcatc
tcccatctcc 92700tctcccaccc cgttcttatt tctactgtag ctagctgcgg tttgtgatgt
gtgtaaccaa 92760agacgcagaa cagagaggaa ggaaaggaag cagtgataga gttgggacaa
taagagaggg 92820cggacccagg agacctggag aaatgggggc actgtaccag acttagtgca
atggcatcac 92880agaagagggc agaaccgagg agtgggggga agggaaggca acccatggca
ggcgggcttc 92940aaggggtggg gaagtgatag gatgcgaaat agagaaaaga gggacagaaa
agagacgaaa 93000gccctggacc ctccattaag tgagagggtt gggaagatgc ctaaggccct
ttttctgtcc 93060tgcctttcct gattctgggt ccctggggga gctctggagg tgaggggcca
ggaaaggcac 93120aaggagaggc ttgggtctgg aggagagatg ggttagccag cagggctcac
cttccttcgc 93180tgaaaggaac tcctttgact gtagctccct gttttcatgc tcagctgttt
ccttctcttc 93240ctttgtggtg ccattcccag ggcacaggct atggaaacaa aagccccacc
agcaaggcca 93300aggactgtga gccgaacctg agactcagac tggagggaat agcatggtga
atcccacatt 93360ccaccgcact ttggaatcac cttttagcca ctctgatgcc caggttgcag
accagaccag 93420ttaaatcaga atgtctggag gtgagagcca ggcttccttt tctaagatct
ctatgtgaat 93480ctagtgattc taataagcag caaagtttag gaagcatgaa aagagtaggg
caggcccagg 93540ttcaaatccc agctctgcct cttcctagca acagaaagat ggctcagact
taacccttct 93600gagcctcatt ttttgcattt agaaaatgga gataaggata tctcagagga
ttattgtgag 93660gatgaaatca gagagcacat ggggtctgac aattagtaag tgagcagcaa
aggaatgccc 93720ttcctctact ccttgtggca aatgactgca aaaatgatca catttcttca
cctcctctgt 93780atttccccca atttgaatga gactgcagct ctatttcccc atgccctgaa
tctgggccag 93840ccttgtgaac tgcttcagcc aaaagaatgc agcagaagtg gctgtgccaa
ttccaagctt 93900aaatctcaag aacgcttgtg catttctgca ctctttcaga accctgaaat
cacggtgtga 93960atgagcccac gctggcttgc tggaggatga cagccacgtg acccaggcat
ccctgtcact 94020ccaaacctat gtgagtgagg ccatcctagc atagccagcc cccatgtaat
cctccaaatg 94080atcagatgta tgaatgagcc ctgtcaaaat catctacatc tggccctgat
cagcggaact 94140agccagctac ccacagactt gtgaaaaata ataaatgctt aacattttag
gctgctgagt 94200tttgagatag tttgttatgc agcaatagct aacagatgca ctgctccagt
cctcctcctc 94260tcctgtgata ggtttgcttt accctgtcca tcccacccta gggccaatga
ggggctctgg 94320cccacaatca ccagatagtc cttacccata gctgtagttg gggggcagtg
ggtatgggat 94380gtgcccccgg ggcatcagga ggaaagtgcg tgctacatgc caggtactgc
agacagagat 94440gaagatgaag gaggccctga ggctgatgcc tttttcataa agaagctgca
gaaggagaag 94500gaaaaagtca gtgtcacacc cacgttcata gcagcactat tcacaatagc
caaaggatgg 94560aagcaaacta agggtccatc agcagatgaa cagctaaaca taatgtgatc
tatacacaca 94620atggaatatt attcagcctt aaaaaaggaa agaggcaacc atgctggctc
acacctacaa 94680tcccagcact ttgggacgca cgaggatcac ctgagcccag ttcaagacca
gccttgacaa 94740catagtgaga ccctcacccc ttctctagaa aatttttatt taattagctg
ggtgtggtgg 94800catacacctg tagtgccagc tactcaggag gctgagtggg aggatttctt
gagcccagaa 94860gtttgaggct gcagtgagtc atgactgggc cactgcaccc cagcctggac
aatgaaacat 94920gaccttgcct ccaaataaaa aaaaaaaagg aaaggaaaga aattctgaca
catgctgcaa 94980catggatgaa ctttaagagc actatgcagg gccaagctca gtggttcctg
cctgtaattc 95040tagtgcttta gaagaccaag acaggaggat tgcttgagtc caggagcttg
agaccagcct 95100gggaaacagc aagacctcat ctctactaaa aataaataaa taaatcagct
gggcgtgatg 95160gtgcacgcct gtaattccag ctacttggga ggctgaggtg agaggatatg
attacatgat 95220tacatgcctg taatcccagt actttgggag gctgaggcaa gcagatcacc
tgaggccagg 95280agttccagac cagcctggcc aacatggtga aaccccgtct ctactgaaaa
tacaaaaatt 95340agtggggcat ggtggcacgc acctgtaatc ccagctactc gggtggttaa
ggcaggagaa 95400tcgcttgaac ccgggaggcg gaggttgcag tgagccaaga tcctgccacc
gcactccagc 95460ctgggcaaca gagcgagact ctgtctcaaa aaaaaaaaaa ggttaagata
gtaaatttta 95520tgttatgtat attttattgc atacaaaaac atcagcagaa gaggcagggg
ctggaaccct 95580gttttctaag gagtcctagt acaagccatc acctactatc ctgtaagctg
attagggaca 95640cctggtacac acatgccccc acccacccca agacacaccc ggcagtagag
gagtcctcat 95700acgacccatc cccacagccg gtggagcctc ctcgtgtggc tccccagaga
tcttctagcc 95760cagtgccttt tttcccccaa cgacagcaaa ggccttttgt tcaaagaaaa
ttttacacaa 95820aaattcatct tacaaaacac accaatgggg agcttgccag tcatctccct
ctttattctc 95880cttggtgact ggtatgacat caaagagaat ccctaagttc ctcaacagct
cagtttgaaa 95940accaccgacc tagcccaacc tcctcccatt ttacagagag tgacgttgag
gtccagagag 96000gtgcagtgaa ttgctcaata aattgacaga gtaagcagca gcaaagtcag
attaaactaa 96060gaattcctgt tcctgctccc tttccccttc caactctaga gagacaggag
agaggctggg 96120catggtggct catgcctgta atttcagcac tttgggaggt caaggaaggc
ggattacttg 96180aggtcaagag ttcaagacca gcctggccaa catggcgaaa ccccatctct
actaaaaata 96240caaaaataag ctgactgtgg tggcacgcct atagtcccag ctactcagga
ggctgaggca 96300ggagaattgc ttaaacccac taggcagaga ttgcagtgag ccaagatccc
accaatgcac 96360tccagcctgg gagacaaagt gagactccaa ctcaataata aaaaaaaaaa
aaaaagagag 96420aggaaagaaa gatgaggcag ccatctgggt tctccagggg aaggagggag
aacccagaaa 96480gtgactctta tgccaggagt agaaaggctt gagtgcctca ggggctcagt
ctctgcataa 96540ccctccaaac ctccaaagct tatgggacta agctagactc atgtctgggt
ggtgactgcc 96600agagatcctc ttctctgccc ccataacctg caggcagtgc caactgcctg
tgacctaaca 96660ctaagcccag agagaagtcc caggttggat ggcttgagat ccacactctt
cccttccttt 96720cactcagcca tctgtggtgt gctggcttta gtcctccagc ttgctgcctc
ataattgaag 96780catggttgcc acaactccag ctatcacatc ctcacaccac aacattcaat
gaggaagact 96840ttgtttttac tctgctttca ccttgcgtca gggaagaaaa gtccccttga
atcttccact 96900atatacactc cctttatctc attaaaaagg actggatcat atgctgacct
ccacctatca 96960ctagcaacgg gtaaatggat tgccatggtt ggctttaatc aatcaggatt
catcccctgg 97020gctaagcggg tcactgccca gataaaactg ttcgcaatga ataagacaga
atggttgttg 97080attgacctct aatagccttg gcaacagttc atcccctgat accccaacat
cagccactgg 97140gacagctgga caagcctctg tgtctgcccc tgctgtaccc actagccact
tgccaccttc 97200ttgtccaaac tagaagctca cagcagcaaa cgccccactc taaaggtccc
ccagcctcta 97260cccaacactg gcccaagcac attatgacca ctgccacaaa agcttgggca
agtctgaaga 97320aggggcttag cggttacaag ctcaggctct agaaccgaca agcctgggtt
caagttccag 97380tatcatggct actagctgca gaaccttcaa caagcttttt aacctcagag
actcaaatgc 97440ttcatctgta aaatgggggt aacacagtac ctacctcacc gagttgatgg
agacaaataa 97500tgcaggttca caagacaagt gtctggcata tacaagtgcc cagtgaatgt
aggctgttgc 97560tatatttacc ttaataataa ggaagactgc cgaggaagag tcaaatgctc
cattgtacag 97620agtgatgatg gtcgaacggt gttggccaaa taggttccca atctggggat
gataggacta 97680gcctggatca cttatttatt catgaaacag atacttcctg agcacccagc
atgtggcaga 97740ccctccttat acccaaactc accctccacc gctagagctc ccacctcagc
ttgggccaac 97800cccatctgag gcagccaatt atagaaaagg gtctctcctt ccctccacct
tcccgccacc 97860ctgccgagtg cctgggatta gggaaggctc ccacctgcag gttggtgatg
agaaacagga 97920ttcccccaat ggtgagcatt ggcatggcca ggaagagcag cacggctgag
cctggaccat 97980caaagtcaga ggtagggtgg tgtcatagtg caacccaaac atggagcccc
aaactctgct 98040cccacctgct ccaaattccc aacaatcctg gtatccaggc cccaattcta
gccagcgttc 98100cagcgtcctt caagggtttt taggataccg gccaaggctt ccccagatat
ctctgtggaa 98160gtcttctgag ccaccttctt cccaaccaaa gttggtcctc agtctgtggc
aggccaggaa 98220gtctacagac agaggcagag ctctaagtga agccacctct ctcttccctc
agtaaaccac 98280aagctgcctc tccctttcat ccttgacact cctggaaaag aagaccctgg
actcaggtcc 98340ctggctcaac cctctagccc attccctaat tcatggtatt ggccttgagc
ttcaatcatc 98400tgtttaatgg gaacaacagt tcctgctctt cctgtctcag gtgctatgag
aactgagtga 98460gaaaaggacc atggtctttt ctttgttcac taaactctga gcacttcttt
ggtgccaggc 98520attgtgcttg gcactggaaa tgcaagatga atcagatagt ccttgccctt
aaatagactg 98580acatgcaaac aaatggttat aacaggtctg gtaagtgtga gaccacagca
aaaaagctca 98640agagctgggc taggggaacc cttgacaaat tcttcctccc caaaccagac
ttctgcccac 98700cattattctg gccacaacct atgcctgtcc tattatttgc taaaatgttt
taagttgact 98760cacttttatc caaaaagtat ctatttttaa aggacacttt atatcactac
tgtagatgaa 98820aacactggca ttacttgtca tgaatagaaa gtaactgtca aaataaatac
aatgaaagga 98880aaacaatgtt attcaattgt agctggatgc atttgacctt agaatgttca
aagcctaaga 98940cctgctcttc ccatcagtgt taaaatcaca ctggccccac atgaagacat
tctttcatga 99000aatcagaagg actgaaagag aaataaaaag ggaatagctg ttctaccagg
tgatttgatg 99060tttgttagtg tagttcacgt agtatgcgtg tgcccctaac atcctcttaa
ctaccgtgct 99120ataccttaag aagcactgcc aagagctaat tttagagtat tcacacagtt
taccattcaa 99180tttctgtctt tataaaatgt acatctctcc tactactaaa ggttggagac
tcctttcaca 99240atagagtcct tatgggctca atgctttttt caaaactgaa aagccctata
ttatggagga 99300agaggaggat tgttgctcag acgatttgca ggcacgagtc aaacattacc
cagccaccac 99360ctccacattc agttgcttaa aaatcattta caggctttta gagtagatga
tgctggtttg 99420ataaggagag tggtttgaaa taattggttt gaggtgctgg gccatctcat
gagatctgtg 99480tgaacaaaga cactcagcct ctgtgtttgc ccagcatgag tgcagacaat
ctcatgatgc 99540tgtcagcttt agcatagctt acacacacaa gagtaatgta ctttctttcc
taaaccaaaa 99600attgagccac gggtctaaca ctaggaagga atattgggag gcatctcgtg
gccaccataa 99660ccaaggcaat gacagaaaga agagtgaggg atcaggaggc ctgcacatca
ggcccacctc 99720ccacttgctt tctctgtggc catggacatg tctttgcaag gggtcctgct
gtggcttcag 99780tttctcctct gtgtaatggg tggaagggtg gtggaaaata aaccagattg
gagttccaga 99840cttaacagac tggtgaaatt ttaaaacaaa gattttgagt acaatagggt
tgtcaacttt 99900taccctgcta agtaaggata tttgcaaaat ggtcattcat ataatcattt
cattaaaaag 99960agaaagagaa cattttaaca cataggagaa ggatgtaaag gttttttgtt
gttgttttgt 100020tttttagggt tttttgtttg ttttttggta gagtctcact ttgtcaccca
ggctggagtg 100080caatggtgtg atcttggctc actgcaacct ctgcctcctg ggttcaggca
attctcctgc 100140ctcagcctcc tgagtggctg ggattacagg cgcgcaccac catccagcta
atttttgtat 100200tttttttagt agagaaggga tttcaccatg ttggccaggc tggtcttgaa
ctcctgacct 100260cagatgatcc acccgcctca gcctcccaaa gtgctgggat tacaggagtg
agccactgca 100320cctggccaga tgtaaagttt tgaataaatt ctactctctg aagtaatccc
tctccatcat 100380ccttgctttt cacattttct caataaactg ttttcacaga ccagcaatag
ctcaagatcc 100440ttccaggatt ctttcaagct gcagatctct gaataaccat gtggtctgta
tatcttgcct 100500atagccctct gctcacacct gccccagcca ccaggtgcct ctgagcttgc
atccctccca 100560cccacctgac agcactcacc tgcagaggtg aaggctatga tgagtgtggc
ggtggtgtag 100620aaaaatctga aacacataac aggaaaagca gaatattgtc aaggagggag
aaacctggga 100680gaaaaaacat gattctgctc agccagccca caagtgtagg acttgaccgc
accctcagcc 100740tgggatgcaa cgggcactga tgcctctgag ccccaggctc aaaaccaggc
gcaagaagcc 100800gcgatgagat tgagatgtgg tcctgacctc atggacagtg cattttgctc
attctgaggc 100860ccaaggctag catggaaagt cttggacaat gagctcagct gacgatgtga
ttggcttggg 100920acttagccag gacagaatgg gcaaagcgaa ggtcctccca cctggaagcc
ccaacagccc 100980aaccccttgg agaaaggggt tagtgcctgg tctgcaaatc aaggccttga
gttctaactc 101040ctcctcactc tgtgaccttg ggcaaggcgc tgtccttctc tgggcctcag
gagccttttc 101100tataaaaaga aatgatcgga ctgatctagc tcagagtgct atgatttcag
gactacagtc 101160ccaaggttat caggctccct tagcatttgg gggtcttgta aggcatggag
taaaaaaaaa 101220aaagcaatat cctaaggctg gagaagaggg aggggacaaa ggaaggggag
gaaaggggag 101280gtagcaggga gccaaggacc aagaaggact gaggtacagt cattctgcat
ccaaaggctt 101340aaattgtaag ggactggctt tactctggct gtttccggaa aggcaggccc
agccagccct 101400cccgtctctc tctctgacag ccaatctcac atgtgcctcc ctgggagcac
ctgctctgag 101460ctgtatcagc ccccagcagg ccgctgatta ccactgagcc tggccacaga
gcacgagatt 101520aggatgcagc aacacactgt gtgtgagatc acgtcccgaa ccttctgact
catctgcaca 101580ggaaaccccc ccagtctccc ctccagtcag aagggacctg aaattccacc
agtggcaata 101640ccaaagaaac ttcctattag ctaagcccct agggagtgat tggctgttgg
ggcggggagg 101700gggggcggtg aggaggatga ggatgaagcc tgggcaacct ggatgtgagg
ctgtgcaggg 101760gatgaggaca aggatccttg gggtgaagga agagaagagc aattttaggt
tttgctaatt 101820ttgtaaccct ggctccaagc cagcccttac aggaagtcac cctggcctcc
ggctcaattc 101880agcacgtgat agggaagcca catttatgca gagcagggaa cgaggtaagg
aaatggaagt 101940ggggctgtgg tgaagtgggc aagtctagag agagtcccgc tgcctggggc
tgttcctaac 102000agctgctggg agcgagctgc aggtgtggtg cctggcaggg tggccgggct
gtctgactct 102060ggatttcact ccaagctagg ctgctgcctg aaggattcct cttacccacc
tttgcctggg 102120ctggcctttg ggacttacat ggctatgagg cgtgccacgg tggtcttgaa
ccggtcaaag 102180atgtagccag tggggaatgt catgaagttg ttcatgaagg accccagggt
gaagatgagt 102240gagaacctct catcctgggc tttgcagtct ggagtagaaa aaaggtctcc
catgcatccc 102300agccttcctg ccaaatgagc acacaggctg ggctcccctc cacctcagac
agcttgtcgg 102360tcgcaaactt gtcccttaag ctgagttgaa atgtggctgc ccctaattac
ccctcaggag 102420ctggtgcctc cctcccaggc acttcccaga tcaagtgggg tgagagctgc
tgacccttcc 102480tctcatcata gaaagagggg tgggcagggg gcagagtcct tcctgctcct
tgccaccacg 102540tgggagccag acttaacttc cttagaaaag tcatccctgc ccttaccagc
ctgccctgtg 102600gcattgccaa tcggcccagc atctggtcca cacagatcct taaagtaatc
ttcattcttg 102660aagacaaaca ctagtgaagg ccagccaaag aggacgccag caaagcccag
gcattccagc 102720agcccagtca gcagtgtggc cacgtgcagg ggcaggccct ggcccgccat
gagcagaagt 102780ggagtggatc ttcaaatccc actttgtcct cctggacgga tcacaggcgc
cgtaagcctg 102840gcgtttgagc acttggaaaa ttcctctggc aagccaagcc cttcctttcc
cgtagctctc 102900tggttgtttc aggcctgggc aaaaaccatc agcgggtgat tctctggatc
ctgtagaata 102960aagatagagg ctgctggaag aggaggcctg cgggaaaggg aaaggtagac
tagagttatt 103020tgtgaggtgc attaagaggc aggatgatca tggccgctgg cagcaaatgt
ggggaataaa 103080tactccaata catcatctta ggcactgcat ttgagtaacc acgtggcaag
tagagaggca 103140ggtcttgatg gccacctgga gtcacaggtg agataaacga cttacccaat
agctccccgg 103200gagcaggtgg agaagcggag ctcctgcgct cgaattctga ataccgtccc
ctaaaataat 103260gacagcaact caaccaggtg cagaggcagg gagatttcat acagaagaca
caaactcccg 103320ctgccaagtt ggtgttatct tcagtttact gacaatgaaa caaaagctcc
catggatttc 103380aggaacttgc ccaaggtcac agggctagtt tagtcacgac gcaggccatt
ctactgccag 103440aaataccccc aactcccatg accctcgcct aggactcgca aacctggtcc
ccgccgccct 103500tcctcgcatc aacttctacc aggaaagcct ccgggggccg ctccccgcca
gcctccgcac 103560cccgctccag cctgcggcct gccctccccg cagaggagcc cgaggggcca
ggccgcgctc 103620ggcgccccat ggcgcccgaa aggggaccct tcgccctacc cgcctgctcc
gcgccggggc 103680tctccgcgcc ctttccgcac gggccaggtt cgcattcgcg cctctcgcag
cccctcccag 103740tcccctgctc gcctccgccc cctcctgccc gcccggaagg ggctggggca
gacctcccac 103800tctccatcac ttccttcttc ttttcccttg ctcacagcct cccgcgccct
ttttacctct 103860ccctcttgaa acttctccct ctagaacccc ctagaacccc agcggtgtct
ttccctccct 103920cctcgctgcc tttcagcctc ccagccccct tgcctctgcc tcccctaacc
aagttagttg 103980aatgctgtta ctcgctcagg cccacctagg gaaaatgtca cacccagcac
ccagaggaca 104040cacagacagc acatgagggc atagggacac acacactcta tttgtgcatt
ttgccttgac 104100cgctgggttg gcagggaaca tatttttcct atttgctcac cagcttaacc
gtctctccca 104160gtttcacact cccagagctg ccaaaaaaat cccaaccaca gaatcaggaa
gccaagaacc 104220aggactgagg gcttttcaga aaccatcccc tggaggactg ccccatattt
tcactcccaa 104280aaacccctta gatgactccc tgcctcaccc ccgcccccca ggttctgaaa
gagccttccc 104340gccagactgc attgattaac cattcattgc cccatttttt attaatcaaa
gacatatata 104400attgctcatc ggagcttgtg atcagcgtga ggccttacta agcagctgcc
ttactatcct 104460tccagcccag agcacgtgag ctgacgtctt ctttggcctg tgtggccgtt
tccttgccaa 104520aagctcagtt tggggagagc ttcttgcgta ttagatgcag tctgcagact
cccaacccca 104580gctacctgga tcccctgagg gcccaggaac tccagctatt ccaagcccac
tcctcttttt 104640tttaagagga agaaatagag gttacgatag gggacagcca gaactgagga
ttttccagct 104700caccaccaaa gcacaaaaga taaaagtctg caaccaccct agtgacttga
ctgaatggag 104760gaagggtggc tggggtcctg taccccaagc tactcactag ttatacaacc
tgaggcaagc 104820tctttggctg ccccacctgt aagacgagga caatagtacc ttaattatag
gaattgtcat 104880aaaagaagta taagatgggt gtatgaggtc cctgcatggc gcaggtgcta
taggcagatt 104940gtagggtagt agattttcta gtctgcagtt atgtagacag agccagagaa
gcagctctgg 105000ggaggaattt caaaggaact tgcccacggt cattctacaa agctgcagta
ccttcccaac 105060tctgaaacgt atgctctcat caccccgtct taacaaacat ttggacatta
gagaaaacaa 105120gtcttttctt aaaataacat tatttatggg agaaaatcca caaaaatata
gcatcccagg 105180acaaacaggg cttaagatgc aagattttct attttactgc aagacacaaa
gactctgaaa 105240ttaatgcatg ccctatcttc tgctctggca tacattttag tctcctgggg
ggatcagtaa 105300gtgtggaagt agcaagggag aaacagaaaa aagtcaaagt aaagagacag
attttagaat 105360gttaatctgc aggagcctgc cagaaagatc tagctcatgg gctatctgta
catccaggac 105420tgaagcacgg gacacggggc aggtcgtcca gggttctgtc caccttatct
tgttacctct 105480cttgactctt agagcctcca ctccacatct cccatcaatg tctgcagaag
acgtggcctc 105540cactaacaca agtcttactg aactgatggg acaggaaatt agaatatcct
ctgaaccatt 105600cccatgttct ttggttcgaa ttccagcagc tagaaaaggc agatgctatt
ctgatcactc 105660tcctgcgtgg ctccaatgag gattaatgag taacatcaga gagagaagtg
attataataa 105720ggtctgacgg tgcacccgat gtcttcatcc ttttctcttc gcctccttcc
tcatcatctc 105780acaccttttt ttttttaatt gactgattgg ttcaacaaat acatgtggta
cctcaggctc 105840tgtgccaagt gccgggattc gtagagaaga gattcagtgc ctgctctcaa
ggggctcatt 105900ctcttgtggg agagacagac aaagaaaccc aagatttctg gagtgtggga
atggtcttcc 105960aggcagatgc tagcacagca cattgaaagg cacggaacct caacaaaaca
ataacattta 106020ggaaccagct agagcacagg gtggtgaaga aagtggaaag atttgaggcc
agcgtcgcca 106080tctaagtgag ggcattaaga attcagccca catcaatcaa tcatgtccta
ttgatttcac 106140cccttaatat ctctcctatc tatccgtggc cactgctcta tgcagacact
catcatctct 106200cacagaggca tcatctgctt ccaagccatc gccattctcc tgcaagagtt
tatttccatg 106260gttcccactg gatggcttca cttaactgct caaaaccctt ctgaggtcca
gtcaactggc 106320tggtaaggac cagtccaggg tctggggatg ccagccatga gacattgctt
tgaggggaag 106380agggagcata gaactggatc tcctgcatcc tactgcccaa gtaccaatgc
tggaggtggt 106440tttccttccc atcatcagca agtctggata tccaggatcc accctatgga
tgtttttatg 106500gacagagtgg gaagatggat atgtttaggt tagggaaaga gggtttgcca
aagagggcag 106560tataagtgag ctgcactcca tcattcccct ggcacaaaca atggctagta
tcctctagtc 106620ctcaagagca ccaccttcca atgcagtccc tgcctgtcca cagacctctc
tcctcaaact 106680tcctctgaac aacctcagcg agggcaattg ccactctctg ggcagagtcc
agatattctc 106740tcctaccctc tgacatcact ttctaaattt gtatatgtag ataaactctg
agccattcac 106800ataaagggct ttgatttcgg atacgccaaa cacataaaca aacaaacata
agctttcctt 106860tcacaatggg ctcatgtaaa ttaaaatgtt tggttttcca cctacggtct
tgaaaggggt 106920ttctacagcc tgttttggaa gtcagaaagc aaaaggtaaa tgcaaacatc
atttcacctg 106980cagagaaaat tctaatcctc ttgaggcagt gccaaaaata atacaagcac
actgctatcg 107040agccaattac tggtatctct gagcttccgt ctcctcatct ataaaattgg
aatcgagctg 107100tatggattaa agataatgta tgaaaactgc ccaattagta cactattaat
aaatagcagc 107160tactgttgtt aacaaatatt attgacttac tggaaaacaa agaggaaata
aagtcacatt 107220tagggagaat ttcaaagtgt tcctaaccta aaaaagaaat aaattagggg
gaaaaccact 107280aagtaatggg tgagctcagt ttaccttgct taagaagtcc caccctagag
aactgatctc 107340tagatgacac ccaaatgcac tcagtacaac cccccaagac tgtctgggct
taaggcaggg 107400gcttggattg tcctgtaagc tgtgggaagt ctgttcatga gccacagtag
acaggaaggg 107460gatggagtct tagagctggc tcttcagggt atctcctagt gtgttcaaag
cagttctcag 107520gagggtgggg aactactaca tagccaagta aatatgaggc ctccttgctc
tggggagacc 107580tttctcttta acagaggtga atctgaaagg atacccaaag aggcactgga
gggtgggggc 107640cactctggcc cctcagagca gccagctcag cttcagtgga tgctagaggc
agcagaggat 107700cagcctggat cagcctccct ttcaccatgc agaaaacaga gctcccccac
cagaccagat 107760actggaagca ctgggccagg cctaagagaa agcagagccc caagccccac
cacaccaggg 107820cttatgaggc tactgctacc caccctccca agccccagct ccacttctat
gttcatcaag 107880caactgttta ctggtaactg cacttcccga tagactttgc tagaaaggaa
tgcctcagtg 107940cactgacaat atctaaacct gcaactctaa ggactaggct ggggaacact
gtgtcaacat 108000ggaggcacgt cctacccctg agaagaaaaa taaggaatat caataatacc
tgctgggcac 108060tgagcactga ctatgtatct gattcgaagc gctttttgct taatctgtca
ctgaatctca 108120cgataggtgt tgttattagc atctattatc tgggccaact gaggcctaga
gggacgaagc 108180aactccccca acatcaccag gtagcagtgt caggactggg atagaaacct
gatgctctga 108240ctgaaactaa tgcttttttt tttttttttt tttttttttt tgagacagca
tctcactctg 108300tcaccaaggc tggagtgaaa tggtgtgatc tcagttcact gcagcctcca
cttcccaggt 108360tcaagtgatt ctcctgcctc agcctcccga gtagctggga ttacaggcgt
gcaccaccat 108420gcccagctaa tttttttgca ttttttgtag agatggggtt tcgccatgtt
ggccagactg 108480gtctcaaact cctgggctca agtgttctgc ctgccttggc cccacaaagt
gctaggatta 108540caggcgtgag ccaccatgcc cagccagctg atgctcttaa tctgtgccct
acccagcctt 108600cctgggaggc ttcccaagag ctacacagag catgagttct ggaatcgggt
tgatgggggt 108660accagttatt actaatagga atgaagatgg gtaattcttt cagacagcac
ccttgattaa 108720aacaagagag tagtgctgcc tctctgtgat tctgtgtctc cctgccctgc
tcacacagac 108780accacaccca cccacacgca tgatcatgaa aagaggaaat ggatccagga
gaaggagacg 108840actcctgagt gaaaacaacg gggtttttca cattgagagc tttgcccaac
accccaaaga 108900tgaaaagagc aggaaactgc tggggccgat tgaacactgg acttttgttg
tggaaaaagg 108960caaagggaag ccggaagaga ctggaacagt ttccatggtg ctggaggatg
gggaagtggg 109020tagggattag ctggagggag aaggagaagc tggggtggga ggggaacctc
cacttgccag 109080gagagcacat gtaggatggg aaccccagat gatactcaag gcatggcatt
agaccagaag 109140caagtctgtg gtgaaattag ggaaggctcc actgcggact gtagacagag
cactggacaa 109200ggaagtggga gacccagggt ccagtcctgg ctctggagcc ccctgggtgg
gctgccccgg 109260gcacctttct ctcttggggc ctccattcct acctctgtga agcgagtgct
gaacctctct 109320tagccctgac ttgctgaaat gctgggactc tgtacagagg ctgacattaa
gcagggatct 109380gtcgtggggt gctgcaatgt tcctccagat gctgcacggg agagggcaga
aaaggcctat 109440atggtgagtc cgccctggga gcctctgctt ggaagctgaa gtggcctgag
agtgactcag 109500aaaccacgga agttcccggg gctgatgggt tcttatagat tgtacatgca
gctctcctcg 109560tgggctgcaa aaccgcaaga tgggctgtga ccactctcaa ggaaagagcc
ctatctgcaa 109620aaagcattct gccctccagg tcttaaagca aacacagact caatccttat
tccttttaag 109680acaaaattgc ctcaggggca tcagggaggc agcaggcctc aaatgtgtgc
ctttctagaa 109740ttctcaatga aagcaccctt ttgggtatta ataatgacaa cagtaatgac
agtcatttac 109800tgagtgctgc ttttgggaca ggcattgggc taagagctat atgtaatata
tattattatt 109860tgatgcccac agccacccca taaggaggcg gaggtactat cattatgcca
actttaaaga 109920tgaagaaact gaggcctcaa gagatgaagt aacttggcca agtcactcag
ccagtaaatg 109980ggaagagata gacttcccag tatccagagc ccatgttttc accattatgc
tgaagtacct 110040cttttcctgt gccaatgtga tctgcctcca ggaatcctgt cttgatgttc
ccttccccat 110100acagaagtcc tctctgtgtc ctcttcagcc tgatagtata tcttttcata
ccattctttg 110160gacatctctg ttatactact ccaatggtgt tccctcccct acccctccct
gggagcttag 110220ttgttgtgat taagtatagg ggaaatgacc cacactaaac aaactcataa
gagactgatt 110280gataaacctg aaatgcaatt tattaattaa cactgagaaa tgaaaccacc
cagcagatgg 110340gaatcctaag gctgactggt cagcacaatc tctttcagga aggacaggct
tttgggaaag 110400gaaatcaata ccagaaggtt ctttgttgag tacaaagtca gagggaaggg
agttgatgga 110460ttgacacata ggtgaagctt gacatacctc tataaagcct ccatcctgcc
aaggatcaga 110520atatccaagg cagggagcca tctgggtgtc ctctcctttg gacagtgctg
gatttttctg 110580gatcctatga agatcttacc tttctggctg catttatcat gattgtggaa
ggctttttgt 110640ttccttgttt gcttagatta atttctgcgt atttaataga actgaaaggc
aatttcccat 110700tgagacccac tgaagaggaa taatcaatac atactagttg tgttgccctt
tgcagagaat 110760tcacttctgt gttgtcactg tatcctcatg cttccttata atggagggac
agagatggta 110820aaaacatgga cttggaagcc agaccgtctg ggtttgaatc ctggctctgt
tacttataag 110880ctctgcaacc tcgggcagat tacctaagtc agtttcccct tctctgaatt
ggggatataa 110940tagcacccac ctcaacatct gtcaagagga ttcaatgagg gaatacacat
aaagtgctca 111000gaacagtgtc tgccatctgg taagcagtcn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 111060nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 111120nnnnnnnnnc taatttttgt atttttggta gagacagggt tttgccatgt
tggccaagct 111180ggtctcaaac tcccgacctc aggtgatcca cccgcctcgg cctcccaaag
tgctgggatt 111240acaggcatga gccaccacgc caggccccac acacctttta aacaaccaga
tttcattcat 111300cgggaagtgc ctgtggggct ggtgtggaca tgtgggtgaa ggtggcactg
ggagaagtta 111360ggattctcca tgacctctgt tactcatatt cccacactcc tcaaattagc
ctgagtctcg 111420aggacagtct gatggctggg caaaccctgc ggcaaaccat tccccagccc
tgccctctca 111480accagagtcc ttccgataca tgattctggg cagctgttgt tacccgtgtc
ctccatgttc 111540ttccagagat atccatgcat gcatcggcat atgtgtataa ttattatatc
tatatttcat 111600cccacaagct tttgacatca atagtagcat attattaaat tgttctgtac
attattttat 111660taacttggta tctctggtac tgctcaatat aagaacatat agacctggcc
aggcacagtg 111720gctcacatct gtaatcccag cattttggga ggctgagatg ggtggatcac
ttgaggtcag 111780gagttcgaaa ccagcctggc catcatggtg aaacccccat ttctactaaa
aatacaaaaa 111840ttagccaggt gtggtggcag gcgcctgtaa tcccagctac ttgggaggct
gaggcaggag 111900aattgcttga acctgggagg cagatgttgc agtaagccga gatcacgcca
ctgcactcca 111960gggtaggcaa caaagcgaga ctctgtctca ataaaaaaaa aaaaggaatg
tatagacctt 112020ctttattctt tttgatggct gtaggtggat gttctaaaat ttgtgtaacc
aatctcctat 112080tgataatatt taagttatgt cttcagcatc atatgaaact tacaaacaag
gttgcattga 112140ctatccatct gtaaatgtct ttttgaacat ttctagaata attgcaggat
aaactcctaa 112200aatgagaatt tctgggtcaa agaggatatg cattttacat ttaatagata
tttgtcaaat 112260tgtcttccaa agtggtcgta ccaattaaca ccccgacctg taatgaatga
gagtgccttt 112320tttccccaca ccctggagag atgaaaaatt tatgggccca ctttggagtg
catggtggag 112380gaagctgttg gccgttatat aaccctcgtc attaataagc ctgggggtgg
ggggggagaa 112440agagaggtta gttagtgggt gcaaacatac aattagatag aagtaataag
ttctaatgtt 112500cgatagcaga ggagggtgac tatagttaac aacaatgtat tgtatatttc
aaaatagcta 112560gaatggagga cttaaaatat tccaacacat agaaataata aatgcttgtc
tgcggccatg 112620ccaccctgaa tgtgccagat cttgtttgtt cttggaagct aagcagggtt
gaacctggtt 112680agtatttgga tgggagaaat gataaatgct tgaggtgata gatatcctaa
ataccctgtc 112740gaacattata cattctatat atgtaacaaa atatcacacg tatcccataa
atatgtacaa 112800atataatgta tcagtaaaga gagggctggg cacggtggct cacatctgta
atcccagcaa 112860tttggcaggc cagggtggga ggatagcttg aggccaggag ttcaagatca
gcctgggcaa 112920catagcgaga ctctgtctcc acaaaaaata aaaataaaaa cgaattagcc
aggcgtggtg 112980atgcatgctt atagtcccag ctacttggga ggctgatgca ggaggattgc
ttgagcccag 113040gagtttgagg ctgcagtgag cctacgactg caccactgca ctctccagcc
taggcaacag 113100aggaagacca tgtttctaaa agaaataaat taaataaaat aaataaaaat
aaaaagactg 113160aaaagcagag tggtaagaga aaggactttg gggctcaaca gtactagcct
tgaaccctgg 113220ctgttactta cccatcgtgt gataagcaaa tgccttaacc cctgtgtgcc
tcactttctt 113280aacatataaa atagaagtaa aaatcatacc cacttcaagg gtcattataa
aaagccaata 113340gagataatgt atataaagct tctggaataa tgcctggcac acagtaggag
tttaataact 113400ggaaattcat tgttgtagtg ggcagccttc tgaatctgtg tcctctttgt
ccactaatgg 113460ctttgatctg gatttggctc aggcaagacc tggggaaggg cagagactga
gggcaactgg 113520aggtataggg tggtctgagc ttccccagca gagtgaggct gggaaaggtc
tgggagacag 113580accaggcagg tgctgataag accggaatgg gaggctggag cataaggcag
ttcagttttt 113640cccaaagggg ggtgtacaaa acgatctcgt atgactcctt tatactgtta
atgttttcat 113700tttatcgcgc actgaaaaac aaaaccaaca tatttaatga atgattccaa
ggggattctt 113760gcttttacaa aaaatgctaa agtaggcatt cacatgttta aaaattgagt
tgatttaaat 113820tttaaaatta ctaagtcata gtacataatg tgtgagccac agctatcccc
aaaatcatga 113880tagcgataca ttaatgactg aagttcttta aacatcaaca tacaatgcca
attccagaat 113940tcagctcaaa ttctgcaatt acacaggctg gggttgaaac ccagcttttt
tgctaactgt 114000gtaaaattag gcaggagggc taacctcgct gaatctcaga tgtctagtct
gtaaattgaa 114060gataatgttt gtttttatct cacagagttg ttgtgaagat tcaataaaat
cacaacatgt 114120gaggatgatc tggctgtgac acctgtcacc ccactgatct ccagagttga
ttcggctgat 114180caggctggct gggcaggtgt cccctttctc cctcaccact ccgcatgcat
tcctcccgaa 114240actgcacact tggtcaaaga ggaagacctt tcctgataga ggaggaccat
tcttcagtca 114300agggtatatg agcacctgtt ctgtcctgcc agaatctccg aaggagctct
cagtaaaatc 114360acaagatttt attgtgcatg gtagcatgag cctgtaatcc cagctactca
ggaggccgag 114420ctaggaggac tgcttgagcc caggagtttg agaccagcct gcgcaacata
gtgagaccct 114480gtctcaaaaa aaagaaagaa agaaagaaaa gaataataat agtaataaat
cacctgtgca 114540acgtgctcac ttctctcttt ggaatgtagt aagtgtacct aataaatgtg
atcattgtaa 114600tcatcacagt gagcacaggc taaagcatct tgactttatt ctataagcaa
taaaagagga 114660tttgttttta cagaactcat tatgttgtga aaataatttt ccaacattaa
caaagaacat 114720tcttcaagta aaaggaaaac cacccatcat tctcccaacc ttcaataatt
ttcaattttg 114780catattctcc agactttgtc aacatgaata cttactttac atggtcgcaa
tcagtgttca 114840tgcaaattct tttatcctga cttttataaa caaatatgat gttataaacc
ggtctccatg 114900tttctgcata ttctttataa ttatcatttt gtggctgcat aatattgcat
tgactatgtt 114960aactgcagtt ttcttaacca tttcactgtc tggggaaatg gaggataatg
ccagggtcat 115020gcctggagct tttttttgtc tattgcatta tattcttaag atcaaatccc
agcagtgaga 115080ttagtcagtc aaaaagtaat aatattttca aggctcttgt tatattttac
tagattgttt 115140tccagagttt tgcacactgc tcccagagat gtaggaacac agacgtcatc
caaccttgcc 115200agtgctgggt gatggtgttt ataaacttct gctaatttaa taagtatgaa
atgctatcct 115260cacacggctt tcatttctat ctctttgatc attaacaggt tgaactattt
tccaagtatt 115320tgtttactct ctgcataccc tcttgggtga agtagtcatc cacttccttc
acctgtttat 115380ctgttgaagt cttgaggctt gttttataaa tgtgagcgag cacttcagag
tcaatagaca 115440ttaattgctt ccagccagat ttggccactg aggctcctga gcaggggaat
gcatgatcaa 115500aactacaccc tggacagatt aaattaattg gagaaaatgg gctgagaggc
agagatatgt 115560gtcactggcc tactgtgttt gatcctatag tgggggcctg aactggggca
acggcctgag 115620tcccccacta ccagtagcag gaggctccat gtgtccccca tattagagct
tgcggcactt 115680ccatttgccc cacctcctac aataccccac atacatgtac tcactctccc
ttgcaaatct 115740agtggcttca acccacagaa tttaagggga aaggaattgt tctgtcctgt
tcacttactg 115800cagaaatgag aaaagcgttg ttcacatggg atcacctaat gaagggatgc
catccccaac 115860ggtgcctata aggaaatggg ggagggttgg agagttgtgc aaaatgcaac
agggaatcat 115920cagagtctct tgccccatga tagagggttc tcaaattaag agagtctaca
gcaactaatc 115980tcacggccac tctaggcagg gcttcccaat gcttccccaa ccccacctcc
atcctagact 116040ttacccactc tgctgaacac agatgttacc catagcacct tgcaccatga
ttgtttgatt 116100agcacctccc acagtagact gtgtttctga taggtcagca acatttgctg
agcacctact 116160ctgcagggct gtgccaggtg cacaaaataa acaaagccaa agacaacatg
gaccctgaac 116220tcagcaagtt cagagtcaag tgggataggg aggctctctt cactggaagg
taactccaag 116280aaacaatggg actcaacttt ctaaccaaga gaactccagg gagctaaaat
tctgacttct 116340ggttaagact ggtgtggagc ttcattaaag aagaaaagat tcacccagac
ttgagttcat 116400agcctggctt tgcagctttt aagtcatgta acctttgatg aagttatgtg
acctctccac 116460ccagctgccc ctaacacctt gcaggggcag ggctggagtg caaagggagg
cactggtacc 116520acagcctggg aggcaccacc ccactagtgc aagccgggca acctctgccc
ccaaggcatc 116580cctagcctcc caactgcaag catcaatctt gcacttggaa aggaacctca
cctttgaaat 116640ctaggttcaa atttagaatg atccagctcc ttgaagttct atacagaaat
acagccagca 116700gccaggcccg gtggctcacg cctgtaatcc cagcactttg ggaggctgag
gtgggtggat 116760cacttgagga cagaagttcg agaccagcct gaccaacatg gtgaaacccc
gtttctacta 116820aaaatacaaa attagccagg catggtggta cacgcctgta atcccagcta
cttgggaggc 116880tgaggcagga gaattgcttg aacccaggag gcagaggttg aagtgagcca
agatcgtgcc 116940attgcactcc agcctgggca acaagagcga aactccatct caaaaaaaaa
caaaacacac 117000gcagcttccc tccacttccc aaccacagct ccatctcaga caacaagggg
cctcatgtcc 117060atgcacatga atatccaacc aacatgtcta aggcccaacc acaccctctc
caaacatctg 117120ccccttggcc accctttggc catgggttca tgcactggca gaaaggtagt
tcagagaaga 117180agcccacaaa gggccgggaa gtccacttgg gctttttgag attccagggt
ccaggataac 117240ctaagtgtgg tctagaagag agatgcagct tctgggaggc acattccttg
gtcttaggga 117300cttcttgccc ccatggaggg aaactggcta gatgagggcc aaagcagagc
cctctaaagc 117360acagggctca gggaaggact ctttttgacc agatctaaga gcagcactac
ctctctgagc 117420ctgtttctcc atctgtaaga aggggacatt aatagactct ccccgctaga
gttactctac 117480atcagccagc acacgtaagt tcatgacatg aagcaagggc ttaatatata
cccgttgtac 117540tataaataat aggccaggcg tggtggctca cacttgtaac cccagcactt
tgggaggccg 117600aggaggatgg atcacaaggt caggagtttg agatcagcct ggtgaaacct
catctctacc 117660aaaaatacaa aaattggccg ggtgtggtgg cgtgcacctg tagtcccagc
tacttgggag 117720gctgaggcag gagaattgct tgaacccggg aggcagagtt tgcagtgagc
caagatctca 117780ccattgcatt ccagcctggg tgacagagca agactgcatc tcaaaataaa
taaatacaca 117840catacataca tacatacata catacataca tacatacata catacaatac
atggacaggg 117900accctaaaaa tgagacaggg aaagagaaaa acatgttctg acaaccttgc
cctttatact 117960aatttaggtt ttcttgcctg ttttagaaag ggcctggaca ggagccctgt
tcccctcagg 118020ccaggcagaa caaggtgtgg aactcactgt ggaagggttc tgggtgacaa
gtgcagcccc 118080gtccctccac ctcccagcac agtaggcagc acgtgtctcc attgactggc
tcaggagcag 118140gcctggtgac cagtgggaga gctgaggagc ccagggtggg gtctgaagga
atccctagaa 118200aatctgattt tcccccaggg cccacatcac gtgcccagag ctgggaaagt
ggaggcagca 118260tgggatctag ctgagaggct ccatttttgg tagcttctag tttgggagtc
acagagacac 118320ctggatgata cgaagatgta gctttgcagg actctctaga acatggagtc
caagatattc 118380ccttcaatga tgggacactg aagcccacag aggagaggtc tgtcccagtt
actcagccat 118440tcggaggcag agaccaggct agaactcagg acttttaatt tggaccagga
ttccttttac 118500cacagtgggc agccctagca agtgccaggt agggtggaac tgtgaaggtc
atccgagggg 118560tagtacacgt gggtaggaag tcatatctaa gaactgaccc ccagacctgg
ctctgccact 118620cactccttat gagaccacag gtgctgggtg cagtggttca cacctgcaat
cccagcactt 118680tgggaggcca aggcaggcag attgcttgat tccaggagtt cgagaccagc
ctgggaaaca 118740tagtgagacc cccacctcta ccaaaattag ccaggcgtgg tggtgtctgc
ctgtagtccc 118800agctacttgg gaggctgagg tgggaggact gcttgagcct gggaggcgga
ggttgcggtg 118860agccaggatc atgccactgc acaccagcct ggatgacaga gtgagacaga
atgacacact 118920gtctcaaaat aaataaataa atgacagcag atcatcattt ttctttctgc
ctctagactg 118980caatgcctat ttctccaggt agtcactagg ataaaagtaa aaataatatt
atcagcattt 119040accaaataca gggtcagcta ctctgttatg ttctttcatg ctttgtttct
tttaagcctc 119100aaacaactct atgagctggg aacaagtatc gtccttcttc ctcccatctt
atttatttat 119160ttatctatgt atttatctat ctattcattt atttatttat tttgagacaa
ggtccttcta 119220agtcaccagg gatggcctca aacttgagct aggaactagt gtcaccaccc
cccaatttct 119280tttattgatt gattgattga ttgattgact ggttaatttt gaggcagggg
tctcgctaag 119340tcgacagggc tggtcttgaa ctcctagtct taagcaatcc gcccgcctca
gcctcccaaa 119400ttgctgggat taccaacacg agccaccatg cccagcccct cccatcttct
gaataggaaa 119460actggggttt gaaaaggtaa gcgacttgcc caaggtcccc tccctagcta
gagagcttca 119520gagccagggc acaaacccat caaagcctgt gctctcgccc attgagccac
cggacctcgt 119580acactaaccg ccaagtgttc tacacagtga aggtgacaaa gaggtgaagg
gaagagccag 119640ggaggttctg tggactcact cggtgggtat gcccagaggg aagggggatc
ttgggtggca 119700cattgagagt agctgcgctg ttagtaagtg agaactcgga agtccagact
catccagtct 119760gtgccaataa acccctcctt ctacctggtc tcctttccaa agccagctgt
tctccagaca 119820atggggtggg cgggggcggg tgtcctcctc cttctcaggg aaaatccgac
gctgagccca 119880tctccagaga tcttggcttc ccgtggggct gcagatccac ctagagccac
cagagggcgg 119940gccagcactg cggccaaggc ttgaagaccc agcacaccaa agcccggcca
agcctccagc 120000ccagtgtcca agagtccagc cagaggccga gtcctcgatc tcaaaatgtc
taactgcaga 120060agcccaactc atgttcaggc atgatgtgtc tgattctact gggacaatca
ttgccaccaa 120120agaattactg ccaaaatagt aacgacatta gctacctacc acccctccac
acaccaacac 120180acctcatttt accaagcact ttctcatgcc tggggtgcct ggagacttaa
tgcagcctcg 120240cgatgaccgg gtagcctcac cgtacagatg aggaaactga ggcacaagga
aggggagtac 120300attgtctagg gtcacctgga gaactctgat ctccagactc aaatttccaa
ttcgtccccc 120360ctccccccaa ccctaaccgg agctaggtgg ggtggggaca gcaaatgtgg
atggggggag 120420gtaagagggg tcagagtgct ctacagagaa gaccaaatgc attgtggcac
ctactgtaaa 120480atgagaccag ccagccaccc acccaccagc cagccaccta aaagtcttca
gtgggcacct 120540gctggaaaca cgacaatgga tgacacgagt tccctgccct caaaaagctg
gtagtctagt 120600tgggggtgga gggggtgagt cagcagataa ttatgggaaa ccgtgacacc
tgtataaggg 120660gcggggatga gcagaggggc tgcgactgcc tggagccagg gattcccgga
cggggcttcc 120720ctttcctcgc agctcgtccc aggaggagga gctccccccc agcttcgggg
ttccgcctgc 120780cttgggggcc cggggtcccc tcccacccct ccccgaagag cgcgggcccc
gggaaccgat 120840gacagcacac ctgagtcagc ccgccgccca cccgcccctc agcgtctgtc
tccgcatctt 120900gtgatatttc gctccccggg agccagcccc actgcgctcc ggaggcagct
cggcaaacaa 120960acccagcgac agattgtgcc gcggctcatt ccggggaagg acgccaaacc
ccaccctgct 121020acccccaaca ctccctcccc gccgccgcct ccaggccctc cccccaggcg
caggccctag 121080tcggggtggg tcctggggaa acgcagggtc ctgtcctgcc tcctggaaat
agggggagcc 121140ctgggtgagg gaagacggga gccccagaga cttttctttc tgtttctacc
tgatccgaaa 121200acgagagggg cgggaaagga aatttagggg cacagagagg agctgggggg
ccgagaaggt 121260ccgaaaatgg aaccagcagg gggcacccga gagccgaggt gcccacgggc
cgggagcctg 121320ggaatgaaac tggggaagag ggggagagaa agggaggcag agacaccgag
acacacagag 121380acgagagaca gagacgcagg gagccccgcg gggaggagga gagagacgaa
gacacagaga 121440gacagtgaga aagacagaag accgggcagg gaaacagacg agtagagaca
gaaaaggtcc 121500gagagagagt gagggaggga gggaacagag agacagagac cacgaaatat
gagtaagagt 121560cgggggagaa aaccagagaa atcgaatgag aacgcgagaa gaacgagaga
ccgtggaggg 121620agcagagaat gaatgggaag aataagacca acatttatca agagccgact
gtatgccagg 121680cactgcattg gaccctggca cggataggaa aggaggagcc gcggcgcggg
cagcggggcg 121740aggggcttct gtgctcgcgg gagcggcagc ccagggggct cagcagcccc
ggcaccgccg 121800cacctgcggc tccagcagcc ccaaccccgc cagcgctgcc tggccaccgt
acccgaagcg 121860gctcccccga gggccccgag cctatcctac gccggggcgg ctccgcggac
gcgccgggcc 121920gagtcaagac ctgggtcaac cgccctgcag cctttgtagg gaagtgccta
ggtgatgggt 121980ctgctgatac cgcctgtgac caggccatga agggccagag gggctccagt
gagaccataa 122040tccgcccctc tttaaaaggg ggtagaggaa gttcacgcga agccaacagt
cttctcccca 122100gctttgggtc ctctcctgca cccccgcggg agataaggtc tcccctcccg
gacacatcat 122160acatacacaa aaaaacgcac acactcgcac gcgcgcccat ctcgcacccg
cttgtaaatg 122220cactaagggg catacacaca ccgggcacat atttctttcc acccatcccc
aagatcgcaa 122280gcgcaaaacc tcgcacagcc tcacgtttcc caccagctca gacatgcacg
ctggcggact 122340ttcagcggct cacccgtgtg cacactcacg tgcccccccc cccgcttccc
caagcccgta 122400caaagggtaa cgggcaagca tcctgagtca cacctgcaca agcatccttg
cgcgcacgtg 122460cacgctcata tgcactcgat cttgcacgca caaactcttg catatactat
tcttatagtc 122520gcacactggg cttgaggtct gggagtggaa ggaaaagtgg aatcttggag
ctgtcccagg 122580ggacagaaat gctggaggct gggacactgg cgcgagggac gcggctgggg
gcgggggagg 122640gggtgaccca gaagctcatc ttctcctgga aagttgggag gggggaacag
gacaagtcca 122700cggcgttcct ctaaactacc gcattccccc aagaagggat ttctctagaa
gagtggcgcc 122760gcgaggacga tcgaacacag tcctccgggt cgcttaagcg ggggggaggg
gggcggggtg 122820gagggggtta gaaagccgct cccgcctcct agtggtcgag aaagggttaa
gtcggcaagc 122880cagcaaacga gggaggagcc agcgagtgcg ggaaggagtg ggggtggttg
ggaagagctt 122940cctcgctgtc cccactctcc ctcggctagc agcctgggca cacggacaga
cggactgacg 123000gactctcgag cggacagcgc agctagcggg gcgcgggcgc tgggcgtcga
cggccagccc 123060cagccttccc cgccccgtcg cgccccgccc cgtcccgtcg gggccgatgg
ctcctcccga 123120ggcccgcagc ccgggcggcg cagggtagag cgccgcggcc cggccacgca
gcccggggac 123180tcccgggccc tcccggagcc ccgcggggtc cccgccgtgc atccggcggg
ctcagggagc 123240gagtgggagc gccctccccc cgctgccccc tcccccgagc atcgagacaa
gatgctgccc 123300gggctcaggc gcctgctgca aggtaagaac gccagcggcg ggagagcgga
gggcatcctg 123360gggagagaag cagggcgtcc cctctttcag ggattgaggg tggggcagtt
ggggaggtgg 123420ggtaacctgg ggaaggggaa aagctcagcg ctggggccgc gcccccgccg
ccagggctgt 123480tctcagcagg agggcacttg gctgggagcc cgcgggcgcg tgcgaggagc
tcgtgaccga 123540ggtgggacgc agggggcagg tggacccggc ccggagcggg gagggaggct
caggttccgc 123600tgtccccgct ccacctgctc cgggggacgc tgaggactcg ggccggctgg
ggaagcgccg 123660actcagcaac tcctcctgcc cggtgcctca gcactttctg gccacctggg
aagacaggag 123720atgtgggtag ggggctgtct ggggaggtag gaggcgcaga gggaaatcca
agtggccctc 123780tctggtagga gagatggagg gcgctagaaa gaggatagtt ctactgattg
agtgacagat 123840aagggtgtgg gccagagact gggggtgggg tggggagggg tcagggggag
agggatagga 123900aggagaactc aaagatggag aaagtggtga gggaagctca aaggaggagg
gagatggagc 123960gggggagggg gagaaggaat aaaggttaga tgggaaaagc gtggagggaa
gtgggaccca 124020ggtgaagacc aaggaagagg gaaggagagg aaagaccaga tcaggggagg
gatgggaaga 124080agactatgga cagggaccca gaatcctggg atggaggtag cgggaaagag
aatcaggact 124140gggaccctgg ggactggaat ggaaaaggag aatggaaaga tcagaaacca
gagaaggatg 124200gggatggtga ctagagaagg ggtatcagga accggcgaag agggttggag
acagggaacc 124260atggatggga gaggggctgg agaggaggga agaggaggag gaagagaaag
gctgagagag 124320agggactggg gattgggggt gctgcccagg gatgagacaa agaggcttct
ggtaaccact 124380tccacgtggg aagccctcca ttcccaaagc gcctgcctgc cacatttctt
ctctcaggga 124440gtggctggtg ggccagatgg ggggtgcttt gagctcaggg ccctgggggt
ggctgtgagg 124500gacagagggt gaggactttg gaaggggagt gacagcctcc gagggtgggc
aaacaggctg 124560gctcctgtgc tgccatttat ttatccggcc cggacgttgg attctgcagc
cgctgccgcc 124620accacggtgg ctgcttattt tggggtgtta cattctggca gagtgagaag
ctgtttgcag 124680cagctctaaa cctccgtcac ccgcgtcagt gcctccccag gcccctgcgt
cactggcatc 124740accaccacct ccatcccact cctcagctcc cacctcctca gcccctgccc
cctcagcatc 124800tgcccgcagg ccccagccct tccctgaagc agcccgttgg gtgtggagcc
cttgcttctc 124860gtctgggacc ctgtgcccct ccttccagag cgagaggcct ctgctgcctt
tccagggagc 124920atcctttcct gggaccactc tgcaccagcg actctgccct gtgggtgggt
agcctggatc 124980ctgcccccta ctttgggtcc agttttcttc tcctcaagtt ccttcttcta
caggggcctc 125040cggcccaaag agtggcctgt gggctgagaa ctttgtttct gagccttggt
actccaaggt 125100ttgatagcca gagtcctgga cagtggtccc tcagtgaaca gatacttttg
gctctggaca 125160cttcagcctt ccgggatcaa taccatgttc tggcctctct tggctccctc
ccctggtcag 125220ttctggccat atattctgga caggggtcat ctcttcttga ctcccacatg
taatcactac 125280tctagaacaa ccgcaactgg aagcctagga ggtgaaagtt gcagagagag
ctggagtccc 125340ttccttgcct tgaccctgaa tagccaaaca gactcagcat tgtggctggc
ccagccctag 125400gcacctgggt gcaatttctc tcctgtcttt acctcaaggg cagtgtctca
cacattcagg 125460cgtggtttct gcggaggatg tggccacctc ttaaagaaag atcagagtgt
ctctctgaca 125520tgggcttgat gtccctcttt tccaatctgg gttccacctt gtactagctg
catgacctga 125580ggccactgtg tcatgtttct ggggctccct tccttcatct gcaaattggg
ggccacaata 125640ttgacctcca ggggattatg tgtgttgtgt tcaatgtata aagaagttaa
cctgtacaaa 125700tgcagtgcct aggacaaaat aggtgcttct tggtttcctc ctaccctgct
gtactctccc 125760ctgcagctct agccatcccc tgctgacttt agaggagggg gtgagcagag
agggtggggg 125820aggctgctac aaagggcttt cctctgtcca tgaagtagtg gagggatgaa
atgaaggctt 125880ctgagaaaga caatgaaggc gagctgtaga gacctggtca ggaggcctgg
ggtgctcaga 125940aactcacact tcccctcccc agccctcaat ggtgttacct atgatgtgag
gggtcggctc 126000taggtggcca ccgaggtatc cccctttcca gctctgatac tctgtgcatc
ttgccccagt 126060ctccaccggg aattcacaaa atgaaggcca ggagtggagc cgtggtcctc
gggagagaca 126120ggaggcctgg gcctggaggg aaggagtggt ggtgctgagg aggagtgaga
acagggggtg 126180gggaagggac gtggcaagaa agaaaagggc acacactggg cagggcaggg
actgagggcg 126240ggggagagag ggaaaggcac agctctctag tcccccaacc ccccagtccc
accacctctg 126300ccctggagtg ctcgctccag ccccagcagg cctggggcag tgaagcccag
agccccctcc 126360cctcccctcc tccttgcctc cagtgagagc cgctgcgtga attatggatg
agctccttgg 126420gttacagctg ctttgcacgg cagtggcaag ggccagaaat ggcaacagag
tcactgttat 126480gcagcagctg ttatggagga gcccccagca ccgggtcgct cttcagagag
cctgcaggga 126540ccactatcat gggctggggg aggtgagccc tggttggggg agacatggga
acaagatgga 126600aggagagtgg ggaaagagaa gagaagtagt ctaatgtggg caggtgggga
gcaggagagt 126660ctagggagag aaagaggagt aggcaccctt gccagctcct gcagagttta
ccctcaaggc 126720cggaaggaac cctgatgcca ggggaatggg ccttgcctct gagattgcac
atccttccct 126780ctgtctctcc tggggcagcg gtcagtccgg aggctggggg aaagctctgt
aatcctccag 126840gggctagcgg ccatcagggc tcacactctg gtgagcttgt ggataagggg
taggattaag 126900ggatcagaga aggatttggc ttcttttggt gtcaagtcct tagggaagtg
gagatcagag 126960ggtgactctg acaggaaggg aagtgccctg gctgggcatc aagagacttt
tctggccctt 127020tccctgccaa cactttgctg tgtgaccttg ggtaagtcgc ttgctctctc
tgagctccag 127080tcatcacctc agtagaactg atgcttgaac cagaggaatc gaggggacct
ttgcggcttt 127140gaaatctcca gttctaagcc ccaaacctca accctcatga aacccactca
gggtccccac 127200tgtgcttcca cactccacct ctgcctggtt cagatgaggg gtaagagaca
ttgctcctcc 127260accccacgtg ggtctaagaa actcgggagg agaaagtaat cgtgaaacgc
cgcacggggg 127320aggggtgaga agggccgaga aacgcggagg tggtgtgaac gaatggaaca
gcagccgctg 127380tgtcactgag tattacatca cacccagcct acacacgcac ggggcccggc
gctcacacac 127440acgcggagga cagccagcac gcaccgacgc agcaccgacg cagcgccagg
aggggccggg 127500gacactcacg gtggggccca aaagcgagga gcagcacact gggagtgtgg
atcttccacc 127560ccgcacctgt gtgctccccc ctctggagga ggaacaccag ggcagctggg
atgccagcgc 127620cacactcggg gcctgtcagt cccatgcgtg cacacctggc tgagcagcac
tgcatttggt 127680gagcacctgg ctcacgccac tacccaaaat cacagataca tacacacatt
cacgcacacg 127740gcaacctcag gagcgtgaca caacacacac aaaaccacca ctaagcaagt
gcaatttgca 127800gccttggaga ccccacactc aaaatcacca acccctcagt ctctcccagg
gtctctgaac 127860cccaaggagc cccaggatgt cagagtgcag aaacaagtct tcctcccctc
tgccttcaaa 127920agcctaggac gttgcttgaa gcagaaggtg ttcagtcact gtgtgcccag
ggaatgactg 127980cctggctttg ggggtgcagg ctcccttttt ccccaggcaa aactgccaga
agaaaatccc 128040aggagtcacc tggaaatcat aagaaagtgt agaggtcaag ctagttccgg
cctagaactt 128100tatcagctat agtgacggca aaggccaggg atgatgggag gccctgcacc
cctattaaaa 128160tatgagtaca gacacctgca ctccactctc tagcccccag gctctctggg
cctgcttttc 128220catcagtatc ataataagga tggatcatat ccaaccttca aaagttactt
tgggggaaaa 128280aaaaaaaaaa gctttggctg gatgcggtag cttatgcctg aaatcccaac
actttgggag 128340gccaaggtgg gaggattgtt tgaggccagg agtttgagac cagactgagc
aacatagcaa 128400gaccccatgc ctacaatttt tttttttttt tttttttttt tttgatacag
agtctcgctg 128460tgtcacccag gctggagtgc agtggtgcga tctcggctca ctgcaagctc
cgcctcccgg 128520gttcacacca ttatcgtgtc tcagcctccc aagtagctgg gactacaggc
gcccgccacc 128580atgcccggct aaattttttt tttgtatttt tagtagagac ggggtttcac
cgtgttagcc 128640aggatggtct cgatctcctg acctcgtgat ccacccgcct cagactccca
aagtgctggg 128700attacaggcg tgagccaccg cgcccggcca aaaattttta aaaaattagc
tgggtgcagt 128760ggcacgggcc tgtggtccca gctcctcagg aagctgaggc aggaggattg
cttgagccca 128820agtgatccaa gctgcaataa gctgtgatcg taccactgca ctccagcctg
ggcgatggag 128880caagaccctg tctccaaaag aaaaaaagaa agaagttttt aagtaactgc
gaatgaggag 128940agcctggggt gtaaaatgca gattcccagg ctgtcccccc aggaattctg
catagttcct 129000aggactggct ggtggcctca cttagagacc cgacccttaa ggcccctccc
ggcacaaaga 129060ggctctgact ctgcaagggc gaaaagtaca ggaaagtaag ggcactgggc
accagtgggc 129120tggcaagacc agaccccaga gtgagtccat ttcacacggg cctcagatct
ccaaagggtc 129180ccaagttact tccagtcatt ctccaatggg gtgactttgc cccccagggg
acatttggca 129240atgtctggag acattttggt tgtcacaact ggaggcaggg tgctgctggc
atctagtggg 129300tagaagacag agatgctgct aaatgcctta tatagggctg cccccacaac
gaggaactat 129360ccggcccaac tgtcaatact gaggcagaga aaccctgacg ttagtctttt
gacattaatc 129420tctagacaag gtcaaacatg caatagtgaa aacaggaatg aagagatgat
cattcttcaa 129480ccaatttgca gtgctttcta caatggcctt ttggcattat tttttaatat
atgagaagcc 129540tcagaaagtg gaagtggcca ggccacttga ggctataacg ttgtcccctg
agcccccaga 129600catgggagca ccagggctct aggcctttat ttttattttc tattttttcc
cctgaaacag 129660ggtcttgttg tgttgcccag gctggagtgc aagggtgtga tcgttgctca
ctacagcctc 129720aaactcctgg gttcaagcga tcctcctgcc tcagcctccc aagtagttgg
gactacaggc 129780acatgccacc atgcctggct aatttttttt tttttcttgt aaagacaggg
atctccctta 129840tgttgcccag gatagtctca aactcctggc ctcaagcaat cctcctgcct
tggcctccca 129900aagtgctggg attacaggtg tgagccacca tattcagccg ggtctaggcc
ttttaccaag 129960ttggggggct ggcccccagc tggcactcct gccctggaag cccacctagt
aagttctgct 130020tcccctcccc acagctcccg cctcggcctg cctcctgctg atgctcctgg
ccctgcccct 130080ggcggccccc agctgcccca tgctctgcac ctgctactca tccccgccca
ccgtgagctg 130140ccaggccaac aacttctcct ctgtgccgct gtccctgcca cccagcactc
agcgactctt 130200cctgcagaac aacctcatcc gcacgctgcg gccaggcacc tttgggtcca
acctgctcac 130260cctgtggctc ttctccaaca acctctccac catctacccg ggcactttcc
gccacttgca 130320agccctggag gagctggacc tcggtgacaa ccggcacctg cgctcgctgg
agcccgacac 130380cttccagggc ctggagcggc tgcagtcgct gcatttgtac cgctgccagc
tcagcagcct 130440gcccggcaac atcttccgag gcctggtcag cctgcagtac ctctacctcc
aggagaacag 130500cctgctccac ctacaggtga gcctgccctg cccccaccct cagccccttt
ctggtttcct 130560ctctctgtgg gcccctctgc tccccgaccc tggcgtgcgt ccctcctctc
tccccaggcc 130620acccttcctg cctcagcatc tccatttctc tctgtctatg tctcttttct
ctcttacatt 130680ctccaggggc tttacttttt cccttctgcc tctctacctg tttaggtccc
ttgctgttcc 130740tctctctctc tctccctcta actccacaac cttcacctct ctgcctctgc
ctgtctgtct 130800gtctatccct ttccatccat cactgcctct ctcactaact tgcctccccc
atctgtcttc 130860tgcctcttct gtctgtctcc cttcacacac ccactccgca tacaccccca
tgtctgtctg 130920cgtgtgtgta tctgtctctt tctgtgatct cacgtgtttg ccttcagggc
actctgcctt 130980cccccagggt cccctgccca aaggcctttg cagctgtttt tctcacccac
cctcaagtct 131040gcccacatca cggtgaagta gagagagaag gcagagccac agccactggc
atcccacaga 131100aagttgcgct tctctccaat tcactgggca atgggacggg agaagcccac
accccttcta 131160gattcccatt ttccaaacct gtcatctcaa tgcaggggaa gaaagaaaag
ggtaaatctc 131220tgttatgcag ctggagaatg gatgctctga aaatggaagg aataccagta
attgttattc 131280attgttatta ttattgatct aattattgtt tattgttgtt atgctgactg
tttgacacgc 131340aaatcatccc actccatttc cccaggaagc aataacacac cctccaaacc
accctgagag 131400aaaatcttcc cttggctaca gagcctccgg ctggaagggg gtgaaaatat
ccaaattctg 131460ccctctccct acttgaacct ggaacgtgct tcctctgcct catccagggc
tagtgcctaa 131520ctagttatca atctgctagt tggaaaatca ggtcagtgct gatgatgcta
atgataataa 131580caatagccat aacaacctaa caaacatact gagcacccac tacgagctag
atgctaagaa 131640tacagtagtg aacagaacag accaaacccc ctgccttcac agagatacca
ttcccatgag 131700gagggaaaga agtaaaatgc acggtatatt ggaaaaatat gtcttatatt
attcttattg 131760ttgcctaaat agtgacagta atagcagtag ccgccaccac ttagtgggta
cacagggtca 131820gccacagtgc caagcacttt ataggtatcc actctgccat ttacaagcgt
gtgacatttt 131880ttttttttac ctcctcagac ctcagttttc tcatctgtac aatggggtag
caagagcacc 131940catctcctag ggattttgaa agcattaaat gcatgaataa tttgtaaagc
acttagaata 132000gtgcttggca tacggtaagt gctatataaa tgcttgttaa aatactattt
taaaaaaaga 132060aacgagcctt atttaacatt ggtttcagtg aagtggccca acttggactc
catcctgaag 132120atgtgggtca acttcaagga ttatactaag gtcatgagtg agtcccagaa
attgcacctc 132180acagtttatg aagtgcactc agccacctca tctcatttct acagcccagt
tgggagatta 132240ttttcacctc cttgttaaca atggagaagc tgaggctggg ggccctgaag
accctataga 132300gatatagtca cctccaatca taaatctttt caaccattgt cggtgtgacc
ggaggcttat 132360gtcttctcac catcatgttg agcctcacaa caacctggtg atagggacag
ttaggggcac 132420tagggacatg gaatgaatgt tcctgaggcc acacacccag gaagagctgg
cgcttgaacc 132480tcatggtctg gctacaaggg gacagtactc tggagtacaa ttgagcaggc
tcatttttga 132540aagcacacag tttggactca gcaagaccta ggttcaaatc ctggctccta
tatatatgac 132600tttggacaaa ttacttaacc tctctcagtc tccatttcct catctctaaa
atggcaatca 132660ggatagtact taataataat cttttttttt tgagacgacg tcccactcta
tcgcccaggc 132720tggagtgcag tagtgcgatc tcggctcact gcaacctctg cctcccaggc
tcaagtgatt 132780ttcctgcctc agcctcctga gtaactaaga ttacaggcat gtgtcactac
acccagctat 132840tttttgtatt tttagtagag aagggtttca ccatgttggc caggctggtc
ttgaactcct 132900gacctcaggt gatccactca cctcggcctc ccaaagtgct gggattacag
gtgtgagcca 132960ccatgcccag ccaataataa tccttattta agaagttttg taaggattaa
aatgtaaggc 133020atttagcaca aggattaaaa tgtaaggcat ttagcacata tgggcactat
aataataatt 133080actactacta ctactactaa tactgagatc aaatactact acaaattgat
catgcattta 133140atgctttcaa aatctcctta tcaatatata ttagttattt aggaggaatt
tggagtcaga 133200gggcctgagc ttgaatcccc gatctactat tttctgactt atttaacttt
aagcaggttg 133260ctaaccctct ctgaacctca cttactttat ctgcaaactg ggaataatga
aaataatacc 133320ttccaccaag aatggctgta aataggaaac gagttagtgt atagaaagcc
catagttcag 133380gctggtgtgg tggcccatgt ctgcaatccc agcacttcgg gaggccaagg
tgggtggatc 133440acttgaagtc aggagttcga gaccagcctg gccaatatgg tgaaaccctg
tctctactaa 133500aaatacaaaa attaggcagg cggggtggca ggtgtctgta atcccagcca
ctagggaggc 133560taaggcagga gaatcacttg aacctgggag gtggaggttg cagtgagctg
agatcgtgct 133620actatactcc agcctgggtg acagagcaag actctgtctc aaaaaaggaa
aaaaaaaaaa 133680aaaagcccat agttcagtgc tgaagaaatc atgttattat gaccccatcc
tccattgact 133740ctcaggccaa caacagcaat caggacctga ggtcagcaaa ggcttgggca
gaggggacct 133800caggtggaca ttggggtctt ctgaaatggg aagtgtttgt tctctacgcc
cctggcatga 133860atggtaccag gcatcatggg aaggaagcaa cttcacacct ggccttttat
agaggagatg 133920gaaaacacag cctctgcctg tgaactgcct ggtagggctg ggctgggaga
tgccacaggc 133980aggtgaggaa acatgggctg gggtgagatc cgcagggtgc aggtgtgacc
caagatggag 134040ccaggcctgc cccaaagggg agctttggag gaaactccac cagaggacca
cagcttttca 134100gaatggggaa gggccaggca ctgtgccagg tgagttcatt catcaacaga
tatttactga 134160gtatctacca catgccaggc aatgttccag gtgccaggga ttcaggagag
aacagaaaca 134220gtggccctgt tctcccagag catattccct actcaagtgt agccagatga
taaagacact 134280tgttttcttt cttttttttt tttgagacga agtctcgctc tcttgctcag
gctggagtgc 134340agtggcacga tctcggctca ctgcaacctc tgcctcccag gttcaagcga
ttctcctgcc 134400tcagcctccc aagtagctgg gattacaggc atgtgctacc atgcctggct
aatttttgta 134460tttttagtag agacggggtt tcaccatgtc ggccaggctg gtcttgaact
cctgaccaca 134520ggtgatctgc ccaccttggc ctcccaaagt gttggattac aggtgtgagc
caccgcaccc 134580gccgacactt gttttctctt tcagtcatta cagtggcctg catggttttt
gtttgttttg 134640ttttgttttg tttttgtttt tgagacagtc tcactatgtc acccagctgg
agtgcagtgg 134700cgcaatcttg gctcactgca gcctcacctc ctggggtcaa acaattcccc
catcttagcc 134760tccccagtag ctggaactac agacatgtgc caccatgtcc agctaatttt
tctattttat 134820agagacgggg tttcaccatg ttgcccaggc tggtctcaaa ctcctgaact
taagcaatcc 134880acccgcctcg gcctcccaaa gtgctgggat tacaggcatg agccaccgta
cacagctggc 134940ctgaatggtt taaaaatagt ctttatgctc aagcagatca gatctcagtt
tgaattccag 135000ccacacctct aatttgctct atggctttgt gcaagttatt taaccactct
gagcctcgat 135060ggacccatct gtgaaatggg gataacctgt accttggcga gcaggggttg
tgaggattaa 135120aggagatact actgagctca cagcccaatg tctggtacaa agtgagtatc
caatgaatgg 135180tagctatcca ttaacaccag ggaggacacc aactgaagct cagcaaaata
aaagcacagt 135240ccaaggtcac ccagctagta aggaacatga cctagaattg gcccaggtct
gtctgactcc 135300agagtgcagt tgttcagagg tctctggagt tggaagccac gttccactgc
atattagctg 135360ttggacccta ggcgagtcac ttcacttctc tgaggctcca tctcgtaatc
tctgaaatgg 135420agataataat agtatccacc tcatagggtt gtgacaatta agttactata
taggatctgt 135480gtagcacaga gcttggcaca tggtaagagc tcaatcagtt acctgcttga
caatgctgac 135540gccgatgatg acgatgatac ccatcctaga ctgatgagct ctgtaagcgg
gggtgcctgg 135600cacagagtag acactcggta cagctctgtg gaatgaatga ggcacatccc
agaactcacc 135660aattcataaa aatcagatgc agatgggatc ttaaagatca cctatcctaa
gtcccttgtt 135720tcacagatga aaagacccag gcccagagag gtgcttggag ctgcgcaagg
tcacacagcc 135780aagcagctca tttgattagt gtcagagcca agagctggga gtttggaggg
aggcaaggtt 135840aagaacagga tgctgtcagg gaagcaggca gggatgctgt gttaagattc
caaatggatg 135900cagagagctg tgaaccggcc agtggggagg caagggaaat gtggtttttg
aaatggaaga 135960ggatgacttt agcagaggct ctcagcccag agggagggga gatagggagg
ggagataggg 136020aggggcgggg ggagggctag ggctgtgaaa gtcaagagct tattaatgca
tagagaacgg 136080ttttaacagt ggagagagga aggaccggat ttgaaagcta cattcaagga
agtggcaacg 136140ggatttggca acagcttgga tggggggagg aggcaatgga ccccaaggca
gaggctcaga 136200gaagggaggg gcaggacttt ttgcagagaa acaaaaggag aggagaggag
gttagaatca 136260agaaattctg tgggccaaaa cctggggctg tgggtcaaag gcacctgaat
tccctaggat 136320ctctggaact ttggtctact cttctgacct cccgaggtcc cccaaaatgt
ggattacccc 136380tgctcactct cccccaaccc ccggcccctt atcgatcctc tgaccataca
tctctgggtg 136440tgtcctactc ttgctgacac ttcataaaaa gaggaacccc atttaggtgt
tttgagtggc 136500agggattcca agcctacccc ctggatgggc ctggaagaga acaagagcac
caggccatgg 136560tgagtcaggc tgaggccagg gaggtgcaag gagccagctg gaggcctgag
ccaggatttg 136620gggtggtggc agcagggggc ggagaatggt ggtgtcagag gcagccgaga
aggttgaggg 136680ggacggatct caatgtggcc aagaggaggg ctcttggcac gctcagttcc
tgtagcgaag 136740agggcggaag ccagatggga gggggcgaga acaggcagga gcacaggaag
gtggaggctg 136800tgggtgtagg ctgggagtca atgccctccc ccaacctgag gcctccgacc
aggctcctgg 136860gtggcaggca tggggaggaa agcgtctccc caggcagtga gggagggaga
gccacagtca 136920gggaacaggc cccctgggtg aactggcctg agcagagtgg atgctcctgt
tctgagaccc 136980agacctcctg gaacctgctg accacagtga tgccctgcac aagaggggag
gacctcaagg 137040cagtgaggtc agggagctga agtcctgctt ccctctctgg caagccctta
tctctttgag 137100ccccagtgct ctcctctaaa aaagtgagct gggctgatgg gtgccaaggc
attagctccc 137160aagtcagctg atcatcagaa tcccctggtg agctggttat aatgcagagt
ccaggaatcc 137220ccactggccg tgggccacac acacccgccg ccccccgctg ttaattctga
accatagttc 137280caaggtcctt tctgcactaa tgtggcctga ttaggtgact ccctagcacc
aggcaggtgg 137340gacagcgcct ctaaggggag tagtaatgca atgtggcttc cttcctctcc
tcccctgccg 137400cctctggggg tggagctgat gcccctcacc ccaataccca gcctagtagc
agtactttgg 137460ttcccccagg gagctcctct tttaaagaaa agggacagga cccaattgtt
actgagcccc 137520tattgtcata gtagccacca tttattgatg gttgactatg cacctgccag
atactgtacc 137580cttaacagca tttatcatcc aaccctcctt tagcctgctg agggggttat
acataataag 137640gaatattgta catactgagg aacctgagac tccatgaggt taaaacttgc
ctaaaataac 137700acagctaggg aaaaggcaag ctggattttg aactagggct ctaagtgctg
agcctgtggg 137760cttcataatt ggaccaaatc cctgtgtgct gggcacgtgt ccagcacttc
cctcatatga 137820tctttatgtg aaccatcctc tggaatcctc agaacaaacc caggaagtag
gtatactcat 137880ccccatttta cagatgagga aacaggcaca gagagatgac tggcttggcc
aagttaagaa 137940taatggctaa caaacaaaaa caaaaacaaa aattaaaaaa aaaaaaagaa
taatggctaa 138000ctcatggaac tcatagaact ccacaaggaa aggtgttcta agcaccttca
tacatgctgc 138060ttcatttaat ctctacatta tacagatgag gaaactgagt cacagatatc
ctgagtgact 138120tgcccacggt ggcatcagtt aatgacagat ccaagatttg aaatcagaaa
ggctggctcc 138180ccagtctcca tacttcacca aaccagaagt tctgaaactc aaactgtggt
cctgccaatg 138240gccacactgg cttccctggg gaacctgtag acatggggat tcccaggctc
caccccaaac 138300ctcctgaatt agaaactctg ccccccgccc caccccgctc agagatccgc
aggggatcct 138360aatacacccg aaagtttagg aaccactgac ctcaccaata ccactttttc
cacagcaaat 138420aggttagagg aggcagaatc caaatccagg atgctatgaa tcaaaaggtc
aaccctttct 138480cttctgccac ggtgcacccc cttccctccc ccggccaagg ccccagcggg
gtctgcaccc 138540tgcctcaggc ccattctctt cttctgtgcc ccactccacc ccacccagga
tgacttgttc 138600gcggacctgg ccaacctgag ccacctcttc ctccacggga accgcctgcg
gctgctcaca 138660gagcacgtgt ttcgcggcct gggcagcctg gaccggctgc tgctgcacgg
gaaccggctg 138720cagggcgtgc accgcgcggc cttccgcggc ctcagccgcc tcaccatcct
ctacctgttc 138780aacaacagcc tggcctcgct gcccggcgag gcgctcgccg acctgccctc
gctcgagttc 138840ctgcggctca acgctaaccc ctgggcgtgc gactgccgcg cgcggccgct
ctgggcctgg 138900ttccagcgcg cgcgcgtgtc cagctccgac gtgacctgcg ccaccccccc
ggagcgccag 138960ggccgagacc tgcgcgcgct ccgcgaggcc gacttccagg cgtgtccgcc
cgcggcaccc 139020acgcggccgg gcagccgcgc ccgcggcaac agctcctcca accacctgta
cggggtggcc 139080gaggccgggg cgcccccagc cgatccctcc accctctacc gagatctgcc
tgccgaagac 139140tcgcgggggc gccagggcgg ggacgcgcct actgaggacg actactgggg
gggctacggg 139200ggtgaggacc agcgagggga gcagatgtgc cccggcgctg cctgccaggc
gcccccggac 139260tcccgaggcc ctgcgctctc ggccgggctc cccagccctc tgctttgcct
cctgctcctg 139320gtgccccacc acctctgact gcggtgctga gatcgaagag gccagtgtcc
gatccccgct 139380tcccgtccac ccggggctgc ggctccggcc ccagtcgccc caccttccct
ggccttgctg 139440cctccctttc ccctcccagc tcctctcctc cccggggagc aggccgcctc
tccttgcctg 139500ccccctgggc tgtcctgact tgtggcagcc ccaagagggc gtgtgtggtg
gctcagccct 139560gccctcccca gttctggcca ttaactcttc cccatcccaa ggctggggtg
gggcccccca 139620ggcagccgct gacccgcact cctaagggcc cacagcggac accagagggg
cttttgtctg 139680cagagcgtct tccaccagca gagcctttgg aagctccccc agggagcccc
acccaggacc 139740ctttggggga tgcctcagtc agggccaggc tgaccctgac ccctgcttac
cctagtcccc 139800tcaacctcct gacactggag gaatactttt ctcctaagtc taccctggac
actttttagg 139860gcacctggag agaactttcc tctccactgt ggcccctgcg tggtgaagat
caaaagaagt 139920tgtttgggaa aaaaaattta ttaaaaaatt ctattatttt atctactgta
agatttgttg 139980acttgggacc ccgaaagcgg gatgaggtct cagaatgtaa ggattgcagg
gccaggaggg 140040ttggagaagg ggagccgtcc cccgccatca aagagcttcc tggtggctgg
aggtggtgtg 140100cgctcccccg ccatgaggag gagctgaagc cctgcattct aggtgaggcg
cagtgtggca 140160gccaagagtg ggtgctggtg gcacctcttc tcttcatttg tccaggggaa
gagctgcagc 140220caaccctgag tggtctggcg cctgaggaac taagcctggg gaagacctgc
tgtctggtta 140280acagccctct tccagaccct gttccttcag gaaacaagag cagttctcct
gcaaggagga 140340gtcacataca cactcctggt cacagacagc cccaacatgg ctttgggtaa
atgtgaacaa 140400ggcactgctc cctcagggaa acacagcccc atgccagagc aaacacctta
gcaaacagag 140460accaaggctg ggtttccgcg tacacttgcc tccttggcta agtgcccttg
tgcagtgcac 140520agcgtacaca cctgcacaca gcaaccctgt gggtatgtgg tctctctctc
agctcctgtg 140580aggtagaagc catcagggat gaaccaggtc agagaagcag gtttccaaac
aggctagaag 140640agggaccgag gaactcgggt gatcagaggg acaggaatcc caaattggga
tgcattactg 140700gcttgaggta caatcagaac cttcatcttt ctggtgtgtg gaagagaggc
tggggactgg 140760gaagagctca ggctaagaag gacttgggtt gggatttagg ggtgagtctc
atcagactga 140820gcacttggag agaagtttgg tagtttgaat ttggagctaa gaatctagct
tgggcagggt 140880gtggtcgctt gcacctgtaa tcccagctaa ttgggaggct gacgtgggag
gatcacttga 140940ggccaagaat ttgagactag cctggacaac atatcgagac tgagtctctt
aaaaatgttt 141000ttttaagaat ctagtttgga gtggggtgtg atgtctcaac gtctgtaatc
ccagcactct 141060gggaggctga ggtggacaga tcacttgagg tcaggagttc aagaccagcc
tggccaacat 141120ggcagaaacc ccgtctctac taaaaattca aaaaaattag ccaggcgtga
cggcgggtgc 141180ctatagtccc aggtactcag gaggctgagg cacaagaatc actccagcct
gggtgacaga 141240gactctgtct aaaaaaaaaa aaaatctagc ttgggaggtg ggaatagaaa
gatagagggg 141300gcctagatgc tagggcttga ggaagcaggc tgaggttctg tgattctggc
tagggaggtc 141360aaatgatctt gagaagaaga gaagaaagga gaagaaatca gcatctaagc
ctgaggcagg 141420tagactccgg ttaagggtgt ggggtgggct gggggagagt gagagcagct
ggtcagaaac 141480ccagggagct cggagtctgg ggtcttgcag gggcttgtgt caggctggct
gtgaggaggt 141540taatgggttg gattggaggg acagccagac aagagctctg gtggaggagg
ggctgctggg 141600gcctgggcag ggggagggga gctgctggta aattagaggc aggctgtcca
ggtcatagaa 141660ttatcattgt gaaatattca tgggccatcg gtccagatgc tatttcagaa
cagtgaaagc 141720aagaggagtg tgtgagcctc aggaagaagc ctgaagcaaa gccactctcc
accaaccccc 141780acccctccca ccaccagccc agacagaccc acggacgccc atcacgtgca
cacccacact 141840cccgagctct cacacacact cgcaccaagc agagccatgt agcacgtgca
agcacaccaa 141900ccacccacgg gtcccacaaa caggcaggtg tcccctaaat tctgacatgc
acactgacat 141960gcacacccac tcaatcagga cccagcagag atcacctcca gcgatctcac
atgcgcagac 142020ccccaaactc tccaaacaac ccagattcac caccttgacc cacacaccct
gagataggag 142080ggatgttcaa ggccatccag cccaaccccc accaatgctc tgatggggaa
actgaggcca 142140tagaaaggaa gggatttgtc tgagattcct ctatcccctg aaaaaagcaa
aattcattca 142200cctcccacat tctgagtgta cccccattct gcattttcgt ctgccagaca
cccagcctag 142260ttgtaattaa ctcctccctt tctctaattt cctgcatcta ttcagttacc
cagtccccca 142320cccagccaca gtctatccct tccttcccat tctccccacc acctccctgc
tccagctact 142380cattacctca tgcctggaat ataaaagaaa actgcgataa cctcctcgct
ggtttcctac 142440atggaatctc tccctccctc ccacccagcc ataccgtggt gaccagattc
atctgatcaa 142500aatttgcata tgttatgatg tcactcagga gcctgtaatg gcttcctaat
gcctataggg 142560taaaggtaaa acaccttagc agagcatcaa agatccctca gagtctggta
ccaactgctt 142620ttctagcctt ttctctcaca atctcatccc aaaccttcac tccagctaga
acgtttgtat 142680catactggcc accagttatc atgtatgtga aacccaccaa ccgactttga
gtgcccccct 142740aaaatttctc agtctctcct gaagtaggaa acctcttccc cctcctcaga
tctcagactc 142800cagagccctt tcccaaggcc aagactgcac ctctctgacc atatacaggg
gttcttcaaa 142860gcagcagaca gaggctcagg ctctggctcc ctccaagcag acggctgccc
ccgactggcc 142920accttgggaa gcacagccag gtttcagtcg tctagaacag agaatgagca
tctaaccgcc 142980tggggagagg actaggacac cagatgataa ggtttataag cccttaagcc
tctaaggttc 143040ttacacccag agtagggggg ggacggttct cagccctgtt tccctagctg
cgggctccca 143100attttcgatc cctaatccga gaggaactcc tctccaatga aatacagact
tgggactctc 143160aggacactgt ggaagggaaa tttcccaaca gactctgaga gtccaggagg
ccagggatag 143220accaggtggc aggcccaagg tccagctggg gtcaggtttc tatatgaatt
tttaatgctt 143280ccagatagac ttgtcagatg ttctgaaaac tgagcatctc ctttcacctc
tgtacatgat 143340gcccttctcc aaccccattg cccctgcagg agggcaggcc tgggacagat
attcagtggc 143400ctctggagaa acggttttgg gacagtagaa gggtaaatga cctagttatg
ttcccactag 143460taagctgtgt gaccttgggc aagttactta acctctctga acattagagt
tctgtgggtt 143520tgtttttgtt ttgtaagctg gggacaatag tgccagccta aatcaatttg
ttgtggggac 143580tcagtgcaat agcccatggc aaagtgacct acatgcttgc tgttattatt
ctctttcctc 143640aagttctgcc tccctcttcc agcttttctt ccaaccccaa agatgtctct
ggctattgct 143700tcgaaggtag gaactttggt tggttctccc ctttctcttc aggcccaaac
tccccacctc 143760aagatccttt ggcctttgta gaaacttcag gtgaggaggt ggcagagaaa
taagaaagtg 143820tgcaaggctg gtggagtgag agaggaggat agatggcgaa gccctagcag
aggggaggga 143880agtgggcagt ggagagagg
14389916215980DNAMus sp.modified_base(1001)..(1100)a, t, c, g,
other or unknown 16ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc
taacttcaag 60caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga
agggttccca 120cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct
gcctaatgac 180tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta
atgtgggaag 240gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga
tgttgattca 300agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt
tgtgggagtt 360cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat
ttgcttccat 420tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga
ttctcaacag 480tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat
tgacacatcc 540cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat
agttttagct 600ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag
tctaacttca 660ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt
tccttgagtc 720tcaacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt
tccttaaaca 780taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta
aacacaggaa 840gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta
ggtccctcct 900tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc
agctcagctg 960ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn
nnnnnnnnnn 1020nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 1080nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct
gtttatagcc 1140tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc
tttctttttc 1200tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg
tctagaactc 1260acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta
caagcatctt 1320ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc
gggctgaaga 1380taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg
agtttgatcc 1440aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg
aatcagggat 1500gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat
agccaggttt 1560cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg
tgacccaact 1620ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc
atgcatgtcc 1680atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt
tggtaatgca 1740cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt
cggcagcacc 1800aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg
aaatgaaaag 1860cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac
atgagacttt 1920gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac
ctgaagctga 1980cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca
cagacacccg 2040catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc
ccattcatag 2100ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 2160nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 2220nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg
atggggcact 2280ttgaactccc acctaggcaa aaaacccagt gatctctggg ccagtgtgtt
gtttgcaagg 2340gaataaggta gagagccgcg gaggaagaca ttgggggttc tatgagtatg
tgaaggggtg 2400cacacaccac acacacacat tttttttgtt ttaaatttac aaacattaaa
ataggctgta 2460atgtggctca gtgggtagaa aaacctgctg tctaagcctg gtacgagttc
aatccctgac 2520aagctggaag gagacaacca accacaactg ctagcagcca gaagcactgc
ttgctaacac 2580tcaagagagc ctggagtgga agacactgga tccccagcag gcaagcctgc
aagaagatgt 2640gccttgccta gacaacggca gaacaaacat caaggctggc agagctgtcc
aggactgttc 2700atattaatca tgtatagata agagggaatg gcacagacag aacaattcaa
cacacggggt 2760atgaaaggaa aggaacaagg cacacaaagg acaaagaacc tagcatacaa
gaaagcctaa 2820gcagagagtg gcacttccca gaagggagtc ataaaataga ctgaattcat
taaaacaaga 2880gccaaagata aacggctcaa aaaactcacg gaaaacaggt caaaataacg
tcacccatct 2940gacagttgat actgtcaact taaccgtatc tagaactcca gcaggcacat
ctccaggcat 3000gcccctgaag gggtctttgg actaggttaa ctgacgtggg agtgacacca
tctatggacc 3060gaagcctcag acagaataaa aaggagccag tgagctgagc gtcagtgctc
attgcttctg 3120gcttcctgtc tgtggctgca gcgagacacg gtgcttcctg ctttagctgc
catgacagac 3180cacaccctca aaccgtgaac caaaataacc tcctctctac attgctttta
ccaggcattt 3240ggtcacacca atgagaaagg ttaactaata cagcactcaa tacttaaaaa
cataaacacc 3300aaccttgttt gcatgtgtga gactttgaag ctcacgggcc agttatgccc
aatgccaggt 3360ctgctggcta agggtgagag tgcacaccta taatcccagc tgctgtggaa
tcagcaaaag 3420cgctacagat ggaaggcagc cagggcagct gagactgact caaactgata
gaggtgggag 3480gcatagagaa aaccagatta atagagtgtt ccccactatg caagaagccc
tgggtttcag 3540gacgagagaa ctaagaatac agaagtctac tgtgtagaag cactgctagg
tcacacagaa 3600acatcactca agtgtctctg gatgctacac ggagggcgtg tgaagtattg
cttcctgatg 3660atctgtatct actacagcac tgctgtttta gtatgcgctc ctccactaca
gctcctcacc 3720acaccaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 3780nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat
taatcaaaga 3840aaacacacac caccagttag agaaagttaa tcaggccgaa tggcggcttt
cccctgtatc 3900caggctaccg tcaggacggc tcactgccac tggcaactct gcctgaacaa
agcccgcagc 3960caacgtgggc ttcaggggct ctaaacactg caatcaaagg ttgtgtgtgg
gggtgggggt 4020gctgctgcta ttcaaggatt cccaaagctt agatgtattc atcatactca
caggaaagcg 4080tgttcaaccc atcactcatg agcagtcggt accggggtga cctattccct
gtagaaatgg 4140gacggatgtt ctggaaaagt tgacagaaaa gttgattcat taggcaggct
ctttgcccaa 4200gccctgaggg taagcaaagc taactggcag gagactaggt ttgccattaa
tctgagacaa 4260gatgaaccac ttgcccatcc tcctgacacc taaatactaa tgaaagaaca
atggattgag 4320ctggcattat taaaaacgat agaaacagaa gtatcaatag tcatgtgttc
tttctcccat 4380atgtcaaaac aatgtgtaag atggcatcga acacatgcag aaactgttta
gggaacatgc 4440tgaaaatatg aagtaaaatt aaaattggaa agaaagacaa tttgcctaaa
gcagctcaga 4500gctggagaag ggaccgaggc agagataaca gcaacgtgtg gacatacgga
tctggggcag 4560agcagtcacg gactcagccg gaaagggtgg ggcagcctct gaaggaagtt
aaggtaaata 4620gagccacaag gtgattggcc caggagtggt gccaccttca cctcctgcct
caaagtctga 4680aggaatgatc ctggagtctc ccatctattg atatatgaaa ttcacagtat
gttttagaac 4740ccactgaatg atgggtagat taactaaaag aaatttaagc ggggtggtgc
aggtctttta 4800atcccagcac ttgggaggca gaggcaggtg gatctctgtg agttcgaggc
cagcctggtt 4860ccaggacagc cagagataca tagagaaacc ctgactcgaa aaaacaaaat
taaaagctca 4920tcaaaacaac aacaacaaca aaaaaaacaa aaaaacaaaa caaaacaaca
aaacacccta 4980tagtacctgt tggtgagttt gagtgagtga gtgagtgtgt gttagagaga
ggggcgggga 5040aagtgtgttc tggaaatggg agaaagagaa tgtgcatgtg tgtttctggg
atgtagacaa 5100aactacatgt cttccatcaa atgcaatgtt taattatcta tgagttgaac
catcttcatt 5160ctgctaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 5220nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaac
aaaaataaac 5280caaaccagta aacaaaatcc tgtaagataa agcctaagac aagacacttc
ctggggctgg 5340ggagttgctt agaccataag gagttcataa tccaggcgtg agagcccgag
ttcaggtccc 5400tgggcttcca agtcaggagg agaccaagga atcaacaagt ctcgactttg
gtctctagtc 5460ccatgcacac acacgcgtgt aaatacgtag atgttcactc acacacagaa
gactgcacct 5520ggctctctca catctcagcc aacatataaa gcctgcatta tcagaacatt
ctaggttcta 5580gtttcagtca actcttacac agaatggcca tcatactccg tctacaactt
ctcctgatct 5640acccacgtgt cattgcttca gtattaacaa aacccagaat aaccagctgc
gtagatcctc 5700cctgatgccc cagtcattgt cttactgaga ctactaagtc acaaggtagc
actctggatc 5760caaaaagcaa tatccaattg agagttacaa cctataagga ggagtttacc
ttcattatag 5820ggcactggat tcccaatctt taatccaacg tcttcagcag atttcataac
ttccaagtcc 5880atcaaaacaa ctactttcct acaaagacag acacaagtta gaattaagaa
ctctgcagcc 5940tttcagatga gttactaaga agcttacttt agtagttgtc tggctaaaac
tgtatccttt 6000accaaccttt tctcattctg gactaacttg agaagtatta attcctaagt
aaatacttca 6060cttattcttt ccccacatct ccaatgtttt tgtctttaat ttattatagg
gcaattcatt 6120tcctatctag ttccctgatt aaaacagtag accttgctgc atgccattat
cctcatggag 6180gcactgatac aatttagatt attaaataca aaaccctaaa acacaaaaag
atgatttttt 6240tttaaaacaa gattttaaaa aaagcatgtg ctacgcttcc ttctgccact
aagcctacac 6300atggtcctct gactgaattt ttcccctcat tctgcttcat ctaatatgtg
cttttcaaac 6360ctggaattga accagggact tattcatgct aggcaaatgc tctaccatag
agctataccc 6420ctccaactcc catctcaaat atcatttcca aagacatttt cttggtctct
tatttagatc 6480aggtttcttt gtcctcctgc agctatgact tcattccttc agaacactcg
tcttagcttt 6540aagttctgta ttaattagtg attgttttca ttctctctgc tagaatgcac
tttcaataaa 6600ggcaggtagc cagccacagt gcttaattaa gcaacagccc aacgatgtca
ttcactacat 6660actgggacaa gatgcctaac atcatctgca gataaagacg aactactggt
gtcaggagac 6720agctaagggg tccagggctt gggcacgctg agtgtgagca ctggagtccg
ggtgcccaga 6780aacgcacata aatgcaatat ggatgtggca atctacctct aattccttct
ttaagacagt 6840ggctctccag agcaagctgg ctagcaagac aagccatatc agtgagctct
gggcttgacc 6900aagaccctgc ctccaggtgt aactcccaag caaaaggatg atggctcaca
aatctcaggc 6960tatcatgttc atgtacaaaa tgtcaaccgg catacacaca tgcacacaca
tgaaaactgg 7020gagaaaataa gaagaattgc aaccaaaaaa tgtaatttga ggacacataa
ttgcaggcgg 7080ggagtggggg gatgacagaa ggtgaactga gtggaccgag ggaaagctgt
gctagcggca 7140atgagaagaa gggtggggca gtctgagcaa gggttcagca atcaccacgc
tttactgtct 7200gcacagcctg gctgtagaat gctgggcttt atcacacaga attattcagt
atgtgctatc 7260tttacagtaa agttattcta tcaggctatg ctacttcaat agaacaagcc
tgaaaaagtg 7320gtctgctgct gagaacctga caaagatgac ctgttagaac tgtctgccaa
gtgtggaatt 7380ccagcactgg ggaccaggag ctcgagggtc accccagatg cagggagtta
gaggccagtc 7440ttggcaacat aacatcatgc ttcagaaatt aaaaacaaaa nnnnnnnnnn
nnnnnnnnnn 7500nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 7560nnnnnnnnnn nnnnnnnnnn catgagatag ttaataaact gaagaaagcc
atacaaggag 7620taaagtagat agttgcaagc atgaagaaag acaaaccact tgagcttttc
ttttgtcgta 7680aggaggaaac cagacaggtc cagagagatg gctcagagat taagagcact
gactgctctt 7740ccgaaggtcc tgagttcaaa tcccagtaac cacatggtgg ctcacaacca
tctgtacagc 7800tacggtgtac tcatatacat taaataaata aataaataaa taaataaata
aatcttaaaa 7860gaaaaaaaaa aaaaacctaa ccaatcagcc aggcgatggt gacacatgtc
tttaatccca 7920gcacttggga ggcagagaca ggtggatttc tgagttcgag gccagcctgt
tcttcagagt 7980gagttccagg acagccaggg tgatacagag aaaccctgtc tcaaaaaaca
aacaaacaaa 8040caaacaaaca aacaaaaaag gaggaagcca gacaggatgc actttatacg
tgaatggaat 8100tgacaaaaga caagttctat aagtgttagg gaaaggggga ggacaacggg
ggttcatgtc 8160tgtggtggaa cacgtattag aaggctctgg gtatcctgtt tccgacaaac
aggcactccc 8220aatcacacag gccactggat gtctcaggca gagaaagatg tgatagattg
actttttaac 8280aatcacagac tgtgtggaaa atatttgtaa ggttgtcatt gtcacccagg
atagagctga 8340tggttattca aacgaggatg ggacaacaga aatgggagag agggatgtga
gaaccatttt 8400gaaccagggt gatttactgc gcacgtgtat agggtctaca gggagtggga
tatgtagagg 8460aggcctatgt tcctaacttt ggtaatgagc ttattacagt tactatgcac
agcctggaag 8520atactggaaa aggtgcaggc taggctagaa aggtactaac tgagggtttg
acagcccctt 8580ggatgtcagg atgcagcaag cctacctctg tatgtagtca atcccttctc
aggctatggg 8640tcctgcagat catccgtctc tgtatccatt attcccagtc catcctctga
gtggctccct 8700cttatccagt ttaacaaaat gctgactgca agctcccaag cccagggctc
tggctccttt 8760actccttgtt attgtacttt accctgtttg cttgggatag agtgtgccct
ttataaacat 8820ttgtgaaagg gggaatgaag aagaataann nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 8880nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 8940nnnnnnnnag agagctcaat ggttaggagc actggatgct cttccaaagg
tctccagttc 9000aattcccagc atcaccatgg cagctcacaa ctgtctgcaa ttccagttcc
aggggattca 9060acactcagaa acataagtgt aggcaatcta cgtaacataa aaataaataa
atgagctgga 9120aaagaaaaca tgtttcaaaa tatacaagta atggggctgg aggagatgtc
tcaatgggta 9180agatcattgg ctgctctttt ggaggttctg ggttcaattc ccaccaccca
catgacagct 9240cacaactgtc tgtaactttg gtcctgtggg agctgatgcc ctcttctggt
gtgcagacat 9300acatgtagac aaaacacctg catacataaa ataagttttt aaaaaagtta
cacatacacc 9360cgtgtgtaat ataacacaca ctggcttaac ttcctcagca ctgactgttc
accatacgga 9420ttcccatgag gttttggttg cattctatca ccgaaaaaaa aaaaaaaaaa
ttagaagaaa 9480gtatatacat ataaacctct ccctaaaata aagttttctt ttctaaaagt
acatccttat 9540ttttttattt tttttttttt ttaagaaatg ggaacaacag ttctgctcac
actgtatttc 9600tagcatgtaa catcttgcaa gtacttaacc gtattctata tcagctcaac
acacttacta 9660ccgaagactc aagatcacaa aaaaaaaaaa aggacccaga ctggataatt
aaacgtttct 9720tttgttgtag taagcgacct cttccttaga agatactaca gtaatgctga
agaaatgaca 9780catctactgt aatctgttct ctgggattcc aacttgtttc ctctgctact
cctcccttgg 9840cggcaatgtt cgtctgcatc cggctgagct cctcgctgcc ttgttaaacc
tccttcctga 9900acttccgacc tgtagttccc gctctacagt gcaagcgagt ggataaggaa
gcgcatacct 9960gccgtctttc agggtgttga cgatgaactt gtggacctgg cagacacagt
tgctggccag 10020ctgccctccc tcgaccaggg tgttcagctg cgtggccagc atgaacgctg
caaaagcaga 10080gagagagggg ctcagtctcc aagcctttcc ttaacccgaa agctcatcac
aaggagaacc 10140attaaataca gctgtttaaa actcctccgc cctgcagaga ggaaagcagc
atcaatccgc 10200cccatgtaaa agtctgaggc tcttcctaaa tggtatctgt ttctcacagt
ctccaaatca 10260tttttactgt aattctagtt tctggggaaa gacctttctc ggtctttagc
cccgtgacta 10320gagacaacag gcaaatattc cagaaaggcc cccattttct ttttaaagct
tctannnnnn 10380nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 10440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngcacat cttgtgaagt
gtccacatct 10500ttcggtccct cgaatttggg tttcttctgg gacgtggtag catgtgactg
tcactccagt 10560gcttggagca gcagaggggt caggaactcc aggctggcat tagctgcaga
gctggagcag 10620gtcctggaga acagaaactt tggttgcagc attaatgaac tagaagaatt
tttttgtctt 10680ctgttaaata taaatacctc cattatcttc tcataaacag tgttgccttt
ttatttaagt 10740ttttaaggat caggcacaga gactccatgc cagactacca ctcaaccact
gagctacacc 10800cccaacttgc ctttctgcta ttttttaaat tgtatcagtg gccaccaaac
atggggagag 10860gtcagggggc tacgtggagg aattgtttct ctcctaccaa gtgggcccca
ggtttcaaat 10920tcaggtgacc tggcttggca gcaagcacct ttacccctaa gccatctcat
tggcttcatc 10980ttttaatggc cccttcccct gctctgaggc aggctctccc tatatagccc
tggctggcct 11040caggctcgca ggtccaccag tgagcaccag gtttctgctt gtccttacct
ccccagcact 11100gtggttataa gcatgtgcca ctgtgtcaaa ctcagtcact aagctttgcc
aagccatagc 11160ccagcccttg agtttactgt ttgtctgtgt ggtgatttgt caaaccactt
ttgttccact 11220gaggtatttt gtcaagtttg acaaaattag ttgagtatgt aggtcttttt
ttctggaatc 11280ttctgttata gcttagtctg gtcttgaact catgatcttg cctcaacctc
acgattattg 11340aggattattg agatggacag gctgtgtgac catgctcggc tgtgtgtttt
agcatgcatt 11400agtcatttga aaaacgttgg ctcatgacac tttacaggtc ttccatgttt
gatatgtttt 11460atttaatcca aagtaattcc agcaccagag gctgagacag gaggatctca
aggtcaacct 11520agagatgcat agcaggcggg gccccactcg gttaggttaa tatcatcact
gacttcagga 11580gaaaagtctt aagtattggg gactaaaagc aggaggatct gaagttcaag
gtcatcttta 11640ggaacttagc agacttgagg ccagcttggg cgctgtggga ccctgttttt
aaaccagaaa 11700acaaattgaa aggaaaaaaa aaaaaagctg gaggaagtga atgtgagtgt
tcacatagtc 11760ctgtttccac aagaaaacag ggttactttt ggcaacaaat aggtgctttc
tttgaaggct 11820ggcatttttg tgacttgtca ttggagaaat gatttaatta agacttttct
actgagtgcc 11880tctgaagagg ctcttttaaa tttagtttaa ttttatctca ttgttagtgt
ggtgtgcttg 11940tgcacacaga aggcagcttt ctagagtctt ttcactctct cctccacagc
tcctggagtc 12000aaactcaggc cctggctagg caagctctta ggacagtgtt agctgtagct
tattaagttt 12060ttaagaattt ttataagact ctgtttttct ttctcaggtc atgatacagc
aggaaaatac 12120atccataaag cccatcctgc aggtcattgt aagtaccggc atgtgtgttt
agcataatga 12180agatggttca cttatagtta attaaacatt ggattggatg gaagacatgt
agttttggtt 12240acttcccaga aacacaaatg cacattcttn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 12300nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 12360nnnnnnnnng aattcagagc tgatatgtag tactaactcc tactcaatga
atcctttgtt 12420cttctattcc ttcattacat tactgttaat agtggtaact atgtaccaaa
gagtcaaata 12480actcttggac catccaaggc agaaggaagg ctggcaaaaa tgtatgatga
tctgggatgg 12540gaatgtactt cagtttgtac aggaggccct tggttcattc catttctggc
aatgcataga 12600cctgtaggat ctcagcactg gtggggggtg gggggtgagg gtgaaggggc
gggaggttaa 12660aggcagaata gtcataaatt caaagtctgg gtcctggaaa gaggactaaa
cgattaagag 12720ctttagctgt tcttctagag aacctggtgt gatccccagc acatggtgcc
tcacgactgt 12780ccgaaactct gattctaggg ggatctgaaa accctcttct gccctctgta
gatacagaac 12840acacatggtg cacatacata catgcaaccc aaacaaccca tatacataaa
atattttttt 12900ttcaaaaaga cattcaaatt cttcctcggc tatatagtgt ttaccaaacc
tcaaaaacaa 12960aacaaaacaa aacaaaacaa agaatcatta atgttttgcc ttcatgtatg
tctgcccacc 13020acggacatgc ctggtaccca gggagattaa aagaagacat tagctcccct
ggaatggaga 13080taggtatgat ctaccacttg ggtgctggga acctgggtcc cctgcaaaag
cagtaaatct 13140ttttaacccc taagctgtct ctcccaacgc ctaaagattc ttgtaacaca
gcatgatgag 13200cactggcaag catagcatgg taatctgact tcagggcgcc agattttgag
cttaatgctt 13260gattattaga agtaacgtac tagatttaat gcctggagct tcaagcaaca
aaattaactg 13320aagaataaaa ataaaaaccc tgccagccat gatggtaatc ccagaacttg
agaggcagag 13380gcaggtgatc tctgtgtttt gcaaggccag ccacaatcta catagcacgt
tgcagtannn 13440nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 13500nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncga ataaatctac
acatgtaaaa 13560agaaattcaa agaaacaaat gccaaataaa tacacatatt gtaataaaga
gataattgtc 13620taaaaaactc aaggctttaa atggtaagat atcatattct tggatgaaaa
gatctaatgt 13680caaaatatat caatttaatg caattatgta tattcaggag atctctggtt
ggcttttgaa 13740cttgatagca ctcttataat tcacatagaa gaaaaaaaac catgaaaact
gccaaacatt 13800attagaatac tccacagatg gtattttggc agcacataca tcgaagggct
gtgaaagatg 13860tgtagatcat ccacgccttg ctagggagag ggcgggtgtg tgtggggggt
atagctgttt 13920gggaaaataa cctggtaatt cctcattagt taaatcatag tcagaacctg
gactagcaac 13980ttctctctaa aatacattca ccctcagcat ctgcattgcc aggaaaccac
tcctagcagg 14040atctgtacgt ggatcaaggt agtagcatct gcatttaatt gacattctcc
taaatgcttt 14100aaattatctc tagattactt atagtagcca agatgatgca aattatgtta
cactgtatta 14160tctggggcgt aacaagaaaa tgtctctact caggttcatt caggtgcagt
acttcccctg 14220aatacttctg aatacacgga tcaagaagcc acagaaagag ggctaaccat
atacaagcat 14280atagtacact aataaccatg tacaaccata tagtacacta atattcagtg
cattactcaa 14340aatgcaaaca gatggaaaca atccaacagc ctgtaagctg aaaaacaaga
taagcaaaat 14400gtgctgggcc tagaggccca ggtctataat tccaactaag gtcgaggcag
gaggatctca 14460agttcaaggc cagcctagac aacttagcaa gaccttgtct caaaacaaaa
agtaaagagg 14520ctgaggatat agctcagtat agagcatctg cttagcatgt gcactgacag
ccgtatcaca 14580gaggaaaaaa aaaaataagc aaaatgtgat ctgtctgcac aacaggatat
cacagccccc 14640taccacaggg gaacgacaca gtaacacaac aaaaacttag ccctgaaaat
actatggtaa 14700ataaagaagt gtcactgagg atcaggaaat gcatgactcc atttacatta
tatagaaatg 14760agaagatcag tgagcctcta ggactcaaga gatttgggat tggcagctaa
agggtactgg 14820gtttctttat gggggtaaga aaacattcta aacttaactg tgagaatgac
tactcaacaa 14880tgtcaagtgt tcaaaaatca tacttttttt tttttttggt ttttcaagac
agggtttctc 14940tgtgcagtcc tggaactcac tctgtagacc aggctggcct cgaattcaga
gattcacctg 15000cctctgcctc ccaagtgctg ggattacagg catgcgccac cattgtccgg
ctcaaaatca 15060tacttttaaa aattgcccag tgactcatga atacaatcag aggcgggaga
ggacagtggc 15120aaactcagga taccagtgtc ttttatgtct gctgcccaac tatcaatttc
ccatagttac 15180cagagaactt tttggtttgt ttcatcttat ttgttgcttt tggtagaatc
tcaatatagt 15240aagatacaag gctggcctca tactatatag ctgaggacga ctttgaactt
ctaatcctcc 15300tgcttccatc tcccaagtgg tgggattaca ggggtgtacc gctatgccca
gcaagcacaa 15360agccatttga accacacccc agccttttca gagaaacctg tacaagcctt
agtgccttag 15420catattaagg caacaaaaga cataatgcgt ggctaccata gagtgtttgc
ctaccatgtg 15480tgaggctcta ggctaaatgt ccagcactta taaaaaagag ttaaaaacac
tcatgactca 15540aggatgacta tgcagtcttg tgtacaaagc cccgcattca atccccagca
ccgtgcacat 15600caggcaggct ctgtagagga cccagcttaa ggtcatcctt aggtaagtta
gaggccttag 15660atggctacat tagatgagac cctttctcat aaacagaata aataatttaa
agctcctgat 15720caaacactat gccttcccat cacactcaga ataaagcact ctactggccc
tttaaggact 15780gcccatctgg aagagaaacc taagttacat tccttgcttg tgtcatatgt
gataacaaac 15840tcactggaaa tacgaaaata cagtcttaag cttggtcaga aagcttcccc
agcaacatga 15900tntcagagga cataatgcag aaagtggaca aatgcaaann nnnnnnnnnn
nnnnnnnnnn 15960nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 16020nnnnnnnnnn nnnnnnnnaa tcagaggaca tctttcagga gttagttctt
tcctccctct 16080atagctttca gggatcaaac tcaagtgtgt actgagcgct tatgcccagt
gcgccatcgc 16140accaggcctg cttctttgtt ttttatgggt ctgaatcaat tagcaccatt
acaacaatgt 16200tgacaatcag caagtacctt tctctacctg gctagtaaga gaagtaagtg
cctttggtgt 16260gtgaacgcag tttctcttgt gaagtgcatg gacttgatct ttgctcacaa
cgttttttag 16320gtccttaagt tgcttgggtt ttatgggaaa ggctcttggg ttttttgaaa
agattttact 16380acaacttgat ataatcatta tttttaatcc tttaaatagt atgacttatt
ttaacagatt 16440aatattgaac tgttctttca ttcttacata ataaatcctg ccttaaaaaa
taatcctctt 16500agcttccttt ctctattttc aaatttgttt tatatttttg catgattttg
aacatttata 16560aaagtaggca gacaacacag tagaaccaag tccccatata gctgtgcaca
tagcttcaga 16620ttattgcctg ataataggtc ctgttttgtc tctgttttct cacagggatg
tgttattgtg 16680tgtgtgtaca catacataca tatatgtatg tatgtatgta tgtatgtatg
taatgaactc 16740cctttaacaa aacaagtact gggctgggaa gacagcacag ttagttatgt
gtttaaccgc 16800acaagcatga caaccagagt tgagatcccc accaaccgca taaaaagctg
ggcatagtgg 16860cattgacctg tagccctggt gctggatgaa agctggggag gcaggtagat
cggcagagct 16920tactggcaac aaatctgccc agtaggtaag ctctgggctc agacatccta
tataggaaaa 16980agatgaaggg cgaggcgcag cggcacacac ctttcgtggt agtgcttgag
aggcaggggc 17040aggccagtct ctgtgaccag cagcctggcc tacatgtcaa gttgcaggac
agccagagcc 17100accacctact gagactgtct cagaaataag ttttttaaaa aattgagatg
aaggagctgg 17160taagatggct tagaaggtaa aggcacttat cactaagcct gaagccccga
gtttgaccct 17220ggaccccaca ctgtagaacc aactcctcca agttcttctc agacctccag
cagagcacaa 17280gtgtatgcag acacacacac taagtaagtg aatgtaaaaa acatgacgta
gtggcactgg 17340cctttaaacc cagcattggg aggaagaagc gggtggatct cttgagtttg
agaccagctt 17400agcctacata aggaatttca aggcagccag ggctacctag aaagtagctg
tttatgaatg 17460aatgaataga aggaaggaag aaagagagac agacttaaaa aatatatgct
ggagagtaac 17520agaagaggac accggcttgc tggtgtcttg acctctggct tgtacacata
cacatgtgta 17580gtgcatacac ccacatacaa ttgtactcag acacacacaa acatgtactc
attcatatac 17640tgcacacctc aacactcaga aaatgaaaaa acaggtacca tttacacctc
cgtgttcggt 17700ttccaaccac tcatatgtat gggttgtaaa tgcttatatc tgtatgtgtc
tgtatatttg 17760tgtatacatt caaagttgag tcaggatcca acgtaaactt ggatagtagt
gggttgatgg 17820tctggaagcc tgctcgcagc tgtctttttc tcctcgtacc ttttcccctg
tttgtttcta 17880cgacagcagg tcatttgtct ctaagtgtta gtttcccatc ctctctcttt
tgctgatggt 17940agccttgtag tagtcacctg tgttctctgt aaaatggctt tgccgtgtta
tttcaatatg 18000ctatcatcct catcttgcta tatttcattc aatatatgta tatattacaa
gatagattaa 18060aattatttta attttatgct tatgaatgtt ttgcctaagt atattgcacc
ttgtgtgtct 18120agtgtccaca gaactcagaa gaaagtgtca catattctgg aactggaatt
gcaggtggtt 18180gtaagccacc atgtgggacc tggaaaccaa atccaggcgc cannnnnnnn
nnnnnnnnnn 18240nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 18300nnnnnnnnnn nnnnnnnnnn nncctttggt caaagatcct cagtttcgac
tttgattacc 18360cagacttcct gtttctctca tggaacagtt tccccctgag atttactagt
ggaagaaagg 18420cactcaaaaa gcagggagcc ctcgtacaaa tgagacttcc tagctatata
attaggccga 18480gatgcacaca tccacagtca ttacccttct tcagagcctt tgtcatgtca
agtgtatttc 18540gcccatgtga actttagaac tggcttgttg tgtttcataa aaagtgtagt
tgggctcttg 18600attgggattg tgttaaattt atagattcat ttagggagag ctgacaactt
tacagtatta 18660aatgttttca tccgcggaaa aggttgtctt gccacttact tgggctttct
tttatgccct 18720taagtaaagg tttatagttt tctttatatc agtcttgcac atttcctgtt
agatttattt 18780ttgcttgtaa ggtttccttg gttggtattg tgtataaaca ttttctccca
attacatacc 18840ataattgatg gttaggagta taaataaaag gtagaatttt aaaattgctg
tatgaatgac 18900tgctttgcta taattcaaca tcttttcatt tttatgccaa tctacttgac
atgtttaggt 18960taaatgatga tatctgtaca gagtaatcct ctgatccagt atttgcacat
cttactttct 19020aacgtccata gcatagatac acatcttata ctgttgagta catatatatt
taaggtattt 19080accatagtct tataatatgc agcgtgcttt ggttcaagac agttgccctg
tgttcctcaa 19140cattaacatt tttttcatca caaatacaca ttgaccttta tcaaattttt
aaaactatct 19200tgagagaaat gaccattttt cttaatctgt taatgtaaaa tttttataaa
aatagttata 19260aataatatta gcctacatat ttcttctgtt ctctttttca actcttagaa
tcagagtatg 19320gtagtctcag actaaaccag gagcttccta tctgtttctc tgttcttaag
tcacttatat 19380aatgtaagga tgctgtgtat atctgccagc taggccttat atacaaaagg
cacccatcac 19440aaccttctaa aacagtctta ccacttagag accatgttca aacatatggg
cctttgaggt 19500aattgccaca ttcaagctat aatattgtta tctaagggaa tatcttcact
tctagcagat 19560gcctaaaaat atctaaaggt aaacactggt aattgctgtg tttgttgatg
ctgctcttcc 19620tcctcctcct cctcctcttc ctcctcttct tcctcctcct cttcttcctc
ctcctgcttc 19680tccttttctt catcctcctt tcttttctta tttttgaggc atgatttcac
catgtagccc 19740taggtaaccc gtaacttact atgtatgtag accaggctag cctctgtctc
ctgagtgctc 19800atattaaagg tgtgtatcac catatccagc aacacttgct ttgagatggt
tagaggaaaa 19860aaaaatatac gtaaataaag atggatgcca attactaaat tgttacttcc
agtcaaactt 19920tgtacctagt ctaaggccaa aatagggatt ttttttctac tttgcaagtt
ggctccatta 19980agaggctttt cttctcttgg tctcactaga taggaaggag agagaggagg
gaaggagaga 20040aagcggttga ggagtgggag gtagtgtgac cgagaatacc cagtaggctc
atatatttaa 20100atatttggtc cctagttgat agaactgttt agaaagatta ggaagcatgt
cttaggggct 20160ttgaggtttc aaaatttaat gctagaccca gtctttcaag ggagggggcg
gtctgtctct 20220ctctgcctgc tgcatgcaga gctctcagct actactctag tgtcaagcct
gtgtgcttcc 20280tgcctcaatg atcataaatt aactgtaagc aagcctccaa ttaaatgctt
tcttttatag 20340ttaccgtgat catggtgtct cttcacagaa atagtaacct gtggtgattt
taatatgcct 20400ggaccaggga gtggcacttt taggaggaat ggccttgtta agaggaagtg
tgtctctgtg 20460ggggtgggca atgagaccct cgtcctaacc atgtgagaac cactcttctc
ctattggcct 20520tcagatgaag atgtagaact ctcagatcca cctgcaccat gtctgcctgg
aagctgcctt 20580tgttcccacc ttgctgcccc aattaaatgt tgtacttata agaattgttt
ttggggggct 20640ggagagatgg ctcagcagtt aagagtactg actgctcttc cagaggtcct
gagttcaatt 20700cccagcaacc acatggtggc tcacaaccat ctgtaatggg atctgatgcc
accttctggt 20760gtgtcagaag acaggacagt atacccacat acattaaata aataaataaa
taaataaata 20820aattcttttt aaaaaagaat tgctttggtc atggtgtctg ttcacagcag
taaaacccta 20880acataaccct gactaagaca acaagtgagg aaaggtgttg tgtgacactc
tggatctctg 20940gaagctcacc tcagcatgaa gcttgtcgaa gcgnnnnnnn nnnnnnnnnn
nnnnnnnnnn 21000nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 21060nnnnnnnnnn nnnatctaag tacactgtac tgtcttcaga cacaccagaa
gagggtgtca 21120gatctcatga cagaggttgt gaactcagac ctttggaaga gcaatcagtg
ctcttaactg 21180ctgagcatct ctccagccca aaataattct tactagtaac atggaacaat
caagttttat 21240tatatgatac atattaatca acttataagt acatgattat gcacatttat
catatcgtgc 21300aaccatcact gctgtcgttt tgttttgttt tgttcttttg aggcccggtt
tctgtgttgt 21360tctggaactc actctgtaga ccaggctggt cttgaactca atgatctgcc
tgcctctgcc 21420tcccaagtgc tgaaaacaaa tgtgtgcacc accacctctg gctatcactg
ctgtcttttt 21480ttttttttta acagttattt atttcgtgca tgcatgtgtg tataagcatg
taacgtatgc 21540catggtatgc atgtggaggt cagaggacaa ctttcaggag ttagttcttt
cctcccactg 21600tgggttctag gaaccaagct caggttgtta gacttgcatg gcaagtgcct
ttaccacaga 21660gccatcctgc tggccctact ataggtcctt atataaaaag atcatatgcc
gggcaaaaac 21720caaacaaaaa ataaacctca aaaaacaaaa ggaccatata atattgtggg
ggagtggatg 21780aagtcctgaa cgaatgtgtt ctgttgacat gtctgtactt cagacccatg
ggaattggca 21840aagccttcct ctggtcctgt gaggatgctg atagtctgtc taaaaactag
agatcacagc 21900tttctcctct ggatgactgt aaccccagat tgttcctctt cagagactgt
ccaccaagct 21960accctgccta cttaagctgt acacaatgaa tgagctgagt ttccaggtta
cagcacagta 22020gacactgtcc atcagtgaga gcacagccta gcctaacagt acacatgtct
gctttcttca 22080cgtttccaga accaagcctt gctggataga gcatatttgt ctgtttggct
tatttcactt 22140gataaaaagt tttcaaggag ggccaggtgt ggtggcacac gcctttagtc
ccagcactcg 22200ggaggcagag gcaggcaaat ttctgagttc gatgccagcc tggtctacaa
agtgagttcc 22260aggacagcca gggctataca gagaaaccct gtctcaaaaa accaaaaaaa
accaaaacaa 22320aacaaacaaa caaacaaaca aaaagccaaa aatccaaccc cccccaaaaa
aaaaaccaaa 22380ccaaaaacca aaaaacaaca acaacaaaaa gtttttgagg tttaatttat
tgcatgtcac 22440agaatttcac tgtttaaaaa aatggctgaa taatatttca ctatccattc
acgtatttgt 22500aggcattcat gtgtgtagtg gtttaaataa aaatagcccc cataggcttc
tacagttgaa 22560tgcttagtca ttgagtagca gtactagaga gggaattgaa ggtgtggcct
tattggagta 22620ggagtggcct tgttgcagga attgtgtcac tttgaggtcc cagcaacaag
gttgctctga 22680tcacatccaa agacattcta ggtctatgtg atctggctgg aattcagaca
tgcccttaat 22740acacaccttt aatcccaaac aatgaaggta aagttagttt ataaaaagaa
gcacccatgt 22800ttgaaagtga cgtttaatta agagtgatga attagagaaa gatctgctgt
cacagagcag 22860agaggaaaga gaggcagcat aagagggagc atggcagagg gagagggagg
aggggttttc 22920accagggcat ttgtacagag acaggttgca gagctagaac aggtgaagac
agaacaagcc 22980agagaatgag aaggagccag gagattagga cagattgcca atgttaatag
gctaagcaga 23040gcattttagt cagaaactga gagaagtcaa attgaatcag ttagcttgga
aaggagtttg 23100agcagcaaca gctgagttaa actagccaac agaatccaga aagaactaga
aaagatgagc 23160ttactcagca gcaaatctca gaggctaaaa acatcttaga cctagattag
actgcatgga 23220ggctagacgc ttccagggct aggcctaggt tagcagacgg agagagtaat
aagccttgga 23280gacaacagtt aatacagaag actatgtaca gacatggata tgaacctctc
agccacttct 23340ccagcgtcat gcctgtctgc attgttagga gtcatctagg aaaggctaag
ggcaggcaag 23400caacttttcc agagatggtc cactgttttt tgcatggctt ttgagaggcg
agctctgaga 23460gggaaggttc caagagactt catcccagga ttgctgctta attacgacat
gccttttctt 23520gtcactgtta tttagtataa tgactcctga gctttagccc atcctattgg
gcatatttcc 23580tgcagatcaa cataaagatg aactttcaca aattaatgct gtttagatga
ataaatgatt 23640ttataaaatt cctgatttga tttaaataat tttaggaaga aagctttagg
agatagttta 23700gttggtttgc cagaaagatg taataacgtc agaatcaaga atagaatgtg
gctgggcagt 23760ggtggcagat gcctttaatc ctagcacttc ggaggcagag ataggcggat
ttctgagttc 23820gaggacagcc tggtctacag agtgagttcc aggacagcca gggctacaca
gagaaaccct 23880gtcttgaaaa acaaaacaaa aagaaaagta agtaaaggct gcataataaa
gaatacaatg 23940agctttcaca actacaccaa aaagagacat gcttgggaca aatttgtgat
caaggaaaaa 24000tattcattct agatcaggtc caaggatgaa gccacaagtg tgtgatatga
tgaacaagac 24060catggataaa ctgttgtttt gagcttaaag aataaaacac tgctttgaaa
ttaactatca 24120acattctact gtaactttcc tttttataaa ttttatctat gagataattt
tctaaagaac 24180ttgtgtctat aaaggtatag aaggacagag agaaagaaat aaggtgtggc
atctgggctc 24240tgctccatcc acccaaataa atatgtgtgt gtgtgtatgt atgtatgtat
gtttatctat 24300atgtatgtat atacatacat gtgtaggtag gtatatgtgt atgtatataa
gtatgcatga 24360acacttggga agttgatgag acaagtgaga ggttgggccc ccnnnnnnnn
nnnnnnnnnn 24420nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 24480nnnnnnnnnn nnnnnnnnnn nngaattcac tctgtaaacc atgctggcct
tgaactcaga 24540gaaccgtgtg cctctgcctc taaagtgctg ggattaaagc atgtaccacc
acaacccagc 24600tagtttaaat gtttcttatt tttttgttta tgggtctttt acctgtatgt
atgtgtgtgc 24660accatgtgga tgcatggtgc ccttagagtc cagaagaggg tatcagatcc
cctggaactg 24720gagtgacaga gggttgtgag ctgggacttg aacctaggac ttctaaaaga
gcagcaggtg 24780ctcttaatag ctgagcctta tctccaggcc gtcccatgga tttggggggc
tttgtttcat 24840tttattttgt tttgagacag ggtgtgtagc tcatgcttga atttactatg
aagccctgac 24900tcccctcaaa gtaaagatcc tcctgcctct gtctacagct gctaggattc
gaggtcttgt 24960accacatgct cagcacagcc atgattcata acaataaaaa aagaaagaga
gacctaaatg 25020gccttagaga taaataaatt attttttttt taaagattta tttatttatt
tattacatgt 25080aatgcacact gtagctgtct tcagaccccc cagaagaggg agtcagatct
cattacagat 25140ggttgtgagc caccatgtgg ttgctgggat ttgaacttcg gaccttcgga
agagcagtcg 25200ggtgctctta cccactgagc catctcacca gccccgagat aaataaatta
taatgtatgc 25260gtaaggtggg atcatctcag tctccgggaa tcttgcctgt tactccttcg
ctctcccttc 25320tattcatgct tgggtaactg gccctggctg attgatgaga gctgatttcc
ccactgccct 25380gtggcaggga ccactgcgcc cacagggctc cctcaggatc ctcagtacag
agctgcacag 25440ctgggtggaa gtagagggct gcatatataa cacgatctca actttatttc
tttaaataaa 25500aattttattt aaattttata cagctctata taaacgaagg aactattgaa
ggttcagcaa 25560ggacctgcca acggttgtca agggtaatgg cgatgtagtg attttttttc
ccccttccat 25620tttacttcca tactttctac attaccccac aactggcaag tattatttta
aaatgaaagt 25680aaatagtgac agatgacttt gaaggaaaat tgaatcggta aaaagaaagc
tgagagacca 25740cccgggaagc ccaggctaaa tgtaatctgg gtcaggcctc ccaggcctgg
ggtctcaaga 25800tggtcagctg agggaccctg gtgaccctct tgggccagca gggacgggga
ggagccggaa 25860gctgagtacc caaagtgctc ctctgggctc caagggcctg cacagagact
gtgtgggaat 25920caaaggatac aggcatgagg actgaggcct gacgaaccca gctatcattc
gtcctagaac 25980aggaggcaga gctccaagag tccaaccaag aggcaggaag ctttgggacc
cgagatgggc 26040gatgggatta gaaaggcatg tttgcaaata ctttcaaatt tacgatgcac
actcactgga 26100aaccccaccc ctgggtgtcc cttccctgcc tcttgccaca cccaatagct
gacatcactg 26160gagaaagtcc caagaccagg ctggctggag ctcctgatag gttccaccct
cctgcagagg 26220gccctcgaag actagcttgc tcgcccacac cgccagatgt ctgtgtcttt
ctctcttttg 26280cctcccaccc tcgtctcttc ctccaacctc agtggagggt cccctgcttc
ctggggaaag 26340tagaacttgc cagtgctcac tgtaatgtcg tccctgtagg tgtcatggtc
ccccattact 26400gggagcaggt atgcctcaga tctccctcta ttcgctgccc tttcaggctg
tctcagtttc 26460tctctgacag ttcctctcct cctgaatcct gcttgttggc atgcgaacag
gctcaatatc 26520ttccatctca aaaaacaaac actgggaagg tgttgagaga cagagagcat
gggtaatggg 26580tgccccagct tggctgggaa ggggtaactt acaatgctct actgcccagt
agggtagctg 26640cagttgtcaa ttaattgtaa atttcaaaat agctagtaga gaggatttta
gatgttccca 26700atcccaacac aaagaaatga taaacattca aggcgatggg tatgctaatt
gctctgatct 26760gatcaccgca cattgtatac atgtttttga aatgtcaggc tgtaccccat
aaatatgtac 26820aattaccgtg cagtgattca agataaaaac tataatttta aaaagctaaa
aacagaagga 26880aatagctgcc cttgaccccc ccacccccac aaggtccttc ctgtttgtcc
agccacttaa 26940tgtcagagct tcctgtggga gggtggtttt ggtgtacaca gacactcctt
cctccctcct 27000tccccataag aggagtcacc cctgtcccac gatgccatgc agggccacat
gcgtgatatt 27060aaccagtaag atgtgagcag ggatgatacc tgtctcttat aacaaacgga
aaaaaaacca 27120caccaaacca aaaacaaaca aacaaacaaa caaacaaaaa cagggttggt
ctgtccctgt 27180gtcttttccc acataaagtt aagcacacaa agtagccacc atttatttat
ttgtcccctc 27240ccccacccct ccccgagaca atgtttctct gtataacagc cctagctgtc
ttggaactca 27300ttttgtagac caggctggcc tggaactcac agagacacag agattcacct
gcctctgcct 27360cccaaatgca gggattaaaa gcatgagcca cgaactaacc agtaccccag
agctcttgac 27420tctagctgca tacgtatcaa aagatgacct agttggccat cactggaaag
agaggcccat 27480tggacacgca aactgtatat gcctcagtac aggggaacgc cagggccaaa
aaaatgggaa 27540tgggtgggta gggaagtggg ggggagggta tggnnnnnnn nnnnnnnnnn
nnnnnnnnnn 27600nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 27660nnnnnnnnnn nnnggcagga tcctgtgttc atgtgcaaca ctcgatgcaa
gctgtgtagt 27720gtttggttct gagcacctga aggggaccaa gcaggctgat gcccaggcca
cgggttcttt 27780ctggccccac tgcccactcc caccctctgg catccccatg atgaacatgg
ccacagatca 27840cactactctg ctcctctccc agatccacgg agccataggg tccccagatt
catctctgca 27900gctaacaagc tgggcagtgt cacctccctc aaggttcctt tcctgctctg
agcagcagtg 27960tctcccacag tgagacactc atgtccactg gaagatattg tagccattaa
attcctgtgc 28020taaaataact agggggactt gtcaatcact acactcttag ccccggactt
ctgactcata 28080gagggtggtg acagctcagg gacctgcatt ctaccaaata gccatgtgtc
cctgatggag 28140gaactgcccc tggacaacct ctgcagcaac tgaaccctct gtggtctcct
agttcttctg 28200gacaggtgtg accccagtac ctagtgccag gtgagagagt gctagggcca
cactaagggg 28260tgacaggaca aggttggagc tggtagatgt ttgggccacc aaagagaaca
ggtcagtagt 28320aaaagccatc atggcctgag ccagcctgcg agtctcctct gcagttggga
cactcttgca 28380gtgtcctggg gacctcttga gggtagcatg gtcaccaaaa tcctacaagg
acagatcaga 28440agtcagtgag gtcaagggaa cagctctagg ttctctgtgt ccctcacgga
cctttttttt 28500tttttttttt tttttaagat ttatttattt attatatgta agtacactgt
agctgtcttc 28560agacagctcc agaagagggc atcagatttc gttacggatg gttgtgagcc
accatgtggt 28620tgctgggatt tgaactcagg accttcggaa gagcagtcgg tgctcttaac
cactgagcca 28680tctctccagc cccctctcag tcctgatgcg acagggcagc aaaggccttg
tcccagatct 28740gaggagagtc atgctgaagt ccttcctacc ccaccccttc cgaacccctg
aacatcagcc 28800ccataactac tgactccccc acccccattc ccttgcttcc actgatccgg
tcctcctctt 28860ccctctggcc ccacccattc ttccccagcc ccacctgatt gtacctggtt
gtccaacttg 28920aagagggcag gcaggggcag cttctgctgg gcctgctcac tcactggctg
tagaaatgag 28980aaaggagatg aagaaaaggc ccttcccatg ggtccccatc ttgccaagac
ataggtgagt 29040ccctttggct cttcccccta aacctctcac ttttgagtac ctgctggccc
gggagatcca 29100cggcgctcac cggagagaac tgttgagaaa agggagaaca gagaactcag
cgttcctccc 29160tctccaccct tctggcctct cccagatttg cccccgcccc ccagcatctc
cttcagcctg 29220actgaccact tcccactcag acctcagctc tgcctcaccg tgaaacaggg
accttgcagg 29280caggacaagc tgagtacgag gagcccccgg agcagtgcca tgttcctgta
tccagaacag 29340ggagtgttag ttcctacctc acgctcgaag gccaagcagt agactgctat
ccatgggttc 29400cttgaccgca ccaggctgcg gaacctggac tcaaaacata gcagctgtgg
acctcactca 29460ctctgagagg tgggatttcc ataagctttt tttttcacct gtacatttag
tcttcattct 29520tttcgtctta cactgtggat cagtcctggg ttcaaattta aagccctcat
cttgcaagag 29580gaccttgcgc atctcccttc atgcctttgg ctttaccctg tcttggtaat
tcatggcaga 29640agttcttcct gctcccatgt agatgttgag gacccaaata agaatctctg
taaatactga 29700gcatgatgcc tggcccccac cctagcaaag ccacctgacc tgttgttcat
ttcatccagc 29760ctttctcagg ctgccctggt cctacccaaa ggctctgaga gctaatctgg
gctggcaggg 29820cagccagaaa cttctttgtt gaccaatgaa tgactggccc agacaccttt
ggacttacgg 29880gaactacaag cctcatccca cttctgctcc aagttctgat ccagggtgct
tcggggaagc 29940ccagctggcg gaagggggga ggctctcagc ctagagagcc ttcctttcca
tcctcagccc 30000cctacccagg ccttatttca ggcaccagct cttctaaaag gtccttctgt
tatccctaga 30060cctccacaac tgtgttcaag aaccttcagc cagggcctca tctccaatct
ggatatatga 30120tttttctcgc caagagtagg cctccaggtt ttggagttct agaggtttct
cctggagctg 30180cctggacctc tgctcctcac caccccagga cgctgtgaag ctgcaggctc
cctgaataaa 30240ttcatccaga ccccttgcca aggtgccagc tgtctacttc ctctgctgcc
caagcagcag 30300gctgcaccac ccctccatcc tacctcttca ggcttcttag cgcagcacac
gcagcacacg 30360gtgttctcct ggaccagctt gctccccacg ctcccccagt gcagccagca
gggcctagct 30420ctctcctccc acaggacctt tgctctcagc caacccccgt tcagcttgtg
ttcagtgctg 30480gtaaatattg acctgtacat ccggttaaac attgatatgg gggccagaag
accctttccc 30540atcaaggcta cccagaaccc tgcctgagcc tggagaaggg gtttacagga
gcagataagt 30600gaggaggttg ggcctggcaa gccttctaat gatccctcaa cataggggat
tatccacagt 30660cagtgaggct cagagaggct gtgtggcctg tgtaagggcg cagagtgggc
tccagagtca 30720cagccaaagt cccaccacca ccaccaccac caccaccacc accaccacca
ctaccaccac 30780caccaccacc accaccacca ccaccaccac caccaccact accaccacca
ccaccaccac 30840caccaccacc accaccacca ccaccaccac cacctcatct acccatacta
anttgaggct 30900nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 30960nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tctatgaggg
ttacatttta 31020gaacatctct ccttttttcc tttttgagac aatcttacta tatttaggct
ccccttgaac 31080gtgtgatcct tctgcctctg cctcccaagt gctgaggtta caggcatgca
cagtcacatc 31140tgcctacaca atgtcttagc agcccttagc agcaccaggg gtcaggaagc
cctcaactgt 31200ccctttagct ggcttctctt gtgaagggct atgtcttctt ccccttccta
gcatggagac 31260ggctcttagc cccagagcct tccttccttc aggttaaaca gcaccagttt
ttggtgggac 31320ctcccatttc cctctatctc cctaagcaac gaccttttct gctctgactc
tcatctggca 31380cttggaccac aagacaaaac tgcagcctgg gctgtgtgtc ctcgcacatc
attcctgtgc 31440ccccctggag tcaggtctag gggaggaaga cagggttcac gactcagaaa
agaccactgg 31500ctgtcctagt gtgccctcac ccatcctata gcacgcacat gctgatgtgc
cccctccgct 31560ccatcaccat cctctcatgt acacgtgccc tccctcgcca gacacatgca
tcactaactt 31620ttctgacttc ccagaaaaat atctgatctg agaagttagg agtctgccat
catcagctat 31680ggtccttaaa attaagtcag acaatccatg ggacatgaag ggcaacaacg
agaagactcc 31740tcgttccttg ttcactctgc ttttggcagc accaccagca ggaaccaacc
tggctctccc 31800taatccctca tctatagcag gtctcccggt gggaatttta gggacctctg
tgttctcatc 31860caggggcact gccactcagc tgctcaggga gagacccctt agaacaacaa
agaaatcaat 31920gcagatttag gcttcttgtt tcccttccca gcccctccca tcacaggcaa
cagcctccct 31980tggctgagcc tcaggaggct gatttatcag agaggtgctc agagaggcac
ctctggtccc 32040tctgggtagg tagcaactga gacaggagga gatggtcacc ctgggcatcc
tctaccagga 32100agtaaatgag atacccttgc agatgggacc cctgaagttc ctcccggggg
cgggggtggt 32160ggtggtggga gtctaagtca cagatctttg ttaccacgtg gttagactga
ggactgaatc 32220tgaggtggga aatctgatgt gcatggggaa acacagaggt ccaatgctgg
ccaagagcta 32280caagcaggga caggtgctag ggggatgtct gaatgttcca ccccaagcca
caggaataac 32340ggaaatggag actctaaagg gcagaaagtg agggtgtgca gcaggggctg
cacaggacac 32400atgcaaggcc ctggctgcaa taactgggtt ggggaggcag tcattggcta
gccaggggca 32460ccaggacagt gatgccatcc tgtccaaagg gcagtgtcca agccagattt
ctaggctcca 32520gggggaggag ggtccgggga gaggggtcaa gattctcccc ctctgagtca
aggttggcct 32580tcccatgtgc cccaaatcag gaggcacaga aactgggatg ttgtggtctc
acatccaagc 32640tgagaagaca agtgggagcc agtacatgtg tttcagatta aacccagtcg
gagacaaaca 32700tgttgctcct cctcctccca gagccaagct gccttcaagc cacatggcag
tgaatatgcg 32760gacagtgcag gggaggacac ctctctctcc actggctcaa ggacagtttc
aaggggttca 32820ggctggctgg ctcatggcta cgccgctcac cccctggaca gtttggggtt
tttccctcct 32880gaaatcttgg aatctgaatc agcctgagat accccataat tgtacctccc
aacaccccca 32940gaaaggtcag ccctgcagaa cagaactctt tggtccccac ccatccccct
cagccctgga 33000ggctgaactg atgggcagct aaggtccaga cagtggctgg ctcttggaaa
gcctgtctct 33060ttcctttgac tcagaccact ccctgccgtg gcttacatca ggaggtgcaa
gggctgcagg 33120agggcagcca gaccccacaa accagctagg ctaaatggtg cttattgttc
gcaagaggcc 33180atgacctcat ttgtctccca gctcttttgg taagagagaa tgagaggaag
ctggacagag 33240aacctagcag gcctcaggca gcccactgct ccttgctgta agggaaccag
caccgatggt 33300tctgaaaagc agcgatccga atggagtcag gctgagctgc aggaagctca
ccttccttgc 33360tcactgctgg tggaagcaac ttcaggaaga gcccagccta tgggactata
gctcctccgg 33420ggtactgctg agtccagccc cagagcttag ctccctgctt cccaccaccc
accaccacat 33480cctttcccaa caccattcaa aaccccagtc cagcctctcc tactggtcta
cagtgagcgg 33540ctaatagagt cctgggcctc tgtcccccca attctctctc ccctctcatc
tgttcacctt 33600ggttcctaaa ctgcaggggc tactataacc ctacctccac ttccttgcac
ccctcttttc 33660tgctctctgg ggtgcccctg ccactcccag tccctctagc cagggagcct
cttccatatc 33720tgtcttcccc aggctagacc aggcgctgcc ttacctgtgg ttgcggcagc
ttctctcaca 33780gcctgcactc tgaggggctc caggaagcag tgaggggagt agctgcctct
caaccagcgt 33840ccagcaggct tcagattaca gctactcttt tcttaaagtg acctgactcc
atttggaatc 33900tgtgattgca tcattgtctg gtgttaactt taacccactg ctgcccttcc
gccatgtggc 33960tccaagacca cacgttggcc accctcctct cccaccacat ctcccttgga
tctttatctc 34020tcttcattgg gaccttcatt gggacatgat ggctaacttc aggggcactt
gggccagcct 34080ggggtaggtc atgagtctga acttgaacat ctgaaaggat tggctgagag
gcaggctgca 34140tggagagact gtgagccagc cggtatggag atgctgggtt cttccaggcg
cttggctctg 34200gctcactgca ggtgggagca aggtgattct tctcccctcc tcacctggaa
aatgaaggaa 34260tgggactgta cctgacagct ctgaaggttc caaaggacag tggggtgggg
actagagagt 34320tggcccagtg cttatgagca ctggctgctc tcgcagagga cctgagttct
gttcccagct 34380cacatagcaa ggactcgaaa ctgcttggaa ctccagctcc agagaatctg
acgctatctg 34440ctnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 34500nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtcagg
catggtggtg 34560catgccttta atcctagaac tcaggaggca aagcagatgg acatctgaat
ttgaggcccg 34620cattgtctac atagcaagtt ctaggctagc caggtacatc ataagaacct
ttctcaaaaa 34680ataaataggg ccagttggca aaatttagct tgccctccta acacaagaac
ccaaagtcaa 34740ccccagcaac catgtaaaaa gaagcaaggt gtggtggcac ttgcttgtga
tccagcattg 34800tcaaggtgga gacagacgga tccatggggc tcactggcca gccagctagc
tggtctactt 34860agtatgctcc cagccagtga gagactgaaa aataaataaa taaataaggg
gttaggaaga 34920ggtaacatgg tggctcagtg agaaaagata cttgtcatac aagcctagca
accctcaatt 34980ttcagtggcc actaaaggtg gaaggagaga accaactcca aagaattgtc
tcctgacagt 35040tttatgctgt ggtacacaca cacgcacaca cacacttgtt acatgcatat
gcatacaatt 35100aataatttaa aatgttatgt gtatgggtgt tttgcctaca tgcatatctt
tctgtgcacc 35160acatgtgtgc aatgcctgtg aaggctagaa gaggacatca gatcccctcg
gagttacaca 35220gggttgttag ctaccatgtg gattctggga acaaaaccat gggttttcca
aaagaggctc 35280ttaatcactg agccatctct ccatcccctc aatagtatat atttctgggg
ctggagagat 35340ggctcagagg ttaagagtat tgactgctct tccagaggtt ctgagttcaa
ttcccagcaa 35400ccacatggtg gctcacaacc atctgtaatg gaatctgatg ccctcttctg
ctgtgtctga 35460agacagctgc agtgtactca tatacataaa gtaaataaat aaaccttttt
tttttttgtt 35520ttgttttttt tgtttttcga gacagggaga cagggtttct ctgtatagcc
ctggctgtcc 35580tgaaactcgc tctgtagacc aggctggcct tgaactcaga aatccgcctg
cctctgcctc 35640ccaagtgctg ggattcaggg tttgcgccac cgccaccacc tccaaggctg
ctgctgcggc 35700caccaccacc ccaccaccac actacctgac tatttaactt ttaaaggcag
ccatctcatg 35760gaaaatgaca cctagcattg tcctctggtc cctacatgac cccatgtgca
aacacatacc 35820tgcataaaca cacataaata cataagtaaa cttagtctgg ttgttttgga
aatgtgctat 35880ggtttggatt gtgtcccccc aagggaaaaa ttgggtccca gagtagtact
attggaagag 35940agtagaactg ttagggttta ggcctggtgg gaagtggcca tggctagaga
gacatgccaa 36000ccaaggggaa tctctggctt catcttttcc ctttgctttc aggtcctaag
acagtcacaa 36060ggctgcttca ccacatgccc agaagcaagg ggagcagtca tggctgggac
cgctaatgca 36120gctgttgatt tccccagata tttgtagtag taagagacag gtgaggaacc
ccacagcaag 36180tgttagtaat tgtgtgtgga ggtgccctcc ggggacgggg gccctcctgg
ggcaggacgt 36240tcctcttcct catccacctg cactccgaga acaggaaatg gtgactttgg
cagagcttaa 36300gcagagcccg ttcatgttac aagtatgtaa attcataagg accagtttct
ctccatatga 36360aacagcttca aacaggagaa ggaagaagca aacattaagg aaaagctctt
ttattgcaga 36420ggctacactg aagctaccgg ccgccttcct ggaatgtata atcagcttcc
ctctgggggt 36480tctgtagagc actgagacat taagtactac tggggtccag gattctgcct
atgaagagga 36540gggcccccgt gtccgtgtcc ctcagaacaa agaggaaagg ttggttaagg
tgatagtcta 36600gcgggaaggt gaggcggacg ggctggaggc ctgggctggg gctgcttcct
gccccctctt 36660cattccactc gaaagcagcc ctgtgttcca cttgggtgag cttcacgggt
ttgccagtaa 36720tcttgctgaa gtcgggtgat tcaaacaacg actgtagctc tgtggagatt
cagagattcc 36780attaacacca cacacacaca cacacacaca cacacacaca cactccctgt
ttgtgtaggc 36840tgattttcaa gaaagcaagc tagaagtgga gtacctcaca gtgacttgtg
agctatgagg 36900cactctgtga caggctcagt gacctacctg agaacttata gccaagatgg
ctgaagccag 36960acctggcctg agagaatgtt ttgggctgtt ataggacaca tagagataca
cacacacaca 37020acacacacac acaccaagga ctgagtctaa tgggaggtgg ttcttcattc
ccctcccctg 37080taatggtgtc acatgttccc tgagccaccc tacaaagaaa gccacaggac
tcagttctgt 37140cagcaaggtg gcaggctcca agactcagcc ccgagcgcaa agtggccttg
caaacatact 37200catgtcctgc agagacttgg taagttcgcc ttcgaagctc agcttcagct
tggggacagt 37260cagcacagct tggatagtct tcagttctcg gtcgatgtca tgaatgaact
cagaggtgag 37320gctctcttct atcatggtca agttctgggt cacggtcagg ggcaggaaga
agatgatgct 37380catacttcct gtcaagggca gctgggcaat ctaacccaac agagatgcgc
acaggttagt 37440tgtgagccag aaaaaacaaa acaaacaaac aaaaaaacac caacagctgc
cttcccctct 37500gctgtaacgg ggccccagcc ttgtgctccc cagcctcagc ctgggctgta
ggctactggt 37560tactggcagt ccttccatga gtagggagtt ttcttctcag cctaaaaccc
acagaagttt 37620aatgaacaca cgtttgtttg tggttccgct acggtttcta ttgtgataaa
acatgactga 37680aagcaacttg gagaggaaag ggtttatttc atctgacaat tcgcagggtg
tcttctcatc 37740actaagggga ctcagggcag gaactgaagc ggaagccgtg gaggaacgct
gctttctggc 37800ttgctccccg tggcttctta gcctgctttt ttatgctatc cagaaccact
tgcccaggag 37860tgacactgcc cattgtgggc tgggcccccc cacatcaatc actaatctag
aaaatgaccc 37920acgggtttgc ccagaggcca gtctggtggg ggcattttat caattgagtt
tcacccttcc 37980aaatgactct aacttgtgtc aagttgacca cacgaatcag ggcctggttc
ttaggagctg 38040aagtggaatg tcccccagag actgcctgcc agcactgctg accatttgct
ttgtatagag 38100cattgaacca gaaatgaaca ataaaatgga tcctttgaac agatgtgttg
atcctagggc 38160ctgtggacac agcgactggg cttcccagag cccccatgga atcannnnnn
nnnnnnnnnn 38220nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 38280nnnnnnnnnn nnnnnnnnnn nnnntggcct catcaggtat cagagagaga
gagagagaga 38340gagagagaga gagagagaga ggataaaagg ttagcccagt ggtggtggca
cataccttta 38400attccaacac ttgagaggca gaggcagggg gagctctgtg ggccagtttg
gtttacagag 38460taagtttcag aatagccagg gctacacaga gaaaccctgt cttgaagaga
aacacacaca 38520cacacacaca cacacacaca cacacacaaa taagatcttt aagaagaaaa
gaaaggatag 38580tggggaaaca tctgagcaga ggaagaaatg gggtgcgcag gacacccacc
ctcagaggag 38640gccctcactg gaggtgtctg cacaggagaa cacttgcact cagcttgccc
tagggcgtca 38700gaactcagaa ttcagtttca aagcactgac aggagcagtg actggggacc
ccaggttgaa 38760tccccccttt atctaaaatg agtaagaacc aaaaaaacaa aagtgtttgg
gatttggaat 38820ctgggttatt tgcatctaca aaagaggtct tggggaggaa acccagttct
acccctggaa 38880ttcatcagtt tcctataccg ctgactacac aggggctgaa ggtaatctca
atgttttcat 38940aactggtgtg gtgctacttg ctcatgatcc caacacttgg aaggtaggtc
agaagttcaa 39000gagcagtctt gactactcag tgaatttgag gctagcctgg gctacatgaa
aactcataaa 39060acaataaaag aaaagaaaag gtggcgagtg aggtggccca tcaggtaaaa
gtgccaacca 39120cctcgcctga aagcctccac acggaagggg aaagccagct tctacatgtg
gtcctctgcc 39180ctccgcatgc accatggctc gtgcaccccc acacccaccc acccacccac
ccacatgaca 39240taaatacttg taatgattag tttctgaaga acaatatttt cgttgatctt
gtttagggaa 39300caaagttcgt gcacattgac ctgtcgaccg tgtagtacgg gatccgctcc
aggaagctaa 39360agattttggg atgcttcatg ttgcagctac tctgatgaac agtgttgctt
tctgggccag 39420gtaatggtgg catatacctt tgatcccagc acttgggagg cagaagcatg
tagatctctg 39480tgagttcgag atcagcctgg tctacagagt gagttccagg atatccaagg
ctatacagaa 39540aaacccctgt ctctaaaaat cactaattta aaaaaaaatt cctttctaaa
cctatataac 39600aaatgttttg taggctgcct taacaaagcc caatggccat tcagagaagg
ctcaaaagag 39660aaagtttagg ggactctaca agcatcctca ggaaggccac agaaagcaga
gcctgggcca 39720gtgagacttt gcagtgggca aggttcagct ctttatgtag gaagaagaga
gtcaacagtc 39780agagtccagc tttccataaa acctgtgcag ggcctctagg caaagccctg
tgttaggggc 39840aaaggcattt gcagtctaag cccggtgaca tgagctcaat ccttggaacc
caggtggaag 39900gagtgtgctg actccacaaa tttgtcctct gatctataca tgtatgcacg
tgcacgcaca 39960ctcacataca tgtccacatg cacacatgca tatacatgcg catgcgtgca
cccacacaca 40020gggtcaaaag cagcaagaga tgccctgtga aaaacgtctc attcagtctc
ccatcatcca 40080gtgccacact ctgagcacag gtggtactga tatcgttcct gattgatcga
tcagttgatt 40140tgagaccccg cctcactatg tagcccaggc tggcctggaa ctcacagtga
tcctcttgct 40200tctgcaagat gagcccatca tgcccaccat gttattgaag caataccatg
ctctataaag 40260caaacctagg caggcaggat ggtggactcc tgtaatctca ggacttgaaa
agtagaaggg 40320agatgaggag ttcacatcaa tctcccgtat gcgttggagg ctggagtggc
tgttccctgg 40380gcgcttctgc cagcacctga ccaatgcaga tgcagatgct cacagccaac
catcagactg 40440agctcgggac cccagtgagg gtgctggggg gaggactgga ggagctgaga
cgggattgca 40500agcccatagg aagaacaatg tcagctggcc aaaccaccca gagctcccag
ggactagacc 40560acgaaccgag gactgcacat gaagggatcc atggctccag atgcatatgc
agcagaggac 40620agccttgtct gacagcatgg gaggggaggc cattggtcct gtggaggttt
gatgccccag 40680tgttggagga tgctggagcg gtggggcagg agtgggtggg taggtgggga
gcaccttcat 40740agaggcaaaa gggatggggg agaaggcaga tgggatgggg gggttgtgga
ggggtaagaa 40800agaaaaaaga tgtctctgaa agtaaaaagt acttgtcact aagcatgagg
atatgagtca 40860acccccaagc cccacagggt ggaaggagag aaatgagtcc cacaagttat
tttctgatct 40920atacgtgcaa tccatggcat acgcagaaac gcaaagacag acaatgagtt
gggtgtggtg 40980gtgcacatgt aattccatca ttcaggagac agaagcagca gagttgttgg
aaatctaagg 41040ccaacctaaa gacctacacc caaagaagga caaactataa ggaaaaaggt
ggtcgaccaa 41100tgtaacatta aagttagaaa tctctcttca cactgtgtag atactgtaca
aggaagagaa 41160aaggcagcca catcaaaaca gtgtaaatca acgagaaaac cagaaacaac
tcaagagaag 41220gctgcagggg cctgaattct gttctcagaa cctgcatcaa gccaagagaa
tcaaaactgt 41280ctgtaactcc agctccctgg gatccaacac ccatttctgg cctccatcag
catcactcac 41340aggtgtgcac acatacacat caataaaaat caaaaccagg gatgaagggg
tagggaggtg 41400catgtggatc tgggaggagc tgagaggcac tgggtgaata caataaaaaa
tttggtgcat 41460ggtggtgcac gcctttaatc ccagcacttg ggaggcagag gcaggcgaat
ttctgagttc 41520aagaccagcc tggtctacag agttagttcc aggacagcca ggtctacaca
gagaaaccct 41580gtctcaaaaa aacaaaacag ccgggcggtg gtggcacacg cctttaatcc
cagcacttgg 41640gaggcagagg caggtggatt tctgagttcg aggccagcct ggtctacaaa
gtgagttcca 41700ggacagccag ggctacacag agaaaccctg tcttgaaata aataagcatt
tgttgctgtt 41760acagaaaact ccagcccagt ttccagcaca cacagggtga ctcacaacat
cataactcca 41820cttccagggg atccaatgcc ttcttctgac ctctgtgggc accaggattg
catacagtgc 41880acagacatgc acataggcaa aacactcaca aaataaaata aatctagcaa
aaaaaatttt 41940aactaataat ttaaagaaaa aaataaggaa gccgggggtg gtgtcgcacg
cctttaatcc 42000tagcacttgg gagacagagg caggcggatt tctgagttcg aggccagcct
ggtctacaaa 42060agtgagttcc aggacagcca gggctacaca gagaaaccct gtcttgaaat
aaataaataa 42120ataaaaaata aggccaagta attcttggaa gaatcccaag gggacactaa
gtgtatataa 42180aggcgttcca tagggctagg aatgaggctt agcgagagca acttcgctgg
tgtatgaaag 42240tccctcagct gcatgtggta cctttaatct aggctctccc gaagcagagg
cagaaggatt 42300tctgtgagtt caaggccagc ctggtgtaca tagctagttc caggacagaa
agggcgatat 42360aatagaaaca tcntacctag agcccngcca aanaaagggg agacctgaga
ccagagagat 42420gactcagtgg ctaagagcat tgactgctct tccagaagtc ntgagttcaa
ttcccagcta 42480aaaatttatt taaatgttta ttacttgtat tattatttaa atttaaataa
ataagtaaat 42540gggagcctag gtttgagtcc ccaaatcacc aagaaaaaat gttatcattg
ctaataatca 42600aattaagagc ataagaactt ctttttaaag aattcttatt tattttatgt
atgtaagaac 42660actgtagctg tcttcagaca caccagaaga gggcattgga tcccattaca
gatggttgtg 42720agccaccatg tagttgctgg gaattgacct caggacctct agaagagcag
tctgtgctct 42780taagtactga gccatctcta cagctcttat caggttgata aaatttaatc
tcgtggagcg 42840ctgagaccaa gaactaaagc tgggagattg aaaaatgcag accaccaagg
ccctgctcat 42900ttctccagtt ctgatcagct cccgtaccag gggtctaacc aggcctgtgt
ctgcttccct 42960gagtagacca gaggccccat ctaaacagcc tgcctgcagc agctcctctc
tctaggtgga 43020cagatgggaa tttcagacca atgtcatttc ccaggacatc aacacagcag
ccaaatttat 43080tggtgctgtg gctgccacag ttggtgtggc aggatcaggg gctggcattg
gcacagtgct 43140tgattattgg ctatgccagg aaccagtctc tcaagcagca gctcttctcc
tatgccatgc 43200tggggtttgc cctgtctgag gccatgggac tcttctgttt gatggtcgcc
ttcctcatcc 43260tcttcgccat gtgaggctcc ctggggtcac ccagccgtcc ctgctgcctt
gactccatgc 43320cagtcctggt gctggagtct actgagattt accattaaac agcaacgttt
ctctaaaata 43380ctattaatta attaattaat cacgtgacaa ccccagcgtc catatgggtg
tggaaaatga 43440ggaactctac ccatcataca tggcgactat gaagaacaat gtgacagaaa
atgctaacat 43500catgtgtgac cgcatgcatc agccctgact gctaaaagtg gacaagcccg
aagcgaaagc 43560ccaatgttct acttctaaat gcatgcacca aacgcctccc acaggaccag
aggtgcagct 43620ctgatagggt ccttgcctgg catgcatgaa gctgtggaca cgaggcatta
ttcgcaagaa 43680cattctagct gtctggaggg ccctcaatcc actgtgttcg cgctgttcca
gcaccagtgc 43740ctcctggggc tgcacctgaa aaaggggact gcttaagagg gctcctacca
agcctactgc 43800cacagatgca tgatgggaaa gccttctgga agcaactggc tgccaaaggc
tctggacaag 43860agatcaccct ctactggaaa ggtggtttca gtctaggttc tgtgggattc
caggaaatta 43920gacaacactg gcagtccaac agacagacga tctaaacttc caaggcacag
ctggtagaac 43980ttgctgcgga accagacaac aaggtacgag ctactcccat acaacataca
aaaaagcaga 44040gagagagtca gagacagaga cacacagaga gagacagaga gagagtctaa
agagagtcag 44100ggtctcagga ctgagggtat agtctactgt agagcatttc cccagcatat
acaagaccct 44160ggattcaatc tgaacacagg aaaaaagggg gggggggact tcgattatct
caaattctcc 44220tttttgtgac acacccctaa agtcactgcc tacttccctc accgccatga
agtaaagagc 44280tgtttgcgct tatgtctaca cagtctcggc tcccacttcc tcctcccctc
tgcttctgtg 44340ctcatctcct ctgaaaccac tgcagcaagt gacttgtgtt gactgccaca
cggaaactct 44400cctcagtagc aggcagcaga gcagagctct gtcttctcgg agcttcttct
ctcttgtcgc 44460cattttctcc cacccttaag taccctatct tctctgtctc tgcttgttga
tccttggacc 44520cttttccttt ctatgaacaa aatatctcct taaaggatct cttctagttc
agggtccccc 44580cgcccccact gtggagaaaa cccagggcct tgcacatgct cagcaggagc
tccatccagt 44640ctctagctcc atgacttaaa gcatctctgt gctgtcaaat atacacttcc
agcccttacc 44700aaaatattca gtcaactcct tgccattcaa aatggatgac ctcaaagcca
gagtcagcgg 44760tgctatgact cccagatcca tccacttggt agcccaggaa tgaactcann
nnnnnnnnnn 44820nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 44880nnnnnnnnnn nnnnnnnnnn nnnnnnnngg ttactgggat ttgaactcag
gacctttgga 44940agagcagcca gtgttcttaa ctgctgagcc atctctccag ccaccaccac
caccaccacc 45000accaccacca ccaccaccac caccccacca ccaccccgcc ccacctgctc
attcctgatt 45060ggttggttag tttagtctgt gagacaggag ctgtcccttt tctatagtgg
aaggtgaata 45120agaaactcct gaaagtgaag gcctacaaaa cagccacact tatttgttgg
aaaatactgt 45180aaatgtgaca tgtaaataca tgctaaaata attcgttaag tcagtgaaca
accttaaaac 45240acagtctgta gcctgaatta cagacacgac acgagccatg acagaggctg
aaataagacg 45300cctttgcaag gagaagggca gaagcttcca tccttgctag caaatctttg
ttccaagctt 45360tatcagattt tattgctttc tctttctgtt tttctttatc atatttgttt
atttgttggg 45420ggaaagccta atcttcatag cccatgtatg tgagcacttc agcatatgtg
tgtgaacacc 45480aacagcacac gtgtatgagc accacagcat aggtgtgtga gtaccacagc
acatgtgtgt 45540gagtacgata gcacgtgtat agagttcaga ggagaactga gagagtccgt
cttttcctcc 45600tactgtgtag gtctcagggg tggaacttgg gctcagcctt ggtggcaagc
tcctttatcc 45660acagagtcat cctgccagcc cagctttctc tttttctctc tgttatgtct
atccactctg 45720ttcaaggcta actcactgac tctgagttat cagaactgct tgtgagagca
ggagtaactt 45780tggacatctg tgctggtagg aacaccatcc ccactcggct tggatgacga
aggggaaaaa 45840aagcatcacc aaggagttcc accacctcaa ccagcaaata tttacctcct
atacatggat 45900aggtggggtg ggtgagcctt gtgatttatc gttaggatct catgggagtg
attacagctg 45960gtctactcca tgaccaaaat ggtgacggtg gctgaccaaa aagaaacagc
tacacctggc 46020tctagttttc tttctttctt ttttcttttt cttttacccc acggtactaa
ggattgaacc 46080caggaatgca agagctctgc caagtgagct acattcccag atctgttttt
ccatttcttt 46140ctttcctttt agattttatt tcgatttatt tgtctatgtt tgtgtgtatg
tgtgtatgct 46200tatgtgtacg tatcagtatg ccatgggtat acagaaacct gagaaggcca
gaagagagtg 46260tctgggttac aggagtttcg agctgtcctg tgggttctca aatgcagcag
caccaccacc 46320aatcaccccc acccccaccc agccttcgag ttcaattctc atcatcacaa
aaacacacac 46380acagacaagg gcctgcaaga tggctcagca ggtaacgaag ctcgtgtcat
aagcctgaga 46440acctgagttc actgtctgga acccgcgtaa agggggaagg gaagaatcaa
ctctatgatg 46500ttgtcctctg ctctccccat gtgtgccatg gaatgaacag ccctcccaaa
cacacatcaa 46560gaataaataa aactaaaatt agcttagtaa cttttatgtt gaaagtggtt
tttacatgcg 46620tgggcaacaa taacaccgag agtagaaagg caagcatgta tgtcactgaa
cagcattgaa 46680gaaaaaacaa acacatttcc tgtacatcgt tctgggagtc tgagttaggg
tttctatgct 46740gggataaaaa caccctgaca aaaattaacc tggggaggaa gctgtttatt
tcagctttta 46800tgtctacaac atgacctgtc acccagggaa gtcagggcag aaattcaaac
aggtcagaag 46860cctggaggta gaagctgata cagaagccat ggagctgctt ctggcttgct
ctagcacaca 46920ggagtactag cctaggggct gtactgccca cagtgggcta ggctctccca
cagcaatcat 46980aaatttagaa aatgcactac aggtttgcac acaggccaat ctggtagggc
cattttctca 47040attgaggttc cttcttccaa aaggacttta gcttgcatta tgttgacata
aaaactagcc 47100agcatattgg gattatagat attctcataa aaaaaagaca tttagattcc
cacataacac 47160catattcaga aattaactca atgtgaacca gaagctctga aagtaagagt
taaaactatg 47220aaaaattctt acaaccatcc ataacaaaaa tctgatgccc tcttctggag
tgtctgaaga 47280cagctacagt gtacacacat ataaataaat aaataaatat ttaaaaaaat
atatgaaaaa 47340tcaggctggt gagatggctc agtgggtaag agcacccgac tgctctttcg
aaagtccaga 47400gttcaaatcc cagcaaccac atggtggctc agaaccatcc gtaataagat
ctgactccct 47460cttctggagt gtctgaagac agctacagtg tacttacata taataaataa
ataaatctta 47520aaaaaaaaaa aaactatgaa gaactatgaa ctacaagaag tcaggaatag
ggctgggggt 47580gtaacccaac agaaaaacac ttgcctggcc tgcgtttggt ctctagcacc
accaacgtag 47640aaagagaaca gcagaggatg agggcatcct gacttgagtc aagtgacaag
tgataatcct 47700cgagacacca aaatcacaat gataaaagag atcaacaagt tgggctttat
ctgaataaag 47760agctgtgtcg ttaaatacca cgcaggaagt gaagaggagc tgagtctggt
aacacaggcc 47820tgaaatccaa gctactgggt ggactgaggg aggacaacag ctagctcaag
gcccacctgg 47880acgccagagt taactcagag agcagcttgg gtagctttaa tgagactctg
cccaggccag 47940tgcacaggag agatggctca gtggttaaga gcattactcc tcttgcaaag
gacctgagtt 48000caattcccag cacccacgtg ggcacttaca atcatccata actttagttt
caggggatcc 48060aatgcccttt tcacagtacc aggcatgtac acagtgcaat tacatacata
catgcatgca 48120tgcatacata cacaggcaaa acttacataa aatactaagc agataaatct
taaaagaagc 48180cgggcgtggt ggcgcatgct tttaatccca gcacttggga ggcagaggca
ggtgtatttc 48240tgagttcgag gtcagcatgg tctacagagt gagttccagg acagccagga
ctacacagag 48300aaaccctgtc ttgaagaaaa taaaaaaaaa aaagaaaaaa atcttaaaag
aaaaggagag 48360gactggagag atggctccac agttaagaac acttgttctg aggtctacag
agtgagttcc 48420aggacagcca ggactataca gagaaaccct gtttcgaaaa accaaaacca
aaacaacaac 48480aacaacaaca acaaaaccac ttgttcttac agaggacttt ggtttgattc
tcagaatcca 48540catgatggtt cacaaccatc agttgcaggg atccaaggtc ctgtcttctg
tgggcaccag 48600gcatatatgt ggtgtacata catgtataca ctcatataca taaaataaaa
agttttaaaa 48660aggaggctgg gtttgtagcg cagaggtaga ggtaaaaaga ctctagcttg
tttaatgttg 48720acatgaaaaa aaaaagacat ttagattcct gcatcacacc atatccaaaa
attaactcaa 48780tgtgaatcat aagctctgaa agtaagaata agcctagtat gcactgtaag
gctctgggtt 48840cactccccag cactgcaaaa gatcatgaaa ccagaaatgc agatcctctg
aaccacagca 48900tgggaatgta actcagccga tgcagtgctc acctgtcgta tacagagcac
aggataaatt 48960gattgtggtg gtgcatacct ataagctcac tacgtggaaa gtagaggcag
gacgaccaaa 49020ggttcagtga catccttggt cacatagaga atttgaggcc agtctggtct
gctggtctat 49080ttggaatgct gtctcaataa ataaaagaaa gaaagaaaaa gaaaagaaga
agtcctatga 49140ttgtcttaac ctctgacctc tgtgttcatc aagtctcctc ctcaggaact
cactggtcat 49200cttgtgaaaa cctaccccag agtctctgtt cagaggaccc aggctccagc
tgtggttacc 49260acataggatt tttatactag aaaaataaaa tgaataagta tgtatttttt
aaaaaggtgc 49320agagctggat atggtggtgt ctagttatag catccagaac tgagacagga
tagccatgag 49380gttgagaaca gctagactat acggtctcaa caaacaaaag taagggatct
gagtagatga 49440ggttttaatt tttttctttg tgtttgttac ctaacgtgta tggttgtttt
gaatacatgc 49500atgtctgtgt atcacttgtg tgcctgaaac ccaaggaagc cagaggaggg
catcgggtcc 49560cccggaagta ttattacaga aggttgtgag cagccatgtg ggtgctggga
atcaaatctg 49620aaagagccac ctcgggctgg agagatggct cagtggttaa gagcactcaa
tggctgctct 49680tccagaggtt tggagatcaa atcccagcaa ctacatggtg gctcacaacc
atatgtaatg 49740ggatccgatg ccctcttctg gtgtgtctga agacagctaa agtgtactca
aataaataga 49800tcaaaaaaga aaaaagaaac agccacctct ccactctccc tttttaaaat
cctcttgcct 49860ctgtccctta atgttaataa cacaggtata tgatactatg ccttgtttat
gaatagaaaa 49920tacacgtgct aaagcaagtg tgaaccttaa atacattatg ctgagtaaaa
ggagtgagtt 49980gcacacaaga cttttctgct caagagtatc tgtatgaagt attgaacatg
tgaactctga 50040aatcgggagc tgaggaagat atggggagtt ctaatggcta caacatttct
ttttggaatg 50100atgaggatgt tctagaactc aaaaatggtg ataactcagc atatatacta
aaactcattg 50160aattgtacac tttaaatgaa tgcaataaaa cttgtctcag taatgtggtt
tagaagatgt 50220acagacatgt gtgtgtgtgt gttaaaacat ttcttggcat ggcaataaaa
atacagtttt 50280agccaggtgg ttgtggctca aaaaataatg ataataacaa taataaaaat
aatgaaaaca 50340gaggctggag agatggctca gcggttaaga acactgactg ctcttccaga
ggtcctgagt 50400tcagttccca gtaaccacat ggtggttcac agacatctgt aatgggatct
gatgccctct 50460tctgatgtgt gtctggaaac agctacagtg aaagtcattg caaggacttt
acaatagtga 50520ccatgataac attgaagcta gacttgctac tactgctgag tgtgtctgct
ggctctttct 50580aaggagtaat gttagctttt tgtcctaaat ttgtttcctt cctttcctct
ctccctctgc 50640tgttttttct tacccctctt ttactttgct ttcccctctc atctcctctc
ttaacagagt 50700tgtcctatgc agcccaaatg ccatcttcct gcctcagcct ccccagtgtt
gaaaaatact 50760ctttccacag gttatgttag gagactggag tctgctcagt cggggaggga
gcctgggtca 50820agttctgagc tcaattcctt ttctttcttt ctttctctct ttctttcttt
ctttctctct 50880ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt
taagacaggg 50940tttctttgta taccctggct gtcctggaac tcactttgta gctggcctgg
aattcagaaa 51000tctgcctgcc tctgcctccc aagtgctggg attaaaggtg tgcaccacca
ctgcccagcc 51060ctgggctcaa ttcttaacat tgtggagaga aaagtattgt agctgttctg
gccacctgga 51120attactttgt ttctgatctt ttgctgcagt caaatccttc tcatccatct
ttcctcgtca 51180ggctataata tagactctcc ttgcaatact tggaaatgct ctacagtcag
ctacatcctc 51240agtcctgctc ctatattttt tcctaagctt ccttctaagg tctttattgg
tttatgattt 51300acacagaaca tttttttttc ttgtctatag catgcgttag agtgatcgtt
gccagataga 51360ggaaagagaa atgagagaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 51420nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnt 51480cagctactga ttcctcctcc tccctcctcc ttcctccctc ctccccagcc
tcatgctctg 51540ctcatcttgg acttctgcgc atgtcctcag cccagacctt ctgctcttgc
ttctcctctc 51600cccagcagcc ccccagttct cttcctgaaa cttctgaggt actctccatc
acctcctttg 51660gctcctgctc tgattggtgt cacctgctgg ataggcttgc tcctgactcc
actgttcgtg 51720tctcaattag ggaccctcac cctctgatat accacacatt tccctagtgt
ctccacctcc 51780cacccccacc ctatacgcac atacacactt agctgcatca ggatcctaca
ccagggactt 51840cttacccttc taatcctccc caccggacac tgcccaggga cactggggct
ccagagggct 51900attgccacac ggacacacag gagatctcat caaggagatg tgcctacccc
agagggtagc 51960tctcaccatt cacaagcaca ccacttctgc ctccagcttc tactctctcg
caggaagtag 52020ccagcccggt gccaagtatc cccaactaca tccccaaaat tctcagacac
tgccagcctc 52080cagctgtcag cctggccccg gctggcgggc gcctgctcct ggcatagcga
ctagggtgta 52140attagaaacc cgctagctcc ctaattgcca gttctgagct gtccttgtta
ccggctgccc 52200gaggcacaca tagaggaaaa ggctgagagc tgagccaggc tggcatggag
gtagccctag 52260tagacctaga gaggactggc atgtggccag ggaccaaacg tggcacagag
agggctcagt 52320gcaatctgcc ccgtgggtgc ctcccagcca catccatttg cccagaactg
tgacgtcaaa 52380ccagcccggc ccattcattc tttattcagg tggcataaaa atcactacaa
aaactttaca 52440aaagagtctt gggagctaaa gggtcccttc cttgcctcag tccccaagat
tcctggcagg 52500ggaggacaag agagagaaga aggaggaaga ctcctggcag tgttggcatc
tccaaatacc 52560agaggggtga cttgggtgac aggacacagg ttggggacct gaatgtcttc
agcaagggac 52620actcttgtag ggtaggtcag cctccaacca tgaagtataa caccaaggcc
agtctaagct 52680tgggagacca acacttgtct ctccttttcc cacccagggt gtctggaata
tgtctaaaga 52740tggcctctcc agcctctgct tacaaatgtg gagggaccct aagttaggga
cttgcctaac 52800ctacctctag ccaaaactgt gtccacaagt gccagcccac aaaagatcac
cccctgagcc 52860ccttgggaag aaatgaagat tccccatgcc tgccttcctc caggccccac
cccacctgct 52920gcaagagaac agcttctaca ctggtgatgg tccttccggt cccaccctat
cccacaaagc 52980tggttagaaa gagtcacagg agctgagagg ctgatccagg tggggactca
ggatgctgct 53040gcccagggcc cctcctcact tgggggagct gaactggggg tagtcttcct
ccatgcgggg 53100tgcaagtttc aagtcaggac caaaggtctt gcctccatgg aagtcagctt
tgtcattctg 53160gcctatgagc ctgttgtcag gggaatctcg ctgttcctgg agctggggca
gcgcgctggg 53220gttagggttc ctcacactgc ccacaaagag gggcacgcct atggtgtcct
ccatgatgaa 53280gaagaggaag ggtcggttca cagtgaagga ggagagggac attcgattca
tggctacgct 53340ggtagctgcg gctgcctcca caccagcctc gctgagctcc atggtagact
gatgttgcac 53400gctagacacc accagattct gctcagagat cccacgaagg tctgggccct
ggaacaattc 53460ctgcaggcct gcccagaaca gcagatgact ggtcagtgct gccccaaggc
tatgtggatc 53520tgtctagcat cctggctaaa gggaacactt gaacccagcg gttgattgga
atctgttaga 53580cctcagtcta gacaacactt ctagaaacct tttttttttt tttttttttt
ttttaaatca 53640ggatctgcgc taggtacagg acagaaagtc tagaggagca tatcaaatgc
tcccatccag 53700gaagcagggc cacctctggc tcaggcacac tggcagctcc cgtactctgc
ccagaccacc 53760taggggcacc ctatccccaa gctccttacc cagttggctg agggtggcca
ccaggtccag 53820ctgctgttgc agatggagtt taggcagcca caccttggtg ggcctctcct
gcagcgaggg 53880atggtacaga gtatcccagg tcaggttggc tagtacctcg gacacgttcc
actcaaaata 53940agtgggcatc acgaccacaa agctcatgtt gttcttaaag gggaaatgag
ccacctacag 54000ataagaaaag gagagaacat gaggaccaga cagcacctgg acctgtctgg
agtctgggcc 54060aaaattactt ctgtactttt gagacaagag ccagaaattc agggttagca
tgctttcact 54120taactggtga agtggaataa taccacttac ccctttgcaa ggtgacatgg
gaccaaatga 54180gataatgctt ttacacctct ctgtgtgcac acataagcat atatgtttgt
atcggtgtga 54240gtgtgtttgc tcatgggtat atggagtcag aagtaggtaa acatcagtcg
tcttcctaca 54300ttgctctcca cttttttttt tttttttttg gtgttgccat ctttttgttg
ttgttatttc 54360aagacaggct ttctctgtgt agccctggct gtcctggaac tcactctgta
aatcaggctg 54420gcctcgaact tgcagagacc cacctgcctc tgcctcctga gtgctgggat
ctaagatgtg 54480tgtaactaca catagctccc tcttttttgg acacagggtc tcatggatcc
caagctggct 54540ttgaaatgac tgtttggggc tggagagatg gctcagcggc taagaacact
gactgctctt 54600ccaaaggtcc tgagttcaaa tcccagcaac cacatggtgg ctcacaacca
tccgtaacaa 54660gatctgactc cctcttctgg agtgtctgaa gacagctaga gtgtacttac
atgtaataaa 54720taaattaatc ttttttaaaa agagaaagaa atgatggcta catacttctc
tctcgtctct 54780ctgccccaag tgctgggatt acagagctgt acaacaagcc caagtttgtt
gtgttttaga 54840catgctaatg tatcccaggc tgtcctcaga ctctctatgt aattcagaac
gaccttgaac 54900ttcttttaag gtttattttt atcttatgtg tatgggtatt ttgcctgagc
atttgtctgt 54960gtaccgtgtc cttgcagtac cctcacagtc cagaggaggg caccatttcc
ccctgaactg 55020gttgtgagct gcatggtggg tgctgggaat caaaccctgg tcctctgcaa
gagaagccag 55080taagtactct taactgctga gccacttctc caccttgagc ttttcttcct
cctatctcga 55140tctaaaagta ctagggatgg cggatgtgcg ttcatgtgcc tggtttatgt
gttgctaagg 55200gttgaacaaa gggctttgtg catgccaggc aagcactcaa caactgagct
acacatcccg 55260acagactttg actcttctag tagtagtgtc tccactacag cctgagttct
ctatctgctg 55320tcagcaagct gtacaaacaa gctatgggcc ttcctgtcct tgcctctcag
ttctctccgc 55380aggtggggct actggctttc aaaatgaccc atagaggagc cacagcaaac
agtaggaagc 55440ttgcccctcg tctttcaccc tctcccagag agtcagctat aattcgagtt
tttttttcct 55500ctctctctct ttaaacagga tctggttatg tggccctaac tatcttcaac
ttcagtcttc 55560ctgcttcaac cttctgagtg ctgggattat ggtgtaagcc accacactca
gctcacacaa 55620cctttttttt tttttttttt tttaaagaat ccatgcagtt aggacagcat
ggaaatgacc 55680aggctcaggc ctccctgggt accagcataa tgcctgcagg cgggtcctct
gccagtgggg 55740ggatggaaag atggagccag aggatctttc ctctctgaac ctcaatgtcc
cacagtgaga 55800cactcatgtc cactgggaga tactgtagta ttcaaggaag aagcaacagg
aaggtgagag 55860ctaagtggag ctgagcaggc tcgtatcctc tcaccacggg ctacagagaa
gtctggctgc 55920cccctccaca tggctcctcc ctgcagaact ggcaatgctg ggcccggctt
gcccagtcaa 55980actaaccaac agaatggatg agcatgtgtg gtgccacaca cctgggaccc
cagcactcag 56040acagctgggg cagaagggtc atgagtccaa agcgaacttg tgtaacattg
tcagaccctc 56100gaacaaacaa aactagcccg tcctgttatc tcagccacag atgatgggcc
caaggatcag 56160tactctagcc aaggagtcac ggttaggcta gaagcaaggg aagccttagc
tgagacagct 56220tggcacggag cttcatccaa tcagaatgtt cagagcaata agctttgaaa
cccgacttcc 56280atctatgaag cactgtgtgg gaactcctct cttcccttac gagcagggcc
ctggtcctct 56340tgggctccgc taaaacccca gcacagagaa cagttacctg gcacgtgaca
aaaactcaat 56400atattttctt tgaggagatg aacctcaaag aagctgtgtc ctggatagac
acagcataat 56460aaacccttca ggagctacct acccagggac cagactttac ctcccagtac
caggcctcgt 56520ttgccagcca aaggcaaagt ccagactgac ctgtatctca ggttgctcca
gcaggaacca 56580tcgaagagga tatgacaccg cgtgcatcat gtccaccgac actgtgaacc
gctcatccag 56640gtggaagaaa tctttctggg tgaggctcgg gtcaaacttg gtcctccaga
aacctgcagc 56700caggcagagg gcaggagcca tgtaacataa aatcagcctn ctgcctgtct
tgcctagaac 56760ctatnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 56820nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaaccaa
ggcaggtctt 56880ggaaaaagga atcttaaatt agaagatgcc ttgataagat tggcatgtag
gtatgtctca 56940ctaatgattg atgtggaaag tcacgaggga tggtgtcacc ctgggcagat
ggcctggggt 57000atataaaaac acaggctgaa caaaccacaa agcagtagtc ctcaatggct
tctgctttag 57060tttctgtctc aggttcctac cttgacttcc ctcagtgaag gcatgtcaca
tgagagttgt 57120aagaggaaat aaaccctttc ctccccacat agtttttggt tatgatgtta
tatgtcaaca 57180acagaaacta taactaatat agttggtttt ctttttttgt ttgttttgtt
ttgttttgag 57240acagggtttc tctgtatggc cctggctgtc ctggaactca ctttgtagac
caggctggcc 57300tcgaactcag aaatccacct gcctctgcct ctgcctccca agtggtggga
ttaaaggcat 57360gcgccaccat tgcctggctg gttttctttt ttttttaata catttataat
gcattttaga 57420tttaaaaaaa aaaatggcca tggcatataa tataaaaaga agtgcttaca
aatcaccatg 57480tgcccttgcc ataaattatg taaaaatttc catatggaca tcagtctcaa
gcttacaatc 57540tcagcactca tgagcctgag gcagaggcag gaggatggtg agctcaaggc
cagcttagtc 57600tacataacaa gatcctgtcc aaataataac aacagtaata atttcataca
tagaactaga 57660aggggccact gcaaagacag tatgacaaaa ccactggccc tgcctaattg
tattttaaat 57720aactgtcctc ctctctgtaa ttttcagttt ctaattttta cataactacc
atgtattctt 57780tttgtaattt taattagttt tttaataata gaaacaagct aagtgctaag
aatattttca 57840tatgaacatt ttcaaggcac ttgatacata cctcagattt gccctccagg
tgagcagtac 57900caattacgtg ccaccagcaa tgttagcttc cttttttccc taccatctga
ttctgtttca 57960gtctattcgt agttctgatc ttgttatatc cctttttatt gtttccctgg
gttccaacac 58020ctcccagttg agtgttctca ttgaatttca ttagcagctg tttcattaat
ggcacagaag 58080aaggattaca gtgttaacta ggatagactt tgacaaagaa ctatgagaac
atatcttatt 58140atctttgcat aaattctttt taatcaaagt tcctcaaaag cctctctctg
ttcccatctc 58200agggagtagg tctggccact gatgagtgtc caggccacag tacaggtgtg
cgtggttctg 58260tccctgtggg aagggcacat ctgtgttgta acaggattcc tgtcttaaca
agccttgctc 58320aggctctaag tggtcctgag ctagctaact gcccttggct ttcccttgat
taccagataa 58380ctattcactc ttctcatttt gcagagcact taccaggtag ctatgtcctg
gaagtacgaa 58440tgagtccttc tattgttttt cttttactta aatcccattt gaaatgcgcc
agggacactt 58500caatccaagg tacacttttg ctaaagaatc actcattttt atatgcaaaa
tgtcacctat 58560taactgcagc tgatatggta catacatatt ctctcttcct attatccact
aataggtgac 58620taatgcgaaa tattgagtaa tttttaaaaa tcaatactca attttttaga
aataattaga 58680gagacattca actctgacac cagcacccta ctcagttcct gagccttcct
ctgccggagg 58740agaatctata aataactcac gaagctgaca ttactcactg tgttgcagtc
atttttttct 58800gagaaaattt tagcaactgt tctaatagag cctgccagtt atcagtagtt
gagaatgcaa 58860gtcaactttt aattatgcag acgctgatta ttcagacgac aaattgttgg
tgcctgcacg 58920gctccttcct gctgcctacc tttaaccgtt ctcagtgctc attagcacat
gttccagaag 58980gtaggctttg gaggggcgga caggcactca aaccagctaa gcacttagag
aagctctgat 59040gaaagatgtt aatgcagttt gtagaattat tgactaaaat tgagtcattt
ggattccctg 59100tgaattgtat ttacatgccc tgtccctgtc ccccatagca acagataata
ggattgtctg 59160cagagagaca acatagttct tatatttaat tttttccttt gtcgaacatt
ttcacatgat 59220ggttcgtggt gtttcctttg ttcattacat ttgtatccag actagttact
tctgataagc 59280ggttagttag gattcctggc acgcggacag tgacaccaca gttgtctgat
cgtttcccac 59340ttttttacaa aaccgtttgc ctttaagagt cagtgttttg cacatttcac
ccagattatt 59400ggaaatatta tttccctcct gcttaaaccg aagctgtgat cataatttaa
gcctttctag 59460gtagccgatc ttacatgtat catacctatt tctggcatat gtttgtctat
tacaaagacc 59520tcgtaggtat gcagttagaa gcctctagtt aaatgaaatg ttgcgtgtgt
gatgaacctg 59580gagtggggat ggccttttgt gtgccccaag gctgttgtgt ttcacacagt
tgttttctgc 59640ctcctctggt ctatcactat cctgccactg ccagaaaacc ctgctgtgtg
ttccccgcgt 59700ggaggatctc tgcttctgaa cttctttggc ctgagaaact ccataaccaa
atcagttagc 59760attttgttta aagagcaggt aggctgttag agcttgggtc ttacatgtct
cccaggtcca 59820cttgccagcg ccttgaccac tgttaacttt tgttaaccaa ctcatctttt
gctgcctgtt 59880ttttgggggg tttttttggt tttgtttaag ccaagatcag ttatatggcc
caggctgagc 59940ctctcttccc agcctctcaa atgttagaat tacaagcatg catccctcag
catacctttc 60000ctttgctttt tttaaaatag agttttgcca tagcaacaga aatctaacct
aactaagcat 60060agccgtgcac atggtatgag gaactcacat atgtgtgaat ggaagttcat
agagaccggc 60120atcactgcct agaggcccct ttcttccttc cttgcagttg tcgtgctagc
tgactgtact 60180acaaaagagg ttgtctgagg cataagacta ccttcaataa aacatgcaca
gacagtttgc 60240ttctctgaga tttcagagca gtgactacct tcaataaaac atggacagac
ggtttgctta 60300cctgagactg cagagcagtt tccaaaaatt ttagacaaag ggtaggatga
agaaggctgc 60360ggggttttgc acacacttaa ggtgcgtaag taaataaact gagctacact
gacaggatgc 60420tcgttctagt agccaaccaa agagcagttg aaccaaagca cctagacttc
aaacatcgtg 60480gggagataat cttaggagtg ctatgcttct gcgtcctaca agtattatga
aactgtctag 60540aaagcacccc actggtaatc cctttttgat tatttttttt ataaattcta
gtcttggggt 60600tttgagtggc acacagacat aatggttagg cttcggtgtg tgctcattca
ctttgcttcc 60660tggggaccag agtttgcgat gagtcatgtt ccatctgatt tctgtcggat
ccggctgcag 60720agccatgact cagatgggct tcaggcccag ctgctcagtt catcttctgg
ggaatagatg 60780acaaggacgg gacaaatgtc ctgacgcaca tttccttctg ttcttgcact
tccagggtct 60840aacgagagca tcattaccaa cagcaggcag atacgccttg ccacaggcat
cttccctgtt 60900gtcagcctcc tgaaccactc ctgcaggccc aacaccagtg tgtccttcac
tggcactgtc 60960gccaccgtcc gggcagcaca gaggatcgca aaaggacagg agattctgca
ctgctatggt 61020gagccagcct ttctttccac taccctgctg tgcctcacac ctcacatgaa
aaggataagg 61080ggacaggaat cagcagatat gggcccagtg cctctactca tcctctgagt
ctttcctgga 61140aagggcaatg catccttggg ccaataaaaa aggtcttctg gctgtaataa
aaaagcccgt 61200tgagggcagt gagccatatc cctccatgcc ttgtagacag cctatcctga
aaatgagcga 61260ggagcacttt cttggcttct ttcttcctgc cccagcagct tggaaacgta
tccactttca 61320cccgtgtttt gttgtttttt ctgagatgat agggcagagt acccaacctc
atataggcta 61380ggctagtgtc tatcactgag ccaggacccc aacccagcac caccatgcca
gtcacgtgat 61440gactaggcca gcccctcggt agagtaggca ttgactctct tggtgtgact
aggaactgtg 61500ggtaatctct ctccagggcc tcacgagagc cggatgggcg ttgctgagag
gcagcagagg 61560ctgagttctc agtacttctt tgactgccgc tgtggggcct gtcacgctga
gacactgaga 61620gcagctgcag ctcccagatg ggaagccttc tgttgtaaga cttgcagagc
gctcatgcag 61680gtaaatctct gctgttccca ggggcagggc tccagctaaa ggttgtcagt
cgccaggaga 61740accattcctg cttcccttct tgtaactcct ccctacatgt cgcccggtcc
tgcagaaaac 61800acaggttgta tttcctaata ttttccctat aagtgacaca aaatcttaaa
ttacacaaag 61860ggaccaaaaa aaaaaaaaaa aaaaaagccc tagaaattta cttgctcaaa
taagtcatca 61920aaagttgtgc atcaggccta gcacttgggt actggtaacc ctagcactca
ggaggctgag 61980gaagaaggat ctcaagtcgg aggccagtct caagtgacac cccatctaag
agatcaccat 62040tccaaggagc tatttcagag atggtttaat ctggggaccc agattgtgga
ttttctgtct 62100gttcaattcc atctctctgt gctggcctca tcagacacac tctgtagtaa
ctgtgggaaa 62160atccgaccca catagttttc cctcagcctt tgacccagag ggaagagcca
cagtggagag 62220catgagagca gacccttggg tgctactgcc aggtaatggt gtagacactg
gagtcttcaa 62280cattcatgcc ccaatgcaaa atggtctcca caccagagca tggcattctc
attagaaata 62340agtaaatgga attggctgtg ttgaaaattg taaagccaag ggtcaagaat
gaagccttcc 62400ccagcatgtt ttgttttgtt ttgtgtttta ggcagcgtct ctctgtgtag
ccttggctcc 62460tgccctctgc tacctctccc aggtgtgcca ccatgctggg cctaagcgcc
ctgtgcatta 62520gtgctccctc gatcctgctc actcttgaga cagtcttcct tctactctgt
atccccagat 62580aacctagagt tcacttcaga gcccaggctg gcctcaaact tgagatcctc
gtgtcccagc 62640ttctcaaatg cagtgatatt tacaggccta cacctggctt tccctgatag
attcctagta 62700agatgattat cctttgagcc atatctctct tctgcttctt cctctcttcc
tgcagggttg 62760atctagaatt tattctaaag ctgactggcc tcagaattgc catccttctg
cctttagnnn 62820nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 62880nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaca tagtgtcaac
tttcaaattc 62940tgccttaaga gttctttgtt tatgggaatt tatgggaatg ttccacagaa
cccatccagc 63000ggagttctgg ctgttgtttt ttaatcttta ttcatcttgc gtgtgtgtgt
gtgtgtgtgt 63060gtgtgtatgc gcgcgtgctc aacttgcaaa attgcaaaat tcagtctcct
ctttccaccc 63120tgtaggtcct ggggatcaga ctctgttagg cttggtggta ggtgctttac
tgagccatct 63180tacaggcccc ccatggacaa ctttttcttt gaaaacctgt ttctggcttg
ggtgtgatag 63240ctcacacctg tgaccctacc accactcatg aggaagaggt aggaggacta
acagaattgg 63300aagccagcct ggactacaca gtgagtaaag gctatctata tactcaccac
atggcaagac 63360cccgttttaa aacactgggc aaggtgaaac aaaagtcaat taatttcaca
taaagtcaat 63420agcttcatta acggcctagt tatctttaaa actgtatgca ggttagtact
tggtttcaat 63480tttattactt tttctctgga acatttaaaa gtactttagg ggctggagag
tcagttaaga 63540acagtggctg ctgccaaagg actggagttc actcccaagc acccaggtgg
caatcacaac 63600tgtctgtcat ctaattctag gggatctgac accctcacag actcacaggc
agtggaacac 63660caatgtacat aaaataataa ttaaaaaaat gaaataaaat accaggcaag
gtggcacacg 63720cctttaaccc cagcactcag gaggcagagg caggcagatt tctgaattcg
aaggcagcct 63780ggtctacaga gtgagttcca ggacagccag ggctatacag agaaaccctg
tctcaaaaaa 63840aaaaaaaaaa aaggacttta aattgggctg gagagatgga ttaaaagcat
tggctgctct 63900tcccagaggt cctgggttca attcccagca ctcaaatggt ggctcacaac
tgtctatatc 63960aacgcaatct aacaccctct tcaggcatgc aggttacatg tagacaaaac
atccatatgc 64020ataaaataca taagtaaatg agtcttttaa tgtatactag aagctgggtg
gtggtgcatg 64080cctttaatcc cagcacttgg gaggcagagg caggtggatc tctgagttgg
aggccagcct 64140ggtctgagta aatagagcct tgtacttcta cttatcacta cagttacatt
ttataacttt 64200gggccctagt gcttccattt tccactgttt gcttaaccac tggggcctga
agcttttgtg 64260ctgacacttt tgttcgctaa tcatcaggca accaatggtc tctacactcc
atcaccatca 64320acacaaacaa aacaaaacac aacactacgg atcctggcat ggtggaacat
ctttagcccc 64380agtacgtggg cttgagttca aggccggcct ggtctacata gcaagttcta
ggatagtagg 64440gatagtcttt aaaacaaaac actattttat ttatgaacaa aacatgtaaa
gaaagaaaaa 64500aaactgcaaa tttatctatg aatgaagtct aagtaatact tcaatattgg
aaatagcttt 64560ctaaaatatt tttatttaaa gaaaactcag caaattattc aaacaacctt
ataaacgttc 64620gttataaaag taaagaatta tttgcaattg ccttaagggt ccaaggtggc
agcctcttaa 64680aattcagaac aatccaagct tcacattcca gttcaacatt tctacagccc
taacgtattc 64740aaatacctcc attctgacaa ctgtttcccc tcttcttttc ttctaagctg
cttagatgtc 64800tgtcccaggc ttttcatgat tttagtcatt cacacaacta gcaaacatta
tctagggact 64860aaaacttgcc agatactggg atatcaccct aaagggggac tgaaagtagc
tgcaggctac 64920agtctctaca atctcctgaa tgaaatacaa agtagctaat atttaccaaa
taaacatgta 64980cacctgtgat gattgctagc tgtactagca gaagctaaac actaaatcta
gaaactcagt 65040cctccaacta gccccttgct cggcttcagc ctcattttta caaacaaggg
aaagagtttg 65100gaatgttgcc caaagccata cataagtgaa caaaaaggag ttggagtctc
caaatgcatg 65160gatttgggct agttactttg ccaaccaact cagtaacaac tgagctgaac
aggaacactg 65220tggtagcaaa agaaactgga actatcaatg gcctctagag caaaaatata
tttaaaaaga 65280aaaaaacaaa caaggcctgg caaggagact gtgagaagag tgtgctgact
gaaattgact 65340agttcagcca acaaaagact attccagggc tggtgagatg gctcagtggg
taagagcacc 65400cgactgctct tccgaaggtc aggagttcaa atcccagcaa ccacatggtg
gctcacaacc 65460atccgtaaca agatctgact ccctcttctg gagtgtatga agacagctac
agtgtactta 65520catataatca ataaataaat ctttaaaaaa aaaaaagact attccagtgg
ggatggaaaa 65580gttaagtgtg gagttaaaat atacttcaac tggtgatgga ctaggtgtcc
agagtcgggc 65640aaaaggatgc tctgtggtag aggtgcctgc tgtgtaagcc cagctacctg
agctcaatcc 65700acagaatcca cagcggagtg ggaagagaaa caacgtccca gagttgtcct
ctggcatccg 65760acgcacattc gccatcccca agatgtcata catatgtgta catactacac
actggcgcac 65820gcgcacacac actctttttt aaaattcaga cttagaggga cataaaggat
ttgctctgat 65880atatgttcaa ttgaaaatga ctttgaagat agagggcaga tcgaaggaag
ctcagcagga 65940aagaattaat aacatgcagg tgaagggcta taaactagtc tgcagagggc
cttggctcga 66000caaaaaaatc tatggggttt gccggtaaaa taaggaaaaa gttgtcaaca
tgaaacacag 66060aacactagca agagaggagt gttagcagaa agaagccaac aagctcaaac
aattaggtcg 66120gctgaaaaat tttaaaatgt cttctgattt ggctactggg aagccactgg
tgacttcggt 66180cagcgttttc tctctcgtga ccagagagat gtctagtagc aataatgagt
taggaggatg 66240taaaagaagt aaaacagccg aaaacaagtc caaaaagttt ggggtgatgg
agaaagggag 66300gaaacagagg ccgccgaaga tagacagcgg catgtttatt tgtcttgttt
tcttagatgt 66360aaacaaacta aaaaaactcg tgagttcttc tgccagtacc gggttgcctc
cagcatcctc 66420tgatggtctt agagaccccg ggatgctccc ccgcggccgt ataatttcct
ccctgacgct 66480ctcccgatcg acagcggctc cctccccggg tcctctttgc accgctccaa
ggccgcgctg 66540ctagggccat cgagcccgct cagggtcgtc tccttacctc gatggccccc
tcgctcaggt 66600gtcccaccat ggctgcaccg ctaactcccg cgctcgcgct cttgcaccgc
ctgagcttct 66660ctgccggggt cccgcgggct gctcaacgat tggctagagc aactgtgcgt
gccgatccgc 66720ccccagcgtg agcgcggtgc gaggggcggg cctagacgcc gatagccacc
gcattggcta 66780ccgcgcggca ggcagagcac gtgactcttc cgaggccggg ttcgaggcct
agtggcggga 66840tggcgggacg tgagggcggg gcgctgggtc gcagtgcgcc tgtgtcagcg
cggtgctact 66900gagttgttcc cccgccagct gtcggaactt tgcccgccca gtcctttggc
ggacagacag 66960aatggcaacc cagggaacag tcggagctct cccctggtaa ctgctgctaa
atatagtcaa 67020agcagtgacc tgggtacttc ttcacgcagt gcgtgcccgg cgccggtgcc
aggcccagag 67080cttggcactg tgggataaac aaggtaaatc agactcagtc tccgccctct
tgagttccac 67140ctgagagttg tggccgcaag gaacccagcc tcaaggatgg tagacgcgat
atgggccaca 67200catgtggagc tccagagtgg gggtcaaaaa tcaatcaggc tttcgagagg
cgatgcggtt 67260tgaactgagt taaagtgtgt gtagaaattt gtcaggtgga ttccagtgag
gatagtgatg 67320ttcctaaaag cccaaatggc ctatgcaaaa gtattggaga gcctggcgtg
ctggctggct 67380ctgatctgtt tgtaatccca gcctttggga tgtagaagca gcaaaagttc
aaggtcaccc 67440ttaacaccgt tgagttcgag gtcaacctga actaaatgag accctgaaaa
atcaaaattt 67500gggacccagg cgtggtggca ttcgaggtaa aagcaggcag atctctgagt
tcgaagccag 67560ccaggctaac ataagatccg gtctcaaaaa aaaaaagtaa taaaaataaa
aagggagaga 67620ggctatatga actgaaagaa agacctggag atcaaaacag aaaactgagc
cgtctaagaa 67680atgaaaatat ttaacttcat agttgctgga gtaagaagtc tggaaaactt
tgggcaacta 67740aggtaaacag gtctagaaag actggaatag tagccatcta ctggtatttt
gatctctgtt 67800tgtacaacca caacctacta tagtttctca aacagttcca aagaatatgt
ctgggtgaat 67860tggtaccaca ccacagatta actctccttc agcatatcaa cagctataga
aaaccccaga 67920agaaatgatt ttggttgcgt gtcacttggt aggatgaaat ctcgattttc
tagaactatg 67980cattaataga aagctgaatc ttcatgttct gactttacag agctgcggca
gcatggatct 68040accggtggat gaatggaagt cctacctact taagaagtgg gcttcactcc
cgaagtctgt 68100gcaggacaca atttctacag cagagacttt gagcgacatc ttccttcctt
cttcttccct 68160tcttcagtaa gtgaatggaa acttcaggga aattttggtc tggaaaatgt
tctgccttgt 68220catttggtct gaatatctct tttttatagg agagagtagc tttatattct
ttatagtatg 68280gggcatttag cagttactgt tggttttcac gtttctccct agtctgtgat
tactagaatg 68340ggtaggcact aactgctttc ctcttttggc atgtgttata cttaaggaat
gtagtatctt 68400gctgtcgtcc cagtgctgtc actcatagga tctggtgcag gttgtgtagc
tgcccctaga 68460agctcattca gtcctaatgg ggagaaagaa ccctggcact tggttagttg
agacccanaa 68520cttctcaagt tctnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 68580nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnntttagtt 68640tccaagtgcc actttactgc aatgtgtcac cacatccaga gttctgtgtt
tgtttatttg 68700tctgtttttg agacagggtt tctctgtgta gccctggctg tcctggaact
cactctgtag 68760accaggctac cctcaaattc actgagatct gcctgcctct gcctccagag
tgctgctgta 68820cactaccacc acccatccag ctttctatat tggttttcct atggcgtttt
aaaatagcca 68880ttatacgtgt gtttatcatc taaagtcctg gtcccaaaag gagatgagag
gggctgctaa 68940ggtgaaaagg attacaaacg cttatcaatt ctgttcaaaa attaaacctc
agagtgggat 69000tagctgcttc tttcattaga attgctatca gaattcactc aggccttgtt
tgcgtgtgtt 69060attgaagaag tctcttcctc atcaggtcag tgactcctta gctcaagtac
atgcaatatg 69120cagtattgat aactgttctc gcttaggaat aaaaatagaa ctgacttcca
cagggaaatg 69180atgtgctgag ctgtagcaac gaatcttgca caaactctgt cagcagggac
cagctagtct 69240ctcgcctgca ggaccttcag caacaggtct gcatggccca gaagcttctc
cgaaccggta 69300aaccaggtga gattggctcc ctcgccctag gcctcagccc ttcccttgtt
tattttggta 69360tcaccttgcc ttactgagca gtcctcaata aatgactgag gacttgaatt
taattatccc 69420agcaccagcc acaagatggc tatgtaggcc agtgagacca gactgtgacc
agctgttact 69480ctggtgccct tgaaagtctt cctgatggtt taagctgtgt ctgctgcgcc
agatagttct 69540agcagctcga gcaccagaaa ggctgtctga cttccatggg ctttgtgtgg
ctccagaggt 69600ccaatgccat catctgattc ccagcttaag gacctaagct ccgagaaggt
tgctctgccc 69660tcagcagcag cagcaagtcc tgagtgctgc ctgggctcgt ggtgtgactc
aggagtagag 69720ctcggtagct agcctgagct gagagctgag agaaagaaag gactcctctc
ttttcagaaa 69780gggatttgca gaactcgatg ttagaccctg acatggtagg aatctgtttt
gactattcta 69840gcctagattc tgaagttgac ctttagccta gagtcaagaa aactaatgat
tacaggagga 69900atgtagagtt ggttgttaaa tgttggttgg aaaatggatg ttagaagccc
agggtaaatg 69960tgaggaagcc tcatctaaca cctcttttac tgaaagagaa aacataagca
accaacagct 70020tccctggaat gcccggctgt tgactccgtg agataaagag gcattttcac
tttgacctaa 70080ccgatagaga ccttgcaacg tggtctctcg tgtccaggac tagatctgta
tctgttgtga 70140ggcatttttc ctttgaatcc atagagcaag ccattcagca gttgttgcgg
tgccgggagg 70200ctgctgagag cttcttgtca gcagagcaca ccgtactggg ggaaattgaa
gatggcctgg 70260cccaggccca tgctacctta ggtatgctac cttaggtata gccggagttc
tccttccctg 70320ccgtgtgttc agtgcggccc ttgccttgtc tgtttggttc tctcttgcca
tctgaattga 70380cgctcttctc cctcccattc tgcattcctt gcccccagag ccttaggcta
atggtgtttc 70440ttttccggaa tgagacattt ctcttctcac agggaactgg ctaaagtctg
ctgcccatgt 70500acagaagagt ctccaggtgg ttgaaactcg ccatgggcca tccagtgttg
aaattggcca 70560tgagctcttc aaactggccc aagtcctatt caatgggtag gcctttcttt
ttcctagtgt 70620ttggccaggg cacacagtgc tctgtgtttt cctaggtgct tctgtgtatg
gctttttgct 70680acagtgcttt aaagcatgtt gaaactcttt tatttcctct ttaggacaga
tatttgccct 70740ctgcttcact gatagacttt aagctttgaa ttccttcctg aggatgtgga
gaaagccatt 70800aggtctgcat ggagcttccc agggaggatt tggaggcagc ctcacccgcc
tctagcattc 70860ctgtctgctt aatcacacct cccttggctg cctcagtccc tgctctctca
actccagggc 70920tcggcccttt ccctggtttg cctcttattc cttttaaagc agtggttttc
aactagaagg 70980gattgcaaat ggcatttggc agtgtttaga gacagttttg attgttatgg
ctgccagcat 71040ctagtaaagg ctaaacctac agtgcacagg accgcctcca cagtggagag
acccaagtta 71100gctatgtgaa ggctgagaat ccctgctttg gagattaaaa aaggaagctg
agggaaccac 71160tcagttggaa gcacccttgg tggcatgcac aaggccctgg ttctgtccct
agctctgcac 71220aaaaaataga atacaaggaa gagtaaccct aatgagctgg tccctcaccc
agtgtgccac 71280tgaggtcact tgaagggaag tctagcccca atttagtatt ttttgtggct
gccatacctc 71340cagccttgat caaatctcat ggtatacatt ggtaagaaaa agggtttgaa
acatagacct 71400gatactcgga catggaaaca gtatgtttgg tcagagagag cgaaggacct
gatagacgag 71460ggcaatatca gagagagggc atcagtcggg ttagacacga gcattccaca
gtgagcagct 71520ctggataagc ttttataaat gctggttaag gttttgaatt tgcccaattt
tgtcaggatc 71580ccagagtcta tcacaaacat acacagtttt ctcaaatctg ctttgcagta
tgcccgtgaa 71640tgtctcttat ctatactttc agatggtaag accctgaggg cagaggaact
cagacccttt 71700gtgccccctg taagaccctg ggggatgcag tggacccgac tttgtgttct
ctgcacagaa 71760aggagtccac tttcgttgag actaaggaag ggaactgaca agcttccctt
tctggcttca 71820ggttggcagt gcctgaagct ctgagtgcca tctggaaggc agaaaggatc
ctgttggtgc 71880actgtggccc tgagagtgag gaggtccggg agctccggga aatgaggtcc
tgcttactgg 71940actcgtcatt cgtccctgtg gggcccttgg tgtagagcaa tcatcctcac
cctcaagaag 72000gagctctggt gatgactgag atgttctgtt ggcttggagc tctcatcaga
gaggacggga 72060ccttcccacc tgacctgagc ctagtgtctg gcacagagag cacttgaaaa
cagattgaga 72120cactcacctg ccatgctggc tgctgcttgc aagagctaac tgccctctga
tggaaacccc 72180atgcccagaa aagactaaat ccagtatcta aaggctgctt taaagggttg
tcactgcagc 72240cgggcttggt ggcacacgcc tttaatccca gcactcggga ggcaggcgga
tttctgagtt 72300caaggccagc ctggtctaca aagtgagttc taggacagcc agggctacag
agaaaccctg 72360tcttgaaaaa ccaaaaaaat aaaaaataaa aataagtaaa aaataaataa
ataaataaat 72420aaagggttgt cactgatctg caggcagctc atgctagcct aggcttttgg
ctcgatttca 72480tctcactaaa cgatgaatct gtttccctgg aacattccta tggtttctag
tagtaatgaa 72540gtgctgtgtt ccactccagt gagaacttca attcttagtc ttgtattata
attgaaaaat 72600aatatatagc aagaaatcag tatgactgct tacctcaaga gacatacaat
tccacttaca 72660atatcctgct tccttaaatt tttcattaag actggtgata tataatttgt
gaatggagaa 72720ataaatacgt cttactgttg gcagtttctt cctgggatgg caactctgta
ttggtttcct 72780accagtgtcc taattcttac tcagtggctt tcattgagtg ttcttggcac
tcactgtcca 72840agcactgatg caaggcaacc ctgtagcatg acttcatagc acaggcctcc
ttgttagcac 72900acctgaaagc agaccactct ggctgtttca cttgcagaca gaatcttact
ctgtaagcca 72960gtctagcctc aaacaacatc ctcctgcctc agccttccaa gttctaggtt
tataggaaaa 73020ggccaccttg cccagcttga gactgcttct tactgccatg tctcttcagg
ctcacacatg 73080aagtccaggg cactccagga ggagccgtga gtctgtctgc agggcactcc
agggggagcc 73140atgagtctgt ctgcagggca ctccaggagg agccgtgagt ctgtctgcag
ggcactccag 73200gggaagccgt gagtctgtct gcagggcact ccagggggag ccatgagtct
gtctgcaggg 73260cactccaggg ggagccatga gtctgtctgc aaggcattcc aagagcagcc
atgggcgtca 73320ctcattggta gactgtgagg ctacatctcc agatgccccg agtgctgtgg
ttgtgagcac 73380tgctgctcat ggtttccaac tgagacagag ggaaggactt tgcccctttc
cctaaggatg 73440ggtagtaata gtccagacca caagggacag atagctatgg ggttttctga
ctcatcctta 73500gtacattatt gctgatgacc agtttgtttg gatgagttag tgggaaagaa
gacccaagtc 73560catacactct gctttttaga acttgctcat cctagccatg cccaaggagc
agccgttgac 73620tgtcatggca ttacagtgag gaaataaaca gtcctgaagg tgcctggcag
cagcttttca 73680agaagctggt gttaaaagac agtattcaaa catctgcgga ctgggaactg
ggcagcattt 73740gagtctcctg ctgtctgtta atttaccctg acaaggaggt gacttgaaag
gtttgttttg 73800tttggggtag agctttttca ggaaaaaagt ttagtcctac agacaactct
atagttattc 73860tagtccaaac tcatgccttg tgttttattc ctaaaagccc tgtcacactt
tgtaaaatag 73920gtgctcttcc tcaaaggata tatttaacgt tttatatatc aggccttatt
ctgtgcatgg 73980aagctttttt tagatgcttt gtaagatggc tcagtggtta agagcatgta
ctgctcttct 74040ggaagtcctg ggtttgattc tcagcagcta acaccagctg ttattccagt
tcctgggatc 74100tgatgccctc ttctggccta tgtgagcact gcatgtgcgt agtgcacaga
caaatgcagg 74160caaagcactc atacataaaa ctaaattcaa aaaactcttt cattgtctca
tgtgacctag 74220cttgagaata cctgtgctta tattataatc tagtatgagc cagccacggt
agcaacacac 74280ctattatctc agcactcaga agattgagac tagatggtca agagctagag
tctgggttac 74340aaaacacctg tctcaaaagt aaaagggctg aaaaagtgtc tcagcagcta
agagcacaca 74400ctgcttctcc agagggcctc atttcagttc ctaataccca caccgagtga
ctcaaccacc 74460tgtaactcca ggtccatgag atccaacacc tctggtcgtc tgcataagct
cctacactca 74520attatacaga gagagagaga gagagagaga gnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 74580nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 74640nnnnnnnnnn ntctagagtg tttcaggttt tttttgtttt tttttttttt
gagacaaggt 74700ctctctatta tgctgcctgg aactttctat gtagaccagg ctggactcaa
acttatagtg 74760atccactact tctgcctctc agtactggta ttgaaggcat gtgtcaccac
accccactac 74820ttcaagatct tagatttcca aagaagccgt agcctagaaa aggttaataa
gtactgattt 74880aaaacagaaa gaaatcaggt acacttagag ctgtagaatg tcagcatgtg
acatttgtga 74940caagttgtca aaactttgct cttaattcta aagagagaag ctgtcaaaag
acttgaactg 75000gggctgtagc caacttggtc gagcccttgc atgaagctgt gtgtttactc
cccagcactg 75060tggggtttga attgatttga acccagtaga ttcgtatatt tgaatgttta
cctcatgggg 75120aatgacatat tacaaggtgt ggccttgttg gaggaattgt caatttgggg
gtgagctttg 75180aggtctctct gctcaagctc tgcccagggt agaaagggag cctcctcctg
gctgtctaca 75240gaggacatag tctcctggct gccttcagat caagatgtag aactcttggc
tcctccagca 75300ccaagtctgc ctgcacaatg ccatgcttcc taccatgatg ataatgaact
gaacctctga 75360aactgtaagc cagccccaat taaatgtttg tctttataag agttgccttg
gtcatggtgt 75420ctcttcataa caataaaagc ctaactaaaa cacattcctg ctgggcagtg
gtggtgcacg 75480cctttaatcc cagcacttgg gaggcagagg caggaggatt tctgagttcg
aggccagcct 75540ggtctacaga gtgagttcca gaacagccag ggctacacag agaaaccctg
tctcaaaaaa 75600aaaacaaaaa caaacaagca aacaaatgcc agcatttggg aggtagagtt
aagaagattg 75660ggagtacaaa gtcgtctcag ctagtatgtt tgaggccagc atggaccaca
tgagacgttc 75720tcaaaacgaa agaaacgaat gaatagataa acatttgagt gtccagtttt
ttcctttctt 75780tcttgctttg tttttggcgg tgctgaggat taaacccagg accttgttca
tactaggcaa 75840gcattctcca ctgaggaaca ccctggcgag tgcctagtct gtctgtctgc
ctgcctgcct 75900gcctgcctgc ctgcttgtta tgtgtatgag tggtaacctg catgtctgtc
tgtataccac 75960agacatgcct ggtatctgca gaggccagaa gaggatgttg gatcgcctgg
aactgggatt 76020acaaatggtt gtaagctgcc atgtaggtat tcagaattga acctggtgct
ctgaaagagc 76080agccagtgct cttgttgttg gttttattgg gggcaggagg tagttatttg
gttggttggt 76140tggttggttg gttggttggt tttcttgaga cagggtttct ctatgtagcc
ttggctgtcc 76200tggaacttgc tctgtaggct caaactcaga gatctgcctg cccctgcctc
ccgagtgctg 76260ggataaagtc atgtgccacc aactccagac aagcagccag tactcttaac
cactgagcca 76320tcattccagc ccttctttgt gttttgagat ggtcacaaag tacaactcag
actgagctct 76380tgatcaccct ccctcagcct cctgactgct gggggttaca ggtgtgtcac
tgtcctcaat 76440tctgagtgtc agatcttgaa aacccattct cgtgaccttg atccttaaaa
caaaccctgg 76500gagaatgagt tctgataact atttctcact cctcttcaag aaaaggaaag
ccagagaaag 76560gggggggggc aagccccaga aacattgata acttgcccaa agttacacag
caaaattcag 76620acagcctgca catctcagtg gccatctgtg ccatatccac cctgcccttc
tctgacctcc 76680ccacctccat ccctacagac cttgcagttg agatcagagt ccaagccgta
tcgtaagatg 76740gccttaggat ctgacatcat ggggactctc acggtcctgt cctcgtccaa
atgaaaatcc 76800tggagggtcg tctttctcga gtcaaacttg gttacccact gccctggaag
gaaacagatg 76860gagcatcctg agccactgtc cccagaaagg ccacaggtcc acgctgtgcg
tccactggcc 76920aaagcaacct gagctgtcag cagcaagaac acaggagccg ctgggtccca
gcatgtgtgg 76980cacagaccat aggctccatg caccacgggt tctggctatc ctcctgtagt
aaactcagaa 77040ataagtgggt gttctctctc tgacttggat caccacgctc cttctgttta
aagtggcctt 77100taatatgctg gtgtgtggta cgtgcctgct ctcctgtccc ctggggactt
ggagtaggaa 77160gccccagggc tttcctctaa gttaggatcc actcttgcta ctactccata
agatggtcac 77220aaagcaacgt aaaatggaaa ttaatcaaac cattcctgcc acaagaataa
aacagatctc 77280aggggaggcc tgtggaaggg tctcctgagg ccttaccact gtctagaagg
aagttgacag 77340cagttcttga gcaggggtgc gactccagga gttgggggct gctctgagag
caggacagca 77400tgtattgtag agtgtctggg agggagctgt gttatcctta ccgtgaagat
gaggacacgg 77460gctcatgggg gcagagccag gattaaacct ggtctgattc aaaaagccag
agatctgtgc 77520ccagccccac gcagccattt cactggtcaa ctaattcaga aacacttggt
ctgatatgct 77580catatgctac aagcactgtg gccttcagat ctccctctgg cctggtacct
gcattcaggt 77640tcaccaccat caccacacac acacacacac acacacacac acactcggcc
agagacaagt 77700ggggaagccc tcacccttga agtaagccac gccaaggaga aggatgctga
gggcactggg 77760catttccctc gtggaccggg caatcttccc tttcatctgg gcctgcaccc
agttgttaat 77820ctcctgaagg tctactcgag ggttgcccgt gaggatccgg ggcctggtcc
cataggactt 77880ctccagaggg gcaacaaagc tggatttgac tcgaagttct tgagaggaaa
cagatcaaag 77940atgagagctg aatcagcacc ctcactttga aagcatgcca gaccccagct
tcctgctcag 78000catcttcctt tgacttgctg gggcatctgc cggcttgccc agaccctggc
tagggaacag 78060tggattccac cgtttgcatt ccccgtccca ggccctcctg ctgtctccca
gagcccactt 78120cctctttctg ttcctctgtg gtctcactgg ctctttcctg cccaccagtg
ccaggcctcg 78180cctgagcaca cacagcctat tgtttagaca tcatggaagc atacagacaa
cccaggccaa 78240tgaagcaact tcacgccagg cataatgggg cgtgcctgcc cttcagaagc
agaggcagct 78300ttatgagttg ggggaccagc tgagactcta tagactttga gaggtggggt
gggggtgggg 78360ctactgactc ctctcaaaca caattctgga agcactcttg aggttcttct
caggggcagt 78420aacagaggca aggagctcct tgtaggtgct gtggatgtca gggttggtga
tcaggtcgta 78480gtagagagcc cggtgaatga cagactctgt tcgatgttca gctcctgcca
gagagaaaag 78540gatgccaagc ttcataactg cccgtgaggc ccgcatcagg atagggacgt
tagacatcaa 78600tccttttgtc ctctgagagc ccgaggaggc cgatattgca gatgttttag
ctggacaaga 78660tcttcagggc gtggaaagaa ataatgaccg ccttgctagg aagagctcta
agacagggca 78720aggttatcag agctacagag agaagagtgg gatgtggtcc tgaagttctc
ccatcgtaac 78780ctaccctgtt ctgaggagga gccagctctg ctcacggcag ctgtacccct
agaacctggt 78840taaatgacta aaacacgata ggaggccact taaggaacca aggtcgagtg
ccacttacaa 78900agtggtaggg attgtgtgtg tggcccccac cgcccctttc ctgttcctct
gacggcggca 78960gcatggaaac tctgagtggg ggaaattcag gtccacctgc agccttcttc
agttgacact 79020cacccagaga aagggcagag agggccgtgg ccacgctgag tggagacagc
aggacgttgc 79080ccgttgggct ggcactggat ctcaggcggt acagatcgta gccgaagttg
gagacagctg 79140ctgccagctt gttcacaggg accttgaaga aggggtcctc ctcctccacg
ggctcgcccg 79200tgctgtccgg gactggggag ccctgggtta gaatacaagg accagtaggg
aggcacagtg 79260agtacatcac ctcctggttg ggttggtcct ctagtccctg gggccatgag
tctgaggtca 79320gaatgagtgt gtgctctctg actccacaac ctgtgtgctg ggaggtgggg
agtgggaagg 79380gcaacacaaa agggcttgcc agacctgaac tgtggtctga gaacctgaag
cctggcccac 79440tttaaaataa aacttgtagg gctggggaga tagcacagta gataaagtac
cagcatgcaa 79500gttcaaggac ctgggttcag tccccagagc tgggcacggg ggtgcatgct
tataatccca 79560acactgggga ggcagagatg ggcaggtcct ggggctcatt ggccaatcag
cctgaactaa 79620tcagcgtatt ccatctcagt gaggggtcct gtttcagagg gcctgaggaa
tgactctggg 79680ttgactacta gcctactctg tgtctgtttc tgtctgtctg tctgtctgtc
tgtctgtctc 79740cacccctctc tgtccccttc cctctgcagg gaacttcctc accaccacca
acccccaaag 79800aaacccaccc tcagaccagt cttccctatt cagcttgctg gctggtccta
gtctgcctag 79860gttctgctgt gacgccctcc ctgtctttcc tgacaagcca tcccctctga
ctagacccga 79920gaggaatttg tcgttttctg acctgttttc agtgtcagcc tcttccttat
gagactttct 79980gcttttttgt tttgttccca gggctttgga tcaaagctgg gctcttacat
acgttaggca 80040aatgcttggc cacccagctg tacctcccgt ccctgttgct tttcggtttg
gaggactttt 80100tttttagttt tctgtttggt ttttggtttt gattttttgt tgtttgtttg
tttgttttga 80160gacaggattt cgctatgtga ctctagctgt cctgggactc actatgtaga
ccaggctggc 80220cttagattca gagatccacc tgcctctgcc tcctaagtgc tgggattttt
agattttaat 80280ctgtacctac caacctcaaa ggaagtgtcc atggatagag ttcagtacta
catcatgtgt 80340gtaacatgtg tgagggcctg ggcttcaccc ccaacagaga agggggagtt
agtaggtgag 80400gaataagtga ctggctagtg gcaagacagt attgtctaag gtcactaagc
cttaagccac 80460acttaaagcc cacaatccag gtctaatatg cccatctgcc ttgtccttgt
gtgacatgac 80520cccaccccta cttcctccgt atagtggcag ctcctctgga tcctgaaagg
agagggaaga 80580tattcttgtc tcgatgttaa agtaaccaag gcctagaaga gtgaaggcca
aagcccaccc 80640tggatccagg gctgcctccc tgcactgtct cttctgctgt cccacctacc
caccctactg 80700acctcagagc tgctggggac gttctggctg ctgccgtgcc cgagcagggc
tccagtccag 80760aggagtagca ccagggcctg catcccggaa ctacaagaga aacaagagag
caaacgactc 80820ccctcaccca caccctcccc tgccactgca cattgcacac tgcacaggga
caagagtcag 80880gcagagtgag cccttcccct ccctccagct ctcagcccca agtggaccct
tgacttgagg 80940tcttccgtcc ctgacctgcc cctgcacttc tccttgagct gtgcccccat
gttggttcct 81000atcaggagac ccaccttccc atctaagctc cagcacaggg aagacccagc
agcaggctct 81060tcaggcccca agacaatgct ctagcacaaa cacacaccaa ggcttttccg
tggaggcaca 81120cggccagctc ctttggtagg atttggaacc ctgtctcagg atgggagcag
agcccaggtc 81180atagacttac agaacatctg gtctggtcct gctatccacc aatagttctc
tgaccaaagc 81240ctatgttaaa gacacacaca cactttcttc taggtaggtt cttgtgtatg
tagcccaggc 81300tagccttgaa gttgcagcta ccctacttca tttgcctcct gagtactaca
atgccaggtg 81360tgagccatca tgcctgactt gactcccttc ttctatttca agagcaattc
ttagttaaga 81420gggtatgaac cagggccaca ctgccnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 81480nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 81540nnnnntctta tttttagctt gtttgttttt cttacttgag acctggggag
gggaggtgta 81600tgtgtcttcc atttgcttct tctacctaat aaagtttctt ggtgatgttt
gggggggggg 81660gagggggtag gactcaagaa gggattctcc cataagctgt tccgtttggg
tagtactatg 81720taaggaagtt acaggtgggc agagctcggc tctgcctgac tggcgtgctc
tgaggtaaag 81780gtgagatggt gcaagatttg ggccctcagg agttggctct gttggccctg
taccttctgg 81840tctgtgggta aggatgacca gtaggtgaga gatgagggaa ccagaacaga
aggtgaaagt 81900tagtggggcg gagccccaga ctagtcaggt gggggtaaac tagatgactt
tctggaaccc 81960caaggggctc ggagactagt ggtgttggag aagacctcta atgtgttgta
aggcctctga 82020actcagtagc cgaacttgat gccagaaagc cccaaactgc taaacccaag
caggagcggg 82080acgccatccg ttccatggct tcacccgagg tggccccatg gctgcgccaa
tcaatgagca 82140gccgagagat aggggcgtgg acaagccagg aaaagttaca gcacgctgga
aagataatac 82200aggccaggaa gccccaggca cagcagggtg gaaaagctag atcccgattc
tgccggaggg 82260gggccccttc gaggtcccgg gcaccgggtg ccaggatcag agaaactgac
tgaaacctag 82320ctgacctgcc cagaccatgg catcctgggg actccttgtg gctggcgctt
ccttcacggc 82380gtttcgggga ctgcactggg ggctgcagct gctgcccacc ccgaaatctg
ttcgggaccg 82440ctggatgtgg cggaacattt tcgtttcgct gatacacagc ctactctctg
gagtaggggc 82500gctggtcggg tgcggaactt ggggactgac aaagcactga ggggcggggg
tggaaaagag 82560ggcctggaag actgaagttg gaaccttttg gaatggaact ggtttgggtt
gtggatgggt 82620gggagtaccc agtgggagaa tggatctagg tctgggagaa attgacctta
gctctttgtc 82680ttctccaggc tgtggcagtt tcctcaaatg gtcaccgacc caattaatga
tcacccaccg 82740tgggcacggg tcctagtagc agtgtcagtg ggtgagtgta cagaaaaggc
tgaatcggga 82800aaggccttgt tggaccggga attctaggtt cctcccccat ctttggaatg
gagcagatgt 82860tgctggaggt ttgctgtgag gaattaagga cctgagaaaa gtgggacttg
agatatctag 82920gctgtgcatc agctctgagc gaggagcctc atagtcttct ccggtgcctt
caggttattt 82980cgctgcagat ggagttgata tgctgtggaa ccagacattg gcccaggcct
gggaccttct 83040ctgtcaccat ttggcggtaa gactctgaag ggagaggcca ggtagtaagg
gagcatgtcc 83100aactcaaggg cccaacctct ctcttcagtg ttctgtcctc tgacttttcc
acaaagcccc 83160ctgaaaacct atcctctcag acttggattg agttggaggg aggttttgac
tggctagcca 83220ctcctgggca ctgcccaagg agtttggttc tccccacaaa cctccagctg
atcataaaaa 83280aaaaaaaaaa aaagccagga atgaaagcta gggtatgcta tgcaaatagt
gtggcttggg 83340gtaagagaac ctctggtcca gggctgctca tgccccctag ataagggtca
gcagaaaggt 83400caggattgga ggcagtccta aaaaatgctt gggtaatata aagtgaataa
ataaaaaata 83460aataaatact aatttttaaa aagctgatac ctggaaggat gaggcagaga
gtagaaaaaa 83520catgcgtggg tgtccctagg ataaggagct gggacttgtt gggcacaggt
catgcaaagc 83580ctgaaccttg aaccttgcct gcaggtagtg agctgcctca gcaccgctgt
tgtgtctggc 83640cactatgtgg gcttctctat ggtatccctg cttctggagc tgaactccat
ctgtttgcat 83700ctacggaagc tactgctgct ctcccataag gccccatcct tggccttcag
agtaagcagt 83760tgggccagcc tggccaccct ggtcctcttc cgccttctgc ctctgggatg
gatgagtctg 83820tggttgtccc ggcagcacta ccagctgtct cttgctctgg ttctgctttg
tgtggctggg 83880ctggtcaccg tgggcagcat aagcatctcc acagggatcc gaattctgac
caaggatatc 83940ttgcagtctc agccctaccc gtttatcctc atgcacaagg aaaccaagac
acgtgagcct 84000gttgccagga acacttccac tctcagtctg aaaggtgtgg aagttttctc
ttctgtcagc 84060ccccagggag gtggggctgg gaagaggaga tggtagccca ctgcatagtc
tactatgtag 84120caaggactag actgtatcat cagagagaga gagagagaga gagagagaga
gagagagaga 84180gagagagaga gaacattgta tgagatctcc attacagtca ggaaatcagg
agatctaaat 84240aactttaaaa gtcccacagt ctttacatat tcttaaaatt tcaatctctt
taaaatatcc 84300atctctttta aaattcaaag tctttttaca attaaaagtc tcaactgtgg
gctccactaa 84360aacagtttct tccttcaaga gggaaaatat cagggcacag tcacaatcaa
aagcaaaagt 84420caatctccaa ccgtccaatg tctgggatac aactcacgat cttctgggct
cctccaaggg 84480cttgggtcac ttctccagcc aggccctttg tagcacacgc gtcatcctct
aggctccaga 84540tacctgtact ccactgctgc tgctgctctt ggtggtcatc tcatggtact
ggcatctcca 84600aaacgctgca tgaccccttc agtcctgggc cttcaagaga gaagactaga
gcctggcaaa 84660gtggcacatg ctgataatgc tagcacttgg gaatgacaag cagaaggatc
agaagttcaa 84720ggccagcctg ggctacaaga gactctgttt caacaaacaa acaaacaaac
aaaccaaaga 84780agagaaagaa aaaactggac atgacagccg gaacattatc tgacattcat
aaggtcctga 84840gttcaatgcc aagttggcag tgcctagttt gataagggtc tagccactct
ggtaatacca 84900tggctgactg aacaccttac ccagcaactt gctgatagac tctgccttcc
agcaaaaggg 84960aggagcttcg ctgaggagag aacattgaac cctattgtat atgaataaat
tgctgtgcaa 85020atgatttcat cagtctcttg tgaatgtgat tgctttgagt catttttctt
ggctccagtg 85080ttatcctggt ctgcagtgtg gtgtggagtt gtggaagctt tgagttggga
gggtttcctg 85140ttaaggtttc tctggctctt ttctttcctc ccggtttttg ttttgtttgc
ctggtggggt 85200tctctggtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt
tagaagttgg 85260cggggggtgg agggggctgg agagatggct cagcggttaa gagcgccaac
tgctcttcca 85320aaggtcctga gttcaaatcc caacaaccac atggtggctc acaaccatcc
gtaacaaaaa 85380aatctgatgc cctcttctgg agtgtctgaa aacagctaca gtgtgcttac
atataataaa 85440taaataaata ttaaaaaaaa aagaagttgg catggatgat gtagtgaaga
ctggcattag 85500atatctctgg atccccctgc ctctacctct tagacactgt gagtatggaa
gtgtaccacc 85560gcaccaggcc aggctagaac attctctgat ctacaaatac ctagagtatt
attcctctat 85620gatcagaaaa cagacccagg gggccacaga aatgtcttag taggtaaaaa
cacttgcttt 85680caggcctgat aacctgcggt tttttgtttg ttctgggggg cgggagaggc
tggctggctg 85740gctggcctgg aattcacaga gatccacctg cctctgcctc ctgagtgtca
ggtaccagga 85800tcacaggtgt gtgccaccac acttggccta actgcctgag tttgagcatc
agtactcaca 85860tggtactgag gatagaatag actctcacca gctcttctga cttccacatg
tgccctgcag 85920catgggctct ccttccccaa aggaaaaata aatgtaagaa ttaaaaaaaa
aaaaaaaaag 85980caaacccagg tcttgtgtga tggctcagca tcaaagctac ctcccgccac
agctgaccac 86040ctggtgataa cttatagcct tgttatgctc tcctttgacc tccacgggca
tgctgtacac 86100gtatgtgtgc ccacacaaac acacaatcaa gaaataaatg cagccaggcg
aggtggcaca 86160cccctttaat cccagcactt gggaggcaga ggcaggtgga attctgagtt
cgaggccaac 86220ctggtctaca aagtgagttc caggacagcc agagctacac agagaaaccc
tgtttcgaaa 86280taaccaaaaa aaaaccactt taaatattat ttttattttg ttttgtttat
cctggaactt 86340ggtctgcaga ccaggctggc cttgaactca cagagatcca actgcttctg
cttcccaagc 86400acattaaagg atgtcccacc actgcctggc taaagattta ttttttcttt
ctttttgttt 86460tgttttgttt tgttttttct aaaaaatttt tttaaaaaga accatccctc
ctagcactca 86520ggagactctg aagtcagggc cagccaggtc tactgagtga gctctagggc
agccagggct 86580ccacaaagaa accctctctc aacaaacaaa caaaagagaa cagacccaac
cagacctgag 86640gacacacact tgtaatctaa gcccttgaga ggctgagaag ttcaaggcta
gccacaagtg 86700tgtggtgcat tcaagagcag cctgggtggg ctacagaaaa agaaagaggg
agagagagaa 86760tggttaatga agatgactct ggaaaagtga aactcaagag aaagcccctc
agatttgctt 86820aagacgagtt gagggtggag aaccgccaaa gcggacgagc cagacagaga
ctgccaacaa 86880agttcaatcg gttcaggtac attacttcca aaacgccatt gccacatcag
gatgcttcaa 86940tcagccaaac caacgcagcg actattgact tctgcatttc agagacttcc
gtctctgtcc 87000agggcaatgt cactttagct ttcctttgca gaaaggaaaa gtccctgcct
ctgatgtggt 87060agatcctcac acaccttctg ccagatccag acactggtat gactcagcct
cggggagctc 87120tatctacaga gataagggta caaggcgtgt gtgtttaaag tatgtgttta
aaagtacaaa 87180gtgagagtcc ctggaaaggg ctccctgccc tcaccatcac cgaaagcaca
aaccttaggg 87240taatatctga cattcctgga aatgtatgta tgtattcatt atgtagccct
gactgtcctg 87300gaatggggta taaaccagga tggcttcaca tctcagagac ccatttgcct
ctgcctccca 87360agaactaaga ttagaggcat gcactaccat acttggctca tgatttactt
aactttattt 87420tatgttcacg aatgttagcc tgcatgtatg tgtgtgcacc atgtgcatgc
ctggtgcccc 87480agaggccaga agaaggtgtt ggttggattt cctggagatg aagtcccaaa
caactgtaag 87540cagtccaatg tgtgtgctgg agatgaaact tggttcatcc acaagagcag
tatgtgctct 87600taactgtgga ggcatatctc cagcctcaga tttcccagtt aatgtttgct
ttcgcaccca 87660ggcccatctg cgcatgcgct ggagacctcc tttaccgcct tgagcctcat
tggccaattg 87720tggctgggag acttgcagat cccaagtggt acaagagaag aataaactgg
tgtgctatga 87780actcacctct tctctgtagc cattggctga gcatactttg cctcaaccta
ccgcccttcc 87840ttcccctaat cctaaatctt tgccctctcc aaatgtgctc ctcccccgca
gtaatccagt 87900ggtcgctggg gctctagaga gatggggggg gggggagcaa cgggtacagc
ttaaggcagc 87960tgcagcagaa cttttttgct gtatattgag tcttaaaaat tcatataaac
tttgtgttct 88020gtttctaaat ataaccccat ctgtttcaac acaaaatgca acaacaaaat
gtttcaaatt 88080gctatttgga ataattaaaa aatttcaata cttgatttaa aaatgcttta
actttttaaa 88140taaattttaa atgttattat ttttaaaaag ttacaagttt aaaaaaaaga
aagatagaaa 88200tcacataatg aaattaacca tacgcaagtg aggctcggtg cactggtaca
cagttacagt 88260agccatttgg agtggaggcc atggcgcttc acattgaatt ttatactttc
tttatagaat 88320attttttatg cacctatcta ctactgataa caaaacaccc atgagagagt
tagaattaga 88380catcaattag ctttgatcct ctgtcataac tcgtgtccac tccctgcctt
agtcctacct 88440catccctgtc ctcttttcta catcttatac tgaatccaca cactcagttg
tttacacaaa 88500cacatacatc actgtccann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 88560nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnag 88620tgcctggctc tgttttgctg ttgttgtttt tgtttgtgtg tgtgtttaga
tttggtttgg 88680tttggtttga ttttgttttt ttttagagaa tcttactatg tagctcaggc
tgtccttgaa 88740ctcacagaga tctcttgtct ctgccttcca agtgctgaga ttaaaggtat
acaccacctt 88800acctggcccc tttcatctat ctatctatct atctatctat ctatctatct
atctatcatc 88860tatctatcta tctaaaattt atctgtgtgt gtctgtgtgt acattcccca
gagcctgtgt 88920ggtagtcaat aagtaaccct cagaagttgg ctttctctaa tccttggatc
aaacttgaat 88980tgttaggctt ggtagcaagc atgtttaccc actgagccat ttatgacccc
atggcccagc 89040atcttccatg ggttctgggg acacaaatgt gtactttgat gtttacagga
caagcgctta 89100accaaccaag tcattttccc agccccatcc tgactcccat taagtgttct
ttcccccaac 89160ccaggaccaa atctagagga gtgtccatgc ccaacaaaca ctctgccaag
cctctcccct 89220tactgctctt ctcccttccc ttccttcatt tcttcgttcc ttcttttctt
tctttttgaa 89280acaggtcttt tctctgcatc ccaagctagc cttgaacttg tgatgtagct
caggctggct 89340ttgaactcac agctgtcctc ctacttcagc ttcccaaaca ctgggattat
agacctatgc 89400taccacacct ggctcatttt tcaaataaat aaaaagaaaa tcaaaaagtt
cctagaacag 89460tcacaggatt cacaaaaact ttggaaggag actaaaaatg gatttttaaa
aaatgcttga 89520agcacaaaga gttgttgaaa gaagagagaa gaggaaaagt tagcttagta
ggtagaagtc 89580aatcaagcct cacaccctga gttcaattcc tgaccctatg gtagaaggag
aagagcaatg 89640ccggaaacat tatcctctga cctccagacc cgctgtggca cgtgcatgca
cacacacaag 89700caggagccct tggagggaag tcctagaaat gaatcttact gaagcaggtc
tgccaggccc 89760tgtgctcagc cattttattt ttcctttgtg tacccgacac gcttccattc
tcaaagttgt 89820gagtctgaga ggaagtactc actgtgtccc cagtgagctt ctgtcttacc
ctgggtcact 89880tagatggggt cacttagtgg tagccttggt gtggagaaag agaacacagg
tccgagtagc 89940cagtagacct gagtctttat atctgcaaag ggtgttgggg cataatcaaa
tctccccccc 90000tccccggggt cctgatacca ggttgtatta agtgtatgtg catggtcctt
ccaagtcttg 90060acacatcatc caactccaag tggctttctc atttttcctt gccagtagcc
tcttggtgag 90120gaaatggctg aggaaaacag agttgcagaa agacagggcc atggcctggc
tgcaggcttt 90180ctctgagtct gaagagggtc agcgactctg agaaatgaag ctatttctga
gtgagagggg 90240ccaaagaagg aacacggcag agggagagcc cccgaggaga tggagacaga
agccggagag 90300ggaccctgtg cgaggctgga ggggaggaag aggggggagg agtgagaccc
actgtcatct 90360gttgggcaga gaggggctac attcatctgc agtatggtgt agaggggaca
gagagtgatg 90420gtaacaggaa aaatttgggg ttgagggggg cagcctgtag ggctgggccc
cagcagtgta 90480cagctaggta gagtacacag taactcccag aattctctgg ctccactaaa
tccctgttcc 90540gctccgtgca gagtaaaacc cacacagggt ggatttcagt ctcctttgca
cccccctcca 90600ccccccctcc acccccagct ctggtcacag ccagtcagag ttgggggtgg
ggcagatctt 90660gtaaaagagg ctggtgagga catcggaaag tctgtaccct ccactagcaa
agtgccagac 90720gctccgtgac actttaaatg cctcagataa aacagtgaga gactctcctg
gtggcaggca 90780agatgatggg tcagggacct cagcgcctct gaggctcaga caccaggata
aagaataaaa 90840acaccacgga gacccctgtg accccctcgt gcagagggag aatgccaatg
tggcccagct 90900agctgctgga gcgcaagcct caaggcctgt gctagttatg agtctactgc
tgctgccttg 90960tcccaataaa cccctttccg cccaggatta gtggacacgc cttgctcaag
ccgagtccct 91020gatctgccac cttatcacac acatacaaaa atcccttgag gatggttacc
atcctgggac 91080aaagcctcat tctctgctta acccaagtga cacctatatg gcagatccct
gtgtccttct 91140cctgatgata acaacccttg caatccaata gaggggaact cgggtttctg
tcagcttcct 91200ttatgctgat agaaatgtac tctgcatgtg gggagcctgc cttgctcacc
ctgagacccc 91260atggggctgg ctggggcttt gcacatcatt gggactcaga gatgttgact
acatgaacgt 91320cccacacttg gttgcacaag gcagaatgac aggatgttat gcctggtgtg
tgagtgtgtg 91380tgtgtctctg tgtgtgtaaa acgcctctct ctggagccct cctgtctgtc
tgcctcttgt 91440tcaatggctg cacaattgtc ctttctcttt ccaaggacct ctgtatgggt
gtgtccttca 91500ttcagtgcct ttcctctgtg ggtttgtcct gctagccccc tgtcactgag
aaagtcttct 91560gtctgtcctt gggttgtctg gctagaacac agacatcatt gtcttttttt
tttttttttt 91620ttttttttaa agatttattt atatgtaagt acactgtagc tgtcttcaga
cactccagaa 91680gagggagtca gatctcgtta ggatggttgt gagccaccat gtggttgctg
ggatttgaac 91740tccagacctt cggaagagca gtcgggtgct cttactcact gagccatctc
accagccccg 91800acatcattgt cttgcccacg actgctctcc agaatggtgg gcaggaggat
gtgacccccc 91860acccccaggc accgggacac aacatcttct acacgtgtag gtcttgtgca
ctggctttgc 91920tttcttcttc caagcaggtc tcccaggaaa tggcacttac agagattgaa
gagtttaata 91980catgtctcgc tgcctctctt ttcgggaacc ccccagaggg agcagcagaa
accagggctg 92040gcaggggctc taagctgcct gggcaaagga gcagggggta gcatggagcc
ttagccaatt 92100tggaaagcac tgtgacccaa gcacattttg cagcagtaat gtcaaattct
gccgttcagg 92160catgccattg atgtgcacgc tgccacacag aaaccagtga cacaaaggca
cagccttctc 92220caccctcctg gtgcttagga actaacggct ctaatgagaa atgagagctg
aaaggagaga 92280gacgggggcg ggccacagca gcgcaggctg gcactgcgtg ttggaggagg
ctgacccact 92340tctcgtagag gtaaggggcc cactgaaatg tcacttaaat tagccaccac
tcccaacact 92400agatctcctt tgtccccata cctcagcccc acgcttcttt ctttttttct
tctttttctt 92460ctcctctggg gcagcctcaa gcccagcacc cactttttag agctgtaaac
caccctggtc 92520ctagaagccc tcttacgtta ggggatgaca ggaggtagag atcaggaagg
agggagggag 92580gggaggagga aaggaaaagg gaggggagag agggaaagag atcgagagag
catgcattca 92640tcacaaagag ccctcttttc tggctttttg actgcactgt gagttattta
gccaacaata 92700gatgtttatg tattttttta gaacccgtat ttattaacag cctgaaagga
gagagacgga 92760gatttatata ggaagtgcag tgagttaagg ggggcaatta agagagcaga
aagagatacg 92820gaacacagac ttgtaaaggg ttttgtaaca tccaatcaaa ggtgcttcag
gtattttcca 92880aggaagcaga aggtaaaaaa aaaaaaaaat tgtcccatta gaagctgaca
ctggatggag 92940caatggccca ggcggaactc ctgcttgaaa gaaggtgaga agggagggac
acagaccagg 93000atccgatgag ccagagtgtg gccatagctg ggtcatgagg cccagggttg
gaaggacccc 93060actaaagtgt gcactggcct ttccttgaca aaggatgcac ctatagctag
gcgtggtggc 93120aagtggttgt tattctagta cttaggaggc tgaggcagga ggatcaccat
gagtgtatgc 93180ccagcctgga ctgcatagca acacccagtt tcaaaataac aacaaaagga
agtgggggtg 93240gggagggcaa catttggaat gtaaataaat aaaacatttt ttttaaaaaa
agaaaggggc 93300tagtgagtta gttcagcggt taagagcgct gactgctctt ccgaaggttc
tgagttcaaa 93360tcccaacaac cacatggtgg ctcacaacca tccataagga gatctacgcc
ctcttctggt 93420gtgtttaaag tcagctacaa tgtacttaca tataataata aataaattct
ggagtgaggg 93480ggccagagca agtagaggtc ctgagtttaa ttcccagcaa ccacatgatg
gctcacaacc 93540atctgtacaa ttacagtgca ctcatataca taaaataaat aaataaatct
ttaaaaaaag 93600aagaaagagg gtggggcagg ggagggaaga agaagaaagg taagaagcta
aataaaaggc 93660acagagatga gcttcatgtg gaaacacagg cctgtagtcc tggcactcag
gtggggttgg 93720ggggggctac agtgagagta tcatgagttc aaggtcaact tgggtgagac
cttgtctcaa 93780aaaatacata ngcnaaaaaa aaaaaaaaaa acatagccag gcatgatggt
atacatttat 93840agtcccagca cttagaggac tgaggcaggg cagaaagaaa aggaattcaa
gatcaggctg 93900agctgtatgc agtcctgatc ctatcccctc cccccccccc ccagagacag
acagacagac 93960agacagacag agagaaacac aaagaaaggg gccttcagat ggctcagcaa
ttaaaggcgc 94020ttgctattca gaccccatga cctgagctca aagcctggga cccaaggtag
aaggcaagag 94080ccaactccac agagctgttc tatgatctct atatgaatgc tggggcatgt
gcctacacta 94140tgttgtgcac acatgcacag attagaaaaa gaggaggaag aaaaacataa
gattgtttca 94200agaaaagaaa ggctggcttc ttccacgtca gtgtgagagg agggtctggc
ccctttgtag 94260ccaggtcctt cccagtccag tgggggctga actgaggcag cggaggaggc
aataacggag 94320ctttcccaac gcagtgtcca gcaaactcaa ctctacagcc tgtcctgatc
cacagagaag 94380ccttcctggc tccctcacca atgcgggggc attggctccc aggctcctgg
gccccccccc 94440acacctgtgg agtgctaggt gatttgctaa tgttgggcaa catttgccca
cgtggggttc 94500ttggctcttt ggtaatagac atgcctagca ggagggcgga gcttggaggg
gggagtcctg 94560gggttgcccg tggctccctg cagctggggt gtctggccag ctgaagaagg
agccatggca 94620cgcaaatggg agagcatgga acagaggctg tggatgctaa gcaatatggg
aggcagtcta 94680agcttggaag cagcaggtgt ctgggaacgg gcctgtggcc caggcagatt
tccagtgagc 94740actccagttt tttggcacaa ggaacaagct ggctgagccc aagaggcaag
tggtgataat 94800gaaacccgca gttgaggaac agcgggtaag ggtgccatgg gagcccatgt
gctcatgaag 94860aggctggggt gtgaagaaga gcccatgcag ggaagccaca catcccctcg
agttccaggc 94920agaggcagag tccctgagtg gggctccctg ggtctcccct tacctaacca
gtctcccggc 94980accccagcaa acaaaatccc atccataatt tgaggtttat agagacctca
aaggctgagc 95040tactgtgtgc cactaaccat cagcctaacc ctcccccact gtcttctcta
gctgcccctc 95100tttcttctga gactgtgata gtggcgggga cgggttggga gtgtgtgtga
agccctctcc 95160gactctccaa ccccagctga gccccttgtt ctgcagctca gtaacacagt
aacacaggct 95220cagttctaca ctggttgaga acactcacgg ctctctcagc tccttagaga
gcctgttttc 95280tcattttcct gtccccaaag cctagacaat ggctggtcca tttgtaagct
tatctgagga 95340tgccaggggc caccccatgt ctccactagg ctggcaatgt tctctgtcac
tgtagtacag 95400aagactgcct ggtgggaggt gagataagga aagggatggt ctcccctggg
gttcccacac 95460agtgctgagc ggaaaatggc agaatgggct gggaggtaac tctgttgcta
gagtacttgc 95520ctagcatgtg caaggaaggg cctgggttcc atccccagca ctacagaacc
caggcgtggt 95580ggttcatgct ggtattctca acattcagga ggtacagtca ggaagagcag
aagttcaagg 95640ccatcctcag ctacatagct agcttgagac cagcctgggc tatgtgagac
tttgtctcca 95700acaaacaaca acaaagcagc agaaggccaa ctggcaagag gagtattacg
taaagtaaat 95760ccatctcaaa aagcaagtag catgtatctt ctttcatttt tttttacatt
ctataaaggc 95820ctatcaagtc atgtatatat gcatgtatgt ttgtatgata tgaaagtagg
gggctggata 95880gatggctcag cagttgagag cacttggatg ctctttcaaa gaacctgggt
tcaattccta 95940gcacccacat ggcagctcac aactgtctgt aattccagtc tcaggggatc
tggcaccctc 96000acacagatat ccacgcacat aaacaccaat gcacataaaa taaataattt
ttaaaaaaag 96060aaattggaag taaaactctc taaggagaca aaagggactg aggggaagtg
ggaggggcat 96120gaagggggag ggcataggtg tgtggtgtgt ttaacatgca gaatacactt
ctataaaagc 96180tttggggttc attatgcaat gtatacatgt gtgggtgcaa gatgtaagct
gtgcatatgt 96240gtgggggcca aaggtctcct cctcaatccc tctctgcctt attttcattt
aaattataat 96300tattactatt agtgtgtggt gtgatgtgtg tgggtgtgtt aagccctcac
ggcaatcaga 96360ggatgtctgt ggtctgagga tgctctctta ccatgttcgt gtgggttctg
tggatggaac 96420tctggtagtc aggtttgcaa agctagtgtc tttatctgcc gagccacctt
gctggccttc 96480aaccttattt tttgcattga acatggaact tcctgagttg cctggacagc
aagtccccaa 96540gaccctcctg ttcctgcctc ccccntgtcn nnntcacang aggacacacn
gcttantggg 96600tntccggatt gctgcncacc tccccgccnc ccnagcctcc tgcctccccg
cccctcgccc 96660ccgctggncc ctcccccccc cccccccccc cccccccccc cttccccccc
ccccnnnnnn 96720nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 96780nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnncagaca cacacactca
aattaaatat 96840agctctaact gctgttaaat tcacactcct tcacatcccc acgctaggac
tctaaggagg 96900caccagcaag gcccaggtcc agcttgactt agagcaaagc atcctccccc
ctccacacaa 96960tggaaacgga cggaaagggg catggaagca gaaccagaca acagcagcct
agccaagccc 97020aggactctgc tccttccccc catgcctgcc gtgcaactgg ggaggcaaag
ccccagccgg 97080tgctttctga ccgcttagcg gaagacaagg ggagcctgtg attatgattt
ctgctgattt 97140gcaatgaaac actaatgcag tgggcttttc attaagccag atttattcaa
tctaaagatt 97200ttatttcctt tatgtagaaa gtgcatcttt atatgttgtt ggaggagcag
agatgtgata 97260aaaagaaatt tctcttatga actaatagca ctgatacata gtggtagcta
tgcctaggcc 97320tctctctctc tctctctctc tgtctcctgt gcatgtgtgt gtgtgtgtgt
gtgtgtgtgt 97380atgaatgcac acaaagtagc ccccccccat attatttctt ctgtgggatc
tccagactca 97440gcaaatggtg gtgactggga agtctggcca tgcaattctt gccttttctc
ttgccagccc 97500aatccctttg cattcaaacc cgggctgctt gctgtggcca gccctttcac
ctggagtcct 97560tcctcctcct tacctgtctt cccatccttt gcagacaatt atcctcaata
actagccaat 97620tacccttaag gacaattata ctcttccatc agcaaacacg ggtgttcttt
ccttgagtct 97680tttgatgaag tcgatattaa agagatgctt tatttacata aagtcaaata
gctccctttt 97740agaagggttt gggttcgatg tcaaagtttt aaaatcttaa ctagaggatg
ggtgtagagg 97800gcttttggct agggtagaaa agagatggag atacttattc tgatgttgct
ttaaaaggta 97860ggatgcccag agaaggtgga aggatggggg agggagggtc cctcctcaag
ctaatgaatc 97920taaaagcagg gatgagctgg gcgctaggag tggaaccagt cagaagtgtc
tgcctttgac 97980tgaccacagc tcctgccctc ccctccccca gtctctctgt gaaccgccag
cattaggagc 98040taatcgcttc agaaagccag attggaatgt gttgctcacc ctccactgct
cagaaaacct 98100ttattccagg caaggactga cccaaaccga tcatggcatc tgccaatcag
gaggccaaag 98160gtgccggcag ggcgggacct agctgtgcag aaacagctcc gttatggcgc
gcagaaaaag 98220ctggggggaa aggctaccgt tttatctctt ggcagatggc ttctctcttt
gatgctttgg 98280gccttacctg ttactgcctg cacttgactt gacctaggca aaaatagcag
cgagatacag 98340gttctcgaag ttagaaggaa aaaaaaaaag ccccaaacca caacacaacc
cggaagtgtg 98400cccccgctgt gtttctaaag agctgttttc ttcccaagct ctacagcgtg
gtggctctaa 98460tcggaaattt ctttttaatc atagcaggag tcccaattag cgtgttgggt
aatctttcaa 98520gtagagtggg agttccgtgg ccacagagag cagaggcaat attcagcata
aagccctaga 98580gaaagaggtg ttgtgggcct gtgcacacat gtgtgtcaac gcacatgtgg
cttgtggagg 98640ctggcttccc actctcaaga tgaggtgtgt gcaccccagg ccttttgatt
ctcaaagctt 98700tattaggacc agagggactg tgtgtgtgga ggggtgttgc tcacagtgca
gaaacccaaa 98760cctggcttct ccaggagccc acatgccaac aaacaggctg cacactcttg
ctagtacatc 98820ccctaaaggt atggggatga gggaccaagt gctttgcaag acagcaggca
cagagttctg 98880ggacgctcct gtaccccaga ctcagccgcc acccagggcc agctctgatc
tggcttgacc 98940tactttcttc tgttgttgtt tttggaagtg ctgatgtcaa tgcagaattc
agcagagtgg 99000ctgagtgaga aaaaagagga gagggaggaa aagggggggg ggacgggacg
ggccgaggcc 99060aacaggaaag ggcaggcaac aagacaatga ccacaaggtc cctgtaacta
cactaactgc 99120ttacctttcc tgacccccag ggcttagcca atatagctga gacccagtct
tggtgctgtg 99180gcttcaggct aagtaaacag ggaagagttg gacatgggtc tccattctct
ctcctcatcc 99240aacaagggga ggaggcagtg gccaggcagc catgcccacc gatgccatcc
ttctgggagg 99300agccagacat ttcaggcacc tctccttccc tgggtgccta gaggtgctgt
gtctgcatcc 99360atctgccatg cctgccatct gagagaggcc actgggactt ggtagagagg
ttctccacac 99420atgctggcct ggaggaaatt ggtctttagg gacactgaag gcagtttcct
ctgttcagtg 99480gctccttgga aacccacgtg acagagctcg catgacaact tgccggctct
caactcccat 99540tcttagctgc ctcaagcact gtaaggttta ggagagcccc agatgtaagt
atggatggga 99600agaccctcca gggagtcatt gcctaccctt ctgaactcta acatggtcca
gcttttccat 99660tccacaattg aggagacgcc agacctggca ggggagcaag cctttgtttc
tgacccattt 99720gcaaacccca gccactgagg aacttgcata caagaaactg cctctgggcc
tctcctggac 99780tgagccctgc ctcccagggg acaactgggc aacagatcct tccaggtggc
tgcagtgaca 99840gatccatgct tttatgacat agaaaggcct cagtctcagg atttcacaca
ctgtatttcc 99900ctcatcctgg ggaccaggga aggcgagcat cttctgctcc ccccaaacaa
gtgtgggaat 99960gattaaaatc attttttttt tctgctccat gaactcatac agttttcaga
taccgaggag 100020acaaagccct cctgtgctga aattagaccc cgaaaaatag gttagctgac
aattacttgt 100080ttctaagtgg agtgtgatgt agtggcagga gcgcaggatg ggctgccagg
gctgcagtct 100140cccccccccc aaacttactg tctcttaacc tctcgagtcc ctgggtttct
tgccgggatg 100200ataattctcc ccatctccct cctctggtgg gctggtggaa agcgtaatga
atcaacgctt 100260gaagcacgct gaagaggcca gactcgggat gccatgtaag tacacagcat
cgccagccac 100320ctctcaagtc tacacggagc tgatttattt acctcccgtg aaagagacaa
caatcatcat 100380atttacactt catgccgcag cttcctgcgt ggcacggcag caccccctcc
ctctccgctg 100440ctgaggactc catcaagcac gctgccttgc caggatgaca gcagcccact
ctcagcctct 100500ccctggcctc cttacagatc atgacctcct gccccgtgag gtctgtcacc
cgaaaaccac 100560ggtacaccgg gggctgcagc ctctctatgg gggaggctga ggaaatgaat
tccgtaggta 100620aaaggcttcc taggaaatca gacgctgcta gtaattaagg agcgaagcat
aggtgcgtga 100680aaggtaaatg gatgttattt aaatgttgcg tcatttaaag agtgtcctgg
tgcttcagtt 100740ccttgttacc atgcagggct gtggacgggt ggcaattagg ctggcacggg
tagagctcac 100800ctgctgagct gagggagggt ggggacacac cttccggtaa ttgctgctgg
gcagctctgg 100860gtctccccac ccccgccccc gccctcactc cccacccccc acttctttcc
tgacagctct 100920ttcatttgca gcagcttaca gggcttgttg cccttaccca gaaaatcacg
ttggaagaaa 100980tataagaaaa agaggaatga aagagaaagc cagaaaagtt catattaggt
tcggatctgc 101040ggccaaacct ggccgagaga atccatgacg gtccgcgcgc atataaccct
gtggcaacag 101100ggcccggcac aacagggccc gccacaagag cttcttgagt tgccacctgc
caggagacag 101160gatgaatgaa tggatcatct gtccttagag cacaagccag gcctgattct
ccaatattga 101220tgtgtgaggg agatgtcaac agaggttccc taaagaatga tgcttctatt
tccatgctaa 101280tcctggggcg tcagcttcag tcggaacagc cggaccgtta ccttagctct
gctgttctcc 101340tgtctgtaac ccgcagaggg aagggcgggg tcacccagca ttgccactcc
ccccaccctc 101400acgtggtcca gacccctctt gggttgatct gctcctgaaa aacagtgttg
gctcaagttt 101460gcctctgaag gtatgtcacc gctggctcag ccagcttatc tccccggtgc
tttcaagatc 101520aaaacaccca aacgaaagaa aaactttgtt tcaagagcag agtgtggtgc
caactctgat 101580caaagtgttt ttcagcatga caactcactg cccgtgacaa ccagtacttg
gctgttgtgg 101640ctcagagtga gatgcggagg gaagtggatg acaacagctg tatccaggtc
caaacagagt 101700agattcacgg ctggcagaaa atggctgaga gccttgggct gcatccctcc
tcccctcctg 101760cctctctctc ttttcaaggt ggtttttgga aatgtccttc ctgtgggttg
tgtgcctttt 101820ccatgtagga cctggggcct gtgcagatgg ccctgtgttc ctggtgctgc
tgttgagatg 101880tgaacgagtg ataggaaccc aggcactaaa cacacaatgt ggttgtatct
gactagaagc 101940aaggcaagag caggaggcat ttgagggtaa aggagtgtaa ggactgtgta
aagagatgag 102000ggttctatct gggaggcagg agtcccaatg ccagcaaata caatggactc
tcctggtcga 102060cccaaccaga gagaattcaa gatggcagag ggacaggctg tctgagtttc
ctatggctgc 102120accgataaat ggtcataagc agagtagagg aaaaccacag acagaaattc
atgccattga 102180gactagaaat ctagctcaag gttgtgtgtg gcagggttgg ttcctgggtg
ttcaaccttt 102240tcacactgtg acatgatgct gtggtctgca gatgtgttgg gctgcatcca
tagctaccct 102300gggacacatt catggaccgc aggttacaca tgctatttaa aaactccaag
ggaagggcta 102360gagaaatggc ctggtagtta agtatgcttg ctgatcttcc agaagacctg
agctctgttc 102420ctagcagcca tgttgggcag cttacaacta actatgactt ctgagctcca
aagctctctt 102480ctaatacata catacataca tacatacata catacataca tacatacata
cgtacacaca 102540cacacacaca cacacacaca cacacacact ttaaagaaaa aaattctggg
ttggagaggt 102600ggctcagcaa ttaagagcac tgactgctct tacagaggtg ctgagttcaa
ctctcaacca 102660catggtggct cacaaccatc tgtaatggga tctgatgccc tcttctggtg
tgcgctgaag 102720acagatacaa tgtactcata tacattaaat aaataaataa gaaagaaaga
aagaaagaaa 102780gaaagaaaga aagaaagtgt aaacgaggaa aattcctaat taaaaaagaa
agaaagaaag 102840aaagaaagaa agaaagaaag aaggaaagga attctgaggg agaatctgcc
ccttttccta 102900acttccaggg ctataggcaa cctgtggcct ggggaagctg tagacaacct
gtggcctggg 102960gaagctgtag acaacctgtg gcctggggaa gctgtagaca acctgtggcc
tggggaagct 103020gtagacaacc tgtggcctgg ggaagctgta gacaacctgt ggcctgtggc
agcatcatgt 103080caacgcctca ccctctgtgc ccaatttcct gttctctaag gacacatgcc
atcaaatgca 103140taggacactc tacatcaaga tgatcttgtc tcaagatgtt taacaaaatt
acatctgcaa 103200agacctatct ttacatgtga ggtcactcca caggttctag acatattttt
gaggagccac 103260catccaactc actatgtgac agagtcatct agagatttgt gtccaggaca
gactggctgt 103320atctgctctg agagtcccct gcctgcccgt gggaactccc cagtggtcct
taagggccct 103380gaggactttg gatctgcaaa gccacatctt ccaaaaccat tttcctcttt
tggagagcta 103440ctctaccctg aaaccctttt ctctgaggtg gcttttagag aggcaggtct
cagcagggca 103500ctgtgcccac aagaagtccc ggggagaagg gacccaaggg ccagtgctga
actatcgctg 103560agactgagaa cattgtgtct cacctaaaat cggtggtcgc aaggaccaag
caggctctat 103620aaatgtctta ctgcctttat tccttttcct ccgctccatc ttactcctca
tttttgtttg 103680tttgtgtgtt tgtttgtttt cttctgagat gtagcccagg ctggccttca
gctcactatg 103740taactaagga tgactttaaa cttctgatcc tttcttccct ccacttccag
agtcctgggg 103800caggtgtgtg ccaccgtacc ccagctttat ttgagactat gattcaggct
ccatacttca 103860tgcatattag gtaagcatgc taccaacttg gctatattcc cagcctttct
ttctttcttc 103920tttgagacaa tgtctttttt tttttaatta tatgagtaca ctgtatctgt
tttcagacac 103980accagaagaa ggcattggat cctattagag atggttgtga gccaccatgt
ggttgttggg 104040atttgaactc aggacctctg gaagagcagt cagtgctttt aaccgctgag
ccatctcgcc 104100agtccttgag acaatgtctt gctatatggc acatattggc ctcaaactca
gaatccttcc 104160gcttcagcct cctaaatact gggattacat gtgagccatg gtgtttggct
tctagccttt 104220cttccttccc tttcccttcc cttttccctt ccctttccct tttccctttc
ctttccttcc 104280cttcccttcc cttcccttcc cttcccctcc cctcccttcc cttcccttcc
cttcccttcc 104340cttcctttcc ctctctctct ctctccccct ctttcttttc tttcagagag
tttctctgtg 104400taatcctggc tgtcttggaa cttgctctgt agaccaggct ggcttgnnnn
nnnnnnnnnn 104460nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 104520nnnnnnnnnn nnnnnnnnnn nnnnnnaaga agaagaagaa gaagaagaag
acaacaacga 104580cgacaacacc ggcgccgctg cctccactgc catccacctg agacaggact
caaatccaga 104640ccaattttta aaaccagtgt ttcaagccgg tacactgaag tagtagtccc
acttgggatt 104700atagctcctt actttgtttt gctttgacgt tttgtgacat ggtgtgatgt
agtcttggct 104760gtcctagaac tcaatgtgta aattaggctg gccttgaact tgcttctgcc
tcctgctggg 104820atgatagact gatggtgtaa aactccactt aggaggcaga ggtgggagga
tcagaaattc 104880aaagtcatcc ttggctatgt tgtgagtttg aggaccaacc ttggctacat
gatatcctat 104940ctcaaaaaga aataaatgta ttgccgggca tggtggcaca cgcctttaat
cccagcactt 105000gggaggcaga ggcaggcaga ttttctgagt tcgaagccag cctggtctac
agagtgagtt 105060ccaggacagc cagggctaca cagagaaacc ctgtctccaa aaacaaaaaa
caaacaaaaa 105120agtgaacccc aacagtactg ccggacagtc tggtgtcttt cctaagtctc
ctttcaactc 105180tgtttaccca ggtgtaccca caaggtgtgt gagcagctct atacccagag
gtgatacggt 105240tgtttgaatg agagaaaagt ttcccatcag ctcgggtgtg tgaatactcg
gtccccagtt 105300ggcagtattg gctggagagg tgatggggag gtgtagcctt ccggtggaga
tgggctttgg 105360gagtttaaag cttcctccac gaagaccact gggctgatct tgttaccaac
agaattggat 105420tggcctgctc ttggctccgg cctcagttta tctaaaattt acatgttacc
tgatcaaaaa 105480ctgtttcctc ccccacccct ctccctgtct gtattccctg ccctctagtg
gtgctggctg 105540tatactacac tggtgatctt gactgtattt cagtttacct cttgttctct
ctctgctgac 105600tctagagatg tcctttcatg gctgggacct ggctcagaaa tttctaaggc
actgagcctt 105660cctccatctg aactttagga aacttcttgg cttaaaggtg tatttctgac
ttagtatgca 105720atgagacctt ggagcctgca ctttgttaag caccctgggt ggtggtggtg
gtggtggtgg 105780tggtggtggc agtagtagaa ctctgatgaa cagttagtta ttcaaggccc
atctagaaaa 105840agaaaggctt tgggtgcaca tggctataac tcagtggcga gagacgtgct
tcccatgaat 105900aataatgatg acgatgaaaa taattctctg tgactgttct ccccacttcc
ctctctctca 105960ccctagctct tatctaccga atccctgcac aagcacccat ggggtttaca
gaatctgggg 106020cggaacgtta gtcacttccc ttcgcctact tcagtattgt gtttccagaa
gtacccattt 106080tggctagtca ctgaggaaaa cggcagctgc ctgtgggcca ccagcccatg
ccaagtgagg 106140tcagcaagaa agaagctgac agcaaatgtg ccaactgtgg gtctgctgga
tttctactgt 106200gctaagtggt ttcaagaagt ttcttcttaa ccccctacaa gaaaccacaa
attttattat 106260ctacactgtt ttgtagatga agaaaacacc attccgaagc tcactgccag
tcagcactgg 106320aactggaatt tgtttggcta attcagtggt tctcaaccag ggtggatttg
gactccccag 106380gggatatttg gggacacttc tggtcgtcat aattgggatc atgtgctact
ggcacctagg 106440gtagaggcca gggtggtact gaccttccta ctatgaacag ggcagccatg
tacataaatt 106500ctctcgatca aaacatcaac agtgctaagg ttgagaaatc tcaggctgaa
accctgtcat 106560ttggccttga ggtgggtggg aggaggttag agagtggaat aaaatcagaa
gggccaccac 106620agaggcctcg agtgaggagg gaacagggct cctatgctag ggataatgga
gaatagggca 106680gcttgttgaa acttttcttt cttccaagct tggctagagc cctgctcaat
ttcccccaac 106740tctgccaagt cagtcccggg acttcgcact aagtttgtcc tggagtgacc
ctgactccag 106800cttggagctg gggcaacaca tttacctggg tcttcccagg agtgggttaa
aagtcaaaga 106860taagtggcat ctgagaagtt aaaagtgggg tgagtgggta aaggcaggag
gaccccatca 106920atcagctgac ctaggaagga agagagcaac tgaagcagca aagagctggt
ccaggagact 106980ggattctgac ccactaggct tatcttccac agcctttctt gtttaggctt
gggctcagtt 107040tccctgcatc atccgcagga gcccctggag cacccacatt cagccggccg
ccaggacagg 107100ctccccagca gtggcctccc cactaactga cagtggtgac aggaaatatc
tcccattcca 107160atctcctcag aagtctgaat aagtaaggga cagatgttgg ggagaggcgt
cactcttggg 107220ttgatgaaga aaagatcatg agaagcatac attttacccg ctatggttgg
ggttccatta 107280ccacccatgg cggggttgag ggggaagggc agaaaaaagg agatggagaa
ggacagacac 107340gtaagaagga ggatgtggtt agggctgatt cgtcccctgg ggtcgaacaa
tcagctgtta 107400ctggggcgga aggcaggaag tctccttaaa gacacaatat tctgaacgtt
gaactcagga 107460tttgaagcaa gcccagcagt caccttagtg gagcccatac ttaatttaac
acagaagcgg 107520ctatctcagg cttccctctc atttctttgt ctcagatgct ctcacttaga
aagcttagat 107580gctcttagaa atgactcaaa agtcaagaac cccaggccac aagtttctct
ttgggggtgg 107640ggagggagtg aagaggtggt cccagtcttg tccctttaaa taagcaattc
agcagctttt 107700gccaagtcat tgggttcatt tcggtttttg cccatccccc gcctttcaga
ctctgattgg 107760cccctaggga aggagccgcc tcttcattgg tctccacctt tgaaatcact
tccctaagta 107820ggcctgagtc agagaagcgt ttcggagggc gggactgaat gggtgttaat
cttagaaccg 107880ggtttctggt tgatactact ttggtaaaga tcttccccta atttttaaaa
agacgcttcc 107940tctctaaaag tgagggcgaa tcctttgtta agaacgtgcc ccttgagaag
ccgtgggctc 108000ttcagcgact aagacgagac attcactaga aaagatttca ctaaacccac
gagggataga 108060ctagacctcc agtgaagatt gggcctgtgc gggtgacatt tgtccctata
ccccgaagac 108120ctcgagctag ctctccagtg aagactgggg ccgtgcgagt gacagtggtc
cctatacccc 108180gaaaaaaaaa aagtcctatt tgtggaaaaa aaaaaagact tcgggtgttc
tgctgcatcg 108240gtggctggct tccatcttta gttctactca ctcctgttgc ttcgcgtgct
ccaccttcgc 108300ttagctcagg cctcctgtga atcagttttg aggctaaaag aagttccaag
aaggaggggc 108360tgtagccctt taaggacttc cccgcgaccg agtcagagat cagtttaaaa
atgccaactc 108420acagagcgcg ctgcattctg ggaagctgag tgtcaccgta agaacttcat
tgaccggaat 108480gcactgcaaa aatacacgcc tatacttcct tctgctcttt aaactgtagt
ttgacgtaaa 108540gctggtctaa gcaagtcgcc taggccgagg gttagccaca ccttttcagc
cattggccag 108600ttggttagtt ggtaggcgtg gcttagagaa gctcctccag gcaagggggt
ggcctccttg 108660ccaatcagag cccagacgcc tgaatgggcg ggagtaagca gaggtgctgg
cgcccccgag 108720tgggtgtggt cacgttgccc agcaatgggc ggtgattggc cctgggtggt
tcattcgcag 108780ctcgtgcgtc acgacgccgc cagctgatcg gagactggag ccggtgtgtg
ctgggcgctg 108840ggaagagaca gagcggtcgg ccgtgcggac aggtcgcagt gattttgctc
ctctgtccac 108900agcaaccccc gcacccagca tcaggtgggt gtgatctggg gacccggtca
tcccgggggg 108960aaccgcggta accgggtgat ggggaaagta gggtcctgac ggccacaccc
tgcccttctg 109020ggggagggga gagggggcgg cggggacagg ggcgctcttg ggagaggagc
ctggactctc 109080ccgagtagtg tgtctggacg tttaaagaga gagtcccgga caggagtcgt
ggcagaaggt 109140ttggagaagt aactggggag gaatatgaga ggccagaggg ccgggggcgt
ctaaccccga 109200cgccctttgg tttgaggatg cccgagctga ccatttagcc tagggaggat
ctggacgagc 109260gaggggtgcg gaggtgcatt gcctctaccg gcgctgactg ggtcagggcc
agttcaagtc 109320cctggcaggg aaggggtcgc tgggcggtcc ggcccctcct ctcgttccct
cccggggatg 109380ttatgtaagg ggggagggga aaggagtagg gggcggcggt gcggaggcct
tatgcaaccc 109440aaaggttagg gtttcaccgc gggttgggcg gaggttgggg ggggcggaca
ggaggagtgc 109500ctggaaactc tacccgcacc ccccctccca gcctaactgg ctgtcttgga
cagagagaag 109560gtcacctttg cacctccccc ctagtatgtc cggtagagag gcccctagcc
cgggcttggc 109620ctgactgcct gggaagccgg ctggctgggt ggggcgcctg ggttagtcat
cgctgggctc 109680cctctctccc cacctcctgg ccaactcttg gcccctcccc acggcctccg
gttaggctaa 109740cgttcccacc tccctctggc cctagtttca gtctccaact catttggcct
gtcaccctgg 109800ctgttagagt aggctagaag ctgtcatggt gccagagagt tgatggagca
gctggtcaga 109860gggtcagtgc cctgggccca ccccgccccg cagccaaggg cacctgcttg
gcacaaactc 109920tcagcagcca gtgaaccctg tggcctgaac agagctatcc tgggcagaga
gaagtggaca 109980gagactgatc acctaggaga aggaagatcc gacaaagttt atacttccca
agaggctttt 110040ggaatttgaa agttgcccac cctagtgtaa tctttccact ctctgaaaat
agaaatccca 110100aggcaaagtc tccttggccc ttctatctgg cagtggccat gtccttggac
tgactgtgca 110160gaaccaccct ctcgggctcc cagccctcta gcctgccacg cccccagccc
cctccctgag 110220ccatgctgta gggccccggc ttttactgct gattcatgcg ttggaactgt
gggggcgggg 110280cttggaactt ggaacaaagt tcagacgtgg aggggccggc agacagcctg
gaattcatac 110340cagatgtacc cggaatgtgc aagcggaatg cctggcatct ctagtcctga
ggaagctgcc 110400cagccaccct acccatacct ccctcccctc ctgcctttgg tcagctgtcc
tccctcagac 110460tcctgagagc ccctgctgac cttccaactc tagtgcccct cccatttcta
accctacaca 110520aaccctcctt gctgctgaat tccctaagaa caagtcattt gagttgatca
cagagctcat 110580atttctgaag tacatttttt tttttaactt gggacttggg ttctacaccc
tgccctttga 110640atgccgaaga tgctgggctc cttagcaggt tgccaagagt tgccagctcc
tagtctgtaa 110700aggggcacaa agcaagtgca tttagaagcc tcttgcttct tattcaagaa
cccctcatta 110760gaaggtactg aaagtcagct agagccaggt ttggatggcc tctgggtcgc
tggccctgtc 110820acccagcttt cctgtttttt tttttcctcc ccttcctttt aggaacctgt
gcctcccaca 110880ccctcacctg gctgagccgc agtagttctt cagtggcaag ctttatgtcc
tgacccagct 110940aaagctgcca gttgaagaac tgttgccctc tgcccctggc ttcgtggagg
aagaggagaa 111000gcagcagctt tgcctatcat ccggaaggtg acagaactgg ggtgggaagg
tctggacagc 111060tggggtgatg gctttatggg agggaaaccc tggtcctctg gggagccctt
acccccactg 111120gcccagtgaa agatttaggt taaaggcact gtctataaat tggggaatag
gtgactccac 111180ctccccaaga ttagttgatg tctgtgtggc agtgggaaga aatagaagga
aaagtctgtc 111240tgtttactga gacttccttg taggcctgcc tttcttatct tcatcatcac
catgccaaca 111300cacacacaca cacacacaca cacacacaca catacacaca catacacaca
cacacacaca 111360cacacacttt cctttccatg aggtccaaaa gtaaatgtac tcaggaaggg
ggacattgaa 111420actccgttct aagtagtcat ttgtgtattt actttttttg tttatttgtt
tgattgactt 111480tcgagacagg gtttctctgt atagccctgg ctgtcctgga actcactttg
tagactaggc 111540tggcctcgaa ctcagaaatc tgcctgcctc tgcctcccaa gtgctgggat
taaaggcgtg 111600tgccaccacc gcccggctgt atttacattt ctttatttat ttttagtctg
gcccagattt 111660tgggtttagg ggtacttacc cttacacctg tggatttttc cacctgtata
atggggaatc 111720ccatagataa gtaggcagga gggcattaaa agtccaccag tggtgactca
gagcctgggc 111780tcttcttctt ctcgtggatg gaaacgaaac agctcttcac atgaactgtt
gtccttcccc 111840caccccctga ctactcaccc agctcagggg gattaggatg gaaggaaagg
ctatggttaa 111900gtcccaggca agctcgtggg aggctagtcc tctactggct tctcaccatg
catgggtggt 111960ccaaggcttt ccctccacct aaagcaaaac tgtagctctt ggttgggttc
tagcaaccac 112020tgccatttat tttctgcctt tgctttccag gatagtgaga ctctgctcaa
tactgtgcag 112080gcaagaaatt gtcaggggag atgggttgta tgatatgagt cccttctgct
gcctctagct 112140cctgattcat tctcacgtat gggcttggtc tctgattgtg gttcaccttt
ggcccagtct 112200tcctaacaga agatgggttc agggggtaca ggaggctgtt tgttgtattt
gacaggagga 112260ggagttctag cctgttcccc atttgtgaga aactgaaagt cataggggag
actagatcat 112320ctaatccagc cccactgcag tctaagctga gggataggat gtgtaaggga
ctgtagcaga 112380cgggctgggg aggctgagtc ggctcacaca ttgcgacaaa gattgccctt
ccctcgacct 112440cgcttgcttt ctttcctcct cccttccctg gccacagtgt gtccctccag
cactgggtac 112500atggctctgc tgtcctcatc caacatggag cctcagaggt gagaaagggc
agcctggaag 112560caacagaggc aggcacaaga cagtggagga cctggcctgg aaccacaagg
gcctatccgg 112620acattggtca gagaggcacg tagaagcctg gagaacacca ggaaagagag
cagccagcca 112680gcctcagtga aagacacgtg cttccagcca tctcctctca ggacctgcct
tcctgggaga 112740tgaagggcct ccaggaagta tggtcccatc tctaccctgc agtttctata
aacagcctca 112800aggagcatga gccacctctg aaaggaaata cacagcaaat tcaaaaagag
attcaaatgt 112860gtaacactgt gggaaaacat atctatgact ggggttgtag ctcagttggt
aggtttgctt 112920aacatgcacc aagccctggt tctgtcttct gcattgcata aaactgaaca
ggttggccca 112980ggtctgcaat cccggcactc tggaggtggt ggcaaaggag cctacattca
aggtaatcct 113040ctgctataca atgagttctg agccagcctg ggctatatga gactgtctca
aaaaataaaa 113100caaaataaaa taaagcattg gttagtaatt caaagaaagc agatgtggct
gaaaccgttt 113160tccctgatca taatacaaca agcaaatgaa agccagaaga aggctcctgt
gccttgtgtg 113220tggcagtacc aaccattgtg agagatgcct ttggacctgg tagtttgctg
tcttagaaat 113280gtatcctaaa ataaggattt ggttataaaa tgttcatctc agggttgtaa
tagagaaaaa 113340tggaacgcag ctgtttgttt ggaagtccat tccttttctg ctgtcatgaa
aatgtatagc 113400tagggcttgc ctaagtaaat tatattcatc tgatggtggt gttctgtgca
gccatccaaa 113460gtcttacaga agaaaaattg agtggaaata taaatattga atactaaaaa
gattataaaa 113520gtatgagttt gtgactgttt ttaaaatatg aacacatact tgtaatatat
ttttttaaaa 113580accatccaat tgagtggaaa tataaatact gaatactaaa aagattatga
aaagtatgag 113640tttgtgactg tttttaaaat atgaatgcat acttgtaata tattttttaa
aaaaacactg 113700aaagtggatt caaaatgtta agaatggttg tttttgtatg gtgggatagt
acaattgtga 113760attttcccct tgttttttct gtctttctaa tttttaaata ttgtgcattg
ctttcatatg 113820ttaaataaaa tacaaaagac aaataaatgt tttaaaattt ttactctttt
atgagtgttt 113880tgcctgtgag tggcaggaat ccaacattgt cttctgaaag cagctagcgc
taacttctga 113940gccgtctctt cactccctct gtaattttta aaaaatatat ttgtatgtta
tattatgtgt 114000ctttgtgcac cagagtgtgg gtgcacattc tgcagaggcc agaagagggc
atcagattcc 114060ctggagctgc acgctgtttg gatcttctga tgtggatgct cagaatcgca
ctcaggtcct 114120ctagaagagc agcaaatgct cctagccact aaagccatct ctccctctag
tcctcattgt 114180catgtttaga ttttggagaa tttgcttgta ggaggatggg ctacaccaag
tgccaggtga 114240aagaaaatgt ttgcttggga tacctattgc ttcttgagtg tgtgtgtgca
tgcttgtgtg 114300tgtgtgtgta tgtgtgtgtg tacactggag ctaggaatca tatccagggg
ccttttcaag 114360ctccaccaca ctaaagtcaa ttctatgaac ttcattaatt gtctgaatcc
acctactctc 114420tacacacagg aagcattcct ctgactttct gactgtcagc cagctaagga
ggtgtggctt 114480agaataagaa agaagggaaa tgctcaaaac ctgtcactct ntggggnnnn
nnnnnnnnnn 114540nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 114600nnnnnnnnnn nnnnnnnnnn nnnnnncccc tattttacca agcattgcag
gataaataag 114660aacatagctt aaagaattct agagaaaaaa aaaaaagtcc actaatgttc
aatgttttaa 114720ttaattttta atgggaaaaa gtaatatcaa agaaatccat gtctacggca
tattaatgtg 114780gttaatgcaa acaggacaga ttatcttggc caaattaaac cacagtaaca
cgttttgaac 114840taaaagtcat ctatttatac gagatgaaat gtgaattagc gtgcgcgcgc
aggtgcacac 114900atgcacacat acaaacagag tctttttatt cttaagccta ccaaataact
ctttaagcag 114960taaataattt taagctctaa aatttaaaaa atagtgaaac cccttcaggc
tatgacagaa 115020tgctgctttt gccattcttt ctgataaaag tcccaaaggg tgtcataatc
tgtatccttc 115080ttagaaaagt aaggagcaca tcctatgagc tggtgacctg agttttacac
ccaagtcaca 115140cgtcagcaca cagcaatgtc ctggaccatt tgtgaggagc cccacgctgg
tcctgagcaa 115200caactcactt ggactggtac ggggtgaggc tggcgatggg caccactttg
gactgtgtcc 115260cacctgaagg ctgcagcagg ccagttccgg caggttttcc aaatggcttt
gaggcaccat 115320aagccttagc tgcagtggaa cctatgcaaa cacacaggaa aagggcagtc
attcccgttt 115380tatttctcga agtgaatgta caggcaccat ggcacgcatc tggaggtcag
atgataatct 115440caaaaaaatc caacttttgt gagacaggct cccttcattg tcttcccacc
acacttgcca 115500ggttaggggc acaagctccc aaggagtatc tgctctccta agggcactgc
catcagagac 115560actctgacta ctacacctgg cttttaggtg ggttctgagg atttgaacgc
agatcatcat 115620ggttgtatag caaattttta ttcaccaagc cagacaaccg ttttttaact
ttttttaaaa 115680aaagatttat ttttatgagt acagtgtggc tgtcttcaga cacaccagaa
gaagacatca 115740gatcccatta cagatggttg tgagccacca tgtggttgct gggaattgaa
ctcaggacct 115800ttggaagagc agtcagtgct cttaaccgct gagccatctc tccagccctt
atttaacttt 115860taaaagttta aaattaattc tcattatgca cgtgccacag ggaagtgtaa
cataagcaca 115920tgtgtttgaa gtatatatgg ggatgtagtt atgtgacggc cagaggacaa
ctctgtgaag 115980ttgattcttt cctttcactg tgggtcctcg gagcgtcaat agggtcagca
ggcattttac 116040cagctgagct atcttgtggg ccaccaaact tatttttaaa aagctagaag
ttggttaaag 116100agaaggaatt catgtaattg aaatagaaaa gcagagttga agagggagga
gaggaggtga 116160gtatgatatg gtaagaaaaa ccaagaaaaa acctctacaa gtaaatctag
tcgtgttcca 116220cagatcacta tgttcaagga tggtgtccca gaaaagggtc ctaaaacaaa
gcccaggaag 116280taatcaaaat gatcccatcc agacctgcta cagagaagct ggggggaagg
agggagaggc 116340cctgtaacct caagtccccg ggcagcacag cagagctgtg cccaatgatc
tccgtgatac 116400aggacaacag gatatctgag aaggagtccc agtgagcgtg tagcagaagc
cagaggcctc 116460gaacaaaacc aacaacttca ccatgatgaa catcatgaag atgtgtggac
aaaagggtgc 116520actctgggac acagtgtcac acactacagc ttccacctgg gacaaaaagt
cacacactac 116580agcttccacc tgggacacaa tgtcacacac tacagcttcc acagtcagat
tatttttctt 116640ctagcggaga ggctgcaagg gtggagggca ggtacaaggg aacaggggtg
agtgggatcg 116700gggtgcatga tgtgaaactt acaaagaacc aataagttag gaaagagaga
tgaagacata 116760gaagccatgg ccaacaaccc cagcactctg gaggcaggca ggagggtctc
catgccaaag 116820gtaatgaatg ctatgtctaa agagatgata aagtcccatg agagttacaa
agtcagagga 116880gaaacttgga cagaaaagcc aaatgtaaca aatgcatggt aggtggggaa
aggtggggat 116940ttaaaagaaa tttcacaaaa aggcacagaa tgtaaggtga aagcgagcga
ctaccagata 117000aagcatcaac taagaggagt caggaacagt gtattttaac tacctttata
aaaaaaatac 117060tcgggctggt gagatggctc agcggttaag agcaccgact gctcttccaa
tggtcctgag 117120ttcaaatccc aacaaccaca tggtggctca caaccatcca taacaaaatc
tgatacggtc 117180ttctggagtg tctgaaggca gctacaagtg tacttacata taataagtaa
ataaataaat 117240aagtattaaa aaatatgcaa taattggaat tattctttgc aattctacta
tgaatggagt 117300ttttgttcct gttgtttgtt ttccgagcct gtttcatgta atccaaactg
gcctggaacc 117360tgctaaatag tggaagatga ccttgaactt ctgattcccc tgtctccacc
tcccaagtgc 117420tggcctaggc tactctgcct gactcaggct cagtgaattt tcaaacactg
cttcaacctt 117480gacccctaac ttccatctct tgggtcccat cttacccatt cccagactcc
cattctgtgg 117540ctggggcttg ctggctggtg gtgtcgctgc agaggctggg gaagggactg
cttgctgctg 117600ctgctgctgc tgctgctgct gctgctgccc atatcctgga gatcaaaaca
ggccaattca 117660gtgcaaaccc agtcaagaga tttcaacacc agcagtaaat accttaggaa
aacccacctt 117720tggctcttag aggagcagac tgcaccacgc tgcacaccct gtgtctgcaa
agggctcact 117780tttttgtctg cgcatagtaa agactggttc catttccctt tcaactgttt
agaattgaca 117840gctttcagtt ttaatgaggt tctccctgaa gccttgccca tttctccttc
aagaacctgc 117900agtaaagtca ctgggctcaa tctgtgatac taccactatt tccatagcaa
caaatcgatg 117960tcatcaacag gacggtgtgt tcgtggcagg atccattaga agagaaagca
gggtggtaag 118020gaaaacctaa aagcagttca attgtctctc aagtgctttg gcttcaaagg
aaaaggagac 118080gtttacaaag ggttctaccc tttcccccaa acaaagcaac cttattttgc
aactgacagc 118140tggtggaaaa tatgagttga aagagtctca ggtaccgtga agtgctagtg
aactgtcaca 118200gcagggagag atgagaacag agttcagaag cagccgtgtc aaggcaacag
ggagataaaa 118260aggaggggga ccgagcacgc ttctaaagca gcaaataaga cgcgccaccc
tgtcggagtg 118320tgtatttgcc cttcaccctt tatacaaatg tttaccaagt tgaagaatgt
taacattgta 118380aattgcctac cactatttct aaaataacga gtttatgggg tttgatttta
cttctgtgat 118440tggctctgaa ggctgaagcc agcctcgttt aggctgctca cctggagtgc
ttgtagatga 118500caaaggcagc taaaaaaaaa aaaatgtccc agagctcctg aaacactaaa
actggtgcgc 118560acaggcagga agtctccctt gccactgagg ctctcccttc tccaactgta
agttctaact 118620cctgctgggc ctcctgaggt cccaactcac cgggcatgct acatgcccat
ggaattcatt 118680tttggcaaaa ttactcttaa gtagctcaag gaatgagacg taattgtgtt
gctggagaca 118740aatacattca gcacaaagtt gagaagatta aatagaattg attatgcttt
agttcatcct 118800acagagagaa aaagtgagac acctatttac ataggagagg ggcccaggct
actgacaatg 118860agcccttcct ttacagctca gccttcccat tatcactcaa ctcagacacc
cagcccagtt 118920cagtctatgt gaagcatttt taaaatcagc aagagaaagg tgagttgctc
agtagtacac 118980tgaaggaaag tagaaatgag cacacagtga cctgctgcat aagacacagt
ttaaaaggtg 119040acctatcttc cagcgaagtt cttgccttct ttaaaaagaa tgtggtattt
gctggtatgt 119100gcacctcacg aaggtgcttg cagctatgtg gagggcagag accatcctga
gggaatcggg 119160tctccttcca ccagagtcct agggatggca caaaggcacc gggcttgaca
gatgccttta 119220cctgctgagc cctttcccca gcctctcctt ttctacagtc tccaaattac
ctgaggtagg 119280agcctcattc tcagaaccat ttctgctgcg catgtctgac tgaactggat
atggtcactc 119340gactcaattt ctataaaaag ataactgagg agccggcttg gttggtagag
aaaacataaa 119400gacccgagtt ttgttcccac atggggcgtg cgtgctagag aggtgaggac
ggactaggca 119460gggagcagag gtaggcagat cccaggcctc aatagccagg cagcccaacc
tgagctctaa 119520gttcagtaag ggagcctgta accacagaag acactgtgcc gtgtaaccac
ggcaagcggc 119580ccgctgcagt ggcatctgct catagccaca gctacccagg aggctgagcc
acaagactca 119640ttgaagctgg aggtcaaggc cagcataggt accataggta gacccccatc
tcaaagttga 119700acagtaaaca tatattttac tacaattaaa aaaacaaggc cgggtgtggt
ggcgcatacc 119760tttaatccca gcactcggga ggcagaggca ggcggatttc tgagttcgag
gccagcctgg 119820tctacaaagt gagttccagg acagccaggg ctacacagag aaaccctgtc
tcgaaaaacc 119880aaaaaaaaaa aaaaaaaaaa aaaaaaccaa aaacccaaca catgtaccta
ttctaacata 119940aaattttcat tttttataaa aattacagtt ataattttta gttgaacata
atacaatgat 120000aagtctccaa cttgataatc tgaggctggg ggtgtagctc atttagtaga
gtacctgcct 120060agcagtcgct aagtccgggt agtcccttgc ctgtatccca gtgcttgggg
aacaggcaga 120120aagaggacca gaagttcaag gtgctcctcc tcttcaggta atgaggagtc
tgagccagcc 120180tgggatgcgt gagacacacc agtaacagca actactgcct ggagtgctca
ctctggacca 120240ggaacaagag ttaagtgcgt tactcccgta actgtctgca caatgatgga
gacagtacgt 120300catctttaac atctttggct gagaagagaa aacctggtat tcctcagctc
gttctggcta 120360agttcatgta cttcatcatc atgcagttaa tttgtaaaca accaatcctg
gcagcaataa 120420ctctaattat atagaacata agatgtggta attaggaaaa gctactaatc
cacttaatag 120480agtaaccttt atctcttgca aatctggtac aagagacagt cccaaatcaa
atgatggcaa 120540gaattccaga gcattgtaaa tagcagcaat tgcccttcaa ttaacgcatt
gtaacgcagc 120600agctgcccac aagacctcaa atcaatcagt ctatagctaa ggaaaaatct
ttctaaagcc 120660aaaaccattc tacaaagcag taacgtaggc tccgtttata ataacctgtt
ttgggccacc 120720tgcaaatcaa gctatcccag gaagccagat cgtaattctt aggctctgct
ggctacacac 120780tggtcccaag ccatgaggga actagattac agcaggctcc gccctcggtg
acctgctcat 120840agctatcatt cttacagtct attatggcaa gtgagctctg ggcagagaaa
aattcacaaa 120900caaacccacc aacttcccaa gcaagcattt tcttaacaaa cacaaagaat
aaataaatag 120960agcctgccat ggtagtgcac acctgtaatc tcagcatgtg ggaggcagag
gcaggcagat 121020ctctgccagg agttccatgc cagcctggtc tagacagttg caagatcaac
aacgctatat 121080ggtgaggccc tgtctcaact cccaaaccac tgaaaacaag taagacgata
tggatcaatg 121140caatatttct cagttcctac tgacagaaaa tggacacaat taggctgggt
ataattccaa 121200tatgaataat aagtatatta tacagtacct agtttaagtc tgagcaagat
attatcgcca 121260acagcacaga tacaggacac acacacagct gcaagcgtga aggatttaaa
ggcacccatc 121320cctacagtat atgcagcatt gactgtctag ttttatttcg cacactttga
atcatccatc 121380catttttgct caatatggca gcagtaataa aatgtatatt tgcattttga
tgcatggtgg 121440aattcttact agcctgggct gtgtttgctc actaactcca gtggacttct
gacgtaagag 121500gcgctggact agcttccaga ggatgtaaat ctaactttgg ttctcggcct
cccctgaagc 121560ctttgctgtg gtgaaaggtg ctgtttctga agccacgaca gtcccatggt
ggtttgagta 121620acaatactcc tggcttggta acaatgccaa aaaataccaa aaaaaccaaa
accaaaacca 121680aaaaaagcac agagctcaca tctgagccaa aaaaaccgac actccctatt
ttttgaagaa 121740ctcacagaaa tcaagaagaa aaacaagcaa acaaacagca cataacaggc
taacaacaac 121800aacaagtcca cttcagaggc caaaaaccaa aaggcaaagg ggctcttaaa
catttacaag 121860gacagacctc actcagaaca caagctatag ccatgatact gctcttcatc
catcaattct 121920taaagacaca gaagatgggc tgctggcatg actgggaaga gaccagtgtg
ctcactcatt 121980gctggggtac gctgtaacaa gcagaggaga atgtctaact gtgtctgcca
atcacaggca 122040cattcccagt taacccagca atcacttctg gcaaaaaaag cccactctgc
ccgcacactg 122100gtgtacccag gaggtaattc actgcggctt ctggtggatc ttttttcctt
cttgcagtgc 122160ttgggttcaa attaggatca agcacacttt tatagtcaga gcacggagac
aaaaggatct 122220aaccacagag ttggttaaat aaacaacaga acagccacac cagaactgct
ggggggcggg 122280gggagggggg ggagggaagg gggagaagct agaatcccag cattcaggat
gcagaggttg 122340gtaaacaagt ttcaggctca gtcggggtat aaggtaagac tttatttcac
aaaaataaat 122400aattttttta aagaggacag aaaaaagaca gcgtagagaa ctagctatct
tttatgtaaa 122460ccatgtgggg tacaggagca gcatttgcat ttggttactt ggcgaatagc
cctggtagga 122520taaagaaacc ttaaactgtg ttatctataa agggcagcag tgggtgcaca
atgcagtggg 122580taggagccca cccattgcac gccatagttt tgattacaaa gtcacgcagc
tctactcaaa 122640aattaaaaca aacattaatg cttgttaaag aaacagggct gcagagatga
tggctcagtg 122700gttaagagca cagcgcccat atggaggctc tcagccatct gcttctccaa
ttccagggga 122760agctaacgcc ctcttttggc cttcaagagc actgcatgca catggtacgc
ttacctacat 122820gcccgcaaac attcaaagaa aaatacaaac tgctataaaa cccatatatg
accatcttaa 122880gattctttca tttttttgag acagggtttc tctgtgtagc cctggctgtt
ccagaacttg 122940ctctgcagac caggctggcc tccaacccag agatctgcct gcctctgcct
ccacagtgct 123000ggaattaaag gtatttaaca cacacattat acatatcttc cttcttcttc
ttcttttttt 123060aaagcgtttt gttgtttttt gttaagtttc aattaaaaaa ctacatagtt
ttataggcaa 123120acataattaa aaatgccaat gtgaaataaa taatatatac atatataaca
ttctgtaata 123180gattcactca cacaacttat atacttaaat acaattttca caataatgaa
aagctttgaa 123240atgaagactt ctggatacat tagaaacgta ccctgaaaat cgcaaatgac
ggttttcatt 123300tctttgtgtc agacattagt gtgagtgtct aaacttgcat aaaggctctc
ttctctatca 123360cttcctacct attgcagtgg ttctcgctaa acctggaacc tggggctcct
gttccttggc 123420tggactacaa ggcagcaagt cccagcaatc ctcctgtctc acctttcttg
gaaccaatgt 123480tataagtgtg tgtgggcact agccttgtta catggctgct gggactagaa
ctctggtctt 123540caatattagg catcaagagc tcttaactgc taagccatct ttctaccctg
attagaattt 123600cttgaagcaa aagaaactca cagatggtca gagtttacac acacacacac
acacacacac 123660acatacacac acacacacac tcacacacca aggcttagtg accactgtga
aaagggaagt 123720gcggtgagga actgtaaaaa taaggtgtca ggaaagctct gcacaaaatg
gtgtcctctg 123780gacaagccag ggcctctgcc ctcatcagct cttagtaact atggttgcct
acagcaaacc 123840atgccaggga ccactctaac atggagctgg gatgggctcg agaggccctg
ttattaaagg 123900agaagctgta gagagttgat ttgatggatt ctaaagtagg gggaatcgat
ttcctttatc 123960gatatgggtc agccatgctc tagtgagtgg ccccacaccc acccatgagt
atgtgtggac 124020agcacacacc ggacctggca agtcaataaa acaaaaacaa aaacaaacaa
ataaacgttc 124080tggtccacca tagtggctgc tcgtggttgg gagctgtccc gcacttatgg
caggaagaca 124140gtggctgcaa agaggacaaa agtctctgga actgatcaac tctagagtct
gcttgttatg 124200agaactggga agtacccgct gggacagaag cagactctga aggtgatcag
gacagagatc 124260acagaaggag agactggtta tcggaggaaa tctgaaacat aactcgacgc
atactggtcc 124320aaactggtgc ccatcactac aacagcagta attgaattgg gcacaacatt
cagaaaacag 124380aaaaagacta cagagtacgc accctggcta tcatcaaccc aggcgattct
ggcactattg 124440gaagcaagcc agactggaga aaaggaaaca aaaagttatt taacaaaact
tcccagagca 124500tgttaaaaaa aaaaaaaaga aagccagaca tgggggtgca cgtttttaac
cctagcactc 124560aggaggcaga ggcaggtgag gcaggaggat ccatgagttc gaggccagcc
tggtctgtac 124620agtgggttcc aggaaagcca ggcaacaaag aaaccctgtc tcaaaaatca
ctgactgggg 124680agagaggaag tggatccaag agcaagagag agagagcaga gagagtgggg
gtttggatgt 124740cagcgttatt aaatgacagc agaaaagatg ggcccgacca atgacatccc
agaaatggca 124800aagatgaaaa aataaacaca agttctaaat atcattttaa taatggctgt
gtgtctgctg 124860gctcattctt gttatccaaa caaactaaag caggggtggc cgggatttac
ggccagccag 124920gactagagtg agacactgct ttaaaaaagc aatagatgca cgctaaccat
taatcagcgt 124980aactggagtt tgagggaggg agtgggcccg ggaagcctgt gcctataaac
ccaacctgcg 125040aggcctgaag cccgaaggtt gaactgagaa catctcaaga caaagcacag
gcacaatctc 125100ttacaaacag tttaaacaca ataccagcaa taaattgtca gctttatgac
agataggctg 125160acaggcatac cacaaagatc gggaagaaga aacgggattc ttgcacaaca
ttttacaaat 125220cgacacagct gagcctagtg acaggccgtg ataagctcaa aacatgcaca
ctgcaaacac 125280cacagcatgg ccagagtgac agggtcacag caatacagca acagagacag
taagaaacat 125340tggaaaaggg aagaaggcaa agagcaccac gagagaaaag ttaacccgcg
attccatatg 125400agggcccagc atccctctcc cacctgacag agaaaaccag gaggacgcag
ctgaaacact 125460gtacaactat aaatactcct tcctagtgta gacagagttt accaaagggt
atcgtaatct 125520gaagcacaaa cataacactg taagttacca gaagtttaga cacgttgaga
atttaataga 125580agttgagaca tcaaacacac gtttgatcct aggggaatta aattattggt
aacagaaaag 125640ctctttacaa agtctccaca tttataaacc aaatacactt gtatcagaat
tgttgtctga 125700gccagaggcg gcggcacaat ggtggctgag gcagcaggat caagtctgga
gaagcccctg 125760ggtccaggga gcactagtga attcaagtaa acattaaaca ttaaataaga
aagacggatt 125820ctacaggagc ctagacaccc tgtaaggtca gtactactaa gataccaaaa
ccaaactgac 125880tcagggggct gatgagatgg ctcagtggct aagggcatgc actgctcttg
cagaggacct 125940gagttcagtt cccagaaccc gtatcaggca gttcgcccac ctgtaatgta
ccagagaccc 126000taaacatttc tatcctccaa gcacagacac acataaacat aattaaaaat
aaatcttaaa 126060aaaaaatctc tcgtggacag acagcaaaaa atactgaacg aaatctaatc
aggtagactc 126120agcaaagcaa atccaaatat gcgaaaagga taatacaatg caaatccaga
cttatcccag 126180aaaccccagg tcgctttagc atccaaaaaa ttcaatcatc ataattcacc
gtattagcta 126240accgaaacag aaaaggcatt tgattattaa taaatacagg gaaagcattt
gacaacattg 126300accattcact cttaataaaa atgttaccaa acaggagaaa gaagtaaaat
ctcctcaacc 126360cgatgaacag gaacacaaac tggctgggca cagtggcaca tcctgaaacc
ccagcaccca 126420gatagctgcc agactgggct ggcatgagaa ccaagggtaa gaggcaccca
aacacttaat 126480gatccaaggt gctgcctccc ccccccccct gtgtcactta aatccgtaat
taaatttctg 126540ggaggagggg tttgacacag ggtctcactc tgtggcacag gctagcctga
atggagtcaa 126600tgctggcctc aaaactcttc tgcctcagtt ccagagtgag gggaaacaat
catgagccac 126660ctcacccagt tttaatgcca tttaacagta caaggttaaa cgctgcctga
aaagaacaag 126720acaaagaaag atcacacaga caaacatcac acaggtgctg tttgagacta
ggtctctagt 126780gatgtaggct ggtctcaaac ttgcacaaga aaagaatact tttgccatca
agatcaaggt 126840tgctatcacc catggtggca tttcggaagc agaggcagga agatcagtag
tacaaggtca 126900tcctcagcta ccatgggtgt gaggccagcc aaggcagcac gtgagccagc
tgtggaagca 126960cacatctgta accccaccac tcaggaggct gaagcaggaa gttcaagcca
cacaagagcc 127020acttaaaaat aaataaaaac ataaataagg ttggagagag ggnnnnnnnn
nnnnnnnnnn 127080nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 127140nnnnnnnnnn nnnnnnnnnn nnggaaaaca cccattagac tgctgaagag
actgagaaat 127200tgtcatcttg aagggttgaa atcactgagg aatggctaat gacaaaactg
cttaaggaga 127260attttattaa aaaacaaaac tccttctgga ggagtactgt gctcccttca
gcattaacta 127320ctgtgccagg gaacccactg ccagcagaca cccaacttca aggataatca
agcttgctac 127380gtaacatggc tgcttacgca gacaccacac acacttcctg tatgctgggc
atcacctcta 127440gcttacttat aatatcaaac ataagtacca tgtgagcgct acagggcctg
gccaatgaac 127500agcgatggga aaaataacct gcacatggtc agtaatgggc aatccactcc
ccagacactt 127560ctaacctcag ctgcagactt gtgggtactg agaacggacg tgactaagtc
aattcacaga 127620agcagagccc tgtgcctgtg ccagggaacg gaggacgctg ctcaatgggc
agagcttcag 127680ggcgggaaga gatggaaagt ggagctaggg cccgatgacg tgaatatggc
taatgccata 127740ttgtgtgtgt gtgtgtgtgt gtgtgtgtat gtatgtgtat gtggtgtgta
tgtatgtgtg 127800tgcatgtgtg tgtgtgtgta tgtatgtatg tgtgtgtatc caaaatggtt
ataaaaatcc 127860catacaatgg ctatgcttat gtgtctttta ccacaagtaa aaaattttaa
gtaatcttag 127920gaacacattt ctaaaatttg gaatacttgt tggggaagct atctgagccc
cagcatgaat 127980caggaaatgg gtaggagagg caggagagat ggcttagtgg ttaacacaca
catggtggcc 128040ctttcctggc accaactagg tcactcacaa ccaggatctg acagcctctc
ctggcctcct 128100caggcaccag gcaggaaagc ggccccacat ccctgcaggt aaaacatttg
tactgaaact 128160aaataaaaac ggagcagaag ctgggcctgg aagtccttta cctccagcac
cgggaagcag 128220gtggatattt tgtgagttcc gggtctacat agtaaaaact tgtcgccaag
taaaacaaaa 128280caaaaactgg gggctggtga gatggctcag tgggtaagag cacccgactg
ctcttccaaa 128340ggtccaaagt tcaaatccca gcaaccacat ggtggctcac aaccatccgc
aacaagatcc 128400tcttctggag tgtctgaaga cagcaacagt gtacttacat atattaataa
ataaatcttt 128460aaaaaaaata cctttaaaaa aacaaaacaa aaactcccac taaaataatt
ggaaaagtcc 128520agggaaagcc acactcgatg gcgcaagcgt gtgatcacag tattctgtag
gtaaggcaag 128580aggatcacag cgggacggag gccagcctca gctacacagg ccggtctggg
ctacagtgtg 128640agaccccggt ctcaaaacaa aacaaaacaa aaaaagcgtt cacattatca
tatactcaag 128700gccacaaaac accttcttct tcccaatcct tgaatgctat cgaatttggt
tacttttttt 128760tggtaatttt ttgtttgctt ttcaaggcag ggtttctctg tgtagccctg
gctatcctgg 128820aactcactct gtagaccagg ctggcctcta actcacagag atgtaattct
ttttcaaagc 128880tggattttga tttgggggtg tgtgggtgat accacaggga cacttgggcc
ccaaaccaaa 128940ccaaagaaaa aacacccccc ccccaaagta tgaacatcaa ctgtatataa
aactaacagt 129000tcatataagc taagtagcct gcaggtacat ggttatggaa acagctttat
catacagact 129060tctttcagta gatgccattt gagaaaaaaa aaaaaacaaa acattctttg
gattttcaac 129120atgtaaacga aaatatgtta aatcttaaaa aaccttaaag cagacgccac
tgctctttgg 129180ggtcagggaa gcacggctgc aggcccccag agcagcacat tccctgaggg
aagccttgtg 129240cgtcctcacc agcaccaaga acagccaact ataataagct cataaaaaat
cctgaaaggc 129300tgcccaagcc tgaaaatctg atatgaaaac agagggcaag acagataaaa
ggcaaactat 129360actttaaaag tcaattccag ctatctgtgt ggaaggactt ctcgggacgg
actatctgct 129420aagaccttgt gggtgattta aacagcgtgg agggcagaaa aagaaaggaa
agtgcaggaa 129480acaatggcaa aagctcctcc tccgtggcct ctacacgcat ccaagcgata
tggagggagg 129540agggaaggac actatacaca ggtaccaatc cccacgaact agaaaacaca
gcttttacta 129600actagtctat tttttttttt accttagcta gtgtctttct tatgtttggc
ataatttctg 129660ctctgcatta aataaaccta gtgtataaga aggcaaatga gtaaaggtaa
agtcagttaa 129720tgttcatttg ttttgactgt gtggtgtgca tgccaggcgc attcgtggag
gtcagaggac 129780aaccgcatgg agcaggtctt ctccttccac caccccaggg gtcgcagggg
tcccagggat 129840ggaactcggg tggtcagatt tgcttggcaa gggcttcact cactgaggcc
ccccaagggt 129900cccaattcac ctgtttactc taatgtatat atttttgaga cagggtcttg
ctatgtagtc 129960caggctagct ttgaactcac tgtggacact agctggcctg gaggtctgag
tgaacctcct 130020gtttccaccc aagtgctggg acaacagaca gcaaccatca aggcaaaatg
aagtcttccc 130080ttcacaccaa agtggcttcc tatgaccttc tcctgacaac cccaaacagc
aagtgcccgg 130140atatgcactt gcgtttctct ttttttactt caagctttga ctccactttc
ctaaaaggtg 130200tttactaggc tgaagcacac ttcaggagac accctgtagc tctggagtga
ccggaaacac 130260accagctctg atgcaaaaac aaaaaacagc gatggggatg gggcaaaggt
ggcttgtgct 130320tctgtgcaat ggcattccct gacgcattac ctcacaccag atacatacag
aaaagacaaa 130380ggtaagtcct cactcagtgc gcgcatccac cacacccaaa gcaagcattc
aacagctcag 130440tggcataagg ccagcattat cacagcgcgg acaaacacaa agccaacctt
ctctttgagt 130500tccccaaggt acatagtgtt ctgtccaagg atggcctctg cagactcctg
gaaacacgtc 130560acccactggt tctcttgaaa atctgcaata tttgcctaaa aaacacaact
ctattatttc 130620agcgaataaa taacaaaagg aagacacccc aaggacgagg gaagattatc
ttttcctact 130680taaagtaact tccaagggag aaatgacatt gccgccacac acagcccccg
actctttggt 130740taacggttcc cccttatgag gaagtattca cttctggtca ccccgtgttt
cacttttagt 130800acagtatcta atgaattaca ggagccactc aataccttta tcataaagtg
ggctttgtgt 130860taggtgattc tgaccaagcg taggctatgc aaatgctctg agcacactta
ggggaggctg 130920ggccgtgcta gccctgctac atgcaggtaa tgctacgcca gtcttacgcc
acaggaagaa 130980gcttaaaatg tggtcgaact caatctgcta gaggtgtgag tttagacctc
gggagacggg 131040ccaagtaaac ttccgcagat gtcggagcat cagacagagc tgtcccctcc
tgatgcaaaa 131100ggcttcgcac ggcaagtttt taatgagcct ccatggtaat agtgggttcc
tctctctcct 131160cctcccttat caatatgctt gacatctttt agttttttga gacagcaaat
aacgtagccc 131220aatctgcccg catacttact gggcagccac agctggccct gaactcctcc
tgcgtctcct 131280ttaccccgta acaagtgctg gggttacaga catatgccgc catgttcagt
gtaagtgacc 131340gctgcccagc agggcaaagt cttccttatt tacaaagcag cagccaagcc
ctgcagccca 131400ggcctatctg atttcctcag cacaccccca agggtctcac cgatggcagt
cagtccatga 131460acaccgtagg cttctcacaa tccacactac tcaccgataa gatcatgcgg
tatttgaaat 131520tgggaaattc ccggtcacac ttctcacagc ggtacaaccc attctgctgg
tcaatcactt 131580tcttattgca gtcctgggtt gggcaggcct ggtacataca gttctctttg
cggagaaaca 131640ccaccgctgc cacagtgctg aaatagtccg cctgcaggcg aaaggaagac
cgccatcagc 131700aagcaacaca ggtctggaac aggcaactca aagctgctct tccttgcaag
tggtgagcgc 131760gtgtacatcc tagctcccac gctcacgagc gatatgcaga atcactaact
ctggtttcag 131820aaatcacacg tgctacacgc agaacccaaa gaagtaacaa accggcactc
acgcacctca 131880ctttttccat tttggagacg gcggctcact agctagcctt gaactcagag
tttggtctgc 131940ctctgtctct atagagtgcc tggctaccac acctgggcac tttgttttcg
agacagggtt 132000tctccggaac tcactctgta gaccaggctg gcctcaaact cagaaatcca
cctgcctctg 132060cctcccaagc gctaggatta aaggcatgtg ccatcagcgc ctgacccaat
tctttttatt 132120tatagttatt atttgttaca tatgtgtatc agtgtgtatg acgtccatat
gtatatgact 132180gtgtgcagtg tgcacatgtg tgcaggtcag aggacaactc tcaggagtca
gttctctcct 132240cctactgtgg cgtttgggga actcaggttc caagatagca ggaaaagtgc
ctttaacagc 132300tgagttatct cgacactgac atgtaatgat tcagtgtgca cacagtggtt
ttgcatgtag 132360tcatataaac cacagcgtgc atgtgcaggc taggaaacaa cctgtgggag
ttggtgatct 132420cccctcacca tgtgagtgag ggagaggaac tcaggctgtc aggctcggtg
gcagtgcctt 132480tactcactga gttcccttgc tggccaagca ttatttataa gatggtgatg
tctaccctta 132540tctttaggga tcagtaagtt tttccccaag acaagccaga aaaacctttt
aggcccaatg 132600agccattcag tcagtctcta ctgccactcc tcaacctgtc tgtggggcag
gacaagccac 132660agacaacctg aagacggaag gtgagccaat aaaatcttac tgacagtaac
agccagccag 132720ctcacaggct tgacaggcaa ttcttggact gaatgcgttt aagagaatgc
agaattaccc 132780atgactaaga tcttctaaat ggaaaaatgt ctggttaagt aacccagcaa
ggagctaagt 132840cacgcaagcg gtggatacct gctttgcctc tgaaccctgc acaggttttg
gtttatgttc 132900aatcatgtca agtacctaca aaatccagat ctagcctgaa ttcaaagata
ggctacctac 132960cctgcccccg gacccccacc cggggtctca ctgtgtagtc ctggctgtcg
tagcgctctc 133020tatacagacc agctgttatc aaattcagag acccacttgc ctcccaagtg
ttgggattaa 133080aggcatttgc cactatgcct ggctctcact ggttgtacca gaggagcaaa
acaatgtggc 133140catttaaaga gacgaagcta gaaaccagtt tagacagctt tggggctgtg
ggtgtagctc 133200agtggaagag cttgcttagc atgcacaagc tgttggtttt aaccctcagc
gtgacagaac 133260cgaatacata agaaggctta gaggaggggg tggtggtgga gagatgggtt
agaggttaag 133320agctggttgc ttaatttccc agtccccaca tggtggctca caacatccat
aactgcagtt 133380ccaggggatc tgatgtcctc ttctgacctc cttggggaca tgcgactcat
ttggctcaca 133440tgcaggatgc ctcgggccac tatgctggct caggggtgaa ggtgcttgct
gccaagctgg 133500gtggatttga gtttggtccc tgggacccac aaggaagagt ttgacttcta
tacactgagg 133560tggaacatgc atgctctacc cgcaaattaa aaacttaaaa tttaaagagg
aagctgtaga 133620gaaatagctt ttaggaggat gcctaaggaa cttctctgcg ttttcaggtg
agattcagac 133680tcaaagccca atttaaaagt ttgagtgctg tcacgtgttc tgtatgccca
gttctggctg 133740tccgttgtct gtctttagtt taagagagca actgggtgag aagtaactga
gagtctagcc 133800gatgtttaat tctcaagatg tcctgtgata gcattataag ttgctgtgga
tgacagtgat 133860ggatcgagca cacagtgaaa tgagacagtg aggaagaaaa cctatattgt
atacgaggac 133920aataatggag cagtggcgag cgatattttg ggaatacaat agaaataaat
gattcaaata 133980tccaagagaa cgcaggttag accaaggtag gaagaggatc actgggctgg
agagatggct 134040cagtggttaa gagcactgac tgctcttctg aaggtcatga gttcaaatcc
cagcaaccac 134100gtggtggctc acaaccatcc gtaataatat ctgatgccct cttctggagt
gtctgaagac 134160ggctacagtg tacttacata taataaataa ataaatcttt aattaaaaaa
aaaaaaaaaa 134220aggaagagga ccactgagca cacattgaaa tggaagatgc aactgaaatg
caataccaga 134280gccagtctgt ggcatggcag caggaattac agtctgttca aaacccagca
cggacctgag 134340gctacattat gtccctctac actggccgtg actgaaaagc agcaacgtgg
taccctggag 134400caggcctgag gtcccaggta aggacacagc cctgacctgg cactaggaaa
caggtgtaaa 134460aactcaagcc cctgccgttc tctgggtgtc tggagcgagg ggtgcgaggt
accttgtctc 134520cctggcccag gttctcagat ttagcctcat gcaaagtttt ccagttggtg
ttgccccctc 134580cggcccctcc actcctgtgg tcagagatgg aaacaccatc taaggcttgt
ccttctgagt 134640caaacctagg ggagaaagaa acaaacgtac tgctgacagg aacttcacat
ccttctcaaa 134700tgtgctgtga atgcaccagc cccgagctgc gccccctcgg ctctcaccta
ggtctcagca 134760cacctagtcc actcaagaaa tgcaatgccg ttgctccttc catctcgtgt
ccttcaatgc 134820ccctcctcct ccacccgcaa ggtcagacaa actgccaaag taccacagac
cctttcccta 134880aactgggctc acattgacag cagccatgaa caaaggcagc aacagcaagc
cacactggca 134940cccctccccc caccctcggc ctcaagcctt caccccaatc accaaaatga
accagatgaa 135000aagggacatt tttgtttcat gagtctgcca aaaatgtttt catgggagaa
aatcttttaa 135060gcccaatcag ttatagaact caatcagaag gcattatctg ctgtgttaga
gatttgcatt 135120ctcagtgaga atttgtttat ggaaatatca aggttaaatt catttatatg
aaattattat 135180taaaattgac tatacttctt gaccatctat tggctaatct gaaggaaaag
aggccaggga 135240gaaggtagca aggacaggct agtggcagag gacagtgagc ctgagatgaa
aacacctcaa 135300ctacacgatt ggtggtaact aaagctgccc ccacacagga ctgactccct
aaggagactc 135360ccaacagagt agcggtggcc caaggcaagc agtcacatcc gtcagaggac
aaggtccata 135420aatccgtctc accactacta gccgctcaag tttaacttca cacaggcatc
ggcaaccaaa 135480gacggctcat caggagtgaa cagagccagg agacaggcag cctctcttat
agtaaaaggt 135540ccactcttag agccaagatc taaagatcgc taatccttgc ctgggtaggg
ggggtatact 135600gaacttgtag atcaatgaca gtattttgtt cgggggacag acatgcctgt
ttgtgatttc 135660tgtgatcaca gttactgaca gtgaaatatg aacaccttaa tgtagacaca
caattaacct 135720ggggttgtgc tcagagtctg ggagaaaaac caaccaacaa accagaccaa
tgccatggct 135780gtctctggtc agacttctgt cctacaatgc aacgctagct atcagcacag
ttacacagcc 135840agacccacac cactgaggat gccttaagtg acagacaccc cagatggatg
ctctaagtag 135900taaatattgt tattggttta catcagactc aaaacaaaaa tgaggagcgt
gaaatatgag 135960ctgtttttct tgagtctctt aaattctacg acacagctgg aaaccacaca
tgcccaccct 136020ggtactacag ctattcaatt tccttaagct tgggctaggt aaagtatttt
ctttgaccac 136080ctcaatcctt ctacacaaca ccctcaggat gaagtgctct tccaaggcag
agtttaatgc 136140cttaaaaagc tagatatggt gtggaggttc taacaagctc tcctcatctc
ctgagtgctt 136200tgggaatgac aatgtacagg caacaaagag tccattgttc tcactgtact
ttaccgagtc 136260cagtgacaac acaacagaag atgtgaccag ctccagagac tcagcagagt
aagaaagtac 136320attctacccc cacatcagct gcccacagtg caaactgaag atgctctcag
cactgttctt 136380cagcgcgcag ctgttaaagc cttgccgtac ctggcgggat caaggtcagc
acttagattc 136440taattgcttt ctctggctct ctatgttcat caggggatcg tgggtttgtt
ggtttaagag 136500ggctttcagg ccaccatctc agaagacata atgcctcgaa gaagtatagg
aaccactcca 136560ctcaatagca aaggcttggc aaaacagctt ggcagcagtc cacatgctga
cagtgtccag 136620ctctggttta ccagagcccg agcacagata acccctgagc tgagttgcac
aaaacctacc 136680agccacgaag cttataggcc tctgggatgt caggattcac aatgacagtg
ctggatgaga 136740ggaccgagag gctccgtcca ccgaagtcag agactcgggc tcctttgatg
gccatcacgg 136800gctgccgaga gccgtcaaac ttgtcagcct gcaaatcaaa cgcacgcaca
catcactgac 136860aagaagaatg tacaggatgg ttttatggag aagagtcagt caatgtctgt
ggactctaag 136920acagacctcc cactcgggaa ggagcattgc actacaagaa gctgcaataa
ccgatcatct 136980cacacagcga aggtcttcaa atacttactg gatagcacaa ctccctgagc
taaagcccca 137040ccctcaggac tcggctcagt gcaaggactc acatcttctc cccacagagt
tgtggtcacc 137100accttccctg acatgtccat caaatagata tttctcttag caacttctct
gttgttcgac 137160ttcactgtga ttttaatcga atcttcatag ctcttgcaga ttccaatgat
gtctgaaaca 137220aagaacactt tgtaagagct cccatcaagg ctcctcttta aaacacggca
gggacgaaag 137280gcaagcacgc tagtcacgca tcaccccaaa cactggagac aaaaatcacc
actctttgcc 137340ctcaaccctg gacgcaccta ctagtgcgtc tttagccttg ctctctaggt
caccgatccc 137400tgtgaaatca aactgaactg tgggtaagtg atggccatct tcacagggaa
ggacagaagt 137460ctcattattg aaggtcatct catagtcatt tttaacagcg gagaactgtt
tgttagcgat 137520cttcagggcg ccctttgaga agtaatacac ctgcaaaaga ggccaagtca
gggcaagcta 137580ctcagtccat caaagcccgc cccacaatgc agccttccaa actgtgctca
tggctctcac 137640agttcagcct gtcagctctg actcaacacc gtcttcttac cttgttcact
tcaataaggg 137700gaaagaactt gtccacttgc tcattgaaag cagtagctct gatttcaccc
tgcagaagca 137760agggagagca cattagaact gcgctgctag gccctcgctc tcaacagcga
gagcaggcca 137820ttttgacagt tagacaccat cctcggagca acagtccaag tgtaaagaga
ccctcacagg 137880gctgcagaca ttgcagaatc ccacatatac agttacattt tcaagctatc
agctctggta 137940aaacaaaagc agctcagcca cccgagccta cagttgtaag ttaacacata
aaacgaggga 138000gcctcgagat gatctggaag acaaaggtgt ttcctgtgcc aacctggcca
ccctagttta 138060atcccggaac cctcgggatg gagaagacaa tcgccccacc aagttattct
ccagcctttg 138120cacacgtgtg catgcataca cacgaaaata attcttacat cttatcaaaa
catttttaga 138180aggaatagct taatattccg ataatgaaac ctaatattct gggtcccgag
tgatgggctg 138240gaagcccaca ggaaggcgtg ctgaactcca atgattcagc actcttcctc
acgcccgcaa 138300cctaagtgac ggcttacttc acagtcagga gctggtgcca aaaatgttcc
aacaagtcat 138360gctttcaagc tgactgcact gttacttttc tgtcaggcaa tatgttacag
atggataaga 138420aacttaacta aatggctaaa accttaccca gacaagctgc aggaaattag
tttctgatac 138480tgaatctgcc atcacagtta agatatctgc tgtgctacac cgtgacaaga
ggaagaagag 138540ggggagagag ccccaaagct ttgcccttcc cggtatcatg gcttatttta
cgtgtgtggg 138600tgttttacca gcatgtatgt gtgagcatca ctgagtgtct gatgcctgag
gtgaccagga 138660gaggtactgg atcactgaca cgggagtcac agatggttat gtgctactct
gttggtgctg 138720agaattgagc ccaagtcctc tcaaagaaca gcgagtgctc ttaaatgctg
agcagtctct 138780ctggcccctg cattttgctt taagtaaagc ctagtgagta taaatgatca
gaagccctcc 138840ccgcagacta gttaaagctg agtagctgct gctccttctg ctggaggcaa
acccgccctg 138900ctcggcaggt atcaccccag ggcttcaact gtgcccctag caatgtaatt
agagcgctgc 138960agtctctgca gacggagact atttacagtc caccaatctg tattgctaca
gacgcgtccc 139020tttaggagca ctttatctca tgtctgaccc tgtgccccag atagcatcaa
aggtctccaa 139080cagaaggaaa gccacaatga ccgccatgtt cctcggagct gagccccacc
cgnnnnnnnn 139140nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 139200nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntttttatt aattatgtat
agatatagat 139260atctctatat atgaatgagt atcctgtagt tatcttcaga cacaccagaa
gagggcatca 139320aatcccattt acaggtggtg gtgagccatc atgtaattgc tgggaattaa
actcaggacc 139380tctagaagag caatcagtgc tcttaacccc tgagccccct cccttttttt
tttttttttt 139440ggcagcttat ttttaattgt tttaaatcac gtatatgttt ctgtatgtag
ctatgttcac 139500ataagcgcag gcactcatag aggtcagaga tgtaagatgc ctttggagct
ggacttaaac 139560atggttgtga gccatctgat atgagcacca agtggacttg ggtcctctgg
accgccttgc 139620tgccttccac ctagaattac agtatatgca attagctgct cagccacccc
ctcagtcctc 139680cttagtactt tcaacatagt gtaaatcgga gagcactcct tggtgcaaag
attctgtgtg 139740ttctatcatt ctctacaccc aggcttcatt gggacgtgtg atgccagcga
tgattcattg 139800gcgtgaagta tgagttagag tcagacaatc agtgctctct gctctgagtg
ggtctccctg 139860cccctgtgtc tctgagacag gggaggattc tcctgttgga tcacatagtg
catagagccc 139920tgtgtgcaca gcgctctcaa ccatttgttg ccatttcaaa tacagtgact
tcagagtctt 139980ccttgaaata acagatttct ccattttgtt tgttcccttc tttcgtcaca
aagacctgag 140040gatgagatgt ttttgaagga actttctagt agttactcgg tggaaaagga
caatgatgct 140100cccctcttct acagagaaga aggaaacagg aaattccaag aaaaggagta
cacagatgct 140160gcagtgctgt actctaaggt aacgtgcgtc caaccagcag ttgaacaagg
cggaagatag 140220aggcaaagtg cagactcata accagtccat tgttttgctt tggttagttg
ggttggttgg 140280tttgtttgtt tgtttgtttc ttcttcttta ttagtcggat tggatttggg
aaaaatgtag 140340tagaattcta ttgttgatta tattcacaaa caaaagactg tttatatgtc
ctgaattttg 140400gcaagattct tcccatattc cttcagaccc atagcagcag caagcttagt
ggccctttgc 140460caggtattct cactgagctg tacagcattt gtctgaagaa ctgatgaaca
ttttagctta 140520tgctcctatg ttagtaaata gctaggtttt tagctagcca ccttgctagc
ttacaactag 140580aatgcctctg gctgagtttg gtggcacttg cctttagttc cagcactcag
gaggcagaag 140640caggtgaatc tctgagttca aggcagcctg gtctacataa caagttccag
cctggccagg 140700gatatgtagt aagactctgt ctcaataaag taaacagcca gaatgactta
aatattgcta 140760aaaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag
aaagaaagaa 140820agaaagaaga aaatgagtgg atttgccata gcctagtcaa actagatttt
cttggttata 140880tgttaacata ctactattaa caacaacaaa tactttacac taaccatgac
aagctatatt 140940tttaaattat tattttatat gtatggtgtg ggaagctgtt gcagagtggc
agttggctac 141000tgctggccac cacacataca taggcagtga aggttctttt gccaagacaa
gttaaccaat 141060cagatgtgag acacgcctct cctaggccta tgtaagcagc accagttctg
ggctcagggt 141120ctcttcgcct ctacaatcaa gctctcccaa taaacgtgtg cagaaggatc
ctgttgcagc 141180gtcgttcttc ctggccagtt gagcgcgcac aagagtatgg gttttcacct
acatttatgt 141240gaactgcatg tgcacttggt acccagggag cccagaagag ggcatcatat
cccctggaac 141300tagagtcaca ggttatgtat gggttctagg aatcaaaccc atgtcctctg
gaagaacagc 141360cagtgttctg gatttaactg ctgagtcatc tctccagccc caccatgatg
gatttaactg 141420ctgagtcatc tctccagccc caccaagctg gatttaactg ctgagtcatc
tctccagccc 141480caccacgctg gatttaactg ctgagtcatc tctccagccc caccacgctg
gatttaactg 141540ctgagtcatc tctccagccc caccacgctg gatttggaag cagaactgag
gctttgaaca 141600ctgtgttatt ctaactcttg cttccttgga accctgagaa aattccttct
attggctttt 141660cagagatctg tatgggctta aaacaagaat atgtccaact cttgatctct
gattttatat 141720attaaaagaa tagagtgagc ctaggagttt ggcacactcc tatgttccaa
gcttcaggaa 141780gggaagaagg tcaggagttg aaggctagtc agtcttagct ttgtgacaag
tttgaggcta 141840acgtgagctc tatgagaccc tgtctaggag aggagaaaga aaggggtaaa
gtgaggtgtg 141900gctccacgct accacttgcc taacacgcat gacgcccaag gtttccgttc
ccagccctgg 141960ggttcagagt attgtgcaca ccctgtatct ttaaaagtca atagtcaacc
cttggaatta 142020actttccgaa aaatattaaa gctacatcat ccactctaca aacttgtaaa
agcccttctt 142080tgtaatggtg taagaaagta tttggcttca gttttatggc ctttgcatcc
taaatgttac 142140cccatggaaa gttgcttaat gaaaacaggt aaaaataaga caggagggac
tgcaaagatg 142200ctcaggctca cgagtgctta ctgcccttgc agagaccatg gcaggcagtg
ggtctcccta 142260ctgcagggaa tccaaaaccc tcttttggcc tctgcaggca accacattaa
cacatgcaca 142320catacacata attttaaaaa taaaataaat cttttaaaat gagctctaga
acgagtttga 142380catcagtcta ggctacacaa gaccttgtct caagaagaaa gaaatgaagg
tttgctgtgg 142440atggagaggg agatgcactt cctattccat gaagctgcta ttttggtgat
tatggtactt 142500tgcaatttta tagagagctg ctattttctt tcttttaaag taatgcttgt
tgttttgact 142560gtaaaagtaa taaatgttac tttggaaaat atagagaagt ataaagagta
aaaaaaaaag 142620tcataaccag tgaagtaccg ttaacatttc tgcttctatc cggccagcca
gagtttttcc 142680tgtgagaatg tgtttttctt ttacaaaatt gggataatgc tgcacttact
gttttgtagc 142740ccactctttc ccttcacgat ttattgtacc cattttctca cagtattaaa
ttttcagctc 142800caagtaattt tccatagcta actctgtgtt ctgttacaga gaagaaatgt
acttaattta 142860agatctaata ttggcataat atttggcatt atgatgctat aataagcatc
tttctgtata 142920aatattttta tgtacagcca gtgttttgtt tgggatgaat tagcacaaga
gtaaagtgtg 142980gggtcaagcc tctgagtgac gctgaattgt tctcaggaaa ggactagtta
acatccattc 143040tgaaagaatg tgggaatgct catttcttaa gcaacactgg ttattattac
atactattat 143100tttgttagta ttatacattt atttaggggg ataggaaata tgctcatata
gttcaatttc 143160aagagttgca aaaatatatg gtgtggtttc ttttcctgtc ccctagttta
gtctcacttc 143220cagccccata atctaccttg ttaaatatac tgtatataag gcatatgtag
gtgaataaaa 143280aacagcctag tatggtggcg ttaacctcgg gttacttgag aactgctctt
gcagagaaca 143340tgactttggc tccgaaagcc ttcctggaat tccagctcca agggatgcag
tgcctctggc 143400atccttgggt acgtcactca tgtgcacaca tacacatttg gtttttaatc
ttaggaactc 143460caagtgggcc aatgagatgg ctccatgtat aaaggcagtt atgcaaaggt
ctggatgaca 143520tgagttcagt cttcagattc tgcaagataa caggagagga ccaacccctg
cgagttgtcc 143580tctgacctca gtacacatgt catggtacgt gtgtagtatg cacatgcaca
gaagtcccag 143640cactcgggag gcagaggcag gaggatctct gagttttagg ccagcctggt
ctacaaaacg 143700atttacagtt atataaagaa actctgtttt gaaaaacaaa acaggggttg
gggatttagc 143760tcagtggtag agcgcttgcc tagcaagcgc aaggccctgg gttcagtcct
caactctgga 143820agagagagag agagagggag agggaaaggg agagggagag ggaggagagg
gaaaggaagg 143880aaggaaggaa ggaaggaagg aaggaaggaa gaaagaaaga aagaaagaaa
gggaaagaaa 143940gaaagaaaga aagagagaaa gaaagaaagg gaaagaaaga aaggggaagg
aagaaagaaa 144000gaaagacaaa gcaaagcaaa actaaataaa atacatacaa tatattaaat
tttaagactt 144060gagggcatag atcagtgtta taatgcttac ctagcatgcg taaaactctt
ggcttctaaa 144120cctagcaccc taccgcaaaa tatttgctct gtcttgctaa attatattgc
tagttgtcag 144180actactgtgt actttcacta gcaacataat gagaatgttc actatcccac
tcctctgtca 144240agtaatctgt tcttggtttt atttttcttt gccaaattga tgggtgaaca
agtatttcag 144300ctagcctaga acacacagag aactctcttt atgaactcaa gtttcttatc
cttttatgat 144360ctccagaggt ttgtttttgt gggtttatta gtgttttttg tttggttggt
tgtttgtttg 144420tttggttggt tggttggttt tgctttactt tttcttattc atttttttat
ttctttattt 144480ttttgttttt aatttaatgg actggttcca tgtggccaag gatagcctca
actttgtagc 144540agaaactggc tttgaacttc tggtcttcct tcatctacct cccaagtgat
gggattaagg 144600cacgtgccac cacatctaac aatatctggg tttctttatt ggagtttgaa
agggattcct 144660ccagcattac tttgactctt catagtttct tccagagtta ttacactttc
atttgttaca 144720ttaagagttt gatccagggc tggagagatg gctcagtggc taagagcacc
aactgctctt 144780ccagaggtcc tgagttcaat tcccagcaac cacatggtgg ctcacaatcg
tctgtaatgg 144840gatctgatgc cctcttctgg tgtgtctgaa gacagctaca gtgtaatcat
ataaataaaa 144900taaataattc tttaaaaaaa aagagtttga tccatttaca ctggacttct
tgaggcagca 144960ggatcataaa ttcaaggtga gcctgggtga actggcagaa gtggcagaag
ctgtgtctca 145020cactgctgta ttcatttcct cattgatctt cagaggtttg ctaacgggaa
gtaagtggaa 145080cagaaggttc agtattcttt ttttcccaat tctattcagt ctttagtagt
agatccctca 145140ttatctgaga tgcagagtcc cctttattcc tgtgaccatc tcgttgtttt
tcagggagtg 145200tctcattcaa ggcctaacac tgaggacatt tcactgtgct atgccaatcg
ctctgcagcg 145260ctcttccatc tgggtcagta tgaagtgagt attgaagaac ctggtgtcct
gcctgtggct 145320gcagtggaaa atgagctcct ctctgttctt ctgcacacat tgaaatcaac
tagcttgcaa 145380acactgacat ccacccagac ccattctctc ttctgactca tgtcacctct
cataggtgac 145440cacaaacaat atgtagttga caagaagtag ctatgtcatt gtccacagtg
catggatttg 145500ttccaatagg ccagcacttc tgtgtccata tcagctagat gtgctgctga
tagtatttta 145560gattccaaaa tgtgtccaga tattacctcc ttcgtttgtt tcttctaaat
aagccaggca 145620caaagacttg aaagatggct tgatggctct tccagaggct gggtttgatt
cccagcccca 145680acatagcagc tcacaatagg ctataacgtc atttccaagg ggtctgactt
cctgttctgg 145740cctctacagg cagaaagcac agacatacat gcaggcaaaa cacataaaca
taattgaaag 145800aagatattaa ataatagccc acgcttgagc ttattcctct gatgatacag
ctctcctgaa 145860gtcatcatgg gcagtgtaaa agtaaaggtg ccccgccctg cccaggggca
tcagtgaggt 145920agctgactgt gagtggcttt cttctcatcc cctaactgct gcaacatcat
caactgtgga 145980gcattatatc cgtggatttt aagttaggaa atgacaaaga ttagatctat
ggccaggcac 146040tagtgcacgc ctttaatccc agcactcagg aagcagaagt aggtggatct
ctgtcagttt 146100gagtttacag agagtgtcta ggcagccagg gctatataga gaaattctat
ctggaaagaa 146160taaacaaatc agatctgtat ttcaggaaga tgcctatgag ctgtttgaca
tgtgtgatag 146220aggtccttaa ggacaggaaa gtattccaca tgtgtgcatt ttacagaaat
tggttataca 146280ctggagttaa ggactgttgg aatagttgaa tgttcacact cagtgttctt
caaatcaaat 146340agaagaacaa gaatctatct gggcatggtg atgcacaact gtattcctaa
catgtagaag 146400actgaggcat gctatgtgtt tgtggctaac ctgggctaca tagtcaatat
tggacagtca 146460gagctgctac taaaacctaa aaaacaaaaa tctaataatc tgggatttta
tattttcctt 146520tttttaaaaa aagagtgggc agaatgtctc tgattttgtt cagatggcca
cagaacctag 146580aaaaactgct gctgctgctg ctgctgcata gcacacagct aatatttgac
tatacgtata 146640taaattttgt tgtatacttt agctgtgctg tcaactttgg aaaaaaagta
tcccagttta 146700tcattttaaa ttggcactgt acagaaatta acagccatat tagtctagac
acattaaact 146760tcatttttcc atttatacag aagcaatgta ctgtattaaa tattcagtct
tatctacagg 146820ggtttgatta cagaaactat caaagtattc tctaaatgat gaaaaaagat
tcaagaatct 146880gactgtagat ccaaaggaca agtggagaaa aacttaggaa gaattttccc
tttatcccct 146940ccctatattg atcatctctt ttacttctaa taatagtggc catttattga
acatacccag 147000gagttccttt catcacttta gatatataat ttatctcatc ctaaaatgac
ctgttgatga 147060gtcatctctc tttcagatga gaaacaaaga cattgaaaaa tctaacttgc
ctgcataaga 147120tcacacctag cctcttactc actccatgaa tattcctttt ttttttttcc
tttgagacag 147180aatctcacta tgtagttctg gttgtcctag aactcaatat atagaccagg
ctagcctcaa 147240actcacagag atctgatagc ctctgcctgc cgagtgctag ggttaaatgg
atgtgtcacc 147300aagcccagca aaatttacct tcttaatttt ctaagacgtt tctcctctta
aaaaatggaa 147360ctattgagcc agtcatggtg agacaggctt aatctttaat ctcagcactt
aggaggcaaa 147420gacaggccta tgggttcggg gcagcctgat ctatagagag agttctatgg
gttaaagttt 147480agggttaaag ttttgagaca aaactttgtg tcaaaaacaa acaaacaaag
ccagactgct 147540taataagaca aatcagacat aatattataa acaagtatta gtgtcactca
attaaaaagt 147600cactcaggag gtgagatcaa tccccagcat aatgagggga ggaggggaga
gaaggaaatg 147660aatgggaggg gaggaggaag gaagagagaa aaaggaaaga tctcaaagca
gaacacagga 147720tgtaatttaa ggcctaagct ctcgactgaa gttgtccact tttaatgacc
cttttcatgc 147780tcatggtttt ctgtcttcgg tacatagtga agagtggaaa ccaagggtca
tcctggaatt 147840tccttttgtt ttcaggcatg tcttaaagac atagtggaag caggtatgca
tgggtatcct 147900gaaagactgc agcccaagat gatggtgcgt aagacagaat gcctggtgaa
cctggggaga 147960ctccaggagg caagacagac catcagtgat ctcgaaagca gcctcactgc
caagccaacc 148020ctggtgcttt cctcttacca gattctgcaa aggaatgtcc agcatctgaa
aataaagatc 148080caagaaaagg agactctccc agaacccatc cctgcagctc tcaccaatgc
cttcgaggat 148140atagccctgg gggaagagaa cacacagatt tctggggcct ccctctctgt
cagcttatgc 148200acacaccctt tgaaaggccg ccatctagtt gccacaaaag acattctccc
aggagaactg 148260ctggtgaagg aagatgcttt tgtaagtgtc cttatcccag gagaaatgcc
acgacctcat 148320cattgccttg agaacaagtg ggataccaga gttaccagtg gagacctcta
ctgtcaccga 148380tgtctgaagc acactttggc cacagtacct tgtggcagct gcagctatgc
caagtattgc 148440agccaggaat gtatgcagca ggcatgggac ctctaccata gcacagagtg
ttctcttggg 148500gggctgctcc tcacactcgg ggtcttctgc catgttgccc tgagaatgac
tcttttagcc 148560agatttgaag atgttgatag agttgtaagg atgctttgtg acgaggttgg
tagcacagac 148620acctgtttac ctgaaagcaa gaatctggtc aaggcatttg attacacaag
tcagggagag 148680agtgaagaga agagcaagat aggtgaaccc ccaattcctg gatgcaatgt
caatggaaag 148740tatggaagta attataatgc tatcttcagc cttttgcccc atactgaaaa
gcatagccca 148800gaacacagat tcatctgtgc catcagtgtc tccgcactgt gcagacaact
caaagctgac 148860agcgtgcagg cccaaacctt aaagtcccct aagctgaaag cagtgacccc
agggctgtgt 148920gcagatttga ctgtttgggg agcagccatg ctgcgacaca tgctacagct
gcagtgtaat 148980gcccaggcaa taacatccat atgtcacaca ggtaagtcag aaatggtttt
tacttacatt 149040attggtattt caagagctaa tgtttaagga gaaaaacact ataaaggaag
cctggcatca 149100aataaatcag tgacctaaaa ggaaaacaca gccgtcttat atatcattat
gctattgaga 149160agctttgagc acatttctgt gaacccagag cttgggaggt ggagatagga
tgattaggag 149220tctaagacag ctttagctat acagcacgtt tgaggtcagc ctgaactaca
tgagaacttg 149280tctcttaaaa acttgagcca gagccaggtg gtggtggcac atgcatttaa
ttctagtact 149340caagaggcaa aggcaggcag atccctgaat ccagcctcgt ctatatagtg
agatccccac 149400caggctacat agtaagatcc tgtttcaaat aaataaatat aacaaaaaca
gcaataataa 149460caatagcaac aaattaattt tttagatgta tttatttatt ttatgtatga
gtacaccatt 149520gcttttttca gacacaccag aagagggcat tggatcccat tacagatggt
tgtgagccac 149580catgtatgtg gttgctggga attgaactca acacctctgg aagagcagtc
ggtgctctta 149640gccactgagc catctctcca gtccattaat taaaaattta aaactagagt
atttttaaac 149700atttattcat tttgtgtgtg gtatacatac tataatacag gttcataagt
caattctctt 149760ctaccatgtg tgtcttggag atcaaactca ggttcttagg catgggagca
agtatttact 149820tcctgaacca tctccctagc catttctagt attcttttct tttgtcttga
aagatttatt 149880tattatatgt aagtacactg tagctgcctt caaataccgg aaaggggaat
caggtcttgt 149940tagagatgat tgtgagtcac catgtggttg ctgggatttg aactcaggcc
ctccagaaga 150000gcagtcagtg ctcttaactg ctgagctatc tccagcccca tttttagtat
tcttattaag 150060tggtttccat tttatccaaa gatgccttta agggcctggg aagatggctc
agtgggcatt 150120gaacttggtg tgtgagcatg aagaccagag ttcagatccc tagcacccag
gcagatgctg 150180aatgatggtg gcctgcctga gattccagga caacggagac agacaggggc
cctagctaac 150240catactacac actagctgag ctgtgtgctc aagagagcag ccctggctta
ctgtgcagga 150300tggagagtga tcatctccac aggcaagcac acacctgagc acacagacat
gcacaaagga 150360aagaaaagtc cttttaaggt ggtggtggtg ttgttgtttg ggggtctttt
gttttgtttt 150420tttctccccc tccctcattg tgatggcaca ttcctttaat ctcacatctg
ggacaaagag 150480gccggaggat ctttgtgaac tggaggtcag cctgttctac atagcaagcc
catttcagcc 150540aggacgacat agatataccc tgtctcaaac agacaaaaat tatttatttt
atatatttga 150600atgttttgcc tgcatgtatg tctatgcaca ttatgtctgg tgcccatgaa
agccagaaga 150660gggcatcaga tctctcagaa ctggaatttc agacacttat caagtactgc
ctgagtgcta 150720ggaatcaaac caaggtcttc tggaagagca gcaagtagtc tttttttttt
ttaatatttt 150780tttattacat attttcctca attacatttc caatgctatc ccaaaagtcc
cccataccct 150840cccccccccc ccccgagcag caagtattct tcattgctgg gccatctccc
catctccttt 150900tctagttaat taagctgaaa gggagggagg tagatgttgc ccaaacttag
gatttattga 150960cagattaata ctctgttagc ctaactacac tatagaagct tattctttag
actttcacat 151020tacactgtcc agattttgcc atcctttttg ngtgtatatg tctacagatc
ttaattcagc 151080tgccaattta tacagtgttt ataggtattc tttgtgacgt ggatctttta
cccatcttaa 151140agcagtagga tttgaaagct gacatttatg tggcctatgg tcctgttaaa
tcacatttca 151200agttagtctc tgtggtacac attttggggt ctatctgcgg ttccgcatct
cacacttttc 151260cctctcaggg tgtccagaag ctgctgcaca ctgggctgga aggatgaagt
ggagtccaga 151320gtgagtggaa ttctgcagca tcccggtcca gctgggagtg aatgctgggg
tcaggaggag 151380atgggtgaga gggccttctc caagggcctt cttagtgtta cagctctagg
caaaggcctt 151440ctctgacaat cttagcctgt gcatagtttt ttattcgaga tgagcttgta
tgcatacact 151500ttattggcag taaatcagag gttatccact cttagggaag gagataggaa
tacccaaggt 151560ggacagaggt cattggctga aggataacgt actgagatgc tcattagcac
ggggaggcat 151620cccaggaatc tcaggtgctt gctgactggg tttcttaggg ggttgagagg
gtagcagtga 151680tttcaccaag gttatgtatg gcagagggta taggggtttc aggctcccca
gacaaagaag 151740gagaaggaga agccctgctg ataagggaag tcccccattt tgagccactt
cagaaggcta 151800tcaagacact gatagacttt gtccttaatt agcagggccc aacaagtgtc
tgttttcttt 151860tctgccttca ttggctcttt gagccactgc tacaaaatat ccaaatctgg
gccaggaagc 151920tggctcggct agtaaaggtg tttgcctcta agcctgaagg cctgagtttt
cacttgattt 151980ggtttggttt ttgttttttt gagacaaggt ttctcagcat agccctgggt
attctggaac 152040tcactctgta gatcaggctc aacttgaatt cagagatctg cctgcttcta
catcccgagt 152100gcttagatta aagttgtgcg ccaacactgc ccacctaaaa aaaatatgag
gggctggtga 152160gatggctcag tgggtaagag cacccgactg ctcttccgaa ggtccgaagt
tcaaatccca 152220gcaaccacat ggtggctcac aaccacctgt gatgagatct gatgccctct
tctggtgcat 152280ctgaagacag ctacgggtgt acttacatat aataataaat aaatcttaaa
aaaaaaaaaa 152340aagaacacta ttcacaaggt acaacaacag tgtactagga aactttaaaa
tagcccatca 152400ttcattctag gggaaaaatc tttttaaact ttagtgtgta agaaagagag
aggggctgta 152460gaaaggccac agcacgtgta tccaggttag gagtcaactt ttcagaagcg
gagtctcccc 152520ttctacctgt ttttgaggca gtctcttgtt tctgccctac actttgtaca
tgaacttcaa 152580gatggttgtt ctatttctgc ctctcatgtt gccctatgca tgctgagctt
actgatgcca 152640gccaccacat aagcatggca ccagcactga gccaagggca ggcatctggc
tcacatggag 152700agttacccat caagccatct tgctagcccc agaaaatgta tttttgacag
gtgtggtggt 152760gcacatattt aatcccagca ctcaggaggc agaggcaggc agatctctgt
gtctgaagcc 152820agcccagttt acaaatcaag tcccagaata gccaaggcta catagagaaa
ccctgttttg 152880aaaaacaaac atccttctgt gttccagtca caatgactgc tgtaacaata
atatgaggat 152940ttgggcgtgt caaaaatcac aagtagcaag gtgtaatggg caggccttta
gtctcagcac 153000ttgggaggca gaggcaggag gatctctgtg agtttagcac agccagggct
gttacacaga 153060gaaaccctgt ctcaaaaaaa ccaagcaaaa atagaattac aagttaacca
gggtattggt 153120gttaatatga aatagcagaa ctcaagatag ctaatgaaac aaaggattct
attttataaa 153180ggagacattc catactgaaa tatatgcaga gcctgatgct tgcctagcaa
ctgaactaca 153240cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 153300nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngcctgggca
agagtaagca 153360aaccgtcagg ctcagaggca ggatggcagt gatgtgtttt ggggcacaag
gccccctttc 153420attaaagcac caaatcttgt actaaaaaca gccctctgct ctcgatctgc
agagatacga 153480gagaggcaca catagcttcc gggcccttgc cctctgctct cccggtcagt
tcctggactc 153540ctgggagaag cttgaagctc aagaactggc cgctgttgcc tttgtgctga
cgccagaacc 153600agcagctatt cagcagctgc ggctcttggg caagcttgga agtagctcgt
tccttctcct 153660tctgcagggg aaagacaagg agatggcact gagctgggta gccaggatgt
gggaagtaaa 153720ttgtgtctat gtgggtgagg gaagtgccgt gtgtgtgtgt gtgtgtgtgt
gtgtgtgtgt 153780gtgtgtgtgt gtgtgacaga gacagaaagg gagagtgcat gttcttgtgt
gttcctgtga 153840gtcagtgctc cctttggctc ctcctgccaa aaagcatgct gtgttctcag
agtcaagctt 153900ggccaggctc gcccacccca ccaaagccca ggatctgcca ccccattaag
aatgcggggt 153960taggaaataa aaaacagggt tcttgtccaa gagaattttt attattattt
tttctctctt 154020aattggattc agcatttctt actcctcagt atcctctctg gtagggaatt
caggtctgtc 154080tatgcagagc acaagagact gtgcttgcca gaaacccttg ggccaacagc
cctattccct 154140gactgggctt gcctgcaggc tgccttctgg gctgacccct gagtctggcc
ctctgacctc 154200tgcccgtcct ggggtcatcc aggaggagca ggaccactgt gatacagggt
tcctgagatg 154260gctactgaaa caccctcatc tgctaaggcc actagatttt tttcccactg
ccagactact 154320cagggccctc agtaggtcac tggccaaaga gcctagatgt taaaactgca
gctaaagcct 154380ctcctgaggc cagagctcag agcctccctg gcctgcacaa agtgctaaga
aggacattgt 154440ccatccaacc cattggacta acactgtaga cgctgccttg tctgccagct
tgaaaacccc 154500agaaacctct tccccagctc cgcctagctt gcctccagca ccctgacatc
cacttctctc 154560gctaatgtct gcagcttcta cagtaggggt gacggggtgc tccggggtgc
cagacaggcg 154620tgtgttgcat ataaaaacga ggtgatgttc taagtatcta agaatgttgg
tcccctgaag 154680tgattcttgc tgcttctctt ccttcccgac tcttcccact gacacctttg
cccccagcaa 154740gcccagggac tattgttgct ggctggggtt ctgaacaaaa ttgccgcagt
tttttgtttt 154800tttttttctt gctagaagtt accgatacag tccttaattg agcaaatata
ggttcccgta 154860tagtttataa acatgataag acatacagtt tggtaaggag tgggttgggg
gacctgtgca 154920tttatatatt tatatatata tatatattat gtgggtgtgt gggcagagtg
aggatatata 154980taagtggaca taggtataaa actgcacctt ctgtgtgact ctcattgcga
gtacagttct 155040aaatgtcatc cactggcgat ctctcctttg gtgattggtt cttggacccc
agcaggtctg 155100ccgggggctg cctgagacgt caggaatgag aggcattacc cccatggata
gggactgagg 155160gtggcatagg gttggacagg gcaggttaac taagtgatct cagacaagag
ccaagagtgc 155220tctgagattg ctggttgccc cagctggctc tgggagagcc ttgttctgag
tcctgctcct 155280tccaaaacca gcagggtcct tcagcccttc tctccaaatg accaggcttc
cgcagagccc 155340agcttcttca aggggcgcat gtcccgacac cactaatgac tcactttgcg
tgcctttgac 155400cactgtgctg gagtggatac ggtccagagg cgctcggtca ggacagccga
gtgagacgtg 155460atacccttcc cgtctacggc tgtacatttt gggcttataa tccaccagga
agggtccaga 155520cggtggctga aggcttcagc agccttttcc tgaaacccag cagatcttcc
acttaggaaa 155580aaaaaaagaa agaaagaaag aaaagaagaa aaaaattctg ttccttgctg
gacatgtggc 155640agtgctgggt gacggagccc tgggtcctca gcggagagtg actgccagcc
ccagtatcca 155700ggccaggagt ggggccccaa gggccgtgcc tgaggatgcc tgctgcaccc
cactgggggc 155760acggatgggg gtcctgcgag cacacttgcc cttcctcttg ggtcgtgcgg
tgggcatgat 155820gtcaaagctg aacttgtgct gatagtcggg ggcatagtcc tgcagttcgg
taagctcttt 155880cccagagctc accttagaga tctggttccg gttcctgtgg ctggtgcagt
tcttgcctgc 155940cttcttgtaa cctgacctgg agccaggcgg atggccatgt gggtggcctt
tgtccctgga 156000ggccccatgg gacggatggt gctccttgcg ggcagccctg tcagaggtgg
taagcgtgtg 156060agacttgatc tggtgaggag acactggtcc tgtgcagttc cggaagtcct
ccaccctcag 156120cagcttcaga tcctggcctt gccgcagctc gggggtcgcg caggggacag
cagagctaga 156180gccacggaac cttcgcagcc attcccacag ggaacgtgcc cggcagccac
agtcccaagc 156240attcccattg aggcgaagga actccaaggc caccaggggg gccagacagt
caccctgcag 156300ctcagtgagg ctgttgttga agagaaagag ggtggttagc ctgtggaggt
catggaaagc 156360cttgtggtga acccactgta gctggttctc atgcagcagc aaccggtcca
ggttcaccag 156420gccccggaag atgccttggc ccaggctcca tagcttgtta ccatggagaa
acaagtgact 156480gagattgacc aggtccacaa agatgtcatc ttggaggtac tcgatatggt
tgtcctgcaa 156540gtagagatac tgcaggctgt gcaggccacc aaagatgcct gcgggcaggg
cgctcagtcc 156600acacttatag aggtagaggg cgtgaagctt caccaggcct tggaaggtct
cgggtgccag 156660cgttcgcagc tgtcggttgt ctccaaggtc tagctcctcc agatgcacaa
agccctcgaa 156720ggtgttggga gcaatgaaag tgatgttgtt ggagtagatc cagagggtga
ccatggcggg 156780gctgaagtgg ccctgctgga ggaaggtgat gcgattgttc tgcaggaaga
tgcgctcact 156840gtcctctggg atgccctccg ggatggcagc aaagttgtgt gcctggcagc
tgacagtcat 156900gggcgcaggg tagcacacac agtctcgagg acaaccacca cccagaggta
gctctccagc 156960gagcagcaac agcagcaatt ccacacagca ccctggtggg gagagacaga
acagcagtga 157020ggggctgccc agaggaggtg gagatagatg gggaacagag ggtggaatgg
gggactggca 157080aatgactctg ttggctcaca gaggttctgt cctctgtatt gtatgcaggg
gtcccttgga 157140cagggcattt ggggccaagg cccacattat ctcctcacct ctttagctct
gtccctaaag 157200tctctaattc catccgaaca cttcttcaga ctgtcagccc caccgcaggg
tggaacacac 157260ttgtgaacac aggcgagtcg gcctccggct ctgggtccgg ctctgccact
cgctcactgt 157320tagctgcctt agcaaggaat gactctaaca aagcaaatct ggagtcctga
atgatcactt 157380tatttaaata attcctcaaa ataaagaaag cattgagtcc atggtaccaa
agcatgcctc 157440aataagcgcc tctttcacac tgtggtacaa aaacttcaaa cctacaactc
ctccatggct 157500gtttcctcat gagttaaaca cagttcacag ggctgtgtgt acagacaagg
cacatttctg 157560tgaggggctg tgagtgacac cagggctgac gcacgaggct tcccttgggg
ttcacagtac 157620tgccagatcg aggctgcatg ctcctctccc ccattcacac ccccccctcc
tgtgctggag 157680attgccaggc tgtggctgta aaaccgggcc ttgcctcttg actgtccaga
gcatttcctc 157740tgtagcttcc ctctaattgg gcattaatta ggcattcgtt aatggatcct
taaaataatt 157800attttcggat gtgtccagcc tgtggtcggg taataggcct atgctcatta
tggaagccgc 157860ctcattatgg acgattgtca ttacctgcct ttttccaggg tcacagctgc
cccaagtggc 157920ccagcaggcg cgtcaggaag atggggacag gctccaggcc tacgggcgcc
caccctgaag 157980ggccaggcag ccacgaccta tgtcgcctca gttggcctct tgccccttct
tttccagctt 158040gttcagctgg gactcttggg agagccaggg cccctggggg aatatgagct
gagctgaatc 158100ttcttgctgc tagctgtgct cagagcaagt ggaaggagca gggaccttct
gaccaggctt 158160cccacttggg gtcccaggcc cagggactgc ccaggccccg gcagagtagg
ttgccacctt 158220gacttctgac gccccccccc cattcccaac agaaacagca tcgtaagttg
acagctccca 158280gctgttggga gttatgggct cccagagagt ggcagctgct tctcgtccct
gtaatcaccc 158340ggcttcagct agaatgtttc tagcacataa aaatcatcgc atataattta
gtttttgcat 158400aattgggttc agttgtgatt tcagagcaat tatgacctca gcagcagggg
tggcacagca 158460taggacccct ttctgggccg ggccatgccc tgcaggccct ctgcagtgct
ttctgcccac 158520cggcccttag acagcatgca ggctataacc atctctgcct caatttcctg
ctccaaaacg 158580tgtctagatg ttactctgtc gatcttcctc ctcagcatcc tgggtgtggc
ctccagcctg 158640ccagcctctg tcctagggat cctgtgctgc agagggaggc acagtcggag
ggaggggagc 158700ctgccctgtg ccaccagcac tcacactggc tgcacagtcc acagacccac
agctccaacc 158760tccctgcttg gcttgcaccc tctcttccag gaaggccatt cttgccagaa
cctttcccaa 158820cggtcccctg ggaaagcctg gactctaggt tcaaggacat tcatgatgct
tgccccacat 158880tttatgctgg atgagacaca gcagagcctt cttcactggg gggtcctgtg
aaaatgaaag 158940cttttcttcc ccggcctgca gctgcaggca ggtaggggtt gcagtgggct
tatcactaat 159000accattcgac atttgtacag ctcatcggag tttacagagg gcttttgttg
taccctcaac 159060ttcccctgtg ttctcccacc tactgtggct gctctgtctc tgtgcacccc
aaaagaatct 159120ggaagtccct tggggagatt acccccctta cataggggcc tccaaggaat
acaggctaca 159180gctctattta ggaaaaaaaa aatcaaactg aaccaaacct caggtgtgga
cttagtaacc 159240agtttataaa cataccgtgg caagtggagg aggcaggcgg cagaacggca
tgaaggtagg 159300actggggttt tccctctaaa agggcaatgg ggggacaaag ggactctagc
caagacctga 159360tgcagaacca ggctcagttc ccctgttatc tcaaggctat catcactagg
aggcctaagg 159420caatggacca caaggacctt gtccttgtag gacagtcact tcctgcagtg
aagtgctctt 159480ctggaagcat actaatagga tgtaggctca ggacagctgg tctctgtcct
ttagtatttt 159540tccacatgcc agggatgtta accttccaag cctccatctc ttctaatggg
gggggggtgt 159600tgaggggctc agccactctg catccatgtc tttgaaagcc agtggtatta
ctccaggacc 159660ctgagcaagc tgtcctagtc agccctggcc acttctggac tccttgcctg
agtcagtagg 159720tgccaatcct aggattgtca ccagcaggtt tcttcctagg gaggcaagca
ctgtatcacc 159780atggcgcctt ctatgccccc tctatgaggc ccttgggagc cccgccccac
tgattgcctg 159840attaatgtac caacaatgag gatggagcct ttgccatgca ttttaacatt
gcaaattagc 159900aggaatccaa gtctctgtgg aggggccctg cacctcttct gccagactca
tcaagcgcct 159960cttgggcagg gctgccttct acttgagggg gcggaaggga gaagacccag
ttccactctc 160020cttcccctcc aggaggtgcc cttcatcgtg ttctgcttcg ttactctcaa
gcctccggcc 160080tcccacgcac gtgagctccc aaggggctct acagcctccg tcattccttc
ttccattcat 160140acttgccccc tagtctagga gagccatgga agacagtgtg ggaagggctt
gacaatgagc 160200atcatgcccc atttgcatat gcggtggcaa taccctggtg ggtaccagga
gagtataggg 160260gaaattaaga gaggggccta aggaaagcct ctgctatccc tgggctacca
gtcagcattg 160320cttggtcact gatcccctct gtaacaccag cccttctgca acctgccaga
gtttttgacc 160380tttgaactag ggctgagaag ggtctgctct gttcagctgc cttggctggg
aggggaatct 160440gctcagacct cagcacacac tcaacagaag gcatgcaagc aagggagcta
gcagtggcct 160500tgggtcagct ggcaagcccc aaactcttcc tgccaagctg agcatgaaaa
gccacctcac 160560catggtccca tgggaccaga cctggtagga taggtggcaa ggctaaggca
gcggaatagc 160620atgtgcaaag gcactggggt gggaaagggc ctgtgcttct caccccctct
aatggtgcag 160680agcctccaag gaatactgta acctcagctc agctgggctc gggtggccag
agagcttggc 160740accagaacca gcatcaacag ggcctgtctg ctaaacccag acctcacaag
ccagtttagt 160800aggggccctg tagcaccctg gccaccagaa ctaacgagga agatctgacg
ctgggaatat 160860gtctttaatg aaaagccctt ccggaagcca catttgcaca gaagaaaatg
aggtgcccag 160920agcatcagtg ggctggttgc agctggagaa cacagcaggg ggacaggtcc
taccaagcta 160980ccctgccttc aggctggggc tctagccagc tccctgatgc ctggagtagg
taaagcagcc 161040tcgaaatggg ctgggtcagc tttttcaggc tccaaagggt caggacagct
gctgcagctt 161100agcacccaag ggggctgccc ctctacccct aagtagaggc atccccatgg
cccctgggca 161160ggtcagtggg tctctctgaa gctttgtagg ctgcttcttg gccatgtagc
caatctctct 161220ggccttcagt cctccctccc tgcccccagc ctggccagct gctcttctct
gagcaatcga 161280tgttaaccga atgctctctt gctgtgggga tggcggcctc aggccaggcc
agctgcactc 161340ctggggctgc tggcgcctca ggccacttgg cacttgtgcc acttgtgttc
taaacacagg 161400ctccttcctg gctcggccct gaagacaaga aagctggcca gggaacagct
gggctcccat 161460ctcagcctcc actgctgtgc agagcggccg gcagcctcct atccatggct
gtgagtagaa 161520cagggctgtg gagccagagg cctaagttga atcctggctg ctccttttaa
tgcttgcagg 161580agcctcgttt tcctcacctg caaaatgggg cagtcattgg aagctcagcc
agtcctccag 161640cccacagata ctggtaccca cctgcccttc ccacatctcc atctgtctaa
ctgaaaccat 161700ccaaaccaag ctcttctctt cctctccggc ctcctccgtt cacaacttct
ccatcttggg 161760taaagatggt ccctttacat cagctgcctg ggaggaacac ctcagaatca
ccgtggctca 161820ctctggggaa attctgctgg ctagatttta gaatgtatcc agtatctatc
cacttatcat 161880actcgctatt gctaccattc acccagtagc ctcctggatg ccctcccccc
ccccccgcaa 161940gccctgcctc ctcaccccta cacctccttc aacaggaact agggtagtcc
agggaaagtg 162000agtcaggaag ggctgctcct tagtctgcat cctccagagt gcccatctaa
ctagaagctg 162060ccccaggtct ttctcccagc ccagaggccc tgcctctccc tggctacaca
gaccttgctg 162120ccgtccctcc cacatgccag ccctcagcct ccctcctgcc tttgtccatg
ctgttccatc 162180tacctggacc ggttcccagt gtgtctgcag ggctgcttcc cagagaagcc
actcttgagc 162240agcgatttgc aaggagcttc tttcctggga cattttactc tacagcacgc
gacctcttgg 162300acagcacgac ataagtcact tgtttccttt atgtccgctg ccactttgtt
ctgatgagtg 162360tgctccctgc acacagtagg tgctcattaa cagttctggg ggagggaatt
accttctcaa 162420gtctctggag aattgaatga tgacactcag gaagccctag gctcaagcct
ggggcctgga 162480tcatagtagg tgctcgataa atgttggttg taattagtcc tgggagactc
agagccttca 162540ggagaacaga cacctgaact tggctcacat caagactcct taggcccatc
aaggaagtga 162600ccgtttgttg gatgagcact ctgagaagga cccaatacca gtcctttcct
gggcaagggg 162660aaatagactg ggactgggga gtttcccaag gtatgggatt ttcagcacta
aactggaata 162720atgctaaaga aaaaaaaaag ttagtcactc taaaaggggc caggaccaaa
acctttcaaa 162780cagaaatgtc tgggtttatg aagagaggaa gccagatatg gtggggcaca
tctttaatcc 162840aggtgcttgg gaggcaaaga cagatggagc tctgtgagtt tgaggccagc
ctattctaca 162900aagtgagttc taggacagcc aaggctacat agagaaaccc acttgacttg
ccacccaaat 162960taaaaatctt agtgggggag cagtcaagga ggaaacacac acacacacac
acacacacac 163020acacacacac acgttagtat aatatcatac tatggctctg tgcctgcagt
ccaggaatga 163080gggctgaact cagagtgtta gtgtgtgcta gtggatattt gagctctgta
tttatgtgca 163140tgtctgtgta gatgtgtacc tgaggtgttt atgtgtacac aggtgttggc
ctgttgcata 163200tggatggaga catggttgtg tttgctaggc atttgcatgt gtctgagttc
atgcacataa 163260actcacatct acctctggag actgagagtg acaacccagg gcccttttat
cctgcagcac 163320cccaggccca gcaccccgac ccagcatccc aggcccagca ctccagaccc
agcaccccag 163380gcccagcatc ccacgcccag catcccaggc ccagcacccc agacccagca
tcccaggccc 163440agcaccccag acccagcacc ccaggcccag catcccaggc ccagcacccc
gacccagcat 163500cccaggccca gcaccccagg tccaagcact caatgcccag caccccgacc
cagcatccca 163560ggcccagcac cccaggccca gcatccaagg cccagcatcc cagggacagc
accccaggcc 163620cagcatccca ggcccagcat cccagggaca gcaccccagg cccagcatcc
caggtccagc 163680atcccaggga cagcacccca ggcccagtat cccagggaca gcaccccagg
cccagtatcc 163740cagggacagc accccaggcc cagtatccca gggacagcac cccaggcaca
gtatcccaca 163800tggaggcagc acatactgaa gatagggaat gtctctgagg cctcttatct
tggtccttac 163860cctcattgct ttcagcacct gctctcctca cactcggaat caaacaccct
gtgcaggttc 163920tcccagtacc aggattcccc tcagctgagg aatgggtagc taccattttg
gcttttgtct 163980gtctggggtt ggcagcccca tgctaattgg actgacagtt tctcctgaga
gcaatttggg 164040cagcacatcc tgcccattag gcctaacctt gcctgcaggg gtgtgctgta
ggggcaggga 164100tggagcctac cctgtatagc tctgtattga ggcactcccc caagctatga
cccatgccag 164160tgggagtcat ttcacctagg caactccaga tgggcacaaa aatctctcca
ataagggtag 164220gtatgggaat aggtaaggag agcatagtga gcctggctgg gcacctgaga
cctgagcagc 164280ctgcacggga gattgtgtca ctgtggttcc agactgccaa gacatcttgg
ctttcacccc 164340aactcaggat ggtccagaat ccagagctct taagagagca gatgctgaga
ggcacttaac 164400ccagggctaa gacccttcct tggacggttt cttggctttc tactctgtcc
tctgtcccag 164460tctgtcatcc ccatctgtgc ctaacagctc tctgtggaaa acatgaggcg
tatgagctct 164520ctacttctcc cagcatccca tgcccgcacc ccagctcact gtgtgccctc
atgttactca 164580aatcttctgc taggtttgag ggccccaggt tgaggctgtg ggtttccctc
catctgtccc 164640tcccttttac caccaccact aatcctcttc ctcctcttcc tcctcttcct
cctcctcctc 164700tttctncncn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 164760nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna
caaaggacgc 164820aaattgaact tacagaggaa acacaaatga gctgtaatga cagaaaagga
atactatttc 164880ccttgtgatc agaaagctat agattaaacc ccagcccttt ttgcctttgt
ttttcatatt 164940tagtggttct ggggactgaa cccaaggaag gacagctgct tggaatgttg
gagacctctc 165000aggcattgct tatgctaagc tgacaaaggc tttctgaatg catatggtag
tatctattca 165060tcttaagatg cctataacca tgggtggagc aattctctaa gagtctgtct
gaacttctta 165120agcatgtgct caaaaacaca cataaagatt gtttactgaa gcagtatttg
caacagtgaa 165180acaaaaccaa acaacaacaa aaagtactta atagctcatc agtaggaaaa
agaacaacaa 165240attgggacac atttatacta ttctcgttct atcacacgac tattaagaag
attaaggtgt 165300gaggatgaaa agcccctgca gaacaacgca cacaatgaaa ttgaatagag
aataaagaaa 165360aaaatatata gtgtatttat gtaaaacata cataaacaca taaaaagaat
agaaaactct 165420ttgactgtat atctttgaaa agaaccagca gagtgctatg gtttataaat
gaaaagttcc 165480cccaaaattc ctgagttgga ggttcaatgc agagttcata ctcttaggaa
gtaactaaat 165540caggaatgct ctgatctaat caatggattg accaactgat agattaataa
tctgaaggca 165600acacttccag gaggcagaaa caaggtgagg cctaactgga ggaagtttgt
caccagaggc 165660atgtccttgg agaggctacc tcatccttga tccttccctt tctccgcttc
ctggctgtaa 165720aagtatttat ttgactgatg taactgtcat cttggaggct tactggctcc
atcagctaac 165780ctaggcctag ccctggaagc ttctagcttc catacaatct aatccaagcc
tagaatgttc 165840cagcctttag gacttgctgc tgagatcacc gtttcctgtt ctttctgaac
tctagctggc 165900tgattcagtc cccctgttcc gggctcaaac tcctctcccc gatgatttta
ttcacaatct 165960gtcttttctc ttggcctctg aattgctctg cttggtctca aactaactct
agcaatcttt 166020tctaatctct tgtctccttc acactctctt gcttgttctg tctttactgt
gtctagtttg 166080ttctctcttc catccttctc tgtaaagctc tcccggtaaa cctgcctcct
cctccccctc 166140tgtgccgctc tactctccct ctcagctcta ctgcactgct ctccacagct
ctcctgtatc 166200ctgtgctgca ctctcttctc cggtaccacc tgtgtctccc ttacgtagct
tccctttcct 166260ctctcttctc ctgagggttg ggcagatcct atcctgtcaa acctttctct
gattcttcac 166320tttgtctgcc actcaattag acatcacttt caagcaggag tgcctcctct
acaaaccaac 166380tttaccttca ttgtttcaaa ttaaaggtga gtactaaggg tgtgtctctt
tttcagccag 166440tgagagtaaa gatgtgtgct aataaggctg agccaactct agctagaaat
agtttctttt 166500tctccataaa taacagaatc ttagggttca caatacgatc aaatatcctg
agacagctgg 166560ctgccatgtg gtgaagagaa tgcagagagt taaggatgtg ctcttgaggt
ttcagatggg 166620aataagcatt ttcttgggag ttggattaga gtcattcctg tgacactgtg
acaaaggact 166680cgactacatt ttgccatgcc ttgagactgt ggagggctga gcgatggatt
aatacactgg 166740agtgaatttt aaggcagcca aggctgtggt tggactgtta ctagctacgt
ttagctggat 166800ttatattaag aattgggaac aaaaaagcag agtagaaagg acagttttgg
cagaaggagg 166860tttcttcata tacaacttag tttttatagt tagcttttta tgttgctatg
accaaaatac 166920cttatggaaa ctgaagaata aaaattttta ctgaactctt cttcacccca
gaacccgacc 166980cctcccatct agagattgtt cccggaacac tcctgaactc ttcaccccag
aatgctttcc 167040tgaactcctc accctagagt tcgaaccctc ccaactaaaa actgttccaa
gaacattttt 167100gagataaggg cctcctaaaa caacctcaaa atgaaccggg tacattgcca
aataatagga 167160catgacccct tagttacgta gattcccttg gcagaacccc ttgtcccttg
acagaacccc 167220ctagtgatgt aaacttgtac tttccctgcc cagctctccc cccttgagtt
ttactatata 167280agcctatgaa aaatttggct ggtcgtcgat tctcctctac accactaggt
gcatgagttt 167340cgaccccaga gctctggtct atgttccatg tgctttcttg ctgttgttct
attaaatctt 167400gccttctaca ttttgagtac ggtctcagtg tcttcttggg tccgcggctg
tcccggggct 167460tgagtgcttg agtgagggtc tcccttcggg ggtctttcat tttggtgcat
tggccgggaa 167520acagcgcgac cacccagagg tcctagaccc acttagaggt aaggttcttt
gttctgtttt 167580ggtctgatgt ttgtgttctg tttctaagtt tggtgcgatc gcagtttcgg
ttttgcggat 167640gctcagtgag accgcgctcc gagagggaac gcggggtgga taaggataga
cgtgtccagg 167700tgtccaccgt ccgttcaccc tgggagacgt cccaggaaaa acaggggagg
accagggacg 167760cctggtggac ccctttggag gccaagagac catttggggt tgcgagatcg
tgggtttgag 167820tcccacctcg tgcccagttg cgagatcgtg ggttcgagtc ccacctcgcg
ttttgttgcg 167880agaccgtggg ttcaagtccc acctcgcgtt tggtcacgag atcgtgggtt
cgagtcccac 167940ctcgtgcaga gggtctcaat cggccggcct tagaaaggcc atctgattct
ttgagttgct 168000tgtggtcgac gcagagtcgc cgccgtttct ggtttctttt ttgtcttagt
ctcgtgtccg 168060ctcttgttgt gtctactgtt tttctagaaa tgggacaatc tgtgtccact
cccctttctc 168120tgactctgga gcattggaag gaggtgcggg tcagagccca caaccagtcg
gtggaagtca 168180gaaagggtcc gtggcagacc ttttgcgcct ccgagtggcc aacgtttaga
gtaggctggc 168240cacctgaggg tgcttttgac ttgtcactaa tcgctgccgt caggcgaatt
gtttttcagg 168300aggaaggggg tcaccctgat cagatcccct acattgtgac ctggcagaat
ctcgtccaat 168360tcccacctcc gtgggtcaag ccttggaccc caaactcttc gaaactgacg
gtcgcggttg 168420cccagtctga tgcagccgga aagtctggcc catcagcacc ccccaagatc
tatccagaga 168480ttgacgacct cctctggata gactcccaac ctccccctta ccccctgccc
caacagccac 168540ctgcagctgc cccaccacag ggaccaatag cgagaggggc tcagggaccg
gcgggggaga 168600ctcggagtcg ccgaggccga agccccgggg aggaaggggg gccagactca
acagttgcct 168660tgccactcag agcacatgtg agagggccag caccaggacc taatgatctc
attcctttac 168720agtactggcc tttttcctct tctgatttat ataattgaaa aactaaccac
cctcccttct 168780cagagaaccc ctctggactt actgggctcc ttgagtcact tatgttctcc
catcaaccca 168840cttgggatga ttgtcagcag cttttgcagg ttctttttac aacagaagaa
agaaaaagaa 168900tcctcataga ggcgagaaaa aatgttctgg gagaggacgg cacacccact
gccctcccta 168960acctcgtgga cgaggctttc cccttgaacc gccccaactg ggactacaac
accgcggaag 169020gtaggggacg cctccttgtc tatcgccgga ctctagtggc aggtctcaga
ggagccgcta 169080gacggcccac caatttggct aaggtaagag aggtcttgca ggggcagact
gaaccaccct 169140cagtcttcct tgagcgtcta atggaggcat ataggagata cacccctttt
gaccccttgt 169200cagaggggca gagagccgct gtagccatgg ccttcattgg tcagtccgct
cccgacatta 169260agaaaaagct gcaaaggctg gaggggctcc aagatcatac gctccaagat
ttagtaaaag 169320aagcagaaaa agtctatcat aagagggaaa cagaagaaga gaggcaggag
agagagaaga 169380aagaaataga ggagagggaa aatagacggg atcgccgtca ggagagaaat
ctgagtaaaa 169440ttttggccgc agttgtgaat gatagacagt caggaaaagg taaaataggg
ctcctgggca 169500acagggcagt gaaaccgcaa ggtggcaaaa agataccact ggaaaaagac
caatgcgcct 169560attgcaaaga gaaaggacac tgggctagag attgccctaa aaagcgggag
cgatccaagg 169620tcctaaccct agaagatgat tagggaagtc ggggctcaga ccccctccct
gagcctaggg 169680taactttgtc cgtggagggg actcccgtca acttcctgat agacaccgga
gcagaacatt 169740cagtactcac taacccccta ggcaagctag gctccaaaaa gaccatggta
attggagcca 169800ctggtagtaa attttacccc tggacgacca aacgagctct tcagatagac
aaaaatatag 169860tgacccactc ctttctggtg atacctgagt gccctgctcc cctcttgggg
cgcgatctgc 169920taaccaaact aaaggctcaa gtccaattta cttcagaagg cccacaagta
agctggggaa 169980aggcccctgt tgcctgcctt gtcctcaaca cagaaaaaga gtaccggttg
catgaagaac 170040aacccaaaaa tgcagtctct tcaggttggc taactgcgtt ccccaatgtc
tgggcagaac 170100aagcaggaat ggggttggct aaacaagtgc ctccggttgt ggtagaactt
aaagctgatg 170160ccacccccat ttcggtaaaa caatacccca tgagcaagga agctagaaaa
ggcatccggc 170220ctcatatcca gaggttgctg ggccaaggag ttttagtggc ctgtcagtcc
ccctggaata 170280caccacttct gccggttcaa aaaccaggga ccaatgacta tcgcccggta
caagacctcc 170340gggaggttaa caaaagggtc ctggacattc accccacagt cccgaacccg
tacaatttat 170400taagctctct cccacctgag agaacatggt atacagtcct agacttaaaa
gatgccttct 170460tttgcctgcg tttgcaccct aagagtcagc tcctgtttgc ttttaaatgg
agggacccag 170520agggcggaca gactggtcaa ctaacttgga ctaggctacc acaggggttc
aaaaattccc 170580ccaccctgtt tgacgaggcc ctccatcggg atcttgcgcc ttttcgcgct
cgaaaccctc 170640agcttaccct actacagtat gtagatgatc tcttggtcgc ggcggcctcg
aaggagctgt 170700gtcaccaggg aactgagagg ctcctcacag aactgagtga cttggggtat
cgagtttcgg 170760ctaaaaaggc acaaatctgt caaactgagg taaccttcct ggggtatacc
ctccgagggg 170820gcaaaagatg gctcacagag gcccggaaaa agactgttat gatgatccca
tcgccaacta 170880ccccacggca ggtacgtgag tttctgggga ctgctggctt ttgtagactc
tggattccag 170940gctttgcaac cctagcagca cctctatatc ctttgactaa ggaaggggtt
cctttcaagt 171000ggaaagaaga acaccaaaga gcttttgagg ctatcaagtc gtctctaatg
actgccccca 171060cgctagcatt accagacttg actaagcctt tcgtcctata tgtggacgag
agagcgggtg 171120tagccagggg agtattgaca caagcactgg gaccctgaaa aagacctgta
gcctatttgt 171180caaaaaaatt agatcctgtt gctagtggat ggcccacatg tctgaaagct
attgcagcag 171240tagccctgct gatcaaagat gctgacaaac tgacaatggg acagcaggtg
accgttgtag 171300cccctcatgc cttagaaagt atcgtgcgac agccacctga cagataagat
gacaaatgcc 171360cgaatgacac actatcagag cctgctgcta aatgagcgtg taacctttgc
gccccctgcc 171420atcctcaacc cagctaccct tctccctcta acaaatgatt ccgtcccagt
acatcaatgt 171480atggacatcc tcgctgaaga aactgggacc agaagtgacc tgactgacca
accctggcct 171540agagctccca gttggtacac ggacggcagc agtttcctga tagaggggaa
gcaaaaggct 171600ggagctgcgg tggtagacgg gaaaaaggta atttgggcaa gcgctttgcc
tgaaggaaca 171660tcggcacaaa aggctgaact tatagcgctt atacaagccc tccgagaggc
taaaggtaag 171720atcgttaata tctacactga cagccgatat gcttttgcta ccgcacacat
ccatggggcc 171780atctacaggc agcgagggct attgacctcg gctggtaaag acattaaaaa
caaagaaaaa 171840attctggccc tgttagaagc catacatgca cctaaaaagg tagccatcat
ccactgcccc 171900ggccacccaa aaaggagaaa acttggtggc caagggcaac cgaatggcag
acttagtggc 171960aaaacaagtt gctcaagggg ccatgatctt aactgaaaaa ggtgatccgc
ccaaaagccc 172020tgaggatggg aggtataaca taaaagagct atggtagacc agtgatcccc
tcccatactt 172080tttttgaaag aaaaatagaa ttaactcccg aagaaggaat aaaatttgta
aaaggactac 172140accaattcac ccacctggga gttgaaaaaa tgatgagact aattaaaaat
tcccgatacc 172200aagtccccaa cctgaagtca gtggctcaaa agattataga ctcctgcaaa
ccatgtgcat 172260tcactaatgc aactaaagcc tacagagaac ctggaaagag acaacgggga
gaccatcctg 172320gagtgtattg ggaggtagac tttactgaag ttaaacctga aatgtatggt
aacaagtatc 172380tgttagtatt tgtagacacc ttttcaggat gggttgaggc atttcccact
aaaacagaga 172440ctgcccagat tgtggccaag aagatccttg aagaaatcct gccaagattt
gaaatcccta 172500aggtaatcgg gtccgacaat ggaccagcct ttgttgccca ggtaagtcag
ggcttggcca 172560ctcagttggg catcgattgg aaattacact gtgcttaccg ccctcaaagc
tcaggacagg 172620tagagaagat aaataggacc ttaaaagaga ccttgactaa attagccatt
gagaccggca 172680gaaaagactg ggtggctctc cttcctcttg cgctcaaaca cccctggtcg
tttcgggctc 172740actccttttg aagttctgta tggaggacct ccccccttaa tggaagctgg
tggaacatta 172800gtttccgact ctgaccctgt cttaccctcc tctttgctta ttcatttaaa
ggccctaaaa 172860gtgattagga cccagatttg ggaccaactg aaagcagcct ataccccagg
gaccaccgca 172920gtaccccacg ggttccgagt tggagacaaa gtcttggtca gacggcatcg
aaccggtagc 172980cttgagccac ggtggaaggg accctatttg gtgttactga caacccctac
tgcggtaaaa 173040gttgacggaa tcgcctcctg gatccacgcc tcccacgtca agagggccgc
cagtcaagat 173100gaagaaaacc acgacgacaa ttggacagtg gcagtcactg acaatcctct
taagcttcgt 173160ctgcgccgca ggcgccactc tagacctagg gaaccttaac cctcatgctc
caattcaaca 173220gtcctgggag gtgcttaatg aaaaggaaaa cattgtatgg gcaaccactg
cagtccatcc 173280cctctggatt tggtggcctg atctcacgcc tgacatctgt aagttagcgg
caggatcccc 173340caattgggac ctctcagatc atactgatct tagcaaccca ccccctgagg
agcggtgtgt 173400cccaaatggg atagggagca catatgggtg ttcggggcag ttctaccgag
ctaatcttag 173460agctgcacat ttttatgttt gccctggtca gggtcagagc aaaaggcttc
aacaaaaatg 173520cgggggggca tcagattact tttgtggtaa atggacatgt gaaacgacag
gagatgctta 173580ctggaagccc tcctctaaat gggacctaat cacggtaaaa cgaggtagtg
gctatgataa 173640gtcaaacgaa ggagaaagaa acccctataa atatcaagag agtgggtgcg
cttttaaaaa 173700cagagcaccc tcaggaccat gcaaagataa atactgtaac cccctacgta
taaggttcac 173760cgagaacgga aaacaacacc gtctaagttg gcttaaagga aataggtggg
gttggcgagt 173820atacattcca ctaagagatc ctgggttcat tttcacgatc agattgacag
tgagagaccc 173880ggcagtgaca ctcgtagggc ccaacaaggt ccttataaaa caggggcccc
ccagtcgtac 173940tggctccccc aaaggtcccg actgtaccag ctccaccaac tccacagccc
aacacagtgg 174000taccctccct aggaactaat actctcctca taaagcctac cttggcttcc
ccaccgcccc 174060taggaacaga ggaccgtctg gtcagtctag tccaaggagc ttttttagtt
ctaaatagaa 174120ctaaccctaa tatgactcaa tcatgctggt tatgctatgc ctctagcccc
ccttattata 174180aaggaatagc tcagatcagg acttataata ctacttcaga tcattctcaa
tgcctttggg 174240gaaaaaacag aaagttgact ctagcagcag tttcaggaag agggctttgt
ctgggccggg 174300tacctcagga taaagggcac ctctgtaatc agacccagaa catccagtct
agcaaaagcg 174360gtcagtatct ggtgcctccc ctagacacag tgtgggcttg caataccggt
ctcactcctt 174420gtgtgtctat gtctgttttt aatagttcca aagatttctg cattttggtt
cagcttattc 174480ccagactctt gtatcatgat aatagttctt ttttagataa atttgaacat
cgggtccgct 174540gaaaaagaga acccgttacc ttaactttgg cagttctatt aggattggga
gtagcagctg 174600gagtaggtac aggaaccgct gccttaatta agaccccccc aatactatga
agaactacgt 174660gcagttatgg atattgatct tagaactata gaacagtcta taaccaaatt
agaagaatct 174720ttaacttccc tgtccgaagt ggtgctgcaa aatagaaggg aattagactt
attattcctt 174780aaaaaaagag gactctgtgc tgccttaaaa gaagaatgtt gtttttatgt
tgaccattca 174840ggagtaatca aagattctat ggctaaactt agagaacgcc tagatatacg
taaaagagaa 174900agaaaaagcc aacaaagatg gtttgaaagc tggtttaata agtccccttg
gctcaccact 174960ctcctctcca ctatagcagg acctttaatt acacttatgc ttttgcttac
ttttgggccc 175020tgcatcctta ataagttagt agcttttatt agaaaaagga taaacgcagt
ccaggttatg 175080gtactaaggc aacaatatcg ggtccttcag gaggttgaaa actcgctcta
agattagagc 175140tatctcctaa aagaagtggg gaatgaagaa taaaaatttt tactgaactc
ttcttcaccc 175200cagaacccga cccctcccat ctagagattg ttcccggaac actcctgaac
tcttcacccc 175260agaatgcatt cctgaactcc tcaccctaga gttcgaaccc tcccaactaa
aaactgttcc 175320tagaacattt ttgagataag ggcctcctaa aacaaccgca aaatgaaccg
ggtacattgc 175380caaataatag gacatgaccc cttagttacg tagattccct tggcagaacc
ccttgtcccc 175440tgacagaacc ccctagtgat gtaaacttgt actttccctg cccagctctc
cccccttgag 175500ttttactata taagcctgta aaaaatttgg ctggtcgtcg attctcctct
acaccactag 175560gtgcatgagt ttcgacccca gagctctggt ctatgttcca tgtgctttct
tgctgttgtt 175620ctattaaatc ttgccttcta cattttgagt acggtctcag tgtcttcttg
ggtccgcggc 175680tgtcccgggg cttgagtgct tgagtgaggg tctcccttcg ggggtctttc
aaaactactt 175740cagaggaaaa atgtattctg cctcatgggt tcagggggtt tccctcagca
aattcaggga 175800agacaagatg gaacagctca acctgctggc aggagggtgt gggaaaggac
aagtgttcat 175860tgtgtggtgg acaggaaaca gagagctgcc tacagtctta caggcctacc
accactgacc 175920tacctctgtc cgtcaggccc tacatcttaa aggatctaca gtttattaaa
agaacactac 175980cagataggaa ccaagtatca aaccaccagt ttgtagggga taaaaataca
aggaacacat 176040ctcaatagga gtgtgttcca ggatgtggac aaggagaaca cagttgttta
aaagcttaac 176100gctggccagg agagctgcac acctttaatt ccatcactcg taagagggaa
gcaggttcat 176160ctctgtgagt tcaaggcaag cctgggctat acaattctag attagccaga
gctacatcgt 176220aggagcctgt ttcaaaacaa acaaaaccaa accataaaaa agcatttctg
aggctttggg 176280tttaatcccc atgacctcaa atagccaaac agctctcctc agtccaaacc
aaactgcaaa 176340attggagcta gtgagatggc tcaacatatg aaagtccttc ccaaaaatat
tgacaactgt 176400agcttatctc tggggacaca cataatggga gaggaccaat ttctacaagt
taccctctga 176460cctccacaca tatgcctccc acaaataaga aaatatatat aataaaaaga
aagaagtcta 176520cagctgcaca tggtcatgca tgcctataat ccagcactcc agaggctgag
gcaggaggat 176580tattagtttg agatcgcata gcaagcagta ggctagacag ggctacatag
tgtaaacctg 176640ccttaaaaca caaaaatcaa ttaagcaaca ataacagtaa caaccacaac
aaaaacccaa 176700aagagtactt tgtagtaagg acaataccaa aaatgttcct ttaaggacag
ttctggaatc 176760agcaatagcc ttccgagtgc tcagggatgt ataaatactt agaaaacttc
ccctggagaa 176820atgagcacca gggtacactg ctctcagagc tgcccagaaa gttgtttatc
ctggattcat 176880ttcagccttc ctaactgctc aggcattcag aggtcacttc tgtagtagcc
aatgtctaaa 176940aaggctaaac tactgctcag catggctgtg gtacttggca ttatcatttt
gtgactggtt 177000ttgtagttat gcagaattca agagttatag catcatgaaa gtttccacca
agttcctgat 177060ccagtcacct cttaaaggtt ggatgcacca agtgcctttg gggtgataaa
ttatattcaa 177120ataatggtat tccaccctaa tccccaaaga cttctggcca tctcataatg
taaaatgctg 177180agccatcgca ccagcccatg gccttgaact cttgatggtc ctgtctcagc
ctgtgtttgg 177240attataaatc tgttggtgag gtattccttt gctgataata caagcaaatt
cttcaagctt 177300ccatcctaga ctgaagacca gcagctctcc aggagtcctc aatgcagact
ggcccagctg 177360ggacattgag cctcatggac tcagccgcta ctagattcgc aacctattca
gacaagccac 177420tgttggacta cccagacaat actatgtaag ccaatcccat tttaatacac
atattcatct 177480gggtgtgtgg cacacacctc tactcccagc acgcaagagg cagaggcagg
cagatctctg 177540atttcgaggc ctggtctata gagtgaattc caggccagcc agggctacac
agagaaaacc 177600tgtttcaaca aaaccaaaac cgtaaattca ttctatcagc tctatttcct
tagagaattc 177660taatacatgt gggtaccagg ggttgaactc aaagtcttca tgtttacgta
gcaagtttcc 177720ttctgctagc ctagtgaagc tgaggcaggt acagccggtt ctttactgct
ccttgcaaat 177780ggtcctcctg agctttcctt tgagagccta caaagaactc tttttttctt
taggtctcca 177840ggttttggtc ttaagaggtt ctggacttgg atctgtagct gtcatatcac
agacattcaa 177900catctggcaa atgtcttgac aaaggagatc acttgtgttt gctgcagtgt
cccttctggc 177960tgtgagattt tgctcctcac cactgcagga ctgcagatct attctgcctt
tttagttgac 178020ttttcattcc tgagaactgg ggaaaactga ctttgtattt gggctttgaa
tttgtccatt 178080tgtcaatcca tcacaccaga cctaaccaac tgccaagagt tctgctgact
tttgttttct 178140ctagggtggt cactttgctg ggctcatcct catccttggc ctgcagttta
tccccaggaa 178200agaaaatggc taacgactgc taagaagcag tctttccttc cagaaatttt
agtctatcta 178260gaccttgctg cagtctgaag tctttaaaat gtgtttgtta tggtagaata
ttttgagttg 178320ccttaggagt attgcttgct gtcacctatc atattctatc aggaagcaga
cgtcccattt 178380accaaatgtg aagaaatatg gcatcaatac ccactgcaaa aagtgtaaat
aaataataaa 178440aaaatagatt tattacagag tgcaagggaa aagaaaaaaa tcagccagtt
tcagaattgt 178500aactggacaa atgttggtac agttcatgaa gaggttctac aaaatggctg
ggggtgggaa 178560cataatgagt tagtttgctt ttttttttct ctttccttcc ctttcctttc
cttacaaggt 178620ctcatgtagt ctatggtctc aaactcacca ctgtaaatca ccttgaactt
ctgatccttc 178680tgcacgctgg caatgtaagc atgtgccacc aggcctggct cacacatttg
gtttttcaat 178740acagaatagc tctgtgatga ttaacttcaa tcatcaactt gacataacca
agaatcgtct 178800gaggaagagt ctcagtgact gggtgggcta agggcatgct cataagggat
tatcctgatt 178860gttaattgac atggaaagat caagtccatt gtgagcagca acacgccctg
aacagaagtc 178920ttctgaagta taagaggaga aagcttgatg agagcaagca ggcaagcaag
ccaggatcca 178980cgtgtttatt ctctgtctgt tcttgaccgt agatgtgatg gctgtcttgg
cttcctggga 179040aacatgaact gcaccctgga attgcaaggc aaacaaacct tttcctcttc
caagttgctt 179100tatgctaaga tattttatcg cagcaataga aatgaaactt agaacaggcc
cataactgcc 179160agctttggaa ctgaacctaa ggctgttata attcactagg atagggacca
ctggaagtga 179220atctgatttt gatggtaaaa tcatgtgttt gtttctggat atgatagatt
tatcaatttg 179280agactcagaa aagaagttag gacttgaatt ccgttttaga gacattccag
agaaaactga 179340tgtcattgtt ctgaatgtaa gtgcctcagc tgaaaataca aagagtacag
ggaagaaagc 179400ccaggctaga atctgaagga actcctctat tttttgtttg cttgtttgtt
tggttggttt 179460tttgagacag ggtttctctg tgtagccctg actgtcctgg aactcacttt
gtagaccagg 179520ctggcctcga actaagaaat ctgcctgcct ctgcttccca agtgctggga
ttaaaggcgt 179580gtgccaccac accaggctag gaactcgtct attacacatt aacacccctc
tttaattaac 179640tgttcctgcc aatgtaccaa atagtcaatt gattcctgtt tatttaccac
atgtttctgt 179700tagtaaacca gaataactta tctagccaaa gtctgcctat tagccatatt
ttcatcagtt 179760cccaaccatt tttggaattc tgtgagggga atccacagat gctgtagacc
gctttagaca 179820tttttcagct tttttcaagt tgcaggtcat gattcagtgg gtcatgaaat
taatttagtg 179880ggttctgatt agcatttcaa aatgaggcaa gcagagggca tattgtcaca
gcacagcaca 179940tgcggtaagc agccacacac tcttgcttgg aggcttagtc agtttctggc
tctaaacgcc 180000ccaggtttgt ttctctatcc taggcctctc tcttaaattc caaacatagt
tagacattac 180060cattggggca cgtgcaactc aaacacggag tgtgactcct ttccccatct
gcggttccca 180120gatttggcaa tgtcaccctc ctcccttctc cctagggtca gttttacctc
tcacactcca 180180caacacaaca cctctcatct caagaattgc cattagggct ggtgagatgg
ctcagaggtt 180240aagagcaccg actgctcttc tgaaggttct gagttcaaat cccagcaacc
acatggtggc 180300tcacaaccat ctgtaatggg atctgattac ctcttctggt gtgtctgaag
acagctacag 180360tgtactcaca tatattaaat aaataaatct aaaaaaaaaa aaaaaaagaa
ttgccattaa 180420atgtacctca gagtccaaat gcttcttcct cccctgacta cactcacgct
ggcctgagtc 180480cattttctta ttgaggttac tgcttctctg cttctaccct ggctccttct
gctgcctatc 180540cttgacacag cagacaagca gttctttaaa gcagggctca ggaccagtga
gactgatcgg 180600ctctggtggc acttcctgcc atgactgatg atctaaggtt aagcctagaa
cccacgaggt 180660agaagcaaag gacctactct ccaaagccgt cctctgacca ccatgtgtaa
actgcacatg 180720tacatgcatg cacatggtac acacacatac acagaagtaa aaagagattt
aaattgaaaa 180780tcattaaaaa gaaaaatcag ggctcagcaa actttccgtg tagaaaacta
gagtacttag 180840gctttgaaag ccaagaagtg gatattaatt atagttattc attatagcag
agatttctaa 180900aaccttttga caaaactaaa aaatataaca gagtgtattt tttttgtaat
gtaagtttac 180960taatggcagc agtgggatta gtttcttttt tagattattg ttattatttt
tattaattat 181020tagtgttttt gtgtttattc atattccaca gcatgtgtgt ggaattggat
ttctgcttcc 181080acctttgtgt gggtcctaga gattgaactc aagtcatcaa gcttgcacag
taggtggtca 181140ggcttacaca gtaggtggtc aggcttgtat ctttggaagg caagcatttt
acttcctgtg 181200ccagctcact ggccttcttt gtttaaaaaa agaaaaaaaa agtccttttt
tgtttaatta 181260gggttcatgg ccagtgctct ttatcttaaa atcaactgca aacttttatc
tggtaaaaag 181320ccatccttag ctgtggtcct aggagaaaaa catacagttg gatggcttta
tcctgcaggc 181380ttagtttgat catctctctt tgaagatata atcagctcac atcacactca
agcctctgcc 181440aacgagtttt ctacttctgt tcaacaaact acccaagctg agcagctcca
aacaacagcc 181500agttatgatc ctcacagtcc ggtgggtcag aagcctaagc gggcgtggct
acctcgctgc 181560tattgcctga ccctgctcgg tgcatccaca ttcacatcct ttcctggtga
gtgtggttct 181620ttgactggtt ttgttccaat ttttagtata tgtgctgctg aaacaatctt
tttgcctctg 181680cctccagact gcagggatta atgttcttga ctgccacaga gcactaatat
ttactgaaca 181740tgtgatcatg tggtgctcag cactcttgca cccaaggctc ggggaacatg
gaggaagagg 181800gggtggaaag attccaagaa ccagaggaag aagaaagtca gaggtgagac
tgcatctcct 181860agaaatgtca gggacatttc tagacctctg aagtctcaag aacaaggcct
gaaagtctta 181920tttatatagg ttaacctgaa aggggaaaaa attcttacag gggtccaacg
ttagacaaag 181980aactctaagc aactaaggaa tgttgggggg ggggtagtct tccccaggga
acactcctct 182040acccttcaag ccccacccaa gctggttatc caaaacaaac tggtcagtcc
tgaagccata 182100tacgcacaag taacatcata tggatgggca gattgcattt aggaatacac
acatacacac 182160acacaactta aaaagagagg ccatgaattt aagagagagc aaagcaaagt
gggaaggggt 182220acatgggaag gttggaggca gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 182280nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 182340nagagagaga gcctattatg tcgttggttg cttctaatca ttagaaaacc
actctcttag 182400gctgagtcag aactagccta gctgagacac tgtccaccca ctgtcccaga
gcaaggccat 182460cgctgtccca gatttgcctt tgggggccct tgaaatgaaa gtcaccagca
ggctccggaa 182520gctgcctcat gctaatcagt tgaggttgtg aaaatagccc tgcagtggtt
cctgggcctg 182580cagctgggcc agagccacta aggggagtct ggtcctttgg agcagagtta
acagtcatca 182640gtgctttttt ttttttttta aatgttccct gctttaggct cagtgctgtg
cgctacttct 182700aatccttgca ataggctgca agacaggcaa gaatatcatc cctgttttgc
cctcaggcaa 182760attctgaagt ctggcaaatg aaagatgtgg gatttgaaca cagacttgtt
tggccaaaga 182820attctcactc tacttctgcc tgtgccacct tcctctcatg cacggggagg
ggaggggagc 182880ccacctccca tgctcagggg ctaggaagtg gggagaagat ggatgtcctc
aaagcagggt 182940gagaatgaag tagaagccag cttcaaatct aaactaagca atgttttatt
tccatttccc 183000tgaacataaa gttcagttac atttggttta aaaaaaaaaa tccctacaca
actggttctt 183060gagaaatgtc aagtgctaca attcagtgga tgtggatgaa acaatcaaaa
tgttgaacac 183120ccccaaacag atacaaacct tcatcaaagt ctcttccaaa ggctgggtct
gaaaagagcg 183180actcatgttc cagcccagtt ggctccttct catgtgagct ccgacttcca
aagactgctt 183240gcaccaggag gaaatataat agatgtcctt tttaaggggg gtggggtctg
tctgacaacc 183300tcccacagtg actgtggata cagcccagtt agtagagttc tcgcctagca
agcgtgcggc 183360cctgggcttg agatctagca ccctaaggca tagtggtaca tgcccatgac
cacagcactt 183420gggacgtaga ggcagaagga tcagttcaag gtcagatgag gggtggggag
gcattccttt 183480aatgccagca tttgagaggc agaaacagat gaatttgtga gttcaaggcc
agcctggtct 183540acagactgag ttccaagaca gccaaggcta cacagagaaa ccctgtcttg
tcaggaaaaa 183600agatagtggg agagaattca aggttatctt ggactgcata agactttgat
tccaaaataa 183660acaaaaatgg agcatgaatg cttgcaactg tggacaatat tgggttcata
catattctgt 183720tttgtcacct acataccaat tatacaaatc agattcagct gggcccactg
gtgcatgttt 183780gttcccagca tctgggaggt agagatgggc agagctctat gactttaagg
ctagcctggt 183840ctacaaagta agttctagga cagccaaccc tacagagaga tacactactt
ctaaatcaat 183900caatcaatcc atcagtcaat catgggctgg agagatagct cagtgatcaa
tagtgccatt 183960cttccagagg acctgggttt gattcccagc acccacatgg cagctcacag
atgtctgtaa 184020ctccaatccc aagggacatg acaacttcta ctggtctctt tggtcaacag
gcatgcacgc 184080agcatacaat atatatatgg gtaaaatgct atatatataa aaatcagatt
cacaaatcaa 184140gtacagaaag agatcaacta taatgaaaca accataacac atactgtttt
aaaagctacc 184200tttcctgtct gacatctgac ttcttgcgtg acccagtgcc tccacaatga
gacaagcaga 184260tggtgtagtc cctgtgaccc aggattaggc aaccactgcc cctgggaagc
cattcctagt 184320aacaggatca aatcccacag tgctccactg taccccgcca gaggacgtga
agcctccctt 184380ccccggctgt ctgtgcagca ccgcccagat gcttggtgtc actgagtgac
catcggagcc 184440caactgagga gtgcctcagt gctcctcagg gcagtgtgca attgaaactt
gcatgtgatt 184500tctggaattt gtcatttgat atttccagac tccagctgac cttggataac
agcacagagg 184560gcagaactgc agaggaaaag gggcactact gtgactgtta tcccctgcat
gattatagaa 184620gggtctgtgc tttctcttac aaatcattgc ggcctctctt cacttccctc
ctttgaagtc 184680gaaaaaaata gatgcatcct cacagtacag gatggcaggt acgaggcggg
ttccgggact 184740gaggcaggac tcatcatgca gcctctttcc ctcaactaca ccgccgcccc
tgcagagttc 184800cctctgatca aatcagtttc aggcctggaa agaaacggcc actcaggctg
gggatgtggt 184860tccattagag gcctgcatgc acgaagctct ggatttgatc ttcagcacgg
gcataagccc 184920agtatggtag tatctgttta gcatggtgtg gaggtatacc tatctctgca
cagggagacc 184980agaagttcag agttatcctt gcacttacag taagttcaaa gctagcttgg
gcaacatgaa 185040gttttgtctt aacaaaacga aacaagaggg gccggggaga tgcttcgctg
tgctgagtca 185100tttgctgcca agtttgatga cctgagtttg gtcccttgag cttatggtgg
aaggagagaa 185160atgacgcttg aactccacgc acatgccaca gcccacgcat gaatgtgtat
acacacacac 185220actaagtgga taaatgttaa aaataaataa atcaaaggaa tggccactca
aaatctacca 185280tcgttgggaa gggaggggaa aaggcaggcg agggagatag ataaccctga
tatgaacacg 185340gaaagagcca gtgtgccacc aaagctgccc agtgtgccac caaagctgcc
cagtgtgcca 185400ccaaagctgc ccagtgtgcc accaaagctg cccagacttg attacagatt
tggccaggga 185460cacaggaggc cagcaggagc agccaggttc cacctcagag gtggagccac
aaacctggaa 185520atgaaacgtc tttccctttc ttcagaccac agcagtgaca gctgtcctgc
agagtctgga 185580gggctggcag ggctcatcca ctctagtgtg cctgtggcca gaacaggcct
cagtcacagg 185640tgcttttcca aggtcttagt gtctaattaa ggttagcagc caaattggag
agagaagggt 185700gctggacttt actctgctgt aaggactttg ggcattgttc cattccgtga
tcaaatacca 185760ctggctctgc caaccaccat gtcagtgggt cttcagaggt agaagaactc
atcctttttt 185820gagaggtttg gtctggtcct tgtctaatgc aaaatgcctg gggcaccagg
ttaatgtcaa 185880ctcaaaggca agtgctggtc cagcatgtgt ggaatcctaa gttcaatacc
catcagagcc 185940ccaagcccta gaggacagac atgctttaaa aaaaagtcat gctttaaaaa
aattctgttg 186000agggggctgg agaaatagct cagcagttaa gagcactagc tgctctttca
gaggacccag 186060attccgttcc taacattttc atggtggctc aaagctgtct ataattcaag
tcctagaggg 186120gaatctgttg ccctctctgg ttttctcagg caccaagaac acatgtggtg
caaacataca 186180cgcaggcaaa acactcatac atatgaaaca ttttttataa acctgctcgg
atgtggtggc 186240tcatgccagc gatgctctca gcactcagat ggcagaggca gggggatttt
gtgttgagcc 186300cagcctgagc tatagaatga gatgctgtct caaaaagaaa aaaaaaacaa
aaaaaaacaa 186360aaaacaaaaa caaatctgtg ggcttaatca ttcctagcca gaaggagctg
gctccagcaa 186420caatggatcc ctagagctct gctttgcccc ctggtctgga gagcttgctc
tagaaaggaa 186480ttctccatag accacatttc tattttgggg accactctgg gcctgcacat
ggaaaaatgg 186540agagttgagg tttacatggt catttttttt gttgttacag taatgagagc
tttgaaatga 186600tcacaaaagg aaaataggaa aatatgcctc ctaaaaagag ccgaggcaaa
ttattatagc 186660aaccgaatcc taaaaggaat gttcaagtaa aaaaaaaaaa atatggctca
tgcgaagttg 186720ttatggcaac tgacatttaa aagtaacagg atttgggcaa gtttctttct
ccaccctctc 186780tgcaaagtct tgcaggaact tcatgctaaa attatgcttt aattttaatt
agccaaatag 186840gtcataaata tagcttattc ccaaatcctt agaatttcta ccctgcaagg
agctgaacta 186900ctaatgagga aatatttcca caaaaaccca cttaccatta agaaccccca
tccgatattt 186960ttctaatata atttatcaat ttaaacacat tgcataatgt gccactctgt
agcattccat 187020taaaatgata aatagcaata gtggtgaggg gtggggggca aaaaccagga
gattaaacat 187080atccaaagca gtgtagctat atttaaatac ctcaatccat tgtagaggaa
aacacactgt 187140tctcatccgc agatacagtc tagactcaga gcagcatatc cttgactgta
agggtattaa 187200taggacagac gaaggggggc aataagaaat gacaggaaac ttcagaagaa
ataaaatttc 187260tattaggctt tgttataaga ttacatcaaa gcagttcata tgttttaatc
tggggaggaa 187320aaaaagcaac tacttggggt ttgcgcctgg gggctgcctc tgtgtactga
accagacagt 187380ttgcataatg aacaattttc attcaatcag gatctcagca gagatagctc
ctactcaaag 187440gaacccggca caggctcata gtttttatct cccagctcca cctgctggag
aaaccttgta 187500ttgcagggag agaaagcagt cgggaggcat tgtcctagtg gctgtgtacc
taaagttaca 187560gacctgactt taaacagttt ctctctggag gttgaaaggg gctctgtaag
ataccagagt 187620ggattgctct caaagactct cggactcctg ttacaggcaa gtaaggtcct
agcagatggt 187680agcatggatc tccggccctt ctcactgctt tcttgaatca gggatttaga
aattgctatt 187740tgcataccag gaggactgaa gtttggctcc cggtgaccag aggacaaggt
cattgtttaa 187800aaccacccaa actcatttcc gacttggttg gtcaaatttt caagtttccc
agcagtctaa 187860ggattcataa aataaggcag aggcagagaa acggagggtg tgtgtgtgtg
tgtgtgtgtg 187920tacccaaaat ggaactgcat tttcatgcac aatagaaaac ttaaagactg
aaccaatcat 187980tttggaaaac tggcacagct gacattggct agaggaagga acggccaggg
cgagccagct 188040gcaccaagac ccagggctga ggcctaatcc gcctttatcc gagggtttag
tgaggctccc 188100gccgctcacc aatcccggct ggagccgcag aagagctctc ttcacttggc
tcagtcccag 188160cacagtcgca ctatgctctc ccgtggggag gccgctccgg gagggggagc
gacatcaagc 188220tttgtgaaac tgttttcgaa aacctgggat gatcatttaa atgtttaaaa
tatgcacatg 188280gtaattcaaa actaattacc ctgagcacat ttgaaacatt tatgccatca
tcttggatcc 188340tgcctactga ttgtgcgctg cagctcactc tggtgtttct ataaactgct
tcagcgattt 188400taacttccag gctaaatcag gcagccacag gcgctgcctc cagccctggg
ttggtggaga 188460gacccccatc cctgacttcc aggcgaggag gcggcccgtt tctccagaga
gccgtttgtc 188520agggtcttgt agttctggct gccgaattat tgctcttatc cgtgttcata
attctcatct 188580gcattattta atttaggcta gaatgacctc tttccctccc gagtcttcct
ccctcattcc 188640catttcctct tcttcaattc gtggccccca ttttctgatt ggtccaaata
tatagacaaa 188700tatccttgat cgtcccaccc cacttggcta catcttcatc tgggagccaa
tgtggtgagt 188760tttctgggtt tgcaaggtgg tcaggtccac cagtcatcct aaggtgtgtg
agagaggtag 188820accaacatga gcggcgcaca gccgccatca ctgagagagc acgtgccctg
cagctcaggc 188880acaggcatgc acacaccggc agacatgtgc acatgcgctt tccccagcaa
accctgcttg 188940cagagtaatt aggcctaggc agttcctgaa gcaaattcat ttcccccttt
tccagaataa 189000aatgagttct cttcctttgg gggtgctaaa ccagcatgcc agtggctaga
agcctgagat 189060gggtgatgtg gctgaaacca tttctgcagc caagcctgtg ggcagaagct
aaccttgggc 189120tggggagctg cagtcggaag aggcacaatt ctgggatcaa gaaatgagca
ctggtttata 189180ggtacactcc cagaaataga cagatgaggg ctgcctcctt attagcgctt
tgaagatgcc 189240catggcgggt ttttagacat ttaggaatat aaaagtaggt tggattccca
cagtcagctg 189300aagtttgaca gagtgatatt accgggttta actagagcca ttaagagact
cttcattatc 189360ccacaccacc gccacccaag ttatcacatg agccataatg caagagaatt
ttcattccat 189420caacaagaga gggagccggt ctatctttgt ccaaaggaaa tgagcagccc
agcgtgaagc 189480ttgtgaggaa ttgagtgtac aacactccaa taacatcccc tgcaggattg
cctctgcgat 189540ttagtcggtg aagcaggggt aactgcgctc gagcagtctg cctgtgtacc
tggcttgcaa 189600gaacaccagc tcgaggaaca ccaaaaaggc cgattaatga caaaggacac
tcatagaggc 189660ccgaattcca cagggcttaa gtattaagcc ccaaagaaat caaggtctag
gccattctcc 189720tggcgctcag caatctcatt tattatttct ctacaaagat ccaacactca
atttcccagg 189780tatcccctgt atctgactca cattctcctg ctcagtaagc catcctggtt
tgaaacgggc 189840ctcccctcct cctgcctatg catgctttgc gtcttcacaa cgacagctgg
taatttgcaa 189900gaccccctcc actggactct ctcaccccac atacttggaa ctactccttg
gaactacttg 189960tttatcaagt gttctgttgg tgagccttct cttgcattaa agctgtgaga
aggaaccaca 190020gtttctattt cctttacatt tcttgtagcg tctcacatgg gagacaccca
ggttagatat 190080actgagggtc ctggtagttt tagagttgga gttagatgac ccagcaacat
gccttccccc 190140accacgcacc aagcaaaaat tgcacccacc cttccctcag atgttcctgg
catcttataa 190200ctcgcccaaa gccagattta ttgctcctgc tgtaaagtgt atcttctcta
agcctcactt 190260aaaagctacc acttggcaga agatcaagtc agaagtgcag gctagcaggt
gacggtgagg 190320acagggcggg atggggcggg tagggtggag cgaataattg aagctccaag
agttaccagc 190380tcaatattta acctaactgg taatttgctg tgacaattac gccatgaagg
gaacgctgcg 190440actatgcaag aatgttgctc tctaattaag agggctctgc atttcctagt
cacccgcact 190500ttaataacac acagaatgag ccttggctcc gggagctaaa ggttccatta
ggagcacggg 190560cagcatatgg ctgtgcacat aggccgtgag tgatgcagcc cagttaagcc
cgctaacacc 190620ttcaattcgt cctcagatag agcccagaga gcgcggctca ggccctcacg
ccacgagccc 190680catttgactg acaggcatct tcccggaaag cctgcgcgtg cctacactgc
aaatggacct 190740gcttcccaca gcccggcttt caaccaggaa ggcttggcgt gggtctgatc
cttcaagagt 190800aactttaata aggattttct cacagaaaga aaagtccatg ggaacaaatc
ctcctcttaa 190860gagcgtgaga caggaatggg gacacaagcc aacaccccaa ttgctaggct
aactctgata 190920tgagacaaaa gaatattaat atcttggcta tgaaggagga tggtgccatc
ttctgaattg 190980atgggagttt tgaggcatgg ctaagctggg caaaccattt tctttttttt
ctcttcttaa 191040ttagtggttc atttatggag ggcttgctgc ccggagagcc catcagaaga
gagctcgctt 191100tatggagatg tagcttataa aactactcag attttaaaca aacagtgcag
gaggccagag 191160gtagaagtgg tgggggtggg gtggggcaag agaacaattg catctgcaga
aggctagccc 191220tgcaccccaa gcctatgttt agggttgatc agcttcccga ggcaagccca
gaagcctcta 191280aaattttagg ccaatagaaa tgacctctgc accacggctg actgaagcta
taaataagcc 191340tcgagttgag cagtggtgtc aacggagaga gcagaggaaa gtccaatcag
agcttcattt 191400ttttttttta aagtccactt gcttgggact cacctgaagg cagggcattg
agtagagcct 191460tggctccctg cagcgagagg ctccagtttt cccaggcacc agcccatcgg
ttggttacct 191520aaccaccgaa agggaactgc acagcacaca agttaaatat aggctgggtt
atctgcattt 191580tacaagctct gagcaagcta tctgaagaag ctgtcatttt taatgacggc
acaaacttcc 191640aattaccgac tgggtaatcc actagggagc aggtagtttt ggaagaacag
ttcaccatta 191700ttaaaagttt acacaatcac ttttgagttg actataagta tttcacacga
ggcaggtggg 191760attagggact ttttgggtgg tttactcgag gctgcaacca acaatgagtg
ttttctcaag 191820aattatacat tgagatttgt caactgctgg ggagtagtgg agggtcctgg
taatgcagaa 191880aggttatgaa atggccaggt aaggttgggt gcttccaagt ctcaaatata
ctcctaaggc 191940cagctccaag tcataagctc aaacaagtct tcaaggggcc tggagagtta
agacaaataa 192000ggatcactta ggctacccac ggacaagcac ttctcataca aggaccggct
acctccaaca 192060ccatcttccc aacatggctt ctatgttgct tcaacaacca gggcagggtg
aattaggggt 192120gggtctctcc aatgtggact caaatcatga ctacagcntg gggttttttt
tttttttttt 192180tttttttttt tntttggttt ttcgagacag ggtttctcca tatagccctg
gctgtcctgg 192240aactcacttt gtagaccagg ctggcctcgg actcagaaac ccgcctgcct
ctgcctctgc 192300ctcctgagtg ctggaattaa aggtgtatgc taccacgccc ggccgagtcc
gtcttgataa 192360tgaagttccc agtgacctgg atgtcaactg aagttggatt ttactgtgat
gactactgag 192420tccggctcag aattttgggg ggacaaggta ccttgattta actgggcact
acacgactgt 192480aacccccaca ttgggagagg cagaggcaga ggcagaggca gaaagttggt
tggaggctag 192540gcaaggctac acagcaagaa gctgtctcaa aaccaaagac atctttcttg
atccaaatcc 192600tgtcggaggg tgtgaggcct tgggggccag aacaaggtgg tcaaggaaga
ccactgactc 192660tgtcctttgc tccattactt aatcagaatc gccatcacag atatagctag
gagattttaa 192720gccttggtgg ctgcaatctg catttaagag ctaagtggga taaactcagg
ggtgggccca 192780atgcctccct ccccaccctc cctgcatccc tccatttacc tgtttccagg
gatctgctta 192840atttacctgc cagcctttgg tgggacacag gcttagtggc ttagcgctgc
tcggggcacc 192900agagaccctc acagaagcac ctgaatgtac tttcagcgct gcagagcacg
cacggctcag 192960gcccatcaga agaacccagg cttatgctaa ggagccagaa agtagaagca
gctggcaaga 193020gtgattcagc cccataaatt tacacatccg tacagccaaa cccacttgaa
gtgatccaga 193080gccactttta ttgaaataga aaagatgcct attctggagt gctaagtggt
acaggagggt 193140gggtatataa gagataatcc catgttgtct ttgatgtggt gctagggaga
taacccagga 193200cctcacgcct gcctgcaagg tagccaccaa gccacaccca caacctctat
ttatacacac 193260actaagtgtg gaggtatgga taaaaaaaaa tgtcccaaga cctcacgaat
ctgcaaacat 193320ggtgcctggt tggtggcacc gtttggggag gcagtggacc atttggtctt
gcaggaggaa 193380gttatgtcac tgggtatggg ctttgagagt ttgtagcttt gctccccttc
cagttaactc 193440tgctctcgta aggttcctgc caccatgttt cctctgccat tatggacacc
tggtcctcta 193500gaactgtaag ccacttactc tcaggtctct ttcagtcctg gagtcttatc
atagcaatga 193560aaagtaactt gtgtggcagc cagctaagca agggctgtgg ccgactgctt
gggattatgg 193620ttgtgtctgt ctgtctgtct gtcattccat ttatatagtc ctgagaattg
aaccacttta 193680ccactgacat gtctcagtcc tcttggtatc atatattcac ttaagacaag
atctcattaa 193740gtcattcaga ctggttttga gcttgcaatc ctcctgcctc tgcctcaagg
cgataggatc 193800cctagggtac tcgaccagac tgggagtagc aggttctgtt ctcttagctt
tctacagtga 193860ttgtggatta tttgtgtata aagatctgat ggcccgaccg actcccttcc
ctttaagtga 193920acatcaacag tatttagcat caacttaata aactcatttg gtaaagccat
ctccccacct 193980cttgaacaaa tgaaaatcaa acagcagtac ctgttctcct agagcagcgg
ctctcagcct 194040tccggccttt taatacagtt cctcgtgttg cggtgacccc cccccccccc
agccgtagaa 194100ttatttcatt gcttaaccag agttaactgg aagggttaat aataaaacca
gtctgggaga 194160ctaaggttac ccaaccacgc taggaaggag aggaaagggc cactcgcaca
aacctgtctt 194220tgagatgaag aacaatcaac ataacaggga cagagcagtc cttgtaacaa
gtgcaaagga 194280gagagagagg ctgagtttct acttctataa ataaaccctt ggcaggcgga
tcactaaagg 194340aacacaagtc aatataaacc tttagacatg gggctgccaa acttcacttt
tcgacagtat 194400attaattatg tagtcaatag ccatgggttt cattagcgta ttaaatacca
cgatcaatat 194460tatttatact tttcgaagac aagccactca gggaaaaaat ggtgggggga
ggaggaggaa 194520caatttgacc ctgtagttca aaaaaagtca gaacagcaca ctagagatta
gcaagggttt 194580aatggaaggc ataaaacact ggaaatatgg acagaaatca gatccctgcc
ttcatttttc 194640tgccttttac aaagagactg gagggaattc agaaactatt taaaataaag
gcaaaatgat 194700tagagcccct ccctcccctc agctgcttaa cactggggtt gtggtggacg
caaaataagc 194760attgagctct aagtgataga tgagaatcag aacaggaaca gtgtttttga
ggcaaaatat 194820gtccaagaga attcaaagaa ctgtgggcca gaatctactt aggcagtcct
ctgggacccg 194880aatccctcac aggcgttaac agtggaacca atttccaagg cagccctgct
ggtgatctga 194940tttttgagta gggaaatctg ttaaacatcg tcccacgagg gagcccagct
ctttcactcc 195000ccacgggttt ctacatgcag ctgtgctaga tctgctgaag tggccggtga
ggaggtgtgg 195060ggattggttc agcgacctca gaggacattc ttgttcacta gccctcgtgc
actggggcga 195120tgaccgaatg ctgtgagcag gagatatcaa aggccggcta ctggactgaa
aactagatca 195180ccatctctaa cctgcaattt gtcaatctca gacagcaatg aagactgtga
ttttctagtc 195240aacgctttgt aagcaaggtc agatagaggc tccataaaaa ttgttcaggg
ttcaggcaga 195300gaatcaagtg taactcaatc cctatctcct gagattaggg aagggaagga
aggctgtgtc 195360tactaaacca gtgagcctca agcaaagcct gtctgttctc agcaaggtga
gccacccacc 195420aaagatgcca acagctaagg gccagggatg tagtgcaggg tgctgtgata
tcaacagctg 195480ggagacagaa acaggaggat caggacttca aggtagtttg ggctataaaa
tataagcttg 195540aagctaccca cttgaagact gtccccaaca aaacaaacaa gctgggtatg
gtggtcgatg 195600cttgtatttc tagtgtatga gacgaaggaa gaaagctcaa gcccgagacc
tgcttgggtt 195660acatagggaa gatggtgcct caaaaacagg acagccgagg agcagacaga
cagggcagac 195720agggtgcatc gatctaaatc cacatacctg gatttaaagt aatatctggg
agactggtct 195780gtgagggccg ttccagagat ttaataaaga cccaccctga ctgagtatgg
gcaacaccca 195840tgggtggccc aggggtccag actgaataaa gggaaaatgg gaggaagttc
agctggtagt 195900gtttccaagt attaggacca cagcctggcc cctggcatgt gctggccagc
tagtctagct 195960ccagtatcaa gcttcaggcc agcggcaggg cactggacag ttcccacaca
cgacacacac 196020acacacagag cactagcatt cacctcctgg tctcttcttg acaacagata
aaatgtaact 196080ggctgccaca gtgagagtcc cctaccttcc tcaccgttaa ggatggtaca
aactgtgagc 196140cagcagcagc catttctcca gtaacttgct ttccacagat actgttatag
cactaagaaa 196200agcaactgaa acatggggtg ctgtgacccc ttggcaccac aaagccatgg
caagctgaag 196260tgcacacatc acaggccagg cctgaagatg ctgggggact gcaatgctgc
ctggattctg 196320gcagagatgt gcagcagatg ccaagaggtg ggctgcagca accagagata
attaatatga 196380ttaggaacac actgagcagg catgctcttg ccgaatgaaa agcctcgcag
tgtaatgact 196440gttttcttcc tcgatcacgg tctccacgtt tcagagttgg cttggtgtta
ggctgccgcg 196500taaacatcaa tccaaccccg aggggccaga tcatcggtgt tcctgggctc
aatcgccttt 196560ccttttgtgt tttcattcat ttaaagatgc attccagggt tgcaaacatt
agtgagaatc 196620atctccaggc ctcagtctaa tctctgagtc tgtaatgagt taacatcttt
ccctagtgaa 196680tatttattat gaaggctaat taattgcttt ccagttacaa gaatccttta
cagtcaaaga 196740aagtaggatc cacaaagata tactgtttat tcaaacaaag caaaggaaac
aaagcttctt 196800tcttaaattc tatttaacat agctttaata aaggtacaca ggtccgcctg
gcaaccgaac 196860ggtaactgat gcaaactgaa gccatgctct gtagcagcct ggatgtccca
gtgccacctc 196920tgtctgcagg ctttgtcgga tttactaaga ttctgttatc ttcaaacagg
gattgtgtct 196980caagtaactg accccactat gtggataatg aagtaaatta tgcaatttgg
gggtttgctt 197040ttccccaagg ggacagcaag ccagtgctta tcagccgtcc tcagaggaga
caattctgat 197100taatatcaga gtcatctgac tcagtctatt aaacctatca aaccctgaag
gaaggatatt 197160cagatattaa cgataggcct ttgattaata attctacctt gttgccattc
taagcattaa 197220caaccatgca gtaactctgc aaaacagacc ctttgattcc aggcagacgc
accctctgaa 197280cacctgggtt ctcccctact cttctccccc caggaggaac tcaagacaaa
aaggtgccac 197340cactggaaaa gcacactcca ggttacataa tttgcctcat tatccagagt
ggggttaatg 197400acttgtgaca taatttctgt ttgaagataa caaaatttca tgaaatccga
caaagccgga 197460aggcaggagg aggggactgc tgccacacta ccggtggctg agaactggag
cggaaggttc 197520acacacagcc ctctgagctc actgtctttg cttatcagtg agtcccaaga
ggggcccaga 197580tgggttgcca gcctccccta gaggatcttc attgtggagc tgtcccatgg
ggcgggaagg 197640aagccattct atttctgttc ttctctcttc cgttctggcc accagtggta
cttgctccca 197700tcacatgttc ttcctgatgt tcgcgatcag ccgtctgcca tagtctctga
agtccacggg 197760cttcacgtcc atcacagtgg ccttaattcg agattcatcc tttagaaaag
agagaagctg 197820tttgtgagtg gcagagcctg gcgtgcagcg gaagagagaa ctttctttgc
ttcagtggct 197880tcaatgagtc cagcaggaag aaggaaagtt tacaagtctc agagagaaag
tgctgtgact 197940tcctggagtt gggccagatc ctcttccaca gaccctttcc ccatcctagg
tgccctgtgc 198000tcagacctag catcctcccg gagaagcctc tgtctttcta tgggtgcagt
gggggcccag 198060agcagacagg taactcaccc taaagcatca ctttcatcta gaggagctct
gtggtagtag 198120ggactgaggc ttctgctcca gctctgggca aggttacttc tctgctcttc
accattcctg 198180tccatcccag gaagacagaa aatccctaca ctctcccttg atctacccga
ctttctgaca 198240ccagcctacc tatgttcatt taatacaaca actaaaatat ctattcacag
gcactaagct 198300ggtgataacg cagaatgcac aaactctgcg gctgcagggg agacggcaga
gttcctcctc 198360cacttgtctc cttgaactaa acagtgtctt tgaggcagaa cagggtgaca
cctagggaca 198420cacaagtcta gctgggggcc ttcatgcttc catgtgctta gtaattaatt
actacatgca 198480ccgctgttta caagtatggt taggagcccg actgcctggg ttggcctctc
gcctctgcca 198540ctccatggct ttaggttcag agtcattctc tgcatgcctc tgcctgtctc
tccgttggta 198600aagcttgcaa caacagctcc aacacagaaa gtgctgtgag ggtcgacagt
ggatagatgg 198660ctagatagat ggggcaggac ggactgtcca gtaagcaggg ttcatcatgg
ctatgcagct 198720ctggacatca ggattagttt aaacacttgt caggtggggc acttttacca
gcacgtgcta 198780tttgtttaat attctgagtt ttagaaccta aactgtggga aacaagagtc
cacacataac 198840annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 198900nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngccttgaac
ttgcttctgc 198960ctcctgctgg ggttacaggc ttgagccacc atgacagctt tagcaatagc
tttgtaaatc 199020cacagtgtca agctggatat gatggcacat gcttgcaata ctaacctcca
aagattccct 199080gaactcgagt tggtgaaata gtccaccagg taaaagagct tgctgcccaa
gcctgaatct 199140gattccctgg tcacatgctt taaggaaaga acttgccgaa gatgtcctct
gagtgccacg 199200tgtaccgatg catgcttgca gccacccaca cacccacaca agtgcactgt
ctcacacagt 199260gagaacagca agtgaacaaa caaacaagcc gggggggggg ggattgtgac
cagaataatt 199320gagggggggt gtaaagctct tggcaggtgg ctggctcctg gtaacactcc
ataagtgggg 199380aagttccaca tgtaaggtca tgtgatcgag tacatctggg cctccaacag
tccttgnnga 199440agaaacagat gcagtctgtc atatctaaac cattgttgtc gtatctctgg
gtagtctttc 199500ttttctcctt cctttctttt ttctctccct ttctctttta aaaaattatt
tatttattat 199560tatatctaag tacactgtag ctgtcttcaa acacaccaga agagggtgtc
tgtcagatct 199620cattatggat ggttgtgagc catcatgtgg ttgctgggat ttgaactcag
gaccttcaga 199680agaacagtca gtgctcttaa ccgctgagcc atctctccat cccccaaccc
ctttctcttt 199740tgagttaggt tttgtgtagc cctgggtggc gttaccttaa ctacactggc
tttgaacttg 199800caatgatact ctgcctgatc tgtcttaatc attttgagat agggactcac
tacatagcct 199860ttgctggtct ggaactaaca gagatctgcc tgtttctgcc ttgcaaatgc
tgggaataaa 199920gttatgtacc accacacctg gagtttaagg gttttttgtt tgtttgtttt
tcgagacagg 199980gtttctctgg gtagtcctgg ctgtcctgga actcgctctg tagaccaggc
tggccttgaa 200040ctcagaaatc cgcctgcctc tgcctcccaa gtgctgggat taaaggcgtg
tgccaccacg 200100cccagtttaa gggttttttt ttgtttgttt tttcctgaga cagggtttct
ctgtgtagct 200160ctggatgttc tggaactcac tctgtagagc aggttagcct tgaactcaga
aatctgactg 200220cctctgcctc ccaagtgctg ggattaaagg cgtgtgccat cactgcccag
tgattttttt 200280ttttttaatg tgtgtatttg tatgggtgtg tgggtgcttg tggaagccag
gtgtcagatc 200340cccagagcgg aagtgttctt aaccgctgaa ccatctctct tccctcttcc
ctaactctga 200400ttttaaaggc accaaactct taggtaggag actatacaca cacacacaca
cacacacaca 200460cacacccgta cacacccgta cacaccacat gaccatgcct gagcacacaa
gtggttttat 200520tgctggtctg gcctgtgtat gagctggaac caaaaccttt gtcgggagat
ccgcagtctg 200580cagtttgagc acaggctctc tggtttctgt tctctgtcct gtgtcgcatc
ttgactagag 200640gcagagaagc atctgcaagg ctgtgaccac gctggctggt gctctgccat
ctacattttc 200700aacaggaaat ctcaggagag tatttccttt taagaacgcc agacttttgt
gcctgggcca 200760cttctctact tcccagaaca ttgtgtgcca agtggcaagt tattaaccaa
gtgctttgga 200820aaattaaact ccttggtttg cagagtagca tgggagcatt gagagggtgt
atgcctaaag 200880gcctggttct gctgctggca gagctgacac ttggctaaag ggctggcatt
tctgagatga 200940gcctcactag atccgcgtct cagagtctgc aggagaaatc agagagggga
gaaggtccag 201000tggcctgttc aggatgatct tcctctgcat ttaagggcgg ctggtttgcc
cacgtagccc 201060cagaaccaaa cgagcctcgg acgaagcccc ctaaaggcag taggagagac
tgagccttgg 201120ctcttcagca ggggtgggga caagagcaag aggcgggatc tcgcccggcc
ctttagagac 201180acgtgcggtt gtttccgtgt ctgggagatc acatgacccg catcagctga
cccgtcacgg 201240tggagctcag cgctggtgct tcgcgctccc cgccctgctg cgccccggag
cgcaggaccc 201300tgcggagggg taagaaaacc cccaggcttt ctttcctttg tcgctggttc
gcgcagtcac 201360ctgcacccta ccccccgctc ctcgttcatc ccagtcttcc cggcctggca
ccccggaagc 201420cactgcgagg agggccgtgg ccaggctcag ccttgcgctg cccccaggcg
gccaggacca 201480aatggcccag gggagcagaa ggcggaaagt ggttcttaca gcagggtccg
agggctggtc 201540cccttcctca ggacctgaca tggaggagct gctccggagc gtggagagag
atctgaacat 201600tgatgcccgg cagctggccc tggcgccggg gggcactcat gtagtggccc
tagtgtccac 201660gcgttggctg gctagtctcc gggagcgccg actgggaccc tgtccccggg
ctgagggcct 201720gggtgaagca gaagtcagga ctttactgca acgttcggta cagaggctgc
ccccaggctg 201780gactcgagtg gaggtgcatg ggctgcggaa acggagactg tcctacccgc
tgggtggagg 201840cgtgcccttt gaggaggggt cctgtagccc tgaaactctc actcggttca
tgcaggaggt 201900ggctgcccag aattaccgga acctgtggcg ccatgcatac cacacttatg
gacagcctta 201960cagccacagc actgccccct cagctctacc tgccctagac tctatacgac
aagctctcca 202020gagggtgtat ggatgcacct tcttgccagt gggtgaatcc atcccatgtc
tatcaaatgt 202080cagggatggg ccctgcccct ctcggggcag ccctgcctgc cccagccttt
tgcgagctga 202140ggctttgctg gagtcgcccg agatgctcta tgtggtacac ccttatgtgc
aattctccct 202200gcatgatgta gttaccttca gccctgccaa gctgaccaac agccaagcca
aggtgctctt 202260tcttctcttc cgtgttctga gggccatgga tgcctgtcac cgccaggggc
tggcctgtgg 202320ggctctgtct ttgcaccaca ttgctgtaga cgagaagcta tgcagtgagc
tccggctgga 202380cctgagcgct tacgagatgc cttccgagga tgaaaaccag gagggctctg
aagagaaaaa 202440tgggacaggc attaagtctg aaaaagaggg ggaagggaga actgagtgtc
ccacctgcca 202500gaaagaactt cggggccttg tgctagactg ggtccatggc cgaatcagca
acttccacta 202560cctcatgcag ctgaatcggt tggcaggtcg acggcagggg gatcccaact
atcacccagt 202620gctgccctgg gtggtggact ttaccacacc ttatgggcgc ttccgagacc
ttcgtaaatc 202680caagttccga ctcaacaagg gagataagca attggacttc acctatgaga
tgacccggca 202740ggcatttgtt gcaggtggtg caggaagtgg ggagccaccc catgttcctc
accacatctc 202800tgacgtgctc tctgacatca cgtactatgt atacaaggcc cgtcgcacac
cgcgctcggt 202860gctctgtgga catgtccgag cgcagtggga accccacgag tatcctgcca
ccatggagcg 202920gatgcagacc tggacaccgg atgagtgcat acccgagttc tacacggacc
cctctatctt 202980ttgctctatc caccctgaca tgcccgacct ggatgtgccg gcctggtgca
gttctaacca 203040ggaatttgtg gctgcccatc gagccctcct ggagagctgg gaggtgtccc
aagacctgca 203100tcactggatt gatcttacct ttggctacaa actccagggc aaagaagctg
tgaaggagaa 203160gaatgtgtgt ctgcacctgg tggacgctca cacccatctg accagctatg
gcgtggtaca 203220gctatttgat cagccacacc cccaacgcct ggctggatct cctgccctgg
cccctgaacc 203280tccactcatc ccccggctgt tggtccagcc tattcgggag gccacaggcc
aggaggacat 203340ttcaggacaa cttataaatg gtgcgggcag gcttgtcgta gaggccactc
catgtgagac 203400tggctggact agagataggc ctgggacagg agaagatgat ttagaacagg
ctacagaagc 203460tctggattcc atctccctcc ccgggaaagc aggtgaccag ccaggctctt
cctccagtca 203520agcatcacct ggcctgttgt ctttttctgc accctcgggg tctcgaccag
gccgtaggag 203580caaagctgcc gggttggacc ctggggaggg tgaagagggc aagattgtcc
ttccagaggg 203640cttcagtccc atacaggcct tggaagagct ggagaaagtg ggtaacttcc
tggccaaagg 203700cctagggagc cagttggagg agcctgaaaa gcctcacgcc cagccacctg
tgcacctgca 203760gagcctcttc catcgagaca tgcaggtcct gggtgtcctg ttggctgaga
tggtgtttgc 203820caccagggtc cggatactgc agcctgatgc acctttgtgg gtacgctttg
aggctgttcg 203880gggtctctgc atacgccact ccaaggacat ccccgtgtct ctgcagcctg
tgctagacac 203940actcctacag ctgagcggac ccaaaagtcc catggtgtcg aagaagggca
agctagaccc 204000actgtttgag tataggccgg tttcccaggg attaccccca cccagcccag
cccagctcct 204060cagccccttc agctccgtgg tccccttccc tccatacttc ccagcactgc
acaagttcat 204120tcttttatat caggcccggc gtgtggagga tgaggtccag ggtcgggagc
tggcgtttgc 204180tctgtggcag cagctgggtg cggtgttaaa tgacatcact cccgagggct
tagagatcct 204240cctgcctttc gtgctgtcgc tcatgtctga ggagcacacg gctgtgtaca
cagcctggta 204300cctatttgaa cccgttgcca aggccctggg ccccaaaaat gccaacaagt
acctcctgaa 204360gcctctcatc ggtgcctatg agagcccctg ccgcctgcat ggccgcttct
acctgtacac 204420cgactgtttt gtggcccagt tggtggtgcg gctgggcttg caggccttcc
tcacccacct 204480gctgccccat gtcctccagg tactggctgg ggtggaggct tcccaggagg
agggcaaagg 204540cctggtcggg accactgagg atgaggaaag tgagctcccg gtgtccgggc
ctggctcctg 204600tgcctttggg gaagagattc agatggatgg gcagccggct gcttcctcag
gactggggct 204660cccagactac aggtcgggcg tcagcttcca tgaccaggcc gacctgccgg
acacggagga 204720cttccaagct ggactctacg tggctgaatc tccacagccc caggaggctg
aggccgtgag 204780cctgggccag ctgagtgata agagcagtac cagcgaagcc tcccagggcg
aggagagggg 204840tggggatgat ggcggtgccc ctgcggacaa gaacagcgtc aagtcagggg
acagcagcca 204900ggacttgaag cagagcgaag gctctgagga agaggaggag gaggaaggct
gtgtggtgtt 204960ggaggaggac caggaggatg aagtcacggg aacatccgag ctcactctgt
ctgacacgat 205020gctgtccatg gagacggtgg tggctcctgg tgatgggaga gacagagaag
aggaagagga 205080gccgctgaca gagcagacag aaggcaaaga acaaaagatc ctccttggtg
agcccgtggg 205140ctgagggggc atgggtcagg tgcttttcct tcaggctctc atatgctggg
tgtgggtcca 205200accagatcca ctgtagcacg cacagccaca gtcagacaca gtgcatggaa
tgtggaagtg 205260ctgtgtgtga gtggaaagtg gggcttagat ttagctttca ggagacagaa
agctccttta 205320aaagccatac cttgggctga ggctgggagt ggagttgagt ggtagagcac
ttgtctggta 205380tactcgaggg ccctgggtgt cttatctcta gccccagaag aagtattaag
aaataaaagc 205440aagtggtggt tgagatgtga atggagccag aactggccgg aacagtcggg
tggaagtggg 205500aagagtgttc cagacaggga acagtgtgtg tgtacctctg aggctctcat
ggttccatca 205560gagaggcagg gaaaggctaa aatggttttc ttaagagagt ccagaagggc
tgggcttggt 205620ggtgcaggcc tttaatccca gcactcagga ggcagaggca ggcggatttt
tgagttcgag 205680gccagcctgg tctacaaagt gagttccagg acagccaggg ctatacagag
aaaccctgtc 205740tcgaaaaaaa aaaaaaaaaa aaagagtgta gaagggtgga agccagggac
aagtctgtac 205800aagaaggaac ttgggagcat tgccgaaagg atgacctctc tgcaggtcct
gcccgaggag 205860ccagtttctg gggaccttga ccatggctag gtgaatggac ccaggatggg
atggtcaggc 205920ttgctagcag agccacagcc gagttggctg ggtggggtgg ggtggggtgg
gaagggtgag 205980ttatctgatg agctcaggac cttttcctgc cctgcagata cagcctgcaa
gatggtccgc 206040tggctgtctg ccaagcttgg ccccacagta gcctctcgcc atgtggcccg
gaacctgctg 206100cgcctgctga catcttgtta tgttggtaag gtctgtggtt agtgctggag
accaggttcc 206160ccagccaggc ttctgcccat ccttagccct ctctaggcga ctccttccct
aacttcccag 206220cactccctga gcagggcctg ggtctcaccc attaagctgg gttttcttgg
gtaagtgggg 206280aagagcccag tattgaatga atagaagcca ccccacagtc tcagaaggcc
ggcttccctc 206340ctgccctcca ctggcttctc aacgctgctg cccttccttg gtagggccca
ctcgacagca 206400gttcaccgtc agcagtgatg acacccctcc actgaatgcc ggcaacatct
accagaagag 206460gccagtccta ggtgacatcg tgtcggggcc tgtgctcagc tgcctcctcc
acattgccta 206520cctgtatgga gaacccgttc tcacctacca gtacctgccc tacatcagct
acctggtcag 206580tccctggttc gtcaaacccc ggcttggggg tgggggcaag gatccaagga
ccagccccag 206640gtcttggggg ttccaggagg tctgtggggt gacctgtccc tccctcatct
attctgtggt 206700tctaggtagc cccagggagc aactcaaacc ccagccgact gaacagccgc
aaggaggccg 206760ggctgctggc agcggtgaca ctgacgcaga aaatcatcgt atacctctct
gacacgaccc 206820tcatggacat tctgccccgc attagccacg aggtcttgct gcctgtgctt
ggcttcctca 206880cctccttcgt cacagggtag gcccctgctg cttgggagag ccacctggct
gagggggccc 206940ccaggaaggg ctaggaagct cagggagaag cagataccgg cctgagtcat
ggttctgatg 207000ttgggggtag tggcacaggt ctttcattcc agcacccaga ggagggcaag
tttctgtgag 207060tctgagacta gcctggtcta cagagagagc tccaggctat ctaaggctcc
atagtaagac 207120tctgacttaa gaaaagagtc gtggttcatt ctgggttgtg ggtgtggctt
ggtgatggga 207180cactttccca gcatgcagga ggagctatgc ttgagttcca gcccttcaga
aaaacaaaaa 207240tgggggctgg aaagaatagc tcagggttta agagcactgg ttgctcttcc
agaggatcca 207300ggttagattc ccagctgcca catggtagct cataaccatc cggcagttct
atggaacctg 207360ccaccctcct tcggtctctg tgggcactgc aaacatgtgc acagacatac
atgcaggcag 207420aaaaaacacc catacacata aaattagacc aaaaaagttc atgttctctc
ctacctgtag 207480ctctgactaa gctacactgc ttccctgtgc ctcagtttcc tcccctggtc
tggactgatc 207540agccttacat gcagctcctg ttatttgaag ttcctggtaa attggtcaag
tccttcaggg 207600aagggctggg aactcttgca ctttgattct aggttcccca gtggggccca
ggcccggact 207660gtcctatgcg tgaaaaccat cagtctcatc gccctcatct gcttgcgcat
cgggcaggag 207720atggtccagc agcacctgag tgagccagtg gccaccttct tccaagtctt
ctctcatctg 207780catgagcttc ggcagcaggt aggcaggcag cttctgggct gggtgggcca
ggccaggcca 207840ggccaggcca gggcagtgga cccactgaat ctgtggtctt cctacccgca
ggatctgcca 207900ctggatccta agggctgtac tgagggccag ctgccagagg cgaccttctc
tgatgggcag 207960cgacgaccag tggaccccac cctgctggaa gagctgcaga aggtgttcac
cctggaaatg 208020gcgtacacaa tctacgtacc tttctcctgc ctgttgggta ttgcccatca
cgttcctttg 208080cacagagttg gtgactacat ctcttccctg gggtgggccc cgatgctttc
acctccagag 208140tcagcaatgg aatcttttta tttttatttt gacatggggt ctcatttagc
ccaggctgac 208200ctttaactcc agctccttcc agcttccacc gtctcctgtt ggcattgtag
tcatgtggca 208260ttgctcaggc ttcttncatg ttcttatttt taaatgacct gtgtgtgtgt
gtgatatctg 208320tgtgagtgtg gaggtgacag aataacagtt ggggggtcag cagatgcctt
gcctgatgag 208380catctctcta gatccagttt ttggttttgt gggcttttat gtgtgtgttt
gtttgtctgt 208440ttttgtagac agggtctctc tgtgtagcct ggccatcctg gaactcattc
agtagaccaa 208500gctggccttg agctcacaga gattcacctg cctctgcctc ccagtgctgg
gattaaaggc 208560gtgtaccact cctgcctggc tttgtttttg ttaaccacca tcctcctgcc
tcagcatctg 208620cctcccctgt gctgggatta caggtgtgtg ctatcacacc cagctaacag
tggatttaaa 208680cgtaggaatt ttaggatcag agtgaccaga tttggtccta gggcccaatt
tccacagtga 208740ttatctatct tagttaggat ctctgttgaa aatcatggtg gaacatcatt
accaaatgca 208800acttggggag gaaaaggttt attttgtctg acaactctca ggtcaccaag
ggaagtcagg 208860gcaggaactc gaggcagaag ctgaagcaaa agccatggaa gaactctggc
ttgttcctca 208920tggcttgctc agtctggtgt accccctccc cacccacctc cccacaatgg
tttctctgct 208980tatgcctggc tgtcctagaa ctcactctgt agactaggct ggcctcaaac
tcaagagatc 209040cccctgcctc tgcatctcaa gtgctaagat taaaggcggg tgccatcacc
cctgccccag 209100gggtggcact acccactgta aattggtccc ttcccatatc agttgttaaa
taagaaaact 209160cctccatagg ccaatctggt gggggaattt tctcagttga gggtttctct
tctcaaatga 209220ctgtagctga tgccaaattg ataaaacaaa tctcaaacca ccaccaccaa
caacaataaa 209280accaaacaaa ccaaacaact aaccaagaca gtgacttata aagagaatct
gaacattttc 209340cagcaggaaa ggctcaggag ctggccattc aagtctgggg aacagaatgt
aggggaatat 209400gatggtctcc agaagctacc tgcaaaggaa tgaacagctt gctgggtttt
gtggcttccc 209460ttatgggatg ggcgctgtac tgggcttctc tctgagtagg atgggccacc
ctgtagttgg 209520gaatattttg ctcctacaga attgtaagtt cccagaggca ggacacatct
gtcttattct 209580tcattgtgtg tctgatgcta gaatggtgcc tggcatacac gtgtgtgtct
ctatagagac 209640agcactcatg tctacgtatc gataaaggaa gctgttttgg ggggaggaaa
caggcttaca 209700gacgagaact taataaccca gagtagccca gtcagtacct tgccttggct
tctgttgttt 209760ctaagctctg ggtagatagc taccttgcca tcttccctga tcttagaact
ttccccactc 209820ccctgtaggt gacatcatcc ggaaaatcat ccccaaccat gagttggtcg
gggagctggc 209880agggctctat ctggaaagca tgagcccgag ctctcgaaac ccagccagca
tggaacccac 209940catggctagt gccggccctg aatgggaccc tcagagtggg agctgtctcc
aggacgatgg 210000ccactcaggg acctttggga gtgtcctggt tggaaatcgc atccagatcc
ctgactctca 210060gccccagagt cctgggccac tgggctccct ctctggagtg ggtagtagcg
gaggcctcag 210120caacaggaat gaagacaacg ccctgaagcg ggagctgcct cggagtgccc
atgggctgag 210180cgggaactgg ctggcgtact ggcagtacga gatcggtgtg agccagcagg
atgcccactt 210240ccacttccac cagatccgcc tgcagagctt cccagggcac acgggggccg
tcaaatgcgt 210300ggccgccctg agcagtgaag acttctttct gagtggcagc aaggaccgga
ctgtgcgcct 210360ctggccgctg tacaactatg gggacgggac caatgagacg gcttcccgcc
tcatctatgc 210420ccagcaccgc aaaagcgtct tctacgtggg ccagcttgag gccccgcagt
atgtggtgag 210480ctgtgatggg gcagtgcacg tctgggaccc cttcacaggt gagcgggccc
aggtgaggcc 210540tgttcgacgg ctgctttact gtgccttagc caggcctctg ggaacgggac
ctagtgcgaa 210600acgtacaatg gcgtattttg acggggaaga ttcagtgagg caggaagaga
agaagagtca 210660ggacttagaa tctgtgggac ccaagtttga atccactccc ccaacttacc
agcaatcggc 210720tcagttgctg caggcgtctg ccttctacct gtaagaacca aaaatttaga
agattccacg 210780agtatggctt tggcttcttg tacgacgtca cctgtcgtcg ttgtaaagag
aagtatcgag 210840tggaggaggg tcagggcaga cggaggtcgc agctagttag agcatgctat
gtgaagagag 210900cagactgttc tggggctgga cccttgactt cactgtggaa gcagcaagat
gagaaagccc 210960tgagattgtg ttttctgagg gtcactgggg aatgggatgc aggtgtgggg
tgagttggag 211020tttgaagtag ccagggctct ttgatagcca ctaagtcccc agatgtgtcc
tttttcagga 211080aagacccttc gcacagtgga tccttcagac agccgggtgc ccctgacggc
tgtggctgtc 211140atgcctgccc cacacaccag catcaccatg gccagctccg actccactct
gcgctttgtg 211200gactgcagga agccaggctt gcaggtcagg aggggtgcag ttcctgggct
actgggggtc 211260tctaggtacc agtcaggaaa gacactcagg ggactccacc aggaacgctg
cagtgacagg 211320cagccctgtg tgggtggggc gctggcacgg atggggcttt tctcttccgg
ggatggagtg 211380ggagggtcag gcctactggt ttcgtgggcc tgaatggggt gagctgcagt
agggtgggtg 211440gcagtgatgg atggcgacgg gcacttgaac acaatctcct cctatagcat
gagttccgac 211500tgggtggagg gctgaaccct gggcttgttc gctcgttggc cgtcagcccc
agtggccgga 211560gtgttgtggc tggcttctcc tcgggcttca tggtgctcct agatacccgc
acgggcctgg 211620ttctacgagg ctggccagcc catgaagggg acattctaca gatcaaggtg
actgactgcc 211680tgaggtccta tcctttcatt tctacttagg gcctggtctg ggagaggaca
ggtttatgct 211740ggtgtccctt ataactactc ggggacattc agtggggtgg gaaaatggcc
ctcgtaggcc 211800agctcaggaa ccagctgcac aggaggcagg ctaggggcag gaatcagggc
tagaactgac 211860cctgatgctc cacagcgatg ttctaatgag taacccttgt ccatatttgt
cttgcttgga 211920ggatcagggg tcacgccctg tccgtgaccc agttcaggtt aaataaagcc
aggaggctgt 211980ttactgcctg gagaccactg agcagagtcc atgccccctg ctgggctgtc
ctgatggggg 212040gcaggaacag gcgcaggcct gcgcatcgtg ttcctgcctc ctatattcaa
tcatagacct 212100cagagctcag caggttctgg gaggggagaa atagggctgc ttgtgggagg
atttctccct 212160gcagtgggaa ctctcctccc ccgccgtcca gatggaggtg aaagacaggc
actgttgctt 212220acagggaagg caggctgccc ccagctctat ccaggacccc aggggaccct
gggtctcagt 212280gtctctaaat cccaacattc taagaaagtg tcaggatggc tctggggtca
tcctgggtgt 212340tagtcccagc tctcggagtc ttctctgagc accagttctt ttcctatggg
aagtaaggac 212400atgccaggtg ttctttgaga ggacctgagt ttggttctca gcactgtcta
gctctggctc 212460cagggggttc aacacccttt atggcttctg tggacacata ttcttatgtg
gcctgcacgc 212520acacacacga acacaaataa aaataaatgt taaaagaaga cagcggcacc
ttgtacctca 212580catgttagta cgattggatg tggcagtgcc tcacagaatc tgtgggactt
tatttattta 212640tttatttttt ggtcttaaaa ttttaaaaga ttggtttagg agtggtggta
cacaccatta 212700atcgcaacac tcaggagcag aggcaggtgg atctctatgg gtttgaggcc
agcctggtct 212760acagagcaag tttcagggca gccaaggtta cacagagaaa ctctttctca
aaaaataaaa 212820acaaaatatt taaaagattt acttacttct tatttgagct aggatctccc
tattagccct 212880ggctgtcctg gaactcactg tatagaccag gctggcacct taaactcacg
aagatcctcc 212940tgcttctgcc tcctaagtgc tgagattaaa gtagtgttat accatgcccc
actattttct 213000ttatagattt ggtgttttgc ctgcttgtgt atatatgcac taccttcatg
cagtgctaat 213060aggggtcaga ggcgagtatc agctcttcct ggaactagag ttatggaagg
ttgggagtca 213120ccatgctggg actgggtcat ttgcaagagt tacaagtact tctgagccat
ctccagcccc 213180ctagagtttt tttcccccct ggctgtcctg aagtagaatc tgttcttgtt
ttgttttgtt 213240tttcgagaca gggtttctct acataggcct ggctgtcctg gaactcactc
tgtagaccag 213300gctggcctcg aactcagaaa tccgcctgcc tctggctctc agaatactgg
aattaaaggt 213360gtgcgccacc acgcctggct cagaatctgt ttttaaatga gagtaatagt
tacaggtttt 213420ttgttttgtt tttttctttt ctttttttgt ttttgttttt tggctcattt
gttttatttg 213480ttttgagaca ggtctcactc tgaaccccta ggtggcctgg agcttgctat
gtagaacaca 213540ctgactttaa acttgttttc tgagtgctgg atttatgggc ttgtgctatt
ttgcccagcc 213600tctgatggtt gttaataaca atattattta gcttttcttt tggagatagg
ctctcactgt 213660atatcaccca gacagtggct ggtctggaaa tcactgtgta ggccaggctg
accttgaatt 213720cacagagatc tgcctcccga gttcagaaat taaaagcact ctgggatggt
tttggagttt 213780ggtgagtacc caagcctcca ttgatgctat ctgtccctcc cgctctctgc
aggctgtaga 213840gggcagcgtg ctcatcagct cctcttccga ccattccttg actgtttgga
aggagctgga 213900acagaagccc acgcaccact acaagtcagc gtccgaccca atccacacct
ttgacctgta 213960cggcagcgag gtggtcaccg gcactgtagc caacaagatt ggtgtctgtt
ccctgcttga 214020gccaccctct caggccacca caaagctcag ttccgagaac ttccgtggca
cgctcactag 214080tctggctttg ctgcccacga aacgccacct cctgctgggc tcggacaatg
gcatcatccg 214140cctcctggca tagggccagc caggagttgg ctgagggcag ggcgagatga
catctctcag 214200ggcccgctcc tcattcttga tctcgaagcc gattcttcta ggcaagcccc
aggctctggc 214260tacccacatg gcctgctgtc tgggattgca cagctcctga atctccaaag
ccttgaagtg 214320gcttcatgaa actcgggaga tactgttcct aaccagcaag aattggggca
aggaaagcac 214380tgtgatcccc attgctcccc agttctgcct tctggattca catggggaca
gggcagctcc 214440aggaaatgaa aggagttggg cctttgctca gccagcttcc tctagccacg
ctctccttag 214500ctctgtttct cccttgggta ggaaactgct cctgtctagg gttctgatgg
tactgggact 214560ccaggctcag gagggctggc caggacctac gactttcagg gcttggtctg
gggttttagc 214620attcattcag ccaggtcttc agtatgggac cagaaaaaag gggatgtgag
aacagggcta 214680gggaaggggt tatatgggcc cagctggtcc aggaatgaat ccatgccttg
ccttggtacc 214740cctaaccaca gcgtttgtgc cttcagccgg ggaggcagcc cttgggacca
gcatccctag 214800ggacaggagg cagcgggaat catctctgta tctcgggttc tgcccagggg
atgggcagac 214860tctgccatct cttgagtgtt cgtttggaga agcctgagat gtggcccctg
ctgccttctc 214920actagttgca gtctatgtaa ataaggtcaa taaattcttt ggaagagcca
cggagctgag 214980tgaggctgtg ttgtgttttg ctttgcctag gctgggctca ggcagctctg
cctcagcctc 215040ccaaggagct ggggaactgg tatatgtcac tgtatatgtc actgtgcctg
gcttatggct 215100tggcttggct ttttttcaga tggtctcaag tgcctcaggt tggccttgat
cttgggatga 215160ccttcctgct tgaaacagag tagtgggctt ataggcatga cccaccaggt
ccaattttta 215220ttttttaaag gcattgattt ttatacgtgt atggttgttt tgcccacttg
tacatatgca 215280caccatactt gtgtctggtc cctgcggagg tcagaagagg gcatcgggat
cacctggaac 215340cgaagttaat gaatggttat gagccacatc tcgatgctga agattgaacc
tggatccttt 215400gcaagagcag ccagtgttct tacccactga gccatctcta agccccacac
ccagcttctt 215460ttgatacaag gtctggtagc tcaaacttga tatgcagccg aggaggttga
cctggtattc 215520cctacctacc ctcttctctc taccttccaa gtgctgatat tatacatagg
catggatagt 215580catgcccacc agtttgcctt gatggcacca gagtcaggaa agtccaaacc
tggtagttgc 215640aaacacagca agagggtaga ggcagccatt gtcctctggc tgccttggat
acagagcttc 215700tgggttgggt ggccttgggt cagttttccg aatggttcac ccttggggaa
agggaacact 215760gctgaagagg tgggaccctg ggagggccgg cctccagctg ggtctctcca
gccctcgcct 215820tggaacctag gctggaggga gccaaccagg atcctggact tgctacagtt
aggtgaacag 215880gctcctgcag cctccccttc ccttgggtag ctgtggtggt ggtggtggtg
gtggtggtgg 215940tggtggtggt ggtggtggtg gtgggggggg gggngnngnt
21598017473PRTMus sp. 17Met Lys Arg Ala Ser Ser Gly Gly Ser Arg
Leu Leu Ala Trp Val Leu 1 5 10
15Trp Leu Gln Ala Trp Arg Val Ala Thr Pro Cys Pro Gly Ala Cys Val
20 25 30Cys Tyr Asn Glu Pro
Lys Val Thr Thr Ser Cys Pro Gln Gln Gly Leu 35
40 45Gln Ala Val Pro Thr Gly Ile Pro Ala Ser Ser Gln Arg
Ile Phe Leu 50 55 60His Gly Asn Arg
Ile Ser His Val Pro Ala Ala Ser Phe Gln Ser Cys 65 70
75 80Arg Asn Leu Thr Ile Leu Trp Leu His
Ser Asn Ala Leu Ala Arg Ile 85 90
95Asp Ala Ala Ala Phe Thr Gly Leu Thr Leu Leu Glu Gln Leu Asp
Leu 100 105 110Ser Asp Asn Ala
Gln Leu His Val Val Asp Pro Thr Thr Phe His Gly 115
120 125Leu Gly His Leu His Thr Leu His Leu Asp Arg Cys
Gly Leu Arg Glu 130 135 140Leu Gly Pro
Gly Leu Phe Arg Gly Leu Ala Ala Leu Gln Tyr Leu Tyr145
150 155 160Leu Gln Asp Asn Asn Leu Gln
Ala Leu Pro Asp Asn Thr Phe Arg Asp 165
170 175Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn
Arg Ile Pro Ser 180 185 190Val
Pro Glu His Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 195
200 205Leu His Gln Asn His Val Ala Arg Val
His Pro His Ala Phe Arg Asp 210 215
220Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Met225
230 235 240Leu Pro Ala Glu
Val Leu Met Pro Leu Arg Ser Leu Gln Tyr Leu Arg 245
250 255Leu Asn Asp Asn Pro Trp Val Cys Asp Cys
Arg Ala Arg Pro Leu Trp 260 265
270Ala Trp Leu Gln Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Asn
275 280 285Leu Pro Gln Arg Leu Ala Asp
Arg Asp Leu Lys Arg Leu Ala Ala Ser 290 295
300Asp Leu Glu Gly Cys Ala Val Ala Ser Gly Pro Phe Arg Pro Ile
Gln305 310 315 320Thr Ser
Gln Leu Thr Asp Glu Glu Leu Leu Ser Leu Pro Lys Cys Cys
325 330 335Gln Pro Asp Ala Ala Asp Lys
Ala Ser Val Leu Glu Pro Gly Arg Pro 340 345
350Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly
Asp Thr 355 360 365Pro Pro Gly Asn
Gly Ser Gly Pro Arg His Ile Asn Asp Ser Pro Phe 370
375 380Gly Thr Leu Pro Ser Ser Ala Glu Pro Pro Leu Thr
Ala Leu Arg Pro385 390 395
400Gly Gly Ser Glu Pro Pro Gly Leu Pro Thr Thr Gly Pro Arg Arg Arg
405 410 415Pro Gly Cys Ser Arg
Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 420
425 430Gln Ala Gly Ser Gly Ala Ser Gly Thr Gly Asp Ala
Glu Gly Ser Gly 435 440 445Ala Leu
Pro Ala Leu Ala Cys Ser Leu Ala Pro Leu Gly Leu Ala Leu 450
455 460Val Leu Trp Thr Val Leu Gly Pro Cys465
47018283PRTArtificial SequenceDescription of Artificial Sequence
Consensus sequence 18Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa
Xaa Thr Xaa Ser 1 5 10
15Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Pro Xaa Xaa
20 25 30Xaa Xaa Arg Xaa Phe Leu Xaa
Xaa Asn Xaa Ile Xaa Xaa Xaa Xaa Xaa 35 40
45Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa
Ser 50 55 60Asn Xaa Xaa Xaa Xaa Ile
Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 65 70
75 80Leu Glu Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa
Leu Xaa Xaa Xaa Xaa 85 90
95Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Xaa Leu Xaa Leu Xaa
100 105 110Xaa Cys Xaa Leu Xaa Xaa
Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa 115 120
125Xaa Leu Gln Tyr Leu Tyr Leu Gln Xaa Asn Xaa Xaa Xaa Xaa
Leu Xaa 130 135 140Asp Xaa Xaa Phe Xaa
Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His145 150
155 160Gly Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Phe Arg Gly Leu Xaa 165 170
175Xaa Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His
180 185 190Xaa Xaa Ala Phe Xaa
Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe 195
200 205Xaa Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu
Xaa Xaa Leu Xaa 210 215 220Xaa Leu Xaa
Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys225
230 235 240Arg Xaa Arg Xaa Leu Trp Xaa
Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser 245
250 255Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa
Xaa Xaa Asp Leu 260 265 270Xaa
Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys 275
2801950PRTArtificial SequenceDescription of Artificial Sequence Consensus
sequence 19Asn Xaa Trp Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa
Trp Xaa 1 5 10 15Xaa Xaa
Xaa Arg Xaa Ser Ser Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 20
25 30Xaa Xaa Xaa Xaa Xaa Asp Leu Xaa Xaa
Leu Xaa Xaa Xaa Asp Xaa Xaa 35 40
45Xaa Cys 50
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20140373227 | Blackberry plant named 'Drisblacksix' |
20140373226 | Ground cover rose plant named 'Poultc017' |
20140373225 | Compact Floribunda Rose Plant Named 'Poulpal056' |
20140373224 | Garden Rose Plant Named 'Poulren014' |
20140373223 | Garden rose plant named 'Poulren022' |